RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12933
(65 letters)
>gnl|CDD|222269 pfam13621, Cupin_8, Cupin-like domain. This cupin like domain
shares similarity to the JmjC domain.
Length = 247
Score = 58.5 bits (142), Expect = 2e-12
Identities = 19/37 (51%), Positives = 24/37 (64%), Gaps = 1/37 (2%)
Query: 30 HLIECTLNPGDMLYIPPKVWHHVRSLS-TSISVSFWF 65
+ L PGD LYIP WHHVRSL +I+V++WF
Sbjct: 205 KALVAELEPGDALYIPAGWWHHVRSLDDFNIAVNYWF 241
>gnl|CDD|219693 pfam08007, Cupin_4, Cupin superfamily protein. This family
contains many hypothetical proteins that belong to the
cupin superfamily.
Length = 319
Score = 47.7 bits (114), Expect = 2e-08
Identities = 13/36 (36%), Positives = 18/36 (50%)
Query: 30 HLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
+ + L PGDMLY+P H +L S+ S F
Sbjct: 174 PVHDFVLEPGDMLYLPRGFIHQGVALDDSLHYSVGF 209
>gnl|CDD|225406 COG2850, COG2850, Uncharacterized conserved protein [Function
unknown].
Length = 383
Score = 38.9 bits (91), Expect = 3e-05
Identities = 12/25 (48%), Positives = 15/25 (60%), Gaps = 1/25 (4%)
Query: 28 PGHL-IECTLNPGDMLYIPPKVWHH 51
P I+ L PGD+LYIPP H+
Sbjct: 174 PFEPDIDEVLEPGDILYIPPGFPHY 198
>gnl|CDD|219618 pfam07883, Cupin_2, Cupin domain. This family represents the
conserved barrel domain of the 'cupin' superfamily
('cupa' is the Latin term for a small barrel).
Length = 70
Score = 34.9 bits (81), Expect = 2e-04
Identities = 9/25 (36%), Positives = 11/25 (44%)
Query: 33 ECTLNPGDMLYIPPKVWHHVRSLST 57
E L GD +Y P V H R+
Sbjct: 37 EVVLKAGDSVYFPAGVPHRFRNTGD 61
>gnl|CDD|202224 pfam02373, JmjC, JmjC domain, hydroxylase. The JmjC domain belongs
to the Cupin superfamily. JmjC-domain proteins may be
protein hydroxylases that catalyze a novel histone
modification. This is confirmed to be a hydroxylase: the
human JmjC protein named Tyw5p unexpectedly acts in the
biosynthesis of a hypermodified nucleoside,
hydroxy-wybutosine, in tRNA-Phe by catalyzing
hydroxylation.
Length = 114
Score = 32.7 bits (75), Expect = 0.004
Identities = 8/34 (23%), Positives = 16/34 (47%)
Query: 32 IECTLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
PG+ ++ P +H V +L +I+ + F
Sbjct: 81 YRFVQKPGEFVFTFPGWYHQVFNLGFNIAEAVNF 114
>gnl|CDD|224829 COG1917, COG1917, Uncharacterized conserved protein, contains
double-stranded beta-helix domain [Function unknown].
Length = 131
Score = 30.6 bits (69), Expect = 0.026
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 35 TLNPGDMLYIPPKVWHHVRSLSTSISV 61
L GD++ IPP V H ++++ V
Sbjct: 85 ELKAGDVIIIPPGVVHGLKAVEDEPMV 111
>gnl|CDD|223734 COG0662, {ManC}, Mannose-6-phosphate isomerase [Carbohydrate
transport and metabolism].
Length = 127
Score = 30.0 bits (68), Expect = 0.035
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 35 TLNPGDMLYIPPKVWHHVRS 54
+ GD +YIP H VR+
Sbjct: 78 EVKAGDSVYIPAGTPHRVRN 97
>gnl|CDD|204196 pfam09313, DUF1971, Domain of unknown function (DUF1971).
Members of this family of functionally uncharacterized
domains are predominantly found in bacterial Tellurite
resistance protein.
Length = 82
Score = 28.3 bits (64), Expect = 0.094
Identities = 9/21 (42%), Positives = 11/21 (52%)
Query: 36 LNPGDMLYIPPKVWHHVRSLS 56
+ G IPP+ WH V LS
Sbjct: 54 IAAGQPAVIPPQYWHKVEPLS 74
>gnl|CDD|226343 COG3822, COG3822, ABC-type sugar transport system, auxiliary
component [General function prediction only].
Length = 225
Score = 29.1 bits (65), Expect = 0.11
Identities = 11/41 (26%), Positives = 19/41 (46%), Gaps = 13/41 (31%)
Query: 24 HNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFW 64
T G ++ L+PG+ + +PP ++H SFW
Sbjct: 147 QTHTAGSQLK--LSPGESITLPPGLYH-----------SFW 174
>gnl|CDD|226586 COG4101, COG4101, Predicted mannose-6-phosphate isomerase
[Carbohydrate transport and metabolism].
Length = 142
Score = 27.9 bits (62), Expect = 0.21
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 35 TLNPGDMLYIPPKVWHHVRSLSTSISV 61
+ PGD YIPP V H +LST
Sbjct: 91 EVGPGDFFYIPPGVPHQPANLSTEPLS 117
>gnl|CDD|225796 COG3257, GlxB, Uncharacterized protein, possibly involved in
glyoxylate utilization [General function prediction
only].
Length = 264
Score = 28.2 bits (63), Expect = 0.21
Identities = 8/31 (25%), Positives = 11/31 (35%)
Query: 35 TLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
L G Y+PP +R+ S W
Sbjct: 104 ALREGGYAYLPPGSGWTLRNAQKEDSRFHWI 134
>gnl|CDD|203266 pfam05523, FdtA, WxcM-like, C-terminal. This family includes FdtA
from Aneurinibacillus thermoaerophilus, which has been
characterized as a dtdp-6-deoxy-3,4-keto-hexulose
isomerase. It also includes WxcM from Xanthomonas
campestris (pv. campestris).
Length = 131
Score = 27.9 bits (63), Expect = 0.24
Identities = 9/23 (39%), Positives = 13/23 (56%), Gaps = 1/23 (4%)
Query: 35 TLN-PGDMLYIPPKVWHHVRSLS 56
L+ P LYIPP +W + + S
Sbjct: 79 LLDSPDKGLYIPPMIWREMYNFS 101
>gnl|CDD|183011 PRK11171, PRK11171, hypothetical protein; Provisional.
Length = 266
Score = 27.9 bits (63), Expect = 0.24
Identities = 7/31 (22%), Positives = 12/31 (38%)
Query: 35 TLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
L+ G Y+PP +R+ + W
Sbjct: 104 ALSEGGYAYLPPGSDWTLRNAGAEDARFHWI 134
>gnl|CDD|236717 PRK10572, PRK10572, DNA-binding transcriptional regulator AraC;
Provisional.
Length = 290
Score = 27.6 bits (62), Expect = 0.33
Identities = 8/15 (53%), Positives = 9/15 (60%)
Query: 37 NPGDMLYIPPKVWHH 51
PGD+L PP HH
Sbjct: 72 RPGDLLLFPPGEIHH 86
>gnl|CDD|224810 COG1898, RfbC, dTDP-4-dehydrorhamnose 3,5-epimerase and related
enzymes [Cell envelope biogenesis, outer membrane].
Length = 173
Score = 26.9 bits (60), Expect = 0.59
Identities = 12/44 (27%), Positives = 18/44 (40%), Gaps = 2/44 (4%)
Query: 23 PHNQTPGHLIECTLNPGD--MLYIPPKVWHHVRSLSTSISVSFW 64
+ T G + L+ + LYIPP H + LS V +
Sbjct: 87 KDSPTYGKWVGVVLSAENKRQLYIPPGFAHGFQVLSDDAEVVYK 130
>gnl|CDD|214845 smart00835, Cupin_1, Cupin. This family represents the conserved
barrel domain of the 'cupin' superfamily ('cupa' is the
Latin term for a small barrel). This family contains
11S and 7S plant seed storage proteins, and germins.
Plant seed storage proteins provide the major nitrogen
source for the developing plant.
Length = 146
Score = 26.5 bits (59), Expect = 0.70
Identities = 5/25 (20%), Positives = 10/25 (40%)
Query: 30 HLIECTLNPGDMLYIPPKVWHHVRS 54
+ + L GD+ +P H +
Sbjct: 73 KVYDARLREGDVFVVPQGHPHFQVN 97
>gnl|CDD|234250 TIGR03540, DapC_direct, LL-diaminopimelate aminotransferase. This
clade of the pfam00155 superfamily of aminotransferases
includes several which are adjacent to elements of the
lysine biosynthesis via diaminopimelate pathway
(GenProp0125). Every member of this clade is from a
genome which possesses most of the lysine biosynthesis
pathway but lacks any of the known aminotransferases,
succinylases, desuccinylases, acetylases or deacetylases
typical of the acylated versions of this pathway nor do
they have the direct, NADPH-dependent enzyme (ddh).
Although there is no experimental characterization of
any of the sequences in this clade, a direct pathway is
known in plants and Chlamydia and the clade containing
the Chlamydia gene is a neighboring one in the same
pfam00155 superfamily so it seems quite reasonable that
these enzymes catalyze the same transformation.
Length = 383
Score = 26.2 bits (58), Expect = 1.2
Identities = 6/18 (33%), Positives = 11/18 (61%)
Query: 29 GHLIECTLNPGDMLYIPP 46
H+ +NPGD++ +P
Sbjct: 105 AHIPLAFVNPGDIVLVPD 122
>gnl|CDD|219393 pfam07385, DUF1498, Protein of unknown function (DUF1498). This
family consists of several hypothetical bacterial
proteins of around 225 residues in length. The function
of this family is unknown.
Length = 225
Score = 25.9 bits (57), Expect = 1.2
Identities = 11/41 (26%), Positives = 17/41 (41%), Gaps = 13/41 (31%)
Query: 24 HNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFW 64
G L++ L PG + +PP ++H SFW
Sbjct: 148 VTMKAGGLLK--LKPGQSICLPPGLYH-----------SFW 175
>gnl|CDD|225051 COG2140, COG2140, Thermophilic glucose-6-phosphate isomerase and
related metalloenzymes [Carbohydrate transport and
metabolism / General function prediction only].
Length = 209
Score = 25.9 bits (57), Expect = 1.3
Identities = 7/26 (26%), Positives = 12/26 (46%)
Query: 28 PGHLIECTLNPGDMLYIPPKVWHHVR 53
G + GD++Y+PP H+
Sbjct: 122 EGEARVIAVRAGDVIYVPPGYGHYTI 147
>gnl|CDD|219088 pfam06560, GPI, Glucose-6-phosphate isomerase (GPI). This family
consists of several bacterial and archaeal
glucose-6-phosphate isomerase (GPI) proteins
(EC:5.3.1.9).
Length = 181
Score = 26.0 bits (57), Expect = 1.3
Identities = 6/30 (20%), Positives = 12/30 (40%), Gaps = 1/30 (3%)
Query: 23 PHNQTPGHLIECTLNPGDMLYIPPKVWHHV 52
P + + G ++Y+PP + H
Sbjct: 100 PEGGAVDDVRVIEMEKGTVVYVPPY-YGHR 128
>gnl|CDD|183450 PRK12335, PRK12335, tellurite resistance protein TehB;
Provisional.
Length = 287
Score = 25.7 bits (57), Expect = 1.6
Identities = 5/17 (29%), Positives = 9/17 (52%)
Query: 40 DMLYIPPKVWHHVRSLS 56
+I P+ WH + + S
Sbjct: 66 QPPFIEPQAWHRIEAAS 82
>gnl|CDD|181219 PRK08068, PRK08068, transaminase; Reviewed.
Length = 389
Score = 25.7 bits (57), Expect = 1.7
Identities = 8/18 (44%), Positives = 12/18 (66%), Gaps = 3/18 (16%)
Query: 31 LIE---CTLNPGDMLYIP 45
L+E C +NPGD + +P
Sbjct: 107 LVELPQCLMNPGDTILVP 124
>gnl|CDD|145456 pfam02311, AraC_binding, AraC-like ligand binding domain. This
family represents the arabinose-binding and
dimerisation domain of the bacterial gene regulatory
protein AraC. The domain is found in conjunction with
the helix-turn-helix (HTH) DNA-binding motif pfam00165.
This domain is distantly related to the Cupin domain
pfam00190.
Length = 134
Score = 25.5 bits (56), Expect = 1.8
Identities = 7/22 (31%), Positives = 9/22 (40%)
Query: 33 ECTLNPGDMLYIPPKVWHHVRS 54
L PGD+ +PP H
Sbjct: 42 TYHLGPGDLFLLPPGEPHDYEP 63
>gnl|CDD|226142 COG3615, TehB, Uncharacterized protein/domain, possibly involved
in tellurite resistance [Inorganic ion transport and
metabolism].
Length = 99
Score = 25.1 bits (55), Expect = 2.1
Identities = 9/43 (20%), Positives = 17/43 (39%), Gaps = 5/43 (11%)
Query: 24 HNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW 64
P H+ PP+ WH V +++ ++SF+
Sbjct: 56 GETEPEHVFSIEAQFP---VFPPQAWHRVEAMTDDAEFNLSFY 95
>gnl|CDD|200251 TIGR03214, ura-cupin, putative allantoin catabolism protein. This
model represents a protein containing a tandem
arrangement of cupin domains (N-terminal part of
pfam07883 and C-terminal more distantly related to
pfam00190). This protein is found in the vicinity of
genes involved in the catabolism of allantoin, a
breakdown product of urate and sometimes of urate
iteslf. The distribution of pathway components in the
genomes in which this family is observed suggests that
the function is linked to the allantoate catabolism to
glyoxylate pathway (GenProp0686) since it is sometimes
found in genomes lacking any elements of the
xanthine-to-allantoin pathways (e.g. in Enterococcus
faecalis).
Length = 252
Score = 25.4 bits (56), Expect = 2.1
Identities = 6/31 (19%), Positives = 11/31 (35%)
Query: 35 TLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65
L G Y+PP + + + F +
Sbjct: 93 ELEEGGYAYLPPGSKWTLENAQDEDARFFLY 123
>gnl|CDD|183930 PRK13264, PRK13264, 3-hydroxyanthranilate 3,4-dioxygenase;
Provisional.
Length = 177
Score = 25.3 bits (56), Expect = 2.3
Identities = 8/25 (32%), Positives = 12/25 (48%)
Query: 26 QTPGHLIECTLNPGDMLYIPPKVWH 50
Q G + + G+M +PP V H
Sbjct: 70 QEDGKRRDVPIREGEMFLLPPHVPH 94
>gnl|CDD|232879 TIGR00218, manA, mannose-6-phosphate isomerase, class I. The names
phosphomannose isomerase and mannose-6-phosphate
isomerase are synonomous. This family contains two
rather deeply branched groups. One group contains an
experimentally determined phosphomannose isomerase of
Streptococcus mutans as well as three uncharacterized
paralogous proteins of Bacillus subtilis, all at more
than 50 % identity to each other, plus a more distant
homolog from Archaeoglobus fulgidus. The other group
contains members from E. coli, budding yeast, Borrelia
burgdorferi, etc [Energy metabolism, Sugars].
Length = 303
Score = 24.7 bits (54), Expect = 3.4
Identities = 6/17 (35%), Positives = 7/17 (41%)
Query: 34 CTLNPGDMLYIPPKVWH 50
L PGD Y+ H
Sbjct: 154 IKLKPGDFFYVRSGTPH 170
>gnl|CDD|183298 PRK11752, PRK11752, putative S-transferase; Provisional.
Length = 264
Score = 24.5 bits (54), Expect = 4.1
Identities = 6/9 (66%), Positives = 6/9 (66%)
Query: 43 YIPPKVWHH 51
Y PPKVW
Sbjct: 6 YQPPKVWTW 14
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.127 0.403
Gapped
Lambda K H
0.267 0.0745 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 2,893,671
Number of extensions: 180549
Number of successful extensions: 202
Number of sequences better than 10.0: 1
Number of HSP's gapped: 202
Number of HSP's successfully gapped: 28
Length of query: 65
Length of database: 10,937,602
Length adjustment: 36
Effective length of query: 29
Effective length of database: 9,340,858
Effective search space: 270884882
Effective search space used: 270884882
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 53 (24.2 bits)