RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 012855
(455 letters)
>gnl|CDD|220676 pfam10288, DUF2392, Protein of unknown function (DUF2392). This is
a family of proteins conserved from plants to humans.
The function is not known. It carries a characteristic
GRG sequence motif.
Length = 104
Score = 110 bits (278), Expect = 1e-29
Identities = 37/105 (35%), Positives = 55/105 (52%), Gaps = 11/105 (10%)
Query: 231 IACHVITATVKGRGYSLPADIQYADAR-WEIPVVLPLRDCLAQELNMLCQLDCLKTVE-- 287
+A +++ T KGRG S+P D+ D+R +I ++ PLRD L +E+ C L L +
Sbjct: 1 LAAKLLSLTAKGRGSSIPWDVSDGDSRGNDIKILRPLRDLLLKEIEAYCSLKNLPPLLIN 60
Query: 288 -------LLNQTHSGINGLVSSFVKILQEENPSRESTIMRTAGKL 325
L++ S IN L + LQE PS ST++RT KL
Sbjct: 61 SLIQIASKLSKNMS-INELTEQYFDNLQENYPSTVSTVVRTGDKL 104
>gnl|CDD|223115 COG0037, MesJ, tRNA(Ile)-lysidine synthase MesJ [Cell cycle
control, cell division, chromosome partitioning].
Length = 298
Score = 34.5 bits (79), Expect = 0.084
Identities = 36/166 (21%), Positives = 49/166 (29%), Gaps = 39/166 (23%)
Query: 60 CFRSNLFGKFRLAVASNALITPADNVLVAFSGGPSSRVALQFVHELQQRAQKNFDASKDR 119
C R L K + A+ LI +LVA SGG S L + EL +R
Sbjct: 1 CLREKLERKVKRAIREFNLIE--YKILVAVSGGKDSLALLHLLKELGRR----------- 47
Query: 120 SLPVFGVGVVFVDETAYYPVPSSEIDNAIQEIKLIVSNLSPPTKELHVIPIESIFCSNPC 179
V V VD D + ++ + L P V
Sbjct: 48 ----IEVEAVHVDHGL-----RGYSDQEAELVEKLCEKLGIPLIVERVTDD--------- 89
Query: 180 DGRERFKKLVDSVSDATGKEDLLLQLRMLSLQKFASENGYNRLLLG 225
L D +LR L K A E G +++ G
Sbjct: 90 --------LGRETLDGKSICAACRRLRRGLLYKIAKELGADKIATG 127
>gnl|CDD|238951 cd01993, Alpha_ANH_like_II, This is a subfamily of Adenine
nucleotide alpha hydrolases superfamily.Adeninosine
nucleotide alpha hydrolases superfamily includes N type
ATP PPases and ATP sulphurylases. It forms a
apha/beta/apha fold which binds to Adenosine group.
This subfamily of proteins is predicted to bind ATP.
This domainhas a strongly conserved motif SGGKD at the
N terminus.
Length = 185
Score = 33.4 bits (77), Expect = 0.13
Identities = 35/147 (23%), Positives = 49/147 (33%), Gaps = 41/147 (27%)
Query: 85 VLVAFSGGPSSRVALQFVHELQQRAQKNFDASKDRSLPVFGVGVVFVDET-AYYPVPSSE 143
+LVA SGG S V L + +LQ+R F+ + + VDE Y S E
Sbjct: 2 ILVALSGGKDSLVLLHVLKKLQRRYPYGFE-----------LEALTVDEGIPGYRDESLE 50
Query: 144 I--DNAIQ-EIKLIVSNL--SPPTKELHVIPIESIFCSNPCDGRERFKKLVDSVSDATGK 198
+ A + I+L + + CS C G
Sbjct: 51 VVERLAEELGIELEIVSFKEEYTDDIEVKKRGGKSPCS-LC-GV---------------- 92
Query: 199 EDLLLQLRMLSLQKFASENGYNRLLLG 225
LR L K A E G ++L G
Sbjct: 93 ------LRRGLLNKIAKELGADKLATG 113
>gnl|CDD|236737 PRK10696, PRK10696, tRNA 2-thiocytidine biosynthesis protein TtcA;
Provisional.
Length = 258
Score = 33.7 bits (78), Expect = 0.14
Identities = 17/43 (39%), Positives = 23/43 (53%)
Query: 72 AVASNALITPADNVLVAFSGGPSSRVALQFVHELQQRAQKNFD 114
A+A +I D V+V SGG S L + LQ+RA NF+
Sbjct: 19 AIADFNMIEEGDRVMVCLSGGKDSYTLLDILLNLQKRAPINFE 61
>gnl|CDD|226478 COG3969, COG3969, Predicted phosphoadenosine phosphosulfate
sulfotransferase [General function prediction only].
Length = 407
Score = 32.4 bits (74), Expect = 0.56
Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 12/55 (21%)
Query: 83 DNVLVAFSGGPSSRVALQFVHELQQRAQKNFDASKDRSLPVFGVGVVFVDETAYY 137
V V+FSGG S + L V E+ A++N +D+ + V+F+D A Y
Sbjct: 28 PRVCVSFSGGKDSGLMLHLVAEV---AREN---GRDK------ISVLFIDWEAQY 70
>gnl|CDD|140324 PTZ00303, PTZ00303, phosphatidylinositol kinase; Provisional.
Length = 1374
Score = 31.2 bits (70), Expect = 1.6
Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 1/55 (1%)
Query: 182 RERFKKLVDSVSDATGKEDL-LLQLRMLSLQKFASENGYNRLLLGLCTSRIACHV 235
R RF++L++ V L L+ + +LSLQ S GY+ L GL +RI V
Sbjct: 802 RRRFQQLLERVIPTVAGSSLSLVFIMLLSLQSTYSYYGYDEDLFGLWRTRIVAEV 856
>gnl|CDD|216343 pfam01171, ATP_bind_3, PP-loop family. This family of proteins
belongs to the PP-loop superfamily.
Length = 182
Score = 29.9 bits (68), Expect = 1.9
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 85 VLVAFSGGPSSRVALQFVHELQQRAQKNFDA 115
+LVA SGGP S L + +L+ + + A
Sbjct: 2 ILVAVSGGPDSMALLYLLKKLKPKFGIDLTA 32
>gnl|CDD|213357 cd12823, Mrs2_Mfm1p-like, Saccharomyces cerevisiae inner
mitochondrial membrane Mg2+ transporters Mfm1p and
Mrs2p-like family. A eukaryotic subfamily belonging to
the Escherichia coli CorA-Salmonella typhimurium
ZntB_like family (EcCorA_ZntB-like) family of the MIT
superfamily of essential membrane proteins involved in
transporting divalent cations (uptake or efflux) across
membranes. This functionally diverse subfamily includes
the inner mitochondrial membrane Mg2+ transporters
Saccharomyces cerevisiae Mfm1p/Lpe10p, Mrs2p, and human
MRS2/ MRS2L. It also includes a family of Arabidopsis
thaliana proteins (AtMGTs) some of which are localized
to distinct tissues, and not all of which can transport
Mg2+. Structures of the intracellular domain of two
EcCorA_ZntB-like family transporters: Vibrio
parahaemolyticus and Salmonella typhimurium ZntB form
funnel-shaped homopentamers, the tip of the funnel is
formed from two C-terminal transmembrane (TM) helices
from each monomer, and the large opening of the funnel
from the N-terminal cytoplasmic domains. The GMN
signature motif of the MIT superfamily occurs just after
TM1, mutation within this motif is known to abolish Mg2+
transport through Salmonella typhimurium CorA, and
Mrs2p. Natural variants such as GVN and GIN, as in some
ZntB family proteins, may be associated with the
transport of different divalent cations, such as zinc
and cadmium. The functional diversity of MIT
transporters may also be due to minor structural
differences regulating gating, substrate selection, and
transport.
Length = 323
Score = 30.3 bits (69), Expect = 1.9
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 5/50 (10%)
Query: 77 ALITPADNVLVAFSGGPSSRVALQFVHELQQRAQ----KNFDASKDRSLP 122
A+IT AD VL+ G SS + F+ ELQ+R ++ + SLP
Sbjct: 61 AIIT-ADEVLLFDPDGSSSALVSAFLEELQRRLASSNGSESESGGEDSLP 109
>gnl|CDD|215201 PLN02351, PLN02351, cytochromes b561 family protein.
Length = 242
Score = 29.9 bits (67), Expect = 2.7
Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 3/66 (4%)
Query: 195 ATGKEDLLLQLRMLSLQKFASENGYNRLL---LGLCTSRIACHVITATVKGRGYSLPADI 251
AT + LL +L L ++ S++G ++ LGL + ++ VI A V + S + +
Sbjct: 172 ATAETGLLEKLTFLQTKRNVSKHGSESMVVNGLGLGLALLSGIVILAAVLPKYQSHSSKL 231
Query: 252 QYADAR 257
Y+
Sbjct: 232 VYSSQD 237
>gnl|CDD|200217 TIGR02880, cbbX_cfxQ, probable Rubsico expression protein CbbX.
Proteins in this family are now designated CbbX. Some
previously were CfxQ (carbon fixation Q). Its gene is
often found immmediately downstream of the Rubisco large
and small chain genes, and it is suggested to be
necessary for Rubisco expression. CbbX has been shown to
be necessary for photoautotrophic growth. This protein
belongs to the larger family of pfam00004, ATPase family
Associated with various cellular Activities. Within that
larger family, members of this family are most closely
related to the stage V sporulation protein K, or SpoVK,
in endospore-forming bacteria such as Bacillus subtilis.
Length = 284
Score = 29.8 bits (67), Expect = 2.7
Identities = 18/58 (31%), Positives = 30/58 (51%), Gaps = 6/58 (10%)
Query: 127 GVVFVDETAYYPVPSSEIDNAIQEIKLIVSNLSPPTKELHVI------PIESIFCSNP 178
GV+F+DE Y P +E D + I++++ + +L VI ++S F SNP
Sbjct: 123 GVLFIDEAYYLYRPDNERDYGQEAIEILLQVMENNRDDLVVILAGYKDRMDSFFESNP 180
>gnl|CDD|151956 pfam11519, DUF3222, Protein of unknown function (DUF3222). This
family of proteins with unknown function appears to be
restricted to Rhodopseudomonas.
Length = 74
Score = 27.7 bits (61), Expect = 2.8
Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 2/40 (5%)
Query: 23 KPATETIS--DSNEQNLCVKCKANEPTPGAGEDGKHCLDC 60
K A ET+S D +N C C A+ P G++ H DC
Sbjct: 18 KTAIETVSEEDGGARNQCKLCNASVPWLQTGDEIVHQPDC 57
>gnl|CDD|163260 TIGR03432, yjhG_yagF, putative dehydratase, YjhG/YagF family. This
homolog of dihydroxy-acid dehydratases has an odd,
sparse distribution. Members are found in two
Acidobacteria, two Planctomycetes, Bacillus clausii
KSM-K16, and (in two copies each) in strains K12-MG1655
and W3110 of Escherichia coli. The local context is not
well conserved, but a few members are adjacent to
homologs of the gluconate:H+ symporter (see TIGR00791)
[Unknown function, Enzymes of unknown specificity].
Length = 640
Score = 29.8 bits (67), Expect = 3.5
Identities = 11/33 (33%), Positives = 18/33 (54%), Gaps = 1/33 (3%)
Query: 150 EIKLIVSNLSPPTKELHVIPIESIFCSNPCDGR 182
E+ L++ + K +P + F S+PCDGR
Sbjct: 85 EVGLLMKAAAEEIKRDGAVPF-AGFVSDPCDGR 116
>gnl|CDD|238950 cd01992, PP-ATPase, N-terminal domain of predicted ATPase of the
PP-loop faimly implicated in cell cycle control [Cell
division and chromosome partitioning]. This is a
subfamily of Adenine nucleotide alpha hydrolases
superfamily.Adeninosine nucleotide alpha hydrolases
superfamily includes N type ATP PPases and ATP
sulphurylases. It forms a apha/beta/apha fold which
binds to Adenosine group. This domain has a strongly
conserved motif SGGXD at the N terminus.
Length = 185
Score = 28.7 bits (65), Expect = 4.3
Identities = 13/31 (41%), Positives = 16/31 (51%)
Query: 85 VLVAFSGGPSSRVALQFVHELQQRAQKNFDA 115
+LVA SGGP S L + EL+ R A
Sbjct: 2 ILVAVSGGPDSMALLHLLSELKPRLGLRLVA 32
>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
kinases [Cell motility and secretion / Signal
transduction mechanisms].
Length = 716
Score = 29.2 bits (66), Expect = 5.0
Identities = 16/66 (24%), Positives = 30/66 (45%), Gaps = 15/66 (22%)
Query: 141 SSEIDNAIQEIKLIVSNLSPPTKELH-------VIPIESIFCSNPCDGRERFKKLVDSVS 193
+SE+D +E+ + LS T +L ++P E +F RF ++V ++
Sbjct: 356 ASELDEVREELDEALRQLSRLTTDLQDEVMKIRMVPFEQVF--------SRFPRMVRDLA 407
Query: 194 DATGKE 199
GK+
Sbjct: 408 RKLGKQ 413
>gnl|CDD|177083 CHL00181, cbbX, CbbX; Provisional.
Length = 287
Score = 28.9 bits (65), Expect = 5.6
Identities = 16/58 (27%), Positives = 29/58 (50%), Gaps = 6/58 (10%)
Query: 127 GVVFVDETAYYPVPSSEIDNAIQEIKLIVSNLSPPTKELHVI------PIESIFCSNP 178
GV+F+DE Y P +E D + I++++ + +L VI ++ + SNP
Sbjct: 124 GVLFIDEAYYLYKPDNERDYGSEAIEILLQVMENQRDDLVVIFAGYKDRMDKFYESNP 181
>gnl|CDD|168478 PRK06234, PRK06234, methionine gamma-lyase; Provisional.
Length = 400
Score = 28.6 bits (64), Expect = 6.6
Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 5/53 (9%)
Query: 121 LPVFGVGVVFVDETAYYPVPSSEIDNAIQEIKLIVSNLSPPTKELHVIPIESI 173
L +GV V FVD + E+ NA++ +V +P L V I++I
Sbjct: 124 LTRYGVEVTFVDTSNL-----EEVRNALKANTKVVYLETPANPTLKVTDIKAI 171
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.133 0.393
Gapped
Lambda K H
0.267 0.0556 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,244,685
Number of extensions: 2096717
Number of successful extensions: 1684
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1681
Number of HSP's successfully gapped: 24
Length of query: 455
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 355
Effective length of database: 6,502,202
Effective search space: 2308281710
Effective search space used: 2308281710
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (27.7 bits)