RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy10967
(199 letters)
>gnl|CDD|235381 PRK05269, PRK05269, transaldolase B; Provisional.
Length = 318
Score = 163 bits (414), Expect = 1e-49
Identities = 58/81 (71%), Positives = 63/81 (77%)
Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
Y NTG K YAP EDPGVVSVTKIYNYYKK GYKTVVMGASFRNTG+IL LAGCD +TI
Sbjct: 186 YKKNTGKKEYAPAEDPGVVSVTKIYNYYKKHGYKTVVMGASFRNTGQILELAGCDRLTIS 245
Query: 173 PKLLEELENSTTPVDQMLSEK 193
P LLEEL S +++ LS
Sbjct: 246 PALLEELAASEGELERKLSPP 266
>gnl|CDD|188644 cd00957, Transaldolase_TalAB, Transaldolases including both TalA
and TalB. Transaldolases including both TalA and TalB.
The enzyme catalyses the reversible transfer of a
dyhydroxyacetone moiety, derived from
fructose-6-phosphate to erythrose-4-phosphate yielding
sedoheptulose-7-phosphate and
glyceraldehyde-3-phosphate. The catalytic mechanism is
similar to other class I aldolases. The enzyme is found
in the non-oxidative branch of the pentose phosphate
pathway and forms a dimer in solution.
Length = 313
Score = 146 bits (370), Expect = 4e-43
Identities = 54/80 (67%), Positives = 61/80 (76%)
Query: 112 RYVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTI 171
Y ++G K Y EDPGV SV KIYNYYKKFGYKT VMGASFRN G+ILALAGCD +TI
Sbjct: 183 WYKKHSGDKAYTAEEDPGVASVKKIYNYYKKFGYKTKVMGASFRNIGQILALAGCDYLTI 242
Query: 172 GPKLLEELENSTTPVDQMLS 191
P LLEEL+NST V++ L
Sbjct: 243 SPALLEELKNSTAKVERKLD 262
>gnl|CDD|162081 TIGR00874, talAB, transaldolase. This family includes the majority
of known and predicted transaldolase sequences,
including E. coli TalA and TalB. It excluded two other
families. The first includes E. coli transaldolase-like
protein TalC. The second family includes the putative
transaldolases of Helicobacter pylori and Mycobacterium
tuberculosis [Energy metabolism, Pentose phosphate
pathway].
Length = 317
Score = 142 bits (360), Expect = 1e-41
Identities = 55/84 (65%), Positives = 62/84 (73%)
Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
Y A TG K Y+ EDPGV SV KIYNYYKK GY T VMGASFRN EILALAGCD +TI
Sbjct: 184 YKAATGKKEYSIEEDPGVASVKKIYNYYKKHGYPTEVMGASFRNKEEILALAGCDRLTIS 243
Query: 173 PKLLEELENSTTPVDQMLSEKSGK 196
P LL+EL+ ST PV++ L +S K
Sbjct: 244 PALLDELKESTGPVERKLDPESAK 267
>gnl|CDD|183426 PRK12309, PRK12309, transaldolase/EF-hand domain-containing
protein; Provisional.
Length = 391
Score = 140 bits (354), Expect = 4e-40
Identities = 50/78 (64%), Positives = 59/78 (75%)
Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
Y TG +Y EDPGV SVT+IYNYYKKFGYKT VMGASFRN GEI+ LAGCDL+TI
Sbjct: 190 YKKETGRDSYPGAEDPGVQSVTQIYNYYKKFGYKTEVMGASFRNIGEIIELAGCDLLTIS 249
Query: 173 PKLLEELENSTTPVDQML 190
PKLLE+L ++ + + L
Sbjct: 250 PKLLEQLRSTEAELPRKL 267
>gnl|CDD|240406 PTZ00411, PTZ00411, transaldolase-like protein; Provisional.
Length = 333
Score = 129 bits (327), Expect = 1e-36
Identities = 52/81 (64%), Positives = 61/81 (75%), Gaps = 1/81 (1%)
Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
Y ++Y +DPGV+SVTKIYNYYKK GYKT+VMGASFRNTGEIL LAGCD +TI
Sbjct: 196 YKKPEKAESYVGAQDPGVISVTKIYNYYKKHGYKTIVMGASFRNTGEILELAGCDKLTIS 255
Query: 173 PKLLEELENST-TPVDQMLSE 192
PKLLEEL N+ PV++ L
Sbjct: 256 PKLLEELANTEDGPVERKLDP 276
>gnl|CDD|183458 PRK12346, PRK12346, transaldolase A; Provisional.
Length = 316
Score = 123 bits (311), Expect = 2e-34
Identities = 46/85 (54%), Positives = 57/85 (67%)
Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
Y A Y EDPGV SV IY+YYK+ Y+T+VMGASFR T +ILALAGCD +TI
Sbjct: 185 YQARKPMDPYVVEEDPGVKSVRNIYDYYKQHRYETIVMGASFRRTEQILALAGCDRLTIS 244
Query: 173 PKLLEELENSTTPVDQMLSEKSGKF 197
P LL+EL+ S +PV++ L S F
Sbjct: 245 PNLLKELQESESPVERKLIPSSQTF 269
>gnl|CDD|216195 pfam00923, Transaldolase, Transaldolase.
Length = 244
Score = 93.8 bits (234), Expect = 1e-23
Identities = 31/59 (52%), Positives = 38/59 (64%)
Query: 123 APTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIGPKLLEELEN 181
A D GV + +IY YYKK+G+ T V+ ASFRN +LALAGCD +TI P LE L
Sbjct: 170 ALRGDDGVANAKEIYQYYKKYGWSTGVLAASFRNVLYVLALAGCDTITIPPDTLEALAK 228
>gnl|CDD|188631 cd00439, Transaldolase, Transaldolase. Transaldolase. Enzymes
found in the non-oxidative branch of the pentose
phosphate pathway, that catalyze the reversible transfer
of a dihydroxyacetone group from fructose-6-phosphate to
erythrose-4-phosphate yielding sedoheptulose-7-phosphate
and glyceraldehyde-3-phosphate. They are members of the
class I aldolases, who are characterized by using a
Schiff-base mechanism for stabilization of the reaction
intermediates.
Length = 252
Score = 88.2 bits (218), Expect = 1e-21
Identities = 20/66 (30%), Positives = 25/66 (37%)
Query: 114 VANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIGP 173
GV VT Y YK+ K V+ ASF +T + L GCD +T P
Sbjct: 176 DKMLEQIGLDLRGKAGVAQVTLAYKLYKQKFKKQRVLWASFSDTLYVAPLIGCDTVTTMP 235
Query: 174 KLLEEL 179
E
Sbjct: 236 DQALEA 241
>gnl|CDD|223254 COG0176, MipB, Transaldolase [Carbohydrate transport and
metabolism].
Length = 239
Score = 85.4 bits (212), Expect = 1e-20
Identities = 26/66 (39%), Positives = 38/66 (57%)
Query: 128 PGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIGPKLLEELENSTTPVD 187
G+ +IY+YYK+ G KT+V A F N I ALAG D++TI P LL++L +
Sbjct: 156 LGIAEAREIYDYYKQHGAKTLVASARFPNHVYIAALAGADVLTIPPDLLKQLLKHGGAMA 215
Query: 188 QMLSEK 193
L ++
Sbjct: 216 VPLLDE 221
>gnl|CDD|188643 cd00956, Transaldolase_FSA, Transaldolase-like fructose-6-phosphate
aldolases (FSA) found in bacteria and archaea.
Transaldolase-like fructose-6-phosphate aldolases (FSA)
found in bacteria and archaea, which are member of the
MipB/TalC subfamily of class I aldolases. FSA catalyze
an aldol cleavage of fructose 6-phosphate and do not
utilize fructose, fructose 1-phosphate, fructose
1,6-phosphate, or dihydroxyacetone phosphate. The
enzymes belong to the transaldolase family that serves
in transfer reactions in the pentose phosphate cycle,
and are more distantly related to fructose
1,6-bisphosphate aldolase.
Length = 211
Score = 44.1 bits (105), Expect = 1e-05
Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 17/84 (20%)
Query: 107 AKHLVRYVA-------NTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGE 159
AK YV+ + G D + + +I + +G+ T ++ AS RN
Sbjct: 119 AKAGATYVSPFVGRIDDLG-------GDGMEL-IREIRTIFDNYGFDTKILAASIRNPQH 170
Query: 160 IL--ALAGCDLMTIGPKLLEELEN 181
++ ALAG D +T+ P +LE+L
Sbjct: 171 VIEAALAGADAITLPPDVLEQLLK 194
>gnl|CDD|234947 PRK01362, PRK01362, putative translaldolase; Provisional.
Length = 214
Score = 35.9 bits (84), Expect = 0.006
Identities = 18/46 (39%), Positives = 29/46 (63%), Gaps = 5/46 (10%)
Query: 136 IYNYYKKFGYKTVVMGASFRNTGEIL--ALAGCDLMTIGPKLLEEL 179
IY+ Y G+ T ++ AS R+ +L ALAG D+ TI K++++L
Sbjct: 150 IYDNY---GFDTEIIAASVRHPMHVLEAALAGADIATIPYKVIKQL 192
>gnl|CDD|129953 TIGR00875, fsa_talC_mipB, fructose-6-phosphate aldolase, TalC/MipB
family. This model represents a family that includes
the E. coli transaldolase homologs TalC and MipB, both
shown to be fructose-6-phosphate aldolases rather than
transaldolases as previously thought. It is related to
but distinct from the transaldolase family of E. coli
TalA and TalB. The member from Bacillus subtilis becomes
phosphorylated during early stationary phase but not
during exponential growth [Energy metabolism, Pentose
phosphate pathway].
Length = 213
Score = 35.6 bits (82), Expect = 0.007
Identities = 16/67 (23%), Positives = 34/67 (50%), Gaps = 4/67 (5%)
Query: 129 GVVSVTKIYNYYKKFGYKTVVMGASFRNTGEIL--ALAGCDLMTIGPKLLEELENSTTPV 186
G+ + ++ ++ T V+ AS R+ +L AL G D+ T+ ++++L N P+
Sbjct: 140 GMKLIEEVKTIFENHAPDTEVIAASVRHPRHVLEAALIGADIATMPLDVMQQLFN--HPL 197
Query: 187 DQMLSEK 193
+ E+
Sbjct: 198 TDIGLER 204
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 33.6 bits (78), Expect = 0.054
Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 4/56 (7%)
Query: 39 DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTEDYLD 94
D+ L +L + +L+RE E+K + E L E +K+ K + ++ D
Sbjct: 514 DKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKE----ELEEKKEKLQEEED 565
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 33.1 bits (76), Expect = 0.082
Identities = 12/42 (28%), Positives = 23/42 (54%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVS 81
E ++ L+E+ +L+RE EE + IE L +E+ ++
Sbjct: 428 EETVERLEEENSELKRELEELKREIEKLESELERFRREVRDK 469
Score = 32.4 bits (74), Expect = 0.15
Identities = 12/48 (25%), Positives = 28/48 (58%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKAT 87
+R ++ ++ +L++E EEK K +E L ++ +++ R + + K T
Sbjct: 473 DREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGT 520
>gnl|CDD|183703 PRK12724, PRK12724, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 432
Score = 32.6 bits (74), Expect = 0.11
Identities = 15/42 (35%), Positives = 21/42 (50%)
Query: 27 VGIFLGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLR 68
+GI S ER LQDLKE + Q E+ K ++ L+
Sbjct: 54 IGIPEKQASREKIERKLQDLKELLKQKSYTEPERKKTLQTLK 95
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 31.8 bits (73), Expect = 0.13
Identities = 12/43 (27%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLR-NEINAIK-QNRHV 80
E+ + +L+ K+ +++ EE+ +I E +EI +K QN+ +
Sbjct: 140 EKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQL 182
>gnl|CDD|132655 TIGR03616, RutG, pyrimidine utilization transport protein G. This
protein is observed in operons extremely similar to that
characterized in E. coli K-12 responsible for the import
and catabolism of pyrimidines, primarily uracil. This
protein is a member of the uracil-xanthine permease
family defined by TIGR00801. As well as the The
Nucleobase:Cation Symporter-2 (NCS2) Family (TC 2.A.40).
Length = 429
Score = 32.5 bits (74), Expect = 0.13
Identities = 20/46 (43%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 93 LDVYNNNADGQDENAKHLVRYVANTGTKTYAPTEDPGVVSVTKIYN 138
LD Y A D A L V TG TYA E+ GV++VTK+Y+
Sbjct: 278 LDPYMGRAFVGDGLATMLSGSVGGTGVTTYA--ENIGVMAVTKVYS 321
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper.
Length = 65
Score = 29.8 bits (68), Expect = 0.15
Identities = 14/36 (38%), Positives = 22/36 (61%)
Query: 41 RNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ 76
+++L+ KV QL+ E+E K IE LR E+ +K
Sbjct: 26 AEIEELERKVEQLEAENERLKKEIERLRRELEKLKS 61
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 32.0 bits (73), Expect = 0.18
Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 3/57 (5%)
Query: 28 GIFLGNDSEYSD---ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVS 81
GI L + D + DL+E++L L+ E +E + +E+L EIN +++ +V
Sbjct: 51 GIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVL 107
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 31.6 bits (72), Expect = 0.22
Identities = 18/91 (19%), Positives = 37/91 (40%), Gaps = 9/91 (9%)
Query: 33 NDSEYSD-ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTED 91
DS+ S+ ++ ++++ ++ L + EE I+ L+ EI+ K + +
Sbjct: 36 QDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAE--------IKKLQK 87
Query: 92 YLDVYNNNADGQDENAKHLVRYVANTGTKTY 122
+ N + E K R + GT T
Sbjct: 88 EIAELKENIVERQELLKKRARAMQVNGTATS 118
Score = 27.0 bits (60), Expect = 7.1
Identities = 10/55 (18%), Positives = 24/55 (43%), Gaps = 2/55 (3%)
Query: 43 LQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ--NRHVSYATKATQTEDYLDV 95
+ +L++++ Q + E ++ K I L+ I ++ + Y+DV
Sbjct: 68 IDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKKRARAMQVNGTATSYIDV 122
>gnl|CDD|213375 cd12086, DD_cGKI-beta, Dimerization/Docking domain of Cyclic
GMP-dependent Protein Kinase I beta. Cyclic
GMP-dependent Protein Kinase I (PKG1 or cGKI) is a
Serine/Threonine Kinase (STK), catalyzing the transfer
of the gamma-phosphoryl group from ATP to
serine/threonine residues on protein substrates. cGKI
exists as two splice variants, cGKI-alpha and
cGKI-beta. They contain an N-terminal regulatory domain
containing a dimerization/docking region and an
autoinhibitory pseudosubstrate region, two cGMP-binding
domains, and a C-terminal catalytic domain. Binding of
cGMP to both binding sites releases the inhibition of
the catalytic center by the pseudosubstrate region,
allowing autophosphorylation and activation of the
kinase. cGKI is a soluble protein expressed in all
smooth muscles, platelets, cerebellum, and kidney. It
is involved in the regulation of smooth muscle tone,
smooth cell proliferation, and platelet activation. The
dimerization/docking (D/D) domain is a
leucine/isoleucine zipper that mediates both
homodimerization and interaction with isotype-specific
G-kinase-anchoring proteins (GKAPs). The D/D domain of
the two variants (alpha and beta) differ, allowing for
their targeting to different subcellular compartments
and intracellular substrates. cGKI-beta binds
specifically to inositol triphosphate
receptor-associated PKG substrate (IRAG) and the
transcriptional regulator TFII-I. Phosphorylation of
IRAG by cGKI-beta contributes to smooth muscle
relaxation while phosphorylation of TFII-I modulates
its co-activator functions for serum response factor
and Smad transcription factors.
Length = 52
Score = 28.8 bits (65), Expect = 0.28
Identities = 10/46 (21%), Positives = 31/46 (67%), Gaps = 3/46 (6%)
Query: 34 DSEYSDERNLQDLKEK---VLQLQRESEEKNKIIENLRNEINAIKQ 76
D + + + ++L+++ + +L++E +EK+ +I++L+NE++ +
Sbjct: 3 DLQRALQEKTEELRKRDELIKELEQELDEKDALIQHLQNELDKYRS 48
>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 247
Score = 30.8 bits (70), Expect = 0.37
Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 5/67 (7%)
Query: 32 GNDSEYSDERN-LQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTE 90
S R DL +++ LQ++ +E+L N+++++ S T E
Sbjct: 40 QTSKGESVRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSV----RRSVLTDDAALE 95
Query: 91 DYLDVYN 97
D L+
Sbjct: 96 DRLEKLR 102
>gnl|CDD|213373 cd12083, DD_cGKI, Dimerization/Docking domain of Cyclic
GMP-dependent Protein Kinase I. Cyclic GMP-dependent
Protein Kinase I (PKG1 or cGKI) is a Serine/Threonine
Kinase (STK), catalyzing the transfer of the
gamma-phosphoryl group from ATP to serine/threonine
residues on protein substrates. cGKI exists as two
splice variants, cGKI-alpha and cGKI-beta. They contain
an N-terminal regulatory domain containing a
dimerization/docking region and an autoinhibitory
pseudosubstrate region, two cGMP-binding domains, and a
C-terminal catalytic domain. Binding of cGMP to both
binding sites releases the inhibition of the catalytic
center by the pseudosubstrate region, allowing
autophosphorylation and activation of the kinase. cGKI
is a soluble protein expressed in all smooth muscles,
platelets, cerebellum, and kidney. It is also expressed
at lower concentrations in other tissues. It is
involved in the regulation of smooth muscle tone,
smooth cell proliferation, and platelet activation. The
dimerization/docking (D/D) domain is a
leucine/isoleucine zipper that mediates both
homodimerization and interaction with isotype-specific
G-kinase-anchoring proteins (GKAPs). The D/D domain of
the two variants (alpha and beta) differ, allowing
their targeting to different subcellular compartments
and intracellular substrates.
Length = 48
Score = 28.3 bits (64), Expect = 0.41
Identities = 9/29 (31%), Positives = 22/29 (75%)
Query: 43 LQDLKEKVLQLQRESEEKNKIIENLRNEI 71
L+ E++ +L++E +EK++ I+ LR+++
Sbjct: 11 LRKKDERIRELEQELQEKDEEIQELRSQL 39
>gnl|CDD|232804 TIGR00064, ftsY, signal recognition particle-docking protein FtsY.
There is a weak division between FtsY and SRP54; both
are GTPases. In E.coli, ftsY is an essential gene
located in an operon with cell division genes ftsE and
ftsX, but its apparent function is as the signal
recognition particle docking protein [Protein fate,
Protein and peptide secretion and trafficking].
Length = 272
Score = 29.5 bits (67), Expect = 0.89
Identities = 28/125 (22%), Positives = 43/125 (34%), Gaps = 23/125 (18%)
Query: 39 DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKA-TQTEDYLDV-Y 96
DE ++L+E +L+ E KIIE L+ E+ + V A ++YL
Sbjct: 3 DEDFFEELEEILLESDVGYEVVEKIIEALKKELK----GKKVKDAELLKEILKEYLKEIL 58
Query: 97 NNNADGQDENAKHLVRY----VANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGA 152
V G T ++ K+ N KK G V++ A
Sbjct: 59 KETDLELIVEENKPNVILFVGVNGVGKTT----------TIAKLANKLKKQGKS-VLLAA 107
Query: 153 S--FR 155
FR
Sbjct: 108 GDTFR 112
>gnl|CDD|225616 COG3074, COG3074, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 79
Score = 28.0 bits (62), Expect = 0.92
Identities = 10/38 (26%), Positives = 19/38 (50%)
Query: 43 LQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHV 80
+++LKEK L +E + E L E +K+ ++
Sbjct: 27 IEELKEKNNSLSQEVQNAQHQREALERENEQLKEEQNG 64
>gnl|CDD|130492 TIGR01425, SRP54_euk, signal recognition particle protein SRP54.
This model represents examples from the eukaryotic
cytosol of the signal recognition particle protein
component, SRP54. This GTP-binding protein is a
component of the eukaryotic signal recognition particle,
along with several other protein subunits and a 7S RNA.
Some species, including Arabidopsis, have several
closely related forms. The extreme C-terminal region is
glycine-rich and lower in complexity, poorly conserved
between species, and excluded from this model.
Length = 429
Score = 29.8 bits (67), Expect = 0.93
Identities = 29/131 (22%), Positives = 49/131 (37%), Gaps = 27/131 (20%)
Query: 39 DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTEDYLDVYNN 98
DE L + +++ ES+ K++ LR I KA E+ N
Sbjct: 23 DEEVLNAMLKEICTALLESDVNIKLVRQLRENIK------------KAINLEEMASGLNK 70
Query: 99 NADGQDENAKHLVRYVANTGTKTYAPT-EDPGVV------------SVTKIYNYYKKFGY 145
Q K L V + G + + P V+ + TK+ YY++ G+
Sbjct: 71 RKMIQHAVFKELCNLV-DPGVEAFTPKKGKQNVIMFVGLQGSGKTTTCTKLAYYYQRKGF 129
Query: 146 KTVVMGA-SFR 155
K ++ A +FR
Sbjct: 130 KPCLVCADTFR 140
>gnl|CDD|236263 PRK08425, flgE, flagellar hook protein FlgE; Validated.
Length = 731
Score = 30.0 bits (68), Expect = 0.94
Identities = 18/54 (33%), Positives = 25/54 (46%), Gaps = 4/54 (7%)
Query: 66 NLRNEINAIKQNRHVSYATKATQTEDYLDVYNNNADGQDENAKHLVRYVANTGT 119
+N INA K V +A + L + N N+ G E K++V V TGT
Sbjct: 297 AAKNAINAQKSKTGV----EAYLDGNQLRLENTNSLGTTEKMKNIVITVNGTGT 346
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 29.6 bits (67), Expect = 0.97
Identities = 8/35 (22%), Positives = 18/35 (51%)
Query: 43 LQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
L+ ++++ L + EK K +E L+ E+ +
Sbjct: 209 LEKARQELRSLSVKISEKRKQLEELQQELQELTIA 243
>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16). Autophagy is a
ubiquitous intracellular degradation system for
eukaryotic cells. During autophagy, cytoplasmic
components are enclosed in autophagosomes and delivered
to lysosomes/vacuoles. ATG16 (also known as Apg16) has
been shown to be bind to Apg5 and is required for the
function of the Apg12p-Apg5p conjugate in the yeast
autophagy pathway.
Length = 194
Score = 29.4 bits (66), Expect = 1.0
Identities = 9/35 (25%), Positives = 20/35 (57%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAI 74
+ L+ ++ L+ E +EK K E L++E+ ++
Sbjct: 122 RSEITSLETEIRDLREELQEKEKDNETLQDELISL 156
>gnl|CDD|224346 COG1429, CobN, Cobalamin biosynthesis protein CobN and related
Mg-chelatases [Coenzyme metabolism].
Length = 1388
Score = 29.3 bits (66), Expect = 1.4
Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 4/52 (7%)
Query: 90 EDYLDVYNNNADGQDENAKHLVRYVANTGTKTYA----PTEDPGVVSVTKIY 137
EDYL VY + G EN ++L+ Y+A+ +P IY
Sbjct: 124 EDYLRVYLYLSGGGVENLRNLLLYLADLYDDLADGIDYAYIEPVRPPKEGIY 175
>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator. DivIC from B.
subtilis is necessary for both vegetative and
sporulation septum formation. These proteins are mainly
composed of an amino terminal coiled-coil.
Length = 80
Score = 27.5 bits (62), Expect = 1.5
Identities = 8/38 (21%), Positives = 15/38 (39%)
Query: 41 RNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNR 78
L +++ LQ E + E L E+ +K +
Sbjct: 17 SAYYQLNQEIAALQAELAKLKAENEELEAEVKDLKSDP 54
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 28.7 bits (64), Expect = 1.7
Identities = 16/42 (38%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 36 EYSD-ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ 76
EY D + + ++LKEK+ +LQ+E EE K +E L E +++
Sbjct: 129 EYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQE 170
>gnl|CDD|237420 PRK13537, PRK13537, nodulation ABC transporter NodI; Provisional.
Length = 306
Score = 28.6 bits (64), Expect = 1.9
Identities = 15/40 (37%), Positives = 18/40 (45%), Gaps = 1/40 (2%)
Query: 127 DPGVVSVTKIYNYYKKFGYKTVVMGASFR-NTGEILALAG 165
P V+ N K++G K VV G SF GE L G
Sbjct: 1 GPMSVAPIDFRNVEKRYGDKLVVDGLSFHVQRGECFGLLG 40
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 28.6 bits (64), Expect = 2.2
Identities = 12/42 (28%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVS 81
+Q+L EK+L L++ S E + +L E+ + HV+
Sbjct: 391 RNEIQNLAEKLLSLEKSSHEDQERSGDLEKELRKL---THVA 429
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 28.6 bits (65), Expect = 2.7
Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 8/52 (15%)
Query: 34 DSEYSDERN-LQDLKEKVLQ----LQRESEE---KNKIIENLRNEINAIKQN 77
+ E + RN LQ L++++LQ L R+ E + + +E E+ +Q
Sbjct: 74 EKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 28.1 bits (63), Expect = 2.9
Identities = 10/41 (24%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 36 EYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ 76
E E + +L E+ ++L ++ +E ++ NL+NE+ +++
Sbjct: 204 ELRKEAD--ELHEEFVELSKKIDELHEEFRNLQNELRELEK 242
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 27.0 bits (60), Expect = 2.9
Identities = 12/49 (24%), Positives = 21/49 (42%), Gaps = 2/49 (4%)
Query: 40 ERNLQDLKEKVLQLQRE--SEEKNKIIENLRNEINAIKQNRHVSYATKA 86
E + L +++L+ RE EE +I+ E +AI +A
Sbjct: 49 EEEAEKLAQEILEEAREEAEEEAEEILAEAEKEASAILSKAAEGKVVEA 97
>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats.
Length = 101
Score = 26.9 bits (60), Expect = 3.3
Identities = 9/50 (18%), Positives = 20/50 (40%)
Query: 30 FLGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRH 79
L ++ D +++ L +K + E E + +E L + + H
Sbjct: 20 LLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGH 69
>gnl|CDD|213185 cd03218, ABC_YhbG, ATP-binding cassette component of YhbG transport
system. The ABC transporters belonging to the YhbG
family are similar to members of the Mj1267_LivG family,
which is involved in the transport of branched-chain
amino acids. The genes yhbG and yhbN are located in a
single operon and may function together in cell envelope
during biogenesis. YhbG is the putative ATP-binding
cassette component and YhbN is the putative
periplasmic-binding protein. Depletion of each gene
product leads to growth arrest, irreversible cell damage
and loss of viability in E. coli. The YhbG homolog
(NtrA) is essential in Rhizobium meliloti, a symbiotic
nitrogen-fixing bacterium.
Length = 232
Score = 27.9 bits (63), Expect = 3.5
Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 1/32 (3%)
Query: 135 KIYNYYKKFGYKTVVMGASFR-NTGEILALAG 165
+ N K++G + VV G S GEI+ L G
Sbjct: 2 RAENLSKRYGKRKVVNGVSLSVKQGEIVGLLG 33
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 28.1 bits (63), Expect = 3.5
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 36 EYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIK 75
E L++L+E L+ E E + +RNE+ +K
Sbjct: 364 EEKLSALLEELEELFEALREELAELEAELAEIRNELEELK 403
>gnl|CDD|146396 pfam03735, ENT, ENT domain. This presumed domain is named after
Emsy N Terminus (ENT). Emsy is a protein that is
amplified in breast cancer and interacts with BRCA2.
The N terminus of this protein is found to be similar
to other vertebrate and plant proteins of unknown
function. This domain has a completely conserved
histidine residue that may be functionally important.
Length = 74
Score = 26.1 bits (58), Expect = 3.6
Identities = 8/31 (25%), Positives = 13/31 (41%), Gaps = 1/31 (3%)
Query: 57 SEEKNKIIENLRNEINAIKQNRHVSYATKAT 87
S EK +++ LR E+ I H +
Sbjct: 28 SWEKEELLTELRKELR-ISDEEHRELLRRVN 57
>gnl|CDD|223671 COG0598, CorA, Mg2+ and Co2+ transporters [Inorganic ion transport
and metabolism].
Length = 322
Score = 27.7 bits (62), Expect = 4.0
Identities = 8/40 (20%), Positives = 18/40 (45%)
Query: 39 DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNR 78
D L+D+ + + QL E + + +L + ++ N
Sbjct: 218 DREYLRDVLDHLTQLIEMLEALRERLSSLLDAYLSLINNN 257
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 27.7 bits (62), Expect = 4.0
Identities = 12/39 (30%), Positives = 19/39 (48%)
Query: 39 DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
D L KEK+ +L +E K K +E L E+ ++
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESK 240
Score = 27.7 bits (62), Expect = 4.5
Identities = 8/38 (21%), Positives = 20/38 (52%)
Query: 41 RNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNR 78
+ L++L+E++ +L+ + E+ L EI ++
Sbjct: 225 KKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKL 262
>gnl|CDD|197699 smart00397, t_SNARE, Helical region found in SNAREs. All
alpha-helical motifs that form twisted and parallel
four-helix bundles in target soluble
N-ethylmaleimide-sensitive factor (NSF) attachment
protein (SNAP) receptor proteins. This motif found in
"Q-SNAREs".
Length = 66
Score = 26.0 bits (58), Expect = 4.2
Identities = 9/38 (23%), Positives = 23/38 (60%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
E+++Q+LK+ L + E EE+ + ++ + + ++ N
Sbjct: 18 EKSIQELKQIFLDMGTELEEQGEQLDRIEDNVDDADVN 55
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 28.1 bits (63), Expect = 4.2
Identities = 8/40 (20%), Positives = 22/40 (55%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRH 79
E +++L+ ++ QL+ E + + ++ LR E+ + +
Sbjct: 781 EAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820
>gnl|CDD|205299 pfam13118, DUF3972, Protein of unknown function (DUF3972). This is
a Proteobacterial family of unknown function. Some of
the proteins in this family are annotated as being
kinesin-like proteins.
Length = 126
Score = 26.9 bits (60), Expect = 4.4
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 44 QDLKEKVLQLQRESEEKNKIIENLRNEIN 72
+ LKE + +Q +E K I+ LR ++
Sbjct: 95 KFLKEALYSMQELYDEDRKTIDLLREQLK 123
>gnl|CDD|130658 TIGR01597, PYST-B, Plasmodium yoelii subtelomeric family PYST-B.
This model represents a paralogous family of Plasmodium
yoelii genes preferentially located in the subtelomeric
regions of the chromosomes. There are no obvious
homologs to these genes in any other organism.
Length = 255
Score = 27.6 bits (61), Expect = 4.4
Identities = 25/127 (19%), Positives = 46/127 (36%), Gaps = 19/127 (14%)
Query: 47 KEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTEDYLDVYNNNADGQDEN 106
K+ + +LQ+E EE K ++N N+ I + + + NN+ + E+
Sbjct: 112 KKLINKLQKELEELKKELDNEMNDELTI----------QPIHDKIIIKKDENNSVSEHED 161
Query: 107 AKHLVRYVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGC 166
K L ++ ++ D KI KK K +V + C
Sbjct: 162 FKQLENEKNSSVSEHE--EFDIASSDNLKIKRKLKKLVKKLIV-------RCLTFIVIVC 212
Query: 167 DLMTIGP 173
++ GP
Sbjct: 213 SILVSGP 219
>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
Length = 663
Score = 27.5 bits (61), Expect = 5.3
Identities = 20/84 (23%), Positives = 41/84 (48%), Gaps = 12/84 (14%)
Query: 33 NDSE---YSDERNLQDLKEKVLQLQRESEEKNKIIENLRN-----EINAIK-QNRHVSYA 83
N++E YS E+ L DLK+K+ ++ ++ I LR+ ++++IK + + + A
Sbjct: 571 NEAETLIYSVEKQLSDLKDKISDADKDELKQK--ITKLRSTLSSEDVDSIKDKTKQLQEA 628
Query: 84 TKATQTEDYLDVYNNNA-DGQDEN 106
+ + Y ++N Q N
Sbjct: 629 SWKISQQAYKQGNSDNQQSEQSTN 652
>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
in cytoskeletal structure; family members include
spectrin, alpha-actinin and dystrophin; the spectrin
repeat forms a three helix bundle with the second helix
interrupted by proline in some sequences; the repeats
are independent folding units; tandem repeats are found
in differing numbers and arrange in an antiparallel
manner to form dimers; the repeats are defined by a
characteristic tryptophan (W) residue in helix A and a
leucine (L) at the carboxyl end of helix C and separated
by a linker of 5 residues; two copies of the repeat are
present here.
Length = 213
Score = 27.0 bits (60), Expect = 5.5
Identities = 9/49 (18%), Positives = 23/49 (46%)
Query: 31 LGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRH 79
L ++ D ++++L +K +L+ E E +++L + + H
Sbjct: 129 LASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGH 177
>gnl|CDD|177942 PLN02307, PLN02307, phosphoglucomutase.
Length = 579
Score = 27.3 bits (61), Expect = 6.1
Identities = 12/40 (30%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Query: 56 ESEEKNKIIENLRNEINAIKQNRHVSYATKATQTE-DYLD 94
+SE NK++++LR+ +N K+ T A + +Y D
Sbjct: 456 DSEAANKMMDHLRDLVNKSKKGIKYGVYTLAFADDFEYTD 495
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 27.4 bits (61), Expect = 6.1
Identities = 13/60 (21%), Positives = 28/60 (46%), Gaps = 1/60 (1%)
Query: 34 DSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEI-NAIKQNRHVSYATKATQTEDY 92
+ + LQ +KE+ +L++E + + I++ +N++ N KQ + K D
Sbjct: 205 KKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKAPQNDL 264
>gnl|CDD|165339 PHA03046, PHA03046, Hypothetical protein; Provisional.
Length = 142
Score = 26.6 bits (59), Expect = 6.4
Identities = 21/70 (30%), Positives = 32/70 (45%), Gaps = 7/70 (10%)
Query: 29 IFLGNDSEYSD-----ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYA 83
IF G+D E +D E+ ++ +L E EE N I N+RNEI I ++ ++
Sbjct: 7 IFPGDDDEENDRNEHREKTSEEDGHYKKRLDVEEEEPNLI--NIRNEIKKIINEKYSTFP 64
Query: 84 TKATQTEDYL 93
D L
Sbjct: 65 IHEDDISDIL 74
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 218
Score = 26.9 bits (60), Expect = 6.7
Identities = 10/47 (21%), Positives = 21/47 (44%), Gaps = 4/47 (8%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNK----IIENLRNEINAIKQNRHVSY 82
+++L+E++ + + EE K I E L+ E+ + R
Sbjct: 144 PAKVEELEEELEEAESALEEARKRYEEISERLKEELKRFHEERARDL 190
>gnl|CDD|226361 COG3842, PotA, ABC-type spermidine/putrescine transport systems,
ATPase components [Amino acid transport and metabolism].
Length = 352
Score = 27.2 bits (61), Expect = 6.8
Identities = 11/29 (37%), Positives = 12/29 (41%), Gaps = 1/29 (3%)
Query: 138 NYYKKFGYKTVVMGASFR-NTGEILALAG 165
N K FG T V S GE + L G
Sbjct: 10 NVSKSFGDFTAVDDISLDIKKGEFVTLLG 38
>gnl|CDD|107320 cd06325, PBP1_ABC_uncharacterized_transporter, Type I periplasmic
ligand-binding domain of uncharacterized ABC-type
transport systems that are predicted to be involved in
the uptake of amino acids, peptides, or inorganic ions.
This group includes the type I periplasmic
ligand-binding domain of uncharacterized ABC (ATPase
Binding Cassette)-type transport systems that are
predicted to be involved in the uptake of amino acids,
peptides, or inorganic ions. This subgroup has high
sequence similarity to members of the family of
hydrophobic amino acid transporters (HAAT), such as
leucine/isoleucine/valine binding protein (LIVBP); its
ligand specificity has not been determined
experimentally.
Length = 281
Score = 26.8 bits (60), Expect = 6.9
Identities = 11/43 (25%), Positives = 19/43 (44%), Gaps = 2/43 (4%)
Query: 122 YAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALA 164
Y P+E VV V ++ K G + V S ++ ++ A
Sbjct: 138 YNPSEANSVVQVKELKKAAAKLGIEVVEATVS--SSNDVQQAA 178
>gnl|CDD|201159 pfam00329, Complex1_30kDa, Respiratory-chain NADH dehydrogenase, 30
Kd subunit.
Length = 103
Score = 26.0 bits (58), Expect = 7.2
Identities = 9/24 (37%), Positives = 14/24 (58%), Gaps = 3/24 (12%)
Query: 120 KTYAPTEDPGVVSVTKIY---NYY 140
K P +DP + S+T I+ N+Y
Sbjct: 46 KVKLPRDDPEIPSITDIFPGANWY 69
>gnl|CDD|134340 PRK00650, PRK00650, 4-diphosphocytidyl-2-C-methyl-D-erythritol
kinase; Provisional.
Length = 288
Score = 26.7 bits (59), Expect = 7.5
Identities = 13/43 (30%), Positives = 25/43 (58%), Gaps = 4/43 (9%)
Query: 35 SEYSDERNLQ----DLKEKVLQLQRESEEKNKIIENLRNEINA 73
S+ S +NL+ DL++ V +L+ + +EK +E+L E+
Sbjct: 183 SDCSSRKNLEYTQNDLEKPVFRLRLDLKEKKHWLESLWAELPV 225
>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904). This
family consists of several bacterial and archaeal
hypothetical proteins of unknown function.
Length = 72
Score = 25.3 bits (56), Expect = 7.8
Identities = 12/36 (33%), Positives = 22/36 (61%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIK 75
+ +++LKE+ QL E+EE + + L+ E NA +
Sbjct: 24 QMEIEELKEENEQLSEENEELEEENKKLKEERNAWQ 59
>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 758
Score = 27.2 bits (60), Expect = 7.9
Identities = 9/34 (26%), Positives = 20/34 (58%)
Query: 44 QDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
L++K+ L++E + + ++ L NEI I++
Sbjct: 448 CGLEKKINNLEKEIKNNQEEVKKLSNEIKEIEKF 481
>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 220 and
257 amino acids in length.
Length = 221
Score = 26.5 bits (59), Expect = 8.1
Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 2/59 (3%)
Query: 33 NDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ-NRHVSYATKATQTE 90
N S + +DLKE+V + + EE K I L+ ++ KQ NR V + + +
Sbjct: 161 NASSNTGVYEKEDLKERV-ERLEQIEELEKEIAKLKKKLKKEKQFNRKVELNLEIKKLK 218
>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
Length = 3164
Score = 27.3 bits (60), Expect = 8.3
Identities = 13/54 (24%), Positives = 24/54 (44%)
Query: 21 KYLPKSVGIFLGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAI 74
L S+ D NL+ ++++ + + ES E I+ L++E NA
Sbjct: 2647 AKLMLSISNMEQTDEIAVLLHNLKKSRKEIEEEESESMEIEDRIDALKSEYNAS 2700
>gnl|CDD|238575 cd01170, THZ_kinase, 4-methyl-5-beta-hydroxyethylthiazole (Thz)
kinase catalyzes the phosphorylation of the
hydroxylgroup of Thz. A reaction that allows cells to
recycle Thz into the thiamine biosynthesis pathway, as
an alternative to its synthesis from cysteine, tyrosine
and 1-deoxy-D-xylulose-5-phosphate.
Length = 242
Score = 26.7 bits (60), Expect = 8.7
Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 4/54 (7%)
Query: 142 KFGYKTVVMGASFRNTGEILALAGCDLMTIGPKLLEELENSTTPVDQMLSEKSG 195
G TV+ G N EI ALAG + G E + + L+ K G
Sbjct: 104 AEGQPTVIRG----NASEIAALAGLTGLGKGVDSSSSDEEDALELAKALARKYG 153
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 27.0 bits (61), Expect = 8.9
Identities = 10/36 (27%), Positives = 17/36 (47%)
Query: 42 NLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
+L E+V L E +E K +E L+ ++ A
Sbjct: 713 KPSELPERVEALLEELKELEKELEQLKAKLAAAAAG 748
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 26.6 bits (59), Expect = 9.0
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 2/64 (3%)
Query: 33 NDSEYSD-ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTED 91
++ E +Q KE++ L+ E E + IE L EI + R + E
Sbjct: 87 DERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIED-LKERLERLEKNLAEAEA 145
Query: 92 YLDV 95
L+
Sbjct: 146 RLEE 149
>gnl|CDD|153296 cd07612, BAR_Bin2, The Bin/Amphiphysin/Rvs (BAR) domain of Bridging
integrator 2. BAR domains are dimerization, lipid
binding and curvature sensing modules found in many
different proteins with diverse functions. Bridging
integrator 2 (Bin2) is a BAR domain containing protein
that is mainly expressed in hematopoietic cells. It is
upregulated during granulocyte differentiation and is
thought to function primarily in this lineage. The BAR
domain of Bin2 is closely related to the BAR domains of
amphiphysins, which function primarily in endocytosis
and other membrane remodeling events. Amphiphysins
contain an N-terminal BAR domain with an additional
N-terminal amphipathic helix (an N-BAR), a variable
central domain, and a C-terminal SH3 domain. Unlike
amphiphysins, Bin2 does not appear to contain a
C-terminal SH3 domain. Amphiphysin I proteins, enriched
in the brain and nervous system, function in synaptic
vesicle endocytosis. Some amphiphysin II isoforms, also
called Bridging integrator 1 (Bin1), function in
intracellular vessicle trafficking. Bin2 can form a
stable complex with Bin1 in cells but cannot replace the
function of Bin1, and thus, appears to harbor a
nonredundant function. The N-BAR domain of amphiphysin
forms a curved dimer with a positively-charged concave
face that can drive membrane bending and curvature.
Length = 211
Score = 26.4 bits (58), Expect = 9.7
Identities = 18/75 (24%), Positives = 37/75 (49%), Gaps = 8/75 (10%)
Query: 31 LGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTE 90
LG E DE+ ++ + L + + N++ ++L+ +NA+K S + +QT
Sbjct: 3 LGKTVETKDEQ----FEQCAMNLNMQQSDGNRLYKDLKAYLNAVKVMHESS--KRLSQT- 55
Query: 91 DYLDVYNNNADGQDE 105
D+Y + DG ++
Sbjct: 56 -LQDIYEPDWDGHED 69
>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
complex, found in both eukaryotes and archaea, that
binds and stabilizes newly synthesized polypeptides
allowing them to fold correctly. The complex contains
two alpha and four beta subunits, the two subunits being
evolutionarily related. In archaea, there is usually
only one gene for each subunit while in eukaryotes there
two or more paralogous genes encoding each subunit
adding heterogeneity to the structure of the hexamer.
The structure of the complex consists of a double beta
barrel assembly with six protruding coiled-coils.
Length = 129
Score = 25.8 bits (57), Expect = 9.7
Identities = 11/42 (26%), Positives = 27/42 (64%)
Query: 36 EYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
E S E ++ LK+++ L+++ E+ K +E L+++I +++
Sbjct: 82 EKSLEEAIEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEE 123
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 27.0 bits (60), Expect = 9.8
Identities = 11/38 (28%), Positives = 22/38 (57%)
Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
E+ L+ L+E++ + E E K +E LR E+ +++
Sbjct: 618 EKELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.132 0.374
Gapped
Lambda K H
0.267 0.0722 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,884,553
Number of extensions: 900230
Number of successful extensions: 1343
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1327
Number of HSP's successfully gapped: 157
Length of query: 199
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 107
Effective length of database: 6,857,034
Effective search space: 733702638
Effective search space used: 733702638
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (25.4 bits)