RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10967
         (199 letters)



>gnl|CDD|235381 PRK05269, PRK05269, transaldolase B; Provisional.
          Length = 318

 Score =  163 bits (414), Expect = 1e-49
 Identities = 58/81 (71%), Positives = 63/81 (77%)

Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
           Y  NTG K YAP EDPGVVSVTKIYNYYKK GYKTVVMGASFRNTG+IL LAGCD +TI 
Sbjct: 186 YKKNTGKKEYAPAEDPGVVSVTKIYNYYKKHGYKTVVMGASFRNTGQILELAGCDRLTIS 245

Query: 173 PKLLEELENSTTPVDQMLSEK 193
           P LLEEL  S   +++ LS  
Sbjct: 246 PALLEELAASEGELERKLSPP 266


>gnl|CDD|188644 cd00957, Transaldolase_TalAB, Transaldolases including both TalA
           and TalB.  Transaldolases including both TalA and TalB.
           The enzyme catalyses the reversible transfer of a
           dyhydroxyacetone moiety, derived from
           fructose-6-phosphate to erythrose-4-phosphate yielding
           sedoheptulose-7-phosphate and
           glyceraldehyde-3-phosphate. The catalytic mechanism is
           similar to other class I aldolases. The enzyme is found
           in the non-oxidative branch of the pentose phosphate
           pathway and forms a dimer in solution.
          Length = 313

 Score =  146 bits (370), Expect = 4e-43
 Identities = 54/80 (67%), Positives = 61/80 (76%)

Query: 112 RYVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTI 171
            Y  ++G K Y   EDPGV SV KIYNYYKKFGYKT VMGASFRN G+ILALAGCD +TI
Sbjct: 183 WYKKHSGDKAYTAEEDPGVASVKKIYNYYKKFGYKTKVMGASFRNIGQILALAGCDYLTI 242

Query: 172 GPKLLEELENSTTPVDQMLS 191
            P LLEEL+NST  V++ L 
Sbjct: 243 SPALLEELKNSTAKVERKLD 262


>gnl|CDD|162081 TIGR00874, talAB, transaldolase.  This family includes the majority
           of known and predicted transaldolase sequences,
           including E. coli TalA and TalB. It excluded two other
           families. The first includes E. coli transaldolase-like
           protein TalC. The second family includes the putative
           transaldolases of Helicobacter pylori and Mycobacterium
           tuberculosis [Energy metabolism, Pentose phosphate
           pathway].
          Length = 317

 Score =  142 bits (360), Expect = 1e-41
 Identities = 55/84 (65%), Positives = 62/84 (73%)

Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
           Y A TG K Y+  EDPGV SV KIYNYYKK GY T VMGASFRN  EILALAGCD +TI 
Sbjct: 184 YKAATGKKEYSIEEDPGVASVKKIYNYYKKHGYPTEVMGASFRNKEEILALAGCDRLTIS 243

Query: 173 PKLLEELENSTTPVDQMLSEKSGK 196
           P LL+EL+ ST PV++ L  +S K
Sbjct: 244 PALLDELKESTGPVERKLDPESAK 267


>gnl|CDD|183426 PRK12309, PRK12309, transaldolase/EF-hand domain-containing
           protein; Provisional.
          Length = 391

 Score =  140 bits (354), Expect = 4e-40
 Identities = 50/78 (64%), Positives = 59/78 (75%)

Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
           Y   TG  +Y   EDPGV SVT+IYNYYKKFGYKT VMGASFRN GEI+ LAGCDL+TI 
Sbjct: 190 YKKETGRDSYPGAEDPGVQSVTQIYNYYKKFGYKTEVMGASFRNIGEIIELAGCDLLTIS 249

Query: 173 PKLLEELENSTTPVDQML 190
           PKLLE+L ++   + + L
Sbjct: 250 PKLLEQLRSTEAELPRKL 267


>gnl|CDD|240406 PTZ00411, PTZ00411, transaldolase-like protein; Provisional.
          Length = 333

 Score =  129 bits (327), Expect = 1e-36
 Identities = 52/81 (64%), Positives = 61/81 (75%), Gaps = 1/81 (1%)

Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
           Y      ++Y   +DPGV+SVTKIYNYYKK GYKT+VMGASFRNTGEIL LAGCD +TI 
Sbjct: 196 YKKPEKAESYVGAQDPGVISVTKIYNYYKKHGYKTIVMGASFRNTGEILELAGCDKLTIS 255

Query: 173 PKLLEELENST-TPVDQMLSE 192
           PKLLEEL N+   PV++ L  
Sbjct: 256 PKLLEELANTEDGPVERKLDP 276


>gnl|CDD|183458 PRK12346, PRK12346, transaldolase A; Provisional.
          Length = 316

 Score =  123 bits (311), Expect = 2e-34
 Identities = 46/85 (54%), Positives = 57/85 (67%)

Query: 113 YVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIG 172
           Y A      Y   EDPGV SV  IY+YYK+  Y+T+VMGASFR T +ILALAGCD +TI 
Sbjct: 185 YQARKPMDPYVVEEDPGVKSVRNIYDYYKQHRYETIVMGASFRRTEQILALAGCDRLTIS 244

Query: 173 PKLLEELENSTTPVDQMLSEKSGKF 197
           P LL+EL+ S +PV++ L   S  F
Sbjct: 245 PNLLKELQESESPVERKLIPSSQTF 269


>gnl|CDD|216195 pfam00923, Transaldolase, Transaldolase. 
          Length = 244

 Score = 93.8 bits (234), Expect = 1e-23
 Identities = 31/59 (52%), Positives = 38/59 (64%)

Query: 123 APTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIGPKLLEELEN 181
           A   D GV +  +IY YYKK+G+ T V+ ASFRN   +LALAGCD +TI P  LE L  
Sbjct: 170 ALRGDDGVANAKEIYQYYKKYGWSTGVLAASFRNVLYVLALAGCDTITIPPDTLEALAK 228


>gnl|CDD|188631 cd00439, Transaldolase, Transaldolase.  Transaldolase. Enzymes
           found in the non-oxidative branch of the pentose
           phosphate pathway, that catalyze the reversible transfer
           of a dihydroxyacetone group from fructose-6-phosphate to
           erythrose-4-phosphate yielding sedoheptulose-7-phosphate
           and glyceraldehyde-3-phosphate. They are members of the
           class I aldolases, who are characterized by using a
           Schiff-base mechanism for stabilization of the reaction
           intermediates.
          Length = 252

 Score = 88.2 bits (218), Expect = 1e-21
 Identities = 20/66 (30%), Positives = 25/66 (37%)

Query: 114 VANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIGP 173
                          GV  VT  Y  YK+   K  V+ ASF +T  +  L GCD +T  P
Sbjct: 176 DKMLEQIGLDLRGKAGVAQVTLAYKLYKQKFKKQRVLWASFSDTLYVAPLIGCDTVTTMP 235

Query: 174 KLLEEL 179
               E 
Sbjct: 236 DQALEA 241


>gnl|CDD|223254 COG0176, MipB, Transaldolase [Carbohydrate transport and
           metabolism].
          Length = 239

 Score = 85.4 bits (212), Expect = 1e-20
 Identities = 26/66 (39%), Positives = 38/66 (57%)

Query: 128 PGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGCDLMTIGPKLLEELENSTTPVD 187
            G+    +IY+YYK+ G KT+V  A F N   I ALAG D++TI P LL++L      + 
Sbjct: 156 LGIAEAREIYDYYKQHGAKTLVASARFPNHVYIAALAGADVLTIPPDLLKQLLKHGGAMA 215

Query: 188 QMLSEK 193
             L ++
Sbjct: 216 VPLLDE 221


>gnl|CDD|188643 cd00956, Transaldolase_FSA, Transaldolase-like fructose-6-phosphate
           aldolases (FSA) found in bacteria and archaea.
           Transaldolase-like fructose-6-phosphate aldolases (FSA)
           found in bacteria and archaea, which are member of the
           MipB/TalC subfamily of class I aldolases. FSA catalyze
           an aldol cleavage of fructose 6-phosphate and do not
           utilize fructose, fructose 1-phosphate, fructose
           1,6-phosphate, or dihydroxyacetone phosphate. The
           enzymes belong to the transaldolase family that serves
           in transfer reactions in the pentose phosphate cycle,
           and are more distantly related to fructose
           1,6-bisphosphate aldolase.
          Length = 211

 Score = 44.1 bits (105), Expect = 1e-05
 Identities = 23/84 (27%), Positives = 39/84 (46%), Gaps = 17/84 (20%)

Query: 107 AKHLVRYVA-------NTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGE 159
           AK    YV+       + G        D   + + +I   +  +G+ T ++ AS RN   
Sbjct: 119 AKAGATYVSPFVGRIDDLG-------GDGMEL-IREIRTIFDNYGFDTKILAASIRNPQH 170

Query: 160 IL--ALAGCDLMTIGPKLLEELEN 181
           ++  ALAG D +T+ P +LE+L  
Sbjct: 171 VIEAALAGADAITLPPDVLEQLLK 194


>gnl|CDD|234947 PRK01362, PRK01362, putative translaldolase; Provisional.
          Length = 214

 Score = 35.9 bits (84), Expect = 0.006
 Identities = 18/46 (39%), Positives = 29/46 (63%), Gaps = 5/46 (10%)

Query: 136 IYNYYKKFGYKTVVMGASFRNTGEIL--ALAGCDLMTIGPKLLEEL 179
           IY+ Y   G+ T ++ AS R+   +L  ALAG D+ TI  K++++L
Sbjct: 150 IYDNY---GFDTEIIAASVRHPMHVLEAALAGADIATIPYKVIKQL 192


>gnl|CDD|129953 TIGR00875, fsa_talC_mipB, fructose-6-phosphate aldolase, TalC/MipB
           family.  This model represents a family that includes
           the E. coli transaldolase homologs TalC and MipB, both
           shown to be fructose-6-phosphate aldolases rather than
           transaldolases as previously thought. It is related to
           but distinct from the transaldolase family of E. coli
           TalA and TalB. The member from Bacillus subtilis becomes
           phosphorylated during early stationary phase but not
           during exponential growth [Energy metabolism, Pentose
           phosphate pathway].
          Length = 213

 Score = 35.6 bits (82), Expect = 0.007
 Identities = 16/67 (23%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 129 GVVSVTKIYNYYKKFGYKTVVMGASFRNTGEIL--ALAGCDLMTIGPKLLEELENSTTPV 186
           G+  + ++   ++     T V+ AS R+   +L  AL G D+ T+   ++++L N   P+
Sbjct: 140 GMKLIEEVKTIFENHAPDTEVIAASVRHPRHVLEAALIGADIATMPLDVMQQLFN--HPL 197

Query: 187 DQMLSEK 193
             +  E+
Sbjct: 198 TDIGLER 204


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 33.6 bits (78), Expect = 0.054
 Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 4/56 (7%)

Query: 39  DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTEDYLD 94
           D+  L +L   + +L+RE E+K +  E L  E   +K+        K  + ++  D
Sbjct: 514 DKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKE----ELEEKKEKLQEEED 565


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 33.1 bits (76), Expect = 0.082
 Identities = 12/42 (28%), Positives = 23/42 (54%)

Query: 40  ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVS 81
           E  ++ L+E+  +L+RE EE  + IE L +E+   ++     
Sbjct: 428 EETVERLEEENSELKRELEELKREIEKLESELERFRREVRDK 469



 Score = 32.4 bits (74), Expect = 0.15
 Identities = 12/48 (25%), Positives = 28/48 (58%)

Query: 40  ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKAT 87
           +R ++    ++ +L++E EEK K +E L  ++  +++ R +  + K T
Sbjct: 473 DREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGT 520


>gnl|CDD|183703 PRK12724, PRK12724, flagellar biosynthesis regulator FlhF;
          Provisional.
          Length = 432

 Score = 32.6 bits (74), Expect = 0.11
 Identities = 15/42 (35%), Positives = 21/42 (50%)

Query: 27 VGIFLGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLR 68
          +GI     S    ER LQDLKE + Q      E+ K ++ L+
Sbjct: 54 IGIPEKQASREKIERKLQDLKELLKQKSYTEPERKKTLQTLK 95


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 31.8 bits (73), Expect = 0.13
 Identities = 12/43 (27%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 40  ERNLQDLKEKVLQLQRESEEKNKIIENLR-NEINAIK-QNRHV 80
           E+ + +L+ K+  +++  EE+ +I E    +EI  +K QN+ +
Sbjct: 140 EKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQL 182


>gnl|CDD|132655 TIGR03616, RutG, pyrimidine utilization transport protein G.  This
           protein is observed in operons extremely similar to that
           characterized in E. coli K-12 responsible for the import
           and catabolism of pyrimidines, primarily uracil. This
           protein is a member of the uracil-xanthine permease
           family defined by TIGR00801. As well as the The
           Nucleobase:Cation Symporter-2 (NCS2) Family (TC 2.A.40).
          Length = 429

 Score = 32.5 bits (74), Expect = 0.13
 Identities = 20/46 (43%), Positives = 25/46 (54%), Gaps = 2/46 (4%)

Query: 93  LDVYNNNADGQDENAKHLVRYVANTGTKTYAPTEDPGVVSVTKIYN 138
           LD Y   A   D  A  L   V  TG  TYA  E+ GV++VTK+Y+
Sbjct: 278 LDPYMGRAFVGDGLATMLSGSVGGTGVTTYA--ENIGVMAVTKVYS 321


>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper. 
          Length = 65

 Score = 29.8 bits (68), Expect = 0.15
 Identities = 14/36 (38%), Positives = 22/36 (61%)

Query: 41 RNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ 76
            +++L+ KV QL+ E+E   K IE LR E+  +K 
Sbjct: 26 AEIEELERKVEQLEAENERLKKEIERLRRELEKLKS 61


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 32.0 bits (73), Expect = 0.18
 Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 3/57 (5%)

Query: 28  GIFLGNDSEYSD---ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVS 81
           GI L +     D    +   DL+E++L L+ E +E  + +E+L  EIN +++  +V 
Sbjct: 51  GIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVL 107


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 31.6 bits (72), Expect = 0.22
 Identities = 18/91 (19%), Positives = 37/91 (40%), Gaps = 9/91 (9%)

Query: 33  NDSEYSD-ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTED 91
            DS+ S+ ++  ++++ ++  L  + EE    I+ L+ EI+  K            + + 
Sbjct: 36  QDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAE--------IKKLQK 87

Query: 92  YLDVYNNNADGQDENAKHLVRYVANTGTKTY 122
            +     N   + E  K   R +   GT T 
Sbjct: 88  EIAELKENIVERQELLKKRARAMQVNGTATS 118



 Score = 27.0 bits (60), Expect = 7.1
 Identities = 10/55 (18%), Positives = 24/55 (43%), Gaps = 2/55 (3%)

Query: 43  LQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ--NRHVSYATKATQTEDYLDV 95
           + +L++++ Q + E ++  K I  L+  I   ++   +             Y+DV
Sbjct: 68  IDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKKRARAMQVNGTATSYIDV 122


>gnl|CDD|213375 cd12086, DD_cGKI-beta, Dimerization/Docking domain of Cyclic
          GMP-dependent Protein Kinase I beta.  Cyclic
          GMP-dependent Protein Kinase I (PKG1 or cGKI) is a
          Serine/Threonine Kinase (STK), catalyzing the transfer
          of the gamma-phosphoryl group from ATP to
          serine/threonine residues on protein substrates. cGKI
          exists as two splice variants, cGKI-alpha and
          cGKI-beta. They contain an N-terminal regulatory domain
          containing a dimerization/docking region and an
          autoinhibitory pseudosubstrate region, two cGMP-binding
          domains, and a C-terminal catalytic domain. Binding of
          cGMP to both binding sites releases the inhibition of
          the catalytic center by the pseudosubstrate region,
          allowing autophosphorylation and activation of the
          kinase. cGKI is a  soluble protein expressed in all
          smooth muscles, platelets, cerebellum, and kidney. It
          is involved in the regulation of smooth muscle tone,
          smooth cell proliferation, and platelet activation. The
          dimerization/docking (D/D) domain is a
          leucine/isoleucine zipper that mediates both
          homodimerization and interaction with isotype-specific
          G-kinase-anchoring proteins (GKAPs). The D/D domain of
          the two variants (alpha and beta) differ, allowing for
          their targeting to different subcellular compartments
          and intracellular substrates. cGKI-beta binds
          specifically to inositol triphosphate
          receptor-associated PKG substrate (IRAG) and the
          transcriptional regulator TFII-I. Phosphorylation of
          IRAG by cGKI-beta contributes to smooth muscle
          relaxation while phosphorylation of TFII-I modulates
          its co-activator functions for serum response factor
          and Smad transcription factors.
          Length = 52

 Score = 28.8 bits (65), Expect = 0.28
 Identities = 10/46 (21%), Positives = 31/46 (67%), Gaps = 3/46 (6%)

Query: 34 DSEYSDERNLQDLKEK---VLQLQRESEEKNKIIENLRNEINAIKQ 76
          D + + +   ++L+++   + +L++E +EK+ +I++L+NE++  + 
Sbjct: 3  DLQRALQEKTEELRKRDELIKELEQELDEKDALIQHLQNELDKYRS 48


>gnl|CDD|226396 COG3879, COG3879, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 247

 Score = 30.8 bits (70), Expect = 0.37
 Identities = 14/67 (20%), Positives = 27/67 (40%), Gaps = 5/67 (7%)

Query: 32  GNDSEYSDERN-LQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTE 90
                 S  R    DL +++  LQ++       +E+L N+++++      S  T     E
Sbjct: 40  QTSKGESVRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSV----RRSVLTDDAALE 95

Query: 91  DYLDVYN 97
           D L+   
Sbjct: 96  DRLEKLR 102


>gnl|CDD|213373 cd12083, DD_cGKI, Dimerization/Docking domain of Cyclic
          GMP-dependent Protein Kinase I.  Cyclic GMP-dependent
          Protein Kinase I (PKG1 or cGKI) is a Serine/Threonine
          Kinase (STK), catalyzing the transfer of the
          gamma-phosphoryl group from ATP to serine/threonine
          residues on protein substrates. cGKI exists as two
          splice variants, cGKI-alpha and cGKI-beta. They contain
          an N-terminal regulatory domain containing a
          dimerization/docking region and an autoinhibitory
          pseudosubstrate region, two cGMP-binding domains, and a
          C-terminal catalytic domain. Binding of cGMP to both
          binding sites releases the inhibition of the catalytic
          center by the pseudosubstrate region, allowing
          autophosphorylation and activation of the kinase. cGKI
          is a  soluble protein expressed in all smooth muscles,
          platelets, cerebellum, and kidney. It is also expressed
          at lower concentrations in other tissues. It is
          involved in the regulation of smooth muscle tone,
          smooth cell proliferation, and platelet activation. The
          dimerization/docking (D/D) domain is a
          leucine/isoleucine zipper that mediates both
          homodimerization and interaction with isotype-specific
          G-kinase-anchoring proteins (GKAPs). The D/D domain of
          the two variants (alpha and beta) differ, allowing
          their targeting to different subcellular compartments
          and intracellular substrates.
          Length = 48

 Score = 28.3 bits (64), Expect = 0.41
 Identities = 9/29 (31%), Positives = 22/29 (75%)

Query: 43 LQDLKEKVLQLQRESEEKNKIIENLRNEI 71
          L+   E++ +L++E +EK++ I+ LR+++
Sbjct: 11 LRKKDERIRELEQELQEKDEEIQELRSQL 39


>gnl|CDD|232804 TIGR00064, ftsY, signal recognition particle-docking protein FtsY. 
           There is a weak division between FtsY and SRP54; both
           are GTPases. In E.coli, ftsY is an essential gene
           located in an operon with cell division genes ftsE and
           ftsX, but its apparent function is as the signal
           recognition particle docking protein [Protein fate,
           Protein and peptide secretion and trafficking].
          Length = 272

 Score = 29.5 bits (67), Expect = 0.89
 Identities = 28/125 (22%), Positives = 43/125 (34%), Gaps = 23/125 (18%)

Query: 39  DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKA-TQTEDYLDV-Y 96
           DE   ++L+E +L+     E   KIIE L+ E+      + V  A       ++YL    
Sbjct: 3   DEDFFEELEEILLESDVGYEVVEKIIEALKKELK----GKKVKDAELLKEILKEYLKEIL 58

Query: 97  NNNADGQDENAKHLVRY----VANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGA 152
                                V   G  T          ++ K+ N  KK G   V++ A
Sbjct: 59  KETDLELIVEENKPNVILFVGVNGVGKTT----------TIAKLANKLKKQGKS-VLLAA 107

Query: 153 S--FR 155
              FR
Sbjct: 108 GDTFR 112


>gnl|CDD|225616 COG3074, COG3074, Uncharacterized protein conserved in bacteria
          [Function unknown].
          Length = 79

 Score = 28.0 bits (62), Expect = 0.92
 Identities = 10/38 (26%), Positives = 19/38 (50%)

Query: 43 LQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHV 80
          +++LKEK   L +E +      E L  E   +K+ ++ 
Sbjct: 27 IEELKEKNNSLSQEVQNAQHQREALERENEQLKEEQNG 64


>gnl|CDD|130492 TIGR01425, SRP54_euk, signal recognition particle protein SRP54.
           This model represents examples from the eukaryotic
           cytosol of the signal recognition particle protein
           component, SRP54. This GTP-binding protein is a
           component of the eukaryotic signal recognition particle,
           along with several other protein subunits and a 7S RNA.
           Some species, including Arabidopsis, have several
           closely related forms. The extreme C-terminal region is
           glycine-rich and lower in complexity, poorly conserved
           between species, and excluded from this model.
          Length = 429

 Score = 29.8 bits (67), Expect = 0.93
 Identities = 29/131 (22%), Positives = 49/131 (37%), Gaps = 27/131 (20%)

Query: 39  DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTEDYLDVYNN 98
           DE  L  + +++     ES+   K++  LR  I             KA   E+     N 
Sbjct: 23  DEEVLNAMLKEICTALLESDVNIKLVRQLRENIK------------KAINLEEMASGLNK 70

Query: 99  NADGQDENAKHLVRYVANTGTKTYAPT-EDPGVV------------SVTKIYNYYKKFGY 145
               Q    K L   V + G + + P      V+            + TK+  YY++ G+
Sbjct: 71  RKMIQHAVFKELCNLV-DPGVEAFTPKKGKQNVIMFVGLQGSGKTTTCTKLAYYYQRKGF 129

Query: 146 KTVVMGA-SFR 155
           K  ++ A +FR
Sbjct: 130 KPCLVCADTFR 140


>gnl|CDD|236263 PRK08425, flgE, flagellar hook protein FlgE; Validated.
          Length = 731

 Score = 30.0 bits (68), Expect = 0.94
 Identities = 18/54 (33%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 66  NLRNEINAIKQNRHVSYATKATQTEDYLDVYNNNADGQDENAKHLVRYVANTGT 119
             +N INA K    V    +A    + L + N N+ G  E  K++V  V  TGT
Sbjct: 297 AAKNAINAQKSKTGV----EAYLDGNQLRLENTNSLGTTEKMKNIVITVNGTGT 346


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 29.6 bits (67), Expect = 0.97
 Identities = 8/35 (22%), Positives = 18/35 (51%)

Query: 43  LQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
           L+  ++++  L  +  EK K +E L+ E+  +   
Sbjct: 209 LEKARQELRSLSVKISEKRKQLEELQQELQELTIA 243


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 29.4 bits (66), Expect = 1.0
 Identities = 9/35 (25%), Positives = 20/35 (57%)

Query: 40  ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAI 74
              +  L+ ++  L+ E +EK K  E L++E+ ++
Sbjct: 122 RSEITSLETEIRDLREELQEKEKDNETLQDELISL 156


>gnl|CDD|224346 COG1429, CobN, Cobalamin biosynthesis protein CobN and related
           Mg-chelatases [Coenzyme metabolism].
          Length = 1388

 Score = 29.3 bits (66), Expect = 1.4
 Identities = 15/52 (28%), Positives = 22/52 (42%), Gaps = 4/52 (7%)

Query: 90  EDYLDVYNNNADGQDENAKHLVRYVANTGTKTYA----PTEDPGVVSVTKIY 137
           EDYL VY   + G  EN ++L+ Y+A+              +P       IY
Sbjct: 124 EDYLRVYLYLSGGGVENLRNLLLYLADLYDDLADGIDYAYIEPVRPPKEGIY 175


>gnl|CDD|203138 pfam04977, DivIC, Septum formation initiator.  DivIC from B.
          subtilis is necessary for both vegetative and
          sporulation septum formation. These proteins are mainly
          composed of an amino terminal coiled-coil.
          Length = 80

 Score = 27.5 bits (62), Expect = 1.5
 Identities = 8/38 (21%), Positives = 15/38 (39%)

Query: 41 RNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNR 78
               L +++  LQ E  +     E L  E+  +K + 
Sbjct: 17 SAYYQLNQEIAALQAELAKLKAENEELEAEVKDLKSDP 54


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 28.7 bits (64), Expect = 1.7
 Identities = 16/42 (38%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 36  EYSD-ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ 76
           EY D + + ++LKEK+ +LQ+E EE  K +E L  E   +++
Sbjct: 129 EYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQE 170


>gnl|CDD|237420 PRK13537, PRK13537, nodulation ABC transporter NodI; Provisional.
          Length = 306

 Score = 28.6 bits (64), Expect = 1.9
 Identities = 15/40 (37%), Positives = 18/40 (45%), Gaps = 1/40 (2%)

Query: 127 DPGVVSVTKIYNYYKKFGYKTVVMGASFR-NTGEILALAG 165
            P  V+     N  K++G K VV G SF    GE   L G
Sbjct: 1   GPMSVAPIDFRNVEKRYGDKLVVDGLSFHVQRGECFGLLG 40


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 28.6 bits (64), Expect = 2.2
 Identities = 12/42 (28%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 40  ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVS 81
              +Q+L EK+L L++ S E  +   +L  E+  +    HV+
Sbjct: 391 RNEIQNLAEKLLSLEKSSHEDQERSGDLEKELRKL---THVA 429


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 28.6 bits (65), Expect = 2.7
 Identities = 14/52 (26%), Positives = 26/52 (50%), Gaps = 8/52 (15%)

Query: 34  DSEYSDERN-LQDLKEKVLQ----LQRESEE---KNKIIENLRNEINAIKQN 77
           + E  + RN LQ L++++LQ    L R+ E    + + +E    E+   +Q 
Sbjct: 74  EKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQE 125


>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
           [Function unknown].
          Length = 294

 Score = 28.1 bits (63), Expect = 2.9
 Identities = 10/41 (24%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 36  EYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ 76
           E   E +  +L E+ ++L ++ +E ++   NL+NE+  +++
Sbjct: 204 ELRKEAD--ELHEEFVELSKKIDELHEEFRNLQNELRELEK 242


>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
          production and conversion].
          Length = 108

 Score = 27.0 bits (60), Expect = 2.9
 Identities = 12/49 (24%), Positives = 21/49 (42%), Gaps = 2/49 (4%)

Query: 40 ERNLQDLKEKVLQLQRE--SEEKNKIIENLRNEINAIKQNRHVSYATKA 86
          E   + L +++L+  RE   EE  +I+     E +AI          +A
Sbjct: 49 EEEAEKLAQEILEEAREEAEEEAEEILAEAEKEASAILSKAAEGKVVEA 97


>gnl|CDD|197544 smart00150, SPEC, Spectrin repeats. 
          Length = 101

 Score = 26.9 bits (60), Expect = 3.3
 Identities = 9/50 (18%), Positives = 20/50 (40%)

Query: 30 FLGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRH 79
           L ++    D  +++ L +K    + E E   + +E L      + +  H
Sbjct: 20 LLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGH 69


>gnl|CDD|213185 cd03218, ABC_YhbG, ATP-binding cassette component of YhbG transport
           system.  The ABC transporters belonging to the YhbG
           family are similar to members of the Mj1267_LivG family,
           which is involved in the transport of branched-chain
           amino acids. The genes yhbG and yhbN are located in a
           single operon and may function together in cell envelope
           during biogenesis. YhbG is the putative ATP-binding
           cassette component and YhbN is the putative
           periplasmic-binding protein. Depletion of each gene
           product leads to growth arrest, irreversible cell damage
           and loss of viability in E. coli. The YhbG homolog
           (NtrA) is essential in Rhizobium meliloti, a symbiotic
           nitrogen-fixing bacterium.
          Length = 232

 Score = 27.9 bits (63), Expect = 3.5
 Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 1/32 (3%)

Query: 135 KIYNYYKKFGYKTVVMGASFR-NTGEILALAG 165
           +  N  K++G + VV G S     GEI+ L G
Sbjct: 2   RAENLSKRYGKRKVVNGVSLSVKQGEIVGLLG 33


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 28.1 bits (63), Expect = 3.5
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query: 36  EYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIK 75
           E      L++L+E    L+ E  E    +  +RNE+  +K
Sbjct: 364 EEKLSALLEELEELFEALREELAELEAELAEIRNELEELK 403


>gnl|CDD|146396 pfam03735, ENT, ENT domain.  This presumed domain is named after
          Emsy N Terminus (ENT). Emsy is a protein that is
          amplified in breast cancer and interacts with BRCA2.
          The N terminus of this protein is found to be similar
          to other vertebrate and plant proteins of unknown
          function. This domain has a completely conserved
          histidine residue that may be functionally important.
          Length = 74

 Score = 26.1 bits (58), Expect = 3.6
 Identities = 8/31 (25%), Positives = 13/31 (41%), Gaps = 1/31 (3%)

Query: 57 SEEKNKIIENLRNEINAIKQNRHVSYATKAT 87
          S EK +++  LR E+  I    H     +  
Sbjct: 28 SWEKEELLTELRKELR-ISDEEHRELLRRVN 57


>gnl|CDD|223671 COG0598, CorA, Mg2+ and Co2+ transporters [Inorganic ion transport
           and metabolism].
          Length = 322

 Score = 27.7 bits (62), Expect = 4.0
 Identities = 8/40 (20%), Positives = 18/40 (45%)

Query: 39  DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNR 78
           D   L+D+ + + QL    E   + + +L +   ++  N 
Sbjct: 218 DREYLRDVLDHLTQLIEMLEALRERLSSLLDAYLSLINNN 257


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 27.7 bits (62), Expect = 4.0
 Identities = 12/39 (30%), Positives = 19/39 (48%)

Query: 39  DERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
           D   L   KEK+ +L +E   K K +E L  E+  ++  
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESK 240



 Score = 27.7 bits (62), Expect = 4.5
 Identities = 8/38 (21%), Positives = 20/38 (52%)

Query: 41  RNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNR 78
           + L++L+E++ +L+ + E+       L  EI   ++  
Sbjct: 225 KKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKL 262


>gnl|CDD|197699 smart00397, t_SNARE, Helical region found in SNAREs.  All
          alpha-helical motifs that form twisted and parallel
          four-helix bundles in target soluble
          N-ethylmaleimide-sensitive factor (NSF) attachment
          protein (SNAP) receptor proteins. This motif found in
          "Q-SNAREs".
          Length = 66

 Score = 26.0 bits (58), Expect = 4.2
 Identities = 9/38 (23%), Positives = 23/38 (60%)

Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
          E+++Q+LK+  L +  E EE+ + ++ + + ++    N
Sbjct: 18 EKSIQELKQIFLDMGTELEEQGEQLDRIEDNVDDADVN 55


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 28.1 bits (63), Expect = 4.2
 Identities = 8/40 (20%), Positives = 22/40 (55%)

Query: 40  ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRH 79
           E  +++L+ ++ QL+ E +   + ++ LR E+  + +   
Sbjct: 781 EAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820


>gnl|CDD|205299 pfam13118, DUF3972, Protein of unknown function (DUF3972).  This is
           a Proteobacterial family of unknown function. Some of
           the proteins in this family are annotated as being
           kinesin-like proteins.
          Length = 126

 Score = 26.9 bits (60), Expect = 4.4
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 44  QDLKEKVLQLQRESEEKNKIIENLRNEIN 72
           + LKE +  +Q   +E  K I+ LR ++ 
Sbjct: 95  KFLKEALYSMQELYDEDRKTIDLLREQLK 123


>gnl|CDD|130658 TIGR01597, PYST-B, Plasmodium yoelii subtelomeric family PYST-B.
           This model represents a paralogous family of Plasmodium
           yoelii genes preferentially located in the subtelomeric
           regions of the chromosomes. There are no obvious
           homologs to these genes in any other organism.
          Length = 255

 Score = 27.6 bits (61), Expect = 4.4
 Identities = 25/127 (19%), Positives = 46/127 (36%), Gaps = 19/127 (14%)

Query: 47  KEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTEDYLDVYNNNADGQDEN 106
           K+ + +LQ+E EE  K ++N  N+   I          +    +  +    NN+  + E+
Sbjct: 112 KKLINKLQKELEELKKELDNEMNDELTI----------QPIHDKIIIKKDENNSVSEHED 161

Query: 107 AKHLVRYVANTGTKTYAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALAGC 166
            K L     ++ ++      D       KI    KK   K +V             +  C
Sbjct: 162 FKQLENEKNSSVSEHE--EFDIASSDNLKIKRKLKKLVKKLIV-------RCLTFIVIVC 212

Query: 167 DLMTIGP 173
            ++  GP
Sbjct: 213 SILVSGP 219


>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
          Length = 663

 Score = 27.5 bits (61), Expect = 5.3
 Identities = 20/84 (23%), Positives = 41/84 (48%), Gaps = 12/84 (14%)

Query: 33  NDSE---YSDERNLQDLKEKVLQLQRESEEKNKIIENLRN-----EINAIK-QNRHVSYA 83
           N++E   YS E+ L DLK+K+    ++  ++   I  LR+     ++++IK + + +  A
Sbjct: 571 NEAETLIYSVEKQLSDLKDKISDADKDELKQK--ITKLRSTLSSEDVDSIKDKTKQLQEA 628

Query: 84  TKATQTEDYLDVYNNNA-DGQDEN 106
           +     + Y    ++N    Q  N
Sbjct: 629 SWKISQQAYKQGNSDNQQSEQSTN 652


>gnl|CDD|238103 cd00176, SPEC, Spectrin repeats, found in several proteins involved
           in cytoskeletal structure; family members include
           spectrin, alpha-actinin and dystrophin; the spectrin
           repeat forms a three helix bundle with the second helix
           interrupted by proline in some sequences; the repeats
           are independent folding units; tandem repeats are found
           in differing numbers and arrange in an antiparallel
           manner to form dimers; the repeats are defined by a
           characteristic tryptophan (W) residue in helix A and a
           leucine (L) at the carboxyl end of helix C and separated
           by a linker of 5 residues; two copies of the repeat are
           present here.
          Length = 213

 Score = 27.0 bits (60), Expect = 5.5
 Identities = 9/49 (18%), Positives = 23/49 (46%)

Query: 31  LGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRH 79
           L ++    D  ++++L +K  +L+ E E     +++L      + +  H
Sbjct: 129 LASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAEELLEEGH 177


>gnl|CDD|177942 PLN02307, PLN02307, phosphoglucomutase.
          Length = 579

 Score = 27.3 bits (61), Expect = 6.1
 Identities = 12/40 (30%), Positives = 22/40 (55%), Gaps = 1/40 (2%)

Query: 56  ESEEKNKIIENLRNEINAIKQNRHVSYATKATQTE-DYLD 94
           +SE  NK++++LR+ +N  K+       T A   + +Y D
Sbjct: 456 DSEAANKMMDHLRDLVNKSKKGIKYGVYTLAFADDFEYTD 495


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 27.4 bits (61), Expect = 6.1
 Identities = 13/60 (21%), Positives = 28/60 (46%), Gaps = 1/60 (1%)

Query: 34  DSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEI-NAIKQNRHVSYATKATQTEDY 92
             +  +   LQ +KE+  +L++E +   + I++ +N++ N  KQ +      K     D 
Sbjct: 205 KKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKAPQNDL 264


>gnl|CDD|165339 PHA03046, PHA03046, Hypothetical protein; Provisional.
          Length = 142

 Score = 26.6 bits (59), Expect = 6.4
 Identities = 21/70 (30%), Positives = 32/70 (45%), Gaps = 7/70 (10%)

Query: 29 IFLGNDSEYSD-----ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYA 83
          IF G+D E +D     E+  ++      +L  E EE N I  N+RNEI  I   ++ ++ 
Sbjct: 7  IFPGDDDEENDRNEHREKTSEEDGHYKKRLDVEEEEPNLI--NIRNEIKKIINEKYSTFP 64

Query: 84 TKATQTEDYL 93
                 D L
Sbjct: 65 IHEDDISDIL 74


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexins.  BAR domains are dimerization, lipid binding and
           curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 218

 Score = 26.9 bits (60), Expect = 6.7
 Identities = 10/47 (21%), Positives = 21/47 (44%), Gaps = 4/47 (8%)

Query: 40  ERNLQDLKEKVLQLQRESEEKNK----IIENLRNEINAIKQNRHVSY 82
              +++L+E++ + +   EE  K    I E L+ E+    + R    
Sbjct: 144 PAKVEELEEELEEAESALEEARKRYEEISERLKEELKRFHEERARDL 190


>gnl|CDD|226361 COG3842, PotA, ABC-type spermidine/putrescine transport systems,
           ATPase components [Amino acid transport and metabolism].
          Length = 352

 Score = 27.2 bits (61), Expect = 6.8
 Identities = 11/29 (37%), Positives = 12/29 (41%), Gaps = 1/29 (3%)

Query: 138 NYYKKFGYKTVVMGASFR-NTGEILALAG 165
           N  K FG  T V   S     GE + L G
Sbjct: 10  NVSKSFGDFTAVDDISLDIKKGEFVTLLG 38


>gnl|CDD|107320 cd06325, PBP1_ABC_uncharacterized_transporter, Type I periplasmic
           ligand-binding domain of uncharacterized ABC-type
           transport systems that are predicted to be involved in
           the uptake of amino acids, peptides, or inorganic ions. 
           This group includes the type I periplasmic
           ligand-binding domain of uncharacterized ABC (ATPase
           Binding Cassette)-type transport systems that are
           predicted to be involved in the uptake of amino acids,
           peptides, or inorganic ions. This subgroup has high
           sequence similarity to members of the family of
           hydrophobic amino acid transporters (HAAT), such as
           leucine/isoleucine/valine binding protein (LIVBP); its
           ligand specificity has not been determined
           experimentally.
          Length = 281

 Score = 26.8 bits (60), Expect = 6.9
 Identities = 11/43 (25%), Positives = 19/43 (44%), Gaps = 2/43 (4%)

Query: 122 YAPTEDPGVVSVTKIYNYYKKFGYKTVVMGASFRNTGEILALA 164
           Y P+E   VV V ++     K G + V    S  ++ ++   A
Sbjct: 138 YNPSEANSVVQVKELKKAAAKLGIEVVEATVS--SSNDVQQAA 178


>gnl|CDD|201159 pfam00329, Complex1_30kDa, Respiratory-chain NADH dehydrogenase, 30
           Kd subunit. 
          Length = 103

 Score = 26.0 bits (58), Expect = 7.2
 Identities = 9/24 (37%), Positives = 14/24 (58%), Gaps = 3/24 (12%)

Query: 120 KTYAPTEDPGVVSVTKIY---NYY 140
           K   P +DP + S+T I+   N+Y
Sbjct: 46  KVKLPRDDPEIPSITDIFPGANWY 69


>gnl|CDD|134340 PRK00650, PRK00650, 4-diphosphocytidyl-2-C-methyl-D-erythritol
           kinase; Provisional.
          Length = 288

 Score = 26.7 bits (59), Expect = 7.5
 Identities = 13/43 (30%), Positives = 25/43 (58%), Gaps = 4/43 (9%)

Query: 35  SEYSDERNLQ----DLKEKVLQLQRESEEKNKIIENLRNEINA 73
           S+ S  +NL+    DL++ V +L+ + +EK   +E+L  E+  
Sbjct: 183 SDCSSRKNLEYTQNDLEKPVFRLRLDLKEKKHWLESLWAELPV 225


>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904).  This
          family consists of several bacterial and archaeal
          hypothetical proteins of unknown function.
          Length = 72

 Score = 25.3 bits (56), Expect = 7.8
 Identities = 12/36 (33%), Positives = 22/36 (61%)

Query: 40 ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIK 75
          +  +++LKE+  QL  E+EE  +  + L+ E NA +
Sbjct: 24 QMEIEELKEENEQLSEENEELEEENKKLKEERNAWQ 59


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 27.2 bits (60), Expect = 7.9
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 44  QDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
             L++K+  L++E +   + ++ L NEI  I++ 
Sbjct: 448 CGLEKKINNLEKEIKNNQEEVKKLSNEIKEIEKF 481


>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 220 and
           257 amino acids in length.
          Length = 221

 Score = 26.5 bits (59), Expect = 8.1
 Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 2/59 (3%)

Query: 33  NDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQ-NRHVSYATKATQTE 90
           N S  +     +DLKE+V +   + EE  K I  L+ ++   KQ NR V    +  + +
Sbjct: 161 NASSNTGVYEKEDLKERV-ERLEQIEELEKEIAKLKKKLKKEKQFNRKVELNLEIKKLK 218


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 27.3 bits (60), Expect = 8.3
 Identities = 13/54 (24%), Positives = 24/54 (44%)

Query: 21   KYLPKSVGIFLGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAI 74
              L  S+      D       NL+  ++++ + + ES E    I+ L++E NA 
Sbjct: 2647 AKLMLSISNMEQTDEIAVLLHNLKKSRKEIEEEESESMEIEDRIDALKSEYNAS 2700


>gnl|CDD|238575 cd01170, THZ_kinase, 4-methyl-5-beta-hydroxyethylthiazole (Thz)
           kinase catalyzes the phosphorylation of the
           hydroxylgroup of Thz. A reaction that allows cells to
           recycle Thz into the thiamine biosynthesis pathway, as
           an alternative to its synthesis from cysteine, tyrosine
           and 1-deoxy-D-xylulose-5-phosphate.
          Length = 242

 Score = 26.7 bits (60), Expect = 8.7
 Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 4/54 (7%)

Query: 142 KFGYKTVVMGASFRNTGEILALAGCDLMTIGPKLLEELENSTTPVDQMLSEKSG 195
             G  TV+ G    N  EI ALAG   +  G       E     + + L+ K G
Sbjct: 104 AEGQPTVIRG----NASEIAALAGLTGLGKGVDSSSSDEEDALELAKALARKYG 153


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 27.0 bits (61), Expect = 8.9
 Identities = 10/36 (27%), Positives = 17/36 (47%)

Query: 42  NLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
              +L E+V  L  E +E  K +E L+ ++ A    
Sbjct: 713 KPSELPERVEALLEELKELEKELEQLKAKLAAAAAG 748


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 26.6 bits (59), Expect = 9.0
 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 2/64 (3%)

Query: 33  NDSEYSD-ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTED 91
           ++ E       +Q  KE++  L+ E  E  + IE L  EI    + R         + E 
Sbjct: 87  DERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIED-LKERLERLEKNLAEAEA 145

Query: 92  YLDV 95
            L+ 
Sbjct: 146 RLEE 149


>gnl|CDD|153296 cd07612, BAR_Bin2, The Bin/Amphiphysin/Rvs (BAR) domain of Bridging
           integrator 2.  BAR domains are dimerization, lipid
           binding and curvature sensing modules found in many
           different proteins with diverse functions. Bridging
           integrator 2 (Bin2) is a BAR domain containing protein
           that is mainly expressed in hematopoietic cells. It is
           upregulated during granulocyte differentiation and is
           thought to function primarily in this lineage. The BAR
           domain of Bin2 is closely related to the BAR domains of
           amphiphysins, which function primarily in endocytosis
           and other membrane remodeling events. Amphiphysins
           contain an N-terminal BAR domain with an additional
           N-terminal amphipathic helix (an N-BAR), a variable
           central domain, and a C-terminal SH3 domain. Unlike
           amphiphysins, Bin2 does not appear to contain a
           C-terminal SH3 domain. Amphiphysin I proteins, enriched
           in the brain and nervous system, function in synaptic
           vesicle endocytosis. Some amphiphysin II isoforms, also
           called Bridging integrator 1 (Bin1), function in
           intracellular vessicle trafficking. Bin2 can form a
           stable complex with Bin1 in cells but cannot replace the
           function of Bin1, and thus, appears to harbor a
           nonredundant function. The N-BAR domain of amphiphysin
           forms a curved dimer with a positively-charged concave
           face that can drive membrane bending and curvature.
          Length = 211

 Score = 26.4 bits (58), Expect = 9.7
 Identities = 18/75 (24%), Positives = 37/75 (49%), Gaps = 8/75 (10%)

Query: 31  LGNDSEYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQNRHVSYATKATQTE 90
           LG   E  DE+     ++  + L  +  + N++ ++L+  +NA+K     S   + +QT 
Sbjct: 3   LGKTVETKDEQ----FEQCAMNLNMQQSDGNRLYKDLKAYLNAVKVMHESS--KRLSQT- 55

Query: 91  DYLDVYNNNADGQDE 105
              D+Y  + DG ++
Sbjct: 56  -LQDIYEPDWDGHED 69


>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
           complex, found in both eukaryotes and archaea, that
           binds and stabilizes newly synthesized polypeptides
           allowing them to fold correctly.  The complex contains
           two alpha and four beta subunits, the two subunits being
           evolutionarily related. In archaea, there is usually
           only one gene for each subunit while in eukaryotes there
           two or more paralogous genes encoding each subunit
           adding heterogeneity to the structure of the hexamer.
           The structure of the complex consists of a double beta
           barrel assembly with six protruding coiled-coils.
          Length = 129

 Score = 25.8 bits (57), Expect = 9.7
 Identities = 11/42 (26%), Positives = 27/42 (64%)

Query: 36  EYSDERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
           E S E  ++ LK+++  L+++ E+  K +E L+++I  +++ 
Sbjct: 82  EKSLEEAIEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEE 123


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 27.0 bits (60), Expect = 9.8
 Identities = 11/38 (28%), Positives = 22/38 (57%)

Query: 40  ERNLQDLKEKVLQLQRESEEKNKIIENLRNEINAIKQN 77
           E+ L+ L+E++ +   E  E  K +E LR E+  +++ 
Sbjct: 618 EKELKKLEEELDKAFEELAETEKRLEELRKELEELEKK 655


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.132    0.374 

Gapped
Lambda     K      H
   0.267   0.0722    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,884,553
Number of extensions: 900230
Number of successful extensions: 1343
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1327
Number of HSP's successfully gapped: 157
Length of query: 199
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 107
Effective length of database: 6,857,034
Effective search space: 733702638
Effective search space used: 733702638
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (25.4 bits)