RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy18023
         (151 letters)



>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 37.8 bits (89), Expect = 0.001
 Identities = 28/109 (25%), Positives = 57/109 (52%), Gaps = 10/109 (9%)

Query: 47  IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
           +EE + E E       L + K+E+ ++  E ++ ++E    +++      E  L+ KE N
Sbjct: 44  LEEAKKEAEAIKK-EALLEAKEEIHKLRNEFEKELRERRNELQK-----LEKRLLQKEEN 97

Query: 107 ----QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
                  L++ ++EL++  KEL+   +EL +  +EL ++I+E  Q +E 
Sbjct: 98  LDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER 146



 Score = 34.4 bits (80), Expect = 0.017
 Identities = 25/101 (24%), Positives = 63/101 (62%), Gaps = 6/101 (5%)

Query: 48  EEQEMEKECKVDFN-KLDQIKKELDQIIKELDQIIKELD--LIIKESNQVIKELNLIIKE 104
           E + ++KE  ++   ++ +++ E ++ ++E    +++L+  L+ KE N + ++L L+ K 
Sbjct: 50  EAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEEN-LDRKLELLEKR 108

Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKEL 145
             + +L++ +KEL+Q  +EL+   +EL ++I+E  Q ++ +
Sbjct: 109 --EEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERI 147



 Score = 28.2 bits (64), Expect = 1.6
 Identities = 22/78 (28%), Positives = 48/78 (61%), Gaps = 4/78 (5%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKEL-NLIIKESNQNKLDQIKKELDQ 119
            +L++ +KEL+Q  +EL++  +EL+ +I+E  Q ++ +  L  +E+ +  L+++++E   
Sbjct: 110 EELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEILLEKVEEEA-- 167

Query: 120 MVKELDLTIKELNQIIKE 137
              E  + IKE+ +  KE
Sbjct: 168 -RHEAAVLIKEIEEEAKE 184


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 37.6 bits (88), Expect = 0.001
 Identities = 25/99 (25%), Positives = 55/99 (55%), Gaps = 14/99 (14%)

Query: 53  EKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQ 112
           E E  VDF +L +  KEL++ +KEL++ ++++  ++K            + +  + +L  
Sbjct: 322 EIEVGVDFPELKEELKELEEELKELEEELEKIKKLLK-----------KLPKKARGQLPP 370

Query: 113 IKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            K+E  + + E   T ++L++ ++EL + +KEL + +E 
Sbjct: 371 EKREQLEKLLE---TKEKLSEELEELEEELKELKEELES 406


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 36.3 bits (85), Expect = 0.003
 Identities = 24/92 (26%), Positives = 50/92 (54%), Gaps = 2/92 (2%)

Query: 62  KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMV 121
           +L+Q  +E + ++KE +++ +EL+   ++  +   +L    ++  Q  + + KKE D+++
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590

Query: 122 KELDLTIKELNQIIK--ELNQIIKELNQIVEE 151
           KEL    K     +K  EL +  K LN+  E+
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEK 622


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 34.3 bits (79), Expect = 0.016
 Identities = 23/93 (24%), Positives = 47/93 (50%), Gaps = 12/93 (12%)

Query: 57  KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKE 116
            +D  KL  +  E     K+L   I+EL     + N++ K++     +  ++K+++IKKE
Sbjct: 26  DIDLEKLIALDDER----KKLLSEIEELQ---AKRNELSKQIGKAKGQ-KKDKIEEIKKE 77

Query: 117 LDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
           L ++ +EL     EL+  +K L   +++    +
Sbjct: 78  LKELKEEL----TELSAALKALEAELQDKLLSI 106



 Score = 26.9 bits (60), Expect = 5.3
 Identities = 20/94 (21%), Positives = 45/94 (47%), Gaps = 4/94 (4%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM 120
           N  D +K+ L      +D  +++L  +  E  +++ E+  +  + N+    QI K   Q 
Sbjct: 9   NNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELS-KQIGKAKGQK 67

Query: 121 VKELDLTIKELNQI---IKELNQIIKELNQIVEE 151
             +++   KEL ++   + EL+  +K L   +++
Sbjct: 68  KDKIEEIKKELKELKEELTELSAALKALEAELQD 101


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 34.2 bits (79), Expect = 0.018
 Identities = 26/114 (22%), Positives = 56/114 (49%), Gaps = 7/114 (6%)

Query: 32  GSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKES 91
           G+ +E  I + +S+G+ E+E+E+  K       +   E D+  KE  +     D +  ++
Sbjct: 483 GTGKEQSITITASSGLSEEEIERMVKEA-----EANAEEDKKRKEEIEARNNADSLAYQA 537

Query: 92  NQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKEL 145
            + +KE    +    + K+++   EL + +K  D+   E+    +EL Q +++L
Sbjct: 538 EKTLKEAGDKLPAEEKEKIEKAVAELKEALKGEDVE--EIKAKTEELQQALQKL 589


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 33.6 bits (77), Expect = 0.025
 Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 9/81 (11%)

Query: 62  KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMV 121
           KL  ++KEL+Q  + L++++KE + + KE  Q ++EL    +          K EL++  
Sbjct: 519 KLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERN--------KKLELEKEA 570

Query: 122 KE-LDLTIKELNQIIKELNQI 141
           +E L    KE+  II+EL + 
Sbjct: 571 QEALKALKKEVESIIRELKEK 591


>gnl|CDD|216597 pfam01601, Corona_S2, Coronavirus S2 glycoprotein.  The coronavirus
           spike glycoprotein forms the characteristic 'corona'
           after which the group is named. The Spike glycoprotein
           is translated as a large polypeptide that is
           subsequently cleaved to S1 pfam01600 and S2.
          Length = 609

 Score = 33.8 bits (78), Expect = 0.027
 Identities = 25/98 (25%), Positives = 40/98 (40%), Gaps = 23/98 (23%)

Query: 56  CKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKK 115
           C V++  +      L  II +     KELD   K  N  + +L+L          D+   
Sbjct: 460 CDVNYVNIT--NTVLPPIIPDYIDFNKELDKWFKNLNSTVPDLDL----------DKFNA 507

Query: 116 ELDQMVKELDLT--IKELNQIIKELNQII---KELNQI 148
            +      L+LT  I  L ++IK LN  +   + LN+ 
Sbjct: 508 TI------LNLTGEIDRLQEVIKNLNDSLIDLEWLNRY 539


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 33.7 bits (78), Expect = 0.028
 Identities = 24/78 (30%), Positives = 46/78 (58%), Gaps = 6/78 (7%)

Query: 69  ELDQIIKELDQIIKELDLIIKESNQV-IKELNLIIKESNQNKLDQIKKELDQMVKELDLT 127
           +L + + +L   + +L+ + +E  +V +K L  +IK+    +L++I+KE+ ++ +E    
Sbjct: 54  KLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVE-EELEKIEKEIKELEEE---- 108

Query: 128 IKELNQIIKELNQIIKEL 145
           I EL   IKEL Q I+ L
Sbjct: 109 ISELENEIKELEQEIERL 126



 Score = 31.8 bits (73), Expect = 0.11
 Identities = 19/80 (23%), Positives = 40/80 (50%), Gaps = 5/80 (6%)

Query: 74  IKELDQIIKELDLI--IKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKEL 131
              L ++ + LD +         ++E    +       L+++ K++++ +++++  IKEL
Sbjct: 49  RSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVK---SLEELIKDVEEELEKIEKEIKEL 105

Query: 132 NQIIKELNQIIKELNQIVEE 151
            + I EL   IKEL Q +E 
Sbjct: 106 EEEISELENEIKELEQEIER 125



 Score = 26.4 bits (59), Expect = 7.7
 Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 6/84 (7%)

Query: 49  EQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQN 108
              + K   +   K     K L+++IK++++ +++++  IKE  + I EL   IKE    
Sbjct: 63  RSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKE---- 118

Query: 109 KLDQIKKELDQMVKELDLTIKELN 132
            L+Q  + L+      DL +  L 
Sbjct: 119 -LEQEIERLEPW-GNFDLDLSLLL 140



 Score = 26.0 bits (58), Expect = 9.5
 Identities = 18/64 (28%), Positives = 35/64 (54%)

Query: 41  LESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNL 100
           L S         E++ KV    L+++ K++++ ++++++ IKEL+  I E    IKEL  
Sbjct: 62  LRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQ 121

Query: 101 IIKE 104
            I+ 
Sbjct: 122 EIER 125


>gnl|CDD|224380 COG1463, Ttg2C, ABC-type transport system involved in resistance to
           organic solvents, periplasmic component [Secondary
           metabolites biosynthesis, transport, and catabolism].
          Length = 359

 Score = 33.2 bits (76), Expect = 0.035
 Identities = 24/121 (19%), Positives = 45/121 (37%), Gaps = 12/121 (9%)

Query: 30  TDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIK 89
           T G   E  I L   +    +               I  E  Q+  +L+ ++   DL++ 
Sbjct: 103 TTGLLGEKYIELTPGDAGGPKLRLG---------ATIPLERTQVPIDLEDLL--GDLLLL 151

Query: 90  ESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
                   LN I+ E+    L     +L+ ++  L      LN    ++  +I  LNQ++
Sbjct: 152 LGGLDPDRLNAILNEAAAA-LAGTGPQLNALLDNLAQFTDALNARDGDIGALIANLNQLL 210

Query: 150 E 150
           +
Sbjct: 211 D 211


>gnl|CDD|184039 PRK13411, PRK13411, molecular chaperone DnaK; Provisional.
          Length = 653

 Score = 33.2 bits (76), Expect = 0.038
 Identities = 25/121 (20%), Positives = 63/121 (52%), Gaps = 11/121 (9%)

Query: 32  GSTEESEINLESSNGIEEQE---MEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
           G+  E  I + ++ G+   E   M +E +  + + D+ +K+L ++  + D ++   +  +
Sbjct: 487 GTGREQSIRITNTGGLSSNEIERMRQEAEK-YAEEDRRRKQLIELKNQADSLLYSYESTL 545

Query: 89  KESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
           KE+ ++I E    +K+  + K++Q++  L       +++++EL Q ++E  Q +  +   
Sbjct: 546 KENGELISE---ELKQRAEQKVEQLEAALTDP----NISLEELKQQLEEFQQALLAIGAE 598

Query: 149 V 149
           V
Sbjct: 599 V 599


>gnl|CDD|215960 pfam00509, Hemagglutinin, Haemagglutinin.  Hemagglutinin from
           influenza virus causes membrane fusion of the viral
           membrane with the host membrane. Fusion occurs after the
           host cell internalises the virus by endocytosis. The
           drop of pH causes release of a hydrophobic fusion
           peptide and a large conformational change leading to
           membrane fusion.
          Length = 550

 Score = 32.7 bits (75), Expect = 0.049
 Identities = 11/45 (24%), Positives = 24/45 (53%), Gaps = 4/45 (8%)

Query: 103 KESNQNKLDQIKKEL----DQMVKELDLTIKELNQIIKELNQIIK 143
            +S Q  +DQI  +L    ++M  + +   KE +++ + +N + K
Sbjct: 366 LKSTQKAIDQITGKLNSLIEKMNTQFEAIEKEFSELERRINNLNK 410



 Score = 26.5 bits (59), Expect = 5.9
 Identities = 11/49 (22%), Positives = 28/49 (57%), Gaps = 3/49 (6%)

Query: 63  LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLD 111
           +DQI  +L+ +I   +++  + + I KE +++ + +N + K+ +    D
Sbjct: 373 IDQITGKLNSLI---EKMNTQFEAIEKEFSELERRINNLNKKVDDGITD 418


>gnl|CDD|223910 COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility
           and secretion / Signal transduction mechanisms].
          Length = 408

 Score = 32.7 bits (74), Expect = 0.054
 Identities = 20/87 (22%), Positives = 46/87 (52%), Gaps = 4/87 (4%)

Query: 68  KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK---KELDQMVKEL 124
           KE+  +I+E+     +    ++ES   + E  + + E   + L +I    +E+ Q++ E+
Sbjct: 292 KEIGLLIEEIQNEAADAVEHMEESASEVSE-GVKLVEETGSSLGEIAAAIEEVSQLISEI 350

Query: 125 DLTIKELNQIIKELNQIIKELNQIVEE 151
               +E   +++E+N  I+EL+ + +E
Sbjct: 351 AAATEEQTAVLEEINASIEELDDVTQE 377



 Score = 32.3 bits (73), Expect = 0.089
 Identities = 22/94 (23%), Positives = 53/94 (56%), Gaps = 4/94 (4%)

Query: 61  NKLDQIKKELDQI---IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKEL 117
            + DQ  + L+++   I+EL + +KE+    KE+  +  E + + +E  +  + Q  +++
Sbjct: 159 ARADQQAESLEEVASAIEELSETVKEVAFNAKEAAALASEASQVAEEGGEE-VRQAVEQM 217

Query: 118 DQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            ++ +EL   +K+L++  +E+ +I   +N I E+
Sbjct: 218 QEIAEELAEVVKKLSESSQEIEEITSVINSIAEQ 251



 Score = 31.9 bits (72), Expect = 0.11
 Identities = 19/92 (20%), Positives = 48/92 (52%), Gaps = 5/92 (5%)

Query: 60  FNKLDQIKKELDQIIKELDQI---IKELDLIIKESNQVIKELNLIIKESNQN--KLDQIK 114
            +++ +  K +++    L +I   I+E+  +I E     +E   +++E N +  +LD + 
Sbjct: 316 ASEVSEGVKLVEETGSSLGEIAAAIEEVSQLISEIAAATEEQTAVLEEINASIEELDDVT 375

Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELN 146
           +E    V+EL    +EL ++ ++L +++ +  
Sbjct: 376 QENAAAVEELAAASEELKELAEKLLELVAKFK 407



 Score = 31.1 bits (70), Expect = 0.22
 Identities = 18/89 (20%), Positives = 42/89 (47%), Gaps = 2/89 (2%)

Query: 63  LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVK 122
           +++I+ E    ++ +++   E+   +K   +    L  I     +  + Q+  E+    +
Sbjct: 298 IEEIQNEAADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEE--VSQLISEIAAATE 355

Query: 123 ELDLTIKELNQIIKELNQIIKELNQIVEE 151
           E    ++E+N  I+EL+ + +E    VEE
Sbjct: 356 EQTAVLEEINASIEELDDVTQENAAAVEE 384



 Score = 28.0 bits (62), Expect = 2.0
 Identities = 20/112 (17%), Positives = 52/112 (46%), Gaps = 4/112 (3%)

Query: 42  ESSNGIEE--QEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN 99
           +S+  I    +E++ E       +++   E+ + +K +++    L  I     + + +L 
Sbjct: 289 DSAKEIGLLIEEIQNEAADAVEHMEESASEVSEGVKLVEETGSSLGEIAAAIEE-VSQLI 347

Query: 100 LIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
             I  + +     + +E++  ++ELD   +E    ++EL    +EL ++ E+
Sbjct: 348 SEIAAATEE-QTAVLEEINASIEELDDVTQENAAAVEELAAASEELKELAEK 398


>gnl|CDD|217653 pfam03639, Glyco_hydro_81, Glycosyl hydrolase family 81.  Family of
           eukaryotic beta-1,3-glucanases. Within the Aspergillus
           fumigatus protein two perfectly conserved Glu residues
           (E550 or E554) have been proposed as putative
           nucleophiles of the active site of the Engl1
           endoglucanase, while the proton donor would be D475. The
           endo-beta-1,3-glucanase activity is essential for
           efficient spore release.
          Length = 694

 Score = 32.6 bits (74), Expect = 0.064
 Identities = 13/31 (41%), Positives = 13/31 (41%), Gaps = 9/31 (29%)

Query: 16  YFALFRMFD---------GLTPVTDGSTEES 37
           YF  FR FD         GL    DG  EES
Sbjct: 499 YFPQFRNFDWFNGHSWAGGLFESGDGKNEES 529


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 32.7 bits (75), Expect = 0.068
 Identities = 22/104 (21%), Positives = 56/104 (53%), Gaps = 14/104 (13%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIK------ESNQVIKELNLIIK------ESNQN 108
             ++Q+++ +D+    +++ ++EL+ I+       E    ++EL  +++      E N  
Sbjct: 282 ELIEQLEELIDKYESHIEKALEELESILDTEKENSEFKLDVEELKALLEALEEILEKNLQ 341

Query: 109 KLDQIKKELDQMVKELDLT--IKELNQIIKELNQIIKELNQIVE 150
           KL++  K+    ++   +T  I+ +N II  +N++I+E N+ ++
Sbjct: 342 KLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKID 385



 Score = 30.0 bits (68), Expect = 0.55
 Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 6/91 (6%)

Query: 61  NKLDQIKKELDQIIKELDQ-IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQ 119
            K+D +KKE ++  K+L   ++ EL   I    +  K L   I  S + ++ Q++ E   
Sbjct: 382 EKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAI-NSLEKEIKQLEAE--- 437

Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQIVE 150
            +K L+  IKEL + +  +     E+N++++
Sbjct: 438 -IKALEKEIKELEKQLTNIEPTADEINKLLK 467



 Score = 29.6 bits (67), Expect = 0.64
 Identities = 28/91 (30%), Positives = 47/91 (51%), Gaps = 9/91 (9%)

Query: 61  NKLDQIKKELDQI---IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKE- 116
           + ++ I   +D I   I+E ++ I  L    KE N+  K+L L +    +  +D  +KE 
Sbjct: 361 DLIESINDIIDAINELIREHNEKIDNLK---KEKNKAKKKLWLHLVAELKEDIDAYQKEK 417

Query: 117 --LDQMVKELDLTIKELNQIIKELNQIIKEL 145
             L++ +  L+  IK+L   IK L + IKEL
Sbjct: 418 KGLEKAINSLEKEIKQLEAEIKALEKEIKEL 448



 Score = 27.3 bits (61), Expect = 4.5
 Identities = 25/122 (20%), Positives = 59/122 (48%), Gaps = 20/122 (16%)

Query: 50  QEMEKECKVDFNKLDQIKKELDQIIKE---------LDQIIKELDLIIKESNQVIKELNL 100
           +E++   +     L++  ++L++ +K+         +  +I+ ++ II   N++I+E N 
Sbjct: 323 EELKALLEALEEILEKNLQKLEEKLKDPSTSIELESITDLIESINDIIDAINELIREHNE 382

Query: 101 IIK--ESNQNKL---------DQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
            I   +  +NK           ++K+++D   KE     K +N + KE+ Q+  E+  + 
Sbjct: 383 KIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALE 442

Query: 150 EE 151
           +E
Sbjct: 443 KE 444


>gnl|CDD|214599 smart00283, MA, Methyl-accepting chemotaxis-like domains
           (chemotaxis sensory transducer).  Thought to undergo
           reversible methylation in response to attractants or
           repellants during bacterial chemotaxis.
          Length = 262

 Score = 32.3 bits (74), Expect = 0.069
 Identities = 22/87 (25%), Positives = 48/87 (55%), Gaps = 4/87 (4%)

Query: 68  KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK---KELDQMVKEL 124
           KE++ +IKE+ +   E    ++ES+  ++E   +++E+    L++I    +E+  +V+E+
Sbjct: 147 KEIESLIKEIQEETNEAVAAMEESSSEVEEGVELVEETGDA-LEEIVDSVEEIADLVQEI 205

Query: 125 DLTIKELNQIIKELNQIIKELNQIVEE 151
                E     +E+N  I E+ Q+ +E
Sbjct: 206 AAATDEQAAGSEEVNAAIDEIAQVTQE 232


>gnl|CDD|227785 COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope
           biogenesis, outer membrane].
          Length = 760

 Score = 32.5 bits (74), Expect = 0.071
 Identities = 14/31 (45%), Positives = 14/31 (45%), Gaps = 9/31 (29%)

Query: 16  YFALFRMFD---------GLTPVTDGSTEES 37
           YF  FRMFD         GL    DG  EES
Sbjct: 557 YFPQFRMFDWFNGHSWADGLFESGDGKDEES 587


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 32.3 bits (74), Expect = 0.073
 Identities = 25/109 (22%), Positives = 64/109 (58%), Gaps = 8/109 (7%)

Query: 48  EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKEL-----NLII 102
           E +E++KE     ++L   +++L+++ +E++++ +ELD + +E  ++ +EL      +  
Sbjct: 372 ELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAG 431

Query: 103 KESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            E+  N+L++ K++    +K+ +    +L Q+  +L++  +EL  + EE
Sbjct: 432 IEAKINELEEEKEDKALEIKKQE---WKLEQLAADLSKYEQELYDLKEE 477



 Score = 30.8 bits (70), Expect = 0.28
 Identities = 20/104 (19%), Positives = 60/104 (57%), Gaps = 4/104 (3%)

Query: 48  EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQ 107
           E++ +E++ +    +L  +++EL+++ +E+ ++ K     ++E  Q+++ELN  IK+  +
Sbjct: 231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKR----LEEIEQLLEELNKKIKDLGE 286

Query: 108 NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            +  ++K+++ ++  E+    + + +  +EL    + L ++  E
Sbjct: 287 EEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAE 330



 Score = 30.8 bits (70), Expect = 0.30
 Identities = 21/97 (21%), Positives = 51/97 (52%), Gaps = 1/97 (1%)

Query: 55  ECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK 114
           E +   N+L   K+ L++ I+EL +   +L   IK   + I+ LN   +E  + +L++++
Sbjct: 816 EIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEE-ELEELE 874

Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
             L  +   L    KE +++  +L ++ +++ ++  +
Sbjct: 875 AALRDLESRLGDLKKERDELEAQLRELERKIEELEAQ 911



 Score = 30.4 bits (69), Expect = 0.33
 Identities = 24/105 (22%), Positives = 54/105 (51%), Gaps = 9/105 (8%)

Query: 47  IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
           +EE   + E ++  +++ +I+ EL     +L++ +  ++  ++E  Q +  L L      
Sbjct: 777 LEEALNDLEARLSHSRIPEIQAELS----KLEEEVSRIEARLREIEQKLNRLTL-----E 827

Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           +  L++  +EL +   +L   IK + + I+ LN   +EL + +EE
Sbjct: 828 KEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEE 872



 Score = 30.4 bits (69), Expect = 0.39
 Identities = 22/104 (21%), Positives = 61/104 (58%), Gaps = 5/104 (4%)

Query: 48  EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQ 107
           E ++ E+       ++D++  E++++ +E+++  K  D + +E  ++ +EL  +     +
Sbjct: 316 ELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDL-----R 370

Query: 108 NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            +L+++ KE  +   EL    ++L ++ +E+N++ +EL+++ EE
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEE 414



 Score = 30.4 bits (69), Expect = 0.40
 Identities = 21/104 (20%), Positives = 56/104 (53%), Gaps = 4/104 (3%)

Query: 48  EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKE---LNLIIKE 104
             + +++E     ++L +I+  LD++ +EL    +++  I KE  Q+ +E   L   ++E
Sbjct: 682 RLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE 741

Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
             +  L  +++E++ +  EL      + ++ ++L+++ + LN +
Sbjct: 742 L-EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDL 784



 Score = 29.3 bits (66), Expect = 0.81
 Identities = 25/135 (18%), Positives = 69/135 (51%), Gaps = 4/135 (2%)

Query: 21  RMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDF-NKLDQIKKELDQIIKELDQ 79
           R+ +    ++D S +  EI  E     +E+E  KE   +    L  +++E++ +  EL +
Sbjct: 703 RLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKE 762

Query: 80  IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM---VKELDLTIKELNQIIK 136
           +   ++ + ++ +++ + LN +    + +++ +I+ EL ++   V  ++  ++E+ Q + 
Sbjct: 763 LEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLN 822

Query: 137 ELNQIIKELNQIVEE 151
            L    + L + ++E
Sbjct: 823 RLTLEKEYLEKEIQE 837



 Score = 28.5 bits (64), Expect = 1.5
 Identities = 22/126 (17%), Positives = 63/126 (50%), Gaps = 22/126 (17%)

Query: 46  GIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIK--------- 96
           G+ E + +KE      +L+++++ ++++   +D+  ++L+ + +E  +  +         
Sbjct: 164 GVAEFDRKKEKA--LEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKR 221

Query: 97  --ELNLIIKESN---------QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKEL 145
             E   ++KE           + +L  +++EL+++ +E+    K L +I + L ++ K++
Sbjct: 222 EYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKI 281

Query: 146 NQIVEE 151
             + EE
Sbjct: 282 KDLGEE 287



 Score = 26.6 bits (59), Expect = 7.3
 Identities = 23/121 (19%), Positives = 59/121 (48%), Gaps = 4/121 (3%)

Query: 31  DGSTEESEINLESSNGIEEQEMEK--ECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
           +    E E  L      +E   ++  E +     L +  K +++ I+ L+   +EL+  +
Sbjct: 811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEEL 870

Query: 89  KESNQVIKEL--NLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELN 146
           +E    +++L   L   +  +++L+   +EL++ ++EL+  I++  + + EL   ++ L 
Sbjct: 871 EELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALE 930

Query: 147 Q 147
           +
Sbjct: 931 E 931


>gnl|CDD|240411 PTZ00419, PTZ00419, valyl-tRNA synthetase-like protein;
           Provisional.
          Length = 995

 Score = 32.3 bits (74), Expect = 0.084
 Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 8/101 (7%)

Query: 48  EEQEMEKECKVDF--NKLDQIKKELDQII---KELDQIIKELDLIIKESNQVIKELNLII 102
           EE E+ K C  D   NK+  I   LD+ I   KEL ++ K+L  + K     +K+++  I
Sbjct: 898 EEAEVPKGCGFDVVDNKVI-IYLNLDEFIDLKKELAKLEKKLAKLQKSLESYLKKIS--I 954

Query: 103 KESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIK 143
                   + ++K  D+ + EL+  IK+L Q I+EL  ++K
Sbjct: 955 PNYEDKVPEDVRKLNDEKIDELNEEIKQLEQAIEELKSLLK 995



 Score = 30.7 bits (70), Expect = 0.31
 Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 15/74 (20%)

Query: 88  IKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL---------TIKELN-QIIKE 137
           + E   + KEL  + K     KL +++K L+  +K++ +          +++LN + I E
Sbjct: 921 LDEFIDLKKELAKLEK-----KLAKLQKSLESYLKKISIPNYEDKVPEDVRKLNDEKIDE 975

Query: 138 LNQIIKELNQIVEE 151
           LN+ IK+L Q +EE
Sbjct: 976 LNEEIKQLEQAIEE 989


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 31.8 bits (73), Expect = 0.11
 Identities = 16/83 (19%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 69  ELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTI 128
           +L ++ +E  ++++EL+ +  E N++ KE+   +K   ++  +++  E+ ++ ++L    
Sbjct: 30  KLLELDEERRKLLRELEELQAERNELSKEIGRALKR-GEDDAEELIAEVKELKEKLKELE 88

Query: 129 KELNQIIKELNQIIKELNQIVEE 151
             L+++  EL+ ++  +  I  E
Sbjct: 89  AALDELEAELDTLLLTIPNIPHE 111



 Score = 28.3 bits (64), Expect = 1.6
 Identities = 16/75 (21%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 77  LDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIK 136
           +D+++ ELD   +E  ++++EL  +  E N+    +I + L +   + +  I E+ ++ +
Sbjct: 28  VDKLL-ELD---EERRKLLRELEELQAERNELS-KEIGRALKRGEDDAEELIAEVKELKE 82

Query: 137 ELNQIIKELNQIVEE 151
           +L ++   L+++  E
Sbjct: 83  KLKELEAALDELEAE 97


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.0 bits (73), Expect = 0.12
 Identities = 34/110 (30%), Positives = 64/110 (58%), Gaps = 6/110 (5%)

Query: 45  NGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKE 104
           NGIEE+  E E K    +L+++KK+L ++ K L + ++E   + +E+    +EL  + K 
Sbjct: 324 NGIEERIKELEEKE--ERLEELKKKLKELEKRL-EELEERHELYEEAKAKKEELERLKKR 380

Query: 105 SNQNKLDQIKKELDQMVK---ELDLTIKELNQIIKELNQIIKELNQIVEE 151
                 ++++KEL+++ K   E++  I ++   I EL + IKEL + +EE
Sbjct: 381 LTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEE 430



 Score = 30.4 bits (69), Expect = 0.36
 Identities = 29/109 (26%), Positives = 64/109 (58%), Gaps = 9/109 (8%)

Query: 48  EEQEMEKECKVDFNKLDQIKKELDQIIKELDQI----IKELDLIIKESNQVIKELNLIIK 103
           + +E++K+      KLD++++EL +++KEL+++    ++EL+  +KE      E   +  
Sbjct: 550 KLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKD 609

Query: 104 -----ESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
                E  + +L ++++ELD+  +EL  T K L ++ KEL ++ K+ ++
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE 658



 Score = 30.0 bits (68), Expect = 0.54
 Identities = 31/119 (26%), Positives = 66/119 (55%), Gaps = 2/119 (1%)

Query: 33  STEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESN 92
           S+E  E+  E     +E +  +E K +  +L++  + L+   ++L++ I+EL+  I+E  
Sbjct: 213 SSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELK 272

Query: 93  QVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           + I+EL   +KE    +L +  +E  ++ +  +  + EL +I K L+++ +E+N I E 
Sbjct: 273 KEIEELEEKVKEL--KELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEER 329



 Score = 28.9 bits (65), Expect = 1.2
 Identities = 24/92 (26%), Positives = 58/92 (63%), Gaps = 1/92 (1%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQ-NKLDQIKKELDQ 119
             + + +KEL+++++E+++I  EL  + +E  ++ KE+  + +   +  +L++  + L+ 
Sbjct: 193 ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEG 252

Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
             ++L+  I+EL + I+EL + I+EL + V+E
Sbjct: 253 SKRKLEEKIRELEERIEELKKEIEELEEKVKE 284



 Score = 27.0 bits (60), Expect = 5.8
 Identities = 27/105 (25%), Positives = 59/105 (56%), Gaps = 4/105 (3%)

Query: 47  IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
           +E ++ EKE + +  +L ++++ELD+  +EL +  K L+ + KE    ++EL     E  
Sbjct: 605 LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKE----LEELEKKYSEEE 660

Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
             +L +   EL + +  L   ++EL +  +E+ + +++L + +EE
Sbjct: 661 YEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEE 705



 Score = 26.6 bits (59), Expect = 5.9
 Identities = 36/127 (28%), Positives = 70/127 (55%), Gaps = 11/127 (8%)

Query: 34  TEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQ 93
            EE E   E    ++ +   K  K +  KL+++KK+L ++ K+LD++ +EL  ++KE  +
Sbjct: 524 AEEYEKLKEKLIKLKGEI--KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEE 581

Query: 94  V----IKELNLIIKE-----SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKE 144
           +    ++EL   +KE     +   +L   +KEL++  KEL    +EL++  +EL +  K 
Sbjct: 582 LGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKR 641

Query: 145 LNQIVEE 151
           L ++ +E
Sbjct: 642 LEELRKE 648


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 31.9 bits (73), Expect = 0.12
 Identities = 21/95 (22%), Positives = 46/95 (48%), Gaps = 10/95 (10%)

Query: 59  DFNKLDQIKKELDQIIKELDQIIK-ELDLIIKESNQVIKELNLIIKESNQNKLDQIKKEL 117
           DFN+L  + + L+     L++++K +   I++E N   +E+   +K    + +  +K  L
Sbjct: 312 DFNQLPDVTEILEN----LEEVVKADFSSIVQEGNSTFEEIPSKVKNQTSSVVPDVKAAL 367

Query: 118 DQMVKEL-----DLTIKELNQIIKELNQIIKELNQ 147
           D +  ++     DL ++ L+ + + LN      N 
Sbjct: 368 DSLGTDIKSVAEDLPLQVLSVLSQILNNTQSSSNP 402



 Score = 27.7 bits (62), Expect = 3.2
 Identities = 18/69 (26%), Positives = 36/69 (52%), Gaps = 5/69 (7%)

Query: 57  KVDFNKLDQIKK---ELDQIIKELDQIIKELDLIIKESNQVIKELN--LIIKESNQNKLD 111
             D N+L  IK+   E+DQ ++ L   +  + L+  E+ + +++L    +   +  N L 
Sbjct: 555 SYDLNELLDIKQYTPEIDQELESLKVNLSTITLLTPEARRNLEDLASSGLDDINFSNFLK 614

Query: 112 QIKKELDQM 120
           QI+K L ++
Sbjct: 615 QIQKPLTEV 623


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 31.4 bits (72), Expect = 0.16
 Identities = 24/93 (25%), Positives = 48/93 (51%), Gaps = 5/93 (5%)

Query: 61  NKLDQIKKELDQIIKELDQIIKE--LDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD 118
             L + K+E+ ++  EL++ +KE   +L   E   + +E  L   +     LD+ ++ L+
Sbjct: 51  EALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETL---DRKMESLDKKEENLE 107

Query: 119 QMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           +  KEL    K L++  +EL ++I E  + +E 
Sbjct: 108 KKEKELSNKEKNLDEKEEELEELIAEQREELER 140



 Score = 29.1 bits (66), Expect = 0.82
 Identities = 28/110 (25%), Positives = 52/110 (47%), Gaps = 14/110 (12%)

Query: 31  DGSTEESEINLESSNGIEE--QEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
           +  T + E  LE+   + +   E+E+E K   N+L ++++ L Q         +E  L  
Sbjct: 44  EAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQ---------REETLDR 94

Query: 89  KESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
           K  +   KE NL   E  + +L   +K LD+  +EL+  I E  + ++ +
Sbjct: 95  KMESLDKKEENL---EKKEKELSNKEKNLDEKEEELEELIAEQREELERI 141


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 31.2 bits (71), Expect = 0.16
 Identities = 23/105 (21%), Positives = 59/105 (56%), Gaps = 2/105 (1%)

Query: 47  IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
           ++E+E  +E +    +  +     ++ IK+L++ ++ L+    E  + ++EL   I E  
Sbjct: 397 VKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREI-EKL 455

Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           +++L++ ++E+   V++ D  I+  ++ I+ L + ++E  + VEE
Sbjct: 456 ESELERFRREVRDKVRK-DREIRARDRRIERLEKELEEKKKRVEE 499



 Score = 27.0 bits (60), Expect = 5.0
 Identities = 16/96 (16%), Positives = 54/96 (56%), Gaps = 7/96 (7%)

Query: 62  KLDQIKKELDQIIKELDQIIKELDLIIKESNQV------IKELNLIIKESNQNKLDQIKK 115
           +L+++K+E++++  EL++  +E+   +++  ++      I+ L   ++E  + +++++++
Sbjct: 444 ELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKK-RVEELER 502

Query: 116 ELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           +L ++ K   L +      +K + ++  E  +  EE
Sbjct: 503 KLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEAEE 538


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 31.2 bits (71), Expect = 0.18
 Identities = 9/47 (19%), Positives = 25/47 (53%)

Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           ++  +L QI+KE+  + K++     +  ++ K+L  +  E+  +  +
Sbjct: 35  ADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQ 81



 Score = 30.5 bits (69), Expect = 0.34
 Identities = 12/70 (17%), Positives = 33/70 (47%), Gaps = 2/70 (2%)

Query: 57  KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKE--SNQNKLDQIK 114
             D  +L QI+KE+  + K++ +   +   + K+   +  E+  +  +     + L +++
Sbjct: 34  AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93

Query: 115 KELDQMVKEL 124
           K++  +   L
Sbjct: 94  KQIADLNARL 103



 Score = 28.1 bits (63), Expect = 1.8
 Identities = 17/81 (20%), Positives = 36/81 (44%), Gaps = 15/81 (18%)

Query: 68  KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM---VKEL 124
           K+L QI KE+  + K++     +  ++ K+L              ++ E+  +   + E 
Sbjct: 38  KQLKQIQKEIAALEKKIREQQDQRAKLEKQLK------------SLETEIASLEAQLIET 85

Query: 125 DLTIKELNQIIKELNQIIKEL 145
              +K+L + I +LN  +  L
Sbjct: 86  ADDLKKLRKQIADLNARLNAL 106



 Score = 26.2 bits (58), Expect = 8.5
 Identities = 6/45 (13%), Positives = 22/45 (48%)

Query: 104 ESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
            + + K+ + + +  ++ K+L     E+  +  +L +   +L ++
Sbjct: 48  AALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKL 92


>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 559

 Score = 31.0 bits (71), Expect = 0.20
 Identities = 19/85 (22%), Positives = 49/85 (57%), Gaps = 6/85 (7%)

Query: 66  IKKELDQIIKELDQII---KELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVK 122
           ++ ++ + +K L++++   K+ +  I++    +++L L++K S + ++     E D++ K
Sbjct: 257 VEGDVAKALKILEELLDEGKDPNRFIEDLIYYLRDL-LLVKTSPELEILDESTENDELFK 315

Query: 123 EL--DLTIKELNQIIKELNQIIKEL 145
           EL   L I+ L ++I  LN   +++
Sbjct: 316 ELSEKLDIERLYRMIDILNDAQQQI 340


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 30.8 bits (70), Expect = 0.21
 Identities = 33/124 (26%), Positives = 54/124 (43%), Gaps = 3/124 (2%)

Query: 30  TDGSTEESEINLESSNGIEEQEMEKECKVD--FNKLDQIKKELDQIIKELDQIIKELDLI 87
            +G  E  E NLE     EE   +    ++    KL +  + L + I  L Q+  EL+L 
Sbjct: 145 LEGLKEGLEENLEGMKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLC 204

Query: 88  IKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
                +  ++    +      K  Q++ EL Q ++EL + I+ L     EL + I E  +
Sbjct: 205 DPLELEKARQELRSLSVKISEKRKQLE-ELQQELQELTIAIEALTNKKSELLEEIAEAEK 263

Query: 148 IVEE 151
           I EE
Sbjct: 264 IREE 267


>gnl|CDD|182106 PRK09841, PRK09841, cryptic autophosphorylating protein tyrosine
           kinase Etk; Provisional.
          Length = 726

 Score = 31.0 bits (70), Expect = 0.22
 Identities = 30/115 (26%), Positives = 60/115 (52%), Gaps = 11/115 (9%)

Query: 44  SNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNL--- 100
           +N   +Q + ++   D   L+ ++++L ++  ELDQ  ++L++  ++ + V  +LNL   
Sbjct: 250 ANNYLQQNIARQAAQDSQSLEFLQRQLPEVRSELDQAEEKLNVYRQQRDSV--DLNLEAK 307

Query: 101 -----IIKESNQ-NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
                I+   NQ N+L   + E+ Q+ K+   T + L +  + L Q  K LN+ V
Sbjct: 308 AVLEQIVNVDNQLNELTFREAEISQLYKKDHPTYRALLEKRQTLEQERKRLNKRV 362


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 30.8 bits (70), Expect = 0.23
 Identities = 24/132 (18%), Positives = 56/132 (42%), Gaps = 1/132 (0%)

Query: 21   RMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKE-LDQIIKELDQ 79
            R+      + +   +   I+      +E +    E K   +       E   ++I  L +
Sbjct: 876  RVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKK 935

Query: 80   IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELN 139
            ++  +DL    S + +K   L      ++KL +  +E + ++K+  + ++E N+   EL 
Sbjct: 936  LLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELK 995

Query: 140  QIIKELNQIVEE 151
               KEL ++ ++
Sbjct: 996  NFKKELAELSKQ 1007



 Score = 30.0 bits (68), Expect = 0.47
 Identities = 26/114 (22%), Positives = 53/114 (46%), Gaps = 7/114 (6%)

Query: 38   EINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKE 97
            +  L + +  E   +E     + NKL +++ +L +  +E + ++K+  ++++E N+   E
Sbjct: 934  KKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSE 993

Query: 98   LNLIIKE----SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
            L    KE    S Q    Q   +    +KEL + + EL    K ++    EL+ 
Sbjct: 994  LKNFKKELAELSKQYGALQESTKQ---LKELPVEVAELQSASKIISSESTELSI 1044


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 30.8 bits (70), Expect = 0.26
 Identities = 21/98 (21%), Positives = 57/98 (58%), Gaps = 9/98 (9%)

Query: 61  NKLDQIKKELDQI---IKELDQIIKELDLIIKESNQVIKELN----LIIKESNQNKLDQI 113
           +KL +++  L+++   I EL++I+ E +   K+  ++  +++     +I   ++ K  ++
Sbjct: 306 DKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAK--KV 363

Query: 114 KKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           K  ++++  E     +EL ++  EL++I+K  +++V+E
Sbjct: 364 KAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVKE 401



 Score = 28.1 bits (63), Expect = 2.1
 Identities = 23/78 (29%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 74  IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQ 133
           I+EL+Q I+ LD+ I    Q IK  N  I+E  + K  +          EL    K +  
Sbjct: 176 IRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRK-KNGENIARKQNKYDELVEEAKTIKA 234

Query: 134 IIKELNQIIKELNQIVEE 151
            I+EL   +  L   +E+
Sbjct: 235 EIEELTDELLNLVMDIED 252



 Score = 26.9 bits (60), Expect = 4.9
 Identities = 23/109 (21%), Positives = 43/109 (39%), Gaps = 8/109 (7%)

Query: 29  VTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
           +T    +  E+           +  +E   +FN+  +   EL   I    Q +  L    
Sbjct: 301 ITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKA 360

Query: 89  KESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKE 137
           K+    I+EL        Q +     +EL ++  ELD  +K  ++++KE
Sbjct: 361 KKVKAAIEEL--------QAEFVDNAEELAKLQDELDKIVKTKSELVKE 401


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 30.8 bits (70), Expect = 0.27
 Identities = 22/112 (19%), Positives = 57/112 (50%), Gaps = 10/112 (8%)

Query: 50  QEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQ-- 107
           +E  +E + +  +L++  +EL + I+ L + ++E + +++E  Q++ EL    +E  +  
Sbjct: 308 RERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKL 367

Query: 108 --------NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
                      + +++EL ++  EL     EL ++ +E+  + + L ++ E 
Sbjct: 368 SALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSER 419



 Score = 30.5 bits (69), Expect = 0.34
 Identities = 23/116 (19%), Positives = 63/116 (54%), Gaps = 1/116 (0%)

Query: 36  ESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVI 95
           E E+          ++  +E + +  +L++   EL++ ++EL++ ++EL   ++E     
Sbjct: 813 ERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEK 872

Query: 96  KELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           +EL   +KE  + + +++++EL ++  EL    +E+ ++ + L ++  +L ++  E
Sbjct: 873 EELEDELKE-LEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVE 927



 Score = 30.1 bits (68), Expect = 0.48
 Identities = 22/107 (20%), Positives = 57/107 (53%), Gaps = 3/107 (2%)

Query: 48  EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN---LIIKE 104
             +E  +E +    +L+Q+  EL++  +EL++ +  L   ++E  + ++E          
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELA 393

Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
             +N+L+++K+E++ + + L+   + L  + +EL ++  EL ++  E
Sbjct: 394 EIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTE 440



 Score = 29.7 bits (67), Expect = 0.57
 Identities = 30/118 (25%), Positives = 63/118 (53%), Gaps = 2/118 (1%)

Query: 36  ESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVI 95
           E E+        E QE  +E + +   L++   +L + I+EL++  + L   ++E  + +
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802

Query: 96  KELNLII--KESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           +E    +   E     L+Q ++ L+Q ++EL+  I+EL + + EL + ++EL + +EE
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEE 860



 Score = 28.1 bits (63), Expect = 2.1
 Identities = 24/111 (21%), Positives = 60/111 (54%), Gaps = 6/111 (5%)

Query: 42  ESSNGIEEQEMEKE-CKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNL 100
           E    +EE E + E  K +   L++  ++L   ++EL++ ++EL+  ++E  + ++EL  
Sbjct: 706 ELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEEL-- 763

Query: 101 IIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
              E     L++   +L + ++EL+   + L + ++EL + ++E  + ++ 
Sbjct: 764 ---EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDA 811


>gnl|CDD|189014 cd09607, M3B_PepF_2, Peptidase family M3B Oligopeptidase F (PepF). 
           Peptidase family M3B Oligopeptidase F (PepF;
           Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
           includes oligoendopeptidase F from Lactococcus lactis.
           This enzyme hydrolyzes peptides containing between 7 and
           17 amino acids with fairly broad specificity. The PepF
           gene is duplicated in L. lactis on the plasmid that
           bears it, while a shortened second copy is found in
           Bacillus subtilis. Most bacterial PepFs are cytoplasmic
           endopeptidases; however, the PepF Bacillus
           amyloliquefaciens oligopeptidase is a secreted protein
           and may facilitate the process of sporulation.
           Specifically, the yjbG gene encoding the homolog of the
           PepF1 and PepF2 oligoendopeptidases of Lactococcus
           lactis has been identified in Bacillus subtilis as an
           inhibitor of sporulation initiation when over expressed
           from a multicopy plasmid.
          Length = 581

 Score = 30.6 bits (70), Expect = 0.29
 Identities = 10/45 (22%), Positives = 23/45 (51%)

Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           Q  L ++K+ +D++   LD  +      ++ L  +I  L +++E 
Sbjct: 16  QEDLKKLKQLIDELKSLLDALLPSDEDAVETLEALITLLQELLEL 60


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 30.3 bits (69), Expect = 0.29
 Identities = 14/70 (20%), Positives = 25/70 (35%), Gaps = 9/70 (12%)

Query: 84  LDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM----VKELDLTIKELNQIIKELN 139
           L+L I E     KE++ ++ E  Q +    + ++       +  L   I  L   I E  
Sbjct: 101 LELKIFE-----KEIDTVLLEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQ 155

Query: 140 QIIKELNQIV 149
             +    Q  
Sbjct: 156 AEVNAAYQEA 165


>gnl|CDD|236084 PRK07739, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 507

 Score = 30.4 bits (69), Expect = 0.30
 Identities = 13/42 (30%), Positives = 24/42 (57%), Gaps = 6/42 (14%)

Query: 108 NKLDQIKKELDQMVKELDLTIKELNQI---IKELNQIIKELN 146
             L  I+ +L     E+D+T+KE+N +   I +LN+ I ++ 
Sbjct: 160 QSLTDIQNDLKS---EIDVTVKEINSLASQISDLNKQIAKVE 198


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 30.4 bits (69), Expect = 0.34
 Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 7/93 (7%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKE--LNLIIKESNQNKLDQIKKELD 118
           N+L++ +K L     EL+++ K+LD  +     +  E    L++K  +    ++  + + 
Sbjct: 105 NQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVK 164

Query: 119 QMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           ++ +E DL  +   Q I     + + + +I  E
Sbjct: 165 KIEEEADLEAERKAQNI-----LAQAMQRIASE 192


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 30.1 bits (68), Expect = 0.38
 Identities = 29/140 (20%), Positives = 56/140 (40%), Gaps = 20/140 (14%)

Query: 20  FRMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQ 79
           FR+           TE+ E    ++  +   EM+ E        +   KE +  ++ L +
Sbjct: 226 FRLDSWDKGSRFTRTEKVEQAATAAAEVPPAEMDTE--------EDRTKEREAELEALQE 277

Query: 80  IIKELDLIIKESNQVIKELNLIIKESNQNKLDQ------------IKKELDQMVKELDLT 127
            I EL+  I+E    IK L   IK+ N+                 IKK   +++ + +  
Sbjct: 278 QIDELESSIEEVLSEIKALASKIKQVNEELTTVRQENEELEEEYKIKKRTVELLPDAENN 337

Query: 128 IKELNQIIKELNQIIKELNQ 147
           + +L  ++   ++ + EL Q
Sbjct: 338 VAKLQALVVASSERLLELAQ 357



 Score = 27.8 bits (62), Expect = 2.9
 Identities = 28/118 (23%), Positives = 58/118 (49%), Gaps = 16/118 (13%)

Query: 47  IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN------- 99
            E   + +E +    +   IKK   +++ + +  + +L  ++  S++ + EL        
Sbjct: 305 EELTTVRQENEELEEEYK-IKKRTVELLPDAENNVAKLQALVVASSERLLELAQQWEAHR 363

Query: 100 --LI-----IKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVE 150
             LI     +KE N+NK D+ +++LD+ +K+L   I+EL   ++   Q+ K+L    E
Sbjct: 364 TPLIDEYRSLKEKNRNKEDETQRQLDE-IKKLRNKIEELESELQTKEQLYKQLLDEYE 420


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.0 bits (68), Expect = 0.43
 Identities = 20/93 (21%), Positives = 40/93 (43%), Gaps = 3/93 (3%)

Query: 62  KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQI--KKELDQ 119
           +L Q ++ L   + +  Q  ++L     E  +  +          Q +LD    + E   
Sbjct: 608 RLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQS 667

Query: 120 MVKELDLTIKE-LNQIIKELNQIIKELNQIVEE 151
           +  +L+L I E   Q   +L Q+  +L Q++E+
Sbjct: 668 LKDKLELAIAERKQQAETQLRQLDAQLKQLLEQ 700


>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
           family. 
          Length = 308

 Score = 30.0 bits (68), Expect = 0.45
 Identities = 19/82 (23%), Positives = 44/82 (53%), Gaps = 6/82 (7%)

Query: 68  KELDQIIKELDQIIKELDL--IIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELD 125
           KE+ Q+++ LD  +  L L  I++   ++++ L  ++ + +  +++      D    ELD
Sbjct: 92  KEIRQLLESLDGPLLGLLLKVILEPLLELLELLLELLNDDDPLEVNDGGLIKDGFDPELD 151

Query: 126 LTIKELNQIIKELNQIIKELNQ 147
               EL + ++EL + ++EL +
Sbjct: 152 ----ELREKLEELEEELEELLK 169



 Score = 27.3 bits (61), Expect = 3.3
 Identities = 12/62 (19%), Positives = 38/62 (61%)

Query: 70  LDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIK 129
           L  I++ L ++++ L  ++ + + +      +IK+    +LD+++++L+++ +EL+  +K
Sbjct: 110 LKVILEPLLELLELLLELLNDDDPLEVNDGGLIKDGFDPELDELREKLEELEEELEELLK 169

Query: 130 EL 131
           + 
Sbjct: 170 KE 171


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 29.9 bits (68), Expect = 0.45
 Identities = 13/66 (19%), Positives = 36/66 (54%), Gaps = 4/66 (6%)

Query: 77  LDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIK 136
           LD++ ++L+ + K+  ++ ++L        +NKL +++++L  + K +D    E  ++I 
Sbjct: 244 LDELQEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRID----EAKELIA 299

Query: 137 ELNQII 142
           +    +
Sbjct: 300 KYGNEV 305


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 29.6 bits (67), Expect = 0.53
 Identities = 17/89 (19%), Positives = 51/89 (57%), Gaps = 1/89 (1%)

Query: 63  LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVK 122
              + KEL+ +     ++    D + +E  Q +K+L   +++ +  +LD+ K++L ++++
Sbjct: 160 YKLLMKELELLNSIKPKLRDRKDALEEELRQ-LKQLEDELEDCDPTELDRAKEKLKKLLQ 218

Query: 123 ELDLTIKELNQIIKELNQIIKELNQIVEE 151
           E+ + +K+L ++ +EL ++  ++  +  +
Sbjct: 219 EIMIKVKKLEELEEELQELESKIEDLTNK 247


>gnl|CDD|197699 smart00397, t_SNARE, Helical region found in SNAREs.  All
           alpha-helical motifs that form twisted and parallel
           four-helix bundles in target soluble
           N-ethylmaleimide-sensitive factor (NSF) attachment
           protein (SNAP) receptor proteins. This motif found in
           "Q-SNAREs".
          Length = 66

 Score = 27.9 bits (63), Expect = 0.53
 Identities = 14/67 (20%), Positives = 33/67 (49%), Gaps = 1/67 (1%)

Query: 68  KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLT 127
           ++     +E D+ +++L+  I+E  Q+  ++   ++E  +  LD+I+  +D     L   
Sbjct: 1   QQALAREEERDEELEQLEKSIQELKQIFLDMGTELEEQGEQ-LDRIEDNVDDADVNLKKA 59

Query: 128 IKELNQI 134
            K L + 
Sbjct: 60  NKRLKKA 66



 Score = 24.9 bits (55), Expect = 8.1
 Identities = 10/37 (27%), Positives = 21/37 (56%)

Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           ++     +E D  +++L + I+EL QI  ++   +EE
Sbjct: 1   QQALAREEERDEELEQLEKSIQELKQIFLDMGTELEE 37


>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
          Length = 619

 Score = 29.8 bits (68), Expect = 0.55
 Identities = 19/125 (15%), Positives = 55/125 (44%), Gaps = 5/125 (4%)

Query: 23  FDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIK 82
            DG              +       +E E E+E + D +      +  ++++++   + K
Sbjct: 170 IDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAK 229

Query: 83  ELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQII 142
           +   + K   + ++      + +   K  +++++L + +K L LT K++++++++L  I 
Sbjct: 230 QYKKLRKAQEKKVEG-----RLAQHKKYAKLREKLKEELKSLRLTSKQIDELVEQLRDIN 284

Query: 143 KELNQ 147
           K +  
Sbjct: 285 KRVRG 289


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 29.6 bits (67), Expect = 0.67
 Identities = 32/128 (25%), Positives = 72/128 (56%), Gaps = 7/128 (5%)

Query: 29  VTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLII 88
           +T GS + +   LE    IEE  +E++ +    K+ +++K L ++ KEL+++ +EL+ + 
Sbjct: 661 ITGGSAKTNSSILERRREIEE--LEEKIEELEEKIAELEKALAELRKELEELEEELEQLR 718

Query: 89  KESNQVIKELN-----LIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIK 143
           KE  ++ ++++     L   E+   +L++   +L + + EL+  I+EL + ++E  + + 
Sbjct: 719 KELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELA 778

Query: 144 ELNQIVEE 151
           E    +EE
Sbjct: 779 EAEAEIEE 786



 Score = 27.7 bits (62), Expect = 2.5
 Identities = 23/96 (23%), Positives = 54/96 (56%), Gaps = 4/96 (4%)

Query: 59  DFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD 118
              +L +  +EL + +KE ++ ++EL   ++E  + ++EL L + E  + ++++++KEL 
Sbjct: 233 RLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEE-EIEELQKELY 291

Query: 119 QMVKE---LDLTIKELNQIIKELNQIIKELNQIVEE 151
            +  E   L+   + L + +  L + ++EL   +EE
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEE 327



 Score = 27.7 bits (62), Expect = 2.8
 Identities = 24/97 (24%), Positives = 49/97 (50%), Gaps = 5/97 (5%)

Query: 55  ECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK 114
             +     L++  +EL + I+ L   I+EL+ +I+E    ++ L L  + S +  L  ++
Sbjct: 835 ATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEAL-LNERASLEEALALLR 893

Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            EL+    EL   ++EL     EL + ++EL + + +
Sbjct: 894 SELE----ELSEELRELESKRSELRRELEELREKLAQ 926



 Score = 27.7 bits (62), Expect = 3.2
 Identities = 25/120 (20%), Positives = 56/120 (46%), Gaps = 2/120 (1%)

Query: 34  TEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQ 93
             E  I   S    E +   +E +    + ++   E +  I+EL+  I++L   +K   +
Sbjct: 744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALRE 803

Query: 94  VIKELNLIIKESNQN--KLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            + EL   +   N+    L +  + L++ +   +  +++L + I+EL++ I+ L   +EE
Sbjct: 804 ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEE 863


>gnl|CDD|236364 PRK09034, PRK09034, aspartate kinase; Reviewed.
          Length = 454

 Score = 29.4 bits (67), Expect = 0.68
 Identities = 13/63 (20%), Positives = 25/63 (39%), Gaps = 6/63 (9%)

Query: 81  IKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQ 140
            K  DL+I  +  V+   +       ++  + I     ++ KEL L    L +I + L  
Sbjct: 46  TKVTDLLILYAEAVLAGEDY------EDIFEAIIARYAEIAKELGLDADILEKIEEILEH 99

Query: 141 IIK 143
           +  
Sbjct: 100 LAN 102


>gnl|CDD|147975 pfam06103, DUF948, Bacterial protein of unknown function (DUF948). 
           This family consists of bacterial sequences several of
           which are thought to be general stress proteins.
          Length = 90

 Score = 27.9 bits (63), Expect = 0.72
 Identities = 20/74 (27%), Positives = 40/74 (54%), Gaps = 8/74 (10%)

Query: 63  LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD---Q 119
           L  + K LD++ K L  + K++D I  E+  ++ + N+++++ N        + LD   Q
Sbjct: 21  LKSLSKTLDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLEDVNGK-----VETLDPVFQ 75

Query: 120 MVKELDLTIKELNQ 133
            V +L  ++ +LNQ
Sbjct: 76  AVADLGESVSDLNQ 89



 Score = 26.4 bits (59), Expect = 3.1
 Identities = 11/42 (26%), Positives = 24/42 (57%)

Query: 110 LDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           LD++ K L  + K++D    E   ++ + N +++++N  VE 
Sbjct: 28  LDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLEDVNGKVET 69



 Score = 25.6 bits (57), Expect = 5.1
 Identities = 11/58 (18%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 94  VIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            +K L+  + E N+  L  + K++D +  E +  + + N +++++N  ++ L+ + + 
Sbjct: 20  TLKSLSKTLDEVNKT-LAGLTKQVDGITTETEDLLAKTNVLLEDVNGKVETLDPVFQA 76


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 29.3 bits (66), Expect = 0.90
 Identities = 21/117 (17%), Positives = 47/117 (40%), Gaps = 7/117 (5%)

Query: 34  TEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQ 93
                +NLE       QE+ +       +L Q +++L  +I+ L Q+  E   +  E  +
Sbjct: 62  RSLQALNLE-----LIQELNELKARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFE 116

Query: 94  VIKELNLIIKESNQNKLDQIKKELDQMVKEL--DLTIKELNQIIKELNQIIKELNQI 148
            +      + + N  +L +  +E+ +  +E       +   +    L +I + L +I
Sbjct: 117 ELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHESAEERSTLLEEIDRLLGEI 173


>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
           [Function unknown].
          Length = 827

 Score = 29.3 bits (66), Expect = 0.93
 Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 15/113 (13%)

Query: 54  KECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKE--SNQVIKELNLIIKES-----N 106
           K  ++D + L +    L ++ KE D+I+K+L+  + E  +  + K      K+S     +
Sbjct: 234 KVGEIDSDVLCRDLGTLQELYKERDKILKKLEKALNELLNKLLKKSHLKTNKKSGKLTPS 293

Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQII--KELNQI------IKELNQIVEE 151
           + K  +I  E     K     IK+LN+ I  K+++ I      + +L+  +E 
Sbjct: 294 RKKEFEILPEYVPDKKRPKHRIKKLNKGIFGKKVDAIDYYSAKLTKLDAEIEN 346


>gnl|CDD|214860 smart00856, PMEI, Plant invertase/pectin methylesterase inhibitor. 
           This domain inhibits pectin methylesterases (PMEs) and
           invertases through formation of a non-covalent 1:1
           complex. It has been implicated in the regulation of
           fruit development, carbohydrate metabolism and cell wall
           extension. It may also be involved in inhibiting
           microbial pathogen PMEs. It has been observed that it is
           often expressed as a large inactive preprotein. It is
           also found at the N-termini of PMEs predicted from DNA
           sequences, suggesting that both PMEs and their
           inhibitors are expressed as a single polyprotein and
           subsequently processed. It has two disulphide bridges
           and is mainly alpha-helical.
          Length = 148

 Score = 28.5 bits (64), Expect = 0.94
 Identities = 8/57 (14%), Positives = 28/57 (49%), Gaps = 3/57 (5%)

Query: 84  LDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQ 140
           + + + ++ + +  ++ ++K++   +L   K  L   ++  D  +  L + ++EL  
Sbjct: 43  IKVALSQATKTLSFISKLLKKTKDPRL---KAALKDCLELYDDAVDSLEKALEELKS 96


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 17/75 (22%), Positives = 40/75 (53%), Gaps = 1/75 (1%)

Query: 73  IIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELN 132
           ++ E+ ++ K L  +I E+  +  E N  +++  Q+   QI++ +    +EL   I++L 
Sbjct: 64  LVAEVKELRKRLAKLISENEALKAE-NERLQKREQSIDQQIQQAVQSETQELTKEIEQLK 122

Query: 133 QIIKELNQIIKELNQ 147
              ++L  +I +L +
Sbjct: 123 SERQQLQGLIDQLQR 137


>gnl|CDD|237076 PRK12362, PRK12362, germination protease; Provisional.
          Length = 318

 Score = 28.7 bits (65), Expect = 1.1
 Identities = 23/66 (34%), Positives = 37/66 (56%), Gaps = 2/66 (3%)

Query: 78  DQIIKELDLIIKESNQVIKELNLIIKESNQN-KLDQIKKELDQMVKELDLTIKELNQIIK 136
           D I   LD +I +S +   E   ++K  NQ  K   IK+ L   V  L +T KE++++I+
Sbjct: 240 DTIDLVLDELINQSEEG-SEFYNMLKSLNQEEKQQLIKEVLSPYVGNLIVTPKEIDELIE 298

Query: 137 ELNQII 142
            L++II
Sbjct: 299 NLSKII 304


>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
           kinases [Cell motility and secretion / Signal
           transduction mechanisms].
          Length = 716

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 10/33 (30%), Positives = 24/33 (72%)

Query: 106 NQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
            +++L Q+  ELD++ +ELD  +++L+++  +L
Sbjct: 348 TRSRLSQLASELDEVREELDEALRQLSRLTTDL 380


>gnl|CDD|236679 PRK10387, PRK10387, glutaredoxin 2; Provisional.
          Length = 210

 Score = 28.3 bits (64), Expect = 1.2
 Identities = 11/36 (30%), Positives = 19/36 (52%)

Query: 63  LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKEL 98
            D +      +IKE++  ++ LD +I + N V  EL
Sbjct: 133 FDALLAHTPGLIKEINADLRALDPLIVKPNAVNGEL 168


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 28.9 bits (65), Expect = 1.2
 Identities = 24/125 (19%), Positives = 63/125 (50%), Gaps = 9/125 (7%)

Query: 24  DGLTPVT---DGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQI 80
           +G+  VT    G+ +E  I ++ ++ + + E+E+  K       + K++ ++I      +
Sbjct: 476 NGILSVTAKDKGTGKEQSITIQGASTLPKDEVERMVKEAEKNAAEDKEKREKI-----DL 530

Query: 81  IKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL-TIKELNQIIKELN 139
             + + +  ++ + +KEL   I E  + K++ + K+L Q ++  +  +IK L + +++  
Sbjct: 531 KNQAESLCYQAEKQLKELKDKISEEKKEKIENLIKKLRQALQNDNYESIKSLLEELQKAL 590

Query: 140 QIIKE 144
             I +
Sbjct: 591 MEIGK 595


>gnl|CDD|203795 pfam07889, DUF1664, Protein of unknown function (DUF1664).  The
           members of this family are hypothetical plant proteins
           of unknown function. The region featured in this family
           is approximately 100 amino acids long.
          Length = 125

 Score = 27.8 bits (62), Expect = 1.3
 Identities = 15/83 (18%), Positives = 34/83 (40%), Gaps = 4/83 (4%)

Query: 52  MEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLD 111
           M         +L+++ + +    K L Q I  LD  + E  ++ +     + E  +  L 
Sbjct: 40  MSDAVASVGKQLEKVSESIAAAKKHLSQRIDNLDDKLDEQKEISESTRDEVTEI-REDLS 98

Query: 112 QIKKELD---QMVKELDLTIKEL 131
            I +++    Q V+ L+  +  +
Sbjct: 99  NIGEDVKSVHQAVEGLEGKLDSI 121


>gnl|CDD|205914 pfam13740, ACT_6, ACT domain.  ACT domains bind to amino acids and
           regulate associated enzyme domains.
          Length = 76

 Score = 27.1 bits (61), Expect = 1.3
 Identities = 13/59 (22%), Positives = 28/59 (47%), Gaps = 3/59 (5%)

Query: 73  IIKELDQIIKELDLIIKESNQVI--KELNLIIK-ESNQNKLDQIKKELDQMVKELDLTI 128
           +   L +++ E    I +S Q +   EL L +      + + +++K L  + +EL L +
Sbjct: 15  LTASLTRVLAEHGCNILDSRQAVIGGELTLGMLVSGPWDAIARLEKALLFLAQELGLQV 73


>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
          Length = 780

 Score = 28.6 bits (64), Expect = 1.4
 Identities = 24/120 (20%), Positives = 49/120 (40%), Gaps = 8/120 (6%)

Query: 25  GLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKEL 84
           GLT +TDG  +        + GI E       K    +L++   E    + EL+  +++L
Sbjct: 228 GLTTLTDGLNQLDSGLGTLAAGIGE------LKQGAEQLNEGIGEFSSGLSELNSGVQDL 281

Query: 85  DLIIKESNQVIKEL--NLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQII 142
              + + NQ I  L   L + +S  ++   +++ L Q+ + L        +  +     +
Sbjct: 282 AAGVPQLNQGISALAAGLSLPDSLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLSSL 341


>gnl|CDD|191096 pfam04799, Fzo_mitofusin, fzo-like conserved region.  Family of
           putative transmembrane GTPase. The fzo protein is a
           mediator of mitochondrial fusion. This conserved region
           is also found in the human mitofusin protein.
          Length = 171

 Score = 28.1 bits (63), Expect = 1.5
 Identities = 18/57 (31%), Positives = 31/57 (54%), Gaps = 4/57 (7%)

Query: 96  KELNLIIKESNQNKLDQIKKELDQMVKEL----DLTIKELNQIIKELNQIIKELNQI 148
           ++L LI+  ++ N   Q+++EL      L    D+T K+L + I EL + I+ L  I
Sbjct: 86  EKLQLIVSLTSSNCSHQVQQELASTFARLCQQVDVTSKDLEEEIAELTKEIQRLETI 142


>gnl|CDD|182747 PRK10807, PRK10807, paraquat-inducible protein B; Provisional.
          Length = 547

 Score = 28.5 bits (64), Expect = 1.6
 Identities = 22/103 (21%), Positives = 51/103 (49%), Gaps = 21/103 (20%)

Query: 63  LDQIKK-ELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMV 121
           LD+I    L+ +I++    + E    ++E    +  LN I   S+Q+ + Q+  ++ + +
Sbjct: 426 LDKINNLPLNPMIEQATSTLSESQRTMRELQTTLDSLNKIT--SSQS-MQQLPADMQKTL 482

Query: 122 KELDLTIK-----------------ELNQIIKELNQIIKELNQ 147
           +EL+ +++                  L+Q+++EL  ++K LN+
Sbjct: 483 RELNRSMQGFQPGSPAYNKMVADMQRLDQVLRELQPVLKTLNE 525



 Score = 26.5 bits (59), Expect = 6.3
 Identities = 12/40 (30%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 110 LDQIKK-ELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
           LD+I    L+ M+++   T+ E  + ++EL   +  LN+I
Sbjct: 426 LDKINNLPLNPMIEQATSTLSESQRTMRELQTTLDSLNKI 465


>gnl|CDD|217372 pfam03105, SPX, SPX domain.  We have named this region the SPX
           domain after (SYG1, Pho81 and XPR1). This 180 residue
           length domain is found at the amino terminus of a
           variety of proteins. In the yeast protein SYG1, the
           N-terminus directly binds to the G- protein beta subunit
           and inhibits transduction of the mating pheromone
           signal. This finding suggests that all the members of
           this family are involved in G-protein associated signal
           transduction. The N-termini of several proteins involved
           in the regulation of phosphate transport, including the
           putative phosphate level sensors PHO81 from
           Saccharomyces cerevisiae and NUC-2 from Neurospora
           crassa, are also members of this family. The SPX domain
           of S. cerevisiae low-affinity phosphate transporters
           Pho87 and Pho90 auto-regulates uptake and prevents
           efflux. This SPX dependent inhibition is mediated by the
           physical interaction with Spl2 NUC-2 contains several
           ankyrin repeats pfam00023. Several members of this
           family are annotated as XPR1 proteins: the xenotropic
           and polytropic retrovirus receptor confers
           susceptibility to infection with murine leukaemia
           viruses (MLV). The similarity between SYG1, phosphate
           regulators and XPR1 sequences has been previously noted,
           as has the additional similarity to several predicted
           proteins, of unknown function, from Drosophila
           melanogaster, Arabidopsis thaliana, Caenorhabditis
           elegans, Schizosaccharomyces pombe, and Saccharomyces
           cerevisiae. In addition, given the similarities between
           XPR1 and SYG1 and phosphate regulatory proteins, it has
           been proposed that XPR1 might be involved in G-protein
           associated signal transduction and may itself function
           as a phosphate sensor.
          Length = 176

 Score = 27.8 bits (62), Expect = 1.6
 Identities = 24/126 (19%), Positives = 55/126 (43%), Gaps = 13/126 (10%)

Query: 24  DGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQ-IKKELDQIIKELDQIIK 82
             LT   DG  E S +  E     +E E  +    +  K++   K++  ++++ L+++ K
Sbjct: 37  LSLTTNEDGEDESSNLLNEEELEEDESEFFEALDKELEKVNSFYKEKEAELLERLEELEK 96

Query: 83  ELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQII 142
           +L+ + +  ++     +    ++     +++KK L            EL + ++ L   I
Sbjct: 97  QLEELEERRDETSSAKSSPSDKNLLKAFEELKKAL-----------LELYRELQLLKSYI 145

Query: 143 KELNQI 148
            ELN+ 
Sbjct: 146 -ELNRT 150


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 28.5 bits (64), Expect = 1.7
 Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 5/67 (7%)

Query: 71  DQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKE 130
           D +++EL  I  +L   IKE N      +LI   S + +LDQ+ K+L ++  E +   +E
Sbjct: 135 DPVVQELVSIFNDLIDSIKEDNLKDDLESLI--ASAKEELDQLSKKLAELKAEEE---EE 189

Query: 131 LNQIIKE 137
           L + +KE
Sbjct: 190 LERALKE 196



 Score = 27.7 bits (62), Expect = 3.0
 Identities = 19/106 (17%), Positives = 45/106 (42%), Gaps = 17/106 (16%)

Query: 53  EKECKVDFNKL-DQIKKELDQIIKELDQIIKELD----LIIKES---------NQVIKEL 98
           E   K D   L    K+ELDQ+ K+L ++  E +      +KE           +++  L
Sbjct: 154 EDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARL 213

Query: 99  NLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKE 144
                   +    + ++E +++ K+ +   ++L Q ++   +  ++
Sbjct: 214 ESKEAALEKQLRLEFEREKEELRKKYE---EKLRQELERQAEAHEQ 256


>gnl|CDD|234403 TIGR03930, WXG100_ESAT6, WXG100 family type VII secretion target.
           Members of this protein family include secretion targets
           for the two main variants of type VII secretion systems
           (T7SS), one found in the Actinobacteria, one found in
           the Firmicutes. This model was derived through iteration
           from pfam06013. The best characterized member of this
           family is ESAT-6 from Mycobacterium tuberculosis.
           Members of this family usually are ~100 amino acids in
           length but occasionally have a long C-terminal
           extension.
          Length = 90

 Score = 27.2 bits (61), Expect = 1.7
 Identities = 18/85 (21%), Positives = 32/85 (37%), Gaps = 10/85 (11%)

Query: 57  KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKE 116
           KVD  +L Q      +  +EL Q+++ L          I  L    + +  +      ++
Sbjct: 3   KVDPEELRQAAARFGKTAQELQQLLQRLR-------SEIDALQGTWEGAAADAFQAEFQQ 55

Query: 117 LDQMVKELDLTIKELNQIIKELNQI 141
            D   + L   I  L +I + L Q 
Sbjct: 56  WDPAAERL---INALQEISEALRQA 77


>gnl|CDD|227109 COG4768, COG4768, Uncharacterized protein containing a divergent
           version of the methyl-accepting chemotaxis-like domain
           [General function prediction only].
          Length = 139

 Score = 27.8 bits (62), Expect = 1.7
 Identities = 21/78 (26%), Positives = 42/78 (53%), Gaps = 2/78 (2%)

Query: 63  LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVK 122
           L ++ K LD++ K L  +  ++D I  E+ +++ + N  + E  Q K+  +    D  VK
Sbjct: 26  LKKVSKTLDEVAKTLKGLTSQVDGITHETEELLHKTN-TLAEDVQGKVATLDPVFDA-VK 83

Query: 123 ELDLTIKELNQIIKELNQ 140
           +L  ++ +LNQ ++ L  
Sbjct: 84  DLGQSVSDLNQSVRHLAT 101


>gnl|CDD|233894 TIGR02492, flgK_ends, flagellar hook-associated protein FlgK.  The
           flagellar hook-associated protein FlgK of bacterial
           flagella has conserved N- and C-terminal domains. The
           central region is highly variable in length and
           sequence, and often contains substantial runs of
           low-complexity sequence. This model is built from an
           alignment of FlgK sequences with the central region
           excised. Note that several other proteins of the
           flagellar apparatus also are homologous in the N- and
           C-terminal regions to FlgK, but are excluded from this
           model [Cellular processes, Chemotaxis and motility].
          Length = 324

 Score = 28.1 bits (63), Expect = 1.7
 Identities = 9/41 (21%), Positives = 25/41 (60%), Gaps = 3/41 (7%)

Query: 108 NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
           ++L  ++K ++    E+   + E+N ++K++  + KE+ Q+
Sbjct: 148 SELQDLRKNINA---EIKSAVTEINSLLKQIASLNKEIKQV 185


>gnl|CDD|203375 pfam06013, WXG100, Proteins of 100 residues with WXG.  ESAT-6 is a
           small protein appears to be of fundamental importance in
           virulence and protective immunity in Mycobacterium
           tuberculosis. Homologues have been detected in other
           Gram-positive bacterial species. It may represent a
           novel secretion system potentially driven by the
           pfam01580 domains in the YukA-like proteins.
          Length = 85

 Score = 26.9 bits (60), Expect = 1.8
 Identities = 16/82 (19%), Positives = 31/82 (37%), Gaps = 11/82 (13%)

Query: 57  KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIK----ESNQNKLDQ 112
           KVD  +L Q      +  +EL+ I++ L          +  L         ++ Q    +
Sbjct: 3   KVDPEELRQAAARFQKAAQELEDILQRLK-------AQVAALQGGWSGAAADAFQAAFAE 55

Query: 113 IKKELDQMVKELDLTIKELNQI 134
            K   +++V  L+   + L Q 
Sbjct: 56  WKPAAEKLVDALEEISEALKQA 77


>gnl|CDD|224493 COG1577, ERG12, Mevalonate kinase [Lipid metabolism].
          Length = 307

 Score = 28.0 bits (63), Expect = 1.8
 Identities = 13/74 (17%), Positives = 37/74 (50%), Gaps = 1/74 (1%)

Query: 68  KELDQIIKELDQIIKEL-DLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL 126
           KEL   + +L +   E+ D I+    ++++E    ++  +  +L ++      ++K L +
Sbjct: 185 KELVAGVAKLLEEEPEVIDPILDAIGELVQEAEAALQTGDFEELGELMNINQGLLKALGV 244

Query: 127 TIKELNQIIKELNQ 140
           +  EL+++++    
Sbjct: 245 STPELDELVEAARS 258


>gnl|CDD|219890 pfam08537, NBP1, Fungal Nap binding protein NBP1.  NBP1 is a
           nuclear protein which has been shown in Saccharomyces
           cerevisiae to be essential for the G2/M transition of
           the cell cycle.
          Length = 313

 Score = 28.3 bits (63), Expect = 1.8
 Identities = 14/39 (35%), Positives = 28/39 (71%), Gaps = 1/39 (2%)

Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKEL 145
           Q K+D+++++L ++ +EL  T K+L Q +KE N +++ L
Sbjct: 172 QKKIDKLEEKLAELEQELQSTQKKL-QFVKEKNHLLESL 209


>gnl|CDD|223616 COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ
           chaperones [Posttranslational modification, protein
           turnover, chaperones].
          Length = 786

 Score = 28.4 bits (64), Expect = 1.8
 Identities = 17/59 (28%), Positives = 37/59 (62%)

Query: 68  KELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL 126
           +ELD++ +EL Q+  E + + +E ++  K+L   I +  + ++ +++KEL+  V E D+
Sbjct: 401 EELDELERELAQLEIEKEALEREQDEKEKKLIDEIIKLKEGRIPELEKELEAEVDEDDI 459


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 28.1 bits (63), Expect = 1.8
 Identities = 17/89 (19%), Positives = 41/89 (46%), Gaps = 6/89 (6%)

Query: 62  KLDQIKKELDQIIKELDQIIKEL--DLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQ 119
             ++I+K L  + ++  +  K+L     ++E  + ++ +       N  +L +IK+E D+
Sbjct: 165 TAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIK-KKDIKNPLELQKIKEEFDK 223

Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQI 148
           + KE      +  +I    N +  +  Q+
Sbjct: 224 LKKEGK---ADKQKIKSAKNDLQNDKKQL 249


>gnl|CDD|182386 PRK10334, PRK10334, mechanosensitive channel MscS; Provisional.
          Length = 286

 Score = 28.0 bits (62), Expect = 1.9
 Identities = 15/65 (23%), Positives = 36/65 (55%), Gaps = 9/65 (13%)

Query: 63  LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNK---------LDQI 113
           +DQ+K+ L  II+  D+I+K+ ++ ++ +      +N +++  + +          L++I
Sbjct: 198 IDQVKQILTNIIQSEDRILKDREMTVRLNELGASSINFVVRVWSNSGDLQNVYWDVLERI 257

Query: 114 KKELD 118
           K+E D
Sbjct: 258 KREFD 262


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 28.2 bits (63), Expect = 1.9
 Identities = 21/110 (19%), Positives = 59/110 (53%)

Query: 32  GSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKES 91
              E  E+  E S   EE E+ +E +    +L ++++EL ++++  + + +EL+  +++ 
Sbjct: 479 YELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKL 538

Query: 92  NQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQI 141
             +++EL  + ++    +L +  ++L+  ++EL   ++EL  +     ++
Sbjct: 539 ENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEEL 588


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 28.4 bits (63), Expect = 2.0
 Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 7/90 (7%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLII-KESNQNKLDQIKKELDQ 119
            ++   +K+     ++L++       ++ E    ++E N     E   N L++  K   +
Sbjct: 413 ERIKNFEKQKKSSKEQLEK------FLVNEFKSDVQEYNKYCGLEKKINNLEKEIKNNQE 466

Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQIV 149
            VK+L   IKE+ + +  +  I+KE+NQ +
Sbjct: 467 EVKKLSNEIKEIEKFLVSIKPIVKEINQTL 496


>gnl|CDD|112953 pfam04163, Tht1, Tht1-like nuclear fusion protein. 
          Length = 554

 Score = 28.2 bits (62), Expect = 2.0
 Identities = 23/112 (20%), Positives = 43/112 (38%), Gaps = 1/112 (0%)

Query: 21  RMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELD-QIIKELDQ 79
             F         + E   I LE  +  E Q  ++E KV+ +++ +  ++L+ +I      
Sbjct: 206 GKFRNSDIELRFADENFRIILEIMDLSEFQNHQQEGKVNQDEILETIEQLESEIFDINSF 265

Query: 80  IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKEL 131
               ++      N +I++L       N   L  I    D  +  L+  IK L
Sbjct: 266 FANFIEESAGVHNHIIEDLADEFASENAIALSAIGDLKDDFLAFLEDRIKNL 317


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 28.1 bits (63), Expect = 2.0
 Identities = 18/79 (22%), Positives = 45/79 (56%), Gaps = 8/79 (10%)

Query: 74  IKELDQIIKELDLIIKESNQVIKELNLIIKE-----SNQNKLDQIKKELDQMVKELDLTI 128
           +KEL++  KE ++  K + ++I++    ++E       + K+++ ++   QM + L+  +
Sbjct: 93  LKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMKGEAKVEEEEE--KQM-EILEKAL 149

Query: 129 KELNQIIKELNQIIKELNQ 147
           K   +I+KE N+ ++ L +
Sbjct: 150 KSYLKIVKEENKSLQRLAK 168


>gnl|CDD|234955 PRK01433, hscA, chaperone protein HscA; Provisional.
          Length = 595

 Score = 27.9 bits (62), Expect = 2.1
 Identities = 23/108 (21%), Positives = 51/108 (47%), Gaps = 11/108 (10%)

Query: 39  INLESSNGIEEQEMEK--ECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIK 96
           I ++ ++GI++ E++   E      K+D   + L + + E + +I  +        + I 
Sbjct: 474 IEVKPNHGIDKTEIDIMLENAYKNAKIDYTTRLLQEAVIEAEALIFNI-------ERAIA 526

Query: 97  ELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKE 144
           EL  ++ ES  + ++ +   + + V   D+ +  +N  IKE    IK+
Sbjct: 527 ELTTLLSESEISIINSLLDNIKEAVHARDIIL--INNSIKEFKSKIKK 572


>gnl|CDD|235277 PRK04309, PRK04309, DNA-directed RNA polymerase subunit A'';
           Validated.
          Length = 383

 Score = 27.9 bits (63), Expect = 2.1
 Identities = 11/45 (24%), Positives = 28/45 (62%)

Query: 94  VIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
             + L   +++++     ++K+EL + ++E  LT +E+ +II+E+
Sbjct: 3   SEETLEEKLEDASLELPQKLKEELREKLEERKLTEEEVEEIIEEV 47


>gnl|CDD|223377 COG0300, DltE, Short-chain dehydrogenases of various substrate
           specificities [General function prediction only].
          Length = 265

 Score = 27.6 bits (62), Expect = 2.2
 Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 7/49 (14%)

Query: 99  NLIIKESNQNKLDQIKKELDQM------VKELDLT-IKELNQIIKELNQ 140
           NLI+    ++KL+ + KEL+        V   DL+  + L ++  EL +
Sbjct: 32  NLILVARREDKLEALAKELEDKTGVEVEVIPADLSDPEALERLEDELKE 80


>gnl|CDD|153165 cd04893, ACT_GcvR_1, ACT domains that comprise the Glycine Cleavage
           System Transcriptional Repressor (GcvR) protein, and
           other related domains.  This CD includes the first of
           the two ACT domains that comprise the Glycine Cleavage
           System Transcriptional Repressor (GcvR) protein, and
           other related domains. The glycine cleavage enzyme
           system in Escherichia coli provides one-carbon units for
           cellular methylation reactions. This enzyme system,
           encoded by the gcvTHP operon and lpd gene, catalyzes the
           cleavage of glycine into CO2 + NH3 and transfers a
           one-carbon unit to tetrahydrofolate, producing
           5,10-methylenetetrahydrofolate. The gcvTHP operon is
           activated by the GcvA protein in response to glycine and
           repressed by a GcvA/GcvR interaction in the absence of
           glycine. It has been proposed that the co-activator
           glycine acts through a mechanism of de-repression by
           binding to GcvR and preventing GcvR from interacting
           with GcvA to block GcvA's activator function. Evidence
           also suggests that GcvR interacts directly with GcvA
           rather than binding to DNA to cause repression. Members
           of this CD belong to the superfamily of ACT regulatory
           domains.
          Length = 77

 Score = 26.5 bits (59), Expect = 2.2
 Identities = 15/59 (25%), Positives = 29/59 (49%), Gaps = 3/59 (5%)

Query: 73  IIKELDQIIKELDLIIKESNQVI--KELNLI-IKESNQNKLDQIKKELDQMVKELDLTI 128
           I+ EL + + E    I +S   I   E  L  + E + + + +++  L  + + LDLT+
Sbjct: 14  ILNELTRAVSESGCNILDSRMAILGTEFALTMLVEGSWDAIAKLEAALPGLARRLDLTL 72


>gnl|CDD|222469 pfam13949, ALIX_LYPXL_bnd, ALIX V-shaped domain binding to HIV.
           The binding of the LYPxL motif of late HIV p6Gag and
           EIAV p9Gag to this domain is necessary for viral
           budding.This domain is generally central between an
           N-terminal Bro1 domain, pfam03097 and a C-terminal
           proline-rich domain. The retroviruses thus used this
           domain to hijack the ESCRT system of the cell.
          Length = 292

 Score = 27.7 bits (62), Expect = 2.2
 Identities = 15/92 (16%), Positives = 45/92 (48%), Gaps = 9/92 (9%)

Query: 69  ELDQIIKELDQIIKELDLIIKESNQVIKELNL---------IIKESNQNKLDQIKKELDQ 119
           E+ + I  L +++ +L+ +  E  ++++EL           ++   N+      ++  ++
Sbjct: 138 EVSEQISRLRELLNKLNELKAEREKLLEELKEKAQDDDISKLLIALNKLGSSNEEQLFEE 197

Query: 120 MVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
            +K+ D   + + Q + +  +++KE+ +  EE
Sbjct: 198 ELKKFDPLQERVEQNLSKQEELLKEIQEANEE 229


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 27.7 bits (62), Expect = 2.3
 Identities = 21/91 (23%), Positives = 46/91 (50%), Gaps = 5/91 (5%)

Query: 61  NKLDQIKKELDQIIKEL-DQIIKELDLIIKES---NQVIKELNLIIKESNQNKLDQIKKE 116
             + +   E  Q +K+L   + K L+ I       N  +++L    +E  +  L Q++++
Sbjct: 226 KAISKSLPETKQYLKKLSQDLSKALEKISSREKHLNNQLEQLVQEYRE-ARRTLSQVQEK 284

Query: 117 LDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
            +Q  + +    +ELN+I +EL Q+ +E+ +
Sbjct: 285 YNQASQGVSELTRELNEISEELEQVKQEMEE 315


>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
           Provisional.
          Length = 2722

 Score = 27.9 bits (62), Expect = 2.5
 Identities = 21/73 (28%), Positives = 42/73 (57%), Gaps = 3/73 (4%)

Query: 62  KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNK--LDQIKKELDQ 119
            L +++KE ++  + +D IIK+++ + K  N +IK LN+ I  SN NK  ++ +      
Sbjct: 850 NLKELEKEFNENNQIVDNIIKDIENMNKNIN-IIKTLNIAINRSNSNKQLVEHLLNNKID 908

Query: 120 MVKELDLTIKELN 132
           +  +L+  +K +N
Sbjct: 909 LKNKLEQHMKIIN 921


>gnl|CDD|223266 COG0188, GyrA, Type IIA topoisomerase (DNA gyrase/topo II,
           topoisomerase IV), A subunit [DNA replication,
           recombination, and repair].
          Length = 804

 Score = 27.6 bits (62), Expect = 2.6
 Identities = 19/73 (26%), Positives = 40/73 (54%), Gaps = 1/73 (1%)

Query: 64  DQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKE 123
           ++I+KEL ++ KE+  + K L    +  + + KEL  I K+    +  +I +E +  +++
Sbjct: 437 EKIEKELKELEKEIADLEKILASEERLLDIIKKELLEIKKKFGDERRTEIVEEEEDEIED 496

Query: 124 LDLTIKELNQIIK 136
            DL I E + ++ 
Sbjct: 497 EDL-IAEEDVVVT 508


>gnl|CDD|212126 cd10815, GH38N_AMII_EcMngB_like, N-terminal catalytic domain of
           Escherichia coli alpha-mannosidase MngB and its
           bacterial homologs; glycoside hydrolase family 38
           (GH38).  The bacterial subfamily is represented by
           Escherichia coli alpha-mannosidase MngB, which is
           encoded by the mngB gene (previously called ybgG). MngB
           exhibits alpha-mannosidase activity that converts
           2-O-(6-phospho-alpha-mannosyl)-D-glycerate to
           mannose-6-phosphate and glycerate in the pathway which
           enables use of mannosyl-D-glycerate as a sole carbon
           source. A divalent metal ion is required for its
           activity.
          Length = 270

 Score = 27.5 bits (62), Expect = 2.7
 Identities = 18/56 (32%), Positives = 31/56 (55%), Gaps = 7/56 (12%)

Query: 64  DQ--IKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKEL 117
           DQ  I+K L ++I+EL++I  + + +I    +  K L     E N++ L  I+ EL
Sbjct: 216 DQMPIRKNLPEVIEELNEISPDYEYVISSYEEFFKAL-----EKNKDLLPTIEGEL 266


>gnl|CDD|185112 PRK15190, PRK15190, fimbrial protein BcfE; Provisional.
          Length = 181

 Score = 27.4 bits (60), Expect = 2.7
 Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 4/54 (7%)

Query: 12 LRRHYFALFRMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQ 65
          + R + ALF M   L P  + S ++   NLE +  I  Q     C +D + L Q
Sbjct: 1  MNRSHCALFAMGLMLCPQINASADDLAYNLEFTGTIVAQ----TCDIDISSLSQ 50


>gnl|CDD|220387 pfam09763, Sec3, Exocyst complex component Sec3.  This entry is the
           conserved middle and C-terminus of the Sec3 protein.
           Sec3 binds to the C-terminal cytoplasmic domain of GLYT1
           (glycine transporter protein 1). Sec3 is the exocyst
           component that is closest to the plasma membrane docking
           site and it serves as a spatial landmark in the plasma
           membrane for incoming secretory vesicles. Sec3 is
           recruited to the sites of polarised membrane growth
           through its interaction with Rho1p, a small GTP-binding
           protein.
          Length = 691

 Score = 27.7 bits (62), Expect = 2.8
 Identities = 16/82 (19%), Positives = 39/82 (47%), Gaps = 11/82 (13%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK------ 114
            +++++ K LD+ + E D++   L L   E + + +++  I    +QN   Q++      
Sbjct: 30  KRVNELMKYLDEALAECDELDPTLSLYSMELSSLSEDIEYI---ESQNNGLQVQSANQKL 86

Query: 115 --KELDQMVKELDLTIKELNQI 134
              EL+ ++  + L   +L  +
Sbjct: 87  LWNELESLLNTVSLPESDLRAL 108


>gnl|CDD|215787 pfam00206, Lyase_1, Lyase. 
          Length = 312

 Score = 27.7 bits (62), Expect = 2.8
 Identities = 26/103 (25%), Positives = 43/103 (41%), Gaps = 19/103 (18%)

Query: 67  KKELDQIIKELDQIIKELDL----IIKES------------NQVIKELNLIIKESNQNKL 110
           K+E   IIK LD++ +E        +K              N+VI EL   +   N +K+
Sbjct: 47  KEEAAAIIKALDEVAEEGKADDAFPLKVIQEGSGTAVNMNLNEVIGELLGQLVHPN-DKV 105

Query: 111 DQIKKELDQMVKELDLTIKE--LNQIIKELNQIIKELNQIVEE 151
              +   DQ+   L L +K      ++  L Q+I  L +  +E
Sbjct: 106 HTGQSSNDQVPTALRLALKLALSEVLLPALGQLIDALKEKAKE 148


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 27.2 bits (61), Expect = 2.9
 Identities = 14/65 (21%), Positives = 40/65 (61%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM 120
           ++L+Q  K+L++  +EL++ + EL+  ++   +  +E   I ++ + +++  +KK+  Q+
Sbjct: 123 SELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQNQQL 182

Query: 121 VKELD 125
             +L+
Sbjct: 183 KSQLE 187


>gnl|CDD|240403 PTZ00400, PTZ00400, DnaK-type molecular chaperone; Provisional.
          Length = 663

 Score = 27.5 bits (61), Expect = 2.9
 Identities = 29/110 (26%), Positives = 58/110 (52%), Gaps = 7/110 (6%)

Query: 31  DGST-EESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIK 89
           D ST ++ EI ++SS G+ ++E+EK  K       +  KE D+  KEL     E + +I 
Sbjct: 524 DKSTGKKQEITIQSSGGLSDEEIEKMVKE-----AEEYKEQDEKKKELVDAKNEAETLIY 578

Query: 90  ESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDL-TIKELNQIIKEL 138
              + + +L   I ++++++L Q   +L   +   D+ +IK+  + ++E 
Sbjct: 579 SVEKQLSDLKDKISDADKDELKQKITKLRSTLSSEDVDSIKDKTKQLQEA 628


>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein.  This is a
           family of proteins which appears to have a highly
           conserved zinc finger domain at the C terminal end,
           described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
           predicted to contain a coiled coil. Members are
           annotated as being tumour-associated antigen HCA127 in
           humans but this could not confirmed.
          Length = 215

 Score = 27.1 bits (60), Expect = 3.1
 Identities = 11/75 (14%), Positives = 34/75 (45%), Gaps = 6/75 (8%)

Query: 77  LDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIK 136
           + +I  + D + K  +++  E   +  E          KE D+ ++EL     +  + ++
Sbjct: 3   IKEIRNKTDELEKLKDEIKAEEEALESEEK------HLKEYDKEMEELLEEKMQHVEELR 56

Query: 137 ELNQIIKELNQIVEE 151
           +++  I ++   +++
Sbjct: 57  QIHADINDMETEIKQ 71



 Score = 26.4 bits (58), Expect = 6.4
 Identities = 12/62 (19%), Positives = 30/62 (48%)

Query: 36 ESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVI 95
          + EI  E      E++  KE   +  +L + K +  + ++++   I +++  IK+S   +
Sbjct: 17 KDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQIHADINDMETEIKQSKSEL 76

Query: 96 KE 97
          + 
Sbjct: 77 ER 78


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 27.3 bits (61), Expect = 3.5
 Identities = 19/145 (13%), Positives = 61/145 (42%), Gaps = 6/145 (4%)

Query: 9   CLRLRRHYFALFRMFDGLTPVTDGSTE----ESEINLESSNGIEEQEMEKECKVDFNKLD 64
            +        L  + D L  V            E+    +     +E+ K  +   ++++
Sbjct: 53  EVAQISLSSLLSEVLDYLRSVKGLEGRLFILPEEVEKLEAELKSLEEVIKPAEKFSSEVE 112

Query: 65  QIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKEL 124
           ++ ++L++ + ELD+ +++L+ +++E  + +  L+  +      K   ++  L +  ++L
Sbjct: 113 ELTRKLEERLSELDEELEDLEDLLEEL-EPLAYLDFDLSLLRGLKFLLVRLGLVR-REKL 170

Query: 125 DLTIKELNQIIKELNQIIKELNQIV 149
           +  +  +   +    + ++    IV
Sbjct: 171 EALVGVIEDEVALYGENVEASVVIV 195


>gnl|CDD|204831 pfam12137, RapA_C, RNA polymerase recycling family C-terminal.
           This domain is found in bacteria. This domain is about
           360 amino acids in length. This domain is found
           associated with pfam00271, pfam00176. The function of
           this domain is not known, but structurally it forms an
           alpha-beta fold in nature with a central beta-sheet
           flanked by helices and loops, the beta-sheet being
           mainly antiparallel and flanked by four alpha helices,
           among which the two longer helices exhibit a coiled-coil
           arrangement.
          Length = 362

 Score = 27.1 bits (61), Expect = 3.7
 Identities = 19/100 (19%), Positives = 50/100 (50%), Gaps = 11/100 (11%)

Query: 57  KVDFNKLDQ----IKKELD-QIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLD 111
           +V F  L++    + +    +++K L   I++L  + K      ++L  II+++ Q    
Sbjct: 251 QVSFESLNKQLSPVNRHTARKLVKALRPQIEQL--LQKAEKLAEEQLEEIIEQAKQQMQQ 308

Query: 112 QIKKELDQMV--KELDLTIK--ELNQIIKELNQIIKELNQ 147
           Q+  EL ++   K ++  I+  E+  + ++  +++  L++
Sbjct: 309 QLSAELARLKALKAVNPNIRDEEIEALEQQREELLTALDK 348


>gnl|CDD|219289 pfam07083, DUF1351, Protein of unknown function (DUF1351).  This
           family consists of several bacterial and phage proteins
           of around 230 residues in length. The function of this
           family is unknown.
          Length = 220

 Score = 26.9 bits (60), Expect = 3.8
 Identities = 20/96 (20%), Positives = 47/96 (48%), Gaps = 3/96 (3%)

Query: 57  KVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKE 116
            ++F   ++ +K +D+++ +    +   +   K+  +   ELN +IK  N  + + IKK+
Sbjct: 11  TIEFLDFEEFEKAIDEVVAKYKGYVVTEE-NYKDDKKERAELNKLIKALNDKRKE-IKKQ 68

Query: 117 LDQMVKELDLTIKELNQIIKE-LNQIIKELNQIVEE 151
             +   E +  +K+L   IK  ++ I   + +  E+
Sbjct: 69  YSKPADEFEAYMKKLIAPIKAVIDLIDSGIKEFEEK 104


>gnl|CDD|215969 pfam00521, DNA_topoisoIV, DNA gyrase/topoisomerase IV, subunit A. 
          Length = 427

 Score = 27.1 bits (61), Expect = 3.8
 Identities = 18/97 (18%), Positives = 49/97 (50%), Gaps = 16/97 (16%)

Query: 63  LDQIKKELD---QIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQI------ 113
           L+++++ L     ++K L++I   +  +I+ S  + K    +I+E ++ + D +      
Sbjct: 329 LEKLEERLHILEGLLKALNKI-DFVIEVIRGSIDLKKAKKELIEELSEIQADYLLDMRLR 387

Query: 114 ------KKELDQMVKELDLTIKELNQIIKELNQIIKE 144
                  ++L++ ++EL+  I EL +I+    ++  +
Sbjct: 388 RLTKEEIEKLEKEIEELEKEIAELEKILASEKKLWIK 424


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 27.0 bits (61), Expect = 3.9
 Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 11/88 (12%)

Query: 70  LDQIIKELDQ----IIKELDLIIKESNQVIKELNLIIKESNQNKLDQIK--KELDQMVKE 123
           +D+++ ELD+    +  EL+ +  E N + KE+    K   ++    I   KEL + +K 
Sbjct: 27  VDELL-ELDEERRELQTELEELQAERNALSKEIG-QAKRKGEDAEALIAEVKELKEEIKA 84

Query: 124 LDLTIKELNQIIKELNQIIKELNQIVEE 151
           L+    EL+++  EL +++  +  +  +
Sbjct: 85  LE---AELDELEAELEELLLRIPNLPHD 109


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 27.3 bits (61), Expect = 3.9
 Identities = 20/109 (18%), Positives = 48/109 (44%), Gaps = 17/109 (15%)

Query: 33  STEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESN 92
           S EE E  +E +      + +    V+       + E + +I  L++ +KE+  + +E  
Sbjct: 484 SDEEIERMVEDAEANAALDKKFRELVEA------RNEAESLIYSLEKALKEIVKVSEEE- 536

Query: 93  QVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQI 141
                     KE  +  +  +++ L+   +E+   I+EL ++ ++L + 
Sbjct: 537 ----------KEKIEEAITDLEEALEGEKEEIKAKIEELQEVTQKLAEK 575


>gnl|CDD|147082 pfam04740, LXG, LXG domain of WXG superfamily.  This domain is
           present is the N-terminal region of a group of
           polymorphic toxin proteins in bacteria. It is predicted
           to use Type VII secretion pathway to mediate export of
           bacterial toxins.
          Length = 204

 Score = 26.8 bits (60), Expect = 4.1
 Identities = 29/110 (26%), Positives = 54/110 (49%), Gaps = 12/110 (10%)

Query: 43  SSNG-IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLI 101
           SSNG I+E+ +E E K    K ++  +EL      ++ I+  +  I+         L   
Sbjct: 90  SSNGYIDEEFLEHELKNGLKKAEEKTEELKD---AINSILASISDIV--------HLPKF 138

Query: 102 IKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
              + Q +LD+ KK+ +  +++L    +EL   + EL  +I+E+ Q V +
Sbjct: 139 STSAFQEQLDKAKKKRNDTLEKLYEFDQELTNELAELEALIQEIQQYVRQ 188


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 27.0 bits (60), Expect = 4.3
 Identities = 23/91 (25%), Positives = 47/91 (51%), Gaps = 23/91 (25%)

Query: 61  NKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQM 120
           +KL +++KE   I  E++ +  +++ I                   Q+K+D+++KE+DQ 
Sbjct: 38  SKLSELQKEKKNIQNEIESLDNQIEEI-------------------QSKIDELQKEIDQS 78

Query: 121 VKELDLTIKELNQIIKELNQIIKELNQIVEE 151
             E    IK+L + I EL + I E  +++++
Sbjct: 79  KAE----IKKLQKEIAELKENIVERQELLKK 105


>gnl|CDD|217273 pfam02895, H-kinase_dim, Signal transducing histidine kinase,
           homodimeric domain.  This helical bundle domain is the
           homodimer interface of the signal transducing histidine
           kinase family.
          Length = 63

 Score = 25.2 bits (56), Expect = 4.3
 Identities = 12/54 (22%), Positives = 29/54 (53%), Gaps = 10/54 (18%)

Query: 106 NQNKLDQI----------KKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
           +  KLD++          +  L+Q+++ L+   +EL + ++ L+++ +EL   V
Sbjct: 7   DVEKLDRLMNLVGELVIARNRLEQLLERLEELDRELKEALERLSRLTRELQDAV 60


>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 782

 Score = 27.1 bits (61), Expect = 4.4
 Identities = 25/110 (22%), Positives = 51/110 (46%), Gaps = 10/110 (9%)

Query: 40  NLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN 99
            L      EE+E+E   +   ++ ++  K   +I  E  Q +  +D   K ++ +   L 
Sbjct: 116 LLPDEPIDEEREIEALVRSILSEFEEYAKLNKKIPPEELQSLNSIDDPGKLADTIAAHLP 175

Query: 100 LIIKE-----SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKE 144
           L ++E        +  ++++K LD + KE+D     L Q+ K + + +KE
Sbjct: 176 LKLEEKQEILETLDVKERLEKLLDLLEKEID-----LLQLEKRIRKKVKE 220


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 27.0 bits (60), Expect = 4.7
 Identities = 19/102 (18%), Positives = 41/102 (40%), Gaps = 14/102 (13%)

Query: 60  FNKLDQIKKELDQIIKELDQI------IKELDLIIKESNQVIKELNLIIKESNQ--NKLD 111
               DQ+ + L+Q  + L+ +      +  L L + +S++ I+E   +++ S      L 
Sbjct: 64  LAINDQLSQALNQQTERLNALASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLLSRILL 123

Query: 112 QIKKELDQMVK--ELDLTIKELNQIIKELNQIIKELNQIVEE 151
           Q    L +  +  E     +E + +  E   I    N +  +
Sbjct: 124 QQLGPLPEAGQPQEQFEVTQERDALQAEKAYI----NALEGQ 161


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 26.7 bits (59), Expect = 4.8
 Identities = 18/91 (19%), Positives = 46/91 (50%), Gaps = 6/91 (6%)

Query: 47  IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
            + +E   E + +   L +  + L +++K L++ ++ L+  IKE  + ++EL   ++E  
Sbjct: 96  EDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEV 155

Query: 107 QNK------LDQIKKELDQMVKELDLTIKEL 131
           +        L +  + L + ++EL   +++L
Sbjct: 156 EELREELERLQENLQRLQEAIQELQSLLEQL 186


>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
           transduction.  This family is a four helix bundle that
           operates as a ubiquitous sensory module in prokaryotic
           signal-transduction. The 4HB_MCP is always found between
           two predicted transmembrane helices indicating that it
           detects only extracellular signals. In many cases the
           domain is associated with a cytoplasmic HAMP domain
           suggesting that most proteins carrying the bundle might
           share the mechanism of transmembrane signalling which is
           well-characterized in E coli chemoreceptors.
          Length = 181

 Score = 26.4 bits (59), Expect = 5.0
 Identities = 15/94 (15%), Positives = 43/94 (45%), Gaps = 7/94 (7%)

Query: 61  NKLDQIKKELDQIIKELDQII------KELDLIIKESNQVIKELNLIIKESNQNKLDQIK 114
             +++++ E+D+++K+ ++ I      K  +   ++     K  N ++  +   K D+  
Sbjct: 82  KDIEELRAEIDKLLKKYEKTILTEEEKKLFNEFKEQLKAYRKVRNKVLDLAKAGKNDEAY 141

Query: 115 KELDQMVKEL-DLTIKELNQIIKELNQIIKELNQ 147
                 ++   D  I+ L+++I    ++ KE  +
Sbjct: 142 ALYLTELEPARDAVIEALDELIDYNLKVAKEAYE 175


>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
           type) [DNA replication, recombination, and repair].
          Length = 1444

 Score = 26.9 bits (60), Expect = 5.0
 Identities = 29/143 (20%), Positives = 60/143 (41%), Gaps = 10/143 (6%)

Query: 17  FALFRMFDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKE 76
           +    +F    P+      E+ ++ E    + + E + E +       +I+    +II++
Sbjct: 45  WEFHFLFAETLPIEIYKELENALSEEFEQLVAKVEFDIEVRNVDITEQEIQTYFKKIIEQ 104

Query: 77  L---DQIIKEL----DLIIKESNQVI-KELNLIIKESNQNKLDQIKKELDQMVKELDLTI 128
           L     I K L     L +K +N +I + LN    +  +NK  +++K+L        L  
Sbjct: 105 LSVKSPIFKSLLNKLKLKVKGNNILIEQVLNNPEFDHFKNKSPELQKKLQSFGFPQLLIE 164

Query: 129 KELNQIIKELNQIIKELNQIVEE 151
            E+N I +E     ++  + + E
Sbjct: 165 FEVNDISEEQE--FEKFEEAINE 185


>gnl|CDD|236270 PRK08471, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 613

 Score = 26.9 bits (60), Expect = 5.3
 Identities = 11/44 (25%), Positives = 30/44 (68%), Gaps = 3/44 (6%)

Query: 105 SNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
             + +LD ++K++++   EL +T+ E+N + K++ +I K++ ++
Sbjct: 150 DTRERLDTLQKKVNE---ELKVTVDEINSLGKQIAEINKQIKEV 190


>gnl|CDD|177094 CHL00195, ycf46, Ycf46; Provisional.
          Length = 489

 Score = 26.9 bits (60), Expect = 5.3
 Identities = 18/84 (21%), Positives = 43/84 (51%), Gaps = 10/84 (11%)

Query: 73  IIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLD-----QIKKELDQMVKELDLT 127
           ++K+ ++ + ++ +  K     ++ L+ I+K   +  +       I KEL  ++  L+  
Sbjct: 86  LLKDFNRFLNDISISRK-----LRNLSRILKTQPKTIIIIASELNIPKELKDLITVLEFP 140

Query: 128 IKELNQIIKELNQIIKELNQIVEE 151
           +   ++I KEL ++IK LN  ++ 
Sbjct: 141 LPTESEIKKELTRLIKSLNIKIDS 164


>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
          Length = 1437

 Score = 26.7 bits (60), Expect = 5.3
 Identities = 19/141 (13%), Positives = 55/141 (39%), Gaps = 8/141 (5%)

Query: 17  FALFRMFDGLTPVTDGSTEESEINLE---SSNGIEEQEMEKECKVDFNKLDQIKKELDQI 73
              F++F      +     + ++       +    E+ +         K  +       +
Sbjct: 54  IEDFKLFKEKLKQSFSHIADIKVTFSIEVENITFTEELLLDYWNEIIEKAKKNSPLFKSL 113

Query: 74  IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKK---ELDQMVKELDLTIKE 130
           +K+    ++   LIIK +N++ +  + + K+     + Q +K    + ++  E+D + +E
Sbjct: 114 LKKQKVEVEGNKLIIKVNNEIER--DHLKKKHLPKLIKQYEKFGFGILKIDFEIDDSKEE 171

Query: 131 LNQIIKELNQIIKELNQIVEE 151
           L +   +  +  ++L +   E
Sbjct: 172 LEKFEAQKEEEDEKLAKEALE 192


>gnl|CDD|184829 PRK14807, PRK14807, histidinol-phosphate aminotransferase;
           Provisional.
          Length = 351

 Score = 26.7 bits (59), Expect = 5.4
 Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 93  QVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQI 148
            ++K LNL+    N N L Q+   +   V    +  + +N I+ E  ++IKEL++I
Sbjct: 230 NILKYLNLVKSPYNINSLSQV---IALKVLRTGVLKERVNYILNERERLIKELSKI 282


>gnl|CDD|133381 cd01106, HTH_TipAL-Mta, Helix-Turn-Helix DNA binding domain of the
           transcription regulators TipAL, Mta, and SkgA.
           Helix-turn-helix (HTH) TipAL, Mta, and SkgA
           transcription regulators, and related proteins,
           N-terminal domain. TipAL regulates resistance to and
           activation by numerous cyclic thiopeptide antibiotics,
           such as thiostrepton. Mta is a global transcriptional
           regulator; the N-terminal DNA-binding domain of Mta
           interacts directly with the promoters of mta, bmr, blt,
           and ydfK, and induces transcription of these
           multidrug-efflux transport genes. SkgA has been shown to
           control stationary-phase expression of
           catalase-peroxidase in Caulobacter crescentus. These
           proteins are comprised of distinct domains that harbor
           an  N-terminal active (DNA-binding) site and a
           regulatory (effector-binding) site. The conserved
           N-terminal domain of these transcription regulators
           contains winged HTH motifs that mediate DNA binding.
           These proteins share the N-terminal DNA binding domain
           with other transcription regulators of the MerR
           superfamily that promote transcription by reconfiguring
           the spacer between the -35 and -10 promoter elements.
           Unique to this family, is a TipAL-like, lineage specific
           Bacilli subgroup, which has five conserved cysteines in
           the C-terminus of the protein.
          Length = 103

 Score = 25.9 bits (58), Expect = 5.6
 Identities = 18/70 (25%), Positives = 37/70 (52%), Gaps = 9/70 (12%)

Query: 59  DFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD 118
           D  +L QI       +KEL   +KE+  ++K+ ++ + E         +  L++ K+ LD
Sbjct: 43  DLERLQQI-----LFLKELGFSLKEIKELLKDPSEDLLE----ALREQKELLEEKKERLD 93

Query: 119 QMVKELDLTI 128
           +++K +D T+
Sbjct: 94  KLIKTIDRTL 103


>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
          Length = 259

 Score = 26.6 bits (59), Expect = 5.9
 Identities = 14/44 (31%), Positives = 26/44 (59%)

Query: 33  STEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKE 76
             E S  +LE+    +++E E + K +  +L  ++K++DQ IKE
Sbjct: 77  EKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKE 120


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 26.9 bits (59), Expect = 6.1
 Identities = 23/90 (25%), Positives = 46/90 (51%), Gaps = 4/90 (4%)

Query: 62  KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMV 121
            L++  K L   I EL QI+++L+L + E+N   +      KE N+ +L   + E++++ 
Sbjct: 484 TLEEDIKNLKHDINELTQILEKLELELSEANSKFELS----KEENERELVAQRIEIEKLE 539

Query: 122 KELDLTIKELNQIIKELNQIIKELNQIVEE 151
           KEL+         I +  Q+++     ++E
Sbjct: 540 KELNDLNLLSKTSILDAEQLVQSTEIKLDE 569


>gnl|CDD|181490 PRK08588, PRK08588, succinyl-diaminopimelate desuccinylase;
           Reviewed.
          Length = 377

 Score = 26.4 bits (59), Expect = 6.1
 Identities = 11/23 (47%), Positives = 15/23 (65%), Gaps = 2/23 (8%)

Query: 127 TIKELN--QIIKELNQIIKELNQ 147
           TI E +  Q+I  L +II E+NQ
Sbjct: 252 TIPEYDNDQVISLLQEIINEVNQ 274


>gnl|CDD|235308 PRK04570, PRK04570, cell division protein ZipA; Provisional.
          Length = 243

 Score = 26.4 bits (58), Expect = 6.1
 Identities = 13/40 (32%), Positives = 21/40 (52%)

Query: 24 DGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKL 63
          DG         E+SE+NLE+++     E+ K    DF+K+
Sbjct: 59 DGDVDERGEGVEQSELNLEATDAAGGNELGKRPNQDFDKI 98


>gnl|CDD|215363 PLN02677, PLN02677, mevalonate kinase.
          Length = 387

 Score = 26.7 bits (59), Expect = 6.1
 Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 8/51 (15%)

Query: 60  FNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKL 110
           FN +D I +EL  II    Q   E +L I E  + +KEL     E NQ  L
Sbjct: 265 FNAVDSISEELATII----QSPAEDELSITEKEEKLKEL----MEMNQGLL 307


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 26.4 bits (58), Expect = 6.2
 Identities = 24/93 (25%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 63  LDQIKKELDQIIKELDQIIKELDLI------IKESNQVIKELNLIIKESNQ--NKLDQIK 114
           L+++  EL    KE D   + L  +      I     +  EL+ + KE  +  ++L++++
Sbjct: 31  LERLDSELRDAEKERDTYKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELE 90

Query: 115 KELDQMVKELDLTIKELNQIIKELNQIIKELNQ 147
           KE D +  EL    +E  Q+  E  Q ++E N 
Sbjct: 91  KEDDDLDGELVELQEEKEQLENEELQYLREYNL 123


>gnl|CDD|218903 pfam06122, TraH, Conjugative relaxosome accessory transposon
           protein.  The TraH protein is thought to be a relaxosome
           accessory component, also necessary for transfer but not
           for H-pilus synthesis within the conjugative transposon.
          Length = 360

 Score = 26.5 bits (59), Expect = 6.3
 Identities = 6/24 (25%), Positives = 13/24 (54%)

Query: 124 LDLTIKELNQIIKELNQIIKELNQ 147
           L     ++ +++ EL +  + LNQ
Sbjct: 94  LQTLCPQIGKVMDELQKKAQALNQ 117



 Score = 26.1 bits (58), Expect = 9.8
 Identities = 6/24 (25%), Positives = 12/24 (50%)

Query: 117 LDQMVKELDLTIKELNQIIKELNQ 140
           L  +  ++   + EL +  + LNQ
Sbjct: 94  LQTLCPQIGKVMDELQKKAQALNQ 117


>gnl|CDD|223964 COG1033, COG1033, Predicted exporters of the RND superfamily
           [General function prediction only].
          Length = 727

 Score = 26.5 bits (59), Expect = 6.5
 Identities = 17/87 (19%), Positives = 40/87 (45%), Gaps = 5/87 (5%)

Query: 65  QIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKEL 124
           +   EL++ +++ ++ +     I+    QV      II + ++ +   I +EL + +K+ 
Sbjct: 453 RWMDELEEELEKGEEHVFSASSIVDLVKQVNG---GIIPDKSKIQ--IILEELPESIKKR 507

Query: 125 DLTIKELNQIIKELNQIIKELNQIVEE 151
            ++  +LN +   L     EL  +  E
Sbjct: 508 YISGDQLNLLGYSLGDTQGELEDVGRE 534


>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
            Members of this family function as part of the Mediator
           (Med) complex, which links DNA-bound transcriptional
           regulators and the general transcription machinery,
           particularly the RNA polymerase II enzyme. They play a
           role in basal transcription by mediating activation or
           repression according to the specific complement of
           transcriptional regulators bound to the promoter.
          Length = 176

 Score = 26.2 bits (58), Expect = 6.9
 Identities = 20/71 (28%), Positives = 35/71 (49%), Gaps = 9/71 (12%)

Query: 79  QIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
           ++ ++L     E    +KEL    +  NQ ++ +++ E    V+ LD  IK    I+KEL
Sbjct: 2   ELAEDLISADDELKSALKELEEHQE--NQQRILELRAE----VESLDEKIK---DILKEL 52

Query: 139 NQIIKELNQIV 149
            +  KEL  + 
Sbjct: 53  KEAEKELRTLP 63


>gnl|CDD|187635 cd08930, SDR_c8, classical (c) SDR, subgroup 8.  This subgroup has
           a fairly well conserved active site tetrad and domain
           size of the classical SDRs, but has an atypical
           NAD-binding motif ([ST]G[GA]XGXXG). SDRs are a
           functionally diverse family of oxidoreductases that have
           a single domain with a structurally conserved Rossmann
           fold (alpha/beta folding pattern with a central
           beta-sheet), an NAD(P)(H)-binding region, and a
           structurally diverse C-terminal region. Classical SDRs
           are typically about 250 residues long, while extended
           SDRs are approximately 350 residues. Sequence identity
           between different SDR enzymes are typically in the
           15-30% range, but the enzymes share the Rossmann fold
           NAD-binding motif and characteristic NAD-binding and
           catalytic sequence patterns. These enzymes catalyze a
           wide range of activities including the metabolism of
           steroids, cofactors, carbohydrates, lipids, aromatic
           compounds, and amino acids, and act in redox sensing.
           Classical SDRs have an TGXXX[AG]XG cofactor binding
           motif and a YXXXK active site motif, with the Tyr
           residue of the active site motif serving as a critical
           catalytic residue (Tyr-151, human
           15-hydroxyprostaglandin dehydrogenase (15-PGDH)
           numbering). In addition to the Tyr and Lys, there is
           often an upstream Ser (Ser-138, 15-PGDH numbering)
           and/or an Asn (Asn-107, 15-PGDH numbering) contributing
           to the active site; while substrate binding is in the
           C-terminal region, which determines specificity. The
           standard reaction mechanism is a 4-pro-S hydride
           transfer and proton relay involving the conserved Tyr
           and Lys, a water molecule stabilized by Asn, and
           nicotinamide. Extended SDRs have additional elements in
           the C-terminal region, and typically have a TGXXGXXG
           cofactor binding motif. Complex (multidomain) SDRs such
           as ketoreductase domains of fatty acid synthase have a
           GGXGXXG NAD(P)-binding motif and an altered active site
           motif (YXXXN). Fungal type ketoacyl reductases have a
           TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
           have lost catalytic activity and/or have an unusual
           NAD(P)-binding motif and missing or unusual active site
           residues. Reactions catalyzed within the SDR family
           include isomerization, decarboxylation, epimerization,
           C=N bond reduction, dehydratase activity,
           dehalogenation, Enoyl-CoA reduction, and
           carbonyl-alcohol oxidoreduction.
          Length = 250

 Score = 26.1 bits (58), Expect = 6.9
 Identities = 15/58 (25%), Positives = 31/58 (53%), Gaps = 7/58 (12%)

Query: 99  NLIIKESNQNKLDQIKKELDQ------MVKELDLTIKE-LNQIIKELNQIIKELNQIV 149
            LI+ + N   L+Q+K+EL        +  ELD+T KE + ++I+   +    ++ ++
Sbjct: 28  RLILADINAPALEQLKEELTNLYKNRVIALELDITSKESIKELIESYLEKFGRIDILI 85


>gnl|CDD|236047 PRK07560, PRK07560, elongation factor EF-2; Reviewed.
          Length = 731

 Score = 26.4 bits (59), Expect = 7.3
 Identities = 14/30 (46%), Positives = 23/30 (76%), Gaps = 4/30 (13%)

Query: 118 DQMVKELDLTIKELNQ----IIKELNQIIK 143
           D+++KEL LT +E+ Q    IIK++N++IK
Sbjct: 150 DRLIKELKLTPQEMQQRLLKIIKDVNKLIK 179


>gnl|CDD|118727 pfam10200, Ndufs5, NADH:ubiquinone oxidoreductase, NDUFS5-15kDa. 
          This is a family of short, approximately 105 amino acid
          residue, proteins which form part of NADH:ubiquinone
          oxidoreductase complex I. Complex I is the first
          multisubunit inner membrane protein complex of the
          mitochondrial electron transport chain and it transfers
          two electrons from NADH to ubiquinone. The protein
          carries four highly conserved cysteine residues but
          these do not appear to be in a configuration which
          would favour metal binding so the exact function of the
          protein is uncertain.
          Length = 96

 Score = 25.5 bits (56), Expect = 7.3
 Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 13/49 (26%)

Query: 41 LESSNGIEEQEMEKECKVDFN-------------KLDQIKKELDQIIKE 76
          +E S+GI +   +KECK++F              +L  I+K+ +++IKE
Sbjct: 41 IECSHGIGQTRAKKECKIEFEDFYECLHRQKTMQRLQAIRKQREKLIKE 89


>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein. 
          Length = 304

 Score = 26.3 bits (58), Expect = 7.4
 Identities = 16/105 (15%), Positives = 31/105 (29%), Gaps = 3/105 (2%)

Query: 48  EEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIK-ESNQVIKELNLIIKESN 106
            E            +L+  +  L   +  L   +  +          +       +    
Sbjct: 91  REALAASRGAASKAQLEAARDNLRSALALLPISLGRIGASQATLGALLNNLQAQGLAARA 150

Query: 107 QNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           Q  LD+   +L Q   EL   I+   Q  +  ++      Q+ EE
Sbjct: 151 Q--LDEAYLQLAQTAAELLRLIRAPAQGERVQSEAQLRSVQVGEE 193


>gnl|CDD|225089 COG2178, COG2178, Predicted RNA-binding protein of the translin
           family [Translation, ribosomal structure and
           biogenesis].
          Length = 204

 Score = 26.1 bits (58), Expect = 7.6
 Identities = 10/76 (13%), Positives = 39/76 (51%), Gaps = 4/76 (5%)

Query: 66  IKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELD 125
           +++E++ I + L +  K  +  +K S ++++     I   ++   ++ +K+L +  + + 
Sbjct: 1   LREEINSIREVLQEKDKAREEALKLSREIVRLSGEAIFLLHRGDFEEAEKKLKKASEAV- 59

Query: 126 LTIKELNQIIKELNQI 141
              ++L +++    ++
Sbjct: 60  ---EKLKRLLAGFPEL 72


>gnl|CDD|221833 pfam12889, DUF3829, Protein of unknown function (DUF3829).  This is
           a small family of proteins from several bacterial
           species, whose function is not known. It may, however,
           be related to the GvpL_GvpF family of proteins,
           pfam06386.
          Length = 277

 Score = 26.3 bits (58), Expect = 7.8
 Identities = 13/82 (15%), Positives = 29/82 (35%), Gaps = 10/82 (12%)

Query: 59  DFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELD 118
           +F K  ++  +L    +E     +               L  I  E    +L+++KKE  
Sbjct: 110 NFAKGKELHAKLLAAYEEFKAAYEAFL----------AALEKINDERLLEQLEELKKEGK 159

Query: 119 QMVKELDLTIKELNQIIKELNQ 140
           ++       + E  Q++   + 
Sbjct: 160 KIAYYTLELLLEAEQLLNLFSS 181


>gnl|CDD|218912 pfam06148, COG2, COG (conserved oligomeric Golgi) complex
           component, COG2.  The COG complex comprises eight
           proteins COG1-8. The COG complex plays critical roles in
           Golgi structure and function. The proposed function of
           the complex is to mediate the initial physical contact
           between transport vesicles and their membrane targets. A
           comparable role in tethering vesicles has been suggested
           for at least six additional large multisubunit
           complexes, including the exocyst, a complex that
           mediates trafficking to the plasma membrane. COG2
           structure reveals a six-helix bundle with few conserved
           surface features but a general resemblance to recently
           determined crystal structures of four different exocyst
           subunits. These bundles inCOG2 may act as platforms for
           interaction with other trafficing proteins including
           SNAREs (soluble N-ethylmaleimide factor attachment
           protein receptors) and Rabs.
          Length = 133

 Score = 25.7 bits (57), Expect = 8.0
 Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 8/86 (9%)

Query: 62  KLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKEL--NLIIKESNQNKLDQIKKELDQ 119
            L+ ++ +L   +K L   + EL   I E       L  NL+       KL++I+  L  
Sbjct: 27  PLESLRDDLRAYLKLLKSELIEL---INEDYADFVSLSTNLV---GLDEKLERIRVPLLA 80

Query: 120 MVKELDLTIKELNQIIKELNQIIKEL 145
           + KE++    EL   ++E+   ++E 
Sbjct: 81  LRKEVESVRSELEARLEEVEAKLEEK 106


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 26.5 bits (59), Expect = 8.0
 Identities = 19/99 (19%), Positives = 39/99 (39%), Gaps = 10/99 (10%)

Query: 61   NKLDQIKKELDQIIKELDQI-IKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQ 119
             +L     +   ++KEL ++       IIK+ ++ I        E +    D+  +E   
Sbjct: 1024 GELVITNAKKKDLVKELKKLGYVRFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELG 1083

Query: 120  MVKELD---------LTIKELNQIIKELNQIIKELNQIV 149
                 D         LT +++ ++  EL +  KEL ++ 
Sbjct: 1084 AAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEKLK 1122


>gnl|CDD|151528 pfam11083, Streptin-Immun, Lantibiotic streptin immunity protein.
           Streptococcal species produce a lantibiotic, streptin,
           in a similar manner to the production of nisin and
           subtilin by other lactic acid bacteria, in order to
           compete against competing bacteria within the
           environment. The immunity protein protects the bacterium
           from destruction by its own lantibiotic. In general,
           there is little homology between the immunity proteins
           of different genera of bacteria.
          Length = 99

 Score = 25.4 bits (56), Expect = 8.3
 Identities = 18/68 (26%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 48  EEQEMEKECKVDFNKLDQIKK-ELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
            + E  K  + DF+KL+  +   L+Q+ +E+ ++  +L L + E  ++++ L   +K  N
Sbjct: 31  ADVESRKLARYDFSKLNLTESITLEQVEEEILKLQNQLSLYLDEYEKLVRRLEKFVKILN 90

Query: 107 QNKLDQIK 114
            +K   IK
Sbjct: 91  TSKETGIK 98


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 26.2 bits (58), Expect = 8.5
 Identities = 14/76 (18%), Positives = 42/76 (55%)

Query: 69  ELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTI 128
           E+++ +++L+  IK+L + +K++               + ++  ++ E+ ++ + L+   
Sbjct: 35  EVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLE 94

Query: 129 KELNQIIKELNQIIKE 144
           KE+N++ + LN + +E
Sbjct: 95  KEINELEEWLNVLDEE 110


>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
           [DNA replication, recombination, and repair].
          Length = 663

 Score = 26.4 bits (59), Expect = 8.5
 Identities = 15/79 (18%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 47  IEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESN 106
           I+E E  +E ++ +N+   I  +   I K++  I+       +   ++ K+ + + K+  
Sbjct: 565 IDETERRREIQMAYNEEHGITPQ--TIKKKIRDILDGEYEEDEYKAKIEKKASKMSKKEL 622

Query: 107 QNKLDQIKKELDQMVKELD 125
           +  + +++KE+ +  K L+
Sbjct: 623 EKLIKKLEKEMKEAAKNLE 641


>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1).  This family
           consists of several rhoptry-associated protein 1 (RAP-1)
           sequences which appear to be specific to Plasmodium
           falciparum.
          Length = 790

 Score = 26.2 bits (57), Expect = 8.5
 Identities = 20/117 (17%), Positives = 50/117 (42%)

Query: 30  TDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQIIKELDQIIKELDLIIK 89
           TD     + +        E     K       +L +    L  +   L+++ +E+D++  
Sbjct: 176 TDSGKASASVAGIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHPLEKLEEEIDILKN 235

Query: 90  ESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELN 146
           + ++V +E    + E ++   +  K+ L+++  E D    + ++ IK +   +K+ N
Sbjct: 236 DGDKVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFKFDEEIKFIEHDVKDRN 292


>gnl|CDD|218065 pfam04399, Glutaredoxin2_C, Glutaredoxin 2, C terminal domain.
          Glutaredoxins are a multifunctional family of
          glutathione-dependent disulphide oxidoreductases.
          Unlike other glutaredoxins, glutaredoxin 2 (Grx2)
          cannot reduce ribonucleotide reductase. Grx2 has
          significantly higher catalytic activity in the
          reduction of mixed disulphides with glutathione (GSH)
          compared with other glutaredoxins. The active site
          residues (Cys9-Pro10-Tyr11-Cys12, in Escherichia coli
          Grx2), which are found at the interface between the N-
          and C-terminal domains are identical to other
          glutaredoxins, but there is no other similarity between
          glutaredoxin 2 and other glutaredoxins. Grx2 is
          structurally similar to glutathione-S-transferases
          (GST), but there is no obvious sequence similarity. The
          inter-domain contacts are mainly hydrophobic,
          suggesting that the two domains are unlikely to be
          stable on their own. Both domains are needed for
          correct folding and activity of Grx2. It is thought
          that the primary function of Grx2 is to catalyze
          reversible glutathionylation of proteins with GSH in
          cellular redox regulation including the response to
          oxidative stress.
          Length = 132

 Score = 25.7 bits (57), Expect = 8.6
 Identities = 11/37 (29%), Positives = 21/37 (56%)

Query: 63 LDQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN 99
           +Q+  +  Q I EL+  ++ELD ++  S  V  +L+
Sbjct: 50 FEQLLADTPQYIAELNADLEELDPLLLPSEAVNGQLS 86


>gnl|CDD|150302 pfam09590, Env-gp36, Lentivirus surface glycoprotein.  This
          protein is found in feline immunodeficiency retrovirus.
          It represents the surface glycoprotein which is found
          in the polyprotein C-terminal to the Env protein.
          Length = 591

 Score = 26.0 bits (57), Expect = 8.8
 Identities = 16/51 (31%), Positives = 25/51 (49%)

Query: 23 FDGLTPVTDGSTEESEINLESSNGIEEQEMEKECKVDFNKLDQIKKELDQI 73
          FD  T VT+       +N     GI EQE ++ C +   KL ++K E+ + 
Sbjct: 19 FDIATQVTEEGPLNPGVNPFRVPGITEQEKQEYCVILQPKLQELKGEITEG 69


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 26.0 bits (58), Expect = 8.8
 Identities = 10/57 (17%), Positives = 26/57 (45%), Gaps = 1/57 (1%)

Query: 95  IKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           + +L L I++   + L+ +K+++ +     ++ I    +   EL    K L +   +
Sbjct: 392 LDKLGLSIEDVFLHGLELLKEKVLEEKANSEIDID-FEEAKAELEAQFKRLREEAAK 447


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 26.0 bits (58), Expect = 9.0
 Identities = 11/44 (25%), Positives = 24/44 (54%)

Query: 108 NKLDQIKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIVEE 151
           +K+D+ K+E +Q+ ++L     +L Q   EL  +  + ++   E
Sbjct: 73  DKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRE 116


>gnl|CDD|206685 cd01898, Obg, Obg GTPase.  The Obg nucleotide binding protein
           subfamily has been implicated in stress response,
           chromosome partitioning, replication initiation,
           mycelium development, and sporulation. Obg proteins are
           among a large group of GTP binding proteins conserved
           from bacteria to humans. The E. coli homolog, ObgE is
           believed to function in ribosomal biogenesis. Members of
           the subfamily contain two equally and highly conserved
           domains, a C-terminal GTP binding domain and an
           N-terminal glycine-rich domain.
          Length = 170

 Score = 25.8 bits (58), Expect = 9.1
 Identities = 18/83 (21%), Positives = 35/83 (42%), Gaps = 18/83 (21%)

Query: 58  VDFNKLDQIKKELDQIIKELDQIIKEL------------DLIIKESNQVIKELNLIIKES 105
           +D +  D   ++ + I  EL+     L            DL+  E     ++L  ++KE 
Sbjct: 86  IDLSGEDDPVEDYETIRNELEAYNPGLAEKPRIVVLNKIDLLDAEERF--EKLKELLKEL 143

Query: 106 NQNKLDQI----KKELDQMVKEL 124
              K+  I     + LD+++K+L
Sbjct: 144 KGKKVFPISALTGEGLDELLKKL 166


>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein.  A number of members of this
           family are annotated as being transmembrane proteins
           induced by tumour necrosis factor alpha, but no
           literature was found to support this.
          Length = 330

 Score = 25.9 bits (57), Expect = 9.1
 Identities = 17/81 (20%), Positives = 37/81 (45%), Gaps = 19/81 (23%)

Query: 66  IKKELDQIIKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELD 125
            ++E  ++ KE  Q+        +E++++ K+           KL++++K  +Q    + 
Sbjct: 2   CEEEWKELEKEFQQL--------QETHRLYKQ-----------KLEEVEKLQEQCTSAIA 42

Query: 126 LTIKELNQIIKELNQIIKELN 146
              K L Q+I  L ++ K L 
Sbjct: 43  RQRKRLKQLIVSLKKLKKSLT 63


>gnl|CDD|237216 PRK12821, PRK12821, aspartyl/glutamyl-tRNA amidotransferase subunit
           C-like protein; Provisional.
          Length = 477

 Score = 26.0 bits (57), Expect = 9.2
 Identities = 14/64 (21%), Positives = 26/64 (40%), Gaps = 4/64 (6%)

Query: 89  KESNQVIKELN-LIIKESNQNKLDQI-KKELDQMVK--ELDLTIKELNQIIKELNQIIKE 144
           K   +   +L   +  +   +K  Q+ K EL ++ +    DL   EL ++  E   I   
Sbjct: 364 KGIKKSFSDLQSPLFPKHWTSKKQQLNKDELKKLARLVMFDLDDAELEKLQVEFKDITSS 423

Query: 145 LNQI 148
             Q+
Sbjct: 424 FKQV 427


>gnl|CDD|233691 TIGR02027, rpoA, DNA-directed RNA polymerase, alpha subunit,
           bacterial and chloroplast-type.  This family consists of
           the bacterial (and chloroplast) DNA-directed RNA
           polymerase alpha subunit, encoded by the rpoA gene. The
           RNA polymerase catalyzes the transcription of DNA into
           RNA using the four ribonucleoside triphosphates as
           substrates. The amino terminal domain is involved in
           dimerizing and assembling the other RNA polymerase
           subunits into a transcriptionally active enzyme. The
           carboxy-terminal domain contains determinants for
           interaction with DNA and with transcriptional activator
           proteins [Transcription, DNA-dependent RNA polymerase].
          Length = 297

 Score = 26.1 bits (58), Expect = 9.3
 Identities = 13/59 (22%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 74  IKELDQIIKELDLIIKESNQVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELN 132
           I E  +I+ E        ++ I+      +E  +    +  K L   ++ELDL+++  N
Sbjct: 192 IAEAAKILIEHLEPFVNLDEEIEAFEEEQEEEEEE--LEDAKLLSIKIEELDLSVRSYN 248


>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK).  This
          family consists of several plant specific
          phytosulfokine precursor proteins. Phytosulfokines, are
          active as either a pentapeptide or a C-terminally
          truncated tetrapeptide. These compounds were first
          isolated because of their ability to stimulate cell
          division in somatic embryo cultures of Asparagus
          officinalis.
          Length = 74

 Score = 24.6 bits (54), Expect = 9.4
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query: 30 TDGSTEESEINLESSNGIEEQEMEKECK 57
          TD   E S+I    S+ ++E E+  EC+
Sbjct: 20 TDTGHESSKIEAAESSKVDEDEVVSECE 47


>gnl|CDD|179143 PRK00855, PRK00855, argininosuccinate lyase; Provisional.
          Length = 459

 Score = 25.9 bits (58), Expect = 9.4
 Identities = 6/19 (31%), Positives = 16/19 (84%)

Query: 67 KKELDQIIKELDQIIKELD 85
          ++E ++I+  LD+I++E++
Sbjct: 55 EEEAEKILAGLDEILEEIE 73


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 25.7 bits (57), Expect = 9.4
 Identities = 16/93 (17%), Positives = 41/93 (44%), Gaps = 9/93 (9%)

Query: 64  DQIKKELDQIIKELDQIIKELDLIIKESNQVIKELN----LIIKESNQNKLDQIKKELDQ 119
            +I  +L +     +++ +E   ++ E  Q ++E       II+++ +      ++   +
Sbjct: 40  AKIADDLAEA----ERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAE 95

Query: 120 MVKELDLTIKELNQIIK-ELNQIIKELNQIVEE 151
             +EL+   +     I+ E  + ++EL   V E
Sbjct: 96  AEEELERIKEAAEAEIEAEKERALEELRAEVAE 128


>gnl|CDD|217899 pfam04108, APG17, Autophagy protein Apg17.  Apg17 is required for
           activating Apg1 protein kinases.
          Length = 408

 Score = 26.1 bits (58), Expect = 9.5
 Identities = 12/46 (26%), Positives = 21/46 (45%)

Query: 93  QVIKELNLIIKESNQNKLDQIKKELDQMVKELDLTIKELNQIIKEL 138
           ++++   LI  E  Q+    ++ ELD     L+ T+  L   I  L
Sbjct: 58  KIVRSTALIRTEWGQSVFVNLQNELDAADARLESTLDRLRSTIVPL 103


>gnl|CDD|225553 COG3008, PqiB, Paraquat-inducible protein B [General function
           prediction only].
          Length = 553

 Score = 26.1 bits (58), Expect = 9.5
 Identities = 8/37 (21%), Positives = 18/37 (48%)

Query: 113 IKKELDQMVKELDLTIKELNQIIKELNQIIKELNQIV 149
            K  LD +V ++ +T+  L   +K    ++    +I+
Sbjct: 436 NKLPLDFVVNDVTVTLSGLQFTLKSAQTLLDGGIRIL 472


>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
          Length = 880

 Score = 26.2 bits (59), Expect = 9.9
 Identities = 15/35 (42%), Positives = 20/35 (57%), Gaps = 1/35 (2%)

Query: 51  EMEKE-CKVDFNKLDQIKKELDQIIKELDQIIKEL 84
            ME+   KVD   L ++  EL Q + EL+Q I EL
Sbjct: 492 RMERNGIKVDREYLKELSAELAQRLAELEQEIYEL 526


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.136    0.353 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,851,367
Number of extensions: 782373
Number of successful extensions: 4817
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3675
Number of HSP's successfully gapped: 1248
Length of query: 151
Length of database: 10,937,602
Length adjustment: 88
Effective length of query: 63
Effective length of database: 7,034,450
Effective search space: 443170350
Effective search space used: 443170350
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 54 (24.7 bits)