RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy157
         (347 letters)



>gnl|CDD|219906 pfam08565, CDC37_M, Cdc37 Hsp90 binding domain.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domains
           corresponds to the Hsp90 chaperone (Heat shocked protein
           90) binding domain of Cdc37. It is found between the N
           terminal Cdc37 domain pfam03234, which is predominantly
           involved in kinase binding, and the C terminal domain of
           Cdc37 pfam08564 whose function is unclear.
          Length = 173

 Score =  116 bits (293), Expect = 1e-31
 Identities = 65/176 (36%), Positives = 86/176 (48%), Gaps = 49/176 (27%)

Query: 128 NKKPPLQQGEKSDEEKEQEMKAFVKENEKLIK---------------------------- 159
           NKK      +K+DEE E +MK FV  NEKL                              
Sbjct: 1   NKKKE---KDKTDEEDEDKMKKFVTTNEKLNSAVDKDDDDELKLSPATEKFGKIPAGDYK 57

Query: 160 ---------------QYANYLVIWCINLAMDEKFDLMDHVAHQCICMQYLLELSKSLNI- 203
                          Q  + LV+   +L ++ K +LM  VAHQ I MQY+LEL+KSL + 
Sbjct: 58  DSEKFLSEHPHIVSEQTKDALVMEAFDLQLEGKEELMRQVAHQSILMQYILELAKSLKVD 117

Query: 204 -DPRACIGSFFSKI-QIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEE 257
              R CI  FF KI   AEP YK +F DE+++F  RI+ RA+E + EA++E EEEE
Sbjct: 118 PALRNCIRLFFKKIFAKAEPMYKEAFLDEVQAFFNRIKTRAQELIQEAMEEQEEEE 173


>gnl|CDD|215010 smart01070, CDC37_M, Cdc37 Hsp90 binding domain.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domains
           corresponds to the Hsp90 chaperone (Heat shocked protein
           90) binding domain of Cdc37. It is found between the N
           terminal Cdc37 domain which is predominantly involved in
           kinase binding, and the C terminal domain of Cdc37 whose
           function is unclear.
          Length = 155

 Score =  112 bits (283), Expect = 2e-30
 Identities = 52/157 (33%), Positives = 76/157 (48%), Gaps = 33/157 (21%)

Query: 123 TKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQYA-------------------- 162
           T+T  +K PP      S+EE E + K FV + E   K++                     
Sbjct: 1   TETTSSKNPP---SAPSEEEDEIDDKDFVLKLEPATKKFGMIPAGDYDDSQKFLSEHPHI 57

Query: 163 ------NYLVIWCINLAMDEKFDLMDHVAHQCICMQYLLELSKSLNIDPRACIGSFFSKI 216
                 + LV+W  +L ++ K  LM+ VAHQ I MQY+LEL+KSL +DPR C+  FF KI
Sbjct: 58  VSEETADALVMWAFDLELEGKEALMEQVAHQAILMQYILELAKSLKVDPRNCVRLFFQKI 117

Query: 217 ----QIAEPDYKASFDDELKSFKERIRNRAKEKLDEA 249
                 A+  +      E +  K R+R  A+E+ +EA
Sbjct: 118 KTPSHPAKEGFLEDVQSEFEHIKTRVRIIAQEQAEEA 154


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score =  107 bits (268), Expect = 4e-28
 Identities = 61/156 (39%), Positives = 82/156 (52%), Gaps = 33/156 (21%)

Query: 1   MVDYSKWKDIEISDDED-ETHPNIDTPSLFRWR----HQARVERMEE------QRLENEK 49
            +DYSKW  IE+SDD D E HPN+D  S  RW+    HQARVERMEE      + + N+ 
Sbjct: 1   PIDYSKWDKIELSDDSDIEVHPNVDKKSFIRWKQRDIHQARVERMEEIKNLKYELIMNDH 60

Query: 50  FQQEKEKTLKTLEET--------------------KKKLVESEQAGANNLDELKKALAEL 89
             +  +K LK L E                     KK+L E+       L+ELKK   +L
Sbjct: 61  LNKRIDKLLKGLREEELSPETPTYNEMLAELQDQLKKELEEANGDSEGLLEELKKHRDKL 120

Query: 90  EKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKT 125
           +KE   +RKK ++L+K+EK   W+VDT    GF K+
Sbjct: 121 KKEQKELRKKLDELEKEEKKKIWSVDT--HTGFDKS 154


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score =  102 bits (256), Expect = 3e-26
 Identities = 55/174 (31%), Positives = 80/174 (45%), Gaps = 51/174 (29%)

Query: 1   MVDYSKWKDIEISDDED-ETHPNIDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLK 59
            +DYSKW  IE+SDD D E HPN+D PS  RW+ QA  ER E+++ E E+ ++E+    +
Sbjct: 1   PIDYSKWDKIELSDDSDIEVHPNVDKPSFIRWKQQAIHERREQRKQEIEELEKERIMNDR 60

Query: 60  TLEETKKKLVE------------------------------------------------S 71
            LE   K L E                                                 
Sbjct: 61  LLERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMVEDLFDQVKDEVD 120

Query: 72  EQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKT 125
           E+ GA  ++EL+K   +L+KE   + KK ++L+K+EK   W+VDT    GF K+
Sbjct: 121 EKNGAALIEELQKHRDKLKKEQKELLKKLDELEKEEKKKIWSVDT--HTGFDKS 172


>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain.  Cdc37 is a protein
           required for the activity of numerous eukaryotic protein
           kinases. This domains corresponds to the C terminal
           domain whose function is unclear. It is found C terminal
           to the Hsp90 chaperone (Heat shocked protein 90) binding
           domain pfam08565 and the N terminal kinase binding
           domain of Cdc37 pfam03234.
          Length = 89

 Score = 77.0 bits (190), Expect = 9e-18
 Identities = 38/78 (48%), Positives = 50/78 (64%)

Query: 263 GPGGLDPVEVFESLPDVLKKCFELQDTAMLQEAIASLSPEEATYHMKRCVDSGLWVPDAK 322
           G  GLDPVEVFESLP  L+K FE +D   + + +A +  EEA YHM+RC+DSGLWVP+AK
Sbjct: 1   GAEGLDPVEVFESLPPELQKAFETKDLDEVNKVLAKMPVEEAEYHMERCIDSGLWVPNAK 60

Query: 323 KAAGESEDPQENTPDAED 340
               +     E   + E+
Sbjct: 61  IEGEKEFKELEEEYNEEE 78


>gnl|CDD|215009 smart01069, CDC37_C, Cdc37 C terminal domain.  Cdc37 is a protein
           required for the activity of numerous eukaryotic protein
           kinases. This domains corresponds to the C terminal
           domain whose function is unclear. It is found C terminal
           to the Hsp90 chaperone (Heat shocked protein 90) binding
           domain pfam08565 and the N terminal kinase binding
           domain of Cdc37.
          Length = 93

 Score = 68.6 bits (168), Expect = 1e-14
 Identities = 39/83 (46%), Positives = 53/83 (63%), Gaps = 3/83 (3%)

Query: 262 LGPGGL--DPVEVFESLPDVLKKCFELQDTAMLQEAIASLSPEEATYHMKRCVDSGLW-V 318
             PGGL  DP EVFESLP  L+K FE +D  ML + +A +  EEA YH++RC++SGLW V
Sbjct: 1   NLPGGLSLDPEEVFESLPPELQKAFETKDLDMLNKVLAKMPVEEAEYHLERCIESGLWGV 60

Query: 319 PDAKKAAGESEDPQENTPDAEDP 341
           P+A +   E ++ QE     E+ 
Sbjct: 61  PNAIEDETEFKELQEQYEVEEEA 83


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 39.4 bits (93), Expect = 0.002
 Identities = 24/71 (33%), Positives = 43/71 (60%), Gaps = 3/71 (4%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           E + + R E EK  +E+   L+ LE   K+L++ E+     L+ L+K   ELEK+   + 
Sbjct: 64  EEIHKLRNEFEKELRERRNELQKLE---KRLLQKEENLDRKLELLEKREEELEKKEKELE 120

Query: 98  KKEEDLKKKEK 108
           +K+++L+KKE+
Sbjct: 121 QKQQELEKKEE 131



 Score = 35.5 bits (83), Expect = 0.037
 Identities = 27/126 (21%), Positives = 59/126 (46%), Gaps = 23/126 (18%)

Query: 39  RMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRK 98
           + E + ++ E   + KE+  K   E +K+L E          EL+K    L ++  N+ +
Sbjct: 48  KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN-------ELQKLEKRLLQKEENLDR 100

Query: 99  KEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLI 158
           K E L+K+E+                 +  K+  L+Q ++  E+KE+E++  ++E  + +
Sbjct: 101 KLELLEKREEE----------------LEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144

Query: 159 KQYANY 164
           ++ +  
Sbjct: 145 ERISGL 150



 Score = 29.7 bits (68), Expect = 2.6
 Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 16/89 (17%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVE-------------SEQAGANN 78
           R    +E+ EE+  + EK  ++K++ L+  EE  ++L+E             +E+A    
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEEAKEIL 159

Query: 79  LDELKKALAELEKEAANIRKKEEDLKKKE 107
           L+++++   E   EAA + K+ E+  K+E
Sbjct: 160 LEKVEE---EARHEAAVLIKEIEEEAKEE 185



 Score = 29.0 bits (66), Expect = 4.2
 Identities = 20/74 (27%), Positives = 38/74 (51%), Gaps = 4/74 (5%)

Query: 36  RVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN 95
           R  R E Q+LE  +  Q KE+ L    E  +K  E  +     L++ ++ L + E+E   
Sbjct: 78  RERRNELQKLEK-RLLQ-KEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135

Query: 96  IRKKEEDLKKKEKI 109
           + +++  L++ E+I
Sbjct: 136 LIEEQ--LQELERI 147


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 39.4 bits (92), Expect = 0.003
 Identities = 30/133 (22%), Positives = 56/133 (42%), Gaps = 13/133 (9%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           E +E+   E EK ++E E+ ++ L+E ++   + E        E ++AL  L+KE  +I 
Sbjct: 532 EHLEKLLKEQEKLKKELEQEMEELKERERNK-KLEL-----EKEAQEALKALKKEVESII 585

Query: 98  KKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPP----LQQGEKSDEEKEQEMKAFVKE 153
           ++   LK+K+      + +I      K    K P      Q  +  D+ + +      K 
Sbjct: 586 RE---LKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQADKIGDKVRIRYFGQKGKI 642

Query: 154 NEKLIKQYANYLV 166
            + L     N  V
Sbjct: 643 VQILGGNKWNVTV 655


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 38.2 bits (89), Expect = 0.006
 Identities = 14/76 (18%), Positives = 26/76 (34%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
                +     Q+ +  +  +E++KTL  L        +  +    N   LK  +A  E 
Sbjct: 186 ELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEA 245

Query: 92  EAANIRKKEEDLKKKE 107
            AA  R+     +   
Sbjct: 246 AAAKAREAAAAAEAAA 261



 Score = 32.8 bits (75), Expect = 0.26
 Identities = 21/88 (23%), Positives = 37/88 (42%), Gaps = 11/88 (12%)

Query: 28  LFRWRHQARVERMEEQ-------RLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLD 80
            +   + AR ER++              +   E+ +    L E + +  +  Q     L+
Sbjct: 151 YYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQL----LE 206

Query: 81  ELKKALAELEKEAANIRKKEEDLKKKEK 108
           E KK LA+L  E +  +KK E+L+  E 
Sbjct: 207 ERKKTLAQLNSELSADQKKLEELRANES 234



 Score = 30.8 bits (70), Expect = 0.92
 Identities = 17/68 (25%), Positives = 33/68 (48%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           Q  +  +E++  E +  + + EK LK+LE     L       A++L +L+K +A+L    
Sbjct: 44  QKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARL 103

Query: 94  ANIRKKEE 101
             +  +E 
Sbjct: 104 NALEVQER 111



 Score = 28.1 bits (63), Expect = 6.6
 Identities = 14/58 (24%), Positives = 27/58 (46%)

Query: 51  QQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
            ++ ++  K +   +KK+ E +   A    +LK    E+    A + +  +DLKK  K
Sbjct: 37  DKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRK 94


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 37.6 bits (88), Expect = 0.008
 Identities = 41/212 (19%), Positives = 85/212 (40%), Gaps = 49/212 (23%)

Query: 53  EKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPW 112
           +K+ ++K+LEE  K + E  +     + EL++ ++ELE E   + ++ E L+      PW
Sbjct: 76  KKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLE------PW 129

Query: 113 ---NVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQYAN--YLVI 167
              ++D     GF    V        G   +++ E+       EN + I       Y+V+
Sbjct: 130 GNFDLDLSLLLGFKYVSV------FVGTVPEDKLEELKLESDVENVEYISTDKGYVYVVV 183

Query: 168 WCINLAMDEKFDLMDHVAHQCICMQYLLELSKSLNIDPRACIGSFFSKIQIAEPDYKASF 227
             +    DE  +              L +L               F ++++   + + + 
Sbjct: 184 VVLKELSDEVEEE-------------LKKLG--------------FERLEL---EEEGTP 213

Query: 228 DDELKSFKERIR--NRAKEKLDEALKEVEEEE 257
            + ++  KE +    + +E L E LKE+ ++ 
Sbjct: 214 SELIREIKEELEEIEKERESLLEELKELAKKY 245


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 37.4 bits (87), Expect = 0.011
 Identities = 21/75 (28%), Positives = 44/75 (58%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           +  + R+EE+  E ++  +E EK ++ L+   ++L E  +     L ELK+ + ELE E 
Sbjct: 245 EEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEI 304

Query: 94  ANIRKKEEDLKKKEK 108
           + +R++ E+L+ + +
Sbjct: 305 SLLRERLEELENELE 319



 Score = 33.9 bits (78), Expect = 0.15
 Identities = 30/131 (22%), Positives = 63/131 (48%), Gaps = 3/131 (2%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           +  +E +E++  E ++  +E E   + LE+  K+L E ++     L EL+  LAEL++E 
Sbjct: 848 EEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEI 907

Query: 94  ANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKE 153
             +R++ E+L+ K +        + +         +     + E+  E  E+E++A    
Sbjct: 908 EKLRERLEELEAKLERLE---VELPELEEELEEEYEDTLETELEREIERLEEEIEALGPV 964

Query: 154 NEKLIKQYANY 164
           N + I++Y   
Sbjct: 965 NLRAIEEYEEV 975



 Score = 33.9 bits (78), Expect = 0.15
 Identities = 18/67 (26%), Positives = 37/67 (55%), Gaps = 7/67 (10%)

Query: 49  KFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN---IRKKEEDLKK 105
           K+++ KE+  + LE T++ L   E    + L+EL+K L +LE++A      ++ + +L++
Sbjct: 169 KYKERKEEAERKLERTEENLERLE----DLLEELEKQLEKLERQAEKAERYQELKAELRE 224

Query: 106 KEKITPW 112
            E     
Sbjct: 225 LELALLL 231



 Score = 33.5 bits (77), Expect = 0.16
 Identities = 22/74 (29%), Positives = 43/74 (58%)

Query: 33  HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
            + R ER+E++  E E+  +E E+ L  LEE  ++L +  +     L+EL+    ELE E
Sbjct: 819 LEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDE 878

Query: 93  AANIRKKEEDLKKK 106
              + +++E+L+++
Sbjct: 879 LKELEEEKEELEEE 892



 Score = 33.5 bits (77), Expect = 0.17
 Identities = 55/253 (21%), Positives = 110/253 (43%), Gaps = 38/253 (15%)

Query: 23   IDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDEL 82
            I+     R   Q  +E +EE+  E E+     E+ L++LE+ +++L +  +     ++EL
Sbjct: 781  IEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEEL 840

Query: 83   KKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEE 142
            ++ L ELE+E   + K+ E+LK++ +                     +  L++ E+  EE
Sbjct: 841  EEKLDELEEELEELEKELEELKEELE------------ELEAEKEELEDELKELEEEKEE 888

Query: 143  KEQEMKAFVKENEKLIKQYANYLVIWCINLAMDEKFDLMDHVAHQCICMQYLLELSKSL- 201
             E+E++    E  +L ++            A  E+ ++      + +  +Y   L   L 
Sbjct: 889  LEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELE 948

Query: 202  --------------NIDPRACIGSFFSKIQIAEPDYKASFDDELKSFKERIRNRAKEKLD 247
                           ++ RA          I E +      +ELKS +E +   AKEKL 
Sbjct: 949  REIERLEEEIEALGPVNLRA----------IEEYEEVEERYEELKSQREDLE-EAKEKLL 997

Query: 248  EALKEVEEEERQK 260
            E ++E+++E+R++
Sbjct: 998  EVIEELDKEKRER 1010



 Score = 32.0 bits (73), Expect = 0.62
 Identities = 21/73 (28%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 40  MEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANN---LDELKKALAELEKEAANI 96
              Q L+ E  + E    L  L+E +K+L E E+  +     L+EL++ L E EKE   +
Sbjct: 213 ERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEEL 272

Query: 97  RKKEEDLKKKEKI 109
           + + E+L+++ + 
Sbjct: 273 KSELEELREELEE 285



 Score = 30.8 bits (70), Expect = 1.3
 Identities = 18/77 (23%), Positives = 47/77 (61%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
             +  +E+++ +  E E+  +E E+ L+ L+E  ++L E  ++    L +LK+ + ELE+
Sbjct: 727 ALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEE 786

Query: 92  EAANIRKKEEDLKKKEK 108
           +   ++++ E+L+++ +
Sbjct: 787 KRQALQEELEELEEELE 803



 Score = 28.9 bits (65), Expect = 4.6
 Identities = 22/90 (24%), Positives = 44/90 (48%), Gaps = 5/90 (5%)

Query: 30  RWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAEL 89
           R   +    ++EE + E    ++E E+    LEE +++L E E      L+EL++ L EL
Sbjct: 709 RQLEELE-RQLEELKRELAALEEELEQLQSRLEELEEELEELE----EELEELQERLEEL 763

Query: 90  EKEAANIRKKEEDLKKKEKITPWNVDTISK 119
           E+E  ++ +    LK++ +        + +
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQE 793


>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
          Length = 202

 Score = 36.2 bits (84), Expect = 0.014
 Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 31  WRHQARVERMEEQRLE-NEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKA 85
           W++   V ++EE+R E    + ++K K  K  +E +KK +         +  LKK 
Sbjct: 146 WKYADVVAKLEEKRKERAAAYYKKKVKLRKAWKEARKKALAKLP--KAIVAVLKKF 199


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 35.4 bits (82), Expect = 0.033
 Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 9/76 (11%)

Query: 33  HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
           H+AR    EE+R   +KF +      K  E+ +K+L E EQ       EL   L E++K+
Sbjct: 307 HEAR--LNEEERELKKKFTE------KIREK-EKRLEELEQNLIEERKELNSKLEEIQKK 357

Query: 93  AANIRKKEEDLKKKEK 108
             ++ K+ E LK  + 
Sbjct: 358 LEDLEKRLEKLKSNKS 373


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 34.9 bits (81), Expect = 0.051
 Identities = 18/78 (23%), Positives = 33/78 (42%), Gaps = 5/78 (6%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           + + ER+ ++    EK Q+E  K  + +   K    ++++A +       K L +LE   
Sbjct: 240 EQKAERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKS-----RIKRLEKLEARL 294

Query: 94  ANIRKKEEDLKKKEKITP 111
           A  R  EE      +  P
Sbjct: 295 AEERPVEEGKPLAFRFPP 312


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 35.1 bits (80), Expect = 0.060
 Identities = 33/125 (26%), Positives = 62/125 (49%), Gaps = 7/125 (5%)

Query: 34   QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
            Q + +  EE++   E  + E+E  +K  EE KK   + ++A     +E KKA  + +K A
Sbjct: 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKA-----EEAKKAEEDEKKAA 1691

Query: 94   ANIRKKEEDLKKKEKITPWNVDTISKPGFTKTV--VNKKPPLQQGEKSDEEKEQEMKAFV 151
              ++K+ E+ KK E++     +   K    K     NK    +  ++++E+K++  +A  
Sbjct: 1692 EALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKK 1751

Query: 152  KENEK 156
             E EK
Sbjct: 1752 DEEEK 1756



 Score = 33.6 bits (76), Expect = 0.20
 Identities = 33/125 (26%), Positives = 61/125 (48%), Gaps = 5/125 (4%)

Query: 32   RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
            R     ++ EE R+E      E+EK +K  E  K +  + +       +E KK + +L+K
Sbjct: 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKK 1640

Query: 92   EAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFV 151
            + A  +KK E+LKK E+      + I      K     K   ++ +K++E++++  +A  
Sbjct: 1641 KEAEEKKKAEELKKAEE-----ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695

Query: 152  KENEK 156
            KE E+
Sbjct: 1696 KEAEE 1700



 Score = 33.2 bits (75), Expect = 0.22
 Identities = 29/123 (23%), Positives = 52/123 (42%), Gaps = 6/123 (4%)

Query: 34   QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
            +A  +  E    + E  +++KE+  K  +  KKK  E ++A     DE KK   E +K+A
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKA-----DEAKKKAEEDKKKA 1407

Query: 94   ANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKE 153
              ++K     KK ++      +   K    K    +     + +K  EE ++  +A  K 
Sbjct: 1408 DELKKAAAAKKKADEAKK-KAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKA 1466

Query: 154  NEK 156
             E 
Sbjct: 1467 EEA 1469



 Score = 31.3 bits (70), Expect = 0.88
 Identities = 35/130 (26%), Positives = 67/130 (51%), Gaps = 9/130 (6%)

Query: 32   RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
            +     ++ EE + + E+ ++ +E   K  E  KK+  E+++A     +ELKK  AE +K
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA-----EELKKKEAEEKK 1716

Query: 92   EAANIRKKEEDLKKK-EKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAF 150
            +A  ++K EE+ K K E+      +   K    K    +K  +   +K +E+K +E++  
Sbjct: 1717 KAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR-- 1774

Query: 151  VKENEKLIKQ 160
             KE E +I++
Sbjct: 1775 -KEKEAVIEE 1783



 Score = 31.3 bits (70), Expect = 0.98
 Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 7/72 (9%)

Query: 38   ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
            ++ EE +   E  ++ +E   K  +E KKK  E+++A     DE KK   E +K+A   +
Sbjct: 1451 KKAEEAKKAEEAKKKAEEA--KKADEAKKKAEEAKKA-----DEAKKKAEEAKKKADEAK 1503

Query: 98   KKEEDLKKKEKI 109
            K  E  KK ++ 
Sbjct: 1504 KAAEAKKKADEA 1515



 Score = 30.9 bits (69), Expect = 1.4
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 8/78 (10%)

Query: 32   RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
            R    +++ EE++  +E    +K +  K  +E KKK  E+++A     DE KK   E +K
Sbjct: 1278 RKADELKKAEEKKKADE---AKKAEEKKKADEAKKKAEEAKKA-----DEAKKKAEEAKK 1329

Query: 92   EAANIRKKEEDLKKKEKI 109
            +A   +KK E+ KK  + 
Sbjct: 1330 KADAAKKKAEEAKKAAEA 1347



 Score = 29.7 bits (66), Expect = 2.9
 Identities = 24/108 (22%), Positives = 44/108 (40%), Gaps = 4/108 (3%)

Query: 38   ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
            ++ EE +   E  + E E      E  ++K     +A     +E KK     +K+A   +
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEK----AEAAEKKKEEAKKKADAAKKKAEEKK 1391

Query: 98   KKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQ 145
            K +E  KK E+      +        K     K   ++ +K+DE K++
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKK 1439



 Score = 29.7 bits (66), Expect = 3.0
 Identities = 26/114 (22%), Positives = 51/114 (44%), Gaps = 3/114 (2%)

Query: 38   ERMEEQRLENEKFQQEKEKTLKTLEETKKK---LVESEQAGANNLDELKKALAELEKEAA 94
            ++  E   +  +  ++ E+  K   E KKK   L ++E+      +E KK   E +K+A 
Sbjct: 1688 KKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE 1747

Query: 95   NIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMK 148
              +K EE+ KK   +         +    K  V ++   ++ EK   E ++++K
Sbjct: 1748 EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIK 1801



 Score = 28.6 bits (63), Expect = 6.4
 Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 1/91 (1%)

Query: 17   DETHPNIDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGA 76
            DE              H  + E ++    + +   +E  +  +  EE   K  E+++   
Sbjct: 1049 DEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTET 1108

Query: 77   NNLDELKKALAELEKEAANIRKKEEDLKKKE 107
               +E +KA  E +K+A + RK EE  K ++
Sbjct: 1109 GKAEEARKA-EEAKKKAEDARKAEEARKAED 1138



 Score = 28.2 bits (62), Expect = 9.7
 Identities = 36/119 (30%), Positives = 57/119 (47%), Gaps = 15/119 (12%)

Query: 38   ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
            ++ EE +  +E  ++ +E   K  EE KKK  E+++A     DE KK  AE  K+A   +
Sbjct: 1438 KKAEEAKKADEAKKKAEEA--KKAEEAKKKAEEAKKA-----DEAKKK-AEEAKKADEAK 1489

Query: 98   KKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEK 156
            KK E+ KKK        D   K    K   ++    ++ +K+DE K+ E      E +K
Sbjct: 1490 KKAEEAKKK-------ADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 34.7 bits (80), Expect = 0.060
 Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 14/81 (17%)

Query: 40  MEEQRLENEKFQQ---EKEKTLKTLEETKKK----LVESEQAGANNLDELK------KAL 86
           + EQ+ + +K +Q   E++KTL  LE + +K    L E     +   D +       KA 
Sbjct: 200 LYEQQAQQQKLEQARNERKKTLTGLESSLQKDQQQLSELRANESRLRDSIARAEREAKAR 259

Query: 87  AELE-KEAANIRKKEEDLKKK 106
           AE E +EAA +R K++  K+K
Sbjct: 260 AEREAREAARVRDKQKQAKRK 280


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 35.0 bits (81), Expect = 0.068
 Identities = 26/110 (23%), Positives = 40/110 (36%), Gaps = 3/110 (2%)

Query: 48   EKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKE 107
            EK   E EK  K LE+ K      +     +LD+ ++AL E E+       KE+ LK K 
Sbjct: 1105 EKLNAELEKKEKELEKLKNT--TPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKT 1162

Query: 108  K-ITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEK 156
            K           K    K   +     ++       K  +     K ++K
Sbjct: 1163 KGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDK 1212


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 35.0 bits (81), Expect = 0.069
 Identities = 21/75 (28%), Positives = 35/75 (46%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
             + R+  +E Q  E E   +E E  L  L E   +L E  +     L+ L+  L ELE 
Sbjct: 306 ILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365

Query: 92  EAANIRKKEEDLKKK 106
           E   +  + E+L+++
Sbjct: 366 ELEELESRLEELEEQ 380



 Score = 32.7 bits (75), Expect = 0.30
 Identities = 19/55 (34%), Positives = 32/55 (58%)

Query: 52  QEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKK 106
            E+ + ++ LEE  ++L E        L EL+K L ELE+E   +RK+ E+L ++
Sbjct: 673 LERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQ 727



 Score = 31.2 bits (71), Expect = 1.0
 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 41  EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKE 100
           E  RLE +K  Q   + L  LE   ++L    +   + LDEL + LAELE++   ++++ 
Sbjct: 296 EISRLEQQK--QILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353

Query: 101 EDLKKKEK 108
           E L+ + +
Sbjct: 354 ESLEAELE 361



 Score = 30.8 bits (70), Expect = 1.2
 Identities = 22/74 (29%), Positives = 38/74 (51%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
             + ++E +EE+  E EK   E  K L+ LEE  ++L +  +  +  +  L+K LA LE 
Sbjct: 681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEA 740

Query: 92  EAANIRKKEEDLKK 105
           E   + ++   L K
Sbjct: 741 EVEQLEERIAQLSK 754



 Score = 30.4 bits (69), Expect = 1.5
 Identities = 22/69 (31%), Positives = 34/69 (49%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           E +E Q  E E    E  + L  LEE  ++L E  ++    L+EL+  L ELE     + 
Sbjct: 319 EELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELE 378

Query: 98  KKEEDLKKK 106
           ++ E L+ K
Sbjct: 379 EQLETLRSK 387



 Score = 30.4 bits (69), Expect = 1.6
 Identities = 26/73 (35%), Positives = 44/73 (60%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           +A +E +EE+  E E+   E E  ++ LE   ++L E  +A    LDEL+  L  L +EA
Sbjct: 760 EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA 819

Query: 94  ANIRKKEEDLKKK 106
           AN+R++ E L+++
Sbjct: 820 ANLRERLESLERR 832



 Score = 28.9 bits (65), Expect = 5.5
 Identities = 15/73 (20%), Positives = 33/73 (45%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           + ++E + E         +E E+ ++ LE   + L+    +    L  L+  L EL +E 
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903

Query: 94  ANIRKKEEDLKKK 106
             +  K  +L+++
Sbjct: 904 RELESKRSELRRE 916



 Score = 28.5 bits (64), Expect = 7.2
 Identities = 19/75 (25%), Positives = 41/75 (54%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
           R +A VE++EE+  +  K   E E  ++ LEE  ++  E        ++EL+  + +L++
Sbjct: 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKE 796

Query: 92  EAANIRKKEEDLKKK 106
           E   +R+  ++L+ +
Sbjct: 797 ELKALREALDELRAE 811


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 35.0 bits (81), Expect = 0.070
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANN---LDELKKALAELEKEAA 94
            ++ E   E E+ +QE+EK  + LEE ++ L   EQ   N    L EL+  + ELE++  
Sbjct: 716 RKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLH 775

Query: 95  NIRKKEEDLKKKEKITPW 112
            + +   DL+ +   +  
Sbjct: 776 KLEEALNDLEARLSHSRI 793



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 20/75 (26%), Positives = 40/75 (53%), Gaps = 6/75 (8%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           +  +E++ E+  E EK  +E E+ L+ L +  K L E EQ        +K+ + ELE E 
Sbjct: 250 EEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQL------RVKEKIGELEAEI 303

Query: 94  ANIRKKEEDLKKKEK 108
           A++ +   + +++ +
Sbjct: 304 ASLERSIAEKERELE 318



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 23/115 (20%), Positives = 46/115 (40%), Gaps = 5/115 (4%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
           +     E  E Q    +  +Q K    K +E    K  E E+     L+EL+ AL +LE 
Sbjct: 828 KEYLEKEIQELQEQRIDLKEQIKSIE-KEIENLNGKKEELEE----ELEELEAALRDLES 882

Query: 92  EAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQE 146
              +++K+ ++L+ + +     ++ +      K     +   +     +E  E E
Sbjct: 883 RLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIE 937



 Score = 29.7 bits (67), Expect = 2.6
 Identities = 21/75 (28%), Positives = 40/75 (53%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           QA + ++EE+    E   +E E+ L  L   K+ L +  Q       +LK+ +  +EKE 
Sbjct: 797 QAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI 856

Query: 94  ANIRKKEEDLKKKEK 108
            N+  K+E+L+++ +
Sbjct: 857 ENLNGKKEELEEELE 871



 Score = 28.5 bits (64), Expect = 6.8
 Identities = 29/131 (22%), Positives = 53/131 (40%), Gaps = 20/131 (15%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
             +A +  +E +  + +K + E E  L+ LE   ++L    +     L ELK  L  LE+
Sbjct: 872 ELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEE 931

Query: 92  EAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFV 151
           E + I    ED K +++  P                     L+  +   +  E+E++A  
Sbjct: 932 ELSEI----EDPKGEDEEIPEEE----------------LSLEDVQAELQRVEEEIRALE 971

Query: 152 KENEKLIKQYA 162
             N   I++Y 
Sbjct: 972 PVNMLAIQEYE 982



 Score = 28.5 bits (64), Expect = 7.3
 Identities = 28/144 (19%), Positives = 63/144 (43%), Gaps = 21/144 (14%)

Query: 41  EEQRLENEKFQQEKEKTLK------TLEET--------KKKLVESEQAGANNLDELKKAL 86
           +E+R + E+ ++E+EK  +         E         K+ L   ++A    L  L++ L
Sbjct: 194 DEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEEL 253

Query: 87  AELEKEAANIRKKEEDLKKK-----EKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDE 141
            +L +E + + K+ E++++      +KI     +   +    + +   +  +   E+S  
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLR--VKEKIGELEAEIASLERSIA 311

Query: 142 EKEQEMKAFVKENEKLIKQYANYL 165
           EKE+E++   +   KL  +    L
Sbjct: 312 EKERELEDAEERLAKLEAEIDKLL 335



 Score = 28.1 bits (63), Expect = 8.9
 Identities = 17/57 (29%), Positives = 32/57 (56%)

Query: 52   QEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
            Q  E+ ++ LE      ++  +     LDELK+  A+LE+E   I ++ E+ +KK++
Sbjct: 961  QRVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKR 1017


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 34.1 bits (79), Expect = 0.096
 Identities = 20/89 (22%), Positives = 46/89 (51%), Gaps = 12/89 (13%)

Query: 28  LFRWRHQA----RVERMEEQRLENEKFQQEK--EKTLKTLEETKKKLVESEQAGANNLDE 81
           + + R +     +  R E QRLE    Q+E+  ++ +++L++ ++ L + E+     L  
Sbjct: 60  VHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEK----ELSN 115

Query: 82  LKKALAELEKEAANIRKKEEDLKKKEKIT 110
            +K L E E+E   +  ++   ++ E+I+
Sbjct: 116 KEKNLDEKEEELEELIAEQ--REELERIS 142



 Score = 33.7 bits (78), Expect = 0.15
 Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 3/71 (4%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           E + + R E E+  +E+   L+ LE   ++L++ E+     ++ L K    LEK+   + 
Sbjct: 58  EEVHKLRAELERELKERRNELQRLE---RRLLQREETLDRKMESLDKKEENLEKKEKELS 114

Query: 98  KKEEDLKKKEK 108
            KE++L +KE+
Sbjct: 115 NKEKNLDEKEE 125


>gnl|CDD|238288 cd00520, RRF, Ribosome recycling factor (RRF). Ribosome recycling
           factor dissociates the posttermination complex, composed
           of the ribosome, deacylated tRNA, and mRNA, after
           termination of translation.  Thus ribosomes are
           "recycled" and ready for another round of protein
           synthesis.  RRF is believed to bind the ribosome at the
           A-site in a manner that mimics tRNA, but the specific
           mechanisms remain unclear.  RRF is essential for
           bacterial growth.  It is not necessary for cell growth
           in archaea or eukaryotes, but is found in mitochondria
           or chloroplasts of some eukaryotic species.
          Length = 179

 Score = 33.4 bits (77), Expect = 0.097
 Identities = 23/74 (31%), Positives = 36/74 (48%), Gaps = 6/74 (8%)

Query: 41  EEQRLENEK-----FQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN 95
           EE+R E  K      ++ K        +   K+ + E+    + DE+KKA  +L+K    
Sbjct: 103 EERRKELVKDAKKIAEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDE 162

Query: 96  -IRKKEEDLKKKEK 108
            I+K +E LK KEK
Sbjct: 163 YIKKIDELLKSKEK 176


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 33.9 bits (78), Expect = 0.14
 Identities = 28/143 (19%), Positives = 59/143 (41%), Gaps = 6/143 (4%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
              A +E  E    E  + +QE  +  + L+  +++LV+ E+      ++L     +LE+
Sbjct: 50  LEAALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEE 109

Query: 92  EAANIRKKEEDLKKKEKITPWNVDTI--SKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKA 149
               +  +E +L++ EK     +D       G T     K        + +EEK Q +K 
Sbjct: 110 REKALSARELELEELEK----QLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKK 165

Query: 150 FVKENEKLIKQYANYLVIWCINL 172
             +E +   ++ A  ++   +  
Sbjct: 166 IEEEADLEAERKAQNILAQAMQR 188


>gnl|CDD|233063 TIGR00631, uvrb, excinuclease ABC, B subunit.  All proteins in this
           family for wich functions are known are DNA helicases
           that function in the nucleotide excision repair and are
           endonucleases that make the 3' incision next to DNA
           damage. They are part of a pathway requiring UvrA, UvrB,
           UvrC, and UvrD homologs. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 655

 Score = 33.8 bits (78), Expect = 0.15
 Identities = 26/92 (28%), Positives = 43/92 (46%), Gaps = 13/92 (14%)

Query: 80  DELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKP------GFTKTVVNKKPPL 133
           D ++KA+ E E+     RK +    ++  ITP    TI KP         K   +     
Sbjct: 555 DSMQKAIEETERR----RKIQMAYNEEHGITP---QTIRKPIRDILDIELKEKEDAAKKK 607

Query: 134 QQGEKSDEEKEQEMKAFVKENEKLIKQYANYL 165
           ++GE   +  ++E+K  +K+ EK +KQ A  L
Sbjct: 608 KKGEDLSDLSKKELKKLIKQLEKEMKQAARNL 639


>gnl|CDD|218038 pfam04350, PilO, Pilus assembly protein, PilO.  PilO proteins are
          involved in the assembly of pilin. However, the precise
          function of this family of proteins is not known.
          Length = 144

 Score = 32.1 bits (74), Expect = 0.17
 Identities = 17/42 (40%), Positives = 19/42 (45%), Gaps = 7/42 (16%)

Query: 51 QQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
           Q KE TLK   E K +        A NL   KK L ELE+ 
Sbjct: 4  AQAKEATLKQEFEEKARQ-------AANLPAYKKQLEELEER 38


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 33.4 bits (76), Expect = 0.17
 Identities = 19/116 (16%), Positives = 48/116 (41%), Gaps = 4/116 (3%)

Query: 37  VERMEEQRLENEKFQQE--KEKTLKTLEETKKKLVESEQA--GANNLDELKKALAELEKE 92
           VE + E   +   F+++    K  ++ E+ K+     E+      + D+ ++     +  
Sbjct: 183 VEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDN 242

Query: 93  AANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMK 148
           A   R +    +++ K  P   DT S     +   N+K  +++ +   ++ ++E  
Sbjct: 243 ADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEAL 298


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 33.5 bits (77), Expect = 0.17
 Identities = 23/84 (27%), Positives = 40/84 (47%), Gaps = 9/84 (10%)

Query: 34  QARVERMEEQRLENEKFQQE---------KEKTLKTLEETKKKLVESEQAGANNLDELKK 84
              + +++E+    EK   E          EK +K LEET ++L E        L+ELK+
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKR 450

Query: 85  ALAELEKEAANIRKKEEDLKKKEK 108
            + +LE E    R++  D  +K++
Sbjct: 451 EIEKLESELERFRREVRDKVRKDR 474


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 32.2 bits (74), Expect = 0.17
 Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 4/59 (6%)

Query: 50  FQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
           +Q +  K  + L E    L+E E A      EL+  +A  +KE A  R + ++L+ K K
Sbjct: 44  YQAKSIK--QHLAELAADLLELEAAAPR--AELQAKIARYKKEKARYRSEAKELEAKAK 98


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 33.3 bits (76), Expect = 0.18
 Identities = 22/75 (29%), Positives = 35/75 (46%), Gaps = 1/75 (1%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
            A+    ++++ E  K +Q  E   K   E +KK  E  +  A   +   KA AE +K+A
Sbjct: 110 AAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAE-EEAKAKAAAEAKKKA 168

Query: 94  ANIRKKEEDLKKKEK 108
           A  +KK E   K + 
Sbjct: 169 AEAKKKAEAEAKAKA 183



 Score = 27.9 bits (62), Expect = 8.4
 Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 1/66 (1%)

Query: 41  EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKE 100
           E+ R +  + +   EK  K  E+  K+  E ++  A      + A A+ + EA   +K +
Sbjct: 87  EQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQ-AEEAKAKQAAEAKAKAEAEAEKKAK 145

Query: 101 EDLKKK 106
           E+ KK+
Sbjct: 146 EEAKKQ 151


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 33.1 bits (76), Expect = 0.18
 Identities = 20/73 (27%), Positives = 40/73 (54%), Gaps = 5/73 (6%)

Query: 36  RVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN 95
           R +R  + +   EK ++E E+  K  ++ KKKL +  +     L E+ K +A+ EK  A 
Sbjct: 274 RPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLAR-----LKEIAKEVAQKEKARAR 328

Query: 96  IRKKEEDLKKKEK 108
            +++ ++  +K+K
Sbjct: 329 KKEQRKERGEKKK 341


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 33.2 bits (76), Expect = 0.19
 Identities = 20/76 (26%), Positives = 39/76 (51%), Gaps = 2/76 (2%)

Query: 33  HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
            Q R++++E++RL  ++ +++ E+  K     +K+    E A         KA AE ++ 
Sbjct: 99  EQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQ--AEEAAAKAAAAAKAKAEAEAKRA 156

Query: 93  AANIRKKEEDLKKKEK 108
           AA  +K   + KKK +
Sbjct: 157 AAAAKKAAAEAKKKAE 172



 Score = 27.8 bits (62), Expect = 8.8
 Identities = 19/75 (25%), Positives = 33/75 (44%), Gaps = 4/75 (5%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           + R+   E+++   E  +Q   K  K  EE   K   + +A A   +   K  A   K+A
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQ-KQAEEAAAKAAAAAKAKA---EAEAKRAAAAAKKA 163

Query: 94  ANIRKKEEDLKKKEK 108
           A   KK+ + +  +K
Sbjct: 164 AAEAKKKAEAEAAKK 178


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 32.9 bits (76), Expect = 0.27
 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 8/80 (10%)

Query: 37  VERMEEQRL-------ENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLD-ELKKALAE 88
           +  +EE          E E   +E EK  + LEE K+KL E E       + E ++A+ E
Sbjct: 522 IASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581

Query: 89  LEKEAANIRKKEEDLKKKEK 108
            +KEA  I K+   L+K   
Sbjct: 582 AKKEADEIIKELRQLQKGGY 601



 Score = 31.3 bits (72), Expect = 0.86
 Identities = 38/170 (22%), Positives = 58/170 (34%), Gaps = 51/170 (30%)

Query: 39  RMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLD-ELKKALAELEKEAANIR 97
            +E++  E E   +E EK  + LEE K+KL E E       + E ++A+ E +KEA  I 
Sbjct: 531 ELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEII 590

Query: 98  K-------------KEEDL-----------------KKKEKITPWNV---DT--ISKPGF 122
           K             K  +L                 KKK+K     +   D       G 
Sbjct: 591 KELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQ 650

Query: 123 TKTVVNKKP---------------PLQQGEKSDEEKEQEMKAFVKENEKL 157
              V++                  PL   EK  + K+++ K       K 
Sbjct: 651 KGEVLSIPDDKEAIVQAGIMKMKVPLSDLEKIQKPKKKKKKKPKTVKPKP 700


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 32.7 bits (75), Expect = 0.28
 Identities = 16/77 (20%), Positives = 31/77 (40%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
           + Q +++ ++ Q    +K     E       + K K     +     + E KK   + EK
Sbjct: 281 KLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEK 340

Query: 92  EAANIRKKEEDLKKKEK 108
           +   I + EE ++K E 
Sbjct: 341 KKKQIERLEERIEKLEV 357


>gnl|CDD|236184 PRK08203, PRK08203, hydroxydechloroatrazine ethylaminohydrolase;
           Reviewed.
          Length = 451

 Score = 32.5 bits (75), Expect = 0.33
 Identities = 16/44 (36%), Positives = 19/44 (43%), Gaps = 7/44 (15%)

Query: 265 GGLDPVEVFESLPDVLKKCFELQDT-------AMLQEAIASLSP 301
           GGL P  V E    +L     L D        AML+ A+A  SP
Sbjct: 165 GGLPPDSVVEDEDAILADSQRLIDRYHDPGPGAMLRIALAPCSP 208


>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
          Length = 535

 Score = 32.5 bits (74), Expect = 0.33
 Identities = 21/64 (32%), Positives = 34/64 (53%)

Query: 46  ENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKK 105
           + E    +KE  L+  EE +K   E EQ   +   ELK+  + L + A ++ +K+E+L  
Sbjct: 63  KRESKALKKELLLEAKEEARKYREEIEQEFKSERQELKQIESRLTERATSLDRKDENLSS 122

Query: 106 KEKI 109
           KEK 
Sbjct: 123 KEKT 126


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 32.7 bits (75), Expect = 0.36
 Identities = 28/132 (21%), Positives = 66/132 (50%), Gaps = 4/132 (3%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGA---NNLDELKKALAELEKEAA 94
           + + E   E ++  +  EK +K  E  ++ + E E+        ++E+   L EL +E  
Sbjct: 165 KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELE 224

Query: 95  NIRKKEEDLKK-KEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKE 153
            + K+ ++L++ KE+I     +  S  G  + +  K   L++  +  +++ +E++  VKE
Sbjct: 225 KLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKE 284

Query: 154 NEKLIKQYANYL 165
            ++L ++   Y+
Sbjct: 285 LKELKEKAEEYI 296



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 4/76 (5%)

Query: 35  ARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQ----AGANNLDELKKALAELE 90
            R+E ++++  E EK  +E E+  +  EE K K  E E+          ++L+K L ELE
Sbjct: 338 ERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELE 397

Query: 91  KEAANIRKKEEDLKKK 106
           K    I ++   +  +
Sbjct: 398 KAKEEIEEEISKITAR 413



 Score = 29.3 bits (66), Expect = 3.7
 Identities = 18/70 (25%), Positives = 37/70 (52%), Gaps = 1/70 (1%)

Query: 42  EQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
           E++   E++++ +E+ L+   E      E E+      +E+KK L +L++E     K ++
Sbjct: 653 EKKYSEEEYEELREEYLELSRELAGLRAELEEL-EKRREEIKKTLEKLKEELEEREKAKK 711

Query: 102 DLKKKEKITP 111
           +L+K EK   
Sbjct: 712 ELEKLEKALE 721



 Score = 28.9 bits (65), Expect = 5.0
 Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 10/88 (11%)

Query: 32  RHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESE---------QAGANNLDEL 82
           R +  +  +EE+  E E+ ++  E+  K L+E +K+L E E         +A    L+ L
Sbjct: 318 RLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERL 377

Query: 83  KKALAELEKEAANIRKKEEDLKKKEKIT 110
           KK L  L  E    ++ EE  K KE+I 
Sbjct: 378 KKRLTGLTPEKLE-KELEELEKAKEEIE 404



 Score = 28.5 bits (64), Expect = 5.8
 Identities = 26/74 (35%), Positives = 46/74 (62%), Gaps = 4/74 (5%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDEL---KKALAELEKEAA 94
           ER+EE + E E+ + EK K LK L+E  ++ ++  +     LDEL   +K L+ LE+E  
Sbjct: 266 ERIEELKKEIEELE-EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEIN 324

Query: 95  NIRKKEEDLKKKEK 108
            I ++ ++L++KE+
Sbjct: 325 GIEERIKELEEKEE 338


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 31.5 bits (72), Expect = 0.38
 Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 10/85 (11%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQ------AGANNLD----ELK 83
           Q  +  ++EQ  E ++  QE ++ L TLE   ++L +         A A  LD    EL+
Sbjct: 79  QEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELR 138

Query: 84  KALAELEKEAANIRKKEEDLKKKEK 108
           + LAEL++E   +  + E L++ E+
Sbjct: 139 EELAELKQENEALEAENERLQENEQ 163



 Score = 28.1 bits (63), Expect = 6.0
 Identities = 16/75 (21%), Positives = 36/75 (48%), Gaps = 6/75 (8%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELE------K 91
           ER+ E + E  + Q+E  +  + L E +++  E +Q  +    EL++   EL        
Sbjct: 66  ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSA 125

Query: 92  EAANIRKKEEDLKKK 106
            A  + ++  +L+++
Sbjct: 126 NAIELDEENRELREE 140


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 32.6 bits (75), Expect = 0.40
 Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 11/78 (14%)

Query: 34  QARVERMEEQRL---ENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELE 90
           QA+++ + +Q+L   E++  QQ+ E+TL  L++  +   + E       ++LK+ LA+  
Sbjct: 42  QAQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDR--QKEE------TEQLKQQLAQAP 93

Query: 91  KEAANIRKKEEDLKKKEK 108
            +    + + E LK    
Sbjct: 94  AKLRQAQAELEALKDDND 111


>gnl|CDD|220297 pfam09581, Spore_III_AF, Stage III sporulation protein AF
           (Spore_III_AF).  This family represents the stage III
           sporulation protein AF (Spore_III_AF) of the bacterial
           endospore formation program, which exists in some but
           not all members of the Firmicutes (formerly called
           low-GC Gram-positives). The C-terminal region of these
           proteins is poorly conserved.
          Length = 185

 Score = 31.5 bits (72), Expect = 0.43
 Identities = 22/125 (17%), Positives = 50/125 (40%), Gaps = 11/125 (8%)

Query: 41  EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKE 100
            E  +E++K + +  +    LEE  K+L          +++  K    ++ +   +   E
Sbjct: 61  IENEIESKKKEIQASQRAYILEEYAKQL-------EKQVEKKLKEEYGVKVKDVEVEIDE 113

Query: 101 EDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQ 160
           +       I    V+   K    +   +K  P+    ++ + KE+E ++  +E EK+   
Sbjct: 114 DLESNNFDIK--EVNVTLKEESKEKQKSKVEPVVIDTQTSKPKEEEEES--EEAEKIKNF 169

Query: 161 YANYL 165
            A+  
Sbjct: 170 LADEY 174


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 32.0 bits (73), Expect = 0.44
 Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 6/75 (8%)

Query: 37  VERMEEQRLENEKFQQEKEKTLK------TLEETKKKLVESEQAGANNLDELKKALAELE 90
           VE  EE     +  QQ+ +K  K       LEE KK+L   ++    N  EL+K   E +
Sbjct: 163 VETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFD 222

Query: 91  KEAANIRKKEEDLKK 105
           K     +  ++ +K 
Sbjct: 223 KLKKEGKADKQKIKS 237


>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 753

 Score = 31.9 bits (73), Expect = 0.52
 Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 2/70 (2%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           E +E+   E E           +L E K ++V     G       ++A  E   E   ++
Sbjct: 525 EEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLP--VPAEEAKVEAVDEVKFLK 582

Query: 98  KKEEDLKKKE 107
           KK+  L    
Sbjct: 583 KKKGILDAGA 592



 Score = 30.8 bits (70), Expect = 1.4
 Identities = 21/76 (27%), Positives = 28/76 (36%), Gaps = 5/76 (6%)

Query: 36  RVERMEEQRLEN-EKFQQEKEKTLKTLEETKKKLVESEQAGANNLD-ELKKALAELEKEA 93
            V +  E+ LE  EK   E E         K  L+E +          L     E + EA
Sbjct: 515 EVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLAHMGLPVPAEEAKVEA 574

Query: 94  ANIRKKEEDLKKKEKI 109
            +  K    LKKK+ I
Sbjct: 575 VDEVKF---LKKKKGI 587


>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
           eukaryotic snRNP [Transcription].
          Length = 564

 Score = 31.6 bits (72), Expect = 0.58
 Identities = 22/124 (17%), Positives = 52/124 (41%), Gaps = 11/124 (8%)

Query: 50  FQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDL------ 103
            + E++K  +   E +KKL +  +   N L++ +  L ELEK A  +R+K E L      
Sbjct: 273 EKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQL 332

Query: 104 --KKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQY 161
             +  + +   +     +    K  ++K     +  +   +K +++K      ++ + + 
Sbjct: 333 IEEGLKSVRLADFYGNEE---IKIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSEL 389

Query: 162 ANYL 165
              +
Sbjct: 390 KEAI 393


>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element. 
          Length = 370

 Score = 31.5 bits (71), Expect = 0.62
 Identities = 30/122 (24%), Positives = 51/122 (41%), Gaps = 9/122 (7%)

Query: 42  EQRLENEKFQQEKEKTLKTLEETKKKLVES----EQAGANNLDELKKALAELEKEAANIR 97
             R + E   +E +  LK LEE   K VE+     +     L ELKK   EL++E  +I+
Sbjct: 59  SLREDIETKGKEVQNFLKELEECITKQVEAHIENTEKCLKELMELKKKARELKEECRSIK 118

Query: 98  KKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKL 157
           K+  +     +        +       +  N+   +++     E+  +E+K   K NEK 
Sbjct: 119 KEHMETTLDIENQKKRQGVVD-----TSFTNRIQEMEERISGAEDSIEEIKRTGKFNEKR 173

Query: 158 IK 159
            K
Sbjct: 174 KK 175


>gnl|CDD|182767 PRK10836, PRK10836, lysine transporter; Provisional.
          Length = 489

 Score = 31.7 bits (72), Expect = 0.64
 Identities = 11/15 (73%), Positives = 12/15 (80%)

Query: 324 AAGESEDPQENTPDA 338
           AAGESEDP +N P A
Sbjct: 227 AAGESEDPAKNIPRA 241


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 31.5 bits (72), Expect = 0.67
 Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 2/51 (3%)

Query: 42  EQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGAN-NLDELKKALAELEK 91
           EQ+L+NE   Q  E   +  +E K+K VE E+ G    L EL   L  LEK
Sbjct: 255 EQKLKNELALQAIELQREFNKEIKEK-VEEERNGRLAKLAELNSRLKGLEK 304


>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
          Length = 470

 Score = 31.2 bits (71), Expect = 0.72
 Identities = 24/104 (23%), Positives = 37/104 (35%), Gaps = 4/104 (3%)

Query: 46  ENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKK 105
           ENE  QQ+++   K  ++  KK  +        L EL     E E +  N +KK++  KK
Sbjct: 7   ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66

Query: 106 KEKITPWNVDTISKPGFTKT----VVNKKPPLQQGEKSDEEKEQ 145
           K+K      D         +      N            E K+ 
Sbjct: 67  KKKNLGEAYDLAYDLPVVWSSAAFQDNSHIRKLGNWPEQEWKQT 110


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 31.3 bits (72), Expect = 0.77
 Identities = 24/79 (30%), Positives = 39/79 (49%), Gaps = 15/79 (18%)

Query: 235 KERIRNRAKEKLDEALKEVEEEERQK----RLGPGGLD--PV--------EVFESLPDVL 280
           KE I    +EK +EA +E+EEE ++K       PGG    P+        E FE+LP+  
Sbjct: 117 KEEIEEEFQEKREEAFEELEEEAKEKGFALVRTPGGFVFAPLKDGEPLTEEEFEALPEEE 176

Query: 281 KKCFELQDTAMLQEAIASL 299
           ++  E +    L+E +  +
Sbjct: 177 REELE-EKIDELEEELQEI 194



 Score = 28.2 bits (64), Expect = 7.6
 Identities = 20/66 (30%), Positives = 34/66 (51%), Gaps = 10/66 (15%)

Query: 41  EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKE 100
           + + L  E+F+   E+  + LEE   +L E        L E+ + L ELE+EA   R+K 
Sbjct: 160 DGEPLTEEEFEALPEEEREELEEKIDELEEE-------LQEILRQLRELEREA---REKL 209

Query: 101 EDLKKK 106
            +L ++
Sbjct: 210 RELDRE 215


>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
          Length = 193

 Score = 30.7 bits (70), Expect = 0.81
 Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 13/81 (16%)

Query: 41  EEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGAN-NLDELKKALAELEK-------- 91
           EEQ+   E+  +E E  ++  EE  +  +E E   A+  + EL+  LAE +         
Sbjct: 4   EEQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEADARIAELEAQLAEAQAEERDSVLR 63

Query: 92  ---EAANIRKK-EEDLKKKEK 108
              E  NIR++ E+D++K  K
Sbjct: 64  ARAEVENIRRRAEQDVEKAHK 84


>gnl|CDD|236681 PRK10401, PRK10401, DNA-binding transcriptional regulator GalS;
           Provisional.
          Length = 346

 Score = 30.9 bits (70), Expect = 0.88
 Identities = 20/66 (30%), Positives = 29/66 (43%), Gaps = 17/66 (25%)

Query: 139 SDEEKEQEMKAFVKENEKLIKQYANYLVIWCINLAMDEKFDLMDHV-------------A 185
           S  E E+E  A     E LI+Q  N L++    L+ DE    MD +             A
Sbjct: 97  SYHEAEKERHAI----EVLIRQRCNALIVHSKALSDDELAQFMDQIPGMVLINRVVPGYA 152

Query: 186 HQCICM 191
           H+C+C+
Sbjct: 153 HRCVCL 158


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 31.1 bits (70), Expect = 0.93
 Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 9/83 (10%)

Query: 33  HQARVERMEEQRLENEKFQQEKEKTLKTLEET---------KKKLVESEQAGANNLDELK 83
            Q   ER++ Q  + +  + EK+  L+  ++          +KK  E+ +A A       
Sbjct: 104 KQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAKL 163

Query: 84  KALAELEKEAANIRKKEEDLKKK 106
           KA AE +K+A    K  E+ K K
Sbjct: 164 KAAAEAKKKAEEAAKAAEEAKAK 186


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 31.0 bits (71), Expect = 0.96
 Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 6/69 (8%)

Query: 37  VERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANI 96
           +ERM +   + E++  E +K  K   E K +  E   +   +L E    L E +K+   +
Sbjct: 506 IERMVK---DAEEYAAE-DKKRKERIEAKNEAEEYVYSLEKSLKEEGDKLPEADKKK--V 559

Query: 97  RKKEEDLKK 105
            +  E LK+
Sbjct: 560 EEAIEWLKE 568


>gnl|CDD|234776 PRK00478, scpA, segregation and condensation protein A/unknown
           domain fusion protein; Provisional.
          Length = 505

 Score = 31.1 bits (70), Expect = 0.98
 Identities = 16/52 (30%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 55  EKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKK 106
              +   +E+ +++V+ +Q     L E  K + E E+  +N +K+EE LKKK
Sbjct: 415 PTKIVNSQESPQEMVQRQQEQEQALKESIKQIQE-ERAISNFQKREEYLKKK 465


>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family.  In a
           subset of endospore-forming members of the Firmcutes,
           members of this protein family are found, several to a
           genome. Two very strongly conserved sequences regions
           are separated by a highly variable linker region. Much
           of the linker region was excised from the seed alignment
           for this model. A characterized member is the
           prespore-specific transcription RsfA from Bacillus
           subtilis, previously called YwfN, which is controlled by
           sigma factor F and seems to fine-tune expression of some
           genes in the sigma-F regulon. A paralog in Bacillus
           subtilis is designated YlbO [Regulatory functions, DNA
           interactions, Cellular processes, Sporulation and
           germination].
          Length = 161

 Score = 30.1 bits (68), Expect = 1.0
 Identities = 24/92 (26%), Positives = 39/92 (42%), Gaps = 13/92 (14%)

Query: 29  FRWRHQARVERMEEQRLENEKFQQEKEKTLK----TLE------ETKKKLVESEQAGANN 78
           FRW   A V +  E+ +E  K +Q KE   +    TL+      +  K    S+QA    
Sbjct: 49  FRW--NAYVRKQYEEAIELAK-KQRKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKE 105

Query: 79  LDELKKALAELEKEAANIRKKEEDLKKKEKIT 110
            + LK     L+K    + K+ E L+++    
Sbjct: 106 NERLKNQNESLQKRNEELEKELEKLRQRLSTI 137


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 30.6 bits (69), Expect = 1.1
 Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 4/69 (5%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           E  EE + + E+ Q+EKE+ LK LEE + +     +     L  L+   + LE+    + 
Sbjct: 135 EDYEELKEKLEELQKEKEELLKELEELEAEY----EEVQERLKRLEVENSRLEEMLKKLP 190

Query: 98  KKEEDLKKK 106
            +  DLKK+
Sbjct: 191 GEVYDLKKR 199


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 30.6 bits (69), Expect = 1.1
 Identities = 13/55 (23%), Positives = 24/55 (43%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
           ER+ ++  E EK   + +  L  L+E K++L   E       +   +   +LE  
Sbjct: 80  ERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDN 134


>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
          Length = 434

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 11/73 (15%), Positives = 31/73 (42%)

Query: 35  ARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAA 94
           A  +  EE +   +    ++++ LK   +  +   +      +  D  ++A A+ ++E  
Sbjct: 348 ALTKLYEEVKSNTDLSGDKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKL 407

Query: 95  NIRKKEEDLKKKE 107
              + E+  K++ 
Sbjct: 408 KQEENEKKQKEQA 420


>gnl|CDD|153327 cd07643, I-BAR_IMD_MIM, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Missing In
           Metastasis.  The IMD domain, also called
           Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a
           dimerization and lipid-binding module that bends
           membranes and induces membrane protrusions. Members of
           this subfamily include missing in metastasis (MIM) or
           metastasis suppressor 1 (MTSS1), metastasis suppressor
           1-like (MTSSL) or ABBA (Actin-Bundling protein with
           BAIAP2 homology), and similar proteins. They contain an
           N-terminal IMD and a WASP homology 2 (WH2) actin-binding
           motif at the C-terminus. MIM was originally identified
           as a missing transcript from metastatic bladder and
           prostate cancer cells. It is a scaffold protein that
           functions in a signaling pathway between the PDGF
           receptor, Src kinases, and actin assembly. It may also
           function as a cofactor of the Sonic hedgehog (Shh)
           transcriptional pathway and may participate in tumor
           development and progression via this pathway. ABBA
           regulates actin and plasma membrane dynamics to promote
           the extension of radial glia, which is important in
           neuronal migration, axon guidance and neurogenesis. The
           IMD domain of MIM binds and bundles actin filaments,
           binds membranes, and interacts with the small GTPase
           Rac.
          Length = 231

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 34/135 (25%), Positives = 59/135 (43%), Gaps = 21/135 (15%)

Query: 79  LDELKKALAELEKEAANIRKK-EEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGE 137
           ++E KK   +L+K+ A   KK  +++KKK      + DTI      K     K  LQ   
Sbjct: 106 IEEWKKVANQLDKDHAKEYKKARQEIKKK------SSDTIR---LQKKARKGKGDLQPQL 156

Query: 138 KSDEEKEQEMKAFVKENEK------LIKQYANY-LVIWCINLAMDEKFDLMDHVAHQCIC 190
            S  +   +    ++E EK      LI++   +   +  +   +DE+  ++  V H    
Sbjct: 157 DSAMQDVNDKYLLLEETEKKAVRNALIEERGRFCTFVSFLKPVLDEEISMLGEVTH---- 212

Query: 191 MQYLLELSKSLNIDP 205
           +Q ++E   SL  DP
Sbjct: 213 LQTIMEDLASLTADP 227


>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
          Length = 413

 Score = 30.5 bits (69), Expect = 1.4
 Identities = 40/195 (20%), Positives = 67/195 (34%), Gaps = 28/195 (14%)

Query: 58  LKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTI 117
           L T  E + K+  +        D+  ++  + EK+    +KKE+   K E  T       
Sbjct: 39  LSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKK--KKKEKKEPKSEGETK---LGF 93

Query: 118 SKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKL-----IKQYANYLVIWCINL 172
             P  +K    K P  +  E  D    +  +   K N  +          +   I   N+
Sbjct: 94  KTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVYIGAHVSASGGVDNSPINAYNI 153

Query: 173 AMDEKFDLM-------DHVAHQCICMQYLLELSKSLNIDPRACI--GSFFSKIQIAEPDY 223
           A  + F L        +        +    E  K  N DP+  +  GS+   I +A PD 
Sbjct: 154 A-GQAFALFLKNQRTWNSPPLSDETIDKFKENCKKYNYDPKFILPHGSYL--INLANPDK 210

Query: 224 ------KASFDDELK 232
                   +F D+L+
Sbjct: 211 EKREKSYDAFLDDLQ 225


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 30.7 bits (70), Expect = 1.4
 Identities = 16/61 (26%), Positives = 32/61 (52%)

Query: 48  EKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKE 107
           +K ++  EK  K  EE KK+  +   AG    +E ++   + E+E     ++ E+ K++E
Sbjct: 410 KKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEE 469

Query: 108 K 108
           +
Sbjct: 470 E 470


>gnl|CDD|147523 pfam05384, DegS, Sensor protein DegS.  This is small family of
           Bacillus DegS proteins. The DegS-DegU two-component
           regulatory system of Bacillus subtilis controls various
           processes that characterize the transition from the
           exponential to the stationary growth phase, including
           the induction of extracellular degradative enzymes,
           expression of late competence genes and down-regulation
           of the sigma D regulon. The family also contains one
           sequence from Thermoanaerobacter tengcongensis which is
           described as a sensory transduction histidine kinase.
          Length = 159

 Score = 29.5 bits (67), Expect = 1.4
 Identities = 16/55 (29%), Positives = 36/55 (65%)

Query: 55  EKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKI 109
           EKT+ T+EE+K+++ E  +      + L++ L E+++E + + K+ + L+K+E+ 
Sbjct: 5   EKTIDTVEESKEEIFEIAEQARQEYERLEQELEEVKEEVSEVIKEVDKLEKQERR 59


>gnl|CDD|216654 pfam01702, TGT, Queuine tRNA-ribosyltransferase.  This is a family
           of queuine tRNA-ribosyltransferases EC:2.4.2.29, also
           known as tRNA-guanine transglycosylase and guanine
           insertion enzyme. Queuine tRNA-ribosyltransferase
           modifies tRNAs for asparagine, aspartic acid, histidine
           and tyrosine with queuine. It catalyzes the exchange of
           guanine-34 at the wobble position with
           7-aminomethyl-7-deazaguanine, and the addition of a
           cyclopentenediol moiety to
           7-aminomethyl-7-deazaguanine-34 tRNA; giving a
           hypermodified base queuine in the wobble position. The
           aligned region contains a zinc binding motif
           C-x-C-x2-C-x29-H, and important tRNA and
           7-aminomethyl-7deazaguanine binding residues.
          Length = 237

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 25/108 (23%), Positives = 36/108 (33%), Gaps = 18/108 (16%)

Query: 216 IQIAE---PDYKASFDD--ELKSFKERIR---NRAKEKLDEALKEVEEEERQKRLGP--G 265
           I+I      D     D+     + +ER      R     D  L+  + +  Q   G   G
Sbjct: 4   IEIQNALGSDIIMPLDECTPYPAPRERAEKSVERTLRWADRCLEAHKRKG-QALFGIVQG 62

Query: 266 GLDPVEVFESLPDVLKKCFELQDTAMLQEAIASLSPEEATYHMKRCVD 313
           GL      ES  ++ +  F+         AI  LS  E    M   VD
Sbjct: 63  GLYEDLREESAEELAEDGFDGY-------AIGGLSVGEEKEEMLELVD 103


>gnl|CDD|223520 COG0443, DnaK, Molecular chaperone [Posttranslational modification,
           protein turnover, chaperones].
          Length = 579

 Score = 30.4 bits (69), Expect = 1.5
 Identities = 20/100 (20%), Positives = 43/100 (43%), Gaps = 12/100 (12%)

Query: 9   DIEISDDEDETHPNIDTPSLFRWRHQARVE------RMEEQRLENEKFQQEKEKTLKTLE 62
           D EI    ++   N      FR   +AR E       +E+   E  K  +E+++ ++   
Sbjct: 485 DEEIERMVEDAEANAALDKKFRELVEARNEAESLIYSLEKALKEIVKVSEEEKEKIEEAI 544

Query: 63  ETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEED 102
              ++ +E E+      +E+K  + EL++    + +K+  
Sbjct: 545 TDLEEALEGEK------EEIKAKIEELQEVTQKLAEKKYQ 578


>gnl|CDD|232961 TIGR00408, proS_fam_I, prolyl-tRNA synthetase, family I.
           Prolyl-tRNA synthetase is a class II tRNA synthetase and
           is recognized by pfam model tRNA-synt_2b, which
           recognizes tRNA synthetases for Gly, His, Ser, and Pro.
           The prolyl-tRNA synthetases are divided into two widely
           divergent families. This family includes the archaeal
           enzyme, the Pro-specific domain of a human
           multifunctional tRNA ligase, and the enzyme from the
           spirochete Borrelia burgdorferi. The other family
           includes enzymes from Escherichia coli, Bacillus
           subtilis, Synechocystis PCC6803, and one of the two
           prolyL-tRNA synthetases of Saccharomyces cerevisiae
           [Protein synthesis, tRNA aminoacylation].
          Length = 472

 Score = 30.5 bits (69), Expect = 1.5
 Identities = 13/47 (27%), Positives = 25/47 (53%)

Query: 215 KIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKR 261
           K Q++    +    + L + +E +RNRA E+ ++ +  VE  E  K+
Sbjct: 365 KYQVSLDQLEERVVELLNNIQENLRNRAWERFEQKIVIVETLEEIKQ 411


>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
           (TBPIP).  This family consists of several eukaryotic
           TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
           been demonstrated to interact with the human
           immunodeficiency virus type 1 (HIV-1) viral protein Tat,
           then modulate the essential replication process of HIV.
           In addition, TBP-1 has been shown to be a component of
           the 26S proteasome, a basic multiprotein complex that
           degrades ubiquitinated proteins in an ATP-dependent
           fashion. Human TBPIP interacts with human TBP-1 then
           modulates the inhibitory action of human TBP-1 on
           HIV-Tat-mediated transactivation.
          Length = 169

 Score = 29.6 bits (67), Expect = 1.5
 Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 7/65 (10%)

Query: 40  MEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKK 99
           ME + L  E   Q  ++   TLE   K L           +EL++ + EL+KE   I +K
Sbjct: 79  MEIEELREEV--QLLKQDCSTLEIELKSLTSDLT-----TEELQEEIQELKKEVREIEEK 131

Query: 100 EEDLK 104
            E L+
Sbjct: 132 LESLE 136


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 30.4 bits (69), Expect = 1.6
 Identities = 27/150 (18%), Positives = 59/150 (39%), Gaps = 13/150 (8%)

Query: 8   KDIEISDDEDETHPNIDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKK 67
           + I I  +      N    +        ++   EE+RL+  +   ++E+ L   EE +  
Sbjct: 782 EKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALR--IQREEMLMRPEELELI 839

Query: 68  LVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVV 127
             E +        E+K  L+E+++      + +   + K +     V+  + P     ++
Sbjct: 840 NEEQKNL----KQEIKLELSEIQE---AEEEIQNINENKNE----FVEFKNDPKKLNKLI 888

Query: 128 NKKPPLQQGEKSDEEKEQEMKAFVKENEKL 157
             K  L +   S +E +Q+ K   K+ + L
Sbjct: 889 IAKDVLIKLVISSDEIKQDEKTTKKKKKDL 918


>gnl|CDD|149815 pfam08870, DUF1832, Domain of unknown function (DUF1832).  This
           family of proteins are functionally uncharacterized.
          Length = 116

 Score = 28.8 bits (65), Expect = 1.6
 Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 5/69 (7%)

Query: 103 LKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQYA 162
           LK++  +TPWN+  + +  F  ++ +   P     K D   E +   F+ E   +   Y 
Sbjct: 15  LKRRTGLTPWNI--LCRWAFCLSLESGSIPDGAEIKLDSGLEIDKYTFLGEYGDI---YI 69

Query: 163 NYLVIWCIN 171
             L    + 
Sbjct: 70  ALLKQRYVQ 78


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 30.5 bits (69), Expect = 1.7
 Identities = 19/73 (26%), Positives = 39/73 (53%)

Query: 36  RVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAAN 95
           R+E ++ + LE E  + E  K  +      ++L+E  +     L+EL++ + ELE+E   
Sbjct: 250 RLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEG 309

Query: 96  IRKKEEDLKKKEK 108
           +R   E+L++  +
Sbjct: 310 LRALLEELEELLE 322



 Score = 29.3 bits (66), Expect = 3.3
 Identities = 19/76 (25%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           + ++  + E   +  +  +E+ K LK LEE +++  E E      ++ L++ LAELE+E 
Sbjct: 191 EGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELE--QEIEALEERLAELEEEK 248

Query: 94  ANIRKKEEDLKKKEKI 109
             + + +  L + E +
Sbjct: 249 ERLEELKARLLEIESL 264



 Score = 29.3 bits (66), Expect = 3.5
 Identities = 21/75 (28%), Positives = 38/75 (50%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           +   E +E++    E+   E E+  + LEE K +L+E E      L   ++ L ELE+  
Sbjct: 224 EQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLL 283

Query: 94  ANIRKKEEDLKKKEK 108
             + +K E L++ E+
Sbjct: 284 EELEEKIERLEELER 298



 Score = 28.2 bits (63), Expect = 7.2
 Identities = 22/72 (30%), Positives = 40/72 (55%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           ER+EE   E EK  +  ++  + ++E K++L E   A     +EL++   ELE+    + 
Sbjct: 368 ERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELE 427

Query: 98  KKEEDLKKKEKI 109
           + EE++KK E+ 
Sbjct: 428 ELEEEIKKLEEQ 439



 Score = 28.2 bits (63), Expect = 9.3
 Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 7/89 (7%)

Query: 27  SLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESE-QAGANNLDELKKA 85
                  +      EE   + EK +   E+  +  E+ + + ++ E +   + L ELK+ 
Sbjct: 515 EELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKEL 574

Query: 86  LAELEK------EAANIRKKEEDLKKKEK 108
           L EL        E   +R++ ++LKKK K
Sbjct: 575 LEELRLLRTRKEELEELRERLKELKKKLK 603


>gnl|CDD|131606 TIGR02555, OrgA_MxiK, type III secretion apparatus protein
           OrgA/MxiK.  This gene is found in type III secretion
           operons and has been shown to be essential for the
           invasion phenotype in Salmonella and a component of the
           secretion apparatus. The protein is known as OrgA in
           Salmonella due to its oxygen-dependent expression
           pattern in which low-oxygen levels up-regulate the gene.
           In Shigella the ghene is called MxiK and has been shown
           to be sessential for the proper assembly of the
           secretion needle complex.
          Length = 185

 Score = 29.8 bits (67), Expect = 1.7
 Identities = 18/75 (24%), Positives = 28/75 (37%), Gaps = 11/75 (14%)

Query: 162 ANYLVIWCINLAMDEKFDLMDHVAHQCICMQYLLELSKSLNIDPRAC--IGSFFSKIQIA 219
           AN  +I    L+ DE F L        +    LL         P+    IG    + ++A
Sbjct: 33  ANEWIINHYRLSCDEDFALRGSPTAMFVDNWNLL---------PQIAYLIGCQRLRARLA 83

Query: 220 EPDYKASFDDELKSF 234
           E     +  D ++SF
Sbjct: 84  ERGRFINLPDWVQSF 98


>gnl|CDD|224937 COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin
           stability system [Translation, ribosomal structure and
           biogenesis / Cell division and chromosome partitioning].
          Length = 90

 Score = 28.4 bits (64), Expect = 1.7
 Identities = 13/47 (27%), Positives = 20/47 (42%)

Query: 222 DYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKRLGPGGLD 268
           +       ELK   ++IR + K+KL E L+        K+L  G   
Sbjct: 6   EIHPKALKELKKLDKKIRKKIKDKLKELLENPPPIRHGKKLRKGLSG 52


>gnl|CDD|225968 COG3434, COG3434, Predicted signal transduction protein containing
           EAL and modified HD-GYP domains [Signal transduction
           mechanisms].
          Length = 407

 Score = 30.1 bits (68), Expect = 1.8
 Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 1/73 (1%)

Query: 191 MQYLLELSKSLNIDPRACIGSFFSKIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEAL 250
             YLL L         +          I + D+K    D+ + F   +    K+ L E +
Sbjct: 110 KGYLLALD-DFIFSNVSEWKPLLPLSDIVKIDFKRVTFDKARLFDRDLGYINKKFLAEKV 168

Query: 251 KEVEEEERQKRLG 263
           +  EE E+ K+ G
Sbjct: 169 ETEEEFEQAKKAG 181


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 30.0 bits (68), Expect = 2.0
 Identities = 13/66 (19%), Positives = 24/66 (36%), Gaps = 4/66 (6%)

Query: 47  NEKFQQEKEKTLKTLEETKKKLVESEQAGANNL----DELKKALAELEKEAANIRKKEED 102
            E   +  ++  K  E+ K    E E+A  + L    D+L   +  LE+    +  K   
Sbjct: 5   REALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAA 64

Query: 103 LKKKEK 108
             +   
Sbjct: 65  SGEGGG 70


>gnl|CDD|222499 pfam14022, DUF4238, Protein of unknown function (DUF4238).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria, archaea and
           eukaryotes. Proteins in this family are typically
           between 274 and 374 amino acids in length.
          Length = 268

 Score = 29.4 bits (66), Expect = 2.4
 Identities = 18/108 (16%), Positives = 48/108 (44%), Gaps = 9/108 (8%)

Query: 77  NNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISK---------PGFTKTVV 127
            + DE++K L+++E  A+ I  K  D  +   +TP + + ++          P F + ++
Sbjct: 52  GDDDEIEKLLSKIEGRASPIIDKIIDGNRSALLTPEDKEKLALFLALQYLRTPAFRERIL 111

Query: 128 NKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQYANYLVIWCINLAMD 175
           +    + +     EE ++ +K  +K  E+  ++  +  ++  +     
Sbjct: 112 SFLEAIARQVDELEEADEAIKELLKGIEEDTEELPDLSLLLLLKSLRK 159


>gnl|CDD|204739 pfam11768, DUF3312, Protein of unknown function (DUF3312).  This is
           a eukaryotic family of uncharacterized proteins. This
           family shows similarity to WD40 repeat proteins.
          Length = 544

 Score = 29.7 bits (67), Expect = 2.5
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query: 230 ELKSFKERIRNRAKEKLDEALKEVEEEERQKRL 262
           E + +   ++N+  EKL + LKE+EE  +Q R 
Sbjct: 6   ESRGYPWTLKNKRPEKLRDNLKELEELLQQSRC 38


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 29.6 bits (67), Expect = 2.6
 Identities = 22/96 (22%), Positives = 42/96 (43%), Gaps = 8/96 (8%)

Query: 36  RVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK---- 91
            +E+  ++         EK K L+ L++  ++L  + +A  N   EL + +AE EK    
Sbjct: 208 ELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTNKKSELLEEIAEAEKIREE 267

Query: 92  ----EAANIRKKEEDLKKKEKITPWNVDTISKPGFT 123
                A  I K +  +   +K+T W   ++S    +
Sbjct: 268 CRGWSAKEISKLKAKVSLLQKLTGWKFLSLSGSTLS 303


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score = 29.8 bits (68), Expect = 2.6
 Identities = 15/66 (22%), Positives = 35/66 (53%), Gaps = 4/66 (6%)

Query: 48  EKFQQEKEKTLKTLEETKKKLVESEQAGANN----LDELKKALAELEKEAANIRKKEEDL 103
           E+F +E ++  K +E+ +K+L E   +   +    +  +K  + E++ +   +RK   +L
Sbjct: 764 ERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEVGGVKVVVEEVDADEDELRKIANEL 823

Query: 104 KKKEKI 109
            K+ K+
Sbjct: 824 AKEGKV 829


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 29.9 bits (68), Expect = 2.6
 Identities = 21/131 (16%), Positives = 41/131 (31%), Gaps = 9/131 (6%)

Query: 30  RWRHQARVERMEEQRLENE-KFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAE 88
           + R +AR  R+E ++   E + ++  E      ++     +   +A      +     A 
Sbjct: 452 KARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAG 511

Query: 89  LEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVV--------NKKPPLQQGEKSD 140
              + + +    E  K + +         +     K  V         KK   Q      
Sbjct: 512 ARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEA 571

Query: 141 EEKEQEMKAFV 151
           EE+    KA V
Sbjct: 572 EEEVDPKKAAV 582


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 29.7 bits (66), Expect = 2.7
 Identities = 17/52 (32%), Positives = 30/52 (57%), Gaps = 1/52 (1%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKA 85
           + R+ER+E +RLE E+ ++E+ +    LE  +   +E E+      D L+KA
Sbjct: 473 RERMERIERERLERERLERERLER-DRLERDRLDRLERERVDRLERDRLEKA 523


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 29.7 bits (68), Expect = 3.0
 Identities = 24/84 (28%), Positives = 34/84 (40%), Gaps = 30/84 (35%)

Query: 82  LKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDE 141
           L+K LA+LEKE   + KK                 +S  GF    V K P     E  ++
Sbjct: 816 LEKELAKLEKEIERVEKK-----------------LSNEGF----VAKAPE----EVVEK 850

Query: 142 EKE-----QEMKAFVKENEKLIKQ 160
           E+E     +E  A +KE    +K 
Sbjct: 851 EREKLAEYEEKLAKLKERLARLKA 874



 Score = 29.7 bits (68), Expect = 3.0
 Identities = 20/73 (27%), Positives = 34/73 (46%), Gaps = 18/73 (24%)

Query: 41  EEQRLENE--KFQQEKEKTLKTLEETK------KKLVESEQAGANNLDELKKALAELEKE 92
           E  RLE E  K ++E E+  K L          +++VE E          ++ LAE E++
Sbjct: 812 ELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKE----------REKLAEYEEK 861

Query: 93  AANIRKKEEDLKK 105
            A ++++   LK 
Sbjct: 862 LAKLKERLARLKA 874


>gnl|CDD|239586 cd03509, DesA_FADS-like, Fatty acid desaturase protein family
           subgroup, a delta-12 acyl-lipid desaturase-like,
           DesA-like, yet uncharacterized subgroup of membrane
           fatty acid desaturase proteins found in alpha-, beta-,
           and gamma-proteobacteria. Sequences of this domain
           family appear to be structurally related to membrane
           fatty acid desaturases and alkane hydroxylases. They all
           share in common extensive hydrophobic regions that would
           be capable of spanning the membrane bilayer at least
           twice. Comparison of these sequences also reveals three
           regions of conserved histidine cluster motifs that
           contain eight histidine residues: HXXXH, HXXHH, and
           HXXHH. These histidine residues are reported to be
           catalytically essential and proposed to be the ligands
           for the iron atoms contained within homologs, stearoyl
           CoA desaturase and alkane hydroxylase.
          Length = 288

 Score = 29.3 bits (66), Expect = 3.0
 Identities = 10/30 (33%), Positives = 13/30 (43%), Gaps = 4/30 (13%)

Query: 15  DEDETHPNIDTPSLF----RWRHQARVERM 40
           DED T P  D  S +    +W    R +R 
Sbjct: 87  DEDLTDPGDDPESNYLSPEQWARLPRWQRA 116


>gnl|CDD|179338 PRK01816, PRK01816, hypothetical protein; Provisional.
          Length = 143

 Score = 28.4 bits (64), Expect = 3.2
 Identities = 19/59 (32%), Positives = 28/59 (47%), Gaps = 4/59 (6%)

Query: 240 NRAKEKLDEALKEVEEEERQKRLGPG-GLDPVE---VFESLPDVLKKCFELQDTAMLQE 294
            R+   L  AL     E R+K    G  L PVE    +++L D+LK+ F+  D   L +
Sbjct: 84  KRSVTPLPPALLNWFYEVREKLQEAGQALAPVEGKPTYQALADLLKRAFKQLDKTFLDD 142


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 28.0 bits (63), Expect = 3.2
 Identities = 18/75 (24%), Positives = 39/75 (52%)

Query: 33  HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
             A   + EE     E  +Q +E+  K  EE ++ L++ ++    N  + ++A  + E+E
Sbjct: 13  QLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEE 72

Query: 93  AANIRKKEEDLKKKE 107
               ++KEE++K+ +
Sbjct: 73  KKLRKEKEEEIKELK 87


>gnl|CDD|224644 COG1730, GIM5, Predicted prefoldin, molecular chaperone implicated
           in de novo protein folding [Posttranslational
           modification, protein turnover, chaperones].
          Length = 145

 Score = 28.5 bits (64), Expect = 3.3
 Identities = 12/56 (21%), Positives = 34/56 (60%)

Query: 52  QEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKE 107
           +  ++ ++ L++  ++L ++ +     L EL + + +LE+EA  +++K+   +KK+
Sbjct: 90  KSADEAIEFLKKRIEELEKAIEKLQQALAELAQRIEQLEQEAQQLQQKQAAAQKKQ 145


>gnl|CDD|135648 PRK05898, dnaE, DNA polymerase III DnaE; Validated.
          Length = 971

 Score = 29.4 bits (66), Expect = 3.3
 Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 27  SLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKAL 86
           +LFR   Q  ++   E+R + E+F    E T K LE T   +V  EQ     +  L K +
Sbjct: 561 ALFRPGPQQNIKTFVERRFKREEFSYWNEATKKILEPTHGIIVYQEQ-----VINLVKTI 615

Query: 87  AELE-KEAANIRKKEEDLKKKEKI 109
           A  +   A N R+     KK EKI
Sbjct: 616 ANFDIATADNFRRAIS--KKDEKI 637


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 28.6 bits (64), Expect = 3.4
 Identities = 15/68 (22%), Positives = 31/68 (45%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           + +E++  E E  +QE +  +  LE    +L           +   K +A L+ EA+ I+
Sbjct: 113 KGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEIARLKSEASKIK 172

Query: 98  KKEEDLKK 105
           ++  D + 
Sbjct: 173 QELVDAEL 180


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 29.3 bits (66), Expect = 3.4
 Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 4/69 (5%)

Query: 35  ARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAA 94
             VE++E +    E+  +  EK    +EE  +KL E      + LDE  + L +L +E  
Sbjct: 85  EEVEKLEAELKSLEEVIKPAEKFSSEVEELTRKLEERL----SELDEELEDLEDLLEELE 140

Query: 95  NIRKKEEDL 103
            +   + DL
Sbjct: 141 PLAYLDFDL 149


>gnl|CDD|235255 PRK04204, PRK04204, RNA 3'-terminal-phosphate cyclase; Provisional.
          Length = 343

 Score = 29.0 bits (66), Expect = 3.4
 Identities = 13/34 (38%), Positives = 16/34 (47%), Gaps = 4/34 (11%)

Query: 236 ERIRNRAKEKLDEALK----EVEEEERQKRLGPG 265
           ER    A E L  +L     E+  EE  + LGPG
Sbjct: 203 ERQAKAAAELLALSLGLIEIEINVEELSRGLGPG 236


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 28.7 bits (65), Expect = 3.5
 Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 2/78 (2%)

Query: 33  HQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGA--NNLDELKKALAELE 90
                 R+E+ + E E+ +Q   +    +E+ KK   E+E+       L +L+K L +L+
Sbjct: 64  LNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLK 123

Query: 91  KEAANIRKKEEDLKKKEK 108
            E     K + +  +K K
Sbjct: 124 AELEKYEKNDPERIEKLK 141


>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
          Length = 455

 Score = 29.0 bits (65), Expect = 3.7
 Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 19/102 (18%)

Query: 59  KTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTIS 118
           K +EE +K +V+          E ++   ELEKE   + K+EE L K+  +         
Sbjct: 368 KDVEEGEKTIVK----------EARQIEEELEKEVEKLGKEEESLFKRVAL--------- 408

Query: 119 KPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVKENEKLIKQ 160
           + G  +   +++  L++  K ++E  +++K    E EKL  +
Sbjct: 409 EEGLKELEQDEENFLKELSKEEKELLEKLKMEASEVEKLFGR 450


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 28.9 bits (65), Expect = 3.7
 Identities = 25/73 (34%), Positives = 39/73 (53%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           + R+E+MEE     +K  +E E+T   LEE  K+  E  Q      DEL++    LE+EA
Sbjct: 11  EERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEEA 70

Query: 94  ANIRKKEEDLKKK 106
           A   ++ E L+ +
Sbjct: 71  AASEEERERLEAE 83


>gnl|CDD|226064 COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport
           and metabolism].
          Length = 501

 Score = 29.3 bits (66), Expect = 3.8
 Identities = 7/18 (38%), Positives = 11/18 (61%)

Query: 307 HMKRCVDSGLWVPDAKKA 324
           H+ R V  G++ PD+  A
Sbjct: 28  HLGRAVYEGIYEPDSPIA 45


>gnl|CDD|215557 PLN03069, PLN03069, magnesiumprotoporphyrin-IX chelatase subunit H;
           Provisional.
          Length = 1220

 Score = 29.4 bits (66), Expect = 4.0
 Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 6/38 (15%)

Query: 244 EKLDEALKEVEEEERQKRLGPGGLDPVEVFESLPDVLK 281
           E+L E +KE  E ER +      LD V VF S+P+V++
Sbjct: 91  EELAEKVKEAVEPERDR------LDAVLVFPSMPEVMR 122


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 29.3 bits (67), Expect = 4.2
 Identities = 19/65 (29%), Positives = 28/65 (43%), Gaps = 6/65 (9%)

Query: 48  EKFQQEKEKTLKTLEETKKK----LVESEQAGANNLDELKKALAELEKEAANIRKKEEDL 103
            ++  E+E  LK L    K     L E  +A    L EL+K L +L+ + A       DL
Sbjct: 693 LEYLNEQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQLKAKLAA--AAAGDL 750

Query: 104 KKKEK 108
             + K
Sbjct: 751 LAQAK 755


>gnl|CDD|223903 COG0833, LysP, Amino acid transporters [Amino acid transport and
           metabolism].
          Length = 541

 Score = 29.1 bits (66), Expect = 4.2
 Identities = 9/15 (60%), Positives = 13/15 (86%)

Query: 324 AAGESEDPQENTPDA 338
           AAGESE+P+++ P A
Sbjct: 256 AAGESENPRKSIPKA 270


>gnl|CDD|235185 PRK03980, PRK03980, flap endonuclease-1; Provisional.
          Length = 292

 Score = 28.6 bits (65), Expect = 4.3
 Identities = 15/40 (37%), Positives = 26/40 (65%), Gaps = 4/40 (10%)

Query: 221 PDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQK 260
           P+ KA   +E++  +E +R  A+EK +EA +E + EE +K
Sbjct: 37  PELKA---EEIEERRE-VREEAEEKYEEAKEEGDLEEARK 72


>gnl|CDD|214937 smart00957, SecA_DEAD, SecA DEAD-like domain.  SecA protein binds
           to the plasma membrane where it interacts with proOmpA
           to support translocation of proOmpA through the
           membrane. SecA protein achieves this translocation, in
           association with SecY protein, in an ATP dependent
           manner. This domain represents the N-terminal
           ATP-dependent helicase domain, which is related to the.
          Length = 380

 Score = 28.9 bits (66), Expect = 4.5
 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 6/43 (13%)

Query: 215 KIQIAEPDYKASFDDELKS----FKERIRNRAKEKLDEALKEV 253
           +I   EP+ +A  D+EL++    FKER+     E LD+ L E 
Sbjct: 22  QINALEPEMEALSDEELRAKTAEFKERLAE--GESLDDLLPEA 62


>gnl|CDD|215411 PLN02768, PLN02768, AMP deaminase.
          Length = 835

 Score = 29.1 bits (65), Expect = 4.5
 Identities = 10/30 (33%), Positives = 18/30 (60%), Gaps = 4/30 (13%)

Query: 268 DPVEVFESLPDVLKKCFELQDTAMLQEAIA 297
           D VE ++    VL++C EL+   + +E +A
Sbjct: 239 DEVEAYK----VLQECLELRKRYVFREEVA 264


>gnl|CDD|237613 PRK14109, PRK14109, bifunctional glutamine-synthetase
           adenylyltransferase/deadenyltransferase; Provisional.
          Length = 1007

 Score = 29.0 bits (66), Expect = 4.6
 Identities = 12/24 (50%), Positives = 15/24 (62%)

Query: 248 EALKEVEEEERQKRLGPGGLDPVE 271
           E L    E +R+ +LGPGGL  VE
Sbjct: 360 EDLIPAAERDRELKLGPGGLRDVE 383


>gnl|CDD|236327 PRK08661, PRK08661, prolyl-tRNA synthetase; Provisional.
          Length = 477

 Score = 28.9 bits (66), Expect = 4.7
 Identities = 10/40 (25%), Positives = 19/40 (47%)

Query: 215 KIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVE 254
           K  +   +      + L+  +E +  +AKE L+E   EV+
Sbjct: 371 KETVPLDELVEKVPELLEEIQENLYEKAKEFLEENTVEVD 410


>gnl|CDD|218417 pfam05075, DUF684, Protein of unknown function (DUF684).  This
           family contains several uncharacterized proteins from
           Caenorhabditis elegans. The GO annotation suggests that
           the protein is involved in nematode larval development
           and has a positive regulation on growth rate.
          Length = 346

 Score = 28.6 bits (64), Expect = 4.8
 Identities = 29/95 (30%), Positives = 42/95 (44%), Gaps = 2/95 (2%)

Query: 58  LKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTI 117
           L+ L E KKKL +     ++  D+LK  + E E    N       L K  + T  + D  
Sbjct: 1   LEKLGELKKKLEKLSDKMSSEFDDLKAFIVEHEFYN-NEAATASTLMKYMQDTVKHPDKK 59

Query: 118 SKPGFTKTVVNKKPPLQQGEKSDEEKEQEMKAFVK 152
           +K  F + V  K  PL+  EK     EQE+   +K
Sbjct: 60  TKEIF-REVYQKHSPLELAEKMIILLEQEVTNPLK 93


>gnl|CDD|215314 PLN02576, PLN02576, protoporphyrinogen oxidase.
          Length = 496

 Score = 28.8 bits (65), Expect = 4.8
 Identities = 28/120 (23%), Positives = 44/120 (36%), Gaps = 30/120 (25%)

Query: 213 FSKIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKRLGP--GGLDPV 270
           F K+   E    +     +K+ +E  +N   E  D  L +     + + +G   GGL   
Sbjct: 186 FPKLWNLEKRGGSIIGGAIKAIQEAKKNPKPEPRDPRLPKP----KGQTVGSFRGGL--- 238

Query: 271 EVFESLPDVLKKCFEL-QDTAMLQEAIASLSPEEATYHMKRCVDSGLWV-----PDAKKA 324
              ++LPD L K   L +D   L   + SLS            D G +      P+ K  
Sbjct: 239 ---QTLPDALAK--RLGKDKVKLNWKVLSLS----------KNDDGGYSLTYDTPEGKVN 283


>gnl|CDD|237259 PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed.
          Length = 830

 Score = 28.9 bits (66), Expect = 4.9
 Identities = 17/43 (39%), Positives = 23/43 (53%), Gaps = 6/43 (13%)

Query: 215 KIQIAEPDYKASFDDELKS----FKERIRNRAKEKLDEALKEV 253
           KI   EP+ +   D+ELK+    FKER+     E LD+ L E 
Sbjct: 26  KINALEPEMEKLSDEELKAKTAEFKERLAK--GETLDDLLPEA 66


>gnl|CDD|225708 COG3167, PilO, Tfp pilus assembly protein PilO [Cell motility and
          secretion / Intracellular trafficking and secretion].
          Length = 211

 Score = 28.2 bits (63), Expect = 5.1
 Identities = 13/41 (31%), Positives = 23/41 (56%)

Query: 51 QQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEK 91
          + E+ + L+  EE  K   + +   A NL+ L+  LAE+E+
Sbjct: 47 KLEELEELEAEEEELKSTYQQKAIQAANLEALRAQLAEMEE 87


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 28.9 bits (65), Expect = 5.1
 Identities = 18/71 (25%), Positives = 30/71 (42%)

Query: 35  ARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAA 94
            R+     +  E   F QE      +L E +  L    +   ++ +EL++ L  L K+  
Sbjct: 788 ERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTK 847

Query: 95  NIRKKEEDLKK 105
             RKK E  +K
Sbjct: 848 LRRKKLEQERK 858


>gnl|CDD|227570 COG5245, DYN1, Dynein, heavy chain [Cytoskeleton].
          Length = 3164

 Score = 28.8 bits (64), Expect = 5.2
 Identities = 9/64 (14%), Positives = 18/64 (28%)

Query: 40   MEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKK 99
            +          +      L  L       +    +     DE+   L  L+K    I ++
Sbjct: 2620 LFVHEKALNALKACGSLFLWVLARYLLAKLMLSISNMEQTDEIAVLLHNLKKSRKEIEEE 2679

Query: 100  EEDL 103
            E + 
Sbjct: 2680 ESES 2683


>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
           Willebrand factor type A (vWA) domain [General function
           prediction only].
          Length = 437

 Score = 28.5 bits (64), Expect = 5.2
 Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 4/72 (5%)

Query: 38  ERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIR 97
           E +  ++ E      E E  L+ +EE     +    AG       K  L  L K A  + 
Sbjct: 133 EGLLSEKQEEISLSGEMEGILEEVEELISGRLWDMSAGELK----KGDLELLGKYADFLL 188

Query: 98  KKEEDLKKKEKI 109
           K  E  K  E++
Sbjct: 189 KFPELRKILEQL 200


>gnl|CDD|153338 cd07654, F-BAR_FCHSD, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of FCH and double SH3
           domains proteins (FCHSD).  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. This subfamily is composed of FCH
           and double SH3 domain (FCHSD) proteins, so named as they
           contain an N-terminal F-BAR domain and two SH3 domains
           at the C-terminus. Vertebrates harbor two subfamily
           members, FCHSD1 and FCHSD2, which have been
           characterized only in silico. Their biological function
           is still unknown. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
          Length = 264

 Score = 28.3 bits (63), Expect = 5.3
 Identities = 20/76 (26%), Positives = 31/76 (40%), Gaps = 1/76 (1%)

Query: 34  QARVERMEEQRLENEK-FQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKE 92
           Q R E       E  K  +  KE+ LK   E  ++     Q     L + +K   E E+ 
Sbjct: 87  QNRCEAYRRYISEPAKTGRSAKEQQLKKCTEQLQRAQAEVQQTVRELSKSRKTYFEREQV 146

Query: 93  AANIRKKEEDLKKKEK 108
           A   R+K  D++ +E 
Sbjct: 147 AHLAREKAADVQAREA 162


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 28.9 bits (65), Expect = 5.3
 Identities = 21/131 (16%), Positives = 49/131 (37%), Gaps = 4/131 (3%)

Query: 23   IDTPSLFRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDEL 82
            ++      +     + ++ E   + ++  +E E  LK      ++  ++       L   
Sbjct: 942  LEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANS----ELKNF 997

Query: 83   KKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEE 142
            KK LAEL K+   +++  + LK+                   T ++   PLQ+ +     
Sbjct: 998  KKELAELSKQYGALQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLL 1057

Query: 143  KEQEMKAFVKE 153
            +  +++A  K 
Sbjct: 1058 ENNQLQARYKA 1068


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 28.5 bits (64), Expect = 5.3
 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 14/52 (26%)

Query: 57  TLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
           TL+TL                 + EL+K LA+L  E   ++ + E L+K+E+
Sbjct: 60  TLRTL--------------VAEVKELRKRLAKLISENEALKAENERLQKREQ 97


>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
          Length = 895

 Score = 28.7 bits (64), Expect = 5.4
 Identities = 14/45 (31%), Positives = 25/45 (55%)

Query: 66  KKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKIT 110
           K +++  +A  +N+D L++ L     E  NI+K+  D +K   IT
Sbjct: 172 KDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSIT 216


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 28.8 bits (65), Expect = 5.4
 Identities = 15/66 (22%), Positives = 29/66 (43%), Gaps = 3/66 (4%)

Query: 30  RWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAEL 89
           + + QA  E  +++ +  E    E E+  + LE   ++L E     +    E K+   E+
Sbjct: 165 KAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQ---ERKQKRKEI 221

Query: 90  EKEAAN 95
             +AA 
Sbjct: 222 TDQAAK 227


>gnl|CDD|150467 pfam09803, DUF2346, Uncharacterized conserved protein (DUF2346).
           Members of this family of proteins have no known
           function.
          Length = 80

 Score = 26.7 bits (59), Expect = 5.5
 Identities = 15/46 (32%), Positives = 28/46 (60%), Gaps = 4/46 (8%)

Query: 215 KIQIAEPDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQK 260
           K ++  P+ +     EL+ FKER+R + + +L   L+ +EE E++K
Sbjct: 39  KRELYPPENE-QHRKELEDFKERLREKHEARL---LRAMEEAEQKK 80


>gnl|CDD|216266 pfam01051, Rep_3, Initiator Replication protein.  This protein is
           an initiator of plasmid replication. RepB possesses
           nicking-closing (topoisomerase I) like activity. It is
           also able to perform a strand transfer reaction on ssDNA
           that contains its target. This family also includes RepA
           which is an E.coli protein involved in plasmid
           replication. The RepA protein binds to DNA repeats that
           flank the repA gene.
          Length = 218

 Score = 28.0 bits (63), Expect = 5.7
 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 2/64 (3%)

Query: 49  KFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
           K   +     +++ +   +L      GA     LK+AL  L K+   I  +EE   +   
Sbjct: 34  KPDDKDFVEYESISDYALELFGVNSEGAY--KRLKEALERLMKKDFEIEDEEEKGGEIIN 91

Query: 109 ITPW 112
           I  W
Sbjct: 92  IVLW 95


>gnl|CDD|162284 TIGR01282, nifD, nitrogenase molybdenum-iron protein alpha chain.
           Nitrogenase consists of alpha (NifD) and beta (NifK)
           subunits of the molybdenum-iron protein and an
           ATP-binding iron-sulfur protein (NifH). This model
           describes a large clade of NifD proteins, but excludes a
           lineage that contains putative NifD and NifD homologs
           from species with vanadium-dependent nitrogenases
           [Central intermediary metabolism, Nitrogen fixation].
          Length = 466

 Score = 28.5 bits (64), Expect = 5.8
 Identities = 21/72 (29%), Positives = 30/72 (41%), Gaps = 7/72 (9%)

Query: 212 FFSKIQIAEP--DYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKR--LGPGGL 267
           FF   +IAE        FDDE+K   E +  + +  +D  + +       K   L  GGL
Sbjct: 286 FFGPTKIAESLRKIAEFFDDEIKEKAEEVIAKYQPAVDAVIAKYRPRLEGKTVMLYVGGL 345

Query: 268 DP---VEVFESL 276
            P   +  FE L
Sbjct: 346 RPRHVIGAFEDL 357


>gnl|CDD|165085 PHA02711, PHA02711, Toll/IL-receptor-like protein; Provisional.
          Length = 190

 Score = 28.0 bits (62), Expect = 6.0
 Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 39  RMEEQRLENE--KFQQEKEKTLKTLEETKKKLVESEQAGANNL 79
           R++  RL NE  KF  + E+   T+EE K+KL  + + GA+N 
Sbjct: 73  RIKYGRLFNEIKKFDNDAEEQFGTIEELKQKLRLNSEEGADNF 115


>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
           Provisional.
          Length = 693

 Score = 28.5 bits (65), Expect = 6.0
 Identities = 12/39 (30%), Positives = 21/39 (53%), Gaps = 3/39 (7%)

Query: 224 KASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQKRL 262
               D+ELK     ++  A+ KL EA +  +++ER+  L
Sbjct: 234 PPEVDEELK---AAVKELAEAKLKEAYQITDKQEREAAL 269


>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
           binding.  This is the middle region of a family of TATA
           element modulatory factor 1 proteins conserved in
           eukaryotes that contains at its N-terminal section a
           number of leucine zippers that could potentially form
           coiled coil structures. The whole proteins bind to the
           TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells.
          Length = 74

 Score = 26.5 bits (59), Expect = 6.2
 Identities = 15/53 (28%), Positives = 25/53 (47%)

Query: 52  QEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLK 104
           ++  K       T KKL    +     + ELKK L +LEKE  N+ ++ +  +
Sbjct: 22  EKLSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLEERLKRAE 74


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 28.7 bits (64), Expect = 6.4
 Identities = 20/56 (35%), Positives = 32/56 (57%), Gaps = 5/56 (8%)

Query: 54   KEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKI 109
            +E+    +E T K++ E E+A   NLD +KKA++ L     N  ++E  L + EKI
Sbjct: 1102 RERYDALIENTSKRITELEKAKLANLDVVKKAISNL-----NNLQQEVTLLRNEKI 1152


>gnl|CDD|234306 TIGR03674, fen_arch, flap structure-specific endonuclease.
           Endonuclease that cleaves the 5'-overhanging flap
           structure that is generated by displacement synthesis
           when DNA polymerase encounters the 5'-end of a
           downstream Okazaki fragment. Has 5'-endo-/exonuclease
           and 5'-pseudo-Y-endonuclease activities. Cleaves the
           junction between single and double-stranded regions of
           flap DNA.
          Length = 338

 Score = 28.4 bits (64), Expect = 6.5
 Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)

Query: 221 PDYKASFDDELKSFKERIRNRAKEKLDEALKEVEEEERQK 260
           P+ K    + L+  +E IR  A+EK +EAL++ + EE +K
Sbjct: 84  PELK---AETLEERRE-IREEAEEKWEEALEKGDLEEARK 119


>gnl|CDD|234659 PRK00137, rplI, 50S ribosomal protein L9; Reviewed.
          Length = 147

 Score = 27.4 bits (62), Expect = 6.8
 Identities = 13/33 (39%), Positives = 18/33 (54%)

Query: 74  AGANNLDELKKALAELEKEAANIRKKEEDLKKK 106
           A   NL +L+   AELE +AA    + E L +K
Sbjct: 39  ATKGNLKQLEARRAELEAKAAEELAEAEALAEK 71


>gnl|CDD|177296 PHA00728, PHA00728, hypothetical protein.
          Length = 151

 Score = 27.6 bits (61), Expect = 6.9
 Identities = 22/48 (45%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 67  KLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEKITPWNV 114
           KL E EQ    N +ELKK LAELE    N    EED + +E   P+ V
Sbjct: 3   KLTEVEQLKKEN-EELKKKLAELEALMNN-ESAEEDEELQEIENPYTV 48


>gnl|CDD|218813 pfam05929, Phage_GPO, Phage capsid scaffolding protein (GPO) serine
           peptidase.  This family consists of several
           bacteriophage capsid scaffolding proteins (GPO) and some
           related bacterial sequences. GPO is thought to function
           in both the assembly of proheads and the cleavage of
           GPN. The family is found to function as a serine
           peptidase, with a conserved Asp, His and Ser catalytic
           triad, as in subtilisin, and as represented in
           MEROPS:S73. The family includes GpO from Enterobacteria
           phage P2 which cleaves itself and then becomes the
           scaffold protein upon which the bacteriophage prohead is
           built - a mechanism quite common amongst phages.
          Length = 277

 Score = 27.9 bits (62), Expect = 7.0
 Identities = 15/42 (35%), Positives = 21/42 (50%)

Query: 48  EKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAEL 89
           E  Q + E T + L E +K + + EQA  +  DE  KA   L
Sbjct: 206 EHVQGQLEATEQRLSEAEKAVSDLEQALNSQGDERDKAFNAL 247


>gnl|CDD|235179 PRK03947, PRK03947, prefoldin subunit alpha; Reviewed.
          Length = 140

 Score = 27.2 bits (61), Expect = 7.1
 Identities = 11/50 (22%), Positives = 28/50 (56%)

Query: 59  KTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
           K L+E  + L + ++     L++L++AL +L    A + ++ + L+++  
Sbjct: 90  KDLDEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQQLQQEAA 139



 Score = 27.2 bits (61), Expect = 8.4
 Identities = 8/47 (17%), Positives = 27/47 (57%)

Query: 55  EKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
           ++ ++ L++ K++L ++ +     L +L   +A+L +E   ++++  
Sbjct: 93  DEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQQLQQEAA 139


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 28.4 bits (63), Expect = 7.3
 Identities = 16/76 (21%), Positives = 38/76 (50%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
           QA+      Q  E  + ++E    L  L+  ++++   ++   +  +E++ +  ELEKE 
Sbjct: 206 QAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEE 265

Query: 94  ANIRKKEEDLKKKEKI 109
             + +  ++ K++EK 
Sbjct: 266 EILAQVLKENKEEEKE 281


>gnl|CDD|237403 PRK13505, PRK13505, formate--tetrahydrofolate ligase; Provisional.
          Length = 557

 Score = 28.2 bits (64), Expect = 7.4
 Identities = 12/30 (40%), Positives = 16/30 (53%)

Query: 69  VESEQAGANNLDELKKALAELEKEAANIRK 98
           V  +     N++ LKK  A LE+   NIRK
Sbjct: 341 VAKDDLKEENVEALKKGFANLERHIENIRK 370


>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
           This family consists of the N-terminal region of the
           prokaryotic fibronectin-binding protein. Fibronectin
           binding is considered to be an important virulence
           factor in streptococcal infections. Fibronectin is a
           dimeric glycoprotein that is present in a soluble form
           in plasma and extracellular fluids; it is also present
           in a fibrillar form on cell surfaces. Both the soluble
           and cellular forms of fibronectin may be incorporated
           into the extracellular tissue matrix. While fibronectin
           has critical roles in eukaryotic cellular processes,
           such as adhesion, migration and differentiation, it is
           also a substrate for the attachment of bacteria. The
           binding of pathogenic Streptococcus pyogenes and
           Staphylococcus aureus to epithelial cells via
           fibronectin facilitates their internalisation and
           systemic spread within the host.
          Length = 447

 Score = 28.0 bits (63), Expect = 7.8
 Identities = 16/69 (23%), Positives = 29/69 (42%), Gaps = 14/69 (20%)

Query: 42  EQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
            ++ E ++ +Q++    K +E              N L++L+K L +LEKE       E 
Sbjct: 282 SKKAERDRVKQKRSDLEKRVE--------------NELEKLEKKLEKLEKELEEAENAEN 327

Query: 102 DLKKKEKIT 110
                E +T
Sbjct: 328 YRLYGELLT 336


>gnl|CDD|216529 pfam01491, Frataxin_Cyay, Frataxin-like domain.  This family
           contains proteins that have a domain related to the
           globular C-terminus of Frataxin the protein that is
           mutated in Friedreich's ataxia. This domain is found in
           a family of bacterial proteins. The function of this
           domain is currently unknown. It has been suggested that
           this family is involved in iron transport.
          Length = 107

 Score = 26.7 bits (60), Expect = 7.9
 Identities = 9/23 (39%), Positives = 15/23 (65%)

Query: 113 NVDTISKPGFTKTVVNKKPPLQQ 135
            V T++     + V+NK+PPL+Q
Sbjct: 39  GVLTLTFENGGQIVINKQPPLKQ 61


>gnl|CDD|216970 pfam02312, CBF_beta, Core binding factor beta subunit.  Core
           binding factor (CBF) is a heterodimeric transcription
           factor essential for genetic regulation of hematopoiesis
           and osteogenesis. The beta subunit enhances DNA-binding
           ability of the alpha subunit in vitro, and has been show
           to have a structure related to the OB fold.
          Length = 170

 Score = 27.4 bits (61), Expect = 8.0
 Identities = 12/44 (27%), Positives = 20/44 (45%)

Query: 29  FRWRHQARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESE 72
             W    R++ M     + E+ QQE     + LE+  ++L E E
Sbjct: 106 RGWMDLERLDGMGCLEYDEERAQQEDAILREQLEQYNQRLREFE 149


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 27.7 bits (62), Expect = 8.1
 Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 34  QARVERMEEQ-----RLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAE 88
           QAR E++EE+     +  NE   +E  +  ++LE+  K L    Q     +++LKK LA 
Sbjct: 65  QARAEKLEEKAELALQAGNEDLAREALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAA 124

Query: 89  LEKEAANIRKKEEDLKKKEK 108
           LE++ A +R K+E LK ++ 
Sbjct: 125 LEQKIAELRAKKEALKARKA 144


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 28.0 bits (63), Expect = 8.3
 Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 9/76 (11%)

Query: 33  HQARVERMEEQR--LENE------KFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKK 84
           H  R+  +  +   LE E      ++QQEKE  ++ +   + +L     A A++   L+ 
Sbjct: 448 HDERLAELRAELAALEAELAALEARWQQEKE-LVEAILALRAELEADADAPADDDAALRA 506

Query: 85  ALAELEKEAANIRKKE 100
            LAELE   A+ + +E
Sbjct: 507 QLAELEAALASAQGEE 522


>gnl|CDD|188994 cd06455, M3A_TOP, Peptidase M3 Thimet oligopeptidase (TOP) also
           includes neurolysin.  Peptidase M3 Thimet oligopeptidase
           (TOP; PZ-peptidase; endo-oligopeptidase A; endopeptidase
           24.15; soluble metallo-endopeptidase; EC 3.4.24.15)
           family also includes neurolysin (endopeptidase 24.16,
           microsomal endopeptidase, mitochondrial oligopeptidase
           M, neurotensin endopeptidase, soluble angiotensin
           II-binding protein, thimet oligopeptidase II) which
           hydrolyzes oligopeptides such as neurotensin, bradykinin
           and dynorphin A. TOP and neurolysin are neuropeptidases
           expressed abundantly in the testis, but also found in
           the liver, lung and kidney. They are involved in the
           metabolism of neuropeptides under 20 amino acid residues
           long and cleave most bioactive peptides at the same
           sites, but recognize different positions on some
           naturally occurring and synthetic peptides; they cleave
           at distinct sites on the 13-residue bioactive peptide
           neurotensin, which modulates central dopaminergic and
           cholinergic circuits.  TOP has been shown to degrade
           peptides released by the proteasome, limiting the extent
           of antigen presentation by major histocompatibility
           complex class I molecules, and has been associated with
           amyloid protein precursor processing.
          Length = 637

 Score = 27.9 bits (63), Expect = 8.4
 Identities = 15/52 (28%), Positives = 29/52 (55%), Gaps = 3/52 (5%)

Query: 67  KLVESEQAGANNLDELKKALAEL-EKEAANI--RKKEEDLKKKEKITPWNVD 115
           ++ +S +   + LD+L + L  L EKE A +   KK+E  +  ++I  W++ 
Sbjct: 248 RMAKSPETVRSFLDDLAEKLKPLAEKELAELLALKKKECPEAGDRIYAWDLR 299


>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
           [DNA replication, recombination, and repair].
          Length = 663

 Score = 27.9 bits (63), Expect = 8.5
 Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 14/91 (15%)

Query: 80  DELKKALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKP-----GFTKTVVNKKPPLQ 134
           D ++KA+ E E+     R+ +    ++  ITP    TI K               K  ++
Sbjct: 559 DSMQKAIDETERR----REIQMAYNEEHGITP---QTIKKKIRDILDGEYEEDEYKAKIE 611

Query: 135 QGEKSDEEKEQEMKAFVKENEKLIKQYANYL 165
           +      +KE  ++  +K+ EK +K+ A  L
Sbjct: 612 KKASKMSKKE--LEKLIKKLEKEMKEAAKNL 640


>gnl|CDD|220719 pfam10368, YkyA, Putative cell-wall binding lipoprotein.  YkyA is a
           family of proteins containing a lipoprotein signal and a
           hydrolase domain. It is similar to cell wall binding
           proteins and might also be recognisable by a host immune
           defence system. It is thus likely to belong to pathways
           important for pathogenicity.
          Length = 205

 Score = 27.7 bits (62), Expect = 8.6
 Identities = 19/68 (27%), Positives = 39/68 (57%), Gaps = 2/68 (2%)

Query: 42  EQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
           E+  E EK  +E++K L+ LE+ +++L +  Q      D+ +      ++  AN+ K+E+
Sbjct: 26  EKAAEQEKPFEEEQKKLEELEKKEQELYD--QIIQLGKDDNEAIKKLSDQALANVDKREK 83

Query: 102 DLKKKEKI 109
            LKK+++ 
Sbjct: 84  LLKKEKES 91


>gnl|CDD|225177 COG2268, COG2268, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 548

 Score = 27.9 bits (62), Expect = 8.8
 Identities = 17/136 (12%), Positives = 39/136 (28%), Gaps = 15/136 (11%)

Query: 30  RWRHQARVERMEEQRLEN-----EKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKK 84
               +   E+  E  +       +  + E E+        + + V+   A         K
Sbjct: 223 AEIAENEAEKETEIAIAEANRDAKLVELEVEQQPAGKTAEQTREVKIILAETEAEVAAWK 282

Query: 85  ALAELEKEAANIRKKEEDLKKKEKITPWNVDTISKPGFTKTVVNKKPPLQQGEKSDEEKE 144
           A    E E A I  ++   ++K +               +     +    +    + +KE
Sbjct: 283 AETRREAEQAEILAEQAIQEEKAQ----------AEQEVQHAKALEAREMRVGLIERQKE 332

Query: 145 QEMKAFVKENEKLIKQ 160
            E++   +       Q
Sbjct: 333 TELEPQERSYFINAAQ 348


>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
          Length = 1213

 Score = 28.0 bits (62), Expect = 8.8
 Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 7/73 (9%)

Query: 54  KEKTLKTL-EETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE------DLKKK 106
            E +  T+ +   KKL E+ +  A+ + EL ++LA+   + AN   +EE           
Sbjct: 500 VEDSAGTMADAFGKKLGEALKKLADAIAELFQSLADNRGKIANGSMEEELLTLYKATAGI 559

Query: 107 EKITPWNVDTISK 119
            +I    V   S 
Sbjct: 560 WQIYKSGVTEFSD 572


>gnl|CDD|215457 PLN02852, PLN02852, ferredoxin-NADP+ reductase.
          Length = 491

 Score = 28.1 bits (63), Expect = 9.0
 Identities = 11/34 (32%), Positives = 17/34 (50%)

Query: 216 IQIAEPDYKASFDDELKSFKERIRNRAKEKLDEA 249
           ++I E D   S +DE +    R + R  E L +A
Sbjct: 242 VRIKEADLTLSPEDEEELKASRPKRRVYELLSKA 275


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.1 bits (63), Expect = 9.2
 Identities = 19/71 (26%), Positives = 40/71 (56%), Gaps = 4/71 (5%)

Query: 34  QARVERMEEQRLENEKFQQEKEKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEA 93
              +ER EEQR +  + + E ++ L+  EE +++L   E      +++L++ +AE E+E 
Sbjct: 219 DEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAE----IEDLRETIAETERER 274

Query: 94  ANIRKKEEDLK 104
             + ++  DL+
Sbjct: 275 EELAEEVRDLR 285


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 28.1 bits (63), Expect = 9.5
 Identities = 14/53 (26%), Positives = 23/53 (43%)

Query: 55  EKTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKE 107
              ++ L E + KL E         +EL K +    +E   ++K+ E LK K 
Sbjct: 697 NAAIEYLNEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKI 749


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 27.7 bits (62), Expect = 9.8
 Identities = 12/62 (19%), Positives = 27/62 (43%), Gaps = 1/62 (1%)

Query: 43  QRLENEKFQQEKE-KTLKTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEE 101
           Q+L+ E  + E E  +L+         +E  ++    + E++  L +L ++    +   E
Sbjct: 313 QQLQIELAEAEAEIASLEARVAELTARIERLESLLRTIPEVEAELTQLNRDYEVNKSNYE 372

Query: 102 DL 103
            L
Sbjct: 373 QL 374


>gnl|CDD|222698 pfam14346, DUF4398, Domain of unknown function (DUF4398).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 127 and
           269 amino acids in length.
          Length = 105

 Score = 26.6 bits (59), Expect = 9.9
 Identities = 13/50 (26%), Positives = 22/50 (44%)

Query: 59  KTLEETKKKLVESEQAGANNLDELKKALAELEKEAANIRKKEEDLKKKEK 108
           + L + K  L E +   A  L E  +A A L +  A   K +  L++ + 
Sbjct: 49  EKLAQAKAALDEGKYEEARRLAEQAEADARLAEAKARAAKAQAALEEAKD 98


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.129    0.364 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,094,980
Number of extensions: 1806424
Number of successful extensions: 5880
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5014
Number of HSP's successfully gapped: 908
Length of query: 347
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 249
Effective length of database: 6,590,910
Effective search space: 1641136590
Effective search space used: 1641136590
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 59 (26.6 bits)