BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.


Reference for composition-based statistics starting in round 2:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254780176|ref|YP_003064589.1| hypothetical protein
CLIBASIA_00295 [Candidatus Liberibacter asiaticus str. psy62]
         (119 letters)

Database: nr 
           13,984,884 sequences; 4,792,584,752 total letters

Searching..................................................done


Results from round 1


>gi|254780176|ref|YP_003064589.1| hypothetical protein CLIBASIA_00295 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254039853|gb|ACT56649.1| hypothetical protein CLIBASIA_00295 [Candidatus Liberibacter
           asiaticus str. psy62]
          Length = 119

 Score =  224 bits (571), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 119/119 (100%), Positives = 119/119 (100%)

Query: 1   MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI 60
           MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI
Sbjct: 1   MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI 60

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS 119
           EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS
Sbjct: 61  EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS 119


>gi|251795508|ref|YP_003010239.1| hypothetical protein Pjdr2_1478 [Paenibacillus sp. JDR-2]
 gi|247543134|gb|ACT00153.1| hypothetical protein Pjdr2_1478 [Paenibacillus sp. JDR-2]
          Length = 736

 Score = 35.4 bits (80), Expect = 2.5,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 2/78 (2%)

Query: 23  SGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTMEDRKAKVGVAR 82
           +G    + ++RS AK   D  R  E KY+ +  +V+   +  A A+K  +DRK+K+  A 
Sbjct: 590 NGGSVHVGMSRSQAKALVDWKRKIERKYRTKIPVVLPHSEISATALK-YKDRKSKLTDAE 648

Query: 83  QERKANESKDAATIITTA 100
           Q  K NE+++   +I  A
Sbjct: 649 QTYK-NEAQERLNVIMQA 665


>gi|227833721|ref|YP_002835428.1| trigger factor [Corynebacterium aurimucosum ATCC 700975]
 gi|262184727|ref|ZP_06044148.1| trigger factor [Corynebacterium aurimucosum ATCC 700975]
 gi|254788997|sp|C3PI36|TIG_CORA7 RecName: Full=Trigger factor; Short=TF
 gi|227454737|gb|ACP33490.1| trigger factor [Corynebacterium aurimucosum ATCC 700975]
          Length = 451

 Score = 34.3 bits (77), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 7/101 (6%)

Query: 25  NQAEIDIARSNAKDKTDLVRIAEAKYKYR-SDI------VMAIEKSKADAIKTMEDRKAK 77
            Q EIDIA+    D  +     + + ++   D       V A+E S+ D  K +ED  ++
Sbjct: 89  GQPEIDIAKLEDNDFVEFTAEVDIRPEFEVPDFSKISVKVPALETSEEDVDKALEDLASR 148

Query: 78  VGVARQERKANESKDAATIITTATVENTKVVETKEAGKTAR 118
            G  +  ++  ++ D A I  T  V+ TK+ E    G T R
Sbjct: 149 FGELKDTKRKMKTGDYAIIDITTEVDGTKLDEASHEGMTYR 189


Searching..................................................done


Results from round 2





CONVERGED!
>gi|254780176|ref|YP_003064589.1| hypothetical protein CLIBASIA_00295 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254039853|gb|ACT56649.1| hypothetical protein CLIBASIA_00295 [Candidatus Liberibacter
           asiaticus str. psy62]
          Length = 119

 Score =  125 bits (314), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 119/119 (100%), Positives = 119/119 (100%)

Query: 1   MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI 60
           MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI
Sbjct: 1   MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI 60

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS 119
           EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS
Sbjct: 61  EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS 119


>gi|82541274|ref|XP_724889.1| dihydrolipoamide S-acetyltransferase [Plasmodium yoelii yoelii str.
           17XNL]
 gi|23479697|gb|EAA16454.1| putative dihydrolipoamide S-acetyltransferase [Plasmodium yoelii
           yoelii]
          Length = 561

 Score = 40.6 bits (93), Expect = 0.061,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 19/124 (15%)

Query: 1   MAELETAQAE-ADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIA-----EAKYKYRS 54
           M+++ET   E AD +  + +    G   E  I   + + K + VRIA     E ++  +S
Sbjct: 147 MSDVETTSVETADVETTDVE----GESVEKGIYSPSVQSKKNKVRIAKWLCKENEFVNKS 202

Query: 55  DIVMAIEKSKADAIKTMEDRKAKVGV-----ARQERKANESKDAATIITTATVENTKVVE 109
           D++  IE  K+    T+E      G+      ++   A+  K  ATI+ T  +ENT +  
Sbjct: 203 DVIFHIEDDKS----TIEVDSPYTGIIKTILVKEGELADLEKQVATILETNELENTSMNL 258

Query: 110 TKEA 113
           + EA
Sbjct: 259 SSEA 262


>gi|284052568|ref|ZP_06382778.1| SPFH domain-containing protein [Arthrospira platensis str. Paraca]
          Length = 523

 Score = 40.3 bits (92), Expect = 0.10,   Method: Composition-based stats.
 Identities = 30/106 (28%), Positives = 57/106 (53%), Gaps = 17/106 (16%)

Query: 3   ELETAQAEADAKIAEAKAK------DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDI 56
           E+E ++AEA+ ++A+A  K      +S ++   ++AR+ A+      RI + + + ++D+
Sbjct: 236 EVEVSKAEAERRVADAMTKRAAVVAESESETAAEVARTQAEVSVQKERIKQVEQQLQADV 295

Query: 57  VM--------AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94
           V         AI +++ DA + +ED KA+   A   R+  ES  AA
Sbjct: 296 VAPAEAECKKAIARARGDAAQIIEDGKAQ---AEGTRRLAESWKAA 338


>gi|291571858|dbj|BAI94130.1| band 7 protein [Arthrospira platensis NIES-39]
          Length = 523

 Score = 39.9 bits (91), Expect = 0.10,   Method: Composition-based stats.
 Identities = 30/106 (28%), Positives = 57/106 (53%), Gaps = 17/106 (16%)

Query: 3   ELETAQAEADAKIAEAKAK------DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDI 56
           E+E ++AEA+ ++A+A  K      +S ++   ++AR+ A+      RI + + + ++D+
Sbjct: 236 EVEVSKAEAERRVADAMTKRAAVVAESESETAAEVARTQAEVSVQKERIKQVEQQLQADV 295

Query: 57  VM--------AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94
           V         AI +++ DA + +ED KA+   A   R+  ES  AA
Sbjct: 296 VAPAEAECKKAIARARGDAAQIIEDGKAQ---AEGTRRLAESWKAA 338


>gi|209524411|ref|ZP_03272960.1| band 7 protein [Arthrospira maxima CS-328]
 gi|209495202|gb|EDZ95508.1| band 7 protein [Arthrospira maxima CS-328]
          Length = 523

 Score = 39.1 bits (89), Expect = 0.18,   Method: Composition-based stats.
 Identities = 30/106 (28%), Positives = 57/106 (53%), Gaps = 17/106 (16%)

Query: 3   ELETAQAEADAKIAEAKAK------DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDI 56
           E+E ++AEA+ ++A+A  K      +S ++   ++AR+ A+      RI + + + ++D+
Sbjct: 236 EVEVSKAEAERRVADAMTKRAAVVAESESETAAEVARTQAEVSVQKERIKQVEQQLQADV 295

Query: 57  VM--------AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94
           V         AI +++ DA + +ED KA+   A   R+  ES  AA
Sbjct: 296 VAPAEAECKKAIARARGDAAQIIEDGKAQ---AEGTRRLAESWKAA 338


>gi|327539858|gb|EGF26461.1| efflux transporter, RND family, MFP subunit [Rhodopirellula baltica
           WH47]
          Length = 515

 Score = 39.1 bits (89), Expect = 0.21,   Method: Composition-based stats.
 Identities = 31/115 (26%), Positives = 58/115 (50%), Gaps = 11/115 (9%)

Query: 3   ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEK 62
           E E A A+A  + +E++ K +  +A  ++AR+N       + +A+ +YK   ++V    +
Sbjct: 156 EAELAAAQARLQQSESQFKQA--KAMTEVARANLLQSEAQLNLADVRYKRTQNLV----E 209

Query: 63  SKADAIKTMEDRKAKVGVARQE----RKANESKDAATIITTATVENTKV-VETKE 112
             A +   ++DR+A+   A+ +    R +  S +AA     A +E  K  VET E
Sbjct: 210 RNASSQDELDDREAEFLKAKADIEGVRASLNSSEAAIATAQAEIELAKAGVETAE 264


>gi|307328251|ref|ZP_07607429.1| thymidylate kinase [Streptomyces violaceusniger Tu 4113]
 gi|306886085|gb|EFN17093.1| thymidylate kinase [Streptomyces violaceusniger Tu 4113]
          Length = 1100

 Score = 38.7 bits (88), Expect = 0.23,   Method: Composition-based stats.
 Identities = 22/78 (28%), Positives = 43/78 (55%), Gaps = 3/78 (3%)

Query: 8   QAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADA 67
           +AE  A+ A  +A+D+  +AE D  R  A+D+   V     + +  ++   A+ +++A+A
Sbjct: 777 EAERQAEAARQRAEDARRRAEEDRKRIEAEDRARAVDEERRRLEAEAE---AVRRAEAEA 833

Query: 68  IKTMEDRKAKVGVARQER 85
            +  E RKA+  + R E+
Sbjct: 834 RRQEEQRKAEEALLRAEQ 851


>gi|119495192|ref|XP_001264386.1| intracellular protein transport protein (UsoA), putative
           [Neosartorya fischeri NRRL 181]
 gi|119412548|gb|EAW22489.1| intracellular protein transport protein (UsoA), putative
           [Neosartorya fischeri NRRL 181]
          Length = 1055

 Score = 38.3 bits (87), Expect = 0.36,   Method: Composition-based stats.
 Identities = 31/116 (26%), Positives = 57/116 (49%), Gaps = 4/116 (3%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           +EL+ A+   +A++A+ +AK    Q+E+D A+   + +   +R+     +  S++    E
Sbjct: 881 SELDKAKEGHEAELADLRAKAQTVQSELDTAKQEHETEISGLRLKAQSLQ--SELDSGTE 938

Query: 62  KSKADAIKTMEDRKAKVGVARQERKANESK-DAATIITTATVENTKVVETKEAGKT 116
           KSK D     +D  +K+    +  K  ESK + A      + E  K V+  + GKT
Sbjct: 939 KSKEDLQAVHDDYSSKLSELEKRVKLAESKAEKAEADALRSAETLKEVQA-QLGKT 993


>gi|32476774|ref|NP_869768.1| acriflavin resistance protein A [Rhodopirellula baltica SH 1]
 gi|32447320|emb|CAD77146.1| acriflavine resistance protein A [Precursor] [Rhodopirellula
           baltica SH 1]
          Length = 467

 Score = 38.3 bits (87), Expect = 0.36,   Method: Composition-based stats.
 Identities = 31/115 (26%), Positives = 58/115 (50%), Gaps = 11/115 (9%)

Query: 3   ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEK 62
           E E A A+A  + +E++ K +  +A  ++AR+N       + +A+ +YK   ++V    +
Sbjct: 108 EAELAAAQARLQQSESQFKQA--KAMTEVARANLLQSEAQLNLADVRYKRTQNLV----E 161

Query: 63  SKADAIKTMEDRKAKVGVARQE----RKANESKDAATIITTATVENTKV-VETKE 112
             A +   ++DR+A+   A+ +    R +  S +AA     A +E  K  VET E
Sbjct: 162 RNASSQDELDDREAEFLKAKADIEGVRASLNSSEAAIATAQAEIELAKAGVETAE 216


>gi|168700562|ref|ZP_02732839.1| hypothetical protein GobsU_13612 [Gemmata obscuriglobus UQM 2246]
          Length = 419

 Score = 38.3 bits (87), Expect = 0.38,   Method: Composition-based stats.
 Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 6/88 (6%)

Query: 11  ADAKIA--EAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMA----IEKSK 64
            DAKIA  +A  K  G  AE+    +  KDK  L   AEA+ K   + +        KS 
Sbjct: 130 GDAKIAKFDAAGKQLGETAELPHVAAAMKDKDALKTRAEAQLKKEKEQMAQSYGNARKSI 189

Query: 65  ADAIKTMEDRKAKVGVARQERKANESKD 92
           A+ +K +ED+KA+     +ER+  + K+
Sbjct: 190 AEQLKRIEDKKAEERSKTEERQIEQFKN 217


>gi|302670141|ref|YP_003830101.1| ABC transporter permease [Butyrivibrio proteoclasticus B316]
 gi|302394614|gb|ADL33519.1| ABC transporter permease protein [Butyrivibrio proteoclasticus
           B316]
          Length = 1188

 Score = 38.3 bits (87), Expect = 0.39,   Method: Composition-based stats.
 Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 12/113 (10%)

Query: 3   ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI-- 60
           +L  AQAE D K  EA+A+ +  +AE +  ++   DK     I +AK  Y S+   A   
Sbjct: 533 QLADAQAEYDRKEPEARAELASKEAEFEKQKAEGADK-----IKKAKGTYASNKAKAQEE 587

Query: 61  ----EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVE 109
               EK   +A+   E++K K G  +  +   E  DA   I  A  +  K +E
Sbjct: 588 LTSREKEFDEAVAEFEEKK-KDGEEQLSQARKEYADAKAEIAKALSDARKEIE 639


>gi|119485088|ref|ZP_01619473.1| Band 7 protein [Lyngbya sp. PCC 8106]
 gi|119457316|gb|EAW38441.1| Band 7 protein [Lyngbya sp. PCC 8106]
          Length = 520

 Score = 37.9 bits (86), Expect = 0.43,   Method: Composition-based stats.
 Identities = 31/106 (29%), Positives = 56/106 (52%), Gaps = 17/106 (16%)

Query: 3   ELETAQAEADAKIAEAKAK------DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDI 56
           E+E A+AEA+ ++ +A  K      +S ++   ++AR+ A+      RI + + + ++DI
Sbjct: 236 EIEVARAEAERRVKDAMTKRDAVIAESESEIASEVARTQAELPVQKARIIQVEQRLQADI 295

Query: 57  VM--------AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94
           V         AI ++K DA + +ED KA+   A   ++  ES  AA
Sbjct: 296 VAPAEAECKRAIARAKGDAAQIIEDGKAR---AEGTQRLAESWKAA 338


>gi|119356503|ref|YP_911147.1| Smr protein/MutS2 [Chlorobium phaeobacteroides DSM 266]
 gi|119353852|gb|ABL64723.1| Smr protein/MutS2 [Chlorobium phaeobacteroides DSM 266]
          Length = 794

 Score = 37.6 bits (85), Expect = 0.58,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 50/91 (54%), Gaps = 9/91 (9%)

Query: 3   ELETAQAEADAKIAEAKAKDSG---NQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMA 59
           +LE  +A+   ++   +A+++G    Q E+ +  +    K     +  A+ + R +IV  
Sbjct: 541 QLEGERADLAERVIALRAEEAGVERKQRELRLGAARELQK----EVEHARKEIR-EIVQE 595

Query: 60  IEKSKADAIKTMEDRKAKVGVARQERKANES 90
           +  + ADA KT++D + K+G+ +QE + +ES
Sbjct: 596 VRNAPADA-KTVQDSRKKLGLKKQEAEKSES 625


>gi|320108783|ref|YP_004184373.1| hypothetical protein AciPR4_3626 [Terriglobus saanensis SP1PR4]
 gi|319927304|gb|ADV84379.1| hypothetical protein AciPR4_3626 [Terriglobus saanensis SP1PR4]
          Length = 427

 Score = 37.2 bits (84), Expect = 0.68,   Method: Composition-based stats.
 Identities = 28/104 (26%), Positives = 54/104 (51%), Gaps = 4/104 (3%)

Query: 4   LETAQAEADAKIAEAKAKDSGNQAEIDIARSNAK---DKTDLVRIAEAKYKYRSDIVMAI 60
           L  A A  DA+++E + + +G++AE D  R++AK   D+ D +  A+A  K +SD  +A 
Sbjct: 162 LNAAIAVEDARLSEMEQQVAGSKAEADKLRADAKDAQDRLDALTAADATSKSQSDAQVAA 221

Query: 61  EKSKADAIKT-MEDRKAKVGVARQERKANESKDAATIITTATVE 103
              + DA    + D +A     + E  A  ++    ++  A+++
Sbjct: 222 LTQERDATAAKLRDAQASYQTVQYELDALRNQHKQDLLHLASLD 265


>gi|159028037|emb|CAO87997.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 475

 Score = 36.8 bits (83), Expect = 1.0,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 14/89 (15%)

Query: 3   ELETAQAEADAKIAEAKAKDSGNQAEI------DIARSNAKDKTDLVRIAEAKYKYRSDI 56
           +LE A+A+A+ ++ + + K     AE+      D+A+  A+      RI + K + ++D+
Sbjct: 289 DLEIAKADAEKRVRDTQTKRGAMIAEVESVVMSDLAKVQAEVAVQTARIKQVKQQLQADV 348

Query: 57  V--------MAIEKSKADAIKTMEDRKAK 77
           +         AI K++ +A K +E  KA+
Sbjct: 349 IAPAAAECQQAIAKARGEAAKIIEQGKAQ 377


>gi|197116844|ref|YP_002137271.1| TPR domain-containing protein [Geobacter bemidjiensis Bem]
 gi|197086204|gb|ACH37475.1| TPR domain protein [Geobacter bemidjiensis Bem]
          Length = 2741

 Score = 36.8 bits (83), Expect = 1.1,   Method: Composition-based stats.
 Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 10/82 (12%)

Query: 13   AKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTME 72
            A++A+A   D  NQA+I +A  +A+ K+ L   AEA Y+   ++        ADA+   E
Sbjct: 2189 AQLADAIG-DKVNQAKIHLALGDARAKSLLNAQAEASYRKALEL--------ADAMLVRE 2239

Query: 73   DR-KAKVGVARQERKANESKDA 93
             R +A +G+AR +++  +SK A
Sbjct: 2240 VRWRALLGLARLQQQGGDSKAA 2261


>gi|320164322|gb|EFW41221.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 828

 Score = 36.4 bits (82), Expect = 1.2,   Method: Composition-based stats.
 Identities = 31/93 (33%), Positives = 52/93 (55%), Gaps = 4/93 (4%)

Query: 4   LETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKD-KTDLVRIAEAKYKYRSDIVMAIEK 62
           LE+++A+ DAK+  AKA+ +   AE+D  RS   + K   VR  E   K R+++  A ++
Sbjct: 457 LESSRADMDAKLEIAKAELAQETAELDKLRSAVDEQKMTSVRQEEELRKLRNEVEQA-QR 515

Query: 63  SKADAIKTMEDRKAKVGVARQERKANESKDAAT 95
            +A  ++   D + K  VA+ E K  E+K  A+
Sbjct: 516 EQA-RLREQLDTETKT-VAQLEAKIEEAKSNAS 546


>gi|159131497|gb|EDP56610.1| intracellular protein transport protein (UsoA), putative
           [Aspergillus fumigatus A1163]
          Length = 1061

 Score = 36.4 bits (82), Expect = 1.2,   Method: Composition-based stats.
 Identities = 30/116 (25%), Positives = 57/116 (49%), Gaps = 4/116 (3%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           +EL+ A+   +A++A+ +AK    Q+E+D A+   + +   +R+     +  S++    E
Sbjct: 881 SELDKAKEGHEAELADLRAKAQTVQSELDTAKQEHETEISGLRVKAQSLQ--SELDSRTE 938

Query: 62  KSKADAIKTMEDRKAKVGVARQERKANESK-DAATIITTATVENTKVVETKEAGKT 116
           +SK D     +D  +K+    +  K  ESK + A      + E  K V+ +  GKT
Sbjct: 939 RSKEDLQAVHDDYLSKLSELEKRVKLAESKAEKAEADALKSAETLKEVQAR-LGKT 993


>gi|253699111|ref|YP_003020300.1| hypothetical protein GM21_0462 [Geobacter sp. M21]
 gi|251773961|gb|ACT16542.1| Tetratricopeptide TPR_2 repeat protein [Geobacter sp. M21]
          Length = 2741

 Score = 36.4 bits (82), Expect = 1.3,   Method: Composition-based stats.
 Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 10/82 (12%)

Query: 13   AKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTME 72
            A++A+A   D  NQA+I +A  +A+ K+ L   AEA Y+    +        +DA+   E
Sbjct: 2189 AELADAIG-DKVNQAKIHLALGDARAKSRLDAQAEASYRKALGL--------SDAMLVRE 2239

Query: 73   DR-KAKVGVARQERKANESKDA 93
             R +A +G+AR +++A +SK A
Sbjct: 2240 VRWRALLGLARLQQQAGDSKAA 2261


>gi|70995974|ref|XP_752742.1| intracellular protein transport protein (UsoA) [Aspergillus
           fumigatus Af293]
 gi|44889965|emb|CAD29605.2| transport protein, putative [Aspergillus fumigatus]
 gi|66850377|gb|EAL90704.1| intracellular protein transport protein (UsoA), putative
           [Aspergillus fumigatus Af293]
          Length = 1061

 Score = 36.4 bits (82), Expect = 1.3,   Method: Composition-based stats.
 Identities = 30/116 (25%), Positives = 57/116 (49%), Gaps = 4/116 (3%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           +EL+ A+   +A++A+ +AK    Q+E+D A+   + +   +R+     +  S++    E
Sbjct: 881 SELDKAKEGHEAELADLRAKAQTVQSELDTAKQEHETEISGLRVKAQSLQ--SELDSRTE 938

Query: 62  KSKADAIKTMEDRKAKVGVARQERKANESK-DAATIITTATVENTKVVETKEAGKT 116
           +SK D     +D  +K+    +  K  ESK + A      + E  K V+  + GKT
Sbjct: 939 RSKEDLQAVHDDYLSKLSELEKRVKLAESKAEKAEADALKSAETLKEVQA-QLGKT 993


>gi|227833721|ref|YP_002835428.1| trigger factor [Corynebacterium aurimucosum ATCC 700975]
 gi|262184727|ref|ZP_06044148.1| trigger factor [Corynebacterium aurimucosum ATCC 700975]
 gi|254788997|sp|C3PI36|TIG_CORA7 RecName: Full=Trigger factor; Short=TF
 gi|227454737|gb|ACP33490.1| trigger factor [Corynebacterium aurimucosum ATCC 700975]
          Length = 451

 Score = 36.0 bits (81), Expect = 1.5,   Method: Composition-based stats.
 Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 7/100 (7%)

Query: 26  QAEIDIARSNAKDKTDLVRIAEAKYKYR-------SDIVMAIEKSKADAIKTMEDRKAKV 78
           Q EIDIA+    D  +     + + ++        S  V A+E S+ D  K +ED  ++ 
Sbjct: 90  QPEIDIAKLEDNDFVEFTAEVDIRPEFEVPDFSKISVKVPALETSEEDVDKALEDLASRF 149

Query: 79  GVARQERKANESKDAATIITTATVENTKVVETKEAGKTAR 118
           G  +  ++  ++ D A I  T  V+ TK+ E    G T R
Sbjct: 150 GELKDTKRKMKTGDYAIIDITTEVDGTKLDEASHEGMTYR 189


>gi|78224167|ref|YP_385914.1| MutS 2 protein [Geobacter metallireducens GS-15]
 gi|78195422|gb|ABB33189.1| MutS 2 protein [Geobacter metallireducens GS-15]
          Length = 785

 Score = 35.6 bits (80), Expect = 2.0,   Method: Composition-based stats.
 Identities = 26/102 (25%), Positives = 53/102 (51%), Gaps = 6/102 (5%)

Query: 1   MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVR--IAEAKYKYRSDIVM 58
           ++ +ET   E  A++ + + +     AE +  R +A++K  +VR  +AEA+ K R     
Sbjct: 517 LSRMETEFHELLAELKDQRRRHEEALAEAERLRRDAEEKARIVRERLAEAEAKRRE---- 572

Query: 59  AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTA 100
           A+EK+  +A + +   + +V    +E +  +S++A   I  A
Sbjct: 573 AVEKAFQEAKEIVRSARREVNAIIEEARKEKSREARKKIDEA 614


>gi|121701239|ref|XP_001268884.1| intracellular protein transport protein (UsoA), putative
           [Aspergillus clavatus NRRL 1]
 gi|119397027|gb|EAW07458.1| intracellular protein transport protein (UsoA), putative
           [Aspergillus clavatus NRRL 1]
          Length = 1048

 Score = 35.6 bits (80), Expect = 2.0,   Method: Composition-based stats.
 Identities = 29/108 (26%), Positives = 52/108 (48%), Gaps = 14/108 (12%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           +EL TA+ E + ++A+ KAK+   QAE +   S+ + K             +S++  A+E
Sbjct: 881 SELATARKEHETEVADLKAKNETLQAEHNTKISDVQAKAQ---------SLQSELESAME 931

Query: 62  KSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVE 109
           KSK D     +D  +K       +  N+++DA + +  A  +  K  E
Sbjct: 932 KSKKDLQVLHDDYSSKC-----SKLENQAEDAKSRVKKAEADAHKSEE 974


>gi|225853742|ref|YP_002735254.1| hypothetical protein SPJ_0142 [Streptococcus pneumoniae JJA]
 gi|225722642|gb|ACO18495.1| conserved hypothetical protein [Streptococcus pneumoniae JJA]
          Length = 450

 Score = 35.6 bits (80), Expect = 2.1,   Method: Composition-based stats.
 Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60
           A+LET  +E   KI   K   +   +EID  +SN KD         ++ Y   + +V  +
Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLVSEL 276

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKAN 88
             ++++  K++ + +A +GVA  + KA+
Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304


>gi|121595243|ref|YP_987139.1| RND efflux system outer membrane lipoprotein [Acidovorax sp. JS42]
 gi|120607323|gb|ABM43063.1| RND efflux system, outer membrane lipoprotein, NodT family
           [Acidovorax sp. JS42]
          Length = 511

 Score = 35.6 bits (80), Expect = 2.1,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)

Query: 3   ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEK 62
           E E A+    A++A A  +  G QA++DI R N +     +R+AE+  + R+ +    E 
Sbjct: 176 EREAARVALSAEVARAYLQLRGAQAQLDITRQNLEVADRTLRLAES--RERNGVATRFET 233

Query: 63  SKADA-IKTME 72
           S A A + T+E
Sbjct: 234 SSARAQLATVE 244


>gi|168487067|ref|ZP_02711575.1| conserved hypothetical protein [Streptococcus pneumoniae
           CDC1087-00]
 gi|225857982|ref|YP_002739492.1| hypothetical protein SP70585_0182 [Streptococcus pneumoniae 70585]
 gi|183570031|gb|EDT90559.1| conserved hypothetical protein [Streptococcus pneumoniae
           CDC1087-00]
 gi|225722007|gb|ACO17861.1| conserved hypothetical protein [Streptococcus pneumoniae 70585]
          Length = 450

 Score = 35.6 bits (80), Expect = 2.4,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60
           A+LET  +E   KI   K   +   +EID  +SN KD         ++ Y   + ++  +
Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLISEL 276

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKAN 88
             ++++  K++ + +A +GVA  + KA+
Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304


>gi|148983506|ref|ZP_01816825.1| hypothetical protein CGSSp3BS71_05229 [Streptococcus pneumoniae
           SP3-BS71]
 gi|147923653|gb|EDK74765.1| hypothetical protein CGSSp3BS71_05229 [Streptococcus pneumoniae
           SP3-BS71]
 gi|301799286|emb|CBW31812.1| unnamed protein product [Streptococcus pneumoniae OXC141]
          Length = 450

 Score = 35.6 bits (80), Expect = 2.4,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60
           A+LET  +E   KI   K   +   +EID  +SN KD         ++ Y   + ++  +
Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLISEL 276

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKAN 88
             ++++  K++ + +A +GVA  + KA+
Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304


>gi|89100461|ref|ZP_01173323.1| hypothetical protein B14911_05506 [Bacillus sp. NRRL B-14911]
 gi|89084804|gb|EAR63943.1| hypothetical protein B14911_05506 [Bacillus sp. NRRL B-14911]
          Length = 332

 Score = 35.6 bits (80), Expect = 2.4,   Method: Composition-based stats.
 Identities = 21/50 (42%), Positives = 29/50 (58%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYK 51
           AEL+T QAEAD KIA+AKA++    A        AK +    ++ EA+ K
Sbjct: 234 AELQTEQAEADKKIAQAKAEERRAMAVAQEQEMKAKVEEMRAKVVEAEAK 283


>gi|83649547|ref|YP_437982.1| outer membrane protein [Hahella chejuensis KCTC 2396]
 gi|83637590|gb|ABC33557.1| Outer membrane protein [Hahella chejuensis KCTC 2396]
          Length = 480

 Score = 35.6 bits (80), Expect = 2.5,   Method: Composition-based stats.
 Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 5/88 (5%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           A L   Q E  A++    A   G QA++ +A+ N ++   +V + EAKYK      + + 
Sbjct: 159 AALRNVQVEIIAEVVRVYADLRGAQAQVQVAQRNLENLKSVVDLTEAKYKSGVGAELDVV 218

Query: 62  KSKAD---AIKTMEDRKAKVGVARQERK 86
           +SKA    A  T+   +A+  +AR E +
Sbjct: 219 RSKAQYAGAQATLAPLQAR--IARDEYR 244


>gi|31414576|dbj|BAC77268.1| UsoAp [Emericella nidulans]
          Length = 1103

 Score = 35.3 bits (79), Expect = 2.9,   Method: Composition-based stats.
 Identities = 32/128 (25%), Positives = 62/128 (48%), Gaps = 12/128 (9%)

Query: 2    AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
            +E+E+ + +   ++++   K  G Q+E+D AR   + ++++V++  A    R+++    E
Sbjct: 928  SEVESTKEQRANEVSDLHKKIQGLQSELDSARK--QHESEVVKLESANETVRAELNTVKE 985

Query: 62   KSKADAIKTMEDRKAKVGV----ARQ-----ERKANESKDAATIITTATVENTKVV-ETK 111
            +S  D     ED  +K       A+Q     ER   E++ AA  +  A     K + ETK
Sbjct: 986  QSTQDLEAVREDYSSKCSALENRAQQAESEVERLEAEARKAAHALEEAQKALEKALQETK 1045

Query: 112  EAGKTARS 119
            E  +  +S
Sbjct: 1046 EKEEARQS 1053


>gi|91205782|ref|YP_538137.1| multidrug resistance protein A [Rickettsia bellii RML369-C]
 gi|91069326|gb|ABE05048.1| Multidrug resistance protein A [Rickettsia bellii RML369-C]
          Length = 353

 Score = 35.3 bits (79), Expect = 2.9,   Method: Composition-based stats.
 Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 6/111 (5%)

Query: 6   TAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLV-RIAEAKYKYRSDIVMAIEKSK 64
           T  A  DA I++  A+ +G    + +  +   +K DL+  I +  YK +   + A+E S 
Sbjct: 45  TDNAYIDADISDVSAEINGVLTNLLVTNNTKVNKGDLIGEIDDRDYKAK---LAALEASI 101

Query: 65  ADAIKTME--DRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEA 113
             + K +E  D+K  +G  + E+   + K AAT     + + T+V E  +A
Sbjct: 102 GASEKNIEIIDQKISIGQNQLEQSGEKLKLAATSFNIVSTDFTRVQELNKA 152


>gi|168489411|ref|ZP_02713610.1| conserved hypothetical protein [Streptococcus pneumoniae SP195]
 gi|183572030|gb|EDT92558.1| conserved hypothetical protein [Streptococcus pneumoniae SP195]
          Length = 449

 Score = 35.3 bits (79), Expect = 3.0,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60
           A+LET  +E   KI   K   +   +EID  +SN KD         ++ Y   + ++  +
Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNNPTSQAYNIYAQLISEL 276

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKAN 88
             ++++  K++ + +A +GVA  + KA+
Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304


>gi|148994314|ref|ZP_01823578.1| hypothetical protein CGSSp9BS68_07267 [Streptococcus pneumoniae
           SP9-BS68]
 gi|147927344|gb|EDK78376.1| hypothetical protein CGSSp9BS68_07267 [Streptococcus pneumoniae
           SP9-BS68]
 gi|332075786|gb|EGI86253.1| hypothetical protein SPAR50_0121 [Streptococcus pneumoniae GA17570]
          Length = 450

 Score = 35.3 bits (79), Expect = 3.1,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60
           A+LET  +E   KI   K   +   +EID  +SN KD         ++ Y   + ++  +
Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNNPTSQAYNIYAQLISEL 276

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKAN 88
             ++++  K++ + +A +GVA  + KA+
Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304


>gi|171912957|ref|ZP_02928427.1| band 7 protein [Verrucomicrobium spinosum DSM 4136]
          Length = 485

 Score = 35.3 bits (79), Expect = 3.3,   Method: Composition-based stats.
 Identities = 39/121 (32%), Positives = 64/121 (52%), Gaps = 14/121 (11%)

Query: 2   AELETAQAEADAKI----AEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIV 57
            E+  AQA+ + KI    A+A+A    N A +DIA SNA     LV+ AEA  +  +   
Sbjct: 217 GEIGRAQAQKEQKIVVAQAQAEATTGENLAAVDIANSNA---NRLVQEAEANRQAEAAQN 273

Query: 58  MAIEKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTA 117
           +A  + + +A   +  R+A+  VAR E+  +++   A+++  A VE  + +ET  A   A
Sbjct: 274 VANARVQQEAY--LAQREAE--VARAEK--DKASQYASVVVPAEVEKLR-METIAAADAA 326

Query: 118 R 118
           R
Sbjct: 327 R 327


>gi|281357218|ref|ZP_06243707.1| phage tail tape measure protein, TP901 family [Victivallis vadensis
           ATCC BAA-548]
 gi|281316249|gb|EFB00274.1| phage tail tape measure protein, TP901 family [Victivallis vadensis
           ATCC BAA-548]
          Length = 898

 Score = 34.9 bits (78), Expect = 3.6,   Method: Composition-based stats.
 Identities = 31/118 (26%), Positives = 55/118 (46%), Gaps = 12/118 (10%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           AE+  A++E  + + E K + +   A+ID A+  ++  T+  R AE ++        A+ 
Sbjct: 786 AEINAAKSEWQSAMDEVKQRAAEKAAQIDEAKEKSEAATENTRNAETRFNSSFGGGKAVG 845

Query: 62  KSKADAIKTMEDRKAKVGVAR--QERKANESKDAATIITTATVENTKVVETKEAGKTA 117
              A+A+  M      +G A   QER A  S+     I + T E  + ++  + G TA
Sbjct: 846 AWSAEALDAM------LGGANNAQERTARASEQ----IVSNTRETNRQIKKLQGGSTA 893


>gi|15902152|ref|NP_357702.1| hypothetical protein spr0108 [Streptococcus pneumoniae R6]
 gi|116516521|ref|YP_815617.1| hypothetical protein SPD_0115 [Streptococcus pneumoniae D39]
 gi|15457645|gb|AAK98912.1| Conserved hypothetical protein [Streptococcus pneumoniae R6]
 gi|116077097|gb|ABJ54817.1| conserved hypothetical protein [Streptococcus pneumoniae D39]
          Length = 450

 Score = 34.9 bits (78), Expect = 4.1,   Method: Composition-based stats.
 Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 1/88 (1%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60
           A+LET   E   KI   K   +   +EID  +SN KD         ++ Y   + +V  +
Sbjct: 217 AKLETKILEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLVSEL 276

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKAN 88
             ++++  K++ + +A +GVA  + KA+
Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304


>gi|75907620|ref|YP_321916.1| hypothetical protein Ava_1398 [Anabaena variabilis ATCC 29413]
 gi|75701345|gb|ABA21021.1| Band 7 protein [Anabaena variabilis ATCC 29413]
          Length = 422

 Score = 34.9 bits (78), Expect = 4.2,   Method: Composition-based stats.
 Identities = 30/106 (28%), Positives = 55/106 (51%), Gaps = 16/106 (15%)

Query: 3   ELETAQAEADAKIAEAKAKDSGNQAEID------IARSNAKDKTDLVRIAEAKYKYRSDI 56
           +L+ A+AEA+ ++ +A  K +   AE++      IA+  A+      RI + + + ++DI
Sbjct: 236 DLQIAKAEAERRVRDAITKRTAVIAEVESVVNSQIAKVQAEVAVQTERIIQVENQLQADI 295

Query: 57  V--------MAIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94
           V         AI ++K DA K +E+ KA+   A  +R A   ++A 
Sbjct: 296 VAPAEAECQTAIAQAKGDAAKIIEEGKAQ--AAGTQRLAESWQNAG 339


>gi|19703613|ref|NP_603175.1| DNA repair protein recN [Fusobacterium nucleatum subsp. nucleatum
           ATCC 25586]
 gi|19713719|gb|AAL94474.1| DNA repair protein recN [Fusobacterium nucleatum subsp. nucleatum
           ATCC 25586]
          Length = 558

 Score = 34.9 bits (78), Expect = 4.2,   Method: Composition-based stats.
 Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 2/83 (2%)

Query: 22  DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIK--TMEDRKAKVG 79
           DSG+    ++ +   K K +  +IAE     R +I + IE    + +K   MED K KV 
Sbjct: 342 DSGDFKTKELKKELNKIKDEYDKIAEKLTNSRKEIAVKIENELLNELKFLNMEDAKLKVQ 401

Query: 80  VARQERKANESKDAATIITTATV 102
           + + ER  NE  D      +  V
Sbjct: 402 INKLERMTNEGYDDVEFFISTNV 424


>gi|120402439|ref|YP_952268.1| hypothetical protein Mvan_1428 [Mycobacterium vanbaalenii PYR-1]
 gi|119955257|gb|ABM12262.1| band 7 protein [Mycobacterium vanbaalenii PYR-1]
          Length = 477

 Score = 34.9 bits (78), Expect = 4.3,   Method: Composition-based stats.
 Identities = 34/124 (27%), Positives = 57/124 (45%), Gaps = 13/124 (10%)

Query: 2   AELETAQAEADAKIAEAKAKDSG--NQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMA 59
           A + TA+AE DA+I  AKA+ +G   QAE D A + A  K D V +A  + +  ++   A
Sbjct: 200 AAVGTAEAERDAQIQSAKARQAGAVAQAEADTAIATANQKRD-VELARLRAQTEAENAQA 258

Query: 60  ----------IEKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVE 109
                      +K    AI+  E  + +  +  ++R+A +S+ A      A  E  +  +
Sbjct: 259 DQAGPLANARAQKDVGIAIEQAEAARVQARIEVEQRRAEQSQAALQADVIAPAEAQRAAD 318

Query: 110 TKEA 113
              A
Sbjct: 319 IARA 322


>gi|237745213|ref|ZP_04575694.1| DNA repair protein recN [Fusobacterium sp. 7_1]
 gi|229432442|gb|EEO42654.1| DNA repair protein recN [Fusobacterium sp. 7_1]
          Length = 558

 Score = 34.9 bits (78), Expect = 4.3,   Method: Composition-based stats.
 Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 22  DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIK--TMEDRKAKVG 79
           DSG+    ++ +  AK K++  ++AE     R +I + IE    + +K   MED K KV 
Sbjct: 342 DSGDFKTRELKQELAKIKSEYDKLAEKLSNLRKEIAVKIENELLNELKFLNMEDAKLKVQ 401

Query: 80  VARQERKANESKDAATIITTATV 102
           + + E+  N+  D      +  V
Sbjct: 402 INKIEKMTNDGYDEVEFFISTNV 424


>gi|254302670|ref|ZP_04970028.1| DNA repair protein RecN [Fusobacterium nucleatum subsp. polymorphum
           ATCC 10953]
 gi|148322862|gb|EDK88112.1| DNA repair protein RecN [Fusobacterium nucleatum subsp. polymorphum
           ATCC 10953]
          Length = 553

 Score = 34.9 bits (78), Expect = 4.3,   Method: Composition-based stats.
 Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 2/83 (2%)

Query: 22  DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIK--TMEDRKAKVG 79
           DSG+    ++ +   K KT+  +IAE     R +I + IE    + +K   MED K KV 
Sbjct: 337 DSGDFKTKELKKELNKIKTEYDKIAEKLTNSRKEIAVKIENELLNELKFLNMEDAKLKVQ 396

Query: 80  VARQERKANESKDAATIITTATV 102
           + + E+  N+  D      +  V
Sbjct: 397 INKLEKMTNDGYDEVEFFISTNV 419


>gi|67470334|ref|XP_651135.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56467830|gb|EAL45749.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 2089

 Score = 34.5 bits (77), Expect = 4.4,   Method: Composition-based stats.
 Identities = 26/86 (30%), Positives = 53/86 (61%), Gaps = 7/86 (8%)

Query: 34  SNAKDKTD-LVRIAEAKYKYRSDI---VMAIEKSKADAIKTMEDRKAKVGVARQ--ERKA 87
           +N+++KT    +I+E K K    +   V  ++K   D+ +T++++K K G+A++   +K 
Sbjct: 616 NNSENKTQKTTQISETKKKINHSVKKKVSELKKENKDS-QTIQEKKTKEGIAKKIHLKKV 674

Query: 88  NESKDAATIITTATVENTKVVETKEA 113
           N+ K+    I+ + ++ TK++ETKEA
Sbjct: 675 NQQKEIDQPISESEIKPTKLLETKEA 700


>gi|149001800|ref|ZP_01826773.1| hypothetical protein CGSSp14BS69_08730 [Streptococcus pneumoniae
           SP14-BS69]
 gi|237650847|ref|ZP_04525099.1| hypothetical protein SpneC1_09049 [Streptococcus pneumoniae CCRI
           1974]
 gi|237821338|ref|ZP_04597183.1| hypothetical protein SpneC19_03302 [Streptococcus pneumoniae CCRI
           1974M2]
 gi|147760258|gb|EDK67247.1| hypothetical protein CGSSp14BS69_08730 [Streptococcus pneumoniae
           SP14-BS69]
          Length = 450

 Score = 34.5 bits (77), Expect = 4.6,   Method: Composition-based stats.
 Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 1/88 (1%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60
           A+LET   E   KI   K   +   +EID  +SN KD         ++ Y   + +V  +
Sbjct: 217 AKLETKILEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLVSEL 276

Query: 61  EKSKADAIKTMEDRKAKVGVARQERKAN 88
             ++++  K++ + +A +GVA  + KA+
Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304


>gi|148988834|ref|ZP_01820249.1| choline binding protein A [Streptococcus pneumoniae SP6-BS73]
 gi|147925645|gb|EDK76721.1| choline binding protein A [Streptococcus pneumoniae SP6-BS73]
          Length = 1008

 Score = 34.5 bits (77), Expect = 4.7,   Method: Composition-based stats.
 Identities = 37/133 (27%), Positives = 66/133 (49%), Gaps = 17/133 (12%)

Query: 3   ELETAQAEADAKIAE-----AKAKDSGNQAEIDIARSNAKDK----TDLVRIA----EAK 49
           ELE A+++ + K AE      KAK+S ++ +I  A +  + K    T L +I     EAK
Sbjct: 179 ELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEAEVESKQAEATRLKKIKTDREEAK 238

Query: 50  YKYRSDIVMAIEKSKADAIKTMEDRKAKVGV----ARQERKANESKDAATIITTATVENT 105
            K  + +  A+EK+ A + +    R+AK GV    A  ++K N++K + + +   T+ + 
Sbjct: 239 RKADAKLKEAVEKNVATSEQDKPKRRAKRGVSGELATPDKKENDAKSSDSSVGEETLPSP 298

Query: 106 KVVETKEAGKTAR 118
            +    E+    R
Sbjct: 299 SLNMANESQTEHR 311


>gi|67516849|ref|XP_658310.1| hypothetical protein AN0706.2 [Aspergillus nidulans FGSC A4]
 gi|40746326|gb|EAA65482.1| hypothetical protein AN0706.2 [Aspergillus nidulans FGSC A4]
 gi|259489020|tpe|CBF88948.1| TPA: hypothetical protein similar to UsoAp (Eurofung) [Aspergillus
           nidulans FGSC A4]
          Length = 1041

 Score = 34.5 bits (77), Expect = 4.7,   Method: Composition-based stats.
 Identities = 32/128 (25%), Positives = 62/128 (48%), Gaps = 12/128 (9%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           +E+E+ + +   ++++   K  G Q+E+D AR   + ++++V++  A    R+++    E
Sbjct: 866 SEVESTKEQRANEVSDLHKKIQGLQSELDSARK--QHESEVVKLESANETVRAELNTVKE 923

Query: 62  KSKADAIKTMEDRKAKVGV----ARQ-----ERKANESKDAATIITTATVENTKVV-ETK 111
           +S  D     ED  +K       A+Q     ER   E++ AA  +  A     K + ETK
Sbjct: 924 QSTQDLEAVREDYSSKCSALENRAQQAESEVERLEAEARKAAHALEEAQKALEKALQETK 983

Query: 112 EAGKTARS 119
           E  +  +S
Sbjct: 984 EKEEARQS 991


>gi|158520035|ref|YP_001527905.1| hypothetical protein Dole_0018 [Desulfococcus oleovorans Hxd3]
 gi|254799454|sp|A8ZRR1|Y018_DESOH RecName: Full=UPF0365 protein Dole_0018
 gi|158508861|gb|ABW65828.1| protein of unknown function DUF1432 [Desulfococcus oleovorans Hxd3]
          Length = 326

 Score = 34.5 bits (77), Expect = 4.7,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 33/61 (54%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           AELET +AEAD KIA+AKA++    A        A+ +    ++ EA+ K    +  A E
Sbjct: 231 AELETDRAEADKKIAQAKAEERRAMAYAREQEMKAQVEEMRAKVVEAEAKIPLAMANAFE 290

Query: 62  K 62
           K
Sbjct: 291 K 291


>gi|166366427|ref|YP_001658700.1| hypothetical protein MAE_36860 [Microcystis aeruginosa NIES-843]
 gi|166088800|dbj|BAG03508.1| hypothetical protein MAE_36860 [Microcystis aeruginosa NIES-843]
          Length = 427

 Score = 34.5 bits (77), Expect = 4.8,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 14/89 (15%)

Query: 3   ELETAQAEADAKIAEAKAKDSGNQAEI------DIARSNAKDKTDLVRIAEAKYKYRSDI 56
           +LE A+A+A+ ++ + + K     AE+      D+A+  A+      RI + K + ++D+
Sbjct: 236 DLEIAKADAEKRVRDTQTKRGAMIAEVESVVMSDLAKVQAEVAVQNARIKQVKQQLQADV 295

Query: 57  V--------MAIEKSKADAIKTMEDRKAK 77
           +         AI K++ +A K +E  KA+
Sbjct: 296 IAPAAAECQQAIAKARGEAAKIIEQGKAQ 324



 Score = 34.1 bits (76), Expect = 5.7,   Method: Composition-based stats.
 Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 16/108 (14%)

Query: 5   ETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSK 64
           + A+ + DA+IAEAKA+ +          S  KD  +L   A  +      I   +E +K
Sbjct: 198 QKAELQRDARIAEAKARKT----------SIIKDSENLRLTALRR------IQKDLEIAK 241

Query: 65  ADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKE 112
           ADA K + D + K G    E ++    D A +     V+N ++ + K+
Sbjct: 242 ADAEKRVRDTQTKRGAMIAEVESVVMSDLAKVQAEVAVQNARIKQVKQ 289


>gi|315150240|gb|EFT94256.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus
           faecalis TX0012]
          Length = 711

 Score = 34.5 bits (77), Expect = 5.0,   Method: Composition-based stats.
 Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 4/85 (4%)

Query: 25  NQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTMEDRKAKVGVARQE 84
           N AE+ + + N + +  + +I E       D +M   K+K + I T  +R+ +V  +  +
Sbjct: 564 NGAEVALPKFNNRGEDFVYQIEEVNVPAEFDSIMT--KTKDEFILT--NRRKEVVSSTTD 619

Query: 85  RKANESKDAATIITTATVENTKVVE 109
              NES  + T  T ++ E T VVE
Sbjct: 620 TSTNESLSSETATTNSSAEKTIVVE 644


>gi|325478751|gb|EGC81862.1| RecF/RecN/SMC N-terminal domain protein [Anaerococcus prevotii
           ACS-065-V-Col13]
          Length = 1144

 Score = 34.5 bits (77), Expect = 5.2,   Method: Composition-based stats.
 Identities = 36/113 (31%), Positives = 56/113 (49%), Gaps = 12/113 (10%)

Query: 4   LETAQAEADAKIAEAKA-KDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEK 62
           LE    E  A+IA+ K   +   Q +I I+ +  + + DLV+I+E  Y+Y  +     +K
Sbjct: 321 LEEKLDEIKAQIADNKELSNKLEQDKIAISDNKKRVEVDLVKISEEIYQYEIN-----QK 375

Query: 63  SKADAIKTMEDRKAKVGVARQERKANESKDAAT--IITTATVENTKVVETKEA 113
           SKA     ++ RKAK    R E+  N SK+     I   + +E  K +E K A
Sbjct: 376 SKA----ILDKRKAKEDEIRIEKLNNLSKEIEKLEIDRDSLIEEIKTLEEKNA 424


>gi|257415126|ref|ZP_05592120.1| predicted protein [Enterococcus faecalis AR01/DG]
 gi|257156954|gb|EEU86914.1| predicted protein [Enterococcus faecalis ARO1/DG]
          Length = 711

 Score = 34.5 bits (77), Expect = 5.5,   Method: Composition-based stats.
 Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 4/85 (4%)

Query: 25  NQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTMEDRKAKVGVARQE 84
           N AE+ + + N + +  + +I E       D +M   K+K + I T  +R+ +V  +  +
Sbjct: 564 NGAEVALPKFNNRGEDFVYQIEEVNVPAEFDSIMT--KTKDEFILT--NRRKEVVSSTTD 619

Query: 85  RKANESKDAATIITTATVENTKVVE 109
              NES  + T  T ++ E T VVE
Sbjct: 620 TSTNESLSSETATTNSSAEKTIVVE 644


>gi|17556907|ref|NP_498856.1| UBX-containing protein in Nematode family member (ubxn-4)
           [Caenorhabditis elegans]
 gi|466104|sp|P34631|UBXN4_CAEEL RecName: Full=UBX domain-containing protein 4
 gi|289758|gb|AAA28196.1| Ubx-containing protein in nematodes protein 4, confirmed by
           transcript evidence [Caenorhabditis elegans]
          Length = 469

 Score = 34.1 bits (76), Expect = 5.9,   Method: Composition-based stats.
 Identities = 29/118 (24%), Positives = 53/118 (44%), Gaps = 6/118 (5%)

Query: 3   ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSD-IVMAIE 61
           EL    A A A + + K KD+  + E D        K ++ +  EAK +  ++ +V A +
Sbjct: 168 ELAEKVARAKALLEQKKQKDAEKKREAD-----KHVKEEMTKAREAKQERDAEALVKAAK 222

Query: 62  KSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS 119
           + K + +    D+K  +   + +R+A + K    + T    ENT+  +    GK   S
Sbjct: 223 QRKMEKLAAESDKKRILAQIKADREAAQKKFGKLVNTENASENTEKKQETTVGKAVPS 280


>gi|297570639|ref|YP_003696413.1| Exo-alpha-sialidase [Arcanobacterium haemolyticum DSM 20595]
 gi|296930986|gb|ADH91794.1| Exo-alpha-sialidase [Arcanobacterium haemolyticum DSM 20595]
          Length = 885

 Score = 34.1 bits (76), Expect = 5.9,   Method: Composition-based stats.
 Identities = 33/112 (29%), Positives = 54/112 (48%), Gaps = 6/112 (5%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           A+ ET +AE   K    K  DS N A++++A++  +     + + EAK         A E
Sbjct: 544 ADSETTRAE---KARLQKEFDSIN-AKLEVAKAEKQRVASDLAVKEAKLTESEKQKAAAE 599

Query: 62  KSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEA 113
           K  A+  K +ED  AKV  A  ER  +++ ++A  +     E+ K +E  E 
Sbjct: 600 KKVAEQHKQIEDLNAKVKKAESER--DQATESAKNLEEKAKEDAKSIEGLEG 649


>gi|49481625|ref|YP_038596.1| cell surface protein [Bacillus thuringiensis serovar konkukian str.
           97-27]
 gi|49333181|gb|AAT63827.1| conserved hypothetical protein, possible cell surface protein
           [Bacillus thuringiensis serovar konkukian str. 97-27]
          Length = 953

 Score = 34.1 bits (76), Expect = 6.0,   Method: Composition-based stats.
 Identities = 29/111 (26%), Positives = 48/111 (43%), Gaps = 17/111 (15%)

Query: 11  ADAKIAEAKAKDSGNQAEIDI----ARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKAD 66
           ADAK+     K +    + ++    A+ NAK K D   I E  Y +  D+ +  + +K  
Sbjct: 806 ADAKVVSEDKKANTRVVQFEVSDLFAKLNAKVKVD---IDEMNYHHFYDVQIQFDTTKIG 862

Query: 67  AIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTA 117
           A          VG  ++E K +   +    +TT  V+N K V T +  + A
Sbjct: 863 A----------VGTVKEEPKNDPKNEPKNPVTTPKVDNVKTVGTPDFNRNA 903


>gi|167574255|ref|ZP_02367129.1| gp14 [Burkholderia oklahomensis C6786]
          Length = 1354

 Score = 34.1 bits (76), Expect = 6.9,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 7/80 (8%)

Query: 12  DAKIAEAKAKDSGNQAEIDIARSNAKDKTDL-VRIAEAKYKYRSDIVMAIEKSKADAIKT 70
           +AK+AEA+A ++  QA +  ARSN  +  ++  RIA   Y         I +  A A + 
Sbjct: 401 EAKLAEARAVEASAQAHVATARSNLANSQEIGTRIAGLPY------AAIIARETAAAQQE 454

Query: 71  MEDRKAKVGVARQERKANES 90
           +E  +A + +A+Q R A E+
Sbjct: 455 LERAEASLALAQQRRTALEA 474


>gi|288555735|ref|YP_003427670.1| hypothetical protein BpOF4_13635 [Bacillus pseudofirmus OF4]
 gi|288546895|gb|ADC50778.1| hypothetical protein BpOF4_13635 [Bacillus pseudofirmus OF4]
          Length = 334

 Score = 34.1 bits (76), Expect = 7.2,   Method: Composition-based stats.
 Identities = 27/98 (27%), Positives = 44/98 (44%), Gaps = 8/98 (8%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           AEL+T QAEAD KIA+AKA++    A        A+ +    ++ EA+ +    +  A+ 
Sbjct: 235 AELQTDQAEADKKIAQAKAEERRAMAVAQEQEMKARVEEMRAKVVEAEAEVPMALSDALR 294

Query: 62  KSKADAIKTME--------DRKAKVGVARQERKANESK 91
           K     +  M         D +  +  A  E  +NE +
Sbjct: 295 KGNMGVMDYMNYQNVMADTDMRGSISKATGEDDSNEKR 332


>gi|8163699|gb|AAF73809.1|AF154037_1 surface protein PspC [Streptococcus pneumoniae]
          Length = 929

 Score = 33.7 bits (75), Expect = 7.6,   Method: Composition-based stats.
 Identities = 37/133 (27%), Positives = 66/133 (49%), Gaps = 17/133 (12%)

Query: 3   ELETAQAEADAKIAE-----AKAKDSGNQAEIDIARSNAKDK----TDLVRIA----EAK 49
           ELE A+++ + K AE      KAK+S ++ +I  A +  + K    T L +I     EAK
Sbjct: 179 ELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEAEVESKQAEATRLKKIKTDREEAK 238

Query: 50  YKYRSDIVMAIEKSKADAIKTMEDRKAKVGV----ARQERKANESKDAATIITTATVENT 105
            K  + +  A+EK+ A + +    R+AK GV    A  ++K N++K + + +   T+ + 
Sbjct: 239 RKADAKLKEAVEKNVATSEQDKPKRRAKRGVSGELATPDKKENDAKSSDSSVGEETLPSP 298

Query: 106 KVVETKEAGKTAR 118
            +    E+    R
Sbjct: 299 SLNMANESQTEHR 311


>gi|77864693|ref|YP_355403.1| gp68 [Burkholderia phage Bcep176]
 gi|161520424|ref|YP_001583851.1| phage tape measure protein [Burkholderia multivorans ATCC 17616]
 gi|189353385|ref|YP_001949012.1| bacteriophage tape measure protein [Burkholderia multivorans ATCC
           17616]
 gi|76885879|gb|ABA60069.1| gp68 [Burkholderia phage Bcep176]
 gi|160344474|gb|ABX17559.1| phage tape measure protein [Burkholderia multivorans ATCC 17616]
 gi|189337407|dbj|BAG46476.1| bacteriophage tape measure protein [Burkholderia multivorans ATCC
           17616]
          Length = 1380

 Score = 33.7 bits (75), Expect = 8.4,   Method: Composition-based stats.
 Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 7/87 (8%)

Query: 12  DAKIAEAKAKDSGNQAEIDIARSNAKDKTDL-VRIAEAKYKYRSDIVMAIEKSKADAIKT 70
           DAK+AEA+A ++   A++  ARSN  +  ++  RIA   Y         I +  A A   
Sbjct: 401 DAKLAEARAIEASAVAQVATARSNLANSQEIGTRIAGTPY------AAVIARETAAAQGE 454

Query: 71  MEDRKAKVGVARQERKANESKDAATII 97
           +E  +A + +A+Q R A E+  A   I
Sbjct: 455 LERAEASLALAQQRRVALEAAAAKGTI 481


>gi|167566450|ref|ZP_02359366.1| gp14 [Burkholderia oklahomensis EO147]
          Length = 1354

 Score = 33.7 bits (75), Expect = 8.5,   Method: Composition-based stats.
 Identities = 26/81 (32%), Positives = 44/81 (54%), Gaps = 7/81 (8%)

Query: 11  ADAKIAEAKAKDSGNQAEIDIARSNAKDKTDL-VRIAEAKYKYRSDIVMAIEKSKADAIK 69
           ++AK+AEA+A ++  QA +  ARSN  +  ++  RIA   Y         I +  A A +
Sbjct: 400 SEAKLAEARAVEASAQAHVATARSNLANSQEIGTRIAGLPY------AAIIARETAAAQQ 453

Query: 70  TMEDRKAKVGVARQERKANES 90
            +E  +A + +A+Q R A E+
Sbjct: 454 ELERAEASLALAQQRRTALEA 474


>gi|312144008|ref|YP_003995454.1| hypothetical protein Halsa_1677 [Halanaerobium sp. 'sapolanicus']
 gi|311904659|gb|ADQ15100.1| hypothetical protein Halsa_1677 [Halanaerobium sp. 'sapolanicus']
          Length = 327

 Score = 33.7 bits (75), Expect = 8.7,   Method: Composition-based stats.
 Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 6/95 (6%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61
           A+L+T QAEAD +IA+AKA++    A  +     A+ +    ++ EA+ +    +  A++
Sbjct: 230 AQLQTDQAEADKEIAQAKAEERRAMAVAEEQEMKARVQEMRAKVVEAEAQVPLAMAEALK 289

Query: 62  KSKADAIKTM------EDRKAKVGVARQERKANES 90
                 +  M       D K +  ++    K NE+
Sbjct: 290 NGNLGVMDYMNLKNIESDTKMRSSISESSEKNNEN 324


>gi|319651603|ref|ZP_08005730.1| hypothetical protein HMPREF1013_02342 [Bacillus sp. 2_A_57_CT2]
 gi|317396670|gb|EFV77381.1| hypothetical protein HMPREF1013_02342 [Bacillus sp. 2_A_57_CT2]
          Length = 332

 Score = 33.7 bits (75), Expect = 8.9,   Method: Composition-based stats.
 Identities = 19/48 (39%), Positives = 28/48 (58%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK 49
           AEL+T QAEAD KIA+AKA++    A        A+ +    ++ EA+
Sbjct: 234 AELQTEQAEADKKIAQAKAEERRAMAVAQEQEMKARVEEMRAKVVEAE 281


>gi|4097980|gb|AAD00184.1| surface protein C [Streptococcus pneumoniae]
          Length = 929

 Score = 33.7 bits (75), Expect = 9.0,   Method: Composition-based stats.
 Identities = 37/133 (27%), Positives = 66/133 (49%), Gaps = 17/133 (12%)

Query: 3   ELETAQAEADAKIAE-----AKAKDSGNQAEIDIARSNAKDK----TDLVRIA----EAK 49
           ELE A+++ + K AE      KAK+S ++ +I  A +  + K    T L +I     EAK
Sbjct: 179 ELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEAEVESKQAEATRLKKIKTDREEAK 238

Query: 50  YKYRSDIVMAIEKSKADAIKTMEDRKAKVGV----ARQERKANESKDAATIITTATVENT 105
            K  + +  A+EK+ A + +    R+AK GV    A  ++K N++K + + +   T+ + 
Sbjct: 239 RKADAKLKEAVEKNVATSEQDKPKRRAKRGVSGELATPDKKENDAKSSDSSVGEETLPSP 298

Query: 106 KVVETKEAGKTAR 118
            +    E+    R
Sbjct: 299 SLNMANESQTEHR 311


>gi|260584545|ref|ZP_05852291.1| YSIRK type signal peptide [Granulicatella elegans ATCC 700633]
 gi|260157568|gb|EEW92638.1| YSIRK type signal peptide [Granulicatella elegans ATCC 700633]
          Length = 2668

 Score = 33.7 bits (75), Expect = 9.6,   Method: Composition-based stats.
 Identities = 33/118 (27%), Positives = 53/118 (44%), Gaps = 21/118 (17%)

Query: 6    TAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKA 65
            TA+  A+ K+  AK + +  +AE+D A++   D        EAK K   D  +A EK+  
Sbjct: 991  TAEDLAEKKLKSAKEQQASLKAELDKAKAALPD-------VEAKVKSARDEALAAEKA-- 1041

Query: 66   DAIKTMEDRKAKVGVARQERKANESKDAATI-------ITTATVENTKVVETKEAGKT 116
                 +E  +  +  A ++  AN    A T+       +T   V++  VV T   GKT
Sbjct: 1042 -----VETAREALKTAAEKNLANPEIAAYTLGEYGSYKVTVRAVDSNGVVTTPTVGKT 1094


>gi|311029291|ref|ZP_07707381.1| flotillin-like protein [Bacillus sp. m3-13]
          Length = 511

 Score = 33.3 bits (74), Expect = 9.9,   Method: Composition-based stats.
 Identities = 26/84 (30%), Positives = 49/84 (58%), Gaps = 7/84 (8%)

Query: 2   AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAK-DKTDLVRIAEAKYKYRSDIVMAI 60
           A++ TA+A+ + +I  A+A     +AE++ A   A+ +KT+ +++AE  Y+   DI    
Sbjct: 210 ADIATAEADKETRIKRAEAAKDAQRAELERATEIAEAEKTNQMKVAE--YRREQDIA--- 264

Query: 61  EKSKADAIKTMEDRKAKVGVARQE 84
            K++AD    +E+ +AK  V  Q+
Sbjct: 265 -KARADQAYHLEEARAKQEVTEQQ 287


  Database: nr
    Posted date:  May 13, 2011  4:10 AM
  Number of letters in database: 999,999,932
  Number of sequences in database:  2,987,209
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 13, 2011  4:17 AM
  Number of letters in database: 999,998,956
  Number of sequences in database:  2,896,973
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 13, 2011  4:23 AM
  Number of letters in database: 999,999,979
  Number of sequences in database:  2,907,862
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 13, 2011  4:29 AM
  Number of letters in database: 999,999,513
  Number of sequences in database:  2,932,190
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 13, 2011  4:33 AM
  Number of letters in database: 792,586,372
  Number of sequences in database:  2,260,650
  
Lambda     K      H
   0.302    0.116    0.279 

Lambda     K      H
   0.267   0.0363    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,360,724,437
Number of Sequences: 13984884
Number of extensions: 38714502
Number of successful extensions: 159683
Number of sequences better than 10.0: 688
Number of HSP's better than 10.0 without gapping: 227
Number of HSP's successfully gapped in prelim test: 2095
Number of HSP's that attempted gapping in prelim test: 154631
Number of HSP's gapped (non-prelim): 6339
length of query: 119
length of database: 4,792,584,752
effective HSP length: 86
effective length of query: 33
effective length of database: 3,589,884,728
effective search space: 118466196024
effective search space used: 118466196024
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 75 (33.7 bits)