BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780176|ref|YP_003064589.1| hypothetical protein CLIBASIA_00295 [Candidatus Liberibacter asiaticus str. psy62] (119 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|254780176|ref|YP_003064589.1| hypothetical protein CLIBASIA_00295 [Candidatus Liberibacter asiaticus str. psy62] gi|254039853|gb|ACT56649.1| hypothetical protein CLIBASIA_00295 [Candidatus Liberibacter asiaticus str. psy62] Length = 119 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 119/119 (100%), Positives = 119/119 (100%) Query: 1 MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI 60 MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI Sbjct: 1 MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI 60 Query: 61 EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS 119 EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS Sbjct: 61 EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS 119 >gi|82541274|ref|XP_724889.1| dihydrolipoamide S-acetyltransferase [Plasmodium yoelii yoelii str. 17XNL] gi|23479697|gb|EAA16454.1| putative dihydrolipoamide S-acetyltransferase [Plasmodium yoelii yoelii] Length = 561 Score = 40.6 bits (93), Expect = 0.061, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 19/124 (15%) Query: 1 MAELETAQAE-ADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIA-----EAKYKYRS 54 M+++ET E AD + + + G E I + + K + VRIA E ++ +S Sbjct: 147 MSDVETTSVETADVETTDVE----GESVEKGIYSPSVQSKKNKVRIAKWLCKENEFVNKS 202 Query: 55 DIVMAIEKSKADAIKTMEDRKAKVGV-----ARQERKANESKDAATIITTATVENTKVVE 109 D++ IE K+ T+E G+ ++ A+ K ATI+ T +ENT + Sbjct: 203 DVIFHIEDDKS----TIEVDSPYTGIIKTILVKEGELADLEKQVATILETNELENTSMNL 258 Query: 110 TKEA 113 + EA Sbjct: 259 SSEA 262 >gi|284052568|ref|ZP_06382778.1| SPFH domain-containing protein [Arthrospira platensis str. Paraca] Length = 523 Score = 40.3 bits (92), Expect = 0.10, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 57/106 (53%), Gaps = 17/106 (16%) Query: 3 ELETAQAEADAKIAEAKAK------DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDI 56 E+E ++AEA+ ++A+A K +S ++ ++AR+ A+ RI + + + ++D+ Sbjct: 236 EVEVSKAEAERRVADAMTKRAAVVAESESETAAEVARTQAEVSVQKERIKQVEQQLQADV 295 Query: 57 VM--------AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94 V AI +++ DA + +ED KA+ A R+ ES AA Sbjct: 296 VAPAEAECKKAIARARGDAAQIIEDGKAQ---AEGTRRLAESWKAA 338 >gi|291571858|dbj|BAI94130.1| band 7 protein [Arthrospira platensis NIES-39] Length = 523 Score = 39.9 bits (91), Expect = 0.10, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 57/106 (53%), Gaps = 17/106 (16%) Query: 3 ELETAQAEADAKIAEAKAK------DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDI 56 E+E ++AEA+ ++A+A K +S ++ ++AR+ A+ RI + + + ++D+ Sbjct: 236 EVEVSKAEAERRVADAMTKRAAVVAESESETAAEVARTQAEVSVQKERIKQVEQQLQADV 295 Query: 57 VM--------AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94 V AI +++ DA + +ED KA+ A R+ ES AA Sbjct: 296 VAPAEAECKKAIARARGDAAQIIEDGKAQ---AEGTRRLAESWKAA 338 >gi|209524411|ref|ZP_03272960.1| band 7 protein [Arthrospira maxima CS-328] gi|209495202|gb|EDZ95508.1| band 7 protein [Arthrospira maxima CS-328] Length = 523 Score = 39.1 bits (89), Expect = 0.18, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 57/106 (53%), Gaps = 17/106 (16%) Query: 3 ELETAQAEADAKIAEAKAK------DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDI 56 E+E ++AEA+ ++A+A K +S ++ ++AR+ A+ RI + + + ++D+ Sbjct: 236 EVEVSKAEAERRVADAMTKRAAVVAESESETAAEVARTQAEVSVQKERIKQVEQQLQADV 295 Query: 57 VM--------AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94 V AI +++ DA + +ED KA+ A R+ ES AA Sbjct: 296 VAPAEAECKKAIARARGDAAQIIEDGKAQ---AEGTRRLAESWKAA 338 >gi|327539858|gb|EGF26461.1| efflux transporter, RND family, MFP subunit [Rhodopirellula baltica WH47] Length = 515 Score = 39.1 bits (89), Expect = 0.21, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 58/115 (50%), Gaps = 11/115 (9%) Query: 3 ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEK 62 E E A A+A + +E++ K + +A ++AR+N + +A+ +YK ++V + Sbjct: 156 EAELAAAQARLQQSESQFKQA--KAMTEVARANLLQSEAQLNLADVRYKRTQNLV----E 209 Query: 63 SKADAIKTMEDRKAKVGVARQE----RKANESKDAATIITTATVENTKV-VETKE 112 A + ++DR+A+ A+ + R + S +AA A +E K VET E Sbjct: 210 RNASSQDELDDREAEFLKAKADIEGVRASLNSSEAAIATAQAEIELAKAGVETAE 264 >gi|307328251|ref|ZP_07607429.1| thymidylate kinase [Streptomyces violaceusniger Tu 4113] gi|306886085|gb|EFN17093.1| thymidylate kinase [Streptomyces violaceusniger Tu 4113] Length = 1100 Score = 38.7 bits (88), Expect = 0.23, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 43/78 (55%), Gaps = 3/78 (3%) Query: 8 QAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADA 67 +AE A+ A +A+D+ +AE D R A+D+ V + + ++ A+ +++A+A Sbjct: 777 EAERQAEAARQRAEDARRRAEEDRKRIEAEDRARAVDEERRRLEAEAE---AVRRAEAEA 833 Query: 68 IKTMEDRKAKVGVARQER 85 + E RKA+ + R E+ Sbjct: 834 RRQEEQRKAEEALLRAEQ 851 >gi|119495192|ref|XP_001264386.1| intracellular protein transport protein (UsoA), putative [Neosartorya fischeri NRRL 181] gi|119412548|gb|EAW22489.1| intracellular protein transport protein (UsoA), putative [Neosartorya fischeri NRRL 181] Length = 1055 Score = 38.3 bits (87), Expect = 0.36, Method: Composition-based stats. Identities = 31/116 (26%), Positives = 57/116 (49%), Gaps = 4/116 (3%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 +EL+ A+ +A++A+ +AK Q+E+D A+ + + +R+ + S++ E Sbjct: 881 SELDKAKEGHEAELADLRAKAQTVQSELDTAKQEHETEISGLRLKAQSLQ--SELDSGTE 938 Query: 62 KSKADAIKTMEDRKAKVGVARQERKANESK-DAATIITTATVENTKVVETKEAGKT 116 KSK D +D +K+ + K ESK + A + E K V+ + GKT Sbjct: 939 KSKEDLQAVHDDYSSKLSELEKRVKLAESKAEKAEADALRSAETLKEVQA-QLGKT 993 >gi|32476774|ref|NP_869768.1| acriflavin resistance protein A [Rhodopirellula baltica SH 1] gi|32447320|emb|CAD77146.1| acriflavine resistance protein A [Precursor] [Rhodopirellula baltica SH 1] Length = 467 Score = 38.3 bits (87), Expect = 0.36, Method: Composition-based stats. Identities = 31/115 (26%), Positives = 58/115 (50%), Gaps = 11/115 (9%) Query: 3 ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEK 62 E E A A+A + +E++ K + +A ++AR+N + +A+ +YK ++V + Sbjct: 108 EAELAAAQARLQQSESQFKQA--KAMTEVARANLLQSEAQLNLADVRYKRTQNLV----E 161 Query: 63 SKADAIKTMEDRKAKVGVARQE----RKANESKDAATIITTATVENTKV-VETKE 112 A + ++DR+A+ A+ + R + S +AA A +E K VET E Sbjct: 162 RNASSQDELDDREAEFLKAKADIEGVRASLNSSEAAIATAQAEIELAKAGVETAE 216 >gi|168700562|ref|ZP_02732839.1| hypothetical protein GobsU_13612 [Gemmata obscuriglobus UQM 2246] Length = 419 Score = 38.3 bits (87), Expect = 0.38, Method: Composition-based stats. Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 6/88 (6%) Query: 11 ADAKIA--EAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMA----IEKSK 64 DAKIA +A K G AE+ + KDK L AEA+ K + + KS Sbjct: 130 GDAKIAKFDAAGKQLGETAELPHVAAAMKDKDALKTRAEAQLKKEKEQMAQSYGNARKSI 189 Query: 65 ADAIKTMEDRKAKVGVARQERKANESKD 92 A+ +K +ED+KA+ +ER+ + K+ Sbjct: 190 AEQLKRIEDKKAEERSKTEERQIEQFKN 217 >gi|302670141|ref|YP_003830101.1| ABC transporter permease [Butyrivibrio proteoclasticus B316] gi|302394614|gb|ADL33519.1| ABC transporter permease protein [Butyrivibrio proteoclasticus B316] Length = 1188 Score = 38.3 bits (87), Expect = 0.39, Method: Composition-based stats. Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 12/113 (10%) Query: 3 ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAI-- 60 +L AQAE D K EA+A+ + +AE + ++ DK I +AK Y S+ A Sbjct: 533 QLADAQAEYDRKEPEARAELASKEAEFEKQKAEGADK-----IKKAKGTYASNKAKAQEE 587 Query: 61 ----EKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVE 109 EK +A+ E++K K G + + E DA I A + K +E Sbjct: 588 LTSREKEFDEAVAEFEEKK-KDGEEQLSQARKEYADAKAEIAKALSDARKEIE 639 >gi|119485088|ref|ZP_01619473.1| Band 7 protein [Lyngbya sp. PCC 8106] gi|119457316|gb|EAW38441.1| Band 7 protein [Lyngbya sp. PCC 8106] Length = 520 Score = 37.9 bits (86), Expect = 0.43, Method: Composition-based stats. Identities = 31/106 (29%), Positives = 56/106 (52%), Gaps = 17/106 (16%) Query: 3 ELETAQAEADAKIAEAKAK------DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDI 56 E+E A+AEA+ ++ +A K +S ++ ++AR+ A+ RI + + + ++DI Sbjct: 236 EIEVARAEAERRVKDAMTKRDAVIAESESEIASEVARTQAELPVQKARIIQVEQRLQADI 295 Query: 57 VM--------AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94 V AI ++K DA + +ED KA+ A ++ ES AA Sbjct: 296 VAPAEAECKRAIARAKGDAAQIIEDGKAR---AEGTQRLAESWKAA 338 >gi|119356503|ref|YP_911147.1| Smr protein/MutS2 [Chlorobium phaeobacteroides DSM 266] gi|119353852|gb|ABL64723.1| Smr protein/MutS2 [Chlorobium phaeobacteroides DSM 266] Length = 794 Score = 37.6 bits (85), Expect = 0.58, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 50/91 (54%), Gaps = 9/91 (9%) Query: 3 ELETAQAEADAKIAEAKAKDSG---NQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMA 59 +LE +A+ ++ +A+++G Q E+ + + K + A+ + R +IV Sbjct: 541 QLEGERADLAERVIALRAEEAGVERKQRELRLGAARELQK----EVEHARKEIR-EIVQE 595 Query: 60 IEKSKADAIKTMEDRKAKVGVARQERKANES 90 + + ADA KT++D + K+G+ +QE + +ES Sbjct: 596 VRNAPADA-KTVQDSRKKLGLKKQEAEKSES 625 >gi|320108783|ref|YP_004184373.1| hypothetical protein AciPR4_3626 [Terriglobus saanensis SP1PR4] gi|319927304|gb|ADV84379.1| hypothetical protein AciPR4_3626 [Terriglobus saanensis SP1PR4] Length = 427 Score = 37.2 bits (84), Expect = 0.68, Method: Composition-based stats. Identities = 28/104 (26%), Positives = 54/104 (51%), Gaps = 4/104 (3%) Query: 4 LETAQAEADAKIAEAKAKDSGNQAEIDIARSNAK---DKTDLVRIAEAKYKYRSDIVMAI 60 L A A DA+++E + + +G++AE D R++AK D+ D + A+A K +SD +A Sbjct: 162 LNAAIAVEDARLSEMEQQVAGSKAEADKLRADAKDAQDRLDALTAADATSKSQSDAQVAA 221 Query: 61 EKSKADAIKT-MEDRKAKVGVARQERKANESKDAATIITTATVE 103 + DA + D +A + E A ++ ++ A+++ Sbjct: 222 LTQERDATAAKLRDAQASYQTVQYELDALRNQHKQDLLHLASLD 265 >gi|159028037|emb|CAO87997.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 475 Score = 36.8 bits (83), Expect = 1.0, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 14/89 (15%) Query: 3 ELETAQAEADAKIAEAKAKDSGNQAEI------DIARSNAKDKTDLVRIAEAKYKYRSDI 56 +LE A+A+A+ ++ + + K AE+ D+A+ A+ RI + K + ++D+ Sbjct: 289 DLEIAKADAEKRVRDTQTKRGAMIAEVESVVMSDLAKVQAEVAVQTARIKQVKQQLQADV 348 Query: 57 V--------MAIEKSKADAIKTMEDRKAK 77 + AI K++ +A K +E KA+ Sbjct: 349 IAPAAAECQQAIAKARGEAAKIIEQGKAQ 377 >gi|197116844|ref|YP_002137271.1| TPR domain-containing protein [Geobacter bemidjiensis Bem] gi|197086204|gb|ACH37475.1| TPR domain protein [Geobacter bemidjiensis Bem] Length = 2741 Score = 36.8 bits (83), Expect = 1.1, Method: Composition-based stats. Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 10/82 (12%) Query: 13 AKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTME 72 A++A+A D NQA+I +A +A+ K+ L AEA Y+ ++ ADA+ E Sbjct: 2189 AQLADAIG-DKVNQAKIHLALGDARAKSLLNAQAEASYRKALEL--------ADAMLVRE 2239 Query: 73 DR-KAKVGVARQERKANESKDA 93 R +A +G+AR +++ +SK A Sbjct: 2240 VRWRALLGLARLQQQGGDSKAA 2261 >gi|320164322|gb|EFW41221.1| predicted protein [Capsaspora owczarzaki ATCC 30864] Length = 828 Score = 36.4 bits (82), Expect = 1.2, Method: Composition-based stats. Identities = 31/93 (33%), Positives = 52/93 (55%), Gaps = 4/93 (4%) Query: 4 LETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKD-KTDLVRIAEAKYKYRSDIVMAIEK 62 LE+++A+ DAK+ AKA+ + AE+D RS + K VR E K R+++ A ++ Sbjct: 457 LESSRADMDAKLEIAKAELAQETAELDKLRSAVDEQKMTSVRQEEELRKLRNEVEQA-QR 515 Query: 63 SKADAIKTMEDRKAKVGVARQERKANESKDAAT 95 +A ++ D + K VA+ E K E+K A+ Sbjct: 516 EQA-RLREQLDTETKT-VAQLEAKIEEAKSNAS 546 >gi|159131497|gb|EDP56610.1| intracellular protein transport protein (UsoA), putative [Aspergillus fumigatus A1163] Length = 1061 Score = 36.4 bits (82), Expect = 1.2, Method: Composition-based stats. Identities = 30/116 (25%), Positives = 57/116 (49%), Gaps = 4/116 (3%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 +EL+ A+ +A++A+ +AK Q+E+D A+ + + +R+ + S++ E Sbjct: 881 SELDKAKEGHEAELADLRAKAQTVQSELDTAKQEHETEISGLRVKAQSLQ--SELDSRTE 938 Query: 62 KSKADAIKTMEDRKAKVGVARQERKANESK-DAATIITTATVENTKVVETKEAGKT 116 +SK D +D +K+ + K ESK + A + E K V+ + GKT Sbjct: 939 RSKEDLQAVHDDYLSKLSELEKRVKLAESKAEKAEADALKSAETLKEVQAR-LGKT 993 >gi|253699111|ref|YP_003020300.1| hypothetical protein GM21_0462 [Geobacter sp. M21] gi|251773961|gb|ACT16542.1| Tetratricopeptide TPR_2 repeat protein [Geobacter sp. M21] Length = 2741 Score = 36.4 bits (82), Expect = 1.3, Method: Composition-based stats. Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 10/82 (12%) Query: 13 AKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTME 72 A++A+A D NQA+I +A +A+ K+ L AEA Y+ + +DA+ E Sbjct: 2189 AELADAIG-DKVNQAKIHLALGDARAKSRLDAQAEASYRKALGL--------SDAMLVRE 2239 Query: 73 DR-KAKVGVARQERKANESKDA 93 R +A +G+AR +++A +SK A Sbjct: 2240 VRWRALLGLARLQQQAGDSKAA 2261 >gi|70995974|ref|XP_752742.1| intracellular protein transport protein (UsoA) [Aspergillus fumigatus Af293] gi|44889965|emb|CAD29605.2| transport protein, putative [Aspergillus fumigatus] gi|66850377|gb|EAL90704.1| intracellular protein transport protein (UsoA), putative [Aspergillus fumigatus Af293] Length = 1061 Score = 36.4 bits (82), Expect = 1.3, Method: Composition-based stats. Identities = 30/116 (25%), Positives = 57/116 (49%), Gaps = 4/116 (3%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 +EL+ A+ +A++A+ +AK Q+E+D A+ + + +R+ + S++ E Sbjct: 881 SELDKAKEGHEAELADLRAKAQTVQSELDTAKQEHETEISGLRVKAQSLQ--SELDSRTE 938 Query: 62 KSKADAIKTMEDRKAKVGVARQERKANESK-DAATIITTATVENTKVVETKEAGKT 116 +SK D +D +K+ + K ESK + A + E K V+ + GKT Sbjct: 939 RSKEDLQAVHDDYLSKLSELEKRVKLAESKAEKAEADALKSAETLKEVQA-QLGKT 993 >gi|227833721|ref|YP_002835428.1| trigger factor [Corynebacterium aurimucosum ATCC 700975] gi|262184727|ref|ZP_06044148.1| trigger factor [Corynebacterium aurimucosum ATCC 700975] gi|254788997|sp|C3PI36|TIG_CORA7 RecName: Full=Trigger factor; Short=TF gi|227454737|gb|ACP33490.1| trigger factor [Corynebacterium aurimucosum ATCC 700975] Length = 451 Score = 36.0 bits (81), Expect = 1.5, Method: Composition-based stats. Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 7/100 (7%) Query: 26 QAEIDIARSNAKDKTDLVRIAEAKYKYR-------SDIVMAIEKSKADAIKTMEDRKAKV 78 Q EIDIA+ D + + + ++ S V A+E S+ D K +ED ++ Sbjct: 90 QPEIDIAKLEDNDFVEFTAEVDIRPEFEVPDFSKISVKVPALETSEEDVDKALEDLASRF 149 Query: 79 GVARQERKANESKDAATIITTATVENTKVVETKEAGKTAR 118 G + ++ ++ D A I T V+ TK+ E G T R Sbjct: 150 GELKDTKRKMKTGDYAIIDITTEVDGTKLDEASHEGMTYR 189 >gi|78224167|ref|YP_385914.1| MutS 2 protein [Geobacter metallireducens GS-15] gi|78195422|gb|ABB33189.1| MutS 2 protein [Geobacter metallireducens GS-15] Length = 785 Score = 35.6 bits (80), Expect = 2.0, Method: Composition-based stats. Identities = 26/102 (25%), Positives = 53/102 (51%), Gaps = 6/102 (5%) Query: 1 MAELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVR--IAEAKYKYRSDIVM 58 ++ +ET E A++ + + + AE + R +A++K +VR +AEA+ K R Sbjct: 517 LSRMETEFHELLAELKDQRRRHEEALAEAERLRRDAEEKARIVRERLAEAEAKRRE---- 572 Query: 59 AIEKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTA 100 A+EK+ +A + + + +V +E + +S++A I A Sbjct: 573 AVEKAFQEAKEIVRSARREVNAIIEEARKEKSREARKKIDEA 614 >gi|121701239|ref|XP_001268884.1| intracellular protein transport protein (UsoA), putative [Aspergillus clavatus NRRL 1] gi|119397027|gb|EAW07458.1| intracellular protein transport protein (UsoA), putative [Aspergillus clavatus NRRL 1] Length = 1048 Score = 35.6 bits (80), Expect = 2.0, Method: Composition-based stats. Identities = 29/108 (26%), Positives = 52/108 (48%), Gaps = 14/108 (12%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 +EL TA+ E + ++A+ KAK+ QAE + S+ + K +S++ A+E Sbjct: 881 SELATARKEHETEVADLKAKNETLQAEHNTKISDVQAKAQ---------SLQSELESAME 931 Query: 62 KSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVE 109 KSK D +D +K + N+++DA + + A + K E Sbjct: 932 KSKKDLQVLHDDYSSKC-----SKLENQAEDAKSRVKKAEADAHKSEE 974 >gi|225853742|ref|YP_002735254.1| hypothetical protein SPJ_0142 [Streptococcus pneumoniae JJA] gi|225722642|gb|ACO18495.1| conserved hypothetical protein [Streptococcus pneumoniae JJA] Length = 450 Score = 35.6 bits (80), Expect = 2.1, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 1/88 (1%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60 A+LET +E KI K + +EID +SN KD ++ Y + +V + Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLVSEL 276 Query: 61 EKSKADAIKTMEDRKAKVGVARQERKAN 88 ++++ K++ + +A +GVA + KA+ Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304 >gi|121595243|ref|YP_987139.1| RND efflux system outer membrane lipoprotein [Acidovorax sp. JS42] gi|120607323|gb|ABM43063.1| RND efflux system, outer membrane lipoprotein, NodT family [Acidovorax sp. JS42] Length = 511 Score = 35.6 bits (80), Expect = 2.1, Method: Composition-based stats. Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%) Query: 3 ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEK 62 E E A+ A++A A + G QA++DI R N + +R+AE+ + R+ + E Sbjct: 176 EREAARVALSAEVARAYLQLRGAQAQLDITRQNLEVADRTLRLAES--RERNGVATRFET 233 Query: 63 SKADA-IKTME 72 S A A + T+E Sbjct: 234 SSARAQLATVE 244 >gi|168487067|ref|ZP_02711575.1| conserved hypothetical protein [Streptococcus pneumoniae CDC1087-00] gi|225857982|ref|YP_002739492.1| hypothetical protein SP70585_0182 [Streptococcus pneumoniae 70585] gi|183570031|gb|EDT90559.1| conserved hypothetical protein [Streptococcus pneumoniae CDC1087-00] gi|225722007|gb|ACO17861.1| conserved hypothetical protein [Streptococcus pneumoniae 70585] Length = 450 Score = 35.6 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 1/88 (1%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60 A+LET +E KI K + +EID +SN KD ++ Y + ++ + Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLISEL 276 Query: 61 EKSKADAIKTMEDRKAKVGVARQERKAN 88 ++++ K++ + +A +GVA + KA+ Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304 >gi|148983506|ref|ZP_01816825.1| hypothetical protein CGSSp3BS71_05229 [Streptococcus pneumoniae SP3-BS71] gi|147923653|gb|EDK74765.1| hypothetical protein CGSSp3BS71_05229 [Streptococcus pneumoniae SP3-BS71] gi|301799286|emb|CBW31812.1| unnamed protein product [Streptococcus pneumoniae OXC141] Length = 450 Score = 35.6 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 1/88 (1%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60 A+LET +E KI K + +EID +SN KD ++ Y + ++ + Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLISEL 276 Query: 61 EKSKADAIKTMEDRKAKVGVARQERKAN 88 ++++ K++ + +A +GVA + KA+ Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304 >gi|89100461|ref|ZP_01173323.1| hypothetical protein B14911_05506 [Bacillus sp. NRRL B-14911] gi|89084804|gb|EAR63943.1| hypothetical protein B14911_05506 [Bacillus sp. NRRL B-14911] Length = 332 Score = 35.6 bits (80), Expect = 2.4, Method: Composition-based stats. Identities = 21/50 (42%), Positives = 29/50 (58%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYK 51 AEL+T QAEAD KIA+AKA++ A AK + ++ EA+ K Sbjct: 234 AELQTEQAEADKKIAQAKAEERRAMAVAQEQEMKAKVEEMRAKVVEAEAK 283 >gi|83649547|ref|YP_437982.1| outer membrane protein [Hahella chejuensis KCTC 2396] gi|83637590|gb|ABC33557.1| Outer membrane protein [Hahella chejuensis KCTC 2396] Length = 480 Score = 35.6 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 5/88 (5%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 A L Q E A++ A G QA++ +A+ N ++ +V + EAKYK + + Sbjct: 159 AALRNVQVEIIAEVVRVYADLRGAQAQVQVAQRNLENLKSVVDLTEAKYKSGVGAELDVV 218 Query: 62 KSKAD---AIKTMEDRKAKVGVARQERK 86 +SKA A T+ +A+ +AR E + Sbjct: 219 RSKAQYAGAQATLAPLQAR--IARDEYR 244 >gi|31414576|dbj|BAC77268.1| UsoAp [Emericella nidulans] Length = 1103 Score = 35.3 bits (79), Expect = 2.9, Method: Composition-based stats. Identities = 32/128 (25%), Positives = 62/128 (48%), Gaps = 12/128 (9%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 +E+E+ + + ++++ K G Q+E+D AR + ++++V++ A R+++ E Sbjct: 928 SEVESTKEQRANEVSDLHKKIQGLQSELDSARK--QHESEVVKLESANETVRAELNTVKE 985 Query: 62 KSKADAIKTMEDRKAKVGV----ARQ-----ERKANESKDAATIITTATVENTKVV-ETK 111 +S D ED +K A+Q ER E++ AA + A K + ETK Sbjct: 986 QSTQDLEAVREDYSSKCSALENRAQQAESEVERLEAEARKAAHALEEAQKALEKALQETK 1045 Query: 112 EAGKTARS 119 E + +S Sbjct: 1046 EKEEARQS 1053 >gi|91205782|ref|YP_538137.1| multidrug resistance protein A [Rickettsia bellii RML369-C] gi|91069326|gb|ABE05048.1| Multidrug resistance protein A [Rickettsia bellii RML369-C] Length = 353 Score = 35.3 bits (79), Expect = 2.9, Method: Composition-based stats. Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 6/111 (5%) Query: 6 TAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLV-RIAEAKYKYRSDIVMAIEKSK 64 T A DA I++ A+ +G + + + +K DL+ I + YK + + A+E S Sbjct: 45 TDNAYIDADISDVSAEINGVLTNLLVTNNTKVNKGDLIGEIDDRDYKAK---LAALEASI 101 Query: 65 ADAIKTME--DRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEA 113 + K +E D+K +G + E+ + K AAT + + T+V E +A Sbjct: 102 GASEKNIEIIDQKISIGQNQLEQSGEKLKLAATSFNIVSTDFTRVQELNKA 152 >gi|168489411|ref|ZP_02713610.1| conserved hypothetical protein [Streptococcus pneumoniae SP195] gi|183572030|gb|EDT92558.1| conserved hypothetical protein [Streptococcus pneumoniae SP195] Length = 449 Score = 35.3 bits (79), Expect = 3.0, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 1/88 (1%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60 A+LET +E KI K + +EID +SN KD ++ Y + ++ + Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNNPTSQAYNIYAQLISEL 276 Query: 61 EKSKADAIKTMEDRKAKVGVARQERKAN 88 ++++ K++ + +A +GVA + KA+ Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304 >gi|148994314|ref|ZP_01823578.1| hypothetical protein CGSSp9BS68_07267 [Streptococcus pneumoniae SP9-BS68] gi|147927344|gb|EDK78376.1| hypothetical protein CGSSp9BS68_07267 [Streptococcus pneumoniae SP9-BS68] gi|332075786|gb|EGI86253.1| hypothetical protein SPAR50_0121 [Streptococcus pneumoniae GA17570] Length = 450 Score = 35.3 bits (79), Expect = 3.1, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 1/88 (1%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60 A+LET +E KI K + +EID +SN KD ++ Y + ++ + Sbjct: 217 AKLETKISEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNNPTSQAYNIYAQLISEL 276 Query: 61 EKSKADAIKTMEDRKAKVGVARQERKAN 88 ++++ K++ + +A +GVA + KA+ Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304 >gi|171912957|ref|ZP_02928427.1| band 7 protein [Verrucomicrobium spinosum DSM 4136] Length = 485 Score = 35.3 bits (79), Expect = 3.3, Method: Composition-based stats. Identities = 39/121 (32%), Positives = 64/121 (52%), Gaps = 14/121 (11%) Query: 2 AELETAQAEADAKI----AEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIV 57 E+ AQA+ + KI A+A+A N A +DIA SNA LV+ AEA + + Sbjct: 217 GEIGRAQAQKEQKIVVAQAQAEATTGENLAAVDIANSNA---NRLVQEAEANRQAEAAQN 273 Query: 58 MAIEKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTA 117 +A + + +A + R+A+ VAR E+ +++ A+++ A VE + +ET A A Sbjct: 274 VANARVQQEAY--LAQREAE--VARAEK--DKASQYASVVVPAEVEKLR-METIAAADAA 326 Query: 118 R 118 R Sbjct: 327 R 327 >gi|281357218|ref|ZP_06243707.1| phage tail tape measure protein, TP901 family [Victivallis vadensis ATCC BAA-548] gi|281316249|gb|EFB00274.1| phage tail tape measure protein, TP901 family [Victivallis vadensis ATCC BAA-548] Length = 898 Score = 34.9 bits (78), Expect = 3.6, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 55/118 (46%), Gaps = 12/118 (10%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 AE+ A++E + + E K + + A+ID A+ ++ T+ R AE ++ A+ Sbjct: 786 AEINAAKSEWQSAMDEVKQRAAEKAAQIDEAKEKSEAATENTRNAETRFNSSFGGGKAVG 845 Query: 62 KSKADAIKTMEDRKAKVGVAR--QERKANESKDAATIITTATVENTKVVETKEAGKTA 117 A+A+ M +G A QER A S+ I + T E + ++ + G TA Sbjct: 846 AWSAEALDAM------LGGANNAQERTARASEQ----IVSNTRETNRQIKKLQGGSTA 893 >gi|15902152|ref|NP_357702.1| hypothetical protein spr0108 [Streptococcus pneumoniae R6] gi|116516521|ref|YP_815617.1| hypothetical protein SPD_0115 [Streptococcus pneumoniae D39] gi|15457645|gb|AAK98912.1| Conserved hypothetical protein [Streptococcus pneumoniae R6] gi|116077097|gb|ABJ54817.1| conserved hypothetical protein [Streptococcus pneumoniae D39] Length = 450 Score = 34.9 bits (78), Expect = 4.1, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 1/88 (1%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60 A+LET E KI K + +EID +SN KD ++ Y + +V + Sbjct: 217 AKLETKILEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLVSEL 276 Query: 61 EKSKADAIKTMEDRKAKVGVARQERKAN 88 ++++ K++ + +A +GVA + KA+ Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304 >gi|75907620|ref|YP_321916.1| hypothetical protein Ava_1398 [Anabaena variabilis ATCC 29413] gi|75701345|gb|ABA21021.1| Band 7 protein [Anabaena variabilis ATCC 29413] Length = 422 Score = 34.9 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 55/106 (51%), Gaps = 16/106 (15%) Query: 3 ELETAQAEADAKIAEAKAKDSGNQAEID------IARSNAKDKTDLVRIAEAKYKYRSDI 56 +L+ A+AEA+ ++ +A K + AE++ IA+ A+ RI + + + ++DI Sbjct: 236 DLQIAKAEAERRVRDAITKRTAVIAEVESVVNSQIAKVQAEVAVQTERIIQVENQLQADI 295 Query: 57 V--------MAIEKSKADAIKTMEDRKAKVGVARQERKANESKDAA 94 V AI ++K DA K +E+ KA+ A +R A ++A Sbjct: 296 VAPAEAECQTAIAQAKGDAAKIIEEGKAQ--AAGTQRLAESWQNAG 339 >gi|19703613|ref|NP_603175.1| DNA repair protein recN [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] gi|19713719|gb|AAL94474.1| DNA repair protein recN [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Length = 558 Score = 34.9 bits (78), Expect = 4.2, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 38/83 (45%), Gaps = 2/83 (2%) Query: 22 DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIK--TMEDRKAKVG 79 DSG+ ++ + K K + +IAE R +I + IE + +K MED K KV Sbjct: 342 DSGDFKTKELKKELNKIKDEYDKIAEKLTNSRKEIAVKIENELLNELKFLNMEDAKLKVQ 401 Query: 80 VARQERKANESKDAATIITTATV 102 + + ER NE D + V Sbjct: 402 INKLERMTNEGYDDVEFFISTNV 424 >gi|120402439|ref|YP_952268.1| hypothetical protein Mvan_1428 [Mycobacterium vanbaalenii PYR-1] gi|119955257|gb|ABM12262.1| band 7 protein [Mycobacterium vanbaalenii PYR-1] Length = 477 Score = 34.9 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 34/124 (27%), Positives = 57/124 (45%), Gaps = 13/124 (10%) Query: 2 AELETAQAEADAKIAEAKAKDSG--NQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMA 59 A + TA+AE DA+I AKA+ +G QAE D A + A K D V +A + + ++ A Sbjct: 200 AAVGTAEAERDAQIQSAKARQAGAVAQAEADTAIATANQKRD-VELARLRAQTEAENAQA 258 Query: 60 ----------IEKSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVE 109 +K AI+ E + + + ++R+A +S+ A A E + + Sbjct: 259 DQAGPLANARAQKDVGIAIEQAEAARVQARIEVEQRRAEQSQAALQADVIAPAEAQRAAD 318 Query: 110 TKEA 113 A Sbjct: 319 IARA 322 >gi|237745213|ref|ZP_04575694.1| DNA repair protein recN [Fusobacterium sp. 7_1] gi|229432442|gb|EEO42654.1| DNA repair protein recN [Fusobacterium sp. 7_1] Length = 558 Score = 34.9 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 2/83 (2%) Query: 22 DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIK--TMEDRKAKVG 79 DSG+ ++ + AK K++ ++AE R +I + IE + +K MED K KV Sbjct: 342 DSGDFKTRELKQELAKIKSEYDKLAEKLSNLRKEIAVKIENELLNELKFLNMEDAKLKVQ 401 Query: 80 VARQERKANESKDAATIITTATV 102 + + E+ N+ D + V Sbjct: 402 INKIEKMTNDGYDEVEFFISTNV 424 >gi|254302670|ref|ZP_04970028.1| DNA repair protein RecN [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148322862|gb|EDK88112.1| DNA repair protein RecN [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 553 Score = 34.9 bits (78), Expect = 4.3, Method: Composition-based stats. Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 2/83 (2%) Query: 22 DSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIK--TMEDRKAKVG 79 DSG+ ++ + K KT+ +IAE R +I + IE + +K MED K KV Sbjct: 337 DSGDFKTKELKKELNKIKTEYDKIAEKLTNSRKEIAVKIENELLNELKFLNMEDAKLKVQ 396 Query: 80 VARQERKANESKDAATIITTATV 102 + + E+ N+ D + V Sbjct: 397 INKLEKMTNDGYDEVEFFISTNV 419 >gi|67470334|ref|XP_651135.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|56467830|gb|EAL45749.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 2089 Score = 34.5 bits (77), Expect = 4.4, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 53/86 (61%), Gaps = 7/86 (8%) Query: 34 SNAKDKTD-LVRIAEAKYKYRSDI---VMAIEKSKADAIKTMEDRKAKVGVARQ--ERKA 87 +N+++KT +I+E K K + V ++K D+ +T++++K K G+A++ +K Sbjct: 616 NNSENKTQKTTQISETKKKINHSVKKKVSELKKENKDS-QTIQEKKTKEGIAKKIHLKKV 674 Query: 88 NESKDAATIITTATVENTKVVETKEA 113 N+ K+ I+ + ++ TK++ETKEA Sbjct: 675 NQQKEIDQPISESEIKPTKLLETKEA 700 >gi|149001800|ref|ZP_01826773.1| hypothetical protein CGSSp14BS69_08730 [Streptococcus pneumoniae SP14-BS69] gi|237650847|ref|ZP_04525099.1| hypothetical protein SpneC1_09049 [Streptococcus pneumoniae CCRI 1974] gi|237821338|ref|ZP_04597183.1| hypothetical protein SpneC19_03302 [Streptococcus pneumoniae CCRI 1974M2] gi|147760258|gb|EDK67247.1| hypothetical protein CGSSp14BS69_08730 [Streptococcus pneumoniae SP14-BS69] Length = 450 Score = 34.5 bits (77), Expect = 4.6, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 1/88 (1%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK-YKYRSDIVMAI 60 A+LET E KI K + +EID +SN KD ++ Y + +V + Sbjct: 217 AKLETKILEITTKIEALKTNITSKNSEIDSQQSNIKDMNRTYNDPTSQAYNIYAQLVSEL 276 Query: 61 EKSKADAIKTMEDRKAKVGVARQERKAN 88 ++++ K++ + +A +GVA + KA+ Sbjct: 277 GTARSNNNKSITELEANLGVATGQDKAH 304 >gi|148988834|ref|ZP_01820249.1| choline binding protein A [Streptococcus pneumoniae SP6-BS73] gi|147925645|gb|EDK76721.1| choline binding protein A [Streptococcus pneumoniae SP6-BS73] Length = 1008 Score = 34.5 bits (77), Expect = 4.7, Method: Composition-based stats. Identities = 37/133 (27%), Positives = 66/133 (49%), Gaps = 17/133 (12%) Query: 3 ELETAQAEADAKIAE-----AKAKDSGNQAEIDIARSNAKDK----TDLVRIA----EAK 49 ELE A+++ + K AE KAK+S ++ +I A + + K T L +I EAK Sbjct: 179 ELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEAEVESKQAEATRLKKIKTDREEAK 238 Query: 50 YKYRSDIVMAIEKSKADAIKTMEDRKAKVGV----ARQERKANESKDAATIITTATVENT 105 K + + A+EK+ A + + R+AK GV A ++K N++K + + + T+ + Sbjct: 239 RKADAKLKEAVEKNVATSEQDKPKRRAKRGVSGELATPDKKENDAKSSDSSVGEETLPSP 298 Query: 106 KVVETKEAGKTAR 118 + E+ R Sbjct: 299 SLNMANESQTEHR 311 >gi|67516849|ref|XP_658310.1| hypothetical protein AN0706.2 [Aspergillus nidulans FGSC A4] gi|40746326|gb|EAA65482.1| hypothetical protein AN0706.2 [Aspergillus nidulans FGSC A4] gi|259489020|tpe|CBF88948.1| TPA: hypothetical protein similar to UsoAp (Eurofung) [Aspergillus nidulans FGSC A4] Length = 1041 Score = 34.5 bits (77), Expect = 4.7, Method: Composition-based stats. Identities = 32/128 (25%), Positives = 62/128 (48%), Gaps = 12/128 (9%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 +E+E+ + + ++++ K G Q+E+D AR + ++++V++ A R+++ E Sbjct: 866 SEVESTKEQRANEVSDLHKKIQGLQSELDSARK--QHESEVVKLESANETVRAELNTVKE 923 Query: 62 KSKADAIKTMEDRKAKVGV----ARQ-----ERKANESKDAATIITTATVENTKVV-ETK 111 +S D ED +K A+Q ER E++ AA + A K + ETK Sbjct: 924 QSTQDLEAVREDYSSKCSALENRAQQAESEVERLEAEARKAAHALEEAQKALEKALQETK 983 Query: 112 EAGKTARS 119 E + +S Sbjct: 984 EKEEARQS 991 >gi|158520035|ref|YP_001527905.1| hypothetical protein Dole_0018 [Desulfococcus oleovorans Hxd3] gi|254799454|sp|A8ZRR1|Y018_DESOH RecName: Full=UPF0365 protein Dole_0018 gi|158508861|gb|ABW65828.1| protein of unknown function DUF1432 [Desulfococcus oleovorans Hxd3] Length = 326 Score = 34.5 bits (77), Expect = 4.7, Method: Composition-based stats. Identities = 23/61 (37%), Positives = 33/61 (54%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 AELET +AEAD KIA+AKA++ A A+ + ++ EA+ K + A E Sbjct: 231 AELETDRAEADKKIAQAKAEERRAMAYAREQEMKAQVEEMRAKVVEAEAKIPLAMANAFE 290 Query: 62 K 62 K Sbjct: 291 K 291 >gi|166366427|ref|YP_001658700.1| hypothetical protein MAE_36860 [Microcystis aeruginosa NIES-843] gi|166088800|dbj|BAG03508.1| hypothetical protein MAE_36860 [Microcystis aeruginosa NIES-843] Length = 427 Score = 34.5 bits (77), Expect = 4.8, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 46/89 (51%), Gaps = 14/89 (15%) Query: 3 ELETAQAEADAKIAEAKAKDSGNQAEI------DIARSNAKDKTDLVRIAEAKYKYRSDI 56 +LE A+A+A+ ++ + + K AE+ D+A+ A+ RI + K + ++D+ Sbjct: 236 DLEIAKADAEKRVRDTQTKRGAMIAEVESVVMSDLAKVQAEVAVQNARIKQVKQQLQADV 295 Query: 57 V--------MAIEKSKADAIKTMEDRKAK 77 + AI K++ +A K +E KA+ Sbjct: 296 IAPAAAECQQAIAKARGEAAKIIEQGKAQ 324 Score = 34.1 bits (76), Expect = 5.7, Method: Composition-based stats. Identities = 30/108 (27%), Positives = 50/108 (46%), Gaps = 16/108 (14%) Query: 5 ETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSK 64 + A+ + DA+IAEAKA+ + S KD +L A + I +E +K Sbjct: 198 QKAELQRDARIAEAKARKT----------SIIKDSENLRLTALRR------IQKDLEIAK 241 Query: 65 ADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKE 112 ADA K + D + K G E ++ D A + V+N ++ + K+ Sbjct: 242 ADAEKRVRDTQTKRGAMIAEVESVVMSDLAKVQAEVAVQNARIKQVKQ 289 >gi|315150240|gb|EFT94256.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecalis TX0012] Length = 711 Score = 34.5 bits (77), Expect = 5.0, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 4/85 (4%) Query: 25 NQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTMEDRKAKVGVARQE 84 N AE+ + + N + + + +I E D +M K+K + I T +R+ +V + + Sbjct: 564 NGAEVALPKFNNRGEDFVYQIEEVNVPAEFDSIMT--KTKDEFILT--NRRKEVVSSTTD 619 Query: 85 RKANESKDAATIITTATVENTKVVE 109 NES + T T ++ E T VVE Sbjct: 620 TSTNESLSSETATTNSSAEKTIVVE 644 >gi|325478751|gb|EGC81862.1| RecF/RecN/SMC N-terminal domain protein [Anaerococcus prevotii ACS-065-V-Col13] Length = 1144 Score = 34.5 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 36/113 (31%), Positives = 56/113 (49%), Gaps = 12/113 (10%) Query: 4 LETAQAEADAKIAEAKA-KDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEK 62 LE E A+IA+ K + Q +I I+ + + + DLV+I+E Y+Y + +K Sbjct: 321 LEEKLDEIKAQIADNKELSNKLEQDKIAISDNKKRVEVDLVKISEEIYQYEIN-----QK 375 Query: 63 SKADAIKTMEDRKAKVGVARQERKANESKDAAT--IITTATVENTKVVETKEA 113 SKA ++ RKAK R E+ N SK+ I + +E K +E K A Sbjct: 376 SKA----ILDKRKAKEDEIRIEKLNNLSKEIEKLEIDRDSLIEEIKTLEEKNA 424 >gi|257415126|ref|ZP_05592120.1| predicted protein [Enterococcus faecalis AR01/DG] gi|257156954|gb|EEU86914.1| predicted protein [Enterococcus faecalis ARO1/DG] Length = 711 Score = 34.5 bits (77), Expect = 5.5, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 42/85 (49%), Gaps = 4/85 (4%) Query: 25 NQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKADAIKTMEDRKAKVGVARQE 84 N AE+ + + N + + + +I E D +M K+K + I T +R+ +V + + Sbjct: 564 NGAEVALPKFNNRGEDFVYQIEEVNVPAEFDSIMT--KTKDEFILT--NRRKEVVSSTTD 619 Query: 85 RKANESKDAATIITTATVENTKVVE 109 NES + T T ++ E T VVE Sbjct: 620 TSTNESLSSETATTNSSAEKTIVVE 644 >gi|17556907|ref|NP_498856.1| UBX-containing protein in Nematode family member (ubxn-4) [Caenorhabditis elegans] gi|466104|sp|P34631|UBXN4_CAEEL RecName: Full=UBX domain-containing protein 4 gi|289758|gb|AAA28196.1| Ubx-containing protein in nematodes protein 4, confirmed by transcript evidence [Caenorhabditis elegans] Length = 469 Score = 34.1 bits (76), Expect = 5.9, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 53/118 (44%), Gaps = 6/118 (5%) Query: 3 ELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSD-IVMAIE 61 EL A A A + + K KD+ + E D K ++ + EAK + ++ +V A + Sbjct: 168 ELAEKVARAKALLEQKKQKDAEKKREAD-----KHVKEEMTKAREAKQERDAEALVKAAK 222 Query: 62 KSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTARS 119 + K + + D+K + + +R+A + K + T ENT+ + GK S Sbjct: 223 QRKMEKLAAESDKKRILAQIKADREAAQKKFGKLVNTENASENTEKKQETTVGKAVPS 280 >gi|297570639|ref|YP_003696413.1| Exo-alpha-sialidase [Arcanobacterium haemolyticum DSM 20595] gi|296930986|gb|ADH91794.1| Exo-alpha-sialidase [Arcanobacterium haemolyticum DSM 20595] Length = 885 Score = 34.1 bits (76), Expect = 5.9, Method: Composition-based stats. Identities = 33/112 (29%), Positives = 54/112 (48%), Gaps = 6/112 (5%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 A+ ET +AE K K DS N A++++A++ + + + EAK A E Sbjct: 544 ADSETTRAE---KARLQKEFDSIN-AKLEVAKAEKQRVASDLAVKEAKLTESEKQKAAAE 599 Query: 62 KSKADAIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEA 113 K A+ K +ED AKV A ER +++ ++A + E+ K +E E Sbjct: 600 KKVAEQHKQIEDLNAKVKKAESER--DQATESAKNLEEKAKEDAKSIEGLEG 649 >gi|49481625|ref|YP_038596.1| cell surface protein [Bacillus thuringiensis serovar konkukian str. 97-27] gi|49333181|gb|AAT63827.1| conserved hypothetical protein, possible cell surface protein [Bacillus thuringiensis serovar konkukian str. 97-27] Length = 953 Score = 34.1 bits (76), Expect = 6.0, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 48/111 (43%), Gaps = 17/111 (15%) Query: 11 ADAKIAEAKAKDSGNQAEIDI----ARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKAD 66 ADAK+ K + + ++ A+ NAK K D I E Y + D+ + + +K Sbjct: 806 ADAKVVSEDKKANTRVVQFEVSDLFAKLNAKVKVD---IDEMNYHHFYDVQIQFDTTKIG 862 Query: 67 AIKTMEDRKAKVGVARQERKANESKDAATIITTATVENTKVVETKEAGKTA 117 A VG ++E K + + +TT V+N K V T + + A Sbjct: 863 A----------VGTVKEEPKNDPKNEPKNPVTTPKVDNVKTVGTPDFNRNA 903 >gi|167574255|ref|ZP_02367129.1| gp14 [Burkholderia oklahomensis C6786] Length = 1354 Score = 34.1 bits (76), Expect = 6.9, Method: Composition-based stats. Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 7/80 (8%) Query: 12 DAKIAEAKAKDSGNQAEIDIARSNAKDKTDL-VRIAEAKYKYRSDIVMAIEKSKADAIKT 70 +AK+AEA+A ++ QA + ARSN + ++ RIA Y I + A A + Sbjct: 401 EAKLAEARAVEASAQAHVATARSNLANSQEIGTRIAGLPY------AAIIARETAAAQQE 454 Query: 71 MEDRKAKVGVARQERKANES 90 +E +A + +A+Q R A E+ Sbjct: 455 LERAEASLALAQQRRTALEA 474 >gi|288555735|ref|YP_003427670.1| hypothetical protein BpOF4_13635 [Bacillus pseudofirmus OF4] gi|288546895|gb|ADC50778.1| hypothetical protein BpOF4_13635 [Bacillus pseudofirmus OF4] Length = 334 Score = 34.1 bits (76), Expect = 7.2, Method: Composition-based stats. Identities = 27/98 (27%), Positives = 44/98 (44%), Gaps = 8/98 (8%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 AEL+T QAEAD KIA+AKA++ A A+ + ++ EA+ + + A+ Sbjct: 235 AELQTDQAEADKKIAQAKAEERRAMAVAQEQEMKARVEEMRAKVVEAEAEVPMALSDALR 294 Query: 62 KSKADAIKTME--------DRKAKVGVARQERKANESK 91 K + M D + + A E +NE + Sbjct: 295 KGNMGVMDYMNYQNVMADTDMRGSISKATGEDDSNEKR 332 >gi|8163699|gb|AAF73809.1|AF154037_1 surface protein PspC [Streptococcus pneumoniae] Length = 929 Score = 33.7 bits (75), Expect = 7.6, Method: Composition-based stats. Identities = 37/133 (27%), Positives = 66/133 (49%), Gaps = 17/133 (12%) Query: 3 ELETAQAEADAKIAE-----AKAKDSGNQAEIDIARSNAKDK----TDLVRIA----EAK 49 ELE A+++ + K AE KAK+S ++ +I A + + K T L +I EAK Sbjct: 179 ELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEAEVESKQAEATRLKKIKTDREEAK 238 Query: 50 YKYRSDIVMAIEKSKADAIKTMEDRKAKVGV----ARQERKANESKDAATIITTATVENT 105 K + + A+EK+ A + + R+AK GV A ++K N++K + + + T+ + Sbjct: 239 RKADAKLKEAVEKNVATSEQDKPKRRAKRGVSGELATPDKKENDAKSSDSSVGEETLPSP 298 Query: 106 KVVETKEAGKTAR 118 + E+ R Sbjct: 299 SLNMANESQTEHR 311 >gi|77864693|ref|YP_355403.1| gp68 [Burkholderia phage Bcep176] gi|161520424|ref|YP_001583851.1| phage tape measure protein [Burkholderia multivorans ATCC 17616] gi|189353385|ref|YP_001949012.1| bacteriophage tape measure protein [Burkholderia multivorans ATCC 17616] gi|76885879|gb|ABA60069.1| gp68 [Burkholderia phage Bcep176] gi|160344474|gb|ABX17559.1| phage tape measure protein [Burkholderia multivorans ATCC 17616] gi|189337407|dbj|BAG46476.1| bacteriophage tape measure protein [Burkholderia multivorans ATCC 17616] Length = 1380 Score = 33.7 bits (75), Expect = 8.4, Method: Composition-based stats. Identities = 28/87 (32%), Positives = 44/87 (50%), Gaps = 7/87 (8%) Query: 12 DAKIAEAKAKDSGNQAEIDIARSNAKDKTDL-VRIAEAKYKYRSDIVMAIEKSKADAIKT 70 DAK+AEA+A ++ A++ ARSN + ++ RIA Y I + A A Sbjct: 401 DAKLAEARAIEASAVAQVATARSNLANSQEIGTRIAGTPY------AAVIARETAAAQGE 454 Query: 71 MEDRKAKVGVARQERKANESKDAATII 97 +E +A + +A+Q R A E+ A I Sbjct: 455 LERAEASLALAQQRRVALEAAAAKGTI 481 >gi|167566450|ref|ZP_02359366.1| gp14 [Burkholderia oklahomensis EO147] Length = 1354 Score = 33.7 bits (75), Expect = 8.5, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 44/81 (54%), Gaps = 7/81 (8%) Query: 11 ADAKIAEAKAKDSGNQAEIDIARSNAKDKTDL-VRIAEAKYKYRSDIVMAIEKSKADAIK 69 ++AK+AEA+A ++ QA + ARSN + ++ RIA Y I + A A + Sbjct: 400 SEAKLAEARAVEASAQAHVATARSNLANSQEIGTRIAGLPY------AAIIARETAAAQQ 453 Query: 70 TMEDRKAKVGVARQERKANES 90 +E +A + +A+Q R A E+ Sbjct: 454 ELERAEASLALAQQRRTALEA 474 >gi|312144008|ref|YP_003995454.1| hypothetical protein Halsa_1677 [Halanaerobium sp. 'sapolanicus'] gi|311904659|gb|ADQ15100.1| hypothetical protein Halsa_1677 [Halanaerobium sp. 'sapolanicus'] Length = 327 Score = 33.7 bits (75), Expect = 8.7, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 45/95 (47%), Gaps = 6/95 (6%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIE 61 A+L+T QAEAD +IA+AKA++ A + A+ + ++ EA+ + + A++ Sbjct: 230 AQLQTDQAEADKEIAQAKAEERRAMAVAEEQEMKARVQEMRAKVVEAEAQVPLAMAEALK 289 Query: 62 KSKADAIKTM------EDRKAKVGVARQERKANES 90 + M D K + ++ K NE+ Sbjct: 290 NGNLGVMDYMNLKNIESDTKMRSSISESSEKNNEN 324 >gi|319651603|ref|ZP_08005730.1| hypothetical protein HMPREF1013_02342 [Bacillus sp. 2_A_57_CT2] gi|317396670|gb|EFV77381.1| hypothetical protein HMPREF1013_02342 [Bacillus sp. 2_A_57_CT2] Length = 332 Score = 33.7 bits (75), Expect = 8.9, Method: Composition-based stats. Identities = 19/48 (39%), Positives = 28/48 (58%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAK 49 AEL+T QAEAD KIA+AKA++ A A+ + ++ EA+ Sbjct: 234 AELQTEQAEADKKIAQAKAEERRAMAVAQEQEMKARVEEMRAKVVEAE 281 >gi|4097980|gb|AAD00184.1| surface protein C [Streptococcus pneumoniae] Length = 929 Score = 33.7 bits (75), Expect = 9.0, Method: Composition-based stats. Identities = 37/133 (27%), Positives = 66/133 (49%), Gaps = 17/133 (12%) Query: 3 ELETAQAEADAKIAE-----AKAKDSGNQAEIDIARSNAKDK----TDLVRIA----EAK 49 ELE A+++ + K AE KAK+S ++ +I A + + K T L +I EAK Sbjct: 179 ELEIAESDVEVKKAELELVKVKAKESQDEEKIKQAEAEVESKQAEATRLKKIKTDREEAK 238 Query: 50 YKYRSDIVMAIEKSKADAIKTMEDRKAKVGV----ARQERKANESKDAATIITTATVENT 105 K + + A+EK+ A + + R+AK GV A ++K N++K + + + T+ + Sbjct: 239 RKADAKLKEAVEKNVATSEQDKPKRRAKRGVSGELATPDKKENDAKSSDSSVGEETLPSP 298 Query: 106 KVVETKEAGKTAR 118 + E+ R Sbjct: 299 SLNMANESQTEHR 311 >gi|260584545|ref|ZP_05852291.1| YSIRK type signal peptide [Granulicatella elegans ATCC 700633] gi|260157568|gb|EEW92638.1| YSIRK type signal peptide [Granulicatella elegans ATCC 700633] Length = 2668 Score = 33.7 bits (75), Expect = 9.6, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 53/118 (44%), Gaps = 21/118 (17%) Query: 6 TAQAEADAKIAEAKAKDSGNQAEIDIARSNAKDKTDLVRIAEAKYKYRSDIVMAIEKSKA 65 TA+ A+ K+ AK + + +AE+D A++ D EAK K D +A EK+ Sbjct: 991 TAEDLAEKKLKSAKEQQASLKAELDKAKAALPD-------VEAKVKSARDEALAAEKA-- 1041 Query: 66 DAIKTMEDRKAKVGVARQERKANESKDAATI-------ITTATVENTKVVETKEAGKT 116 +E + + A ++ AN A T+ +T V++ VV T GKT Sbjct: 1042 -----VETAREALKTAAEKNLANPEIAAYTLGEYGSYKVTVRAVDSNGVVTTPTVGKT 1094 >gi|311029291|ref|ZP_07707381.1| flotillin-like protein [Bacillus sp. m3-13] Length = 511 Score = 33.3 bits (74), Expect = 9.9, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 49/84 (58%), Gaps = 7/84 (8%) Query: 2 AELETAQAEADAKIAEAKAKDSGNQAEIDIARSNAK-DKTDLVRIAEAKYKYRSDIVMAI 60 A++ TA+A+ + +I A+A +AE++ A A+ +KT+ +++AE Y+ DI Sbjct: 210 ADIATAEADKETRIKRAEAAKDAQRAELERATEIAEAEKTNQMKVAE--YRREQDIA--- 264 Query: 61 EKSKADAIKTMEDRKAKVGVARQE 84 K++AD +E+ +AK V Q+ Sbjct: 265 -KARADQAYHLEEARAKQEVTEQQ 287 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.302 0.116 0.279 Lambda K H 0.267 0.0363 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 616,853,644 Number of Sequences: 13984884 Number of extensions: 15906257 Number of successful extensions: 58323 Number of sequences better than 10.0: 688 Number of HSP's better than 10.0 without gapping: 76 Number of HSP's successfully gapped in prelim test: 612 Number of HSP's that attempted gapping in prelim test: 57242 Number of HSP's gapped (non-prelim): 1548 length of query: 119 length of database: 4,792,584,752 effective HSP length: 86 effective length of query: 33 effective length of database: 3,589,884,728 effective search space: 118466196024 effective search space used: 118466196024 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 43 (21.9 bits) S2: 75 (33.7 bits)