BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781191|ref|YP_003065604.1| hypothetical protein CLIBASIA_05490 [Candidatus Liberibacter asiaticus str. psy62] (165 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|291556516|emb|CBL33633.1| Protein of unknown function (DUF2800) [Eubacterium siraeum V10Sc8a] Length = 374 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 37/162 (22%), Positives = 70/162 (43%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C+FC+AK C A + +L++ ++ +++ + + W+ +K+ AL Sbjct: 217 EWCQFCKAKADCRERANANMELARFEFRQPPLLTDEEVEEILGRIDELIAWVSDIKDYAL 276 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG+ Y+L EGR +R Y ++N V + + Y +L T L+ +K Sbjct: 277 QAAISGKQWSGYKLVEGRS-NRKYTDENAVIAAVTAAGY-DPYEHKILGITAMTSLLGKK 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 + ++ L IT+ GK +VP + I +F Sbjct: 335 QFNDI----LGGLITKPQGKPTLVPDSDKRPAMTTIIDDFKE 372 >gi|197303498|ref|ZP_03168537.1| hypothetical protein RUMLAC_02220 [Ruminococcus lactaris ATCC 29176] gi|197297496|gb|EDY32057.1| hypothetical protein RUMLAC_02220 [Ruminococcus lactaris ATCC 29176] Length = 377 Score = 214 bits (545), Expect = 4e-54, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 65/162 (40%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + C FC+ K C A L ++ L + +++ ++ + + +W +K+ AL Sbjct: 222 DHCTFCKVKATCRKRAEYNLELAKYDFKMPATLDDTEIAAILEKVDEMISWGNDIKDYAL 281 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG +++ EGR +R Y ++N V + + Y + LL T L+ +K Sbjct: 282 QQAQSGVHFEGWKIVEGRS-NRKYTDENAVADTVKDAGF-DPYEKKLLGITSMSTLLGKK 339 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 K E L I + GK +V + +F Sbjct: 340 KFEEL----LGGLIYKPPGKPTLVLESDKRPAMNTAKDDFKE 377 >gi|317132756|ref|YP_004092070.1| hypothetical protein Ethha_1814 [Ethanoligenens harbinense YUAN-3] gi|315470735|gb|ADU27339.1| hypothetical protein Ethha_1814 [Ethanoligenens harbinense YUAN-3] Length = 380 Score = 213 bits (543), Expect = 7e-54, Method: Composition-based stats. Identities = 41/164 (25%), Positives = 77/164 (46%), Gaps = 9/164 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEALN 61 CRFC+AK C A + L ++ LS+ ++ +++ + + +W +KE AL Sbjct: 221 WCRFCKAKTICRKRAEENLKLAQHEFRLPPELSDAEIEVILSQVDELVSWASDIKEYALQ 280 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKK 121 SG++ ++L EGR R Y N+ V + + + Y + LL T ++L+ + + Sbjct: 281 QALSGKEWRGFKLVEGRS-VRKYANETAVAKTVEDAGF-DPYEKKLLGITAMQKLLGKNR 338 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 +E L FI + GK +VP + ++F +K+ Sbjct: 339 FNEL----LSGFIEKPQGKPTLVPDSDKRPAMNTAKNDFKEVKN 378 >gi|302876793|ref|YP_003845426.1| hypothetical protein Clocel_3999 [Clostridium cellulovorans 743B] gi|307687475|ref|ZP_07629921.1| hypothetical protein Ccel74_04918 [Clostridium cellulovorans 743B] gi|302579650|gb|ADL53662.1| Protein of unknown function DUF2800 [Clostridium cellulovorans 743B] Length = 380 Score = 213 bits (542), Expect = 7e-54, Method: Composition-based stats. Identities = 37/162 (22%), Positives = 70/162 (43%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C+FCR K C A + L ++ L++ +++ +++ + + +W +KE AL Sbjct: 217 EHCQFCRVKATCRKRAEENLKLAQYDFEMPINLNDTEIAAILSQVDELVSWANDIKEYAL 276 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG + +++ EGR R Y +++ V ++ + Y + LL T L+ +K Sbjct: 277 QQALSGTEYEGFKVVEGRS-IRKYTDEDAVAFIVKDHGF-DPYEKKLLGITAMTSLLGKK 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 K E L I + GK +VP + +F Sbjct: 335 KFEEL----LSGLIAKPPGKPTLVPISDKRQAMSTAKEDFKE 372 >gi|167746064|ref|ZP_02418191.1| hypothetical protein ANACAC_00759 [Anaerostipes caccae DSM 14662] gi|167654579|gb|EDR98708.1| hypothetical protein ANACAC_00759 [Anaerostipes caccae DSM 14662] Length = 373 Score = 212 bits (541), Expect = 9e-54, Method: Composition-based stats. Identities = 36/162 (22%), Positives = 69/162 (42%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEAL 60 N C+FC+AK C A + ++ L + ++S ++ + W+ +KE AL Sbjct: 217 NHCQFCKAKATCRKRAEYNMELAKYDFEMPSTLDDTEISVILEKADELVAWVTDIKEYAL 276 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 +G +++ EGR +R Y ++ +V + + + Y + LL T L+ + Sbjct: 277 AQAMNGTHYDGFKIVEGRS-NRKYTDEEKVAETVTKAG-HNPYEQRLLGITAMTSLLGKT 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 K +E L + + GK +VP + +FS Sbjct: 335 KFNEL----LGNLVYKPQGKPTLVPESDKRPEMNTAKDDFSE 372 >gi|293401131|ref|ZP_06645275.1| phage protein [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291305257|gb|EFE46502.1| phage protein [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 376 Score = 212 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 39/160 (24%), Positives = 71/160 (44%), Gaps = 9/160 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+AK C A A L +L + ++ +++ + + W +KE AL Sbjct: 219 EWCGFCKAKHECRARAEANLLLAQHDFKLPPLLEDTEIEVILSRVDELVAWASDIKEYAL 278 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG++ ++L EGR +R Y+N+ V + + + Y + LL T ++L+ + Sbjct: 279 QQAISGKEWTGWKLVEGRS-NRKYSNEAAVIEAVTDAGF-DPYEKKLLGITAMQKLLGKS 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + E L +I + GK +VP + S+F Sbjct: 337 RFDEL----LAAYIEKPQGKPTLVPESDKRPAMNNAKSDF 372 >gi|323693315|ref|ZP_08107533.1| phage protein [Clostridium symbiosum WAL-14673] gi|323502798|gb|EGB18642.1| phage protein [Clostridium symbiosum WAL-14673] Length = 376 Score = 212 bits (540), Expect = 1e-53, Method: Composition-based stats. Identities = 39/160 (24%), Positives = 71/160 (44%), Gaps = 9/160 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+AK C A A L +L++ ++ +++ + +W +KE AL Sbjct: 219 EWCGFCKAKHECRARAEANLLLAQHDFKLPPLLTDSEIEVILSRADELISWAGDIKEYAL 278 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG++ ++L EGR +R Y+ND V Q + + Y + LL T ++ + + Sbjct: 279 QQAISGKEWAGWKLVEGRS-NRKYSNDEAVIQAVTDAGF-DPYEKKLLGITAMQKRLGKS 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + E L +I + GK +VP + ++F Sbjct: 337 RFDEL----LTAYIEKPQGKPTLVPESDKRPAMNNAKTDF 372 >gi|167761127|ref|ZP_02433254.1| hypothetical protein CLOSCI_03525 [Clostridium scindens ATCC 35704] gi|167661246|gb|EDS05376.1| hypothetical protein CLOSCI_03525 [Clostridium scindens ATCC 35704] Length = 376 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 38/160 (23%), Positives = 72/160 (45%), Gaps = 9/160 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+AK C A A L +L++ ++ ++ + + +W +KE AL Sbjct: 219 EWCGFCKAKNECRARAEANLKLAQHDFKLPPLLTDTEIEVILGKVDELVSWAYDIKEYAL 278 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG++ ++L EGR +R Y+N+ V + + + Y + LL T ++L+ + Sbjct: 279 QQALSGKEWSGFKLVEGRA-NRKYSNETAVIDAVEKAGF-DPYEKKLLGITAMQKLLGKS 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + E L +I + GK +VP + ++F Sbjct: 337 RFDEL----LSAYIEKPQGKPTLVPESDKRPAMNTAKNDF 372 >gi|325662091|ref|ZP_08150710.1| hypothetical protein HMPREF0490_01448 [Lachnospiraceae bacterium 4_1_37FAA] gi|325471754|gb|EGC74973.1| hypothetical protein HMPREF0490_01448 [Lachnospiraceae bacterium 4_1_37FAA] Length = 373 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 69/162 (42%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + C+FC+ K C A L ++ L + +++ ++ + + +W +K+ AL Sbjct: 217 DHCQFCKVKATCRKRAETNLELAKYDFEMPATLDDFEIAAILPRIDQLISWGNDIKDYAL 276 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG +++ EGR +R Y +D+ V + + + Y + LL T L+ +K Sbjct: 277 TQAQSGTHYDGFKIVEGRS-NRKYTDDDAVAEAVTAAGY-DPYEKKLLGITAMSSLLGKK 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 K + L + + GK +VP + +F+ Sbjct: 335 KFEQL----LGGLVYKPPGKPALVPESDKRLAMNTAADDFND 372 >gi|227498334|ref|ZP_03928484.1| conserved hypothetical protein [Acidaminococcus sp. D21] gi|226903796|gb|EEH89714.1| conserved hypothetical protein [Acidaminococcus sp. D21] Length = 375 Score = 210 bits (536), Expect = 4e-53, Method: Composition-based stats. Identities = 39/162 (24%), Positives = 64/162 (39%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+ K C A L L + ++ V+ + W VKE AL Sbjct: 218 DHCRFCKIKATCRKRAEYNLELAQYDFAVPSTLQDEEIEAVLAKADELVNWAGDVKEYAL 277 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG+ ++L EGR +R Y ++ V + + Y R LL T + + +K Sbjct: 278 QQALSGKHWDGWKLVEGRS-NRRYVSEEAVAAKVEDAGF-DPYERKLLGITAMTKQLGKK 335 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 + E L + + GK V+VP + +F+ Sbjct: 336 RFEEL----LSDLVEKPQGKPVLVPESDKRPTMHTAADDFND 373 >gi|260161777|emb|CAZ39321.1| putative phage protein [Streptococcus suis] gi|313575359|emb|CBR26888.1| hypothetical protein [Streptococcus phage phi-SsUD.1] Length = 373 Score = 210 bits (536), Expect = 4e-53, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 74/164 (45%), Gaps = 9/164 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEE 58 C+FC+ K C A L+ L N +++++ L L+ +W +K Sbjct: 215 SGKHCQFCKLKNVCRKRAEDNLALAKMEFADPASLDNEDIAEILPKLDLLISWANDIKAY 274 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 ALN + G +P Y+L EGR R +++++ V Q ++ + Y + LL+ T +L+ Sbjct: 275 ALNQATDGHPIPGYKLVEGRS-VRKFSDESAVSQAVIEAGY-DPYEKKLLTITAMTKLLG 332 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K ++ L + + GK +VP D + +EF Sbjct: 333 KKTFNDL----LGGLVVKPSGKPTLVPIDDSRQEMNLAKNEFKE 372 >gi|50261595|gb|AAT72363.1| unknown [Streptococcus pyogenes] Length = 373 Score = 210 bits (535), Expect = 5e-53, Method: Composition-based stats. Identities = 41/164 (25%), Positives = 74/164 (45%), Gaps = 9/164 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEE 58 C+FC+ K C A L+ L N +++++ L L+ +W +K Sbjct: 215 SGKHCQFCKLKNVCRKRAEDNLALAKMEFADPASLDNEDIAEILPKLDLLISWANDIKAY 274 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 ALN + G +P Y+L EGR R +++++ V Q ++ + Y + LL+ T +L+ Sbjct: 275 ALNQATDGHPIPGYKLVEGRS-VRKFSDESAVSQAVIEAGY-DPYEKKLLTITAMTKLLG 332 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K ++ L I + GK +VP D + +EF Sbjct: 333 KKTFNDL----LGGLIIKPSGKPTLVPIDDSRQEMNLAKNEFKE 372 >gi|226323294|ref|ZP_03798812.1| hypothetical protein COPCOM_01066 [Coprococcus comes ATCC 27758] gi|225208484|gb|EEG90838.1| hypothetical protein COPCOM_01066 [Coprococcus comes ATCC 27758] Length = 373 Score = 210 bits (535), Expect = 5e-53, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 69/162 (42%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + C+FC+ K C A L + ++ L + +++ ++ + + TW +K L Sbjct: 217 DHCQFCKVKATCRKRAETNLELVKYNFEMPATLDDFEIAAILPRIDQLITWGNDIKNYTL 276 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG +++ EGR +R Y +D+ V + + + Y + LL T L+ +K Sbjct: 277 TQAQSGTHYDGFKIVEGRS-NRKYTDDDAVAEAVTTAGY-DPYEKKLLGITAMSSLLGKK 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 + E L + + GK +VP + + +F+ Sbjct: 335 QFDEI----LGGLVYKPPGKPALVPESDKRSAMNTAADDFND 372 >gi|57234130|ref|YP_181817.1| hypothetical protein DET1102 [Dehalococcoides ethenogenes 195] gi|57224578|gb|AAW39635.1| conserved hypothetical protein [Dehalococcoides ethenogenes 195] Length = 379 Score = 210 bits (534), Expect = 7e-53, Method: Composition-based stats. Identities = 38/160 (23%), Positives = 72/160 (45%), Gaps = 9/160 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+AK C A A L +L++ ++ ++ + + +W +KE AL Sbjct: 219 EWCGFCKAKNECRARAEANLKLAQHDFKLPPLLTDTEIEVILGKVDELVSWASDIKEYAL 278 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG++ ++L EGR +R Y+N+ V + + + Y + LL T ++L+ + Sbjct: 279 QQALSGKEWSGFKLVEGRA-NRRYSNEAAVIDAVEKAGF-DPYEKKLLGITAMQKLLGKS 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + E L +I + GK +VP + ++F Sbjct: 337 RFDEL----LTAYIEKPQGKPTLVPESDKRPAMNTAKNDF 372 >gi|317501101|ref|ZP_07959307.1| phage protein [Lachnospiraceae bacterium 8_1_57FAA] gi|316897488|gb|EFV19553.1| phage protein [Lachnospiraceae bacterium 8_1_57FAA] Length = 379 Score = 209 bits (533), Expect = 8e-53, Method: Composition-based stats. Identities = 37/160 (23%), Positives = 73/160 (45%), Gaps = 9/160 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+AK C A A L +L++ ++ ++ + + +W +KE AL Sbjct: 219 EWCGFCKAKNECRARAEANLKLAQHDFKLPPLLTDTEIEVILGKVDELVSWASDIKEYAL 278 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG++ +++ EGR +R Y+N++ V + + + Y + LL T ++L+ + Sbjct: 279 QQALSGKEWTGFKIVEGRS-NRRYSNESAVIDAVEKAGL-DPYEKKLLGITAMQKLLGKS 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + E L +I + GK +VP + ++F Sbjct: 337 RFDEL----LTAYIEKPQGKPTLVPDSDKRPAMNTAKNDF 372 >gi|50914499|ref|YP_060471.1| unknown phage protein [Streptococcus pyogenes MGAS10394] gi|40218550|gb|AAR83204.1| hypothetical phage protein [Streptococcus pyogenes] gi|50903573|gb|AAT87288.1| unknown phage protein [Streptococcus pyogenes MGAS10394] Length = 381 Score = 209 bits (532), Expect = 1e-52, Method: Composition-based stats. Identities = 41/164 (25%), Positives = 74/164 (45%), Gaps = 9/164 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEE 58 C+FC+ K C A L+ L N +++++ L L+ +W +K Sbjct: 223 SGKHCQFCKLKNVCRKRAEDNLALAKMEFADPASLDNEDIAEILPKLDLLISWANDIKAY 282 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 ALN + G +P Y+L EGR R +++++ V Q ++ + Y + LL+ T +L+ Sbjct: 283 ALNQATDGHPIPGYKLVEGRS-VRKFSDESAVSQAVIEAGY-DPYEKKLLTITAMTKLLG 340 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K ++ L I + GK +VP D + +EF Sbjct: 341 KKTFNDL----LGGLIIKPSGKPTLVPIDDSRQEMNLAKNEFKE 380 >gi|304436357|ref|ZP_07396333.1| phage protein [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304370626|gb|EFM24275.1| phage protein [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 374 Score = 208 bits (530), Expect = 2e-52, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 63/161 (39%), Gaps = 9/161 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FCR K C A L L + ++ V+ ++ W+ +KE AL Sbjct: 219 HCQFCRIKATCRKRAEYNLELARYDFEMPPTLEDSEVEAVLAKADMLAAWVSDIKEYALQ 278 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKK 121 G+ +++L EGR +R Y ++ V + + E Y + LL T L+ + K Sbjct: 279 RAIQGKRWTDWKLVEGRS-NRKYTDEAAVAKTVKEAGY-EPYEQKLLGITAMTGLLGKNK 336 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 E L F+ + GK + P + +F Sbjct: 337 FEEL----LGGFVIKPQGKPTLAPMSDKRPVMNTAAEDFKE 373 >gi|313898032|ref|ZP_07831571.1| conserved hypothetical protein [Clostridium sp. HGF2] gi|312957060|gb|EFR38689.1| conserved hypothetical protein [Clostridium sp. HGF2] Length = 207 Score = 208 bits (529), Expect = 2e-52, Method: Composition-based stats. Identities = 39/160 (24%), Positives = 73/160 (45%), Gaps = 9/160 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFC+AK C A A L +L + ++ +++ + + +W +KE AL Sbjct: 47 EWCRFCKAKHECRARAEANLLLAQYDFKLPPLLEDSEIEVILSRVDELVSWAGDIKEYAL 106 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG++ ++L EGR +R Y N++ V + + + Y + LL T ++L+ + Sbjct: 107 QQAISGKEWTGWKLVEGRS-NRRYTNEDAVSKAVEAAGF-DPYEKKLLGITAMQKLLGKS 164 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + E L +I + GK +VP + ++F Sbjct: 165 RFEEL----LAAYIEKPQGKPTLVPESDKRPAMNTAKNDF 200 >gi|225573256|ref|ZP_03782011.1| hypothetical protein RUMHYD_01447 [Blautia hydrogenotrophica DSM 10507] gi|225039388|gb|EEG49634.1| hypothetical protein RUMHYD_01447 [Blautia hydrogenotrophica DSM 10507] Length = 372 Score = 208 bits (529), Expect = 3e-52, Method: Composition-based stats. Identities = 42/162 (25%), Positives = 76/162 (46%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+AK C A L ++ L + +++ +++ + + W VKE AL Sbjct: 217 DWCRFCKAKAACRKRAEYNLELAKYDFEMPDTLEDAEIAAILDKVDELTAWAADVKEYAL 276 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG + P Y++ EGR +R Y +++ V + + + Y + +L TE ++L+ +K Sbjct: 277 RQALSGTEYPGYKVVEGRS-NRRYISEDAVADAVSQAGY-DPYAKKVLGLTEMQRLLGKK 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 K E L I + GK V+VP + ++F Sbjct: 335 KFDEL----LGGLIEKPQGKPVLVPLSDKRQPMNTAQNDFKD 372 >gi|167630955|ref|YP_001681454.1| hypothetical protein HM1_2934 [Heliobacterium modesticaldum Ice1] gi|167593695|gb|ABZ85443.1| conserved hypothetical protein [Heliobacterium modesticaldum Ice1] Length = 373 Score = 207 bits (528), Expect = 3e-52, Method: Composition-based stats. Identities = 37/163 (22%), Positives = 67/163 (41%), Gaps = 9/163 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C+FC+ K C A A L +L + ++ ++ + + +W +K+ AL Sbjct: 217 EWCQFCKVKHECRARAEHNLELARYDFKLPPLLEDAEVEDILGKIDDLVSWANDIKDYAL 276 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 G+ ++L EGR +R Y +++ V + + Y R +L T L+ +K Sbjct: 277 QAALGGKQWNGWKLVEGRS-NRRYTDESAVADAVSAAGF-DPYERKVLGITAMTSLLGKK 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVL 163 + E L FI + GK +VP + +FS Sbjct: 335 RFEEV----LGGFIDKPPGKPTLVPESDKRPAIHTAQQDFSEF 373 >gi|167757897|ref|ZP_02430024.1| hypothetical protein CLOSCI_00228 [Clostridium scindens ATCC 35704] gi|167664551|gb|EDS08681.1| hypothetical protein CLOSCI_00228 [Clostridium scindens ATCC 35704] Length = 387 Score = 207 bits (527), Expect = 5e-52, Method: Composition-based stats. Identities = 42/162 (25%), Positives = 74/162 (45%), Gaps = 9/162 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+AK C A L ++ L + +++ +++ + W VKE AL Sbjct: 232 DWCRFCKAKAVCRKRAEYNLELAKYDFEMPDTLEDAEIAAILDKADELTVWAVDVKEYAL 291 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG + P Y++ EGR +R Y +++ V + + + Y + +L TE +L+ +K Sbjct: 292 RQALSGTEYPGYKVVEGRS-NRRYISEDAVADAVSQAGY-DPYAKKVLGLTEMHRLLGKK 349 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 K E L I + GK V+VP + ++F Sbjct: 350 KFDEL----LGGLIEKPQGKPVLVPLSDKRQPMNTAQNDFKD 387 >gi|332523749|ref|ZP_08400001.1| hypothetical protein STRPO_0041 [Streptococcus porcinus str. Jelinkova 176] gi|332315013|gb|EGJ27998.1| hypothetical protein STRPO_0041 [Streptococcus porcinus str. Jelinkova 176] Length = 373 Score = 206 bits (524), Expect = 9e-52, Method: Composition-based stats. Identities = 41/164 (25%), Positives = 75/164 (45%), Gaps = 9/164 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEE 58 N C+FC+ K C A L+ L N +++++ + L+ +W +K Sbjct: 215 SGNHCQFCKLKNVCRKRAEDNLALAKMEFADPASLDNEDIAEILPKVNLLISWANDIKTY 274 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 ALN + G +P Y+L EGR R +++++ V Q ++ + Y + LL+ T +L+ Sbjct: 275 ALNQATDGHPIPGYKLVEGRS-VRKFSDESAVSQAVIEAGY-DPYEKKLLTITAMTKLLG 332 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K ++ L I + GK +VP D + +EF Sbjct: 333 KKTFNDL----LGGLIIKPSGKPTLVPIDDSRQEMNLAKNEFKE 372 >gi|284048477|ref|YP_003398816.1| hypothetical protein Acfer_1134 [Acidaminococcus fermentans DSM 20731] gi|283952698|gb|ADB47501.1| conserved hypothetical protein [Acidaminococcus fermentans DSM 20731] Length = 382 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 39/161 (24%), Positives = 67/161 (41%), Gaps = 9/161 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQILS---NRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+ + C A L + S + ++ ++ + TW VKE AL Sbjct: 218 DHCRFCKIRATCRKRAEYNLELAKYDFAMPSTLEDTEVEAILAKADELVTWAGDVKEYAL 277 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG++ +++ EGR +R Y N+ V + Y + LL T +L+ ++ Sbjct: 278 QQALSGKNWAGWKVVEGRS-NRRYVNEEAVAAKVEEAGY-SPYEKKLLGITALTKLLGKR 335 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFS 161 + E L I + GK V+VP + +FS Sbjct: 336 RFDEL----LTDLIEKPQGKPVLVPETDKRPSMHTAADDFS 372 >gi|262113720|emb|CAR95387.1| hypothetical protein [Streptococcus phage phi-m46.1] Length = 400 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 74/164 (45%), Gaps = 9/164 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEE 58 N C+FC+ K C A L+ L N +++++ L L+ +W +K Sbjct: 242 SGNHCQFCKLKNVCRKRAEDNLALAKMEFADPASLDNEDIAEILPKLDLLISWANDIKAY 301 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 ALN + G +P Y+L E R R +++++ V Q ++ + Y + LL+ T +L+ Sbjct: 302 ALNQATDGHPIPGYKLVESRS-VRKFSDESAVSQAVIEAGF-DPYEKKLLTITAMTKLLG 359 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K ++ L + + GK +VP D + +EF Sbjct: 360 KKTFNDL----LGGLVVKPSGKPTLVPIDDSRQEMNLAKNEFKE 399 >gi|313896485|ref|ZP_07830036.1| conserved hypothetical protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312974909|gb|EFR40373.1| conserved hypothetical protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 374 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 62/161 (38%), Gaps = 9/161 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC+ K C A L L + ++ V+ + W+ +KE AL Sbjct: 219 HCQFCKIKATCRKRAEYNLELARYDFEMPPTLEDAEVEAVLAKADTLAAWVSDIKEYALQ 278 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKK 121 G+ +++L EGR +R Y ++ V + + E Y + LL T L+ + K Sbjct: 279 RAIQGKQWADWKLVEGRS-NRKYTDETAVAKTVKEAGF-EPYEQKLLGITAMTALLGKSK 336 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 E L FI + GK + P + +F Sbjct: 337 FEEL----LGGFIVKPQGKPTLAPMSDKRPVMNTAAEDFKE 373 >gi|319757795|gb|ADV69737.1| hypothetical protein SSUJS14_0646 [Streptococcus suis JS14] Length = 373 Score = 204 bits (520), Expect = 3e-51, Method: Composition-based stats. Identities = 39/164 (23%), Positives = 73/164 (44%), Gaps = 9/164 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEE 58 C+FC+ K C A L+ L N +++++ L L+ +W +K Sbjct: 215 SGKHCQFCKLKTVCRKRAEDNLALAKMEFANPATLDNEDIAEILPKLDLLISWANDIKAY 274 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 ALN + G +P Y+L EGR R +++++ V ++ + Y + LL+ T +L+ Sbjct: 275 ALNQATDGYSIPGYKLVEGRS-VRKFSDESAVSHAVIEAGY-DPYEKKLLTITAMTKLLG 332 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K ++ L + + GK +VP D + +EF Sbjct: 333 KKTFNDL----LGGLVIKPTGKPTLVPIDDSRQEMNLAKNEFKE 372 >gi|262043418|ref|ZP_06016543.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039244|gb|EEW40390.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 393 Score = 204 bits (520), Expect = 3e-51, Method: Composition-based stats. Identities = 48/184 (26%), Positives = 94/184 (51%), Gaps = 23/184 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+A C A + T + +L+N QL+++ Sbjct: 204 EKQCRFCKASAICTAREQFHMQTVAGEFDNLTAPISELVTSAIARVPMLTNEQLAEIYGQ 263 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEA 102 +E+W+K +++ + L++G +P ++L G++G+R + N+ + E++L R D+ Sbjct: 264 ADFLESWLKAIRDRVNSELNAGHPVPGFKLVTGKQGNRAWRNEVEAEEMLKSFRLKQDQM 323 Query: 103 YNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFS 161 Y++ ++SPT+ E+L+K++ S W +++ ITR DGK I P P L +++F Sbjct: 324 YSQKVISPTQAEKLLKKE--SPRRWTKVEALITRSDGKPTIAPESDPRPALNVNPVNDFD 381 Query: 162 VLKD 165 + + Sbjct: 382 DVSE 385 >gi|167039877|ref|YP_001662862.1| hypothetical protein Teth514_1232 [Thermoanaerobacter sp. X514] gi|300915372|ref|ZP_07132686.1| conserved hypothetical protein [Thermoanaerobacter sp. X561] gi|307724799|ref|YP_003904550.1| hypothetical protein Thet_1677 [Thermoanaerobacter sp. X513] gi|166854117|gb|ABY92526.1| hypothetical protein Teth514_1232 [Thermoanaerobacter sp. X514] gi|300888648|gb|EFK83796.1| conserved hypothetical protein [Thermoanaerobacter sp. X561] gi|307581860|gb|ADN55259.1| hypothetical protein Thet_1677 [Thermoanaerobacter sp. X513] Length = 375 Score = 204 bits (520), Expect = 3e-51, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 76/166 (45%), Gaps = 11/166 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C+FCRA +C A A + + +LS+ +++ +++ + + W + A Sbjct: 215 EWCQFCRAAVKCRARAEAKMKLATFEFALPPLLSDEEIADILSSIGDLTNWANEIIAYAT 274 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + + G+ +++ EGR +R Y ++ V + + Y ++L++ TE E+L+ + Sbjct: 275 DAAVNHGKKWTGFKVVEGRS-NRKYKDEEAVAEAAKNAGYRDIYKQSLITITEMEKLMGK 333 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVL 163 K +E L + + + GK +VP + + ++F + Sbjct: 334 SKFNEI----LGELVMKPPGKPTLVPVSDKRPEMNTSSAKNDFMEV 375 >gi|284048440|ref|YP_003398779.1| hypothetical protein Acfer_1095 [Acidaminococcus fermentans DSM 20731] gi|283952661|gb|ADB47464.1| conserved hypothetical protein [Acidaminococcus fermentans DSM 20731] Length = 382 Score = 204 bits (519), Expect = 3e-51, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 67/161 (41%), Gaps = 9/161 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQILS---NRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+ + C A L + S + ++ ++ + +W+ VK+ AL Sbjct: 218 DHCRFCKIRATCRKRAEYNLELAKYDFAMPSTLEDPEMEAILAKADELVSWVGDVKDYAL 277 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG+ +++ EGR +R Y N+ V + Y + LL T +L+ ++ Sbjct: 278 QQALSGKTWAGWKVVEGRS-NRRYVNEEAVAAKVEEAGY-SPYEKKLLGITALTKLLGKR 335 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFS 161 + E L I + GK V+VP + +FS Sbjct: 336 RFDEL----LTDLIEKPQGKPVLVPETDKRPAMHTAADDFS 372 >gi|331648296|ref|ZP_08349385.1| putative protein p51 [Escherichia coli M605] gi|331042845|gb|EGI14986.1| putative protein p51 [Escherichia coli M605] Length = 433 Score = 203 bits (518), Expect = 5e-51, Method: Composition-based stats. Identities = 52/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 243 EKQCRFCKAKAICTAHKMQHLQTAASDFEDLTKPVSEIITNASARVPLLTIEQLAEIYSQ 302 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 303 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 362 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK +VP P L +++F Sbjct: 363 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTVVPESDPRPALNINPVNDF 420 Query: 161 SVLKD 165 + D Sbjct: 421 DDVSD 425 >gi|309702937|emb|CBJ02268.1| putative phage protein [Escherichia coli ETEC H10407] Length = 433 Score = 203 bits (518), Expect = 5e-51, Method: Composition-based stats. Identities = 53/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 243 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIISNASARVPLLTIEQLAEIYSQ 302 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 303 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 362 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK IVP P L +++F Sbjct: 363 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTIVPESDPRPALNINPVNDF 420 Query: 161 SVLKD 165 + D Sbjct: 421 DDVSD 425 >gi|315654971|ref|ZP_07907876.1| phage protein [Mobiluncus curtisii ATCC 51333] gi|315490932|gb|EFU80552.1| phage protein [Mobiluncus curtisii ATCC 51333] Length = 385 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 73/166 (43%), Gaps = 10/166 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC+ C A A L LS ++++V+ +P + W V++ AL+ Sbjct: 219 WCQFCKIASTCRARAEANLELAKFEFAPPAELSAGEVAEVLAQIPELTRWASDVQDYALS 278 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKK 121 SGE ++L GR R Y ++ V + + Y R+LL+ T E+L+ +K+ Sbjct: 279 QALSGEQYEGFKLVAGRS-IRKYTDETAVAEAAKAAGYRDIYKRSLLTITAMEKLMGKKQ 337 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFSVLKD 165 SE L + + +GK +VP + +F+ + + Sbjct: 338 FSEI----LGNLVVKPEGKPTLVPLSDKRPELQISTAADDFTNIDN 379 >gi|331654011|ref|ZP_08355012.1| putative protein p51 [Escherichia coli M718] gi|331048860|gb|EGI20936.1| putative protein p51 [Escherichia coli M718] Length = 433 Score = 203 bits (517), Expect = 7e-51, Method: Composition-based stats. Identities = 53/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 243 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIISNASARVPLLTIEQLAEIYSQ 302 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 303 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 362 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK IVP P L +++F Sbjct: 363 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTIVPESDPRPALNINPVNDF 420 Query: 161 SVLKD 165 + D Sbjct: 421 DDVSD 425 >gi|260845236|ref|YP_003223014.1| hypothetical protein ECO103_3129 [Escherichia coli O103:H2 str. 12009] gi|257760383|dbj|BAI31880.1| hypothetical protein ECO103_3129 [Escherichia coli O103:H2 str. 12009] Length = 433 Score = 203 bits (516), Expect = 8e-51, Method: Composition-based stats. Identities = 53/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 243 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIISNASARVPLLTIEQLAEIYSQ 302 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 303 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 362 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK IVP P L +++F Sbjct: 363 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTIVPESDPRPALNINPVNDF 420 Query: 161 SVLKD 165 + D Sbjct: 421 DDVSD 425 >gi|291283826|ref|YP_003500644.1| Bbp38 [Escherichia coli O55:H7 str. CB9615] gi|290763699|gb|ADD57660.1| Bbp38 [Escherichia coli O55:H7 str. CB9615] Length = 433 Score = 203 bits (516), Expect = 8e-51, Method: Composition-based stats. Identities = 52/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 243 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIITNASARVPLLTIEQLAEIYSQ 302 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 303 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 362 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK +VP P L +++F Sbjct: 363 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTVVPESDPRPALNINPVNDF 420 Query: 161 SVLKD 165 + D Sbjct: 421 DDVSD 425 >gi|76788560|ref|YP_329356.1| hypothetical protein SAK_0728 [Streptococcus agalactiae A909] gi|76563617|gb|ABA46201.1| conserved hypothetical protein [Streptococcus agalactiae A909] Length = 373 Score = 203 bits (516), Expect = 9e-51, Method: Composition-based stats. Identities = 42/164 (25%), Positives = 73/164 (44%), Gaps = 9/164 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEE 58 C+FC+ K C A + L+ L +++++ L L+ +W VK Sbjct: 215 SGKHCQFCKIKNVCRKRAEENLALAKMEFADPATLDYEDIAEILTKLDLLVSWANDVKAY 274 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 AL + G +P Y+L EGR R ++++ V Q +M + Y + LL+ T +L+ Sbjct: 275 ALKEATEGHSIPGYKLVEGRS-VRKFSDEAAVSQAVMDAGF-DPYEKKLLTITAMTKLLG 332 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K ++ L I ++ GK +VP D L +EF Sbjct: 333 KKTFNDL----LGGLIVKQSGKPTLVPLDDSRQELNLATNEFKE 372 >gi|77412081|ref|ZP_00788407.1| phi APSE P51-like protein [Streptococcus agalactiae CJB111] gi|77161886|gb|EAO72871.1| phi APSE P51-like protein [Streptococcus agalactiae CJB111] Length = 373 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 42/164 (25%), Positives = 72/164 (43%), Gaps = 9/164 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEE 58 C+FC+ K C A L+ L +++++ L L+ +W VK Sbjct: 215 SGKHCQFCKIKNVCRKRAEDNLALAKMEFADPATLDYEDIAEILPKLDLLVSWANDVKAY 274 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 AL + G +P Y+L EGR R ++++ V Q +M + Y + LL+ T +L+ Sbjct: 275 ALKEATEGHSIPGYKLVEGRS-VRKFSDEAAVSQAVMDAGF-DPYEKKLLTITAMTKLLG 332 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K ++ L I ++ GK +VP D L +EF Sbjct: 333 KKTFNDL----LGGLIVKQSGKPTLVPLDDSRQELNLATNEFKE 372 >gi|218555131|ref|YP_002388044.1| hypothetical protein ECIAI1_2661 [Escherichia coli IAI1] gi|218361899|emb|CAQ99499.1| conserved hypothetical protein from bacteriophage origin [Escherichia coli IAI1] Length = 433 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 53/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 243 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIITNASARVPLLTIEQLAEIYSQ 302 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 303 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 362 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK IVP P L +++F Sbjct: 363 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTIVPESDPRPALNINPVNDF 420 Query: 161 SVLKD 165 + D Sbjct: 421 DDVSD 425 >gi|332800354|ref|YP_004461853.1| hypothetical protein TepRe1_2445 [Tepidanaerobacter sp. Re1] gi|332698089|gb|AEE92546.1| hypothetical protein TepRe1_2445 [Tepidanaerobacter sp. Re1] Length = 386 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 42/166 (25%), Positives = 79/166 (47%), Gaps = 12/166 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFC+ K C A A + L +L++ ++ +++ +++W+ V+ AL Sbjct: 224 EHCRFCKVKATCRARADENLKLAEHDFKKPPLLTDDEIVEILAAADELQSWISDVQAYAL 283 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGD--EAYNRTLLSPTETEQLV 117 + + G + P ++L EGR R Y ++ +V ++L+ D E Y ++LL T E+LV Sbjct: 284 DQAVNHGREWPGFKLIEGRSYRR-YADEAEVTEVLVAAGFDEEEIYTKSLLGITAMEKLV 342 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANIS-EFSV 162 +K+ +E L I + GK + P +K+ +F Sbjct: 343 GKKQFNEI----LGTLIIKPPGKPRLAPESDNRPAIKSTAEIDFKE 384 >gi|332088058|gb|EGI93183.1| bbp38 [Shigella boydii 5216-82] Length = 343 Score = 201 bits (511), Expect = 3e-50, Method: Composition-based stats. Identities = 52/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 153 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIITNASARVPLLTIEQLAEIYSQ 212 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 213 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 272 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK +VP P L +++F Sbjct: 273 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTVVPESDPRPALNINPVNDF 330 Query: 161 SVLKD 165 + D Sbjct: 331 DDVSD 335 >gi|194429254|ref|ZP_03061781.1| Bbp38 [Escherichia coli B171] gi|194412662|gb|EDX28957.1| Bbp38 [Escherichia coli B171] gi|323159203|gb|EFZ45193.1| bbp38 [Escherichia coli E128010] Length = 433 Score = 201 bits (511), Expect = 3e-50, Method: Composition-based stats. Identities = 51/185 (27%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 243 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIITNASARVPLLTIEQLAEIYSQ 302 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 303 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 362 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK ++P P L +++F Sbjct: 363 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTVIPESDPRPALNINPVNDF 420 Query: 161 SVLKD 165 + D Sbjct: 421 DDVSD 425 >gi|269976592|ref|ZP_06183577.1| phage-associated protein [Mobiluncus mulieris 28-1] gi|269935393|gb|EEZ91942.1| phage-associated protein [Mobiluncus mulieris 28-1] Length = 421 Score = 200 bits (510), Expect = 4e-50, Method: Composition-based stats. Identities = 42/166 (25%), Positives = 71/166 (42%), Gaps = 10/166 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FCR C A A L LS +++ V+ +P + W V++ AL+ Sbjct: 219 WCQFCRIASTCRARAEANLELAKFEFAPPAELSPAEVADVLAQIPELTRWASDVQDYALS 278 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKK 121 SGE ++L GR R Y ++ V + + Y R+LL+ T E+L+ +K Sbjct: 279 QALSGELYEGFKLVAGRS-IRKYTDETAVAEAAKAAGYRDIYKRSLLTITAMERLMGKKH 337 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSVLKD 165 SE L + + +GK +VP L +F+ + + Sbjct: 338 FSEI----LGNLVVKPEGKPTLVPVTDKRPELQVSTAADDFTNIDN 379 >gi|163814573|ref|ZP_02205962.1| hypothetical protein COPEUT_00724 [Coprococcus eutactus ATCC 27759] gi|158450208|gb|EDP27203.1| hypothetical protein COPEUT_00724 [Coprococcus eutactus ATCC 27759] Length = 172 Score = 200 bits (510), Expect = 4e-50, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 75/165 (45%), Gaps = 9/165 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+A +C A A L +L + ++ +++ + + +W +KE AL Sbjct: 8 EWCGFCKANHKCRARAEANLLLAQHDFKLPPLLEDSEIEVILSRVDELISWANDIKEYAL 67 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG++ +++L EGR +R Y +++ V + + + Y + LL T ++L+ + Sbjct: 68 QQAISGKEWTSWKLVEGRS-NRRYTSEDDVSKAVEAAGF-DPYEKKLLGITAMQKLLGKS 125 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 + E L +I + GK +VP + +++ + + Sbjct: 126 RFEEL----LAAYIEKPQGKPTLVPESDKRPAMNTAKNDYIEVTN 166 >gi|268610650|ref|ZP_06144377.1| hypothetical protein RflaF_14282 [Ruminococcus flavefaciens FD-1] Length = 371 Score = 200 bits (509), Expect = 5e-50, Method: Composition-based stats. Identities = 38/162 (23%), Positives = 65/162 (40%), Gaps = 11/162 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C+FCR K C A L L++ ++S +++ W+ VKE AL Sbjct: 216 KHCQFCRIKATCRKRAEYNLQLAQYDFAVPDTLADDEISMILDRADTFIGWVNDVKEYAL 275 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG+ P +++ EGR +R Y + + V + + + + L+ T +L+ K Sbjct: 276 EQAISGKCYPGFKVVEGRS-NRRYTDTDAVAAAVTEAGY-DPFEKKLMGVTAMTKLLGAK 333 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 K + L I + GK +VP I +F Sbjct: 334 KFNTL----LGSLIEKPKGKPTLVPESDKRPAWT--IDDFKE 369 >gi|191174017|ref|ZP_03035534.1| Bbp38 [Escherichia coli F11] gi|190905708|gb|EDV65330.1| Bbp38 [Escherichia coli F11] Length = 433 Score = 200 bits (509), Expect = 6e-50, Method: Composition-based stats. Identities = 52/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 243 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIITNASARVPLLTIEQLAEIYSQ 302 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 303 TDFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 362 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK +VP P L +++F Sbjct: 363 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTVVPESDPRPALNINPVNDF 420 Query: 161 SVLKD 165 + D Sbjct: 421 DDVSD 425 >gi|324014354|gb|EGB83573.1| hypothetical protein HMPREF9533_01586 [Escherichia coli MS 60-1] Length = 417 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 52/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 227 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIITNASARVPLLTIEQLAEIYSQ 286 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 287 TDFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 346 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK +VP P L +++F Sbjct: 347 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTVVPESDPRPALNINPVNDF 404 Query: 161 SVLKD 165 + D Sbjct: 405 DDVSD 409 >gi|332160961|ref|YP_004297538.1| hypothetical protein YE105_C1339 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665191|gb|ADZ41835.1| hypothetical protein YE105_C1339 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862117|emb|CBX72281.1| putative protein p51 [Yersinia enterocolitica W22703] Length = 436 Score = 198 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 55/184 (29%), Positives = 93/184 (50%), Gaps = 23/184 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILSN------------------RQLSQVMNV 44 + CRFC+AK C AL +L+ + L++ L +++ Sbjct: 247 DKQCRFCKAKATCKALEQHSLNLVANDFVDLTDNLEPQLASAKERITHCDNAHLGELLGQ 306 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEA 102 L L+E W K V+E A + L++G +P Y+L G++G+R +N++ E L MR +E Sbjct: 307 LDLVEGWCKAVRERANSELNAGHPVPGYKLVIGKQGNRAWNSEETAEATLSAMRLKKEEM 366 Query: 103 YNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH-LKANISEFS 161 YN L+SPT+ E+L+K++ S W +L+ I+R DGK I P P + ++F Sbjct: 367 YNFKLISPTQAEKLLKKE--SPRRWTKLEALISRADGKPTIAPEADPRPAHIVNPENDFE 424 Query: 162 VLKD 165 + + Sbjct: 425 NVDE 428 >gi|323173138|gb|EFZ58769.1| bbp38 [Escherichia coli LT-68] Length = 366 Score = 198 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 52/185 (28%), Positives = 94/185 (50%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+AK C A ++ L T + +L+ QL+++ + Sbjct: 176 EKQCRFCKAKAICTAQKMQHLQTAASDFEDLTKPVSEIITNASARVPLLTIEQLAEIYSQ 235 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ N L++G +P ++L G++G+R ++++ LL R +E Sbjct: 236 ADFIESWLKAVRDRVHNELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 295 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK +VP P L +++F Sbjct: 296 VFDFKLISPTKAEKLIKKA--SPRRWSKVEALITRADGKPTVVPESDPHPALNINPVNDF 353 Query: 161 SVLKD 165 + D Sbjct: 354 DDVSD 358 >gi|170023454|ref|YP_001719959.1| hypothetical protein YPK_1208 [Yersinia pseudotuberculosis YPIII] gi|169749988|gb|ACA67506.1| conserved hypothetical protein [Yersinia pseudotuberculosis YPIII] Length = 436 Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats. Identities = 53/184 (28%), Positives = 92/184 (50%), Gaps = 23/184 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILSNR------------------QLSQVMNV 44 + CRFC+AK C A +L+ + L++ L +++ Sbjct: 247 DKQCRFCKAKATCNAGEQHSLNLVANDFVDLTDALEPQLSGAKERITHCDNAHLGELLGQ 306 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEA 102 L L+E W K V+E A + L++G +P Y+L G++G+R ++++ E L MR +E Sbjct: 307 LDLVEGWCKAVRERANSELNAGHPVPGYKLVIGKQGNRAWSSEETAETTLSAMRLKKEEM 366 Query: 103 YNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH-LKANISEFS 161 YN L+SPT+ E+L+K++ S W +L+ I+R DGK I P P + ++F Sbjct: 367 YNFKLISPTQAEKLLKKE--SPRRWTKLEALISRADGKPTIAPEADPRPAHIVNPENDFE 424 Query: 162 VLKD 165 + + Sbjct: 425 NVDE 428 >gi|153814239|ref|ZP_01966907.1| hypothetical protein RUMTOR_00448 [Ruminococcus torques ATCC 27756] gi|145848635|gb|EDK25553.1| hypothetical protein RUMTOR_00448 [Ruminococcus torques ATCC 27756] Length = 375 Score = 197 bits (500), Expect = 7e-49, Method: Composition-based stats. Identities = 40/168 (23%), Positives = 81/168 (48%), Gaps = 9/168 (5%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEE 58 + CRFC+A+ C A A + L ++S+ ++++V++ ++ W + V Sbjct: 213 TGDHCRFCKARFTCRARAEEYLKLAQMEFAEPALMSDEEIAEVLSKADALKKWAEEVYTY 272 Query: 59 ALNVLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLV 117 A N + ++ P Y+L GR +R Y ++++V + + + Y + L+ TE E+L+ Sbjct: 273 AQNEAVVNHKEWPGYKLVLGRS-NRKYTDEDEVAEAAQKAGYTDIYKKNLIGITEMERLM 331 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 +KK +E L + + DGK +VP +K +E ++ Sbjct: 332 GKKKFNEI----LGSLVYKPDGKVTLVPDSDKREAVKTATAEADFKEE 375 >gi|295114920|emb|CBL35767.1| Protein of unknown function (DUF2800). [butyrate-producing bacterium SM4/1] Length = 375 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 38/160 (23%), Positives = 68/160 (42%), Gaps = 9/160 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFC+AK C A L+ +L++ +++ +++ + + W VKE AL Sbjct: 218 EWCRFCKAKAECRERAEANLALARYDFEEPPLLTDEEIAGILDKVDALTAWAADVKEYAL 277 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 SG P ++L EGR +R Y ++ + G Y + LL T ++L+ + Sbjct: 278 QQAVSGTAFPGWKLVEGRS-NRKYTSEAA-VAAAVEGAGFNPYEKKLLGITAMQKLLGKS 335 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + E L +I + G+ +V +EF Sbjct: 336 RFEEL----LAPYIEKPQGRPTLVRSSDKRPEWNTAKNEF 371 >gi|295096876|emb|CBK85966.1| Protein of unknown function (DUF2800) [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 417 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 49/185 (26%), Positives = 89/185 (48%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHM------------------QILSNRQLSQVMNV 44 E CRFC+A C A T + +L+ QL++V + Sbjct: 227 EKQCRFCKASAICTARQQLHFDTIAGDFVDLTQPTGEQLAEAVKCVPLLTAEQLAEVYSQ 286 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM---RELGDE 101 IE+W+K V++ + L++G +P ++L G++G+R ++++ LL R +E Sbjct: 287 ADFIESWLKAVRDRVNSELNAGHPVPGFKLVTGKQGNRAWSDEEAARALLKDQFRYKTEE 346 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 ++ L+SPT+ E+L+K+ S W +++ ITR DGK + P P L +++F Sbjct: 347 VFDLKLISPTKAEKLIKKA--SPRRWTKVEALITRADGKPTVAPESDPRPALNLNPVNDF 404 Query: 161 SVLKD 165 + D Sbjct: 405 DDVSD 409 >gi|154504842|ref|ZP_02041580.1| hypothetical protein RUMGNA_02352 [Ruminococcus gnavus ATCC 29149] gi|153794725|gb|EDN77145.1| hypothetical protein RUMGNA_02352 [Ruminococcus gnavus ATCC 29149] Length = 375 Score = 194 bits (494), Expect = 3e-48, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 80/166 (48%), Gaps = 9/166 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+A+ C A A + L ++S+ ++++V++ ++ W + V A Sbjct: 215 DHCRFCKARFTCRARAEEYLKLAQMEFAEPALMSDAEIAEVLSKADALKKWAEEVYTYAQ 274 Query: 61 NVLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 N + ++ P Y+L GR +R Y ++ V + + + + ++L+ TE E+L+ + Sbjct: 275 NEAVVNHKEWPGYKLVLGRS-NRKYTDEEDVAEAAQKAGYTDIFKKSLIGITEMERLMGK 333 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 KK +E L + + DGK +VP +K +E +D Sbjct: 334 KKFNEI----LGSLVYKPDGKVTLVPDSDKREAVKTATAEADFKED 375 >gi|226949735|ref|YP_002804826.1| conserved phage-associated protein [Clostridium botulinum A2 str. Kyoto] gi|226842707|gb|ACO85373.1| conserved phage-associated protein [Clostridium botulinum A2 str. Kyoto] Length = 381 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 39/162 (24%), Positives = 75/162 (46%), Gaps = 10/162 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + C+FCRAK C A A K + Q L N ++ ++ + W K V+E AL Sbjct: 224 DHCKFCRAKAVCKARADKNMELAQYDFQEPNTLDNNDIAFILGKADELINWAKDVQEYAL 283 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVK 118 GE+ +++ EGR +R + +++++ ++L+ + E Y + L T E+ + Sbjct: 284 EQALQGEEFDGFKVVEGRS-NRKWTDEDKIGEILLGQGFLENIIYTKKLTGITNMEKAIG 342 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 +K+V+ L +IT+ GK + + ++F Sbjct: 343 KKEVNRL----LGDYITKPQGKPTLATITDKRPVYNSAEADF 380 >gi|300853537|ref|YP_003778521.1| phage-like protein [Clostridium ljungdahlii DSM 13528] gi|300433652|gb|ADK13419.1| phage-related protein [Clostridium ljungdahlii DSM 13528] Length = 387 Score = 193 bits (491), Expect = 7e-48, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 75/168 (44%), Gaps = 11/168 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+A +C A A L +L++ ++ ++++ +P + W + A Sbjct: 215 EWCTFCKASVKCRARAEAKLELARAEFMLPPLLTDMEIEEIIHKIPDLTKWANEIMAYAT 274 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + + G+ +++ EGR +R Y +++ V + Y++ L++ TE ++L+ + Sbjct: 275 DAAINHGKQWSGFKVVEGRS-NRKYKDEDAVADAAKTNGYKDIYHQRLITITEMQKLMGK 333 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSVLKD 165 K+ + L I + GK +VP + +EF+ + + Sbjct: 334 KQFEKI----LGCLIYKPPGKPTLVPITDKRTAMDTSNANNEFNEIME 377 >gi|227497452|ref|ZP_03927684.1| phage-associated protein [Actinomyces urogenitalis DSM 15434] gi|226833077|gb|EEH65460.1| phage-associated protein [Actinomyces urogenitalis DSM 15434] Length = 312 Score = 193 bits (490), Expect = 8e-48, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 73/166 (43%), Gaps = 10/166 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FCR P C A A L+ LS +++ V+ +P ++ W V+ AL Sbjct: 151 WCQFCRIAPTCRARAESNLALAKHEFAPPAELSIAEVADVLAKIPELKAWASDVEAWALA 210 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKK 121 +G +P +++ GR R Y ++ V + + +++ L+ T E+L+ ++ Sbjct: 211 KARAGTQIPGFKVVAGRS-IRKYTDEAAVAEAAKAAGYSDIWDKRLIGITAMERLMGKRA 269 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLKD 165 +E L + + +GK +VP L + ++F+ + Sbjct: 270 FTE----TLGDLVIKPEGKPTLVPESDKRPALHRVSAATDFTNTNN 311 >gi|303239092|ref|ZP_07325622.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302593430|gb|EFL63148.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 374 Score = 193 bits (490), Expect = 9e-48, Method: Composition-based stats. Identities = 37/164 (22%), Positives = 74/164 (45%), Gaps = 11/164 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFCRAK C A A L ++LS+ ++ +++ ++ W+ + E A Sbjct: 216 EHCRFCRAKAVCRARAKANLELAKYDFADPELLSDEEMGEILAKAEQLQAWVSDLWEYAQ 275 Query: 61 NVLSSG-EDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLV 117 +G + P +++ GR +R Y+++ + ++L+ E +N+ LL + E+L Sbjct: 276 AEAIAGRKKWPGFKVVAGRS-NRRYSDEEKAAEVLLVNGYTEKQIFNKKLLGIGDMEKLT 334 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFS 161 +K+ E L+ +I + GK +V ++F Sbjct: 335 GKKRFEEL----LKDYIEKPAGKPALVSETDKRQEWNRAAADFD 374 >gi|220930429|ref|YP_002507338.1| hypothetical protein Ccel_3066 [Clostridium cellulolyticum H10] gi|220000757|gb|ACL77358.1| conserved hypothetical protein [Clostridium cellulolyticum H10] Length = 411 Score = 193 bits (490), Expect = 1e-47, Method: Composition-based stats. Identities = 39/174 (22%), Positives = 75/174 (43%), Gaps = 15/174 (8%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEE 58 + C+FCRAK C A L + LSN +++ ++ ++ W V+ Sbjct: 227 SGDHCQFCRAKAVCRKRAEDNLEMARYEFEDPNILSNDEIADILAKAAELQKWASDVQAY 286 Query: 59 ALNVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYN-RTLLSPTETE 114 AL+ + G ++L EGR +R Y +++ V L E D Y + + +E E Sbjct: 287 ALDQAENHGVKFTGWKLVEGRS-NRKYTDEDAVATKLKDEGYASDVIYQPQKIWGISEME 345 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANIS---EFSVLKD 165 + + ++ ++ L +F+ + GK +VP + + S +F L + Sbjct: 346 KKIGKRLFAD----YLTEFVVKPAGKATLVPESDKRPEISSVASAVRDFDDLYE 395 >gi|313618509|gb|EFR90508.1| phage-associated protein [Listeria innocua FSL S4-378] Length = 378 Score = 192 bits (488), Expect = 2e-47, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 76/167 (45%), Gaps = 11/167 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C FC+A RC A A + L + ++++ ++ +++ +LP + W + A + Sbjct: 216 WCTFCKAANRCRARAEEKLKLAEKEFKMPPLMTDAEIEEILLILPDLTKWANKITAYATD 275 Query: 62 VL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G++ +++ EGR R Y ++ + + + + Y ++L+ TE ++L+ + Sbjct: 276 AAVNHGKEWNGFKVVEGRS-VRKYKDEGAIAEKAVAGGYKDIYRKSLIPLTEMQKLMGKS 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLKD 165 K E L I + GK +VP + +EF+ + + Sbjct: 335 KFEE----FLGDLIFKPPGKPTLVPNSDKRPAINVVNAKNEFNEIME 377 >gi|302873660|ref|YP_003842293.1| hypothetical protein Clocel_0759 [Clostridium cellulovorans 743B] gi|307686616|ref|ZP_07629062.1| hypothetical protein Ccel74_00591 [Clostridium cellulovorans 743B] gi|302576517|gb|ADL50529.1| Protein of unknown function DUF2800 [Clostridium cellulovorans 743B] Length = 388 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 39/164 (23%), Positives = 75/164 (45%), Gaps = 12/164 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FCRA+ C A A + L+ +LS+ ++++V++V I +W+ V A Sbjct: 226 EHCGFCRARKTCRARADQRLAMTKYDFKLPPLLSDEEIAEVLSVAEGISSWVNDVYAYAT 285 Query: 61 NV-LSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 N+ ++ G+ ++L EGR +R Y ++ V ++ E Y ++LL T E+L+ + Sbjct: 286 NLSINEGKRWSGFKLVEGRS-NRKYISEEAVIKVCNDNGITEIYTKSLLGITAMEKLLGK 344 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL---KANISEF 160 + L + + GK +VP + ++F Sbjct: 345 DSFNSI----LGDLVEKPKGKPTLVPFSDKRKAIEINNMAEADF 384 >gi|331085758|ref|ZP_08334841.1| hypothetical protein HMPREF0987_01144 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406681|gb|EGG86186.1| hypothetical protein HMPREF0987_01144 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 376 Score = 191 bits (486), Expect = 3e-47, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 80/164 (48%), Gaps = 11/164 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFC+A+ +C A A + L +LS+ ++++V++ ++ W + + A Sbjct: 215 EWCRFCKARFQCRARAEEYLCLAQMEFSQPALLSDEEIAEVLSKADALKKWAEEIYTYAQ 274 Query: 61 NVLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 N + ++ P ++L GR +R Y ++ +V + + Y +L+S TE E+L+ + Sbjct: 275 NEAITNRKEWPGFKLVLGRS-NRKYTDEEEVAEAAKTAGYTDIYRTSLISITEMEKLMGK 333 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFS 161 KK +E L ++ + DGK +VP + ++FS Sbjct: 334 KKFNEI----LGSYVYKPDGKVTLVPDSDKREAIYISTAEADFS 373 >gi|160937657|ref|ZP_02085017.1| hypothetical protein CLOBOL_02547 [Clostridium bolteae ATCC BAA-613] gi|158439302|gb|EDP17054.1| hypothetical protein CLOBOL_02547 [Clostridium bolteae ATCC BAA-613] Length = 391 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 38/168 (22%), Positives = 66/168 (39%), Gaps = 11/168 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+A+ C A L +L++ ++ +V+ ++ W+K V AL Sbjct: 225 DHCRFCKARVTCRVRAEYNLELTKLDFVDPALLTDEEIGEVLRRADELDHWVKDVTGFAL 284 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMREL--GDEAYN-RTLLSPTETEQLV 117 G ++L EG R Y + + + L E DE Y + L+ TE +L+ Sbjct: 285 AEALKGTKYEGWKLVEGTS-RRRYTDQDAIAMRLTTEGWEEDEIYKPQELIGITEMTKLI 343 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 +KK E L + + +GK + P L + Sbjct: 344 GKKKFEEL----LSGLVIKPEGKPTLAPESDKRPELNRVAEAKQDFDN 387 >gi|315656938|ref|ZP_07909825.1| phage protein [Mobiluncus curtisii subsp. holmesii ATCC 35242] gi|315492893|gb|EFU82497.1| phage protein [Mobiluncus curtisii subsp. holmesii ATCC 35242] Length = 378 Score = 190 bits (484), Expect = 5e-47, Method: Composition-based stats. Identities = 38/164 (23%), Positives = 74/164 (45%), Gaps = 11/164 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC+ P C A A L L++ +++ V+ +P ++TW V+ AL+ Sbjct: 218 WCQFCKITPTCRARAEANLQLAKLEFAPPAELTDVEIADVLTRIPQLKTWASDVEAYALS 277 Query: 62 VL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G ++L GR R Y ++ V + + ++R L++ T E+L+ + Sbjct: 278 KAVNQGVVFEGFKLVAGRS-VRKYTSETDVAKAAEAAGYRDIWDRKLITLTAMEKLMGKP 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSV 162 +E L +++T+ GK +VP L + ++FS Sbjct: 337 AFNEI----LSEYVTKPAGKPTLVPASDKRPALDLVSAATDFSN 376 >gi|150391704|ref|YP_001321753.1| hypothetical protein Amet_4012 [Alkaliphilus metalliredigens QYMF] gi|149951566|gb|ABR50094.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF] Length = 378 Score = 190 bits (483), Expect = 5e-47, Method: Composition-based stats. Identities = 37/167 (22%), Positives = 76/167 (45%), Gaps = 11/167 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C FCRA RC A A + L +L++ ++ +++ ++P + W + A + Sbjct: 216 WCTFCRASTRCRARADEKLKLAQMEFKMPPLLTDSEIEEILMIIPDLTKWANEITAYATD 275 Query: 62 VL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G++ +++ EGR R Y +++ V Q + + Y ++L+ TE ++L+ + Sbjct: 276 AAVNHGKEWSGFKVVEGRS-VRKYKDEDAVAQKAVESGYKDIYRKSLIPLTEMQKLMGKT 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLKD 165 K E L I + GK +VP + +EF+ + + Sbjct: 335 KFEEI----LGSLIVKPPGKPTLVPKTDKRVAMNVTNAKNEFNEIME 377 >gi|153940578|ref|YP_001391680.1| hypothetical protein CLI_2438 [Clostridium botulinum F str. Langeland] gi|152936474|gb|ABS41972.1| conserved hypothetical protein [Clostridium botulinum F str. Langeland] gi|295319709|gb|ADG00087.1| conserved hypothetical protein [Clostridium botulinum F str. 230613] Length = 381 Score = 190 bits (483), Expect = 6e-47, Method: Composition-based stats. Identities = 38/162 (23%), Positives = 74/162 (45%), Gaps = 10/162 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + C+FCRAK C A A K + Q L N ++ ++ + W K V+E AL Sbjct: 224 DHCKFCRAKAVCKARADKNMELAKYDFQEPITLDNNDVAFILGKADELINWAKDVQEYAL 283 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVK 118 GE+ +++ EGR +R + ++ ++ ++L+ + E Y + L T E+ + Sbjct: 284 EQALQGEEFDGFKVVEGRS-NRKWTDEEKIGKILLGQGFLEDIIYTKKLTGITNMEKAIG 342 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 +K+V++ L +I + GK + + ++F Sbjct: 343 KKEVTKL----LGDYIIKPQGKPTLATITDKRPVYNSAEADF 380 >gi|168179453|ref|ZP_02614117.1| phage protein [Clostridium botulinum NCTC 2916] gi|182669541|gb|EDT81517.1| phage protein [Clostridium botulinum NCTC 2916] Length = 381 Score = 190 bits (482), Expect = 7e-47, Method: Composition-based stats. Identities = 38/162 (23%), Positives = 74/162 (45%), Gaps = 10/162 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + C+FCRAK C A A K + Q L N ++ ++ + W K V+E AL Sbjct: 224 DHCKFCRAKAVCKARADKNMELAKYDFQEPNTLDNNDIAFILGKADELINWAKDVQEYAL 283 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVK 118 GE+ +++ EGR +R + ++ ++ ++L+ + E Y + L T E+ + Sbjct: 284 EQALKGEEFDGFKVVEGRS-NRKWTDEEKIGKILLGQGFLEDIIYTKKLTGITNMEKAIG 342 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 +K+V++ L +I + GK + + ++F Sbjct: 343 KKEVTKL----LGDYIIKPQGKPTLATITDKRPVYNSAEADF 380 >gi|153955281|ref|YP_001396046.1| hypothetical protein CKL_2663 [Clostridium kluyveri DSM 555] gi|219855704|ref|YP_002472826.1| hypothetical protein CKR_2361 [Clostridium kluyveri NBRC 12016] gi|146348139|gb|EDK34675.1| Hypothetical protein CKL_2663 [Clostridium kluyveri DSM 555] gi|219569428|dbj|BAH07412.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 378 Score = 188 bits (479), Expect = 1e-46, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 75/168 (44%), Gaps = 11/168 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FCRA +C A A + L +L++ ++ +V++ L + W + A Sbjct: 215 EWCTFCRAAVKCRARAEEKLKLAQMEFKLPPLLTDSEIEEVLSKLSDLTKWANEIIAYAT 274 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + + G++ +++ EGR R Y ++ V + + Y ++L++ TE ++L+ + Sbjct: 275 DAAVNHGKEWHGFKVVEGRS-VRKYKDEEAVAEAAKANGYKDIYRQSLITLTEMQKLMGK 333 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLKD 165 KK + L I + GK +VP + +EF+ + + Sbjct: 334 KKFEQI----LGGLIHKPPGKPTLVPNSDKRPAMNISNVKNEFNEITE 377 >gi|331090259|ref|ZP_08339146.1| hypothetical protein HMPREF1025_02729 [Lachnospiraceae bacterium 3_1_46FAA] gi|330401878|gb|EGG81453.1| hypothetical protein HMPREF1025_02729 [Lachnospiraceae bacterium 3_1_46FAA] Length = 376 Score = 188 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 73/166 (43%), Gaps = 9/166 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+AK C A A + L ++LS ++++V+ V + W V A Sbjct: 215 DWCRFCKAKNTCRARAEEYLRLAQMEFKPPELLSEEEIAEVLKVADELAKWSADVYAYAQ 274 Query: 61 NVLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + G ++L EGR +R Y N+ +V ++ Y ++L+ TE E+L+ + Sbjct: 275 DEAITHGRVWNGFKLVEGRS-NRKYVNEEEVADAAKAAGYEDIYKKSLIGITEMEKLMGK 333 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 K + L + + GK +VP ++ +E +D Sbjct: 334 KDFQKI----LGSLVYKPQGKITLVPESDKRPPIQTETAEADFKED 375 >gi|134299051|ref|YP_001112547.1| hypothetical protein Dred_1188 [Desulfotomaculum reducens MI-1] gi|134051751|gb|ABO49722.1| conserved hypothetical protein [Desulfotomaculum reducens MI-1] Length = 397 Score = 188 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 41/169 (24%), Positives = 75/169 (44%), Gaps = 15/169 (8%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 N C+FC+A+ C A A K L +L+ ++++V+ ++ W K V + AL Sbjct: 224 NHCQFCKARFTCRARADKNLELAKLDFQDPPLLTIDEIAEVLAKAEELQKWAKDVSDYAL 283 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRT-LLSPTETEQL 116 + + P ++L EGR R Y+++ +V +L+ E Y L+ T E+ Sbjct: 284 DQAVNHDVKFPGWKLVEGRS-KRVYSDEKEVANVLLAANYAEDIIYKPKELVGITAMEKA 342 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL---KANISEFSV 162 + +KK L I + GK V+VP + + ++F+ Sbjct: 343 LGKKKFGIL----LDGLIIKPAGKPVLVPESDKRPEICSTDSAKADFAE 387 >gi|295114048|emb|CBL32685.1| Protein of unknown function (DUF2800). [Enterococcus sp. 7L76] Length = 380 Score = 188 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 40/167 (23%), Positives = 76/167 (45%), Gaps = 11/167 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC AK C A A L ++LS+ ++ +++ + + +W +K+ AL Sbjct: 218 KWCGFCPAKNSCRARADHNLKLAQYEFKPPELLSDDEIEEIIGKVDDLVSWSNDIKDCAL 277 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G+ +++L EGR R Y+NDN V +++ + Y++ LL T + + ++ Sbjct: 278 KLALGGKQWTHHKLVEGRST-RKYSNDNDVAAAVIKAGY-DPYDKKLLGVTAMTKALGKE 335 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFSVLKD 165 K +E L ++I + GK +V + EF L + Sbjct: 336 KFNEI----LGEYIIKPKGKLTLVDGSDKRQAVTVNNLNEEFKPLTE 378 >gi|304389853|ref|ZP_07371812.1| phage protein [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|304327029|gb|EFL94268.1| phage protein [Mobiluncus curtisii subsp. curtisii ATCC 35241] Length = 378 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 11/163 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC+ P C A A L LS+ +++ V+ +P ++TW V+ AL+ Sbjct: 218 WCQFCKIAPTCRARAEANLQLAKLEFAPPAELSDVEIADVLTRIPQLKTWAADVEAYALS 277 Query: 62 VL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G ++L GR R Y+++ V + Y+R L++ T E+L+ + Sbjct: 278 KAVNQGVVFEGFKLVAGRS-VRKYSSETDVAAAAEAAGYRDIYDRKLITLTAMEKLMGKP 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFS 161 +E L +T+ GK +VP L + S+F Sbjct: 337 TFNEI----LGDLVTKPAGKPTLVPVSDKRPALDLVSAASDFQ 375 >gi|220930224|ref|YP_002507133.1| hypothetical protein Ccel_2859 [Clostridium cellulolyticum H10] gi|220000552|gb|ACL77153.1| conserved hypothetical protein [Clostridium cellulolyticum H10] Length = 378 Score = 188 bits (477), Expect = 3e-46, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 74/168 (44%), Gaps = 11/168 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FCRA +C A A + L +L++ ++ +V++ L + W + A Sbjct: 215 EWCTFCRAAVKCRARAEEKLKLAQMEFKLPPLLTDSEIEEVLSKLSDLTKWANEIIAYAT 274 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + + G++ +++ EGR R Y ++ V + + Y + L++ TE ++L+ + Sbjct: 275 DAAVNHGKEWHGFKVVEGRS-VRKYKDEKAVAEAAKANGYKDIYRQNLITLTEMQKLMGK 333 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLKD 165 KK + L I + GK +VP + +EF+ + + Sbjct: 334 KKFEQI----LGGLIHKPPGKPTLVPNSDKRPAMNISNVKNEFNEITE 377 >gi|284024532|ref|ZP_06378930.1| hypothetical protein Saura13_08075 [Staphylococcus aureus subsp. aureus 132] Length = 388 Score = 187 bits (476), Expect = 3e-46, Method: Composition-based stats. Identities = 40/163 (24%), Positives = 75/163 (46%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ ALN Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALNQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKAMLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|300764692|ref|ZP_07074683.1| hypothetical protein LMHG_11076 [Listeria monocytogenes FSL N1-017] gi|300514578|gb|EFK41634.1| hypothetical protein LMHG_11076 [Listeria monocytogenes FSL N1-017] Length = 380 Score = 187 bits (476), Expect = 4e-46, Method: Composition-based stats. Identities = 40/167 (23%), Positives = 75/167 (44%), Gaps = 11/167 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC AK C A A L ++LS+ ++ +++ + + +W +K+ AL Sbjct: 218 KWCGFCPAKNSCRARADHNLKLAQYEFKPPELLSDDEIEEIIGKVDDLVSWSNDIKDCAL 277 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G+ + +L EGR R Y+NDN V +++ + Y++ LL T + + ++ Sbjct: 278 KLALGGKQWTHQKLVEGRST-RKYSNDNDVAAAVIKAGY-DPYDKKLLGVTAMTKALGKE 335 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFSVLKD 165 K +E L ++I + GK +V + EF L + Sbjct: 336 KFNEI----LGEYIIKPKGKLTLVDGSDKRQAVTVNNLNEEFKPLTE 378 >gi|332983349|ref|YP_004464790.1| hypothetical protein Mahau_2843 [Mahella australiensis 50-1 BON] gi|332701027|gb|AEE97968.1| hypothetical protein Mahau_2843 [Mahella australiensis 50-1 BON] Length = 378 Score = 187 bits (474), Expect = 5e-46, Method: Composition-based stats. Identities = 35/166 (21%), Positives = 74/166 (44%), Gaps = 11/166 (6%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKE 57 + CRFC+A +C A A + + +LS+ ++S+++ + + +W + Sbjct: 212 VPGEHCRFCKAAVKCRARAESKMKLAAFEFALPPLLSDEEISEILASIGDLTSWANEIIA 271 Query: 58 EALNVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQL 116 A + + G+ P +++ EGR +R Y+++ V + Y ++L++ TE E+L Sbjct: 272 YATDAAVNRGKQWPGFKVVEGRS-NRKYSDEGAVAAAAKAAGYRDIYRQSLITITEMEKL 330 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEF 160 + + K +E L I + K +VP + +F Sbjct: 331 MGKSKFNEV----LGGLIIKPLSKPTLVPLSDKRPPMNISNAKIDF 372 >gi|258455587|ref|ZP_05703544.1| conserved hypothetical protein [Staphylococcus aureus A5937] gi|257862275|gb|EEV85046.1| conserved hypothetical protein [Staphylococcus aureus A5937] Length = 388 Score = 187 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ C A + + +LS+ ++++++ LP I+ W V++ ALN Sbjct: 231 HCRFCKINHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALNQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKAMLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKSSAEDDFDEL 388 >gi|257432435|ref|ZP_05608798.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus E1410] gi|257283314|gb|EEV13446.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus E1410] Length = 388 Score = 187 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRVITDTKATLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|283469576|emb|CAQ48787.1| conserved phage-associated protein [Staphylococcus aureus subsp. aureus ST398] Length = 388 Score = 186 bits (473), Expect = 8e-46, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 72/163 (44%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V+ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEHYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + V L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKAVRDRLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ + FI + GK + +K + +F L Sbjct: 350 FSKIA----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|262049892|ref|ZP_06022754.1| hypothetical protein SAD30_0348 [Staphylococcus aureus D30] gi|259162041|gb|EEW46621.1| hypothetical protein SAD30_0348 [Staphylococcus aureus D30] Length = 388 Score = 186 bits (472), Expect = 9e-46, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEKYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + N + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTNATLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ + FI + GK + +K + +F L Sbjct: 350 FSKIA----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|21283156|ref|NP_646244.1| hypothetical protein MW1427 [Staphylococcus aureus subsp. aureus MW2] gi|87160012|ref|YP_494122.1| hypothetical protein SAUSA300_1425 [Staphylococcus aureus subsp. aureus USA300_FPR3757] gi|161509704|ref|YP_001575363.1| hypothetical protein USA300HOU_1475 [Staphylococcus aureus subsp. aureus USA300_TCH1516] gi|294848503|ref|ZP_06789249.1| hypothetical protein SKAG_00567 [Staphylococcus aureus A9754] gi|300911888|ref|ZP_07129331.1| hypothetical protein HMPREF0773_11540 [Staphylococcus aureus subsp. aureus TCH70] gi|21204596|dbj|BAB95292.1| hypothetical protein [Staphylococcus aureus subsp. aureus MW2] gi|87125986|gb|ABD20500.1| conserved hypothetical phage protein [Staphylococcus aureus subsp. aureus USA300_FPR3757] gi|160368513|gb|ABX29484.1| hypothetical bacteriophage protein [Staphylococcus aureus subsp. aureus USA300_TCH1516] gi|294824529|gb|EFG40952.1| hypothetical protein SKAG_00567 [Staphylococcus aureus A9754] gi|300886134|gb|EFK81336.1| hypothetical protein HMPREF0773_11540 [Staphylococcus aureus subsp. aureus TCH70] gi|302333143|gb|ADL23336.1| putative phage protein [Staphylococcus aureus subsp. aureus JKD6159] gi|315197710|gb|EFU28044.1| hypothetical bacteriophage protein [Staphylococcus aureus subsp. aureus CGS01] gi|320140646|gb|EFW32500.1| hypothetical protein HMPREF9528_01270 [Staphylococcus aureus subsp. aureus MRSA131] gi|320142782|gb|EFW34585.1| hypothetical protein HMPREF9529_01882 [Staphylococcus aureus subsp. aureus MRSA177] Length = 388 Score = 186 bits (472), Expect = 9e-46, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ C A + + +LS+ ++++++ LP I+ W V++ ALN Sbjct: 231 HCRFCKINHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALNQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKAMLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|323440321|gb|EGA98035.1| hypothetical protein SAO11_1058 [Staphylococcus aureus O11] Length = 388 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 73/164 (44%), Gaps = 9/164 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL 63 N CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL Sbjct: 230 NHCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALEQA 289 Query: 64 -SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRK 120 + ++ P ++L EGR R + + L+ ++ LLS T E+L+ +K Sbjct: 290 KENDKNYPGWKLVEGRS-RRMITDTKATLEKLVEAGYKPEDITETKLLSITNLEKLIGKK 348 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T FI + GK + +K + +F L Sbjct: 349 AFSKIT----DGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|215401123|ref|YP_002332378.1| hypothetical protein SauSIPLA35_gp15 [Staphylococcus phage phiSauS-IPLA35] gi|215260474|gb|ACJ64604.1| gp15 [Staphylococcus phage phiSauS-IPLA35] Length = 388 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 40/163 (24%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEKYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKATLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI R GK + +K + +F L Sbjct: 350 FSKIT----EGFIERPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|148267471|ref|YP_001246414.1| hypothetical protein SaurJH9_1037 [Staphylococcus aureus subsp. aureus JH9] gi|150393524|ref|YP_001316199.1| hypothetical protein SaurJH1_1056 [Staphylococcus aureus subsp. aureus JH1] gi|147740540|gb|ABQ48838.1| hypothetical protein SaurJH9_1037 [Staphylococcus aureus subsp. aureus JH9] gi|149945976|gb|ABR51912.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus JH1] Length = 388 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLS 64 CRFC+ K C A + + +LS+ ++++++ LP I+ W V+ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEHYALDQAK 290 Query: 65 -SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ ++ LLS T+ E+L+ +K Sbjct: 291 GNDKNYPGWKLVEGRS-RRMITDTKATLEKLVEAGYKPEDITETKLLSITKLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|209363569|ref|YP_002267987.1| hypothetical protein phi2958PVL_gp17 [Staphylococcus phage phi2958PVL] gi|257428255|ref|ZP_05604653.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus 65-1322] gi|258419944|ref|ZP_05682903.1| conserved hypothetical protein [Staphylococcus aureus A9719] gi|258448863|ref|ZP_05696973.1| conserved hypothetical protein [Staphylococcus aureus A6224] gi|295407653|ref|ZP_06817442.1| hypothetical protein SMAG_02823 [Staphylococcus aureus A8819] gi|297246754|ref|ZP_06930569.1| hypothetical protein SLAG_02805 [Staphylococcus aureus A8796] gi|208973070|dbj|BAG74386.1| hypothetical protein [Staphylococcus phage phi2958PVL] gi|257275096|gb|EEV06583.1| conserved hypothetical protein [Staphylococcus aureus subsp. aureus 65-1322] gi|257844069|gb|EEV68459.1| conserved hypothetical protein [Staphylococcus aureus A9719] gi|257857900|gb|EEV80791.1| conserved hypothetical protein [Staphylococcus aureus A6224] gi|294967467|gb|EFG43507.1| hypothetical protein SMAG_02823 [Staphylococcus aureus A8819] gi|297176380|gb|EFH35653.1| hypothetical protein SLAG_02805 [Staphylococcus aureus A8796] Length = 388 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V+ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEHYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKAMLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|282928295|ref|ZP_06335899.1| conserved hypothetical protein [Staphylococcus aureus A9765] gi|282591966|gb|EFB96999.1| conserved hypothetical protein [Staphylococcus aureus A9765] Length = 388 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ C A + + +LS+ ++++++ LP I+ W V++ ALN Sbjct: 231 HCRFCKINHSCRTRAEYMQNVPQKPPYLLSDEEIAELLYKLPDIKKWADEVEQYALNQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKAMLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|258445213|ref|ZP_05693408.1| conserved hypothetical protein [Staphylococcus aureus A6300] gi|257855974|gb|EEV78894.1| conserved hypothetical protein [Staphylococcus aureus A6300] Length = 388 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ C A + + +LS+ ++++++ LP I+ W V++ ALN Sbjct: 231 HCRFCKINHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALNQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + N + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTNATLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ + FI + GK + +K + +F L Sbjct: 350 FSKIA----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|315128372|gb|EFT84383.1| hypothetical protein CGSSa03_07341 [Staphylococcus aureus subsp. aureus CGS03] Length = 388 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ E ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRVITDTKATLEKLVEEGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ + FI + GK + +K + +F L Sbjct: 350 FSKIA----EGFIEKPQGKLTLATESDKRPAIKSSAEDDFDKL 388 >gi|329730598|gb|EGG66983.1| hypothetical protein SA21193_1535 [Staphylococcus aureus subsp. aureus 21193] Length = 388 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V+ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEHYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRVITDTKATLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|253317128|ref|ZP_04840341.1| hypothetical protein SauraC_13554 [Staphylococcus aureus subsp. aureus str. CF-Marseille] gi|258428685|ref|ZP_05688219.1| conserved hypothetical protein [Staphylococcus aureus A9299] gi|282905880|ref|ZP_06313735.1| gp15 [Staphylococcus aureus subsp. aureus Btn1260] gi|282911109|ref|ZP_06318911.1| gp15 [Staphylococcus aureus subsp. aureus WBG10049] gi|297207803|ref|ZP_06924237.1| phage protein [Staphylococcus aureus subsp. aureus ATCC 51811] gi|257849740|gb|EEV73704.1| conserved hypothetical protein [Staphylococcus aureus A9299] gi|282324804|gb|EFB55114.1| gp15 [Staphylococcus aureus subsp. aureus WBG10049] gi|282331172|gb|EFB60686.1| gp15 [Staphylococcus aureus subsp. aureus Btn1260] gi|296887519|gb|EFH26418.1| phage protein [Staphylococcus aureus subsp. aureus ATCC 51811] gi|312438061|gb|ADQ77132.1| phage protein [Staphylococcus aureus subsp. aureus TCH60] Length = 388 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLS 64 CRFC+ K C A + + +LS+ ++++++ LP I+ W V+ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEHYALDQAK 290 Query: 65 -SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ ++ LLS T+ E+L+ +K Sbjct: 291 GNDKNYPGWKLVEGRS-RRMITDTKATLEKLVEAGYKPEDITETKLLSITKLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|282919247|ref|ZP_06326982.1| hypothetical protein SASG_00553 [Staphylococcus aureus subsp. aureus C427] gi|282317057|gb|EFB47431.1| hypothetical protein SASG_00553 [Staphylococcus aureus subsp. aureus C427] Length = 388 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ C A + + +LS+ ++++++ LP I+ W V++ ALN Sbjct: 231 HCRFCKINHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALNQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKAMLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|282926445|ref|ZP_06334077.1| conserved hypothetical protein [Staphylococcus aureus A10102] gi|282591774|gb|EFB96845.1| conserved hypothetical protein [Staphylococcus aureus A10102] Length = 388 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V+ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEHYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ ++ LLS T+ E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKATLEKLVEAGYKPEDITETKLLSITKLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|45569531|ref|NP_996600.1| hypothetical protein BMP-1p37 [Bordetella phage BMP-1] Length = 418 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 57/182 (31%), Positives = 92/182 (50%), Gaps = 20/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C AL LST ++ +S N L ++ + Sbjct: 229 EKQCRFCKAKATCPALRDHVLSTVADDFVDVSKPVAPQIEHAAERTVDNAILGNLLGAVD 288 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEAYN 104 L+E+W K ++ +A L +G +P ++L EGR+G+R + ND +VE L MR ++ Y+ Sbjct: 289 LVESWCKAIRAKAEAELLAGHPVPGFKLVEGRRGARRWTNDAEVEAALKAMRLKVEQMYD 348 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFSV 162 +L+SPT E+L K + W +LQ FIT+ DGK + P ++A +F+ Sbjct: 349 LSLISPTTAEKLHKAGDIGPRQWPKLQGFITQSDGKPSVAPESDKRPALVIQAAADDFAD 408 Query: 163 LK 164 + Sbjct: 409 VS 410 >gi|49483721|ref|YP_040945.1| hypothetical protein SAR1543 [Staphylococcus aureus subsp. aureus MRSA252] gi|295428043|ref|ZP_06820675.1| hypothetical protein SIAG_00571 [Staphylococcus aureus subsp. aureus EMRSA16] gi|297590988|ref|ZP_06949626.1| phage protein [Staphylococcus aureus subsp. aureus MN8] gi|49241850|emb|CAG40542.1| hypothetical phage protein [Staphylococcus aureus subsp. aureus MRSA252] gi|295128401|gb|EFG58035.1| hypothetical protein SIAG_00571 [Staphylococcus aureus subsp. aureus EMRSA16] gi|297575874|gb|EFH94590.1| phage protein [Staphylococcus aureus subsp. aureus MN8] Length = 388 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + +LS+ ++++++ LP I+ W V+ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQDVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEHYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKAMLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|156603968|ref|YP_001429914.1| hypothetical protein SPTP3102_gp19 [Staphylococcus phage tp310-2] gi|154818054|gb|ABS87481.1| hypothetical protein [Staphylococcus phage tp310-2] Length = 388 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V+ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEHYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ ++ LLS T+ E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKATLEKLVEAGYKPEDITETKLLSITKLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|71899881|ref|ZP_00682029.1| phage-related protein [Xylella fastidiosa Ann-1] gi|71730321|gb|EAO32404.1| phage-related protein [Xylella fastidiosa Ann-1] Length = 426 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 54/182 (29%), Positives = 85/182 (46%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAQLRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR G R + + E +L R + Y+ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRHGPRRWADMTAAETMLKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G V+VP L ++F L Sbjct: 345 VSLISPTRAEKLHQAGVIGDRQWPKLQPLIHRAAGAPVVVPTSDKRPALALQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|45580782|ref|NP_996648.1| hypothetical protein BIP-1p37 [Bordetella phage BIP-1] Length = 418 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 57/182 (31%), Positives = 92/182 (50%), Gaps = 20/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C AL LST ++ +S N L ++ + Sbjct: 229 EKQCRFCKAKATCPALRDHVLSTVADDFVDVSKPVAPQIEHAAERTVDNAILGNLLGAVD 288 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEAYN 104 L+E+W K ++ +A L +G +P ++L EGR+G+R + ND +VE L MR ++ Y+ Sbjct: 289 LVESWCKAIRAKAEAELLAGHPVPGFKLVEGRRGARRWTNDAEVEAALKAMRLKVEQMYD 348 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFSV 162 +L+SPT E+L K + W +LQ FIT+ DGK + P ++A +F+ Sbjct: 349 LSLISPTTAEKLHKAGDIGPRQWPKLQGFITQSDGKPSVAPESDKRPALVIQAAADDFAD 408 Query: 163 LK 164 + Sbjct: 409 VS 410 >gi|283470755|emb|CAQ49966.1| conserved phage-associated protein [Staphylococcus aureus subsp. aureus ST398] Length = 388 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 38/163 (23%), Positives = 70/163 (42%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEKYALEQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + + P ++L GR R + V L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKTYPGWKLVTGRS-RRVITDTKAVRDRLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ + FI + GK + +K + +F L Sbjct: 350 FSKIA----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|29028627|ref|NP_803316.1| phi APSE P51-like protein [Staphylococcus phage phi 12] gi|66395668|ref|YP_240048.1| ORF012 [Staphylococcus phage 47] gi|88195278|ref|YP_500081.1| phi APSE P51-like protein [Staphylococcus aureus subsp. aureus NCTC 8325] gi|18920551|gb|AAL82291.1| phi APSE P51-like protein [Staphylococcus phage phi 12] gi|62636092|gb|AAX91203.1| ORF012 [Staphylococcus phage 47] gi|87202836|gb|ABD30646.1| phi APSE P51-like protein [Staphylococcus aureus subsp. aureus NCTC 8325] gi|329724425|gb|EGG60935.1| hypothetical protein SA21189_1602 [Staphylococcus aureus subsp. aureus 21189] Length = 388 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 38/163 (23%), Positives = 70/163 (42%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEKYALEQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + + P ++L GR R + V L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKTYPGWKLVTGRS-RRVITDTKAVRDRLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ + FI + GK + +K + +F L Sbjct: 350 FSKIA----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|41179398|ref|NP_958707.1| Bbp38 [Bordetella phage BPP-1] gi|40950137|gb|AAR97703.1| Bbp38 [Bordetella phage BPP-1] Length = 418 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 57/182 (31%), Positives = 92/182 (50%), Gaps = 20/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C AL LST ++ +S N L ++ + Sbjct: 229 EKQCRFCKAKATCPALRDHVLSTVADDFVDVSKPVAPRIEHAAERTVDNAILGNLLGAVD 288 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEAYN 104 L+E+W K ++ +A L +G +P ++L EGR+G+R + ND +VE L MR ++ Y+ Sbjct: 289 LVESWCKAIRAKAEAELLAGHPVPGFKLVEGRRGARRWTNDAEVEAALKAMRLKVEQMYD 348 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFSV 162 +L+SPT E+L K + W +LQ FIT+ DGK + P ++A +F+ Sbjct: 349 LSLISPTTAEKLHKAGDIGPRQWPKLQGFITQSDGKPSVAPESDKRPALVIQAAADDFAD 408 Query: 163 LK 164 + Sbjct: 409 VS 410 >gi|71276264|ref|ZP_00652542.1| phage-related protein [Xylella fastidiosa Dixon] gi|71900319|ref|ZP_00682454.1| phage-related protein [Xylella fastidiosa Ann-1] gi|71162872|gb|EAO12596.1| phage-related protein [Xylella fastidiosa Dixon] gi|71729894|gb|EAO31990.1| phage-related protein [Xylella fastidiosa Ann-1] Length = 425 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 54/182 (29%), Positives = 85/182 (46%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 EKPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAALRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR G R + ++ E L R + Y+ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRHGPRRWVDETAAEDALKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G V+VP L ++F L Sbjct: 345 VSLISPTRAEKLYQAGVIGDRQWPKLQPLIHRAAGTPVVVPTSDKRPPLALQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|300933389|ref|ZP_07148645.1| phage-associated protein [Corynebacterium resistens DSM 45100] Length = 377 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 42/167 (25%), Positives = 78/167 (46%), Gaps = 12/167 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFC+ P C A L+ LS+ +++QV+ LP ++ W V+ AL Sbjct: 217 EWCRFCKLAPTCRTRAEANLALAQHEFAPPAELSDAEIAQVLAQLPDLKAWAADVEAHAL 276 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 ++ + G+ P ++L EGR R Y++++ V Q + ++R L + T E+ + + Sbjct: 277 SLAVNQGKTWPGFKLVEGRS-IRKYSDESAVAQAAEAAGV-DVWDRKLKTITALEKQLGK 334 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLK 164 K+ + L + + GK +VP L+ + EF+ +K Sbjct: 335 KRFTTL----LGDLVVKPAGKPTLVPESDKRPALEIQSATDEFTAIK 377 >gi|300856816|ref|YP_003781800.1| phage-like protein [Clostridium ljungdahlii DSM 13528] gi|300436931|gb|ADK16698.1| phage-related protein [Clostridium ljungdahlii DSM 13528] Length = 383 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 41/164 (25%), Positives = 74/164 (45%), Gaps = 11/164 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FCRAK C A A K + Q L N ++ ++ + W K V++ AL Sbjct: 225 EHCGFCRAKAVCKARADKNMELAKYEFQEPTTLDNDDIAYILGKAEELAKWAKDVQDYAL 284 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNR-TLLSPTETEQLV 117 + GE+ +++ EGR +R + ++N V + L + + LL + E+ + Sbjct: 285 DQALKGEEFTGFKVVEGRS-NRKFTDENMVAKTLYDNGYTDNVIFKPAQLLGISAMEKAI 343 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFS 161 +KK+++ L+ FI + GK V+VP + ++F Sbjct: 344 GKKKLNQL----LKGFIEKPQGKPVLVPETDKREVFNSAKADFQ 383 >gi|49485812|ref|YP_043033.1| hypothetical protein SAS0907 [Staphylococcus aureus subsp. aureus MSSA476] gi|49244255|emb|CAG42682.1| hypothetical phage protein [Staphylococcus aureus subsp. aureus MSSA476] Length = 388 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 39/163 (23%), Positives = 74/163 (45%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEQYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRMITDTKATLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIKKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|254781191|ref|YP_003065604.1| hypothetical protein CLIBASIA_05490 [Candidatus Liberibacter asiaticus str. psy62] gi|254040868|gb|ACT57664.1| hypothetical protein CLIBASIA_05490 [Candidatus Liberibacter asiaticus str. psy62] gi|317120753|gb|ADV02574.1| hypothetical protein UF506_005 [Candidatus Liberibacter asiaticus] Length = 165 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 165/165 (100%), Positives = 165/165 (100%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 MDENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL Sbjct: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK Sbjct: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD Sbjct: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 >gi|15839112|ref|NP_299800.1| hypothetical protein XF2522 [Xylella fastidiosa 9a5c] gi|9107728|gb|AAF85320.1|AE004059_10 phage-related protein [Xylella fastidiosa 9a5c] Length = 425 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 57/182 (31%), Positives = 87/182 (47%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLTTVADDFVDLTQPVAPQLSHAALRTFDNTTLASLFGATA 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEAYN 104 LIE+W K +++ A L SG+ +P Y+L +GR+G R + ++ E L MR ++ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKLVQGRQGPRRWVDETAAEDALIQMRIGVAHLHD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + E+ W QLQ I R G V+VP L ++F L Sbjct: 345 VSLISPTSAEKLHQAGIIGESQWTQLQPLIHRATGAPVVVPTSDQRPPLALQDATDFEDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|307578621|gb|ADN62590.1| hypothetical protein XFLM_02955 [Xylella fastidiosa subsp. fastidiosa GB514] Length = 425 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 53/182 (29%), Positives = 86/182 (47%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAQLRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR+G R + + E +L R + Y+ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRQGPRRWADMTAAEAMLKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G ++VP L ++F L Sbjct: 345 VSLISPTRAEKLHQVGVIGDRQWPKLQPLIHRSTGAPIVVPTSDKRPALALQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|182681745|ref|YP_001829905.1| hypothetical protein XfasM23_1203 [Xylella fastidiosa M23] gi|182631855|gb|ACB92631.1| hypothetical protein XfasM23_1203 [Xylella fastidiosa M23] Length = 425 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 53/182 (29%), Positives = 86/182 (47%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAQLRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR+G R + + E +L R + Y+ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRQGPRRWADMTAAEAMLKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G ++VP L ++F L Sbjct: 345 VSLISPTRAEKLHQVGVIGDRQWPKLQPLIHRSTGAPIVVPTSDKRPALALQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|323443492|gb|EGB01108.1| hypothetical protein SAO46_0717 [Staphylococcus aureus O46] Length = 388 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 40/163 (24%), Positives = 75/163 (46%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEKYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ P ++L EGR R + + L+ E ++ LLS T E+L+ +K Sbjct: 291 ENDKNYPGWKLVEGRS-RRVITDTKATLEKLVEEGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ T + FI + GK + +K + +F L Sbjct: 350 FSKIT----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|307580174|gb|ADN64143.1| hypothetical protein XFLM_11440 [Xylella fastidiosa subsp. fastidiosa GB514] Length = 425 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 53/182 (29%), Positives = 86/182 (47%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAQLRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR+G R + + E +L R + Y+ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRQGPRRWADMTAAEAMLKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G ++VP L ++F L Sbjct: 345 VSLISPTRAEKLHQVGVIGDRQWPKLQPLIHRSTGAPIVVPTSDKRPALALQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|170730327|ref|YP_001775760.1| hypothetical protein Xfasm12_1179 [Xylella fastidiosa M12] gi|167965120|gb|ACA12130.1| conserved hypothetical protein [Xylella fastidiosa M12] Length = 425 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 55/182 (30%), Positives = 87/182 (47%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 EKPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSHAALRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K V+ +A L SG+ +P Y++ +GR+G+R + + E +L R + Y+ Sbjct: 285 LIESWCKAVRAKAAAELLSGQPVPGYKVVQGRQGARRWADMTAAEAMLKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G V+VP L ++F L Sbjct: 345 VSLISPTTAEKLHQAGVIGDRQWPKLQPLIHRAAGTPVVVPTSDKRPPLALQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|260579054|ref|ZP_05846953.1| phage protein [Corynebacterium jeikeium ATCC 43734] gi|300933487|ref|ZP_07148743.1| phage-associated protein [Corynebacterium resistens DSM 45100] gi|258602805|gb|EEW16083.1| phage protein [Corynebacterium jeikeium ATCC 43734] Length = 377 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 42/167 (25%), Positives = 77/167 (46%), Gaps = 12/167 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFC+ P C A L+ S+ +++QV+ LP ++ W V+ AL Sbjct: 217 EWCRFCKLAPTCRTRAEANLALAKYEFAPPAEFSDAEIAQVLAQLPDLKAWAADVEAHAL 276 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 ++ + G+ P ++L EGR R Y+++ V Q + ++R L + T E+ + + Sbjct: 277 SLAVNQGKTWPGFKLVEGRS-IRKYSDEAAVAQTAEAAGV-DVWDRKLKTITALEKQLGK 334 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLK 164 K+ S+ L + + GK +VP L+ + EF+ +K Sbjct: 335 KRFSDL----LGDLVVKPAGKPTLVPESDKRPALEIQSATDEFTAIK 377 >gi|28199023|ref|NP_779337.1| hypothetical protein PD1131 [Xylella fastidiosa Temecula1] gi|28199599|ref|NP_779913.1| hypothetical protein PD1724 [Xylella fastidiosa Temecula1] gi|182682340|ref|YP_001830500.1| hypothetical protein XfasM23_1823 [Xylella fastidiosa M23] gi|28057121|gb|AAO28986.1| phage-related protein [Xylella fastidiosa Temecula1] gi|28057714|gb|AAO29562.1| phage-related protein [Xylella fastidiosa Temecula1] gi|182632450|gb|ACB93226.1| hypothetical protein XfasM23_1823 [Xylella fastidiosa M23] Length = 425 Score = 183 bits (466), Expect = 5e-45, Method: Composition-based stats. Identities = 53/182 (29%), Positives = 86/182 (47%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAQLRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR+G R + + E +L R + Y+ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRQGPRRWADMTAAEAMLKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G ++VP L ++F L Sbjct: 345 VSLISPTRAEKLHQVGVIGDRQWPKLQPLIHRSTGAPIVVPTSDKRPALALQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|328913311|gb|AEB64907.1| putative protein p51 [Bacillus amyloliquefaciens LL3] Length = 392 Score = 183 bits (466), Expect = 5e-45, Method: Composition-based stats. Identities = 42/168 (25%), Positives = 74/168 (44%), Gaps = 12/168 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFCRAK C A A K L ++LS L +++ ++ W+K ++E L Sbjct: 230 EHCRFCRAKANCSARAEKNLELARFDFQKPELLSKEDLGKILYEAEELKRWVKDIQEYTL 289 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGD---EAYNRTLLSPTETEQL 116 GE +P ++L EGR +R Y + + V+ L+ E + +L + E+ Sbjct: 290 AQAEHHGEKIPGWKLVEGRS-NRKYADVDAVKNTLLAEGYEPEQVLSKPEVLGVSALEKS 348 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 + +K +E L+ + + GK +VP L + S + + Sbjct: 349 IGKKAFNEL----LKDLVIKPVGKPTLVPESDKRPELNSTESAIADFE 392 >gi|298253793|ref|ZP_06977382.1| DUF2800 family protein [Gardnerella vaginalis 5-1] gi|297532129|gb|EFH71102.1| DUF2800 family protein [Gardnerella vaginalis 5-1] Length = 388 Score = 183 bits (466), Expect = 5e-45, Method: Composition-based stats. Identities = 33/155 (21%), Positives = 70/155 (45%), Gaps = 9/155 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFCRAK C A + L +L++ ++ +V+ ++P + W V A Sbjct: 227 DWCRFCRAKAVCRKRAEENLKLAEFEFKPPSVLTDSEIEEVLTLIPQLTKWADDVLAYAT 286 Query: 61 N-VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + ++ G++ ++L EGR R + ++ V + + + +L+ TE ++++ + Sbjct: 287 DSAINHGKEWSGFKLVEGRS-VRKFKDETAVIEKAKAHGFTDIFKTSLIGLTEMQKMMGK 345 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK 154 KK + L + + GK +VP + Sbjct: 346 KKFEDI----LGDLVIKPSGKLTLVPDSDKRAKVN 376 >gi|308174989|ref|YP_003921694.1| protein p51 [Bacillus amyloliquefaciens DSM 7] gi|307607853|emb|CBI44224.1| putative protein p51 [Bacillus amyloliquefaciens DSM 7] Length = 392 Score = 183 bits (466), Expect = 5e-45, Method: Composition-based stats. Identities = 42/168 (25%), Positives = 75/168 (44%), Gaps = 12/168 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFCRAK C A A K L ++LS L +++ ++ W+K ++E L Sbjct: 230 DHCRFCRAKANCSARAKKNLELARFDFQKPELLSKEDLGKILYEAEELKRWVKDIQEYTL 289 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGD---EAYNRTLLSPTETEQL 116 GE +P ++L EGR +R Y + + V+ L+ E + +L + E+ Sbjct: 290 AQAEHHGEKIPGWKLVEGRS-NRKYADVDAVKNTLLAEGYEPEQVLSKPEVLGVSALEKS 348 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 + +K +E L+ + + GK +VP L + S + + Sbjct: 349 IGKKAFNEL----LKDLVIKPVGKPTLVPESDKRPELNSTESAIADFE 392 >gi|66395600|ref|YP_239979.1| ORF012 [Staphylococcus phage 3A] gi|62635946|gb|AAX91057.1| ORF012 [Staphylococcus phage 3A] Length = 388 Score = 183 bits (465), Expect = 6e-45, Method: Composition-based stats. Identities = 38/163 (23%), Positives = 73/163 (44%), Gaps = 9/163 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL- 63 CRFC+ K C A + + +LS+ ++++++ LP I+ W V++ AL+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVEKYALDQAK 290 Query: 64 SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRKK 121 + ++ ++L EGR R + N + L+ ++ LLS T E+L+ +K Sbjct: 291 ENDKNYSGWKLVEGRS-RRMITDTNATLEKLVEAGYKPEDITETKLLSITNLEKLIGKKA 349 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 S+ + FI + GK + +K + +F L Sbjct: 350 FSKIA----EGFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 388 >gi|331087496|ref|ZP_08336432.1| hypothetical protein HMPREF1025_00015 [Lachnospiraceae bacterium 3_1_46FAA] gi|330410476|gb|EGG89907.1| hypothetical protein HMPREF1025_00015 [Lachnospiraceae bacterium 3_1_46FAA] Length = 176 Score = 183 bits (465), Expect = 7e-45, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 72/168 (42%), Gaps = 11/168 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+ +C A A + + +L++ ++ +V++VLP + W + A Sbjct: 13 EWCTFCKVSVKCRARAEEKMKLARLEFKMPPLLTDAEIEEVLDVLPDLTKWANEITAYAT 72 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 G++ +++ EGR +R Y ++ V + + Y +TL+ TE ++L+ + Sbjct: 73 EAAIHHGKEWNGFKVVEGRS-NRKYRDELLVAEAAREHGYTDIYRQTLIPMTEMQKLMGK 131 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLKD 165 E L I + GK ++VP + +EF + + Sbjct: 132 SAFEEI----LGDLIYKPPGKPILVPNTDKRPAMNVTNAENEFDKIME 175 >gi|323484116|ref|ZP_08089486.1| phage-associated protein [Clostridium symbiosum WAL-14163] gi|323402558|gb|EGA94886.1| phage-associated protein [Clostridium symbiosum WAL-14163] Length = 379 Score = 183 bits (464), Expect = 8e-45, Method: Composition-based stats. Identities = 43/165 (26%), Positives = 74/165 (44%), Gaps = 9/165 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 CRFC+AK C A A + L Q +LS+ ++++V+ V + W V A + Sbjct: 217 WCRFCKAKNTCRARAEEYLRLAQMEFQTPALLSDEEIAEVLKVADDLAKWAADVYAFATD 276 Query: 62 VLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 G+ ++L EGR +R Y ++ +V + + Y TL+ TE E+L+ +K Sbjct: 277 EAITHGKQWAGFKLVEGRS-NRKYTDEEEVAEAAKAAGYTDIYKSTLVGITEMEKLMGKK 335 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 K SE L K + + GK +V + +E ++ Sbjct: 336 KFSEV----LGKLVYKPQGKITLVTESDKREAVMTATAEADFKEE 376 >gi|273810429|ref|YP_003344900.1| gp10 [Xylella phage Xfas53] gi|257097804|gb|ACV41110.1| gp10 [Xylella phage Xfas53] Length = 426 Score = 183 bits (464), Expect = 9e-45, Method: Composition-based stats. Identities = 54/182 (29%), Positives = 87/182 (47%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAQLRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K ++++A L SG+ +P Y++ +GR+G R + + E +L R + Y+ Sbjct: 285 LIESWCKSIRDKAAAQLLSGQPVPGYKVVQGRQGPRRWADVTAAEAMLKQLRIKSKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G V+VP L ++F L Sbjct: 345 ISLISPTTAEKLHQAGVIGDRQWPKLQPLIHRATGAPVVVPTSDKRPPLTLQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|71898930|ref|ZP_00681097.1| phage-related protein [Xylella fastidiosa Ann-1] gi|71731342|gb|EAO33406.1| phage-related protein [Xylella fastidiosa Ann-1] Length = 426 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 54/182 (29%), Positives = 85/182 (46%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAALRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR G R + + E +L R + Y+ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRHGPRRWADMTAAEAMLKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G V+VP L ++F L Sbjct: 345 VSLISPTRAEKLYQAGVIGDRQWPKLQPLIHRAAGTPVVVPTSDKRPPLALQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|297242720|ref|ZP_06926658.1| conserved hypothetical protein [Gardnerella vaginalis AMD] gi|296888931|gb|EFH27665.1| conserved hypothetical protein [Gardnerella vaginalis AMD] Length = 376 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 35/155 (22%), Positives = 70/155 (45%), Gaps = 9/155 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFS---EHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFCRAK C A + L + +L++ ++ +V+ +P + W V A Sbjct: 215 DWCRFCRAKAVCRKRAEENLKLAELEFKEPSVLTDSEIEEVLKTIPTLTKWADDVLAYAT 274 Query: 61 N-VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + ++ G++ ++L EGR R + ++ V + + + +L+ TE ++L+ + Sbjct: 275 DSAINHGKEWSGFKLVEGRS-VRKFKDETAVIEKAKAAGFTDIFKTSLIGLTEMQKLMGK 333 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK 154 KK + L I + GK +VP + Sbjct: 334 KKFEDI----LGDLIIKPSGKLTLVPDSDKRAKVN 364 >gi|71901329|ref|ZP_00683425.1| phage-related protein [Xylella fastidiosa Ann-1] gi|71728913|gb|EAO31048.1| phage-related protein [Xylella fastidiosa Ann-1] Length = 426 Score = 182 bits (462), Expect = 1e-44, Method: Composition-based stats. Identities = 54/182 (29%), Positives = 85/182 (46%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAALRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR G R + + E +L R + Y+ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRHGPRRWADMTAAEAMLKQLRIKFKDMYD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + + W +LQ I R G V+VP L ++F L Sbjct: 345 VSLISPTRAEKLYQAGVIGDRQWPKLQPLIHRAAGTPVVVPTSDKRPALPLQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|298346389|ref|YP_003719076.1| phage-associated protein [Mobiluncus curtisii ATCC 43063] gi|298236450|gb|ADI67582.1| phage-associated protein [Mobiluncus curtisii ATCC 43063] Length = 354 Score = 182 bits (462), Expect = 2e-44, Method: Composition-based stats. Identities = 38/161 (23%), Positives = 66/161 (40%), Gaps = 10/161 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC C A A + LS +++ V+ +P + W V++ AL Sbjct: 194 WCQFCPIAATCRARAEANMKLAKHEFAPPAELSADEVANVLTRIPELTKWASDVQDYALG 253 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKK 121 GE ++L GR R Y ++ V + + Y + LL+ T E+L+ RK+ Sbjct: 254 KALEGEHYAGFKLVAGRS-IRKYADEAAVAEAAKAAGYKDIYKQQLLTITAMEKLMGRKQ 312 Query: 122 VSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEF 160 +E L + + +GK +VP L + +F Sbjct: 313 FAEV----LGDLVVKPEGKPTLVPLSDKRPELSMSSAADDF 349 >gi|253581723|ref|ZP_04858947.1| phage protein [Fusobacterium varium ATCC 27725] gi|251836072|gb|EES64609.1| phage protein [Fusobacterium varium ATCC 27725] Length = 385 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 73/170 (42%), Gaps = 10/170 (5%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKE 57 + N C+FCRAK C A L +LS+ ++ + + + W + +K+ Sbjct: 219 IAGNHCKFCRAKATCRERARMNLEVAKFDFKEPALLSDEEVGEALKQAQDLAKWAEDLKD 278 Query: 58 EALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQ 115 AL G+ +P ++ EGR GSR + ++ + ++LM DE Y R L+ + E+ Sbjct: 279 YALAESLKGKLIPGWKAVEGR-GSRVFTDNEEALKVLMGSGIDETMLYERKQLTLAQIEK 337 Query: 116 LVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 ++ K+ E + + + GK +V + + K+ Sbjct: 338 VLGPKQFKEL----VGNMVEKSPGKPTLVLDTDKREAISNKTTAVEDFKE 383 >gi|304439196|ref|ZP_07399114.1| phage protein [Peptoniphilus duerdenii ATCC BAA-1640] gi|304372328|gb|EFM25916.1| phage protein [Peptoniphilus duerdenii ATCC BAA-1640] Length = 370 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 38/162 (23%), Positives = 68/162 (41%), Gaps = 11/162 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+AK +C A + L + LS+ ++ +++ L +E W+K +K AL Sbjct: 212 EWCIFCKAKNKCRKRAEENLKLAQDEFTLPPELSDEEIEEILPKLDELEQWVKDIKTYAL 271 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 G + +L EGR +R Y ++++V + + LL T +L+ +K Sbjct: 272 ERAMKGHKWKDLKLVEGRS-NRKYRDEDEVINKVKELGF-NPFEEKLLGITAMTKLLGKK 329 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEF 160 E + + + GK +V D +K EF Sbjct: 330 VFDE----NIVDLLEKPKGKLTLVSIDDKREEVKIDNVKEEF 367 >gi|227875069|ref|ZP_03993214.1| phage-associated protein [Mobiluncus mulieris ATCC 35243] gi|304390312|ref|ZP_07372265.1| phage protein [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|306817356|ref|ZP_07451101.1| phage protein [Mobiluncus mulieris ATCC 35239] gi|227844347|gb|EEJ54511.1| phage-associated protein [Mobiluncus mulieris ATCC 35243] gi|304326068|gb|EFL93313.1| phage protein [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|304649797|gb|EFM47077.1| phage protein [Mobiluncus mulieris ATCC 35239] Length = 378 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 37/163 (22%), Positives = 70/163 (42%), Gaps = 11/163 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC+ P C A A L+ LS+ +++ V+ +P ++TW V+ AL+ Sbjct: 218 WCQFCKIAPTCRARAEANLALARLEFAPPAELSDSEIADVLARIPQLKTWAADVEAYALS 277 Query: 62 VL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G ++L GR R Y ++ V + ++R L++ T E+L+ + Sbjct: 278 QAVNQGVVFEGFKLVAGRS-IRKYTSETDVAAAAEAAGYRDIWDRKLITLTAMERLMGKP 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFS 161 +E L +T+ GK +V L + ++F Sbjct: 337 AFNEI----LGDLVTKPAGKPTLVLASDKRPALDLVSAATDFQ 375 >gi|212712320|ref|ZP_03320448.1| hypothetical protein PROVALCAL_03408 [Providencia alcalifaciens DSM 30120] gi|212685066|gb|EEB44594.1| hypothetical protein PROVALCAL_03408 [Providencia alcalifaciens DSM 30120] Length = 428 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 53/185 (28%), Positives = 85/185 (45%), Gaps = 24/185 (12%) Query: 3 ENACRFCRAKPR-CGALAVKALSTFSEHMQILS------------------NRQLSQVMN 43 E CRFC+AK C A A AL L+ QL+++ Sbjct: 238 EKQCRFCKAKGGLCAAEAQHALELVKGDFVDLTAPLEGQLSEAPQRITVLKPSQLAEIYK 297 Query: 44 VLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDE 101 L +E + K ++ + L+SG +P +L G+ G+RT+ N+ + E L R +E Sbjct: 298 GLDFVENFCKTLRIRVSDELNSGHTIPGLKLVTGKLGNRTWGNETEAEATLKAFRLKREE 357 Query: 102 AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEF 160 YN L+SPT+ E+L+K+ + W +L+ ITR +GK V+ P L ++F Sbjct: 358 MYNMKLISPTQAEKLIKKD--NPRRWTKLESLITRAEGKPVVAFESDPRPALVINPENDF 415 Query: 161 SVLKD 165 + D Sbjct: 416 DDVSD 420 >gi|297587103|ref|ZP_06945748.1| phage protein [Finegoldia magna ATCC 53516] gi|297575084|gb|EFH93803.1| phage protein [Finegoldia magna ATCC 53516] Length = 374 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 37/167 (22%), Positives = 70/167 (41%), Gaps = 11/167 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+AK +C A + L E LS+ ++ +++ L +E W+K +K AL Sbjct: 212 EWCIFCKAKNKCRKRAEENLKLAQEEFILPPELSDDEIEEILPRLDELEQWVKDIKAYAL 271 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 G + +L EGR +R Y ++++V + + + LL T +L+ +K Sbjct: 272 ERAMKGHRWKDLKLVEGRS-NRKYRDEDEVVKKVKELGF-NPFEEKLLGITAMTKLLGKK 329 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFSVLKD 165 E + + + GK +V + ++ EF + Sbjct: 330 VFDE----NISDLLEKPKGKLTLVNINDKREEVVIENVKEEFGGFNN 372 >gi|15837288|ref|NP_297976.1| hypothetical protein XF0686 [Xylella fastidiosa 9a5c] gi|9105568|gb|AAF83496.1|AE003912_8 phage-related protein [Xylella fastidiosa 9a5c] Length = 425 Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats. Identities = 58/182 (31%), Positives = 86/182 (47%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLTTVADDFVDLTQPVAPQLSHAALRTFDNTTLASLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEAYN 104 LIE+W K V+ +A L SG +P Y+L +GR+G R + ++ E L MR ++ Sbjct: 285 LIESWCKAVRAKAAAELRSGNAVPGYKLVQGRQGPRRWVDETAAEDALIQMRIGVAHLHD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SPT E+L + + E+ W QLQ I R G V+VP L ++F L Sbjct: 345 VSLISPTSAEKLHQAGIIGESQWTQLQPLIHRATGAPVVVPTSDQRPPLALQDATDFEDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|205375252|ref|ZP_03228042.1| hypothetical protein Bcoam_19809 [Bacillus coahuilensis m4-4] Length = 384 Score = 180 bits (458), Expect = 4e-44, Method: Composition-based stats. Identities = 47/168 (27%), Positives = 76/168 (45%), Gaps = 12/168 (7%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKE 57 + + CRFC+ K C A A + L +LS+ ++ +V+N + + +W K V+E Sbjct: 222 IAGDHCRFCKVKSTCRARAEENLKLACMDFQKPPLLSDEEVVEVLNTIDQLVSWAKDVQE 281 Query: 58 EALNVLSSGED-LPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETE 114 AL S P +L GR GSR Y ++ V Q L D Y +TL + T E Sbjct: 282 FALTKAISENKLWPRMKLVXGR-GSRKYADEEAVLQALTDAGYDHDLIYKKTLNTITTLE 340 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFS 161 + + +K +E L +T+ GK +VP +K + +F Sbjct: 341 KELGKKTFNEL----LGPLVTKAPGKIKLVPEQDKRPEIKASPAVDFQ 384 >gi|293400010|ref|ZP_06644156.1| phage protein [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291306410|gb|EFE47653.1| phage protein [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 392 Score = 180 bits (458), Expect = 4e-44, Method: Composition-based stats. Identities = 39/166 (23%), Positives = 70/166 (42%), Gaps = 10/166 (6%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSE---HMQILSNRQLSQVMNVLPLIETWMKGVKEE 58 E CR+CRAK C A A +L+N ++S+++ + + W VKE Sbjct: 225 SEETCRWCRAKGACKARAEYNEHLRRYGFMDPDLLNNEEISKILAGVDELIRWATDVKEY 284 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQL 116 AL+ + G +P +++ EGR R ++ + + +E Y + L S T+ E+L Sbjct: 285 ALDAMLKGAQIPGFKVVEGRSL-RKVTDEKLLVNNMKDAGYEEALLYEKKLHSITKLEKL 343 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +K + + I + G I P + +S+F Sbjct: 344 AGKKDFAIIS----AGCIEKPKGSPAIAPMSDKRPEYNSGVSDFQE 385 >gi|297583084|ref|YP_003698864.1| hypothetical protein Bsel_0769 [Bacillus selenitireducens MLS10] gi|297141541|gb|ADH98298.1| conserved hypothetical protein [Bacillus selenitireducens MLS10] Length = 375 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 39/165 (23%), Positives = 77/165 (46%), Gaps = 11/165 (6%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEE 58 C+FCRA+ RC A A + L+ S +L++ +++ V+ + + +W + VK Sbjct: 213 SGEWCQFCRAQVRCRAKAEEKLALASYEFVNPNLLTDEEIADVLKRIEDLASWGQKVKTY 272 Query: 59 ALNV-LSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLV 117 A+N ++ G+ P ++L G+ R Y ++ V + L ++ Y ++LL T+ E+ + Sbjct: 273 AINASINQGKQWPGFKLVAGKS-QRKYRDEQAVARALHEAGHEDVYKQSLLPLTKLEKKL 331 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEF 160 R + + +TR K +VP + S+F Sbjct: 332 GRTAFQDL----VGPHLTRSTTKPTLVPVSDERPESLIGTAKSDF 372 >gi|66395460|ref|YP_239829.1| ORF010 [Staphylococcus phage 2638A] gi|62635887|gb|AAX90998.1| ORF010 [Staphylococcus phage 2638A] Length = 388 Score = 178 bits (453), Expect = 1e-43, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 71/164 (43%), Gaps = 9/164 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVL 63 CRFC+ K C A A L ++ +LS+ ++++++ +P + W ++ AL + Sbjct: 230 EHCRFCKIKHSCRARAQYMLDIPNKPAHLLSDNEIAELLYKVPDFKKWADELESYALEQM 289 Query: 64 -SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVKRK 120 ++ ++L EGR R + V+ L+ + LLS T E+ + +K Sbjct: 290 TEHDKNYDGWKLVEGRS-KRVMTDTEAVKDKLIENGFKLENITETKLLSITNLEKKIGKK 348 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 +E + +FI + GK + +K + +F + Sbjct: 349 AFNEI----VGEFIFKPQGKLTLAKESDKRPAIKQSAEEDFDQI 388 >gi|71901488|ref|ZP_00683575.1| phage-related protein [Xylella fastidiosa Ann-1] gi|71728744|gb|EAO30888.1| phage-related protein [Xylella fastidiosa Ann-1] Length = 426 Score = 178 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 53/182 (29%), Positives = 84/182 (46%), Gaps = 19/182 (10%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQILS----------------NRQLSQVMNVLP 46 E CRFC+AK C ALA L+T ++ L+ N L+ + Sbjct: 225 ETPCRFCKAKASCPALATHVLNTVADDFVDLTKPIVPQLSYAALRTFDNTTLACLFGATE 284 Query: 47 LIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEAYN 104 LIE+W K +++ A L SG+ +P Y++ +GR+G R + ++ E L MR ++ Sbjct: 285 LIESWCKSIRDRAAAELLSGQPVPGYKVVQGRQGPRRWVDETAAEDALIQMRIGLSHLHD 344 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKAN-ISEFSVL 163 +L+SP E+L + + W QLQ I R G ++VP L ++F L Sbjct: 345 VSLISPASAEKLHQAGVLDLQQWVQLQPLIHRSTGAPIVVPTSDKRPALPLQDATDFQDL 404 Query: 164 KD 165 D Sbjct: 405 SD 406 >gi|320530600|ref|ZP_08031652.1| hypothetical protein HMPREF9555_01757 [Selenomonas artemidis F0399] gi|320137127|gb|EFW29057.1| hypothetical protein HMPREF9555_01757 [Selenomonas artemidis F0399] Length = 385 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 42/167 (25%), Positives = 73/167 (43%), Gaps = 11/167 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 CRFC+A +C ALA L + +L++ +++ V++ + + + + VK AL Sbjct: 217 EHCRFCKAAVQCKALADHQLELAKLEFKDAGLLTDDEVAFVLSRVDGLVRYAEKVKTFAL 276 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNR-TLLSPTETEQLV 117 + G P Y++ EGR R ++ + LL + + Y + + T E+LV Sbjct: 277 DEALKGHRWPGYKVVEGRSD-RKIGDEAKAVTLLRKAGYSDDVIYKPLEIQTITNLEKLV 335 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 +KK + L IT+ GK + P D EF+ L Sbjct: 336 TKKKFAAL----LADVITKPPGKPALAPVDDKRPEYSPAAMEFNDLD 378 >gi|269120026|ref|YP_003308203.1| hypothetical protein Sterm_1409 [Sebaldella termitidis ATCC 33386] gi|268613904|gb|ACZ08272.1| conserved hypothetical protein [Sebaldella termitidis ATCC 33386] Length = 389 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 34/168 (20%), Positives = 70/168 (41%), Gaps = 10/168 (5%) Query: 4 NACRFCRAKPRCGALAV--KALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC+AK C A A L E +L+N + ++ + ++ W+ +K E + Sbjct: 222 EHCKFCKAKASCRARAANYFTLEALKEKGPLLTNEEKAEALQRGAELDKWLGELKVEIFS 281 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVKR 119 + GE++ +++ +GR R + + + L E Y R +L+ + E ++ + Sbjct: 282 AIEKGEEVKGWKIVQGRSAGRKFTDTDAAVGKLKEHGIAEELLYERKMLTVPQMETVIGK 341 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLKD 165 K E + + GK + P + + FSV+ + Sbjct: 342 KDFKEW----VGDMVETIPGKPTLAPESDKREAISKRKSKDVFSVISN 385 >gi|325478681|gb|EGC81792.1| hypothetical protein HMPREF9290_0162 [Anaerococcus prevotii ACS-065-V-Col13] Length = 370 Score = 175 bits (445), Expect = 1e-42, Method: Composition-based stats. Identities = 42/162 (25%), Positives = 71/162 (43%), Gaps = 11/162 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FC+A RC A + L E LS+ ++ +++ L +E W+K +K AL Sbjct: 212 EWCIFCKANNRCRKRAEENLKLAQEEFILPPELSDDEIEEILPKLDQLEQWVKDIKAYAL 271 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 G + +L EGR +R Y ++++V + + RELG + LL T +L+ +K Sbjct: 272 ERAMRGHRWKDLKLVEGRS-NRRYRDEDEVVKKV-RELGFNPFEEKLLGITAMTKLLGKK 329 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEF 160 E + + + GK +V +K EF Sbjct: 330 VFDE----NITDLLEKPKGKLTLVSVRDKREEVKIDNVKEEF 367 >gi|313813476|gb|EFS51190.1| conserved hypothetical protein [Propionibacterium acnes HL025PA1] gi|315106929|gb|EFT78905.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1] Length = 380 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 41/163 (25%), Positives = 73/163 (44%), Gaps = 11/163 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FCR P C A A L+ LS +++ V+ +P ++TW V+ AL+ Sbjct: 218 WCQFCRIAPTCRARAEANLALAQHEFAPPAELSGSEIADVLTRIPQLKTWAADVEAYALS 277 Query: 62 VL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G+ ++L GR R Y +N V + + Y+R L++ T E+L+ +K Sbjct: 278 QAVNQGKHWDGFKLVAGRS-VRKYTEENAVAEAAEAAGYADIYDRRLITLTAMERLMDKK 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFS 161 +E L + + GK +V L+ + +EF+ Sbjct: 337 TFNEV----LGDLVVKPVGKPTLVSDTDKRPALEIHSAQTEFT 375 >gi|227530263|ref|ZP_03960312.1| phage-associated protein [Lactobacillus vaginalis ATCC 49540] gi|227349817|gb|EEJ40108.1| phage-associated protein [Lactobacillus vaginalis ATCC 49540] Length = 379 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 35/164 (21%), Positives = 67/164 (40%), Gaps = 11/164 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L++ ++ +V+ + + W VK+ A N Sbjct: 219 WCQFSACNAVLRARFDYHHKLTRFQLRSPNLLTDAEVIEVLKHIDDLNRWAHEVKDYAAN 278 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 ++ G+ P +++ EGR R Y ++N V ++ + Y R LL T+ E+ + +K Sbjct: 279 LAINHGKQWPGFKIVEGRSTRR-YKDENAVAKIAEANGIHDIYQRKLLPITKLEKQLGKK 337 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSV 162 K +E ++ I + GK +VP + EF Sbjct: 338 KFTELFSQE----IVKPAGKPTLVPNSDRRQSIGKSNPQDEFKE 377 >gi|50843072|ref|YP_056299.1| phage-associated protein [Propionibacterium acnes KPA171202] gi|50840674|gb|AAT83341.1| conserved phage-associated protein [Propionibacterium acnes KPA171202] Length = 380 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 41/163 (25%), Positives = 73/163 (44%), Gaps = 11/163 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FCR P C A A L+ LS +++ V+ +P ++TW V+ AL+ Sbjct: 218 WCQFCRIAPTCRARAEANLALAQHEFAPPAELSGSEIADVLTRIPQLKTWAADVEAYALS 277 Query: 62 VL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + G+ ++L GR R Y +N V + + Y+R L++ T E+L+ +K Sbjct: 278 QAVNQGKHWDGFKLVAGRS-VRKYTEENAVAEAAEAAGYADIYDRRLITLTAMERLMDKK 336 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFS 161 +E L + + GK +V L+ + +EF+ Sbjct: 337 TFNEV----LGDLVVKPVGKPTLVSDTDKRPALEIHSVQTEFT 375 >gi|256847834|ref|ZP_05553279.1| conserved hypothetical protein [Lactobacillus coleohominis 101-4-CHN] gi|256715523|gb|EEU30499.1| conserved hypothetical protein [Lactobacillus coleohominis 101-4-CHN] Length = 379 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 67/164 (40%), Gaps = 11/164 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L++ Q+++V+ + + W +K+ A + Sbjct: 219 WCQFSACNAVLRARFDYHHKLTRFQLRSPNLLTDSQVTEVLEHIDDLNRWAHEIKDYAAD 278 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 ++ G+ P Y++ EGR R Y ++ V ++ + Y + LL T+ E+ V +K Sbjct: 279 LAINHGKQWPGYKIVEGRS-VRHYKDEAAVAKIAEEHGYHDIYQKKLLPITKLEKQVGKK 337 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSV 162 K +E ++ I + GK +VP + EF Sbjct: 338 KFTELFSQE----IVKPAGKPTLVPDSDSRQSIGKSNPKDEFKE 377 >gi|256617074|ref|ZP_05473920.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200] gi|257088363|ref|ZP_05582724.1| conserved hypothetical protein [Enterococcus faecalis D6] gi|256596601|gb|EEU15777.1| conserved hypothetical protein [Enterococcus faecalis ATCC 4200] gi|256996393|gb|EEU83695.1| conserved hypothetical protein [Enterococcus faecalis D6] Length = 380 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 42/167 (25%), Positives = 70/167 (41%), Gaps = 11/167 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMN---VLPLIETWMKGVKEEAL 60 C FC AK C A A + L Q + ++ + + +W +KE AL Sbjct: 218 KWCGFCPAKNSCRARAEQNLELSKYEFQKPELLEDDEIEEILEKVDDLISWSNDIKEYAL 277 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + SG+ N++L EGR R Y+NDN V +++ + Y++ LL T + + + Sbjct: 278 QLALSGKQWANHKLVEGRST-RKYSNDNDVAAAVIKAGY-DPYDKKLLGVTAMTKALGKA 335 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNH--LKANISEFSVLKD 165 K E L +I + K +V D + EF+ L + Sbjct: 336 KFDEL----LSDYIVKPPCKLTLVTNDDKRQAVKINNVNDEFNNLAE 378 >gi|237738626|ref|ZP_04569107.1| conserved hypothetical protein [Fusobacterium sp. 2_1_31] gi|229424109|gb|EEO39156.1| conserved hypothetical protein [Fusobacterium sp. 2_1_31] Length = 388 Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats. Identities = 37/170 (21%), Positives = 69/170 (40%), Gaps = 13/170 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI---LSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C+FC+AK C A L + LS ++ +++ + W + + E AL Sbjct: 220 EHCKFCKAKSICKERANVNLELAKYEFKAADQLSLEEIGEILKKAQDLAEWAEDLNEYAL 279 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVK 118 G ++P ++ GR GSR++ N ++ ++L E Y R L+ + E+++ Sbjct: 280 AESLKGNNVPGWKAVNGR-GSRSFKNTDEAIKVLKENGIAEELLYERKYLTLAQIEKVIG 338 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA---NISEFSVLKD 165 +K + + I GK +V + EFS + D Sbjct: 339 KKDFNNL----VGDLIVMNVGKPTLVEASDKREAITNKIKAEDEFSAVDD 384 >gi|312897722|ref|ZP_07757138.1| conserved hypothetical protein [Megasphaera micronuciformis F0359] gi|310621106|gb|EFQ04650.1| conserved hypothetical protein [Megasphaera micronuciformis F0359] Length = 385 Score = 173 bits (440), Expect = 5e-42, Method: Composition-based stats. Identities = 38/169 (22%), Positives = 74/169 (43%), Gaps = 15/169 (8%) Query: 4 NACRFCRAKPRCGALAVKALST-----FSEHMQILSNRQLSQVMNVLPLIETWMKGVKEE 58 + CRFCRAK +C S ++++ +L + + + W++ +KE Sbjct: 222 DWCRFCRAKQQCKTRYESNDSLYPELSARHDPRLITLEELGEYLKRGRDMAAWLEDMKEY 281 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQL 116 AL+ +G D+P ++ EGR GSR + + ++ +L++ DE Y R +L+ + E+ Sbjct: 282 ALSESLAGADVPGWKAVEGR-GSRAFTDTDEAVDILIKNGIDESVLYERRVLTLAQMEKA 340 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK---ANISEFSV 162 V +K E + + + GK +V + F+ Sbjct: 341 VGKKAFGEI----VGNLVVKNPGKPTLVEESDKRPKITNQPTAADVFNS 385 >gi|227544662|ref|ZP_03974711.1| phage protein [Lactobacillus reuteri CF48-3A] gi|300909421|ref|ZP_07126882.1| phage protein [Lactobacillus reuteri SD2112] gi|227185387|gb|EEI65458.1| phage protein [Lactobacillus reuteri CF48-3A] gi|300893286|gb|EFK86645.1| phage protein [Lactobacillus reuteri SD2112] Length = 378 Score = 173 bits (440), Expect = 6e-42, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 67/164 (40%), Gaps = 11/164 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L++ ++++V+ + + W VK+ A + Sbjct: 219 WCQFSACNAVLRARYNYHHKLTRFQLRSPSLLTDSEVAEVLEHIDDLNRWAHEVKDYAAD 278 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 ++ G+ P Y++ EGR R Y ++ V ++ Y + LL T+ E+ + +K Sbjct: 279 LAINHGKRWPGYKIVEGRSIRR-YKDEKAVAKIAETNGIHNIYQQKLLPITKLEKQLGKK 337 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSV 162 K +E ++ I + GK ++VP + EF Sbjct: 338 KFAELFSQE----IVKPAGKPILVPNSDRRQSIGKSNPKDEFKE 377 >gi|313621750|gb|EFR92490.1| putative protein p51 [Listeria innocua FSL J1-023] Length = 393 Score = 173 bits (439), Expect = 7e-42, Method: Composition-based stats. Identities = 45/156 (28%), Positives = 79/156 (50%), Gaps = 11/156 (7%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FCRA+ C A A L+ S ++L ++++V+ + +++W++ VK AL Sbjct: 228 WCKFCRARGICKARAEANLALTSYDFKDPRLLQPIEIAKVLGQVAELKSWVEDVKSFALK 287 Query: 62 VLSSGE-DLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVK 118 SG + P ++L EGR +R Y + V+ +L E +E ++ L+S T+ E+LV Sbjct: 288 EAESGGIEFPGWKLVEGRS-NRRYVDAEMVQAMLELEGYSEEELLSKKLISLTDMEKLVG 346 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK 154 +K+VS L I + GK +V +K Sbjct: 347 KKQVSAI----LGDLIEKPAGKPTLVVETDKRQAIK 378 >gi|227544277|ref|ZP_03974326.1| phage-associated protein [Lactobacillus reuteri CF48-3A] gi|300909466|ref|ZP_07126927.1| phage protein [Lactobacillus reuteri SD2112] gi|227185737|gb|EEI65808.1| phage-associated protein [Lactobacillus reuteri CF48-3A] gi|300893331|gb|EFK86690.1| phage protein [Lactobacillus reuteri SD2112] Length = 379 Score = 173 bits (439), Expect = 8e-42, Method: Composition-based stats. Identities = 32/164 (19%), Positives = 66/164 (40%), Gaps = 11/164 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L++ ++++V+ + + W +K+ A + Sbjct: 219 WCQFSACNAVLRARFDYHHKLTRFQLRSPNLLTDSEVTEVLEHIDDLNRWAHEIKDYAAD 278 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 ++ G+ P Y++ EGR R Y ++ V ++ Y + LL T+ E+ + +K Sbjct: 279 LAINHGKQWPGYKIVEGRS-VRHYKDEAAVAKIAEANGYHNIYQKKLLPITKLEKQLGKK 337 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSV 162 K +E ++ I + GK +VP + EF Sbjct: 338 KFAELFSQE----IVKPAGKPTLVPNSDRRQSIGKSNPQDEFKE 377 >gi|116630094|ref|YP_815266.1| hypothetical protein LGAS_1471 [Lactobacillus gasseri ATCC 33323] gi|238854129|ref|ZP_04644476.1| conserved phage-associated protein [Lactobacillus gasseri 202-4] gi|116095676|gb|ABJ60828.1| hypothetical protein LGAS_1471 [Lactobacillus gasseri ATCC 33323] gi|238833205|gb|EEQ25495.1| conserved phage-associated protein [Lactobacillus gasseri 202-4] Length = 379 Score = 173 bits (439), Expect = 8e-42, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 67/164 (40%), Gaps = 11/164 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L+++++++V+ + + W VK+ A + Sbjct: 219 WCQFSACNAVLRARYDYHHKLTRFQLCSPNLLTDKEVTEVLEHIDDLNRWAHEVKDYAAD 278 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 ++ G+ P Y++ EGR R Y N+ V ++ Y + LL T+ E+ + +K Sbjct: 279 LAINHGKQWPGYKIVEGRS-VRHYKNEAAVAKIAEANGYHNIYQKKLLPITKLEKQLGKK 337 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSV 162 K +E ++ I + GK +VP + EF Sbjct: 338 KFTELFSQE----IVKPAGKPTLVPNSDRRQGIGKSNPKDEFKE 377 >gi|238018838|ref|ZP_04599264.1| hypothetical protein VEIDISOL_00698 [Veillonella dispar ATCC 17748] gi|237864604|gb|EEP65894.1| hypothetical protein VEIDISOL_00698 [Veillonella dispar ATCC 17748] Length = 380 Score = 172 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 45/170 (26%), Positives = 84/170 (49%), Gaps = 13/170 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM----QILSNRQLSQVMNVLPLIETWMKGVKEEA 59 + C FC+A+ C ALA L TF ++ Q+L++R++S ++ + LI W+KGV + A Sbjct: 214 DYCNFCKARHTCRALADTCLDTFYKNGGKLNQLLTDREVSDILAMKDLITKWIKGVYDFA 273 Query: 60 LNVLSSG-EDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYN-RTLLSPTETEQ 115 SG + P Y+L EG RT + + + L+ ++ + R L T ++ Sbjct: 274 YEKALSGEKQWPGYKLVEGTS-RRTITDPDAAAKTLLDNGYKEEDIFKPRELEGITNLQK 332 Query: 116 LVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 ++ +K V+E L+ +I + +GK +VP + + + +D Sbjct: 333 VLGKKGVAE----YLEAYIEKPEGKPTLVPESDKRPAINTVETMMNEFED 378 >gi|212499745|ref|YP_002308553.1| hypothetical protein APSE241 [Bacteriophage APSE-2] gi|238898736|ref|YP_002924418.1| APSE-2 prophage; hypothetical [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|211731714|gb|ACJ10202.1| conserved hypothetical protein [Bacteriophage APSE-2] gi|229466496|gb|ACQ68270.1| APSE-2 prophage; conserved hypothetical [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 439 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 46/184 (25%), Positives = 91/184 (49%), Gaps = 24/184 (13%) Query: 4 NACRFCRAKPR-CGALAVKA------------------LSTFSEHMQILSNRQLSQVMNV 44 C FC+AK C A A LS + + +L+ Q++++ Sbjct: 250 KQCLFCKAKGGLCFAQAQFVHNEVKGDFVDLTQPLASQLSDAPKRITLLTPEQMAKLYPH 309 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEA 102 + LIE++ K ++ L +G+ +P ++L G++G+RT+ ++ + E LL + ++ Sbjct: 310 VDLIESFCKALRNRVAEALHTGQSVPGFKLVTGKQGNRTWGDEREAETLLKGAKLKQEQI 369 Query: 103 YNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH-LKANISEFS 161 Y++ ++SP + E+L+K+ K W +L+ I R DGK VI P P + +++F Sbjct: 370 YHKKIISPPQAEKLLKKDK--PHRWAKLEALIERADGKPVIAPESDPRPAIITTPLNDFD 427 Query: 162 VLKD 165 + + Sbjct: 428 DVTE 431 >gi|75906034|gb|ABA29383.1| conserved hypothetical protein [Bacteriophage APSE-2] Length = 448 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 46/184 (25%), Positives = 91/184 (49%), Gaps = 24/184 (13%) Query: 4 NACRFCRAKPR-CGALAVKA------------------LSTFSEHMQILSNRQLSQVMNV 44 C FC+AK C A A LS + + +L+ Q++++ Sbjct: 259 KQCLFCKAKGGLCFAQAQFVHNEVKGDFVDLTQPLASQLSDAPKRITLLTPEQMAKLYPH 318 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEA 102 + LIE++ K ++ L +G+ +P ++L G++G+RT+ ++ + E LL + ++ Sbjct: 319 VDLIESFCKALRNRVAEALHTGQSVPGFKLVTGKQGNRTWGDEREAETLLKGAKLKQEQI 378 Query: 103 YNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH-LKANISEFS 161 Y++ ++SP + E+L+K+ K W +L+ I R DGK VI P P + +++F Sbjct: 379 YHKKIISPPQAEKLLKKDK--PHRWAKLEALIERADGKPVIAPESDPRPAIITTPLNDFD 436 Query: 162 VLKD 165 + + Sbjct: 437 DVTE 440 >gi|297585293|ref|YP_003701073.1| hypothetical protein Bsel_3026 [Bacillus selenitireducens MLS10] gi|297143750|gb|ADI00508.1| conserved hypothetical protein [Bacillus selenitireducens MLS10] Length = 388 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 38/165 (23%), Positives = 71/165 (43%), Gaps = 12/165 (7%) Query: 5 ACRFCRAKPRCGALAVKALSTFS---EHMQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC+ C A A + + +L++ ++SQV+ + + W K V+E AL Sbjct: 229 HCQFCKVSATCRARAEERQKLACLDFKEPPLLTDEEVSQVLREVDELVNWAKQVQEYALK 288 Query: 62 VLSS-GEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVK 118 + P +L +GR GSR Y ++ + L +E + +TL T E+++ Sbjct: 289 TAMKENKKWPGMKLVQGR-GSRVYTDEKAIISTLKEAGMEEHQLFKQTLKPITNMEKMLG 347 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSV 162 +K + + IT+ GK +V + K + +F Sbjct: 348 KKTFQDL----VGHLITKTPGKLKLVETEDSRPAAKPSAAEDFQN 388 >gi|9633598|ref|NP_051012.1| hypothetical protein APSE-1_51 [Acyrthosiphon pisum bacteriophage APSE-1] gi|9910950|sp|Q9T1P7|VP51_BPAPS RecName: Full=Putative protein p51 gi|6118046|gb|AAF03994.1|AF157835_51 P51 [Endosymbiont phage APSE-1] Length = 439 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 45/184 (24%), Positives = 91/184 (49%), Gaps = 24/184 (13%) Query: 4 NACRFCRAKPR-CGALAVKA------------------LSTFSEHMQILSNRQLSQVMNV 44 C FC+AK C A A LS + + +L+ +++++ Sbjct: 250 KQCLFCKAKGGLCFAQAQFVHNEVKGDFVDLTQPLAPQLSDAPKRITLLTPEKMAKLYPH 309 Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEA 102 + LIE++ K ++ L +G+ +P ++L G++G+RT+ ++ + E LL + ++ Sbjct: 310 VDLIESFCKALRNRVAEALHTGQSVPGFKLVTGKQGNRTWGDEREAETLLKGAKLKQEQI 369 Query: 103 YNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH-LKANISEFS 161 Y++ ++SP + E+L+K+ K W +L+ I R DGK VI P P + +++F Sbjct: 370 YHKKIISPPQAEKLLKKDK--PHRWAKLEALIERADGKPVIAPESDPRPAIITTPLNDFD 427 Query: 162 VLKD 165 + + Sbjct: 428 DVTE 431 >gi|258646419|ref|ZP_05733888.1| phage protein [Dialister invisus DSM 15470] gi|260403822|gb|EEW97369.1| phage protein [Dialister invisus DSM 15470] Length = 389 Score = 171 bits (433), Expect = 4e-41, Method: Composition-based stats. Identities = 37/168 (22%), Positives = 76/168 (45%), Gaps = 12/168 (7%) Query: 4 NACRFCRAKPRCGALAVKALSTFS-----EHMQILSNRQLSQVMNVLPLIETWMKGVKEE 58 CRFCRA+ +C A + + + +++ L + + ++ W + ++ Sbjct: 222 EHCRFCRARAQCKARSEYYAALAETAKENANPALITMADLGEYLKKAGALKKWAEDLQAY 281 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQL 116 AL+ SG+ +P ++ EGR GSR + + ++ ++L DE Y+R + +TE++ Sbjct: 282 ALSSCLSGKTVPGWKAVEGR-GSRVFTSTDEAFKVLTDNGIDESLLYSRVPATLAQTEKI 340 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 V +K L K++ + GK + P + +S + K Sbjct: 341 VGKKVFETL----LSKYVIKNPGKPTLAPESDKREAISNVVSAKDIFK 384 >gi|259502606|ref|ZP_05745508.1| phage protein [Lactobacillus antri DSM 16041] gi|259169421|gb|EEW53916.1| phage protein [Lactobacillus antri DSM 16041] Length = 307 Score = 170 bits (432), Expect = 4e-41, Method: Composition-based stats. Identities = 33/164 (20%), Positives = 68/164 (41%), Gaps = 11/164 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L++ ++++V+ + + W +K+ A + Sbjct: 147 WCQFSTCNAVLRARFDYHHKLTRFQLRSPNLLTDSEVTEVLEHIDDLNRWAHEIKDYAAD 206 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 +++G+ P Y++ EGR R Y ++ V ++ + Y + LL T+ E+ V +K Sbjct: 207 LAINNGKQWPGYKIVEGRS-VRHYKDEAAVAKIAEEHGYHDIYQKKLLPITKLEKQVGKK 265 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSV 162 K +E ++ I + GK +VP + EF Sbjct: 266 KFTELFSQE----IVKPAGKPTLVPNSDQRQSISKSNPQDEFKE 305 >gi|217965859|ref|YP_002351537.1| conserved phage-associated protein [Listeria monocytogenes HCC23] gi|217335129|gb|ACK40923.1| conserved phage-associated protein [Listeria monocytogenes HCC23] gi|307569597|emb|CAR82776.1| phage protein, putative [Listeria monocytogenes L99] Length = 393 Score = 170 bits (431), Expect = 7e-41, Method: Composition-based stats. Identities = 44/156 (28%), Positives = 79/156 (50%), Gaps = 11/156 (7%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FCRA+ C A A L+ S ++L ++++V+ + +++W++ VK AL Sbjct: 228 WCKFCRARSICKARAEANLALTSYDFKDPRLLQPDEIAKVLGQVAELKSWVEDVKSFALK 287 Query: 62 VLSS-GEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQLVK 118 S G + P ++L EGR +R Y + V+ ++ E +E ++ L+S T+ E+L+ Sbjct: 288 EAESRGIEFPGWKLVEGRS-NRRYADAEMVQAMMELEGYSEEELLSKKLISLTDMEKLIG 346 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK 154 +K+VS L I + GK +V LK Sbjct: 347 KKQVSAI----LGDLIEKPAGKPALVVETDRRQALK 378 >gi|307244285|ref|ZP_07526400.1| conserved hypothetical protein [Peptostreptococcus stomatis DSM 17678] gi|306492435|gb|EFM64473.1| conserved hypothetical protein [Peptostreptococcus stomatis DSM 17678] Length = 391 Score = 170 bits (430), Expect = 7e-41, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 69/165 (41%), Gaps = 11/165 (6%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEE 58 C+FCRA RC A A L E +L++ ++ ++++++P + W + Sbjct: 228 SGEWCKFCRASVRCRARAEDKLKLAKEEFKLPPLLTDEEIEEILSIIPDLTKWANEIMNY 287 Query: 59 ALNVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLV 117 A + + +++ EGR R Y ++N V Q + + +L++ TE ++L+ Sbjct: 288 ATEAAVNHAKKWTGFKIVEGRS-IRKYKDENAVIQKAKEHGYTDIFKSSLITLTEMQKLM 346 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEF 160 + K E L I + GK +V + EF Sbjct: 347 GKAKFEEV----LGDLIIKPSGKPTLVQESDKRKAMNISNINDEF 387 >gi|254975138|ref|ZP_05271610.1| hypothetical protein CdifQC_07480 [Clostridium difficile QCD-66c26] gi|255314265|ref|ZP_05355848.1| hypothetical protein CdifQCD-7_07940 [Clostridium difficile QCD-76w55] gi|255516945|ref|ZP_05384621.1| hypothetical protein CdifQCD-_07514 [Clostridium difficile QCD-97b34] gi|255650047|ref|ZP_05396949.1| hypothetical protein CdifQCD_07659 [Clostridium difficile QCD-37x79] gi|260686785|ref|YP_003217918.1| hypothetical protein CDR20291_1421 [Clostridium difficile R20291] gi|306519582|ref|ZP_07405929.1| hypothetical protein CdifQ_05387 [Clostridium difficile QCD-32g58] gi|260212801|emb|CBE03965.1| putative uncharacterized protein [Clostridium difficile R20291] Length = 388 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 16/172 (9%) Query: 1 MDENACRFCRAKPRCGALAVKALSTF-SEHMQI---LSNRQLSQVMNVLPLIETWMKGVK 56 + + C FCRAK C A L L+ +++ ++ I+ W+K V+ Sbjct: 221 VSGSHCGFCRAKNDCRKRAEDNLKLARKYDFADTFALNKYEIADILGFAKNIQDWLKDVQ 280 Query: 57 EEALNVLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYN-RTLLSPTE 112 AL G P Y+L EGR +R Y ++ +V ++L+ DE Y RTL ++ Sbjct: 281 SYALEQAEKHGVKYPGYKLVEGRS-NRKYVDEQEVAKVLLNSDYDEEKIYKPRTLKGISD 339 Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK---ANISEFS 161 E+ + +K ++ L I + GK +V + + + +F Sbjct: 340 MEKAIGKKSFAKL----LSDLIIKPVGKATLVVESDKRSEINSIDSAKKDFE 387 >gi|255306555|ref|ZP_05350726.1| hypothetical protein CdifA_08182 [Clostridium difficile ATCC 43255] Length = 388 Score = 169 bits (429), Expect = 1e-40, Method: Composition-based stats. Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 16/172 (9%) Query: 1 MDENACRFCRAKPRCGALAVKALSTF-SEHMQI---LSNRQLSQVMNVLPLIETWMKGVK 56 + + C FCRAK C A L L+ +++ ++ I+ W+K V+ Sbjct: 221 VSGSHCGFCRAKNDCRKRAEDNLKLARKYDFADTFALNKYEIADILGFAKNIQDWLKDVQ 280 Query: 57 EEALNVLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYN-RTLLSPTE 112 AL G P Y+L EGR +R Y ++ +V ++L+ DE Y RTL ++ Sbjct: 281 SYALEQAEKHGVKYPGYKLVEGRS-NRKYIDEQEVAKVLLNSDYDEEKIYKPRTLKGISD 339 Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK---ANISEFS 161 E+ + +K ++ L I + GK +V + + + +F Sbjct: 340 MEKAIGKKSFAKL----LSDLIIKPVGKATLVVESDKRSEINSIDSAKKDFE 387 >gi|238899050|ref|YP_002924732.1| hypothetical protein HDEF_2004 [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229466810|gb|ACQ68584.1| conserved hypothetical phage protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 271 Score = 168 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 50/172 (29%), Positives = 85/172 (49%), Gaps = 21/172 (12%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQ------------ILSNRQLSQVMNVLPLIE 49 D AC+FCRAK RC ALA ++L ++ Q +L+NR ++ ++ L ++ Sbjct: 101 DPKACKFCRAKARCRALAARSLQVAAQEFQNVVTPIALKDITLLNNRDIAALLPQLNMMA 160 Query: 50 TWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMR--ELGDEAYNRTL 107 W+K V+ AL L G D+P Y+L GR R + ++ Q EQ L + E + + L Sbjct: 161 DWIKSVEATALQELEQGRDIPGYKLVTGRS-IRKWRDEAQAEQSLRKTHLKVAEIFTQKL 219 Query: 108 LSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 +SP + E+L+ +K L + GK ++ P L++++ E Sbjct: 220 VSPAQAEKLLGKKH------PLLDELAIHPQGKPILAPESDKRPALRSSVEE 265 >gi|257088827|ref|ZP_05583188.1| conserved hypothetical protein [Enterococcus faecalis CH188] gi|281416449|ref|YP_003347369.1| hypothetical protein [Enterococcus phage phiFL4A] gi|256997639|gb|EEU84159.1| conserved hypothetical protein [Enterococcus faecalis CH188] gi|270209625|gb|ACZ64164.1| conserved hypothetical protein [Enterococcus phage phiFL4A] gi|315160603|gb|EFU04620.1| conserved hypothetical protein [Enterococcus faecalis TX0645] gi|315579449|gb|EFU91640.1| conserved hypothetical protein [Enterococcus faecalis TX0630] Length = 390 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 33/172 (19%), Positives = 80/172 (46%), Gaps = 13/172 (7%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + ++ +F + + + A + + +L+N ++++++ P I+ W++ V+ Sbjct: 221 ITDDVVKFSKVRAQLRPRAERNFQLIDKHELKEAPLLTNEEIAEILERAPEIKKWLEHVE 280 Query: 57 EEALNVLSS-GEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE---AYNRTLLSPTE 112 AL GE+ P +++ GR +R +++ + +L E ++ + L + + Sbjct: 281 TYALQKARDEGEEFPGWKVVAGRS-NRKISDNEGLLMVLEAEGFEDEDILKPQELKAIGQ 339 Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 E++V +KK +E FI + +GK V+VP L + + F+ + Sbjct: 340 LEKVVGKKKFAELA----ADFIIKPEGKPVLVPESDKRPALNSVENAFNDFE 387 >gi|255092528|ref|ZP_05322006.1| hypothetical protein CdifC_07687 [Clostridium difficile CIP 107932] gi|260683189|ref|YP_003214474.1| hypothetical protein CD196_1446 [Clostridium difficile CD196] gi|260209352|emb|CBA62779.1| putative uncharacterized protein [Clostridium difficile CD196] Length = 388 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 16/172 (9%) Query: 1 MDENACRFCRAKPRCGALAVKALSTF-SEHMQI---LSNRQLSQVMNVLPLIETWMKGVK 56 + + C FCRAK C A L L+ +++ ++ I+ W+K V+ Sbjct: 221 VSGSHCGFCRAKNDCRKRAGDNLKLARKYDFADTFALNKYEIADILGFAKNIQDWLKDVQ 280 Query: 57 EEALNVLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYN-RTLLSPTE 112 AL G P Y+L EGR +R Y ++ +V ++L+ DE Y RTL ++ Sbjct: 281 SYALEQAEKHGVKYPGYKLVEGRS-NRKYVDEQEVAKVLLNSDYDEEKIYKPRTLKGISD 339 Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK---ANISEFS 161 E+ + +K ++ L I + GK +V + + + +F Sbjct: 340 MEKAIGKKSFAKL----LSDLIIKPVGKATLVVESDKRSEINSIDSAKKDFE 387 >gi|297585233|ref|YP_003701013.1| hypothetical protein Bsel_2962 [Bacillus selenitireducens MLS10] gi|297143690|gb|ADI00448.1| conserved hypothetical protein [Bacillus selenitireducens MLS10] Length = 432 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 37/159 (23%), Positives = 69/159 (43%), Gaps = 11/159 (6%) Query: 5 ACRFCRAKPRCGALAVKALSTFS---EHMQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+FC+ C A A + + +L++ ++SQV+ + + W K V+E AL Sbjct: 229 HCQFCKVSATCRARAEERQKLACLDFKEPPLLTDEEVSQVLREVDELVNWAKQVQEYALK 288 Query: 62 VLSS-GEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVK 118 + P +L +GR GSR Y ++ + L +E + +TL T E+++ Sbjct: 289 TAMKENKQWPGMKLVQGR-GSRVYTDEKAIISTLKEAGMEEHQLFKQTLKPITNMEKMLG 347 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANI 157 +K + + IT+ GK +V + K + Sbjct: 348 KKTFQDL----VGHLITKTPGKLKLVETEDGRPAAKTSA 382 >gi|227872578|ref|ZP_03990914.1| phage protein [Oribacterium sinus F0268] gi|227841579|gb|EEJ51873.1| phage protein [Oribacterium sinus F0268] Length = 384 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 38/170 (22%), Positives = 80/170 (47%), Gaps = 11/170 (6%) Query: 2 DENACRFCRAKPRCGALAVKALSTF---SEHMQILSNRQLSQVMNVLPLIETWMKGVKEE 58 +E+ CRFC+ K C A A K L+ + ++LSN +L ++ TW+ V+E Sbjct: 219 EEDTCRFCKVKAVCRARAEKNLALMFLEEQDPRLLSNEELGDILTKCSGFPTWLSDVEEC 278 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRE---LGDEAYNRTLLSPTETEQ 115 A + L GE++ +++ EGR R + ++ + + ++ +E + L+ T+ E+ Sbjct: 279 AKDKLLLGEEIKGWKIVEGRST-RVWTDEKEAFKYIVDSEEAKEEELFETVPLTLTKVEK 337 Query: 116 LVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 L+ +K+ +K++T+ GK + + + F+ + Sbjct: 338 LLGKKRFKPIA----EKYVTKSKGKPTLTLESDERPAYNSVETMFNEEGE 383 >gi|254780124|ref|YP_003064537.1| hypothetical protein CLIBASIA_00015 [Candidatus Liberibacter asiaticus str. psy62] gi|254039801|gb|ACT56597.1| hypothetical protein CLIBASIA_00015 [Candidatus Liberibacter asiaticus str. psy62] gi|317120695|gb|ADV02518.1| hypothetical protein SC1_gp195 [Liberibacter phage SC1] gi|317120839|gb|ADV02660.1| hypothetical protein SC1_gp195 [Liberibacter phage SC1] Length = 388 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 67/165 (40%), Positives = 109/165 (66%), Gaps = 2/165 (1%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 +++++CRFCRAK RC AL+ L ++ + +LS+ + + LI++++K ++E Sbjct: 223 VNDDSCRFCRAKVRCPALSRHVLLEATKDPSTNTTVELSKAYSSISLIKSYVKACEDEMF 282 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 L++G+++ Y+L EGRKG+R++ + N+ ++LL LG+EA+ R L +P E EQL K + Sbjct: 283 KRLNAGDEIQGYQLVEGRKGNRSFKDINRAQELLTSVLGEEAFKRILKTPKELEQLYKEQ 342 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLP--VNHLKANISEFSVL 163 KVS+ WE+LQ+ ITR DGK VI P D+P K+ +SEF VL Sbjct: 343 KVSDEFWEELQELITRGDGKPVIAPRDIPTNKQTQKSQLSEFEVL 387 >gi|317120738|gb|ADV02560.1| hypothetical protein SC2_gp195 [Liberibacter phage SC2] gi|317120799|gb|ADV02620.1| hypothetical protein SC2_gp195 [Liberibacter phage SC2] Length = 388 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 67/165 (40%), Positives = 109/165 (66%), Gaps = 2/165 (1%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 +++++CRFCRAK RC AL+ L ++ + +LS+ + + LI++++K ++E Sbjct: 223 VNDDSCRFCRAKVRCPALSRHVLLEATKDPSTNTTVELSKAYSSISLIKSYVKACEDEMF 282 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 L++G+++ Y+L EGRKG+R++ + N+ ++LL LG+EA+ R L +P E EQL K + Sbjct: 283 KRLNAGDEIQGYQLVEGRKGNRSFKDINRAQELLTSVLGEEAFKRILKTPKELEQLYKEQ 342 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLP--VNHLKANISEFSVL 163 KVS+ WE+LQ+ ITR DGK VI P D+P K+ +SEF VL Sbjct: 343 KVSDEFWEELQELITRGDGKPVIAPRDIPTNKQTQKSQLSEFEVL 387 >gi|313895680|ref|ZP_07829236.1| conserved hypothetical protein [Selenomonas sp. oral taxon 137 str. F0430] gi|312975806|gb|EFR41265.1| conserved hypothetical protein [Selenomonas sp. oral taxon 137 str. F0430] Length = 376 Score = 165 bits (419), Expect = 2e-39, Method: Composition-based stats. Identities = 38/168 (22%), Positives = 76/168 (45%), Gaps = 11/168 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FCRA RC A A + L E ++++ ++ V+ +P + W + A Sbjct: 212 KWCTFCRAAVRCRARAEEKLRLAKEEFKYPPLITDEEIEDVLGEIPELIKWANAILAYAT 271 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + + G++ +++ EGR R Y +++ V + + ++R L+S T+ E+L+ + Sbjct: 272 DAALNHGKEWTGFKIVEGRSVRR-YKDEDAVAREAESAGYTDIFDRKLISLTQMEKLMGK 330 Query: 120 KKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA--NISEFSVLKD 165 K ++ L I + GK +VP + + SEF + + Sbjct: 331 KAFTDI----LGGLIEKPPGKPTLVPISDKRSAIHTGNVQSEFKTIME 374 >gi|322382463|ref|ZP_08056358.1| hypothetical protein PL1_2402 [Paenibacillus larvae subsp. larvae B-3650] gi|321153576|gb|EFX45965.1| hypothetical protein PL1_2402 [Paenibacillus larvae subsp. larvae B-3650] Length = 395 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 38/167 (22%), Positives = 73/167 (43%), Gaps = 11/167 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CR+C+ K C A A + + + Q +LS+ ++ + V ++ W K V++ A Sbjct: 231 DHCRWCKVKGNCRARAEENMKAVQQEFQDPALLSDEEIGSTLFVAQQLKAWAKDVEDYAK 290 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGD---EAYNRTLLSPTETEQLV 117 SGE++P ++L EG+ +R + + L D R LL E+ + Sbjct: 291 EKALSGENIPQWKLVEGKS-NRFITDKGKAISKLEAAKIDPDKYLKPRELLGIGALEKQL 349 Query: 118 KRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 +K+++ + I + GK +VP P + EF+ + Sbjct: 350 DKKQLNNL----IGDLIVKPQGKPTLVPETDPRPEFNSLEQEFANMD 392 >gi|315574073|gb|EFU86264.1| conserved hypothetical protein [Enterococcus faecalis TX0309B] gi|315582018|gb|EFU94209.1| conserved hypothetical protein [Enterococcus faecalis TX0309A] Length = 391 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 79/172 (45%), Gaps = 13/172 (7%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + ++ +F + + + A + + +L+N ++++++ P I+ W++ V+ Sbjct: 222 ITDDVVKFSKVRAQLRPRAERNFQLIDKHELKEAPLLTNEEIAEILERAPEIKKWLEHVE 281 Query: 57 EEALNVLSS-GEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE---AYNRTLLSPTE 112 AL GE+ P +++ GR +R +++ + +L E ++ + L + + Sbjct: 282 TYALQKARDEGEEFPGWKVVAGRS-NRKISDNEGLLMVLEAEGFEDEDILKPQELKAIGQ 340 Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 E++V +KK +E FI + +GK V+VP L + + + + Sbjct: 341 LEKVVGKKKFAELA----ADFIIKPEGKPVLVPESDKRPALNSVENALNDFE 388 >gi|29374958|ref|NP_814111.1| hypothetical protein EF0319 [Enterococcus faecalis V583] gi|29342416|gb|AAO80182.1| conserved hypothetical protein [Enterococcus faecalis V583] Length = 390 Score = 164 bits (416), Expect = 4e-39, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 79/172 (45%), Gaps = 13/172 (7%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + ++ +F + + + A + + +L+N ++++++ P I+ W++ V+ Sbjct: 221 ITDDVVKFSKVRAQLRPRAERNFQLIDKHELKEAPLLTNEEIAEILERAPEIKKWLEHVE 280 Query: 57 EEALNVLSS-GEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE---AYNRTLLSPTE 112 AL GE+ P +++ GR +R +++ + +L E ++ + L + + Sbjct: 281 TYALQKARDEGEEFPGWKVVAGRS-NRKISDNEGLLMVLEAEGFEDEDILKPQELKAIGQ 339 Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 E++V +KK +E FI + +GK V+VP L + + + + Sbjct: 340 LEKVVGKKKFAELA----ADFIIKPEGKPVLVPESDKRPALNSVENALNDFE 387 >gi|266623809|ref|ZP_06116744.1| phage protein [Clostridium hathewayi DSM 13479] gi|288864381|gb|EFC96679.1| phage protein [Clostridium hathewayi DSM 13479] Length = 399 Score = 164 bits (416), Expect = 4e-39, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 74/172 (43%), Gaps = 13/172 (7%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEH-----MQILSNRQLSQVMNVLPLIETWMKGVK 56 C++CRA+ +C A A + + ++SN + + + W++ +K Sbjct: 228 SPETCKYCRARGQCRARADENVKLAFSEDLGKLPPLISNAEAGDYLRKGVDVAKWLEALK 287 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE---AYNRTLLSPTET 113 + AL +G+++P ++ GR G R + + ++ + L++ E + R LS + Sbjct: 288 DYALKECLAGKEVPGWKAVSGRGG-RDWTDMDKAFETLVKSGVAEEAVLWERKPLSLAQV 346 Query: 114 EQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 E V +K ++ + +++ K GK +V + ++ K+ Sbjct: 347 ETTVGKKDFAD----AVGEYVVWKPGKPALVEASDKRPAITNKLTAAEAFKE 394 >gi|227520149|ref|ZP_03950198.1| hypothetical phage associated protein SpyM3_1445 [Enterococcus faecalis TX0104] gi|227072394|gb|EEI10357.1| hypothetical phage associated protein SpyM3_1445 [Enterococcus faecalis TX0104] Length = 390 Score = 164 bits (416), Expect = 4e-39, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 79/172 (45%), Gaps = 13/172 (7%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + ++ +F + + + A + + +L+N ++++++ P I+ W++ V+ Sbjct: 221 ITDDVVKFSKVRAQLRPRAERNFQLIDKHELKEAPLLTNEEIAEILERAPEIKKWLEHVE 280 Query: 57 EEALNVLSS-GEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE---AYNRTLLSPTE 112 AL GE+ P +++ GR +R +++ + +L E ++ + L + + Sbjct: 281 TYALQKARDEGEEFPGWKVVAGRS-NRKISDNEGLLMVLEAEGFEDEDILKPQELKAIGQ 339 Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLK 164 E++V +KK +E FI + +GK V+VP L + + + + Sbjct: 340 LEKVVGKKKFAELA----ADFIIKPEGKPVLVPESDKRPALNSVENALNDFE 387 >gi|48697229|ref|YP_024959.1| hypothetical protein BcepC6B_gp39 [Burkholderia phage BcepC6B] gi|47779035|gb|AAT38398.1| gp39 [Burkholderia phage BcepC6B] Length = 455 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 50/181 (27%), Positives = 86/181 (47%), Gaps = 23/181 (12%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQ-----------------ILSNRQLSQVMNVL 45 E C+FC+AK C ALA T + +L N +L + L Sbjct: 267 EKQCKFCKAKAVCPALAAHVEQTIGADFEALADDIYTQAKVGMNVDLLDNERLGVIYASL 326 Query: 46 PLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL--MRELGDEAY 103 LI++W K V+ + L +P +L GR+G+R +++ + E LL MR D+ Y Sbjct: 327 DLIDSWAKAVRGRIEHELLQARVVPGVKLVAGRRGARQWSDAEKAEALLKSMRLKQDQMY 386 Query: 104 NRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFS 161 N L+SPT+ ++L++ + S W++++ I ++DG+ + P P L+ +F Sbjct: 387 NFKLISPTQADKLLRNE--SPRRWKKVEAEIVQRDGRPSVAPDSDPRPALEIQPPEDDFE 444 Query: 162 V 162 V Sbjct: 445 V 445 >gi|255994006|ref|ZP_05427141.1| hypothetical protein GCWU000322_00063 [Eubacterium saphenum ATCC 49989] gi|255993674|gb|EEU03763.1| hypothetical protein GCWU000322_00063 [Eubacterium saphenum ATCC 49989] Length = 351 Score = 159 bits (403), Expect = 9e-38, Method: Composition-based stats. Identities = 40/162 (24%), Positives = 76/162 (46%), Gaps = 9/162 (5%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE--HMQILSNRQLSQVMNVLPLIETWMKGVKEE 58 + C+FC AK C A +T + + L+ ++++++ LI W+ K+ Sbjct: 189 VSGEHCKFCSAKAVCTTHAKDIATTEPKMIAPKELTTDEVAELIPRAELIADWLAAAKQY 248 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQL 116 L +G ++ Y+ EGR R +++ ++ +LL + D+ Y LS + E++ Sbjct: 249 LLGECLAGREVKGYKAVEGRA-IRVWSDQDKALELLEQAGYDKALIYESAPLSLAKLEKM 307 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANIS 158 V +K+ E + ++ITRK GK + P D LK+ Sbjct: 308 VGKKEFGEL----VGEYITRKTGKPTLAPEDDKRPALKSKAE 345 >gi|225405825|ref|ZP_03761014.1| hypothetical protein CLOSTASPAR_05046 [Clostridium asparagiforme DSM 15981] gi|225042639|gb|EEG52885.1| hypothetical protein CLOSTASPAR_05046 [Clostridium asparagiforme DSM 15981] Length = 394 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 36/170 (21%), Positives = 71/170 (41%), Gaps = 12/170 (7%) Query: 3 ENACRFCRAKPRCGALAVKALSTF-----SEHMQILSNRQLSQVMNVLPLIETWMKGVKE 57 E C+FCRAK C A A + + + +LSN ++ + + W+ +++ Sbjct: 229 EKTCQFCRAKGMCRARADENVKLAFSPDIGKKPPLLSNEEMGDYLIRGADVAKWLADLQK 288 Query: 58 EALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETEQ 115 AL +G D+P ++ EGR GSR + + ++ L + R ++P E+ Sbjct: 289 CALAECLAGNDVPGWKAVEGR-GSRDWTDLDEAFAALEAAGISTAILWERKPVTPAGLEK 347 Query: 116 LVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 + +K+ + + +K GK +V + +S K+ Sbjct: 348 ALGKKEYT----AHAANYEVKKPGKPALVKESDKREAITNKVSAAEAFKE 393 >gi|218133413|ref|ZP_03462217.1| hypothetical protein BACPEC_01278 [Bacteroides pectinophilus ATCC 43243] gi|217990788|gb|EEC56794.1| hypothetical protein BACPEC_01278 [Bacteroides pectinophilus ATCC 43243] Length = 336 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 30/123 (24%), Positives = 66/123 (53%), Gaps = 5/123 (4%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CRFC+A+ C A A + L ++S+ ++++V++ ++ W + V A Sbjct: 215 DHCRFCKARFTCRARAEEYLKLAQMEFAEPALMSDEEIAEVLSKADALKKWAEEVYTYAQ 274 Query: 61 NVLS-SGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 N + ++ P Y+L GR +R Y ++++V + + + Y ++L+ TE E+L+ + Sbjct: 275 NEAVVNHKEWPGYKLVLGRS-NRKYTDEDEVAEAAQKAGYTDIYKKSLIGITEMERLMGK 333 Query: 120 KKV 122 K++ Sbjct: 334 KEI 336 >gi|71911266|ref|YP_282816.1| phage protein [Streptococcus pyogenes MGAS5005] gi|94989085|ref|YP_597186.1| phage protein [Streptococcus pyogenes MGAS9429] gi|94992976|ref|YP_601075.1| phage protein [Streptococcus pyogenes MGAS2096] gi|71854048|gb|AAZ52071.1| phage protein [Streptococcus pyogenes MGAS5005] gi|94542593|gb|ABF32642.1| phage protein [Streptococcus pyogenes MGAS9429] gi|94546484|gb|ABF36531.1| phage protein [Streptococcus pyogenes MGAS2096] Length = 384 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 35/169 (20%), Positives = 70/169 (41%), Gaps = 13/169 (7%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + E ++ + A + + LS+ ++++++ I+ W++ V+ Sbjct: 221 LSEKVLQWSPVAAKLVPRAQENWELIDKYDYQEPVYLSDEAVAEILDKASAIKKWVESVE 280 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRT-LLSPTETEQ 115 AL SG+++P Y++ EGR +R N +L + ++ Y LLS E+ Sbjct: 281 AYALKEALSGKEVPGYKIVEGRS-NRVLTNKEAAATILEKNGFEDIYKPKELLSMGALEK 339 Query: 116 LVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA---NISEFS 161 +V +K+ + L + GK V+V L + I EF Sbjct: 340 MVGKKRFEDMMAFILD----KPQGKPVLVKNSDKRPALNSLEQAIKEFE 384 >gi|157311148|ref|YP_001469193.1| hypothetical protein P9_gp13 [Streptococcus phage P9] gi|119104297|gb|ABL61042.1| hypothetical protein [Streptococcus phage P9] Length = 385 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 73/170 (42%), Gaps = 14/170 (8%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + E ++ + A + + LS+ ++++++ I+ W++ V+ Sbjct: 221 LSEKVLQWSPVAAKLVPRAQENWELIDKYDYQEPVYLSDEAVAEILDKASAIKKWVESVE 280 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE-AYNRT-LLSPTETE 114 AL SG+++P Y++ EGR +R + ++ +L D+ + LL+ E Sbjct: 281 AYALKEALSGKEVPGYKIVEGRS-NRVITDKDKAIDILQDNGFDDEIFKPKELLAMGALE 339 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA---NISEFS 161 +L+ + ++ L + I + GK V+VP + + I +F Sbjct: 340 KLIGKTTFADL----LAEVIDKPQGKPVLVPEKDKRPAINSLEQAIKDFE 385 >gi|21910981|ref|NP_665249.1| hypothetical protein SpyM3_1445 [Streptococcus pyogenes MGAS315] gi|28876475|ref|NP_795673.1| hypothetical protein SpyM3_1445 [Streptococcus pyogenes phage 315.6] gi|28895332|ref|NP_801682.1| hypothetical protein SPs0420 [Streptococcus pyogenes SSI-1] gi|50913378|ref|YP_059350.1| unknown phage protein [Streptococcus pyogenes MGAS10394] gi|21905189|gb|AAM80052.1| conserved hypothetical protein - phage-associated [Streptococcus pyogenes MGAS315] gi|28810578|dbj|BAC63515.1| hypothetical protein (phage associated) [Streptococcus pyogenes SSI-1] gi|50902452|gb|AAT86167.1| unknown phage protein [Streptococcus pyogenes MGAS10394] Length = 385 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 73/170 (42%), Gaps = 14/170 (8%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + E ++ + A + + LS+ ++++++ I+ W++ V+ Sbjct: 221 LSEKVLQWSPVAAKLVPRAQENWELIDKYDYQEPVYLSDEAVAEILDKASAIKKWVESVE 280 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE-AYNRT-LLSPTETE 114 AL SG+++P Y++ EGR +R + ++ +L D+ + LL+ E Sbjct: 281 AYALKEALSGKEVPGYKIVEGRS-NRVITDKDKAIDILQDNGFDDEIFKPKELLAMGALE 339 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA---NISEFS 161 +L+ + ++ L + I + GK V+VP + + I +F Sbjct: 340 KLIGKTTFADL----LAEVIDKPQGKPVLVPEKDKRPAINSLEQAIKDFE 385 >gi|306826826|ref|ZP_07460127.1| phage protein [Streptococcus pyogenes ATCC 10782] gi|304430989|gb|EFM33997.1| phage protein [Streptococcus pyogenes ATCC 10782] Length = 385 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 73/170 (42%), Gaps = 14/170 (8%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + E ++ + A + + LS+ ++++++ I+ W++ V+ Sbjct: 221 LSEKVLQWSPVAAKLVPRAQENWELIDKYDYQEPVYLSDEAVAEILDKASAIKKWVESVE 280 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE-AYNRT-LLSPTETE 114 AL SG+++P Y++ EGR +R + ++ +L D+ + LL+ E Sbjct: 281 AYALKEALSGKEVPGYKIVEGRS-NRVITDKDKAIDILQDNGFDDEIFKPKELLAMGALE 339 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA---NISEFS 161 +L+ + ++ L + I + GK V+VP + + I +F Sbjct: 340 KLIGKTTFADL----LAEVIDKPQGKPVLVPEKDKRPAINSLEQAIKDFE 385 >gi|317500271|ref|ZP_07958498.1| phage-associated protein [Lachnospiraceae bacterium 8_1_57FAA] gi|316898314|gb|EFV20358.1| phage-associated protein [Lachnospiraceae bacterium 8_1_57FAA] Length = 145 Score = 154 bits (389), Expect = 4e-36, Method: Composition-based stats. Identities = 29/149 (19%), Positives = 63/149 (42%), Gaps = 11/149 (7%) Query: 23 LSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEALNVL-SSGEDLPNYELKEGR 78 + +L++ ++ +V++VLP + W + A G++ +++ EGR Sbjct: 1 MKLARLEFKMPPLLTDAEIEEVLDVLPDLTKWANEITAYATEAAIHHGKEWNGFKVVEGR 60 Query: 79 KGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKD 138 +R Y ++ V + + Y +TL+ TE ++L+ + E L I + Sbjct: 61 S-NRKYRDELLVAEAAREHGYTDIYRQTLIPMTEMQKLMGKSAFEEI----LGDLIYKPP 115 Query: 139 GKQVIVPCDLPVNHLK--ANISEFSVLKD 165 GK ++VP + +EF + + Sbjct: 116 GKPILVPNTDKRPAMNVTNAENEFDKIME 144 >gi|325687768|gb|EGD29789.1| phage protein [Streptococcus sanguinis SK72] Length = 380 Score = 153 bits (387), Expect = 8e-36, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 66/160 (41%), Gaps = 9/160 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L++ ++ +++ + + W VK+ A Sbjct: 223 WCQFSPCNAVLRARMDYHKELEQFQLASPHLLTDGEIEEILLHVDDLVKWATEVKDYATK 282 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 ++S + ++L EGR R + N+++V +L E Y ++L+S TE E+ + +K Sbjct: 283 VAINSHKSWNGFKLVEGRS-IRQFTNEDEVAKLAEAEGFTNIYKQSLVSLTELEKRMGKK 341 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + + L + + GK +VP +EF Sbjct: 342 EFNRV----LGHLVRKPQGKLTLVPESDKRKEYIPAAAEF 377 >gi|327461122|gb|EGF07455.1| phage protein [Streptococcus sanguinis SK1057] Length = 380 Score = 153 bits (386), Expect = 9e-36, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 66/160 (41%), Gaps = 9/160 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L++ ++ +++ + + W VK+ A Sbjct: 223 WCQFSPCNAVLRARMDYHKELEQFQLASPHLLTDGEIEEILLHVDDLVKWATEVKDYATK 282 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 ++S + ++L EGR R + N+++V +L E Y ++L+S TE E+ + +K Sbjct: 283 VAINSHKSWNGFKLVEGRS-IRQFTNEDEVAKLAEAEGFTNIYKQSLVSLTELEKRMGKK 341 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + + L + + GK +VP +EF Sbjct: 342 EFNRV----LGHLVRKPQGKLTLVPESDKRKEYIPAAAEF 377 >gi|209559296|ref|YP_002285768.1| Hypothetical phage associated protein SpyM3_1445 [Streptococcus phage NZ131.2] gi|209540497|gb|ACI61073.1| Hypothetical phage associated protein SpyM3_1445 [Streptococcus phage NZ131.2] Length = 385 Score = 152 bits (384), Expect = 1e-35, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 72/170 (42%), Gaps = 14/170 (8%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + E ++ + A + + LS+ ++++++ I+ W++ V+ Sbjct: 221 LSEKVLQWSPVAAKLVPRAQENRELIDKYDYQEPVYLSDEAVAEILDKASAIKKWVESVE 280 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE-AYNRT-LLSPTETE 114 AL SG+++P Y++ EGR +R + ++ +L D+ + LL+ E Sbjct: 281 AYALKEALSGKEVPGYKIVEGRS-NRVITDKDKAIDILQDNGFDDEIFKPKELLAMGTLE 339 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA---NISEFS 161 +L+ + ++ L + I + GK V+V L + I +F Sbjct: 340 KLIGKTTFADL----LAEVIDKPQGKPVLVKNSDKRPALNSLEQAIKDFE 385 >gi|325690423|gb|EGD32426.1| phage protein [Streptococcus sanguinis SK115] Length = 380 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 63/160 (39%), Gaps = 9/160 (5%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEH---MQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 C+F A +L++ ++ +++ + + W +K+ A Sbjct: 223 WCQFSTCNAVLRARMDYHKQLEKFQLSSPHLLTDGEIEEILAHVDDLVKWATEIKDYATK 282 Query: 62 -VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 + S + ++L EGR R + N+ V Q E + Y ++L+S TE E+ + +K Sbjct: 283 VAVESHKSWAGFKLVEGRS-IRQFTNEEAVIQAAEAEGFTDLYKQSLVSLTELEKRMGKK 341 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 + + L + + GK +VP +EF Sbjct: 342 EFNRV----LGHLVHKPQGKLTLVPESDKRKEYIPAAAEF 377 >gi|139473890|ref|YP_001128606.1| hypothetical protein SpyM51057 [Streptococcus pyogenes str. Manfredo] gi|225871343|ref|YP_002747290.1| phage protein [Streptococcus equi subsp. equi 4047] gi|134272137|emb|CAM30382.1| hypothetical phage protein [Streptococcus pyogenes str. Manfredo] gi|225700747|emb|CAW95391.1| hypothetical phage protein [Streptococcus equi subsp. equi 4047] Length = 385 Score = 151 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 72/170 (42%), Gaps = 14/170 (8%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSE----HMQILSNRQLSQVMNVLPLIETWMKGVK 56 + E ++ + A + + LS+ ++++++ I+ W++ V+ Sbjct: 221 LSEKVLQWSPVAAKLVPRAQENWELIDKYDYQEPVYLSDEAVAEILDKASAIKKWVESVE 280 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE-AYNRT-LLSPTETE 114 AL SG+++P Y++ EGR +R + ++ +L D+ + LL+ E Sbjct: 281 AYALKEALSGKEVPGYKIVEGRS-NRVITDKDKAIDILQDNGFDDEIFKPKELLAMGTLE 339 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA---NISEFS 161 +L+ + ++ L + I + GK V+V L + I +F Sbjct: 340 KLIGKTTFADL----LAEVIDKPQGKPVLVKNSDKRPALNSLEQAIKDFE 385 >gi|83311035|ref|YP_421299.1| hypothetical protein amb1936 [Magnetospirillum magneticum AMB-1] gi|82945876|dbj|BAE50740.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 397 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 78/178 (43%), Gaps = 21/178 (11%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQI-----------LSNRQLSQVMNVLPLIE 49 + + CRFC+AKP C + AL+T ++ ++ QV+N +IE Sbjct: 227 VAGDHCRFCKAKPVCPEMERMALTTAEAQFTDTTITVPPAPSSMTPARIGQVLNAAEIIE 286 Query: 50 TWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGD--EAYNRTL 107 W+K V+E A L G ++P ++L R R + +D+ L+ + Y R + Sbjct: 287 GWLKSVREHAHASLEQGVEIPGWKLVPKRA-QRKWADDDLASDALLTAGLPILDIYARKI 345 Query: 108 LSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVLK 164 +SP E+L+ ++K + ++ + + + P + ++F+ L+ Sbjct: 346 ISPAAAEKLLGKQK------DAIKHLVVAESSGTTLAPAGDKRQAVAGGAAADFTSLE 397 >gi|315121963|ref|YP_004062452.1| hypothetical protein CKC_01065 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495365|gb|ADR51964.1| hypothetical protein CKC_01065 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 388 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 73/166 (43%), Positives = 101/166 (60%), Gaps = 2/166 (1%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 +DE+ACRFCRAK RC AL+ L + LS+ + LPLIE ++K +KEE Sbjct: 223 VDEDACRFCRAKTRCPALSRHVLLETIRDPKSGCEVDLSKAYSSLPLIEQYIKALKEEVF 282 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 LS GE++ Y+L EGRKG+RT+ + Q + L LGD+A+ + LLSP E EQ K + Sbjct: 283 KRLSEGEEVKGYQLVEGRKGNRTFKDIEQATEYLTGVLGDKAFKKILLSPKEVEQFRKDQ 342 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDL-PVNHLKANI-SEFSVLK 164 +S WE+LQ+ ITR DGK VI P D+ PV ++ S+F+ L Sbjct: 343 TLSSDVWEELQELITRGDGKPVIAPRDIPPVPRIQKAEISDFASLD 388 >gi|315122931|ref|YP_004063420.1| hypothetical protein CKC_05935 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496333|gb|ADR52932.1| hypothetical protein CKC_05935 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 388 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 73/166 (43%), Positives = 101/166 (60%), Gaps = 2/166 (1%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 +DE+ACRFCRAK RC AL+ L + LS+ + LPLIE ++K +KEE Sbjct: 223 VDEDACRFCRAKTRCPALSRHVLLETIRDPKSGYEVDLSKAYSSLPLIEQYIKSLKEEVF 282 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 LS GE++ Y+L EGRKG+RT+ + Q + L LGD+A+ + LLSP E EQ K + Sbjct: 283 KRLSEGEEVKGYQLVEGRKGNRTFKDIEQATEYLTGVLGDKAFKKILLSPKEVEQFRKDQ 342 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDL-PVNHLKANI-SEFSVLK 164 +S WE+LQ+ ITR DGK VI P D+ PV ++ S+F+ L Sbjct: 343 TLSSDVWEELQELITRGDGKPVIAPRDIPPVPRIQKAEISDFASLD 388 >gi|318064536|gb|ADV36497.1| gp43 [Edwardsiella phage eiDWF] Length = 419 Score = 146 bits (370), Expect = 6e-34, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 74/165 (44%), Gaps = 12/165 (7%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI-----LSNRQLSQVMNVLPLIETWMKGVK 56 +AC++CR + +C A ++ ++ ++ +L++ LP + +K ++ Sbjct: 256 SADACQWCRFREKCNAARKFNEQIAADDLRDESGDEMTPEELAEAYAKLPALRQHIKNIE 315 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETE 114 L +G LP +L G+ G+RT++++ V+ L + D Y + LL+PT+ E Sbjct: 316 SATYKALLAGTKLPGLKLVAGKDGNRTWSDEALVQLRLEQGGVTPDAMYTQKLLTPTQAE 375 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 + + E +++ ITRK G+ I D + Sbjct: 376 KALPAGAF-----EWVEELITRKPGEPSIASADDKRPEYVPVKDD 415 >gi|62327347|ref|YP_224060.1| hypothetical protein BPKS7gp38 [Salmonella phage SS3e] gi|57472381|gb|AAW51243.1| hypothetical protein [Salmonella phage SS3e] Length = 427 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 40/163 (24%), Positives = 72/163 (44%), Gaps = 11/163 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI-----LSNRQLSQVMNVLPLIETWMKGVKEE 58 + C++CR +C A S+ +E ++ L+ QL LPL+ + V++ Sbjct: 262 DGCQWCRFSEQCAARTKTVNSSLAEELEDDFALELTPEQLVAEYEKLPLLRQHIDKVEKA 321 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLM--RELGDEAYNRTLLSPTETEQL 116 L SG+ +P Y+L EGR G+R + + + + + L + + L++PTE E+ Sbjct: 322 MATTLHSGKKVPGYKLVEGRPGNRAWKDTDALLETLSHFELGAEMLHKEVLMTPTEAEKQ 381 Query: 117 VKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 K W L+K +TRK G + + K + Sbjct: 382 HK----GSELWAALEKHVTRKPGAPCVTTVEDKRPEWKNVTED 420 >gi|318064640|gb|ADV36549.1| gp43 [Edwardsiella phage eiMSLS] Length = 419 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 74/165 (44%), Gaps = 12/165 (7%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI-----LSNRQLSQVMNVLPLIETWMKGVK 56 +AC++CR + +C A ++ ++ ++ +L++ LP + +K ++ Sbjct: 256 SADACQWCRFREKCNAARKFNEQIAADDLRDESGDEMTPEELAEAYAKLPALRQHIKNIE 315 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETE 114 L +G LP +L G+ G+RT++++ V+ L + D Y + LL+PT+ E Sbjct: 316 SATYKALLAGTKLPGLKLVAGKDGNRTWSDEALVQLRLEQGGVTPDAMYTQKLLTPTQAE 375 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 + + E +++ ITRK G+ I D + Sbjct: 376 KALPAGAF-----EWVEELITRKPGEPSIASADDKRPEYVPVNDD 415 >gi|227496455|ref|ZP_03926739.1| phage-associated protein [Actinomyces urogenitalis DSM 15434] gi|226834037|gb|EEH66420.1| phage-associated protein [Actinomyces urogenitalis DSM 15434] Length = 342 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 41/163 (25%), Positives = 71/163 (43%), Gaps = 11/163 (6%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 + CR+CRAK C A A + L+ M L+ +L+ V+ + W+K V+E A Sbjct: 183 DWCRWCRAKAICRARAEENLALARREMAPAPALTPEELADVITRGRRLAAWVKDVEEHAT 242 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRK 120 L++GE +P +L GR G R + + + L + E+ + RK Sbjct: 243 AALTAGEGVPGLKLVAGR-GRRAFTDAEAAADAAEKAGYVS-RETRPLPLSAIEKAMGRK 300 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFS 161 + +E L +T+ +G ++V P + SEF+ Sbjct: 301 RFAEV----LGDLVTKTEGVPLLVAASDPRPAWEPVTPQSEFT 339 >gi|83311819|ref|YP_422083.1| hypothetical protein amb2720 [Magnetospirillum magneticum AMB-1] gi|82946660|dbj|BAE51524.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 396 Score = 145 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 38/178 (21%), Positives = 76/178 (42%), Gaps = 21/178 (11%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQI-----------LSNRQLSQVMNVLPLIE 49 + CRFCRAKP C + AL+T ++ ++ QV+N +IE Sbjct: 226 VAGEHCRFCRAKPVCPEMERMALATAEAQFTDTTITVPPAPSSMAPARIGQVLNAAEIIE 285 Query: 50 TWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGD--EAYNRTL 107 W+K V+E A L G ++P ++L R R + +D+ L+ + Y R + Sbjct: 286 GWLKSVREHAHASLEQGVEIPGWKLVPKRA-QRKWADDDLASDALLTAGLPILDIYTRKI 344 Query: 108 LSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK-ANISEFSVLK 164 ++P ++L+ + K + ++ + + + P + ++F+ L+ Sbjct: 345 ITPAAADKLLGKNK------DVVKHLVVAESSGTTLAPAGDKRQAVAGGAAADFTSLE 396 >gi|282534224|gb|ADA82332.1| hypothetical protein [Escherichia phage K1H] Length = 416 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 41/161 (25%), Positives = 72/161 (44%), Gaps = 14/161 (8%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI-----LSNRQLSQVMNVLPLIETWMKGVKEE 58 + C++CR +C A S +E ++ L+ QL LPL+ + V++ Sbjct: 262 DGCQWCRFSEQCAARTKTVNSVLAEELEDDFVLELTTEQLVVEYEKLPLLRQHIDKVEKA 321 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 L SG+ +P Y+L EGR+G+R + + + +E GD L++PTE +++ Sbjct: 322 MSAALHSGKKVPGYKLVEGRQGNRAWKDADAIEV----SHGDILKKEVLMTPTEAAKVLS 377 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 +++ L+ FITRK G + D E Sbjct: 378 ADEMA-----ALEPFITRKPGAPCVATADDKRPEWNQVSEE 413 >gi|282535274|gb|ADA82480.1| hypothetical protein [Escherichia phage K1ind3] gi|282547375|gb|ADA82431.1| hypothetical protein [Escherichia phage K1ind2] Length = 416 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 14/161 (8%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI-----LSNRQLSQVMNVLPLIETWMKGVKEE 58 + C++CR +C A S +E ++ L+ QL LPL+ + V++ Sbjct: 262 DGCQWCRFSEQCAARTKTVNSVLAEELEDDFVLELTTEQLVVEYEKLPLLRQHIDKVEKA 321 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 L SG+ +P Y+L EGR G+R + + + +E GD L+SPTE +++ Sbjct: 322 MSATLHSGKKVPGYKLVEGRPGNRAWKDADAIEV----SHGDILKKEVLMSPTEAAKVLS 377 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 +++ L+ FITRK G + D E Sbjct: 378 ADEMA-----ALEPFITRKPGAPCVATADDKRPEWNQVSEE 413 >gi|288799803|ref|ZP_06405262.1| putative protein p51 [Prevotella sp. oral taxon 299 str. F0039] gi|288333051|gb|EFC71530.1| putative protein p51 [Prevotella sp. oral taxon 299 str. F0039] Length = 376 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 65/162 (40%), Gaps = 11/162 (6%) Query: 5 ACRFCRAKPRCGA--LAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNV 62 C FC+AK C +K L + ++L+ + ++++ P + W+ V ++ Sbjct: 217 WCGFCKAKTSCAEQGRELKGLEELKQK-ELLTKEEQAEIVLKAPQVIAWLNAVVKDVTER 275 Query: 63 LSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVKRK 120 GE +L EGR R + + + L + E Y L++ T ++L+ +K Sbjct: 276 AKQGEAFEGLKLVEGRS-VRKIVDPENLVKALTAKGYTEAQLYESKLITLTAMQKLLGKK 334 Query: 121 KVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSV 162 +E + + + G +V D P +F+ Sbjct: 335 AFTE----DVVPYTIKPVGALQLVSLDDPRPE-ATAKDDFNN 371 >gi|304406769|ref|ZP_07388424.1| conserved hypothetical protein [Paenibacillus curdlanolyticus YK9] gi|304344302|gb|EFM10141.1| conserved hypothetical protein [Paenibacillus curdlanolyticus YK9] Length = 395 Score = 141 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 66/160 (41%), Gaps = 7/160 (4%) Query: 4 NACRFCRAKPRCGALAVKALST---FSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FCRA+ C A LS S ++ +++ ++ + W +KE AL Sbjct: 230 EHCGFCRARSTCRARVEGLLSIEDKASLKPPMIGWDEVADILRRGDGLVKWYSSLKEAAL 289 Query: 61 NVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDE--AYNRTLLSPTETEQLVK 118 + G D+P ++ EGR GSR + + + + R D Y R ++ E+L Sbjct: 290 AEVLKGGDVPGWKAVEGR-GSRQFADIDAAFIHMERYGIDGALLYERKPITAAAAEKLFD 348 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANIS 158 KK+ + E + ++ G + P + + ++ Sbjct: 349 SKKMYKQVLED-TGHVVKQSGAPTLAPAEDLRPSINNQVT 387 >gi|282533173|gb|ADA82282.1| hypothetical protein [Escherichia phage K1G] Length = 416 Score = 141 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 42/161 (26%), Positives = 70/161 (43%), Gaps = 14/161 (8%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI-----LSNRQLSQVMNVLPLIETWMKGVKEE 58 + C++CR +C A S +E ++ L+ QL LPL+ + V++ Sbjct: 262 DGCQWCRFSEQCAARTKTVNSVLAEELEDDFVLELTTEQLVVEYEKLPLLRQHIDKVEKA 321 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK 118 L SG +P Y+L EGR G+R + + + +E GD L+SPTE +++ Sbjct: 322 MSAALHSGNKVPGYKLVEGRLGNRAWKDADAIEV----SHGDILKKEVLMSPTEAAKVLS 377 Query: 119 RKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 +++ L+ FITRK G + D E Sbjct: 378 ADEMA-----ALEPFITRKPGAPCVATADDKRPEWNQVSEE 413 >gi|318064430|gb|ADV36446.1| hypothetical protein [Edwardsiella phage eiAU] Length = 182 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 74/165 (44%), Gaps = 12/165 (7%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQI-----LSNRQLSQVMNVLPLIETWMKGVK 56 +AC++CR + +C A ++ ++ ++ +L++ LP + +K ++ Sbjct: 19 SADACQWCRFREKCNAARKFNEQIAADDLRDESGDEMTPEELAEAYAKLPALRQHIKNIE 78 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG--DEAYNRTLLSPTETE 114 L +G LP +L G+ G+RT++++ V+ L + D Y + LL+PT+ E Sbjct: 79 SATYKALLAGTKLPGLKLVAGKDGNRTWSDEALVQLRLEQGGVTPDAMYTQKLLTPTQAE 138 Query: 115 QLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 + + E +++ ITRK G+ I D + Sbjct: 139 KALPAGAF-----EWVEELITRKPGEPSIASADDKRPEYVPVKDD 178 >gi|91214215|ref|NP_919005.2| conserved phage protein [Burkholderia phage BcepNazgul] gi|88604908|gb|AAQ63372.2| conserved phage protein [Burkholderia phage BcepNazgul] Length = 454 Score = 133 bits (335), Expect = 8e-30, Method: Composition-based stats. Identities = 37/198 (18%), Positives = 68/198 (34%), Gaps = 35/198 (17%) Query: 2 DENACRFCRAKPRCGAL--------------------AVK-----------ALSTFSEHM 30 + CRFC+ + C AL A + LST + Sbjct: 252 SDKGCRFCKVQKTCPALLAMAHRLSDDCFDDLTAEVTAEQMSSSKEVLMRGELSTKLLPV 311 Query: 31 QILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQV 90 LS Q+++++ + E W + E +GED+P +L R R + Sbjct: 312 SELSTEQMAKLLPYRKVFERWFSEMDSELERRAHNGEDIPGMKLVASRSNRRWNVQQAKA 371 Query: 91 EQLLMRELGDE--AYNRTLLSPTETEQLVKRKKVSETTWEQ-LQKFITRKDGKQVIVPCD 147 + L E ++ +SP + E+L+ + + E+ + + + GK + P Sbjct: 372 VETLTFLGVKENALFSTDFVSPAKAEELLVASGMRKKAAEKIIAPLVAKAPGKATLAPVA 431 Query: 148 LPVNHLKANISE-FSVLK 164 L + F L Sbjct: 432 DTREKLGQVADDCFDDLT 449 >gi|42779456|ref|NP_976703.1| hypothetical protein BCE_0375 [Bacillus cereus ATCC 10987] gi|42735372|gb|AAS39311.1| conserved hypothetical protein [Bacillus cereus ATCC 10987] Length = 362 Score = 132 bits (333), Expect = 1e-29, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 58/125 (46%), Gaps = 5/125 (4%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHM---QILSNRQLSQVMNVLPLIETWMKGVKEEAL 60 C FCRA +C A A + L +L++ ++ +V++ L + W + A Sbjct: 215 EWCTFCRAAVKCRARAEEKLKLAQMEFKLPPLLTDSEIEEVLSKLSDLTKWANEIISYAT 274 Query: 61 NVL-SSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKR 119 + + G++ +++ EGR R Y ++ V + + Y ++L++ TE ++L+ Sbjct: 275 DAAVNHGKEWHGFKVVEGRS-IRKYKDEEAVAEAAKANGYKDIYRQSLITLTEMQKLMVS 333 Query: 120 KKVSE 124 + + Sbjct: 334 RNFKK 338 >gi|269975291|gb|ACZ55515.1| conserved phage protein [Staphylococcus phage SA1] Length = 445 Score = 131 bits (330), Expect = 3e-29, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 65/184 (35%), Gaps = 35/184 (19%) Query: 4 NACRFCRAKPRCGALA-------------------VKALSTFSE-----------HMQIL 33 CRFCRA C A+A + +S + L Sbjct: 247 KGCRFCRAAHNCAAIAYMMECAVGGDVEFLESEFGEEEMSVLRDALAQEYKFRRAQFGNL 306 Query: 34 SNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQL 93 + Q+++++ ++E W + E +GE +P +L E R +R + N+ L Sbjct: 307 TTEQMAKILPYRKVVENWFSRLDFELERRAMNGEKVPGQKLVESRT-NRKFANEKDAIAL 365 Query: 94 LMRELGDE--AYNRTLLSPTETEQLVKRKKVSETTWEQ--LQKFITRKDGKQVIVPCDLP 149 +E R L +P + E++++ + + + + +GK + P Sbjct: 366 FKFLDIEEDKYIERKLRTPAQMEEVLRDELGVSRAGAPNIIAGIVYKPEGKPTLAPLTDK 425 Query: 150 VNHL 153 L Sbjct: 426 RPPL 429 >gi|160898857|ref|YP_001564439.1| hypothetical protein Daci_3416 [Delftia acidovorans SPH-1] gi|160364441|gb|ABX36054.1| hypothetical protein Daci_3416 [Delftia acidovorans SPH-1] Length = 463 Score = 131 bits (329), Expect = 4e-29, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 68/187 (36%), Gaps = 27/187 (14%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHM--------QILSNRQLSQVMNVLPLIETWMK 53 D + C FCRAK C A KAL T + + L ++P ++ W K Sbjct: 264 DYDNCFFCRAKGTCYAQTSKALGTVFDGFGEATSGVLRRPDPLMLGSQYALVPFVQQWAK 323 Query: 54 GVKEEALNVLSSGEDL---PN--YELKEGRKGSRTYNNDN--QVEQLLMRELGDEAYNRT 106 V+E L +GE + Y+L GR RT+ ++ R D Y Sbjct: 324 DVEEATYRALQNGEPVVRNDGVPYKLVPGRAAKRTWRDEEAAAAVLHAARIPRDRMYLFQ 383 Query: 107 LLSPTETEQLVKRKK-----------VSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKA 155 L+SP E + K+K+ + + W+ LQ I + + I P L Sbjct: 384 LISPAMAEAMSKQKRPPKGQPPQPAELPPSKWKDLQPLIAQGEPAPQIALATDPRPAL-C 442 Query: 156 NISEFSV 162 F Sbjct: 443 KADGFED 449 >gi|273810597|ref|YP_003344978.1| gp43 [Sodalis phage SO-1] gi|258619882|gb|ACV84135.1| gp43 [Sodalis phage SO-1] Length = 476 Score = 127 bits (320), Expect = 5e-28, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 78/204 (38%), Gaps = 50/204 (24%) Query: 4 NACRFCRAKPRCGALAVKALST----------FSEHMQILSNRQLS-------------- 39 +AC++CR +C A A A+ T + + I ++ + + Sbjct: 270 DACQWCRFADQCSARAKAAIDTMTPPTATDEDLGDDVSIATDEECANHDRNMAREARKAA 329 Query: 40 ---------------------QVMNVLPLIETWMKGVKEEALNVLSSGE-DLPNYELKEG 77 + L + +K ++ + +G+ + ++ G Sbjct: 330 RRAKRGKIEATPGAMSAAELRKAYEGLDAMRQHIKAIEAAVFKAVMAGDGESLGLKMVAG 389 Query: 78 RKGSRTYNNDNQVEQLLMRE--LGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFIT 135 ++G R + ++++V ++ + D Y TLLSPT+ E+++K +K W +L IT Sbjct: 390 KEGIRKWADESEVIEIFTKARIKRDVMYKETLLSPTDAEKVLKDEK--PKVWAKLCDKIT 447 Query: 136 RKDGKQVIVPCDLPVNHLKANISE 159 R K V+ P D P E Sbjct: 448 RAPAKPVLAPIDDPRPAWSEATDE 471 >gi|134288573|ref|YP_001110812.1| hypothetical protein SPSV3_gp12 [Salmonella phage SETP3] gi|125631938|gb|ABN47341.1| hypothetical protein [Salmonella phage SETP3] Length = 475 Score = 121 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 75/213 (35%), Gaps = 59/213 (27%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQ-------------------------------- 31 + C++CR +C A + + ++ ++ Sbjct: 262 DGCQWCRFSEQCQARTRASNAVLAKELENDADVYRRAVERVAKFCNEADYMSVWEDVQPS 321 Query: 32 ----ILSNR--------------------QLSQVMNVLPLIETWMKGVKEEALNVLSSGE 67 +L + QL LP + + V++ L SG+ Sbjct: 322 DILKLLPDETDLPKKPVREDDFALELTTKQLVAEYEKLPFLRQHIDKVEKAMAAALHSGK 381 Query: 68 DLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTL-LSPTETEQLVKRKKVSETT 126 ++P Y+L EGR G+R + +++ V + + ++T+ + P E E+++ ++ Sbjct: 382 NVPGYKLVEGRLGNRAWKDEDAVNKFRENHVNGYLLDKTVTVLPAEAEKIIGKE--DPEL 439 Query: 127 WEQLQKFITRKDGKQVIVPCDLPVNHLKANISE 159 W +L K + RK G + D K + Sbjct: 440 WAELAKLVIRKPGAPCVASADDRRPDWKNITED 472 >gi|225220083|ref|YP_002720050.1| hypothetical protein EpSSL_gp11 [Enterobacteria phage SSL-2009a] gi|224986024|gb|ACN74588.1| hypothetical protein [Enterobacteria phage SSL-2009a] Length = 476 Score = 121 bits (303), Expect = 4e-26, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 75/204 (36%), Gaps = 50/204 (24%) Query: 4 NACRFCRAKPRCGALAVKALST----------FSEHMQILSNRQLS-------------- 39 +AC++CR +C A A+ T + + + + + Sbjct: 270 DACQWCRFADQCSVRAKAAIDTMTPPTATDEDLGDDVSEATEEECANHDRNMAREARKAA 329 Query: 40 ---------------------QVMNVLPLIETWMKGVKEEALNVLSSGE-DLPNYELKEG 77 + L + +K ++ + +G+ + ++ G Sbjct: 330 RRAKRGKKESTPGAMSAAELRKAYEGLDAMRQHIKAIESAVFKAVMAGDGESLGLKMVAG 389 Query: 78 RKGSRTYNNDNQVEQLLMRE--LGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFIT 135 ++G R + ++++V ++ + D Y TLLSPT+ E+++K +K W +L IT Sbjct: 390 KEGIRKWADESEVIEIFTKARIKRDVMYKETLLSPTDAEKVLKDEK--PKVWAKLCAKIT 447 Query: 136 RKDGKQVIVPCDLPVNHLKANISE 159 R K V+ P D P E Sbjct: 448 RAPAKPVLAPIDDPRPAWSEATDE 471 >gi|302206576|gb|ADL10918.1| Phage-associated protein [Corynebacterium pseudotuberculosis C231] gi|308276819|gb|ADO26718.1| phage-associated protein [Corynebacterium pseudotuberculosis I19] Length = 337 Score = 119 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 55/135 (40%), Gaps = 11/135 (8%) Query: 11 AKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALNVL-SSG 66 A+ A L L + +++QV+ LP + W + AL++ + G Sbjct: 195 AEKVVKPRAEANLVLAKHEFAPPAELGDAEIAQVLYRLPSLSAWATDAEAHALSLAVNQG 254 Query: 67 EDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEA--YNRTLLSPTETEQLVKRKKVSE 124 + P ++L EGR R Y +++ V ++ + Y+ L + T E+ +K+ +E Sbjct: 255 KTWPGFKLVEGRS-IRKYADESAVAEMAGTGGITDIYDYDHKLKTITTLEKQAGKKRFTE 313 Query: 125 TTWEQLQKFITRKDG 139 L + G Sbjct: 314 L----LGDLVVEPAG 324 >gi|302331131|gb|ADL21325.1| phage-associated protein [Corynebacterium pseudotuberculosis 1002] Length = 337 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 55/135 (40%), Gaps = 11/135 (8%) Query: 11 AKPRCGALAVKALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALNVL-SSG 66 A+ A L L + +++QV+ LP + W + AL++ + G Sbjct: 195 AEKVVKPRAEANLVLAKHEFAPPAELGDAEIAQVLYRLPSLSAWATDAEAHALSLAVNQG 254 Query: 67 EDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEA--YNRTLLSPTETEQLVKRKKVSE 124 + P ++L EGR R Y +++ V ++ + Y+ L + T E+ +K+ +E Sbjct: 255 KTWPGFKLVEGRS-IRKYADESAVAEMAGTGGITDIYDYDHKLKTITTLEKQAGKKRFTE 313 Query: 125 TTWEQLQKFITRKDG 139 L + G Sbjct: 314 L----LGDLVVEPAG 324 >gi|290466793|gb|ADD25723.1| hypothetical protein [Lactococcus phage 1358] Length = 406 Score = 113 bits (282), Expect = 1e-23, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 60/174 (34%), Gaps = 18/174 (10%) Query: 5 ACRFCRAKPRCGALAVKALST------FSEHMQILSNRQLSQVMNVLPLIETWMKGVKEE 58 CR+C+ + + + L R++ Q+ ++ W+ V + Sbjct: 238 TCRYCKHRVTDKKHRDEFIKVLGGWDKIGIRPSALDKREIIQIAENAAALKQWVDDVVKY 297 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELG---DEAYNRTLLSPTETEQ 115 A + G++L +L +G R + ++ +V + L R + R L TE E Sbjct: 298 ATAQVYDGQELEGLKLVKG-ASRRQFTDEKRVRRKLKRLGYNADEYLKPRPLKPLTEIEA 356 Query: 116 LVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLP----VNHLKANISEFSVLKD 165 LV + + + +VP N L+ +F+ + Sbjct: 357 LVGHTAFERDFARS----VIKNEYAPRLVPLSNKGLPAANKLEQAADDFAAFTE 406 >gi|300858869|ref|YP_003783852.1| hypothetical protein cpfrc_01452 [Corynebacterium pseudotuberculosis FRC41] gi|300686323|gb|ADK29245.1| hypothetical protein cpfrc_01452 [Corynebacterium pseudotuberculosis FRC41] Length = 135 Score = 110 bits (275), Expect = 7e-23, Method: Composition-based stats. Identities = 27/125 (21%), Positives = 53/125 (42%), Gaps = 11/125 (8%) Query: 21 KALSTFSEHMQ---ILSNRQLSQVMNVLPLIETWMKGVKEEALNVL-SSGEDLPNYELKE 76 K L L + +++QV+ LP + W + AL++ + G+ P ++L E Sbjct: 3 KNLVLAKHEFAPPAELGDAEIAQVLYRLPSLSAWATDAEAHALSLAVNQGKTWPGFKLVE 62 Query: 77 GRKGSRTYNNDNQVEQLLMRELGDEA--YNRTLLSPTETEQLVKRKKVSETTWEQLQKFI 134 GR R Y +++ V ++ + Y+ L + T E+ +K+ +E L + Sbjct: 63 GRS-IRKYADESAVAEMAGTGGITDIYDYDHKLKTITTLEKQAGKKRFTEL----LGDLV 117 Query: 135 TRKDG 139 G Sbjct: 118 VEPAG 122 >gi|331007089|ref|ZP_08330313.1| Phage protein [gamma proteobacterium IMCC1989] gi|330419106|gb|EGG93548.1| Phage protein [gamma proteobacterium IMCC1989] Length = 388 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 39/182 (21%), Positives = 76/182 (41%), Gaps = 29/182 (15%) Query: 3 ENACRFCRAKPRCGALAVKALSTFSEHMQI------------LSNRQLSQVMNVLPLIET 50 E C FC+A C ALA AL ++ + LSN +++ ++ L + Sbjct: 214 EKQCHFCKANGTCKALAEHALQIAADGFEAVEVPLTVKDITKLSNEEIAVLLPQLKTLTK 273 Query: 51 WMKGVKEEALNVLSSGEDLPNYE--LKEGRKGSRTYNNDNQVEQLL----MRELGDEAYN 104 W+ ++ A GE PN++ LK G + SR++ ND+ ++ L + +E Sbjct: 274 WVSALETHAQAEAEKGEKFPNHKVILKNG-QTSRSWKNDDSAKRALNREFKKLGKEEP-T 331 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDG--KQVIVPCDLPVNHLK-ANISEFS 161 +SP + E+++ + T K + + G K +V + ++ +F Sbjct: 332 SNPISPAQAEKIIGK------TNPLFDKHVVKTTGDAKYKLVHESAKGDPVEINPTEDFE 385 Query: 162 VL 163 + Sbjct: 386 EV 387 >gi|227889485|ref|ZP_04007290.1| conserved hypothetical protein [Lactobacillus johnsonii ATCC 33200] gi|227849963|gb|EEJ60049.1| conserved hypothetical protein [Lactobacillus johnsonii ATCC 33200] Length = 115 Score = 106 bits (265), Expect = 9e-22, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 47/118 (39%), Gaps = 8/118 (6%) Query: 48 IETWMKGVKEEALN-VLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRT 106 + W VK+ A + ++ + P+Y++ E R R Y + V ++ + Y + Sbjct: 1 MNRWTHEVKDYAADLAINHDKKWPSYKIVEERS-IRHYKYEAAVAKIAEDHGYHDIYQKK 59 Query: 107 LLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHL--KANISEFSV 162 LL T+ E+ + +K +E ++ I + +VP + EF Sbjct: 60 LLPITKLEKQLGKKIFTELFSQE----IVKTADNPTLVPNSDQRQSISKSNPKDEFKE 113 >gi|282547322|gb|ADA82379.1| hypothetical protein [Escherichia phage K1ind1] Length = 361 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 26/93 (27%), Positives = 46/93 (49%), Gaps = 5/93 (5%) Query: 4 NACRFCRAKPRCGALAVKALSTFSEHMQI-----LSNRQLSQVMNVLPLIETWMKGVKEE 58 + C++CR +C A S +E ++ L+ QL LPL+ + V++ Sbjct: 262 DGCQWCRFSEQCAARTKTVNSVLAEELEDDFVLELTTEQLVVEYEKLPLLRQHIDKVEKA 321 Query: 59 ALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVE 91 L SG+ +P Y+L EGR G+R + + + +E Sbjct: 322 MSAALHSGKKVPGYKLVEGRPGNRAWKDADAIE 354 >gi|331028096|ref|YP_004421810.1| hypothetical protein RDJLphi1_gp42 [Roseobacter phage RDJL Phi 1] gi|301341559|gb|ADK73443.1| hypothetical protein RDJLphi1_gp42 [Roseobacter phage RDJL Phi 1] Length = 480 Score = 96.9 bits (240), Expect = 9e-19, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 65/181 (35%), Gaps = 25/181 (13%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQI--------LSNRQLSQVMNVLPLIETWMKGVK 56 C +C C A K + L+ ++ IE+++K Sbjct: 305 HCNWCPHLNDCPAALNKVQDVAGMDFADEPEELEDPMGPNHLAMILPWKGFIESFLKKAA 364 Query: 57 EEALNVLSSGEDLPNYELKEGRKGSRTYN---NDNQVEQLLMRELGDE-------AYNRT 106 A L GE++P Y++ + G+RT+ ++ +V L+ G + Sbjct: 365 AIAQERLLKGEEVPGYKMVR-KGGNRTWKPDLDETEVTNRLIEHYGVKREKIMNPPAEAK 423 Query: 107 LLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLK--ANISEFSVLK 164 + + + E+ V K E ++F+ + +G +V D + +F ++ Sbjct: 424 MRTGPQIEKAVPAKMREEFA----KEFLWKPEGGLTMVTEDDGREAVTPDQAADDFGDVE 479 Query: 165 D 165 + Sbjct: 480 E 480 >gi|37626198|gb|AAQ96573.1| hypothetical protein [Vibrio phage VP16C] Length = 375 Score = 90.0 bits (222), Expect = 9e-17, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 58/158 (36%), Gaps = 18/158 (11%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSE-----HMQILSNRQLS---QVMNVLP-LIETWM 52 CR+C RC A S S + ++ L ++ +++ + Sbjct: 214 TGKHCRYCPLVGRCSAARQAGYSLISYVKEPFEVDTMTGADLEAERDILTEGSVILKARL 273 Query: 53 KGVKEEALNVLSSGEDLPNYEL--KEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSP 110 + ++E+ N L GE L GR + ++V ++ +G + T ++P Sbjct: 274 EDIEEQLQNRLRKGEKGIGLTLRQTTGRA---KWTQSDKVVITALKSIGLDVSKETTITP 330 Query: 111 TETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDL 148 T+ K K +E ++ R +GK +V Sbjct: 331 TQA----KDKAKTEAQKAAVKALQRRPNGKIELVKLKD 364 >gi|307317151|ref|ZP_07596592.1| APSE-2 prophage [Sinorhizobium meliloti AK83] gi|306897239|gb|EFN27984.1| APSE-2 prophage [Sinorhizobium meliloti AK83] Length = 408 Score = 85.7 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 65/188 (34%), Gaps = 32/188 (17%) Query: 2 DENACRFCRAK---PRCGALAVKALSTFSEHMQILSN------------------RQLSQ 40 + AC +C A C L + L + + Sbjct: 223 SKEACLWCSAAKVDGACPEFEAWNLEFCDIEFENLDDYDEYGIDIETPDVEGLTVARKRA 282 Query: 41 VMNVLPLIETWMKGVKEEALNVLSSGE-DLPNYELKEGRKGSRTYNNDNQVEQLLMRELG 99 + + L +I ++ V++ + SG+ DL +L GR+ R + ++ + E+ L ++ Sbjct: 283 IYDHLGVIRKFLTRVEQSVAEAVRSGQGDLYGKKLVAGRRSHRKHVDETESEKWLRKKGF 342 Query: 100 DE--AYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANI 157 ++ + + L +P +++ + K + + IV + L Sbjct: 343 EDEQIFTKKLKTPAMLDKVCGKGKFP-------TEMVKGGHPTPSIVSIEDARPAL-PVD 394 Query: 158 SEFSVLKD 165 EF L D Sbjct: 395 MEFDNLDD 402 >gi|48697543|ref|YP_024901.1| gp67 [Burkholderia phage BcepB1A] gi|47717513|gb|AAT37759.1| gp67 [Burkholderia phage BcepB1A] Length = 370 Score = 84.6 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 55/160 (34%), Gaps = 19/160 (11%) Query: 1 MDENACRFCRAKPRCGA--LAVKALSTFSEHM------QILSNRQLSQVMNVLPLIETWM 52 + C C A+ C A AV+ S I ++ +L ++ + ++E Sbjct: 213 ITGEWCTNCDARANCDAYDQAVENALDVSFEPINNDLSPIRADTELLRIERAMQILEARQ 272 Query: 53 KGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTE 112 + A + G+ L +YE+K GR + +E L + + ++PT+ Sbjct: 273 SALMARAEMFMRDGQQLRHYEMKPGRS-RPIWKP--GIEASLKQMGV--PFKERPITPTQ 327 Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNH 152 + RK + + L R + D Sbjct: 328 ---VRDRKLLDPKVIDALSD---RPPAGLKVARVDEERAA 361 >gi|37626132|gb|AAQ96508.1| hypothetical protein [Vibrio phage VP16T] Length = 350 Score = 78.8 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 56/155 (36%), Gaps = 18/155 (11%) Query: 5 ACRFCRAKPRCGALAVKALSTFSE-----HMQILSNRQLS---QVMNVLP-LIETWMKGV 55 C++C RC A S S + +S L ++ V L+E M+ + Sbjct: 192 HCKYCPIVGRCSAARKAGYSVISYVQEPFEVDTMSGADLEAERDLLAVGKRLLEARMEDI 251 Query: 56 KEEALNVLSSGEDLPNYEL--KEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTET 113 +E+ N L GE + GR + V ++ +G + ++PT+ Sbjct: 252 EEQIFNRLRKGETGMGLTIRTTAGRA---KWKEKESVVIAALKSVGLDVSKEAAITPTQA 308 Query: 114 EQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDL 148 K K ++ E ++ R GK +V Sbjct: 309 ----KDKAKTDAQREAVKGLQHRPGGKAELVKLKD 339 >gi|296277304|ref|ZP_06859811.1| hypothetical protein SauraMR_13202 [Staphylococcus aureus subsp. aureus MR1] Length = 282 Score = 78.4 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 14/52 (26%), Positives = 27/52 (51%) Query: 5 ACRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVK 56 CRFC+ K C A + + +LS+ ++++++ LP I+ W V+ Sbjct: 231 HCRFCKIKHSCRTRAEYMQNVPQKPPHLLSDEEIAELLYKLPDIKKWADEVE 282 >gi|84662668|ref|YP_453632.1| hypothetical protein OP2_ORF15 [Xanthomonas phage OP2] gi|84570717|dbj|BAE72779.1| conserved hypothetical protein [Xanthomonas oryzae phage OP2] Length = 438 Score = 76.1 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 19/151 (12%), Positives = 49/151 (32%), Gaps = 12/151 (7%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEH-----MQILSNRQLSQVMNVLPLIETWMKGV 55 + C +C + C A A + LS ++ + + + + Sbjct: 276 VPGEHCTYCPSARWCPAAAEFGAKALQAESNPRTVADLSPDEVMALWAQRSAFSAFEEDL 335 Query: 56 KEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQ 115 KE + P RKGS + +D + +M + +S + E Sbjct: 336 KERV--RILHEHKHPAVS-VRYRKGSPKWLSDAAAVEAIMLADRMDLLKPPAIS--KVEG 390 Query: 116 LVKRKKVSET--TWEQLQKFITRKDGKQVIV 144 ++ +++E + + +++ + + Sbjct: 391 VIPPDQMAELTGRYPDIATYVSASGKQPSVA 421 >gi|47842849|ref|NP_958125.2| gp20 [Burkholderia phage Bcep43] gi|47719037|gb|AAR89311.2| gp20 [Burkholderia phage Bcep43] Length = 424 Score = 71.5 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 49/154 (31%), Gaps = 24/154 (15%) Query: 1 MDENACRFCRAKPRCGALAVKALSTF------SEHMQILSNRQLSQVMNVLPLIETWMKG 54 + + CR+C+AKP C AL + + L + + +++ + Sbjct: 261 VAGDHCRYCKAKPACP-RMQDALQIAFDVNAGRRSVLDMPEDDLIALYSARSGVKSLWED 319 Query: 55 VKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLS----- 109 V++ + G D + GR R + N L+ + LS Sbjct: 320 VEQRIELLAQRGHDALTIKTSPGR---RMWRNAKAAALTLLALDRTDLLQPVALSEAIAH 376 Query: 110 PTET--EQLVKRKKVSETTWEQLQKFITRKDGKQ 141 E + L+ + + S Q + + Sbjct: 377 IPEALHDDLIGKSRDS-------QSIVVKTPAAP 403 >gi|149882920|ref|YP_001294857.1| BcepNY3gp19 [Burkholderia phage BcepNY3] gi|148763571|gb|ABR10554.1| BcepNY3gp19 [Burkholderia phage BcepNY3] Length = 417 Score = 70.7 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 48/154 (31%), Gaps = 24/154 (15%) Query: 1 MDENACRFCRAKPRCGALAVKALSTF------SEHMQILSNRQLSQVMNVLPLIETWMKG 54 + + CR+C+AKP C AL + + L + + +++ + Sbjct: 254 VAGDHCRYCKAKPACP-RMQDALQIAFDVNAGRRSILDMPEDDLIALYSARSGVKSLWED 312 Query: 55 VKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLS----- 109 V++ + G D + GR R + N L+ + LS Sbjct: 313 VEQRIELLAQRGHDALTIKTSPGR---RMWRNAKAAALTLLALDRTDLLQPVALSEAIAH 369 Query: 110 PTET--EQLVKRKKVSETTWEQLQKFITRKDGKQ 141 E + L+ + + Q + + Sbjct: 370 IPEALHDDLIGKSR-------DSQSIVVKTPAAP 396 >gi|38638628|ref|NP_944329.1| gp21 [Burkholderia phage Bcep1] gi|34486010|gb|AAQ73367.1| gp21 [Burkholderia phage Bcep1] Length = 417 Score = 70.7 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 48/154 (31%), Gaps = 24/154 (15%) Query: 1 MDENACRFCRAKPRCGALAVKALSTF------SEHMQILSNRQLSQVMNVLPLIETWMKG 54 + + CR+C+AKP C AL + + L + + +++ + Sbjct: 254 VAGDHCRYCKAKPACP-RMQDALQIAFDVNAGRRSILDMPEDDLIALYSARSGVKSLWED 312 Query: 55 VKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLS----- 109 V++ + G D + GR R + N L+ + LS Sbjct: 313 VEQRIELLAQRGHDALTIKTSPGR---RMWRNAKAAALTLLALDRTDLLQPVALSEAIAH 369 Query: 110 PTET--EQLVKRKKVSETTWEQLQKFITRKDGKQ 141 E + L+ + + Q + + Sbjct: 370 IPEALHDDLIGKSR-------DSQSIVVKTPAAP 396 >gi|23752331|ref|NP_705646.1| gp20 [Burkholderia phage Bcep781] gi|23507198|gb|AAN38021.1| gp20 [Burkholderia phage Bcep781] Length = 417 Score = 67.6 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 47/154 (30%), Gaps = 24/154 (15%) Query: 1 MDENACRFCRAKPRCGALAVKALSTF------SEHMQILSNRQLSQVMNVLPLIETWMKG 54 + + CR+C+AKP C AL + + L + + +++ + Sbjct: 254 VAGDHCRYCKAKPACP-RMQDALQIAFDVNAGRRSVLDMPEDDLIALYSARSGVKSLWED 312 Query: 55 VKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLS----- 109 V++ + G D + G R + N L+ + LS Sbjct: 313 VEQRIELLAQRGHDALTIKTSPG---HRMWRNAKAAALTLLALDRTDLLQPVALSEAIAH 369 Query: 110 PTET--EQLVKRKKVSETTWEQLQKFITRKDGKQ 141 E + L+ + + Q + + Sbjct: 370 IPEALHDDLIGKSR-------DSQSIVVKTPAAP 396 >gi|238917460|ref|YP_002930977.1| hypothetical protein EUBELI_01538 [Eubacterium eligens ATCC 27750] gi|238872820|gb|ACR72530.1| Hypothetical protein EUBELI_01538 [Eubacterium eligens ATCC 27750] Length = 48 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 8/48 (16%), Positives = 19/48 (39%), Gaps = 4/48 (8%) Query: 113 TEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEF 160 ++ + + + E L +I + GK +VP + ++F Sbjct: 1 MQKCLGKSRFDEL----LTAYIEKPQGKPTLVPESDKRPAMNNAKTDF 44 >gi|159146278|gb|ABW90602.1| conserved hypothetical protein [Bacteriophage APSE-2] Length = 271 Score = 58.8 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 31/78 (39%), Gaps = 19/78 (24%) Query: 4 NACRFCRAKPR-CGALAVKA------------------LSTFSEHMQILSNRQLSQVMNV 44 C+FC+AK C A A LS + + +L+ Q++++ Sbjct: 194 KQCQFCKAKGGLCFAQAQFVHNEVKGDFVDLTQPLASQLSDAPKRITLLTPEQMAKLYPH 253 Query: 45 LPLIETWMKGVKEEALNV 62 + LIE++ K ++ Sbjct: 254 VDLIESFCKALRNRVAEA 271 >gi|159146274|gb|ABW90600.1| conserved hypothetical protein [Bacteriophage APSE-2] Length = 271 Score = 56.8 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 19/78 (24%) Query: 4 NACRFCRAKPR-CGALAVKA------------------LSTFSEHMQILSNRQLSQVMNV 44 C+FC+AK C A A ++ + + +L+ Q++++ Sbjct: 194 KQCQFCKAKGGLCFAQAQFVHNEVIGDFVDLTQPLAPQINDAPKRITLLTPEQMAKLYPH 253 Query: 45 LPLIETWMKGVKEEALNV 62 + LIE++ K ++ Sbjct: 254 VDLIESFCKSLRNRVTEA 271 >gi|159146272|gb|ABW90599.1| conserved hypothetical protein [Bacteriophage APSE-2] Length = 271 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 30/78 (38%), Gaps = 19/78 (24%) Query: 4 NACRFCRAKPR-CGALAVKA------------------LSTFSEHMQILSNRQLSQVMNV 44 C FC+AK C A A LS + + +L+ Q++++ Sbjct: 194 KQCLFCKAKGGLCFAQAQFVHNEVKGDFVDLTQPLASQLSDAPKRITLLTPEQMAKLYPH 253 Query: 45 LPLIETWMKGVKEEALNV 62 + LIE++ K ++ Sbjct: 254 VDLIESFCKALRNRVAEA 271 >gi|159146266|gb|ABW90596.1| conserved hypothetical protein [Bacteriophage APSE-2] gi|159146268|gb|ABW90597.1| conserved hypothetical protein [Bacteriophage APSE-2] gi|159146270|gb|ABW90598.1| conserved hypothetical protein [Bacteriophage APSE-2] Length = 271 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 30/78 (38%), Gaps = 19/78 (24%) Query: 4 NACRFCRAKPR-CGALAVKA------------------LSTFSEHMQILSNRQLSQVMNV 44 C FC+AK C A A LS + + +L+ Q++++ Sbjct: 194 KQCLFCKAKGGLCFAQAQFVHNEVKGDFVDLTQPLAPQLSDAPKRITLLTPEQMAKLYPH 253 Query: 45 LPLIETWMKGVKEEALNV 62 + LIE++ K ++ Sbjct: 254 VDLIESFCKALRNRVAEA 271 >gi|159146276|gb|ABW90601.1| conserved hypothetical protein [Bacteriophage APSE-2] Length = 271 Score = 55.3 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 30/78 (38%), Gaps = 19/78 (24%) Query: 4 NACRFCRAKPR-CGALAVKA------------------LSTFSEHMQILSNRQLSQVMNV 44 C FC+AK C A A LS + + +L+ Q++++ Sbjct: 194 KQCLFCKAKGGLCFAQAQFVHNEVKGDFVDLTQPLASQLSDAPKRITLLTPEQMAKLYPH 253 Query: 45 LPLIETWMKGVKEEALNV 62 + LIE++ K ++ Sbjct: 254 VDLIESFCKALRNRVAEA 271 >gi|295108164|emb|CBL22117.1| Protein of unknown function (DUF2800). [Ruminococcus obeum A2-162] Length = 270 Score = 52.6 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 12/38 (31%), Positives = 18/38 (47%), Gaps = 3/38 (7%) Query: 1 MDENACRFCRAKPRCGALAVKALSTFSEHMQ---ILSN 35 + + CRFC+A+ C A A L Q +LS+ Sbjct: 215 ISGSWCRFCKARNTCRARAESFLELAKMEFQPPALLSD 252 >gi|296277206|ref|ZP_06859713.1| hypothetical protein SauraMR_12698 [Staphylococcus aureus subsp. aureus MR1] Length = 33 Score = 43.7 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 7/33 (21%), Positives = 12/33 (36%), Gaps = 1/33 (3%) Query: 132 KFITRKDGKQVIVPCDLPVNHLK-ANISEFSVL 163 FI + GK + +K + +F L Sbjct: 1 GFIEKPQGKLTLATESDKRPAIKQSAEDDFDKL 33 >gi|225155628|ref|ZP_03724117.1| hypothetical protein ObacDRAFT_9239 [Opitutaceae bacterium TAV2] gi|224803601|gb|EEG21835.1| hypothetical protein ObacDRAFT_9239 [Opitutaceae bacterium TAV2] Length = 414 Score = 43.0 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 47/131 (35%), Gaps = 23/131 (17%) Query: 4 NACRFCRAKP-RCGA----------LAVKALSTFSEHMQILSNRQLSQVMNV-LPLIETW 51 CR+C A +C A L + ++ + + L +P++E Sbjct: 241 KQCRYCPAAGKKCPAIDQEIKMKLQLTDEHIAAIAAEPTLEHMLDLEMARKKLVPILERA 300 Query: 52 MKGVKEEALNV---LSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNR--T 106 +K +++G + E R R + + ++L D+ Y+R Sbjct: 301 HDALKTRVRAEGESIAAGHRI----FIETRNSGREITDQAKAAEILSDL-SDDLYHRCHK 355 Query: 107 LLSPTETEQLV 117 + P E E+++ Sbjct: 356 HI-PGEIERVL 365 >gi|256375083|ref|YP_003098743.1| hypothetical protein Amir_0938 [Actinosynnema mirum DSM 43827] gi|255919386|gb|ACU34897.1| conserved hypothetical protein [Actinosynnema mirum DSM 43827] Length = 480 Score = 41.8 bits (97), Expect = 0.028, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 50/160 (31%), Gaps = 26/160 (16%) Query: 16 GALAVKALSTFSEHMQILSNRQ----LSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPN 71 A +A+ + Q + + L+++ +L L+E W+ V EA+ GE LP Sbjct: 275 PASFEEAMKSGMLEPQ--TTEEQKAALARLETLLALVEGWVDVVVAEAV-----GERLPG 327 Query: 72 YELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQ 131 E R + EQ +G E R L S +LV + E Sbjct: 328 AEALRETLRRRR-ASGGPAEQTFATLVGLELRPRRLRSAAALWKLVGDQHGLEAR----D 382 Query: 132 KFITRKDGKQVIVPCDLP----------VNHLKANISEFS 161 D D P L+ + E Sbjct: 383 SLWEHPDLVPTAADLDDPMEFAERFGRTRAALENPMEELE 422 >gi|320539214|ref|ZP_08038885.1| hypothetical protein SSYM_0910 [Serratia symbiotica str. Tucson] gi|320030852|gb|EFW12860.1| hypothetical protein SSYM_0910 [Serratia symbiotica str. Tucson] Length = 66 Score = 40.3 bits (93), Expect = 0.10, Method: Composition-based stats. Identities = 15/46 (32%), Positives = 26/46 (56%), Gaps = 2/46 (4%) Query: 90 VEQLLMRE--LGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKF 133 E+ L G+ Y + L+SP +TE+L+K +SE W++L+ Sbjct: 2 AEETLKALRRKGNPIYTQKLISPPQTEKLLKSGGISERRWKKLEHL 47 >gi|240143309|ref|ZP_04741910.1| carbamoyl-phosphate synthase, large subunit [Roseburia intestinalis L1-82] gi|257204679|gb|EEV02964.1| carbamoyl-phosphate synthase, large subunit [Roseburia intestinalis L1-82] Length = 1068 Score = 39.9 bits (92), Expect = 0.12, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 43/121 (35%), Gaps = 14/121 (11%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK 75 +L S S LS+++L + ++V+ W+ L G ++ Sbjct: 398 RSLEQHVDSLMSYDFTGLSDKELDEQLHVVDDRRIWV------IAEALRRGVSYD--KIH 449 Query: 76 EGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFIT 135 E K R + + + + +L +E LL + KR + ++ +L Sbjct: 450 EITKIDRWFIDKLAILVEMENQLKNEPLTVELL------KEAKRIEFPDSVIAKLTGKTE 503 Query: 136 R 136 + Sbjct: 504 K 504 >gi|291536829|emb|CBL09941.1| carbamoyl-phosphate synthase, large subunit [Roseburia intestinalis M50/1] gi|291539132|emb|CBL12243.1| carbamoyl-phosphate synthase, large subunit [Roseburia intestinalis XB6B4] Length = 1068 Score = 39.9 bits (92), Expect = 0.12, Method: Composition-based stats. Identities = 20/121 (16%), Positives = 43/121 (35%), Gaps = 14/121 (11%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK 75 +L S S LS+++L + ++V+ W+ L G ++ Sbjct: 398 RSLEQHVDSLMSYDFTGLSDKELDEQLHVVDDRRIWV------IAEALRRGVSYD--KIH 449 Query: 76 EGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFIT 135 E K R + + + + +L +E LL + KR + ++ +L Sbjct: 450 EITKIDRWFIDKLAILVEMENQLKNEPLTVELL------KEAKRIEFPDSVIAKLTGKTE 503 Query: 136 R 136 + Sbjct: 504 K 504 >gi|154506076|ref|ZP_02042814.1| hypothetical protein RUMGNA_03618 [Ruminococcus gnavus ATCC 29149] gi|153793575|gb|EDN75995.1| hypothetical protein RUMGNA_03618 [Ruminococcus gnavus ATCC 29149] Length = 1067 Score = 39.9 bits (92), Expect = 0.12, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 37/121 (30%), Gaps = 14/121 (11%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK 75 +L S S LS +L++ + ++ + W + G ++ Sbjct: 398 RSLEQHVDSLMSYDFTDLSVEELTEQLEIVDDMRMW------RIAEAIRRGVSYE--KIH 449 Query: 76 EGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFIT 135 E + + + + + L E LL + KR + + QL Sbjct: 450 EITQIDEWFIDKFAIIVEMENALKSEELTPELL------KEAKRMEFPDNVIAQLTGKTE 503 Query: 136 R 136 R Sbjct: 504 R 504 >gi|154271724|ref|XP_001536715.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1] gi|150409385|gb|EDN04835.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1] Length = 676 Score = 39.1 bits (90), Expect = 0.23, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 54/158 (34%), Gaps = 17/158 (10%) Query: 15 CGALAVKALSTFSEHMQILSNRQLSQ----VMNVLPLIETWMKGVK---EEALNVLSSGE 67 C A +A + F H L+ +LS+ ++ L ++ K A+ L + + Sbjct: 460 CKARLERANTAFQAHFGHLNKEELSKNHSQLLGKLENLQDRTMDAKHKYNRAVRELRNEK 519 Query: 68 DLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTW 127 L R + Y ++ V + + G L+ E L + + Sbjct: 520 QRQRNRLI--RANLKRYRDEQPVIDVERQLAG------KLVDTKVMETLEHKSFMPPEHL 571 Query: 128 EQLQKFITRKDGKQVIVPCDLPVNHLKANISEFSVLKD 165 + ++ G V +N + ++ F +++ Sbjct: 572 IMVDAILSMP-GATVEAEYQRRINAIN-AMAAFCRIEE 607 >gi|25145020|ref|NP_492775.2| LAMinin related. See also lmb- family member (lam-3) [Caenorhabditis elegans] gi|3328188|gb|AAC26793.1| laminin alpha chain [Caenorhabditis elegans] gi|21615495|emb|CAA15432.3| C. elegans protein T22A3.8, confirmed by transcript evidence [Caenorhabditis elegans] gi|21617790|emb|CAB03385.3| C. elegans protein T22A3.8, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3102 Score = 38.7 bits (89), Expect = 0.29, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 53/128 (41%), Gaps = 9/128 (7%) Query: 18 LAVKALSTFSEHMQILS----NRQLSQVMN-VLPLIETWMKGVKEEALNVLSSGE-DLPN 71 A + E +++++ N + + + + ++E WM +E +V D Sbjct: 1854 RAQQEHKKAEELLKMVTAQKLNETIFEDLKNRIDVLEQWMNDYRETIYDVSKKDTADAER 1913 Query: 72 YELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVK---RKKVSETTWE 128 L G++ +R N++E+L + AY+R + +E+L+ K+ T Sbjct: 1914 MSLVVGKRINRYKEVSNEIEKLRVEAEDQIAYSRNSIEKARSEELMNMFEDKEKINMTLA 1973 Query: 129 QLQKFITR 136 +L + + Sbjct: 1974 ELPDLVEQ 1981 >gi|331694986|ref|YP_004331225.1| hypothetical protein Psed_1121 [Pseudonocardia dioxanivorans CB1190] gi|326949675|gb|AEA23372.1| Protein of unknown function DUF2342 [Pseudonocardia dioxanivorans CB1190] Length = 514 Score = 38.3 bits (88), Expect = 0.32, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 46/135 (34%), Gaps = 14/135 (10%) Query: 34 SNRQ---LSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQV 90 + Q L+++ +L LIE W+ V EA+ GE LP E R + Sbjct: 323 TPEQKAALARLETLLALIEGWVDAVVAEAV-----GERLPGAEALRETLRRRR-ASGGPA 376 Query: 91 EQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPV 150 EQ +G E R L + E +L+ ++ + D D P Sbjct: 377 EQAFATLVGLELRPRRLRAAAELWKLLGEQRGTAGR----DALWAHPDLVPTTDDLDDPA 432 Query: 151 NHLKANISEFSVLKD 165 A + + + Sbjct: 433 -AFVAGSDDVDPIAE 446 >gi|119196755|ref|XP_001248981.1| hypothetical protein CIMG_02752 [Coccidioides immitis RS] Length = 804 Score = 38.3 bits (88), Expect = 0.36, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 43/130 (33%), Gaps = 15/130 (11%) Query: 15 CGALAVKALSTFSEHMQILSNRQLSQ----VMNVLPLIETWMKGVK---EEALNVLSSGE 67 C A +A + F H L+ +LS+ ++ L ++ K A+ L + + Sbjct: 518 CKARLERANTAFQAHFGHLNKEELSKYHSQLLEKLENLQDRTMDAKHKYNRAVRELRNEK 577 Query: 68 DLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTW 127 L R + Y ++ V + + G L+ E L + + Sbjct: 578 QRQRNRLI--RANLKRYRDEQPVIDVERQLAG------KLVDTKVMETLEHKSFMPPEHL 629 Query: 128 EQLQKFITRK 137 + ++ Sbjct: 630 IMVDAILSMP 639 >gi|291550825|emb|CBL27087.1| carbamoyl-phosphate synthase, large subunit [Ruminococcus torques L2-14] Length = 1067 Score = 38.3 bits (88), Expect = 0.39, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 29/115 (25%), Gaps = 29/115 (25%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK 75 +L S S L++ +L + + + W G Sbjct: 398 RSLEQHVDSLMSYDFTDLTDDELIDQLKKVDDMRMW------RIAEACRRGISY------ 445 Query: 76 EGRKGSRTYNNDNQVEQLLMRE-LGDEAYNRTLLSPTETEQLVKRKKVSETTWEQ 129 L+ DE + TE E +K ++++ + Sbjct: 446 ----------------DLIHEITKVDEWFIDKFAIITEMETALKTQELTPELLAE 484 >gi|170582914|ref|XP_001896346.1| laminin alpha chain [Brugia malayi] gi|158596476|gb|EDP34814.1| laminin alpha chain, putative [Brugia malayi] Length = 3357 Score = 38.0 bits (87), Expect = 0.47, Method: Composition-based stats. Identities = 15/114 (13%), Positives = 44/114 (38%), Gaps = 11/114 (9%) Query: 31 QILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYEL---KEGRKGSRTYNND 87 ++L++ ++ L E W+ ++ + + + + ++ R +N+ Sbjct: 1866 RMLNDTSFVELRKQLDAFEQWLHDYRDTIWDRARQ-DTVSSDKVAIIVAKRVNR--FNDV 1922 Query: 88 NQVEQLLMRELGDEAYN-RTLLSPTETEQLVKR----KKVSETTWEQLQKFITR 136 L+ + DE +S + E+++ K ++ET Q+ + + Sbjct: 1923 KANITELLAKATDELSEAENKVSIAKAEKILAMFDDFKLINETVLPQVDEMTRK 1976 >gi|258591266|emb|CBE67563.1| putative 6-phosphogluconate dehydrogenase (gnd-like) [NC10 bacterium 'Dutch sediment'] Length = 327 Score = 38.0 bits (87), Expect = 0.48, Method: Composition-based stats. Identities = 18/126 (14%), Positives = 40/126 (31%), Gaps = 14/126 (11%) Query: 18 LAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK-- 75 +L E+ L+ +++V ++ +W+ + AL + + + Sbjct: 211 RNALSLDLPEEYRYDLNTADIAEVWRHGSVVSSWLLDLTATALAEDPTLSNYTGF--VQD 268 Query: 76 --EGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKF 133 EGR + +L L +R T E+L+ ++ Sbjct: 269 SGEGRWTIMAAIEEAVPADVLSASLYARFRSRQEH--TFAEKLLSA------MRQKFGGH 320 Query: 134 ITRKDG 139 + R G Sbjct: 321 VERPAG 326 >gi|317047449|ref|YP_004115097.1| argininosuccinate lyase [Pantoea sp. At-9b] gi|316949066|gb|ADU68541.1| argininosuccinate lyase [Pantoea sp. At-9b] Length = 475 Score = 37.6 bits (86), Expect = 0.59, Method: Composition-based stats. Identities = 14/86 (16%), Positives = 27/86 (31%), Gaps = 16/86 (18%) Query: 13 PRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWM-----------KGVKEEALN 61 C A A +LS+ +LS ++ + ++ + V Sbjct: 40 AGCRAHAR---ELARGE--LLSDAELSTMLAAIDELDAEFRAGQLHPIAADEDVHTFVER 94 Query: 62 VLSSGEDLPNYELKEGRKGSRTYNND 87 L+ +L+ GR + ND Sbjct: 95 ALTERLGPLGGKLRAGRSRNDQTVND 120 >gi|154484782|ref|ZP_02027230.1| hypothetical protein EUBVEN_02500 [Eubacterium ventriosum ATCC 27560] gi|149734630|gb|EDM50547.1| hypothetical protein EUBVEN_02500 [Eubacterium ventriosum ATCC 27560] Length = 1065 Score = 37.6 bits (86), Expect = 0.62, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 37/117 (31%), Gaps = 14/117 (11%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK 75 +L L++ QL ++V+ W+ L G ++ Sbjct: 398 RSLEQHVECLMDYDFTELTDDQLEDQLHVVDDRRIWV------IAEALRRGVSYD--KIH 449 Query: 76 EGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQK 132 E K + + + + + L +E LL + KR + +T +L Sbjct: 450 EITKIDKWFIDKIAIIVEMENSLKNEPLTPELL------KAAKRIEFPDTVISRLTG 500 >gi|18076559|emb|CAC83294.1| T cell transcription factor [Suberites domuncula] Length = 394 Score = 37.2 bits (85), Expect = 0.70, Method: Composition-based stats. Identities = 12/68 (17%), Positives = 26/68 (38%), Gaps = 6/68 (8%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQILS-NRQLSQVMNVLPLIETWMKGVKEEAL 60 +N C CR K RC + F+ + ++ N ++ ++ + + ++E A Sbjct: 304 TDNWCGVCRRKKRCVKRTEEDDELFNNDVTDVTKNSNMATILGTIGK-----ESIQESAY 358 Query: 61 NVLSSGED 68 G Sbjct: 359 VTADPGNH 366 >gi|71057430|emb|CAH04889.1| TCF/LEF transcription factor [Suberites domuncula] Length = 416 Score = 37.2 bits (85), Expect = 0.70, Method: Composition-based stats. Identities = 12/68 (17%), Positives = 26/68 (38%), Gaps = 6/68 (8%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEHMQILS-NRQLSQVMNVLPLIETWMKGVKEEAL 60 +N C CR K RC + F+ + ++ N ++ ++ + + ++E A Sbjct: 326 TDNWCGVCRRKKRCVKRTEEDDELFNNDVTDVTKNSNMATILGTIGK-----ESIQESAY 380 Query: 61 NVLSSGED 68 G Sbjct: 381 VTADPGNH 388 >gi|86739649|ref|YP_480049.1| 6-phosphogluconate dehydrogenase-like protein [Frankia sp. CcI3] gi|86566511|gb|ABD10320.1| 6-phosphogluconate dehydrogenase (decarboxylating) [Frankia sp. CcI3] Length = 340 Score = 37.2 bits (85), Expect = 0.79, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 40/134 (29%), Gaps = 27/134 (20%) Query: 19 AVKALSTF-----SEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYE 73 A T + L +++V ++ +W+ + AL + P Sbjct: 221 AEHNAETAPLGNPEYYQYDLDTTAIAEVWRRGSVVSSWLLDLTAAALQ------EDPGLT 274 Query: 74 LKEGR---KGSRTYNNDNQVEQ-----LLMRELGDEAYNRTLLSPTETEQLVKRKKVSET 125 GR G + VE+ +L L + +R E L K +S Sbjct: 275 EFSGRVSDSGEGRWTVLAAVEEGVPAHVLTASLYERFSSRG-------EGLFADKLLSAM 327 Query: 126 TWEQLQKFITRKDG 139 +Q + G Sbjct: 328 R-KQFGGHAEKPAG 340 >gi|197103300|ref|YP_002128678.1| ATP-dependent metalloprotease [Phenylobacterium zucineum HLK1] gi|196480576|gb|ACG80103.1| ATP-dependent metalloprotease [Phenylobacterium zucineum HLK1] Length = 635 Score = 36.8 bits (84), Expect = 0.89, Method: Composition-based stats. Identities = 12/85 (14%), Positives = 37/85 (43%), Gaps = 9/85 (10%) Query: 82 RTYNN-DNQVEQLLMRELGDEAYNR--------TLLSPTETEQLVKRKKVSETTWEQLQK 132 R+Y + + +++ L EA++R + + ++L+ ++ +S+ E++ Sbjct: 551 RSYADGTAEAIDQVVKALVAEAFDRATAILRRNRPVLDSAAQELLAKETLSKLDVERISG 610 Query: 133 FITRKDGKQVIVPCDLPVNHLKANI 157 +T + + P L++ + Sbjct: 611 TVTPEPPADALTPPSAKRPKLQSAV 635 >gi|307565397|ref|ZP_07627886.1| single-strand binding family protein [Prevotella amnii CRIS 21A-A] gi|307345847|gb|EFN91195.1| single-strand binding family protein [Prevotella amnii CRIS 21A-A] Length = 137 Score = 36.8 bits (84), Expect = 1.0, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 30/79 (37%), Gaps = 12/79 (15%) Query: 32 ILSNRQLSQVMN---VLPLIETW-----MKGVKEEALNVLSSGEDLPNYELKEGRKGSRT 83 L+ + V+ +P W K + + A + G+ + EGR +R+ Sbjct: 29 TLATTERGYVLPNGTQVPNRTDWHNIILFKHLAKYAEKYIHKGDKI----FVEGRIRNRS 84 Query: 84 YNNDNQVEQLLMRELGDEA 102 Y++ V +++ + Sbjct: 85 YDDKKGVRRVITEIYAENL 103 >gi|118085982|ref|XP_001235253.1| PREDICTED: hypothetical protein [Gallus gallus] Length = 306 Score = 36.8 bits (84), Expect = 1.0, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 25/55 (45%), Gaps = 1/55 (1%) Query: 19 AVKALSTFSEHMQILS-NRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNY 72 A ++LS + +L++V L +++ + + E AL+ +LP + Sbjct: 48 AGVTARLVRGDGEMLSLDEELAKVQRQLHVLQWRARDITERALHEALRRTELPGF 102 >gi|262274191|ref|ZP_06052003.1| PhnG protein [Grimontia hollisae CIP 101886] gi|262222001|gb|EEY73314.1| PhnG protein [Grimontia hollisae CIP 101886] Length = 150 Score = 36.8 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 24/58 (41%), Gaps = 4/58 (6%) Query: 58 EALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQ 115 A+ L SGE Y GR ++ + V LM+ + + L+SP E+ Sbjct: 70 RAVVRLESGE--LGYSYVTGR--NKQHAELAAVVDALMQTVAHDVLQHALISPLAAEK 123 >gi|320159459|ref|YP_004172683.1| hypothetical protein ANT_00490 [Anaerolinea thermophila UNI-1] gi|319993312|dbj|BAJ62083.1| hypothetical protein ANT_00490 [Anaerolinea thermophila UNI-1] Length = 557 Score = 36.8 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 13/61 (21%), Positives = 23/61 (37%), Gaps = 2/61 (3%) Query: 87 DNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPC 146 + ++L REL T E+ ++ SE ++ + +DGK V P Sbjct: 258 ETGDIEMLTREL--NLKETTPTVLAALERQFGKRWFSEFKNMKVGTKVEGEDGKPVPAPD 315 Query: 147 D 147 Sbjct: 316 S 316 >gi|71008961|ref|XP_758261.1| hypothetical protein UM02114.1 [Ustilago maydis 521] gi|46097936|gb|EAK83169.1| hypothetical protein UM02114.1 [Ustilago maydis 521] Length = 336 Score = 36.8 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 7/46 (15%), Positives = 17/46 (36%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALN 61 ALA L+ + ++ + +V+ + W + A + Sbjct: 64 RALAKHNLALVAPDFRLCPQVSILEVLEDVADSIRWTLDPEARAAS 109 >gi|307322875|ref|ZP_07602173.1| transposase IS3/IS911 family protein [Sinorhizobium meliloti AK83] gi|306891481|gb|EFN22369.1| transposase IS3/IS911 family protein [Sinorhizobium meliloti AK83] Length = 156 Score = 36.4 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 10/44 (22%), Positives = 18/44 (40%) Query: 54 GVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRE 97 GV +EA++ EL GR+ R + ++ + L Sbjct: 11 GVMDEAMDEARHDGAYRRIELITGRRQRRNWTDEEKARILAESA 54 >gi|283778133|ref|YP_003368888.1| HhH-GPD family protein [Pirellula staleyi DSM 6068] gi|283436586|gb|ADB15028.1| HhH-GPD family protein [Pirellula staleyi DSM 6068] Length = 398 Score = 36.4 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 12/67 (17%), Positives = 25/67 (37%), Gaps = 7/67 (10%) Query: 6 CRFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSS 65 C C A C A +A+ + +LS R + + + + ++ + ++ L Sbjct: 221 CDQCPASTWCVARREQAVE----QIPLLSKRMVYEYLTEVAVL---VVDRQQVLLRQCQP 273 Query: 66 GEDLPNY 72 GE Sbjct: 274 GERWAGL 280 >gi|313631125|gb|EFR98662.1| phage-associated protein [Listeria seeligeri FSL N1-067] Length = 38 Score = 36.0 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 5/23 (21%), Positives = 8/23 (34%) Query: 132 KFITRKDGKQVIVPCDLPVNHLK 154 I + GK +V +K Sbjct: 1 DLIEKPAGKPTLVVETDKRQAIK 23 >gi|315126725|ref|YP_004068728.1| carboxy-terminal protease [Pseudoalteromonas sp. SM9913] gi|315015239|gb|ADT68577.1| carboxy-terminal protease [Pseudoalteromonas sp. SM9913] Length = 674 Score = 36.0 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 38/103 (36%), Gaps = 18/103 (17%) Query: 18 LAVKALSTFSEHMQILSNRQLSQVMNV---------LPLIETWMKGVKEEALNVLSSGED 68 +LS M + + + L E W + VK +AL + +G+D Sbjct: 125 RYEFSLSLLENEMTF--DEEDEYFYDREDAPWATTQAELDEIWRERVKYDALRLKMTGKD 182 Query: 69 LPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPT 111 P + ++ Y N E+ L++ ++A+ + S Sbjct: 183 WPGIKEV----LTKRYKN---AEKRLVQSKSEDAFQIVMNSLA 218 >gi|169603453|ref|XP_001795148.1| hypothetical protein SNOG_04736 [Phaeosphaeria nodorum SN15] gi|111067376|gb|EAT88496.1| hypothetical protein SNOG_04736 [Phaeosphaeria nodorum SN15] Length = 163 Score = 36.0 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 11/90 (12%) Query: 22 ALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGS 81 L+T + +++L + S+ E W+ E + + G+ P Y+ GR G+ Sbjct: 67 HLTTVAPSIELLPGIRTSK---SSDRAEDWL----ERRDSRIMEGDPDP-YKTPPGRDGN 118 Query: 82 ---RTYNNDNQVEQLLMRELGDEAYNRTLL 108 R + + V + E G +A R L Sbjct: 119 ASKRKWRDQKDVAKTSEDEKGSKAKERKTL 148 >gi|328951506|ref|YP_004368841.1| (p)ppGpp synthetase I, SpoT/RelA [Marinithermus hydrothermalis DSM 14884] gi|328451830|gb|AEB12731.1| (p)ppGpp synthetase I, SpoT/RelA [Marinithermus hydrothermalis DSM 14884] Length = 735 Score = 36.0 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 38/119 (31%), Gaps = 16/119 (13%) Query: 54 GVKEEALNVLSSG--EDLPNY-ELKEGRKGSRTY-----------NNDNQVEQLLMRELG 99 + E + S+G +D NY + + R R + +E+ L R Sbjct: 468 DIVEIITSKASTGPSKDWLNYAKTRSARSKIRHFFREQERGETLVKGQRALEKYLKRRGL 527 Query: 100 DEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANIS 158 L + + +L+ + +Q +T +++VP + + Sbjct: 528 PLPKEAELEAIAQ--KLLGHASPEDLYLAIVQGRVTTAQVARLLVPEEERTPRIAPPPD 584 >gi|260592201|ref|ZP_05857659.1| single-strand binding protein [Prevotella veroralis F0319] gi|260535835|gb|EEX18452.1| single-strand binding protein [Prevotella veroralis F0319] Length = 135 Score = 36.0 bits (82), Expect = 1.8, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 28/79 (35%), Gaps = 12/79 (15%) Query: 32 ILSNRQLSQVMN---VLPLIETW-----MKGVKEEALNVLSSGEDLPNYELKEGRKGSRT 83 L+ + + +P W K + + A + G+ L EGR +R+ Sbjct: 29 TLATTERGYELPGGTKVPDRTDWHNIVLFKSLAKYAEQYIHKGDKL----YVEGRIRNRS 84 Query: 84 YNNDNQVEQLLMRELGDEA 102 Y++ V + + D Sbjct: 85 YDDKKGVRRQVTEIYVDNL 103 >gi|115525707|ref|YP_782618.1| 6-phosphogluconate dehydrogenase-like protein [Rhodopseudomonas palustris BisA53] gi|115519654|gb|ABJ07638.1| 6-phosphogluconate dehydrogenase (decarboxylating) [Rhodopseudomonas palustris BisA53] Length = 345 Score = 36.0 bits (82), Expect = 1.9, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 36/97 (37%), Gaps = 8/97 (8%) Query: 25 TFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK----EGRKG 80 S+H L +++V +I +W+ + AL + ++ + EGR Sbjct: 218 LASKHRLDLDIADIAEVWRRGSVIPSWLLDLTASALARNDTLDNYSGF--VEDSGEGRWT 275 Query: 81 SRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLV 117 ++ ++L L +R T E+++ Sbjct: 276 VNAAIDEAVPAEVLTAALFARFRSRREH--TFAEKVL 310 >gi|197303253|ref|ZP_03168294.1| hypothetical protein RUMLAC_01976 [Ruminococcus lactaris ATCC 29176] gi|197297679|gb|EDY32238.1| hypothetical protein RUMLAC_01976 [Ruminococcus lactaris ATCC 29176] Length = 1067 Score = 35.6 bits (81), Expect = 2.2, Method: Composition-based stats. Identities = 14/114 (12%), Positives = 34/114 (29%), Gaps = 27/114 (23%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK 75 +L S S L++ +L + + ++ + W G D Sbjct: 398 RSLEQHVDSLMSYDFTGLTDDELLEQLEIVDDMRMW------RIAEACRRGVDYD----- 446 Query: 76 EGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQ 129 V + + D+ + TE E +K ++++ ++ Sbjct: 447 --------------VIHNITK--VDKWFIDKFAIITEMETALKTQELTPELLKE 484 >gi|213404684|ref|XP_002173114.1| N-acetyltransferase Nat10 [Schizosaccharomyces japonicus yFS275] gi|212001161|gb|EEB06821.1| N-acetyltransferase Nat10 [Schizosaccharomyces japonicus yFS275] Length = 1034 Score = 35.6 bits (81), Expect = 2.3, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 49/126 (38%), Gaps = 6/126 (4%) Query: 18 LAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSG---EDLPNYEL 74 L K + + ++ SN+ L+ ++ +L + + V+E+A+ + + P + Sbjct: 874 LQRKNVDDLEKEFKLPSNQVLALLIKLLRKVVNHLDSVEEQAIEAEMNDSSAQQKPESK- 932 Query: 75 KEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFI 134 K + ++ E L R L++ + E+ + E W +K I Sbjct: 933 VPEFKPLQQNLDEELEEGADEALLQLREKQRELINAVDLEKYS--IRGDEKQWSAAEKQI 990 Query: 135 TRKDGK 140 + GK Sbjct: 991 NKTSGK 996 >gi|327470377|gb|EGF15833.1| recombination regulator RecX [Streptococcus sanguinis SK330] Length = 258 Score = 35.6 bits (81), Expect = 2.4, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 44/125 (35%), Gaps = 18/125 (14%) Query: 21 KALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKG 80 K L+ + + + +++ + ++ E ++ + N+ Sbjct: 59 KNLALYHLSFKQRTAKEVKDYLTQ--------HDIQAEIISQVLDNLKKDNW------IN 104 Query: 81 SRTYNNDNQVEQLLMRELGDEAYNRTL----LSPTETEQLVKRKKVSETTWEQLQKFITR 136 R Y N LL + G + L +S T E+ + R +E T + QK + + Sbjct: 105 DRKYANSFIQSNLLTGDKGAFVLKQKLSQKGISSTIIEEELGRFDFTELTDKVAQKLLKK 164 Query: 137 KDGKQ 141 GK Sbjct: 165 HQGKL 169 >gi|148271487|ref|YP_001221048.1| hypothetical protein CMM_0308 [Clavibacter michiganensis subsp. michiganensis NCPPB 382] gi|147829417|emb|CAN00330.1| conserved hypothetical protein [Clavibacter michiganensis subsp. michiganensis NCPPB 382] Length = 413 Score = 35.6 bits (81), Expect = 2.5, Method: Composition-based stats. Identities = 8/58 (13%), Positives = 22/58 (37%) Query: 52 MKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLS 109 + + ++ + +P +E+ +GR G+R + ++ E + L Sbjct: 242 LAAIDDDDIRAALQDFVVPGWEVPDGRGGTRRIRMEADAASAIVDVCRGEPFLFQLAG 299 >gi|262047478|ref|ZP_06020434.1| glycosyltransferase [Lactobacillus crispatus MV-3A-US] gi|260572248|gb|EEX28812.1| glycosyltransferase [Lactobacillus crispatus MV-3A-US] Length = 302 Score = 35.6 bits (81), Expect = 2.5, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 6/72 (8%) Query: 43 NVLPLIETWMKGVKEEALNVLSSG----EDLPNYELKEGRKGSRTYNNDNQVEQLLMREL 98 N + LI W ++ +A + +G P+ ++ GR + + +E+ Sbjct: 99 NAVDLINNWCDDIEHDANSQNFAGVAGLRKYPDNKINGGRGNGKLIIDATNLEREKFNLG 158 Query: 99 GD--EAYNRTLL 108 GD E Y LL Sbjct: 159 GDKAEIYKTNLL 170 >gi|225570405|ref|ZP_03779430.1| hypothetical protein CLOHYLEM_06505 [Clostridium hylemonae DSM 15053] gi|225160776|gb|EEG73395.1| hypothetical protein CLOHYLEM_06505 [Clostridium hylemonae DSM 15053] Length = 1065 Score = 35.6 bits (81), Expect = 2.5, Method: Composition-based stats. Identities = 13/105 (12%), Positives = 33/105 (31%), Gaps = 12/105 (11%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETW--MKGVKE-EALNVLSSGEDLPNY 72 +L S S L+ +L + + + + W + V++ + + + + Sbjct: 398 RSLEQHVDSLMSYDFSGLTKEELLEQLETVDDMRIWRIAEAVRKGLTYDEIHGATKIDRW 457 Query: 73 ---E---LKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPT 111 + L E + N ++L E + + T Sbjct: 458 FIDKIAILVE---MEQKLQNTTLSAEVLREAKRLEFPDNVIAGLT 499 >gi|118086605|ref|XP_001233744.1| PREDICTED: hypothetical protein [Gallus gallus] Length = 306 Score = 35.6 bits (81), Expect = 2.5, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 25/55 (45%), Gaps = 1/55 (1%) Query: 19 AVKALSTFSEHMQILS-NRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNY 72 A +++S + +L++V L +++ + + E AL+ +LP + Sbjct: 48 AGATARLVRGDGEMVSLDEELAKVQRQLHVLQWRARDITERALHEALRRTELPGF 102 >gi|150378295|ref|YP_001314889.1| transposase IS3/IS911 family protein [Sinorhizobium medicae WSM419] gi|150032842|gb|ABR64956.1| transposase IS3/IS911 family protein [Sinorhizobium medicae WSM419] Length = 160 Score = 35.6 bits (81), Expect = 2.5, Method: Composition-based stats. Identities = 9/46 (19%), Positives = 17/46 (36%) Query: 52 MKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRE 97 M +EA++ EL GR+ R + ++ + L Sbjct: 13 MHEAMDEAMDEARHDGAYRRIELITGRRQRRNWTDEEKARILAESA 58 >gi|75676826|ref|YP_319247.1| 6-phosphogluconate dehydrogenase-like protein [Nitrobacter winogradskyi Nb-255] gi|74421696|gb|ABA05895.1| 6-phosphogluconate dehydrogenase (decarboxylating) [Nitrobacter winogradskyi Nb-255] Length = 338 Score = 35.6 bits (81), Expect = 2.6, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 32/89 (35%), Gaps = 8/89 (8%) Query: 33 LSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK----EGRKGSRTYNNDN 88 L +++V +I +W+ + AL + E + EGR ++ Sbjct: 227 LDLADIAEVWRRGSVISSWLLDLTASALARSETLEAYSGF--VADSGEGRWTVNAAIDEA 284 Query: 89 QVEQLLMRELGDEAYNRTLLSPTETEQLV 117 ++L L +R T E+++ Sbjct: 285 VPAEVLTAALFARFRSRQQH--TFAEKIL 311 >gi|325262943|ref|ZP_08129679.1| carbamoyl-phosphate synthase, large subunit [Clostridium sp. D5] gi|324032037|gb|EGB93316.1| carbamoyl-phosphate synthase, large subunit [Clostridium sp. D5] Length = 1067 Score = 35.3 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 7/43 (16%), Positives = 19/43 (44%), Gaps = 2/43 (4%) Query: 16 GALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETW--MKGVK 56 +L S S LS+ +L++ + ++ + W + ++ Sbjct: 398 RSLEQHVDSLMSYDFTELSDEELTEKLALVDDMRIWRIAEAIR 440 >gi|331694605|ref|YP_004330844.1| BadM/Rrf2 family transcriptional regulator [Pseudonocardia dioxanivorans CB1190] gi|326949294|gb|AEA22991.1| transcriptional regulator, BadM/Rrf2 family [Pseudonocardia dioxanivorans CB1190] Length = 148 Score = 35.3 bits (80), Expect = 2.7, Method: Composition-based stats. Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 1/35 (2%) Query: 7 RFCRAKPRCGALAVKALSTFSEHMQILSNRQLSQV 41 RFCR CGA V + + + +L + L ++ Sbjct: 96 RFCRVAGVCGARNVFSRALDAY-FAVLDDATLEEI 129 >gi|310286600|ref|YP_003937858.1| hypothetical protein BBIF_0079 [Bifidobacterium bifidum S17] gi|309250536|gb|ADO52284.1| hypothetical protein BBIF_0079 [Bifidobacterium bifidum S17] Length = 148 Score = 35.3 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 42/129 (32%), Gaps = 10/129 (7%) Query: 18 LAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEG 77 +AL+ E + I +++S++ + ++ +K L P +LK Sbjct: 30 QREEALADQREALLIRQAQRISELDERILVLTQERDELKARILEQHPQPGTYPAGKLKVL 89 Query: 78 RKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRK 137 K R + Y S T +LV +++FITR Sbjct: 90 VKQGRRGLDGKAFTAAFPPAENPRLYELKPKSLTNVAKLVGE--------LAVEQFITRS 141 Query: 138 DGKQVIVPC 146 K +V Sbjct: 142 --KPSVVVE 148 >gi|92118735|ref|YP_578464.1| 6-phosphogluconate dehydrogenase-like protein [Nitrobacter hamburgensis X14] gi|91801629|gb|ABE64004.1| 6-phosphogluconate dehydrogenase (decarboxylating) [Nitrobacter hamburgensis X14] Length = 337 Score = 35.3 bits (80), Expect = 2.8, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 32/89 (35%), Gaps = 8/89 (8%) Query: 33 LSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK----EGRKGSRTYNNDN 88 L +++V +I +W+ + AL + E + EGR ++ Sbjct: 227 LDLADIAEVWRRGSVIPSWLLDLTASALARSETLEAYSGF--VADSGEGRWTINAAIDEA 284 Query: 89 QVEQLLMRELGDEAYNRTLLSPTETEQLV 117 ++L L +R T E+++ Sbjct: 285 VPAEVLTAALFARFRSRRQH--TFAEKIL 311 >gi|27381870|ref|NP_773399.1| 6-phosphogluconate dehydrogenase-like protein [Bradyrhizobium japonicum USDA 110] gi|27355039|dbj|BAC52024.1| 6-phosphogluconate dehydrogenase [Bradyrhizobium japonicum USDA 110] Length = 332 Score = 35.3 bits (80), Expect = 2.9, Method: Composition-based stats. Identities = 13/95 (13%), Positives = 32/95 (33%), Gaps = 11/95 (11%) Query: 27 SEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK----EGRKGSR 82 + +++V +I +W+ + AL + + + EGR Sbjct: 223 RYDFDL---ADIAEVWRRGSVIPSWLLDLTSTALADSPALAEYSGF--VEDSGEGRWTVN 277 Query: 83 TYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLV 117 ++ ++L L +R T E+++ Sbjct: 278 AAIDEAVPAEVLTAALYTRFRSRKEH--TFAEKIL 310 >gi|218886805|ref|YP_002436126.1| hypothetical protein DvMF_1712 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757759|gb|ACL08658.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 85 Score = 35.3 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 13/70 (18%), Positives = 24/70 (34%), Gaps = 15/70 (21%) Query: 5 ACRFCRAKPRCGALAVKA----------LSTFSEHMQI-LSNRQLSQVMNVLPL----IE 49 C FC AK +C ++ +++ Q++ V+ V + Sbjct: 13 GCGFCCAKAQCPPGREAYGDRRRCPGLFWDGARYRCRLVMTDAQVAAVLQVGEGCCRPLN 72 Query: 50 TWMKGVKEEA 59 W K V+E Sbjct: 73 RWRKDVRERV 82 >gi|150377607|ref|YP_001314202.1| transposase IS3/IS911 family protein [Sinorhizobium medicae WSM419] gi|150396438|ref|YP_001326905.1| transposase IS3/IS911 family protein [Sinorhizobium medicae WSM419] gi|150027953|gb|ABR60070.1| transposase IS3/IS911 family protein [Sinorhizobium medicae WSM419] gi|150032154|gb|ABR64269.1| transposase IS3/IS911 family protein [Sinorhizobium medicae WSM419] Length = 156 Score = 35.3 bits (80), Expect = 3.1, Method: Composition-based stats. Identities = 10/44 (22%), Positives = 17/44 (38%) Query: 54 GVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRE 97 GV EA++ EL GR+ R + ++ + L Sbjct: 11 GVMHEAMDEARHDGAYRRIELITGRRQRRNWTDEEKARILAESA 54 >gi|331225185|ref|XP_003325263.1| hypothetical protein PGTG_07096 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309304253|gb|EFP80844.1| hypothetical protein PGTG_07096 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 343 Score = 35.3 bits (80), Expect = 3.2, Method: Composition-based stats. Identities = 22/114 (19%), Positives = 41/114 (35%), Gaps = 19/114 (16%) Query: 48 IETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTL 107 +E W +KE SS N L + R + + + R Sbjct: 95 LEVWPDSLKEYTGLDKSSSMIWLNETLLKKRSNRPFLGQEASDLKAS--------FRRIT 146 Query: 108 LSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVPCDLPVNHLKANISEFS 161 +SPT+ E+L K + T+ + L + I +++ K ++ S+F Sbjct: 147 ISPTQAERL---KHATTTSTDWLNQ-IEKEEAK-------EKRPEIELEPSKFD 189 >gi|190409986|ref|YP_001965510.1| putative transposase [Sinorhizobium meliloti] gi|125631016|gb|ABN47017.1| putative transposase [Sinorhizobium meliloti SM11] Length = 156 Score = 35.3 bits (80), Expect = 3.2, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 19/47 (40%) Query: 54 GVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGD 100 GV +EA++ E+ GR+ R + ++ + L D Sbjct: 11 GVMDEAMDEARHDGVYRRIEVITGRRQRRNWTDEEKARILAESAEPD 57 >gi|163800845|ref|ZP_02194745.1| tail-specific protease [Vibrio sp. AND4] gi|159175194|gb|EDP59991.1| tail-specific protease [Vibrio sp. AND4] Length = 664 Score = 35.3 bits (80), Expect = 3.3, Method: Composition-based stats. Identities = 21/102 (20%), Positives = 40/102 (39%), Gaps = 6/102 (5%) Query: 20 VKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK-EGR 78 K + ++ L + + + L E W K VK +ALN+ +G+ P + E R Sbjct: 135 DKEIKFDTDEFIELDRSEAAWPKDKAELNELWRKRVKYDALNLKMTGKKWPEIKEVLEKR 194 Query: 79 KGS-----RTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQ 115 + N++ + + + + + LSP EQ Sbjct: 195 YNNAMKRITQTRNEDAFQLYMNAFSREVDPHTSYLSPRNAEQ 236 >gi|156974628|ref|YP_001445535.1| carboxy-terminal protease [Vibrio harveyi ATCC BAA-1116] gi|156526222|gb|ABU71308.1| hypothetical protein VIBHAR_02346 [Vibrio harveyi ATCC BAA-1116] Length = 664 Score = 34.9 bits (79), Expect = 3.6, Method: Composition-based stats. Identities = 21/102 (20%), Positives = 41/102 (40%), Gaps = 6/102 (5%) Query: 20 VKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELK-EGR 78 K + ++ L + + + L E W K VK +ALN+ +G++ P + E R Sbjct: 135 DKEIKFDTDEFIELDRSEAAWPKDEAELNELWRKRVKYDALNLKMTGKEWPEIKEVLEKR 194 Query: 79 KGS-----RTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQ 115 + N++ + + + + + LSP EQ Sbjct: 195 YNNAMKRITQTRNEDAFQLYMNAFAREVDPHTSYLSPRNAEQ 236 >gi|108946929|gb|ABG23812.1| putative NBS-LRR disease resistance protein [Malus x domestica] Length = 156 Score = 34.9 bits (79), Expect = 3.8, Method: Composition-based stats. Identities = 16/75 (21%), Positives = 21/75 (28%), Gaps = 8/75 (10%) Query: 52 MKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPT 111 + V + G GSR E LL D+ Y L+ Sbjct: 73 LDDVDQLEQLEALCGHSWFG-------SGSRIIITSRD-EHLLRTYGVDKMYKVKPLTDA 124 Query: 112 ETEQLVKRKKVSETT 126 E QL RK + Sbjct: 125 EVLQLFGRKAFKKDQ 139 >gi|302843948|ref|XP_002953515.1| hypothetical protein VOLCADRAFT_109207 [Volvox carteri f. nagariensis] gi|300261274|gb|EFJ45488.1| hypothetical protein VOLCADRAFT_109207 [Volvox carteri f. nagariensis] Length = 566 Score = 34.5 bits (78), Expect = 4.5, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 41/128 (32%), Gaps = 15/128 (11%) Query: 33 LSNRQLSQVMNV--LPLIETWMKGVKEEALNVLSSGEDLPNY---ELK--EGRKGSRTYN 85 L N +L+ V +++++ + +N + P Y ++ G KG+ + Sbjct: 289 LDNEELAAVFKEWNQGELKSFLVEISAIIMNK--PDDQAPGYLVDKIVDQTGSKGTGKWT 346 Query: 86 NDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDGKQVIVP 145 E + A + LS + E++ K + + + +V Sbjct: 347 VQQAAELAVAAPTMASALDARYLSALKAERVAASKVFASCAQPGAVPRVDKAQ----LV- 401 Query: 146 CDLPVNHL 153 L Sbjct: 402 -ADVRAAL 408 >gi|299138763|ref|ZP_07031941.1| protein of unknown function DUF1905 [Acidobacterium sp. MP5ACTX8] gi|298599399|gb|EFI55559.1| protein of unknown function DUF1905 [Acidobacterium sp. MP5ACTX8] Length = 221 Score = 34.5 bits (78), Expect = 4.6, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 37/104 (35%), Gaps = 12/104 (11%) Query: 35 NRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLL 94 +L+ +++ P + W + E + G+ + + E R + E+LL Sbjct: 100 PDELAVLLDDEPGLREWYDELTEYMRREI--GKWIAGVKSDEARMRR----AEQAAERLL 153 Query: 95 MRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRKD 138 G R L P E +++ + W ++ R + Sbjct: 154 AAMEG----EREL--PPVIEAAFRKRPKARVGWAKMTPVQRRGE 191 >gi|254254137|ref|ZP_04947454.1| hypothetical protein BDAG_03426 [Burkholderia dolosa AUO158] gi|124898782|gb|EAY70625.1| hypothetical protein BDAG_03426 [Burkholderia dolosa AUO158] Length = 185 Score = 34.5 bits (78), Expect = 5.6, Method: Composition-based stats. Identities = 12/92 (13%), Positives = 23/92 (25%), Gaps = 9/92 (9%) Query: 11 AKPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLP 70 A+ AL ++++ V+ IE AL+ S + Sbjct: 15 ARATFRALRSHHADEHFYAFALMTDSGAMTVLPAANSIE---------ALSRAGSIAPID 65 Query: 71 NYELKEGRKGSRTYNNDNQVEQLLMRELGDEA 102 E R + + +E Sbjct: 66 GVEYVPQRAPQQMWRVTEWAYASAHASPFNEI 97 >gi|257462361|ref|ZP_05626776.1| cobaltochelatase, CobN subunit [Fusobacterium sp. D12] gi|317060024|ref|ZP_07924509.1| hydrogenobyrinic acid a,c-diamide cobaltochelatase [Fusobacterium sp. D12] gi|313685700|gb|EFS22535.1| hydrogenobyrinic acid a,c-diamide cobaltochelatase [Fusobacterium sp. D12] Length = 1245 Score = 34.5 bits (78), Expect = 5.6, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 37/95 (38%), Gaps = 13/95 (13%) Query: 45 LPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYN 104 L + W + +G+++ N ++ +G R + N +V + + + E Y Sbjct: 397 LKNMTDWFSEIGIFLEYPFENGQEIIN-KIIQGVSNDRKWLNVEKVMEKSIDTVSKEKYQ 455 Query: 105 RTLLSPTETEQLVKRKKVSETTWEQLQKFITRKDG 139 R L E E V++K +++ + G Sbjct: 456 RWFL---ELENEVQKK---------IEEQWGKAPG 478 >gi|290961738|ref|YP_003492920.1| phenylalanyl-tRNA synthetase subunit beta [Streptomyces scabiei 87.22] gi|260651264|emb|CBG74386.1| putative phenylalanyl-tRNA synthetase beta chain [Streptomyces scabiei 87.22] Length = 840 Score = 34.1 bits (77), Expect = 5.7, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 41/127 (32%), Gaps = 23/127 (18%) Query: 18 LAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEG 77 A + ++ + ++ +++ + LP + V A + G Sbjct: 598 RAEQRVAVAPPVDRRPTDDEIAALNAALPEQPRHVAVVVAGAREQA-------GW---WG 647 Query: 78 RKGSRTYNNDNQVEQLLMRELGDEAYNRTLLSPTETEQLVKRKKVSETTWEQLQKFITRK 137 R + + + + + RE G E +V++ + + + + Sbjct: 648 RGRPADWADTVEAGRAVAREAGAEL-------------IVRKGQYGPWHPGRCAELVIVA 694 Query: 138 DGKQVIV 144 DG + +V Sbjct: 695 DGTERVV 701 >gi|187251475|ref|YP_001875957.1| hypothetical protein Emin_1068 [Elusimicrobium minutum Pei191] gi|186971635|gb|ACC98620.1| Putative RecB family exonuclease [Elusimicrobium minutum Pei191] Length = 411 Score = 34.1 bits (77), Expect = 5.9, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 55/147 (37%), Gaps = 21/147 (14%) Query: 2 DENACRFCRAKPRCGALAVKALSTFSEH----MQIL--SNRQLSQVMNVLPLIE---TWM 52 E+ CR+C K C F + + +N + + + + + Sbjct: 255 SESKCRWCDYKAMCPVFTGMEFEQFQKTEKPVFSDIPVTNEDI--LSSKIDELAETGQKY 312 Query: 53 KGVKEEALNVLSSG----EDLPNYELKEGRKGSRTYNNDNQVEQLLMRELGDEAYNRTLL 108 +K+E ++++ NY+++ +K + + +V + L +TL+ Sbjct: 313 SSLKKEIISLMKQNNYNQHFGSNYKVELKQKEFLDFEDKEKVIEFLKE---KNLIKKTLV 369 Query: 109 SPTE--TEQLVKRKKVSETTWEQLQKF 133 PT+ E L+ V E +L++ Sbjct: 370 -PTQCSIEALLDDPSVPEDDKARLKEL 395 >gi|148724476|ref|YP_001285442.1| ribonucleotide reductase [Cyanophage Syn5] gi|145588121|gb|ABP87940.1| ribonucleotide reductase [Synechococcus phage Syn5] Length = 464 Score = 34.1 bits (77), Expect = 6.8, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 30/78 (38%), Gaps = 6/78 (7%) Query: 13 PRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKG----VKEEALNVLSSGED 68 C + E S +L V + + W V+E+ L + +G+ Sbjct: 104 GACVIHLDANHTDL-EEFVDASRAELPWVKKCIDITTEWWNDLTEPVQEKILKAIQAGDI 162 Query: 69 LPNYELKEGRKGSRTYNN 86 N ++K RKG+R Y N Sbjct: 163 WLN-KVKYDRKGNRIYGN 179 >gi|302410613|ref|XP_003003140.1| insulin-degrading enzyme [Verticillium albo-atrum VaMs.102] gi|261358164|gb|EEY20592.1| insulin-degrading enzyme [Verticillium albo-atrum VaMs.102] Length = 834 Score = 33.7 bits (76), Expect = 7.5, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 39/107 (36%), Gaps = 8/107 (7%) Query: 34 SNRQLSQVMNVLPLIETWMKGVKEEALNVLSSGEDLPNYELKEGRKGSRTYNNDNQVEQL 93 +N +L Q + + TW+ ++ + + +LPN + R + + Sbjct: 422 NNWELQQPFSQVSDYTTWLNSERDYVVEEYLA--ELPNISA----EDIRQFKKQMLAQMR 475 Query: 94 LMRELGDEAYNRTLLSPTET-EQLVKRKKVSETTWEQLQKFITRKDG 139 + Y L T+ E ++K + + +T W + I G Sbjct: 476 IEAYAHGNIYKEDALKLTDMVETILKPRILPQTQWPVTRSLIL-PPG 521 >gi|168705097|ref|ZP_02737374.1| peptide chain release factor 1 [Gemmata obscuriglobus UQM 2246] Length = 358 Score = 33.7 bits (76), Expect = 7.7, Method: Composition-based stats. Identities = 5/43 (11%), Positives = 18/43 (41%) Query: 12 KPRCGALAVKALSTFSEHMQILSNRQLSQVMNVLPLIETWMKG 54 K C A+A + ++ +++ +L+ + + ++ Sbjct: 52 KRLCRAIADAEVMAADPDLKDMADEELADLRPKRDALHARIEE 94 >gi|322372100|ref|ZP_08046642.1| hypothetical protein ZOD2009_21412 [Haladaptatus paucihalophilus DX253] gi|320548522|gb|EFW90194.1| hypothetical protein ZOD2009_21412 [Haladaptatus paucihalophilus DX253] Length = 609 Score = 33.7 bits (76), Expect = 7.8, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 33/77 (42%), Gaps = 6/77 (7%) Query: 29 HMQILSNRQLSQVMNVLPLIETWMKGVKEEALNVLSS----GEDLPN-YELKEGRKGSRT 83 + S +L+ + + ++ ++ ++ E +VLSS G+++ + GR+ +R Sbjct: 477 DFEDRSTAELADMYALFSDVKGQIETLRTEVRDVLSSRVGEGQEVTGSFGSVSGRRRTRK 536 Query: 84 Y-NNDNQVEQLLMRELG 99 + V L R Sbjct: 537 SLKDSETVLTELARHGV 553 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.151 0.467 Lambda K H 0.267 0.0469 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,719,353,022 Number of Sequences: 14124377 Number of extensions: 66974303 Number of successful extensions: 239330 Number of sequences better than 10.0: 331 Number of HSP's better than 10.0 without gapping: 241 Number of HSP's successfully gapped in prelim test: 90 Number of HSP's that attempted gapping in prelim test: 238292 Number of HSP's gapped (non-prelim): 370 length of query: 165 length of database: 4,842,793,630 effective HSP length: 126 effective length of query: 39 effective length of database: 3,063,122,128 effective search space: 119461762992 effective search space used: 119461762992 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 76 (33.7 bits)