BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781210|ref|YP_003065623.1| hypothetical protein CLIBASIA_05585 [Candidatus Liberibacter asiaticus str. psy62] (343 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781210|ref|YP_003065623.1| hypothetical protein CLIBASIA_05585 [Candidatus Liberibacter asiaticus str. psy62] gi|254040887|gb|ACT57683.1| hypothetical protein CLIBASIA_05585 [Candidatus Liberibacter asiaticus str. psy62] gi|317120675|gb|ADV02498.1| putative major capsid protein [Liberibacter phage SC1] gi|317120819|gb|ADV02640.1| putative major capsid protein [Candidatus Liberibacter asiaticus] Length = 343 Score = 242 bits (617), Expect = 5e-62, Method: Composition-based stats. Identities = 343/343 (100%), Positives = 343/343 (100%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG Sbjct: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI Sbjct: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 Query: 121 LKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ 180 LKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ Sbjct: 121 LKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ 180 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 VYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK Sbjct: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW Sbjct: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 Query: 301 HAPQITLTSSFGATRIEPDKILGIEISKDSLKGVPVLKGTKAA 343 HAPQITLTSSFGATRIEPDKILGIEISKDSLKGVPVLKGTKAA Sbjct: 301 HAPQITLTSSFGATRIEPDKILGIEISKDSLKGVPVLKGTKAA 343 >gi|291334460|gb|ADD94114.1| hypothetical protein ALOHA_HF400048F7ctg1g15 [uncultured phage MedDCM-OCT-S04-C1161] gi|291334517|gb|ADD94170.1| hypothetical protein ALOHA_HF400048F7ctg1g15 [uncultured phage MedDCM-OCT-S04-C1201] gi|291334663|gb|ADD94310.1| hypothetical protein ALOHA_HF400048F7ctg1g15 [uncultured phage MedDCM-OCT-S04-C695] gi|291334717|gb|ADD94363.1| hypothetical protein ALOHA_HF400048F7ctg1g15 [uncultured phage MedDCM-OCT-S04-C890] gi|291336443|gb|ADD95998.1| hypothetical protein ALOHA_HF400048F7ctg1g15 [uncultured organism MedDCM-OCT-S04-C1073] gi|291336930|gb|ADD96458.1| hypothetical protein ALOHA_HF400048F7ctg1g15 [uncultured organism MedDCM-OCT-S09-C787] Length = 287 Score = 236 bits (601), Expect = 4e-60, Method: Composition-based stats. Identities = 60/326 (18%), Positives = 109/326 (33%), Gaps = 40/326 (12%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 ++ A + ++ ++++ Q+ S LR V ++ +A + A Sbjct: 1 MSTEITKAFVEQYSSNIQMLSQQKGSLLRDKVRLESVT-GKNAFFDQIGSVTATVRSTRH 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 DT T RR V + +A+ +D + ++P YA A AM R D+AI+ Sbjct: 60 SDTPQADTPHSRRRVSLVDYEFADLVDDLDKVRMLVDPTSSYAQAAAFAMGRAMDDAIIT 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 G G G ++ I +L AK I +D + Sbjct: 120 AATGSADTGVAGGTAVALPSAQKIAE---AGTAGLTIAKLRQAKEILDLASVDPSIPRYI 176 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 V+ P + L TS D+ AL G + F G F ++ Sbjct: 177 VVSPKQI-TDLLGTTEVTSSDFNTVKALAQGDLSTFLGFNFCVSNRL------------- 222 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 K + + + K + + K +A Sbjct: 223 ----------------------TIASSKRKCFAFAQDGLALAVGKDSTARIDERSDKGYA 260 Query: 303 PQITLTSSFGATRIEPDKILGIEISK 328 Q+ +++FGATR+E +K++ I + Sbjct: 261 TQVYYSAAFGATRMEEEKVVEILAHE 286 >gi|315121935|ref|YP_004062424.1| hypothetical protein CKC_00925 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|315122897|ref|YP_004063386.1| hypothetical protein CKC_05765 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495337|gb|ADR51936.1| hypothetical protein CKC_00925 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496299|gb|ADR52898.1| hypothetical protein CKC_05765 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 343 Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats. Identities = 267/343 (77%), Positives = 303/343 (88%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 MATK+QLATANI EFKKHVELALQ+ SKLRPTVTE++TEGE SA VE+FKP+EAH+I+G Sbjct: 1 MATKQQLATANILEFKKHVELALQQETSKLRPTVTEKSTEGEKSAYVEIFKPSEAHKIIG 60 Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 DM DTIYN TDQ RRW+ H QFGWAERIDPFATLDSG+NPLLPYA LATAAMHRKQDE I Sbjct: 61 DMSDTIYNNTDQSRRWISHEQFGWAERIDPFATLDSGLNPLLPYAKLATAAMHRKQDEVI 120 Query: 121 LKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ 180 L+GMLGVN+ GK E FS +NI+SAV+GDDFF+TFIGQLITAKSIF +R+IDVDSEQ Sbjct: 121 LEGMLGVNQCGKDAKSLEPFSADNIISAVDGDDFFQTFIGQLITAKSIFMERHIDVDSEQ 180 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 +YVL+PSDVWASLFALE+ATSKDYINTAALQAG+IEAFAGV FINMEKVPGN+LFP+GT+ Sbjct: 181 IYVLVPSDVWASLFALEKATSKDYINTAALQAGRIEAFAGVRFINMEKVPGNNLFPSGTQ 240 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 FPGL D K++ G+ V SSAKF + KIKYVLPIYCKSAV FTQRKA++V+HS+DP KW Sbjct: 241 FPGLTDSKIKNVAGQVGVTSSAKFANDKIKYVLPIYCKSAVAFTQRKAVEVKHSEDPSKW 300 Query: 301 HAPQITLTSSFGATRIEPDKILGIEISKDSLKGVPVLKGTKAA 343 HAPQITLT+SFGA R+EP+KILGIEIS SLKGVP L G KAA Sbjct: 301 HAPQITLTASFGAARVEPEKILGIEISHASLKGVPKLVGKKAA 343 >gi|288959326|ref|YP_003449667.1| hypothetical protein AZL_024850 [Azospirillum sp. B510] gi|288911634|dbj|BAI73123.1| hypothetical protein AZL_024850 [Azospirillum sp. B510] Length = 272 Score = 219 bits (556), Expect = 6e-55, Method: Composition-based stats. Identities = 48/327 (14%), Positives = 91/327 (27%), Gaps = 55/327 (16%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 +A A + +F++ V A Q SKLR TV + AS + + A Sbjct: 1 MSTSIAQAFVKQFEREVHEAYQRMGSKLRNTVRSKNNVQGASTVFQKVGKGAASTK-SRH 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 D F + +D L + I+ A+ A+ RK DE ++ Sbjct: 60 GAVPVMNLDHTPVECALYDFYAGDWVDRLDELKTNIDERQIIANAGAYALGRKTDELLIA 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + G +++TA + + + D ++ Sbjct: 120 ELDKS-------------------VSYAGAATDGLTKAKILTAFEMMGEADVPDDGQRYA 160 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 V+ L IE FA ++ +++P A Sbjct: 161 VVGWKQWSQLL--------------------GIEEFARSDYVGTDELPW-RGTQAKRWLG 199 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 L + Y K+AV + + + A Sbjct: 200 TLWLPHSG-------------LTLNGGVRLCHWYHKTAVGHAAGADVKTDITWHGDRA-A 245 Query: 303 PQITLTSSFGATRIEPDKILGIEISKD 329 + S GA I+ ++ + + Sbjct: 246 HFVNNMMSQGAALIDTSGVVTLRCLES 272 >gi|317152367|ref|YP_004120415.1| hypothetical protein Daes_0651 [Desulfovibrio aespoeensis Aspo-2] gi|316942618|gb|ADU61669.1| hypothetical protein Daes_0651 [Desulfovibrio aespoeensis Aspo-2] Length = 276 Score = 213 bits (542), Expect = 3e-53, Method: Composition-based stats. Identities = 49/331 (14%), Positives = 98/331 (29%), Gaps = 59/331 (17%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 + + + E+ + V + Q+ SK+R TV Q +S + + A + Sbjct: 1 MSTTITNSFVTEYAEMVHQSYQQRGSKMRNTVRLQTGVIGSSCVFQRIGRGAAGKKT-RH 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 + D S + AE +D L + A A+ RK DE ++ Sbjct: 60 GNVPLMNLDHTSVSCTLSDWYAAEYVDKLDELKQKQDEHKVAAEAGAWALGRKIDELLIS 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + G + G +++ + D + Sbjct: 120 RLTGAANVIEEG-------------------NTGLTKDKILRGFGTLNASDVADDGHRFA 160 Query: 183 VLIPSDVWASLFALERATSKDYINTAA--LQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 ++ P W L ++ S DY L + + G+ ++ +P + Sbjct: 161 MVGPHQ-WNELLNIQEFKSSDYAGEQFAWLTGTESRTWLGITWMFHTGLPLIEGV----- 214 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 IY ++++ + + I P K Sbjct: 215 ------------------------------RSCFIYHRNSLGLAEGQDIKAFVDWVPEKA 244 Query: 301 HAPQITLTSSFGATRIEPDKILGIEISKDSL 331 A + S GA I+PD ++ I D++ Sbjct: 245 -AHLVDHMLSAGACLIDPDGVIEIRCDDDAV 274 >gi|209966378|ref|YP_002299293.1| hypothetical protein RC1_3116 [Rhodospirillum centenum SW] gi|209959844|gb|ACJ00481.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 272 Score = 212 bits (539), Expect = 6e-53, Method: Composition-based stats. Identities = 45/326 (13%), Positives = 90/326 (27%), Gaps = 55/326 (16%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 + A I +F++ V + Q SKLR TV + AS + +V A Sbjct: 1 MSTTIDQAFIKQFEREVHESYQRMGSKLRATVRHKTDVQGASTVFQVVGRGAASTKA-RH 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 + + + + +D L I+ A+ A+ RK DE I+ Sbjct: 60 GKVPVMNLEHSHVECALADYYAGDWVDRLDELKVNIDERAVVANAGAYALGRKTDELIIA 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + G +++ A + + D ++ Sbjct: 120 ELDRSAN-------------------LAGAATDGLTRDKVLAAFEMLGTADVPDDGQRTA 160 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 V+ L + FA ++ +++P A Sbjct: 161 VVGWKQWSQLL--------------------ALPEFADADYVGADELPW-RGTQAKRWLG 199 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 L T + Y ++AV + + + A Sbjct: 200 TLWMPHSG-------------LTLTGGVRLCHWYHRTAVGHAAGADVATDVTWHGDRA-A 245 Query: 303 PQITLTSSFGATRIEPDKILGIEISK 328 + S GA I+P ++ + + Sbjct: 246 FFVNHMMSQGACLIDPKGVVTLRCKE 271 >gi|296532337|ref|ZP_06895074.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296267333|gb|EFH13221.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 277 Score = 210 bits (534), Expect = 2e-52, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 86/326 (26%), Gaps = 49/326 (15%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 + + +F+ V A Q SKLRPTV + AS + A Sbjct: 1 MSASIDQVFVKQFESEVHEAYQRQGSKLRPTVRSKTGVRGASTNFPIVGHGTAAAKA-RN 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 + E ID L I+ AS A+ RK DE I+ Sbjct: 60 GAVPVMNLAHSNVECFLQDYYAGEWIDRLDELKVNIDERQVVASAGAYALGRKTDELIIA 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + G D +++ A + + D + Sbjct: 120 ALD-TATEEATGTAA------------GTTDSDGLTKAKVLLAFEMLGAADVPDDGNRFA 166 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 ++ L +IE FA +I + +P A Sbjct: 167 IVGWKQWSNLL--------------------QIEEFANTQYIGDDDLPWKG-TQAKRWLG 205 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 + Y K+A+ + + + + A Sbjct: 206 ATWMPHSG-------------LTRSGATRFCYFYHKTAIGHAVAQDVTTDVTWHGDRA-A 251 Query: 303 PQITLTSSFGATRIEPDKILGIEISK 328 + S GA I+P ++ + ++ Sbjct: 252 YFVNNMMSQGAVLIDPAGVVRMRCAE 277 >gi|288959385|ref|YP_003449726.1| hypothetical protein AZL_025440 [Azospirillum sp. B510] gi|288911693|dbj|BAI73182.1| hypothetical protein AZL_025440 [Azospirillum sp. B510] Length = 297 Score = 210 bits (533), Expect = 4e-52, Method: Composition-based stats. Identities = 77/332 (23%), Positives = 123/332 (37%), Gaps = 39/332 (11%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 Q+ T ++K++EL LQ+ SKL V + E + PTEA ++ Sbjct: 1 MSSQIPTHYQNTYQKNLELGLQQKTSKLEGCVRTENQSAER-DFYDKIGPTEAEDVTERH 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 DT Y T DRR W++ ID F + +P Y A AA++R++D IL Sbjct: 60 ADTKYANTKHDRRACTIIPATWSDLIDKFDKVQLVTDPTSAYTQNAIAALNRRKDRHILT 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAV----EGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G GK G F I++ IG+L A+ I D D Sbjct: 120 AAIGTAFTGKEGTTPVAFPSSQIVAVNYVEGGSAANSGMTIGKLRKAREILGLADNDED- 178 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 E Y+ + L TS DY + AL AGKI+ F G F + Sbjct: 179 EDTYLALTETQITDLLKTTEVTSADYNSVQALVAGKIDTFLGFKFKKVS----------- 227 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 + KS +V + + + ++ Sbjct: 228 ----------------------PKLVVKASTTRKCVAWKKSGIVLAKGLEVQSKVTELAT 265 Query: 299 KWHAPQITLTSSFGATRIEPDKILGIEISKDS 330 K ++ Q+ FGATR++ +K++ I+ + + Sbjct: 266 KNYSTQVWACGMFGATRLDEEKVVEIDCLESA 297 >gi|323699588|ref|ZP_08111500.1| hypothetical protein DND132_2180 [Desulfovibrio sp. ND132] gi|323459520|gb|EGB15385.1| hypothetical protein DND132_2180 [Desulfovibrio desulfuricans ND132] Length = 277 Score = 205 bits (520), Expect = 9e-51, Method: Composition-based stats. Identities = 52/330 (15%), Positives = 101/330 (30%), Gaps = 58/330 (17%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 ++ A + ++ + V A Q SK+R TV Q + + + A + Sbjct: 1 MSTTVSNAFVTQYVEMVHQAYQAQGSKMRQTVRLQTEVEGSKCVFQKIGKGAAGKKT-RH 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 + + S + AE ID L + A+ A+ RK DE ++ Sbjct: 60 GNVPLMNLNHSNVSCTLSDWYAAEYIDKLDELKDKSDEKQVAANAGAWALGRKIDELLIT 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + G V G+ +++ A + D + Sbjct: 120 ELDGATN-------------------VVGEAATGLTKDKILQAFGTLNANDVPDDGHRFA 160 Query: 183 VLIPSDVWASLFALERATSKDYINTAA--LQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 V+ P W L ++ S DY L+ + + G+ ++ +P ++ Sbjct: 161 VVGPHQ-WNELLNIQEFKSSDYAGEQYAWLKGTESRTWLGITWMFHTGLPLDEAGM---- 215 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 IY ++A + + + P K Sbjct: 216 ------------------------------RKCYIYHRNAAGLAEGQKVQAFVDWVPEKA 245 Query: 301 HAPQITLTSSFGATRIEPDKILGIEISKDS 330 A + S GA I+PD ++ I+ D+ Sbjct: 246 -AHLVDHMLSAGACLIDPDGVVQIQCDDDA 274 >gi|144898780|emb|CAM75644.1| conserved hypothetical protein [Magnetospirillum gryphiswaldense MSR-1] Length = 272 Score = 201 bits (511), Expect = 1e-49, Method: Composition-based stats. Identities = 38/326 (11%), Positives = 80/326 (24%), Gaps = 55/326 (16%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 + ++ V A Q +KLR TV + A A+ + A Sbjct: 1 MSTSVINGYSKDYGAQVHAAYQRQGTKLRNTVRTRNNVTGAIAVFQKVGKGSASTKA-RH 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 D + + +D L + + A+ RK DE I+ Sbjct: 60 GKVPVMNVDHQTVECQLYDYYAGDWLDKLDELKIEHDERAVLVNAGAYALGRKTDELIIA 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + G +++ A + + + D E+ Sbjct: 120 ELDKSTNYALDGTT-------------------ALTKDKVLAAFEMLGEADVPDDGERYA 160 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 V+ W+ L + + ++ + +P A Sbjct: 161 VVG-WKQWSDLLQIAEFSDA-------------------DYVGDDDLPWKG-TQAKNWLG 199 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 L Y K+A+ + + + + A Sbjct: 200 TLWMPHSG-------------LTKAGSIRHCYWYHKTAIGHAVGSEVKSEITYHGDRA-A 245 Query: 303 PQITLTSSFGATRIEPDKILGIEISK 328 S G+ I+P ++ + + Sbjct: 246 WFCNNMMSQGSALIDPAGVVSLRCLE 271 >gi|158425209|ref|YP_001526501.1| minor capsid protein 10 [Azorhizobium caulinodans ORS 571] gi|158332098|dbj|BAF89583.1| minor capsid protein 10 [Azorhizobium caulinodans ORS 571] Length = 331 Score = 193 bits (489), Expect = 4e-47, Method: Composition-based stats. Identities = 31/317 (9%), Positives = 82/317 (25%), Gaps = 9/317 (2%) Query: 8 ATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIY 67 + +F V A E + + SA +A+ V Sbjct: 19 DALFLKQFSGEVMTAFSEVN-VMMERHLVRTITNGKSAQFPATWKADAYYHVPGTELQGQ 77 Query: 68 NATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 + +R + + + Y + A+ + D+ +L+ + Sbjct: 78 SIKHGERVITIDDLLVSPVFVAQIDEAKNHYDVRSIYTNECGYALANQADKNVLQTAVLA 137 Query: 128 NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS 187 + S + + + L A ++ + + V P+ Sbjct: 138 ARASATITGGIGGSTLAVGPDIVTNANGALVNA-LYLAAQTLDEKDVPEQG-RFAVFKPA 195 Query: 188 DVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDG 247 + + + ++ GK+ AGV + +P + + Sbjct: 196 QYYKLVLDDKAINRDFTAGNGDIRTGKVFDIAGVQIVKSNHLPTSAIAAPA-----GSAN 250 Query: 248 KVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITL 307 + + K+ + +AV + + V+ + I Sbjct: 251 VPTGVTPQIGPRPLGKYAGDFSNTAGLVMHANAVGTVKLMDLSVEGEYLITR-QGTLIVA 309 Query: 308 TSSFGATRIEPDKILGI 324 + G + P+ + + Sbjct: 310 KYAMGHGILRPECAVEL 326 >gi|298485987|ref|ZP_07004061.1| hypothetical protein PSA3335_1416 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159464|gb|EFI00511.1| hypothetical protein PSA3335_1416 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 290 Score = 188 bits (478), Expect = 8e-46, Method: Composition-based stats. Identities = 65/329 (19%), Positives = 116/329 (35%), Gaps = 44/329 (13%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 +Q+ A + +F + Q+++S+L TVT + S V A Sbjct: 4 MSQQITEAFVQQFADNFMHVAQQSQSRLESTVTIEPNIVGMSKSVNRLGQRTATRRTQRH 63 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 DT N R+V + + +D + ++P Y +++R +D+ I+ Sbjct: 64 GDTPINDQPHSTRYVDLYDWEDGDMVDDQDKIRMLVDPTSDYVKAMVNSLNRAKDDVIIG 123 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVD-SEQV 181 + G ++ S ++I AK IFR D + E++ Sbjct: 124 ALGGFSR-------ATSGQIILPTSQKIAVGGTGLTKAKIIQAKKIFRLNEADEEAGEEL 176 Query: 182 YVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPAGTK 240 Y++ + A + A TS DY+ LQ G + + G +I E++ Sbjct: 177 YMVYSAQAAADILADPTLTSADYLAGQFLQQGSVRGKWMGFNWIPSERMG---------- 226 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 + L Y KS VV + I + +DPGK Sbjct: 227 -------------------------KSGTTRYLNAYAKSGVVLGKGAEITTKVGEDPGKG 261 Query: 301 HAPQITLTSSFGATRIEPDKILGIEISKD 329 +I S GA R+E +K++ I + Sbjct: 262 FNVRIYAKMSIGAVRVEEEKVVEIACLES 290 >gi|317120718|gb|ADV02540.1| putative major capsid protein [Liberibacter phage SC2] gi|317120779|gb|ADV02600.1| putative major capsid protein [Candidatus Liberibacter asiaticus] Length = 306 Score = 184 bits (467), Expect = 1e-44, Method: Composition-based stats. Identities = 289/293 (98%), Positives = 290/293 (98%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG Sbjct: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI Sbjct: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 Query: 121 LKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ 180 LKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ Sbjct: 121 LKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ 180 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 VYVL+PSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK Sbjct: 181 VYVLVPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQH 293 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAID Sbjct: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDGNI 293 >gi|85059665|ref|YP_455367.1| hypothetical protein SG1687 [Sodalis glossinidius str. 'morsitans'] gi|84780185|dbj|BAE74962.1| hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 306 Score = 181 bits (458), Expect = 2e-43, Method: Composition-based stats. Identities = 67/327 (20%), Positives = 121/327 (37%), Gaps = 32/327 (9%) Query: 2 ATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD 61 A K + A + +F E+A Q+ S+L+ VT++ AS + E +I Sbjct: 8 ANKNMITAAFVQQFHDSFEIASQQKDSRLQAAVTDRGHITGASFTINDMGTIEMTQITTR 67 Query: 62 MPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAIL 121 DT++N + R + +G ++ +P PY L AA +RK+D+ I Sbjct: 68 FGDTVWNVPEAGTRNALMADYGVFVPVEKRDLRKLIADPQGPYLQLTLAAANRKKDDIIY 127 Query: 122 KGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DSEQ 180 + +L K + + +LI AK++FR+ D + E+ Sbjct: 128 RALLDTVL-RKTSDTGAYAPVALPTTQKIVAGKTGMTKAKLIAAKAMFRRNECDEQNGEE 186 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPAGT 239 +Y+ +D+ + + TS D++ LQ G + + G ++ EK+ Sbjct: 187 LYITYNADMLTQILSDTTLTSADFMAVKMLQEGAVFGNWLGFKWLAYEKLDEAKAGEPAV 246 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 +CKSAV +V K Sbjct: 247 -----------------------------TTKTAAAWCKSAVHLGTGAQYNVDICLRRDK 277 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEI 326 + QI++ +S+GA R K++ IE Sbjct: 278 NNTIQISVDASYGAGRANEKKVVAIEF 304 >gi|85059166|ref|YP_454868.1| hypothetical protein SG1188 [Sodalis glossinidius str. 'morsitans'] gi|84779686|dbj|BAE74463.1| hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 306 Score = 181 bits (458), Expect = 2e-43, Method: Composition-based stats. Identities = 66/327 (20%), Positives = 121/327 (37%), Gaps = 32/327 (9%) Query: 2 ATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD 61 A K + A + +F E+A Q+ S+L+ VT++ AS + E +I Sbjct: 8 ANKNMITAAFVQQFHDSFEIASQQKDSRLQAAVTDRGHITGASFTINDMGTIEMTQITTR 67 Query: 62 MPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAIL 121 DT+++ + R + +G ++ +P PY L AA +RK+D+ I Sbjct: 68 FGDTVWDVPEAGTRNALMADYGVFVPVEKRDLRKLIADPQGPYLQLTLAAANRKKDDIIY 127 Query: 122 KGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DSEQ 180 + +L K + + +LI AK++FR+ D + E+ Sbjct: 128 RALLDTVL-RKTSDTGAYAPVALPTTQKIVAGKTGMTKAKLIAAKAMFRRNECDEQNGEE 186 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPAGT 239 +Y+ +D+ + + TS D++ LQ G + + G ++ EK+ Sbjct: 187 LYITYNADMLTQILSDTTLTSADFMAVKMLQEGAVSGNWLGFKWLAYEKLDEAKAGEPTV 246 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 +CKSAV +V K Sbjct: 247 -----------------------------TTKTAAAWCKSAVHLGTGAQYNVDIGPRRDK 277 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEI 326 + QI++ +S+GA R K++ IE Sbjct: 278 NNTIQISVDASYGAGRANEKKVVAIEF 304 >gi|262043405|ref|ZP_06016530.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039231|gb|EEW40377.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 330 Score = 180 bits (457), Expect = 2e-43, Method: Composition-based stats. Identities = 66/327 (20%), Positives = 123/327 (37%), Gaps = 32/327 (9%) Query: 2 ATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD 61 K + A I +F E+A Q+ S+L+ V ++ AS + E +I Sbjct: 32 TAKNMITAAFIQQFHDSFEIAAQQKDSRLQAAVFDRGNITGASFTINDMGTIEMTQITER 91 Query: 62 MPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAIL 121 DT+++ D R + +G ++ +P PY L AA +RK+D+ I Sbjct: 92 FGDTVWDLPDAGTRNALMADYGVFVPVEKRDLRKLLADPQGPYLQLTLAASNRKKDDVIY 151 Query: 122 KGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DSEQ 180 + +L K + S +LI AK++FR+ D + E+ Sbjct: 152 RALLDTVL-RKTSNTGAYAPVALPASQKIVAGGTGMTKAKLIAAKAMFRRNECDEQNGEE 210 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPAGT 239 +Y+ +D+ + + TS D++ LQ G + + G ++ EK+ + Sbjct: 211 LYITYNADMLTQILSDTTLTSADFMAVKMLQEGAVSGNWLGFKWLAYEKLDSAEAGDPAV 270 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 +CK+AV F + +V K Sbjct: 271 -----------------------------TTKTAVAWCKTAVHFGTGEEYNVDIGPRRDK 301 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEI 326 + QI++ +S+GA R +K++ I+ Sbjct: 302 NNTIQISVDASYGAGRAAENKVVAIDF 328 >gi|254251746|ref|ZP_04945064.1| hypothetical protein BDAG_00943 [Burkholderia dolosa AUO158] gi|124894355|gb|EAY68235.1| hypothetical protein BDAG_00943 [Burkholderia dolosa AUO158] Length = 295 Score = 180 bits (456), Expect = 3e-43, Method: Composition-based stats. Identities = 73/328 (22%), Positives = 125/328 (38%), Gaps = 39/328 (11%) Query: 1 MATKE-QLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIV 59 M+T + A + +F +A Q+ +S+L+ TV +S TEA+++ Sbjct: 1 MSTNNETITQAFVQQFADGYIMAAQQKESRLQSTVMAYGDVTGSSFTANNMGATEANDVT 60 Query: 60 GDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEA 119 + DT++N D R W+ ID + NP Y AA++RK+D Sbjct: 61 SRLSDTVWNDNPNDTRVALMQDKDWSTPIDKYDLPKLKANPQGTYMQNGLAALNRKKDAV 120 Query: 120 ILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DS 178 I + ++G + + G + S S D +LITAK +FRK D + Sbjct: 121 IYQALIGNS-ITRAGEALPYGSIALPSSQKILDGGVGMTKAKLITAKKLFRKNEADEQNG 179 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPA 237 E +Y+L +++ + + TS D++ LQ GK+ + G +I E + Sbjct: 180 EDLYMLYDAEMLEDILSDTTLTSADFMAVQMLQDGKLSGRWLGFNWIPYEAL-------- 231 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDP 297 Y KS+ F D+ Sbjct: 232 ---------------------------NTAGTVKTTVAYTKSSTQFGVGLNRDIDIGPRR 264 Query: 298 GKWHAPQITLTSSFGATRIEPDKILGIE 325 K +A QI + S+GA R + K++ I+ Sbjct: 265 DKRNAIQIYIGESYGAVRTDEKKVVTID 292 >gi|325272826|ref|ZP_08139163.1| minor capsid protein 10 [Pseudomonas sp. TJI-51] gi|324102031|gb|EGB99540.1| minor capsid protein 10 [Pseudomonas sp. TJI-51] Length = 322 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 35/317 (11%), Positives = 86/317 (27%), Gaps = 16/317 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F V A QE + SA + A + Sbjct: 20 ALFLKVFSGEVLTAFQE-SCVTADKHLVRTITSGKSAQFPILGKISAQYHTPGAEIAGLS 78 Query: 69 ATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVN 128 ++ I + + PY++ A+ D+ IL+ + Sbjct: 79 VPANEQVITIDDLLISHAFIASIDEAMNHYDVRGPYSTEMGRALSYTYDKHILQLGVLAA 138 Query: 129 KKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSD 188 + + + + D + + L A +++I +++ Y + Sbjct: 139 RASAPVSTEAGGGSVTDSALLT-DTTGEALVAALFAAAQKLDEKFIP--ADERYAYLTPA 195 Query: 189 VWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGK 248 + L + + + + G++ AG+ + P Sbjct: 196 AYYMLAQNTKLMNSLWGGQGSYAKGELPQVAGINLVKAVHAPFGSN-----------IAT 244 Query: 249 VEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLT 308 V T +S K+ + K+AV + + ++ D + + Sbjct: 245 VANGGTALTAGTSDKYAVDATSTAALVMHKAAVGTVKLMDLAMESDYDI-RRQGTLMVAK 303 Query: 309 SSFGATRIEPDKILGIE 325 + G + P + ++ Sbjct: 304 YAMGHGILRPAAAVELK 320 >gi|83594643|ref|YP_428395.1| minor capsid protein 10 [Rhodospirillum rubrum ATCC 11170] gi|83577557|gb|ABC24108.1| minor capsid protein 10 [Rhodospirillum rubrum ATCC 11170] Length = 309 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 39/317 (12%), Positives = 84/317 (26%), Gaps = 29/317 (9%) Query: 8 ATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIY 67 + F V E P + SA V A+ V Sbjct: 19 DALFLKVFGGEVLTTFAENN-VFLPLTMSRTITSGKSAQFPVLGKNTAYYHVPGAELNGN 77 Query: 68 NATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 N + +R I + + Y S A++ + D I + ++ Sbjct: 78 NILNAERVITVDGLLVSPVFIAKIDEAKTHYDVRSQYTSECGASLSNQADRTISQVLINA 137 Query: 128 NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS 187 + A L + F ++ + Y + Sbjct: 138 ARST---ATITGGFGGTKLVDAAFGTDGDKLAAGIFGIAQTFDEKDVPET--DRYAAVRP 192 Query: 188 DVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDG 247 + + A + ++D+ + + GK+ AGV + +P + + + Sbjct: 193 AQYYLMVAGTKVLNRDWGGSGSYMDGKVLKVAGVSIVKSNHIPKSVITGSAQ-------- 244 Query: 248 KVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITL 307 A ++ K V + KSAV + + + + I Sbjct: 245 --------------AAYDGDFTKTVAVGFHKSAVGTVKLLDLQTEGEYQIQR-QGTLIVA 289 Query: 308 TSSFGATRIEPDKILGI 324 + G + P+ + + Sbjct: 290 KYAMGHGVLRPEAAVEL 306 >gi|295096864|emb|CBK85954.1| hypothetical protein ENC_24270 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 303 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 63/327 (19%), Positives = 120/327 (36%), Gaps = 32/327 (9%) Query: 2 ATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD 61 K + A I +F E+A Q+ S+L+ V ++ + + E +I Sbjct: 5 TNKNMITAAFITQFHDSFEIAAQQKDSRLQAAVNDRGMITGEAFTINDMGTIEMTQITTR 64 Query: 62 MPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAIL 121 DT+++ + R + +G ++ +P PY L AA +RK+D+ + Sbjct: 65 FGDTVWDLPEAGTRNALMADYGVFVPVEKRDLRKLLADPQGPYLQLTLAAANRKKDDVVY 124 Query: 122 KGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DSEQ 180 + +L K + + S +LI AK++FR+ D + E+ Sbjct: 125 RALLDTVL-RKTSSGGAYAPVALPASQKIVAGGTGMTKAKLIAAKAMFRRNECDEQNGEE 183 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKIE-AFAGVWFINMEKVPGNDLFPAGT 239 +Y+ +D+ + + TS D++ LQ G + + G ++ EK+ Sbjct: 184 LYMTYNADMLTQILSDTTLTSADFMAVKMLQEGAVSSKWLGFNWLAYEKLDSVTDGDPAV 243 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 +CKSAV F +V K Sbjct: 244 -----------------------------TTKTAAAWCKSAVHFGTGAEYNVDIGPRRDK 274 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEI 326 + QI++ +S+GA R K++ I+ Sbjct: 275 NNTIQISVDASYGAGRANEKKVVAIDF 301 >gi|26989006|ref|NP_744431.1| minor capsid protein 10 [Pseudomonas putida KT2440] gi|24983827|gb|AAN67895.1|AE016421_7 minor capsid protein 10 [Pseudomonas putida KT2440] Length = 322 Score = 176 bits (446), Expect = 4e-42, Method: Composition-based stats. Identities = 35/317 (11%), Positives = 86/317 (27%), Gaps = 16/317 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F V A QE + SA + A + Sbjct: 20 ALFLKVFSGEVLTAFQE-SCVTADKHLVRTITSGKSAQFPILGKISAQYHTPGAEIAGLS 78 Query: 69 ATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVN 128 ++ I + + PY++ A+ D+ IL+ + Sbjct: 79 VPANEQIITIDDLLISHAFIASIDEAMNHYDVRGPYSTEMGRALSYTYDKHILQLGVLAA 138 Query: 129 KKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSD 188 + + + + D + + L A +++I +++ Y + Sbjct: 139 RASAPVSTEAGGGSVTDSALLT-DTTGEALVAALFAAAQKLDEKFIP--ADERYAYLTPA 195 Query: 189 VWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGK 248 + L + + + + G++ AG+ + P Sbjct: 196 AYYMLAQNTKLMNSLWGGQGSYAKGELPQVAGISLVKAVHAPFGSN-----------IAT 244 Query: 249 VEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLT 308 V T +S K+ + K+AV + + ++ D + + Sbjct: 245 VANGGTALTAGTSDKYAVDATSTAALVMHKAAVGTVKLMDLAMESDYDI-RRQGTLMVAK 303 Query: 309 SSFGATRIEPDKILGIE 325 + G + P + ++ Sbjct: 304 YAMGHGILRPAAAVELK 320 >gi|282857733|ref|ZP_06266942.1| minor capsid protein 10 [Pyramidobacter piscolens W5455] gi|282584403|gb|EFB89762.1| minor capsid protein 10 [Pyramidobacter piscolens W5455] Length = 331 Score = 173 bits (438), Expect = 3e-41, Method: Composition-based stats. Identities = 37/317 (11%), Positives = 96/317 (30%), Gaps = 27/317 (8%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNAT 70 I +K V A + + +S+ AH N Sbjct: 23 FITNYKLDVMKAFARK-CIFKDLHRIHTIDHGSSSTFYYTGTASAHYHDKGKMILGTNNP 81 Query: 71 DQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNK 129 + + A+ ID ++ ++ A+ DE I + + Sbjct: 82 PISKTIINIDGLLLADIMIDDLEDAMMHLDVRSEFSHQQGVALANAFDERIARLFYLSAR 141 Query: 130 KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDV 189 G + + +++SA + + + A ++ + D + ++++ Sbjct: 142 S---GPKNKDHPGGSVISAKDAETNGSVLADCIFAAAQTLDEKDVPDD--ERFIVVKPAQ 196 Query: 190 WASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKV 249 + L ++ ++DY + +++ +++ A + +P Sbjct: 197 YYLLCKVKDLINRDYGGSGSIKDVALQSIANMSLKKSMNLPNGKNITTA----------- 245 Query: 250 EYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQH--SKDPGKWHAPQITL 307 + K V + ++AV + K + + S+ + A IT Sbjct: 246 -------DPHEHNDYRGDFTKSVAVVGNRNAVGTVKLKDLTTRMSGSEVKTLFEATLITA 298 Query: 308 TSSFGATRIEPDKILGI 324 + + G ++P + I Sbjct: 299 SYAMGHGILDPRGAVEI 315 >gi|332160972|ref|YP_004297549.1| hypothetical protein YE105_C1350 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665202|gb|ADZ41846.1| hypothetical protein YE105_C1350 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862128|emb|CBX72292.1| hypothetical protein YEW_AK02290 [Yersinia enterocolitica W22703] Length = 302 Score = 170 bits (431), Expect = 2e-40, Method: Composition-based stats. Identities = 63/325 (19%), Positives = 120/325 (36%), Gaps = 34/325 (10%) Query: 2 ATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD 61 A K + A + +F E+A Q+ S+L+ V ++ AS + E + I Sbjct: 5 ANKNMITAAFVQQFHDSFEIASQQKDSRLQAAVHDRGMITGASFTINDMGTIEMNAITTR 64 Query: 62 MPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAIL 121 DT+++ + R + +G ++ P PY L +A +RK+D+ I Sbjct: 65 FGDTVWDVPEAGTRNALMADYGVFVPVEKRDLRKLIAEPQGPYLQLTLSATNRKKDDVIY 124 Query: 122 KGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DSEQ 180 + +L + + + + +LI AK++FR+ D + E+ Sbjct: 125 RALLDPVPRKVENNGA-YTNVVLPAAQKILAGGSGMTKAKLIAAKAMFRRNECDEQNGEE 183 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPAGT 239 +Y+ +D+ + + TS D++ LQ G + + G +I EK+ T Sbjct: 184 LYIAYNADMLTQILSDTTLTSADFMAVKMLQEGALAGNWLGFRWIAYEKLDSVTDTGVTT 243 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 K + K+AV F + K Sbjct: 244 KTTV-------------------------------AWAKTAVHFGTGAEYNTDIGPRRDK 272 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + QI++ +S+GA R K++ I Sbjct: 273 NNTIQISVDASYGAGRANEQKVVSI 297 >gi|61806429|ref|YP_214206.1| T7-like capsid protein [Prochlorococcus phage P-SSP7] gi|298508277|pdb|2XD8|A Chain A, Capsid Structure Of The Infectious Prochlorococcus Cyanophage P-Ssp7 gi|298508278|pdb|2XD8|B Chain B, Capsid Structure Of The Infectious Prochlorococcus Cyanophage P-Ssp7 gi|298508279|pdb|2XD8|C Chain C, Capsid Structure Of The Infectious Prochlorococcus Cyanophage P-Ssp7 gi|298508280|pdb|2XD8|D Chain D, Capsid Structure Of The Infectious Prochlorococcus Cyanophage P-Ssp7 gi|298508281|pdb|2XD8|E Chain E, Capsid Structure Of The Infectious Prochlorococcus Cyanophage P-Ssp7 gi|298508282|pdb|2XD8|F Chain F, Capsid Structure Of The Infectious Prochlorococcus Cyanophage P-Ssp7 gi|298508283|pdb|2XD8|G Chain G, Capsid Structure Of The Infectious Prochlorococcus Cyanophage P-Ssp7 gi|61374354|gb|AAX44208.1| T7-like capsid protein [Prochlorococcus phage P-SSP7] gi|265525466|gb|ACY76232.1| predicted protein [Prochlorococcus phage P-SSP7] Length = 375 Score = 165 bits (418), Expect = 7e-39, Method: Composition-based stats. Identities = 33/341 (9%), Positives = 77/341 (22%), Gaps = 28/341 (8%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F + Q ++ R VT++ + S + P Sbjct: 28 ALYLKLFSGEMFKGFQ-HETIARDLVTKRTLKNGKSLQFIYTGRMTSSFHTPGTPILGNA 86 Query: 69 A---TDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 ++ V + + + + A+ K D I + + Sbjct: 87 DKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSIT 146 Query: 126 GVNKKGKIGAETEFFSKENILSAVEGDDF------FKTFIGQLITAKSIFRKRYIDVDSE 179 + + T F V + A + ++ + Sbjct: 147 RGARSASPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQG- 205 Query: 180 QVYVLIPSDVWASLFAL---ERATSKDYINTAALQAGKIEAFAGVWFINMEKVPG----- 231 ++ + +L ++D +A + AG+ +P Sbjct: 206 -RCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYG 264 Query: 232 ------NDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQ 285 G + + K I+ K A + Sbjct: 265 VKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVE 324 Query: 286 RKAIDVQHS--KDPGKWHAPQITLTSSFGATRIEPDKILGI 324 VQ + + I + GA + P + + Sbjct: 325 AIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVEL 365 >gi|187735988|ref|YP_001878100.1| hypothetical protein Amuc_1497 [Akkermansia muciniphila ATCC BAA-835] gi|187426040|gb|ACD05319.1| hypothetical protein Amuc_1497 [Akkermansia muciniphila ATCC BAA-835] Length = 349 Score = 165 bits (416), Expect = 1e-38, Method: Composition-based stats. Identities = 59/339 (17%), Positives = 107/339 (31%), Gaps = 45/339 (13%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 ++ ++ + LQ+ S+L V+ ++ F + E + Sbjct: 44 MAVTISDNYQVKYTRKWGSLLQQHASRLDKYVSVMRDLSGKVVFLDQFGILDFTEKTTRV 103 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGIN--PLLPYASLATAAMHRKQDEAI 120 T+ N RR + F A D F G P+ AA R+ D+ + Sbjct: 104 GQTVLNEAPTTRRSMRPRTFTKAIGYDEFDATRLGDMDLPVSKTIEGLQAAAGRRMDDVM 163 Query: 121 LKGMLGVNKKGKIGAETEFFSKENILS----AVEGDDFFKTFIGQLITAKSIFRKR---- 172 + G L N G+ G F + ++ + +L A +F + Sbjct: 164 ISGFLDTNYVGEDGMTAVPFKESQQIAVDHVDSGTKSASNLTVAKLRAALQLFEENEAWN 223 Query: 173 -YIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPG 231 +Q+ + + S +L +S D+ N AL GKI+ F G FI +++P Sbjct: 224 QDAPQFGDQLVIAVTSSQIMNLLRETEVSSYDFNNVKALVEGKIDTFMGFKFIRTQRLPK 283 Query: 232 NDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDV 291 + + KS F V Sbjct: 284 TEEGV----------------------------------RSCLAWVKSKAQFGIWNDFKV 309 Query: 292 QHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISKDS 330 + S A QI + GATR++ + + I + + Sbjct: 310 KLSVRDDMEEALQIRAKFACGATRLQEEGFVKILCDEGA 348 >gi|167041087|gb|ABZ05848.1| hypothetical protein ALOHA_HF400048F7ctg1g15 [uncultured marine microorganism HF4000_48F7] Length = 221 Score = 164 bits (415), Expect = 2e-38, Method: Composition-based stats. Identities = 54/254 (21%), Positives = 88/254 (34%), Gaps = 34/254 (13%) Query: 76 WVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGA 135 V + + A+ ID L ++P YA A+ R D+ I+ G K G+ G Sbjct: 1 MVTLADYEVADLIDDQDKLRMIVDPTSSYAQAQAFAIGRSMDDVIITAATGDAKTGETGG 60 Query: 136 ETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFA 195 T ++ IG+L AK I +D +V V+ P + L A Sbjct: 61 TTTALPSGQKVAVNLSGSNEGLTIGKLREAKFILDNNSVDPSIPRVMVVGPKQI-QDLLA 119 Query: 196 LERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGK 255 + TS D+ AL G ++ F G FI ++ N Sbjct: 120 TTQITSSDFNTIKALVQGDVDTFMGFQFITSTRLAHNSGT-------------------- 159 Query: 256 PTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATR 315 Y + K + V+ + P K +A Q+ S G+TR Sbjct: 160 -------------DVRTCFAYAVDGITLAVAKDLTVRIDERPDKGYAVQVYACMSIGSTR 206 Query: 316 IEPDKILGIEISKD 329 +E +K++ I + Sbjct: 207 MEEEKVVEISCDES 220 >gi|225158773|ref|ZP_03725090.1| conserved hypothetical protein [Opitutaceae bacterium TAV2] gi|224802608|gb|EEG20863.1| conserved hypothetical protein [Opitutaceae bacterium TAV2] Length = 305 Score = 161 bits (407), Expect = 1e-37, Method: Composition-based stats. Identities = 65/324 (20%), Positives = 116/324 (35%), Gaps = 48/324 (14%) Query: 8 ATANIYEFKKHVELALQETKSKLRPT-VTEQATEGEASALVEVFKPTEAHEIVGDMPDTI 66 A + +++ +VE K + + + + H+IVG + D Sbjct: 7 PAAFVEQYRSNVEHLAARQKHIFEGKGIRIETA-NGKVDYFDQIGGLKMHKIVGRLADIT 65 Query: 67 YNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 Y+ + RR V S +G A D + I+P P A A++ +DE I+ LG Sbjct: 66 YDQQEFWRRQVSCSPYGIAVPFDGADKVRGIIDPNAPTAQNQAFAINVSKDEVIVAAALG 125 Query: 127 VNKKGKIGAETEFFSKENILSAVEG---------DDFFKTFIGQLITAKSIFRKRYIDVD 177 K + + E G + +LI KS+ + + Sbjct: 126 TAYKKNDDEGSVPVAVELGDDRKVGVGYNGTGNPGANTGLTLAKLIRLKSLISRDDVQNA 185 Query: 178 SEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPA 237 ++ +V + + L ++ S DY ALQ G I F G+ ++ E++P + Sbjct: 186 KKKYFVHNQAMLDQLLLNVQEVKSTDYAAVKALQEGGITHFLGMEWVKYEELPAVNGI-- 243 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFT--QRKAIDVQHSK 295 YC++A++F + + + K Sbjct: 244 ---------------------------------RSCFAYCENAILFANQKNTGVRTEIEK 270 Query: 296 DPGKWHAPQITLTSSFGATRIEPD 319 PGKW+A +T + FGATR+ D Sbjct: 271 IPGKWNAWHVTTQADFGATRMRED 294 >gi|317487278|ref|ZP_07946073.1| hypothetical protein HMPREF0179_03436 [Bilophila wadsworthia 3_1_6] gi|316921468|gb|EFV42759.1| hypothetical protein HMPREF0179_03436 [Bilophila wadsworthia 3_1_6] Length = 321 Score = 159 bits (401), Expect = 6e-37, Method: Composition-based stats. Identities = 34/317 (10%), Positives = 90/317 (28%), Gaps = 27/317 (8%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNAT 70 F V A E ++ + SA V A + N Sbjct: 23 FRDVFTGEVITAFDEHN-IMKDWHRMRTITHGKSASFAVMGRANARYHDPGVAILGSNKI 81 Query: 71 DQDRRWVGHSQFGWAER-IDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNK 129 + R + A+ I + + Y+ A+ ++ DE ++ + + Sbjct: 82 AANERTINVDNLLIADVAIYDLEDAMNHYDVRREYSKQLGVALAKRFDETTMRVAVLAAR 141 Query: 130 KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDV 189 I + S + + + F ++ + ++ +++ Sbjct: 142 SSGIIDDEPGGSVIKGGA--TLATDGEKIAEAVFACSQTFDEKDVPE--QERCLILRPAQ 197 Query: 190 WASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKV 249 + L + ++D++ + GK++ AG+ + +P ++ A Sbjct: 198 FYLLNQTTKVLNRDWLGAGSYSDGKLDKIAGIKILMSNHLPKANITAA------------ 245 Query: 250 EYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKD--PGKWHAPQITL 307 + + +A+ + K + VQ S + + + Sbjct: 246 -------VDGEKNTYYGDFTNTLGLCMQSNAIATVKLKDLTVQQSGHDFNIVYQSTLMVA 298 Query: 308 TSSFGATRIEPDKILGI 324 + G + P + + Sbjct: 299 KYAMGHGVLNPSYAIEL 315 >gi|310005694|gb|ADP00081.1| major capsid protein [Cyanophage NATL1A-7] Length = 338 Score = 158 bits (399), Expect = 1e-36, Method: Composition-based stats. Identities = 25/330 (7%), Positives = 80/330 (24%), Gaps = 30/330 (9%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F + A E+ + R TV + + S +A P Sbjct: 30 ATYLKLFSGELFKAY-ESATIARDTVQRRTLKNGKSLQFIFTGRMQAAYHTPGEPILGSG 88 Query: 69 ATD-QDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 ++ + + + + + A+ D+ + + + Sbjct: 89 DPPVAEKTIQCDDLLISSAFVYDLDETLAHYSLRSEISKKIGHALAEAYDKKVFRTIALA 148 Query: 128 NKKGKIGAETEFFSKENILSAVEGD--DFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLI 185 ++ + + + + A ++ ++ + ++ Sbjct: 149 AREAHPITASPGPEPGGTTIELGVTKEYNAQALVDAFFEAAAVLDEKNLPKTG--RTAVL 206 Query: 186 PSDVWASL---FALERATSKDYINTAALQAGK-IEAFAGVWFINMEKVPGNDLFPAGTKF 241 + +L + + L +G+ + AG+ +P Sbjct: 207 NPRQYYALVSQVSSNILNRDYGNSQGNLNSGEGLVEIAGIQIKRSNNLPFLAGTVN---- 262 Query: 242 PGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHS--KDPGK 299 + + + IY + A + VQ + Sbjct: 263 --------------SVSGENNSYNGDFSTHCGLIYQRDAAGIVEAVGPQVQVTGGDVSVL 308 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEISKD 329 + + + G + P + + ++ Sbjct: 309 YQGDVMVGRLAMGVGTLNPAGAIELTSARS 338 >gi|326633072|ref|YP_004306684.1| predicted major capsid protein [Salmonella phage Vi06] gi|301170545|emb|CBV65233.1| predicted major capsid protein [Salmonella phage Vi06] Length = 350 Score = 158 bits (399), Expect = 1e-36, Method: Composition-based stats. Identities = 45/324 (13%), Positives = 90/324 (27%), Gaps = 14/324 (4%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ SA V T+A + D I Sbjct: 28 ALFLKVFGGEVLTAFTRT-SVTASRHMVRSISSGKSAQFPVLGRTQAAYLKPGVNLDDIR 86 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 N + + A+ I + + Y S ++ D A+L + Sbjct: 87 NDIKHTEKVITIDGLLTADVLIYDIDDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAA 146 Query: 127 ------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ 180 + + G + K I L A++ Y+ Sbjct: 147 LCNAKPNSDENIDGLGHASVIPIKGGKQDDKATLGKNIITALTEARAALTNNYVPASDRV 206 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 Y + ++++ A + +Y + G I G + + + + T Sbjct: 207 FYC--SPENYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTVGGAGESRTG 264 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 G TK V +SAV + + + ++ ++ + Sbjct: 265 MGGQK--HEFPSTTSTKEGDEGNMNVTKGNVVGLFMHRSAVGTVKLRDLALERARRAN-F 321 Query: 301 HAPQITLTSSFGATRIEPDKILGI 324 A QI + G + P+ + Sbjct: 322 QADQIIAKYAMGHGGLRPEAAGAV 345 >gi|291335397|gb|ADD95011.1| T7-like capsid protein [uncultured phage MedDCM-OCT-S04-C24] Length = 379 Score = 157 bits (397), Expect = 2e-36, Method: Composition-based stats. Identities = 38/345 (11%), Positives = 88/345 (25%), Gaps = 31/345 (8%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F + Q + R V + S T+A + Sbjct: 26 ALYLKLFSGEMFKGFQ-HNAIARDLVMRRTLTNGKSLQFIYTGHTKAEFHTPGNSILGDS 84 Query: 69 A---TDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 ++ + + S + + A+ +K D I + + Sbjct: 85 NGAPPVAEKTITVDDLLISSAFLYDLDETLSHYDMRSEISRKIGYALAQKYDRLIFRAIT 144 Query: 126 GVNKKGKIGAETEFFSKENILSAVEGDDFFKT-------FIGQLITAKSIFRKRYIDVDS 178 + ++ + V + + A + ++ + D Sbjct: 145 RGARAASPITKSGYVEPGGTQIRVGSSGTAASDAYDSAKLVTAFYDAAAALDEKGVSQDG 204 Query: 179 EQVYVLIPSDVWASL--FALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGN---- 232 +V +L P +A + ++D +A A I AG+ +P Sbjct: 205 -RVGILNPRQYYALIQEVGSNGLVNRDSQGSALQGAEGIVEIAGIKIYKSMNIPFFSQYG 263 Query: 233 -----------DLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAV 281 + G + + + N + + E I+ + A Sbjct: 264 TKYGTGSATNPGVTDPGNTGSFVSEAIEDAANDVTGINNEYGEETEFANSCGLIFQREAA 323 Query: 282 VFTQRKAIDVQHS--KDPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A VQ + + I + GA + P + + Sbjct: 324 GCVEAIAPQVQVTSGDVSTIYQGDVILGRLAMGADYLNPAASVEL 368 >gi|326633073|ref|YP_004306683.1| predicted minor capsid protein [Salmonella phage Vi06] gi|301170546|emb|CBV65234.1| predicted minor capsid protein [Salmonella phage Vi06] Length = 396 Score = 156 bits (394), Expect = 5e-36, Method: Composition-based stats. Identities = 47/337 (13%), Positives = 94/337 (27%), Gaps = 14/337 (4%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ SA V T+A + D I Sbjct: 28 ALFLKVFGGEVLTAFTRT-SVTASRHMVRSISSGKSAQFPVLGRTQAAYLKPGVNLDDIR 86 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 N + + A+ I + + Y S ++ D A+L + Sbjct: 87 NDIKHTEKVITIDGLLTADVLIYDIDDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAA 146 Query: 127 ------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ 180 + + G + K I L A++ Y+ Sbjct: 147 LCNAKPNSDENIDGLGHASVIPIKGGKQDDKATLGKNIITALTEARAALTNNYVPASDRV 206 Query: 181 VYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTK 240 Y + ++++ A + +Y + G I G + + + + T Sbjct: 207 FYC--SPENYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTVGGAGESRTG 264 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 G TK V +SAV + + + ++ ++ + Sbjct: 265 MGGQK--HEFPSTTSTKEGDEGNMNVTKGNVVGLFMHRSAVGTVKLRDLALERARRAN-F 321 Query: 301 HAPQITLTSSFGATRIEPDKILGIEISKDSLKGVPVL 337 A QI + G + P+ + + K + V Sbjct: 322 QADQIIAKYAMGHGGLRPEAAGAVVLKKGGVTQEVVS 358 >gi|310005783|gb|ADP00169.1| major capsid protein [Cyanophage NATL2A-133] Length = 383 Score = 155 bits (390), Expect = 1e-35, Method: Composition-based stats. Identities = 39/349 (11%), Positives = 89/349 (25%), Gaps = 35/349 (10%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F + Q + R V ++ + S T+A + Sbjct: 26 ALYLKLFSGEMFKGFQ-HNAIARDLVMKRTLKNGKSLQFIYTGHTKAEFHTPGNSILGNS 84 Query: 69 A---TDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 ++ + + + + A+ +K D I + +L Sbjct: 85 DGAPPVAEKTITVDDLLISSAFVYELDETLAHYELRGEISKKIGYALAQKYDRLIFRSIL 144 Query: 126 GVNKKGKIGAETEFFSKENILSAVEGDD------FFKTFIGQLITAKSIFRKRYIDVDSE 179 +K ++ F V + + + A + ++ + + Sbjct: 145 RGARKASPVSKAGFVEPGGTQIRVGSNAQASDAINPDSLVTAFYDAAAALDEKGVSSEG- 203 Query: 180 QVYVLIPSDVWASLFALER------ATSKDYINTAALQAGKIEAFAGVWFINMEKVPGN- 232 +V VL P +A + L+ ++D A I AG+ VP Sbjct: 204 RVAVLNPRQYYALIKGLDGSGIGAYLVNRDSQGDALQSGKGIYEIAGIKIYKSMNVPFFG 263 Query: 233 ---------------DLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYC 277 + G + + N + + + K I+ Sbjct: 264 EYGTKLGGSAGAEVPGITSPGNLGSFVQQSVEDARNSVTGINNEYGQQGDFTKSCGVIFQ 323 Query: 278 KSAVVF--TQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A + V + I + GA + P + + Sbjct: 324 REAAGVVEAIGPQVQVTSGDVSVVYQGDVILGRLAMGADYLNPAAAVEL 372 >gi|9627470|ref|NP_041997.1| minor capsid protein [Enterobacteria phage T7] gi|137564|sp|P19727|VC10B_BPT7 RecName: Full=Minor capsid protein 10B gi|431193|emb|CAA24428.1| unnamed protein product [Enterobacteria phage T7] Length = 398 Score = 154 bits (389), Expect = 2e-35, Method: Composition-based stats. Identities = 50/337 (14%), Positives = 98/337 (29%), Gaps = 21/337 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ SA V T+A + D Sbjct: 26 ALFLKVFGGEVLTAFART-SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENLDDKR 84 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 85 KDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAG 144 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G T + +N + + K I L A++ K Y+ Sbjct: 145 LCNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAAD 204 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 Y D ++++ A + +Y + G I G + + + A Sbjct: 205 RVFYC--DPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAR 262 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 G P K + K + +SAV + + + ++ ++ Sbjct: 263 EGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 Query: 299 KWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + GV Sbjct: 316 -FQADQIIAKYAMGHGGLRPEAAGAVVFQSGVMLGVA 351 >gi|310005671|gb|ADP00059.1| major capsid protein [Cyanophage 9515-10a] Length = 383 Score = 153 bits (386), Expect = 3e-35, Method: Composition-based stats. Identities = 40/349 (11%), Positives = 88/349 (25%), Gaps = 35/349 (10%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F + Q + R V ++ + S T+A + Sbjct: 26 ALYLKLFSGEMFKGFQ-HNAIARDLVMKRTLKNGKSLQFIYTGHTKAEFHTPGNSILGNS 84 Query: 69 A---TDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 ++ + + + + A+ +K D I + +L Sbjct: 85 DGAPPVAEKTITVDDLLISSAFVYELDETLAHYELRGEISKKIGYALAQKYDRLIFRSIL 144 Query: 126 GVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIG------QLITAKSIFRKRYIDVDSE 179 +K ++ F V + I A + ++ + + Sbjct: 145 RGARKESPVSKAGFVEPGGTQIRVGSNAQASDAIDPDALVTAFYDAAAALDEKGVSSEG- 203 Query: 180 QVYVLIPSDVWASLFALER------ATSKDYINTAALQAGKIEAFAGVWFINMEKVPGN- 232 +V VL P +A + L+ ++D A I AG+ VP Sbjct: 204 RVAVLNPRQYYALIKGLDGSGIGAYLVNRDSQGDALQSGKGIYEIAGIKIYKSMNVPFFG 263 Query: 233 ---------------DLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYC 277 + G + + N + + + K I+ Sbjct: 264 EYGTKLGGSAGAEVPGITSPGNLGSFVQQSVEDARNSVTGINNEYGQQGDFTKSCGVIFQ 323 Query: 278 KSAVVF--TQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A + V + I + GA + P + + Sbjct: 324 REAAGVVEAIGPQVQVTSGDVSVVYQGDVILGRLAMGADYLNPAAAVEL 372 >gi|30387488|ref|NP_848296.1| minor capsid protein [Yersinia pestis phage phiA1122] gi|30314125|gb|AAP20533.1| minor capsid protein [Yersinia pestis phage phiA1122] Length = 397 Score = 153 bits (385), Expect = 5e-35, Method: Composition-based stats. Identities = 50/344 (14%), Positives = 103/344 (29%), Gaps = 23/344 (6%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 MA ++L + F V A T S ++ SA V T+A + Sbjct: 19 MAAGDKL-ALFLKVFGGEVLTAFART-SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAP 76 Query: 61 D-MPDTIYNATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDE 118 D I + + A+ I + + Y S ++ D Sbjct: 77 GENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADG 136 Query: 119 AILKGMLG--------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFR 170 A+L + G +G T + ++ + + K I L A++ Sbjct: 137 AVLAEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALT 196 Query: 171 KRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVP 230 K Y+ Y D ++++ A + +Y + G I G + + + Sbjct: 197 KNYVPSSDRVFYC--DPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLT 254 Query: 231 GNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAID 290 + S + K + +SAV + + + Sbjct: 255 AGGAGTSREG--------TTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLA 306 Query: 291 VQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGV 334 ++ ++ + A QI + G + P+ + + + GV Sbjct: 307 LERARRAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQDEVMLGV 349 >gi|17570826|ref|NP_523335.1| major capsid protein 10A [Enterobacteria phage T3] gi|137561|sp|P19693|VC10A_BPT3 RecName: Full=Major capsid protein 10A gi|15716|emb|CAA35154.1| 10A [Enterobacteria phage T3] gi|6015600|emb|CAB57820.1| major capsid protein 10A [Enterobacteria phage T3] gi|17384310|emb|CAC86298.1| major capsid protein 10A [Enterobacteria phage T3] Length = 347 Score = 152 bits (384), Expect = 6e-35, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 91/325 (28%), Gaps = 18/325 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S P ++ SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L + G Sbjct: 84 KDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAG 143 Query: 127 VNK----KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYID---VDSE 179 + + + ++ G +G+ I A+ + + V + Sbjct: 144 LVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAA 203 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 D ++++ A + +Y + G I G + + + Sbjct: 204 DRTFYTTPDNYSAILAALMPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTRE 263 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 SS + V +SAV + K + ++ ++ Sbjct: 264 -------DAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN- 315 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ I Sbjct: 316 YQADQIIAKYAMGHGGLRPEAAGAI 340 >gi|291335887|gb|ADD95482.1| T7-like capsid protein [uncultured phage MedDCM-OCT-S08-C41] Length = 339 Score = 152 bits (384), Expect = 7e-35, Method: Composition-based stats. Identities = 35/325 (10%), Positives = 76/325 (23%), Gaps = 26/325 (8%) Query: 25 ETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNA---TDQDRRWVGHSQ 81 E + R V ++ + S T A ++ Sbjct: 6 ENNAIARDLVMKRTLKNGKSLQFIYTGRTTAEYHTPGNAILGNGDGAPPVAEKTITVDDL 65 Query: 82 FGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFS 141 + + + + A+ K D I + + + ++ F Sbjct: 66 LISSAFVYELDETLAHYELRGEISKKIGYALAEKYDRLIFRAVTRGARAASPITKSNFVE 125 Query: 142 KENILSAVEGDDF------FKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASL-- 193 V + A + ++ I D +V VL P ++ + Sbjct: 126 PGGTQVRVGASTNESDAYSATALVDSFYDAAAAMDEKGISQDG-RVGVLNPRQYYSLIQQ 184 Query: 194 FALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPN 253 ++D ++ I AG+ +P + + + Sbjct: 185 VGENGLINRDEQGSSRQSGQGIVEIAGIKIYKSMNIPFLGKYGTKYGGTSGVADPGNTGD 244 Query: 254 GKPTVKSSAKFEDTKIK------------YVLPIYCKSAVVFT--QRKAIDVQHSKDPGK 299 +A T I I+ K A + V Sbjct: 245 FIGVTAENASGATTGINNDYGTAAELGAKSCGIIFQKEAAAVVETIGPQVQVTSGDVSVV 304 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + I + GA + P + + Sbjct: 305 YQGDVILGRLAMGADYLNPAAAVEL 329 >gi|37956783|gb|AAP34051.1| gene 10A [Enterobacteria phage T7] Length = 345 Score = 152 bits (383), Expect = 8e-35, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 94/326 (28%), Gaps = 21/326 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ SA V T+A + D Sbjct: 26 ALFLKIFGGEVLTAFART-SVTTSHHMVRSISSGKSAQFPVLGRTQAAYLAPGENLDDKR 84 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 85 KDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAG 144 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G T +N + + K I L A++ K Y+ Sbjct: 145 LCNVESKYNENIEGLGTATVIEITQNKPALTDQVVLGKEIIAALTKARATLTKNYVPAAD 204 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 Y D ++++ A + +Y + G I G + + + A Sbjct: 205 RVFYC--DPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGIAR 262 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 G P K + K + +SAV + + + ++ ++ Sbjct: 263 EGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 316 -FQADQIIAKYAMGHGGLRPEAAGAV 340 >gi|326536134|ref|YP_004300568.1| gp10a [Enterobacteria phage 285P] gi|256861523|gb|ACV32479.1| gp10a [Enterobacteria phage 285P] Length = 347 Score = 152 bits (383), Expect = 8e-35, Method: Composition-based stats. Identities = 44/326 (13%), Positives = 86/326 (26%), Gaps = 18/326 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ + SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFTRT-SVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 R + A+ I + + Y + ++ D A+L M Sbjct: 84 KDMKHTERTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAK 143 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G + + + I QL A++ Y+ Sbjct: 144 LCNLPSASDENIAGLGKAHVLEVGKQSELRGDQVKLGQAIIAQLTLARAKLTGNYVPSA- 202 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 D ++++ A + +Y G I G I + + Sbjct: 203 -DRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGEDR 261 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 + SS V +SAV + K + ++ ++ Sbjct: 262 PEEG----ANPTGQKHAFPETSSGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN 317 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 318 -FQADQIIAKYAMGHGGLRPEACGAL 342 >gi|9627469|ref|NP_041998.1| major capsid protein [Enterobacteria phage T7] gi|137562|sp|P19726|VC10A_BPT7 RecName: Full=Major capsid protein 10A gi|312207821|pdb|3IZG|G Chain G, Bacteriophage T7 Prohead Shell Em-Derived Atomic Model gi|312207822|pdb|3IZG|A Chain A, Bacteriophage T7 Prohead Shell Em-Derived Atomic Model gi|312207823|pdb|3IZG|B Chain B, Bacteriophage T7 Prohead Shell Em-Derived Atomic Model gi|312207824|pdb|3IZG|C Chain C, Bacteriophage T7 Prohead Shell Em-Derived Atomic Model gi|312207825|pdb|3IZG|D Chain D, Bacteriophage T7 Prohead Shell Em-Derived Atomic Model gi|312207826|pdb|3IZG|E Chain E, Bacteriophage T7 Prohead Shell Em-Derived Atomic Model gi|312207827|pdb|3IZG|F Chain F, Bacteriophage T7 Prohead Shell Em-Derived Atomic Model gi|313103524|pdb|2XVR|A Chain A, Phage T7 Empty Mature Head Shell gi|313103525|pdb|2XVR|B Chain B, Phage T7 Empty Mature Head Shell gi|313103526|pdb|2XVR|C Chain C, Phage T7 Empty Mature Head Shell gi|313103527|pdb|2XVR|D Chain D, Phage T7 Empty Mature Head Shell gi|313103528|pdb|2XVR|E Chain E, Phage T7 Empty Mature Head Shell gi|313103529|pdb|2XVR|F Chain F, Phage T7 Empty Mature Head Shell gi|313103530|pdb|2XVR|G Chain G, Phage T7 Empty Mature Head Shell gi|15604|emb|CAA24427.1| unnamed protein product [Enterobacteria phage T7] gi|37956680|gb|AAP33950.1| gene 10A [Enterobacteria phage T7] gi|265525001|gb|ACY75864.1| major capsid protein 10A [Enterobacteria phage T7] Length = 345 Score = 151 bits (382), Expect = 1e-34, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 95/326 (29%), Gaps = 21/326 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ SA V T+A + D Sbjct: 26 ALFLKVFGGEVLTAFART-SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENLDDKR 84 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 85 KDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAG 144 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G T + +N + + K I L A++ K Y+ Sbjct: 145 LCNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAAD 204 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 Y D ++++ A + +Y + G I G + + + A Sbjct: 205 RVFYC--DPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAR 262 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 G P K + K + +SAV + + + ++ ++ Sbjct: 263 EGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 316 -FQADQIIAKYAMGHGGLRPEAAGAV 340 >gi|194100288|ref|YP_002003486.1| gp10A [Enterobacteria phage BA14] gi|193201283|gb|ACF15763.1| gp10A [Enterobacteria phage BA14] Length = 347 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 45/326 (13%), Positives = 87/326 (26%), Gaps = 18/326 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ + SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFTRT-SVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L M Sbjct: 84 KDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAK 143 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G E + + I QL A++ Y+ Sbjct: 144 LCNLPSASDENIAGLGKAHVLEVGEQSALKGDQVKLGQAIIAQLTLARAKLTSNYVPSS- 202 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 D ++++ A + +Y G I G I + + Sbjct: 203 -DRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGEDR 261 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 T+ SS V +SAV + K + ++ ++ Sbjct: 262 TEEG----VNPTGQKHAFPETSSGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN 317 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 318 -FQADQIIAKYAMGHGGLRPEACGAL 342 >gi|37956733|gb|AAP34002.1| gene 10A [Enterobacteria phage T7] Length = 345 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 94/326 (28%), Gaps = 21/326 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ SA V T+A + D Sbjct: 26 ALFLKIFGGEVLTAFART-SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENLDDKR 84 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 85 KDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAG 144 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G T +N + + K I L A++ K Y+ Sbjct: 145 LCNVESKYNENIEGLGTATVIEITQNKTALTDQVVLGKEIIAALTKARATLTKNYVPAAD 204 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 Y D ++++ A + +Y + G I G + + + A Sbjct: 205 RVFYC--DPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGIAR 262 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 G P K + K + +SAV + + + ++ ++ Sbjct: 263 EGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 316 -FQADQIIAKYAMGHGGLRPEAAGAV 340 >gi|212671413|ref|YP_002308413.1| major capsid protein 10A [Kluyvera phage Kvp1] gi|211997257|gb|ACJ14574.1| major capsid protein 10A [Kluyvera phage Kvp1] Length = 347 Score = 151 bits (381), Expect = 1e-34, Method: Composition-based stats. Identities = 43/326 (13%), Positives = 86/326 (26%), Gaps = 18/326 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ + SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFTRT-SVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L M Sbjct: 84 KDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAK 143 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G + + + I QL A++ Y+ Sbjct: 144 LCNLPSAKDENIAGLGKAHVLEVGKQSDLRGDQVKLGQAIIAQLTLARAKLTSNYVPSA- 202 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 D ++++ A + +Y G I G I + + Sbjct: 203 -DRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGEDR 261 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 + SS V +SAV + K + ++ ++ Sbjct: 262 PEEG----VNPTGQKHAFPETSSGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN 317 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 318 -FQADQIIAKYAMGHGGLRPEACGAL 342 >gi|189427233|ref|YP_001949782.1| gp10B [Salmonella phage phiSG-JL2] gi|189085886|gb|ACD75701.1| gp10B [Salmonella phage phiSG-JL2] Length = 393 Score = 151 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 45/325 (13%), Positives = 93/325 (28%), Gaps = 18/325 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S P ++ SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L + G Sbjct: 84 KDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAG 143 Query: 127 VNK----KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYID---VDSE 179 + + + ++ GD +G+ I A+ + + V + Sbjct: 144 LVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAA 203 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 D ++++ A + +Y + G I G + + + Sbjct: 204 DRTFYTTPDNYSAILAALMPNAANYQALIDPERGTIRNVMGFEVVEVPHLTAGGAGDTRE 263 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 SSA + V +SAV + K + ++ ++ Sbjct: 264 -------DAPADQKHAFPATSSATVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN- 315 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ I Sbjct: 316 YQADQIIAKYAMGHGGLRPEAAGAI 340 >gi|9634035|ref|NP_052109.1| major capsid protein 10A [Yersinia phage phiYeO3-12] gi|6599026|emb|CAB63630.1| major capsid protein 10A [Yersinia phage phiYeO3-12] Length = 347 Score = 151 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 44/325 (13%), Positives = 92/325 (28%), Gaps = 18/325 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S P ++ SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L + G Sbjct: 84 KDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAG 143 Query: 127 VNK----KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYID---VDSE 179 + + + ++ GD +G+ I A+ + + V + Sbjct: 144 LVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAA 203 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 D ++++ A + +Y + G I G + + + Sbjct: 204 DRTFYTTPDNYSAILAALMPNAANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTRE 263 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 SS + V +SAV + K + ++ ++ Sbjct: 264 -------DAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN- 315 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ I Sbjct: 316 YQADQIIAKYAMGHGGLRPEAAGAI 340 >gi|189427232|ref|YP_001949783.1| gp10A [Salmonella phage phiSG-JL2] gi|189085885|gb|ACD75700.1| gp10A [Salmonella phage phiSG-JL2] Length = 348 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 45/325 (13%), Positives = 93/325 (28%), Gaps = 18/325 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S P ++ SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L + G Sbjct: 84 KDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAG 143 Query: 127 VNK----KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYID---VDSE 179 + + + ++ GD +G+ I A+ + + V + Sbjct: 144 LVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAA 203 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 D ++++ A + +Y + G I G + + + Sbjct: 204 DRTFYTTPDNYSAILAALMPNAANYQALIDPERGTIRNVMGFEVVEVPHLTAGGAGDTRE 263 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 SSA + V +SAV + K + ++ ++ Sbjct: 264 -------DAPADQKHAFPATSSATVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN- 315 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ I Sbjct: 316 YQADQIIAKYAMGHGGLRPEAAGAI 340 >gi|212671429|ref|YP_002308412.1| major capsid protein 10B [Kluyvera phage Kvp1] gi|211997273|gb|ACJ14590.1| major capsid protein 10B [Kluyvera phage Kvp1] Length = 392 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 43/331 (12%), Positives = 87/331 (26%), Gaps = 18/331 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ + SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFTRT-SVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L M Sbjct: 84 KDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAK 143 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G + + + I QL A++ Y+ Sbjct: 144 LCNLPSAKDENIAGLGKAHVLEVGKQSDLRGDQVKLGQAIIAQLTLARAKLTSNYVPSA- 202 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 D ++++ A + +Y G I G I + + Sbjct: 203 -DRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGEDR 261 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 + SS V +SAV + K + ++ ++ Sbjct: 262 PEEG----VNPTGQKHAFPETSSGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN 317 Query: 299 KWHAPQITLTSSFGATRIEPDKILGIEISKD 329 + A QI + G + P+ + + Sbjct: 318 -FQADQIIAKYAMGHGGLRPEACGALVFQQG 347 >gi|30387487|ref|NP_848297.1| major capsid protein [Yersinia pestis phage phiA1122] gi|30314124|gb|AAP20532.1| major capsid protein [Yersinia pestis phage phiA1122] Length = 344 Score = 150 bits (379), Expect = 2e-34, Method: Composition-based stats. Identities = 48/334 (14%), Positives = 99/334 (29%), Gaps = 23/334 (6%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 MA ++L + F V A T S ++ SA V T+A + Sbjct: 19 MAAGDKL-ALFLKVFGGEVLTAFART-SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAP 76 Query: 61 D-MPDTIYNATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDE 118 D I + + A+ I + + Y S ++ D Sbjct: 77 GENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADG 136 Query: 119 AILKGMLG--------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFR 170 A+L + G +G T + ++ + + K I L A++ Sbjct: 137 AVLAEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALT 196 Query: 171 KRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVP 230 K Y+ Y D ++++ A + +Y + G I G + + + Sbjct: 197 KNYVPSSDRVFYC--DPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLT 254 Query: 231 GNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAID 290 + S + K + +SAV + + + Sbjct: 255 AGGAGTSREG--------TTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLA 306 Query: 291 VQHSKDPGKWHAPQITLTSSFGATRIEPDKILGI 324 ++ ++ + A QI + G + P+ + Sbjct: 307 LERARRAN-FQADQIIAKYAMGHGGLRPEAAGAV 339 >gi|37956838|gb|AAP34105.1| gene 10A [Enterobacteria phage T7] gi|37956891|gb|AAP34157.1| gene 10A [Enterobacteria phage T7] Length = 345 Score = 150 bits (378), Expect = 3e-34, Method: Composition-based stats. Identities = 47/326 (14%), Positives = 95/326 (29%), Gaps = 21/326 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ SA V T+A + D Sbjct: 26 ALFLKVFGGEVLTAFART-SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENLDDKR 84 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + Sbjct: 85 KDIKHTEKVIIIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAD 144 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G T + +N + + K I L A+++ K Y+ Sbjct: 145 LCNVESKYNENIEGLGTATVIETTQNKAALTDQIALGKEIIAALTKARAVLTKNYVPAAD 204 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 Y D ++++ A + +Y + G I G + + + A Sbjct: 205 RVFYC--DPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAR 262 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 G P K + K + +SAV + + + ++ ++ Sbjct: 263 EGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 316 -FQADQIIAKYAMGHGGLRPEAAGAV 340 >gi|194100498|ref|YP_002003343.1| gp10 [Yersinia phage Yepe2] gi|193201231|gb|ACF15712.1| gp10 [Yersinia phage Yepe2] Length = 353 Score = 149 bits (375), Expect = 6e-34, Method: Composition-based stats. Identities = 44/335 (13%), Positives = 88/335 (26%), Gaps = 18/335 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ + SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFTRT-SVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L M Sbjct: 84 KDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAK 143 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G + + + I QL A++ Y+ Sbjct: 144 LCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSS- 202 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 D ++++ A + +Y G I G I + + Sbjct: 203 -DRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNR 261 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 + +S V +SAV + K + ++ ++ Sbjct: 262 AEEG----VAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN 317 Query: 299 KWHAPQITLTSSFGATRIEPDKILGIEISKDSLKG 333 + A QI + G + P+ + K L Sbjct: 318 -FQADQIIAKYAMGHGGLRPEACGALVFKKTQLLA 351 >gi|281416197|ref|YP_003347932.1| major capsid protein [Vibrio phage N4] gi|237701504|gb|ACR16497.1| major capsid protein [Vibrio phage N4] Length = 374 Score = 149 bits (375), Expect = 7e-34, Method: Composition-based stats. Identities = 46/333 (13%), Positives = 92/333 (27%), Gaps = 12/333 (3%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHE-IVGDMPDTIY 67 + F V A + L V + SA V T+A G D Sbjct: 24 ALFLKVFGGEVLTAFERQAKTL-SKVMTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGR 82 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + + I + + Y++ A+ D A M Sbjct: 83 EDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEALAMAADVANYAEMAK 142 Query: 127 VNKKGKIGAETE--FFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV--DSEQVY 182 + K +++ + + Q+I A + R + + Sbjct: 143 LVNSRKETTNENIAGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRT 202 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 D ++++ A + +Y + G I G + + + F Sbjct: 203 FYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPADAF- 261 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 DG ++ K V +SAV + K + ++ ++ P + A Sbjct: 262 ---DGTGHIFPSTGDSTTAGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPE-YQA 317 Query: 303 PQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 QI + G + P+ + I V Sbjct: 318 DQIIAKYAMGHGGLRPEAVGAIIFVDGDTPAVA 350 >gi|194100397|ref|YP_002003972.1| gp10A [Enterobacteria phage 13a] gi|193201444|gb|ACF15921.1| gp10A [Enterobacteria phage 13a] Length = 344 Score = 148 bits (374), Expect = 9e-34, Method: Composition-based stats. Identities = 46/329 (13%), Positives = 95/329 (28%), Gaps = 22/329 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ SA V T+A + D Sbjct: 26 ALFLKVFGGEVLTAFART-SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENLDDKR 84 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 85 KDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAG 144 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +GA T + +N + + K I L A++ K Y+ Sbjct: 145 LCNVESQYDENIAGLGAATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAAD 204 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 Y D ++++ A + +Y + G I G + + + + Sbjct: 205 RVFYC--DPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSR 262 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 S + K + +SAV + + + ++ ++ Sbjct: 263 EG--------TAGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 314 Query: 299 KWHAPQITLTSSFGATRIEPDKILGIEIS 327 + A QI + G + P+ + + Sbjct: 315 -FQADQIIAKYAMGHGGLRPEAAGAVVFT 342 >gi|312436376|gb|ADQ83185.1| major capsid protein [Yersinia phage Yep-phi] Length = 347 Score = 148 bits (374), Expect = 1e-33, Method: Composition-based stats. Identities = 42/326 (12%), Positives = 86/326 (26%), Gaps = 18/326 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ + SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFTRT-SVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L M Sbjct: 84 KDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAK 143 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G + + + I QL A++ Y+ Sbjct: 144 LCNLPAANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTANYVPSS- 202 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 D ++++ A + +Y G I G I + + Sbjct: 203 -DRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNR 261 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 + +S V +SAV + K + ++ ++ Sbjct: 262 AEEG----VAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN 317 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 318 -FQADQIIAKYAMGHGGLRPEACGAL 342 >gi|17570825|ref|NP_523334.1| minor capsid protein 10B [Enterobacteria phage T3] gi|1352833|sp|P19728|VC10B_BPT3 RecName: Full=Minor capsid protein 10B gi|1001910|emb|CAA35155.1| 10B [Enterobacteria phage T3] gi|17384309|emb|CAC86297.1| minor capsid protein 10B [Enterobacteria phage T3] Length = 433 Score = 148 bits (372), Expect = 1e-33, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 91/325 (28%), Gaps = 18/325 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S P ++ SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L + G Sbjct: 84 KDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAG 143 Query: 127 VNK----KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYID---VDSE 179 + + + ++ G +G+ I A+ + + V + Sbjct: 144 LVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAA 203 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 D ++++ A + +Y + G I G + + + Sbjct: 204 DRTFYTTPDNYSAILAALMPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTRE 263 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 SS + V +SAV + K + ++ ++ Sbjct: 264 -------DAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN- 315 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ I Sbjct: 316 YQADQIIAKYAMGHGGLRPEAAGAI 340 >gi|119637776|ref|YP_919012.1| Major capsid protein [Yersinia phage Berlin] gi|119391807|emb|CAJ70680.1| hypothetical protein [Yersinia phage Berlin] Length = 347 Score = 148 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 42/326 (12%), Positives = 86/326 (26%), Gaps = 18/326 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S ++ + SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFTRT-SVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L M Sbjct: 84 KDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAK 143 Query: 127 --------VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 +G + + + I QL A++ Y+ Sbjct: 144 LCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSS- 202 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 D ++++ A + +Y G I G I + + Sbjct: 203 -DRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNR 261 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 + +S V +SAV + K + ++ ++ Sbjct: 262 AEEG----VAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN 317 Query: 299 KWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 318 -FQADQIIAKYAMGHGGLRPEACGAL 342 >gi|29366729|ref|NP_813774.1| major capsid protein [Pseudomonas phage gh-1] gi|29243588|gb|AAO73167.1|AF493143_28 major capsid protein A [Pseudomonas phage gh-1] Length = 347 Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats. Identities = 39/331 (11%), Positives = 93/331 (28%), Gaps = 17/331 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A S + + SA V T+ + + D Sbjct: 25 ALFLKVFGGEVLTAFVRR-SVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML- 125 + + ++ I + + Y++ A+ D A+L M Sbjct: 84 KDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAK 143 Query: 126 -----GVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQ-LITAKSIFRKRYIDVDSE 179 + + G + + + I + L A++ K Y+ Sbjct: 144 LCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDR 203 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 + Y + ++++ + + +Y + G I G I + + Sbjct: 204 RFYCA--PEDYSAILSALMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNP 261 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 ++ + V +SAV + K + ++ ++ P Sbjct: 262 ADG----VAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE- 316 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEISKDS 330 + A QI + G + P+ + + + Sbjct: 317 FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 >gi|332875221|ref|ZP_08443054.1| hypothetical protein HMPREF0022_02687 [Acinetobacter baumannii 6014059] gi|332736665|gb|EGJ67659.1| hypothetical protein HMPREF0022_02687 [Acinetobacter baumannii 6014059] Length = 299 Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats. Identities = 63/328 (19%), Positives = 110/328 (33%), Gaps = 35/328 (10%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 MA + ++ A + ++ E+A + +S+L T + S + E Sbjct: 1 MANENKITAAFVIQYHDTYEIAAMQNESRLLKTAVNRGKIQGESFTINDMGQVEMSPSGN 60 Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 DT + D R + + I+ P Y A +RK D+ I Sbjct: 61 RFGDTTWTIPDAGVRTALMADYDLFIPIESRDLPKLKAVPTDKYMKNLINARNRKIDDII 120 Query: 121 LKGMLGV-NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DS 178 + ++G + A + N+ + + T Q+I AKSIFR D + Sbjct: 121 YQALVGGVTRTTVNDAGVKSTGTVNLPAGQIILSGYGTLKQQIIKAKSIFRANECDEHNG 180 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPA 237 E + ++ + + + TS D++ LQ G + + GV +I EK+ Sbjct: 181 ETLNIIYTASMLEDILGDTTLTSADFMAVKMLQEGAVSGKWLGVNWIPYEKLNNG----- 235 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDP 297 + +Y SAV F SK P Sbjct: 236 ---------------------------AGGATEKRTVMYTSSAVHFGDADITGFDISKRP 268 Query: 298 GKWHAPQITLTSSFGATRIEPDKILGIE 325 K + Q+ SF A R K++ I+ Sbjct: 269 DKKNISQVGGVHSFAAGRANEQKVVAID 296 >gi|323512113|gb|ADX87573.1| major capsid protein [Vibrio phage ICP3_2009_A] Length = 374 Score = 147 bits (370), Expect = 3e-33, Method: Composition-based stats. Identities = 45/333 (13%), Positives = 91/333 (27%), Gaps = 12/333 (3%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHE-IVGDMPDTIY 67 + F V A + L V + SA V T+A G D Sbjct: 24 ALFLKVFGGEVLTAFERQAKTL-SKVMTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGR 82 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + + I + + Y++ A+ D A M Sbjct: 83 EDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEALAMAADVANYAEMAK 142 Query: 127 VNKKGKIGAETE--FFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV--DSEQVY 182 + K +++ + + Q+I + R + + Sbjct: 143 LVNSRKETTNENIAGLGAASLVKITGKKEDPAKYGTQVIQGLTYARAAFAKKYIPAGDRT 202 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 D ++++ A + +Y + G I G + + + F Sbjct: 203 FYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPADAF- 261 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 DG ++ K V +SAV + K + ++ ++ P + A Sbjct: 262 ---DGTGHIFPATGDSATTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPE-YQA 317 Query: 303 PQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 QI + G + P+ + I V Sbjct: 318 DQIIAKYAMGHGGLRPEAVGAIIFVDGDTPAVA 350 >gi|323512064|gb|ADX87525.1| major capsid protein [Vibrio phage ICP3_2009_B] Length = 375 Score = 147 bits (370), Expect = 3e-33, Method: Composition-based stats. Identities = 45/333 (13%), Positives = 91/333 (27%), Gaps = 12/333 (3%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHE-IVGDMPDTIY 67 + F V A + L V + SA V T+A G D Sbjct: 25 ALFLKVFGGEVLTAFERQAKTL-SKVMTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + + I + + Y++ A+ D A M Sbjct: 84 EDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEALAMAADVANYAEMAK 143 Query: 127 VNKKGKIGAETE--FFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV--DSEQVY 182 + K +++ + + Q+I + R + + Sbjct: 144 LVNSRKETTNENIAGLGAASLVKITGKKEDPAKYGTQVIQGLTYARAAFAKKYIPAGDRT 203 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 D ++++ A + +Y + G I G + + + F Sbjct: 204 FYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPADAF- 262 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 DG ++ K V +SAV + K + ++ ++ P + A Sbjct: 263 ---DGTGHIFPATGDSATTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPE-YQA 318 Query: 303 PQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 QI + G + P+ + I V Sbjct: 319 DQIIAKYAMGHGGLRPEAVGAIIFVDGDTPAVA 351 >gi|325171311|ref|YP_004251282.1| major capsid protein [Vibrio phage ICP3] gi|323512017|gb|ADX87479.1| major capsid protein [Vibrio phage ICP3] gi|323512162|gb|ADX87621.1| major capsid protein [Vibrio phage ICP3_2008_A] gi|323512210|gb|ADX87668.1| major capsid protein [Vibrio phage ICP3_2007_A] Length = 374 Score = 147 bits (370), Expect = 3e-33, Method: Composition-based stats. Identities = 45/333 (13%), Positives = 91/333 (27%), Gaps = 12/333 (3%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHE-IVGDMPDTIY 67 + F V A + L V + SA V T+A G D Sbjct: 24 ALFLKVFGGEVLTAFERQAKTL-SKVMTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGR 82 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + + I + + Y++ A+ D A M Sbjct: 83 EDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEALAMAADVANYAEMAK 142 Query: 127 VNKKGKIGAETE--FFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV--DSEQVY 182 + K +++ + + Q+I + R + + Sbjct: 143 LVNSRKETTNENIAGLGAASLVKITGKKEDPAKYGTQVIQGLTYARAAFAKKYIPAGDRT 202 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 D ++++ A + +Y + G I G + + + F Sbjct: 203 FYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPADAF- 261 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 DG ++ K V +SAV + K + ++ ++ P + A Sbjct: 262 ---DGTGHIFPATGDSATTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPE-YQA 317 Query: 303 PQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 QI + G + P+ + I V Sbjct: 318 DQIIAKYAMGHGGLRPEAVGAIIFVDGDTPAVA 350 >gi|50282923|ref|YP_052979.1| hypothetical protein VP2p08 [Vibrio phage VP2] gi|50282955|ref|YP_053011.1| hypothetical protein VP5_gp07 [Vibrio phage VP5] Length = 322 Score = 147 bits (370), Expect = 3e-33, Method: Composition-based stats. Identities = 54/328 (16%), Positives = 102/328 (31%), Gaps = 36/328 (10%) Query: 7 LATANIYEFKKHVELALQETKSKLRPTVTEQATEGEAS--ALVEVFKPTEAHEIVGDMPD 64 + A + ++ + + Q+ +KL+ + E+ + P Sbjct: 17 IDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQS 76 Query: 65 ------TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDE 118 T N +R + ++ ++P + AM RK D+ Sbjct: 77 ADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDD 136 Query: 119 AILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 I+ G G EF + + I GD + F + I+ + Sbjct: 137 LIIAGAWKPASIKGTGQPVEFLATQEI-----GDGTKPISFDYVTEITERFLENEIEPEV 191 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQA-GKIEAFAGVWFINMEKVPGNDLFPA 237 +V V+ P+ L + ATS DY + LQ+ G I + G +I ++ D Sbjct: 192 SKVIVIGPTQARKLL-QITEATSADYTSAMDLQSKGIITNWMGYTWIVSTRLDKFDPTQW 250 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDP 297 G + P G A+ + K I + ++DP Sbjct: 251 G-------MAAEDGPQGDEIW--------------CIAMTDMALGYHSCKDIWTKVAEDP 289 Query: 298 GKWHAPQITLTSSFGATRIEPDKILGIE 325 A +I + R+E + I + Sbjct: 290 SASFAWRIYSAFTADCVRVEDEHIFKLR 317 >gi|169795388|ref|YP_001713181.1| hypothetical protein ABAYE1259 [Acinetobacter baumannii AYE] gi|169148315|emb|CAM86180.1| conserved hypothetical protein [Acinetobacter baumannii AYE] Length = 299 Score = 146 bits (368), Expect = 4e-33, Method: Composition-based stats. Identities = 64/328 (19%), Positives = 111/328 (33%), Gaps = 35/328 (10%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 MA + ++ A + ++ E+A + +S+L T + S + E Sbjct: 1 MANENKITAAFVIQYHDTYEIAAMQNESRLLKTAVNRGKIQGESFTINDMGQVEMSPSGN 60 Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 DT + D R + + I+ P Y A +RK D+ I Sbjct: 61 RFGDTTWTIPDAGVRTALMADYDLFIPIESRDLPKLKAVPTDKYMKNLINARNRKIDDII 120 Query: 121 LKGMLGV-NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DS 178 + ++G + A + + N+ + F T Q+I AKSIFR D + Sbjct: 121 YQALVGGVTRTTVNDAGVKSTATVNLPAGQVILSGFGTLKQQIIKAKSIFRANECDEHNG 180 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPA 237 E + ++ + + + TS D++ LQ G + + GV +I EK+ Sbjct: 181 ETLNIIYTASMLEDILGDTTLTSADFMAVKMLQEGAVAGKWLGVNWIPYEKLNNG----- 235 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDP 297 + +Y SAV F SK P Sbjct: 236 ---------------------------AGGATEKRTVMYTSSAVHFGDADITGFDISKRP 268 Query: 298 GKWHAPQITLTSSFGATRIEPDKILGIE 325 K + Q+ SF A R K++ I+ Sbjct: 269 DKKNISQVGGVHSFAAGRANEQKVVAID 296 >gi|284519689|gb|ACF42037.2| minor capsid protein [Morganella phage MmP1] Length = 385 Score = 145 bits (366), Expect = 8e-33, Method: Composition-based stats. Identities = 42/332 (12%), Positives = 90/332 (27%), Gaps = 20/332 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMP-DTIY 67 + F V A T S ++ SA V T A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 84 KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAG 143 Query: 127 ----VNKKGKIGAETEFFSKENILSAVEGDDFFKT---FIGQLITAKSIFRKRYIDVDSE 179 + A S + + + + I QL A++ Y+ Sbjct: 144 LCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSA-- 201 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 +V++++ A + +Y + G I G + + + Sbjct: 202 DRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRE 261 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 + + + V +SAV + K + ++ ++ Sbjct: 262 -------DETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE- 313 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEISKDSL 331 + A QI + G + P+ + + Sbjct: 314 YQADQIIARYAMGHGGLRPEAAGALVFHSGLM 345 >gi|9634034|ref|NP_052108.1| minor capsid protein 10B [Yersinia phage phiYeO3-12] gi|6599025|emb|CAB63629.1| minor capsid protein 10B [Yersinia phage phiYeO3-12] Length = 433 Score = 145 bits (366), Expect = 8e-33, Method: Composition-based stats. Identities = 44/325 (13%), Positives = 92/325 (28%), Gaps = 18/325 (5%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A T S P ++ SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y + ++ D A+L + G Sbjct: 84 KDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAG 143 Query: 127 VNK----KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYID---VDSE 179 + + + ++ GD +G+ I A+ + + V + Sbjct: 144 LVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAA 203 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 D ++++ A + +Y + G I G + + + Sbjct: 204 DRTFYTTPDNYSAILAALMPNAANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTRE 263 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 SS + V +SAV + K + ++ ++ Sbjct: 264 -------DAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN- 315 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ I Sbjct: 316 YQADQIIAKYAMGHGGLRPEAAGAI 340 >gi|293609616|ref|ZP_06691918.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292828068|gb|EFF86431.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 299 Score = 145 bits (364), Expect = 1e-32, Method: Composition-based stats. Identities = 65/328 (19%), Positives = 112/328 (34%), Gaps = 35/328 (10%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG 60 MA + ++ A + ++ E+A + +S+L T + S + E Sbjct: 1 MANENKIMAAFVIQYHDTYEIAAMQNESRLLKTAVNRGKIQGESFTINDMGQVEMSPSGA 60 Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 DT + D R + + I+ P Y A +RK D+ I Sbjct: 61 RFGDTNWTIPDAGERTALMADYDLFIPIESRDLPKLKAVPTDKYMKNLINARNRKIDDII 120 Query: 121 LKGMLGV-NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDV-DS 178 + ++G +K A + S N+ + F + Q+I AKSIFR D + Sbjct: 121 YQALVGGVTRKTVNDAGVKSTSTVNLPAGQIILSGFGSLKQQIIKAKSIFRANECDEHNG 180 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPA 237 E + ++ + + + TS D++ LQ G + + GV +I EK+ Sbjct: 181 ETLNIIYTASMLEDILGDTTLTSADFMAVKMLQEGAVSGKWLGVNWIPYEKLNNG----- 235 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDP 297 + +Y SAV F SK P Sbjct: 236 ---------------------------AGGATEKRTVMYTSSAVHFGDADITGFDISKRP 268 Query: 298 GKWHAPQITLTSSFGATRIEPDKILGIE 325 K + Q+ SF A R K++ I+ Sbjct: 269 DKKNISQVGGVHSFAAGRANEQKVVAID 296 >gi|194473833|ref|YP_002048657.1| major capsid protein [Morganella phage MmP1] gi|194307054|gb|ACF42036.1| major capsid protein [Morganella phage MmP1] Length = 343 Score = 143 bits (361), Expect = 3e-32, Method: Composition-based stats. Identities = 42/328 (12%), Positives = 90/328 (27%), Gaps = 20/328 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMP-DTIY 67 + F V A T S ++ SA V T A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 84 KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAG 143 Query: 127 ----VNKKGKIGAETEFFSKENILSAVEGDDFFKT---FIGQLITAKSIFRKRYIDVDSE 179 + A S + + + + I QL A++ Y+ Sbjct: 144 LCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSA-- 201 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 +V++++ A + +Y + G I G + + + Sbjct: 202 DRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRE 261 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 + + + V +SAV + K + ++ ++ Sbjct: 262 -------DETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE- 313 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEIS 327 + A QI + G + P+ + + Sbjct: 314 YQADQIIARYAMGHGGLRPEAAGALVFT 341 >gi|315518950|dbj|BAJ51827.1| putative major capsid protein [Ralstonia phage RSB2] Length = 318 Score = 143 bits (360), Expect = 3e-32, Method: Composition-based stats. Identities = 44/325 (13%), Positives = 92/325 (28%), Gaps = 33/325 (10%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMP--DTI 66 + F V A T +K + + SA V T A + D Sbjct: 19 ALFLKMFAGEVLTAFART-AKTMDKHISRTIQSGKSAQFPVLGRTTAAYLAAGTSLDDQR 77 Query: 67 YNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 ++ V I + + Y+ A+ D + L + Sbjct: 78 VAIPHNEKVIVIDGLLTADVLITDIDDAMNHYDVRGEYSKQLGEALALTADGSNLAELAT 137 Query: 127 VNKKGKI----GAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + GA + V + + L TA+ K+Y+ Sbjct: 138 LASAAENLPGLGAGSIVELATATSVTVASPTVGQEILSALATARMTLGKKYVPS--GDRV 195 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 + + ++ + + +Y + G ++ G I + + T Sbjct: 196 FYVTPEAYSCILTALMPQAANYQAIVDPETGNLKNIHGFEIIEVPHFELGGVGGKHTFPA 255 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 GL K V +SA+ + K + ++ ++ P + A Sbjct: 256 GLA-----------------------GKMVGIAAHRSAIGTVKLKDLALERARRPE-YQA 291 Query: 303 PQITLTSSFGATRIEPDKILGIEIS 327 QI + G + P+ ++ I + Sbjct: 292 DQIIAKYAMGHGGLRPEAVVAITVQ 316 >gi|281416308|ref|YP_003347548.1| major capsid protein [Klebsiella phage KP32] gi|262410427|gb|ACY66692.1| major capsid protein [Klebsiella phage KP32] Length = 345 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 46/328 (14%), Positives = 92/328 (28%), Gaps = 20/328 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMP-DTIY 67 + F V A T S ++ SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTTNRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 84 KDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAG 143 Query: 127 VNKKGKIGAETEFFSKENILSAVEGDDF-------FKTFIGQLITAKSIFRKRYIDVDSE 179 + E + L V + I QL A++ K Y+ + Sbjct: 144 LVNLADSVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-- 201 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 DV++++ A + +Y + G I G + + + Sbjct: 202 DRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRP 261 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 + + K V +SAV + K + ++ ++ Sbjct: 262 DEGADATNQKHAFPATGG-------KVNKENVVGLFQHRSAVGTVKLKDLALERARRAE- 313 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEIS 327 + A QI + G + P+ + + Sbjct: 314 YQADQIIAKYAMGHGGLRPESAGALVFT 341 >gi|326536939|ref|YP_004306347.1| major capsid protein 10A [Pseudomonas phage phiIBB-PF7A] gi|318054515|gb|ADV35691.1| major capsid protein 10A [Pseudomonas phage phiIBB-PF7A] Length = 341 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 43/329 (13%), Positives = 89/329 (27%), Gaps = 22/329 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A + S + + SA V T + D Sbjct: 25 ALFLKVFGGEVLTAFKRR-SVTMDKHMVRTIQSGKSAQFPVMGRTAGFYLAPGENIDDKQ 83 Query: 68 NATDQDRRWVGHSQFG-WAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A I + + Y++ A+ D A+L M Sbjct: 84 GDIKHTEKVITIDGLLVSAVMIFDIEDAMNHYDVSSEYSAQLGEALAISADGAVLAEMAL 143 Query: 127 V------NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQ-LITAKSIFRKRYIDVDSE 179 + + + G + + + I + L A++ K Y+ Sbjct: 144 LCNLPEESDENIAGLGKASVLPIGKAADLMDPEARGKAILKGLTLARAKLTKNYVPSS-- 201 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 + + ++++ A + +Y + G I G I + + Sbjct: 202 DRFFYTSPEYYSAILAALMPNAANYAALIDPETGNIRNVMGFTVIEVPHLTVGGSGNDLA 261 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 SS + V +SAV + K + ++ S+ Sbjct: 262 GTSR---------KHAFPQVSSDTVKVAADNVVGLFNHRSAVGTVKLKDMALERSRRAN- 311 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEISK 328 + QI + G + P+ + I K Sbjct: 312 FQGDQIIGKYAMGHGGLRPEAAGALVIEK 340 >gi|194100450|ref|YP_002003823.1| gp10A [Klebsiella phage K11] gi|193201389|gb|ACF15867.1| gp10A [Klebsiella phage K11] Length = 343 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 46/328 (14%), Positives = 92/328 (28%), Gaps = 20/328 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMP-DTIY 67 + F V A T S ++ SA V T+A + D Sbjct: 25 ALFLKVFGGEVLTAFART-SVTTNRHMQRQISSGKSAQFPVIGRTKAAYLQPGESLDDKR 83 Query: 68 NATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A+ I + + Y S ++ D A+L + G Sbjct: 84 KDIKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAG 143 Query: 127 VNKKGKIGAETEFFSKENILSAVEGDDF-------FKTFIGQLITAKSIFRKRYIDVDSE 179 + E + L V + I QL A++ K Y+ + Sbjct: 144 LVNLADSVNENIAGLGKPSLLEVGAKADLTDPVKLGQAVIAQLTIARAALTKNYVPAN-- 201 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 DV++++ A + +Y + G I G + + + Sbjct: 202 DRTFYTTPDVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDRP 261 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 + + K V +SAV + K + ++ ++ Sbjct: 262 DEGAEATNQKHAFPATGG-------KVNKENVVGLFQHRSAVGTVKLKDLALERARRTE- 313 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEIS 327 + A QI + G + P+ + + Sbjct: 314 YQADQIVAKYAMGHGGLRPESAGALVFT 341 >gi|83313366|ref|YP_423630.1| hypothetical protein amb4267 [Magnetospirillum magneticum AMB-1] gi|82948207|dbj|BAE53071.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 209 Score = 141 bits (355), Expect = 1e-31, Method: Composition-based stats. Identities = 26/263 (9%), Positives = 64/263 (24%), Gaps = 54/263 (20%) Query: 67 YNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 D + + +D L N + A+ RK DE I+ + Sbjct: 1 MMNVDHSAVECQLFDYYAGDWLDKLDELKIEHNEREVLINAGAYALGRKTDELIIAELDK 60 Query: 127 VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIP 186 + G +++ A + + + D + V+ Sbjct: 61 STNYAQDGTT-------------------GLTKAKVLEAFEMLGETEVPDDGNRFAVVG- 100 Query: 187 SDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLID 246 W+ L ++ ++ +++P A L Sbjct: 101 WKQWSDLMSITEFAHA-------------------DYVGSDELPWKG-TQAKHWLGTLWM 140 Query: 247 GKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQIT 306 + Y ++AV + + + + A + Sbjct: 141 PHSG-------------LTKSSNVRYCYWYHQTAVGHACGSEVKSEITYQGDRA-AWFVN 186 Query: 307 LTSSFGATRIEPDKILGIEISKD 329 S GA ++ ++ + + Sbjct: 187 NFMSQGAALVDATGVVSLRCLES 209 >gi|326536940|ref|YP_004306346.1| minor capsid protein 10B [Pseudomonas phage phiIBB-PF7A] gi|318054516|gb|ADV35692.1| minor capsid protein 10B [Pseudomonas phage phiIBB-PF7A] Length = 519 Score = 139 bits (349), Expect = 6e-31, Method: Composition-based stats. Identities = 47/348 (13%), Positives = 97/348 (27%), Gaps = 27/348 (7%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD-MPDTIY 67 + F V A + S + + SA V T + D Sbjct: 25 ALFLKVFGGEVLTAFKRR-SVTMDKHMVRTIQSGKSAQFPVMGRTAGFYLAPGENIDDKQ 83 Query: 68 NATDQDRRWVGHSQFG-WAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 + + A I + + Y++ A+ D A+L M Sbjct: 84 GDIKHTEKVITIDGLLVSAVMIFDIEDAMNHYDVSSEYSAQLGEALAISADGAVLAEMAL 143 Query: 127 V------NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQ-LITAKSIFRKRYIDVDSE 179 + + + G + + + I + L A++ K Y+ Sbjct: 144 LCNLPEESDENIAGLGKASVLPIGKAADLMDPEARGKAILKGLTLARAKLTKNYVPSS-- 201 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 + + ++++ A + +Y + G I G I + + Sbjct: 202 DRFFYTSPEYYSAILAALMPNAANYAALIDPETGNIRNVMGFTVIEVPHLTVGGSGNDLA 261 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 SS + V +SAV + K + ++ S+ Sbjct: 262 GTSR---------KHAFPQVSSDTVKVAADNVVGLFNHRSAVGTVKLKDMALERSRRAN- 311 Query: 300 WHAPQITLTSSFGATRIEPDKILGIEISKDSLK-----GVPVLKGTKA 342 + QI + G + P+ + I K+ + GV + + T A Sbjct: 312 FQGDQIIGKYAMGHGGLRPEAAGALVIEKEGVSVPDPTGVTLSQKTMA 359 >gi|254504590|ref|ZP_05116741.1| hypothetical protein SADFL11_4629 [Labrenzia alexandrii DFL-11] gi|254505320|ref|ZP_05117468.1| hypothetical protein SADFL11_PLAS18 [Labrenzia alexandrii DFL-11] gi|222436164|gb|EEE42846.1| hypothetical protein SADFL11_PLAS18 [Labrenzia alexandrii DFL-11] gi|222440661|gb|EEE47340.1| hypothetical protein SADFL11_4629 [Labrenzia alexandrii DFL-11] Length = 341 Score = 138 bits (348), Expect = 8e-31, Method: Composition-based stats. Identities = 30/322 (9%), Positives = 82/322 (25%), Gaps = 29/322 (9%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNAT 70 + F V A QE L Q E SA + V + Sbjct: 37 FLKIFGGEVLTAFQERVLTL-DKHRVQTIEHGKSAQFPKTWKASSEYHVAGKELLGNDID 95 Query: 71 DQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKK 130 + I + + +++ A+ R+ D+ ++ ++ + Sbjct: 96 TGEVTITVDGLLVSHTEIYDLDRKMAHFDVTSEFSNELGRALAREFDKNSMRTIIRSART 155 Query: 131 GKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS--- 187 G E+ G +I ++ A ++ + D + ++ Sbjct: 156 ASDGPFPGGNVIEDANLTNTGTISGVDWIDGIVQANQELFEKDVPEDHPRFMLVNKKVFD 215 Query: 188 --DVWASLFALERATSKDYIN--TAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPG 243 ++D+ G++ GV + +P Sbjct: 216 AIKYAKDASGNYLVLNRDFGTQAGGIAGRGEVLMVDGVAIMAQRTIP------------- 262 Query: 244 LIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAP 303 + K+ ++ A+ + + ++ ++ + Sbjct: 263 -------GTDESADAGVYPKYRGNYSTTTGVLWTPWALGTVKLMDLAME-TERDVRRQTD 314 Query: 304 QITLTSSFGATRIEPDKILGIE 325 + + G+ + P+ + Sbjct: 315 FMVAKMATGSDPLRPECAVEFR 336 >gi|77118198|ref|YP_338120.1| capsid [Enterobacteria phage K1F] gi|72527942|gb|AAZ72994.1| capsid [Enterobacteria phage K1F] gi|83308150|emb|CAJ29383.1| gp10A protein [Enterobacteria phage K1F] Length = 347 Score = 138 bits (346), Expect = 2e-30, Method: Composition-based stats. Identities = 42/325 (12%), Positives = 86/325 (26%), Gaps = 16/325 (4%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG--DMPDTI 66 + F V A S + + SA V T + + D Sbjct: 24 ALFLKVFAGEVLTAFTRR-SVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPGERLSDKR 82 Query: 67 YNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML- 125 ++ I + + Y++ A+ D A+L M Sbjct: 83 KGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAI 142 Query: 126 -----GVNKKGKIG-AETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSE 179 + + G + IGQL A++ Y+ + Sbjct: 143 LCNLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVP--AG 200 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 Y D ++++ A + +Y + G I G + + + AG Sbjct: 201 DRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIRNVMGFVVVEVPHLVQGG---AGE 257 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 +S+ + T V +SAV + + + ++ +D Sbjct: 258 TRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVD- 316 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 I + G + P+ + Sbjct: 317 AQGDLIVGKYAMGHGGLRPEAAGAL 341 >gi|326424992|ref|YP_004286214.1| virion structural protei [Pseudomonas phage phi15] gi|325048396|emb|CBZ42009.1| virion structural protei [Pseudomonas phage phi15] Length = 342 Score = 137 bits (345), Expect = 2e-30, Method: Composition-based stats. Identities = 38/325 (11%), Positives = 88/325 (27%), Gaps = 22/325 (6%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMP-DTIY 67 + F V A + S + + SA V T ++ D Sbjct: 25 ALFLKVFGGEVLTAFKRR-SVTMDKHMVRTIQSGKSAQFPVMGRTAGFYLLPGEDIDDKQ 83 Query: 68 NATDQDRRWVGHSQFG-WAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML- 125 + + A I + + Y++ A+ D A+L M Sbjct: 84 GDIKHTEKVITIDGLLVSAVMIFDIEDAMNHYDVSSEYSAQLGEALAISADGAVLAEMAA 143 Query: 126 -----GVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQ-LITAKSIFRKRYIDVDSE 179 + G T + + + + I + L A++ + Y+ Sbjct: 144 LCNLPAATNENIAGLGTASVLEVGKAADLTDPEALGKAILKQLTLARAKLTRNYVPAS-- 201 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 + + ++++ + + +Y + G I G I + + Sbjct: 202 DRFFYTTPENYSAILSALMPNAANYAALIDPETGNIRNVMGFVVIEVPHLVVGGSGDN-- 259 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 + + K V +SAV + K + ++ ++ Sbjct: 260 -------LAGANQKHAFPATAGGDVKVAKDNVVGLFNHRSAVGTVKLKDMALERARRAN- 311 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 + QI + G + P+ + Sbjct: 312 YQGDQIIGKYAMGHGGLRPEAAGAL 336 >gi|194100342|ref|YP_002003772.1| gp10A [Enterobacteria phage EcoDS1] gi|193201337|gb|ACF15816.1| gp10A [Enterobacteria phage EcoDS1] Length = 347 Score = 136 bits (341), Expect = 6e-30, Method: Composition-based stats. Identities = 40/325 (12%), Positives = 83/325 (25%), Gaps = 16/325 (4%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG--DMPDTI 66 + F V A S + + SA V T + + D Sbjct: 24 ALFLKVFAGEVLTAFTRR-SVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPGERLSDKR 82 Query: 67 YNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML- 125 ++ I + + Y++ A+ D A+L M Sbjct: 83 KGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAI 142 Query: 126 ------GVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSE 179 N+ + IGQL A++ Y+ + Sbjct: 143 LCNLPVASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVP--AG 200 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 Y D ++++ A + +Y + G I G + + + AG Sbjct: 201 DRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIRNVMGFVVVEVPHLVQGG---AGE 257 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 ++ + V +SAV + + + ++ +D Sbjct: 258 TRGEDGITIASGQKHAFPATATGDVKVAMDNVVGLFSHRSAVGTVKLRDLALERDRDVD- 316 Query: 300 WHAPQITLTSSFGATRIEPDKILGI 324 I + G + P+ + Sbjct: 317 AQGDLIVGKYAMGHGGLRPEAAGAL 341 >gi|68299740|ref|YP_249589.1| Major capsid protein [Vibriophage VP4] gi|66473279|gb|AAY46288.1| major capsid protein [Vibriophage VP4] Length = 324 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 41/307 (13%), Positives = 85/307 (27%), Gaps = 11/307 (3%) Query: 35 TEQATEGEASALVEVFKPTEAHE-IVGDMPDTIYNATDQDRRWVGHSQFGWAE-RIDPFA 92 + SA V T+A G D + + + I Sbjct: 1 MTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIE 60 Query: 93 TLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETE--FFSKENILSAVE 150 + + Y++ A+ D A M + K +++ Sbjct: 61 DAMNHYDVRSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITG 120 Query: 151 GDDFFKTFIGQLITAKSIFRKRYIDV--DSEQVYVLIPSDVWASLFALERATSKDYINTA 208 + + Q+I A + R + + D ++++ A + +Y Sbjct: 121 KKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDPDTYSAILAALMPNAANYAALI 180 Query: 209 ALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTK 268 + G I G + + + F DG ++ K Sbjct: 181 DPETGNIRNVMGFEVVETPHMTAQMVTNPTDAF----DGTGHIFPATGDSTTTGKMTVGA 236 Query: 269 IKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISK 328 V +SAV + K + ++ ++ P + A QI + G + P+ + I Sbjct: 237 DNVVGLFVHRSAVATLKLKDMALERARRPE-YQADQIIAKYAMGHGGLRPEAVGAIIFED 295 Query: 329 DSLKGVP 335 V Sbjct: 296 GETPAVA 302 >gi|320158422|ref|YP_004190800.1| minor capsid protein 10 [Vibrio vulnificus MO6-24/O] gi|319933734|gb|ADV88597.1| minor capsid protein 10 [Vibrio vulnificus MO6-24/O] Length = 365 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 47/354 (13%), Positives = 95/354 (26%), Gaps = 34/354 (9%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNAT 70 + F V + L + SA + A Sbjct: 25 FLKLFAGEVLTTFKADNIAL-GLTRVRTIRNGKSAEFPMIGKNTARYHTPGQLIDGNKIK 83 Query: 71 DQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKK 130 R I + Y+ +A+ D I + + Sbjct: 84 HAARIVTIDDVAVSPVFIADIDEAMTHYEFRSQYSQEGGSALAELIDRNIFRMVTKAAYI 143 Query: 131 GKIGAETEFFSKENILSAVEGDDFFKTFI------------GQLITAKSIFRKRYIDVDS 178 T + ++L +V D+ F I + A++I +K I Sbjct: 144 TNKTEATNAITDGSMLGSVLDDEDFTANIVVPSAYAGEHIVSAIFKARTILKKANI---K 200 Query: 179 EQVYVLIPSDVWASLFALERA-----TSKDYINTAALQAGKIEAFAGVWFINMEKVPGND 233 + ++P +V+ L ++ +KD T ++ G I AG+ + +P + Sbjct: 201 QVPVCVLPPEVYELLVNIQDTNKVTWMNKDVGGTGSMAEGSIARVAGISILESNHLPQEE 260 Query: 234 LFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQH 293 G + ++ K++ V I+ V + + + Sbjct: 261 --------KGAQTDPKPLADATVGSGNATKYDVEARGLVGLIFTPDCVATVKLMDVQTKD 312 Query: 294 SKDPGKWHAPQITLTSSFGATRIEPDKILGIEIS----KDSLKGVPVLKGTKAA 343 +P I G + P + I + ++ V G AA Sbjct: 313 VPEP-LRLGTTILSKLCVGHNILRPACAIAIVAKGTEAEAAMGANKVEVGAVAA 365 >gi|13186161|emb|CAC33472.1| hypothetical protein [Legionella pneumophila] Length = 289 Score = 133 bits (334), Expect = 4e-29, Method: Composition-based stats. Identities = 53/325 (16%), Positives = 99/325 (30%), Gaps = 43/325 (13%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 L+ I +F Q L+ V ++ + VF A++ Sbjct: 1 MSLALSQIEIKQFLSEAHAEFQSEGFLLQGAVRTKSGTKGSIVHFPVFGEGMANQKAPQD 60 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 T N +++D V + +E D +N + YA L A+ R+ D+ + Sbjct: 61 DITPMNVSNRDAEAVI-EDWYASEYADRSFQNKLAVNAVEEYAKLCAWAIGRRADQINID 119 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + G + +L A R+R + + Sbjct: 120 TIAGATYSATPNDQQGALVP---------VGTTGFTFEKLRQAHRWLRQRSA--NRGKRT 168 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKI--EAFAGVWFINMEKVPGNDLFPAGTK 240 V+I + L +E+ T+ Y+N L + F G+ FI + + L G Sbjct: 169 VIIDAIAEEQLLNVEQLTNSFYVNQKILDNDGLHGMTFLGMNFIVIPSMQEGGLPTTGGG 228 Query: 241 FPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKW 300 G E AV + Q + + S + K Sbjct: 229 TVGRAFFINEM----------------------------AVGYAQSERLGGDISWENIKT 260 Query: 301 HAPQITLTSSFGATRIEPDKILGIE 325 + I + GA I+P ++ ++ Sbjct: 261 -SYLINMWMEAGAVVIDPKGLVEVD 284 >gi|148724482|ref|YP_001285448.1| major capsid protein [Cyanophage Syn5] gi|145588127|gb|ABP87946.1| major capsid protein [Synechococcus phage Syn5] Length = 332 Score = 131 bits (328), Expect = 2e-28, Method: Composition-based stats. Identities = 31/315 (9%), Positives = 80/315 (25%), Gaps = 25/315 (7%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F V A S + V G S A P Sbjct: 27 ATALKLFSGEVFTAF-NNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTPIVGDA 85 Query: 69 ATDQ-DRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 ++ V ++ + + S + + A+ DE I + + Sbjct: 86 GIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKA 145 Query: 128 NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS 187 + + NI + + + + A ++ +R + +V VL P Sbjct: 146 SAEASPVTGEPGGFHVNIGAGNT--NDAQAIVDGFFEAAAVLDERSAPQEG-RVAVLSPR 202 Query: 188 DVWASLFA-LERATSKDYINTA--ALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 ++ + + +++ N+ + + AG+ + + Sbjct: 203 QYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNL--------------- 247 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVF--TQRKAIDVQHSKDPGKWHA 302 + + + ++ I+ + A + I ++ Sbjct: 248 AGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG 307 Query: 303 PQITLTSSFGATRIE 317 I + G + Sbjct: 308 DLIVGKLAMGCGSLR 322 >gi|313892480|ref|ZP_07826069.1| minor capsid protein 10 family protein [Dialister microaerophilus UPII 345-E] gi|313119059|gb|EFR42262.1| minor capsid protein 10 family protein [Dialister microaerophilus UPII 345-E] Length = 320 Score = 130 bits (327), Expect = 3e-28, Method: Composition-based stats. Identities = 46/316 (14%), Positives = 90/316 (28%), Gaps = 24/316 (7%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHE-IVGDMPDTIYNA 69 + F A + S + E SA VF EA G D I Sbjct: 20 YLKVFAGETITAFER-ASVTMGRHIVRTIEHGKSAQFPVFGRAEAAYLKRGGSLDDIRKK 78 Query: 70 TD-QDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVN 128 ++ V ++ I + + Y+ A+ D A+L + Sbjct: 79 IPGAEKNIVIDGLLTTSQLIADIDEAMTHFDVRSEYSKQMGEALALAADGAVLAEAAKLV 138 Query: 129 KKGKIGAETEFFSKENILSAVEGDDFF--KTFIGQLITAKSIFRKRYIDVDSEQVYVLIP 186 GK + ++ G K + L+ K+ ++ + Y + Sbjct: 139 ADGKENITGLGKGEALTITGTAGITQDFGKAVVESLLNVKAKMSLLHVPAT--ERYCYMT 196 Query: 187 SDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLID 246 +L A A ++DY A + G + AG I + G Sbjct: 197 PIGVNALVASLVAINRDYGAVATITEGNVLRVAGFDIIETPHLTQGGADATGILQGKGHV 256 Query: 247 GKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQIT 306 K K ++AV + K + ++ ++ + A + Sbjct: 257 FP----------------TQYKDKCTFIAMHRTAVGTVKLKDLALEKARRAE-YQADMLV 299 Query: 307 LTSSFGATRIEPDKIL 322 + + G + P+ + Sbjct: 300 ASYAMGHGGLRPEAVF 315 >gi|294648403|ref|ZP_06725902.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] gi|292825708|gb|EFF84412.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] Length = 290 Score = 129 bits (323), Expect = 7e-28, Method: Composition-based stats. Identities = 52/322 (16%), Positives = 97/322 (30%), Gaps = 39/322 (12%) Query: 5 EQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIV-GDMP 63 + + + ++ L++ +SKL TVT S V Sbjct: 4 NTIDSVFVKQYADTYVALLEQKESKLLSTVTNVGAVTGTSFTVNEMGTLGDEFNTLTRFG 63 Query: 64 DTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKG 123 +T Y R + F R+ P A +RK D + Sbjct: 64 ETAYTDASFASRLATMNDFPNFTRLAIQDLYKLKAQPQDQLLQRLHAKWNRKVDSVVYNA 123 Query: 124 MLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYV 183 ++G + +G + + S + GD LI ++ F + D D +Y+ Sbjct: 124 LIGNAARKVVG-ADTYTNVALPASQILGDAAVAPTKKLLIDIRTKFMENECDED---IYI 179 Query: 184 LIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPG 243 S + S+ A TS D++ LQ G++ F G +++ E + D A Sbjct: 180 TYDSSLLNSILADPTLTSSDFLAGQMLQKGEVSNFLGFNWVHAEFIKAADGLSAT----- 234 Query: 244 LIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAP 303 Y +SAV ++ + Sbjct: 235 -----------------------------GVAYTRSAVEVGINSISPLKIVEVETANRYH 265 Query: 304 QITLTSSFGATRIEPDKILGIE 325 I + GA R + +++ + Sbjct: 266 SIGHIEALGAVRTDEKRVVAFK 287 >gi|148747831|ref|YP_001285797.1| capsid protein [Phormidium phage Pf-WMP3] gi|146230064|gb|ABQ12472.1| capsid protein [Phormidium phage Pf-WMP3] Length = 381 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 41/359 (11%), Positives = 98/359 (27%), Gaps = 50/359 (13%) Query: 9 TANIYE-FKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAH--EIVGDMPDT 65 I E + V + + + L T + EG+ L+ + + A + P Sbjct: 21 QVFIPEVWSSEVRMFRDQKFAALEAT-KKIPFEGKKGDLIHIPNISRAAVYDKQPQTPVN 79 Query: 66 IYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDE--AILKG 123 + TD + + + I+ + Y A A+ R D + Sbjct: 80 LQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRA 139 Query: 124 MLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYV 183 ++ +I + + + + G L+ AK + + + V Sbjct: 140 VINAFPSQRIYSYDTTLGDGTVNAHLTG-TPAPLTYAALLLAKQKLDEADVPQEG--RIV 196 Query: 184 LIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFP------- 236 ++ + L ++ + S D+ + +G + G+ I ++ N L Sbjct: 197 MVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTILGMEVIVTTQIGINSLTGYVNGQGA 256 Query: 237 ----AGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIY---------------- 276 S++ + + LP++ Sbjct: 257 PTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLGS 316 Query: 277 -------------CKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKIL 322 + ++ + + S + A + +GA PD + Sbjct: 317 FGGANRWATAVVCHPDWLAVGVQQNVKSESS-RETMYLADAFVTSCVYGAKVFRPDHCV 374 >gi|291335870|gb|ADD95466.1| minor capsid protein 10 [uncultured phage MedDCM-OCT-S08-C304] Length = 437 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 34/314 (10%), Positives = 74/314 (23%), Gaps = 13/314 (4%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F + Q + R VT++ + S + P Sbjct: 28 ATYLKLFSGEMFKGFQ-HNTIARDLVTKRTLKNGKSLQFIYTGRMTSDYHTPGTPILGNA 86 Query: 69 A---TDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 ++ V + + S + + A+ D I + + Sbjct: 87 DKAPPVAEKTIVMDDLLVSSAFVYDLDETLSHYDLRGEISRKIGYALAENYDRKIFRAIA 146 Query: 126 GVNKKGKIGAETEFFSKE------NILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSE 179 ++ + T F N + A S+ ++ + D Sbjct: 147 KGARQASPISATGFVEPGGTQIQLNGTQNNTQATTASNLVTGFYDAASVLDEKGVSSDG- 205 Query: 180 QVYVLIPSDVWASL--FALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPA 237 +V VL P +A + ++D T + + AG+ +P F Sbjct: 206 RVAVLNPRQYYALIQQTGDNGLINRDVQGTGLQSGEGVVSIAGIKIYKSMNLPFLGKFGT 265 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDP 297 + + +++ V +V Sbjct: 266 ANTISNAGSFIGQSMDSGSGRQTATYARSGTTITVTLNAHGLSVGDKVVFDATAGGGTSG 325 Query: 298 GKWHAPQITLTSSF 311 A T T + Sbjct: 326 TYTVATVATNTFTI 339 Score = 47.6 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 13/96 (13%), Positives = 24/96 (25%), Gaps = 2/96 (2%) Query: 231 GNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAID 290 + F G + G V + I+ K A + Sbjct: 333 ATNTFTITDTVSGTVSGGTACAFNIAGVNNGYGEAGDFAGSCGLIFQKEAAGVVEAIGPQ 392 Query: 291 VQHS--KDPGKWHAPQITLTSSFGATRIEPDKILGI 324 VQ + + I + GA + P + + Sbjct: 393 VQVTNGDISVIYQGDVILGRMAMGADYLNPAACVEL 428 >gi|310005863|gb|ADP00248.1| major capsid protein [Cyanophage Syn26] Length = 437 Score = 123 bits (308), Expect = 3e-26, Method: Composition-based stats. Identities = 27/285 (9%), Positives = 69/285 (24%), Gaps = 13/285 (4%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F + Q + R V ++ + S + P + Sbjct: 28 ATYLKLFSGEMFKGFQ-HNTIARDLVMKRTLKNGKSLQFIYTGRMTSDYHTPGTPILGNS 86 Query: 69 A---TDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 ++ V + + S + + A+ D I + + Sbjct: 87 DKAPPVAEKTIVMDDLLVSSAFVYDLDETLSHYDLRGEISRKIGYALAENYDRKIFRAIA 146 Query: 126 GVNKKGKIGAETEFFSKE------NILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSE 179 ++ + + F N + A ++ ++ + + Sbjct: 147 KGARQASPISASGFVEPGGTQIQLNATQNNTQATTASNLVTGFYDAAAVLDEKGVSSEG- 205 Query: 180 QVYVLIPSDVWASL--FALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPA 237 +V VL P +A + ++D + + AG+ +P F Sbjct: 206 RVAVLNPRQYYALIQETGDNGLINRDVQGQGLQSGTGVVSIAGIKIYKSMNLPFLGKFGT 265 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVV 282 + + +S+ V +V Sbjct: 266 ANTISNAGSFVGQSMDSAAGKQSATYARSGTTITVTLTAHGISVG 310 Score = 45.3 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 14/103 (13%), Positives = 27/103 (26%), Gaps = 3/103 (2%) Query: 224 INMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVF 283 + VP + F G + N V + I+ + A Sbjct: 327 YTVATVPNANTFTITDTASGTVSSSACTFNIAG-VNNGYGESGDFAGSCGLIFQREAAGV 385 Query: 284 TQRKAIDVQHS--KDPGKWHAPQITLTSSFGATRIEPDKILGI 324 + VQ + + I + GA + P + + Sbjct: 386 VEAIGPQVQVTNGDISVIYQGDVILGRLAMGADYLNPAACVEL 428 >gi|146276489|ref|YP_001166648.1| hypothetical protein Rsph17025_0437 [Rhodobacter sphaeroides ATCC 17025] gi|145554730|gb|ABP69343.1| hypothetical protein Rsph17025_0437 [Rhodobacter sphaeroides ATCC 17025] Length = 308 Score = 119 bits (297), Expect = 7e-25, Method: Composition-based stats. Identities = 50/333 (15%), Positives = 98/333 (29%), Gaps = 56/333 (16%) Query: 14 EFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQD 73 + V + Q+T+ VT+ + GEA ++ ++ E Sbjct: 14 MYANSVTMVAQQTRDPFAGAVTDASATGEAQSVTDLVDAGEYAYGEERSRRNPEMPISGG 73 Query: 74 RRWVGHSQF-GWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK---------- 122 RRWV + ID + +P + T + R + + L Sbjct: 74 RRWVVMPPVIESGQYIDKEDKFRTATDPTSVIVTTHTKRVIRGKADRTLGIRKAEDGTYA 133 Query: 123 ----GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYI-DVD 177 G+LG +GK G + + +L A + D Sbjct: 134 VLDGGILGYATEGKRGTSQVGLPSSQFVPV----GTTGLTLDKLRDAVKTLKLADFGMED 189 Query: 178 SEQVYVLIPSDVWASLFALERATSKDYIN--TAALQAGKIEAFAGVWFINMEKVPGNDLF 235 + +Y I + L A+ A+ + L+ GK GV ++ +VP Sbjct: 190 DDPLYCAITPNQEDDLLAIAAASGANLNTFSIDQLRTGKPTMLMGVNWLLTNRVPV---- 245 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 D + PI+ K +V + ++ Sbjct: 246 ------------------------------DAADSRLCPIWSKKNIVRGIWQDVEGDMWN 275 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISK 328 D + P +++ R++ ++ IE + Sbjct: 276 DTHAKNLPYAYVSAYIDCVRVQDKGVIVIECKE 308 >gi|115304375|ref|YP_762667.1| PfWMP4_37 [Cyanophage Pf-WMP4] gi|113201869|gb|ABI33181.1| PfWMP4_37 [Phormidium phage Pf-WMP4] Length = 341 Score = 115 bits (288), Expect = 9e-24, Method: Composition-based stats. Identities = 53/353 (15%), Positives = 104/353 (29%), Gaps = 34/353 (9%) Query: 1 MATKEQLA---------TANIY-EFKKHVELALQETKSKLR-PTVTEQATEGEASALVEV 49 MA + I ++ V++ L V + + V Sbjct: 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMF--RKAKMLDTSVVKTWGAQVKKGDTFHV 58 Query: 50 --FKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASL 107 + D+P + D D + A +D + + + PY Sbjct: 59 PRISELGVEDKATDVPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEA 118 Query: 108 ATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKS 167 A+ + +LG+ + A FS N + + A+ Sbjct: 119 MGYALAKDM----TGSILGLRAAVQNTASQNVFSSSNG---AITGNGQAFSFAVFLAARR 171 Query: 168 IFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINME 227 + + + E++ +LI ++LF + + SKD+IN A + G+I + GV I Sbjct: 172 LLLEADVPE--EKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTS 229 Query: 228 KVPGNDLFPAGTKFPGLIDGKVEY--------PNGKPTVKSSAKFEDTK-IKYVLPIYCK 278 + N P + + P A F + + Sbjct: 230 LIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHM 289 Query: 279 SAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSL 331 KA V S + + + ++GA P + I + D++ Sbjct: 290 DWAAAVVSKAPRVTQSFE-NREQVWLMVGRQAYGARLYRPLHAVNIHTTGDTV 341 >gi|167565010|ref|ZP_02357926.1| minor capsid protein 10 [Burkholderia oklahomensis EO147] Length = 303 Score = 111 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 31/319 (9%), Positives = 81/319 (25%), Gaps = 31/319 (9%) Query: 14 EFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQD 73 F V A + + E+ S ++A + + Sbjct: 1 MFSGEVLTAF-TAATLTKGKTREKNITSGKSYQFPRTGTSQAEYLQRGQEMLGNPFATGE 59 Query: 74 RRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKI 133 + F S + P + A+ R D+ + + + + Sbjct: 60 VEVTIDGPLVAHHALWDFDVAMSQFDVRGPMTADMGQALARMYDQNNFRQIALAARTAAV 119 Query: 134 GAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASL 193 G + G I + +++ + Y+++ +V+ ++ Sbjct: 120 GEFPGGDRIVDSSLLSTGTAIDGLAWMDAIRKAKLVKQKKNLPAAAPWYMVVTPEVFDAI 179 Query: 194 ----FALERATSKDYINTAALQA-----GKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 + + + + + A + F GV ++ +P + F Sbjct: 180 KYAKNSAGQFVNLNSLVQLATAGVGAVPTEAIRFEGVTILSSNLLPQANDSANTKVFS-- 237 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQ 304 K+ K ++ AV I + ++D + Sbjct: 238 ------------------KYRADFSKLSGLMWQPEAVAVLTLMGISTETTRD-VRRQEDF 278 Query: 305 ITLTSSFGATRIEPDKILG 323 I + G + + + Sbjct: 279 IVSKQAVGHGTLRAECAVE 297 >gi|291334269|gb|ADD93932.1| hypothetical protein amb4267 [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 175 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 64/225 (28%), Gaps = 52/225 (23%) Query: 104 YASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLI 163 A + A+ RK DE I + G + D + ++ Sbjct: 3 VAQSSAGALGRKTDELITTALDGTSNLSGN------------------SDSDGLTLAKIN 44 Query: 164 TAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWF 223 + I D ++ +V+ P L I AFA F Sbjct: 45 GVFGSMGEGDIPDDGDRYFVVSPDGWIDLL--------------------AINAFADADF 84 Query: 224 INMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVF 283 I +++P A L P T + Y +S V Sbjct: 85 IGPDELPYKGGMVAKRWLGFLWMTHSGLP-------------VTGGRRQCFAYHRSGVGV 131 Query: 284 TQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISK 328 + + + P + + IT S G I+ + + ++I++ Sbjct: 132 AMGADVTTEINYIPER-VSNLITAYMSLGVVLIDDNAVFEVQITE 175 >gi|291335771|gb|ADD95373.1| minor capsid protein [uncultured phage MedDCM-OCT-S05-C429] Length = 256 Score = 103 bits (255), Expect = 5e-20, Method: Composition-based stats. Identities = 25/248 (10%), Positives = 56/248 (22%), Gaps = 25/248 (10%) Query: 100 PLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKT-- 157 + A+ K D I + + + ++ F V Sbjct: 1 MRGEISKKIGYALAEKYDRLIFRAITRGARAASPITKSNFVEPGGTQIRVGATTNDSDAY 60 Query: 158 ----FIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLF---ALERATSKDYINTAAL 210 + A + ++ + ++ + L ++D TA Sbjct: 61 VASNLVTAFYDAAAALDEKGVSSQG--RCAVLNPRQYYELITGVGTNGLINRDAQGTALQ 118 Query: 211 QAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIK 270 I AG+ +P + + + +A T I Sbjct: 119 SGNGIIEIAGIKIYKSMNIPFLGKYGTAFGGTTGVTSPSNMGSHIGPALENASGASTGIN 178 Query: 271 ------------YVLPIYCKSAVVF--TQRKAIDVQHSKDPGKWHAPQITLTSSFGATRI 316 I+ K A + V + I + G+ + Sbjct: 179 NDYGTATEVAAKSCGLIFQKEAAGVVEAIGPQVQVTSGDVSVVYQGDVILGRMAMGSDYL 238 Query: 317 EPDKILGI 324 P + + Sbjct: 239 NPAAAVEL 246 >gi|307946245|ref|ZP_07661580.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307769909|gb|EFO29135.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 330 Score = 97.7 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 42/327 (12%), Positives = 88/327 (26%), Gaps = 50/327 (15%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTE-QATEGEASALVEVFKPTEAHEIVGD 61 Q ++ + QE ++LRPTV+ EG A+ + ++A + + Sbjct: 40 MTAQAPVWFQTQYPQRAMHIYQEKGNRLRPTVSHPVRFEGSEKAIFYLAGTSKAVKKTRN 99 Query: 62 MPDTIYNATDQDRR--WVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEA 119 +T T R+ V + + ++ + I+ A+ R DE Sbjct: 100 QKNTP---TGGQRKKFEVPLETWTVFDTVEEWDLDRMTIDEREIVYESGAMALGRATDEE 156 Query: 120 ILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSE 179 I M GV G + + + + D Sbjct: 157 IYAKMAGVKSSVDGGLDF---------------SASAFDAANAMVLCEALQDMKVPWDG- 200 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 + +P+ W L A + + +P Sbjct: 201 NTWCGLPAKQWNQLLANKVVNNSQ--------------------HVGSDMPFVKATDTRF 240 Query: 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGK 299 VE +D L + +SA+ + I+++ Sbjct: 241 WNGVNWFLFVEQEPQALYPVPGENKQD------LFAWHQSAIGWAAHTDINMREQWHNE- 293 Query: 300 WHAPQITLTSSFGATRIEP-DKILGIE 325 + I + + A ++ + I+ Sbjct: 294 YDWWSINMKAKGAAKELQEGNGIVRFR 320 >gi|291334892|gb|ADD94530.1| hypothetical protein [uncultured phage MedDCM-OCT-S08-C151] Length = 151 Score = 96.9 bits (239), Expect = 4e-18, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 56/167 (33%), Gaps = 24/167 (14%) Query: 163 ITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYI--NTAALQAGKIEAFAG 220 F +R I ++ +L P++ + + R D+ + +GK++ AG Sbjct: 1 YDIAQTFDERDIPPT-DRFCILPPAEYYKLAESATRTVDVDFNPQGNGSFASGKVQQVAG 59 Query: 221 VWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSA 280 + + VP +++ G ++ + K + ++ KSA Sbjct: 60 IPVMMSNNVPQSNV-------------------GSNPSGANNTYSGDDSKTIGLVFHKSA 100 Query: 281 VVFTQRKAIDVQH--SKDPGKWHAPQITLTSSFGATRIEPDKILGIE 325 V + + + S + + + G + P+ I+ Sbjct: 101 VGTVKLMDMTTEISGSDYGIMYQGTLMVAKYALGHGILRPECAATIK 147 >gi|288922767|ref|ZP_06416936.1| hypothetical protein FrEUN1fDRAFT_6634 [Frankia sp. EUN1f] gi|288345880|gb|EFC80240.1| hypothetical protein FrEUN1fDRAFT_6634 [Frankia sp. EUN1f] Length = 277 Score = 95.0 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 35/319 (10%), Positives = 78/319 (24%), Gaps = 57/319 (17%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTE----QATEGEASALVEVFKPTEAHEIVGDMPDTI 66 + L + + + + + + Sbjct: 6 FKPQIWVAALLESIKKNLVYAELCNRDYEGEIRAAGDTVRITSISRPSISTYARNTDISY 65 Query: 67 YNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLG 126 TD R V + W ID + + + + A+ A+ D+ + Sbjct: 66 EELTDAQRTLVVDQEKYWGFTIDDVDAAQARASVVSEAMAEASYALADTVDQFVAGLYTQ 125 Query: 127 VNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIP 186 VN ++G + QL + I +V++P Sbjct: 126 VNTANQLGTVSV--------------TTADLAYTQLRLLSLKLDEANIPTAG--RWVVVP 169 Query: 187 SDVWASLFALERATS-KDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLI 245 + L + + ++ +T L G++ G VP Sbjct: 170 PWYHSLLLENSKFVNYQNSNSTEPLYNGRVGRALGFDIRMSNNVPL-------------- 215 Query: 246 DGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQI 305 T Y + A+ F Q+ + + K + Sbjct: 216 --------------------VTGDDYAVIAGTNRAMTFAQQM-MKTEAG-RSEKRFGDWM 253 Query: 306 TLTSSFGATRIEPDKILGI 324 + +GA + P+ + + Sbjct: 254 RGLAVYGAKVLRPEGLATV 272 >gi|302389838|ref|YP_003825659.1| hypothetical protein Toce_1280 [Thermosediminibacter oceani DSM 16646] gi|302200466|gb|ADL08036.1| conserved hypothetical protein [Thermosediminibacter oceani DSM 16646] Length = 276 Score = 92.3 bits (227), Expect = 9e-17, Method: Composition-based stats. Identities = 39/288 (13%), Positives = 74/288 (25%), Gaps = 54/288 (18%) Query: 41 GEASALVEVFKPTEAHEIVGDMPD-TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGIN 99 ++ + + + V + ++D +N Sbjct: 40 KGSTVKINSIGSINIGDYDKNTGIGDPQELDSYQTTLVIDQAKYFNFKVDDVDKAQMNVN 99 Query: 100 PLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFI 159 + A A+ D+ I V IG++ Sbjct: 100 LVDAAMQEAAYALADAMDQYIASLYTEVAPGNTIGSDESPIVP-----------TKDNAY 148 Query: 160 GQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFA 219 L+ + + + +V++PS L R TSK L G+I Sbjct: 149 DYLVDLLVKLDEANVPKNG--RFVVVPSWFAGLLKKDPRFTSKTD----VLITGEIGMVD 202 Query: 220 GVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKS 279 G VP KY + + Sbjct: 203 GATIYESNNVPNVG----------------------------------GQKYKIMAGYRG 228 Query: 280 AVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEIS 327 A+ F +AI+ + P K A + + +GA I+P+ I + + Sbjct: 229 AIAFV--RAINSIEAYRPEKSFADAVKGLALYGAKVIKPNAIAVMTCN 274 >gi|294478926|gb|ADE87491.1| capsid protein [Deep-sea thermophilic phage D6E] Length = 289 Score = 91.1 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 40/325 (12%), Positives = 79/325 (24%), Gaps = 55/325 (16%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEG-----EASALVEVFKPTEA 55 MA + T + + LQ T + V + EG + + Sbjct: 1 MAINNFIPT----VWSARLLQNLQRTLVYGQAAVINRDYEGEIRAYGDTVKINNIGRISV 56 Query: 56 HEIVGD-MPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHR 114 + + TD+ R V + ++D + + A A+ Sbjct: 57 GDYTKNANMPDPETLTDETRTLVIDQAKFFNFQVDDVDRIQQNPKLMDEAMREAAYALRN 116 Query: 115 KQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYI 174 D+ I + L+ + + Sbjct: 117 AADQFIASHYVDAAHTIGSDTSPVQP-------------TKTDAYEYLVDLSVKLDEADV 163 Query: 175 DVDSEQVYVLIPSDVWASLFALERATSKDYINTAA-LQAGKIEAFAGVWFINMEKVPGND 233 +V++P + +R ++ L G I AG + VP Sbjct: 164 PEQG--RWVIVPPWFEGLMLKDDRFVKTGSLSAEDRLVNGVIGRAAGFLVLKSNNVP--- 218 Query: 234 LFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQH 293 V + + Y + A F ++ V+ Sbjct: 219 ------------------------VVPANAQSGVQENYKIIAGHPMAWSFAEQVN-QVE- 252 Query: 294 SKDPGKWHAPQITLTSSFGATRIEP 318 + P K A + +GA + P Sbjct: 253 AYRPEKRFADAVKGLHLYGAKTVRP 277 >gi|167841463|ref|ZP_02468147.1| minor capsid protein 10 [Burkholderia thailandensis MSMB43] Length = 337 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 31/317 (9%), Positives = 86/317 (27%), Gaps = 21/317 (6%) Query: 12 IYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATD 71 I E+ VE + S +R V + +G ++ + ++ Sbjct: 37 IEEYGGVVEHTIARR-SIVRNFVPIRNVKGTSTVSNYQVGKSTLAKVTPGTEPDATVNGT 95 Query: 72 QDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKG 131 Q + + + S + + + D++ + Sbjct: 96 QKVKLTIDTLVNARAVVPLLDDFQSSYDARAAIGMEHGIEIAKFFDQSFFIQAVKAAGIT 155 Query: 132 KIGAETEFFSKENILSAVEGDDFFKTFI--GQLITAKSIFRKRYIDVDSEQVYVLIPSDV 189 + + + + D + + + + +D + + ++ Sbjct: 156 DMSQYPAGWQPGSSQTFTAAGDELDPVKLESKFLDLFAQMADKDVDPHDDGLVIVTRPKF 215 Query: 190 WASLFALERATSKD-YINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGK 248 + +L +R ++ + K + AGV +P ++ Sbjct: 216 FYTLLKNDRLVDREMITSDGTTIKTKALSVAGVPIYFSNNLPNTNVT------------- 262 Query: 249 VEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLT 308 + + ++ K V ++ A++ + + DP + I Sbjct: 263 ---GHFLSNAGNGNAYDGDFSKTVAAVFSPRALLAGETIPLTPDVFYDP-RTKMWFIDAH 318 Query: 309 SSFGATRIEPDKILGIE 325 SFG T P ++ Sbjct: 319 LSFGVTPNNPAFAGLLK 335 >gi|169628877|ref|YP_001702526.1| bacteriophage protein [Mycobacterium abscessus ATCC 19977] gi|169240844|emb|CAM61872.1| Bacteriophage protein [Mycobacterium abscessus] Length = 276 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 31/253 (12%), Positives = 63/253 (24%), Gaps = 20/253 (7%) Query: 9 TANIYEFKKHVELALQETKSKLRPTV-TEQATEGEASALVEVFKPTEAH---EIVGDMPD 64 T I E L K+ V + E + + Sbjct: 4 THFIPEIWSSYILERYMAKNVFASLVDRKYEGEARKGNTIHIPGVVAPAVKDYKAASRTT 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 + +D + + + +D S N L Y A ++ D+ I + Sbjct: 64 SADAISDTGIDILIDQEKNFDFYVDDIDNAQSNENLLPLYTDAAGDSLATDADQFIANLL 123 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 + + + + A+ + K + D +V V+ Sbjct: 124 VANATGMPWSSNPTTGDGAFNV---------------VKDARKLMNKANVPDDDLRVAVV 168 Query: 185 IPSDVWASLFALERATSKDYINTAA-LQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPG 243 + A + TS D A L+ + G + +P +D A Sbjct: 169 NAEFEALLVGADSKLTSFDSSGDTAGLRNATVGKLLGFRVVTSNNLPESDSPQAVFFHQR 228 Query: 244 LIDGKVEYPNGKP 256 + + Sbjct: 229 AAAFVSQIDEVEG 241 >gi|328553954|gb|AEB24446.1| hypothetical protein BAMTA208_11405 [Bacillus amyloliquefaciens TA208] Length = 286 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 27/282 (9%), Positives = 58/282 (20%), Gaps = 51/282 (18%) Query: 42 EASALVEVFKPTEAHEIVGDMP-DTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINP 100 S + + + D R+ + + +ID + Sbjct: 43 GDSVTINNMGRVSVGDYTKNQDMDNAQTLDSTSRKLLIDQSKYFNFQIDDVDKIQQNPKL 102 Query: 101 LLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIG 160 + A A+ D I + + Sbjct: 103 MDAAMQEAAYALKNTADSYIASHYVDAAHTIGSDTKVVSP-------------TKNDAYE 149 Query: 161 QLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAA-LQAGKIEAFA 219 L+ + + +V++ + +R +++ L G I A Sbjct: 150 YLVDLSVKLDEADVPEQG--RWVVVTPWYEGLMLKDDRFVKAGNMSSEQRLLNGVIGQAA 207 Query: 220 GVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKS 279 G + P + E + + Sbjct: 208 GFTVLKSNNAPLSKP------------------------------EGGTENHKIIAGHGM 237 Query: 280 AVVFTQRKAIDVQHS-KDPGKWHAPQITLTSSFGATRIEPDK 320 A + Q P K A + +GA P+ Sbjct: 238 AWSYA---DQATQVEAYRPEKRFADAVKGLHLYGAKVTRPEA 276 >gi|159897186|ref|YP_001543433.1| hypothetical protein Haur_0657 [Herpetosiphon aurantiacus ATCC 23779] gi|159890225|gb|ABX03305.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC 23779] Length = 283 Score = 86.9 bits (213), Expect = 4e-15, Method: Composition-based stats. Identities = 34/281 (12%), Positives = 72/281 (25%), Gaps = 51/281 (18%) Query: 42 EASALVEVFKPTEAHEIVGDMPD-TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINP 100 + + P + TD + + + ++D + Sbjct: 43 GDTVKINSIGPVTIGNYTKNTNIGDPETLTDAQMTLLINQAKYFNFQVDDIDRAQQKPSV 102 Query: 101 LLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIG 160 + A+ + D I GV IG +T + Sbjct: 103 MDEAMKEASYGLRDVSDGFIASLYTGVAAGNVIGNDTTPVTP-----------TSANAYD 151 Query: 161 QLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDY-INTAALQAGKIEAFA 219 L+ ++ + + + + ++P L +R + L+ G+I + A Sbjct: 152 YLVDLGTLLDEANVPSEG--RWTIVPPWFHGLLLKDDRFVGVGSASSDQVLRNGQIGSAA 209 Query: 220 GVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKS 279 G + VP KY + Sbjct: 210 GFSVLKSNSVPN----------------------------------VAGAKYKIMAGHPM 235 Query: 280 AVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDK 320 A+ F ++ + V+ P K A + +GA + P Sbjct: 236 AISFAEQI-VKVE-GYRPEKRFADAVKGLHVYGAKVVRPTA 274 >gi|197935885|ref|YP_002213721.1| major capsid-like protein [Ralstonia phage RSB1] gi|197927048|dbj|BAG70390.1| major capsid-like protein [Ralstonia phage RSB1] Length = 336 Score = 86.5 bits (212), Expect = 6e-15, Method: Composition-based stats. Identities = 34/315 (10%), Positives = 84/315 (26%), Gaps = 20/315 (6%) Query: 12 IYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATD 71 I E+ +E + S +R + ++ G + + ++ Sbjct: 37 IEEYGGQIEGTIARK-SIVRNFIPVRSVTGTSILSNFRIGESTLAKVTPGTAPDGTVNQA 95 Query: 72 QDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKG 131 + + + + + + D+A L + + Sbjct: 96 AKVSLRIDTLINARSMVPLLDDFQNSYDARMAIGQEHGKKFAKFIDQAFLIQAVKAAQLS 155 Query: 132 KIGAETE-FFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVW 190 G +A ++ + + +D S+ V V++ + Sbjct: 156 NSGLPAGWSGGTAKTFAAAGDENDPAKLEALFSDLFADMEGKDVDPISDDVVVVLKPAAY 215 Query: 191 ASLFALERATSKDY-INTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKV 249 +L R +D+ ++ K + GV +P ++ Sbjct: 216 YTLLKNNRLVDRDFVLSDGTEIKTKSLSVYGVPVYVSNNLPTTNI--------------- 260 Query: 250 EYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTS 309 + +S ++ K V + A++ + + DP I + Sbjct: 261 -SGHELSNAGNSNAYDGDFTKVVAAAFSPKALLAGETIPLTPDVFYDPISKM-WFIDAHT 318 Query: 310 SFGATRIEPDKILGI 324 SFG T P + Sbjct: 319 SFGVTPDNPAYAGVL 333 >gi|149227912|gb|ABR22956.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 86.1 bits (211), Expect = 7e-15, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 49/160 (30%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTVGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREDTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSGVMLGVA 152 >gi|149227927|gb|ABR22966.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 85.7 bits (210), Expect = 8e-15, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 49/160 (30%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSGVMLGVA 152 >gi|291334268|gb|ADD93931.1| hypothetical protein Magn03010160 [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 87 Score = 85.7 bits (210), Expect = 9e-15, Method: Composition-based stats. Identities = 15/79 (18%), Positives = 28/79 (35%), Gaps = 1/79 (1%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 ++TA I +F+ V +A Q SKL+ T+ ++ + A+ + A + Sbjct: 1 MSIGISTAFIKQFESDVHMAYQRMGSKLKDTIRQKPSVNGNQAVFQKVGKGSAVQK-SRH 59 Query: 63 PDTIYNATDQDRRWVGHSQ 81 D V Sbjct: 60 GQVPIMNIDHTNVTVTLQD 78 >gi|149227939|gb|ABR22974.1| minor capsid protein [Enterobacteria phage T7] gi|149227942|gb|ABR22976.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 85.7 bits (210), Expect = 9e-15, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 50/160 (31%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSEVMLGVA 152 >gi|149227945|gb|ABR22978.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 85.4 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 49/160 (30%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPTADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEDNVKVAKDNIIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSGVMLGVA 152 >gi|149227948|gb|ABR22980.1| minor capsid protein [Enterobacteria phage T7] gi|149227951|gb|ABR22982.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 85.4 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 49/160 (30%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPTADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEDNVKVAKDNIIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSGVMLGVA 152 >gi|312134863|ref|YP_004002201.1| hypothetical protein Calow_0833 [Caldicellulosiruptor owensensis OL] gi|311774914|gb|ADQ04401.1| hypothetical protein Calow_0833 [Caldicellulosiruptor owensensis OL] Length = 277 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 37/324 (11%), Positives = 71/324 (21%), Gaps = 55/324 (16%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQ---ATEGEASALVEVFKPTEAHEIVGDMPD- 64 T I L + + + + + + Sbjct: 4 TNFIPTIWSARLLENLQKRLVYTNITNNDYEGDVKFGNAVKINAIGRVNIFDYAKYTALP 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 + + + +D + +N + A + D+ I Sbjct: 64 DPQVLDSTQQTLLIDQAKAFNFAVDDIDKAQANVNLMDAAMRQAAQDIKDVIDKFIASHY 123 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 T L+ A + + I D + Sbjct: 124 TYAANAIGDDTTPIVP-------------TATTAYELLVDASTKLDEMDIPSDG--RVAI 168 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 +P L +R L+ G + AG VP Sbjct: 169 VPPWFHGLLRKDDRFVKYTSEGQQVLRTGLVGEAAGFQIFISNNVPN------------- 215 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQ 304 T KY + A+ F + I+ + P K A Sbjct: 216 ---------------------TTGTKYKILCGHPMAITFA--QQIEKIEAYRPEKLFADA 252 Query: 305 ITLTSSFGATRIEPDKILGIEISK 328 + +GA I P+ ++ I +K Sbjct: 253 VKGLVVYGAKVIRPEALVVITANK 276 >gi|149227936|gb|ABR22972.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 50/160 (31%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNIIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSEVMLGVA 152 >gi|149227915|gb|ABR22958.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 50/160 (31%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PVNKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSEVMLGVA 152 >gi|149227930|gb|ABR22968.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 84.6 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 49/160 (30%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VSAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSGVMLGVA 152 >gi|149227921|gb|ABR22962.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 83.8 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 49/160 (30%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEDNVKVAKDNIIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + G+ Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSGVMLGMA 152 >gi|149227918|gb|ABR22960.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 83.8 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 50/159 (31%), Gaps = 8/159 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIHNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGV 334 + A QI + G + P+ + + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAVVFQSEVMLGV 151 >gi|149227933|gb|ABR22970.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 83.8 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 50/160 (31%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKDEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEATGAVVFQSEVMLGVA 152 >gi|149227924|gb|ABR22964.1| minor capsid protein [Enterobacteria phage T7] Length = 188 Score = 83.8 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 50/160 (31%), Gaps = 8/160 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVP 335 + A QI + G + P+ + + + GV Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEATGAVVFQSEVMLGVA 152 >gi|149227916|gb|ABR22959.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 83.4 bits (204), Expect = 4e-14, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PVNKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227928|gb|ABR22967.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 83.4 bits (204), Expect = 4e-14, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227913|gb|ABR22957.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 83.4 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTVGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREDTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227919|gb|ABR22961.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 83.4 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIHNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227940|gb|ABR22975.1| major capsid protein [Enterobacteria phage T7] gi|149227943|gb|ABR22977.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 83.4 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227937|gb|ABR22973.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 83.0 bits (203), Expect = 6e-14, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNIIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227946|gb|ABR22979.1| major capsid protein [Enterobacteria phage T7] gi|149227949|gb|ABR22981.1| major capsid protein [Enterobacteria phage T7] gi|149227952|gb|ABR22983.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 82.7 bits (202), Expect = 7e-14, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPTADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEDNVKVAKDNIIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227922|gb|ABR22963.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 82.7 bits (202), Expect = 8e-14, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEDNVKVAKDNIIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227931|gb|ABR22969.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VSAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEAAGAV 141 >gi|149227934|gb|ABR22971.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKDEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEATGAV 141 >gi|149227925|gb|ABR22965.1| major capsid protein [Enterobacteria phage T7] Length = 146 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 20/149 (13%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 V + D ++++ A + +Y + G I G + + + Sbjct: 1 VPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 60 Query: 236 PAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSK 295 A G P K + K + +SAV + + + ++ ++ Sbjct: 61 TAREGTTGQKHVF-------PANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 113 Query: 296 DPGKWHAPQITLTSSFGATRIEPDKILGI 324 + A QI + G + P+ + Sbjct: 114 RAN-FQADQIIAKYAMGHGGLRPEATGAV 141 >gi|327492214|gb|AEA86235.1| P22 coat protein 5 [Clostridium phage CP26F] Length = 276 Score = 78.4 bits (191), Expect = 1e-12, Method: Composition-based stats. Identities = 30/287 (10%), Positives = 76/287 (26%), Gaps = 28/287 (9%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEG-----EASALVEVFKPTEAHEIVGDMP 63 ++ I + LA + + V + EG + + ++ + Sbjct: 4 SSFIPKIWSARLLAHLDKAHVVANLV-NRDYEGEIKAYGDTVKINQIGAITVNDYTKNTD 62 Query: 64 D-TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 + ++ Q + +ID + A A+ + ++ +LK Sbjct: 63 IHDPEELSTTEKVLTIDKQKYFNFQIDDVDAAQVRTPLMDAAMQRAAYALAEETEKVLLK 122 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + + +L+ K K + + Sbjct: 123 AIDTDA-------THKIVPEATLDPTNI--------YKELVGVKLKLDKANVPTVG--RF 165 Query: 183 VLIPSDVWASLFALERATSK-DYINTAALQAGKIEAFAGVWFINMEK---VPGNDLFPAG 238 ++I + A L R + + L+ G + G+ + + AG Sbjct: 166 LIISPETHALLLQEGRFVATGGAMAEGILKNGLVGQILGMDVYLSNNIDSLTNGNGAIAG 225 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQ 285 K ++ K A + + A+V + Sbjct: 226 VKMACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLK 272 >gi|322689249|ref|YP_004208983.1| hypothetical protein BLIF_1063 [Bifidobacterium longum subsp. infantis 157F] gi|320460585|dbj|BAJ71205.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 285 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 59/235 (25%), Gaps = 22/235 (9%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTE--QATEGEASALVEVFK--PTEAHEIVGDMPD 64 T I E L + V + V + + Sbjct: 4 TNFIPELWSANILLELQKNLVYGSAVNRDYEGDIANYGDTVHITGIAHISIGDYTAHTDI 63 Query: 65 TIYNATDQDR-RWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKG 123 TI ATD+D V + +A ID + N Y+ A + D+ + Sbjct: 64 TIEPATDKDAGELVINQSKYFAFEIDDVEKRQAMNNLTAAYSRDAAYKLRDLTDQYLAGL 123 Query: 124 MLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYV 183 M K ++ + K+ + +V Sbjct: 124 MAAGAKSK---------------LDPISGATATKAYDTIVDLATALDKQNVPDAG--RWV 166 Query: 184 LIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 ++ D + L R + + L G + AG+ + P A Sbjct: 167 IVTPDFYGLLRKDSRFVAGAESAHSTLLNGVVGEAAGMTILKSNNAPAAKGGSAS 221 >gi|169343190|ref|ZP_02864210.1| conserved hypothetical protein [Clostridium perfringens C str. JGS1495] gi|169298715|gb|EDS80792.1| conserved hypothetical protein [Clostridium perfringens C str. JGS1495] Length = 278 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 27/262 (10%), Positives = 60/262 (22%), Gaps = 23/262 (8%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTEQATEG-----EASALVEVFKPTEAHEIVGDMPDT 65 I + LA + V + EG + + + G + Sbjct: 6 FIPQIWSARLLANLDKNLVYANAV-NRDYEGEIKKFGDTVKINQMGDVTVKDYKGGAIED 64 Query: 66 IYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 + ++D + + + A+ A+ D+ I + Sbjct: 65 PEELNSNQTILTIDQAKYFNFKVDDVDKAQANVTLVDKGMGRASYAVQDVIDKFIAALVK 124 Query: 126 GVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLI 185 K ++ + N L+ + + +V++ Sbjct: 125 DAKIKVGNTSKPVEITVANA-------------YDTLVDLGVELDNKNVPRVG--RFVIL 169 Query: 186 PSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLI 245 P L R T I + G +G + VP + + + Sbjct: 170 PPFYLGLLSKDPRFTKDFKILENGVVDGA--TVSGFKIMMSNNVPFSANNYSIMAGIDMA 227 Query: 246 DGKVEYPNGKPTVKSSAKFEDT 267 + F D Sbjct: 228 ISFAGQVTEVEAYRPEKSFSDA 249 >gi|208429869|ref|YP_002265422.1| Gp6-like protein [Clostridium phage 39-O] gi|190683352|gb|ACE81996.1| Gp6-like protein [Clostridium phage 39-O] Length = 276 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 76/287 (26%), Gaps = 28/287 (9%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEG-----EASALVEVFKPTEAHEIVGDMP 63 T+ I + LA + + V + EG + + ++ + Sbjct: 4 TSFIPKLWSARLLAHLDKAHVVANLV-NRDYEGEIKAYGDTVKINQIGAITVNDYTKNTD 62 Query: 64 D-TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 + ++ Q + +ID + A A+ + ++ +LK Sbjct: 63 IHDPEELSTTEKVLTIDKQKYFNFQIDDVDAAQVRTPLMDAAMQRAAYALAEETEKVLLK 122 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + + +L+ K K + + Sbjct: 123 AIDTDA-------THKIVPEATLDPTNI--------YKELVGVKLKLDKANVPTVG--RF 165 Query: 183 VLIPSDVWASLFALERATSK-DYINTAALQAGKIEAFAGVWFINMEK---VPGNDLFPAG 238 ++I + A L R + + L+ G + G+ + + AG Sbjct: 166 LIISPETHALLLQEGRFVATGGAMAEGILKNGLVGQILGMDVYLSNNIDSLTNGNGAIAG 225 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQ 285 K ++ K A + + A+V + Sbjct: 226 VKMACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLK 272 >gi|206971573|ref|ZP_03232523.1| conserved hypothetical protein [Bacillus cereus AH1134] gi|206733558|gb|EDZ50730.1| conserved hypothetical protein [Bacillus cereus AH1134] Length = 281 Score = 77.3 bits (188), Expect = 3e-12, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 93/320 (29%), Gaps = 56/320 (17%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 I + +A +S T+ A + + ++ Sbjct: 4 ATFIPTIWEARLMANFHKRSIADLITTKPAKIEGNKIIFNRVGAVNVKDYS---GSVEWD 60 Query: 69 ATDQDRRWVGHSQF-GWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 T+ + + Q +A ++D + + + + P+ A + + D L G Sbjct: 61 DTNPSKVEINMDQKKYFAFKVDDVDAVQAAGDLIDPHTQEAGSVLQETVDTFTLGLYTGA 120 Query: 128 NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS 187 +K IG ++ + ++ + + + + + +I S Sbjct: 121 HKDNVIGTDSAAIELSPKNA-----------YDYIVDLNTKLNVKKVPKT--ERFTIINS 167 Query: 188 DVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDG 247 V L +R T + I + G+I G + E++ Sbjct: 168 QVLGLLSKDDRFTKQPVILENGIVEGQIIN--GSQIVVSEEI------------------ 207 Query: 248 KVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITL 307 +T KY + KS + K ++ ++ A + Sbjct: 208 -----------------HNTSGKYKILALHKSGIGH--GKQLNETEAQRLQNSFADGVRG 248 Query: 308 TSSFGATRIEPDKILGIEIS 327 +GA + P+ + + ++ Sbjct: 249 LMVYGAGVLRPEALAVLTVT 268 >gi|229074029|ref|ZP_04207089.1| hypothetical protein bcere0025_61080 [Bacillus cereus F65185] gi|228709104|gb|EEL61218.1| hypothetical protein bcere0025_61080 [Bacillus cereus F65185] Length = 281 Score = 76.9 bits (187), Expect = 4e-12, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 92/320 (28%), Gaps = 56/320 (17%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 I + +A +S T+ A + + ++ Sbjct: 4 ATFIPTIWEARLMANFHKRSIADLITTKPAKIEGNKIIFNRVGTVNVKDYS---GSVEWD 60 Query: 69 ATDQDRRWVGHSQF-GWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 T+ + + Q +A ++D + + + + P+ A + + D L G Sbjct: 61 DTNPSKVEINMDQKKYFAFKVDDVDAVQAAGDLIDPHTQEAGSVLQETVDTFTLGLYTGA 120 Query: 128 NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS 187 +K IG ++ + ++ + + + + + +I S Sbjct: 121 HKDNVIGTDSAAVELSPKNA-----------YDYIVDLNTKLNVKKVPKT--ERFTIINS 167 Query: 188 DVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDG 247 V L +R T + I + G+I G + E++ Sbjct: 168 QVLGLLSKDDRFTKQPVILENGIIEGQIIN--GSQIVVSEEI------------------ 207 Query: 248 KVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITL 307 +T KY + KS + K ++ ++ A + Sbjct: 208 -----------------HNTSGKYKILALHKSGIGH--GKQLNETEAQRLQNSFADGVRG 248 Query: 308 TSSFGATRIEPDKILGIEIS 327 +GA + P+ + + + Sbjct: 249 LMVYGAGVLRPEALAVLTAT 268 >gi|315122635|ref|YP_004063124.1| hypothetical protein CKC_04435 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496037|gb|ADR52636.1| hypothetical protein CKC_04435 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 336 Score = 76.9 bits (187), Expect = 4e-12, Method: Composition-based stats. Identities = 47/275 (17%), Positives = 93/275 (33%), Gaps = 25/275 (9%) Query: 73 DRRWVGHSQFGWAERIDP-FATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKG 131 RR+V + +D + +N L YA A M+R QD I+KG+ N Sbjct: 69 TRRYVQGFPKVTSSLVDKSYDQTTISVNILEGYARSAIKGMNRAQDHMIIKGIFDPNIVD 128 Query: 132 K-----------------IGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYI 174 + + + + +L+ AK++ + Sbjct: 129 DGTERKEKEFDPNMVVALNHGVETAPANSGTVLFQDKFNPKGLTWEKLLRAKTLIGESG- 187 Query: 175 DVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDL 234 S+ + +I + +L R + DY+ + ++ G I A + + ++ Sbjct: 188 --GSDNINAIISHMDYENLLLDPRIKTVDYMKSGRVERGNITRIAKINI----NIYVSEA 241 Query: 235 FPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHS 294 P G D +++ +A + ++P++ K AV K I S Sbjct: 242 IPGGLMAEYESDKELKKDRKDRKPYWTAAKMGAGVSRMIPVFSKDAVTLGIWKEIKKIVS 301 Query: 295 KDPGKWHAPQITLTSSFGATRIEPDKILGIEISKD 329 + Q+ + GATR + + I +S Sbjct: 302 VRTDLHNILQLFYSMKMGATRTNENHVAKILVSDS 336 >gi|298103492|ref|YP_003714734.1| gp27 [Streptomyces phage phiSASD1] gi|293338433|gb|ADE43451.1| gp27 [Streptomyces phage phiSASD1] Length = 291 Score = 75.7 bits (184), Expect = 9e-12, Method: Composition-based stats. Identities = 38/329 (11%), Positives = 87/329 (26%), Gaps = 60/329 (18%) Query: 10 ANIYEFKK-HVELA------LQETKSKLRPTVTEQATEGEASALVEVFK---PTEAHEIV 59 I E + +A + R + +A V + PT + Sbjct: 5 TFIPEVWSADLMVALRGAQVFGQLGVINRDY---EGDVSQAGDTVHIGSLSRPTISTYTK 61 Query: 60 GDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEA 119 T D+ + +A +D A +++ DE Sbjct: 62 NSTSIDPQTLTTTDQTLLIDQSKYFAFEVDDVDKRQ---------ARDGGRLLNQAADE- 111 Query: 120 ILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSE 179 GV + + + ++ K K + Sbjct: 112 ---AAFGVADVVDLFLAGLITTSAGNVLTAGDATTPDAAYKIILALKLKLDKAKVPTAG- 167 Query: 180 QVYVLIPSDVWASLFALERATS-KDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 +V++ + +A + +R T Y ++ A++ G++ G + +P AG Sbjct: 168 -RFVIVSPEFYALILQDQRFTDVARYGDSNAIRNGEVGKVLGFDVMVSMNLPQGTAGTAG 226 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG 298 ++ G + T + I+ + P Sbjct: 227 EVSNFVVAGH-------------------------------GMATTYAEQINNVEAYRPQ 255 Query: 299 KWHAPQITLTSSFGATRIEPDKILGIEIS 327 + I +GA + P+ + +++ Sbjct: 256 NSFSDAIKGLHLYGAKVVRPEALAVMDVD 284 >gi|18640506|ref|NP_570347.1| minor capsid protein [Synechococcus phage P60] gi|18478736|gb|AAL73285.1| minor capsid protein [Synechococcus phage P60] Length = 221 Score = 73.0 bits (177), Expect = 5e-11, Method: Composition-based stats. Identities = 20/211 (9%), Positives = 60/211 (28%), Gaps = 16/211 (7%) Query: 78 GHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAET 137 ++ + + + N + A+ DE I + + + Sbjct: 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQ 60 Query: 138 EFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFA-L 196 + NI + + + + A ++ +R +D +V VL P ++ + + Sbjct: 61 DGGFSVNIGAGNT--NNAQAIVDGFFEAAAVLDERSAPMDG-RVAVLSPRQYYSLISSVD 117 Query: 197 ERATSKDYINTAALQAGKIEAF--AGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNG 254 +++ NT + AG+ + + + Sbjct: 118 TNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTN----------LVTDPGDA 167 Query: 255 KPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQ 285 + +++ + ++ K A + Sbjct: 168 TTSGENNGSYRPAITDRAGLVFHKEAADTVE 198 >gi|326775602|ref|ZP_08234867.1| hypothetical protein SACT1_1416 [Streptomyces cf. griseus XylebKG-1] gi|326655935|gb|EGE40781.1| hypothetical protein SACT1_1416 [Streptomyces cf. griseus XylebKG-1] Length = 286 Score = 72.6 bits (176), Expect = 7e-11, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 56/236 (23%), Gaps = 19/236 (8%) Query: 9 TANIYEFKK-------HVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGD 61 + L + + R E S + Sbjct: 4 ALFKPQIWSAQILAGLDEALVYAQPQIVNRDYEGEIT-SQGQSVRIVTIGDPSIFPYKSG 62 Query: 62 MPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAIL 121 + +A ++D +NP+ A + + D + Sbjct: 63 DTINYEDIDTAGLDLPIDQGDAFAFKLDDVDKAQVALNPMAKTTQRAARKLAAQADRYVA 122 Query: 122 KGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQV 181 GV +G+ + + LI ++ + + Sbjct: 123 SLYTGVAPSNVVGSSGSP--------VNITTNPKDAWDKVLIPLRTKLNRANVP--GMDR 172 Query: 182 YVLIPSDVWASLFALERATSKD-YINTAALQAGKIEAFAGVWFINMEKVPGNDLFP 236 YV++ + +L +R D ++ L+ G + AG + P Sbjct: 173 YVVVSPEFTGALLQDDRFVRVDASGSSEGLRNGIVGKAAGFDVLESNVTPNPSADT 228 >gi|227833744|ref|YP_002835451.1| hypothetical protein cauri_1920 [Corynebacterium aurimucosum ATCC 700975] gi|262184816|ref|ZP_06044237.1| hypothetical protein CaurA7_12538 [Corynebacterium aurimucosum ATCC 700975] gi|227454760|gb|ACP33513.1| hypothetical protein cauri_1920 [Corynebacterium aurimucosum ATCC 700975] Length = 300 Score = 72.3 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 54/232 (23%), Gaps = 10/232 (4%) Query: 40 EGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGIN 99 S + + T + + V Q +A I+ + + Sbjct: 41 NSGKSVKINRLGAVKTRTYTQGESITYDTLSTESTELVMDQQEYYAFLIEDIDRAQAAGD 100 Query: 100 PLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFI 159 AM K D + K K+G + F + A Sbjct: 101 FQNESTRQHAYAMAAKVDAHTAGVLKDGAKT-KLGNKAVFDGADFYRPAEGQMTA----W 155 Query: 160 GQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDY-INTAALQAGKIEAF 218 L K +V++ ++ A+L A R T D L+ G+I A Sbjct: 156 DVLREFSKQLNKHSAPS--LDRWVVVGPNMAAALLADRRFTEADKAGTDTILRNGQIGAI 213 Query: 219 --AGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTK 268 G + P + F D Sbjct: 214 KTLGFTVYTSNQAPVTAGRETIIGGAPNALDFASQLQTAEAFRHQDHFADAF 265 >gi|46201220|ref|ZP_00055498.2| hypothetical protein Magn03010160 [Magnetospirillum magnetotacticum MS-1] Length = 79 Score = 71.9 bits (174), Expect = 1e-10, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 20/74 (27%), Gaps = 1/74 (1%) Query: 3 TKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDM 62 + A ++ V A Q +KLR TV + A A+ + A Sbjct: 1 MSTSVINAYSKQYGHEVHAAYQRMGTKLRNTVRSRNNVKGAIAVFQKVGKGTASTKA-RH 59 Query: 63 PDTIYNATDQDRRW 76 D Sbjct: 60 GKVPVMNVDHSAVE 73 >gi|227505825|ref|ZP_03935874.1| conserved hypothetical protein [Corynebacterium striatum ATCC 6940] gi|227197581|gb|EEI77629.1| conserved hypothetical protein [Corynebacterium striatum ATCC 6940] Length = 297 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 27/286 (9%), Positives = 62/286 (21%), Gaps = 16/286 (5%) Query: 9 TANIYEFKKHVELALQETKSK-----LRPTVTEQATEG-EASALVEVFKPTEAHEIVGDM 62 + + E E + T G + E D Sbjct: 4 ASFVPELWNAAIQEPYEKSLVYGQSSIASTGYFGQITGMGDTVHFNTLTAPTIKEYDKDA 63 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 TI + T D ++ ++ + + P A + D+ I Sbjct: 64 DLTIEDLTTADNTLKIDQGKYFSFGVNDVDKVQVAGDLQGPATRAAATGLRDGVDKFIAG 123 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + + L+ + + + Sbjct: 124 KLKEGALSANKIGTLKVVNDDPDKVGNGQTTA----FKTLVKLSEKLNMQSVPTTG--RW 177 Query: 183 VLIPSDVWASLFALERATSKD-YINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKF 241 V++ +++L R T D L+ G + G + P Sbjct: 178 VVVGPKTYSALLMDPRFTKVDASGTAEGLRNGIVGRAIGFEVMVSNNAPSTSGRELAIAG 237 Query: 242 PGLIDGKVEYPNGKPTVKSSAKFED---TKIKYVLPIYCKSAVVFT 284 ++ ++F D Y + + Sbjct: 238 VPGAFVFASQLVETEALRDPSRFRDIVRGLNVYGAGVVRPEGIATA 283 >gi|227498434|ref|ZP_03928580.1| conserved hypothetical protein [Acidaminococcus sp. D21] gi|226903892|gb|EEH89810.1| conserved hypothetical protein [Acidaminococcus sp. D21] Length = 288 Score = 69.9 bits (169), Expect = 5e-10, Method: Composition-based stats. Identities = 30/275 (10%), Positives = 65/275 (23%), Gaps = 27/275 (9%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEG-----EASALVEVFKPTEAHEIVGDMP 63 + I LA + L V + EG + + V Sbjct: 4 STFIPALWSARLLAHLDKNLVLGNLV-NRDYEGEIRNFGDRVKINQIADVVVKDYVKGT- 61 Query: 64 DTIYNATDQDRRW-VGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 D Y+ TD V +A +++ + I + A+ A+ D+ I Sbjct: 62 DLAYDDTDGTPTELVIDQSKYFAFKVNDVDAAQANIALMDRSLERASYALRDVIDQRIAG 121 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + ++ + + + + Sbjct: 122 HAKKAGSTLTVKDMESPEQA----------------YDSIVKLGTTLDENNVTRAG--RW 163 Query: 183 VLIPSDVWASLFALERATSKDYINTAA-LQAGKIEAFAGVWFINMEKVPGNDLFPAGTKF 241 +++P ++ L +R L G + + AG + + Sbjct: 164 LVLPPWLYGLLQKDQRFVGTGSAAAENRLTTGNVGSAAGFQIYESNNLLTVKSTNTVSVM 223 Query: 242 PGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIY 276 G T + + L +Y Sbjct: 224 AGTNAAISLAVQILKTESLRLEKDFADAVRGLLVY 258 >gi|319440816|ref|ZP_07989972.1| hypothetical protein CvarD4_03523 [Corynebacterium variabile DSM 44702] Length = 300 Score = 69.9 bits (169), Expect = 5e-10, Method: Composition-based stats. Identities = 29/265 (10%), Positives = 64/265 (24%), Gaps = 14/265 (5%) Query: 9 TANIYEFKKHVELALQET----KSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPD 64 + I + A + + T S + H+ + Sbjct: 4 ASFIPKIWAASLEAPYQKSLVYGALADNKFQPMLTNSGNSIEINSIGSAAIHDHDRNTDL 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 T + + + + + + R++ L + + M K D + + + Sbjct: 64 TYDDLSVTAQTLLIDQEDYYGFRVNDVDALQAAGDLQSAATEQHGIEMANKVDTFLAEQL 123 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 + GK F + G + K + + + Sbjct: 124 V--ADAGKKITGLTAFDGADFYRPATGQTTA---WDTIRAIVKELDKVSAPSTA--RWAV 176 Query: 185 IPSDVWASLFALERATSKD-YINTAALQAGKIEAF--AGVWFINMEKVPGNDLFPAGTKF 241 + + ++L A R T + G I A G+ P T Sbjct: 177 VGPEFASALLADRRVTDASVTGTDTVARTGMITAIQHLGISVYVSNNTPVKTGAEVITAG 236 Query: 242 PGLIDGKVEYPNGKPTVKSSAKFED 266 V + + +F D Sbjct: 237 VPGALAFVSQLRTIEAFRDTNRFGD 261 >gi|261368733|ref|ZP_05981616.1| hypothetical protein SUBVAR_06993 [Subdoligranulum variabile DSM 15176] gi|282569155|gb|EFB74690.1| hypothetical protein SUBVAR_06993 [Subdoligranulum variabile DSM 15176] Length = 300 Score = 69.6 bits (168), Expect = 7e-10, Method: Composition-based stats. Identities = 22/231 (9%), Positives = 54/231 (23%), Gaps = 21/231 (9%) Query: 21 LALQETKSKLRPTVTEQATEGEASALV-------EVFKPTEAHEIVGD---------MPD 64 A QE K+ + + + + Sbjct: 3 HANQERYGKMVDAKLRTNLVTRDNYIFNNKYEGDPKAGKVKIPVRDTEVEVKDYDKANGV 62 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 +T E ID F + A ++ D+ L + Sbjct: 63 DPKASTTTYLDLDIDQDEAVNELIDGFDAESVPDGIVAERLDSAAYSLGLSMDKKSLNAL 122 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 E + +A + + + AK ++ + D ++ +++ Sbjct: 123 EAAGTGEGSVEEGTLANVSTSKTACTSSNA----YKEALAAKRTLSRKGVPNDGQR-WMI 177 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 + + L + ++ +Q G + AG + Sbjct: 178 VSPEYLEVLMQDPNFVKQGDLSQELVQEGVVGKVAGFLVFESANLDFESTT 228 >gi|331269401|ref|YP_004395893.1| hypothetical protein CbC4_1216 [Clostridium botulinum BKT015925] gi|329125951|gb|AEB75896.1| conserved hypothetical protein [Clostridium botulinum BKT015925] Length = 278 Score = 69.2 bits (167), Expect = 9e-10, Method: Composition-based stats. Identities = 36/322 (11%), Positives = 69/322 (21%), Gaps = 68/322 (21%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTEQATEG-----EASALVEVFKPTEAHEIVGDMPDT 65 I + LA + K V + EG + + + D Sbjct: 6 FIPQIWSARLLANLDKKLVYANAV-NRDYEGEIKKFGDTVKINQMGDVTVKDYKDGKIDD 64 Query: 66 IYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 + ++D + I + A+ A+ D+ I + Sbjct: 65 PEELKSSQTILTIDQAKYFNFKVDDVDKAQANITLVDKGMGRASYAVQDVIDQFIAAFVK 124 Query: 126 GVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLI 185 K ++ N L+ + + + ++ Sbjct: 125 DAKIKMGSSSKPIELIPTNA-------------YDILVDLGVELDNKNVPRVG--RFAIL 169 Query: 186 PSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLI 245 P L R T + I + G AG V Sbjct: 170 PPFYLGLLSKDARFTKEYKILENGVVEGA--TVAGFSLRMSNNV---------------- 211 Query: 246 DGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHS-KDPGKWHAPQ 304 + Y + A+ F + P K A Sbjct: 212 -------------------SVSSGNYSIMAGTDMAISFA---GQVTEIEAYRPEKSFADA 249 Query: 305 ITLTSSFGATRIEPDKILGIEI 326 + FGA K++ + Sbjct: 250 MKGLYVFGA------KVVQSDC 265 >gi|283783443|ref|YP_003374197.1| hypothetical protein HMPREF0424_0987 [Gardnerella vaginalis 409-05] gi|283441729|gb|ADB14195.1| conserved hypothetical protein [Gardnerella vaginalis 409-05] Length = 284 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 36/318 (11%), Positives = 71/318 (22%), Gaps = 60/318 (18%) Query: 9 TANIYEFKKHVELALQETKSKLRPTV--TEQATEGEASALVEVFKPTEAH----EIVGDM 62 I E L E + V + + G + Sbjct: 5 NNFIPEIWSANILVTLENSLVFANLANREHEGEIKAYGDTVHITGIGDIQIQDYTKYGKL 64 Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 D +A +D T+ S + + A + + D+ + Sbjct: 65 TIQPVTDIDAG-VLKIDQSKAFAFEVDDLDTVQSRKDLRGKFQERAAYNLAAEVDKYVGG 123 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 M+ + +++ K+ I + Sbjct: 124 LMVTAAAGKALKKTYTKPED---------------VYESIVSLGVRLSKQNIPTTG--RF 166 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 +++ DV+ L +R + +A L G + G +PGN Sbjct: 167 LVVDPDVYGMLLLDDRFVKNTAVESATLHNGFVGNVNGFTVYQTNCMPGNTDTKHTMLAG 226 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHA 302 S + T + I S + + Sbjct: 227 ------------------------------------STIATTFAQQISKMESTRREESFS 250 Query: 303 PQITLTSSFGATRIEPDK 320 I +GA I P+ Sbjct: 251 DLIKGLLVYGAKVIRPEA 268 >gi|330507937|ref|YP_004384365.1| hypothetical protein MCON_1983 [Methanosaeta concilii GP-6] gi|328928745|gb|AEB68547.1| conserved hypothetical protein [Methanosaeta concilii GP-6] Length = 295 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 33/282 (11%), Positives = 68/282 (24%), Gaps = 40/282 (14%) Query: 41 GEASALVEVFKPTEAHEIVGDMPDT-IYNATDQDRRWVGHSQFGWAERIDPFATLDSGIN 99 ++ + D + D + +D + N Sbjct: 42 KGSTVKITSIGDITVGNYTKDSDISDPEALNDAQATLTATEAKYFNFSVDDVSRAQMSNN 101 Query: 100 PLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFI 159 + A + D+ + G + A + Sbjct: 102 IMDAAMRQAAYNLSDVADQF----IAGSSYVDVATANKIGSDTAGKVPNTTPGTTA---Y 154 Query: 160 GQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKD-YINTAALQAGKIEAF 218 L+ + + + +V++P L A R T +T AL G ++ Sbjct: 155 DYLLQMGTKLSEANVQKQG--RWVVVPPWFVEKLAADARFTDASASGSTDALLNGSVKRA 212 Query: 219 AGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCK 278 AG + VP + + K Y + Sbjct: 213 AGFDILESNNVPTVAG---------------------------SGGDAGKTNYKIIAGVP 245 Query: 279 SAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDK 320 SA+ F +++ + P K A + +G + P Sbjct: 246 SAITFA--DSVNKVEAYRPDKRFADAVKGLHVYGMKVVRPSA 285 >gi|313116021|ref|ZP_07801445.1| hypothetical protein HMPREF9436_03335 [Faecalibacterium cf. prausnitzii KLE1255] gi|310621618|gb|EFQ05149.1| hypothetical protein HMPREF9436_03335 [Faecalibacterium cf. prausnitzii KLE1255] Length = 286 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 31/242 (12%), Positives = 57/242 (23%), Gaps = 32/242 (13%) Query: 21 LALQETKSKLRPTVTEQATEGEASALV-------EVFKP-------TEAHEIVGDMPDTI 66 A QE S L + + TE D + + Sbjct: 3 HASQERYSALVDAKLRATLVTRDNTIFNNRYEGSPKAGKVKIPVRDTEVAVKAYDKANGV 62 Query: 67 YNATDQDRRWVGHSQFGWA--ERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 A E ID F A +M D+ ++ + Sbjct: 63 DADAGTTTYLDLDIDNDEAVNEIIDGFDAASVPDGITAERLDSAGYSMALSIDKKSIEAL 122 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 G NI + + L AK + + +++ Sbjct: 123 QGAA-------------GANISATKTACTASTAYKEAL-AAKRTLSRNGVPQAG--RWMI 166 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 + + L +R + ++ +QAG + AG + + TK Sbjct: 167 VSPEYLEILMQDDRFIKQGDLSQQLVQAGAVGQIAGFAVYESNNMDFENATRVATKKTTT 226 Query: 245 ID 246 Sbjct: 227 EF 228 >gi|237743783|ref|ZP_04574264.1| conserved hypothetical protein [Fusobacterium sp. 7_1] gi|229432814|gb|EEO43026.1| conserved hypothetical protein [Fusobacterium sp. 7_1] Length = 275 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 25/226 (11%), Positives = 52/226 (23%), Gaps = 27/226 (11%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEG-----EASALVEVFKPTEAHEIVGDMP 63 E + + + + EG +S V + Sbjct: 4 QTFKPEVWAELTNRNLNKQLVF-GALANRNYEGKIENMGSSVRVPSIGSVTVGDYT-GAD 61 Query: 64 DTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKG 123 T T + + +A ++D + + A M D + K Sbjct: 62 ITFQEDTGAYQTININKAKYFALKMDDVDKAQAISGVIEALTDQAIYEMADVVDIELAKL 121 Query: 124 MLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYV 183 S V G ++I + + ++ Sbjct: 122 Y------------------AKCKSKVAGVIGSDKVSDKIIDLAVKMDEDNVPTA--NRWL 161 Query: 184 LIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKV 229 +I +++ L SK Q+ I ++ G V Sbjct: 162 VISPEIYGQLIKEVPTISKGENTLGINQSYFIGSWGGFTIYKSNNV 207 >gi|262276629|ref|ZP_06054434.1| hypothetical protein HIMB114_0030 [alpha proteobacterium HIMB114] gi|262225209|gb|EEY75656.1| hypothetical protein HIMB114_0030 [alpha proteobacterium HIMB114] Length = 281 Score = 64.6 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 40/332 (12%), Positives = 82/332 (24%), Gaps = 62/332 (18%) Query: 2 ATKEQLATANIYEFKKHVELALQETKSKLRPTVTEQATEG-EASALVEVFKPTEAHEIVG 60 T L+ ++ + + S +RP VT G V V+ A + Sbjct: 5 TTSSTLSELYTEIIQEAIFTF--QETSVMRPLVTTYNISGQGKQIAVPVYPAISAAAVAE 62 Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAI 120 + + S+ G + + N L A+ K D I Sbjct: 63 GTDLSNTAVNPTEATI-TASEVGVMTTLTDLGRDSASRNVAADIGKLFGDAIADKVDTDI 121 Query: 121 LKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQ 180 + G + + A + R + Sbjct: 122 AALFSSFSS-------------------DVGAAATELTPELIFKAVATLRANNVPAPYYG 162 Query: 181 VYVLIPSDVWASLFALERATSKD----YINTAALQAGKIEAFAGVWFINMEKVPGNDLFP 236 V+ + + T+ + AL++G I AGV + Sbjct: 163 VFNPKAAFNLKKVLTNAGYTTSSNAVSDLGNEALRSGYIATVAGVQIFENSNISI----- 217 Query: 237 AGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKD 296 D V ++ +++ ++ ++ +D Sbjct: 218 -----------------------------DAYDDSVGAVFHPASLGLAMKQDFRIETQRD 248 Query: 297 PGKWHAPQITLTSSFGATRIEPDKILGIEISK 328 A +I T + G ++ D + I Sbjct: 249 ASLR-ATEIVATVTKGQGVVKSDYGVKITTDS 279 >gi|118443909|ref|YP_878246.1| hypothetical protein NT01CX_2173 [Clostridium novyi NT] gi|118134365|gb|ABK61409.1| conserved hypothetical protein [Clostridium novyi NT] Length = 230 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/272 (11%), Positives = 62/272 (22%), Gaps = 56/272 (20%) Query: 50 FKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLAT 109 + G + + ++D S I + A+ Sbjct: 1 MGDVTVKDYKGGTIEDPEELKSNQTILTIDQAKYFNFKVDDVDKAQSNILLVDKGMGRAS 60 Query: 110 AAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIF 169 A+ D+ I + K KIG+ ++ L+ Sbjct: 61 YAVQDVIDKFIAALVKDA--KIKIGSTSKPIEI-----------TVANAYDTLVDLGVEL 107 Query: 170 RKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKV 229 + + + ++P L R T I + G +G + V Sbjct: 108 DNKNVPRVG--RFAILPPFYLGLLSKDPRFTKDFKILENGVVEGA--TVSGFKLMMSNNV 163 Query: 230 PGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAI 289 P + Y + A+ + Sbjct: 164 PF-----------------------------------SANNYSIMAGTDMAISYA---GQ 185 Query: 290 DVQHS-KDPGKWHAPQITLTSSFGATRIEPDK 320 + P K + + FGA ++PD Sbjct: 186 VTEIEAYRPEKSFSDAMKGLYVFGAKVVQPDC 217 >gi|295100741|emb|CBK98286.1| hypothetical protein FP2_06660 [Faecalibacterium prausnitzii L2-6] Length = 272 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/221 (10%), Positives = 56/221 (25%), Gaps = 22/221 (9%) Query: 58 IVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQD 117 T + T E ID F + + A ++ + D Sbjct: 58 KQTGAELTGGDTTY--LTVNIDKDKAVNEIIDGFDAASVPDDLVADRLDSAGYSLALQVD 115 Query: 118 EAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVD 177 + + KT G ++ A++ ++ + Sbjct: 116 SD----------------GSVELTTAGTAFGTTTALTEKTIYGNVVDARTKLSTVHVPTE 159 Query: 178 SEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPA 237 ++L+ +++ L + A +Q G + AG + + A Sbjct: 160 G--RWLLVSPEIYGLLLKSPEFIKASDLGDAVVQTGAVGRIAGFTVFEDSTLGEGVEYIA 217 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIK--YVLPIY 276 G + P + S+K+ + + Sbjct: 218 GHPNWFAFIDEWAVPVHVQDLNGSSKYIGASAVKGRKVYAF 258 >gi|83721100|ref|YP_441470.1| hypothetical protein BTH_I0914 [Burkholderia thailandensis E264] gi|257139849|ref|ZP_05588111.1| hypothetical protein BthaA_11711 [Burkholderia thailandensis E264] gi|83654925|gb|ABC38988.1| hypothetical protein BTH_I0914 [Burkholderia thailandensis E264] Length = 126 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 42/136 (30%), Gaps = 36/136 (26%) Query: 191 ASLFALERATSKDYINTAALQAGKI-EAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKV 249 + + TS D++ LQ GK+ + G ++ E + Sbjct: 23 DYILSDTTLTSADFMAVQMLQDGKLSGHWLGFTWVPYEAL-------------------- 62 Query: 250 EYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTS 309 Y KS+ F D+ K +A QI + Sbjct: 63 ---------------ATNGTVKTTCAYAKSSTQFGVGLNRDIDIGPRRDKRNAIQIYIGE 107 Query: 310 SFGATRIEPDKILGIE 325 S+GA R + K++ I+ Sbjct: 108 SYGAVRTDEKKVVTID 123 >gi|158345177|ref|YP_001522884.1| capsid protein [Enterobacteria phage LKA1] gi|114796473|emb|CAK25011.1| capsid protein [Pseudomonas phage LKA1] Length = 334 Score = 59.9 bits (143), Expect = 5e-07, Method: Composition-based stats. Identities = 24/233 (10%), Positives = 57/233 (24%), Gaps = 10/233 (4%) Query: 27 KSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAE 86 SK + ++ G V+ + + + Sbjct: 39 SSKFASWMNVRSLRGTNQLRVDRVGASTIAGRKAGEELVVQKNVSDKLNLTVDTVLYARH 98 Query: 87 RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENIL 146 D F S ++ A A+ R+ D+A + + F +L Sbjct: 99 FFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGILL 158 Query: 147 SAVEGDDFFKTFIGQL------ITAKSIFRKRYI-DVDSEQVYVLIPSDVWASLFALERA 199 + R + D + L+ +++ L +R Sbjct: 159 PSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRL 218 Query: 200 TSKDYI---NTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKV 249 + ++ + G+I GV + + P + + + Sbjct: 219 MNVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAE 271 >gi|289976623|gb|ADD21668.1| putative major capsid protein [Caulobacter phage Cd1] Length = 337 Score = 59.5 bits (142), Expect = 7e-07, Method: Composition-based stats. Identities = 37/328 (11%), Positives = 93/328 (28%), Gaps = 28/328 (8%) Query: 7 LATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTI 66 + + E+ E +L S L V + G + T+ ++ +P Sbjct: 27 IHALAVSEYAGFTETSLNRR-SVLADWVPMRRITGTTTVHNYAIGETKLDKVEPGVPP-P 84 Query: 67 YNATDQDRRWVGHSQFGWAE-RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGML 125 + D R V A + + + + + +D+A+ + Sbjct: 85 SHGVDISRASVTVDTMINARNVLPLLEEFQTQVEVRKHLGEEHGKELAKFRDQALFIQAI 144 Query: 126 GVNKKGKIGAETEFFSKEN---ILSAVEGDDFFKTFIGQLITAK----SIFRKRYIDVDS 178 + + + S G T ++ A + ++ +D Sbjct: 145 KAARMTQSAYAKGGQDVDGFKGGTSIQLGAAGDVTDPAKMYRAVSDLETAMAEKDVDWVE 204 Query: 179 EQVYVLIPSDVWASLFALERATSKDYIN-TAALQAGKIEAFAGVWFINMEKVPGNDLFPA 237 + + + V+ +L E+ + +Y+ + G + G + +P ++ Sbjct: 205 DGIILAFRPKVFQALRDAEKIVNGEYVTADGTTKEGLVFKTFGAPVVKTNNLPNTNIT-- 262 Query: 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDP 297 + +S ++ K A++ + + D Sbjct: 263 --------------GHLLSNAGNSNGYDVDARKVAGVALSTRALLAGETIPLTSDVFYDK 308 Query: 298 GKWHAPQITLTSSFGATRIEPDKILGIE 325 W + ++F AT + IE Sbjct: 309 I-WKCWFVDSHTAFAATPSRAEFAGIIE 335 >gi|311899963|dbj|BAJ32371.1| hypothetical protein KSE_66120 [Kitasatospora setae KM-6054] Length = 304 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/272 (10%), Positives = 63/272 (23%), Gaps = 21/272 (7%) Query: 8 ATANIYEFKKHVELALQETKSKLRPT-VTEQATEG-----EASALVEVFKPTEAHEIVGD 61 +++ + E L + T + EG + + + Sbjct: 13 SSSFVPEIWDGALLTKFDPLLVWASKICTNRKYEGEIRKQGDTVHINSLSTPTVGDYTLP 72 Query: 62 MPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAIL 121 T +++ ++ + + P A+ R+ D + Sbjct: 73 EGMTAQRPEMVEQKLAITEAKYLQLLVEDIERVQAAGAMESPINQQMVRALAREADTFMG 132 Query: 122 KGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQV 181 + + V + + ++ I + Sbjct: 133 RVIASAA-------------TPMPSVKVTAGNAPQALYSAVLDMMLALDSHDIPD--GRY 177 Query: 182 YVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKF 241 V+ P + A + Y G + AG ++ +P AG Sbjct: 178 VVVSPRVKRHLVEHPAIANAGAYGEAGVTANGVVARLAGFTVLSTTAMPEGSDIVAGHSE 237 Query: 242 PGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVL 273 + + K A F DT Y Sbjct: 238 FATFASQFNGFREGLSEKYRANFVDTLHLYGG 269 >gi|168207210|ref|ZP_02633215.1| conserved hypothetical protein [Clostridium perfringens E str. JGS1987] gi|170661422|gb|EDT14105.1| conserved hypothetical protein [Clostridium perfringens E str. JGS1987] Length = 273 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 27/299 (9%), Positives = 74/299 (24%), Gaps = 63/299 (21%) Query: 38 ATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQD-RRWVGHSQFGWAERIDPFATLDS 96 + + ++ + V + +A +D Sbjct: 33 TEINGEKVIFNRVANGNLKDYT---GTIAWDDVNTTPIEMVFDQKKYFAFSLDDVDKAQL 89 Query: 97 GINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFK 156 + + P A + D+ + K A Sbjct: 90 KADVMKPTLEEHGAILAETYDKNFFNVLAAGAKSENNIGSKSKKKTVTPKEA-------- 141 Query: 157 TFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIE 216 ++ + K+ + +V + S+ L +R T + + G+ Sbjct: 142 --YDYIVDLGTKLSKKKVPKA--DRFVTVDSEYLGLLSKDDRFTKNPNVLANGIVEGQ-- 195 Query: 217 AFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIY 276 G+ ++ E++P N + + Sbjct: 196 KINGLQVMSSEELPDNTI---------------------------------------IAH 216 Query: 277 CKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKI----LGIEISKDSL 331 KSA+ K + + A I +G+ + + I ++++++++ Sbjct: 217 HKSAIGSA--KQLQKTEAMRLQGSFADGIRGLCVYGSKVLREEAISVLYYELKVAEETV 273 >gi|254391651|ref|ZP_05006849.1| Mycobacterium phage protein [Streptomyces clavuligerus ATCC 27064] gi|294812979|ref|ZP_06771622.1| Gp6-like protein [Streptomyces clavuligerus ATCC 27064] gi|326441473|ref|ZP_08216207.1| hypothetical protein SclaA2_10428 [Streptomyces clavuligerus ATCC 27064] gi|197705336|gb|EDY51148.1| Mycobacterium phage protein [Streptomyces clavuligerus ATCC 27064] gi|294325578|gb|EFG07221.1| Gp6-like protein [Streptomyces clavuligerus ATCC 27064] Length = 304 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 24/206 (11%), Positives = 50/206 (24%), Gaps = 19/206 (9%) Query: 37 QATEGEASALVEV---FKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFAT 93 + + LV + PT A + D T D++ ++ Sbjct: 44 EGDISKQGDLVHINSLVTPTVADYKLPD-GMTFQRPETVDQKLEVSEAKYIQLLVEDAER 102 Query: 94 LDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDD 153 P A+ R+ D + + + + Sbjct: 103 AQVAGTIDSPINQRMIQALARETDTFVGNVIASGA-------------TALPSAKATAQN 149 Query: 154 FFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAG 213 + G ++ + + + V+ P + A + Y G Sbjct: 150 APQVLYGTILDMMLALDDNDVP--TGRYVVVSPRVKRYLIEHPAIANAGAYGEGGVTANG 207 Query: 214 KIEAFAGVWFINMEKVPGNDLFPAGT 239 I AG ++ +P AG Sbjct: 208 LIARLAGFTVVSTTAMPKGVDIVAGH 233 >gi|317499867|ref|ZP_07958105.1| hypothetical protein HMPREF1026_00047 [Lachnospiraceae bacterium 8_1_57FAA] gi|316898769|gb|EFV20802.1| hypothetical protein HMPREF1026_00047 [Lachnospiraceae bacterium 8_1_57FAA] Length = 292 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 62/236 (26%), Gaps = 20/236 (8%) Query: 11 NIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNAT 70 + LA + S T+ + + ++ Sbjct: 6 FKPTLWEGALLANFHSVSIADVLATKPTEIKGQKVIFNRVAGGTLKDYS---GSVDWDDI 62 Query: 71 DQDRR-WVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNK 129 D V + +A +D + + L A + D+ +L K Sbjct: 63 DTTPVEMVFDKKKYFAFALDDVDKVQLKADLLSATTKEHAAVLAETYDKDFFAALLAGTK 122 Query: 130 KGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDV 189 + + + ++ ++ K+ + +V + +D Sbjct: 123 LLIGSSSAKKKV------------TAASAYDYIVDLGTMLSKKKVPKV--NRFVTVNADY 168 Query: 190 WASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLI 245 L +R T+ + + G+ G+ + E++P N + G Sbjct: 169 LGLLSKDKRFTANPKVLENGVVEGQ--TINGMQVMCSEELPANVIIANHKSAIGAA 222 >gi|157311195|ref|YP_001469239.1| gp6 [Mycobacterium phage Tweety] gi|148540824|gb|ABQ86075.1| gp6 [Mycobacterium phage Tweety] Length = 273 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/318 (9%), Positives = 67/318 (21%), Gaps = 59/318 (18%) Query: 9 TANIYEFKKHVELALQETKSKLRPTV-TEQATEGEASALVEVFKPTEAH---EIVGDMPD 64 I E + L ++ V E +V + Sbjct: 4 NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 + +D + + +D + + Y A+ D+ I + Sbjct: 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLL 122 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 + + + A K + V V Sbjct: 123 VDNG----------------TALSGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVN 166 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 W + ++ + A L+AG I G + + Sbjct: 167 AEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL--------------- 211 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQ 304 + SA + + V+ +D + + Sbjct: 212 ---------------------RDTDDEQFVAFHPSAAAYVSQID-TVEALRDQD-SFSDR 248 Query: 305 ITLTSSFGATRIEPDKIL 322 I +G + P ++ Sbjct: 249 IRALHVYGGKVVRPTGVV 266 >gi|206599886|ref|YP_002241691.1| gp6 [Mycobacterium phage Fruitloop] gi|318065798|ref|YP_004123828.1| gp6 [Mycobacterium phage Wee] gi|206286974|gb|ACI12320.1| gp6 [Mycobacterium phage Fruitloop] gi|315420881|gb|ADU15882.1| gp6 [Mycobacterium phage Wee] Length = 273 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 31/318 (9%), Positives = 68/318 (21%), Gaps = 59/318 (18%) Query: 9 TANIYEFKKHVELALQETKSKLRPTV-TEQATEGEASALVEVFKPTEAH---EIVGDMPD 64 I E + L ++ V E +V + Sbjct: 4 NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 + +D + + +D + + Y A+ D+ I + Sbjct: 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLL 122 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 + + + TA K + V V Sbjct: 123 VDNG----------------TALSGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVN 166 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 W + ++ + A L+AG I G + + Sbjct: 167 AEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL--------------- 211 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQ 304 + SA + + V+ +D + + Sbjct: 212 ---------------------RDTDDEQFVAFHPSAAAYVSQID-TVEALRDQD-SFSDR 248 Query: 305 ITLTSSFGATRIEPDKIL 322 I +G + P ++ Sbjct: 249 IRALHVYGGKVVRPTGVV 266 >gi|206600085|ref|YP_002241590.1| gp6 [Mycobacterium phage Pacc40] gi|206287173|gb|ACI12517.1| gp6 [Mycobacterium phage Pacc40] Length = 273 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 30/318 (9%), Positives = 66/318 (20%), Gaps = 59/318 (18%) Query: 9 TANIYEFKKHVELALQETKSKLRPTV-TEQATEGEASALVEVFKPTEAH---EIVGDMPD 64 I E + L ++ V E +V + Sbjct: 4 NNFIPELWSDMLLEEWAAQTVFANLVNREYEGIANKGNVVHIAGVVSPTVKDYKAAGRQT 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 + +D + + +D + + Y A+ D+ I + Sbjct: 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALAIDTDKFIADML 122 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 + + A K + V V Sbjct: 123 VDNG----------------TALTGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVN 166 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 W + ++ + A L+AG I G + + Sbjct: 167 AEMAYWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL--------------- 211 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQ 304 + SA + + V+ +D + + Sbjct: 212 ---------------------RDTDDEQFVAFHPSAAAYVSQID-TVEALRDQD-SFSDR 248 Query: 305 ITLTSSFGATRIEPDKIL 322 I +G + P ++ Sbjct: 249 IRALHVYGGKVVRPTGVV 266 >gi|291084865|ref|YP_003495148.1| gp6 [Mycobacterium phage Ardmore] gi|262262701|gb|ACY39889.1| gp6 [Mycobacterium phage Ardmore] Length = 273 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 31/318 (9%), Positives = 68/318 (21%), Gaps = 59/318 (18%) Query: 9 TANIYEFKKHVELALQETKSKLRPTV-TEQATEGEASALVEVFKPTEAH---EIVGDMPD 64 I E + L ++ V E +V + Sbjct: 4 NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQT 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 + +D + + +D + + Y A+ D+ I + Sbjct: 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLL 122 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 + + + TA K + V V Sbjct: 123 VDNG----------------TALSGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVN 166 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 W + ++ + A L+AG I G + + Sbjct: 167 AEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL--------------- 211 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQ 304 + SA + + V+ +D + + Sbjct: 212 ---------------------RDTDDEQFVAFHPSAAAYVSQID-TVEALRDQD-SFSDR 248 Query: 305 ITLTSSFGATRIEPDKIL 322 I +G + P ++ Sbjct: 249 IRALHVYGGKVVRPTGVV 266 >gi|109392192|ref|YP_655002.1| gp6 [Mycobacterium phage Llij] gi|109522090|ref|YP_655767.1| gp6 [Mycobacterium phage PMC] gi|88910293|gb|ABD58222.1| gp6 [Mycobacterium phage Llij] gi|91980790|gb|ABE67507.1| gp6 [Mycobacterium phage PMC] Length = 273 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 30/318 (9%), Positives = 66/318 (20%), Gaps = 59/318 (18%) Query: 9 TANIYEFKKHVELALQETKSKLRPTV-TEQATEGEASALVEVFKPTEAH---EIVGDMPD 64 I E + L ++ V E +V + Sbjct: 4 NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 + +D + + +D + + Y A+ D+ I + Sbjct: 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADML 122 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 + + A K + V V Sbjct: 123 VDNG----------------TALTGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVN 166 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 W + ++ + A L+AG I G + + Sbjct: 167 AEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL--------------- 211 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQ 304 + SA + + V+ +D + + Sbjct: 212 ---------------------RDTDDEQFVAFHPSAAAYVSQID-TVEALRDQD-SFSDR 248 Query: 305 ITLTSSFGATRIEPDKIL 322 I +G + P ++ Sbjct: 249 IRALHVYGGKVVRPTGVV 266 >gi|294782221|ref|ZP_06747547.1| major head protein [Fusobacterium sp. 1_1_41FAA] gi|294480862|gb|EFG28637.1| major head protein [Fusobacterium sp. 1_1_41FAA] Length = 282 Score = 54.2 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 63/234 (26%), Gaps = 31/234 (13%) Query: 1 MATKEQLATANIYEFKKHVELALQETKSKLR--PTVTEQA---TEGEASALVEVFKPTEA 55 MA + ++ I E + + QE KL P V + + Sbjct: 1 MAGETKVEHLIIPEVLED--MVRQELPHKLVFGPLVDINNKLEGVPGNVLTIPKWGLLGI 58 Query: 56 HEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRK 115 E V ++ Y + V + A L +PL S T ++ RK Sbjct: 59 AEDVAELGAVPYENLTTSKTEVTIKKIAKGVHFSDEALLSGYGDPLGEGVSQLTVSIARK 118 Query: 116 QDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYID 175 D +L + K + L A + F ++ Sbjct: 119 IDSDVLDEIKKAKLKYNRKSVK-------------------LSYDVLADALTKFGEK--- 156 Query: 176 VDSEQVYVLIPSDVWASLFALERATSKDYINTAAL-QAGKIEAFAGVWFINMEK 228 + I D +A L + + I L G I G+ + Sbjct: 157 -IDNPRVIFITPDQYAELRKDKNFLALKDIAGKPLMMTGVIGELCGIQLVVTSN 209 >gi|255010170|ref|ZP_05282296.1| hypothetical protein Bfra3_13600 [Bacteroides fragilis 3_1_12] gi|313147965|ref|ZP_07810158.1| predicted protein [Bacteroides fragilis 3_1_12] gi|313136732|gb|EFR54092.1| predicted protein [Bacteroides fragilis 3_1_12] Length = 323 Score = 54.2 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 31/301 (10%), Positives = 75/301 (24%), Gaps = 33/301 (10%) Query: 39 TEGEASALVEVFKPTEAHEIV-GDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSG 97 + + +P T+ TD D + I T++ Sbjct: 33 VNNGKIVHIPNAGAASGTKKNRTSLPATVTKRTDIDVTFPLDEYTTDPVLIPNADTVELS 92 Query: 98 INPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKT 157 + + QD+ L + + ET + G Sbjct: 93 YDKRESVLRQDKLKL---QDDVALDFVFNWSPAAAQCIETTGTEIDAYTDKATGK-RKGI 148 Query: 158 FIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFAL-ERATSKDYINTAALQAGKIE 216 ++ + F I + Y+L+ + +++ L + ++ +A Q G + Sbjct: 149 CKADVLGLMTKFNNDDIPQEG--RYLLLDAQMYSQLLNSLTENENTAFLASADAQNGILG 206 Query: 217 AFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIY 276 + +V K+ + + Sbjct: 207 KLFSFNIMMRSRV--------------------ALYTAAKAPKAWSAAGAATDLAAGLAW 246 Query: 277 CKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVPV 336 + +V + ++ ++ + G + DK I ++G PV Sbjct: 247 HEQSVCRALG-EVKAFENEGDATYYGDIYSFLVRAGGRIMREDKKGVI----ALVQGTPV 301 Query: 337 L 337 Sbjct: 302 A 302 >gi|29565772|ref|NP_817344.1| gp6 [Mycobacterium phage Che8] gi|29424497|gb|AAN12404.1| gp6 [Mycobacterium phage Che8] Length = 273 Score = 53.8 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 30/318 (9%), Positives = 67/318 (21%), Gaps = 59/318 (18%) Query: 9 TANIYEFKKHVELALQETKSKLRPTV-TEQATEGEASALVEVFKPTEAH---EIVGDMPD 64 I E + L ++ V E +V + Sbjct: 4 NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQT 63 Query: 65 TIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM 124 + +D + + +D + + Y A+ D+ I + Sbjct: 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADML 122 Query: 125 LGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL 184 + + +A K + V V Sbjct: 123 VDNG----------------TALTGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVN 166 Query: 185 IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGL 244 W + ++ + A L+AG I G + + Sbjct: 167 AEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL--------------- 211 Query: 245 IDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQ 304 + SA + + V+ +D + + Sbjct: 212 ---------------------RDTDDEQFVAFHPSAAAYVSQID-TVEALRDQD-SFSDR 248 Query: 305 ITLTSSFGATRIEPDKIL 322 I +G + P ++ Sbjct: 249 IRALHVYGGKVVRPTGVV 266 >gi|317483978|ref|ZP_07942914.1| hypothetical protein HMPREF0179_00264 [Bilophila wadsworthia 3_1_6] gi|316924767|gb|EFV45917.1| hypothetical protein HMPREF0179_00264 [Bilophila wadsworthia 3_1_6] Length = 350 Score = 53.4 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 31/301 (10%), Positives = 70/301 (23%), Gaps = 54/301 (17%) Query: 19 VELALQETKSKLRPTVTEQATEG----------EASALVEVFKPTEAHEIVGD-MPDTIY 67 V+ L + LR T AS V + +A + Sbjct: 8 VDKLLAQGLLALRGTCVMPRLVNSDYSNLAAQQGASIDVPIPSAIKAQAVTPGATSQDTG 67 Query: 68 NATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 + + + +++ + AS A A+ D LG Sbjct: 68 DISPVSATIKLDRWMEAPFYLTDKDLMEANRGVIPMQASEAVKAIAN--DVNATLLGLGR 125 Query: 128 NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS 187 G +G + A+ + ++ V+ + +++ Sbjct: 126 KFYGMVGTPGTTPFS---------------TVVDATNARKVLNRQLAPVNDRR--IVLDP 168 Query: 188 DVWASLFALERATS-KDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLID 246 D A+ L + + G I G + ++VP + + Sbjct: 169 DAEAAALGLSGFADVSKSGDARPIIDGTIGRKYGFDWAMDQQVPTFEASVMTEGALTVNG 228 Query: 247 GKVEYPNGKPTVKSSA-----------------------KFEDTKIKYVLPIYCKSAVVF 283 K++ + + + + A+ F Sbjct: 229 ANEAGAQVVSLAKATNAAGLKEGDILTIAGDAQTYVVMEAVTVSGSHVMNLAFHRDAIAF 288 Query: 284 T 284 Sbjct: 289 A 289 >gi|145297109|ref|YP_001139929.1| hypothetical protein cgR_p0014 [Corynebacterium glutamicum R] gi|140847056|dbj|BAF56027.1| hypothetical protein [Corynebacterium glutamicum R] Length = 229 Score = 53.4 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 43/161 (26%), Gaps = 10/161 (6%) Query: 77 VGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAE 136 V Q +A +D + + AM K D + + KIG Sbjct: 4 VMDEQEYYAFLVDDVDKAQAAGDFQGAGTEQHGIAMAAKVDSTVSTKLRDGA-GKKIGNT 62 Query: 137 TEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFAL 196 F + + A L + S +V++ + +L A Sbjct: 63 AIFNGADFYMPASGQATA----WDALRMLSKEL--NKVSAPSLNRWVVVGPEFGDALLAD 116 Query: 197 ERATSKDYINTAAL-QAGKIEAF--AGVWFINMEKVPGNDL 234 T D T A+ + G I G VP Sbjct: 117 RHLTEADKAGTDAVARNGLIATIKTLGFSVFTSNSVPVTAG 157 >gi|333027404|ref|ZP_08455468.1| hypothetical protein STTU_4908 [Streptomyces sp. Tu6071] gi|332747256|gb|EGJ77697.1| hypothetical protein STTU_4908 [Streptomyces sp. Tu6071] Length = 316 Score = 52.6 bits (124), Expect = 8e-05, Method: Composition-based stats. Identities = 41/300 (13%), Positives = 87/300 (29%), Gaps = 14/300 (4%) Query: 31 RPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDP 90 R + A + V A E NAT+ V + + + Sbjct: 31 RDYEADFAGRQGDTITVRKPAVFTATEFNRTTGIVPQNATESGFPVVLNHLPDVSFTVTT 90 Query: 91 FATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVE 150 + A AM +K D IL + + AE N Sbjct: 91 EQLTLEIDDFGERLLDPAMEAMAQKIDRDILSLRSDITQTVGEVAENTGGENYN----YP 146 Query: 151 GDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAAL 210 G + + LI A ++ + + V V + + RA+ + Sbjct: 147 GGAYPWSDSRVLIEAGALLDTKNVPAADRNVVVGPRTKARWMAEKIWRASDQRGSTVGLT 206 Query: 211 QAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIK 270 +A +G + + G P + +D ++ Sbjct: 207 EAQFGANASGFTPYMSQNITGPAADPETGEPTTEVDVAFHRTAFALVTRTLEIPPGA--- 263 Query: 271 YVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISKDS 330 +A+V + A+ V + D K+ +++ +G ++P++ + I+ + + Sbjct: 264 ------QDAAIVPYKGFALRVVYDYDI-KFKQTVVSVDCLYGVKTLDPNRAVLIKGADAA 316 >gi|33300843|ref|NP_877471.1| capsid protein [Pseudomonas phage phiKMV] gi|167600478|ref|YP_001671977.1| major capsid protein [Pseudomonas phage LUZ19] gi|195546677|ref|YP_002117758.1| major capsid protein [Pseudomonas phage PT5] gi|195546739|ref|YP_002117817.1| capsid protein [Pseudomonas phage PT2] gi|33284814|emb|CAD44223.1| capsid protein [Enterobacteria phage phiKMV] gi|158187638|gb|ABW23115.1| major capsid protein [Pseudomonas phage PT5] gi|161168341|emb|CAP45505.1| major capsid protein [Pseudomonas phage LUZ19] gi|165880748|gb|ABY71003.1| capsid protein [Pseudomonas phage PT2] Length = 335 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 35/307 (11%), Positives = 79/307 (25%), Gaps = 27/307 (8%) Query: 28 SKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAER 87 SK P + + G ++ EA + + + Sbjct: 38 SKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELERSRVVNDKWNLTVDTLLYLRHQ 97 Query: 88 IDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILS 147 D + A L + RK D+A L ++ + FS + Sbjct: 98 FDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEK 157 Query: 148 AVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYV-----LIPSDVWASLFALERATSK 202 K +++ + +ID D + V++ L ++ + Sbjct: 158 LDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNV 217 Query: 203 DYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSA 262 +Y T A +V + +P G + Sbjct: 218 EYQATGATND-----------YVKSRVAILNGVKVLETPRFATKAIAAHPLG----RHFN 262 Query: 263 KFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWH----APQITLTSSFGATRIEP 318 + + + ++ Q + + +D K+ Q+ GA R + Sbjct: 263 VSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYN---IGARRPDT 319 Query: 319 DKILGIE 325 + ++ Sbjct: 320 AGAIELK 326 >gi|318057419|ref|ZP_07976142.1| hypothetical protein SSA3_05733 [Streptomyces sp. SA3_actG] gi|318075980|ref|ZP_07983312.1| hypothetical protein SSA3_04519 [Streptomyces sp. SA3_actF] Length = 316 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 41/300 (13%), Positives = 87/300 (29%), Gaps = 14/300 (4%) Query: 31 RPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDP 90 R + A + V A E NAT+ V + + + Sbjct: 31 RDYEADFAGRQGDTITVRKPAVFTATEFNRTTGIVPQNATESGFPVVLNHLPDVSFTVTT 90 Query: 91 FATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVE 150 + A AM +K D IL + + AE N Sbjct: 91 EQLTLEIDDFGERLLDPAMEAMAQKIDRDILSLRSDITQTVGEVAENTGGENYN----YP 146 Query: 151 GDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAAL 210 G + + LI A ++ + + V V + + RA+ + Sbjct: 147 GGAYPWSDSRVLIEAGALLDTKNVPAADRNVVVGPRTKARWMAEKIWRASDQRGSTVGLT 206 Query: 211 QAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIK 270 +A +G + + G P + +D ++ Sbjct: 207 EAQFGANASGFTPYMSQNIAGPAADPETGEPTTEVDVAFHRTAFALVTRTLEIPPGA--- 263 Query: 271 YVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISKDS 330 +A+V + A+ V + D K+ +++ +G ++P++ + I+ + + Sbjct: 264 ------QDAAIVPYKGFALRVVYDYDI-KFKQTVVSVDCLYGVKTLDPNRAVLIKGADAA 316 >gi|270297112|ref|ZP_06203311.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270273099|gb|EFA18962.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 349 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 44/191 (23%), Gaps = 12/191 (6%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P + N T+ D A R+ + + A+ + + + Sbjct: 66 PIPVQNLTEGDIPIGLDKYQTKATRVTDDQLYAISYDKFSTDVQRHSNAIDTAKYKKAIH 125 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + K +I K F K + D ++ Sbjct: 126 ALSPYSNTKTTPVVPTSGE-------ADATGRKKMTRKDVIALKRAFDKAEVPTDGRRLV 178 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 + P + L ++ + Y GK+ G P F Sbjct: 179 LC-PDHINDLLEEDQKFREQYYN----YTTGKVTNMYGFEIYEFVNCPYFTNAGVKVPFG 233 Query: 243 GLIDGKVEYPN 253 + Sbjct: 234 TSPAETDMQAS 244 >gi|307545233|ref|YP_003897712.1| hypothetical protein HELO_2643 [Halomonas elongata DSM 2581] gi|307217257|emb|CBV42527.1| hypothetical protein HELO_2643 [Halomonas elongata DSM 2581] Length = 255 Score = 51.8 bits (122), Expect = 1e-04, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 53/186 (28%), Gaps = 5/186 (2%) Query: 78 GHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIG--A 135 +A ++ S I + ++ A M D IL + G A Sbjct: 34 IDKAKYFAFEVNDIDAYQSDIKLMDDWSDDAGQQMKIAIDTVILGDVYADAAPENAGPDA 93 Query: 136 ETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLF- 194 + S S + ++ S+ ++ Y+++P+ + L Sbjct: 94 GVKSGSYNMGESGAPVSITKSNILDTIVDCGSVLDEQNAPDTG--RYIILPAWMNGMLKK 151 Query: 195 ALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNG 254 + R S +T+ + GK+ + K ++ G + Sbjct: 152 SDLRDASAMGDSTSVYRNGKVGMLDRFDVYVSNNLSTVTDATTSKKATNVLFGHKKALTF 211 Query: 255 KPTVKS 260 + + Sbjct: 212 ASQMTN 217 >gi|261368683|ref|ZP_05981566.1| major head protein [Subdoligranulum variabile DSM 15176] gi|282569278|gb|EFB74813.1| major head protein [Subdoligranulum variabile DSM 15176] Length = 285 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 38/293 (12%), Positives = 76/293 (25%), Gaps = 48/293 (16%) Query: 36 EQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLD 95 A + V + + V + + R + + A L Sbjct: 40 TLAGVPGDTITVPAYTYIGDADDVAEGGEVAIEKMTTSTRKATIKKAMKGIGLTDEAVLS 99 Query: 96 SGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFF 155 NP+ + A+ K D + +L + Sbjct: 100 GYGNPVGEANTQLALAIAAKIDNDCMDALLTASLSYD-------------------GSAN 140 Query: 156 KTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKI 215 ++ A +F + ++ S + I L S D G+I Sbjct: 141 TISYNGIVDAVDLFEE---EMGSSDKVMFIHPKQVTQLRKNADFISADKYQAGVALTGEI 197 Query: 216 EAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPI 275 AG + +KVP + D +V+ T I Sbjct: 198 GMIAGCRLVPSKKVPLSGGVYTCPIVKLESDPEVDDEIPALT-----------------I 240 Query: 276 YCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISK 328 Y K + ++++ + P K +IT + A K++ + + Sbjct: 241 YRK--------RDVNIETERKP-KTRTTEITADEFYVAVLSNEAKVVLAKFKE 284 >gi|281357151|ref|ZP_06243640.1| conserved hypothetical protein [Victivallis vadensis ATCC BAA-548] gi|281316182|gb|EFB00207.1| conserved hypothetical protein [Victivallis vadensis ATCC BAA-548] Length = 299 Score = 51.5 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 20/234 (8%), Positives = 60/234 (25%), Gaps = 9/234 (3%) Query: 59 VGDMPDTIYNATDQD-RRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQD 117 TI T + + + +A D S + + A + + Sbjct: 64 TPRTDLTIEEITAPELVQLLIDKGKYYAINADEVEQHQSDVPYIQRAVQRAVTKLKETIE 123 Query: 118 EAILKGMLGVNKKGKIGAETEFFSKE--NILSAVEGDDFFKTFIGQLITAKSIFRKRYID 175 + + + GA S ++ + + +I ++ ++ + Sbjct: 124 GEFVNAIYADAAEKNFGATAGEKSGAFNLGITGTPVELTKDNVLDWIIDCGTVLDEQNLP 183 Query: 176 VDSEQVYVLIPSDVWASLF-ALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDL 234 E ++++P V + + + ++L+ G I I ++ Sbjct: 184 N--ENRWMVLPFAVVNRIKKSEIKEVYITGDKQSSLRTGNIGMIDRFNIIATNQLNKTGD 241 Query: 235 FPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIY---CKSAVVFTQ 285 + +F + + A+ ++ Sbjct: 242 NWQPVFGHKDAISFATQIVKNESCPREKRFGKVYKGLTVYGWKTVHPEALGYSV 295 >gi|224542957|ref|ZP_03683496.1| hypothetical protein CATMIT_02151 [Catenibacterium mitsuokai DSM 15897] gi|224524095|gb|EEF93200.1| hypothetical protein CATMIT_02151 [Catenibacterium mitsuokai DSM 15897] Length = 273 Score = 51.1 bits (120), Expect = 2e-04, Method: Composition-based stats. Identities = 29/225 (12%), Positives = 66/225 (29%), Gaps = 23/225 (10%) Query: 12 IYEFKKHVELALQETKS---KLRPTVTEQATEGEASALVEVFKPTEA-HEIVGDMPDTIY 67 + E + LA Q S KL + A + V A + ++ Sbjct: 7 LQERYSSLVLAKQRKTSLFIKLFNKNYDGTPTAGA-VKIPVRDTEVAVNAYDKTNGVSLT 65 Query: 68 NATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 ++ + V + E ID + A +M K D + ++ Sbjct: 66 SSATSYKVLVIDNDNAVNELIDNHTAASVPDGLVAERLDSAGYSMAMKIDTDLGDELVAK 125 Query: 128 NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS 187 T +I A++ RK +I ++++ + + Sbjct: 126 G----------------TAITDTKALTKTTVYDAIIDARTQARKAHIAPS--EMWLAVST 167 Query: 188 DVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGN 232 +++ L ++ + + +Q G I G+ + + Sbjct: 168 EMYGLLLKSDQFVRASDLGDSVVQTGAIGKIGGILVYEADNLTDA 212 >gi|298383667|ref|ZP_06993228.1| hypothetical protein HMPREF9007_00220 [Bacteroides sp. 1_1_14] gi|298263271|gb|EFI06134.1| hypothetical protein HMPREF9007_00220 [Bacteroides sp. 1_1_14] Length = 307 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 17/205 (8%), Positives = 42/205 (20%), Gaps = 10/205 (4%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P + D+D A I + +++ + Sbjct: 69 PIDVQALEDKDIAIKLDKFQTKATPITDDELYAISYDKTARVKEGHANSINDAKFTKAAH 128 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + + + L+ K + D ++ Sbjct: 129 ALCANKN------TETTPVLKTTGEKDPATNRLRLTVNDLVEMKRALDNLRVPSDGRRLV 182 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 + D L +A + Y +GK+ G P Sbjct: 183 LC--PDHVNDLLLTSQAFREQYNIDR--NSGKVGNLYGFEIYEYGNNPLYTTAGVKKALG 238 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDT 267 + + + K + Sbjct: 239 TTAEAGEFPCSFAFYKQRVFKATGS 263 >gi|237708128|ref|ZP_04538609.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|229457956|gb|EEO63677.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] Length = 307 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 17/205 (8%), Positives = 42/205 (20%), Gaps = 10/205 (4%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P + D+D A I + +++ + Sbjct: 69 PIDVQALEDKDIAIKLDKFQTKATPITDDELYAISYDKTARVKEGHANSINDAKFTKAAH 128 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + + + L+ K + D ++ Sbjct: 129 ALCANKN------TATTPVLKTTGEKDPATNRLRLTVNDLVEMKRALDNLRVPSDGRRLV 182 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 + D L +A + Y +GK+ G P Sbjct: 183 LC--PDHVNDLLLTSQAFREQYNIDR--NSGKVGNLYGFEIYEYGNNPLYTTAGVKKALG 238 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDT 267 + + + K + Sbjct: 239 ATAEAGEFPCSFAFYKQRVFKATGS 263 >gi|158345059|ref|YP_001522824.1| capsid protein [Pseudomonas phage LKD16] gi|114796412|emb|CAK25968.1| capsid protein [Pseudomonas phage LKD16] Length = 335 Score = 49.9 bits (117), Expect = 5e-04, Method: Composition-based stats. Identities = 34/310 (10%), Positives = 80/310 (25%), Gaps = 27/310 (8%) Query: 28 SKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAER 87 SK P + + G ++ EA + + + Sbjct: 38 SKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELERSRVVNDKWNLTVDTLLYLRHQ 97 Query: 88 IDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILS 147 D + A L + RK D+A L ++ + FS + Sbjct: 98 FDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEK 157 Query: 148 AVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYV-----LIPSDVWASLFALERATSK 202 K +++ + +I+ D + V++ L ++ S Sbjct: 158 LDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSV 217 Query: 203 DYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSA 262 +Y T A +V + +P G + Sbjct: 218 EYQATGATND-----------YVKSRVAILNGVKVLETPRFATKAISAHPLG----RHFN 262 Query: 263 KFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWH----APQITLTSSFGATRIEP 318 + + + ++ Q + + +D ++ Q+ GA R + Sbjct: 263 VSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYN---IGARRPDT 319 Query: 319 DKILGIEISK 328 + ++ + Sbjct: 320 AGAIELKGIE 329 >gi|255994023|ref|ZP_05427158.1| hypothetical protein GCWU000322_00080 [Eubacterium saphenum ATCC 49989] gi|255993691|gb|EEU03780.1| hypothetical protein GCWU000322_00080 [Eubacterium saphenum ATCC 49989] Length = 286 Score = 49.9 bits (117), Expect = 5e-04, Method: Composition-based stats. Identities = 21/228 (9%), Positives = 58/228 (25%), Gaps = 32/228 (14%) Query: 24 QETKSKLRPTVTEQATEGEASALVE-----VFKPTEAHEIVGDMPDTIYNATDQD----- 73 QE S L + + + + + V D T+ + + Sbjct: 6 QEKYSSLVDMKLRKTLVTQDNLIFNNRYEGDPAAGVVNIPVRDTEVTVEDYNKSNGMGIK 65 Query: 74 ------RRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGV 127 + + E ID + + A ++ + D + + Sbjct: 66 EGGTTYIKLNLDNDIAVNELIDGYDAAAVPDGIVAERLDSAGYSLSQVVDVRSITALEKA 125 Query: 128 NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPS 187 ++ ++ +++ A S + + D +++ Sbjct: 126 QDMN--------------IAKLKTATAEGKAYEEVLKAMSTLTRVGVPQDG--RWLIASP 169 Query: 188 DVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLF 235 + +A+L + + G + + AG + D Sbjct: 170 EFYATLLNSPQFIKQTDPAKTLNDLGLVGSVAGFAVYVSNNLAFEDST 217 >gi|332878979|ref|ZP_08446692.1| hypothetical protein HMPREF9074_02443 [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332683086|gb|EGJ55970.1| hypothetical protein HMPREF9074_02443 [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 359 Score = 49.5 bits (116), Expect = 7e-04, Method: Composition-based stats. Identities = 17/205 (8%), Positives = 42/205 (20%), Gaps = 10/205 (4%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P + D+D A I + +++ + Sbjct: 121 PIDVQALEDKDIAIKLDKFQTKATPITDDELYAISYDKTARVKEGHANSINDAKFTKAAH 180 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + + + L+ K + D ++ Sbjct: 181 ALCANKN------TETTPVLKTTGEKDPATNRLRLTVNDLVEMKRALDNLRVPSDGRRLV 234 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 + D L +A + Y +GK+ G P Sbjct: 235 LC--PDHVNDLLLTSQAFREQYNIDR--NSGKVGNLYGFEIYEYGNNPLYTTAGVKKALG 290 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDT 267 + + + K + Sbjct: 291 TTAEAGEFPCSFAFYKQRVFKATGS 315 >gi|254882974|ref|ZP_05255684.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|319643116|ref|ZP_07997747.1| hypothetical protein HMPREF9011_03348 [Bacteroides sp. 3_1_40A] gi|254835767|gb|EET16076.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|317385284|gb|EFV66232.1| hypothetical protein HMPREF9011_03348 [Bacteroides sp. 3_1_40A] Length = 354 Score = 49.5 bits (116), Expect = 7e-04, Method: Composition-based stats. Identities = 17/205 (8%), Positives = 42/205 (20%), Gaps = 10/205 (4%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P + D+D A I + +++ + Sbjct: 116 PIDVQALEDKDIAIKLDKFQTKATPITDDELYAISYDKTARVKEGHANSINDAKFTKAAH 175 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + + + L+ K + D ++ Sbjct: 176 ALCANKN------TETTPVLKTTGEKDPATNRLRLTVNDLVEMKRALDNLRVPSDGRRLV 229 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 + D L +A + Y +GK+ G P Sbjct: 230 LC--PDHVNDLLLTSQAFREQYNIDR--NSGKVGNLYGFEIYEYGNNPLYTTAGVKKALG 285 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDT 267 + + + K + Sbjct: 286 TTAEAGEFPCSFAFYKQRVFKATGS 310 >gi|301309975|ref|ZP_07215914.1| hypothetical protein HMPREF9008_00325 [Bacteroides sp. 20_3] gi|300831549|gb|EFK62180.1| hypothetical protein HMPREF9008_00325 [Bacteroides sp. 20_3] Length = 359 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 17/205 (8%), Positives = 42/205 (20%), Gaps = 10/205 (4%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P + D+D A I + +++ + Sbjct: 121 PIDVQALEDKDIAIKLDKFQTKATPITDDELYAISYDKTARVKEGHANSINDAKFTKAAH 180 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + + + L+ K + D ++ Sbjct: 181 ALCANKN------TATTPVLKTTGEKDPATNRLRLTVNDLVEMKRALDNLRVPSDGRRLV 234 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 + D L +A + Y +GK+ G P Sbjct: 235 LC--PDHVNDLLLTSQAFREQYNIDR--NSGKVGNLYGFEIYEYGNNPLYTTAGVKKALG 290 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDT 267 + + + K + Sbjct: 291 ATAEAGEFPCSFAFYKQRVFKATGS 315 >gi|325279518|ref|YP_004252060.1| hypothetical protein Odosp_0803 [Odoribacter splanchnicus DSM 20712] gi|324311327|gb|ADY31880.1| hypothetical protein Odosp_0803 [Odoribacter splanchnicus DSM 20712] Length = 303 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 26/301 (8%), Positives = 74/301 (24%), Gaps = 33/301 (10%) Query: 39 TEGEASALVEVFKPTEAHEIV-GDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSG 97 + + ++P + TD D + I T++ Sbjct: 33 VNNGKIVHIPNAGAASGTKKNRTELPAKVTKRTDIDVTFPLDEYTTDPVLIPNADTVELS 92 Query: 98 INPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKT 157 + + QD+ L + + ++ + + Sbjct: 93 YDKRESVLRQDKLKL---QDDVALDFIFNWSPAA-AQCIETTGAEIDAYTDKATGKRKGI 148 Query: 158 FIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFAL-ERATSKDYINTAALQAGKIE 216 ++ + F I + Y+L+ + +++ L + ++ +A Q G + Sbjct: 149 CKADVLGLMTKFNNDDIPQEG--RYLLLDAQMYSQLLNSLTENENTAFLASADAQNGILG 206 Query: 217 AFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIY 276 + + K+ + + Sbjct: 207 KLFSFNIMMRSR--------------------AALYTAAKAPKTWSTAGAATDLAAGLAW 246 Query: 277 CKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISKDSLKGVPV 336 + +V + ++ ++ + G + DK I ++G PV Sbjct: 247 HEQSVCRALG-EVKAFENEGDATYYGDIYSFLVRAGGRIMREDKKGVI----ALVQGTPV 301 Query: 337 L 337 Sbjct: 302 A 302 >gi|291460125|ref|ZP_06599515.1| major head protein [Oribacterium sp. oral taxon 078 str. F0262] gi|291417466|gb|EFE91185.1| major head protein [Oribacterium sp. oral taxon 078 str. F0262] Length = 296 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 36/300 (12%), Positives = 79/300 (26%), Gaps = 41/300 (13%) Query: 26 TKSKLRPTVTEQATEGEASALVEVFK-PTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGW 84 S T + + + V + +A ++ + + Sbjct: 29 KFSPFAKVDTTLSGQPGDTITVPKYAYIGDAEDVAEGVAI-GTVVLTASTTTAQVKKAAK 87 Query: 85 AERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKEN 144 A I A L +P+ A+ T A+ K D + + G + Sbjct: 88 AVEITDEAALSGYGDPIGEAANQLTMAIAAKVDNDCYEALKGATLQYD------------ 135 Query: 145 ILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERA--TSK 202 ++ A F + + + L +K Sbjct: 136 -------GSAKIISYEGIVDAVDKFG--DETDAGVNKIIFVHPNQVTQLRKDPNFLDINK 186 Query: 203 DYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSA 262 I + +G I + AG + +KV + ++ PN P +A Sbjct: 187 YPIANGVIMSGTIGSIAGCRVVKSKKVALDSG--NAYYLNPIMVDDSADPNEDPAADKTA 244 Query: 263 KFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKIL 322 SA+ ++ ++ + +D K ++ + A K++ Sbjct: 245 TVS-------------SALTIYLKRDVNTETDRDILKK-TTVLSADEHYTAVLSNESKVV 290 >gi|255527914|ref|ZP_05394757.1| conserved hypothetical protein [Clostridium carboxidivorans P7] gi|255508379|gb|EET84776.1| conserved hypothetical protein [Clostridium carboxidivorans P7] Length = 278 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 46/324 (14%), Positives = 85/324 (26%), Gaps = 65/324 (20%) Query: 12 IYEFKKHVELALQETKSKLRPTVTEQA------TEGEASAL--VEVFKP-TEAHEIVGDM 62 + E V + ++ SK V A GE K +A E+V Sbjct: 4 VPEIYAQVVI--EKMGSK--ALVKNMATDLGVIISGEKGDTISFPRSKRIGDATEVVKGT 59 Query: 63 PDTIYNATDQDRRWVGHSQFGWA-ERIDPFATLDSGINPLLPYASLATAAMHRKQDEAIL 121 T D D Q RI ++ + A + ++ K D ++ Sbjct: 60 AKTP-AELDFDEVKAVIKQMEAPPVRIYDKTQKEALGYEIQNAAKQQSDSLDYKFDLDLI 118 Query: 122 KGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQV 181 + M V + ++ A +F D Sbjct: 119 EEMD------------------TTDLKVHAANAKAITSNEIDEALLLFGDDRNVEDFTNG 160 Query: 182 YVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKF 241 ++I S + S + TS AL + G Sbjct: 161 GIIIHSALITSFTNMVGFTSASNTTVTALNGIARKNCLGF-------------------- 200 Query: 242 PGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWH 301 G P + ++ E I A+ + R+ + V+ P + Sbjct: 201 ----------YQGIPVIFTNHGTEKNGEYRS-FILKNDAIGYKIRQGLTVE-DFRPEGLY 248 Query: 302 APQITLTSSFGATRIEPDKILGIE 325 A + + + IE + + I+ Sbjct: 249 ATDLYSSMMYAVKLIEEESCVSIK 272 >gi|292496050|gb|ADE29151.1| hypothetical protein [uncultured virus] Length = 279 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 19/208 (9%), Positives = 50/208 (24%), Gaps = 15/208 (7%) Query: 25 ETKSKLRPTVTEQATEGEASALVEVF----KPTEAHEIVGDMPDTIYNATDQDRRWVGHS 80 + + R + + ++ K E + D + Sbjct: 22 QENTVFRDAFRNISIPDRTGSTFDIPVPEDKLGEPTVREPGAEFDYGRE-EYDAVTLERE 80 Query: 81 QFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFF 140 ++ RI D+ L + M K D + + Sbjct: 81 EYASGSRITEEEIADNSFALLEDHIDRHAQKMAEKLDAEAFEVLNAAATSAA-------- 132 Query: 141 SKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERAT 200 ++++ D+ +I + R + + ++V + + Sbjct: 133 PQDDVALPAGSDNGDDMTFEDVIEGMEVLESREGGYEGDILFVGTDAK--NGIVRDLSDR 190 Query: 201 SKDYINTAALQAGKIEAFAGVWFINMEK 228 + + G + +AGV Sbjct: 191 GTELGDNTITGNGVVTNYAGVDIAFSNN 218 >gi|299142224|ref|ZP_07035357.1| hypothetical protein HMPREF0665_01814 [Prevotella oris C735] gi|298576313|gb|EFI48186.1| hypothetical protein HMPREF0665_01814 [Prevotella oris C735] Length = 344 Score = 47.6 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 14/189 (7%), Positives = 39/189 (20%), Gaps = 10/189 (5%) Query: 79 HSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETE 138 I + + + A++ + + K Sbjct: 121 DKFQTKVTPITDDELYAASYDKMARVKESHANALNDSKFTKAAHALCAQQDSAKT----- 175 Query: 139 FFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALER 198 + + + ++ K+ K + ++ ++ + D L + + Sbjct: 176 -PILKTTGERDATTGRLRLTMTDVVALKAAMDKLGVPAENRRLVLC--PDHANDLLLVSQ 232 Query: 199 ATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTV 258 + Y A GK+ G P + Sbjct: 233 TFREQYNIDRA--TGKVGKLYGFDVYEYANTPLYTQAGKKKNLGVAAGDGEFNCSFAFYT 290 Query: 259 KSSAKFEDT 267 K + Sbjct: 291 PRVFKATGS 299 >gi|326693187|ref|ZP_08230192.1| hypothetical protein LargK3_05575 [Leuconostoc argentinum KCTC 3773] Length = 271 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 40/261 (15%), Positives = 68/261 (26%), Gaps = 26/261 (9%) Query: 30 LRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERID 89 L T + F + V + + + V + I Sbjct: 33 LASVDTTLQGRSGDTLKFPAFTYIGDAKDVAEGEAIPLDKLGTTAKSVTIKKAAKGTEIT 92 Query: 90 PFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAV 149 A L +P+ A+ K D IL L ++ K A + Sbjct: 93 DEAVLSGYGDPVGESTKQLGLAIANKVDNDILAAALTASQTVKFFATS------------ 140 Query: 150 EGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAA 209 + A ++F K DS V + P+D A A + ++ A Sbjct: 141 ----------DGVQLALTLFAKNNDQDDSPVVALFNPADAAALRKAARAEGTGSDVSQNA 190 Query: 210 LQAGKIEAFAGVWFINMEKVPGNDLF----PAGTKFPGLIDGKVEYPNGKPTVKSSAKFE 265 L G GV I KV L+ + + + + Sbjct: 191 LVNGTKFEVLGVQIIESNKVTAGQAIYIKVNPSVPALKLVMKRAAEVEDQRNIINKTTVL 250 Query: 266 DTKIKYVLPIYCKSAVVFTQR 286 Y +Y + VV + Sbjct: 251 TADEHYAAYLYDPTKVVVAKG 271 >gi|257464332|ref|ZP_05628710.1| putative major head protein [Fusobacterium sp. D12] gi|317061840|ref|ZP_07926325.1| conserved hypothetical protein [Fusobacterium sp. D12] gi|313687516|gb|EFS24351.1| conserved hypothetical protein [Fusobacterium sp. D12] Length = 280 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 50/188 (26%), Gaps = 24/188 (12%) Query: 42 EASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPL 101 + + + + V ++ Y + + G I A L P+ Sbjct: 45 GDTITLPKWGLIGPAQDVAELEQIPYEEMSSSKTTATIKKVGKGIAISDEARLSGLGKPI 104 Query: 102 LPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQ 161 A ++ RK D L + G GA + Sbjct: 105 DEAAEQLAISVARKIDADALTALKGAKLTFGKGATELGYEL------------------- 145 Query: 162 LITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAA-LQAGKIEAFAG 220 L A + F + + + + D ++ L + S + L +G + A AG Sbjct: 146 LCDALTKFGE----EIDTEKVLFVTPDQYSMLRKNKDFLSLKDLAGTPILFSGVVGAIAG 201 Query: 221 VWFINMEK 228 Sbjct: 202 CQIAVTSN 209 >gi|291335772|gb|ADD95374.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C429] Length = 100 Score = 47.2 bits (110), Expect = 0.003, Method: Composition-based stats. Identities = 7/61 (11%), Positives = 15/61 (24%), Gaps = 1/61 (1%) Query: 9 TANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYN 68 + F + Q ++ R V ++ + S T A + Sbjct: 26 ALYLKLFSGEMFKGFQ-HETIARDMVMKRTLKNGKSLQFIYTGRTTAEFHTPGNSILGNS 84 Query: 69 A 69 Sbjct: 85 D 85 >gi|153806467|ref|ZP_01959135.1| hypothetical protein BACCAC_00731 [Bacteroides caccae ATCC 43185] gi|149131144|gb|EDM22350.1| hypothetical protein BACCAC_00731 [Bacteroides caccae ATCC 43185] Length = 357 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 47/205 (22%), Gaps = 9/205 (4%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P I TD D + I + + + A+ K+ + Sbjct: 114 PIAIQQLTDTDAVFSLDKFQTKPTSITDDELYALSYDKMASVKERHSQALLVKKYAKAIH 173 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + K + +I K F + + ++ Sbjct: 174 ALAPDSNAAKT----PVLKTTGDVEGGAATGRRMMQRSDIIALKKKFDVMQVPTE-DRRL 228 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 VL P V L ++ + Y GKI G P F Sbjct: 229 VLCPDHVNDLLMQDQKFAEQYYN----YTTGKIANLYGFQVYEFVNNPVYKAAGTKVAFG 284 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDT 267 + K + K + Sbjct: 285 TAAGANEFQASVAFYGKMTFKATGS 309 >gi|281306689|ref|YP_003345495.1| predicted phage capsid protein [Pseudomonas phage phi-2] gi|271277994|emb|CBH51600.1| predicted phage capsid protein [Pseudomonas phage phi-2] Length = 330 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 29/305 (9%), Positives = 74/305 (24%), Gaps = 21/305 (6%) Query: 27 KSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAE 86 S+L + + G +A ++ + + + + Sbjct: 37 SSQLASVMNIRQLRGTNTARIDRVGAVKIGGRKTGEKLVSSRVVNDKFTLLVDTVLYARH 96 Query: 87 RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKE--N 144 D F S ++ A A+ ++ D+A L F Sbjct: 97 EFDKFDQWTSDLDMRKETAEEDGIALAKQFDQACLIMAAKCADFVAPAGLEGAFHNGILT 156 Query: 145 ILSAVEGDDFFKTFIGQLITAK-----SIFRKRYIDVDSEQVYVLIPSDVWASLFALERA 199 + + L+ A + + D + + ++ L ++ Sbjct: 157 QATVTGLPGNAEADADALVRAHREGIEQLILRDLSDAVYSEGITFVDPRIFTLLLDHKKL 216 Query: 200 TSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVK 259 + ++ G + FA + V L +I T + Sbjct: 217 MNVEFQAL-----GGVNDFARSRIAVLNGV---RLVETPRVVTEVITDNPLGDAFNVTAE 268 Query: 260 SSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPD 319 + + T I + + + + + + I R + Sbjct: 269 EAKRRMITIIPSKTLVSAQVHAITGDYWEDKREFCWVLDTYQSYNIAQR------RADAA 322 Query: 320 KILGI 324 I+ + Sbjct: 323 AIVEV 327 >gi|256379627|ref|YP_003103287.1| hypothetical protein Amir_5625 [Actinosynnema mirum DSM 43827] gi|255923930|gb|ACU39441.1| conserved hypothetical protein [Actinosynnema mirum DSM 43827] Length = 451 Score = 46.1 bits (107), Expect = 0.007, Method: Composition-based stats. Identities = 30/255 (11%), Positives = 53/255 (20%), Gaps = 8/255 (3%) Query: 15 FKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDR 74 ++ + SK V A S G T ++D Sbjct: 122 IDTNMMQSAMTIASKYLRDVQTLAIGAGQSLNQIARNKLYRAYSGGRTWATAGGSSDTSI 181 Query: 75 RWVGHSQFG------WAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVN 128 F + L I + + +A Sbjct: 182 TVASVDGFTHVGVNGVPTPVSASTPLTVSIEGVANTVTGVSAQTGPGTLTLGTARADTTG 241 Query: 129 KKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSD 188 + SA + A RK+ + IP D Sbjct: 242 DSVVAANAPVSYRPAAKSSANALTSSDTATLALFRNAVVRLRKQNVPTVGGFYVAHIPPD 301 Query: 189 VWASLFALERATSKDYINTAA--LQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLID 246 LFA + + + FAG+ ++ + P T L+ Sbjct: 302 TEGQLFADPDFKQAAQGAVESPIYRNLSLGRFAGIDWVRNNETPTVTSATGVTVQRPLVV 361 Query: 247 GKVEYPNGKPTVKSS 261 G+ + Sbjct: 362 GEGALTANPFEGNGN 376 >gi|294085821|ref|YP_003552581.1| hypothetical protein SAR116_2254 [Candidatus Puniceispirillum marinum IMCC1322] gi|292665396|gb|ADE40497.1| hypothetical protein SAR116_2254 [Candidatus Puniceispirillum marinum IMCC1322] Length = 394 Score = 46.1 bits (107), Expect = 0.008, Method: Composition-based stats. Identities = 28/228 (12%), Positives = 62/228 (27%), Gaps = 13/228 (5%) Query: 42 EASALVEVFKPTEAHEIVGDMPDTIYN-ATDQDRRWVGHSQFGWAERI---DPFATLDSG 97 + V V P ++ T T + + A D A ++S Sbjct: 42 GDAIDVPVSSPVAVSDVTPGKTFTGNIPDTSISSVSITLDNWKRAAFYLTDDEMAKIESS 101 Query: 98 INPLLPYASLATAAMHRKQDEAILKGMLGVNKK-GKIGAETEFFSKENILSAVEGDDFFK 156 + + + A A+ +++I+ + G G I + Sbjct: 102 ADFIPMQMAEAIHALAGAVNQSIIDTHKLIAHGLGLPGEIPFQPMPSTIADVKDWHGAT- 160 Query: 157 TFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDY-INTAALQAGKI 215 I A+ K + +I D+ A+ L + D +T+ G+I Sbjct: 161 ----CAIQARRFLNKAAAPKTG--RFAIIDYDMEANALGLPQFHDADKAGSTSVPMEGEI 214 Query: 216 EAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAK 263 G+ + + + +P + + + Sbjct: 215 GRKFGIDWFSSDLLPNAGNSVGEVAITQTARAQAMTITVNASHSNINP 262 >gi|313148480|ref|ZP_07810673.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313137247|gb|EFR54607.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 313 Score = 45.7 bits (106), Expect = 0.010, Method: Composition-based stats. Identities = 26/244 (10%), Positives = 57/244 (23%), Gaps = 20/244 (8%) Query: 73 DRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGK 132 D I + + A++ + + Sbjct: 79 DIAISLDKFQSKVTPITDDELYAISYDKMARVKESHGNAINDAKFAKAAHALCATE---- 134 Query: 133 IGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWAS 192 + + A E + L+ K K + ++ ++ + D Sbjct: 135 --HTAKTPVLKTTGDADEETGRKRLTPNDLVEMKRALDKLKVPSENRRLVLC--PDHVND 190 Query: 193 LFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYP 252 L + + + Y GK+ G P AG K Sbjct: 191 LLLVSQNFREQYNIDR--NTGKVGNLYGFQVYEYGNNPV--YTTAGKKKAVGAASDTGEF 246 Query: 253 NGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFG 312 + F+ T +Y A +K + Q +K + + + + G Sbjct: 247 QCSFAFYTPRVFKATGSTK---MYFSEA-----QKDPEYQRNKINFRHYFICMFKKADAG 298 Query: 313 ATRI 316 + Sbjct: 299 VVMM 302 >gi|226363450|ref|YP_002781232.1| hypothetical protein ROP_40400 [Rhodococcus opacus B4] gi|226241939|dbj|BAH52287.1| hypothetical protein [Rhodococcus opacus B4] Length = 379 Score = 45.7 bits (106), Expect = 0.010, Method: Composition-based stats. Identities = 23/223 (10%), Positives = 53/223 (23%), Gaps = 24/223 (10%) Query: 47 VEVFKPTEAHEI---VGDMPDTIYNATDQDRRW--VGHSQFGWAERIDPFATLDSGINPL 101 + V A I T +++ + A N Sbjct: 48 IRVPGRLTARTRELRATGNARNIIMDTLTEQKIDVTLTTDIYSAVPTTDEELTLDIANFG 107 Query: 102 LPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQ 161 + + A+ ++A+ Sbjct: 108 VQILAPQVRAVAEGMEDAVANEFRSAPYT------------------FTIVVDPAKTHDS 149 Query: 162 LITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGV 221 + A+ + ++V V+ A L + + + AAL+ + AG Sbjct: 150 FVDARKALNDENVP-FGQRVLVVGSGIEAAILKDPQFVHADQSGSDAALREAFVGRIAGF 208 Query: 222 WFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKF 264 I + ++ + ++ P+G P S + Sbjct: 209 DVIVSNSLDDDEGYAFHKTAFTMVTRAPVVPDGAPYGASQSYN 251 >gi|225626359|ref|YP_002727855.1| putative capsid protein [Pseudomonas phage phikF77] gi|225594868|emb|CAX63153.1| putative capsid protein [Pseudomonas phage phikF77] Length = 336 Score = 45.3 bits (105), Expect = 0.014, Method: Composition-based stats. Identities = 31/308 (10%), Positives = 80/308 (25%), Gaps = 28/308 (9%) Query: 28 SKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAER 87 SK P + + G ++ + + + + Sbjct: 38 SKFAPLMNIRDLRGSNVVRLDRLGNVQVKGRRAGEELERSRVVNDKWNLTVDTLLYLRHQ 97 Query: 88 IDPFATLDSGINPLLPYASLATAAMHRKQDEAI-LKGMLGVNKKGKIGAETEFFSKENIL 146 D + A L + RK D+A ++ + + E F Sbjct: 98 FDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDSFSPGVLEK 157 Query: 147 SAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYV-----LIPSDVWASLFALERATS 201 + G K +++ + +I+ D + V++ L ++ + Sbjct: 158 LDLTGVTSSKEAANKIVRMHRKVVESFINRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMN 217 Query: 202 KDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSS 261 +Y T A +V + + +P G + Sbjct: 218 VEYQATGATND-----------YVKSRVAILNGVKVLETPRFATEAIAAHPLG----RHF 262 Query: 262 AKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWH----APQITLTSSFGATRIE 317 + + + ++ Q + + +D ++ Q+ GA R + Sbjct: 263 NVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYN---IGARRPD 319 Query: 318 PDKILGIE 325 + ++ Sbjct: 320 TAGAIELK 327 >gi|255010669|ref|ZP_05282795.1| hypothetical protein Bfra3_16133 [Bacteroides fragilis 3_1_12] Length = 358 Score = 44.9 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 26/244 (10%), Positives = 57/244 (23%), Gaps = 20/244 (8%) Query: 73 DRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGK 132 D I + + A++ + + Sbjct: 124 DIAISLDKFQSKVTPITDDELYAISYDKMARVKESHGNAINDAKFAKAAHALCATE---- 179 Query: 133 IGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWAS 192 + + A E + L+ K K + ++ ++ + D Sbjct: 180 --HTAKTPVLKTTGDADEETGRKRLTPNDLVEMKRALDKLKVPSENRRLVLC--PDHVND 235 Query: 193 LFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYP 252 L + + + Y GK+ G P AG K Sbjct: 236 LLLVSQNFREQYNIDR--NTGKVGNLYGFQVYEYGNNPV--YTTAGKKKAVGAASDTGEF 291 Query: 253 NGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFG 312 + F+ T +Y A +K + Q +K + + + + G Sbjct: 292 QCSFAFYTPRVFKATGSTK---MYFSEA-----QKDPEYQRNKINFRHYFICMFKKADAG 343 Query: 313 ATRI 316 + Sbjct: 344 VVMM 347 >gi|238019194|ref|ZP_04599620.1| hypothetical protein VEIDISOL_01058 [Veillonella dispar ATCC 17748] gi|237863893|gb|EEP65183.1| hypothetical protein VEIDISOL_01058 [Veillonella dispar ATCC 17748] Length = 282 Score = 44.9 bits (104), Expect = 0.019, Method: Composition-based stats. Identities = 22/246 (8%), Positives = 57/246 (23%), Gaps = 24/246 (9%) Query: 39 TEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGI 98 + + + A E V + + + V G A + A Sbjct: 42 GVPGDTVTIPAWAYIGAAEDVAEGAEVTTATMSASTKTVQIKTAGKAITLTDKAVNSGLG 101 Query: 99 NPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTF 158 +P+ + +M K D +L + + + Sbjct: 102 DPVGQATYQLSLSMADKIDNDVLAAL--------------------GTTTLAATSTKVIS 141 Query: 159 IGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAF 218 ++ A + + + +L ++ N + G+I Sbjct: 142 YEGVVAAVDKLNEEGNTD----KVLFVAPSQVTTLRLDPNFIDRNKYNADVMMNGEIGMI 197 Query: 219 AGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCK 278 AG + ++ + + + P + + + + Sbjct: 198 AGCRVVASRRIDDSKATIDNFIVCLTPEVEDGTPALPAVTIYTKAEANLETERHAKALST 257 Query: 279 SAVVFT 284 VV Sbjct: 258 DIVVSA 263 >gi|281416261|ref|YP_003347610.1| coat protein [Enterococcus phage phiFL3A] gi|270209526|gb|ACZ64067.1| coat protein [Enterococcus phage phiFL3A] gi|270209593|gb|ACZ64133.1| coat protein [Enterococcus phage phiFL3B] Length = 345 Score = 43.7 bits (101), Expect = 0.040, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 57/198 (28%), Gaps = 16/198 (8%) Query: 88 IDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILS 147 ++ + SG +P+ L A R+Q + L KG GA + ++ ++ Sbjct: 109 VNDLSKALSGDDPMRAIGDLVAAYWARRQ-----QATLLSVLKGVFGAASTKMNENSLDI 163 Query: 148 AVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINT 207 + E + + A + + + + S V+A+L N Sbjct: 164 SAETGNDSAFTGETFLDASYKLGDAEEKLTA----IAVHSSVYANLRKQNLIEFLLDSN- 218 Query: 208 AALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDT 267 KI + G I + +P + F G PT Sbjct: 219 ----NTKIPTYMGKRVIVDDGMPVSGDVFTSYIFGQGAIGLGNGAAPVPTETDRDALAGD 274 Query: 268 --KIKYVLPIYCKSAVVF 283 + + V F Sbjct: 275 DILVNRQHFLLHPRGVKF 292 >gi|307290087|ref|ZP_07570011.1| coat protein [Enterococcus faecalis TX0411] gi|306498929|gb|EFM68423.1| coat protein [Enterococcus faecalis TX0411] Length = 345 Score = 43.7 bits (101), Expect = 0.041, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 57/198 (28%), Gaps = 16/198 (8%) Query: 88 IDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILS 147 ++ + SG +P+ L A R+Q + L KG GA + ++ ++ Sbjct: 109 VNDLSKALSGDDPMRAIGDLVAAYWARRQ-----QATLLSVLKGVFGAASTKMNENSLDI 163 Query: 148 AVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINT 207 + E + + A + + + + S V+A+L N Sbjct: 164 SAETGNDSAFTGETFLDASYKLGDAEEKLTA----IAVHSSVYANLRKQNLIEFLLDSN- 218 Query: 208 AALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDT 267 KI + G I + +P + F G PT Sbjct: 219 ----NTKIPTYMGKRVIVDDGMPVSGDVFTSYIFGQGAIGLGNGAAPVPTETDRDALAGD 274 Query: 268 --KIKYVLPIYCKSAVVF 283 + + V F Sbjct: 275 DILVNRQHFLLHPRGVKF 292 >gi|118466856|ref|YP_880082.1| hypothetical protein MAV_0809 [Mycobacterium avium 104] gi|118168143|gb|ABK69040.1| conserved hypothetical protein [Mycobacterium avium 104] Length = 345 Score = 43.4 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 27/288 (9%), Positives = 66/288 (22%), Gaps = 41/288 (14%) Query: 63 PDTIYNATDQDR-----------RWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAA 111 +T D + ++G I + + + + Sbjct: 72 ANTPQAVQPGDEYPLTPVPTGPAQMANVVKWGLDTPITDESIARQNFDVVARAFIKIVNS 131 Query: 112 MHRKQDEAILKGMLGVNKKGKIGAETE--FFSKENILSAVEGDDFFKTFIGQLITAKSIF 169 M + D ++ M+ + + S + + ++ A+ + Sbjct: 132 MVAQIDSVVMSAMVAAITQSVNAGASTIGGSSPAGGANWNGSGSNAPKILRDVMFAEELM 191 Query: 170 RKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINT------------AALQAGKIEA 217 R + V++ +A++ T+ ++ G Sbjct: 192 R--SLKQGYRANTVVLDLQTFAAVMGDPNITAALPREDMGAQGVTKNPIFEGIETGLAVR 249 Query: 218 FAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYC 277 G +++ +P F P D Sbjct: 250 MLGKTWLSTPNLP-GGPFEPFAAVLDSTIFGAFVDEELPAPGYVGSQSDGSANDDG---- 304 Query: 278 KSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIE 325 +S + + + + I IEP I+ IE Sbjct: 305 RSMIQVKTMREDKNDRWRIRARRVTTPI---------IIEPKAIVQIE 343 >gi|331693822|gb|AED89638.1| major head protein precursor [Escherichia phage EcoS-CEV2] Length = 441 Score = 43.0 bits (99), Expect = 0.072, Method: Composition-based stats. Identities = 24/218 (11%), Positives = 59/218 (27%), Gaps = 12/218 (5%) Query: 25 ETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG---DMPDTIYNATDQDRRWVGHSQ 81 + + + E + ++ +A + T + + + S Sbjct: 187 QKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDATTGDEVKGALKEIHFST 246 Query: 82 FGWAE--RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEF 139 + A I D+ + L A +E M G G T Sbjct: 247 YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE---AFMTGDGSGKPKGLLTLA 303 Query: 140 FSKENILSAVEGDDFFKTFIGQLIT-AKSIFRKRYIDVDSEQVYVLIPSDVWASLFALER 198 + D + I+ + + + + ++ +++ D + L E Sbjct: 304 SEDSAKVITEAKADGSVLVTAKTISKLRRKLGRHGLKLS--KLVLIVSMDAYYDLLEDEE 361 Query: 199 ATSKDYINTAALQ-AGKIEAFAGVWFINMEKVPGNDLF 235 + A++ G++ G+ + E P Sbjct: 362 WQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKATS 399 >gi|270291637|ref|ZP_06197857.1| hypothetical protein HMPREF9024_01817 [Pediococcus acidilactici 7_4] gi|270279956|gb|EFA25794.1| hypothetical protein HMPREF9024_01817 [Pediococcus acidilactici 7_4] Length = 272 Score = 42.6 bits (98), Expect = 0.088, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 50/207 (24%), Gaps = 25/207 (12%) Query: 28 SKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAER 87 S L T + + F + +G+ + + + + Sbjct: 33 SPLANVDTTLQGQPGTTLKFPKFTYIGDAQDIGEGEAIPLDKLGTETQEATIKKAAKGTS 92 Query: 88 IDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILS 147 I A L +PL ++ K D+ +L+ Sbjct: 93 ITDEAVLSGYGDPLGESTRQLGLSLANKVDDDVLEAA----------------------K 130 Query: 148 AVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINT 207 F + + A IF V + + V A + + Sbjct: 131 TATQTITFDPTVDGIQAALDIFDDEDDKVVVAIMSPKDAAKVRKDAMAQKLGSEVGANQ- 189 Query: 208 AALQAGKIEAFAGVWFINMEKVPGNDL 234 L G GV + +K+ + Sbjct: 190 --LINGTYLDVLGVQIVRSKKLKEGEA 214 >gi|58039667|ref|YP_191631.1| hypothetical protein GOX1214 [Gluconobacter oxydans 621H] gi|58002081|gb|AAW60975.1| Putative phage protein [Gluconobacter oxydans 621H] Length = 473 Score = 42.6 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 30/240 (12%), Positives = 71/240 (29%), Gaps = 5/240 (2%) Query: 30 LRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERID 89 L AT G A + V ++ + T +AT+ VG + Sbjct: 162 LGGNTRVTATLGSAGDTIAVDDIRGFQSVIVNGQVTPISATNGMTVTVGGDVYTLVSVTA 221 Query: 90 PFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAV 149 + + + T + + + + + + + +A Sbjct: 222 DATNVSTAP---GGVSGQMTFSASVSVADGTEGQAVVASTAPLVIRPNGRLTTAALQTAS 278 Query: 150 EGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATS--KDYINT 207 I Q++ + R+ + + + + + SLF + + Sbjct: 279 SSGLADTLGIQQVLAGVATLRRNNVPMINGAYHCYLDDLQLLSLFRDPDFKHLYRGAYGS 338 Query: 208 AALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDT 267 ++G++ GV FI + P +G+ L+ G+ G + + D+ Sbjct: 339 EEYRSGQVIELLGVRFIPTTEAPQQVSLGSGSIHRALLLGQGALIEGDCALTGHSDIPDS 398 >gi|241895594|ref|ZP_04782890.1| major capsid protein [Weissella paramesenteroides ATCC 33313] gi|241871172|gb|EER74923.1| major capsid protein [Weissella paramesenteroides ATCC 33313] Length = 275 Score = 42.2 bits (97), Expect = 0.11, Method: Composition-based stats. Identities = 41/260 (15%), Positives = 65/260 (25%), Gaps = 26/260 (10%) Query: 30 LRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERID 89 L + F + V + + + I Sbjct: 34 LASIDNTLQGTAGNTLTFPAFTYIGDAQDVAEGAPIPLDKLGTSTTSATVKKAAKGTEIT 93 Query: 90 PFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAV 149 A L +P+ ++ K D IL L Sbjct: 94 DEAVLSGYGDPVGESTKQLGLSIANKVDNDILAAAL----------------------TA 131 Query: 150 EGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAA 209 F + +A ++F D DS V V+ P+D A A + ++ A Sbjct: 132 TQSVDFAATSDMVQSALTVFATNSDDDDSPVVAVMSPADAAALRKAARNEGTGSEVSANA 191 Query: 210 LQAGKIEAFAGVWFINMEKVPGNDLF----PAGTKFPGLIDGKVEYPNGKPTVKSSAKFE 265 L G GV I KV A + LI K + + Sbjct: 192 LVNGTKFEVLGVQIIESNKVTAGQAIFIKVNATSPAIKLIMKKSASVETDRNIITKTTVL 251 Query: 266 DTKIKYVLPIYCKSAVVFTQ 285 YV +Y + VV + Sbjct: 252 TADEHYVAYLYDPTKVVVAK 271 >gi|326633030|ref|YP_004306619.1| major head protein precursor [Enterobacteria phage SPC35] gi|321272224|gb|ADW80116.1| major head protein precursor [Enterobacteria phage SPC35] Length = 458 Score = 41.8 bits (96), Expect = 0.14, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 50/178 (28%), Gaps = 9/178 (5%) Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAE--RIDPFATLDSGINPLLPYASLATAAMHRKQDE 118 +T+ + + + A I D+ + L A +E Sbjct: 226 GTDETVGSEVKGTLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE 285 Query: 119 AILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLIT-AKSIFRKRYIDVD 177 M G G T ++ D + I+ + + + + Sbjct: 286 ---AFMTGNGTGQPKGLLTLASEDSAKVTTEAKADGSVLVTAKTISKLRRKLGRHGLKLS 342 Query: 178 SEQVYVLIPSDVWASLFALERATSKDYINTAALQ-AGKIEAFAGVWFINMEKVPGNDL 234 ++ +++ D + L E + +++ G++ G+ + E P Sbjct: 343 --KLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKAA 398 >gi|124005679|ref|ZP_01690518.1| hypothetical protein M23134_03905 [Microscilla marina ATCC 23134] gi|123988747|gb|EAY28353.1| hypothetical protein M23134_03905 [Microscilla marina ATCC 23134] Length = 295 Score = 41.8 bits (96), Expect = 0.15, Method: Composition-based stats. Identities = 21/217 (9%), Positives = 52/217 (23%), Gaps = 11/217 (5%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P N D D + I+ + + T +M + + Sbjct: 58 PIATVNDVDSDVAISLDKFDTENTSVSDDTLYAISIDKMGETTTKHTESMREATGDKAIH 117 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + +G + G + K + + ++ Sbjct: 118 AFAPASNTARTPVIVTSGEA----VSEDGITRNRMVPGDVAHLKRRWDDNNVPKQGREL- 172 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 +L P+ + + E + ++ G+ G +P F Sbjct: 173 ILSPAHIQDLITTHESFRDQYAN----IREGQPLRLYGFMIGEYTSLPYYAATGIKKAFG 228 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKS 279 L + + V S + + +S Sbjct: 229 SLYNPGTDRIASVGFVNSEMFKARDREVK--MYWQRS 263 >gi|291334620|gb|ADD94269.1| hypothetical protein BH3528 [uncultured phage MedDCM-OCT-S04-C231] Length = 359 Score = 41.8 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 27/202 (13%), Positives = 65/202 (32%), Gaps = 19/202 (9%) Query: 53 TEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAM 112 A + +A +D A L +G +P+L + A+ Sbjct: 75 GTATWGTSGAGYLTPQKIGTGTQIATICHRAFAYAVDDLAVLAAGEDPMLHIRNQLADAI 134 Query: 113 HRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKR 172 ++K+ + + G+ G + + + G + + A+S+ +R Sbjct: 135 NKKKSARLFSHLAGLFGTALSGNA----LDKGVAATHGGAEANFLTAATVAEARSLLGER 190 Query: 173 YIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAA-----------LQAGKIEAFAGV 221 ++D+ +++ V L+ + T + + A ++ FAG+ Sbjct: 191 GEELDT----LIVHPSVAYYLYQVGMLTFSTSALATSGAVTWGGGGVGVGAREVGEFAGM 246 Query: 222 WFINMEKVPGNDLFPAGTKFPG 243 I +V +G + Sbjct: 247 NVIVDSQVNTVAPGTSGHQKEF 268 >gi|59897280|gb|AAX12075.1| hypothetical protein-like protein [Enterobacteria phage T5] Length = 423 Score = 41.8 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 23/218 (10%), Positives = 58/218 (26%), Gaps = 12/218 (5%) Query: 25 ETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG---DMPDTIYNATDQDRRWVGHSQ 81 + + + E + ++ +A + T + + S Sbjct: 152 QKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFST 211 Query: 82 FGWAE--RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEF 139 + A I D+ + L A +E M G G T Sbjct: 212 YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE---AFMTGDGSGKPKGLLTLA 268 Query: 140 FSKENILSAVEGDDFFKTFIGQLIT-AKSIFRKRYIDVDSEQVYVLIPSDVWASLFALER 198 + D + I+ + + + + ++ +++ D + L E Sbjct: 269 SEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS--KLVLIVSMDAYYDLLEDEE 326 Query: 199 ATSKDYINTAALQ-AGKIEAFAGVWFINMEKVPGNDLF 235 + +++ G++ G+ + E P Sbjct: 327 WQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANS 364 >gi|238018854|ref|ZP_04599280.1| hypothetical protein VEIDISOL_00714 [Veillonella dispar ATCC 17748] gi|237864620|gb|EEP65910.1| hypothetical protein VEIDISOL_00714 [Veillonella dispar ATCC 17748] Length = 294 Score = 41.4 bits (95), Expect = 0.19, Method: Composition-based stats. Identities = 21/206 (10%), Positives = 49/206 (23%), Gaps = 25/206 (12%) Query: 53 TEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAM 112 +A +I + T + I A L +P+ ++ Sbjct: 69 GDAEDIAEGVEVTA-TQMSTSVAKAKIKKAMKRVDITDEAKLSGYGDPVGEATHQLRLSL 127 Query: 113 HRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKR 172 K D+ ++ + G + + D ++ A + Sbjct: 128 ASKIDQDVVTALGG--------------------ATLAVTDTKVISYEGIVNAVDKLNE- 166 Query: 173 YIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGN 232 D + Y+ + +L K + G+I AG + ++ Sbjct: 167 ---EDYVEKYLFVAPSQITALRKDPNFIDKTKYGNDVMMTGEIGMIAGCRVVTSRRINDT 223 Query: 233 DLFPAGTKFPGLIDGKVEYPNGKPTV 258 + + P Sbjct: 224 GATIDNFIVGVSAEVEDGTPVLPAVT 249 >gi|182682959|ref|YP_001837083.1| major head protein precursor [Enterobacteria phage EPS7] gi|182630671|gb|ACB97603.1| major head protein precursor [Enterobacteria phage EPS7] Length = 458 Score = 41.4 bits (95), Expect = 0.19, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 48/179 (26%), Gaps = 9/179 (5%) Query: 61 DMPDTIYNATDQDRRWVGHSQFGWAE--RIDPFATLDSGINPLLPYASLATAAMHRKQDE 118 +T+ + + + A I D+ + L A +E Sbjct: 226 GTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE 285 Query: 119 AILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLIT-AKSIFRKRYIDVD 177 M G G + D + I+ + + + + Sbjct: 286 ---AFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS 342 Query: 178 SEQVYVLIPSDVWASLFALERATSKDYINTAALQ-AGKIEAFAGVWFINMEKVPGNDLF 235 ++ +++ D + L E + A++ G++ G+ + E P Sbjct: 343 --KLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAAS 399 >gi|46401878|ref|YP_006977.1| major head protein precursor [Enterobacteria phage T5] gi|45775056|gb|AAS77188.1| major head protein precursor [Enterobacteria phage T5] gi|51512085|gb|AAU05284.1| major head protein pb8 [Enterobacteria phage T5] Length = 458 Score = 41.4 bits (95), Expect = 0.20, Method: Composition-based stats. Identities = 23/218 (10%), Positives = 58/218 (26%), Gaps = 12/218 (5%) Query: 25 ETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVG---DMPDTIYNATDQDRRWVGHSQ 81 + + + E + ++ +A + T + + S Sbjct: 187 QKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFST 246 Query: 82 FGWAE--RIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEF 139 + A I D+ + L A +E M G G T Sbjct: 247 YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE---AFMTGDGSGKPKGLLTLA 303 Query: 140 FSKENILSAVEGDDFFKTFIGQLIT-AKSIFRKRYIDVDSEQVYVLIPSDVWASLFALER 198 + D + I+ + + + + ++ +++ D + L E Sbjct: 304 SEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS--KLVLIVSMDAYYDLLEDEE 361 Query: 199 ATSKDYINTAALQ-AGKIEAFAGVWFINMEKVPGNDLF 235 + +++ G++ G+ + E P Sbjct: 362 WQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANS 399 >gi|124004283|ref|ZP_01689129.1| hypothetical protein M23134_05725 [Microscilla marina ATCC 23134] gi|123990353|gb|EAY29852.1| hypothetical protein M23134_05725 [Microscilla marina ATCC 23134] Length = 295 Score = 41.1 bits (94), Expect = 0.24, Method: Composition-based stats. Identities = 22/217 (10%), Positives = 52/217 (23%), Gaps = 11/217 (5%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P N D D + I+ + + T +M + + Sbjct: 58 PIATVNDVDSDVAISLDKFDTENTSVSDDTLYAISIDKMGETTTKHTESMREATGDKAIH 117 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + +G + G + K + + ++ Sbjct: 118 AFAPASNTARTPVIATSGEA----VSEDGITRNRMVPGDVAHLKRRWDDANVPKQGREL- 172 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 +L P+ V + E + ++ G+ G +P F Sbjct: 173 ILSPAHVQDLITTHESFRDQYAN----IREGQPLRLYGFMIGEYTSLPYYAASGIKKAFG 228 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKS 279 L + + V S + + +S Sbjct: 229 SLYNPGTDRIASVGFVNSEMFKARDREVK--MYWQRS 263 >gi|156564106|ref|YP_001429616.1| major head protein [Bacillus phage 0305phi8-36] gi|154622803|gb|ABS83683.1| major head protein [Bacillus phage 0305phi8-36] Length = 393 Score = 41.1 bits (94), Expect = 0.27, Method: Composition-based stats. Identities = 19/177 (10%), Positives = 45/177 (25%), Gaps = 7/177 (3%) Query: 27 KSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATD---QDRRWVGHSQFG 83 +K+ + ++ S + A+++ ++ D + + + G Sbjct: 101 GTKMLQKIRLKS---GQSMIFPSIGIMRAYDVAEGQEI-PEDSIDWQTHESPEIRVGKSG 156 Query: 84 WAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKE 143 R DS + + A AM R +++ T + Sbjct: 157 IRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHT 216 Query: 144 NILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERAT 200 L + + L ++ Y D + L +A Sbjct: 217 TGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQAN 273 >gi|124009869|ref|ZP_01694536.1| hypothetical protein M23134_06458 [Microscilla marina ATCC 23134] gi|123984105|gb|EAY24471.1| hypothetical protein M23134_06458 [Microscilla marina ATCC 23134] Length = 295 Score = 41.1 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 21/217 (9%), Positives = 52/217 (23%), Gaps = 11/217 (5%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P N D D + I+ + + T +M + + Sbjct: 58 PIATVNDVDSDVAISLDKFDTENTSVSDDTLYAISIDKMGETTTKHTESMREATGDKAIH 117 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + +G + G + K + + ++ Sbjct: 118 AFAPASNTARTPVIVTSGEA----VSEDGITRNRMVPGDVAHLKRRWDDNNVPKQGREL- 172 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 +L P+ + + E + ++ G+ G +P F Sbjct: 173 ILSPAHIQDLITTHESFRDQYAN----IREGQPLRLYGFMIGEYTSLPYYADSGIKKAFG 228 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKS 279 L + + V S + + +S Sbjct: 229 SLYNPGTDRIASVGFVNSEMFKARDREVK--MYWQRS 263 >gi|124009661|ref|ZP_01694333.1| hypothetical protein M23134_03006 [Microscilla marina ATCC 23134] gi|123984711|gb|EAY24696.1| hypothetical protein M23134_03006 [Microscilla marina ATCC 23134] Length = 295 Score = 40.7 bits (93), Expect = 0.31, Method: Composition-based stats. Identities = 22/217 (10%), Positives = 52/217 (23%), Gaps = 11/217 (5%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P N D D + I+ + + T +M + + Sbjct: 58 PIATVNDVDSDVAISLDKFDTENTSVSDDTLYAISIDKMGETTTKHTESMREATGDKAIH 117 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + + +G + G + K + + ++ Sbjct: 118 AFAPASNTARTPVIVTSGEA----VSEDGITRNRMVPGDVAHLKRRWDDANVPKQGREL- 172 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 +L P+ V + E + ++ G+ G +P F Sbjct: 173 ILSPAHVQDLITTHESFRDQYAN----IREGQPLRLYGFMIGEYTSLPYYATSGIKKAFG 228 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKS 279 L + + V S + + +S Sbjct: 229 SLYNPGTDRIASVGFVNSEMFKARDREVK--MYWQRS 263 >gi|297180879|gb|ADI17083.1| hypothetical protein [uncultured gamma proteobacterium HF0070_03O15] Length = 305 Score = 40.3 bits (92), Expect = 0.46, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 59/202 (29%), Gaps = 16/202 (7%) Query: 53 TEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAM 112 A + + G+A +D A L +G +P+ + A+ Sbjct: 75 GTATWGTSNSGYLTPQKIGTGTQIATICHRGFAYAVDDVAVLAAGEDPMGHIRNQIADAI 134 Query: 113 HRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKR 172 ++ + + G+ G + +A D+ + +S+ R Sbjct: 135 NKLNSARLFSLLDGL-FGSTFGPLGANALDLSKGAASGADETNFLTASTVARGRSLLGSR 193 Query: 173 YIDVDSEQVYVLIPSDVWASLFALERATSKDYINT-----------AALQAGKIEAFAGV 221 ++D+ +++ V L+ + T + + I FAG+ Sbjct: 194 GDELDT----LVVHPSVAYYLYQVGMLTFSTSALSTGTGIQWGGGGVGVTETSIGQFAGM 249 Query: 222 WFINMEKVPGNDLFPAGTKFPG 243 + +V G + Sbjct: 250 TVVIDSQVNTVQPGTTGHQKEF 271 >gi|291335934|gb|ADD95527.1| prophage LambdaCh01 coat protein [uncultured phage MedDCM-OCT-S09-C14] Length = 360 Score = 39.9 bits (91), Expect = 0.56, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 62/202 (30%), Gaps = 19/202 (9%) Query: 53 TEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAM 112 A + G+A +D A L +G +P+L + A+ Sbjct: 75 GTATWGTSGAGYLTPQKVGTGTQIASIVHRGFAYAVDDVAVLAAGEDPMLHIRNQLADAI 134 Query: 113 HRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKR 172 ++ + + + G+ G ++ + + + + A+S +R Sbjct: 135 NKLNTARLFEQLTGLFHTALNG----HRLEKQLGGSGSTGEANYLTAATVAEARSKLGER 190 Query: 173 YIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAG-----------KIEAFAGV 221 ++D +++ V L+ + T A+ ++ FAG Sbjct: 191 GEEMD----LLIVHPSVAYYLYQVGLLTFSTSALAASGAVTWGGGGVGIGAREVGEFAGC 246 Query: 222 WFINMEKVPGNDLFPAGTKFPG 243 I +V ND G + Sbjct: 247 RVIVDSQVNINDPTTTGNRQEF 268 >gi|124002915|ref|ZP_01687766.1| hypothetical protein M23134_07380 [Microscilla marina ATCC 23134] gi|123991565|gb|EAY30973.1| hypothetical protein M23134_07380 [Microscilla marina ATCC 23134] Length = 295 Score = 39.9 bits (91), Expect = 0.59, Method: Composition-based stats. Identities = 21/217 (9%), Positives = 51/217 (23%), Gaps = 11/217 (5%) Query: 63 PDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILK 122 P N D D + I+ + + T +M + + Sbjct: 58 PIATVNDVDSDVAISLDKFDTENTSVSDDTLYAISIDKMGETTTKHTESMREATGDKAIH 117 Query: 123 GMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVY 182 + + +G + G + K + + ++ Sbjct: 118 AFAPAINTARTPVIVTSGEA----VSEDGITRNRMVPGDVAHLKRRWDDNNVPKQGREL- 172 Query: 183 VLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFP 242 +L P+ + + E + ++ G+ G +P F Sbjct: 173 ILSPAHIQDLITTHESFRDQYAN----IREGQPLRLYGFMIGEYTSLPYYATSGIKKAFG 228 Query: 243 GLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKS 279 L + + V S + + +S Sbjct: 229 SLYNPGTDRIASVGFVNSEMFKARDREVK--MYWQRS 263 >gi|291336968|gb|ADD96494.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C1587] Length = 46 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 17/47 (36%), Gaps = 1/47 (2%) Query: 283 FTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGIEISKD 329 A + + P K + + S GA I+ D I+ I + Sbjct: 1 MAVNMAQKTEINYVPEKT-SFLVNSMFSAGAVAIDADGIVKITTDES 46 >gi|33865637|ref|NP_897196.1| hypothetical protein SYNW1103 [Synechococcus sp. WH 8102] gi|33632807|emb|CAE07618.1| conserved hypothetical protein [Synechococcus sp. WH 8102] Length = 336 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 28/250 (11%), Positives = 57/250 (22%), Gaps = 26/250 (10%) Query: 40 EGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGI- 98 G + + F +A I T T V S + R+ L + Sbjct: 106 TGGENLTIPRFAKADAGWIAEGADYTALTTTSTS---VDASPKLASARLSFSRRLKVLVP 162 Query: 99 NPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTF 158 + A+ + +G + + Sbjct: 163 DVEGSVLQEVGRAVA--------GLIEKGAIQGTGSNSQPLGLLNLPDALSQTFASATPT 214 Query: 159 IGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAF 218 +L + +D+ +V L+ A L S + + L Sbjct: 215 SDELASMLEKLGDADVDLS--KVVFLMHPSTAADLMKTRVDASSGALVLSDL------KI 266 Query: 219 AGVWFINMEKVPGNDLFPAGTKFPGLIDGKVE----YPNGKPTVKSSAKFEDTKIKYVLP 274 G+ V + + + L+ P + +V Sbjct: 267 HGLPVFITSNVTEDKVIALDPSYSRLVYFGSAQVVVDPFRGAVSGVTHTQILNAADFVC- 325 Query: 275 IYCKSAVVFT 284 +S+VV Sbjct: 326 -SHQSSVVVG 334 >gi|330992645|ref|ZP_08316590.1| hypothetical protein SXCC_02549 [Gluconacetobacter sp. SXCC-1] gi|329760299|gb|EGG76798.1| hypothetical protein SXCC_02549 [Gluconacetobacter sp. SXCC-1] Length = 474 Score = 39.1 bits (89), Expect = 0.85, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 41/171 (23%), Gaps = 12/171 (7%) Query: 104 YASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENI---------LSAVEGDDF 154 + A D +L N G + Sbjct: 229 GTTADAANTSTAPDGVSGTLVLSGNVSVSDGTAGNAVMAATAPLVLRPSGRATTAALVTG 288 Query: 155 FKTFIGQLITAKSIFRKRYID-VDSEQVYVLIPSDVWASLFALERA--TSKDYINTAALQ 211 + ++ A + R + D + + + LF + + Q Sbjct: 289 DLLTVQTILAALATLRDNNVPTPDGGVYHCYLDNAQLLGLFRDADFKLLYRGQYGSDTYQ 348 Query: 212 AGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSA 262 G+I GV FI + P AG +I G+ G + Sbjct: 349 TGQIFDLLGVRFIPTTEAPQQASLGAGAIHRAIICGQGALIEGDYANIGTH 399 >gi|266622409|ref|ZP_06115344.1| coat protein [Clostridium hathewayi DSM 13479] gi|288865862|gb|EFC98160.1| coat protein [Clostridium hathewayi DSM 13479] Length = 334 Score = 39.1 bits (89), Expect = 0.94, Method: Composition-based stats. Identities = 23/245 (9%), Positives = 56/245 (22%), Gaps = 23/245 (9%) Query: 64 DTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKG 123 T R + +G +P+ L R+ + +++ Sbjct: 78 ITSNKDVSTTVRRANMWA------ATDLSAALAGSDPMAAIGDLVAGYWAREYQKILIQV 131 Query: 124 MLGV-----NKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDS 178 + GV +T +S K I A + + + Sbjct: 132 LSGVFGSYQTATEPAETKTPLADHILDISTAGSAAAQKISASAFIDALQLLGDAQGQLTA 191 Query: 179 EQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 ++ + + + + + + G I + P D Sbjct: 192 VAMHSATKAFLKKNNLIDTE---------RDSTDVEFDTYQGRRVIVDDGCPVADGVYTT 242 Query: 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFT---QRKAIDVQHSK 295 F + + I K+ ++ + + +H + Sbjct: 243 YLFGQGAIAFGNGSPVGFVATEVDRDKKKGSGVDYLINRKTFIMHARGIKWTDLAREHVE 302 Query: 296 DPGKW 300 P K Sbjct: 303 TPTKA 307 >gi|167757951|ref|ZP_02430078.1| hypothetical protein CLOSCI_00286 [Clostridium scindens ATCC 35704] gi|167664383|gb|EDS08513.1| hypothetical protein CLOSCI_00286 [Clostridium scindens ATCC 35704] Length = 336 Score = 38.4 bits (87), Expect = 1.5, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 66/226 (29%), Gaps = 16/226 (7%) Query: 52 PTEAHEIVGDMPDTIY-NATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATA 110 E+ +V T ++QD + W+ + +G +P+L ASL Sbjct: 61 TGESEPVVEGKDLTPSGIESEQDVAVIIRRAKMWSA--TDLSAALAGSDPMLAIASLVAG 118 Query: 111 AMHRKQDEA---ILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKS 167 R + ILKG+ G + + T + + + K I A+ Sbjct: 119 FRARDMQKELVAILKGIFGSYTASEASSATTPLASNILDISGGSGTSAKWSGSAFIDAEQ 178 Query: 168 IFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINME 227 + + V++ S A+L + N + + G I + Sbjct: 179 LLGDNKTALTG----VVMHSATEAALKKQNLIETVQPSNDVSF-----GLYQGKRVIVDD 229 Query: 228 KVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVL 273 P F + G+ G A E + K Sbjct: 230 GCPVTGSGS-NQVFSTYLFGQGAIALGNGNPVGFAPTETDRDKKKG 274 >gi|94970200|ref|YP_592248.1| peptidase S49 [Candidatus Koribacter versatilis Ellin345] gi|94552250|gb|ABF42174.1| peptidase S49 [Candidatus Koribacter versatilis Ellin345] Length = 781 Score = 38.0 bits (86), Expect = 2.3, Method: Composition-based stats. Identities = 22/191 (11%), Positives = 45/191 (23%), Gaps = 13/191 (6%) Query: 60 GDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEA 119 T + T + Q + S I+ D Sbjct: 551 PGSAVTATDQTTGS-VTLSPKQAMAQTAYSRQFIIQSSIDAEQFVREDLANIFALGVD-- 607 Query: 120 ILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSE 179 L ++G + A +G + A+ + I + + Sbjct: 608 -LAALVGSGTSNQPKGIVNQSGVGTEAIATDGG---AITYSIITKAQEDLEESSIPLIAP 663 Query: 180 QVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 + V L ++ + AG ++ ++P N +GT Sbjct: 664 --GIATTPGVKKKLRNTAELSNTISLPIWHSDD----TVAGYPAMSSNQLPSNTSKGSGT 717 Query: 240 KFPGLIDGKVE 250 +I G Sbjct: 718 NLHTMIVGDWA 728 >gi|208386|gb|AAA72920.1| E.coli gene 10/human coagulation factor IX fusion protein [synthetic construct] Length = 144 Score = 36.8 bits (83), Expect = 4.3, Method: Composition-based stats. Identities = 8/47 (17%), Positives = 21/47 (44%), Gaps = 2/47 (4%) Query: 278 KSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGI 324 +SAV + + + ++ ++ + A QI + A ++ + I Sbjct: 53 RSAVGTVKLRDLALERARRAN-FQADQIIAKYAM-AVFLDHENANKI 97 >gi|262040822|ref|ZP_06014050.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259041844|gb|EEW42887.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 378 Score = 36.8 bits (83), Expect = 4.8, Method: Composition-based stats. Identities = 21/174 (12%), Positives = 41/174 (23%), Gaps = 20/174 (11%) Query: 106 SLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITA 165 A A+ + D + ++ F ++ A A Sbjct: 112 KQAFRALANEMDADLAALYFASSRAVGTAGTAPFGIAGDLSDAA--------------NA 157 Query: 166 KSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFIN 225 + + Q+ + + + A L+ G + G Sbjct: 158 RQVLSDNGSPTTDLQMVLGSSAIANLRGKQSVLFKVNESGTDALLREGIVGRLEGFNIHE 217 Query: 226 MEKV------PGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVL 273 V P G K G I ++ G F+ KY++ Sbjct: 218 SAHVKKRAASPAAGYLVNGAKAEGDILISIDTGTGAFAAGDIVTFDGDSNKYLV 271 >gi|300854854|ref|YP_003779838.1| phage-like protein [Clostridium ljungdahlii DSM 13528] gi|300434969|gb|ADK14736.1| phage-related protein [Clostridium ljungdahlii DSM 13528] Length = 276 Score = 36.8 bits (83), Expect = 4.9, Method: Composition-based stats. Identities = 15/203 (7%), Positives = 43/203 (21%), Gaps = 30/203 (14%) Query: 41 GEASALVEV---FKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATLDSG 97 + + D + R + ++ Sbjct: 39 QGDTIHFPKWKIIGDATEVVKGTQSAIETLDQDDSTAKI---KFIDKIVRCYDYDSVTEI 95 Query: 98 INPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKT 157 N L +S R D + + T + + Sbjct: 96 GNQLEEASSQQAVVFARALDTD----LCTEASTTDLKTATASATAITAAELDTALANYG- 150 Query: 158 FIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKD----YINTAALQAG 213 +D + +++ S + +S ++++ + ++ G Sbjct: 151 ------------DDADVDDMAG---IVVNSRIDSSFYSMDEFVDVNKTFTQTGNGIVRNG 195 Query: 214 KIEAFAGVWFINMEKVPGNDLFP 236 I F G+ + + Sbjct: 196 MIGYFRGIPVFHSNHGTFDSTTN 218 >gi|224060554|ref|XP_002189652.1| PREDICTED: hypothetical protein [Taeniopygia guttata] Length = 2821 Score = 36.8 bits (83), Expect = 5.2, Method: Composition-based stats. Identities = 17/173 (9%), Positives = 37/173 (21%), Gaps = 19/173 (10%) Query: 21 LALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHS 80 + Q+ L V + + V A V I TD R + Sbjct: 712 ILKQQKSHVLVNNVRQ---------TLPVSAAGGAIT-VSQSGRYIVLETDFSLRVSYDT 761 Query: 81 QFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFF 140 + + +R++DE ++ +G + Sbjct: 762 DHSV--------EVKVPTTYFNLTCGMCGNFNNRREDEYMMPNGQQAADSNALGESWQVP 813 Query: 141 SKENILSAVEGDDFFKTFIGQLITAKSIFRKRYI-DVDSEQVYVLIPSDVWAS 192 + +L + E + +I + Sbjct: 814 DSDPSCGVPGPSTPCSAEEEKLYRSDQFCGMLTTRPSSFESCHSVINPQDYFD 866 >gi|50955858|ref|YP_063146.1| phage-related major capsid protein [Leifsonia xyli subsp. xyli str. CTCB07] gi|50952340|gb|AAT90041.1| phage-related major capsid protein [Leifsonia xyli subsp. xyli str. CTCB07] Length = 435 Score = 36.4 bits (82), Expect = 5.5, Method: Composition-based stats. Identities = 25/238 (10%), Positives = 48/238 (20%), Gaps = 12/238 (5%) Query: 13 YEFKKHVELALQETKSKLRPTVTEQATEGE-ASALVEVFKPTEAHEI-VGDMPDTIYNAT 70 E+ +A V +S + A + Sbjct: 134 PEWLIEDFVAFARPGRVYADGVQHDELPSGVSSINLPTVNTGAAVAVQATQNTAVASTDL 193 Query: 71 DQDRRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKK 130 G + + + SGI A D L G + + Sbjct: 194 TTSSVSSGITTIAGQQVVSLQLLQQSGIPFDRVVLGDLARAYASGLDVQTLT-GSGASGQ 252 Query: 131 GKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVL------ 184 + + A F Q+I A + +E Sbjct: 253 LQGVIGLPGVNVITYTQASPAFAGAGQFYSQIIQAINAVNTNRFLPATEIYMHPRRWAWV 312 Query: 185 ---IPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 + + + A ++ + G AG+ +P N Sbjct: 313 LNALDAQNRPLVVPDGPAFNQPAVQGGVTAQGYAGTLAGLPVKVDPNIPTNLGSGTNQ 370 >gi|254521800|ref|ZP_05133855.1| TonB-dependent receptor, Fe transport [Stenotrophomonas sp. SKA14] gi|219719391|gb|EED37916.1| TonB-dependent receptor, Fe transport [Stenotrophomonas sp. SKA14] Length = 972 Score = 36.4 bits (82), Expect = 6.3, Method: Composition-based stats. Identities = 39/282 (13%), Positives = 75/282 (26%), Gaps = 24/282 (8%) Query: 30 LRPTVTEQATEGE--ASALVEVF--KPTEAHEIVGDMPDTIYNATDQDR---RWVGHSQF 82 L VT +A G A+A ++ A V T+ AT V + Sbjct: 75 LAQGVTSRAVSGSLSANAALQQLLQGSGLAVRRVSADAVTLEAATSAQAGDGVIVTDTLS 134 Query: 83 GWAERID---PFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEF 139 +R+D + ++ ++R E G KG G Sbjct: 135 VAGDRVDAGATSDEARLLDSYRSVGSTTT---LNRTHLERFRGTSNGDIVKGVAGVTAGD 191 Query: 140 FSKENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERA 199 N + + +I +D+ + Y + + Sbjct: 192 PRVGNGFDVNIRGIQGQGRVPVII------DGGQSSIDTYRGYAGQSQRTYLDPDLISSL 245 Query: 200 TSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVK 259 T + A +G I + ME + D+ G F + G + + Sbjct: 246 TITKGPSLQANASGGIGG-----VVEMETLKIGDVLREGRDFGVRVRGGLANASANNLPA 300 Query: 260 SSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWH 301 SA + + +A R + ++ + Sbjct: 301 YSAVPRTDRSATGNQFFNVAAAGHWDRFDLVAAYAYRDTGNY 342 >gi|291518784|emb|CBK74005.1| methionine adenosyltransferase [Butyrivibrio fibrisolvens 16/4] Length = 399 Score = 36.0 bits (81), Expect = 7.3, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 30/95 (31%), Gaps = 1/95 (1%) Query: 230 PGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAI 289 P D G K G G D Y K+ V K + Sbjct: 246 PQGDAGLTGRKIIVDTYGGTGRHGGGAFSGKDPTKVDRSAAYAARWVAKNLVAAGVAKRL 305 Query: 290 DVQHSKDPGKWHAPQITLTSSFGATRIEPDKILGI 324 +V+ + G I + SFG I+ +KI+ I Sbjct: 306 EVELAYAIGVAKPVSIAVD-SFGTGVIDDEKIVEI 339 >gi|317473199|ref|ZP_07932496.1| phage coat protein [Anaerostipes sp. 3_2_56FAA] gi|316899294|gb|EFV21311.1| phage coat protein [Anaerostipes sp. 3_2_56FAA] Length = 336 Score = 36.0 bits (81), Expect = 7.4, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 50/189 (26%), Gaps = 14/189 (7%) Query: 89 DPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKK----GKIGAETEFFSKEN 144 + +G +P+ ASL R + ++ + G+ G G + Sbjct: 97 TDLSAALAGKDPMEAIASLVAGFWARDMQKELVALLNGIFGTIPAQGDSGTAETRLASNI 156 Query: 145 ILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDY 204 + + + K I A+ + + V + S A L + Sbjct: 157 LDISGSSGNAGKWSGAAFIDAEQKLGDNKTALTA----VCMHSATEAELKKQNLIETVQP 212 Query: 205 INTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKF 264 N A + G I + P A F + G G + Sbjct: 213 SNDVAF-----GLYQGKRVIVDDGCPVKGSG-ASQVFSTYLFGTGAVALGNGSPAGFVPT 266 Query: 265 EDTKIKYVL 273 E + K Sbjct: 267 ETDRAKRKG 275 >gi|311895249|dbj|BAJ27657.1| hypothetical protein KSE_18320 [Kitasatospora setae KM-6054] Length = 290 Score = 36.0 bits (81), Expect = 7.4, Method: Composition-based stats. Identities = 23/244 (9%), Positives = 53/244 (21%), Gaps = 7/244 (2%) Query: 48 EVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERI-DPFATLDSGINPLLPYA- 105 + + G T+ + L +P Sbjct: 27 PLVYRDAEKDFGGRSGTTVSIPVPHAIPAADFDGVNKFSAAGEDLVELKITASPYSAVPI 86 Query: 106 --SLATAAMHRKQDEAILKGMLGVNKKGKIGAET--EFFSKENILSAVEGDDFFKTFIGQ 161 T + + + + GV + + + + Sbjct: 87 TDEENTFTLMNYATQVLAPQVDGVARALEAVVAKPMNALIAAVKDTDTAQVIDPARALDF 146 Query: 162 LITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGV 221 + A + +R I + + V+ P L + T L+ G+I G Sbjct: 147 VADASVMLDQRDIPDEG-RYLVVAPEIKAFFLKDEGLRQADKAGGTDELRRGQIADVHGF 205 Query: 222 WFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAV 281 I ++ G + F + + + + S V Sbjct: 206 KVIASNQIKGGAVAFVREAFALAVRAPRAMEGAAWSQAEVQDGYALTVTRDFDLSSHSDV 265 Query: 282 VFTQ 285 + Sbjct: 266 SLVK 269 >gi|288934977|ref|YP_003439036.1| hypothetical protein Kvar_2105 [Klebsiella variicola At-22] gi|288889686|gb|ADC58004.1| conserved hypothetical protein [Klebsiella variicola At-22] Length = 378 Score = 36.0 bits (81), Expect = 7.7, Method: Composition-based stats. Identities = 21/174 (12%), Positives = 41/174 (23%), Gaps = 20/174 (11%) Query: 106 SLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITA 165 A A+ + D + ++ F ++ A A Sbjct: 112 KQAFRALANEMDADLAALYFASSRAVGTAGTAPFGIAGDLSDAA--------------NA 157 Query: 166 KSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFIN 225 + + Q+ + + + A L+ G + G Sbjct: 158 RQVLSDNGSPTTDLQMVLGSSAIANLRGKQSVLFKVNESGTDALLREGIVGRLEGFNIHE 217 Query: 226 MEKV------PGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVL 273 V P G K G I ++ G F+ KY++ Sbjct: 218 SAHVKKRAASPAAGYLVNGAKAEGDILIAIDTGTGAFAAGDIVTFDGDSNKYLV 271 >gi|116511877|ref|YP_809093.1| hypothetical protein LACR_1137 [Lactococcus lactis subsp. cremoris SK11] gi|116107531|gb|ABJ72671.1| hypothetical protein LACR_1137 [Lactococcus lactis subsp. cremoris SK11] Length = 272 Score = 36.0 bits (81), Expect = 7.9, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 54/213 (25%), Gaps = 30/213 (14%) Query: 26 TKSKLRPTVTEQATEGEA---SALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQF 82 + P T + F V + + + + V + Sbjct: 27 KALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKSVTIKKA 86 Query: 83 GWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSK 142 I A L +P+ ++ K D+ +LK ++ Sbjct: 87 AKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLKAAKTTSQTV----------- 135 Query: 143 ENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSK 202 K + + A IF D D++ +++ A + A + Sbjct: 136 -----------STKANVDGVQAALDIFN----DEDAQAYVLIVNPKDAAKIRKDANAKNI 180 Query: 203 -DYINTAALQAGKIEAFAGVWFINMEKVPGNDL 234 + AL G G + +K+ Sbjct: 181 GSEVGANALINGTYADVLGAQIVRSKKLAEGSA 213 >gi|13786567|ref|NP_112699.1| MHP [Lactococcus phage TP901-1] gi|13661710|gb|AAK38053.1|AF304433_36 MHP [Lactococcus phage TP901-1] Length = 272 Score = 36.0 bits (81), Expect = 7.9, Method: Composition-based stats. Identities = 25/213 (11%), Positives = 53/213 (24%), Gaps = 30/213 (14%) Query: 26 TKSKLRPTVTEQATEGEA---SALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQF 82 + P T + F V + + + + V + Sbjct: 27 KALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKSVTIKKA 86 Query: 83 GWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSK 142 I A L +P+ ++ K D+ +L ++ Sbjct: 87 AKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTV----------- 135 Query: 143 ENILSAVEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSK 202 K + + A IF D D++ +++ A + A + Sbjct: 136 -----------STKANVDGVQAALDIFN----DEDAQAYVLIVNPKDAAKIRKDANAKNI 180 Query: 203 -DYINTAALQAGKIEAFAGVWFINMEKVPGNDL 234 + AL G G + +K+ Sbjct: 181 GSEVGANALINGTYADVLGAQIVRSKKLAEGSA 213 >gi|256847407|ref|ZP_05552853.1| major head protein [Lactobacillus coleohominis 101-4-CHN] gi|256716071|gb|EEU31046.1| major head protein [Lactobacillus coleohominis 101-4-CHN] Length = 298 Score = 36.0 bits (81), Expect = 8.8, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 42/207 (20%), Gaps = 18/207 (8%) Query: 38 ATEGEASALVEVFK---PTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERIDPFATL 94 EG+ + + K A E +++ + + A I A Sbjct: 44 TLEGKPGDTITIPKYEFTGTAREYGEGEQID-FDSLKYTTQQAKIKKIVSAYSISDEAAF 102 Query: 95 DSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDF 154 +P A + A+ D+ IL G + D Sbjct: 103 IPFGDPRTEAARQMSMALATYVDDDILNTA-KTAPLQVTGHTP------------DQVDL 149 Query: 155 FKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGK 214 + A + + L T + L G Sbjct: 150 IDDLEDKFANATNAVEGATYPQQGVLYVSYKDAASLRKLAGDN-WTRASDLGDNILINGA 208 Query: 215 IEAFAGVWFINMEKVPGNDLFPAGTKF 241 G I K+ Sbjct: 209 FGELLGWEIIRTAKLTKGHAIAVKPGA 235 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.309 0.114 0.253 Lambda K H 0.267 0.0346 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,372,826,786 Number of Sequences: 14124377 Number of extensions: 33934216 Number of successful extensions: 110193 Number of sequences better than 10.0: 327 Number of HSP's better than 10.0 without gapping: 166 Number of HSP's successfully gapped in prelim test: 161 Number of HSP's that attempted gapping in prelim test: 109607 Number of HSP's gapped (non-prelim): 389 length of query: 343 length of database: 4,842,793,630 effective HSP length: 140 effective length of query: 203 effective length of database: 2,865,380,850 effective search space: 581672312550 effective search space used: 581672312550 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 81 (36.1 bits)