BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1079_1 (251 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|315122536|ref|YP_004063025.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495938|gb|ADR52537.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 455 Score = 473 bits (1217), Expect = e-132, Method: Compositional matrix adjust. Identities = 219/251 (87%), Positives = 233/251 (92%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LKAYEQGR+KWQSNTV YVWFDEEPPEDVYFEGLTRINATQGLV LTLTPLKGRS I+EH Sbjct: 147 LKAYEQGREKWQSNTVDYVWFDEEPPEDVYFEGLTRINATQGLVALTLTPLKGRSNIVEH 206 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YLS+SS DRQVIRMT+ ETPHY +ER RII+SYPLHEREARTKGEP+LGSGRIFPI+E+ Sbjct: 207 YLSSSSPDRQVIRMTLEETPHYTAKERIRIINSYPLHEREARTKGEPVLGSGRIFPILEQ 266 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 DIVI S DIPEHW QIGGMDFGWHHPFAA L WNRDSDVIYVVKNYRCREQTP+FH A Sbjct: 267 DIVITSFDIPEHWSQIGGMDFGWHHPFAAVQLAWNRDSDVIYVVKNYRCREQTPLFHAAV 326 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LKSWGKWLPWAWPHDGLQHDK SGEQL+ QYR+QGMKMLPECATFDDGSNGVEAG+SD+L Sbjct: 327 LKSWGKWLPWAWPHDGLQHDKGSGEQLAVQYRQQGMKMLPECATFDDGSNGVEAGVSDIL 386 Query: 241 DRMRSGRWKVF 251 DRMRSGRWKVF Sbjct: 387 DRMRSGRWKVF 397 >gi|150397042|ref|YP_001327509.1| hypothetical protein Smed_1839 [Sinorhizobium medicae WSM419] gi|150028557|gb|ABR60674.1| protein of unknown function DUF264 [Sinorhizobium medicae WSM419] Length = 477 Score = 384 bits (987), Expect = e-105, Method: Compositional matrix adjust. Identities = 176/251 (70%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPLKG S ++ Sbjct: 169 FKAYEQGRAKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGSIAVTFTPLKGLSAVVAR 228 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S+DR+V MTI + HY +ER+RIIDSYP HEREARTKG P LGSGRIFP+ EE Sbjct: 229 YLMEKSADREVTTMTIEDAEHYTPEERRRIIDSYPAHEREARTKGVPALGSGRIFPVTEE 288 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + DIP+HWVQIGG+DFGW HPFAA W+RD+DV YV K YR RE TPI H AA Sbjct: 289 SIRADPFDIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFYVTKLYRERESTPIIHAAA 348 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG LPWAWPHDGLQHDK SGEQL+AQYR QG+ +LPE ATFDDG+NGVEAG+SDML Sbjct: 349 LKPWGGTLPWAWPHDGLQHDKGSGEQLAAQYRAQGLALLPERATFDDGTNGVEAGLSDML 408 Query: 241 DRMRSGRWKVF 251 RM++GRWKVF Sbjct: 409 QRMQTGRWKVF 419 >gi|227821702|ref|YP_002825672.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227340701|gb|ACP24919.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 416 Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPLKG S ++ Sbjct: 108 FKAYEQGRGKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGSIAVTFTPLKGMSTVVAR 167 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 Y+ S DR+VI MTI++ HY +ER+RIIDSYP HEREARTKG P LGSGRIFP+ EE Sbjct: 168 YILEKSPDREVITMTIDDAEHYTPEERQRIIDSYPAHEREARTKGVPSLGSGRIFPVAEE 227 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I +IP+HWVQIGG+DFGW HPF A W+RD+DV YV K YR RE TPI H AA Sbjct: 228 SITIAPFEIPKHWVQIGGLDFGWDHPFGAAGCAWDRDADVFYVTKVYREREATPIIHAAA 287 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG WLPW+WPHDGLQHDK SGEQL+ QYR QG+ MLPE ATF+DG+NGVEAG+SDML Sbjct: 288 LKPWGAWLPWSWPHDGLQHDKGSGEQLATQYRAQGLNMLPERATFEDGTNGVEAGLSDML 347 Query: 241 DRMRSGRWKVF 251 RM++GRWKVF Sbjct: 348 QRMQTGRWKVF 358 >gi|15965769|ref|NP_386122.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] gi|15075038|emb|CAC46595.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] Length = 477 Score = 381 bits (979), Expect = e-104, Method: Compositional matrix adjust. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPL+G S ++ Sbjct: 169 FKAYEQGRAKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGAIAVTFTPLRGLSAVVAR 228 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S DR VI MTI + HY QER+R+IDSYP HEREART+G P LGSGRIFP+ EE Sbjct: 229 YLMEKSPDRAVITMTIEDAEHYTPQERQRVIDSYPAHEREARTRGVPALGSGRIFPVTEE 288 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I+ +IP+HWVQIGG+DFGW HPFAA W+RD+DV +V K YR RE TPI H AA Sbjct: 289 SIRIDPFEIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFHVTKIYREREATPIIHAAA 348 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG +PWAWPHDGLQHDK SGEQL+AQYR QG+ +LPE ATFDDG+NGVEAG+SDML Sbjct: 349 LKPWGAAMPWAWPHDGLQHDKGSGEQLAAQYRAQGLALLPERATFDDGTNGVEAGLSDML 408 Query: 241 DRMRSGRWKVF 251 RM+SGRWKVF Sbjct: 409 QRMQSGRWKVF 419 >gi|307315429|ref|ZP_07594994.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] gi|306898808|gb|EFN29464.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] Length = 477 Score = 381 bits (979), Expect = e-104, Method: Compositional matrix adjust. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPL+G S ++ Sbjct: 169 FKAYEQGRAKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGAIAVTFTPLRGLSAVVAR 228 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S DR VI MTI + HY QER+R+IDSYP HEREART+G P LGSGRIFP+ EE Sbjct: 229 YLMEKSPDRAVITMTIEDAEHYTPQERQRVIDSYPAHEREARTRGVPALGSGRIFPVTEE 288 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I+ +IP+HWVQIGG+DFGW HPFAA W+RD+DV +V K YR RE TPI H AA Sbjct: 289 SIRIDPFEIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFHVTKIYREREATPIIHAAA 348 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG +PWAWPHDGLQHDK SGEQL+AQYR QG+ +LPE ATFDDG+NGVEAG+SDML Sbjct: 349 LKPWGAAMPWAWPHDGLQHDKGSGEQLAAQYRAQGLALLPERATFDDGTNGVEAGLSDML 408 Query: 241 DRMRSGRWKVF 251 RM+SGRWKVF Sbjct: 409 QRMQSGRWKVF 419 >gi|307318836|ref|ZP_07598268.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] gi|306895557|gb|EFN26311.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] Length = 477 Score = 381 bits (979), Expect = e-104, Method: Compositional matrix adjust. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPL+G S ++ Sbjct: 169 FKAYEQGRAKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGAIAVTFTPLRGLSAVVAR 228 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S DR VI MTI + HY QER+R+IDSYP HEREART+G P LGSGRIFP+ EE Sbjct: 229 YLMEKSPDRAVITMTIEDAEHYTPQERQRVIDSYPAHEREARTRGVPALGSGRIFPVTEE 288 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I+ +IP+HWVQIGG+DFGW HPFAA W+RD+DV +V K YR RE TPI H AA Sbjct: 289 SIRIDPFEIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFHVTKIYREREATPIIHAAA 348 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG +PWAWPHDGLQHDK SGEQL+AQYR QG+ +LPE ATFDDG+NGVEAG+SDML Sbjct: 349 LKPWGAAMPWAWPHDGLQHDKGSGEQLAAQYRAQGLALLPERATFDDGTNGVEAGLSDML 408 Query: 241 DRMRSGRWKVF 251 RM+SGRWKVF Sbjct: 409 QRMQSGRWKVF 419 >gi|227822449|ref|YP_002826421.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227341450|gb|ACP25668.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 454 Score = 379 bits (973), Expect = e-103, Method: Compositional matrix adjust. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPLKG S ++ Sbjct: 146 FKAYEQGRGKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGAIAVTFTPLKGLSNVVAR 205 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S DR+VI MTI + HY +ER+RII+SYP HEREARTKG P LGSGRIFP+ EE Sbjct: 206 YLMEKSPDREVITMTIEDAEHYTPEERQRIIESYPAHEREARTKGVPALGSGRIFPVTEE 265 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + DIP+HWVQIGG+DFGW HPFAA W+RD+DV YV + YR RE TPI H AA Sbjct: 266 AIRVEPFDIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFYVTRIYREREATPIIHAAA 325 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG WLP+AWPHDGLQHDK SGEQL+AQYR QG+ +L E ATFDDG+NGVEAG+SDML Sbjct: 326 LKPWGAWLPFAWPHDGLQHDKGSGEQLAAQYRAQGLPLLAERATFDDGTNGVEAGLSDML 385 Query: 241 DRMRSGRWKVF 251 RM++GRWKVF Sbjct: 386 QRMQTGRWKVF 396 >gi|71274675|ref|ZP_00650963.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71901596|ref|ZP_00683677.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|170730087|ref|YP_001775520.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] gi|71164407|gb|EAO14121.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71728644|gb|EAO30794.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|167964880|gb|ACA11890.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] Length = 472 Score = 342 bits (876), Expect = 4e-92, Method: Compositional matrix adjust. Identities = 153/251 (60%), Positives = 197/251 (78%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK+++QGR+KWQ++TVH+VWFDEEPPEDVYFEG+TR N T G V +T TPLKG S ++ Sbjct: 164 LKSFDQGREKWQADTVHWVWFDEEPPEDVYFEGITRTNRTFGPVFMTFTPLKGMSNVVRR 223 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L+ ++DR ++M+I + HY+ +E RI SYP HER+ART+G P LGSGR+FPI +E Sbjct: 224 FLTEDAADRGYVQMSIEDAEHYSAEECARITASYPPHERDARTQGVPALGSGRVFPIAQE 283 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 +I + IP W IGGMDFG+ HPFAA L W+RD+D++YVV YR RE TPI H AA Sbjct: 284 EISVAPFAIPAQWALIGGMDFGYDHPFAAVKLAWDRDADILYVVCAYRKRESTPIIHAAA 343 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG LPWAWPHDGLQHDK SG+QL+ QYR+QG+ MLP+ ATF+DG+NG+EAG+++ML Sbjct: 344 LKPWGVTLPWAWPHDGLQHDKGSGDQLAEQYRQQGLAMLPQRATFEDGTNGLEAGVTEML 403 Query: 241 DRMRSGRWKVF 251 DRM +GR KVF Sbjct: 404 DRMHTGRLKVF 414 >gi|71897556|ref|ZP_00679801.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|71732459|gb|EAO34512.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] Length = 471 Score = 341 bits (874), Expect = 7e-92, Method: Compositional matrix adjust. Identities = 155/251 (61%), Positives = 196/251 (78%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK+++QGR+KWQ++TV +VWFDEEPPEDVYFEG+TR N T G V +T TPLKG S ++ Sbjct: 163 LKSFDQGREKWQADTVDWVWFDEEPPEDVYFEGITRTNRTFGPVFMTFTPLKGMSSVVRR 222 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR +++MTI++ HY+ ++R RII SYP HEREARTKG P LGSGR+FPI E+ Sbjct: 223 FLLEQAPDRGLVQMTIDDAEHYSPEDRARIIASYPAHEREARTKGTPSLGSGRVFPIAED 282 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I IPE W IGGMDFG+ HPFAA + W+R++DVIYV+ YR RE TP+ H AA Sbjct: 283 SIAIAPFSIPEEWALIGGMDFGYDHPFAAVKMAWDREADVIYVMCAYRQREATPVIHTAA 342 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 L+ WG LPWAWPHDGLQHDK SGEQL+ QYR+QG+ ML + ATF DG+NG+EAG+++ML Sbjct: 343 LRPWGAHLPWAWPHDGLQHDKGSGEQLAEQYRQQGLSMLGQRATFTDGTNGLEAGVTEML 402 Query: 241 DRMRSGRWKVF 251 DRM +GR KVF Sbjct: 403 DRMHTGRLKVF 413 >gi|273810450|ref|YP_003344921.1| TerL [Xylella phage Xfas53] gi|257097825|gb|ACV41131.1| TerL [Xylella phage Xfas53] Length = 470 Score = 341 bits (874), Expect = 7e-92, Method: Compositional matrix adjust. Identities = 155/251 (61%), Positives = 196/251 (78%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK+++QGR+KWQ++TV +VWFDEEPPEDVYFEG+TR N T G V +T TPLKG S ++ Sbjct: 162 LKSFDQGREKWQADTVDWVWFDEEPPEDVYFEGITRTNRTFGPVFMTFTPLKGMSSVVRR 221 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR +++MTI++ HY+ ++R RII SYP HEREARTKG P LGSGR+FPI E+ Sbjct: 222 FLLEQAPDRGLVQMTIDDAEHYSPEDRARIIASYPAHEREARTKGTPSLGSGRVFPIAED 281 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I IPE W IGGMDFG+ HPFAA + W+R++DVIYV+ YR RE TP+ H AA Sbjct: 282 SIAIAPFSIPEEWALIGGMDFGYDHPFAAVKMAWDREADVIYVMCAYRQREATPVIHTAA 341 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 L+ WG LPWAWPHDGLQHDK SGEQL+ QYR+QG+ ML + ATF DG+NG+EAG+++ML Sbjct: 342 LRPWGAHLPWAWPHDGLQHDKGSGEQLAEQYRQQGLSMLGQRATFTDGTNGLEAGVTEML 401 Query: 241 DRMRSGRWKVF 251 DRM +GR KVF Sbjct: 402 DRMHTGRLKVF 412 >gi|62178924|ref|YP_215341.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62126557|gb|AAX64260.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322713379|gb|EFZ04950.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 499 Score = 258 bits (660), Expect = 4e-67, Method: Compositional matrix adjust. Identities = 124/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D IYV + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADTIYVSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYAEAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|281599695|gb|ADA72679.1| Gp2-like protein [Shigella flexneri 2002017] Length = 441 Score = 258 bits (659), Expect = 5e-67, Method: Compositional matrix adjust. Identities = 124/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 124 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 183 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 184 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 243 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D IYV + ++ +E+T + A Sbjct: 244 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADTIYVSRVWKAKEKTAVQAWGA 303 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 304 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 363 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 364 DMMLDGRFKVF 374 >gi|218549377|ref|YP_002383168.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|307311077|ref|ZP_07590721.1| protein of unknown function DUF264 [Escherichia coli W] gi|331669066|ref|ZP_08369914.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] gi|218356918|emb|CAQ89550.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|306908583|gb|EFN39080.1| protein of unknown function DUF264 [Escherichia coli W] gi|312945545|gb|ADR26372.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli O83:H1 str. NRG 857C] gi|315061655|gb|ADT75982.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli W] gi|323377763|gb|ADX50031.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli KO11] gi|324117758|gb|EGC11657.1| terminase [Escherichia coli E1167] gi|331064260|gb|EGI36171.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] Length = 499 Score = 258 bits (659), Expect = 6e-67, Method: Compositional matrix adjust. Identities = 124/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D IYV + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADTIYVSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|323967108|gb|EGB62533.1| terminase [Escherichia coli M863] Length = 499 Score = 258 bits (658), Expect = 7e-67, Method: Compositional matrix adjust. Identities = 124/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D IYV + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAHVQLWWDKDADTIYVSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|110804280|ref|YP_687800.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] gi|110613828|gb|ABF02495.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] Length = 354 Score = 256 bits (654), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 123/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 37 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 96 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 97 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 156 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++++D IYV + ++ +E+T + A Sbjct: 157 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKEADTIYVSRVWKAKEKTAVQAWGA 216 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 217 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 276 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 277 DMMLDGRFKVF 287 >gi|281599578|gb|ADA72562.1| putative terminase large subunit [Shigella flexneri 2002017] Length = 351 Score = 256 bits (654), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 123/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 34 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 93 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 94 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 153 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++++D IYV + ++ +E+T + A Sbjct: 154 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKEADTIYVSRVWKAKEKTAVQAWGA 213 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 214 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 273 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 274 DMMLDGRFKVF 284 >gi|198245578|ref|YP_002214540.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940094|gb|ACH77427.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] Length = 499 Score = 256 bits (653), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 AIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFSDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|157734711|dbj|BAF80717.1| terminase large subunit [Enterobacteria phage P22] gi|169658843|dbj|BAG12600.1| terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 256 bits (653), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 AIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFSDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|331657716|ref|ZP_08358678.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] gi|331055964|gb|EGI27973.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] Length = 499 Score = 256 bits (653), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D+IY+ + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADIIYLSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K W +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKPWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|326622293|gb|EGE28638.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 482 Score = 256 bits (653), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 165 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 224 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 225 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 284 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 285 AIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 344 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 345 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFSDGGNSVESGISELR 404 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 405 DLMLEGRFKVF 415 >gi|221328620|ref|YP_002533461.1| Terminase, large subunit [Salmonella phage epsilon34] gi|255252684|ref|YP_003090219.1| Terminase, large subunit [Salmonella phage c341] gi|193244688|gb|ACF16628.1| Terminase, large subunit [Salmonella phage epsilon34] gi|223697657|gb|ACN18281.1| Terminase, large subunit [Salmonella phage g341c] Length = 499 Score = 256 bits (653), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|327251967|gb|EGE63639.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] gi|327254495|gb|EGE66117.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] Length = 499 Score = 256 bits (653), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D+IY+ + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADIIYLSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K W +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKPWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|293410725|ref|ZP_06654301.1| DNA-packaging protein gp2 [Escherichia coli B354] gi|291471193|gb|EFF13677.1| DNA-packaging protein gp2 [Escherichia coli B354] Length = 499 Score = 255 bits (652), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D+IY+ + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADIIYLSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K W +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKPWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|238912312|ref|ZP_04656149.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|261245593|emb|CBG23388.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] Length = 499 Score = 255 bits (652), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|197363441|ref|YP_002143078.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|197094918|emb|CAR60455.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|320086843|emb|CBY96615.1| DNA packaging protein gp2 Terminase large subunit [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 499 Score = 255 bits (652), Expect = 3e-66, Method: Compositional matrix adjust. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|318065950|ref|YP_004123808.1| Gp2 [Salmonella phage ST160] gi|289066936|gb|ADC81147.1| Gp2 [Salmonella phage ST160] Length = 517 Score = 254 bits (650), Expect = 6e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 200 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 259 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 260 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 319 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 320 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 379 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 380 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 439 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 440 DLMLEGRFKVF 450 >gi|161504537|ref|YP_001571649.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|160865884|gb|ABX22507.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 499 Score = 254 bits (649), Expect = 7e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|148557334|ref|YP_001264916.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] gi|148502524|gb|ABQ70778.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] Length = 276 Score = 254 bits (649), Expect = 7e-66, Method: Compositional matrix adjust. Identities = 116/191 (60%), Positives = 142/191 (74%) Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL + R V RMTI++ HY+ ER I+ SYP HER+AR +G P+LGSGR+FP+ E+ Sbjct: 24 YLWETPMTRHVTRMTIDDAEHYSPAERAAIVASYPAHERKARAEGIPMLGSGRVFPVDED 83 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I + ++P W QIGG+DFGW HP AA L W+RD+D IYV +Y RE TPI H AA Sbjct: 84 VIKIRAFEVPAGWTQIGGIDFGWDHPTAAVRLAWDRDADCIYVTASYGVREATPILHAAA 143 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG WLPWAWPHDGLQHDK SG L+ QYR QG+ +LPE A+F++G NGVEAGI++ML Sbjct: 144 LKPWGNWLPWAWPHDGLQHDKGSGAALAQQYRDQGLSLLPEKASFEEGGNGVEAGIAEML 203 Query: 241 DRMRSGRWKVF 251 DRM SGRWKVF Sbjct: 204 DRMLSGRWKVF 214 >gi|51236724|ref|YP_063734.1| terminase large subunit [Enterobacteria phage P22] gi|137879|sp|P26745|TERL_BPP22 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein gp2; AltName: Full=Terminase large subunit gi|21914414|gb|AAM81379.1|AF527608_1 terminase large subunit [Salmonella phage P22-pbi] gi|553005|gb|AAA72959.1| DNA pacaging [Enterobacteria phage P22] gi|8439622|gb|AAF75044.1| terminase large subunit [Enterobacteria phage P22] gi|28394263|tpg|DAA00977.1| TPA_inf: terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 254 bits (649), Expect = 7e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|168240109|ref|ZP_02665041.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194451817|ref|YP_002044341.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194410121|gb|ACF70340.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|205340165|gb|EDZ26929.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] Length = 499 Score = 254 bits (649), Expect = 8e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|24371583|ref|NP_720326.1| gp2 [Enterobacteria phage ST64T] gi|24250810|gb|AAL15523.1| gp2 [Salmonella phage ST64T] Length = 517 Score = 254 bits (649), Expect = 8e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 200 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 259 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 260 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 319 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 320 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 379 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GI ++ Sbjct: 380 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGIGELR 439 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 440 DLMLEGRFKVF 450 >gi|60476789|gb|AAX21426.1| gp2 [Enterobacteria phage L] Length = 499 Score = 254 bits (648), Expect = 9e-66, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GI ++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGIGELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|219681243|ref|YP_002455888.1| Gp2 [Salmonella enterica bacteriophage SE1] gi|66473858|gb|AAY46504.1| Gp2 [Salmonella phage SE1] Length = 499 Score = 253 bits (647), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 122/251 (48%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRAAWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|315299781|gb|EFU59021.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 16-3] Length = 499 Score = 253 bits (646), Expect = 2e-65, Method: Compositional matrix adjust. Identities = 121/251 (48%), Positives = 162/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VW DEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWVDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D+IY+ + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADIIYLSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K W +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKPWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|46358697|ref|YP_006405.1| Gp2 [Enterobacteria phage ST104] gi|46357933|dbj|BAD15212.1| Gp2 [Enterobacteria phage ST104] gi|312911340|dbj|BAJ35314.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 499 Score = 253 bits (646), Expect = 2e-65, Method: Compositional matrix adjust. Identities = 121/251 (48%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+K F Sbjct: 422 DLMLEGRFKAF 432 >gi|300920006|ref|ZP_07136465.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] gi|300412953|gb|EFJ96263.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] Length = 498 Score = 247 bits (631), Expect = 9e-64, Method: Compositional matrix adjust. Identities = 119/251 (47%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I MDFGW HP A L W++D DVIY+ + ++ +++ +A Sbjct: 302 TIKCQPFECPDHFYVINAMDFGWDHPQAHIQLWWDKDEDVIYLSRVWKAKQKKATEAWSA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K+W K P AWPHDG QH+K G QL QY G MLP+ AT+ DG N VE GI+++ Sbjct: 362 VKAWSKNTPTAWPHDGHQHEKGGGAQLKEQYADAGFDMLPDHATWPDGGNAVEPGIAEIR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|24111660|ref|NP_706170.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|24050435|gb|AAN41877.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|313646707|gb|EFS11166.1| DNA packaging gp2 domain protein [Shigella flexneri 2a str. 2457T] Length = 300 Score = 234 bits (597), Expect = 8e-60, Method: Compositional matrix adjust. Identities = 113/233 (48%), Positives = 151/233 (64%) Query: 19 VWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINE 78 +WFDEEPP +Y EGLTR N LT TPL G S ++ +L S ++V+ MTI + Sbjct: 1 MWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYD 60 Query: 79 TPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGG 138 HY ++++++II SYP HEREAR +G P +GSGRIF I EE I + P+H+ IGG Sbjct: 61 AEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIGG 120 Query: 139 MDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQ 198 MDFGW HP A L W++++D IYV + ++ +E+T + A+KSW +P AWPHDG Q Sbjct: 121 MDFGWDHPQAQVQLWWDKEADTIYVSRVWKAKEKTAVQAWGAVKSWAHKVPTAWPHDGNQ 180 Query: 199 HDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 H+K GEQL QY G ML E AT+ DG N VE GI+++ D M GR+KVF Sbjct: 181 HEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELRDMMLDGRFKVF 233 >gi|158422463|ref|YP_001523755.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] gi|158329352|dbj|BAF86837.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] Length = 251 Score = 223 bits (567), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 100/183 (54%), Positives = 136/183 (74%) Query: 69 RQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLD 128 R I+ T ++ PH +Q++ + + P H R+ARTKG P+LGSGR+FPI EE I+ +++ Sbjct: 3 RFCIQATWDDVPHLTQQQKDELWAAIPAHMRDARTKGIPVLGSGRVFPIAEELILCDAMP 62 Query: 129 IPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWL 188 IP+HW +I G+DFGW HPF A + W+RD+DVIYVV YR RE++ + H AA++ WGK + Sbjct: 63 IPKHWARINGLDFGWDHPFGAVSIAWDRDADVIYVVNTYRAREESSVIHAAAIRPWGKKI 122 Query: 189 PWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRW 248 P AWPHDG QHDK SG+QL+ QYR G++ML E AT + G NGVEAGI +ML+RM++GR Sbjct: 123 PCAWPHDGFQHDKGSGQQLAEQYRDHGLEMLDEHATHEQGGNGVEAGIMEMLERMQTGRL 182 Query: 249 KVF 251 KVF Sbjct: 183 KVF 185 >gi|137993|sp|P16938|VG2_BPLP7 RecName: Full=Protein GP2 gi|75884|pir||Z2BPL7 gene 2 protein - phage LP-7 (fragment) gi|553003|gb|AAA88220.1| packaging glycoprotein [Enterobacteria phage LP7] Length = 475 Score = 216 bits (550), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 106/226 (46%), Positives = 141/226 (62%) Query: 26 PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQ 85 P +Y EGLTR N LT TPL G S + +L S ++V+ MTI + HY ++ Sbjct: 203 PYSIYAEGLTRTNKYGQFSILTFTPLMGMSDGVTKFLKNPSKSQKVVNMTIYDAEHYTDE 262 Query: 86 ERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHH 145 ++++II SYP HEREAR +G P +GSGRIF I EE I + P+H+ I DFGW+H Sbjct: 263 QKEQIIASYPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIDAQDFGWNH 322 Query: 146 PFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGE 205 P A L W++D+DV Y+ + ++ E T + A+KSW +P AWPHDG QH+K GE Sbjct: 323 PQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGAVKSWANKIPVAWPHDGHQHEKGGGE 382 Query: 206 QLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 QL QY G MLP+ ATF DG N VE+GIS++ D M GR+KVF Sbjct: 383 QLKTQYADAGFSMLPDHATFPDGGNSVESGISELRDLMLEGRFKVF 428 >gi|167041080|gb|ABZ05841.1| hypothetical protein ALOHA_HF400048F7ctg1g8 [uncultured marine microorganism HF4000_48F7] Length = 504 Score = 209 bits (532), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 99/251 (39%), Positives = 155/251 (61%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K+Y+ G W V YVW DEEPP+++Y + L + G V+LT TP G + ++ Sbjct: 173 FKSYDAGPASWMGVAVDYVWLDEEPPQEIYSQALRATLKSGGPVSLTFTPEAGVTGVVAM 232 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L+ + +++ T ++ PH + + R+ I+ + P HER R+KG P LGSG++FP+ E+ Sbjct: 233 FLNERKGGQALVQATWDDAPHLSLEVREEILAALPPHERLMRSKGIPTLGSGQVFPVPED 292 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I++++ IP+H+ +I G+DFG+ HP A + +RD+DV+Y+ YR + + H A Sbjct: 293 QIMVSAFAIPDHFSRIAGIDFGFDHPTACVWMAHDRDTDVVYLYDAYREKGSGMLQHAEA 352 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K G ++P AWPHDG HDK SGE L+ QYRR G+ L T +G VE G+ +L Sbjct: 353 IKHRGGFIPVAWPHDGSIHDKGSGEALATQYRRSGVNFLGSHFTNPEGGIAVEPGLMALL 412 Query: 241 DRMRSGRWKVF 251 RM++GR+KVF Sbjct: 413 TRMQTGRFKVF 423 >gi|94317806|gb|ABF15069.1| terminase large subunit Gp2 [Salmonella enterica subsp. enterica serovar Typhimurium] Length = 278 Score = 204 bits (520), Expect = 7e-51, Method: Compositional matrix adjust. Identities = 98/206 (47%), Positives = 133/206 (64%) Query: 46 LTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKG 105 LT TPL G S ++ +L S ++V+ MTI + HY ++++++II SYP HEREAR +G Sbjct: 10 LTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARG 69 Query: 106 EPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVK 165 P +GSGRIF I EE I + P+H+ I DFGW+HP A L W++D+DV Y+ + Sbjct: 70 IPTMGSGRIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLAR 129 Query: 166 NYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATF 225 ++ E T + A+KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF Sbjct: 130 VWKKSENTAVQAWGAVKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATF 189 Query: 226 DDGSNGVEAGISDMLDRMRSGRWKVF 251 DG N VE+GIS++ D M GR+KVF Sbjct: 190 PDGGNSVESGISELRDLMLEGRFKVF 215 >gi|321225021|gb|EFX50082.1| Phage terminase, large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 267 Score = 194 bits (492), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 93/199 (46%), Positives = 128/199 (64%) Query: 53 GRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSG 112 G S ++ +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSG Sbjct: 2 GMSDVVTKFLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSG 61 Query: 113 RIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQ 172 RIF I EE I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E Sbjct: 62 RIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSEN 121 Query: 173 TPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGV 232 T + A+KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N V Sbjct: 122 TAVQAWGAVKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSV 181 Query: 233 EAGISDMLDRMRSGRWKVF 251 E+GIS++ D M GR+KVF Sbjct: 182 ESGISELRDLMLEGRFKVF 200 >gi|30061788|ref|NP_835959.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] gi|30040030|gb|AAP15764.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] Length = 186 Score = 186 bits (473), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 87/183 (47%), Positives = 120/183 (65%) Query: 19 VWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINE 78 +WFDEEPP +Y EGLTR N LT TPL G S ++ +L S ++V+ MTI + Sbjct: 1 MWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYD 60 Query: 79 TPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGG 138 HY ++++++II SYP HEREAR +G P +GSGRIF I EE I + P+H+ IGG Sbjct: 61 AEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIGG 120 Query: 139 MDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQ 198 MDFGW HP A L W++++D IYV + ++ +E+T + A+KSW +P AWPHDG Q Sbjct: 121 MDFGWDHPQAQVQLWWDKEADTIYVSRVWKAKEKTAVQAWGAVKSWAHKVPTAWPHDGNQ 180 Query: 199 HDK 201 H+K Sbjct: 181 HEK 183 >gi|89885991|ref|YP_516188.1| phage terminase large subunit [Sodalis phage phiSG1] gi|89191726|dbj|BAE80473.1| phage terminase large subunit [Sodalis phage phiSG1] gi|125470018|gb|ABN42210.1| gp02 [Sodalis phage phiSG1] Length = 475 Score = 165 bits (417), Expect = 7e-39, Method: Compositional matrix adjust. Identities = 90/254 (35%), Positives = 141/254 (55%), Gaps = 10/254 (3%) Query: 3 AYEQGRDKWQSNTVHYVWFDEEPPE-DVYFEGLTRI----NATQGLVTLTLTPLKGRSPI 57 +Y QG+ + V + DEEP + +Y + LTR G LT TP GR+ + Sbjct: 162 SYSQGQHALMGDCVDWFHIDEEPRDPTIYPQVLTRTATGDRGKGGRGILTFTPENGRTDL 221 Query: 58 IEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPI 117 + ++ S + I + ++ PH +++ + ++ S+P H+R+ RTKG P+LG GRI+ + Sbjct: 222 VIGFMDNPSPAQTCINVGWDDAPHLSQKVKNDLLASFPAHQRDMRTKGIPMLGHGRIYDL 281 Query: 118 VEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFH 177 E+ I + +P HW+ I GMDFGW HP A LVW+ ++++ YV + Y+ R+ +P Sbjct: 282 GEDFITCDPFPVPAHWLVIDGMDFGWDHPQAHIQLVWDNENEMFYVTRAYKARQVSPAEA 341 Query: 178 VAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGIS 237 +A+ W + +P AWP DGL +K SG Q Y G ML + A + DGS VE Sbjct: 342 YSAVSIWAENVPTAWPSDGLMTEKGSGIQQKTYYDDAGFCMLRDPAQWPDGSRSVE---- 397 Query: 238 DMLDRMRSGRWKVF 251 + D MR G++KVF Sbjct: 398 -LHDLMRRGKFKVF 410 >gi|49146380|ref|YP_025488.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] gi|40458348|gb|AAR87096.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] Length = 474 Score = 157 bits (396), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 94/285 (32%), Positives = 145/285 (50%), Gaps = 40/285 (14%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQ----GLVTLTLTPLKGRSP 56 K+YEQGR K+Q+ + V DEEPP D+Y E L R +T+ G+V LT+TPL G + Sbjct: 133 FKSYEQGRKKFQAAKLDLVHLDEEPPRDIYVESLMRTMSTEVDNEGIVLLTMTPLLGLTD 192 Query: 57 IIEHYLSAS-----------------------------SSDRQVIRMTINETPHYNEQER 87 +I + + ++R I+ + ++ PH + + Sbjct: 193 LILEFQETTIEREVINGSGVSEMSEMTEEVIKVDEGSIVNNRFYIQASWDDNPHLLDSAK 252 Query: 88 KRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPF 147 + + S HE+EAR G P LGSG ++P+ E +V+N IP+HW ++ G+DFGW +P Sbjct: 253 ETLSKSLKPHEKEARKHGIPSLGSGLVYPVSEVAVVVNPFVIPKHWGRVFGLDFGWINPT 312 Query: 148 AAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGK-WLPWAWPHDGLQHDKRSGEQ 206 AA V +RD+DV+Y+ Y E+TP HV LK G + + G Q +R G Sbjct: 313 AALFAVIDRDNDVMYLTGEYYVSERTPQQHVYELKKLGADKINGVYDPAGEQSSQRDGGD 372 Query: 207 LSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 L+ YR G++ L + N E GI +L R ++G+ K+F Sbjct: 373 LAQLYRDSGLRYLYKA------DNAKEEGIMKVLQRFQNGKLKIF 411 >gi|167583562|ref|YP_001671752.1| terminase large subunit [Enterobacteria phage phiEco32] gi|164375400|gb|ABY52808.1| terminase large subunit [Enterobacteria phage phiEco32] Length = 513 Score = 148 bits (373), Expect = 7e-34, Method: Compositional matrix adjust. Identities = 82/268 (30%), Positives = 139/268 (51%), Gaps = 17/268 (6%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPED---VYFEGLTRINATQGLVTLTLTPLKGRSPI 57 ++ +QG TV Y+W DEE P + ++ + +TR T+GLVT+T TP G + + Sbjct: 177 FRSTQQGEHTLMGATVDYIWLDEEDPYESMAIFAQCVTRTLTTKGLVTITATPENGLTEL 236 Query: 58 IEHYLSASSSDRQVIRMTINET---------PHYNEQERKRIIDSYPLHEREARTKGEPI 108 ++ ++ + N + H +Q+ K + + P + E R+KG P+ Sbjct: 237 VDKFMKGEGDESTGSLYFQNASWWDAHVDLGGHITDQDIKDMTEGIPAWQLEMRSKGMPL 296 Query: 109 LGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYR 168 LGSG I+ + ++ I +IP+ W ++ +D G HP AA ++ ++D IYV +Y+ Sbjct: 297 LGSGLIYDVSDDTIKCEPFEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDSYK 356 Query: 169 CREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDG 228 TP++H A+ G+W+P PHD +K SG ++ Y+ G+ + E G Sbjct: 357 EGGFTPVYHAPAINGRGQWIPVILPHDADNTEKGSGSSVAQFYKNAGVNVQSETFYNKIG 416 Query: 229 SNG-----VEAGISDMLDRMRSGRWKVF 251 +G VE GI+D+ +RM SGR+KVF Sbjct: 417 MDGKKNFFVEPGITDIRERMMSGRFKVF 444 >gi|27476053|ref|NP_775255.1| terminase [Pseudomonas phage PaP3] gi|27414483|gb|AAL85569.1| terminase [Pseudomonas phage PaP3] Length = 482 Score = 146 bits (369), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 81/260 (31%), Positives = 131/260 (50%), Gaps = 9/260 (3%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K+YE +DK+ + +W DEE P+D+Y + +TR T G+V LT TP G + I++ Sbjct: 154 FKSYEMSQDKFMGTAIDVIWLDEECPKDIYTQCVTRTATTGGIVYLTFTPEHGLTEIVKD 213 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + +I + + PH + + +++++ Y ER R +G P+LGSG +FPI+EE Sbjct: 214 FLQDLKPGQFLIHASWEDAPHLSPEVKEQLLSVYSPAERRMRAEGIPMLGSGVVFPILEE 273 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 V DIP+H+ +I G+D G+ HP A + W+ + D Y+ +T H A Sbjct: 274 KFVCEPFDIPDHFHRIIGIDLGFDHPNAIACVAWDAEKDKYYLYDERSESGETLGMHADA 333 Query: 181 LK-SWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDD--------GSNG 231 + G +P PHD +HD + + + + F + G N Sbjct: 334 IYLKGGHQIPVVVPHDAFKHDGATSGRRFVDLLKDDHNLNVVYEPFSNPPGPDGKHGGNS 393 Query: 232 VEAGISDMLDRMRSGRWKVF 251 VE G++ ML RM +G KVF Sbjct: 394 VEFGVNWMLTRMENGDLKVF 413 >gi|167600439|ref|YP_001671939.1| terminase large subunit [Pseudomonas phage LUZ24] gi|161168302|emb|CAP45467.1| terminase large subunit [Pseudomonas phage LUZ24] Length = 482 Score = 143 bits (360), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 79/260 (30%), Positives = 130/260 (50%), Gaps = 9/260 (3%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K+YE +DK+ + +W DEE P+D+Y + +TR T G+V LT TP G + I++ Sbjct: 154 FKSYEMSQDKFMGTAIDVIWLDEECPKDIYTQCVTRTATTGGIVYLTFTPEHGLTEIVKD 213 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + ++ + + PH + + +++++ Y ER R +G P+LGSG +FPI+EE Sbjct: 214 FLQDLKPGQFLVHASWEDAPHLSPEVKEQLLSVYSPAERRMRAEGVPMLGSGVVFPILEE 273 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 V IP+H+ +I G+D G+ HP A + W+ + D Y+ +T H A Sbjct: 274 KFVCEPFQIPDHFHRIIGIDLGFDHPNAIACVAWDPEKDKYYLYDERSESGETLGMHADA 333 Query: 181 LK-SWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDD--------GSNG 231 + G +P PHD +HD + + + + F + G N Sbjct: 334 IYLKGGHQIPVVVPHDAFKHDGATSGRRFVDLLKDDHNLNVVYEPFSNPPGPDGKHGGNS 393 Query: 232 VEAGISDMLDRMRSGRWKVF 251 VE G++ ML RM +G KVF Sbjct: 394 VEFGVNWMLTRMENGDLKVF 413 >gi|264678784|ref|YP_003278691.1| phage DNA packaging protein Gp2 [Comamonas testosteroni CNB-2] gi|262209297|gb|ACY33395.1| putative phage DNA packaging protein Gp2 [Comamonas testosteroni CNB-2] Length = 434 Score = 135 bits (341), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 73/223 (32%), Positives = 120/223 (53%), Gaps = 18/223 (8%) Query: 36 RINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTI----NETPHYNEQERKRII 91 R+ +G+ LT TPL G + +++ S + + R + ++ PH E+ + +++ Sbjct: 2 RLMTREGISMLTFTPLSGLTALVQQLTSPDPEGKVIGRAVVQCGWDDVPHLTEEAKAKLL 61 Query: 92 DSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGH 151 H+R+ARTKG P LG+G I+P+ E DIV+ +P+ W + GMD GW+ A Sbjct: 62 SRLMPHQRDARTKGVPALGAGAIYPVPESDIVVPDFQLPDFWPRAYGMDVGWNRTSA--- 118 Query: 152 LVW---NRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLS 208 VW +RDSD++Y+ N+ + P H ++K+ G W+P A ++ GEQL Sbjct: 119 -VWGAHDRDSDIVYLYSNHYRGQAEPSVHATSIKARGDWIPGAIDPASRGRSQKDGEQLL 177 Query: 209 AQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 Y G++++ +NGVEAGI + +RM +GR KVF Sbjct: 178 QNYVDLGLQLV-------TANNGVEAGIYQVWERMSTGRLKVF 213 >gi|71898835|ref|ZP_00681003.1| phage-related protein [Xylella fastidiosa Ann-1] gi|71731421|gb|EAO33484.1| phage-related protein [Xylella fastidiosa Ann-1] Length = 291 Score = 132 bits (332), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 62/122 (50%), Positives = 86/122 (70%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK++EQG +KWQ++ V+++WFDE+PPEDVYFEG+TR N T GLV +T PLK ++ Sbjct: 41 LKSFEQGGEKWQADPVNWMWFDEQPPEDVYFEGITRTNRTFGLVCMTFAPLKSILTVVWR 100 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M I + HY ++ RI SYP +ER+ART+G P LGSGR+FPI E Sbjct: 101 FLLENVPDRADVQMIIEDAEHYLLEDCARITASYPPYERQARTQGVPALGSGRLFPIAGE 160 Query: 121 DI 122 I Sbjct: 161 KI 162 >gi|15837841|ref|NP_298529.1| hypothetical protein XF1239 [Xylella fastidiosa 9a5c] gi|9106220|gb|AAF84049.1|AE003958_3 hypothetical protein XF_1239 [Xylella fastidiosa 9a5c] Length = 135 Score = 132 bits (332), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 63/122 (51%), Positives = 86/122 (70%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 L ++EQG +KWQ++ V ++WFDE+PPEDVYFEG+TR N T LV +T TPLK S ++ Sbjct: 7 LTSFEQGGEKWQADPVDWMWFDEQPPEDVYFEGITRTNRTFWLVCMTFTPLKSISTVVWR 66 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M+I + HY ++ RI SYP +EREART+G P LGSGR+FPI E Sbjct: 67 FLLENVPDRADMQMSIEDAEHYLVEDCARITASYPPYEREARTQGVPALGSGRVFPIAGE 126 Query: 121 DI 122 I Sbjct: 127 KI 128 >gi|28198423|ref|NP_778737.1| hypothetical protein PD0512 [Xylella fastidiosa Temecula1] gi|28056507|gb|AAO28386.1| phage-related protein [Xylella fastidiosa Temecula1] Length = 257 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 61/122 (50%), Positives = 84/122 (68%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK++EQG +KWQ++ V ++WFDE+PPEDVYFEG+ R N T GLV +T PLK ++ Sbjct: 7 LKSFEQGGEKWQADPVDWIWFDEQPPEDVYFEGIIRTNRTFGLVCMTFAPLKSILTVVWR 66 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M I + HY ++ RI SYP +ER+ART+G P LGSGR+FPI E Sbjct: 67 FLLENVPDRADVQMIIEDAEHYLLEDCARITASYPPYERQARTQGVPALGSGRLFPIAGE 126 Query: 121 DI 122 I Sbjct: 127 KI 128 >gi|182681090|ref|YP_001829250.1| bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa M23] gi|182631200|gb|ACB91976.1| Bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa M23] Length = 291 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 61/122 (50%), Positives = 84/122 (68%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK++EQG +KWQ++ V ++WFDE+PPEDVYFEG+ R N T GLV +T PLK ++ Sbjct: 41 LKSFEQGGEKWQADPVDWIWFDEQPPEDVYFEGIIRTNRTFGLVCMTFAPLKSILTVVWR 100 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M I + HY ++ RI SYP +ER+ART+G P LGSGR+FPI E Sbjct: 101 FLLENVPDRADVQMIIEDAEHYLLEDCARITASYPPYERQARTQGVPALGSGRLFPIAGE 160 Query: 121 DI 122 I Sbjct: 161 KI 162 >gi|307579537|gb|ADN63506.1| bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa subsp. fastidiosa GB514] Length = 278 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 61/122 (50%), Positives = 84/122 (68%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK++EQG +KWQ++ V ++WFDE+PPEDVYFEG+ R N T GLV +T PLK ++ Sbjct: 28 LKSFEQGGEKWQADPVDWIWFDEQPPEDVYFEGIIRTNRTFGLVCMTFAPLKSILTVVWR 87 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M I + HY ++ RI SYP +ER+ART+G P LGSGR+FPI E Sbjct: 88 FLLENVPDRADVQMIIEDAEHYLLEDCARITASYPPYERQARTQGVPALGSGRLFPIAGE 147 Query: 121 DI 122 I Sbjct: 148 KI 149 >gi|260463792|ref|ZP_05811989.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030389|gb|EEW31668.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 131 Score = 114 bits (284), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 48/102 (47%), Positives = 65/102 (63%) Query: 68 DRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSL 127 R V MTI++ HY+ ER I+ +YP HEREAR +G P+LGSGRIFP+ E I Sbjct: 2 SRHVTFMTIDDAAHYSPDERAAIVAAYPAHEREARARGIPVLGSGRIFPVAEALIACEPF 61 Query: 128 DIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRC 169 +P +W ++G +DFGW HP AA L W+ ++DV+YV C Sbjct: 62 RLPRYWPRLGALDFGWDHPSAAVELAWDTEADVVYVTNANPC 103 >gi|71274944|ref|ZP_00651232.1| similar to Bacteriophage terminase large (ATPase) subunit and inactivated derivatives [Xylella fastidiosa Dixon] gi|71901567|ref|ZP_00683649.1| similar to Bacteriophage terminase large (ATPase) subunit and inactivated derivatives [Xylella fastidiosa Ann-1] gi|71164676|gb|EAO14390.1| similar to Bacteriophage terminase large (ATPase) subunit and inactivated derivatives [Xylella fastidiosa Dixon] gi|71728653|gb|EAO30802.1| similar to Bacteriophage terminase large (ATPase) subunit and inactivated derivatives [Xylella fastidiosa Ann-1] Length = 142 Score = 89.0 bits (219), Expect = 5e-16, Method: Compositional matrix adjust. Identities = 45/94 (47%), Positives = 59/94 (62%) Query: 29 VYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERK 88 +YFE +TR N T GLV +T PLK ++ +L + DR ++M I + HY ++ Sbjct: 1 MYFEVITRTNRTFGLVCMTFAPLKSILTVVWRFLLENVPDRADVQMIIEDAEHYFLEDCA 60 Query: 89 RIIDSYPLHEREARTKGEPILGSGRIFPIVEEDI 122 RI SYP +EREARTKG P LGSGR+FPI E I Sbjct: 61 RITASYPPYEREARTKGVPALGSGRLFPIAGEKI 94 >gi|148557330|ref|YP_001264912.1| bacteriophage terminase large (ATPase) subunit-like protein [Sphingomonas wittichii RW1] gi|148502520|gb|ABQ70774.1| Bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Sphingomonas wittichii RW1] Length = 225 Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 38/62 (61%), Positives = 46/62 (74%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ +T++ +WFDEEPP D+Y EGLTR NAT G LT TPLKG S ++ Sbjct: 161 FKAYEQGRAKWQGDTLNGIWFDEEPPLDIYVEGLTRTNATGGFAMLTFTPLKGMSEVVRM 220 Query: 61 YL 62 +L Sbjct: 221 FL 222 >gi|13471711|ref|NP_103278.1| hypothetical protein msl1767 [Mesorhizobium loti MAFF303099] gi|14022455|dbj|BAB49064.1| msl1767 [Mesorhizobium loti MAFF303099] Length = 90 Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 38/69 (55%), Positives = 50/69 (72%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K++E+GR+KWQ T+H VWFDEEPP D+Y EGLTR NAT G+ +T TPL G S ++ Sbjct: 18 FKSFEKGREKWQGETLHGVWFDEEPPLDIYSEGLTRTNATSGITIVTFTPLLGMSDVVLL 77 Query: 61 YLSASSSDR 69 +LSA +R Sbjct: 78 FLSAGEVER 86 >gi|13471714|ref|NP_103281.1| hypothetical protein mll1771 [Mesorhizobium loti MAFF303099] gi|14022458|dbj|BAB49067.1| mll1771 [Mesorhizobium loti MAFF303099] Length = 254 Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 38/69 (55%), Positives = 50/69 (72%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K++E+GR+KWQ T+H VWFDEEPP D+Y EGLTR NAT G+ +T TPL G S ++ Sbjct: 182 FKSFEKGREKWQGETLHGVWFDEEPPLDIYSEGLTRTNATSGITIVTFTPLLGMSDVVLL 241 Query: 61 YLSASSSDR 69 +LSA +R Sbjct: 242 FLSAGDVER 250 >gi|260463788|ref|ZP_05811985.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030385|gb|EEW31664.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 209 Score = 85.5 bits (210), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 37/65 (56%), Positives = 48/65 (73%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K++E+GR+KWQ T+H VWFDEEPP D+Y EGLTR NAT G+ +T TPL G S ++ Sbjct: 137 FKSFEKGREKWQGETLHGVWFDEEPPLDIYSEGLTRTNATGGITIVTFTPLLGMSDVVLL 196 Query: 61 YLSAS 65 +LSA Sbjct: 197 FLSAG 201 >gi|158422462|ref|YP_001523754.1| putative DNA packaging protein GP3 [Azorhizobium caulinodans ORS 571] gi|158329351|dbj|BAF86836.1| putative DNA packaging protein GP3 [Azorhizobium caulinodans ORS 571] Length = 203 Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 31/64 (48%), Positives = 41/64 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K+Y+QGR K+Q H VW DEEPP DVY E L R+ T GL+ T TPL+G + I Sbjct: 137 FKSYDQGRKKFQGTAKHVVWLDEEPPADVYQEALMRLMTTSGLMLCTFTPLEGMTDIAAQ 196 Query: 61 YLSA 64 +++A Sbjct: 197 FIAA 200 >gi|30061789|ref|NP_835960.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] gi|30040031|gb|AAP15765.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] Length = 124 Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 26/51 (50%), Positives = 32/51 (62%) Query: 201 KRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 + GEQL QY G ML E AT+ DG N VE GI+++ D M GR+KVF Sbjct: 7 RAGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELRDMMLDGRFKVF 57 >gi|215304|gb|AAA72960.1| unnamed protein product [Enterobacteria phage P22] Length = 101 Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust. Identities = 21/34 (61%), Positives = 26/34 (76%) Query: 218 MLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 MLP+ ATF DG N VE+GIS++ D M GR+KVF Sbjct: 1 MLPDHATFPDGGNSVESGISELRDLMLEGRFKVF 34 >gi|71274589|ref|ZP_00650877.1| hypothetical protein XfasaDRAFT_1897 [Xylella fastidiosa Dixon] gi|71898128|ref|ZP_00680314.1| hypothetical protein XfasoDRAFT_3692 [Xylella fastidiosa Ann-1] gi|170730853|ref|YP_001776286.1| hypothetical protein Xfasm12_1760 [Xylella fastidiosa M12] gi|71164321|gb|EAO14035.1| hypothetical protein XfasaDRAFT_1897 [Xylella fastidiosa Dixon] gi|71732102|gb|EAO34158.1| hypothetical protein XfasoDRAFT_3692 [Xylella fastidiosa Ann-1] gi|167965646|gb|ACA12656.1| hypothetical protein Xfasm12_1760 [Xylella fastidiosa M12] Length = 78 Score = 44.7 bits (104), Expect = 0.011, Method: Composition-based stats. Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 7/59 (11%) Query: 46 LTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYP---LHEREA 101 +T TPLKG S ++ +L+ ++DR I+ + HY+ +E RII SYP H+R A Sbjct: 1 MTFTPLKGMSTVVRRFLTEDAADRGYIK----DAEHYSAEECARIIASYPPRSAHQRSA 55 >gi|284008126|emb|CBA74349.1| DNA packaging protein gp2 [Arsenophonus nasoniae] Length = 137 Score = 44.3 bits (103), Expect = 0.014, Method: Compositional matrix adjust. Identities = 18/26 (69%), Positives = 20/26 (76%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPP 26 K Y QGR +WQ +TVH VWFDEEPP Sbjct: 109 FKPYSQGRARWQGDTVHGVWFDEEPP 134 >gi|297565631|ref|YP_003684603.1| hypothetical protein Mesil_1197 [Meiothermus silvanus DSM 9946] gi|296850080|gb|ADH63095.1| hypothetical protein Mesil_1197 [Meiothermus silvanus DSM 9946] Length = 434 Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust. Identities = 43/186 (23%), Positives = 77/186 (41%), Gaps = 8/186 (4%) Query: 3 AYEQGRDKWQSNTVHYVWFDEEPPE----DVYFEGLTRINATQGLVTLTLTP--LKGRSP 56 + Q D +S T W DE + D + L R++ QG V +T TP L Sbjct: 131 GHAQDPDSLESATAKAAWLDEAGQKKFRRDSWQAILRRLSIHQGRVLITTTPYYLGWLKA 190 Query: 57 IIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFP 116 + D +++ + P++ E +R + P + + G +G+I+ Sbjct: 191 DLHDPARQGHPDIELVNFKSVDNPNFPRAEYERARATLPRWKFDMFYNGLFTRPAGQIYD 250 Query: 117 IVEEDI-VINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPI 175 + ++ V + ++PE W + G+DFG + AA L N S+ +V Y+ +T Sbjct: 251 CFDPEVHVRPAFNVPEDWPRFIGLDFGGVNT-AAVKLAKNPASEEYFVYAEYKAGGRTAR 309 Query: 176 FHVAAL 181 H L Sbjct: 310 EHAEVL 315 >gi|326570200|gb|EGE20245.1| major facilitator superfamily protein [Moraxella catarrhalis BC8] Length = 509 Score = 39.3 bits (90), Expect = 0.47, Method: Compositional matrix adjust. Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 7/66 (10%) Query: 111 SGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCR 170 SG + ++ +++N LD W Q+ M FGW PF G L+ VI ++ +Y+ Sbjct: 159 SGCVLSLIFLILLMNWLDDTLTWAQM--MSFGWRIPFICGGLL-----SVILILISYKHL 211 Query: 171 EQTPIF 176 ++TPIF Sbjct: 212 QETPIF 217 >gi|326565298|gb|EGE15483.1| major facilitator superfamily protein [Moraxella catarrhalis 12P80B1] gi|326576952|gb|EGE26858.1| major facilitator superfamily protein [Moraxella catarrhalis O35E] Length = 509 Score = 39.3 bits (90), Expect = 0.49, Method: Compositional matrix adjust. Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 7/66 (10%) Query: 111 SGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCR 170 SG + ++ +++N LD W Q+ M FGW PF G L+ VI ++ +Y+ Sbjct: 159 SGCVLSLIFLILLMNWLDDTLTWAQM--MSFGWRIPFICGGLL-----SVILILISYKHL 211 Query: 171 EQTPIF 176 ++TPIF Sbjct: 212 QETPIF 217 >gi|296113304|ref|YP_003627242.1| major facilitator superfamily protein [Moraxella catarrhalis RH4] gi|295920998|gb|ADG61349.1| major facilitator superfamily protein [Moraxella catarrhalis RH4] Length = 509 Score = 39.3 bits (90), Expect = 0.50, Method: Compositional matrix adjust. Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 7/66 (10%) Query: 111 SGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCR 170 SG + ++ +++N LD W Q+ M FGW PF G L+ VI ++ +Y+ Sbjct: 159 SGCVLSLIFLILLMNWLDDTLTWAQM--MSFGWRIPFICGGLL-----SVILILISYKHL 211 Query: 171 EQTPIF 176 ++TPIF Sbjct: 212 QETPIF 217 >gi|326566390|gb|EGE16540.1| major facilitator superfamily protein [Moraxella catarrhalis BC1] Length = 509 Score = 39.3 bits (90), Expect = 0.52, Method: Compositional matrix adjust. Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 7/66 (10%) Query: 111 SGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCR 170 SG + ++ +++N LD W Q+ M FGW PF G L+ VI ++ +Y+ Sbjct: 159 SGCVLSLIFLILLMNWLDDTLTWAQM--MSFGWRIPFICGGLL-----SVILILISYKHL 211 Query: 171 EQTPIF 176 ++TPIF Sbjct: 212 QETPIF 217 >gi|326560292|gb|EGE10680.1| major facilitator superfamily protein [Moraxella catarrhalis 7169] gi|326561986|gb|EGE12319.1| major facilitator superfamily protein [Moraxella catarrhalis 46P47B1] Length = 509 Score = 39.3 bits (90), Expect = 0.52, Method: Compositional matrix adjust. Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 7/66 (10%) Query: 111 SGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCR 170 SG + ++ +++N LD W Q+ M FGW PF G L+ VI ++ +Y+ Sbjct: 159 SGCVLSLIFLILLMNWLDDTLTWAQM--MSFGWRIPFICGGLL-----SVILILISYKHL 211 Query: 171 EQTPIF 176 ++TPIF Sbjct: 212 QETPIF 217 >gi|291529974|emb|CBK95559.1| Terminase-like family [Eubacterium siraeum 70/3] Length = 487 Score = 37.0 bits (84), Expect = 2.3, Method: Compositional matrix adjust. Identities = 25/101 (24%), Positives = 43/101 (42%), Gaps = 10/101 (9%) Query: 83 NEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDI----------VINSLDIPEH 132 N+ E + P ER+A G+ +G++F +D VI +IP H Sbjct: 222 NDPEYLASLSMLPTAERKALLYGDWNSFTGQVFTEWRDDPEHYCDRRWTHVIAPFEIPRH 281 Query: 133 WVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQT 173 W + G DFG+ PF+ G + + + + Y C ++ Sbjct: 282 WEIVRGFDFGYTRPFSVGWYAVDTKGCIYRIREYYGCTDKA 322 >gi|291556861|emb|CBL33978.1| Terminase-like family [Eubacterium siraeum V10Sc8a] Length = 487 Score = 37.0 bits (84), Expect = 2.5, Method: Compositional matrix adjust. Identities = 25/101 (24%), Positives = 43/101 (42%), Gaps = 10/101 (9%) Query: 83 NEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDI----------VINSLDIPEH 132 N+ E + P ER+A G+ +G++F +D VI +IP H Sbjct: 222 NDPEYLASLSMLPTAERKALLYGDWNSFTGQVFTEWRDDPEHYCDRRWTHVIAPFEIPRH 281 Query: 133 WVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQT 173 W + G DFG+ PF+ G + + + + Y C ++ Sbjct: 282 WEIVRGFDFGYTRPFSVGWYAVDTKGCIYRIREYYGCTDKA 322 >gi|167749269|ref|ZP_02421396.1| hypothetical protein EUBSIR_00220 [Eubacterium siraeum DSM 15702] gi|167657762|gb|EDS01892.1| hypothetical protein EUBSIR_00220 [Eubacterium siraeum DSM 15702] Length = 487 Score = 37.0 bits (84), Expect = 2.5, Method: Compositional matrix adjust. Identities = 25/101 (24%), Positives = 43/101 (42%), Gaps = 10/101 (9%) Query: 83 NEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDI----------VINSLDIPEH 132 N+ E + P ER+A G+ +G++F +D VI +IP H Sbjct: 222 NDPEYLASLSMLPTAERKALLYGDWNSFTGQVFTEWRDDPEHYCDRRWTHVIAPFEIPRH 281 Query: 133 WVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQT 173 W + G DFG+ PF+ G + + + + Y C ++ Sbjct: 282 WEIVRGFDFGYTRPFSVGWYAVDTKGCIYRIREYYGCTDKA 322 >gi|313115194|ref|ZP_07800678.1| conserved hypothetical protein [Faecalibacterium cf. prausnitzii KLE1255] gi|310622472|gb|EFQ05943.1| conserved hypothetical protein [Faecalibacterium cf. prausnitzii KLE1255] Length = 482 Score = 35.8 bits (81), Expect = 5.0, Method: Compositional matrix adjust. Identities = 37/142 (26%), Positives = 55/142 (38%), Gaps = 20/142 (14%) Query: 48 LTPLKGRSPIIEHYL--------SASSSDRQVIRMTINETPHY--NEQERKRIIDSYPLH 97 +TP +PI E Y R I +I + P N+ + S P Sbjct: 175 ITPAPPGTPITEEYTVKLPDGTEQKLQRARVFIPSSIFDNPALLANDPGYLASLASMPEA 234 Query: 98 EREARTKGEPILGSGRIF------PIVEEDI----VINSLDIPEHWVQIGGMDFGWHHPF 147 E++A G SG++F P ED VI IP+HW G DFG+ PF Sbjct: 235 EKQALLYGSWDSFSGQVFTEWRNDPAHYEDQRWTHVIAPFTIPKHWQLYRGFDFGFSKPF 294 Query: 148 AAGHLVWNRDSDVIYVVKNYRC 169 + G + + + + + Y C Sbjct: 295 SVGWYAADEEGRLYRIKELYGC 316 >gi|295102643|emb|CBL00188.1| Terminase-like family [Faecalibacterium prausnitzii L2-6] Length = 464 Score = 35.8 bits (81), Expect = 5.2, Method: Compositional matrix adjust. Identities = 50/222 (22%), Positives = 87/222 (39%), Gaps = 50/222 (22%) Query: 48 LTPLKGRSPIIEHY-LSASSSDRQVIR-------MTINETPHYNEQERKRI--IDSYPLH 97 +TP +PI+E Y + +V+R +I + P E + + + + P Sbjct: 157 ITPAPPGTPIVEEYPVRMPDGTEKVLRRARVFIPSSIFDNPALLENDPDYLASLAAMPEA 216 Query: 98 EREARTKGEPILGSGRIF------PIVEEDI----VINSLDIPEHWVQIGGMDFGWHHPF 147 E++A G SG++F P ED VI IP+HW G DFG+ PF Sbjct: 217 EKQALLYGSWDSFSGQVFTEWRNDPNHYEDQRWTHVIAPFTIPKHWKIYRGYDFGFSKPF 276 Query: 148 AAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDK------ 201 + G + + + + + Y C + P++GL+ D Sbjct: 277 SVGWYAADEEGRLYRIKELYGCTGR--------------------PNEGLRIDPVEQARR 316 Query: 202 -RSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDR 242 R EQ R + ++ + + A FD+ I+ M++R Sbjct: 317 IREAEQNDPVLRGRVIQGIADPAIFDESRG---ESIASMMER 355 >gi|224108713|ref|XP_002333355.1| predicted protein [Populus trichocarpa] gi|222836304|gb|EEE74725.1| predicted protein [Populus trichocarpa] Length = 294 Score = 35.4 bits (80), Expect = 6.5, Method: Compositional matrix adjust. Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 13/91 (14%) Query: 169 CREQTPIFHVAALKSWGKWLPWA--------WPHDGLQHDKRSGEQLSAQYRRQGMKMLP 220 C +P+FHV A +S G W P WP G + +R+ +A ++ K+LP Sbjct: 67 CSSSSPLFHVLAARSAGGWRPAVKEIGCCCWWPVCGCRCWRRA----AADRLKEMTKVLP 122 Query: 221 -ECATFDDGSNGVEAGISDMLDRMRSGRWKV 250 C + GS +S ++ + GRW++ Sbjct: 123 RRCRVWAAGSLVTVEKLSRLVSGWKKGRWRL 153 Searching..................................................done Results from round 2 >gi|227822449|ref|YP_002826421.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227341450|gb|ACP25668.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 454 Score = 442 bits (1136), Expect = e-122, Method: Composition-based stats. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPLKG S ++ Sbjct: 146 FKAYEQGRGKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGAIAVTFTPLKGLSNVVAR 205 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S DR+VI MTI + HY +ER+RII+SYP HEREARTKG P LGSGRIFP+ EE Sbjct: 206 YLMEKSPDREVITMTIEDAEHYTPEERQRIIESYPAHEREARTKGVPALGSGRIFPVTEE 265 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + DIP+HWVQIGG+DFGW HPFAA W+RD+DV YV + YR RE TPI H AA Sbjct: 266 AIRVEPFDIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFYVTRIYREREATPIIHAAA 325 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG WLP+AWPHDGLQHDK SGEQL+AQYR QG+ +L E ATFDDG+NGVEAG+SDML Sbjct: 326 LKPWGAWLPFAWPHDGLQHDKGSGEQLAAQYRAQGLPLLAERATFDDGTNGVEAGLSDML 385 Query: 241 DRMRSGRWKVF 251 RM++GRWKVF Sbjct: 386 QRMQTGRWKVF 396 >gi|227821702|ref|YP_002825672.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] gi|227340701|gb|ACP24919.1| DNA packaging protein Gp2 [Sinorhizobium fredii NGR234] Length = 416 Score = 440 bits (1131), Expect = e-121, Method: Composition-based stats. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPLKG S ++ Sbjct: 108 FKAYEQGRGKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGSIAVTFTPLKGMSTVVAR 167 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 Y+ S DR+VI MTI++ HY +ER+RIIDSYP HEREARTKG P LGSGRIFP+ EE Sbjct: 168 YILEKSPDREVITMTIDDAEHYTPEERQRIIDSYPAHEREARTKGVPSLGSGRIFPVAEE 227 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I +IP+HWVQIGG+DFGW HPF A W+RD+DV YV K YR RE TPI H AA Sbjct: 228 SITIAPFEIPKHWVQIGGLDFGWDHPFGAAGCAWDRDADVFYVTKVYREREATPIIHAAA 287 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG WLPW+WPHDGLQHDK SGEQL+ QYR QG+ MLPE ATF+DG+NGVEAG+SDML Sbjct: 288 LKPWGAWLPWSWPHDGLQHDKGSGEQLATQYRAQGLNMLPERATFEDGTNGVEAGLSDML 347 Query: 241 DRMRSGRWKVF 251 RM++GRWKVF Sbjct: 348 QRMQTGRWKVF 358 >gi|15965769|ref|NP_386122.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] gi|15075038|emb|CAC46595.1| DNA packaging protein GP2 [Sinorhizobium meliloti 1021] Length = 477 Score = 439 bits (1129), Expect = e-121, Method: Composition-based stats. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPL+G S ++ Sbjct: 169 FKAYEQGRAKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGAIAVTFTPLRGLSAVVAR 228 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S DR VI MTI + HY QER+R+IDSYP HEREART+G P LGSGRIFP+ EE Sbjct: 229 YLMEKSPDRAVITMTIEDAEHYTPQERQRVIDSYPAHEREARTRGVPALGSGRIFPVTEE 288 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I+ +IP+HWVQIGG+DFGW HPFAA W+RD+DV +V K YR RE TPI H AA Sbjct: 289 SIRIDPFEIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFHVTKIYREREATPIIHAAA 348 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG +PWAWPHDGLQHDK SGEQL+AQYR QG+ +LPE ATFDDG+NGVEAG+SDML Sbjct: 349 LKPWGAAMPWAWPHDGLQHDKGSGEQLAAQYRAQGLALLPERATFDDGTNGVEAGLSDML 408 Query: 241 DRMRSGRWKVF 251 RM+SGRWKVF Sbjct: 409 QRMQSGRWKVF 419 >gi|307315429|ref|ZP_07594994.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] gi|306898808|gb|EFN29464.1| protein of unknown function DUF264 [Sinorhizobium meliloti BL225C] Length = 477 Score = 439 bits (1128), Expect = e-121, Method: Composition-based stats. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPL+G S ++ Sbjct: 169 FKAYEQGRAKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGAIAVTFTPLRGLSAVVAR 228 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S DR VI MTI + HY QER+R+IDSYP HEREART+G P LGSGRIFP+ EE Sbjct: 229 YLMEKSPDRAVITMTIEDAEHYTPQERQRVIDSYPAHEREARTRGVPALGSGRIFPVTEE 288 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I+ +IP+HWVQIGG+DFGW HPFAA W+RD+DV +V K YR RE TPI H AA Sbjct: 289 SIRIDPFEIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFHVTKIYREREATPIIHAAA 348 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG +PWAWPHDGLQHDK SGEQL+AQYR QG+ +LPE ATFDDG+NGVEAG+SDML Sbjct: 349 LKPWGAAMPWAWPHDGLQHDKGSGEQLAAQYRAQGLALLPERATFDDGTNGVEAGLSDML 408 Query: 241 DRMRSGRWKVF 251 RM+SGRWKVF Sbjct: 409 QRMQSGRWKVF 419 >gi|307318836|ref|ZP_07598268.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] gi|306895557|gb|EFN26311.1| protein of unknown function DUF264 [Sinorhizobium meliloti AK83] Length = 477 Score = 438 bits (1127), Expect = e-121, Method: Composition-based stats. Identities = 174/251 (69%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPL+G S ++ Sbjct: 169 FKAYEQGRAKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGAIAVTFTPLRGLSAVVAR 228 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S DR VI MTI + HY QER+R+IDSYP HEREART+G P LGSGRIFP+ EE Sbjct: 229 YLMEKSPDRAVITMTIEDAEHYTPQERQRVIDSYPAHEREARTRGVPALGSGRIFPVTEE 288 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I+ +IP+HWVQIGG+DFGW HPFAA W+RD+DV +V K YR RE TPI H AA Sbjct: 289 SIRIDPFEIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFHVTKIYREREATPIIHAAA 348 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG +PWAWPHDGLQHDK SGEQL+AQYR QG+ +LPE ATFDDG+NGVEAG+SDML Sbjct: 349 LKPWGAAMPWAWPHDGLQHDKGSGEQLAAQYRAQGLALLPERATFDDGTNGVEAGLSDML 408 Query: 241 DRMRSGRWKVF 251 RM+SGRWKVF Sbjct: 409 QRMQSGRWKVF 419 >gi|150397042|ref|YP_001327509.1| hypothetical protein Smed_1839 [Sinorhizobium medicae WSM419] gi|150028557|gb|ABR60674.1| protein of unknown function DUF264 [Sinorhizobium medicae WSM419] Length = 477 Score = 437 bits (1124), Expect = e-121, Method: Composition-based stats. Identities = 176/251 (70%), Positives = 201/251 (80%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ+NTV YVWFDEEPPEDVYFEG+TR NAT+G + +T TPLKG S ++ Sbjct: 169 FKAYEQGRAKWQANTVDYVWFDEEPPEDVYFEGITRTNATRGSIAVTFTPLKGLSAVVAR 228 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL S+DR+V MTI + HY +ER+RIIDSYP HEREARTKG P LGSGRIFP+ EE Sbjct: 229 YLMEKSADREVTTMTIEDAEHYTPEERRRIIDSYPAHEREARTKGVPALGSGRIFPVTEE 288 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + DIP+HWVQIGG+DFGW HPFAA W+RD+DV YV K YR RE TPI H AA Sbjct: 289 SIRADPFDIPKHWVQIGGLDFGWDHPFAAVGCAWDRDADVFYVTKLYRERESTPIIHAAA 348 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG LPWAWPHDGLQHDK SGEQL+AQYR QG+ +LPE ATFDDG+NGVEAG+SDML Sbjct: 349 LKPWGGTLPWAWPHDGLQHDKGSGEQLAAQYRAQGLALLPERATFDDGTNGVEAGLSDML 408 Query: 241 DRMRSGRWKVF 251 RM++GRWKVF Sbjct: 409 QRMQTGRWKVF 419 >gi|323967108|gb|EGB62533.1| terminase [Escherichia coli M863] Length = 499 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 124/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D IYV + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAHVQLWWDKDADTIYVSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|327251967|gb|EGE63639.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] gi|327254495|gb|EGE66117.1| DNA packaging protein gp2 [Escherichia coli STEC_7v] Length = 499 Score = 431 bits (1108), Expect = e-119, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D+IY+ + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADIIYLSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K W +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKPWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|293410725|ref|ZP_06654301.1| DNA-packaging protein gp2 [Escherichia coli B354] gi|291471193|gb|EFF13677.1| DNA-packaging protein gp2 [Escherichia coli B354] Length = 499 Score = 431 bits (1108), Expect = e-119, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D+IY+ + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADIIYLSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K W +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKPWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|331657716|ref|ZP_08358678.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] gi|331055964|gb|EGI27973.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA206] Length = 499 Score = 431 bits (1108), Expect = e-119, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D+IY+ + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADIIYLSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K W +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKPWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|315122536|ref|YP_004063025.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495938|gb|ADR52537.1| DNA packaging protein Gp2 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 455 Score = 430 bits (1107), Expect = e-119, Method: Composition-based stats. Identities = 219/251 (87%), Positives = 233/251 (92%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LKAYEQGR+KWQSNTV YVWFDEEPPEDVYFEGLTRINATQGLV LTLTPLKGRS I+EH Sbjct: 147 LKAYEQGREKWQSNTVDYVWFDEEPPEDVYFEGLTRINATQGLVALTLTPLKGRSNIVEH 206 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YLS+SS DRQVIRMT+ ETPHY +ER RII+SYPLHEREARTKGEP+LGSGRIFPI+E+ Sbjct: 207 YLSSSSPDRQVIRMTLEETPHYTAKERIRIINSYPLHEREARTKGEPVLGSGRIFPILEQ 266 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 DIVI S DIPEHW QIGGMDFGWHHPFAA L WNRDSDVIYVVKNYRCREQTP+FH A Sbjct: 267 DIVITSFDIPEHWSQIGGMDFGWHHPFAAVQLAWNRDSDVIYVVKNYRCREQTPLFHAAV 326 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LKSWGKWLPWAWPHDGLQHDK SGEQL+ QYR+QGMKMLPECATFDDGSNGVEAG+SD+L Sbjct: 327 LKSWGKWLPWAWPHDGLQHDKGSGEQLAVQYRQQGMKMLPECATFDDGSNGVEAGVSDIL 386 Query: 241 DRMRSGRWKVF 251 DRMRSGRWKVF Sbjct: 387 DRMRSGRWKVF 397 >gi|218549377|ref|YP_002383168.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|307311077|ref|ZP_07590721.1| protein of unknown function DUF264 [Escherichia coli W] gi|331669066|ref|ZP_08369914.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] gi|218356918|emb|CAQ89550.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia fergusonii ATCC 35469] gi|306908583|gb|EFN39080.1| protein of unknown function DUF264 [Escherichia coli W] gi|312945545|gb|ADR26372.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli O83:H1 str. NRG 857C] gi|315061655|gb|ADT75982.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli W] gi|323377763|gb|ADX50031.1| DNA packaging protein gp2 (terminase large subunit) [Escherichia coli KO11] gi|324117758|gb|EGC11657.1| terminase [Escherichia coli E1167] gi|331064260|gb|EGI36171.1| DNA packaging protein gp2 (Terminase large subunit) [Escherichia coli TA271] Length = 499 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 124/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D IYV + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADTIYVSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|315299781|gb|EFU59021.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 16-3] Length = 499 Score = 430 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 121/251 (48%), Positives = 162/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VW DEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWVDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D+IY+ + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADIIYLSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K W +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKPWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|62178924|ref|YP_215341.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62126557|gb|AAX64260.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322713379|gb|EFZ04950.1| gp2-like protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 499 Score = 429 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 124/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D IYV + ++ +E+T + A Sbjct: 302 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADTIYVSRVWKAKEKTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 362 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYAEAGFMMLQEHATWPDGGNAVEPGITELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|71274675|ref|ZP_00650963.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71901596|ref|ZP_00683677.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|170730087|ref|YP_001775520.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] gi|71164407|gb|EAO14121.1| Protein of unknown function DUF264 [Xylella fastidiosa Dixon] gi|71728644|gb|EAO30794.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|167964880|gb|ACA11890.1| putative DNA packaging protein GP2 [Xylella fastidiosa M12] Length = 472 Score = 429 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 153/251 (60%), Positives = 197/251 (78%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK+++QGR+KWQ++TVH+VWFDEEPPEDVYFEG+TR N T G V +T TPLKG S ++ Sbjct: 164 LKSFDQGREKWQADTVHWVWFDEEPPEDVYFEGITRTNRTFGPVFMTFTPLKGMSNVVRR 223 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L+ ++DR ++M+I + HY+ +E RI SYP HER+ART+G P LGSGR+FPI +E Sbjct: 224 FLTEDAADRGYVQMSIEDAEHYSAEECARITASYPPHERDARTQGVPALGSGRVFPIAQE 283 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 +I + IP W IGGMDFG+ HPFAA L W+RD+D++YVV YR RE TPI H AA Sbjct: 284 EISVAPFAIPAQWALIGGMDFGYDHPFAAVKLAWDRDADILYVVCAYRKRESTPIIHAAA 343 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG LPWAWPHDGLQHDK SG+QL+ QYR+QG+ MLP+ ATF+DG+NG+EAG+++ML Sbjct: 344 LKPWGVTLPWAWPHDGLQHDKGSGDQLAEQYRQQGLAMLPQRATFEDGTNGLEAGVTEML 403 Query: 241 DRMRSGRWKVF 251 DRM +GR KVF Sbjct: 404 DRMHTGRLKVF 414 >gi|318065950|ref|YP_004123808.1| Gp2 [Salmonella phage ST160] gi|289066936|gb|ADC81147.1| Gp2 [Salmonella phage ST160] Length = 517 Score = 428 bits (1101), Expect = e-118, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 200 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 259 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 260 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 319 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 320 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 379 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 380 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 439 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 440 DLMLEGRFKVF 450 >gi|281599695|gb|ADA72679.1| Gp2-like protein [Shigella flexneri 2002017] Length = 441 Score = 428 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 124/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 124 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 183 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 184 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 243 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++D+D IYV + ++ +E+T + A Sbjct: 244 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKDADTIYVSRVWKAKEKTAVQAWGA 303 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 304 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 363 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 364 DMMLDGRFKVF 374 >gi|110804280|ref|YP_687800.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] gi|110613828|gb|ABF02495.1| putative terminase large subunit [Shigella flexneri 5 str. 8401] Length = 354 Score = 428 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 123/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 37 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 96 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 97 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 156 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++++D IYV + ++ +E+T + A Sbjct: 157 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKEADTIYVSRVWKAKEKTAVQAWGA 216 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 217 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 276 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 277 DMMLDGRFKVF 287 >gi|281599578|gb|ADA72562.1| putative terminase large subunit [Shigella flexneri 2002017] Length = 351 Score = 427 bits (1099), Expect = e-118, Method: Composition-based stats. Identities = 123/251 (49%), Positives = 163/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 34 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 93 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 94 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 153 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ IGGMDFGW HP A L W++++D IYV + ++ +E+T + A Sbjct: 154 TIKCQPFECPDHFYVIGGMDFGWDHPQAQVQLWWDKEADTIYVSRVWKAKEKTAVQAWGA 213 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G ML E AT+ DG N VE GI+++ Sbjct: 214 VKSWAHKVPTAWPHDGNQHEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELR 273 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 274 DMMLDGRFKVF 284 >gi|24371583|ref|NP_720326.1| gp2 [Enterobacteria phage ST64T] gi|24250810|gb|AAL15523.1| gp2 [Salmonella phage ST64T] Length = 517 Score = 427 bits (1099), Expect = e-118, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 200 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 259 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 260 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 319 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 320 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 379 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GI ++ Sbjct: 380 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGIGELR 439 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 440 DLMLEGRFKVF 450 >gi|238912312|ref|ZP_04656149.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|261245593|emb|CBG23388.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] Length = 499 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|221328620|ref|YP_002533461.1| Terminase, large subunit [Salmonella phage epsilon34] gi|255252684|ref|YP_003090219.1| Terminase, large subunit [Salmonella phage c341] gi|193244688|gb|ACF16628.1| Terminase, large subunit [Salmonella phage epsilon34] gi|223697657|gb|ACN18281.1| Terminase, large subunit [Salmonella phage g341c] Length = 499 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|197363441|ref|YP_002143078.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|197094918|emb|CAR60455.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|320086843|emb|CBY96615.1| DNA packaging protein gp2 Terminase large subunit [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 499 Score = 427 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|168240109|ref|ZP_02665041.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194451817|ref|YP_002044341.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194410121|gb|ACF70340.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|205340165|gb|EDZ26929.1| DNA packaging protein gp2 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] Length = 499 Score = 427 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|51236724|ref|YP_063734.1| terminase large subunit [Enterobacteria phage P22] gi|137879|sp|P26745|TERL_BPP22 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein gp2; AltName: Full=Terminase large subunit gi|21914414|gb|AAM81379.1|AF527608_1 terminase large subunit [Salmonella phage P22-pbi] gi|553005|gb|AAA72959.1| DNA pacaging [Enterobacteria phage P22] gi|8439622|gb|AAF75044.1| terminase large subunit [Enterobacteria phage P22] gi|28394263|tpg|DAA00977.1| TPA_inf: terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 427 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|161504537|ref|YP_001571649.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|160865884|gb|ABX22507.1| hypothetical protein SARI_02650 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 499 Score = 427 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|219681243|ref|YP_002455888.1| Gp2 [Salmonella enterica bacteriophage SE1] gi|66473858|gb|AAY46504.1| Gp2 [Salmonella phage SE1] Length = 499 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRAAWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|198245578|ref|YP_002214540.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940094|gb|ACH77427.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] Length = 499 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 AIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFSDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|60476789|gb|AAX21426.1| gp2 [Enterobacteria phage L] Length = 499 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 122/251 (48%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GI ++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFPDGGNSVESGIGELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|157734711|dbj|BAF80717.1| terminase large subunit [Enterobacteria phage P22] gi|169658843|dbj|BAG12600.1| terminase large subunit [Enterobacteria phage P22] Length = 499 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 AIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFSDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DLMLEGRFKVF 432 >gi|71897556|ref|ZP_00679801.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] gi|71732459|gb|EAO34512.1| Protein of unknown function DUF264 [Xylella fastidiosa Ann-1] Length = 471 Score = 425 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 155/251 (61%), Positives = 196/251 (78%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK+++QGR+KWQ++TV +VWFDEEPPEDVYFEG+TR N T G V +T TPLKG S ++ Sbjct: 163 LKSFDQGREKWQADTVDWVWFDEEPPEDVYFEGITRTNRTFGPVFMTFTPLKGMSSVVRR 222 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR +++MTI++ HY+ ++R RII SYP HEREARTKG P LGSGR+FPI E+ Sbjct: 223 FLLEQAPDRGLVQMTIDDAEHYSPEDRARIIASYPAHEREARTKGTPSLGSGRVFPIAED 282 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I IPE W IGGMDFG+ HPFAA + W+R++DVIYV+ YR RE TP+ H AA Sbjct: 283 SIAIAPFSIPEEWALIGGMDFGYDHPFAAVKMAWDREADVIYVMCAYRQREATPVIHTAA 342 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 L+ WG LPWAWPHDGLQHDK SGEQL+ QYR+QG+ ML + ATF DG+NG+EAG+++ML Sbjct: 343 LRPWGAHLPWAWPHDGLQHDKGSGEQLAEQYRQQGLSMLGQRATFTDGTNGLEAGVTEML 402 Query: 241 DRMRSGRWKVF 251 DRM +GR KVF Sbjct: 403 DRMHTGRLKVF 413 >gi|326622293|gb|EGE28638.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 482 Score = 425 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 123/251 (49%), Positives = 161/251 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 165 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 224 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 225 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 284 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 285 AIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 344 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLPE ATF DG N VE+GIS++ Sbjct: 345 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPEHATFSDGGNSVESGISELR 404 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 405 DLMLEGRFKVF 415 >gi|46358697|ref|YP_006405.1| Gp2 [Enterobacteria phage ST104] gi|46357933|dbj|BAD15212.1| Gp2 [Enterobacteria phage ST104] gi|312911340|dbj|BAJ35314.1| putative terminase large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 499 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 121/251 (48%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 302 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 362 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 421 Query: 241 DRMRSGRWKVF 251 D M GR+K F Sbjct: 422 DLMLEGRFKAF 432 >gi|273810450|ref|YP_003344921.1| TerL [Xylella phage Xfas53] gi|257097825|gb|ACV41131.1| TerL [Xylella phage Xfas53] Length = 470 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 155/251 (61%), Positives = 196/251 (78%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK+++QGR+KWQ++TV +VWFDEEPPEDVYFEG+TR N T G V +T TPLKG S ++ Sbjct: 162 LKSFDQGREKWQADTVDWVWFDEEPPEDVYFEGITRTNRTFGPVFMTFTPLKGMSSVVRR 221 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR +++MTI++ HY+ ++R RII SYP HEREARTKG P LGSGR+FPI E+ Sbjct: 222 FLLEQAPDRGLVQMTIDDAEHYSPEDRARIIASYPAHEREARTKGTPSLGSGRVFPIAED 281 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I IPE W IGGMDFG+ HPFAA + W+R++DVIYV+ YR RE TP+ H AA Sbjct: 282 SIAIAPFSIPEEWALIGGMDFGYDHPFAAVKMAWDREADVIYVMCAYRQREATPVIHTAA 341 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 L+ WG LPWAWPHDGLQHDK SGEQL+ QYR+QG+ ML + ATF DG+NG+EAG+++ML Sbjct: 342 LRPWGAHLPWAWPHDGLQHDKGSGEQLAEQYRQQGLSMLGQRATFTDGTNGLEAGVTEML 401 Query: 241 DRMRSGRWKVF 251 DRM +GR KVF Sbjct: 402 DRMHTGRLKVF 412 >gi|300920006|ref|ZP_07136465.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] gi|300412953|gb|EFJ96263.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 115-1] Length = 498 Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats. Identities = 119/251 (47%), Positives = 160/251 (63%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K Y QGR +WQ +T+H VWFDEEPP +Y EGLTR N LT TPL G S ++ Sbjct: 182 FKPYSQGRARWQGDTIHGVWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTK 241 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 242 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 301 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I MDFGW HP A L W++D DVIY+ + ++ +++ +A Sbjct: 302 TIKCQPFECPDHFYVINAMDFGWDHPQAHIQLWWDKDEDVIYLSRVWKAKQKKATEAWSA 361 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K+W K P AWPHDG QH+K G QL QY G MLP+ AT+ DG N VE GI+++ Sbjct: 362 VKAWSKNTPTAWPHDGHQHEKGGGAQLKEQYADAGFDMLPDHATWPDGGNAVEPGIAEIR 421 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 422 DMMLDGRFKVF 432 >gi|24111660|ref|NP_706170.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|24050435|gb|AAN41877.1| putative terminase large subunit [Shigella flexneri 2a str. 301] gi|313646707|gb|EFS11166.1| DNA packaging gp2 domain protein [Shigella flexneri 2a str. 2457T] Length = 300 Score = 394 bits (1012), Expect = e-108, Method: Composition-based stats. Identities = 113/233 (48%), Positives = 151/233 (64%) Query: 19 VWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINE 78 +WFDEEPP +Y EGLTR N LT TPL G S ++ +L S ++V+ MTI + Sbjct: 1 MWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYD 60 Query: 79 TPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGG 138 HY ++++++II SYP HEREAR +G P +GSGRIF I EE I + P+H+ IGG Sbjct: 61 AEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIGG 120 Query: 139 MDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQ 198 MDFGW HP A L W++++D IYV + ++ +E+T + A+KSW +P AWPHDG Q Sbjct: 121 MDFGWDHPQAQVQLWWDKEADTIYVSRVWKAKEKTAVQAWGAVKSWAHKVPTAWPHDGNQ 180 Query: 199 HDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 H+K GEQL QY G ML E AT+ DG N VE GI+++ D M GR+KVF Sbjct: 181 HEKGGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELRDMMLDGRFKVF 233 >gi|137993|sp|P16938|VG2_BPLP7 RecName: Full=Protein GP2 gi|75884|pir||Z2BPL7 gene 2 protein - phage LP-7 (fragment) gi|553003|gb|AAA88220.1| packaging glycoprotein [Enterobacteria phage LP7] Length = 475 Score = 386 bits (992), Expect = e-105, Method: Composition-based stats. Identities = 109/251 (43%), Positives = 148/251 (58%), Gaps = 1/251 (0%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K + R +T+ + P +Y EGLTR N LT TPL G S + Sbjct: 179 FKHTRRHRHA-AGDTITAYGLTKRLPYSIYAEGLTRTNKYGQFSILTFTPLMGMSDGVTK 237 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GSGRIF I EE Sbjct: 238 FLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEE 297 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E T + A Sbjct: 298 TIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGA 357 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N VE+GIS++ Sbjct: 358 VKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNSVESGISELR 417 Query: 241 DRMRSGRWKVF 251 D M GR+KVF Sbjct: 418 DLMLEGRFKVF 428 >gi|167041080|gb|ABZ05841.1| hypothetical protein ALOHA_HF400048F7ctg1g8 [uncultured marine microorganism HF4000_48F7] Length = 504 Score = 369 bits (948), Expect = e-100, Method: Composition-based stats. Identities = 99/251 (39%), Positives = 155/251 (61%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K+Y+ G W V YVW DEEPP+++Y + L + G V+LT TP G + ++ Sbjct: 173 FKSYDAGPASWMGVAVDYVWLDEEPPQEIYSQALRATLKSGGPVSLTFTPEAGVTGVVAM 232 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L+ + +++ T ++ PH + + R+ I+ + P HER R+KG P LGSG++FP+ E+ Sbjct: 233 FLNERKGGQALVQATWDDAPHLSLEVREEILAALPPHERLMRSKGIPTLGSGQVFPVPED 292 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I++++ IP+H+ +I G+DFG+ HP A + +RD+DV+Y+ YR + + H A Sbjct: 293 QIMVSAFAIPDHFSRIAGIDFGFDHPTACVWMAHDRDTDVVYLYDAYREKGSGMLQHAEA 352 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 +K G ++P AWPHDG HDK SGE L+ QYRR G+ L T +G VE G+ +L Sbjct: 353 IKHRGGFIPVAWPHDGSIHDKGSGEALATQYRRSGVNFLGSHFTNPEGGIAVEPGLMALL 412 Query: 241 DRMRSGRWKVF 251 RM++GR+KVF Sbjct: 413 TRMQTGRFKVF 423 >gi|94317806|gb|ABF15069.1| terminase large subunit Gp2 [Salmonella enterica subsp. enterica serovar Typhimurium] Length = 278 Score = 356 bits (915), Expect = 1e-96, Method: Composition-based stats. Identities = 99/215 (46%), Positives = 134/215 (62%) Query: 37 INATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPL 96 N LT TPL G S ++ +L S ++V+ MTI + HY ++++++II SYP Sbjct: 1 TNKYGQFSILTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPE 60 Query: 97 HEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNR 156 HEREAR +G P +GSGRIF I EE I + P+H+ I DFGW+HP A L W++ Sbjct: 61 HEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDK 120 Query: 157 DSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGM 216 D+DV Y+ + ++ E T + A+KSW +P AWPHDG QH+K GEQL QY G Sbjct: 121 DADVFYLARVWKKSENTAVQAWGAVKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGF 180 Query: 217 KMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 MLP+ ATF DG N VE+GIS++ D M GR+KVF Sbjct: 181 SMLPDHATFPDGGNSVESGISELRDLMLEGRFKVF 215 >gi|27476053|ref|NP_775255.1| terminase [Pseudomonas phage PaP3] gi|27414483|gb|AAL85569.1| terminase [Pseudomonas phage PaP3] Length = 482 Score = 340 bits (872), Expect = 1e-91, Method: Composition-based stats. Identities = 81/260 (31%), Positives = 131/260 (50%), Gaps = 9/260 (3%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K+YE +DK+ + +W DEE P+D+Y + +TR T G+V LT TP G + I++ Sbjct: 154 FKSYEMSQDKFMGTAIDVIWLDEECPKDIYTQCVTRTATTGGIVYLTFTPEHGLTEIVKD 213 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + +I + + PH + + +++++ Y ER R +G P+LGSG +FPI+EE Sbjct: 214 FLQDLKPGQFLIHASWEDAPHLSPEVKEQLLSVYSPAERRMRAEGIPMLGSGVVFPILEE 273 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 V DIP+H+ +I G+D G+ HP A + W+ + D Y+ +T H A Sbjct: 274 KFVCEPFDIPDHFHRIIGIDLGFDHPNAIACVAWDAEKDKYYLYDERSESGETLGMHADA 333 Query: 181 LK-SWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDD--------GSNG 231 + G +P PHD +HD + + + + F + G N Sbjct: 334 IYLKGGHQIPVVVPHDAFKHDGATSGRRFVDLLKDDHNLNVVYEPFSNPPGPDGKHGGNS 393 Query: 232 VEAGISDMLDRMRSGRWKVF 251 VE G++ ML RM +G KVF Sbjct: 394 VEFGVNWMLTRMENGDLKVF 413 >gi|89885991|ref|YP_516188.1| phage terminase large subunit [Sodalis phage phiSG1] gi|89191726|dbj|BAE80473.1| phage terminase large subunit [Sodalis phage phiSG1] gi|125470018|gb|ABN42210.1| gp02 [Sodalis phage phiSG1] Length = 475 Score = 340 bits (871), Expect = 2e-91, Method: Composition-based stats. Identities = 90/256 (35%), Positives = 141/256 (55%), Gaps = 10/256 (3%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPE-DVYFEGLTRINA----TQGLVTLTLTPLKGRS 55 +Y QG+ + V + DEEP + +Y + LTR G LT TP GR+ Sbjct: 160 FWSYSQGQHALMGDCVDWFHIDEEPRDPTIYPQVLTRTATGDRGKGGRGILTFTPENGRT 219 Query: 56 PIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIF 115 ++ ++ S + I + ++ PH +++ + ++ S+P H+R+ RTKG P+LG GRI+ Sbjct: 220 DLVIGFMDNPSPAQTCINVGWDDAPHLSQKVKNDLLASFPAHQRDMRTKGIPMLGHGRIY 279 Query: 116 PIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPI 175 + E+ I + +P HW+ I GMDFGW HP A LVW+ ++++ YV + Y+ R+ +P Sbjct: 280 DLGEDFITCDPFPVPAHWLVIDGMDFGWDHPQAHIQLVWDNENEMFYVTRAYKARQVSPA 339 Query: 176 FHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAG 235 +A+ W + +P AWP DGL +K SG Q Y G ML + A + DGS VE Sbjct: 340 EAYSAVSIWAENVPTAWPSDGLMTEKGSGIQQKTYYDDAGFCMLRDPAQWPDGSRSVE-- 397 Query: 236 ISDMLDRMRSGRWKVF 251 + D MR G++KVF Sbjct: 398 ---LHDLMRRGKFKVF 410 >gi|167600439|ref|YP_001671939.1| terminase large subunit [Pseudomonas phage LUZ24] gi|161168302|emb|CAP45467.1| terminase large subunit [Pseudomonas phage LUZ24] Length = 482 Score = 337 bits (865), Expect = 6e-91, Method: Composition-based stats. Identities = 79/260 (30%), Positives = 130/260 (50%), Gaps = 9/260 (3%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K+YE +DK+ + +W DEE P+D+Y + +TR T G+V LT TP G + I++ Sbjct: 154 FKSYEMSQDKFMGTAIDVIWLDEECPKDIYTQCVTRTATTGGIVYLTFTPEHGLTEIVKD 213 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + ++ + + PH + + +++++ Y ER R +G P+LGSG +FPI+EE Sbjct: 214 FLQDLKPGQFLVHASWEDAPHLSPEVKEQLLSVYSPAERRMRAEGVPMLGSGVVFPILEE 273 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 V IP+H+ +I G+D G+ HP A + W+ + D Y+ +T H A Sbjct: 274 KFVCEPFQIPDHFHRIIGIDLGFDHPNAIACVAWDPEKDKYYLYDERSESGETLGMHADA 333 Query: 181 LK-SWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDD--------GSNG 231 + G +P PHD +HD + + + + F + G N Sbjct: 334 IYLKGGHQIPVVVPHDAFKHDGATSGRRFVDLLKDDHNLNVVYEPFSNPPGPDGKHGGNS 393 Query: 232 VEAGISDMLDRMRSGRWKVF 251 VE G++ ML RM +G KVF Sbjct: 394 VEFGVNWMLTRMENGDLKVF 413 >gi|167583562|ref|YP_001671752.1| terminase large subunit [Enterobacteria phage phiEco32] gi|164375400|gb|ABY52808.1| terminase large subunit [Enterobacteria phage phiEco32] Length = 513 Score = 331 bits (850), Expect = 4e-89, Method: Composition-based stats. Identities = 81/268 (30%), Positives = 139/268 (51%), Gaps = 17/268 (6%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPED---VYFEGLTRINATQGLVTLTLTPLKGRSPI 57 ++ +QG TV Y+W DEE P + ++ + +TR T+GLVT+T TP G + + Sbjct: 177 FRSTQQGEHTLMGATVDYIWLDEEDPYESMAIFAQCVTRTLTTKGLVTITATPENGLTEL 236 Query: 58 IEHYLSASSSDRQVI----RMTINET-----PHYNEQERKRIIDSYPLHEREARTKGEPI 108 ++ ++ + + + H +Q+ K + + P + E R+KG P+ Sbjct: 237 VDKFMKGEGDESTGSLYFQNASWWDAHVDLGGHITDQDIKDMTEGIPAWQLEMRSKGMPL 296 Query: 109 LGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYR 168 LGSG I+ + ++ I +IP+ W ++ +D G HP AA ++ ++D IYV +Y+ Sbjct: 297 LGSGLIYDVSDDTIKCEPFEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDSYK 356 Query: 169 CREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDG 228 TP++H A+ G+W+P PHD +K SG ++ Y+ G+ + E G Sbjct: 357 EGGFTPVYHAPAINGRGQWIPVILPHDADNTEKGSGSSVAQFYKNAGVNVQSETFYNKIG 416 Query: 229 SNG-----VEAGISDMLDRMRSGRWKVF 251 +G VE GI+D+ +RM SGR+KVF Sbjct: 417 MDGKKNFFVEPGITDIRERMMSGRFKVF 444 >gi|321225021|gb|EFX50082.1| Phage terminase, large subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 267 Score = 331 bits (849), Expect = 6e-89, Method: Composition-based stats. Identities = 93/200 (46%), Positives = 128/200 (64%) Query: 52 KGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGS 111 G S ++ +L S ++V+ MTI + HY ++++++II SYP HEREAR +G P +GS Sbjct: 1 MGMSDVVTKFLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIASYPEHEREARARGIPTMGS 60 Query: 112 GRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCRE 171 GRIF I EE I + P+H+ I DFGW+HP A L W++D+DV Y+ + ++ E Sbjct: 61 GRIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSE 120 Query: 172 QTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNG 231 T + A+KSW +P AWPHDG QH+K GEQL QY G MLP+ ATF DG N Sbjct: 121 NTAVQAWGAVKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPDHATFPDGGNS 180 Query: 232 VEAGISDMLDRMRSGRWKVF 251 VE+GIS++ D M GR+KVF Sbjct: 181 VESGISELRDLMLEGRFKVF 200 >gi|49146380|ref|YP_025488.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] gi|40458348|gb|AAR87096.1| putative phage DNA packaging protein Gp2 [Caedibacter taeniospiralis] Length = 474 Score = 313 bits (802), Expect = 2e-83, Method: Composition-based stats. Identities = 94/285 (32%), Positives = 145/285 (50%), Gaps = 40/285 (14%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQ----GLVTLTLTPLKGRSP 56 K+YEQGR K+Q+ + V DEEPP D+Y E L R +T+ G+V LT+TPL G + Sbjct: 133 FKSYEQGRKKFQAAKLDLVHLDEEPPRDIYVESLMRTMSTEVDNEGIVLLTMTPLLGLTD 192 Query: 57 IIEHYLSAS-----------------------------SSDRQVIRMTINETPHYNEQER 87 +I + + ++R I+ + ++ PH + + Sbjct: 193 LILEFQETTIEREVINGSGVSEMSEMTEEVIKVDEGSIVNNRFYIQASWDDNPHLLDSAK 252 Query: 88 KRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPF 147 + + S HE+EAR G P LGSG ++P+ E +V+N IP+HW ++ G+DFGW +P Sbjct: 253 ETLSKSLKPHEKEARKHGIPSLGSGLVYPVSEVAVVVNPFVIPKHWGRVFGLDFGWINPT 312 Query: 148 AAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG-KWLPWAWPHDGLQHDKRSGEQ 206 AA V +RD+DV+Y+ Y E+TP HV LK G + + G Q +R G Sbjct: 313 AALFAVIDRDNDVMYLTGEYYVSERTPQQHVYELKKLGADKINGVYDPAGEQSSQRDGGD 372 Query: 207 LSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 L+ YR G++ L + N E GI +L R ++G+ K+F Sbjct: 373 LAQLYRDSGLRYLYK------ADNAKEEGIMKVLQRFQNGKLKIF 411 >gi|30061788|ref|NP_835959.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] gi|30040030|gb|AAP15764.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] Length = 186 Score = 309 bits (793), Expect = 2e-82, Method: Composition-based stats. Identities = 87/184 (47%), Positives = 120/184 (65%) Query: 19 VWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINE 78 +WFDEEPP +Y EGLTR N LT TPL G S ++ +L S ++V+ MTI + Sbjct: 1 MWFDEEPPYSIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYD 60 Query: 79 TPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGG 138 HY ++++++II SYP HEREAR +G P +GSGRIF I EE I + P+H+ IGG Sbjct: 61 AEHYTDEQKEQIIASYPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIGG 120 Query: 139 MDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQ 198 MDFGW HP A L W++++D IYV + ++ +E+T + A+KSW +P AWPHDG Q Sbjct: 121 MDFGWDHPQAQVQLWWDKEADTIYVSRVWKAKEKTAVQAWGAVKSWAHKVPTAWPHDGNQ 180 Query: 199 HDKR 202 H+K Sbjct: 181 HEKG 184 >gi|148557334|ref|YP_001264916.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] gi|148502524|gb|ABQ70778.1| hypothetical protein Swit_4440 [Sphingomonas wittichii RW1] Length = 276 Score = 304 bits (778), Expect = 9e-81, Method: Composition-based stats. Identities = 116/191 (60%), Positives = 142/191 (74%) Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 YL + R V RMTI++ HY+ ER I+ SYP HER+AR +G P+LGSGR+FP+ E+ Sbjct: 24 YLWETPMTRHVTRMTIDDAEHYSPAERAAIVASYPAHERKARAEGIPMLGSGRVFPVDED 83 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I I + ++P W QIGG+DFGW HP AA L W+RD+D IYV +Y RE TPI H AA Sbjct: 84 VIKIRAFEVPAGWTQIGGIDFGWDHPTAAVRLAWDRDADCIYVTASYGVREATPILHAAA 143 Query: 181 LKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LK WG WLPWAWPHDGLQHDK SG L+ QYR QG+ +LPE A+F++G NGVEAGI++ML Sbjct: 144 LKPWGNWLPWAWPHDGLQHDKGSGAALAQQYRDQGLSLLPEKASFEEGGNGVEAGIAEML 203 Query: 241 DRMRSGRWKVF 251 DRM SGRWKVF Sbjct: 204 DRMLSGRWKVF 214 >gi|158422463|ref|YP_001523755.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] gi|158329352|dbj|BAF86837.1| putative DNA packaging protein GP2 [Azorhizobium caulinodans ORS 571] Length = 251 Score = 298 bits (762), Expect = 7e-79, Method: Composition-based stats. Identities = 100/185 (54%), Positives = 136/185 (73%) Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINS 126 R I+ T ++ PH +Q++ + + P H R+ARTKG P+LGSGR+FPI EE I+ ++ Sbjct: 1 MSRFCIQATWDDVPHLTQQQKDELWAAIPAHMRDARTKGIPVLGSGRVFPIAEELILCDA 60 Query: 127 LDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGK 186 + IP+HW +I G+DFGW HPF A + W+RD+DVIYVV YR RE++ + H AA++ WGK Sbjct: 61 MPIPKHWARINGLDFGWDHPFGAVSIAWDRDADVIYVVNTYRAREESSVIHAAAIRPWGK 120 Query: 187 WLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSG 246 +P AWPHDG QHDK SG+QL+ QYR G++ML E AT + G NGVEAGI +ML+RM++G Sbjct: 121 KIPCAWPHDGFQHDKGSGQQLAEQYRDHGLEMLDEHATHEQGGNGVEAGIMEMLERMQTG 180 Query: 247 RWKVF 251 R KVF Sbjct: 181 RLKVF 185 >gi|264678784|ref|YP_003278691.1| phage DNA packaging protein Gp2 [Comamonas testosteroni CNB-2] gi|262209297|gb|ACY33395.1| putative phage DNA packaging protein Gp2 [Comamonas testosteroni CNB-2] Length = 434 Score = 264 bits (676), Expect = 5e-69, Method: Composition-based stats. Identities = 72/221 (32%), Positives = 118/221 (53%), Gaps = 12/221 (5%) Query: 35 TRINATQGLVTLTLTPLKGRSPIIEHYLSASSSD----RQVIRMTINETPHYNEQERKRI 90 R+ +G+ LT TPL G + +++ S R V++ ++ PH E+ + ++ Sbjct: 1 MRLMTREGISMLTFTPLSGLTALVQQLTSPDPEGKVIGRAVVQCGWDDVPHLTEEAKAKL 60 Query: 91 IDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAG 150 + H+R+ARTKG P LG+G I+P+ E DIV+ +P+ W + GMD GW+ A Sbjct: 61 LSRLMPHQRDARTKGVPALGAGAIYPVPESDIVVPDFQLPDFWPRAYGMDVGWNRTSA-V 119 Query: 151 HLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQ 210 +RDSD++Y+ N+ + P H ++K+ G W+P A ++ GEQL Sbjct: 120 WGAHDRDSDIVYLYSNHYRGQAEPSVHATSIKARGDWIPGAIDPASRGRSQKDGEQLLQN 179 Query: 211 YRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 Y G++++ +NGVEAGI + +RM +GR KVF Sbjct: 180 YVDLGLQLV-------TANNGVEAGIYQVWERMSTGRLKVF 213 >gi|71898835|ref|ZP_00681003.1| phage-related protein [Xylella fastidiosa Ann-1] gi|71731421|gb|EAO33484.1| phage-related protein [Xylella fastidiosa Ann-1] Length = 291 Score = 211 bits (536), Expect = 1e-52, Method: Composition-based stats. Identities = 62/126 (49%), Positives = 87/126 (69%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK++EQG +KWQ++ V+++WFDE+PPEDVYFEG+TR N T GLV +T PLK ++ Sbjct: 41 LKSFEQGGEKWQADPVNWMWFDEQPPEDVYFEGITRTNRTFGLVCMTFAPLKSILTVVWR 100 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M I + HY ++ RI SYP +ER+ART+G P LGSGR+FPI E Sbjct: 101 FLLENVPDRADVQMIIEDAEHYLLEDCARITASYPPYERQARTQGVPALGSGRLFPIAGE 160 Query: 121 DIVINS 126 I + Sbjct: 161 KIGVAP 166 >gi|28198423|ref|NP_778737.1| hypothetical protein PD0512 [Xylella fastidiosa Temecula1] gi|28056507|gb|AAO28386.1| phage-related protein [Xylella fastidiosa Temecula1] Length = 257 Score = 210 bits (534), Expect = 2e-52, Method: Composition-based stats. Identities = 61/126 (48%), Positives = 85/126 (67%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK++EQG +KWQ++ V ++WFDE+PPEDVYFEG+ R N T GLV +T PLK ++ Sbjct: 7 LKSFEQGGEKWQADPVDWIWFDEQPPEDVYFEGIIRTNRTFGLVCMTFAPLKSILTVVWR 66 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M I + HY ++ RI SYP +ER+ART+G P LGSGR+FPI E Sbjct: 67 FLLENVPDRADVQMIIEDAEHYLLEDCARITASYPPYERQARTQGVPALGSGRLFPIAGE 126 Query: 121 DIVINS 126 I + Sbjct: 127 KIGVAP 132 >gi|307579537|gb|ADN63506.1| bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa subsp. fastidiosa GB514] Length = 278 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 61/126 (48%), Positives = 85/126 (67%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK++EQG +KWQ++ V ++WFDE+PPEDVYFEG+ R N T GLV +T PLK ++ Sbjct: 28 LKSFEQGGEKWQADPVDWIWFDEQPPEDVYFEGIIRTNRTFGLVCMTFAPLKSILTVVWR 87 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M I + HY ++ RI SYP +ER+ART+G P LGSGR+FPI E Sbjct: 88 FLLENVPDRADVQMIIEDAEHYLLEDCARITASYPPYERQARTQGVPALGSGRLFPIAGE 147 Query: 121 DIVINS 126 I + Sbjct: 148 KIGVAP 153 >gi|182681090|ref|YP_001829250.1| bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa M23] gi|182631200|gb|ACB91976.1| Bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Xylella fastidiosa M23] Length = 291 Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats. Identities = 61/126 (48%), Positives = 85/126 (67%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 LK++EQG +KWQ++ V ++WFDE+PPEDVYFEG+ R N T GLV +T PLK ++ Sbjct: 41 LKSFEQGGEKWQADPVDWIWFDEQPPEDVYFEGIIRTNRTFGLVCMTFAPLKSILTVVWR 100 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M I + HY ++ RI SYP +ER+ART+G P LGSGR+FPI E Sbjct: 101 FLLENVPDRADVQMIIEDAEHYLLEDCARITASYPPYERQARTQGVPALGSGRLFPIAGE 160 Query: 121 DIVINS 126 I + Sbjct: 161 KIGVAP 166 >gi|15837841|ref|NP_298529.1| hypothetical protein XF1239 [Xylella fastidiosa 9a5c] gi|9106220|gb|AAF84049.1|AE003958_3 hypothetical protein XF_1239 [Xylella fastidiosa 9a5c] Length = 135 Score = 200 bits (509), Expect = 1e-49, Method: Composition-based stats. Identities = 63/125 (50%), Positives = 87/125 (69%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 L ++EQG +KWQ++ V ++WFDE+PPEDVYFEG+TR N T LV +T TPLK S ++ Sbjct: 7 LTSFEQGGEKWQADPVDWMWFDEQPPEDVYFEGITRTNRTFWLVCMTFTPLKSISTVVWR 66 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 +L + DR ++M+I + HY ++ RI SYP +EREART+G P LGSGR+FPI E Sbjct: 67 FLLENVPDRADMQMSIEDAEHYLVEDCARITASYPPYEREARTQGVPALGSGRVFPIAGE 126 Query: 121 DIVIN 125 I + Sbjct: 127 KIGVA 131 >gi|260463792|ref|ZP_05811989.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030389|gb|EEW31668.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 131 Score = 162 bits (410), Expect = 4e-38, Method: Composition-based stats. Identities = 50/114 (43%), Positives = 68/114 (59%), Gaps = 4/114 (3%) Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINS 126 R V MTI++ HY+ ER I+ +YP HEREAR +G P+LGSGRIFP+ E I Sbjct: 1 MSRHVTFMTIDDAAHYSPDERAAIVAAYPAHEREARARGIPVLGSGRIFPVAEALIACEP 60 Query: 127 LDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVV----KNYRCREQTPIF 176 +P +W ++G +DFGW HP AA L W+ ++DV+YV +R R P Sbjct: 61 FRLPRYWPRLGALDFGWDHPSAAVELAWDTEADVVYVTNANPCAWRPRPTLPSQ 114 >gi|71274944|ref|ZP_00651232.1| similar to Bacteriophage terminase large (ATPase) subunit and inactivated derivatives [Xylella fastidiosa Dixon] gi|71901567|ref|ZP_00683649.1| similar to Bacteriophage terminase large (ATPase) subunit and inactivated derivatives [Xylella fastidiosa Ann-1] gi|71164676|gb|EAO14390.1| similar to Bacteriophage terminase large (ATPase) subunit and inactivated derivatives [Xylella fastidiosa Dixon] gi|71728653|gb|EAO30802.1| similar to Bacteriophage terminase large (ATPase) subunit and inactivated derivatives [Xylella fastidiosa Ann-1] Length = 142 Score = 155 bits (391), Expect = 7e-36, Method: Composition-based stats. Identities = 45/98 (45%), Positives = 60/98 (61%) Query: 29 VYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERK 88 +YFE +TR N T GLV +T PLK ++ +L + DR ++M I + HY ++ Sbjct: 1 MYFEVITRTNRTFGLVCMTFAPLKSILTVVWRFLLENVPDRADVQMIIEDAEHYFLEDCA 60 Query: 89 RIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINS 126 RI SYP +EREARTKG P LGSGR+FPI E I + Sbjct: 61 RITASYPPYEREARTKGVPALGSGRLFPIAGEKIGVAP 98 >gi|148557330|ref|YP_001264912.1| bacteriophage terminase large (ATPase) subunit-like protein [Sphingomonas wittichii RW1] gi|148502520|gb|ABQ70774.1| Bacteriophage terminase large (ATPase) subunit and inactivated derivatives-like protein [Sphingomonas wittichii RW1] Length = 225 Score = 116 bits (290), Expect = 3e-24, Method: Composition-based stats. Identities = 38/64 (59%), Positives = 46/64 (71%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 KAYEQGR KWQ +T++ +WFDEEPP D+Y EGLTR NAT G LT TPLKG S ++ Sbjct: 161 FKAYEQGRAKWQGDTLNGIWFDEEPPLDIYVEGLTRTNATGGFAMLTFTPLKGMSEVVRM 220 Query: 61 YLSA 64 +L Sbjct: 221 FLEE 224 >gi|13471711|ref|NP_103278.1| hypothetical protein msl1767 [Mesorhizobium loti MAFF303099] gi|14022455|dbj|BAB49064.1| msl1767 [Mesorhizobium loti MAFF303099] Length = 90 Score = 115 bits (288), Expect = 5e-24, Method: Composition-based stats. Identities = 38/69 (55%), Positives = 50/69 (72%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K++E+GR+KWQ T+H VWFDEEPP D+Y EGLTR NAT G+ +T TPL G S ++ Sbjct: 18 FKSFEKGREKWQGETLHGVWFDEEPPLDIYSEGLTRTNATSGITIVTFTPLLGMSDVVLL 77 Query: 61 YLSASSSDR 69 +LSA +R Sbjct: 78 FLSAGEVER 86 >gi|13471714|ref|NP_103281.1| hypothetical protein mll1771 [Mesorhizobium loti MAFF303099] gi|14022458|dbj|BAB49067.1| mll1771 [Mesorhizobium loti MAFF303099] Length = 254 Score = 113 bits (282), Expect = 3e-23, Method: Composition-based stats. Identities = 38/69 (55%), Positives = 50/69 (72%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K++E+GR+KWQ T+H VWFDEEPP D+Y EGLTR NAT G+ +T TPL G S ++ Sbjct: 182 FKSFEKGREKWQGETLHGVWFDEEPPLDIYSEGLTRTNATSGITIVTFTPLLGMSDVVLL 241 Query: 61 YLSASSSDR 69 +LSA +R Sbjct: 242 FLSAGDVER 250 >gi|260463788|ref|ZP_05811985.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030385|gb|EEW31664.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 209 Score = 111 bits (277), Expect = 1e-22, Method: Composition-based stats. Identities = 37/64 (57%), Positives = 48/64 (75%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K++E+GR+KWQ T+H VWFDEEPP D+Y EGLTR NAT G+ +T TPL G S ++ Sbjct: 137 FKSFEKGREKWQGETLHGVWFDEEPPLDIYSEGLTRTNATGGITIVTFTPLLGMSDVVLL 196 Query: 61 YLSA 64 +LSA Sbjct: 197 FLSA 200 >gi|158422462|ref|YP_001523754.1| putative DNA packaging protein GP3 [Azorhizobium caulinodans ORS 571] gi|158329351|dbj|BAF86836.1| putative DNA packaging protein GP3 [Azorhizobium caulinodans ORS 571] Length = 203 Score = 102 bits (254), Expect = 5e-20, Method: Composition-based stats. Identities = 31/64 (48%), Positives = 41/64 (64%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEH 60 K+Y+QGR K+Q H VW DEEPP DVY E L R+ T GL+ T TPL+G + I Sbjct: 137 FKSYDQGRKKFQGTAKHVVWLDEEPPADVYQEALMRLMTTSGLMLCTFTPLEGMTDIAAQ 196 Query: 61 YLSA 64 +++A Sbjct: 197 FIAA 200 >gi|30061789|ref|NP_835960.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] gi|30040031|gb|AAP15765.1| putative terminase large subunit [Shigella flexneri 2a str. 2457T] Length = 124 Score = 87.7 bits (216), Expect = 1e-15, Method: Composition-based stats. Identities = 26/53 (49%), Positives = 32/53 (60%) Query: 199 HDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 + GEQL QY G ML E AT+ DG N VE GI+++ D M GR+KVF Sbjct: 5 TRRAGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELRDMMLDGRFKVF 57 >gi|297565631|ref|YP_003684603.1| hypothetical protein Mesil_1197 [Meiothermus silvanus DSM 9946] gi|296850080|gb|ADH63095.1| hypothetical protein Mesil_1197 [Meiothermus silvanus DSM 9946] Length = 434 Score = 84.3 bits (207), Expect = 1e-14, Method: Composition-based stats. Identities = 54/257 (21%), Positives = 97/257 (37%), Gaps = 20/257 (7%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDE----EPPEDVYFEGLTRINATQGLVTLTLTP--LKGR 54 + Q D +S T W DE + D + L R++ QG V +T TP L Sbjct: 129 FFGHAQDPDSLESATAKAAWLDEAGQKKFRRDSWQAILRRLSIHQGRVLITTTPYYLGWL 188 Query: 55 SPIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRI 114 + D +++ + P++ E +R + P + + G +G+I Sbjct: 189 KADLHDPARQGHPDIELVNFKSVDNPNFPRAEYERARATLPRWKFDMFYNGLFTRPAGQI 248 Query: 115 FPIVEEDIVINS-LDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQT 173 + + ++ + ++PE W + G+DFG + AA L N S+ +V Y+ +T Sbjct: 249 YDCFDPEVHVRPAFNVPEDWPRFIGLDFGGVN-TAAVKLAKNPASEEYFVYAEYKAGGRT 307 Query: 174 PIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVE 233 H L P A +S ++ G+ + VE Sbjct: 308 AREHAEVLLKGEPRAPHAVGGA------KSEGNWRLEFAAAGLGVAAPPVA------DVE 355 Query: 234 AGISDMLDRMRSGRWKV 250 GI+ + ++SGR V Sbjct: 356 VGINRVYGLLKSGRLYV 372 >gi|71274589|ref|ZP_00650877.1| hypothetical protein XfasaDRAFT_1897 [Xylella fastidiosa Dixon] gi|71898128|ref|ZP_00680314.1| hypothetical protein XfasoDRAFT_3692 [Xylella fastidiosa Ann-1] gi|170730853|ref|YP_001776286.1| hypothetical protein Xfasm12_1760 [Xylella fastidiosa M12] gi|71164321|gb|EAO14035.1| hypothetical protein XfasaDRAFT_1897 [Xylella fastidiosa Dixon] gi|71732102|gb|EAO34158.1| hypothetical protein XfasoDRAFT_3692 [Xylella fastidiosa Ann-1] gi|167965646|gb|ACA12656.1| hypothetical protein Xfasm12_1760 [Xylella fastidiosa M12] Length = 78 Score = 71.9 bits (175), Expect = 7e-11, Method: Composition-based stats. Identities = 23/59 (38%), Positives = 35/59 (59%), Gaps = 7/59 (11%) Query: 46 LTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPL---HEREA 101 +T TPLKG S ++ +L+ ++DR I+ + HY+ +E RII SYP H+R A Sbjct: 1 MTFTPLKGMSTVVRRFLTEDAADRGYIK----DAEHYSAEECARIIASYPPRSAHQRSA 55 >gi|215304|gb|AAA72960.1| unnamed protein product [Enterobacteria phage P22] Length = 101 Score = 62.7 bits (151), Expect = 5e-08, Method: Composition-based stats. Identities = 21/34 (61%), Positives = 26/34 (76%) Query: 218 MLPECATFDDGSNGVEAGISDMLDRMRSGRWKVF 251 MLP+ ATF DG N VE+GIS++ D M GR+KVF Sbjct: 1 MLPDHATFPDGGNSVESGISELRDLMLEGRFKVF 34 >gi|313115194|ref|ZP_07800678.1| conserved hypothetical protein [Faecalibacterium cf. prausnitzii KLE1255] gi|310622472|gb|EFQ05943.1| conserved hypothetical protein [Faecalibacterium cf. prausnitzii KLE1255] Length = 482 Score = 60.0 bits (144), Expect = 3e-07, Method: Composition-based stats. Identities = 65/309 (21%), Positives = 100/309 (32%), Gaps = 68/309 (22%) Query: 4 YEQGRDKWQSNTVHYVWFDE-------------------EPPEDVYFEGLTRINATQ-GL 43 Y + R +Q ++ FDE P VY T G Sbjct: 110 YTKDRTNYQGKAFDFIGFDELTHFEWEEYSYMMSRNRPTGPGTRVYMRATTNPGGIGHGW 169 Query: 44 VTLTL-TPLKGRSPIIEHYLSASSSD--------RQVIRMTINETPHYNEQERKRI--ID 92 V TP +PI E Y R I +I + P + + + Sbjct: 170 VKARFITPAPPGTPITEEYTVKLPDGTEQKLQRARVFIPSSIFDNPALLANDPGYLASLA 229 Query: 93 SYPLHEREARTKGEPILGSGRIF------PIVEEDI----VINSLDIPEHWVQIGGMDFG 142 S P E++A G SG++F P ED VI IP+HW G DFG Sbjct: 230 SMPEAEKQALLYGSWDSFSGQVFTEWRNDPAHYEDQRWTHVIAPFTIPKHWQLYRGFDFG 289 Query: 143 WHHPFAAGHLVWNRDSDVIYVVKNYRCREQ-------TPIFHVAALKS--------WGKW 187 + PF+ G + + + + + Y C + P+ ++ G+ Sbjct: 290 FSKPFSVGWYAADEEGRLYRIKELYGCTGRPNEGLRIDPVEQARRIREAEQNDPLLRGRV 349 Query: 188 LPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMR--- 244 + D+ GE ++A R P + G N AG R+ Sbjct: 350 IHGIADPAIF--DESRGESIAAMMERS-----PNFLRWSPGDNTRLAGKMQFHYRLNFDA 402 Query: 245 SGR--WKVF 251 GR ++VF Sbjct: 403 DGRPMFQVF 411 >gi|225419955|ref|ZP_03762258.1| hypothetical protein CLOSTASPAR_06296 [Clostridium asparagiforme DSM 15981] gi|225041463|gb|EEG51709.1| hypothetical protein CLOSTASPAR_06296 [Clostridium asparagiforme DSM 15981] Length = 318 Score = 59.6 bits (143), Expect = 3e-07, Method: Composition-based stats. Identities = 30/155 (19%), Positives = 52/155 (33%), Gaps = 13/155 (8%) Query: 39 ATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHE 98 +TLT P S + + D V T +E +R + + Sbjct: 64 GYFKQITLTFNPWSATSWLKARFFDTPDEDTFVKTTTWQCNEWLDESDRNIFLKMQKNNP 123 Query: 99 REARTKGEPILG--SGRIFPIVEEDIVINSLDIPEHWVQIGGM------DFGWHHPFAAG 150 R R +GE G G I+ ++V ++ + I G+ DFG+ P A Sbjct: 124 RRYRIEGEGEWGIAEGLIY----TNVVCEDFNV-DEIRAIPGIKSAFNLDFGFTDPNAFV 178 Query: 151 HLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 + + + IY+ + T +K G Sbjct: 179 CEMVDNAAMRIYIFDEWYQTGVTNKIIAEQIKKMG 213 >gi|284008126|emb|CBA74349.1| DNA packaging protein gp2 [Arsenophonus nasoniae] Length = 137 Score = 59.6 bits (143), Expect = 4e-07, Method: Composition-based stats. Identities = 18/29 (62%), Positives = 21/29 (72%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDV 29 K Y QGR +WQ +TVH VWFDEEPP + Sbjct: 109 FKPYSQGRARWQGDTVHGVWFDEEPPYAI 137 >gi|160945639|ref|ZP_02092865.1| hypothetical protein FAEPRAM212_03168 [Faecalibacterium prausnitzii M21/2] gi|158443370|gb|EDP20375.1| hypothetical protein FAEPRAM212_03168 [Faecalibacterium prausnitzii M21/2] gi|295103135|emb|CBL00679.1| Terminase-like family. [Faecalibacterium prausnitzii SL3/3] Length = 481 Score = 56.9 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 56/309 (18%), Positives = 97/309 (31%), Gaps = 68/309 (22%) Query: 4 YEQGRDKWQSNTVHYVWFDE----EPPEDVYFEGLTRINATQGLVTLTLTPLKGRSP--- 56 Y + R +Q ++ FDE E E Y R V + T G Sbjct: 109 YTKDRTNYQGKAYDFIGFDELTHFEWDEYSYMMSRNRPTGPGTRVYMRATTNPGGIGHGW 168 Query: 57 IIEHYLSASSSDRQVIRMTINETPHYNEQERKRI------------------------ID 92 + +++ + ++ P +Q+ +R + Sbjct: 169 VKARFITPAPPGTPIVETVTVRLPDGTDQQMERARVFIPSSVFDNPALLANDPGYLASLA 228 Query: 93 SYPLHEREARTKGEPILGSGRIF----------PIVEEDIVINSLDIPEHWVQIGGMDFG 142 S P E++A G SG++F VI IP+HW G DFG Sbjct: 229 SLPEAEKQALLYGSWDSFSGQVFTEWRNDPAHYQDQRWTHVIAPFAIPKHWPIWRGYDFG 288 Query: 143 WHHPFAAGHLVWNRDSDVIYVVKNYRCREQ-------TPIFHVAALKS--------WGKW 187 + PF+ G + + + + + Y C + P+ ++ G+ Sbjct: 289 FSKPFSVGWYAVDEEGRLYRIKELYGCTGRPNEGLRIDPVEQAKRIREAEQNDPLLRGRV 348 Query: 188 LPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMR--- 244 + D+ GE ++A R P + G + AG R+R Sbjct: 349 IHGVADPAIF--DESRGESIAAMMERS-----PHFLHWQPGDHTRLAGKMQFHYRLRFAP 401 Query: 245 SGR--WKVF 251 GR +VF Sbjct: 402 DGRPMLQVF 410 >gi|257438500|ref|ZP_05614255.1| putative phage terminase, large subunit [Faecalibacterium prausnitzii A2-165] gi|257199079|gb|EEU97363.1| putative phage terminase, large subunit [Faecalibacterium prausnitzii A2-165] Length = 477 Score = 56.2 bits (134), Expect = 5e-06, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 69/210 (32%), Gaps = 41/210 (19%) Query: 4 YEQGRDKWQSNTVHYVWFDE-------------------EPPEDVYFEGLTRINATQ-GL 43 Y + R +Q ++ FDE P VY T G Sbjct: 105 YTKDRTNYQGKAFDFIGFDELTHFEWEEYSYMMSRNRPTGPGTRVYLRATTNPGGVGHGW 164 Query: 44 VTLTL-TPLKGRSPIIEHYLSASSSD--------RQVIRMTINETPHYNEQERKRI--ID 92 V TP +PI+E + R I ++ + P E + + + Sbjct: 165 VKARFITPAPPGTPIVEQFPVRMPDGTEKVLERARVFIPSSVFDNPALLENDPDYLASLA 224 Query: 93 SYPLHEREARTKGEPILGSGRIF----------PIVEEDIVINSLDIPEHWVQIGGMDFG 142 S P E++A G SG++F VI IP HW G DFG Sbjct: 225 SLPEAEKQALLYGSWDSFSGQVFTEWRNDPGHYQDQRWTHVIAPFAIPRHWKIYRGYDFG 284 Query: 143 WHHPFAAGHLVWNRDSDVIYVVKNYRCREQ 172 + PF+ G + + + + + Y C + Sbjct: 285 FSKPFSVGWYAADEEGRLYRIKELYGCTGR 314 >gi|295102643|emb|CBL00188.1| Terminase-like family [Faecalibacterium prausnitzii L2-6] Length = 464 Score = 55.8 bits (133), Expect = 5e-06, Method: Composition-based stats. Identities = 64/309 (20%), Positives = 102/309 (33%), Gaps = 68/309 (22%) Query: 4 YEQGRDKWQSNTVHYVWFDE-------------------EPPEDVYFEGLTRINATQ-GL 43 Y + R +Q ++ FDE P VY T G Sbjct: 92 YTKDRTNYQGKAFDFIGFDELTHFEWEEYSYMMSRNRPTGPGTRVYLRATTNPGGVGHGW 151 Query: 44 VTLTL-TPLKGRSPIIEHYLSASSSD--------RQVIRMTINETPHYNEQERKRI--ID 92 V TP +PI+E Y R I +I + P E + + + Sbjct: 152 VKARFITPAPPGTPIVEEYPVRMPDGTEKVLRRARVFIPSSIFDNPALLENDPDYLASLA 211 Query: 93 SYPLHEREARTKGEPILGSGRIF------PIVEEDI----VINSLDIPEHWVQIGGMDFG 142 + P E++A G SG++F P ED VI IP+HW G DFG Sbjct: 212 AMPEAEKQALLYGSWDSFSGQVFTEWRNDPNHYEDQRWTHVIAPFTIPKHWKIYRGYDFG 271 Query: 143 WHHPFAAGHLVWNRDSDVIYVVKNYRCREQ-------TPIFHVAALKS--------WGKW 187 + PF+ G + + + + + Y C + P+ ++ G+ Sbjct: 272 FSKPFSVGWYAADEEGRLYRIKELYGCTGRPNEGLRIDPVEQARRIREAEQNDPVLRGRV 331 Query: 188 LPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMR--- 244 + D+ GE +++ R P + G + AG M R+ Sbjct: 332 IQGIADPAIF--DESRGESIASMMERG-----PNFLHWMPGDHTRLAGKMQMHYRLNFDG 384 Query: 245 SGR--WKVF 251 GR +VF Sbjct: 385 EGRPMLQVF 393 >gi|291335182|gb|ADD94806.1| hypothetical protein [uncultured phage MedDCM-OCT-S12-C102] gi|291336563|gb|ADD96112.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C6] Length = 555 Score = 55.4 bits (132), Expect = 8e-06, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 52/129 (40%), Gaps = 11/129 (8%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDEEPPEDVYFEGLT-RINATQGLVTLTLTPLKGRSPIIE 59 K Y Q + + +WFDE P+ + E L R+ + +G + +T TP++G S ++ Sbjct: 168 FKNYTQDLSTLEGTELDLIWFDELVPQS-WVETLKYRLVSRKGKMLITFTPIEGYSSAVK 226 Query: 60 HYLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVE 119 + + + I+ + + + + ARTK GSG+IF Sbjct: 227 SAMEGAIIEETREAKLIDPN---SPGNIPGVPKGHMPY--RARTKN----GSGKIFWFYS 277 Query: 120 EDIVINSLD 128 E D Sbjct: 278 EWNPYTPFD 286 >gi|291556861|emb|CBL33978.1| Terminase-like family [Eubacterium siraeum V10Sc8a] Length = 487 Score = 55.0 bits (131), Expect = 9e-06, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 69/204 (33%), Gaps = 34/204 (16%) Query: 67 SDRQVIRMTINETPHYNEQERKRI--IDSYPLHEREARTKGEPILGSGRIF----PIVEE 120 R I ++ + + + + + P ER+A G+ +G++F E Sbjct: 204 KSRVFIPASVFDNKELLRNDPEYLASLSMLPTAERKALLYGDWNSFTGQVFTEWRDDPEH 263 Query: 121 ------DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQ-- 172 VI +IP HW + G DFG+ PF+ G + + + + Y C ++ Sbjct: 264 YCDRRWTHVIAPFEIPRHWEIVRGFDFGYTRPFSVGWYAVDTKGCIYRIREYYGCTDKAN 323 Query: 173 -----TPIFHVAALKS--------WGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKML 219 P ++ G+ + DK GE ++ R Sbjct: 324 EGIRLEPSVIAENIRKIERDDPNIRGRNVYGVADPSIF--DKSRGESVADLMARS----- 376 Query: 220 PECATFDDGSNGVEAGISDMLDRM 243 P + G N +G +R+ Sbjct: 377 PNFIIWSPGDNARISGKMQYHNRL 400 >gi|167749269|ref|ZP_02421396.1| hypothetical protein EUBSIR_00220 [Eubacterium siraeum DSM 15702] gi|167657762|gb|EDS01892.1| hypothetical protein EUBSIR_00220 [Eubacterium siraeum DSM 15702] Length = 487 Score = 55.0 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 69/204 (33%), Gaps = 34/204 (16%) Query: 67 SDRQVIRMTINETPHYNEQERKRI--IDSYPLHEREARTKGEPILGSGRIF----PIVEE 120 R I ++ + + + + + P ER+A G+ +G++F E Sbjct: 204 KSRVFIPASVFDNKELLRNDPEYLASLSMLPTAERKALLYGDWNSFTGQVFTEWRDDPEH 263 Query: 121 ------DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQ-- 172 VI +IP HW + G DFG+ PF+ G + + + + Y C ++ Sbjct: 264 YCDRRWTHVIAPFEIPRHWEIVRGFDFGYTRPFSVGWYAVDTKGCIYRIREYYGCTDKAN 323 Query: 173 -----TPIFHVAALKS--------WGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKML 219 P ++ G+ + DK GE ++ R Sbjct: 324 EGIRLEPSVIAENIRKIERDDPNIRGRNVYGVADPSIF--DKSRGESVADLMARS----- 376 Query: 220 PECATFDDGSNGVEAGISDMLDRM 243 P + G N +G +R+ Sbjct: 377 PNFIIWSPGDNARISGKMQYHNRL 400 >gi|194466550|ref|ZP_03072537.1| phage terminase, large subunit, PBSX family [Lactobacillus reuteri 100-23] gi|194453586|gb|EDX42483.1| phage terminase, large subunit, PBSX family [Lactobacillus reuteri 100-23] Length = 421 Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 66/182 (36%), Gaps = 10/182 (5%) Query: 27 EDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQV--IRMTINETPHYNE 84 ++ + E + R + + P + + + Y+ ++ +++ + Sbjct: 136 QEAFQEIIQRCSKPGARIICDTNPDSPQHYLKKDYIDNKDPKARIKTFHFVLDDNTFLPK 195 Query: 85 QERKRIIDSYPLHEREART-KGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFG 142 + + P R+ G + G G ++ E + + D+P+ G+D+G Sbjct: 196 DYVDSLKAATPSGMYYDRSILGLWVTGEGAVYKDFDERTMTVKREDLPDSLTYTAGVDWG 255 Query: 143 WHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAAL-----KSWGKWLPWAWPHDGL 197 + HP A + + D Y+V + + H A+ K +G +P+ Sbjct: 256 YDHPTAIEIIGHD-DKGNYYLVDEAYGQFEQVDPHWIAVAQKFRKKYGLKMPFYADTART 314 Query: 198 QH 199 +H Sbjct: 315 EH 316 >gi|291460129|ref|ZP_06599519.1| phage terminase, large subunit, PBSX family [Oribacterium sp. oral taxon 078 str. F0262] gi|291417470|gb|EFE91189.1| phage terminase, large subunit, PBSX family [Oribacterium sp. oral taxon 078 str. F0262] Length = 408 Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 58/181 (32%), Gaps = 10/181 (5%) Query: 15 TVHYVWFDE--EPPEDVYFEGLTRINA-----TQGLVTLTLTPLKGRSPIIEHYLSASSS 67 + +VW +E E ED + + I +TLT P S + + A Sbjct: 123 CLCFVWIEEAYEVAEDDFNKLDMSIRGEVPEGYFKQLTLTFNPWSATSWLKARFFDAPDD 182 Query: 68 DRQVIRMTINETPHYNEQERK--RIIDSYPLHEREARTKGEPILGSGRIFPIVE-EDIVI 124 T ++ +R ++ GE + G I+P ED + Sbjct: 183 TIFTKTTTWQCNEWLDDADRHIFEMMRRQNPRRYRIEGDGEWGIAEGLIYPNHRMEDFDV 242 Query: 125 NSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSW 184 + E +DFG+ P A + + + IY+ + T +K Sbjct: 243 GEIRAQEGVKAAFNLDFGFTDPNAFVCELVDGQAKKIYIFDEWYQSGVTNRIIAETIKEK 302 Query: 185 G 185 G Sbjct: 303 G 303 >gi|28377533|ref|NP_784425.1| prophage Lp1 protein 38 [Lactobacillus plantarum WCFS1] gi|28270365|emb|CAD63266.1| prophage Lp1 protein 38 [Lactobacillus plantarum WCFS1] Length = 428 Score = 53.5 bits (127), Expect = 3e-05, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 88/236 (37%), Gaps = 25/236 (10%) Query: 18 YVWFDEEPPEDVYFEGLTRINATQG-------LVTLTLTPLKGRSPIIEHYLSASSSDRQ 70 ++W +E + + + T I + +G VTLT P + + D Sbjct: 128 WLWVEEAYEIESFSKLQTVIESLRGNDPQVFYQVTLTFNPWNEHHWLKREFFDQPRDDTF 187 Query: 71 VIRMTINETPHYNEQERKRIIDSYPLHEREART--KGEPILGSGRIFPIVEEDIVINSLD 128 V T+ +++ ++R+ Y + R A+T GE + G +F E I N++D Sbjct: 188 VRTTTVRCNEFVSDEYKQRLYSLYQTNPRRAKTVVDGEWGVAEGLVFEDNIEQIEFNAMD 247 Query: 129 IPEHWVQIG-GMDFGW-HHPFAAGHLVWNRDSDVIYVVKN-YRCREQTPIFHVAALKSWG 185 + Q G G+D+G+ + P A + + + ++V Y + TP Sbjct: 248 KIQECGQTGFGLDYGFSNDPNAFVAVAVDVRNKQLWVYNEMYTYHQTTPHI--------A 299 Query: 186 KWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLD 241 +WL + + + +AQ G+ VEAGI + Sbjct: 300 EWLKVNGYERARIYADSANSERTAQLNDLGITNADSVVKTP-----VEAGIDQLWQ 350 >gi|291529974|emb|CBK95559.1| Terminase-like family [Eubacterium siraeum 70/3] Length = 487 Score = 53.1 bits (126), Expect = 3e-05, Method: Composition-based stats. Identities = 37/204 (18%), Positives = 69/204 (33%), Gaps = 34/204 (16%) Query: 67 SDRQVIRMTINETPHYNEQERKRI--IDSYPLHEREARTKGEPILGSGRIF----PIVEE 120 R I ++ + + + + + P ER+A G+ +G++F E Sbjct: 204 KSRVFIPASVFDNKELLRNDPEYLASLSMLPTAERKALLYGDWNSFTGQVFTEWRDDPEH 263 Query: 121 ------DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQ-- 172 VI +IP HW + G DFG+ PF+ G + + + + Y C ++ Sbjct: 264 YCDRRWTHVIAPFEIPRHWEIVRGFDFGYTRPFSVGWYAVDTKGCIYRIREYYGCTDKAN 323 Query: 173 -----TPIFHVAALKS--------WGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKML 219 P ++ G+ + DK GE ++ R Sbjct: 324 EGIRLEPSAIAENIRKIERDDPNIRGRNVYGVADPSIF--DKSRGESVADLMARS----- 376 Query: 220 PECATFDDGSNGVEAGISDMLDRM 243 P + G N +G +R+ Sbjct: 377 PYFIIWSPGDNARISGKMQYHNRL 400 >gi|121534831|ref|ZP_01666651.1| protein of unknown function DUF264 [Thermosinus carboxydivorans Nor1] gi|121306626|gb|EAX47548.1| protein of unknown function DUF264 [Thermosinus carboxydivorans Nor1] Length = 845 Score = 52.7 bits (125), Expect = 5e-05, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 64/175 (36%), Gaps = 24/175 (13%) Query: 57 IIEHYLSASSSDRQVIR-----------MTINETPHYNEQERKRI--IDSYPLHEREART 103 + ++ + + + + + P + + ++S P ER+A Sbjct: 540 VKARFIDVAPPGKTYVDPVTGLTRCFVPARVFDNPILLRADPLYLKRLESLPEAERKALL 599 Query: 104 KGEPILGSGRIFPIVEEDI-VINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIY 162 G+ +G++F D+ V+ IP W++ MD+G+ P+ + + VIY Sbjct: 600 LGDWDAFAGQVFSEWRRDVHVVEPFAIPAGWLRFRAMDWGFSKPYCILWFAVDYNG-VIY 658 Query: 163 VVKNYR---------CREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLS 208 V + ++T +K+ W + + SGE+++ Sbjct: 659 VYRELYGLKPGCVDVGTQETAREVAQKVKAAEDWKNFIADEGVKLETQLSGEKIA 713 >gi|187251258|ref|YP_001875740.1| phage terminase large subunit [Elusimicrobium minutum Pei191] gi|186971418|gb|ACC98403.1| Phage terminase, large subunit [Elusimicrobium minutum Pei191] Length = 397 Score = 52.7 bits (125), Expect = 5e-05, Method: Composition-based stats. Identities = 46/224 (20%), Positives = 93/224 (41%), Gaps = 26/224 (11%) Query: 8 RDKWQSNTVHYVWFDE--EPPEDVYFEGLTRINA-----TQGLVTLTLTPLKGRSPIIEH 60 +K +S +Y+W +E E + Y LTR++A + + LTL P S I + Sbjct: 111 PEKIKSAEFNYIWMEEATEFTYEDYVTLLTRLSAPIKEPYKNQIFLTLNPSDSNSWIAKK 170 Query: 61 YLSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE 120 LSA + Q+I+ + + P ++ ++ + E R G+ + Sbjct: 171 LLSAQ--NTQIIKSSYKDNPFLSKDYINTLLGLKDIDENYYRVFALGQWGANK------- 221 Query: 121 DIVINSL----DIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIF 176 +IV ++ +I I G+DFG+++P A L + + +Y + T Sbjct: 222 NIVYDNYTFVDEIKNTDNVIWGLDFGFNNPSALVKLYISDEG--VYTEEKLYKSGLT--- 276 Query: 177 HVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLP 220 + A +K+ + +P + H+ + D +++ G + P Sbjct: 277 NSALIKNLAEIIPPSQRHESIYADAAEPARIAE-ISEAGFNIHP 319 >gi|313113988|ref|ZP_07799543.1| conserved domain protein [Faecalibacterium cf. prausnitzii KLE1255] gi|310623690|gb|EFQ07090.1| conserved domain protein [Faecalibacterium cf. prausnitzii KLE1255] Length = 486 Score = 51.1 bits (121), Expect = 1e-04, Method: Composition-based stats. Identities = 52/307 (16%), Positives = 102/307 (33%), Gaps = 68/307 (22%) Query: 6 QGRDKWQSNTVHYVWFDE--EPPEDVYFEGLTRINATQGLVTL----TLTPL-KGRSPII 58 Q + +Q ++ DE + Y ++R T + T P G + Sbjct: 113 QDKYNYQGKAFDFIGVDELTHFTWEEYSYLMSRNRPTGPGTAVYMRATANPGGIGHGWVK 172 Query: 59 EHYLSASSSDRQVIRM----------------------TINETPHYNEQERKRI--IDSY 94 +++ + +++++ T+ + E + + + S Sbjct: 173 ARFITPAPPGTRMVQLVDVKKPDGSVEKLRRTRVFIPSTVFDNKKLLENDPGYLGTLASL 232 Query: 95 PLHEREARTKGEPILGSGRIF------PIVEEDI----VINSLDIPEHWVQIGGMDFGWH 144 P E++A G+ +G++F P ED VI IP HW G DFG+ Sbjct: 233 PEAEKQALLYGDWDSFNGQVFTEWRNDPAHYEDQRWTHVIKPFRIPAHWRIWRGYDFGYA 292 Query: 145 HPFAAGHLVWNRDSDVIYVVKNYRCREQ-------TPIFHVAALKS--------WGKWLP 189 PF+ G + + + + + Y C P+ +K G+ + Sbjct: 293 KPFSVGWYAADEEGRLYRIKELYGCTGVPNEGLKIDPVEQARRIKEAEENDPMLRGRQIT 352 Query: 190 WAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMR---SG 246 + GE ++A + P + G + AG + R+ G Sbjct: 353 GVADPAIFNESQ--GESIAAMQEKH-----PNYIFWTPGDHTRLAGKMQLHYRLAFDGEG 405 Query: 247 R--WKVF 251 R ++VF Sbjct: 406 RPMFQVF 412 >gi|256847412|ref|ZP_05552858.1| PBSX family phage terminase, large subunit [Lactobacillus coleohominis 101-4-CHN] gi|256716076|gb|EEU31051.1| PBSX family phage terminase, large subunit [Lactobacillus coleohominis 101-4-CHN] Length = 386 Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 88/233 (37%), Gaps = 29/233 (12%) Query: 28 DVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYL--SASSSDR-QVIRMTINETPHYNE 84 DV+ E L R + V P + + Y+ S +R + TI++ P + Sbjct: 139 DVFQEILDRCSVPNARVLCDTNPDNPQHWLKVDYIDKSNEPKNRIKAFHFTIDDNPTLDP 198 Query: 85 QERKRIIDSYPLHEREART-KGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFG 142 + P R+ G + G G ++ E ++++ +P I G+D+G Sbjct: 199 TYVSTLKAVTPSGMYYDRSILGLWVTGEGAVYKDFDERKMIVD--KVPPMARYIAGVDWG 256 Query: 143 WHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAAL-----KSWGKWLPWAWPHDGL 197 + H + ++D D Y+V+ + + + I + + + +GK +P+ Sbjct: 257 YQHYGSIVVFGVDKD-DNWYLVEEHSEKYKE-IDYWTDIAHELQEKYGKNMPFYCDTART 314 Query: 198 QHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKV 250 +H ++ G+ L G V GI + M+ GR+ V Sbjct: 315 EH--------IDHFKHSGINALY-------GWKSVVPGIEIVASLMKQGRFFV 352 >gi|90592610|ref|YP_529870.1| putative phage terminase large subunit B [Lactobacillus phage KC5a] gi|89891939|gb|ABD78812.1| putative phage terminase large subunit B [Lactobacillus phage KC5a] Length = 409 Score = 50.4 bits (119), Expect = 2e-04, Method: Composition-based stats. Identities = 40/232 (17%), Positives = 83/232 (35%), Gaps = 26/232 (11%) Query: 28 DVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQV--IRMTINETPHYNEQ 85 DV+ E L R + + P + Y+ ++ TI++ ++ Sbjct: 139 DVFQEILQRCSIEGARIICDTNPDIPTHWLKTDYIDNHDPKARIKSFTFTIDDNTFLSKD 198 Query: 86 ERKRIIDSYPLHEREART-KGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFGW 143 + I + P R G+ + G G ++ ++ +VI +P+ G+D+G+ Sbjct: 199 YVESIKAATPRGMFYDRGILGQWVTGDGIVYQDFNKDTMVIPKNRVPDGLDYYVGVDWGY 258 Query: 144 HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRS 203 HP L ++D + YV+++Y + K W+ A Q+ + Sbjct: 259 EHPNPIILLGDDKDGNT-YVLEDYTQKH----------KFINYWVKVA------QNLQTR 301 Query: 204 GEQLSAQYRRQGMKMLPECATFD-----DGSNGVEAGISDMLDRMRSGRWKV 250 + Y + + + V GI + +MR G++ V Sbjct: 302 FGRNLIFYADSARPDNVNEFQSNGLNCINANKNVLPGIECVARKMREGKFYV 353 >gi|62327097|ref|YP_223885.1| putative large subunit terminase [Lactobacillus phage phiJL-1] gi|37930114|gb|AAP74512.1| putative large subunit terminase [Lactobacillus phage phiJL-1] Length = 440 Score = 50.0 bits (118), Expect = 3e-04, Method: Composition-based stats. Identities = 43/213 (20%), Positives = 76/213 (35%), Gaps = 21/213 (9%) Query: 44 VTLTLTPLKGRSPIIEHYLSASSSDRQ--VIRMTINETPHYNEQERKRIIDSYPLHEREA 101 +T P R + + + I T + H N + + + A Sbjct: 174 TVITFNPWSDRHWLKHEFFDDKTKRNHSRAITTTYKDNDHLNADYVDSLKEMLVRNPNRA 233 Query: 102 RTK--GEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHH-PFAAGHLVWNRDS 158 R GE + G +F + E + +I + + G+DFG+ H P A + ++D+ Sbjct: 234 RVAVLGEWGIAEGLVFDGLFEQRDFSYDEI-ANLPKSVGLDFGFKHDPTAGEFIAVDQDN 292 Query: 159 DVIYVVKN-YRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMK 217 ++Y+ Y+ T K LP ++R +LS Q+R Sbjct: 293 RIVYIYDEFYKQHLLTNQIAQELAKHKAFGLPITADSA----EQRMIVELSQQHR----- 343 Query: 218 MLPECATFDDGSNGVEAGISDMLDRMRSGRWKV 250 +P G + V GI M+S R+ V Sbjct: 344 -VPNIKPSGKGKDSVIQGI----QYMQSYRFVV 371 >gi|304437549|ref|ZP_07397505.1| PBSX family phage terminase [Selenomonas sp. oral taxon 149 str. 67H29BP] gi|304369471|gb|EFM23140.1| PBSX family phage terminase [Selenomonas sp. oral taxon 149 str. 67H29BP] Length = 416 Score = 49.2 bits (116), Expect = 5e-04, Method: Composition-based stats. Identities = 40/264 (15%), Positives = 99/264 (37%), Gaps = 33/264 (12%) Query: 5 EQGRDKW-QSNTVHYVWFDEEP--PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHY 61 ++G +K+ + T+ + DE PE + + L R++ + T P + + Y Sbjct: 113 DEGSEKFIRGKTLAGAYCDELTLMPERFFKQLLNRLSVPGAKLYSTTNPDSPMHYLYKEY 172 Query: 62 LSASSSDR----QVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPI 117 +++ R V+ +++ P+ + + I SY + G +L G I+ + Sbjct: 173 VTSEQKLRDGLVSVVHFELDDNPNLTDDYKTNIRSSYSGMWFKRMILGLWVLAEGIIYDM 232 Query: 118 VEEDIVINSLDIPEHW----VQIGGMDFGWHHPFAAGHLVWNRDS----DVIYVVKNYRC 169 ++++ + + + D+G +P + + ++ ++ Y + Sbjct: 233 FSDELLFDDAEFTNTLRSTCRRYIACDYGTKNPMVFLDIYDDGETIWIPNLYYWDSRKKQ 292 Query: 170 REQTPIFHVAAL-KSWGKWLP--WAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFD 226 R++T + AL K G+ P S + + +G ++ Sbjct: 293 RQKTDAQYADALEKMVGEEYPDFIVIDP--------SAASFKLECQGRGFRV-------K 337 Query: 227 DGSNGVEAGISDMLDRMRSGRWKV 250 D N V GI ++ + + ++ Sbjct: 338 DADNSVNDGIREVAKLLTKKKIRI 361 >gi|51512091|gb|AAU05290.1| terminase large subunit [Enterobacteria phage T5] Length = 438 Score = 49.2 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 78/201 (38%), Gaps = 30/201 (14%) Query: 63 SASSSDRQVIRMTINETPHY----NEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIV 118 + + I T + P E+ R+ + +Y E EA + G+IF Sbjct: 202 DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADF----SVFEGQIFDTF 257 Query: 119 EEDIVINSLDIPEHWVQ-------IGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCRE 171 + L H+ + + G+D G+ P A + ++ D+D YV++ Y+ E Sbjct: 258 NATDHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAE 317 Query: 172 QTPIFHVAALKSWGKWLPWAWPH--DGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGS 229 +T H A ++ H D + D+ + +AQ+R Q + E A+ Sbjct: 318 KTTAQHAAYIQ-----------HCIDRYKVDRIFVDSAAAQFR-QDLAYEHEIASAP-AK 364 Query: 230 NGVEAGISDMLDRMRSGRWKV 250 V G++ + + G+ V Sbjct: 365 KSVLDGLACLQALFQQGKIIV 385 >gi|41179257|ref|NP_958579.1| putative terminase large subunit [Lactobacillus prophage Lj965] gi|42518398|ref|NP_964328.1| Lj965 prophage terminase large subunit [Lactobacillus johnsonii NCC 533] gi|39652610|gb|AAK27894.2| putative terminase large subunit [Lactobacillus prophage Lj965] gi|41582683|gb|AAS08294.1| Lj965 prophage terminase large subunit [Lactobacillus prophage Lj965] Length = 424 Score = 49.2 bits (116), Expect = 6e-04, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 71/174 (40%), Gaps = 9/174 (5%) Query: 5 EQGRDKWQSNTVHYVWFDEEP--PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYL 62 E +D Q T+ +FDE P+ + R + T + P +++ Sbjct: 121 EASQDLVQGITLAGFFFDEVALMPQSFVNQATARCSVTGSKMWFNCNPSGPFHWFKLNWI 180 Query: 63 SASSSDRQV-IRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFP-IVEE 120 R + I T+++ P + R Y + +G ++ G I+ ++ Sbjct: 181 DQMKDKRALRIHFTMHDNPSLDSVTINRYERMYSGVFYQRYIQGLWVMSEGVIYDNFDKD 240 Query: 121 DIVINSLDIPEHWVQIG-GMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQT 173 +V+N ++P H+ + D+G +P A L+W R+ V Y+VK Y +T Sbjct: 241 TMVVN--ELPNHFEKYYVSCDYGTLNPTA--FLLWGRNHGVWYLVKEYYYSGRT 290 >gi|182682964|ref|YP_001837088.1| terminase, large subunit [Enterobacteria phage EPS7] gi|182630676|gb|ACB97608.1| terminase, large subunit [Enterobacteria phage EPS7] Length = 438 Score = 48.8 bits (115), Expect = 7e-04, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 78/201 (38%), Gaps = 30/201 (14%) Query: 63 SASSSDRQVIRMTINETPHY----NEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIV 118 + + + I T + P E+ R+ + +Y E EA + G+IF Sbjct: 202 NETLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADF----SVFEGQIFDTF 257 Query: 119 EEDIVINSLDIPEHWVQ-------IGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCRE 171 + L H+ + + G+D G+ P A + ++ D+DV YV++ Y+ E Sbjct: 258 NAIEHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDVYYVLEEYQQAE 317 Query: 172 QTPIFHVAALKSWGKWLPWAWPH--DGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGS 229 +T H ++ H D D+ + +AQ+R Q + E A+ Sbjct: 318 KTTAQHATYIQ-----------HCIDRYNVDRIFVDSAAAQFR-QDLAYEHEIASAP-AK 364 Query: 230 NGVEAGISDMLDRMRSGRWKV 250 V G++ + + G+ V Sbjct: 365 KSVLDGLACLQALFQQGKIIV 385 >gi|152976656|ref|YP_001376173.1| PBSX family phage terminase large subunit [Bacillus cereus subsp. cytotoxis NVH 391-98] gi|152025408|gb|ABS23178.1| phage terminase, large subunit, PBSX family [Bacillus cytotoxicus NVH 391-98] Length = 434 Score = 48.8 bits (115), Expect = 7e-04, Method: Composition-based stats. Identities = 36/175 (20%), Positives = 65/175 (37%), Gaps = 23/175 (13%) Query: 18 YVWFDE--EPPEDVYFEGLTRINATQG-------LVTLTLTPLKGRSPIIEHYLSASSS- 67 + WF+E E + FE + +T+T P + ++ + Sbjct: 137 WAWFEEAYEIEDQHKFETVVESIRGSWDSPDFFKQITVTFNPWSENHWLKSYFFDEETQA 196 Query: 68 -DRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 D I T ++Q+R R Y + R AR GE + G ++ E+ + Sbjct: 197 YDTFAITTTYKCNEWLDKQDRARYESLYEKNPRRARIVCDGEWGVADGLVY----ENFQV 252 Query: 125 NSLDIPEHWVQ-----IGGMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQT 173 DI E + G+DFG+ + P A + + ++ IYV Y + + Sbjct: 253 RDFDIDEIRQRKDVQSAFGLDFGYTNDPTALCCSLVDLRNETIYVFDEYYEKGMS 307 >gi|260887046|ref|ZP_05898309.1| phage terminase, large subunit, PBSX family [Selenomonas sputigena ATCC 35185] gi|330839176|ref|YP_004413756.1| phage terminase, large subunit, PBSX family [Selenomonas sputigena ATCC 35185] gi|260863108|gb|EEX77608.1| phage terminase, large subunit, PBSX family [Selenomonas sputigena ATCC 35185] gi|329746940|gb|AEC00297.1| phage terminase, large subunit, PBSX family [Selenomonas sputigena ATCC 35185] Length = 416 Score = 48.8 bits (115), Expect = 7e-04, Method: Composition-based stats. Identities = 44/263 (16%), Positives = 95/263 (36%), Gaps = 30/263 (11%) Query: 5 EQGRDKW-QSNTVHYVWFDEEP--PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHY 61 ++G +K+ + T+ + DE PE + + L R++ + T P + + Y Sbjct: 113 DEGSEKFIRGKTLAGAYCDELTLMPERFFKQLLNRLSVPGAKLYSTTNPDSPMHYLYKEY 172 Query: 62 LSASSSDR----QVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPI 117 +++ R V+ +++ P+ + + I SY + G +L G I+ + Sbjct: 173 VTSDEKLRDGLVSVVHFELDDNPNLTDDYKTNIRSSYSGMWFKRMILGLWVLAEGVIYDM 232 Query: 118 VEEDIVINSLDIPEHWVQIG----GMDFGWHHPFAAGHLVWNRDSDVIYVVKNYR----- 168 +D++ + + D+G +P L D D I++ Y Sbjct: 233 FSDDLLFDDAEFTNTLKSTCRRHIACDYGTKNPM--VFLDIYDDGDTIWIPAMYYWDSRK 290 Query: 169 -CREQTPIFHVAAL-KSWGKWLP--WAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECAT 224 R++T + L + G+ P S + + +G ++ + Sbjct: 291 EQRQKTDAQYADDLERMLGEEYPDFIVIDP--------SAASFKLECQGRGFRVKDADNS 342 Query: 225 FDDGSNGVEAGISDMLDRMRSGR 247 +DG V ++ RM R Sbjct: 343 VNDGIREVAKLLTKKKIRMHRTR 365 >gi|251810445|ref|ZP_04824918.1| large terminase subunit [Staphylococcus epidermidis BCM-HMP0060] gi|251806049|gb|EES58706.1| large terminase subunit [Staphylococcus epidermidis BCM-HMP0060] Length = 420 Score = 48.8 bits (115), Expect = 7e-04, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 68/184 (36%), Gaps = 13/184 (7%) Query: 62 LSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGE-PILGSGRIFPIVEE 120 S + V T + P ++Q + + +E+ R + +GSG + P Sbjct: 185 TSFQPDNTFVHHSTYLDNPFISKQFIQEAESAKERNEQRYRWEYMGEAIGSGVV-PFNNL 243 Query: 121 DIVINSLDIPEHWVQI-GGMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 I ++ + + I G+DFG+ P A +++ VIY + Y + + + Sbjct: 244 QIETIPQEMIDGFDNIRNGLDFGYADDPLAFVRWHYDKKKRVIYAIDEYYGVQISNRQYA 303 Query: 179 AALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISD 238 + W + + D + D + ++ + GMK + G + E G Sbjct: 304 NEM--WKRK----YQSDDIYADHAEPKSIAELKQEHGMKKVR---PVKKGPDSREYGEQW 354 Query: 239 MLDR 242 + D Sbjct: 355 LSDL 358 >gi|227833811|ref|YP_002835518.1| putative phage terminase [Corynebacterium aurimucosum ATCC 700975] gi|262184752|ref|ZP_06044173.1| putative phage terminase [Corynebacterium aurimucosum ATCC 700975] gi|227454827|gb|ACP33580.1| putative phage terminase [Corynebacterium aurimucosum ATCC 700975] Length = 751 Score = 48.4 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 40/201 (19%), Positives = 76/201 (37%), Gaps = 20/201 (9%) Query: 5 EQG-RDKWQSNTVHYVWFDEE--PPEDVYFEGLTRINATQGL----VTLTLTPLKGRSPI 57 +QG + + T +++DE PE+V+ +R+ AT V T P + Sbjct: 109 DQGAEGRIRGGTYQLLFYDELTLCPENVWEMLWSRMRATGNPNPPRVFATTNPATPAHYL 168 Query: 58 IEHYLSASSSDRQVIRM-TINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFP 116 +++ R+ T+++ P E+ ++R+ SY +GE G ++ Sbjct: 169 KTNFIDKPGETDTYARLFTMDDNPGLTEEYKERMKASYTGIFYRRMIRGEWAAAEGAVYE 228 Query: 117 IVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYR-------- 168 + D ++ + V G+D+G +HP A L D + V + Sbjct: 229 SWDPDTMVKGRAVGT--VLAVGIDYGTNHPSAGYALTVTEDG--LQVTHEWSPQTTGLGG 284 Query: 169 CREQTPIFHVAALKSWGKWLP 189 T +L+ W LP Sbjct: 285 RTRLTDGELADSLQEWLSTLP 305 >gi|326633035|ref|YP_004306624.1| terminase large subunit [Enterobacteria phage SPC35] gi|321272229|gb|ADW80121.1| terminase large subunit [Enterobacteria phage SPC35] Length = 438 Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 78/201 (38%), Gaps = 30/201 (14%) Query: 63 SASSSDRQVIRMTINETPHY----NEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIV 118 + + I T + P E+ R+ + +Y E EA + G+IF Sbjct: 202 DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADF----SVFEGQIFDTF 257 Query: 119 EEDIVINSLDIPEHWVQ-------IGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCRE 171 + L H+ + + G+D G+ P A + ++ D+D YV++ Y+ E Sbjct: 258 NAIDHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAE 317 Query: 172 QTPIFHVAALKSWGKWLPWAWPH--DGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGS 229 +T H A ++ H D + D+ + +AQ+R Q + E A+ Sbjct: 318 KTTAQHAAYIQ-----------HCIDRYKVDRVFVDSAAAQFR-QDLAYEHEIASAP-AK 364 Query: 230 NGVEAGISDMLDRMRSGRWKV 250 V G++ + + G+ V Sbjct: 365 KSVLDGLACLQALFQQGKIIV 385 >gi|329735579|gb|EGG71866.1| phage terminase, large subunit, PBSX family [Staphylococcus epidermidis VCU028] Length = 420 Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 67/184 (36%), Gaps = 13/184 (7%) Query: 62 LSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGE-PILGSGRIFPIVEE 120 S + V T + P ++Q + + +E R + +GSG + P Sbjct: 185 TSFQPDNTFVHHSTYLDNPFISKQFIQEAESTKERNELRYRWEYMGEAIGSGVV-PFNNL 243 Query: 121 DIVINSLDIPEHWVQI-GGMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 I ++ + + I G+DFG+ P A +++ VIY + Y + + + Sbjct: 244 QIETIPQEMIDGFDNIRNGLDFGYADDPLAFVRWHYDKKKRVIYAIDEYYGVQISNRQYA 303 Query: 179 AALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISD 238 + W + + D + D + ++ + GMK + G + E G Sbjct: 304 NEM--WKRK----YQSDDIYADHAEPKSIAELKQEHGMKKVR---PVKKGPDSREYGEQW 354 Query: 239 MLDR 242 + D Sbjct: 355 LSDL 358 >gi|324325093|gb|ADY20353.1| phage terminase, large subunit, PBSX family [Bacillus thuringiensis serovar finitimus YBT-020] Length = 434 Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 68/187 (36%), Gaps = 23/187 (12%) Query: 18 YVWFDE--EPPEDVYFEGLTRINA-------TQGLVTLTLTPLKGRSPIIEHYLSASSS- 67 + WF+E E + FE + +T+T P + ++ + Sbjct: 137 WAWFEEAYEIEDQHKFETVVESIRGSYDSPDFFKQITVTFNPWSENHWLKSYFFDEETQA 196 Query: 68 -DRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 D I T +EQ+R R Y + R AR GE + G ++ E+ + Sbjct: 197 YDTFAITTTYKCNEWLDEQDRARYESLYTKNPRRARIVCDGEWGVADGLVY----ENFQV 252 Query: 125 NSLDIPEHWVQ-----IGGMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 DI E + G+DFG+ + P A + + ++ IYV + + + Sbjct: 253 RDFDIDEIRQRKDVQSAFGLDFGYTNDPTALCCSLVDMRNETIYVFDEHYEKGMSNKRIA 312 Query: 179 AALKSWG 185 ++ G Sbjct: 313 KVIEEKG 319 >gi|326203479|ref|ZP_08193343.1| hypothetical protein Cpap_1523 [Clostridium papyrosolvens DSM 2782] gi|325986299|gb|EGD47131.1| hypothetical protein Cpap_1523 [Clostridium papyrosolvens DSM 2782] Length = 429 Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 59/176 (33%), Gaps = 28/176 (15%) Query: 68 DRQVIRMTINETPHYNEQERKRIID--SYPLHEREARTKGEPILGSGRIFPIVEEDI-VI 124 R I + + E++ I + P ER+A +G + + FP + DI Sbjct: 153 TRCYIPAKVYDNIFLMEKDPNYITNLMQLPERERDALLEGSWDIFDDQAFPEFDPDIHTY 212 Query: 125 NS---L---DIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFH- 177 + IP+HW + D G+ PFA + V + + Y E+ P Sbjct: 213 DPEKTFKNGQIPKHWKRWRSADNGYDDPFAFYWHAIDEHGHV-WTYREYTRSEKDPKVAY 271 Query: 178 ---VAALKSWGKWLP-------------WAWPHDG-LQHDKRSGEQLSAQYRRQGM 216 A + S +L HD +++ + + Y G+ Sbjct: 272 KDQAAEVVSRSTYLNEETGLLEPEKILYTVIGHDAFFSNERLDAKSIEEFYNEGGL 327 >gi|227534164|ref|ZP_03964213.1| large subunit terminase [Lactobacillus paracasei subsp. paracasei ATCC 25302] gi|227188204|gb|EEI68271.1| large subunit terminase [Lactobacillus paracasei subsp. paracasei ATCC 25302] Length = 432 Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 51/147 (34%), Gaps = 10/147 (6%) Query: 44 VTLTLTPLKGRSPIIEHY--LSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREA 101 +T P + + + + + T + P+ ++ + D + A Sbjct: 165 SIITFNPWSDQHWLKREFFDQDTKNPRSKSFTTTYEDNPYLDDDYIASLKDMVKRNPNRA 224 Query: 102 RTK--GEPILGSGRIFPIVEEDIVINSLDIPE--HWVQIGGMDFGWHH-PFAAGHLVWNR 156 R G+ + G +F + E + + + G+DFG+ H P A + ++ Sbjct: 225 RVAVYGDWGIAEGLVFDGLFEQ---RDFSMEDIAALPKAVGLDFGFKHDPTAGEFMAIDQ 281 Query: 157 DSDVIYVVKNYRCREQTPIFHVAALKS 183 + V+Y+ + + L S Sbjct: 282 QNRVVYIYDEFYKQGLLTGQIAKELAS 308 >gi|46401884|ref|YP_006983.1| terminase, large subunit [Enterobacteria phage T5] gi|45775062|gb|AAS77194.1| terminase, large subunit [Enterobacteria phage T5] gi|59897286|gb|AAX12081.1| ORF144 [Enterobacteria phage T5] Length = 438 Score = 47.7 bits (112), Expect = 0.001, Method: Composition-based stats. Identities = 41/201 (20%), Positives = 78/201 (38%), Gaps = 30/201 (14%) Query: 63 SASSSDRQVIRMTINETPHY----NEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIV 118 + + I T + P E+ R+ + +Y E EA + G+IF Sbjct: 202 DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADF----SVFEGQIFDTF 257 Query: 119 EEDIVINSLDIPEHWVQ-------IGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCRE 171 + L H+ + + G+D G+ P A + ++ D+D YV++ Y+ E Sbjct: 258 NAIDHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAE 317 Query: 172 QTPIFHVAALKSWGKWLPWAWPH--DGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGS 229 +T H A ++ H D + D+ + +AQ+R Q + E A+ Sbjct: 318 KTTAQHAAYIQ-----------HCIDRYKVDRIFVDSAAAQFR-QDLAYEHEIASAP-AK 364 Query: 230 NGVEAGISDMLDRMRSGRWKV 250 V G++ + + G+ V Sbjct: 365 KSVLDGLACLQALFQQGKIIV 385 >gi|226305996|ref|YP_002765956.1| hypothetical protein RER_25090 [Rhodococcus erythropolis PR4] gi|226185113|dbj|BAH33217.1| hypothetical protein RER_25090 [Rhodococcus erythropolis PR4] Length = 402 Score = 47.7 bits (112), Expect = 0.001, Method: Composition-based stats. Identities = 44/224 (19%), Positives = 81/224 (36%), Gaps = 23/224 (10%) Query: 32 EGLTRINA-TQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYN----EQE 86 E + A +G TP +S LS ++++ +V T + P + ++ Sbjct: 134 EIIRATLADYRGRALFMGTPKGYKSLYRLEKLSKTNANYEVFHFTSFDNPFLSVEELDEM 193 Query: 87 RKRIIDSYPLHEREARTKGEPILGSGRIFPI--VEEDIVINSLDIPEHWVQIGGMDFGWH 144 R + + E A G I+ ++ I PE W +DFG++ Sbjct: 194 RGEMTVTQYAQEMLAEYHKM----EGLIYEEFNRDQHIKALPFT-PERWAL--SIDFGYN 246 Query: 145 HPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSG 204 HPFAAG D+ +++ + R+ + + A++ D D Sbjct: 247 HPFAAGIFAIGSDNS-LHLDRMVYKRKLSDEQRMNAVRDLIGDTKL----DFQIGDSEDP 301 Query: 205 EQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRW 248 + R+ G+K+ P G+ V GI+ + GR Sbjct: 302 LAIDTLNRQLGLKIQPVV----KGAGSVLEGINKSKSLLHQGRL 341 >gi|282851552|ref|ZP_06260917.1| phage terminase, large subunit, PBSX family [Lactobacillus gasseri 224-1] gi|282557520|gb|EFB63117.1| phage terminase, large subunit, PBSX family [Lactobacillus gasseri 224-1] Length = 409 Score = 47.7 bits (112), Expect = 0.001, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 84/232 (36%), Gaps = 26/232 (11%) Query: 28 DVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQV--IRMTINETPHYNEQ 85 +V+ E + R + + P + Y+ ++ TI++ ++ Sbjct: 139 EVFQEIVQRCSVRSARIICDTNPDIPTHWLKTDYIDNHDPKARIKAFSFTIDDNTFLSKD 198 Query: 86 ERKRIIDSYPLHEREART-KGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFGW 143 + + + P R+ G+ + G G ++ ++ +VI +P+ G+D+G+ Sbjct: 199 YVEALKAATPRGMFYDRSILGQWVTGDGIVYQDFNKDKMVIPKNRVPDGLDYYVGVDWGY 258 Query: 144 HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRS 203 HP L ++D + YV+++Y + K W+ A Q+ + Sbjct: 259 EHPNPIILLGDDKDGNT-YVLEDYTQKH----------KFINYWVKIA------QNLQTR 301 Query: 204 GEQLSAQYRRQGMKMLPECATFD-----DGSNGVEAGISDMLDRMRSGRWKV 250 + Y + + + V GI + +MR G++ V Sbjct: 302 FGRNLIFYADSARPDNVNEFQSNGLNCINANKNVLPGIECVAKKMREGKFYV 353 >gi|281416362|ref|YP_003347496.1| terminase large subunit [Enterococcus phage phiFL1A] gi|270209192|gb|ACZ63738.1| terminase large subunit [Enterococcus phage phiFL1A] gi|270209254|gb|ACZ63799.1| terminase large subunit [Enterococcus phage phiFL1B] gi|270209324|gb|ACZ63868.1| terminase large subunit [Enterococcus phage phiFL1C] Length = 413 Score = 47.3 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 41/255 (16%), Positives = 99/255 (38%), Gaps = 30/255 (11%) Query: 7 GRDKWQSNTVHYVWFDEEP--PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSA 64 G + T + + +E ++V+ E ++R +AT + P + + Y+ Sbjct: 116 GVGSIRGMTAYGAYINEASLAKQEVFAEIVSRCSATGARILADTNPDNPEHWLKKEYIDN 175 Query: 65 SSSDRQVIRMTINETPHYNEQERKRIIDSYPL---HEREARTKGEPILGSGRIF-PIVEE 120 SS Q +++ +E+ R I S P ++R+ KG + G ++ Sbjct: 176 SSKSIQSFHFGLDDNTFLSERYRTNIKASTPSGMFYDRDI--KGLWVSADGVVYKDFDAN 233 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA 180 I+S D+P G+D+G+ H + + ++ + Y+V+ + + I + Sbjct: 234 KHYIDSSDLPPLAKYYCGVDWGYDH-WGSIVVIGETEDGTAYLVEEH-ASQYEEIDYWVG 291 Query: 181 LKS-----WGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAG 235 + +G +P+ +H +++ R+G+ + + +G Sbjct: 292 IAKEIQARYGGRIPFYCDSARPEH--------VSRFVREGLNAI-------NAFKARLSG 336 Query: 236 ISDMLDRMRSGRWKV 250 + + R ++ R + Sbjct: 337 VESVAKRFKTNRLYI 351 >gi|196048452|ref|ZP_03115627.1| phage terminase, large subunit, pbsx family [Bacillus cereus 03BB108] gi|196020709|gb|EDX59441.1| phage terminase, large subunit, pbsx family [Bacillus cereus 03BB108] Length = 419 Score = 47.3 bits (111), Expect = 0.002, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 69/187 (36%), Gaps = 23/187 (12%) Query: 18 YVWFDE--EPPEDVYFEGLTRINA-------TQGLVTLTLTPLKGRSPIIEHYLSA--SS 66 + WF+E E + FE + +T+T P + ++ + Sbjct: 128 WAWFEEAYEIEDQHKFETVVESIRGSFDAPDFFKQITVTFNPWSENHWLKSYFFDEATQA 187 Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 D I T +EQ+R R Y + R AR GE + G ++ E+ + Sbjct: 188 YDTFAITTTYKCNEWLDEQDRARYESLYVKNPRRARIVCDGEWGVADGLVY----ENFQV 243 Query: 125 NSLDIPEHWVQ-----IGGMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 DI E + G+DFG+ + P A + + ++ IYV + + + Sbjct: 244 RDFDIDEIRQRKDVQSAFGLDFGYTNDPTALTCSLVDLKNETIYVFDEHSQKGMSNKKIA 303 Query: 179 AALKSWG 185 A ++ G Sbjct: 304 AMIEKKG 310 >gi|77405323|ref|ZP_00782419.1| phage terminase, large subunit, PBSX family [Streptococcus agalactiae H36B] gi|77176118|gb|EAO78891.1| phage terminase, large subunit, PBSX family [Streptococcus agalactiae H36B] Length = 426 Score = 46.1 bits (108), Expect = 0.004, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 85/225 (37%), Gaps = 28/225 (12%) Query: 28 DVYFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNE 84 D Y + R+ + L V L P+ + + + + S + V + T + ++ Sbjct: 148 DDYTQLTLRLRDKKHLEKQVYLMFNPVSKANWVYKAFFIKSPKNTVVYQTTYKDNRFLDD 207 Query: 85 QERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVINSLDIPEHWVQIGGM 139 R+ I + + EA K LG IFP E+ I+ D H G+ Sbjct: 208 VTRENIEEL--ANRNEAYYK-IYALGQFATLEKLIFPKYEKRILNK--DKLSHLPSFFGL 262 Query: 140 DFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQ 198 D+G+ + P A H+ + + +Y+++ Y + T A+KS G + Sbjct: 263 DYGFINDPSAFLHVKIDDANKKLYILEEYVRKNLTNDKIANAIKSLGY---------AKE 313 Query: 199 HDKRSGEQLS--AQYRRQGMKMLPECATFDDGSNGVEAGISDMLD 241 + + + R G++ + + G+ V GI ML Sbjct: 314 EIRGDSAEKKSNQELRNLGIQRMID---VKKGAGSVMQGIQYMLQ 355 >gi|237748196|ref|ZP_04578676.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] gi|229379558|gb|EEO29649.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] Length = 474 Score = 46.1 bits (108), Expect = 0.005, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 48/144 (33%), Gaps = 20/144 (13%) Query: 123 VINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKN-YRCREQTPIFHVAAL 181 +++ IP W MD+G+ P+A + D IY+ + Y E+ Sbjct: 259 IVDPFPIPSSWTVWKAMDWGYSAPYAVYWFAMDCDG-CIYLWRELYGAGEKAGQGSREGA 317 Query: 182 KSWGKWLPWAWPHDGLQ--------------HDKRSGEQLSAQYRRQGMKMLPECATFDD 227 + + HD + + +R G++ L ++ Sbjct: 318 AEVARKIKRIEEHDNRLGYEYRLNLADPSIFSKNGTDRSIGQIFRDNGVRWL---EAWNA 374 Query: 228 GSNGVEAGISDMLDRMRSGRWKVF 251 + V G +++ + + K+F Sbjct: 375 KGSRVN-GAQEIIRLLAEDKLKIF 397 >gi|329919929|ref|ZP_08276833.1| phage terminase, large subunit, PBSX family [Lactobacillus iners SPIN 1401G] gi|328936867|gb|EGG33301.1| phage terminase, large subunit, PBSX family [Lactobacillus iners SPIN 1401G] Length = 409 Score = 45.8 bits (107), Expect = 0.005, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 70/177 (39%), Gaps = 14/177 (7%) Query: 2 KAY---EQGRDKWQSNTVHYVWFDEEP--PEDVYFEGLTRINATQGLVTLTLTPLKGRSP 56 KAY E+GRD + T + +E V+ E R +A + + P Sbjct: 108 KAYTGSERGRDSIRGMTAWGAYINEASLAKASVFSEIQKRCSAPEARIICDTNPDAPTHW 167 Query: 57 IIEHYLSASSSDRQVIRM--TINETPHYNEQERKRIIDSYPL---HEREARTKGEPILGS 111 + ++Y+ + + T ++ P ++ ++++ S P ++R+ G G Sbjct: 168 LKKNYIDNTDPKAGIKTFFFTFDDNPTLDDDYKEKLKASTPSGVFYDRDI--LGLWCTGE 225 Query: 112 GRIF-PIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNY 167 G ++ + + I+ +P G+D+G+ H + + Y+++ + Sbjct: 226 GVVYRDFDQSTMTIDRYKLPTDLTYYVGVDWGYEHTGTLIVFADDSQGNT-YLIEEH 281 >gi|225871048|ref|YP_002746995.1| phage terminase, large subunit [Streptococcus equi subsp. equi 4047] gi|225700452|emb|CAW94859.1| putative phage terminase, large subunit [Streptococcus equi subsp. equi 4047] Length = 415 Score = 45.8 bits (107), Expect = 0.005, Method: Composition-based stats. Identities = 42/225 (18%), Positives = 82/225 (36%), Gaps = 28/225 (12%) Query: 28 DVYFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNE 84 D Y + R+ + L + L P+ + + + + + + V + T + ++ Sbjct: 134 DDYTQLTLRLRDRKHLKKQIFLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDD 193 Query: 85 QERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVINSLDIPEHWVQIGGM 139 R+ I + + EA K LG IFP E+ I+ D H G+ Sbjct: 194 VTRENIEEL--ANRNEAYYK-IYALGQFATLDKLIFPKYEKKILNK--DKLSHLPSFFGL 248 Query: 140 DFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQ 198 D+G+ + P A H+ + ++ +Y+++ Y + T A+K G + Sbjct: 249 DYGFINDPSAFLHVKIDDNNKKLYILEEYVRKNLTNDKIANAIKDLGY---------AKE 299 Query: 199 HDKRSGEQLS--AQYRRQGMKMLPECATFDDGSNGVEAGISDMLD 241 + + + R G +P G V GI +L Sbjct: 300 EIRGDSAEKKSNQELRNLG---IPRIIDVKKGPGSVMQGIQYLLQ 341 >gi|148544063|ref|YP_001271433.1| PBSX family phage terminase large subunit [Lactobacillus reuteri DSM 20016] gi|184153441|ref|YP_001841782.1| putative phage terminase [Lactobacillus reuteri JCM 1112] gi|227365124|ref|ZP_03849143.1| PBSX family phage terminase large subunit [Lactobacillus reuteri MM2-3] gi|325682397|ref|ZP_08161914.1| prophage Lp1 protein 38 [Lactobacillus reuteri MM4-1A] gi|148531097|gb|ABQ83096.1| phage terminase, large subunit, PBSX family [Lactobacillus reuteri DSM 20016] gi|183224785|dbj|BAG25302.1| putative phage terminase [Lactobacillus reuteri JCM 1112] gi|227069827|gb|EEI08231.1| PBSX family phage terminase large subunit [Lactobacillus reuteri MM2-3] gi|324978236|gb|EGC15186.1| prophage Lp1 protein 38 [Lactobacillus reuteri MM4-1A] Length = 431 Score = 45.4 bits (106), Expect = 0.007, Method: Composition-based stats. Identities = 40/204 (19%), Positives = 73/204 (35%), Gaps = 18/204 (8%) Query: 44 VTLTLTPLKGRSPIIEHYLSAS--SSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREA 101 +T+T P + + + +D T +EQ+R+R +D Y + R A Sbjct: 163 ITVTFNPWNAQHWLKRTFFDPETRKADTFAQTTTFRCNEWLDEQDRQRYLDLYKTNPRRA 222 Query: 102 R--TKGEPILGSGRIFPIVEEDIVINSLD-IPEHWVQIGGMDFGWH-HPFAAGHLVWNRD 157 + G+ + G +F E + + + + E G+D+G+ P A L + Sbjct: 223 KVAADGDWGVSEGLVFEDNVERVEFDPQEKLTECGHAGFGLDYGFGGDPNAFVALAIDPK 282 Query: 158 SDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMK 217 S I++ QT LK G + H + D S E+ + Q + Sbjct: 283 SKNIWIYDEMYTYHQTTPHIADWLKKNG------YQHASIYADSASPER-TQQLLDLDID 335 Query: 218 MLPECATFDDGSNGVEAGISDMLD 241 + +EAGI + Sbjct: 336 NIQSVVKTP-----IEAGIDQLWQ 354 >gi|300768463|ref|ZP_07078363.1| PBSX family phage terminase [Lactobacillus plantarum subsp. plantarum ATCC 14917] gi|300493981|gb|EFK29149.1| PBSX family phage terminase [Lactobacillus plantarum subsp. plantarum ATCC 14917] Length = 412 Score = 45.4 bits (106), Expect = 0.008, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 58/159 (36%), Gaps = 6/159 (3%) Query: 27 EDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVI--RMTINETPHYNE 84 E+V+ E L R +A + P + Y+ + TI++ Sbjct: 138 EEVFNEILNRCSAQGARIICDTNPDVPTHYLKASYIDNDDPKAGTVSFHFTIDDNTFLPP 197 Query: 85 QERKRIIDSYPLHEREART-KGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFG 142 Q + P R G + G G ++ +++++I +P G+D+G Sbjct: 198 QYVEHQKAGTPSGMFYDRAILGLWVSGEGMVYKDFNKDEMIIPRAQLPADLTYYAGVDWG 257 Query: 143 WHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAAL 181 + H + +R + Y+++ + R+ I + + Sbjct: 258 YEHKGTIVVMADDRVGNT-YLIEEH-TRQFEEIDYWVEI 294 >gi|163932181|ref|YP_001642371.1| putative phage terminase large subunit B [Lactobacillus johnsonii prophage Lj771] gi|163562135|gb|ABY26991.1| putative phage terminase large subunit B [Lactobacillus johnsonii prophage Lj771] Length = 409 Score = 45.0 bits (105), Expect = 0.010, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 80/231 (34%), Gaps = 28/231 (12%) Query: 28 DVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQV--IRMTINETPHYNEQ 85 +V+ E + R + + P + Y+ ++ TI++ + Sbjct: 139 EVFQEIVQRCSVGSARIICDTNPDIPTHWLKTDYIDNHDPKARIKAFSFTIDDNTFLAKD 198 Query: 86 ERKRIIDSYPLHEREART-KGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFGW 143 + + + P R+ G+ + G G ++ +VI+ +IP+ G+D+G+ Sbjct: 199 YVEALKAATPRGMFYDRSILGQWVTGEGIVYQDFNANTMVIDDKNIPDGLNYYCGVDWGF 258 Query: 144 HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRS 203 HP L + + YV+K++ R + I + + LQ + Sbjct: 259 EHPNPILLLGDDNQGNT-YVIKDFTKRHKF-ISYWVDIAKR------------LQTEYG- 303 Query: 204 GEQLSAQYRRQGMKMLPECATFDDGSNGVE------AGISDMLDRMRSGRW 248 + Y +G N + GI + +MR G++ Sbjct: 304 --RNLIFYVDSARPDNLNEFQ-SNGINAINANKNILPGIEFVAQKMRQGKF 351 >gi|329667176|gb|AEB93124.1| phage terminase large subunit [Lactobacillus johnsonii DPC 6026] Length = 424 Score = 44.6 bits (104), Expect = 0.012, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 71/174 (40%), Gaps = 9/174 (5%) Query: 5 EQGRDKWQSNTVHYVWFDEEP--PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYL 62 E +D Q T+ +FDE P+ + R + + + P ++ Sbjct: 121 EASQDLVQGITLAGFFFDEVALMPQSFVNQATARCSVSGAKMWFNCNPSGPYHWFKLDWI 180 Query: 63 SASSSDRQV-IRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFP-IVEE 120 R + + T+++ P ++ R Y + +G ++ G I+ ++ Sbjct: 181 DRLKGKRALRLHFTMHDNPSLDKATISRYERMYSGVFYQRYIQGLWVMSEGVIYDNFDKD 240 Query: 121 DIVINSLDIPEHWVQIG-GMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQT 173 +V+N ++P H+ + D+G +P A L+W R+ + Y++K Y +T Sbjct: 241 TMVVN--ELPNHFEKYYVSCDYGTLNPTA--FLLWGRNHGIWYLIKEYYYSGRT 290 >gi|260664700|ref|ZP_05865552.1| PBSX family phage terminase, large subunit [Lactobacillus jensenii SJ-7A-US] gi|260561765|gb|EEX27737.1| PBSX family phage terminase, large subunit [Lactobacillus jensenii SJ-7A-US] Length = 408 Score = 44.6 bits (104), Expect = 0.012, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 46/122 (37%), Gaps = 4/122 (3%) Query: 27 EDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQ--VIRMTINETPHYNE 84 V+ E + R + + P + Y+ + + V TI++ + Sbjct: 138 YAVFQEIIQRCSQPNARIICDTNPDTPTHWLKAKYIDNKKPEAKIKVYNFTIDDNTFLDP 197 Query: 85 QERKRIIDSYPLHEREART-KGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGW 143 K + S P R KG + G G ++P ++ ++ +P+ G+D+G+ Sbjct: 198 DYVKTLKASTPSGMFYDRNIKGLWVTGDGVVYPDFDKKRMVVD-KVPDGLTCYCGVDWGF 256 Query: 144 HH 145 H Sbjct: 257 EH 258 >gi|134095464|ref|YP_001100539.1| putative terminase, large subunit [Herminiimonas arsenicoxydans] gi|133739367|emb|CAL62417.1| Conserved hypothetical protein [Herminiimonas arsenicoxydans] Length = 437 Score = 43.8 bits (102), Expect = 0.021, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 71/221 (32%), Gaps = 35/221 (15%) Query: 55 SPIIEHYLSASSSDRQVIRMTINETPHYNEQE---RKRIIDSYPLHEREARTKGEPILGS 111 S + R IR +I+E H E + K + + R A +G+ + Sbjct: 158 SGQVIRLEGEQP--RVRIRSSIHENTHLLESDPDYLKTLEALKDPNRRRAWLEGDWDIHV 215 Query: 112 GRIFPIVEEDIV--INSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKN-YR 168 G F V + ++ IP W MD+G+ P+ L + D IYV + Y Sbjct: 216 GSFFEGVWDAKRHIVDPFPIPASWQVWKAMDWGFAAPYCVLWLAMDPDG-CIYVWRELYG 274 Query: 169 CRE---QTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQ----------- 214 E + H + + + + D+R G + Sbjct: 275 AGEKVGEGSREHADVVAKKVRTI--------EERDERLGYEYRMNLADPSIFSNTGVNTT 326 Query: 215 -GMKMLPECATFDDGSN---GVEAGISDMLDRMRSGRWKVF 251 G + + N + G +++ M + K+F Sbjct: 327 IGAIFRKAGVKWQEAWNAKGSIANGAQEIMRLMGDDKLKIF 367 >gi|326775607|ref|ZP_08234872.1| phage terminase, large subunit, PBSX family [Streptomyces cf. griseus XylebKG-1] gi|326655940|gb|EGE40786.1| phage terminase, large subunit, PBSX family [Streptomyces cf. griseus XylebKG-1] Length = 416 Score = 43.4 bits (101), Expect = 0.025, Method: Composition-based stats. Identities = 50/257 (19%), Positives = 94/257 (36%), Gaps = 20/257 (7%) Query: 5 EQGRDKWQSNTVHYVWFDEEP--PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYL 62 + ++ + T DE P++ + + L R++ + + P + ++ Sbjct: 111 SRAEERLRGMTCAGALVDEATLVPQEFWTQLLGRMSVPGAKLFASTNPGSPAHWLKRDFI 170 Query: 63 SASSS-DRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPI-VEE 120 + +++ P + + I + + GE I G IF + EE Sbjct: 171 DRRDELGIRYWHYVLDDNPSLGDDYKNSIKNEFVGLWYRRFVLGEWIAAEGSIFDMWDEE 230 Query: 121 DIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYR------CREQTP 174 V+++L W+ + G+D+G +PF A L RD +Y +R R+ T Sbjct: 231 KHVVDTLPEIAKWISV-GVDYGQTNPFHATLLGLGRDR-RLYAASEWRYDGRQQRRQLTD 288 Query: 175 IFHVAALKSWGKWLPWAWP-HDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVE 233 I + ++ W + P S SAQ RR + T +N V Sbjct: 289 IEYSERMRGWLSNVAGIGPVRPQFVTVDPSAASFSAQLRR-------DRLTPTPANNAVL 341 Query: 234 AGISDMLDRMRSGRWKV 250 GI M + +G+ V Sbjct: 342 DGIRTMASLLSAGKLVV 358 >gi|302385792|ref|YP_003821614.1| phage terminase, large subunit, PBSX family [Clostridium saccharolyticum WM1] gi|302196420|gb|ADL03991.1| phage terminase, large subunit, PBSX family [Clostridium saccharolyticum WM1] Length = 428 Score = 43.4 bits (101), Expect = 0.026, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 83/219 (37%), Gaps = 41/219 (18%) Query: 1 LKAYEQGRDKWQSNTVHYVWFDE--EPPEDVYFEGLTRINATQ--GLVTLTLTPLKGRSP 56 K ++ N V VW +E E + E L R+ + + L+ P+ + Sbjct: 106 FKGMDKPAKLKSLNGVSIVWIEECSEVKYAGFKEILGRLRHPELSNHIILSTNPVSKSNW 165 Query: 57 IIEHYLSASSSDRQVIR-------------------MTINETPHYNEQERKRIIDSYPLH 97 + +H+ S+ +++ T+++ ++ QE +D H Sbjct: 166 VYKHFFQDKSTGYKILNDEELYTRRIVVIGNTYYHHSTVDDN-YFVPQEYVEQLDELQQH 224 Query: 98 E----REARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQI-------GGMDFGWHHP 146 + R AR KG + +FP V+ D + +++ GMDFG+ Sbjct: 225 DPDLYRIAR-KGRFGVNGRLVFP----QFVVKPDDEVKTLIKLIRNPVEKNGMDFGFVTS 279 Query: 147 -FAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSW 184 A ++ + D+ ++Y+ Y R++T A L W Sbjct: 280 YNAVVRMMIDHDNKILYLYDEYYSRDKTDPEIAADLAKW 318 >gi|302541176|ref|ZP_07293518.1| prophage terminase large subunit [Streptomyces hygroscopicus ATCC 53653] gi|302458794|gb|EFL21887.1| prophage terminase large subunit [Streptomyces himastatinicus ATCC 53653] Length = 270 Score = 43.4 bits (101), Expect = 0.030, Method: Composition-based stats. Identities = 42/181 (23%), Positives = 68/181 (37%), Gaps = 21/181 (11%) Query: 78 ETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEE--DIVINSLDIPEHWVQ 135 + P + + + Y R G ++ G I+ + +E +V D+ +W+ Sbjct: 36 DNPSLSPEYVADLAAEYVGLWRRRMIDGAWVVAEGAIYDMWDEGRHVVAELPDVRRYWL- 94 Query: 136 IGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYR------CREQTPIFHVAALKSWGKWLP 189 G D+G +PF+A L D D +YV +R R T + AA+++W L Sbjct: 95 --GCDYGTTNPFSAILLGEGVD-DRLYVAAEWRHDSRATHRSMTDAQYSAAVRAWLADL- 150 Query: 190 WAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWK 249 P S S Q + G A SN V GI + + +GR Sbjct: 151 GIVPEWTFI--DPSAASFSTQMWQDG------HAGLARASNDVADGIRSVSSLLAAGRLL 202 Query: 250 V 250 V Sbjct: 203 V 203 >gi|326693191|ref|ZP_08230196.1| phage terminase, large subunit, PBSX family [Leuconostoc argentinum KCTC 3773] Length = 430 Score = 43.1 bits (100), Expect = 0.032, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 73/200 (36%), Gaps = 15/200 (7%) Query: 45 TLTLTPLKGRSPIIEHY--LSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR 102 +T P R + + + ++ T H N+ + + + + A+ Sbjct: 165 VITFNPWSDRHWLKREFFDVDTRRNNTLAFTTTYKNNHHLNDDFIEAMKEMVVRNPNRAK 224 Query: 103 TK--GEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHH-PFAAGHLVWNRDSD 159 G+ + G +F + E + +I + + G+DFG+ H P A + ++ + Sbjct: 225 VAVFGDWGISEGLVFDGLFEQRDFSMEEIAK-LPKSIGLDFGFKHDPTAGEFMAIDQTNR 283 Query: 160 VIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKML 219 V+YV + + AL + + P ++R +L++ Y + Sbjct: 284 VVYVYDEFYQQGMLTQMIAQAL---AQHKAYGLPITADSAEQRLTTELASVYG------V 334 Query: 220 PECATFDDGSNGVEAGISDM 239 P T G + V G+ M Sbjct: 335 PNLRTAGKGKDSVIQGVQYM 354 >gi|42526667|ref|NP_971765.1| phage terminase, large subunit, putative [Treponema denticola ATCC 35405] gi|41816860|gb|AAS11646.1| phage terminase, large subunit, putative [Treponema denticola ATCC 35405] Length = 417 Score = 43.1 bits (100), Expect = 0.036, Method: Composition-based stats. Identities = 26/113 (23%), Positives = 49/113 (43%), Gaps = 4/113 (3%) Query: 61 YLSASSSDRQVIRMTINETPHYNEQER--KRIIDSYPLHEREARTKGEPILGSGRIFP-I 117 Y + R I +++ + + ++ ++ + P + EA KG + +G F Sbjct: 140 YTDEDGNTRCFIPSRLDDNDYLIKNDKGYEKRLRLLPKYLYEALRKGNWDIIAGSAFEEF 199 Query: 118 VEEDIVINSLDI-PEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRC 169 E VI + + P W + MD+G+ PF+ G +RD +I + Y C Sbjct: 200 SRESHVIKPIALDPGVWFKFCSMDWGYSRPFSIGWWAVSRDGRMIRYRELYGC 252 >gi|299531659|ref|ZP_07045064.1| putative phage associated protein [Comamonas testosteroni S44] gi|298720375|gb|EFI61327.1| putative phage associated protein [Comamonas testosteroni S44] Length = 436 Score = 43.1 bits (100), Expect = 0.041, Method: Composition-based stats. Identities = 46/231 (19%), Positives = 87/231 (37%), Gaps = 45/231 (19%) Query: 37 INATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINET---PHYNEQERKR---- 89 I + LTL P + +++ S D V+ + + P ++ER++ Sbjct: 152 IRKEGSEIWLTLNPDMETDETYQRFIATPSPDTWVVEINWRDNPWFPRVLDEERRKAKRT 211 Query: 90 IIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVIN------SLD--IPEH--WVQIGGM 139 ++ H E + + +G I+ E + ++ D +P H W Sbjct: 212 MLADDYAHIWEGKARRV---AAGAIYRHEMESVYLDNRARDVPYDPTLPVHTVW------ 262 Query: 140 DFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAW-----PH 194 D GW+ + + R + ++ + +T ++VA L + LP+ W PH Sbjct: 263 DLGWNDAMSIALV--QRGPQDVRIIGHIEDSHRTLDWYVAKL----EKLPYRWGTDYLPH 316 Query: 195 DGLQHDKRSGEQLSAQYRRQGMK--MLPECATFDDGSNGVEAGISDMLDRM 243 DG + ++G+ R G + M+ AT VE GI + M Sbjct: 317 DGKTKNFQTGKSTEQLLRELGRRSVMVQPRAT------DVEEGIKQVRMLM 361 >gi|300361372|ref|ZP_07057549.1| PBSX family phage terminase [Lactobacillus gasseri JV-V03] gi|300353991|gb|EFJ69862.1| PBSX family phage terminase [Lactobacillus gasseri JV-V03] Length = 412 Score = 42.7 bits (99), Expect = 0.044, Method: Composition-based stats. Identities = 38/231 (16%), Positives = 79/231 (34%), Gaps = 22/231 (9%) Query: 27 EDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVI--RMTINETPHYNE 84 E V+ E R + + P + +Y+ + D V+ TI++ Sbjct: 141 EAVFNEIQNRCSKGGSHIICDTNPDIPTHWLKTNYIDNKNPDAGVVSFNFTIDDNTTLAS 200 Query: 85 QERKRIIDSYPLHEREARTKGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFGW 143 K + S + G G G ++ ++ +V++ ++P+ G+D+G+ Sbjct: 201 DYVKSMKASKIGVFYDRDILGLWATGDGIVYQDFNKDTMVVD--EVPDDLEYYCGVDWGF 258 Query: 144 --HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDK 201 H + + D+D+ Y++ Y+ K W+ A +Q + Sbjct: 259 AKGHENVITVMGDDPDTDISYLIGVYKSTG----------KYIDYWVDIA---QQIQDKR 305 Query: 202 RSGEQLSAQYRRQGMK--MLPECATFDDGSNGVEAGISDMLDRMRSGRWKV 250 G R + + V GI R++ G++KV Sbjct: 306 GYGINFWCDSARPEYVSYFQQQDIQARNADKSVMDGIEYCSSRIKLGKFKV 356 >gi|251777775|ref|ZP_04820695.1| phage terminase, large subunit, pbsx family [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243082090|gb|EES47980.1| phage terminase, large subunit, pbsx family [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 452 Score = 42.7 bits (99), Expect = 0.046, Method: Composition-based stats. Identities = 48/246 (19%), Positives = 83/246 (33%), Gaps = 23/246 (9%) Query: 5 EQGRDKWQSNTVHYVWFDEEP--PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYL 62 E +D Q T+ V FDE P+ + +R + + P +L Sbjct: 129 EGSQDLIQGITLAGVLFDEVALMPQSFVNQATSRCSVDGAKMWFNCNPDGPYHWFKTEFL 188 Query: 63 SASSSDRQV-IRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPI-VEE 120 V + T+++ +E+ ++R Y + G L G I+ + E+ Sbjct: 189 DKLKEKNAVHLHFTMDDNLSLSERVKERYKRMYSGIFYKRYILGLWCLAEGVIYDMFNED 248 Query: 121 DIVINSLDIPEHWVQIG-GMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVA 179 + + I + + +D+G + A L+W D Y+VK Y + A Sbjct: 249 NHKVE--TIKRRYEKYYVSIDYGTQN--ATVFLLWGLYQDKWYIVKEYYYSGRNTGIQKA 304 Query: 180 AL-------KSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGM-KMLPECATFDDGSNG 231 + K G +P D S Q R G +LP DG Sbjct: 305 DIQYSKDLKKFLGDIIPVKIIVD------PSAASFIKQLRDDGFKNILPANNDVLDGIRT 358 Query: 232 VEAGIS 237 V + +S Sbjct: 359 VASALS 364 >gi|119953680|ref|YP_950600.1| putative large terminase subunit [Staphylococcus phage CNPH82] gi|112361306|gb|ABI15678.1| putative large terminase subunit [Staphylococcus phage CNPH82] gi|329736010|gb|EGG72285.1| phage terminase, large subunit, PBSX family [Staphylococcus epidermidis VCU045] Length = 421 Score = 42.7 bits (99), Expect = 0.046, Method: Composition-based stats. Identities = 34/184 (18%), Positives = 67/184 (36%), Gaps = 13/184 (7%) Query: 62 LSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGE-PILGSGRIFPIVEE 120 S + V T + P ++Q + + +E+ R + +GSG + P Sbjct: 186 TSFQPDNTFVHHSTYLDNPFISKQFIQEAESAKERNEQRYRWEYMGEAIGSGVV-PFNNL 244 Query: 121 DIVINSLDIPEHWVQI-GGMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 I D+ + + I +DFG+ P A +++ +IY V + + + Sbjct: 245 QIEKIPDDLYKTFDNIRNAVDFGYATDPLAFVRWHYDKKKRIIYAVDEHYGVQISNREFA 304 Query: 179 AALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISD 238 LK G + D + D + ++ + G+K + G + VE G Sbjct: 305 NWLKRRG------YQSDEIYADSAEPKSIAELKQEHGIKRIK---GVKKGPDSVEHGEQW 355 Query: 239 MLDR 242 + D Sbjct: 356 LDDL 359 >gi|300362021|ref|ZP_07058198.1| PBSX family phage terminase [Lactobacillus gasseri JV-V03] gi|300354640|gb|EFJ70511.1| PBSX family phage terminase [Lactobacillus gasseri JV-V03] Length = 409 Score = 42.7 bits (99), Expect = 0.048, Method: Composition-based stats. Identities = 35/227 (15%), Positives = 90/227 (39%), Gaps = 16/227 (7%) Query: 28 DVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQV--IRMTINETPHYNEQ 85 +V+ E + R + + P + Y+ ++ TI + ++ Sbjct: 139 EVFQEIVQRCSVRSARIICDTNPDIPTHWLKTDYIDNHDPKARIKAFSFTIGDNTFLSKD 198 Query: 86 ERKRIIDSYPLHEREART-KGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFGW 143 + + + P R+ G+ + G G ++ ++ ++I +P+ G+D+G+ Sbjct: 199 YVEALKAATPRGMFYDRSILGQWVTGDGIVYQDFNKDTMIIPRNRVPDGLDYYVGVDWGY 258 Query: 144 HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRS 203 HP L ++D + YV+++Y + + I + + K L + + + + + Sbjct: 259 EHPNPIILLGDDKDGNT-YVLEDYTQKHKF-INYWVEI---AKNLQTRFGRNLIFYADSA 313 Query: 204 GEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKV 250 +++ G+ + + + V GI + +MR G++ V Sbjct: 314 RPDNVNEFQSNGLNCI-------NANKNVLPGIECVARKMREGKFYV 353 >gi|227498429|ref|ZP_03928575.1| phage terminase large subunit [Acidaminococcus sp. D21] gi|226903887|gb|EEH89805.1| phage terminase large subunit [Acidaminococcus sp. D21] Length = 418 Score = 42.7 bits (99), Expect = 0.053, Method: Composition-based stats. Identities = 46/235 (19%), Positives = 82/235 (34%), Gaps = 24/235 (10%) Query: 16 VHYVWFDEEPPEDVYFE--GLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQ 70 V VW +E E L + G V T P K RS + + DR Sbjct: 128 VGIVWLEELDQFSGMEEIRNLCQSLLRGGPQYWVFCTYNPPKSRSSWVNEEILVDDPDRM 187 Query: 71 VIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVIN 125 V R T + P E + + ++ L ER LG G +F VE+ + Sbjct: 188 VHRSTYLQVPQKWLGE-QFLQEAEKLKERNEMAYRHEYLGEVTGTGGAVFENVEDLPLTE 246 Query: 126 SLDIPEHWVQIGGMDFGWH-HPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSW 184 + + ++ G+DFG+ P A + ++ + +Y+ ++ T + Sbjct: 247 E-ALGQFDHRLFGLDFGFAVDPLAFVAMHYDAKHEDLYIWGEIYEQKLTNPQAARKISQ- 304 Query: 185 GKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDM 239 P + ++ D + + R M ++ G + VE GI + Sbjct: 305 -----VILPGELVRCDSAEPKSIKEM-RSLDMNIIGA----PKGPDSVEYGIKWL 349 >gi|50913388|ref|YP_059360.1| terminase large subunit [Streptococcus pyogenes MGAS10394] gi|50902462|gb|AAT86177.1| Terminase large subunit [Streptococcus pyogenes MGAS10394] Length = 428 Score = 42.3 bits (98), Expect = 0.059, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 67/167 (40%), Gaps = 14/167 (8%) Query: 28 DVYFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNE 84 D Y + R+ + L + L P+ + + + + + + V + T + ++ Sbjct: 151 DDYTQLTLRLRDKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDD 210 Query: 85 QERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVINSLDIPEHWVQIGGM 139 R+ I + + EA K LG IFP ++ I+ D H G+ Sbjct: 211 VTRENIEEL--ANRNEAYYK-IYALGQFATLDKLIFPKYDKQILNK--DKLSHLPSFFGL 265 Query: 140 DFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 D+G+ + P A H+ + + +Y+++ Y + T A+K G Sbjct: 266 DYGFINDPSALLHVKIDDANKKLYILEEYVRKNLTNDKIANAIKDLG 312 >gi|21910971|ref|NP_665239.1| putative terminase large subunit - phage associated [Streptococcus pyogenes MGAS315] gi|28876465|ref|NP_795683.1| putative terminase large subunit [Streptococcus pyogenes phage 315.6] gi|28895342|ref|NP_801692.1| hypothetical protein SPs0430 [Streptococcus pyogenes SSI-1] gi|157311161|ref|YP_001469206.1| terminase large subunit [Streptococcus phage P9] gi|225871330|ref|YP_002747277.1| phage terminase, large subunit [Streptococcus equi subsp. equi 4047] gi|21905179|gb|AAM80042.1| putative terminase large subunit - phage-associated [Streptococcus pyogenes MGAS315] gi|28810588|dbj|BAC63525.1| hypothetical protein [Streptococcus pyogenes SSI-1] gi|119104310|gb|ABL61055.1| terminase large subunit [Streptococcus phage P9] gi|225700734|emb|CAW95367.1| putative phage terminase, large subunit [Streptococcus equi subsp. equi 4047] Length = 425 Score = 42.3 bits (98), Expect = 0.059, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 67/167 (40%), Gaps = 14/167 (8%) Query: 28 DVYFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNE 84 D Y + R+ + L + L P+ + + + + + + V + T + ++ Sbjct: 148 DDYTQLTLRLRDKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDD 207 Query: 85 QERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVINSLDIPEHWVQIGGM 139 R+ I + + EA K LG IFP ++ I+ D H G+ Sbjct: 208 VTRENIEEL--ANRNEAYYK-IYALGQFATLDKLIFPKYDKQILNK--DKLSHLPSFFGL 262 Query: 140 DFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 D+G+ + P A H+ + + +Y+++ Y + T A+K G Sbjct: 263 DYGFINDPSALLHVKIDDANKKLYILEEYVRKNLTNDKIANAIKDLG 309 >gi|227530602|ref|ZP_03960651.1| PBSX family phage terminase, large subunit [Lactobacillus vaginalis ATCC 49540] gi|227349490|gb|EEJ39781.1| PBSX family phage terminase, large subunit [Lactobacillus vaginalis ATCC 49540] Length = 227 Score = 42.3 bits (98), Expect = 0.065, Method: Composition-based stats. Identities = 34/152 (22%), Positives = 61/152 (40%), Gaps = 25/152 (16%) Query: 105 GEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYV 163 G + G G I+ E +V+N D+P+ I G+D+G++HP + + +S+ Y+ Sbjct: 25 GLWVTGEGAIYRDFDERKMVVN--DVPKMVRYIAGIDWGYNHPCSITVFGIDANSN-YYL 81 Query: 164 VKNYRCREQTPIFHVAAL-----KSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKM 218 V R I + + K +G +P+ + ++ G+ Sbjct: 82 VDEKTER-FKEIDYWTKVARKLQKKYGYKMPFYCDTAR--------TEFIDHFKHNGINA 132 Query: 219 LPECATFDDGSNGVEAGISDMLDRMRSGRWKV 250 L G V GI + M+SGR+ V Sbjct: 133 LY-------GWKLVVPGIEIVAGLMKSGRFFV 157 >gi|241895598|ref|ZP_04782894.1| large subunit terminase [Weissella paramesenteroides ATCC 33313] gi|241871176|gb|EER74927.1| large subunit terminase [Weissella paramesenteroides ATCC 33313] Length = 429 Score = 41.9 bits (97), Expect = 0.085, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 76/202 (37%), Gaps = 19/202 (9%) Query: 45 TLTLTPLKGRSPIIEHYLSASSSDRQVIRMTI-NETPHYNEQE----RKRIIDSYPLHER 99 +T P R + + A + V+ T + H+ Q+ + ++ P + Sbjct: 164 VITFNPWSDRHWLKREFFDADTRRNHVLSFTTTYKNNHHLNQDFIDSMEEMVIRNPNRAK 223 Query: 100 EARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHH-PFAAGHLVWNRDS 158 A G+ + G +F + E + +I + + G+DFG+ H P A + ++ + Sbjct: 224 VA-VYGDWGISEGLVFDGLFEQRDFSMEEIAK-LPKSVGLDFGFKHDPTAGEFMAIDQRN 281 Query: 159 DVIYVVKNYRCREQ-TPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMK 217 ++YV + + T + K LP ++R +L++ Y Sbjct: 282 RIVYVYDEFYQQGMLTQAIAQSLAKHKAYGLPITADSA----EQRLTTELASVYN----- 332 Query: 218 MLPECATFDDGSNGVEAGISDM 239 +P T G + V G+ M Sbjct: 333 -VPNLRTAGKGKDSVIQGVQYM 353 >gi|94989072|ref|YP_597173.1| terminase large subunit [Streptococcus pyogenes MGAS9429] gi|94992963|ref|YP_601062.1| terminase large subunit [Streptococcus pyogenes MGAS2096] gi|94542580|gb|ABF32629.1| terminase large subunit [Streptococcus pyogenes MGAS9429] gi|94546471|gb|ABF36518.1| Terminase large subunit [Streptococcus pyogenes MGAS2096] Length = 428 Score = 41.9 bits (97), Expect = 0.089, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 66/167 (39%), Gaps = 14/167 (8%) Query: 28 DVYFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNE 84 D Y + R+ + L + L P+ + + + + + + V + T + ++ Sbjct: 151 DDYTQLTLRLRDKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDD 210 Query: 85 QERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVINSLDIPEHWVQIGGM 139 R+ I + EA K LG IFP ++ I+ D H G+ Sbjct: 211 VTRENIEEL--ADRNEAYYK-IYALGQFATLDKLIFPKYDKQILNK--DKLSHLSSFFGL 265 Query: 140 DFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 D+G+ + P A H+ + + +YV++ Y + T A+K G Sbjct: 266 DYGFINDPSAFLHVKIDDANKKLYVIEEYVRKNLTNDKIANAIKDLG 312 >gi|71911253|ref|YP_282803.1| terminase large subunit [Streptococcus pyogenes MGAS5005] gi|71854035|gb|AAZ52058.1| terminase large subunit [Streptococcus pyogenes MGAS5005] Length = 425 Score = 41.9 bits (97), Expect = 0.089, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 66/167 (39%), Gaps = 14/167 (8%) Query: 28 DVYFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNE 84 D Y + R+ + L + L P+ + + + + + + V + T + ++ Sbjct: 148 DDYTQLTLRLRDKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDD 207 Query: 85 QERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVINSLDIPEHWVQIGGM 139 R+ I + EA K LG IFP ++ I+ D H G+ Sbjct: 208 VTRENIEEL--ADRNEAYYK-IYALGQFATLDKLIFPKYDKQILNK--DKLSHLSSFFGL 262 Query: 140 DFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 D+G+ + P A H+ + + +YV++ Y + T A+K G Sbjct: 263 DYGFINDPSAFLHVKIDDANKKLYVIEEYVRKNLTNDKIANAIKDLG 309 >gi|157146113|ref|YP_001453432.1| hypothetical protein CKO_01869 [Citrobacter koseri ATCC BAA-895] gi|157083318|gb|ABV12996.1| hypothetical protein CKO_01869 [Citrobacter koseri ATCC BAA-895] Length = 523 Score = 41.5 bits (96), Expect = 0.094, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 4/91 (4%) Query: 68 DRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFP--IVEEDIVIN 125 R I + E P+ + Q ++ + R+A +G + SG F E VI Sbjct: 219 TRVAIHGSFKENPYLDPQYIATLMSIKDPNRRKAWVEGSWDVTSGGRFDHLWSEALHVIK 278 Query: 126 SLDIPEHWVQIGGMDFGWHHPFAAGHLVWNR 156 IP+ W D+G PF+ +L W R Sbjct: 279 PFRIPDSWTVDRSHDWGESKPFS--NLWWAR 307 >gi|293368016|ref|ZP_06614649.1| large terminase subunit [Staphylococcus epidermidis M23864:W2(grey)] gi|291317838|gb|EFE58251.1| large terminase subunit [Staphylococcus epidermidis M23864:W2(grey)] Length = 421 Score = 41.5 bits (96), Expect = 0.11, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 66/184 (35%), Gaps = 13/184 (7%) Query: 62 LSASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGE-PILGSGRIFPIVEE 120 S + V T + P ++Q + + +E R + +GSG + P Sbjct: 186 TSFQPDNTFVHHSTYLDNPFISKQFIQEAESTKERNELRYRWEYMGEAIGSGVV-PFNNL 244 Query: 121 DIVINSLDIPEHWVQI-GGMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 I ++ + + I +DFG+ P A +++ +IY V + + + Sbjct: 245 QIEKIPDELYKSFDNIRNAVDFGYATDPLAFVRWHYDKKKRIIYAVDEHYGVQISNREFA 304 Query: 179 AALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISD 238 LK G + D + D + ++ + G+K + G + VE G Sbjct: 305 NWLKRRG------YQSDEIFADSAEPKSIAELKQEHGIKRIK---GVKKGPDSVEHGEQW 355 Query: 239 MLDR 242 + D Sbjct: 356 LDDL 359 >gi|306826838|ref|ZP_07460138.1| PBSX family phage terminase [Streptococcus pyogenes ATCC 10782] gi|304430856|gb|EFM33865.1| PBSX family phage terminase [Streptococcus pyogenes ATCC 10782] Length = 425 Score = 41.5 bits (96), Expect = 0.11, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 66/167 (39%), Gaps = 14/167 (8%) Query: 28 DVYFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNE 84 D Y + R+ + L + L P+ + + + + + + V + T + ++ Sbjct: 148 DDYTQLTLRLRDKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDD 207 Query: 85 QERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVINSLDIPEHWVQIGGM 139 R+ I + EA K LG IFP ++ I+ D H G+ Sbjct: 208 VTRENIEEL--ADRNEAYYK-IYALGQFATLDKLIFPKYDKQILNK--DKLSHLSSFFGL 262 Query: 140 DFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 D+G+ + P A H+ + + +YV++ Y + T A+K G Sbjct: 263 DYGFINDPSAFLHVKIDDANKKLYVLEEYVRKNLTNDKIANAIKDLG 309 >gi|295840070|ref|ZP_06827003.1| translation initiation factor IF-2 [Streptomyces sp. SPB74] gi|197697038|gb|EDY43971.1| translation initiation factor IF-2 [Streptomyces sp. SPB74] Length = 489 Score = 41.1 bits (95), Expect = 0.13, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 37/85 (43%), Gaps = 3/85 (3%) Query: 99 REARTK-GEPILGSGRIFPIVEEDI-VINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNR 156 R R + G+ G ++ ++ + +I+ IP+ W + +DFG+ +PF + Sbjct: 217 RRLRLRDGQWAAAEGMVYDGWDDAVHLIDPAPIPKEWPRWWVVDFGFTNPFVLQCWAEDP 276 Query: 157 DSDVIYVVKNYRCREQTPIFHVAAL 181 D +I + + ++ H + Sbjct: 277 DGRLI-MYREIYRTQRLVEDHARDI 300 >gi|145589311|ref|YP_001155908.1| hypothetical protein Pnuc_1128 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145047717|gb|ABP34344.1| protein of unknown function DUF264 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 444 Score = 40.7 bits (94), Expect = 0.16, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 49/146 (33%), Gaps = 24/146 (16%) Query: 123 VINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKN---YRCREQTPIFHVA 179 V+ IP W MD+G+ P+A + D V Y+ + Y +E T A Sbjct: 250 VVEPFAIPPTWKVWRSMDWGYARPYAVYWFALSNDG-VYYLWRELYGYGDKENTGTREDA 308 Query: 180 ALKSWGKWLPWAWPHD--------------GLQHDKRSGEQLSAQYRRQGMKMLPECATF 225 + + + HD + + + +R +G+K Sbjct: 309 TVV--AEKIKKIEIHDQRLGYEYRMNLADPSIFSKIGAERSIGQIFRDKGVKWTEAY--- 363 Query: 226 DDGSNGVEAGISDMLDRMRSGRWKVF 251 + S G +++ + R K+F Sbjct: 364 -NASRSRVNGAQEIIRLLAEDRLKIF 388 >gi|66395220|ref|YP_239489.1| ORF005 [Staphylococcus phage 187] gi|62635572|gb|AAX90683.1| ORF005 [Staphylococcus phage 187] Length = 446 Score = 40.4 bits (93), Expect = 0.21, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 63/164 (38%), Gaps = 10/164 (6%) Query: 30 YFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVI--RMTINETPHYNE 84 Y + R+ + + + L P+ + + +++ V+ + + + +E Sbjct: 166 YTQLTLRLRERKHMNKQIFLMFNPVSKLNWVYKYFFEHGEPMENVMIRQSSYRDNKFLDE 225 Query: 85 QERK--RIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFG 142 R+ ++ + + GE +FP E+ I I+ ++ H G+DFG Sbjct: 226 MTRQNLELLANRNPAYYKIYALGEFATLDKLVFPKYEKRI-ISDKEV-GHLPSYFGLDFG 283 Query: 143 W-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 + + P A H+ + D+ +YV+ Y + + G Sbjct: 284 YVNDPSAFIHVKIDNDNKKLYVISEYVKKGMLNNEIAQVINDLG 327 >gi|52081321|ref|YP_080112.1| Phage terminase, large subunit protein [Bacillus licheniformis ATCC 14580] gi|52786700|ref|YP_092529.1| YqaT [Bacillus licheniformis ATCC 14580] gi|52004532|gb|AAU24474.1| Phage terminase, large subunit protein [Bacillus licheniformis ATCC 14580] gi|52349202|gb|AAU41836.1| YqaT [Bacillus licheniformis ATCC 14580] Length = 431 Score = 40.0 bits (92), Expect = 0.30, Method: Composition-based stats. Identities = 35/191 (18%), Positives = 65/191 (34%), Gaps = 35/191 (18%) Query: 56 PIIEHYLSASSSDRQVIRMTINET--PHYNEQERKRIIDSY-----------PLHEREAR 102 + ++ + + +N+T H ++ + +SY P R AR Sbjct: 169 DLNNRFVLDDKELYEKRTIVLNDTYYHHSTAEDNLFLPESYVRQLDELKEYDPDLYRIAR 228 Query: 103 TKGEPILGSGRIFPIVEE------DIVINSLDIPEHWVQIGGMDFGWHHP-FAAGHLVWN 155 KG + R+FP +EE ++++D P ++ GMDFG+ A L + Sbjct: 229 -KGHFGVNGVRVFPQLEEWPHDEVMQAVSNIDCP---IKRVGMDFGFEESYNAVVRLAVD 284 Query: 156 RDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLS--AQYRR 213 +Y+ Y R T L+ + K + +R+ Sbjct: 285 HKKKYLYIYWEYYKRGMTDDRTAEELQEFKNTQELI---------KADSAEPKTIQYFRQ 335 Query: 214 QGMKMLPECAT 224 QG M+ Sbjct: 336 QGFNMVGAHKY 346 >gi|139473881|ref|YP_001128597.1| putative phage terminase, large subunit [Streptococcus pyogenes str. Manfredo] gi|134272128|emb|CAM30373.1| putative phage terminase, large subunit [Streptococcus pyogenes str. Manfredo] Length = 425 Score = 40.0 bits (92), Expect = 0.31, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 67/167 (40%), Gaps = 14/167 (8%) Query: 28 DVYFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNE 84 D Y + R+ + L + L P+ + + + + + + V + T + ++ Sbjct: 148 DDYTQLTLRLRDKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDD 207 Query: 85 QERKRIIDSYPLHEREARTKGEPILGS-----GRIFPIVEEDIVINSLDIPEHWVQIGGM 139 R+ I + + EA K LG IFP ++ I+ D + G+ Sbjct: 208 VTRENIEEL--ANRNEAYYK-IYALGQFATLDKLIFPKYDKQILNK--DKLSYLPSFFGL 262 Query: 140 DFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 D+G+ + P A H+ + + +Y+++ Y + T A+K G Sbjct: 263 DYGFINDPSALLHVKIDDANKKLYILEEYVRKNLTNDKIANAIKDLG 309 >gi|13470687|ref|NP_102256.1| bacteriophage terminase large subunit-like protein [Mesorhizobium loti MAFF303099] gi|14021429|dbj|BAB48042.1| mll0463 [Mesorhizobium loti MAFF303099] Length = 445 Score = 40.0 bits (92), Expect = 0.32, Method: Composition-based stats. Identities = 54/281 (19%), Positives = 88/281 (31%), Gaps = 50/281 (17%) Query: 5 EQGRDKWQSNTVHYVWFDEEPPEDVYFEGLTRINATQ----------------------- 41 E+ R K+Q +H + DE + E + R + Sbjct: 56 EKDRFKYQGAEIHVLLIDE---LTHFTEVIYRFLRNRVRMVGIKLPAKYAGRFPRILCGA 112 Query: 42 -------GLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPHYNEQE---RKRII 91 V T + S RQ I + + P N+ + R++ Sbjct: 113 NPGGIGHQFVKATFIDGATAMKVYTTEASEGGMRRQFIPAQLEDNPSMNDNDPGYENRLM 172 Query: 92 DSYPLHEREARTKGEPILGSGRIFPIVEEDI-VINSLDIPEHWVQIGGMDFGWHHPFAAG 150 A G+ + G F E V+ IP+HW++ D+G PFA G Sbjct: 173 GLGSESLVRAMRYGDWDVVEGAYFDNFERRRHVVKPFTIPDHWIRFRAGDWGSAKPFAFG 232 Query: 151 HLVWNRDSDVIYVVKNYRCREQTPIFHVAALK--SWGKWLPWAWPHDGLQ-HDKRSGEQL 207 V D +I +K GK+LP GL+ H + G + Sbjct: 233 WYVVASDDTIIAPGVVVPRGALVKYREWYGVKIDKSGKFLPDV----GLKLHAEAVGAGV 288 Query: 208 SA-QYRRQGMKMLPECATFD-DGSNGVEAGISDMLDRMRSG 246 Y + + + A F DG I++ + R +G Sbjct: 289 RQRDYDDIIVYGVLDPAAFSQDGG----PSIAERMTRGTTG 325 >gi|237746327|ref|ZP_04576807.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS] gi|229377678|gb|EEO27769.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS] Length = 489 Score = 40.0 bits (92), Expect = 0.32, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 48/146 (32%), Gaps = 24/146 (16%) Query: 123 VINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFH----- 177 V IP W MD+G+ P+A N D IY+ + + Sbjct: 279 VTEPFAIPPSWTVWKAMDWGYAAPYAVYWFAMNPDG-CIYLWRELYGAGEKAGQGSREDA 337 Query: 178 ---VAALKS---------WGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATF 225 +K + + A P + + + +R +G++ + Sbjct: 338 ADVARKIKKMEERDERFGYDYRINLADP--SIFSKNGTDRSIGQIFRDEGIRWQ---EAW 392 Query: 226 DDGSNGVEAGISDMLDRMRSGRWKVF 251 + + V G +++ + G+ K+F Sbjct: 393 NAKGSRVN-GAQEIIRLLAEGKLKIF 417 >gi|260753972|ref|YP_003226865.1| hypothetical protein Za10_1747 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258553335|gb|ACV76281.1| hypothetical protein Za10_1747 [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 297 Score = 39.6 bits (91), Expect = 0.36, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 44/121 (36%), Gaps = 21/121 (17%) Query: 130 PEHWVQIGGMDFGWH-----HPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSW 184 P+HW ++ +D G P + W + Y+ + R++ + A + Sbjct: 128 PDHWERLCDIDLGGDIAIDATPVFHDGMWW-----LFYMSGYAKARKKQELHAAYARELT 182 Query: 185 GKWL-----PWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDM 239 GKW P +G + + G + +++G +LP V + + D Sbjct: 183 GKWTVYKNNPVI---EGFAYSRPGGSAVI---KKEGKLVLPVQDCVTTYGRAVRSLLFDH 236 Query: 240 L 240 L Sbjct: 237 L 237 >gi|328772063|gb|EGF82102.1| hypothetical protein BATDEDRAFT_34578 [Batrachochytrium dendrobatidis JAM81] Length = 384 Score = 39.6 bits (91), Expect = 0.36, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 35/99 (35%), Gaps = 25/99 (25%) Query: 92 DSYPLHEREARTKG---EPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFA 148 S E EART+G LGSG+ P+ P WV G P Sbjct: 207 ASRSAWESEARTRGQHYMQALGSGQ--PVP-----------PVAWVLTEGSSI----PQN 249 Query: 149 AGHLVWNRDSDVIYVVKNYRCREQTPIFHVAA-LKSWGK 186 A RD IY+ + + HV ++SW K Sbjct: 250 AIQGGNERDGSPIYITRTWHEN----SIHVGKMIRSWSK 284 >gi|302829593|ref|XP_002946363.1| hypothetical protein VOLCADRAFT_115935 [Volvox carteri f. nagariensis] gi|300268109|gb|EFJ52290.1| hypothetical protein VOLCADRAFT_115935 [Volvox carteri f. nagariensis] Length = 2414 Score = 39.6 bits (91), Expect = 0.38, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 65/209 (31%), Gaps = 41/209 (19%) Query: 45 TLTLTPLKGRSPIIEHYLSASSSDRQ--VIRMTINETPHYNEQERKRIIDSY--PLHERE 100 +T T + RSP + L+ S R + +PH ++ ++ S P R Sbjct: 369 IVTATAMASRSPGVRRALATSPEARSTDGPLASAAGSPHATFEQVHLLLASLNLPGLGRA 428 Query: 101 ARTKGEPILGSGRIFPIVEEDIV---------INSLDI--------PEHWVQIGGMDFGW 143 + P +G+G EE +++ +W Q+ G + G Sbjct: 429 SMGADRPPMGTGC--DAPEEQQRRDLDNLAPAVSAFTAGLADQHISAAYWPQVSGQEGGL 486 Query: 144 HHPFAAGHLVWNRDSDVIYVVKNYRCREQT----------PIFHVAALKSWGKWL----- 188 +P A + R +D+ + YR P A+ + Sbjct: 487 TNPAAKSMMAAAR-TDLGFTQGMYRRPGAVRVGADAFDEYPPQPAVAMNARCGKNGAYEN 545 Query: 189 --PWAWPHDGLQHDKRSGEQLSAQYRRQG 215 WP D + G + + + G Sbjct: 546 FDAVRWPFDQALAARGPGGPRAHELAKSG 574 >gi|317499861|ref|ZP_07958099.1| pbsx family Phage terminase [Lachnospiraceae bacterium 8_1_57FAA] gi|316898763|gb|EFV20796.1| pbsx family Phage terminase [Lachnospiraceae bacterium 8_1_57FAA] Length = 428 Score = 39.6 bits (91), Expect = 0.39, Method: Composition-based stats. Identities = 26/122 (21%), Positives = 49/122 (40%), Gaps = 14/122 (11%) Query: 73 RMTINE---TPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDI 129 T+++ P ++ + P R AR + GS +FP + +V + + Sbjct: 199 HSTVDDNFFVPKEYVEQLDDLQTHDPDLYRVARQGRFGVNGS-LVFP---QFVVEPANQV 254 Query: 130 PEHWVQI------GGMDFGWHHP-FAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALK 182 + I GMDFG+ AA ++ + D ++Y+ + Y R +T +K Sbjct: 255 EKEIKAIRTPLEKNGMDFGFVTSYNAALRMIVDHDEKILYIYREYYSRNKTDPEIAEDMK 314 Query: 183 SW 184 W Sbjct: 315 DW 316 >gi|227505813|ref|ZP_03935862.1| phage terminase [Corynebacterium striatum ATCC 6940] gi|227197594|gb|EEI77642.1| phage terminase [Corynebacterium striatum ATCC 6940] Length = 378 Score = 39.6 bits (91), Expect = 0.46, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 41/114 (35%), Gaps = 3/114 (2%) Query: 73 RMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPI-VEEDIVINSLDIPE 131 T+++ P + + Y +G + G I+ + E V++ D+P Sbjct: 197 HFTMDDNPSLTTAYKNNLKKEYTGMWFLRFIQGLWVAAEGAIYSMWDESKHVVDPEDMPP 256 Query: 132 HWVQIG-GMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSW 184 I G+D+G HP L D +Y V + T A LK W Sbjct: 257 METIIALGIDYGTTHPTTGIMLGMGTDR-KLYAVDEWAPGRLTNAALTADLKEW 309 >gi|221213957|ref|ZP_03586930.1| putative TerL [Burkholderia multivorans CGD1] gi|221166134|gb|EED98607.1| putative TerL [Burkholderia multivorans CGD1] Length = 473 Score = 39.2 bits (90), Expect = 0.48, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 42/141 (29%), Gaps = 14/141 (9%) Query: 114 IFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHL-----VWNRDSDVIYVVKNYR 168 + P + + ++ + GMDFG P A W S+V+ R Sbjct: 260 VHPDYADSLHCKPFELVKTQPLWIGMDFGLT-PAAVIGQRKPMGGWRIRSEVVATSMGAR 318 Query: 169 CREQTPIFHVAALKSWGKWLPWAW-PHDGLQHDKRSGEQL-SAQYRRQGMKMLPECATFD 226 H+A + G + + G Q + E R G E Sbjct: 319 KFGIELKRHLAEIYP-GFEIGGIYGDPAGDQRSQADDEDTPFRILRAAGF----EARPAP 373 Query: 227 DGSNGVEAG-ISDMLDRMRSG 246 + G + + L R+ G Sbjct: 374 TNDTSLRYGAVDEALTRIIDG 394 >gi|319647234|ref|ZP_08001456.1| YqaT protein [Bacillus sp. BT1B_CT2] gi|317390581|gb|EFV71386.1| YqaT protein [Bacillus sp. BT1B_CT2] Length = 431 Score = 39.2 bits (90), Expect = 0.51, Method: Composition-based stats. Identities = 34/191 (17%), Positives = 64/191 (33%), Gaps = 35/191 (18%) Query: 56 PIIEHYLSASSSDRQVIRMTINET--PHYNEQERKRIIDSY-----------PLHEREAR 102 + ++ + + +N+T H ++ + +SY P R AR Sbjct: 169 DLNNRFVLDDKELYEKRTIVLNDTYYHHSTAEDNLFLPESYVRQLDELKEYDPDLYRIAR 228 Query: 103 TKGEPILGSGRIFPIVEE------DIVINSLDIPEHWVQIGGMDFGWHHP-FAAGHLVWN 155 KG + R+FP +EE ++++D P ++ GMDFG+ A L + Sbjct: 229 -KGHFGVNGVRVFPQLEEWPHDEVMQAVSNIDCP---IKRVGMDFGFEESYNAVVRLAVD 284 Query: 156 RDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLS--AQYRR 213 +Y+ Y R T L+ + K + +R+ Sbjct: 285 HKKKYLYIYWEYYKRGMTDDRTAEELQEFKNTQELI---------KADSAEPKTIQYFRQ 335 Query: 214 QGMKMLPECAT 224 G M+ Sbjct: 336 HGFNMVGAHKY 346 >gi|295189275|gb|ADF83462.1| putative large terminase [Lactobacillus phage LBR48] Length = 277 Score = 39.2 bits (90), Expect = 0.52, Method: Composition-based stats. Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 13/139 (9%) Query: 44 VTLTLTPLKGRSPIIEHYLSAS--SSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREA 101 +TLT P R + + D T ++Q+R+R +D Y + R A Sbjct: 20 ITLTFNPWSERHWLKPMFFDPETRKPDVFARTTTFRVNEWLDKQDRQRYLDLYRTNPRRA 79 Query: 102 R--TKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGG----MDFGWHH-PFAAGHLVW 154 + G + G +F E+ V+ D+ + + G MDFG+ H P Sbjct: 80 QIVCDGNWGVAEGLVF----ENWVVEDFDVNKVVAESDGVGHGMDFGFTHDPTTFAEAAI 135 Query: 155 NRDSDVIYVVKNYRCREQT 173 NR++ I++ K + T Sbjct: 136 NRETKDIWIFKELYQKAMT 154 >gi|257413762|ref|ZP_04744154.2| prophage LambdaCh01, terminase, large subunit, PBSX family [Roseburia intestinalis L1-82] gi|257202370|gb|EEV00655.1| prophage LambdaCh01, terminase, large subunit, PBSX family [Roseburia intestinalis L1-82] Length = 644 Score = 39.2 bits (90), Expect = 0.53, Method: Composition-based stats. Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 1/55 (1%) Query: 112 GRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKN 166 G IF VEE + ++ +I G+DFG+ HP ++ D+D +Y V Sbjct: 444 GGIFDNVEERTITDA-EIENLPFLYYGLDFGFEHPQTFEVAYYDEDTDTLYCVSE 497 >gi|332072381|gb|EGI82864.1| phage terminase large subunit [Streptococcus pneumoniae GA17570] Length = 344 Score = 39.2 bits (90), Expect = 0.57, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 64/187 (34%), Gaps = 23/187 (12%) Query: 18 YVWFDE------EPPEDVYFEGLTRINAT---QGLVTLTLTPLKGRSPIIEHYLSASS-- 66 + WF+E E E + +T+T P R + + + Sbjct: 49 WAWFEEAYQIETEDKFSTVVESIRGSLDVPDFFKQITVTFNPWNERHWLKRVFFDEETSR 108 Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 +D T +E + KR D Y + R AR GE + G I+ E++ + Sbjct: 109 ADTFATTTTYKCNEWLDEVDIKRYEDLYHTNPRRARIVCDGEWGVAEGLIY----ENVTV 164 Query: 125 NSLD----IPEHWVQIG-GMDFGWHH-PFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 D + + ++ G+DFG+ H P A + N + IYV Sbjct: 165 KDFDKDELLRDSANKLCIGLDFGFTHDPTALCCSLINDTTKEIYVFDEAYKVGLITKEVA 224 Query: 179 AALKSWG 185 +K G Sbjct: 225 KMIKDKG 231 >gi|321157214|emb|CBW39198.1| Phage terminase large subunit [Streptococcus phage 8140] Length = 432 Score = 39.2 bits (90), Expect = 0.57, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 64/187 (34%), Gaps = 23/187 (12%) Query: 18 YVWFDE------EPPEDVYFEGLTRINAT---QGLVTLTLTPLKGRSPIIEHYLSASS-- 66 + WF+E E E + +T+T P R + + + Sbjct: 137 WAWFEEAYQIETEDKFSTVVESIRGSLDVPDFFKQITVTFNPWNERHWLKRVFFDEETSR 196 Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 +D T +E + KR D Y + R AR GE + G I+ E++ + Sbjct: 197 ADTFATTTTYKCNEWLDEVDIKRYEDLYHTNPRRARIVCDGEWGVAEGLIY----ENVTV 252 Query: 125 NSLD----IPEHWVQIG-GMDFGWHH-PFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 D + + ++ G+DFG+ H P A + N + IYV Sbjct: 253 KDFDKDELLRDSANKLCIGLDFGFTHDPTALCCSLINDTTKEIYVFDEAYKVGLITKEVA 312 Query: 179 AALKSWG 185 +K G Sbjct: 313 KMIKDKG 319 >gi|321156947|emb|CBW38936.1| Phage terminase large subunit [Streptococcus phage V22] Length = 432 Score = 39.2 bits (90), Expect = 0.57, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 64/187 (34%), Gaps = 23/187 (12%) Query: 18 YVWFDE------EPPEDVYFEGLTRINAT---QGLVTLTLTPLKGRSPIIEHYLSASS-- 66 + WF+E E E + +T+T P R + + + Sbjct: 137 WAWFEEAYQIETEDKFSTVVESIRGSLDVPDFFKQITVTFNPWNERHWLKRVFFDEETSR 196 Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 +D T +E + KR D Y + R AR GE + G I+ E++ + Sbjct: 197 ADTFATTTTYKCNEWLDEVDIKRYEDLYHTNPRRARIVCDGEWGVAEGLIY----ENVTV 252 Query: 125 NSLD----IPEHWVQIG-GMDFGWHH-PFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 D + + ++ G+DFG+ H P A + N + IYV Sbjct: 253 KDFDKDELLRDSANKLCIGLDFGFTHDPTALCCSLINDTTKEIYVFDEAYKVGLITKEVA 312 Query: 179 AALKSWG 185 +K G Sbjct: 313 KMIKDKG 319 >gi|183603148|ref|ZP_02712947.2| phage terminase large subunit [Streptococcus pneumoniae SP195] gi|183572674|gb|EDT93202.1| phage terminase large subunit [Streptococcus pneumoniae SP195] Length = 434 Score = 39.2 bits (90), Expect = 0.57, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 64/187 (34%), Gaps = 23/187 (12%) Query: 18 YVWFDE------EPPEDVYFEGLTRINAT---QGLVTLTLTPLKGRSPIIEHYLSASS-- 66 + WF+E E E + +T+T P R + + + Sbjct: 139 WAWFEEAYQIETEDKFSTVVESIRGSLDVPDFFKQITVTFNPWNERHWLKRVFFDEETSR 198 Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 +D T +E + KR D Y + R AR GE + G I+ E++ + Sbjct: 199 ADTFATTTTYKCNEWLDEVDIKRYEDLYHTNPRRARIVCDGEWGVAEGLIY----ENVTV 254 Query: 125 NSLD----IPEHWVQIG-GMDFGWHH-PFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 D + + ++ G+DFG+ H P A + N + IYV Sbjct: 255 KDFDKDELLRDSANKLCIGLDFGFTHDPTALCCSLINDTTKEIYVFDEAYKVGLITKEVA 314 Query: 179 AALKSWG 185 +K G Sbjct: 315 KMIKDKG 321 >gi|148993445|ref|ZP_01822962.1| terminase large subunit [Streptococcus pneumoniae SP9-BS68] gi|147928000|gb|EDK79020.1| terminase large subunit [Streptococcus pneumoniae SP9-BS68] Length = 432 Score = 39.2 bits (90), Expect = 0.57, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 64/187 (34%), Gaps = 23/187 (12%) Query: 18 YVWFDE------EPPEDVYFEGLTRINAT---QGLVTLTLTPLKGRSPIIEHYLSASS-- 66 + WF+E E E + +T+T P R + + + Sbjct: 137 WAWFEEAYQIETEDKFSTVVESIRGSLDVPDFFKQITVTFNPWNERHWLKRVFFDEETSR 196 Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 +D T +E + KR D Y + R AR GE + G I+ E++ + Sbjct: 197 ADTFATTTTYKCNEWLDEVDIKRYEDLYHTNPRRARIVCDGEWGVAEGLIY----ENVTV 252 Query: 125 NSLD----IPEHWVQIG-GMDFGWHH-PFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 D + + ++ G+DFG+ H P A + N + IYV Sbjct: 253 KDFDKDELLRDSANKLCIGLDFGFTHDPTALCCSLINDTTKEIYVFDEAYKVGLITKEVA 312 Query: 179 AALKSWG 185 +K G Sbjct: 313 KMIKDKG 319 >gi|149004553|ref|ZP_01829252.1| terminase large subunit [Streptococcus pneumoniae SP14-BS69] gi|147757556|gb|EDK64580.1| terminase large subunit [Streptococcus pneumoniae SP14-BS69] Length = 420 Score = 39.2 bits (90), Expect = 0.57, Method: Composition-based stats. Identities = 38/187 (20%), Positives = 64/187 (34%), Gaps = 23/187 (12%) Query: 18 YVWFDE------EPPEDVYFEGLTRINAT---QGLVTLTLTPLKGRSPIIEHYLSASS-- 66 + WF+E E E + +T+T P R + + + Sbjct: 137 WAWFEEAYQIETEDKFSTVVESIRGSLDVPDFFKQITVTFNPWNERHWLKRVFFDEETSR 196 Query: 67 SDRQVIRMTINETPHYNEQERKRIIDSYPLHEREAR--TKGEPILGSGRIFPIVEEDIVI 124 +D T +E + KR D Y + R AR GE + G I+ E++ + Sbjct: 197 ADTFATTTTYKCNEWLDEVDIKRYEDLYHTNPRRARIVCDGEWGVAEGLIY----ENVTV 252 Query: 125 NSLD----IPEHWVQIG-GMDFGWHH-PFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 D + + ++ G+DFG+ H P A + N + IYV Sbjct: 253 KDFDKDELLRDSANKLCIGLDFGFTHDPTALCCSLINDTTKEIYVFDEAYKVGLITKEVA 312 Query: 179 AALKSWG 185 +K G Sbjct: 313 KMIKDKG 319 >gi|289551569|ref|YP_003472473.1| Phage terminase, large subunit [Staphylococcus lugdunensis HKU09-01] gi|289181100|gb|ADC88345.1| Phage terminase, large subunit [Staphylococcus lugdunensis HKU09-01] Length = 424 Score = 38.8 bits (89), Expect = 0.67, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 63/164 (38%), Gaps = 10/164 (6%) Query: 30 YFEGLTRINATQGL---VTLTLTPLKGRSPIIEHYLSASSSDRQVI--RMTINETPHYNE 84 Y + R+ + + + L P+ + + +++ V+ + + + +E Sbjct: 144 YTQLTLRLRERKHINKQIFLMFNPVSKLNWVYKYFFEHDKPMENVMIRQSSYRDNKFLDE 203 Query: 85 QERK--RIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFG 142 R+ ++ S + GE +FP E+ ++ N ++ H +DFG Sbjct: 204 MTRQNLELLASRNPAYYKIYALGEFATLDKLVFPKYEKKLL-NKQEL-SHLQSYFAIDFG 261 Query: 143 W-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 + + P A H + ++ +Y+V+ Y + +K G Sbjct: 262 YVNDPSAFIHCKVDMENKKLYIVEEYVKKGMLNNEIAEVIKRLG 305 >gi|241761606|ref|ZP_04759693.1| conserved hypothetical protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241373914|gb|EER63447.1| conserved hypothetical protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 297 Score = 38.8 bits (89), Expect = 0.67, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 43/121 (35%), Gaps = 21/121 (17%) Query: 130 PEHWVQIGGMDFGWH-----HPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSW 184 P+HW ++ +D G P + W + Y+ + R++ + A + Sbjct: 128 PDHWERLCDIDLGGDIAIDATPVFHDGMWW-----LFYMSGYAKARKKQELHAAYARELT 182 Query: 185 GKWL-----PWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDM 239 GKW P +G + + G + ++G +LP V + + D Sbjct: 183 GKWTVYKNNPVI---EGFAYSRPGGSAVIN---KEGKLVLPVQDCVTTYGRAVRSLLFDH 236 Query: 240 L 240 L Sbjct: 237 L 237 >gi|48697193|ref|YP_024923.1| putative TerL [Burkholderia phage BcepC6B] gi|47778999|gb|AAT38362.1| putative TerL [Burkholderia phage BcepC6B] Length = 473 Score = 38.8 bits (89), Expect = 0.67, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 42/141 (29%), Gaps = 14/141 (9%) Query: 114 IFPIVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHL-----VWNRDSDVIYVVKNYR 168 + P + + ++ + GMDFG P A W S+V+ R Sbjct: 260 VHPDYADSLHCKPFELVKSQPLWIGMDFGLT-PAAVIGQRKPMGGWRIRSEVVATSMGAR 318 Query: 169 CREQTPIFHVAALKSWGKWLPWAW-PHDGLQHDKRSGEQL-SAQYRRQGMKMLPECATFD 226 H+A + G + + G Q + E R G E Sbjct: 319 KFGIELKRHLAEIYP-GFEVAGIYGDPAGDQRSQADDEDTPFRILRAAGF----EAKPAP 373 Query: 227 DGSNGVEAG-ISDMLDRMRSG 246 + G + + L R+ G Sbjct: 374 TNDTSLRYGAVDEALTRIIDG 394 >gi|187935777|ref|YP_001886932.1| phage terminase, large subunit, pbsx family [Clostridium botulinum B str. Eklund 17B] gi|187723930|gb|ACD25151.1| phage terminase, large subunit, pbsx family [Clostridium botulinum B str. Eklund 17B] Length = 450 Score = 38.8 bits (89), Expect = 0.68, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 12/86 (13%) Query: 138 GMDFGWHHP-FAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDG 196 GMDFG+ A L + DS ++Y+ Y + T L+ + K Sbjct: 267 GMDFGFETSYNAVVRLAIDDDSKILYIYWQYYKNQMTDDKTAIELEEFKKT--------- 317 Query: 197 LQHDKRSGEQLS--AQYRRQGMKMLP 220 + K + YR++G ML Sbjct: 318 QERIKADSAEPKTITFYRQEGFNMLG 343 >gi|293374297|ref|ZP_06620625.1| phage terminase, large subunit, PBSX family [Turicibacter sanguinis PC909] gi|292647130|gb|EFF65112.1| phage terminase, large subunit, PBSX family [Turicibacter sanguinis PC909] Length = 418 Score = 38.8 bits (89), Expect = 0.70, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 34/78 (43%), Gaps = 4/78 (5%) Query: 112 GRIFP----IVEEDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLVWNRDSDVIYVVKNY 167 G +F I DI +N++ + W GMD G P A ++N + +Y++K + Sbjct: 226 GLVFKHINFINSHDIDVNAMLKDKSWQVRCGMDIGELDPTAIAVSLFNERTQALYLIKEF 285 Query: 168 RCREQTPIFHVAALKSWG 185 R T A+ + G Sbjct: 286 YQRGATLDEMYEAIINLG 303 >gi|317473130|ref|ZP_07932428.1| phage terminase [Anaerostipes sp. 3_2_56FAA] gi|316899354|gb|EFV21370.1| phage terminase [Anaerostipes sp. 3_2_56FAA] Length = 499 Score = 38.4 bits (88), Expect = 0.90, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 64/187 (34%), Gaps = 31/187 (16%) Query: 19 VWFDEEP----PEDV--YFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVI 72 +W +E PE V + + R + + P K + Y+ R V Sbjct: 219 LWLEELDQFTGPESVRKIEQSVIRGGEY-AYIFKSFNPPKSKGNWANKYIKIPKETRLVT 277 Query: 73 RMTINETPH------YNEQERKRIIDSYPLHEREARTKGEPILGSGRIFP------IVEE 120 T + P + ++ + + GE G++F I +E Sbjct: 278 HSTYMDIPKKWLGKPFLDE--AEFLKEVNPAAYDNEYMGEANGNGGQVFDNVTIREITDE 335 Query: 121 DIVINSLDIPEHWVQIG-GMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHV 178 +I + +I G+D+GW P+ G + ++ +Y+ YRC ++ Sbjct: 336 EI--------AQFDRIYNGVDWGWYPDPYHFGRMHYDAARMTLYIFMEYRCNKKGNKETA 387 Query: 179 AALKSWG 185 LK G Sbjct: 388 EELKKRG 394 >gi|238062875|ref|ZP_04607584.1| phage terminase [Micromonospora sp. ATCC 39149] gi|237884686|gb|EEP73514.1| phage terminase [Micromonospora sp. ATCC 39149] Length = 428 Score = 38.0 bits (87), Expect = 1.3, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 90/234 (38%), Gaps = 13/234 (5%) Query: 26 PEDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSS-DRQVIRMTINETPHYNE 84 PE + + L R++ + T P + + YL + + + T+ + PH + Sbjct: 137 PEAFFTQVLARLSVAGAQLFGTTNPDSPNHWLRKKYLLRAGELNLRTWHSTLRDNPHLDP 196 Query: 85 QERKRIIDSYPLHEREARTKGEPILGSGRIFPIVEEDIVINSLDIPEHWVQIGGMDFGWH 144 Q + + Y + G + G +F + +ED + + H G+D+G Sbjct: 197 QYVRNLTTEYVGLWYKRFILGAWVQAEGAVFDMWDEDRHVIPVLPAIHRWISLGIDYGTR 256 Query: 145 HPFAAGHLVWNRDSDVIYVVKNYR------CREQTPIFHVAALKSWGK--WLPWAWPHDG 196 + AA L +D +++ +R R+ T L++W +P A G Sbjct: 257 NATAALILGVGQDG-RLHLTHEWRHDPAVARRQLTDAGLSRELRAWLGRLQVPGATGLTG 315 Query: 197 LQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRWKV 250 L+ + + +A R +++ + T N V GI M + + + +V Sbjct: 316 LRPEWTVVDPAAASLR---LQLHEDGMTPALADNAVLDGIRLMSSLLGNDQLRV 366 >gi|262184823|ref|ZP_06044244.1| Phage terminase large subunit [Corynebacterium aurimucosum ATCC 700975] Length = 421 Score = 38.0 bits (87), Expect = 1.3, Method: Composition-based stats. Identities = 27/129 (20%), Positives = 45/129 (34%), Gaps = 7/129 (5%) Query: 62 LSASSSDR----QVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPI 117 D+ T+++ P E + + Y +G + G I+ + Sbjct: 182 TKTQPEDQLADWTYWHFTMDDNPSLTEGYKSNLRKEYTGMWYLRFIQGLWVAAEGAIYQM 241 Query: 118 -VEEDIVINSLDIPEHWVQIG-GMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPI 175 E V++ D+P I G+D+G HP L D+ +Y V + T Sbjct: 242 WDEAKHVVHPEDMPIMRQIIALGIDYGTTHPTTGILLGIGDDN-RLYAVDEWAPGRLTNN 300 Query: 176 FHVAALKSW 184 A LK W Sbjct: 301 ALTADLKEW 309 >gi|227833751|ref|YP_002835458.1| Phage terminase large subunit [Corynebacterium aurimucosum ATCC 700975] gi|227454767|gb|ACP33520.1| Phage terminase large subunit [Corynebacterium aurimucosum ATCC 700975] Length = 414 Score = 38.0 bits (87), Expect = 1.3, Method: Composition-based stats. Identities = 27/129 (20%), Positives = 45/129 (34%), Gaps = 7/129 (5%) Query: 62 LSASSSDR----QVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPI 117 D+ T+++ P E + + Y +G + G I+ + Sbjct: 175 TKTQPEDQLADWTYWHFTMDDNPSLTEGYKSNLRKEYTGMWYLRFIQGLWVAAEGAIYQM 234 Query: 118 -VEEDIVINSLDIPEHWVQIG-GMDFGWHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPI 175 E V++ D+P I G+D+G HP L D+ +Y V + T Sbjct: 235 WDEAKHVVHPEDMPIMRQIIALGIDYGTTHPTTGILLGIGDDN-RLYAVDEWAPGRLTNN 293 Query: 176 FHVAALKSW 184 A LK W Sbjct: 294 ALTADLKEW 302 >gi|268319311|ref|YP_003292967.1| phage terminase large subunit [Lactobacillus johnsonii FI9785] gi|262397686|emb|CAX66700.1| phage terminase large subunit [Lactobacillus johnsonii FI9785] Length = 423 Score = 37.7 bits (86), Expect = 1.4, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 59/155 (38%), Gaps = 19/155 (12%) Query: 44 VTLTLTPLKGRSPIIEHYLSASSS---DRQVIRMTINETPHYNEQERKRIIDSY----PL 96 + P+ + + + + R I + + + +++ R I+ P Sbjct: 161 IFCMFNPVSKLNWTYTTWFAPDVNLDKSRVAIHQSTYKDNQFLDEDNIRTIEDLKNTNPA 220 Query: 97 HEREARTKGEPILGSGRIFPIVEEDIVINS-----LDIPEHWVQIGGMDFGW-HHPFAAG 150 + + T GE +FP E ++ ++IP+++ G+DFG+ + P A Sbjct: 221 YYK-IYTLGEFATLDKLVFP-SFETRRLDPHSSDLVNIPDYF----GLDFGYVNDPSAFT 274 Query: 151 HLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 H + + VIYV+ + + +K G Sbjct: 275 HTKIDMKNKVIYVIDEFVKKGLLNNELAQVIKDLG 309 >gi|317492999|ref|ZP_07951423.1| hypothetical protein HMPREF0864_02187 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316919121|gb|EFV40456.1| hypothetical protein HMPREF0864_02187 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 535 Score = 37.7 bits (86), Expect = 1.6, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 35/88 (39%), Gaps = 2/88 (2%) Query: 68 DRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFPIV--EEDIVIN 125 R I + E P+ + +++ ++R+A +G + SG F + E VI Sbjct: 234 TRVAIHGSFKENPYLDPVYIATLMNIKDPNKRKAWVEGSWDVTSGGRFDHLWNESLHVIK 293 Query: 126 SLDIPEHWVQIGGMDFGWHHPFAAGHLV 153 IP+ W D+G PF+ Sbjct: 294 PFKIPDSWTVDRSHDWGESKPFSNLWWA 321 >gi|123442572|ref|YP_001006549.1| putative phage terminase large subunit [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122089533|emb|CAL12381.1| putative phage terminase large subunit [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 520 Score = 37.7 bits (86), Expect = 1.6, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 36/88 (40%), Gaps = 2/88 (2%) Query: 68 DRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGEPILGSGRIFP-IVEEDI-VIN 125 R I + E P+ + +++ ++R+A +G + SG F + E I VI Sbjct: 219 TRVAIHGSFKENPYLDPVYIASLMNIKDPNKRKAWVEGSWDVTSGGRFDHLWNESIHVIK 278 Query: 126 SLDIPEHWVQIGGMDFGWHHPFAAGHLV 153 IP+ W D+G PF+ Sbjct: 279 PFRIPDSWTVDRSHDWGESKPFSNLWWA 306 >gi|167462274|ref|ZP_02327363.1| hypothetical protein Plarl_06915 [Paenibacillus larvae subsp. larvae BRL-230010] gi|322382817|ref|ZP_08056660.1| phage-related terminase-like protein large subunit [Paenibacillus larvae subsp. larvae B-3650] gi|321153200|gb|EFX45647.1| phage-related terminase-like protein large subunit [Paenibacillus larvae subsp. larvae B-3650] Length = 423 Score = 37.3 bits (85), Expect = 1.8, Method: Composition-based stats. Identities = 17/86 (19%), Positives = 31/86 (36%), Gaps = 12/86 (13%) Query: 138 GMDFGW-HHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDG 196 GMDFG+ A L + + ++Y+ Y + T AL+ + + Sbjct: 266 GMDFGFEDSYNAVVRLAVDHEQKILYIYWEYYKNQMTDDRTAEALQEFARTKELI----- 320 Query: 197 LQHDKRSGEQLSA--QYRRQGMKMLP 220 K + +R++G M P Sbjct: 321 ----KADSAEPKTIRYFRQKGFNMRP 342 >gi|262039284|ref|ZP_06012601.1| phage terminase, large subunit, pbsx family [Leptotrichia goodfellowii F0264] gi|261746674|gb|EEY34196.1| phage terminase, large subunit, pbsx family [Leptotrichia goodfellowii F0264] Length = 438 Score = 37.3 bits (85), Expect = 1.9, Method: Composition-based stats. Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 2/45 (4%) Query: 133 WVQIGGMDFGWH-HPFAAGHLVWNRDSDVIYVVKNYRCREQTPIF 176 W I GMDFG+ A + +++++YV + + T Sbjct: 270 WH-IAGMDFGFSISYTAVVRAAIDYENNILYVYDEFYNKGLTNAQ 313 >gi|317485514|ref|ZP_07944391.1| phage terminase [Bilophila wadsworthia 3_1_6] gi|316923194|gb|EFV44403.1| phage terminase [Bilophila wadsworthia 3_1_6] Length = 434 Score = 37.3 bits (85), Expect = 2.0, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 36/103 (34%), Gaps = 16/103 (15%) Query: 146 PFAAGHLVWN---RDSDVIYVVK---------NYRCREQTPIFHVAA-LKSWGKWLPWAW 192 P A W+ DS I+V + +Y P+ H A ++ G P + Sbjct: 258 PAAPVFTAWDLGMDDSTAIWVAQCVGREIHLIDYYEANGQPLAHYADWVRGRGYGRPTHY 317 Query: 193 -PHDGLQHDKRSGEQLSAQYR--RQGMKMLPECATFDDGSNGV 232 PHD + +G+ G ++ + DG N V Sbjct: 318 LPHDARARELGTGKSREEVLAGLDIGPVLVVPQQSVADGINAV 360 >gi|313498221|gb|ADR59587.1| Hypothetical protein, conserved [Pseudomonas putida BIRD-1] Length = 511 Score = 37.3 bits (85), Expect = 2.3, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 36/94 (38%), Gaps = 3/94 (3%) Query: 63 SASSSDRQVIRMTINETPHYNEQERKRIIDSYPLHEREARTKGE-PILGSGRIFPIV--E 119 D+ I T+ E PH N++ + + +R + G+ + +F + + Sbjct: 217 KNGQRDKCAIFGTVFENPHLNDEYKHWLRTISDPAKRASWLLGDWEAVDDTAMFAALWKK 276 Query: 120 EDIVINSLDIPEHWVQIGGMDFGWHHPFAAGHLV 153 + +++ IP HW D+G PF Sbjct: 277 DVLLMQPFTIPAHWKVERSFDYGQSTPFCCLWTA 310 >gi|225419807|ref|ZP_03762110.1| hypothetical protein CLOSTASPAR_06147 [Clostridium asparagiforme DSM 15981] gi|225041548|gb|EEG51794.1| hypothetical protein CLOSTASPAR_06147 [Clostridium asparagiforme DSM 15981] Length = 686 Score = 36.9 bits (84), Expect = 2.3, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 52/150 (34%), Gaps = 18/150 (12%) Query: 47 TLTPLKGRSPIIEHYLSASSSDRQVIRMTINETPH------YNEQE---RKRIIDSYPLH 97 + P K S Y+ R V T + P + ++ ++ D+Y Sbjct: 439 SFNPPKSASNWANKYIKIPKDSRLVTESTYLDVPQKWLGKPFLDEAEFLKETNSDAY--- 495 Query: 98 EREARTKGEPILGSGRIFPIVEEDIV-INSLDIPEHWVQIGGMDFGW-HHPFAAGHLVWN 155 E G G +F I I +I + + G+D+GW P+A ++ Sbjct: 496 --ENEYMGVANGSGGSVFD--NVKIREITDDEIAQFDHVLNGVDWGWYPDPYAFTRSHYD 551 Query: 156 RDSDVIYVVKNYRCREQTPIFHVAALKSWG 185 +Y+ + Y C +Q+ L G Sbjct: 552 PARHTLYIWQEYTCNKQSNQQTAEKLIELG 581 >gi|168207201|ref|ZP_02633206.1| phage terminase, large subunit, pbsx family [Clostridium perfringens E str. JGS1987] gi|170661424|gb|EDT14107.1| phage terminase, large subunit, pbsx family [Clostridium perfringens E str. JGS1987] Length = 427 Score = 36.9 bits (84), Expect = 2.5, Method: Composition-based stats. Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 1/47 (2%) Query: 137 GGMDFGWHHP-FAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALK 182 GMDFG+ A + + D V+Y+ Y R++T I +K Sbjct: 267 NGMDFGFVTSYNALLRMAIDHDKRVLYIYWEYYTRDKTDIEIAKDIK 313 >gi|312868635|ref|ZP_07728829.1| phage terminase, large subunit, PBSX family [Lactobacillus oris PB013-T2-3] gi|311095844|gb|EFQ54094.1| phage terminase, large subunit, PBSX family [Lactobacillus oris PB013-T2-3] Length = 409 Score = 36.9 bits (84), Expect = 2.8, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 59/147 (40%), Gaps = 9/147 (6%) Query: 27 EDVYFEGLTRINATQGLVTLTLTPLKGRSPIIEHYLSASSSDRQVIRM--TINETPHYNE 84 + V+ E L+R + V P + + + Y+ D + + T+++ Sbjct: 138 QKVFSEILSRCSKAGSHVICDTNPDNPQHWLKKDYIDNDDPDDKTVTFCFTMDDNTFLAP 197 Query: 85 QERKRIIDSYPL---HEREARTKGEPILGSGRIF-PIVEEDIVINSLDIPEHWVQIGGMD 140 K+ P ++RE G + G G ++ + ++IN +P+ G+D Sbjct: 198 DYVKQKKAQTPTGMFYDREI--LGLWVSGDGIVYRDFDQRAMIINQEQLPDGLHIYCGVD 255 Query: 141 FGWHHPFAAGHLVWNRDSDVIYVVKNY 167 +G+ H + + + IY+++ + Sbjct: 256 WGFEH-KGVITVWGDDEDGNIYMLEEH 281 >gi|213018830|ref|ZP_03334638.1| phage uncharacterized protein [Wolbachia endosymbiont of Culex quinquefasciatus JHB] gi|212995781|gb|EEB56421.1| phage uncharacterized protein [Wolbachia endosymbiont of Culex quinquefasciatus JHB] Length = 367 Score = 36.5 bits (83), Expect = 3.7, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 4/65 (6%) Query: 149 AGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLS 208 + W + + Y++ YR + + P L +W P H L K SG+QL Sbjct: 225 SVCTTWTKVDNTFYLLDVYRAKLEYPKLKEKVLSLAARWKP----HAILIEAKASGQQLV 280 Query: 209 AQYRR 213 + R+ Sbjct: 281 QELRK 285 >gi|190571432|ref|YP_001975790.1| phage uncharacterized protein [Wolbachia endosymbiont of Culex quinquefasciatus Pel] gi|190357704|emb|CAQ55153.1| phage uncharacterized protein [Wolbachia endosymbiont of Culex quinquefasciatus Pel] Length = 465 Score = 36.5 bits (83), Expect = 3.7, Method: Composition-based stats. Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 4/65 (6%) Query: 149 AGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLS 208 + W + + Y++ YR + + P L +W P H L K SG+QL Sbjct: 323 SVCTTWTKVDNTFYLLDVYRAKLEYPKLKEKVLSLAARWKP----HAILIEAKASGQQLV 378 Query: 209 AQYRR 213 + R+ Sbjct: 379 QELRK 383 >gi|160899533|ref|YP_001565115.1| PBSX family phage terminase large subunit [Delftia acidovorans SPH-1] gi|160365117|gb|ABX36730.1| phage terminase, large subunit, PBSX family [Delftia acidovorans SPH-1] Length = 414 Score = 36.1 bits (82), Expect = 4.3, Method: Composition-based stats. Identities = 42/236 (17%), Positives = 83/236 (35%), Gaps = 28/236 (11%) Query: 20 WFDE-EPPEDVYFEGLTRINATQGL-VTLTLTPLKGRSPIIEHYLSASSSDRQVIRMTIN 77 W DE E +V ++ L QG + +T P K SP + + + +V+ + + Sbjct: 127 WVDEAESVSEVAWQKLAPTVREQGSEIWVTWNPEKDGSPTDKRFRKEPPPNSKVVELNYS 186 Query: 78 ETPHYNE-------QERKRIIDSYPLHEREA--RTKGEPILGSGRIFPIVEEDIVINSLD 128 + P + E +R R+ D + R + + SG+ + + E + Sbjct: 187 DNPWFPEVLDQERQADRDRLDDQTYAWVWDGAYRENSDAQILSGK-YRVAE-------FE 238 Query: 129 IPEHW-VQIGGMDFGWHH-PFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKSWGK 186 W G+D+G+ P A W D + + + + + + Sbjct: 239 PQPGWDGPYFGLDWGFSQDPTAGVKC-WVGDGRLWIEYEAGKVGLENDDIADYVI----Q 293 Query: 187 WLPWAWPHDGLQHDKR--SGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDML 240 LP H R + + ++ R + LP+ + VE GI+ + Sbjct: 294 RLPGIEQHTVRADSARPETISHVKSKGRDGKRQCLPKLEAVEKWKGSVEDGIAHLR 349 >gi|168698435|ref|ZP_02730712.1| hypothetical protein GobsU_02875 [Gemmata obscuriglobus UQM 2246] Length = 492 Score = 35.7 bits (81), Expect = 5.7, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 42/114 (36%), Gaps = 13/114 (11%) Query: 143 WHHPFAAGHLVWNRDSDVIYVVKNYRCREQTPIFHVAALKS--WGKWLP--WAWPH---- 194 W FA ++RD+D E + AL+ WG+ P A P Sbjct: 62 WDDTFAKLFAYFDRDAD-----GALDATEAARLPSAFALRQVLWGQSTPFTGAAPPLADI 116 Query: 195 DGLQHDKRSGEQLSAQYRRQGMKMLPECATFDDGSNGVEAGISDMLDRMRSGRW 248 D K S ++L+ YRR G+ + ++ + + LD + G+ Sbjct: 117 DLNGDGKASPDELADFYRRAGLGGVLVGVGRAPATDALTDALLKHLDTNKDGKL 170 >gi|66396341|ref|YP_240671.1| ORF008 [Staphylococcus phage 88] gi|66396415|ref|YP_240743.1| ORF009 [Staphylococcus phage 92] gi|62636756|gb|AAX91867.1| ORF008 [Staphylococcus phage 88] gi|62636829|gb|AAX91940.1| ORF009 [Staphylococcus phage 92] Length = 421 Score = 35.3 bits (80), Expect = 7.5, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 74/197 (37%), Gaps = 14/197 (7%) Query: 50 PLKGRSPIIEHYLSASSSDRQVIRMTIN-ETPHYNEQERKRIIDSYPLHEREARTKGE-P 107 P + +S + + Y S+ +D + + P ++Q + + +E+ R + Sbjct: 173 PKRKQSWVNKKYESSFQADNTYVHHSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGE 232 Query: 108 ILGSGRIFPIVEEDIVINSLDIPEHWVQI-GGMDFGW-HHPFAAGHLVWNRDSDVIYVVK 165 +GSG + P I + + I +DFG+ P A +++ VIY + Sbjct: 233 AIGSGVV-PFNNLRIEEIPQRQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAMD 291 Query: 166 NYRCREQTPIFHVAALKSWGKWLPWAWPHDGLQHDKRSGEQLSAQYRRQGMKMLPECATF 225 Y + + LK G + + +S +L ++ + +K + + A Sbjct: 292 EYYGVQISNREFANWLKKKGYQSDEIF---ADSAEPKSIAELKQEHGIKKIKGVKKGA-- 346 Query: 226 DDGSNGVEAGISDMLDR 242 + VE G + D Sbjct: 347 ----DSVEFGEQWLDDL 359 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.308 0.141 0.431 Lambda K H 0.267 0.0435 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5,210,618,932 Number of Sequences: 13984884 Number of extensions: 224118666 Number of successful extensions: 467583 Number of sequences better than 10.0: 185 Number of HSP's better than 10.0 without gapping: 158 Number of HSP's successfully gapped in prelim test: 115 Number of HSP's that attempted gapping in prelim test: 467367 Number of HSP's gapped (non-prelim): 279 length of query: 251 length of database: 4,792,584,752 effective HSP length: 136 effective length of query: 115 effective length of database: 2,890,640,528 effective search space: 332423660720 effective search space used: 332423660720 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 80 (35.3 bits)