BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781215|ref|YP_003065628.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] (511 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781215|ref|YP_003065628.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040892|gb|ACT57688.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|317120680|gb|ADV02503.1| putative phage terminase large subunit [Liberibacter phage SC1] gi|317120824|gb|ADV02645.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 511 Score = 1066 bits (2757), Expect = 0.0, Method: Compositional matrix adjust. Identities = 511/511 (100%), Positives = 511/511 (100%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME Sbjct: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL Sbjct: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI Sbjct: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM Sbjct: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP Sbjct: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR 511 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR Sbjct: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR 511 >gi|315121940|ref|YP_004062429.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495342|gb|ADR51941.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 509 Score = 796 bits (2056), Expect = 0.0, Method: Compositional matrix adjust. Identities = 376/508 (74%), Positives = 428/508 (84%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 M+RELPT E EQ+L +LM+SD+IKLSF+NFVL FPW E T L FS PR WQL+FME Sbjct: 1 MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VD CL +V+NP+P++FKGA+SAGRGIGKTTLNAW++LWL+STRPG+S++CLANSETQL Sbjct: 61 AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 K+TLWAEVSKWLS+LPNKHWFEMQSLSLHPA WY++ L + GIDSKHY+ CRTYSEER Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVGHHNTYGMAI NDEASGTPDVIN ILGF TE NANRFW+MTSNPRRL G FY+I Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLKGWFYDI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FN PL+DW+RFQIDTRTVEGIDPSFHEGII+RYGLDSDVTRVEV GQFPQQDI+SFIP Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPSFHEGIISRYGLDSDVTRVEVLGQFPQQDINSFIPFY 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 IEEALNREP DPYAPLIMGCDIA EGGDNTVVVLRRG IEH+FDWS + ++ KI Sbjct: 301 RIEEALNREPIKDPYAPLIMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNASSRKI 360 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 L+ KY+PDA+++DAN G +T YL GY V+ GQ RA D E RNRRTELHVKM Sbjct: 361 EELINKYKPDAVVVDANGIGVQTYYYLADEGYSVHAEKGQNRADDHESYRNRRTELHVKM 420 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 A+WLE AS+ NHSGLIQNLKSL+SFI PNTG+LA+ESKRVKGA STDYSD L YTFA +P Sbjct: 421 AEWLELASIPNHSGLIQNLKSLESFIEPNTGKLALESKRVKGAVSTDYSDALAYTFAVSP 480 Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEY 508 RSDM+FGRC SYQYE +LL++RRF Y Sbjct: 481 ARSDMNFGRCRSYQYEADELLVDRRFSY 508 >gi|315122902|ref|YP_004063391.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496304|gb|ADR52903.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 509 Score = 790 bits (2041), Expect = 0.0, Method: Compositional matrix adjust. Identities = 373/508 (73%), Positives = 428/508 (84%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 M+RELPT E EQ+L +LM+SD+IKLSF+NFVL FPW E T L FS PR WQL+FME Sbjct: 1 MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VD CL +V+NP+P++FKGA+SAGRGIGKTTLNAW++LWL+STRPG+S++CLANSETQL Sbjct: 61 AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 K+TLWAEVSKWLS+LPNKHWFEMQSLSLHPA WY++ L + GIDSKHY+ CRTYSEER Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVGHHNTYGMAI NDEASGTPDVIN ILGF TE NANRFW+MTSNPRRL+G FY+I Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLNGWFYDI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FN PL+DW+RFQIDTRTVEGIDP+FHE IIARYGLDSDVTRVEV GQFPQQDI+SFIP Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPNFHENIIARYGLDSDVTRVEVLGQFPQQDINSFIPFY 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 IEEALNREP DPYAPL+MGCDIA EGGDNTVVVLRRG IEH+FDWS + ++ KI Sbjct: 301 RIEEALNREPIKDPYAPLVMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNVSSRKI 360 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 L+ KY+PDA+++DAN G +T YL GY V+ GQ RA D E RNRRTELHVKM Sbjct: 361 EELINKYKPDAVVVDANGIGVQTYYYLADEGYSVHPEKGQNRADDHESYRNRRTELHVKM 420 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 A+WLE AS+ +HSGLIQNLKSL+SFI PNTG+LA+ESKRVKGA STDYSD L YTFA +P Sbjct: 421 AEWLELASIPHHSGLIQNLKSLESFIEPNTGKLALESKRVKGAVSTDYSDALAYTFAVSP 480 Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEY 508 RSDM+FGRC SYQYE +LL++RRF Y Sbjct: 481 ARSDMNFGRCRSYQYEADELLVDRRFSY 508 >gi|317120722|gb|ADV02544.1| putative phage terminase large subunit [Liberibacter phage SC2] gi|317120783|gb|ADV02604.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 516 Score = 778 bits (2009), Expect = 0.0, Method: Compositional matrix adjust. Identities = 396/512 (77%), Positives = 415/512 (81%), Gaps = 19/512 (3%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME Sbjct: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL Sbjct: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPQQ 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 I EAL R PDPYAPLIMGCDIA EG D TVVVLRRG +IE +FDWS + TN KI Sbjct: 301 YIVEALERVAIPDPYAPLIMGCDIAGEGEDKTVVVLRRGNIIERIFDWSGELIEVTNRKI 360 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDY-LEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419 S L+ +Y PDAI+ID N G Y L M V +LGQ+R+ + E N R EL+ Sbjct: 361 SSLINRYNPDAIVIDGNGIGGTVVSYLLNMHHISVEVILGQRRSTEPEQYHNLRAELYDL 420 Query: 420 MADWLEFASLI--NHSGLIQNLKSLKSFIVPNTGELAIESKRVK----GAKSTDYSDGLM 473 M + + + LI LKS+KS I G L IE KR G +S D+ D L Sbjct: 421 MRSAITGGLQLPDDCPDLINELKSIKS-ISDTLGRLLIEKKRQGRSEFGVRSPDFVDALC 479 Query: 474 YTFAENPPRSDMDFGRCPSYQ------YEGVD 499 YTFA +PPR D P YQ YE +D Sbjct: 480 YTFAVDPPRKD-----NPLYQGQDISEYEALD 506 >gi|254781187|ref|YP_003065600.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040864|gb|ACT57660.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] Length = 367 Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust. Identities = 252/359 (70%), Positives = 299/359 (83%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 M R + T+ + EQ+L +++ E LSF NFV+ FFPWG KG PLE FS P WQLEFME Sbjct: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VD HC ++VNN NP +FK AISAGRGIGKTTLNAW++LWL+STRPG+S+IC+ANSETQL Sbjct: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 K TLWAEVSKWLS+LP++HWFEMQSLSLHP+ WY+++L S+GIDSKHY+ CRTYSEER Sbjct: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVG HNT+GMA+ NDEASGTPD+IN ILGF TE N NRFWIMTSN RRL+G FY+I Sbjct: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FN PL+DWKR+QIDTRTVEGID FHEGII+RYGLDSDV R+E+ GQFPQQ++++FIP N Sbjct: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359 IEEA++RE D YAPLIMGCDIA EGGD TVVV RRG +IEH+FDWS ++ TN + Sbjct: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 >gi|302120432|gb|ADK92426.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 255 Score = 529 bits (1362), Expect = e-148, Method: Compositional matrix adjust. Identities = 250/255 (98%), Positives = 254/255 (99%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 IGKTTLNAWLVLWLMS RPG+S+ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS Sbjct: 1 IGKTTLNAWLVLWLMSIRPGMSIICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 60 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI Sbjct: 61 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 120 Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE 267 NLGILGFLTE+NANRFWIMTSNPRRLSGKFYEIFN+PLDDWKRFQIDTRTVEGIDPSFHE Sbjct: 121 NLGILGFLTEQNANRFWIMTSNPRRLSGKFYEIFNRPLDDWKRFQIDTRTVEGIDPSFHE 180 Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE Sbjct: 181 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 240 Query: 328 GGDNTVVVLRRGPVI 342 GGDNTVVVLRRGPVI Sbjct: 241 GGDNTVVVLRRGPVI 255 >gi|303328395|ref|ZP_07358832.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861389|gb|EFL84326.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 500 Score = 206 bits (525), Expect = 6e-51, Method: Compositional matrix adjust. Identities = 147/469 (31%), Positives = 213/469 (45%), Gaps = 38/469 (8%) Query: 30 NFVLHFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGI 88 FVL FPWG G L + P WQ E + + S V + A+S+G G+ Sbjct: 31 GFVLFAFPWG--GGALADYPDGPDVWQREILRGMGEQL--STGASAASVIREAVSSGHGV 86 Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+ L AW++LW MST + AN+E QLK WAE++KW L +WF+ + +L Sbjct: 87 GKSALVAWIILWAMSTFSDTRGVVTANTENQLKGKTWAELAKWHRLCLCGYWFDCTATAL 146 Query: 149 ------HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT-YGMAIINDEAS 201 H W D++ +SE + F G HN + +I DEAS Sbjct: 147 ISTQAGHEKTWRVDMV----------------AWSERNTEAFAGLHNKGRRVLLIFDEAS 190 Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261 PD I G LT+ + W NP R +G+F E F + W ++D+RT Sbjct: 191 AIPDAIWEVSEGALTDADTEIIWCCFGNPTRNTGRFRECFGRYAHRWNTRRVDSRTAAMT 250 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY--APLI 319 D + + YG DSD RV V G+FP+ FI +I+ EA R PD Y AP I Sbjct: 251 DKNQLAQWVEDYGEDSDFVRVRVRGEFPRAGDRQFISSDIVHEARGRSLKPDQYSFAPRI 310 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A G D +V+ R+G + D T ++ ++ D I +D Sbjct: 311 LGVDVARSGSDQSVITRRQGLACLEQRKFRGLDTVTLAGIVAEECREWGADKIFVDGIGV 370 Query: 380 GARTCDYLEM---LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL-EFASLINHSGL 435 GA D L LG+ V + A+ E NRR E+ M WL E ++ + + L Sbjct: 371 GAGVVDALRQVYGLGHLVVDAVAGATALQPERFLNRRAEMWTAMRKWLAEGGAVPDDAEL 430 Query: 436 IQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481 + L L+ + V +G+L +ESK + +G S D +D L TF P Sbjct: 431 AEQLCGLE-YAVTVSGKLKLESKDDMKARGLTSPDCADALALTFYAPVP 478 >gi|268589373|ref|ZP_06123594.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315400|gb|EFE55853.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 493 Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 144/462 (31%), Positives = 216/462 (46%), Gaps = 36/462 (7%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 ++ L+ FPWGE GT LE + PR WQ E + + H N P + A ++G GIG Sbjct: 24 SYALYAFPWGEAGTELENANGPRQWQAEALNEIGEHLRNPETRHQP--LQLARASGHGIG 81 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL- 148 K+ + ++ W M T V+ AN+E QL+T W E++KW L K WF ++ Sbjct: 82 KSAFISMIIKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQRLSITKDWFTYTKTAIY 141 Query: 149 -----HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASG 202 H W +D + +SE + F G HN I I DEAS Sbjct: 142 SNDPNHANAWRADAV----------------PWSENNTEAFAGLHNQGKRIILIFDEASN 185 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID 262 D++ G LT+ N WI NP R +G+F E F K WK QID+RTVEG + Sbjct: 186 IADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRECFRKFKHRWKTKQIDSRTVEGTN 245 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIM 320 E I YG+D D +V V G FP FIP + + A+ R +AP+I+ Sbjct: 246 KEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPTGLTDAAMKRTVTQAEVSHAPIII 305 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTTNNKISGLVEKYRPDAIIID-ANN 378 G D A G D+ V+ LR+G + L+ SKT D +I+ ++Y DA+ ID Sbjct: 306 GVDPAYSGDDDAVIYLRQGLHSKCLWTGSKTIDDVIMAKRIADFEDQYGADAVHIDFGYG 365 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438 TG ++ + + + G + RN+R E++ + WL+ I+ + ++ Sbjct: 366 TGIQSVGMNWGRNWQLVQFNGASTDPQM---RNKRGEMYNNVKSWLKIGGAIDDQEVAED 422 Query: 439 LKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 L S + V +G++ +ESK + + +S D L TFA Sbjct: 423 L-STPEYKVELSGKILLESKDDIKKRIGRSPGKGDALALTFA 463 >gi|167032754|ref|YP_001667985.1| putative phage terminase large subunit [Pseudomonas putida GB-1] gi|166859242|gb|ABY97649.1| putative phage terminase, large subunit [Pseudomonas putida GB-1] Length = 499 Score = 202 bits (513), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 142/465 (30%), Positives = 222/465 (47%), Gaps = 27/465 (5%) Query: 27 SFSN----FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAI 82 SFS+ +VL+ FPWGE G L + PR WQ E +E + L + EV + A+ Sbjct: 20 SFSDDPLGYVLYAFPWGEAGGELANKTGPRKWQREVLESI-GEQLRAGAKDRGEVIREAV 78 Query: 83 SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142 ++G GIGK+ L +W++ W + T + AN+E+QL+T W EV+KW L HWF+ Sbjct: 79 ASGHGIGKSALVSWVIKWALDTEVDTRGVVTANTESQLRTKTWPEVAKWNRLSITAHWFK 138 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT-YGMAIINDEAS 201 + +L +D H K++ +S+ + F G HN + +I DEAS Sbjct: 139 LTGTALIS----TDPDH------EKNWRIDAVPWSDTNTEAFAGLHNEGKRILLIFDEAS 188 Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261 D++ G LT+ + W NP R SG+F E F K W+ Q+D+RTV+G Sbjct: 189 AIADLVWEVAEGALTDADTEIIWAAFGNPTRNSGRFRECFTKFKHRWRHRQVDSRTVDGT 248 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321 + + IA YG DSD R+ V G FP+ IP + + EA+ R+ L+ G Sbjct: 249 NKTQIAKWIADYGEDSDFVRIRVRGMFPRASDLQLIPTDWVAEAMRRDGVYGLDDALVCG 308 Query: 322 CDIAEEGGDNTVVVLRRGPVIEHL--FDWSKTDLRTTN---NKISGLVEKYRPDAIIIDA 376 DIA G DN V+ RRG + + ++ R T K+ LV ++RPDA+ +D+ Sbjct: 309 IDIARGGMDNNVIRFRRGMDAKSIKPIKIPGSETRNTTPFIAKVCTLVVEHRPDAVFVDS 368 Query: 377 NNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 G D L L G + V +A D + N RT + +M + ++ I Sbjct: 369 TGVGGPVADQLRRLLPGVMIIDVNFASQAPDRHYA-NMRTYIWWRMREAIKLGLAIESDT 427 Query: 435 LIQNLKSLKSFIVPNTGELAIESKRVKGAK---STDYSDGLMYTF 476 ++ + + ++ ++A+E K+ + S D D L TF Sbjct: 428 ELETELTSPEYDHNSSDQIALEKKKDIKKRLGISPDDGDALALTF 472 >gi|212710820|ref|ZP_03318948.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM 30120] gi|212686517|gb|EEB46045.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM 30120] Length = 493 Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 142/462 (30%), Positives = 214/462 (46%), Gaps = 36/462 (7%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 ++ L+ FPWGE GT LE S PR WQ E + + H N P + A ++G GIG Sbjct: 24 SYALYAFPWGEAGTELENASGPRQWQAEALNEIGEHLRNPETRHQP--LQLARASGHGIG 81 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL- 148 K+ + ++ W M T V+ AN+E QL+T W E++KW L K WF ++ Sbjct: 82 KSAFISMIIKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQRLSITKDWFTCTKTAIY 141 Query: 149 -----HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASG 202 H W +D + +SE + F G HN I + DEAS Sbjct: 142 SNDPNHANAWRADAV----------------PWSENNTEAFAGLHNQGKRIILVFDEASN 185 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID 262 D++ G LT+ N WI NP R +G+F E F K WK QID+RTVEG + Sbjct: 186 IADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRECFRKFKHRWKTKQIDSRTVEGTN 245 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIM 320 E I YG+D D +V V G FP FIP + + A+ R +AP+I+ Sbjct: 246 KEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPTGLTDAAMKRTVTQAEVSHAPIIL 305 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTTNNKISGLVEKYRPDAIIID-ANN 378 G D A G D+ V+ LR+G + L+ SKT D +I+ ++Y DA+ ID Sbjct: 306 GVDPAYSGDDDAVIYLRQGLHSKCLWTGSKTIDDVIMAKRIADYEDQYGADAVHIDFGYG 365 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438 TG ++ + + G ++ N+R E++ + WL+ I+ + + Sbjct: 366 TGIQSVGMNWGRNWQLVSFNGASTDPQMQ---NKRGEMYNNVKSWLKIGGAIDDQEVADD 422 Query: 439 LKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 L S + V +G++ +E K + + +S + D L TFA Sbjct: 423 L-STPEYKVQLSGKILLEKKEDIKKRIGRSPNKGDALALTFA 463 >gi|290968649|ref|ZP_06560187.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] gi|290781302|gb|EFD93892.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] Length = 487 Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 143/463 (30%), Positives = 230/463 (49%), Gaps = 45/463 (9%) Query: 31 FVLHFFPWGEKGTPLEGFSAPRSWQLEFM-EVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 FV F W + L+G P++WQ++ + EV + L++ + A ++G GIG Sbjct: 22 FVYFAFDWDSE--ELKG-QNPQTWQIKTLKEVGEGLSLSTA-------LQHATASGHGIG 71 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL- 148 K+ L AWL+LW +STRP + AN+ TQL+T WAE+SKW L K +F + S ++ Sbjct: 72 KSALVAWLILWAISTRPDTRGVVTANTATQLETKTWAELSKWYHLFRGKKFFTLTSTAIF 131 Query: 149 -----HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASG 202 H W D + S+ +R ++F G HN + +I DEAS Sbjct: 132 CRQEGHERTWRIDAIPWSV----------------DRTESFAGLHNQGNRLLLIFDEASA 175 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID 262 + I G LT+++ W++ NP R +G+F++ F+K W +ID+RTV+ + Sbjct: 176 IDNKIWEVAEGALTDKDTEILWLVFGNPTRSTGRFFDCFHKYKKSWITQKIDSRTVDISN 235 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP---YAPLI 319 + + I YG+DSD +V V G+FP FI I+ A R P +AP I Sbjct: 236 KTQLQKWIQTYGIDSDFVKVRVLGEFPDTSDTQFISTAIVRTAWERRPLRTAEYDFAPCI 295 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR-TTNNKISGLVEKYRPDAIIIDANN 378 +G D A GGD+TV+ LR+G E L ++ + D +++ +KY DA+ ID Sbjct: 296 IGMDPAWTGGDSTVIFLRQGFFSEKLAEYKQNDNDGVMAARLAEFEDKYHADAVFID-KG 354 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH-SGLIQ 437 G + +G +R++ + N+R E+ M +WL+ +I GLI+ Sbjct: 355 YGTGIYSFGVTMGRQ-WRLVSFAEKSGAQAYANKRAEMWGNMKEWLQEGGVIPQVDGLIE 413 Query: 438 NLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 L + ++FI GE+ +E K + +G +S + +D L TFA Sbjct: 414 ELTAPQAFINAR-GEIQLEKKEDMKKRGIESPNMADALALTFA 455 >gi|323156136|gb|EFZ42295.1| terminase large subunit [Escherichia coli EPECa14] Length = 491 Score = 196 bits (497), Expect = 9e-48, Method: Compositional matrix adjust. Identities = 138/456 (30%), Positives = 217/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A+++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPATRYQPLML--ALASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D+ H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDLGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R YAP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAYAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|304398406|ref|ZP_07380280.1| terminase, large subunit [Pantoea sp. aB] gi|304354272|gb|EFM18645.1| terminase, large subunit [Pantoea sp. aB] Length = 490 Score = 193 bits (490), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 136/456 (29%), Positives = 215/456 (47%), Gaps = 23/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L PR WQ + + + AH N P + A +G GIG Sbjct: 24 GYALYAFPWGEEGTDLAYSKGPRQWQEDAFKQIGAHLQNPDTRHQPLMIGRA--SGHGIG 81 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + LV W M T V+ AN+E QL+T W E++KW L + WF + +++ Sbjct: 82 KSAFISMLVKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQRLSITQDWFTCTATAIY 141 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASGTPDVIN 208 +D H +K + +SE + F G HN I I DEAS D++ Sbjct: 142 S----NDPSH------AKSWRADAIPWSENNTEAFAGLHNERKRIILIFDEASNIADLVW 191 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ N W+ NP R +G+F E F K WK QID+R+VEG + + Sbjct: 192 EVAEGALTDENTEIIWVAFGNPTRNTGRFRECFRKLRHRWKTAQIDSRSVEGTNKEQIQK 251 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD--PYAPLIMGCDIAE 326 + YG DSD +V V G FP FIP + + A+ R P +A ++G D A Sbjct: 252 WVDDYGEDSDFVKVRVRGLFPSASEAQFIPTGLTDAAVGRVITPGQVAHAATVIGVDPAH 311 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI-SGLVEKYRPDAIIID-ANNTGARTC 384 +GGD V+ LR+G + L ++ +T KI + ++YR DA+ ID TG ++ Sbjct: 312 QGGDPAVIYLRQGLHTKKLGEYQRTTDDVLFAKIVASFEDEYRADAVFIDYGYGTGLKSV 371 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + + + G + D + N+R E++ + WL+ ++ + + L + + Sbjct: 372 GDNWGRNWQLIQFGGG--STDPQMA-NKRGEMYNAVKTWLKDGGQLDSQQVAEELSAAEY 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + + +E K + + KS + +D L TFA Sbjct: 429 KVRLKDSRIVLEDKTSIKERLGKSPNDADALALTFA 464 >gi|320175050|gb|EFW50163.1| terminase B protein, putative [Shigella dysenteriae CDC 74-1112] Length = 480 Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 216/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 14 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 71 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 72 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 131 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 132 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 181 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 182 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 241 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 242 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 301 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 302 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 361 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ + WL +++ +L S Sbjct: 362 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFISCKTWLRLGGMLDDQETADDL-SAAE 417 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 418 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 453 >gi|332344357|gb|AEE57691.1| terminase, large subunit [Escherichia coli UMNK88] Length = 491 Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|327252187|gb|EGE63859.1| terminase large subunit [Escherichia coli STEC_7v] Length = 491 Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPATRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|324008564|gb|EGB77783.1| hypothetical protein HMPREF9532_01752 [Escherichia coli MS 57-2] Length = 491 Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 216/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D+ H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDLGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|300898423|ref|ZP_07116764.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357890|gb|EFJ73760.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 491 Score = 190 bits (482), Expect = 6e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|309702815|emb|CBJ02146.1| putative terminase, large subunit [Escherichia coli ETEC H10407] Length = 493 Score = 189 bits (481), Expect = 6e-46, Method: Compositional matrix adjust. Identities = 133/472 (28%), Positives = 225/472 (47%), Gaps = 23/472 (4%) Query: 31 FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGK 90 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPATRHQPIML--ARASGHGIGK 83 Query: 91 TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150 + + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMYS 143 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVINL 209 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 144 ----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 GILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGI 269 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKEQLQKW 253 Query: 270 IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAEE 327 + YG DSD +V V G FP + FIP + + A+ R P +A +++G D + + Sbjct: 254 VDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAVGRVITPAQVQHAAVVLGVDPSHQ 313 Query: 328 GGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTCD 385 G D V+ LR+G + L +W + TD I+ ++Y+ DA+ ID TG ++ Sbjct: 314 GKDPAVIYLRQGLHCKKLGEWQRTTDDVLFAKVIADFEDQYQADAVFIDYGYGTGLKSVG 373 Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSF 445 + G + ++ D E N+R E++ D L+ + ++ L L + + Sbjct: 374 --DNWGRNWTLIMFGSGTADPEMG-NKRGEMYKSARDALKLGAQLDSQELADELSAPEYK 430 Query: 446 I-VPNTGELAIESKRVKG--AKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQ 494 + + ++ ++ + VK +S + +D + T+A + ++G+ S Q Sbjct: 431 VRLKDSRKILQDKDEVKELLGRSPNNADAYVLTYAAPVTKKQFNYGQQQSQQ 482 >gi|298381721|ref|ZP_06991320.1| terminase large subunit protein [Escherichia coli FVEC1302] gi|301019339|ref|ZP_07183525.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|298279163|gb|EFI20677.1| terminase large subunit protein [Escherichia coli FVEC1302] gi|299882256|gb|EFI90467.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|323948690|gb|EGB44595.1| hypothetical protein ERKG_04913 [Escherichia coli H252] Length = 491 Score = 189 bits (481), Expect = 6e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|218700994|ref|YP_002408623.1| putative phage terminase, large subunit [Escherichia coli IAI39] gi|218370980|emb|CAR18807.1| putative phage terminase, large subunit [Escherichia coli IAI39] Length = 491 Score = 189 bits (481), Expect = 6e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|294491573|gb|ADE90329.1| putative phage terminase, large subunit [Escherichia coli IHE3034] Length = 491 Score = 189 bits (481), Expect = 7e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-STAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|301046412|ref|ZP_07193572.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|300301638|gb|EFJ58023.1| conserved hypothetical protein [Escherichia coli MS 185-1] Length = 491 Score = 189 bits (480), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 214/456 (46%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEDGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|330007152|ref|ZP_08305894.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3] gi|328535499|gb|EGF61959.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3] Length = 495 Score = 189 bits (480), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 138/456 (30%), Positives = 212/456 (46%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE GT L S PR WQ + + H N P + A +G GIG Sbjct: 29 GYALYAFPWGEDGTELAHASGPRQWQADAFREIGEHLQNPATRHQPLMISRA--SGHGIG 86 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 87 KSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMY 146 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 147 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIVVVFDEASNIADLVW 196 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 197 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 256 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD +V V G FP FIP + +EA+ R +AP I+G D A Sbjct: 257 WVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPRIIGVDPAY 316 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 317 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 376 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL+ ++ +L S Sbjct: 377 G--DGWGRTWQLVPFGGASADPQML-NKRGEMFNACKTWLKLGGALDDQETADDL-SAAE 432 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ +E K + + +S D L+ TFA Sbjct: 433 YKVRVDGKIVMEPKEDIKERLGRSPGKGDALLLTFA 468 >gi|215487825|ref|YP_002330256.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] gi|215265897|emb|CAS10306.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] Length = 493 Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 133/472 (28%), Positives = 224/472 (47%), Gaps = 23/472 (4%) Query: 31 FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGK 90 + L+ FPWGE GT L + PR WQ + + H N P + A ++G GIGK Sbjct: 26 YALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIGK 83 Query: 91 TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150 + + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMYS 143 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVINL 209 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 144 ----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 GILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGI 269 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQKW 253 Query: 270 IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAEE 327 + YG DSD +V V G FP + FIP + + A+ R P +A +++G D + + Sbjct: 254 VDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAVGRVITPAQVQHAAVVLGVDPSHQ 313 Query: 328 GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK-ISGLVEKYRPDAIIID-ANNTGARTCD 385 G D V+ LR+G + L +W +T K I+ ++Y+ DA+ ID TG ++ Sbjct: 314 GKDPAVIYLRQGLHCKKLGEWQRTTDDVLFAKIIADFEDQYQADAVFIDYGYGTGLKSVG 373 Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSF 445 + G + + D E N+R E++ D L+ + ++ L L + + Sbjct: 374 --DNWGRNWTLIQFGSGTADPEMG-NKRGEMYKSARDALKLGAQLDSQNLADELSAPEYK 430 Query: 446 I-VPNTGELAIESKRVKG--AKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQ 494 + + ++ ++ + + VK +S + +D + T+A + ++G+ S Q Sbjct: 431 VRLKDSRKILQDKEEVKELLGRSPNDADAYVLTYAAPVTKKQFNYGQQQSQQ 482 >gi|331648179|ref|ZP_08349269.1| conserved hypothetical protein [Escherichia coli M605] gi|331043039|gb|EGI15179.1| conserved hypothetical protein [Escherichia coli M605] Length = 491 Score = 189 bits (479), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNACKIWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|117624715|ref|YP_853628.1| putative phage terminase, large subunit [Escherichia coli APEC O1] gi|115513839|gb|ABJ01914.1| putative phage terminase, large subunit [Escherichia coli APEC O1] Length = 491 Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 136/456 (29%), Positives = 215/456 (47%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD ++ V G FP FIP + +EA+ R ++P+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHSPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|30387381|ref|NP_848210.1| terminase large subunit [Enterobacteria phage epsilon15] gi|30266036|gb|AAO06065.1| terminase large subunit [Salmonella phage epsilon15] Length = 491 Score = 187 bits (474), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 136/456 (29%), Positives = 214/456 (46%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG +SD +V V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEESDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + + G + D + N+R E+ WL+ ++ +L S Sbjct: 373 GDGWGRTWQLIPFGGG--STDPQML-NKRGEMFNSCKTWLKLGGALDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|89152423|ref|YP_512256.1| putative terminase large subunit [Escherichia phage phiV10] gi|74055446|gb|AAZ95895.1| putative terminase large subunit [Escherichia phage phiV10] Length = 491 Score = 187 bits (474), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 136/456 (29%), Positives = 214/456 (46%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE+GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG SD ++ V G FP FIP + +EA+ R +AP+I+G D A Sbjct: 253 WVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y+ DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL +++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-STAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ IE K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|262043569|ref|ZP_06016682.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039103|gb|EEW40261.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 491 Score = 187 bits (474), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 137/456 (30%), Positives = 212/456 (46%), Gaps = 24/456 (5%) Query: 30 NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + L+ FPWGE GT L + PR WQ + + H N P + A ++G GIG Sbjct: 25 GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIG 82 Query: 90 KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149 K+ + L+ W MST V+ AN++ QL+T W E+ KW +L K WF + +++ Sbjct: 83 KSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMY 142 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208 +D H K + +SE + F G HN + ++ DEAS D++ Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIVVVFDEASNIADLVW 192 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268 G LT+ + W+ NP R +G+F E F K WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326 + YG DSD +V V G FP FIP + +EA+ R +AP I+G D A Sbjct: 253 WVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVVTAVQVAHAPRIIGVDPAY 312 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384 G D+ V+ LR+G + L+ +K TD +I+ ++Y DA+ ID TG ++ Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYLADAVFIDFGYGTGLKSI 372 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 + G V + D + N+R E+ WL+ ++ +L S Sbjct: 373 G--DGWGRTWQLVPFGGASADPQML-NKRGEMFNACKTWLKLGGALDDQETADDL-SAAE 428 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + V G++ +E K + + +S D L+ TFA Sbjct: 429 YKVRVDGKIVMEPKEDIKERLGRSPGKGDALLLTFA 464 >gi|227355862|ref|ZP_03840255.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906] gi|227164181|gb|EEI49078.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906] Length = 494 Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust. Identities = 143/501 (28%), Positives = 223/501 (44%), Gaps = 39/501 (7%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 MS L +PE EQ + D+ L ++ + FPWGE G LE ++ PR WQ E + Sbjct: 1 MSEALQKSPE-EQLIEDIASFTHDPLGYAYYA---FPWGEAGGELEEYNGPRQWQAEALN 56 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 + H N P + A ++G GIGK+ + ++ W M T V+ AN+E QL Sbjct: 57 EIGEHLRNPKTRHQPLLL--ARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQL 114 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSL------HPAPWYSDVLHCSLGIDSKHYSTMCR 174 +T W E++KW L +WF ++ H W +D + Sbjct: 115 RTKTWPEIAKWQRLSLTNNWFTCTKTAIYSNDPNHANAWRADAV---------------- 158 Query: 175 TYSEERPDTFVGHHNTYGMAI-INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233 +SE + F G HN I + DEAS D++ G LT+ WI NP R Sbjct: 159 PWSENNTEAFAGLHNKGKRIILVFDEASNIADLVWEVAEGALTDEGTEIIWIAFGNPTRN 218 Query: 234 SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293 +G+F E F K W QID+RTVEG + + YG DSD +V V G FP Sbjct: 219 TGRFRECFRKFKHRWNTKQIDSRTVEGSNKEQIKNWEEDYGEDSDFFKVRVRGVFPSASE 278 Query: 294 DSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLF-DWSK 350 FIP + +EA+ R +AP+I+G D A G D+ V+ LR+G + L+ + Sbjct: 279 LQFIPTGLTDEAMKRIVTQAEVAHAPVIIGVDPAYSGIDDAVIYLRQGLFSKCLWTGFKT 338 Query: 351 TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 TD +I+ ++Y+ DA+ ID TG + + V+R++ A Sbjct: 339 TDDVVMAKRIADFEDQYKADAVHIDFGYGTGIHS---IGTSWGRVWRLVKFGGASTDPQM 395 Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKST 466 N+R E++ + WL+ I+ +L + + ++ +E K + + +S Sbjct: 396 LNKRGEMYNSVKTWLKIGGAIDDQETADDLSCGEYKVRVIDSKIVLEDKTEIKKRLGRSP 455 Query: 467 DYSDGLMYTFAENPPRSDMDF 487 D L TFA + D ++ Sbjct: 456 GKGDALALTFAYPVTKIDRNY 476 >gi|282848875|ref|ZP_06258265.1| conserved hypothetical protein [Veillonella parvula ATCC 17745] gi|282581380|gb|EFB86773.1| conserved hypothetical protein [Veillonella parvula ATCC 17745] Length = 483 Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 140/475 (29%), Positives = 217/475 (45%), Gaps = 62/475 (13%) Query: 31 FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGK 90 FV +PWGE GTPLE P WQ++ ++ + + + A+++G GIGK Sbjct: 21 FVYFAYPWGEPGTPLENMEGPDEWQIQILKDIGEQLKKGKDLQT--AIQEAVASGHGIGK 78 Query: 91 TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150 + L +WL+ + +ST + AN+E QL+T W E+SKW ++ K F + ++ Sbjct: 79 SALISWLIHFAISTHENTRGVVTANTEGQLRTKTWPELSKWHNMFIAKDLFTYTATAIFS 138 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRT----YSEERPDTFVGHHNTYG-MAIINDEASGTPD 205 + K Y R +S+ P++F G HN + ++ DEAS D Sbjct: 139 S--------------DKDYEKTWRIDAIPWSKNSPESFAGLHNQGNRILVLFDEASAIDD 184 Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSF 265 VI G LT+ N W NP R SG+F E F K W +QID+RTV+ + + Sbjct: 185 VIWEVTEGALTDANTEIIWCAFGNPTRNSGRFRECFRKYRKFWNTYQIDSRTVKISNKTK 244 Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCD 323 E + YG DSD +V V G FP FI I ++A + +P + P+I+G D Sbjct: 245 IEEWLEAYGEDSDFFKVRVRGVFPSASDLQFISTEIADKAQKQVYKPGQFEHLPVIIGVD 304 Query: 324 IAEEGGDNTVVVLRRGPVIEHLF-------DWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 A G D+ +V+R+G ++ L DW L I+ ++Y+ DA+ ID Sbjct: 305 PAWTGSDSLEIVMRQGYYMKSLASIPKNDDDWRMAQL------IAQFEDEYKADAVFIDM 358 Query: 377 N-NTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCR--------NRRTELHVKMADWLEFA 427 TG Y + + LG+K + +EF N R + +M +WL Sbjct: 359 GYGTGI----------YSIGKQLGRKWRL-IEFGGKSNDPVYLNMRAYMWGQMKEWLREG 407 Query: 428 SLI--NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 I N L ++ ++ I N G + +ESK + +G S + D L TFA Sbjct: 408 GSIPPNDQALYDDIVGPEAIIDKN-GRIQLESKKDMKDRGLPSPNKGDALALTFA 461 >gi|54302246|ref|YP_132239.1| terminase large subunit [Photobacterium profundum SS9] gi|46915667|emb|CAG22439.1| hypothetical protein PBPRB0566 [Photobacterium profundum SS9] Length = 513 Score = 164 bits (414), Expect = 4e-38, Method: Compositional matrix adjust. Identities = 125/446 (28%), Positives = 202/446 (45%), Gaps = 27/446 (6%) Query: 37 PWGEKGTPLEGFSAPRSWQLE----FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92 PW K + G P +W E EV+ + N V+ + F +IS+G GIGK+ Sbjct: 48 PWASKYDSVYG---PDAWFCEMCDQLQEVIRKNDFNGVDPV--DAFLYSISSGHGIGKSC 102 Query: 93 LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152 ++WL+ ++MSTRP + +N+ QL+T W E+ KW L NKHWF + + Sbjct: 103 ASSWLIHFVMSTRPNSKGVVTSNTSEQLRTKTWGELGKWTKKLINKHWFVYNNGKGNMNF 162 Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-IINDEASGTPDVINLGI 211 ++ D ++ + +T EE ++F G H + DEAS PD I Sbjct: 163 YHKDY--------AETWRVDAQTCREENSESFAGLHCASSTPWYLFDEASAVPDKIWEVA 214 Query: 212 LGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIA 271 G LT+ FW + NP R SG+F E + + W R QID+ TV+ + + Sbjct: 215 EGGLTD--GEPFWFVFGNPTRNSGRFRECWRRFRQRWNRKQIDSSTVQVTNKKKISEWES 272 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331 YG DSD RV V G FP + I ++E A++R P +P +M D+A GGDN Sbjct: 273 DYGEDSDFYRVRVKGVFPSASSNQKISGALLEAAMSRTAHVIPGSPRVMSLDVARGGGDN 332 Query: 332 TVVVLRRG--PVIEHLFDWSKTDLRTTNNKISGLVE---KYRPDAIIIDANNTGARTCDY 386 V R G + ++ R + + V+ +++PDA ID G D Sbjct: 333 CVFRFRHGLNGGVRKKVTLPGSEYRDSMKLAAMAVQLCSEFKPDAFFIDETGVGGPVGDR 392 Query: 387 LEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH-SGLIQNLKSLKSF 445 + LG++ + +A D + N R ++ + +WL+ +++ GL+ + +++ Sbjct: 393 IRQLGFNCIGINFASKAPDPHYA-NMRAYMYHQWGEWLKAGGSLHYDEGLLTEVGAIEYT 451 Query: 446 IVPNTGELAIESKRVKGAKSTDYSDG 471 E+ I +K A DG Sbjct: 452 HDRKDREILIPKDVIKKAIGISTDDG 477 >gi|332981151|ref|YP_004462592.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON] gi|332698829|gb|AEE95770.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON] Length = 461 Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 140/452 (30%), Positives = 206/452 (45%), Gaps = 58/452 (12%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 + P WQ E ++ + NP V A+ +G G+GKT L AW +LW + TRP Sbjct: 25 AEPDDWQAETLQAL---------ADNPRV---AVRSGHGVGKTALEAWALLWFLFTRPYP 72 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPN-KHWFEMQSLSL----HPAPWYSDVLHCSLG 163 + C A + QL LWAE SKWL P K +FE Q + +P W++ Sbjct: 73 KIPCTAPTREQLHDILWAEASKWLERAPALKPYFEWQKTRIVQKQYPGRWFA-------- 124 Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223 RT + +P+ G H + + II DEASG D I I G LT +A Sbjct: 125 --------TARTSN--KPENMAGFHEEHLLFII-DEASGIADNIFETIEGALTTSDAK-- 171 Query: 224 WIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 +M NP + SG F++ F K + ++ + + + E + +Y DSDV RV Sbjct: 172 LLMCGNPTKNSGVFHDAFFKDRSLYWTRKVSCLDSQRVTLEYAERLKRKYHEDSDVYRVR 231 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIE 343 V G+FP+ + D+FI L+I+E A R+ PD L +G D+A G D TV+ R G + Sbjct: 232 VLGEFPKAEPDTFISLDIVEAATMRDVEPD--GVLEIGVDVARFGDDETVLAARAGLKLV 289 Query: 344 HLFDWSKTDLRTTNNKISGLVEKY-----RPDAII-IDANNTGARTCDYL------EMLG 391 +L ++K D TT L + +P I ID + G D E L Sbjct: 290 YLKAYTKQDTMTTAGYAIALAKDLMKECGKPKCTIKIDDDGVGGGVTDRCREVVREEKLY 349 Query: 392 YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLKSLKSFIVPN 449 V D E N TE + D L E A LIN LI L + K + + + Sbjct: 350 IDVIDCHNGGAPEDKEHYENWGTEAWAYLRDLLQDEQAELINDEDLIGQLTTRK-YRITS 408 Query: 450 TGELAIESK---RVKGAKSTDYSDGLMYTFAE 478 G++A+ESK + +G S D +D ++ +A+ Sbjct: 409 KGKIALESKDEMKRRGLMSPDRADAVVLAYAK 440 >gi|228968731|ref|ZP_04129698.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis serovar sotto str. T04001] gi|228790961|gb|EEM38595.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis serovar sotto str. T04001] Length = 459 Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 137/492 (27%), Positives = 223/492 (45%), Gaps = 77/492 (15%) Query: 14 KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73 ++ D+ W D + +F+ +L F+P WQ + + ++ +P Sbjct: 2 EIIDVYWDDPV--AFAEDMLGFYP--------------DEWQRKVL-------MDLAQSP 38 Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 K ++ +G+G+GKT L + +V+W + RP VIC A ++ QL T LWAE++KWL Sbjct: 39 -----KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLE 93 Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 K+ + ++ +G + + ++T RT + +P+ G H Y M Sbjct: 94 GSAVKNLLKWTKTRVY-----------MIGSEERWFAT-ARTAT--KPENMQGFHEDY-M 138 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 + DEASG D I ILG L+ A + NP R SG FY+ N+ D +K ++ Sbjct: 139 LFVCDEASGIADPIMEAILGTLS--GAENKLFLCGNPTRTSGVFYDSHNRDRDLYKIHKV 196 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 + E + +YG SDV RV V G+FP+ + D+FIPL I+E+A + + P Sbjct: 197 SSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCKVEPT 256 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY------ 367 L +G D+A G D TV+ R G + L + K D T + L ++Y Sbjct: 257 GET-LDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMETAGHVLKLAKEYMAKYKQ 315 Query: 368 --RPDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419 R D I +D + G D L E L + VY V+ + +D E N TE Sbjct: 316 LKRVD-IKVDDSGVGGGVTDRLKEVIKSERLPFKVYPVVNNGKPLDDEHYDNAGTEGWAV 374 Query: 420 MADWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464 + D LE + N +I S K + + + G++A+E K + +G + Sbjct: 375 VRDLLEENMKAFIQGEEPTMEIPNDEKMISQFSSRK-YRITSRGKIALERKEEMKKRGLQ 433 Query: 465 STDYSDGLMYTF 476 S D +D ++ F Sbjct: 434 SPDRADAIVLAF 445 >gi|228911519|ref|ZP_04075310.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL 200] gi|228848128|gb|EEM92991.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL 200] Length = 459 Score = 156 bits (394), Expect = 8e-36, Method: Compositional matrix adjust. Identities = 136/492 (27%), Positives = 222/492 (45%), Gaps = 77/492 (15%) Query: 14 KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73 ++ D+ W D + +F+ +L F+P WQ + + ++ +P Sbjct: 2 EIIDVYWDDPV--AFAEDMLGFYP--------------DEWQRKVL-------MDLAQSP 38 Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 K ++ +G+G+GKT L + +V+W + RP VIC A ++ QL T LWAE++KWL Sbjct: 39 -----KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLE 93 Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 K+ + ++ +G + + ++T RT + +P+ G H Y M Sbjct: 94 GSAVKNLLKWTKTRVY-----------MIGSEERWFAT-ARTAT--KPENMQGFHEDY-M 138 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 + DEASG D I ILG L+ A + NP R SG FY+ N+ D +K ++ Sbjct: 139 LFVCDEASGIADPIMEAILGTLS--GAENKLFLCGNPTRTSGVFYDSHNRDRDLYKIHKV 196 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 + E + +YG SDV RV V G+FP+ + D+FIPL I+E+A + + P Sbjct: 197 SSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCKVEPT 256 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY------ 367 L +G D+A G D TV+ R G + L + K D T + L ++Y Sbjct: 257 GET-LDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMETAGHVLKLAKEYMAKYKQ 315 Query: 368 --RPDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419 R D I +D + G D L E L + VY V+ + +D E N E Sbjct: 316 LKRVD-IKVDDSGVGGGVTDRLKEVIKSERLPFKVYPVVNNGKPLDDEHYDNAGAEGWAV 374 Query: 420 MADWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464 + D LE + N +I S K + + + G++A+E K + +G + Sbjct: 375 VRDLLEENMKAFIQGEEPTMEIPNDEKMISQFSSRK-YRITSRGKIALERKEEMKKRGLQ 433 Query: 465 STDYSDGLMYTF 476 S D +D ++ F Sbjct: 434 SPDRADAIVLAF 445 >gi|332976102|gb|EGK12970.1| hypothetical protein HMPREF9374_1123 [Desmospora sp. 8437] Length = 462 Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 129/427 (30%), Positives = 197/427 (46%), Gaps = 55/427 (12%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 A+ AG G+GKT AW VLW + TRP + C A ++ QL LW E++KWL Sbjct: 51 AVRAGHGVGKTATEAWAVLWFLLTRPFPKIPCTAPTKPQLMDVLWPEIAKWL-------- 102 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 M + L P + + + ++T RT + +P+ G H + + +I DEA Sbjct: 103 --MNAPELAPYVEWQKTRVVMKQYEERWFAT-ARTSN--KPENMAGFHEEHLLFVI-DEA 156 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260 SG + I I G LT A +M NP R +G FY+ F++ D + ++I + Sbjct: 157 SGVDNAIFETIDGALT--TAGSKLVMFGNPTRTNGVFYDAFHQDRDLYWTYKISCLDSKM 214 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320 + + +YG DSD+ RV V G+FPQ D DSFIPL ++E+A R+ L + Sbjct: 215 ASKDYARNMARKYGEDSDIYRVRVQGEFPQGDPDSFIPLELVEDARVRDLEWIDEDELHI 274 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG--------LVEKYRPDAI 372 G D+A G D TV+ R GPV F + RT + G L+E++R D Sbjct: 275 GVDVARFGSDETVLAARIGPVA---FRLDRYGGRTPTTETVGRVLALARELMEEHRRDYA 331 Query: 373 IIDANNT--GARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH--VKMADW----- 423 ++ ++T G D L+ + V + +D+ C N T H DW Sbjct: 332 VVKVDDTGVGGGVTDQLQEI------VAEEGLNIDVIPCNNGATPEHDPDHYHDWGTESW 385 Query: 424 ---------LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDG 471 E A I+ LI L + K + + G++ +ESK + +G +S D +D Sbjct: 386 GTLLDRFKAGEIALKIDDEDLIGQLTTRKKEMT-SKGKIKLESKEKMKKRGQRSPDRADA 444 Query: 472 LMYTFAE 478 L+ FAE Sbjct: 445 LVLAFAE 451 >gi|209901239|ref|YP_002290878.1| putative terminase B [Clostridium phage phiCD27] gi|199612120|gb|ACH91293.1| putative terminase B [Clostridium phage phiCD27] Length = 469 Score = 153 bits (386), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 134/437 (30%), Positives = 202/437 (46%), Gaps = 62/437 (14%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 K +I +G+G+GKT L + +W +STRP V+ A + QL LWAE++KWLS + Sbjct: 44 KVSIRSGQGVGKTGLESIATVWYLSTRPFPKVVATAPTRQQLYDVLWAEIAKWLSNSKVE 103 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 E W ++ G + + ++T RT +P+ G H Y M + D Sbjct: 104 KLLE----------WTKTKVYMK-GFEERWWAT-ARTAV--KPENMQGFHEDY-MLFVVD 148 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT--- 255 EASG D I ILG L+ A ++ NP R SG FY+ N+ D +K F++ + Sbjct: 149 EASGVADPIMEAILGTLS--GAENKLLLCGNPTRTSGTFYDSHNRDRDLYKTFKVSSLDS 206 Query: 256 -RT----VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 RT +E + +HEG SD RV V G+FP+ + DS I L +E + RE Sbjct: 207 PRTSKDNIEMLKRKYHEG--------SDPWRVRVLGEFPKGESDSLISLEAVETSTIREV 258 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR-- 368 L +G DIA G D T++ R G + L +SK D T I V+K++ Sbjct: 259 NISNDYILNIGADIARYGDDETIIAPRIGGKVFDLLTYSKKDTMETVGNILRAVDKFKNM 318 Query: 369 -----PDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDL--------EFC 409 I D + GA D L E L Y V + A++ E Sbjct: 319 YHQINRVKIKTDDDGLGAGVTDRLKEVIRHERLKYEVIPIQNGSSAIEKDKYYNKASEMW 378 Query: 410 RNRRTELHVKMADWLE----FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462 N R EL ++ +++ L N LI+ L + K + V + G++ IESK + + Sbjct: 379 DNMREELDANLSSFIQNKEAIIQLPNDDKLIKQLSNRK-YTVDSKGKIQIESKKEMKKRI 437 Query: 463 AKSTDYSDGLMYTFAEN 479 +S D +D ++Y+FAEN Sbjct: 438 GESPDRADAVIYSFAEN 454 >gi|150016512|ref|YP_001308766.1| hypothetical protein Cbei_1636 [Clostridium beijerinckii NCIMB 8052] gi|149902977|gb|ABR33810.1| conserved hypothetical protein [Clostridium beijerinckii NCIMB 8052] Length = 470 Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 132/438 (30%), Positives = 203/438 (46%), Gaps = 63/438 (14%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 K ++ +G+G+GKT L + +V W + TRP VI A + QL LWAE+SKWL+ + Sbjct: 44 KVSVRSGQGVGKTGLESIVVTWYLCTRPFPKVIATAPTRQQLYDVLWAEISKWLASSKIE 103 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 + E ++ + S+ + +T + RP+ G H Y M + D Sbjct: 104 NLLEWTKTKIYMKGY------------SERWWATAKTAT--RPENMQGFHEDY-MLFVVD 148 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT--- 255 EASG D I ILG LT N+ +M NP R SG FY+ N+ D +K F++ + Sbjct: 149 EASGVADPIMEAILGTLTGYE-NKL-LMCGNPTRTSGTFYDSHNRDRDLYKTFKVSSLES 206 Query: 256 -RT----VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-LNRE 309 RT +E + +HEG SDV RV V G+FP+ + DS I L E A + + Sbjct: 207 PRTSKDNIEMLKRKYHEG--------SDVWRVRVEGEFPKGESDSLISLEYAETATITKI 258 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 L +G DIA G D +V+ R G + L ++K D T I +K++ Sbjct: 259 NNIHNNFTLHIGADIARFGNDESVIAPRIGNKVFDLLTYTKKDTMETTGNILRATDKFKN 318 Query: 370 D-------AIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416 + I +D + G D L E LGY V + +A D E ++ E+ Sbjct: 319 EYKHINKVKIRVDDDGLGGGVTDRLREVIRQEGLGYEVMPIKNGSKANDEEHYSDKSAEM 378 Query: 417 HVKMADWLE--FASLI----------NHSGLIQNLKSLKSFIVPNTGELAIESK---RVK 461 M D LE F + + N+ LI+ L + K F + + G + +E K + + Sbjct: 379 WGNMRDILEENFTNFVQGKEPTIELPNNDKLIKQLSNRK-FRIDSKGRIDLEKKEEMKKR 437 Query: 462 GAKSTDYSDGLMYTFAEN 479 +S D +D ++Y+FAEN Sbjct: 438 IGESPDLADAVIYSFAEN 455 >gi|150390341|ref|YP_001320390.1| hypothetical protein Amet_2579 [Alkaliphilus metalliredigens QYMF] gi|149950203|gb|ABR48731.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF] Length = 469 Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 127/428 (29%), Positives = 195/428 (45%), Gaps = 44/428 (10%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 K ++ +G+G+GKT L + + W + TRP VI A + QL LWAE+SKWLS Sbjct: 44 KVSVRSGQGVGKTGLESIAITWYLCTRPFPKVIATAPTRQQLYDVLWAEISKWLS----- 98 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 +S W ++ + G + + ++T RT RP+ G H Y M + D Sbjct: 99 -----KSKVDKLLRWTKTKIYMN-GFEERWWAT-ARTAV--RPENMQGFHEDY-MLFVVD 148 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258 EASG D I ILG LT N+ ++ NP + SG FY+ N+ D +K ++ + Sbjct: 149 EASGVADPIMEAILGTLTGYE-NKL-LLCGNPTKTSGTFYDSHNRDRDTYKSHKVSSMDS 206 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 E + +YG DSDV RV V G FP+ + DS I L + E+A L Sbjct: 207 PRTSKENIEMLKKKYGADSDVFRVRVLGDFPKGEADSLISLEVTEQAAETVVDISNAYTL 266 Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD-------A 371 +G DIA G D T++ R G + L +SK D T I V++ + Sbjct: 267 NIGADIARFGDDKTIIAPRIGNRVLDLQQYSKKDTMETAGNILRTVDRLKTQHLQINKIV 326 Query: 372 IIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425 I ID + G D L + LGY + + +A D E N+ E+ + + L+ Sbjct: 327 IKIDDDGLGGGVTDRLREINRQQSLGYIIVPIKNGSKADDPEHYYNKAAEMWDNIRELLD 386 Query: 426 ---FASLINHSGLIQNLK--------SLKSFIVPNTGELAIESK---RVKGAKSTDYSDG 471 L G+IQ K S + + V + G + +ESK + + +S D +D Sbjct: 387 ENLSKFLQGEPGVIQLPKDDILIKQLSNRKYKVDSKGRIELESKDEMKRRIGESPDRADA 446 Query: 472 LMYTFAEN 479 ++Y+FA + Sbjct: 447 VIYSFASD 454 >gi|153810665|ref|ZP_01963333.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174] gi|149833061|gb|EDM88143.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174] Length = 469 Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 96/284 (33%), Positives = 142/284 (50%), Gaps = 17/284 (5%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 ++ +G GIGK+ + AW V+W M T P + C A ++ QL LWAE+SKW N Sbjct: 44 SVRSGHGIGKSAVEAWSVIWFMCTHPYPKIPCTAPTQHQLFDILWAEISKWKR---NNKT 100 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + W + L+ + ++ + + RT S PD G H + + II DEA Sbjct: 101 LDSELI------WTKEKLY--MKGHAEEWFAVARTAST--PDALQGFHAEHMLYII-DEA 149 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260 SG D I +LG L+ A +M NP +LSG FY+ NK + + F ID R Sbjct: 150 SGVEDKIFEPVLGALSTPGAK--LLMCGNPTQLSGFFYDSHNKNREQYSTFHIDGRNSTR 207 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI- 319 + F + II YG DSDV RV V G FP + D +IPL ++E+++ E P + +I Sbjct: 208 VSQEFVQTIINMYGEDSDVFRVRVAGDFPLAEDDIYIPLPLVEKSIATEYFPRRHPQIIH 267 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL 363 +GCD+A G D TV+ R ++ D T + I L Sbjct: 268 IGCDVARFGTDKTVIGYRTDEKVQFFKKRVGQDTMKTADDIVSL 311 >gi|228950291|ref|ZP_04112468.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] gi|228809453|gb|EEM55897.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] Length = 495 Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 126/470 (26%), Positives = 200/470 (42%), Gaps = 73/470 (15%) Query: 51 PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110 P WQ E + + H SV +G+G+GKT + +W+ +W + RP + Sbjct: 41 PDPWQKEVLNDIANHSHVSVR------------SGQGVGKTAMESWICIWFLCCRPYPKI 88 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 IC A ++ QL LWAE++KWL+ K + W ++ G + + ++ Sbjct: 89 ICTAPTKQQLYDVLWAEIAKWLNSSQVKDLLK----------WTKTKIYMK-GFEDRWFA 137 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 T + RP+ G H Y M I DEASG D I ILG L+ F M NP Sbjct: 138 T---AKTATRPENMQGFHEDY-MLFIADEASGIADDIMEAILGTLSGSENKLF--MCGNP 191 Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 + SG F++ NK +K ++ + E + +YG SDV RV V G+FP+ Sbjct: 192 TKTSGVFFDSHNKDRALYKSHKVSSADSPRTSKKNIEMLKKKYGEGSDVYRVRVEGEFPR 251 Query: 291 QDIDSFIPLNIIEEALNREP------------------CPDPYAPLIMGCDIAEEGGDNT 332 + D+FI L E A RE PD A + +GCD+A G D T Sbjct: 252 GEADAFISLETAEAARMREVYKVEVIENEEEESTVKEIIPDT-AVVEIGCDVARFGSDET 310 Query: 333 VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--------RPDAIIIDANNTGARTC 384 ++ RRG + L + D + + +KY + I ID G Sbjct: 311 IIATRRGWKVLPLQVHHQRDTMYVSGLLVQEAKKYFSWCERTGKRIPIRIDDTGVGGGVT 370 Query: 385 DYL-EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA--------DWLEFASLINHSGL 435 D L E++ + Y + + + F E ++ + LEF +L + L Sbjct: 371 DRLKEVVAENDYPI----DVIPINFASKGNAEYACIVSVMYGHFKDNCLEFVALPDDEDL 426 Query: 436 IQNLKSLKSFIVPNTGELAIESKRV---KGAKSTDYSDGLMYTFAENPPR 482 I L S++ + + + G + IE K+ +G KS D ++ ++ FA P+ Sbjct: 427 IAQL-SVRKYQINSDGRIKIEPKKAMKDRGLKSPDRAEAVVMAFAPFYPK 475 >gi|257883493|ref|ZP_05663146.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|294614775|ref|ZP_06694675.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636] gi|294622490|ref|ZP_06701512.1| conserved hypothetical protein [Enterococcus faecium U0317] gi|257819151|gb|EEV46479.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|291592387|gb|EFF23996.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636] gi|291598037|gb|EFF29147.1| conserved hypothetical protein [Enterococcus faecium U0317] Length = 471 Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 131/478 (27%), Positives = 218/478 (45%), Gaps = 49/478 (10%) Query: 34 HFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92 F P+ + G+ ++ + P ++ + + + +V N E K ++ +G+G+GKT Sbjct: 4 EFIPFADIGSAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTA 63 Query: 93 LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152 L A +LW ++ RP VI A + QL LWAEV+KWL+ SL + Sbjct: 64 LEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLN----------DSLIKNLLK 113 Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212 W ++ +G DS+ + RT + +P+ G H + M I+ DEASG D I IL Sbjct: 114 WTKTKIYM-VG-DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAIL 168 Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272 G L+ + +M NP + G FY+ N D ++ ++ + + + E I+ + Sbjct: 169 GTLS--GFDNKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKK 226 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---PCPDPYAPLIMGCDIAEEGG 329 YG +SDV RV + G+FP+ +DSFI L +E A ++ + +G D+A G Sbjct: 227 YGKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGD 286 Query: 330 DNTVVVLRRGPVIEHLFDWSK-TDLRTTN---NKISGLVEKY-RPDAIIIDANNT--GAR 382 D+T++ R +SK + + TT N L+ +Y D ++I ++T G Sbjct: 287 DSTILFPRIATRALEYEKYSKRSTMETTGYVINMAKNLMSQYPSIDKVMIKVDDTGVGGG 346 Query: 383 TCDYLEML---GYHVYRVLGQKRAVDLE--FCRNRRTELHVKMADWLE------------ 425 D LE L ++ + V G E F N T+L + + LE Sbjct: 347 VTDRLEELIEDKHYPFEVFGVNNGSTSEDDFYDNLGTQLWGNIKEMLEENMTANLNGEQP 406 Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L + S LI+ L S + F + + + +ESK + + S D +D L F E P Sbjct: 407 VIELPSDSSLIKEL-STRKFKMTSRSRIRLESKDDMKKRNIGSPDIADALALAFYEPP 463 >gi|282598712|ref|YP_003358792.1| putative phage terminase B protein [Enterococcus phage phiEf11] gi|300860603|ref|ZP_07106690.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|307292389|ref|ZP_07572245.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis TX0411] gi|258598082|gb|ACV83339.1| putative phage terminase B protein [Enterococcus phage phiEf11] gi|300849642|gb|EFK77392.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|306496518|gb|EFM66079.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis TX0411] gi|315146097|gb|EFT90113.1| conserved hypothetical protein [Enterococcus faecalis TX2141] Length = 484 Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 123/431 (28%), Positives = 198/431 (45%), Gaps = 50/431 (11%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 K ++ +G+G+GKT L A +LW ++ RP VI A + QL LWAEV+KWL+ Sbjct: 50 KVSVRSGQGVGKTALEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLN----- 104 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 SL W ++ +G DS+ + RT + +P+ G H + M I+ D Sbjct: 105 -----NSLIKDLLKWTKTKIYM-VG-DSERWFATARTAT--KPENMQGFHEDH-MLIVVD 154 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258 EASG D I ILG L+ + +M NP + G FY+ N D ++ ++ + Sbjct: 155 EASGVADPIMEAILGTLS--GFDNKLLMCGNPNNIEGVFYDSHNTDRDKYRTHKVSSYDS 212 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 + + + +I +YG +SDV RV + G+FP+ +DSFI L I+E A + + Sbjct: 213 KRTNKENIQMLIDKYGENSDVARVRIYGEFPKGALDSFISLEIVEFAKDINISDSELKHV 272 Query: 319 I---MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA---- 371 +G D+A G D+T+V R G +SK D T ++ ++ D Sbjct: 273 REGHIGVDVARFGDDSTIVFPRIGAKALPFEKYSKQDTMQTTGRVLKAAKRMMDDYPTIK 332 Query: 372 ---IIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 I +D G D L E L Y V V + + D ++ N+ T++ + + Sbjct: 333 KVFIKVDDTGVGGGVTDRLKEVISDEKLPYEVIPVNNGESSTD-DYYANKGTQIWGDVKE 391 Query: 423 WLE--FASLINHSG----------LIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTD 467 LE ++ IN G LI+ L S + F + + G++ +ESK + + S D Sbjct: 392 LLEQNISNSINGQGPTIELPDNANLIKEL-STRKFKMTSNGKIRLESKEDMKKRNVGSPD 450 Query: 468 YSDGLMYTFAE 478 +D L F E Sbjct: 451 IADALTLAFYE 461 >gi|261208032|ref|ZP_05922709.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289567088|ref|ZP_06447483.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|260077749|gb|EEW65463.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289161103|gb|EFD09008.1| conserved hypothetical protein [Enterococcus faecium D344SRF] Length = 471 Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 131/478 (27%), Positives = 217/478 (45%), Gaps = 49/478 (10%) Query: 34 HFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92 F P+ + G ++ + P ++ + + + +V N E K ++ +G+G+GKT Sbjct: 4 EFIPFADIGAAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTA 63 Query: 93 LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152 L A +LW ++ RP VI A + QL LWAEV+KWL+ SL + Sbjct: 64 LEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLN----------DSLIKNLLK 113 Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212 W ++ +G DS+ + RT + +P+ G H + M I+ DEASG D I IL Sbjct: 114 WTKTKIYM-VG-DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAIL 168 Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272 G L+ + +M NP + G FY+ N D ++ ++ + + + E I+ + Sbjct: 169 GTLS--GFDNKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKK 226 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---PCPDPYAPLIMGCDIAEEGG 329 YG +SDV RV + G+FP+ +DSFI L +E A ++ + +G D+A G Sbjct: 227 YGKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGD 286 Query: 330 DNTVVVLRRGPVIEHLFDWSK-TDLRTTN---NKISGLVEKY-RPDAIIIDANNT--GAR 382 D+T++ R +SK + + TT N L+ +Y D ++I ++T G Sbjct: 287 DSTILFPRIATRALEYEKYSKRSTMETTGYVINMAKNLMSQYPSIDKVMIKVDDTGVGGG 346 Query: 383 TCDYLEML---GYHVYRVLGQKRAVDLE--FCRNRRTELHVKMADWLE------------ 425 D LE L ++ + V G E F N T+L + + LE Sbjct: 347 VTDRLEELIEDKHYPFEVFGVNNGSTSEDDFYDNLGTQLWGNIKEMLEENMTANLNGEQP 406 Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L + S LI+ L S + F + + + +ESK + + S D +D L F E P Sbjct: 407 VIELPSDSSLIKEL-STRKFKMTSRSRIRLESKDDMKKRNIGSPDIADALALAFYEPP 463 >gi|289578588|ref|YP_003477215.1| hypothetical protein Thit_1395 [Thermoanaerobacter italicus Ab9] gi|289528301|gb|ADD02653.1| conserved hypothetical protein [Thermoanaerobacter italicus Ab9] Length = 460 Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 118/416 (28%), Positives = 183/416 (43%), Gaps = 45/416 (10%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 A+ A G+GKT + AW+ LW + T VI A + Q++ LW E+ Sbjct: 49 AVRACHGVGKTKVAAWVALWFLYTHHNSKVITTAPTWHQVENLLWREIH----------- 97 Query: 141 FEMQSLSLHPA---PWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + H A P VL + + + ++ T ++P+ F G H + + I+ Sbjct: 98 ------AAHAASRIPLGGKVLQTQIELGEQWFALGLST---DKPERFQGFHAEHILLIV- 147 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI---D 254 DEASG GFLT A ++ NP +LSG+FY F PL + + I D Sbjct: 148 DEASGVEQYTFDAAEGFLTSIGAK--LLLIGNPTQLSGEFYNAFRSPL--YHKIHISAFD 203 Query: 255 TRTVEG--------IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 + ++ + P + E ++G DS + V G+FP+Q D+ IPL IE A Sbjct: 204 SPNLKAGKIVRPYLVTPEWVEDKRLKWGEDSPLWYSRVLGEFPEQGNDTLIPLAWIEAAQ 263 Query: 307 NREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK 366 R + P+ +G D+A G D TV++LRRG E ++ D K+ +K Sbjct: 264 QRWHMTEAGEPVEIGADVARYGTDTTVIMLRRGDKAEIVYQLRGQDTMEVTGKVIDAFKK 323 Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426 + I ID GA D L+ GY V + + A D N+R E + + + + Sbjct: 324 TGANVIKIDVVGIGAGVVDRLKEQGYPVQGLNVGESATDKGRFVNKRAEWYWALRERFQE 383 Query: 427 ASLI--NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 ++ L L SLK + + G + IESK R +G S D +D LM F+ Sbjct: 384 GTIAIPPDDELASQLASLK-YKFDSRGRIQIESKEELRRRGLPSPDKADALMLAFS 438 >gi|323486060|ref|ZP_08091391.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum WAL-14163] gi|323400627|gb|EGA92994.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum WAL-14163] Length = 476 Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust. Identities = 125/430 (29%), Positives = 188/430 (43%), Gaps = 58/430 (13%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 K AI +G+G+GKT + A +LW + P ++ A ++ QL LW+EVSKW+S Sbjct: 52 KVAIKSGQGVGKTGMEAVALLWFLCCYPYPRIVATAPTKQQLHDVLWSEVSKWMS----- 106 Query: 139 HWFEMQSLSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 +P SD+L + + + K + + RT + +P+ G H M Sbjct: 107 -----------KSPLLSDILKWTKTYIYMVGNEKRWFAVARTAT--KPENMQGFHED-NM 152 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 I DEASG D I ILG L+ AN +M NP R SG FY+ FN ++ + Sbjct: 153 LFIVDEASGVADPIMEAILGTLS--GANNKLLMCGNPTRTSGTFYDAFNVDRSIYRCHTV 210 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-PCP 312 + + + E +I +YG DS+V V V G+FP+Q+ D FI L+I+E + P Sbjct: 211 SSADSKRTNKQNIESLIRKYGKDSNVVLVRVFGEFPKQEDDVFIALSIVEHCCMLDLPDD 270 Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE------- 365 P + G D+A G D TV+ G I + L TT KI L Sbjct: 271 VPIKRISFGVDVARYGSDETVIAKNVGGRITLPVSFRGQSLMTTVGKIVQLYRQAITEFP 330 Query: 366 KYRPDAII-IDANNTGARTCDYLEML-----------------GYHVYRVLGQKRAVDLE 407 +YR I ID G D LE + G LG + + Sbjct: 331 RYRGKIYINIDDCGLGGGVTDRLEEVKQEEKLTRMVIVPVNAAGKVPEETLGDGKQKACD 390 Query: 408 FCRNRRTEL--HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462 N T L VK A +E SL N + L+ + + + + + G++ +ESK + +G Sbjct: 391 IYDNMTTYLWGTVKDALMMEEVSLENDNELVAQF-TCRKYRLTSRGKMLLESKEEMKKRG 449 Query: 463 AKSTDYSDGL 472 S D +D + Sbjct: 450 IDSPDRADAV 459 >gi|319956916|ref|YP_004168179.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM 16511] gi|319419320|gb|ADV46430.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM 16511] Length = 462 Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 121/426 (28%), Positives = 197/426 (46%), Gaps = 39/426 (9%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 K +I +G G GKTTL AW+VLW R + A + QL L E+ KW +P + Sbjct: 45 KISIRSGHGTGKTTLLAWIVLWWGLGREDAKIPMTAPTGHQLYDLLMPEIRKWREKMPVQ 104 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 + E++ V + + +++ + RT +++P+ G H T +A I D Sbjct: 105 YQNEVE------------VKTEKIDFANGNFA-VPRTARKDQPEALQGFHAT-NLAFIID 150 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258 EASG P VI G +T + IM +NP R G FY+ +K W+ FQ + Sbjct: 151 EASGIPQVIFEVAEGAMT--GESTLVIMAANPTRTEGYFYDSHHKNRWQWECFQFNAEES 208 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 E + + E +YG DSDV RV + G+FP+Q ++ L +++A RE D A Sbjct: 209 ENVSKEWIEEKKRQYGEDSDVYRVRIKGEFPRQSSNAVFSLQEVDDATTREIVDDSGAE- 267 Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-----RPDAII 373 + G D+A+ G D +V+ R+G +H + + T + L+ +Y +P I Sbjct: 268 VWGLDVADFGDDKSVLAKRKG---KHFHEITARSGLTLPDLAGWLIYEYNQAKRKPAVIF 324 Query: 374 IDANNTGARTCDYLEMLGYH-VYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH 432 +DA G+ G V V G A + E N+R E + + D LE + + Sbjct: 325 VDAIGIGSSLPAVCFEKGLDIVIGVKGSNSASNSEKYHNKRAEWYYNLKDLLEDGKIPDD 384 Query: 433 SGLIQNLKSLKSFIVPNTGELA-IESKRVKG--AKSTDYSDG-------LMYTFAENP-- 480 L+ L + K + + +TG++ +E K +K +S D +D ++Y EN Sbjct: 385 DELVGELMAQK-YQISSTGKIQLVEKKEIKKELGRSPDKADACALTCERMIYVEEENDDI 443 Query: 481 PRSDMD 486 P +DM+ Sbjct: 444 PEADME 449 >gi|160940775|ref|ZP_02088117.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC BAA-613] gi|158436295|gb|EDP14062.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC BAA-613] Length = 484 Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 89/265 (33%), Positives = 133/265 (50%), Gaps = 39/265 (14%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL-------- 132 ++ +G GIGK+ + AW V+W M TRP + C A +E QL LWAE+SKW+ Sbjct: 44 SVRSGHGIGKSAVEAWSVIWYMCTRPFPKIPCTAPTEHQLMDVLWAEISKWMRNNPALRD 103 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 L+ K MQ HP W++ + RT + P+ G H + Sbjct: 104 DLIWTKEKLYMQG---HPEEWFA----------------VPRTATN--PEALQGFHAEHV 142 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + II DEASG D + +LG +T +A +M NP RL+G FY+ ++ + + Sbjct: 143 LYII-DEASGVSDKVFEPVLGAMTGEDAK--LLMMGNPTRLAGFFYDSHHRNREQYSAIH 199 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312 +D R + + +F + II +G DSDV RV V GQFP+ DS I + EEA N + Sbjct: 200 VDGRDSQHVSRTFVQKIIDMFGEDSDVFRVRVAGQFPKSTPDSLIAMEWCEEAANLQ--- 256 Query: 313 DPYAP---LIMGCDIAEEGGDNTVV 334 YAP + +G D+A G D++ + Sbjct: 257 -VYAPGGQIDIGVDVARYGDDSSAL 280 >gi|266623290|ref|ZP_06116225.1| putative terminase B protein [Clostridium hathewayi DSM 13479] gi|288864932|gb|EFC97230.1| putative terminase B protein [Clostridium hathewayi DSM 13479] Length = 484 Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 84/270 (31%), Positives = 137/270 (50%), Gaps = 31/270 (11%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH- 139 ++ +G G+GK+ + +W V+W + TRP + C A ++ QL LWAE+SKWL P Sbjct: 44 SVRSGHGVGKSAVESWSVIWFLCTRPFPKIPCTAPTQHQLYDILWAEISKWLRNNPELKN 103 Query: 140 ---WFEMQS-LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 W + + ++ +P W++ + RT + P+ G H + + I Sbjct: 104 DIIWTQQRVYMNGYPEEWFA----------------VPRTATN--PEALQGFHAEHVLYI 145 Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255 I DEASG D + +LG +T +A +M NP RLSG F++ +K ++ ID Sbjct: 146 I-DEASGVSDKVFEPVLGAMTGEDAK--LLMMGNPTRLSGFFFDSHHKSRSEYSAMHIDG 202 Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 R + ++ F + II +G+DSDV RV V GQFP+ DS I ++ E A +P Sbjct: 203 RDSQHVNQKFVQKIINMFGMDSDVFRVRVAGQFPKSTPDSLIMMDWCEAATQLKP-ETVR 261 Query: 316 APLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345 + +G D+A G D++ + PVI+ + Sbjct: 262 NRVDIGVDVARYGDDSSALY----PVIDKV 287 >gi|308069786|ref|YP_003871391.1| hypothetical protein PPE_03030 [Paenibacillus polymyxa E681] gi|305859065|gb|ADM70853.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 452 Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 134/463 (28%), Positives = 202/463 (43%), Gaps = 72/463 (15%) Query: 51 PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110 P WQ + ++ NNP + ++ +G+G+GKT L A LW +S P V Sbjct: 6 PDDWQASTL-------MDLANNP-----RVSVRSGQGVGKTGLEAATALWFLSCFPYPKV 53 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 IC A + QL LWAE++KW S P + W ++ + + ++ Sbjct: 54 ICTAPTRQQLHDVLWAEINKWQSKSP---------VLKRILKWTKTKIYMK-NYEERWFA 103 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 T RT + +P+ G H Y M I DEASG D I ILG L+ N+ +M NP Sbjct: 104 T-ARTAT--KPENMQGLHEDY-MLFIVDEASGVADPIMEAILGTLSGE-FNKI-LMCGNP 157 Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID-PSFHEGIIA----RYGLDSDVTRVEVC 285 + SG FY+ NK D+K TR V +D P + IA +YG SDV RV V Sbjct: 158 TKTSGVFYDSHNKDRADYK-----TRKVSCLDSPRTSKDNIAMLKRKYGEGSDVWRVRVE 212 Query: 286 GQFPQQDIDSFIPLNIIEEA---LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342 G+FP+ D+FI L + E A + EP D L +G D+A G D T + GP I Sbjct: 213 GEFPRGGSDTFISLEVAEFAAKEVKLEPTGD---MLTIGVDVARFGDDETSMFAGIGPRI 269 Query: 343 EHLFDWSKTDLRTTNNKISGLVEKYRPD-------AIIIDANNTGARTCDYL------EM 389 K T + L ++ + I +D + G D L E Sbjct: 270 VGEHHHFKKGTMVTAGWVINLAKELQVAHPYLNRIRIRVDDSGVGGGVTDRLSEIVAEEG 329 Query: 390 LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE--FASLINHSGLIQNLK------- 440 L Y + + ++D E N TE+ + + LE ++ +N I L Sbjct: 330 LPYEIIPINNGSSSLD-EHYGNLVTEMWASIKEQLEQNMSNFMNGDSSILQLPDDDVLIT 388 Query: 441 --SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478 + + + + + G++ +ESK + +G KS D +D + TF E Sbjct: 389 QLTARKWNMTSKGKMLLESKKDMKKRGLKSPDRADAFVLTFGE 431 >gi|255282256|ref|ZP_05346811.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] gi|255267204|gb|EET60409.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] Length = 506 Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 81/257 (31%), Positives = 124/257 (48%), Gaps = 19/257 (7%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 A+ +G+G+GKT + A VLW +S V+ A + QL LW+E++KW P Sbjct: 68 AVKSGQGVGKTGIEAVAVLWFLSCFRYARVVATAPTRQQLHDVLWSEIAKWQERSP---- 123 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 L W ++ G + K + + RT + +P+ G H M I DEA Sbjct: 124 -----LLKAILRWTKTYVYVK-GYE-KRWFAVARTAT--KPENMQGFHED-NMLFIVDEA 173 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260 SG D I +LG L+ N +M NP R +G FY+ F K + + + Sbjct: 174 SGVADPIMEAVLGTLS--GGNNKLLMCGNPTRTTGTFYDAFTKDRSIFACHTVSSLDSSR 231 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---PCPDPYAP 317 D + + +I +YG DS++ RV V G FP+QD D FI +I++ +R+ P A Sbjct: 232 TDKNNIDALIRKYGEDSNLVRVRVKGLFPKQDDDVFISQELIDQCTSRQYELPESRGMAQ 291 Query: 318 LIMGCDIAEEGGDNTVV 334 +I+G D+A G D TV+ Sbjct: 292 VILGVDVARYGNDETVI 308 >gi|307308936|ref|ZP_07588619.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti BL225C] gi|306900570|gb|EFN31183.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti BL225C] Length = 472 Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 114/427 (26%), Positives = 188/427 (44%), Gaps = 46/427 (10%) Query: 76 EVFKG----AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131 E FK + G GKT ++A + W + + V A SE+ +K+ +W E Sbjct: 42 EAFKNNQTITVKGSSGWGKTFISAISLWWSLIVFDPVKVTIFAPSESTIKSGIWNE---- 97 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSL-GIDSKHYSTMC----RTYSEERPDTFVG 186 +Q L + AP + ++ S I K C R S++ G Sbjct: 98 -----------LQVLYSNMAPLFRELFEVSATKIFRKSRGETCWAEYRLVSKDNIAAARG 146 Query: 187 HHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP-- 244 H+ + +I DEASG DVI G L + ++ SNP + SG F++ + P Sbjct: 147 FHSKNNI-VIADEASGIEDVIFTGALLNVLNDGPGAKVVLVSNPDKASGFFFKTWRDPEL 205 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 DW + R P E YG + S V G+FP D+D I ++ Sbjct: 206 SKDWIKVHGSIRDKPNYTPGEEERFARLYGGVTSRDYLTLVEGEFPLSDVDGLISREFLD 265 Query: 304 EAL-NREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 EA+ N++ P+P AP+I G D A G D +V+ +R V+ +W+ + ++ Sbjct: 266 EAVTNKDAIPNPKAPIIWGLDPAGAGKDKSVLAIRHDNVLRGFEEWAGLEPVALALRVKE 325 Query: 363 LV----EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ---KRAVDLEFCRNRRTE 415 L +K RP I +D N GA D L+ VY+ + KR D + R R + Sbjct: 326 LYLKTSKKDRPAVIAVDGNGLGAGVYDALKHFKIPVYKCMFAEVPKRNPD-RYTR-VRDQ 383 Query: 416 LHVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPNTGELAIESKRV---KGAKSTDYSD 470 + +M +W+ S+ NH LI++L ++ ++ ++ ++ IE K+ + +S DY+D Sbjct: 384 IWFEMREWIHTGDVSIPNHKKLIEDL-AIPTY--EDSPKIKIEDKKSLKKRLGRSPDYAD 440 Query: 471 GLMYTFA 477 L TF+ Sbjct: 441 ALALTFS 447 >gi|253578914|ref|ZP_04856185.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA] gi|251849857|gb|EES77816.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA] Length = 473 Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 22/264 (8%) Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 NP+V +I +G+G+GKT L A + LW ++ P ++ A ++ QL LW+E+SKW+S Sbjct: 32 NPKV---SIKSGQGVGKTGLEAAVFLWFVTCFPHPRIVATAPTKQQLHDVLWSEISKWMS 88 Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 E+ S+ L Y ++ + K + + RT + +P+ G H M Sbjct: 89 K------SELLSILLKWTKTYVYMVG-----EEKRWFGVARTAT--KPENMQGFHED-NM 134 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 I DEASG D I ILG L+ AN ++ NP + SG FY+ + +K + Sbjct: 135 LFIVDEASGVADPIMEAILGTLS--GANNKLLLCGNPTKTSGTFYDSHTRDRALYKCHTV 192 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR---EP 310 + + + ++ +YG DS+V RV V G+FP Q+ D FIPL++IE+ ++ Sbjct: 193 SSMDSTRTNKENIDSLVRKYGWDSNVVRVRVRGEFPNQEDDVFIPLSLIEQCSSKLLELD 252 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVV 334 D + +G D+A G D T++ Sbjct: 253 DADGMQFVSLGVDVARFGDDETII 276 >gi|167767949|ref|ZP_02440002.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1] gi|167710278|gb|EDS20857.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1] gi|291560988|emb|CBL39788.1| hypothetical protein CL2_30180 [butyrate-producing bacterium SSC/2] Length = 473 Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust. Identities = 95/319 (29%), Positives = 141/319 (44%), Gaps = 24/319 (7%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 K I +G+G+GKT A +LW +S V+ A + QL LWAEVSKW S P Sbjct: 49 KVTIKSGQGVGKTGFEAATLLWFLSCFENARVVATAPTLHQLNDVLWAEVSKWQSKSP-- 106 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 L ++ +G + Y+ + RT + P+ G H M I D Sbjct: 107 --------LLKEILQWTKTKISMIGSKERWYA-VARTAT--TPENMQGFHED-NMLFIVD 154 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258 EASG D I ILG LT +N ++ NP + SG FY+ + +++ Sbjct: 155 EASGVADPIMEAILGTLT--GSNNKLLLCGNPTKASGTFYDSHTSDRKLYYCITVNSAES 212 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 + + + +I +YG +S+V RV V G FP+QD D ++PL ++E ++ E P P Sbjct: 213 KRTNKDNIDSLIRKYGEESNVVRVRVKGLFPKQDDDVYMPLEMLEASIILEEIP-PADIC 271 Query: 319 IMGCDIAEEGGDNTVVVLRRG-----PVIEHLFDWSKT--DLRTTNNKISGLVEKYRPDA 371 +G D+A G D+TV+ I H D KT D+ I + + Sbjct: 272 TLGVDVARFGDDDTVIARNMNNKITLEKIRHGQDLMKTVGDVVVECRNIKEKFKYKKTIY 331 Query: 372 IIIDANNTGARTCDYLEML 390 +IID G D L L Sbjct: 332 VIIDDTGLGGGVTDRLNEL 350 >gi|332980681|ref|YP_004462122.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON] gi|332698359|gb|AEE95300.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON] Length = 486 Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust. Identities = 113/442 (25%), Positives = 178/442 (40%), Gaps = 64/442 (14%) Query: 79 KGAISAGRGIGKTTLNAWLVLW-LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 + A+ + G GK+ + ++LW L S P I V+ A + Q++ +W EV Sbjct: 46 RTAVRSCHGAGKSFIAGQVILWFLYSFYPSI-VLSTAPTWRQVEKLIWKEVRA------- 97 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 S P ++L I S PD F G H + ++ Sbjct: 98 -------SYRRSKVPLGGNLLPKRPEIQIIQDEWYAVGLSTNEPDRFQGFHEE-NILVVV 149 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRT 257 DEA+G P+ I I G LT +A ++ NP + G FY F P W+ I T Sbjct: 150 DEAAGVPEEIFEAIEGVLTSEHAR--LLLLGNPTSVGGTFYNAFRTP--GWENISISAFT 205 Query: 258 VEG-----------------------------IDPSFHEGIIARYGLDSDVTRVEVCGQF 288 I P++ R+G +S + V GQF Sbjct: 206 TPNFTAFGITEDDIINKTWESKITNSLPNPKLITPAWVADKYRRWGPNSPAYQARVLGQF 265 Query: 289 PQQDIDSFIPLNIIEEALNR-EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 P + D+ IPL IE A+ R E P+ P+ +G D+A G D TV+ RRG + L Sbjct: 266 PSEGEDTLIPLAWIEAAMARWEDTPE-GEPIEIGVDVARFGSDKTVIAARRGQKVLPLNV 324 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE 407 ++K D T I + K +D GA D L+ G+ V + + A D E Sbjct: 325 YAKQDTMETVGCIIMVHRKIGASKTKVDVIGVGAGVVDRLKEQGHPVIGINVAEAATDTE 384 Query: 408 FCRNRRTELHVKMADWLEFASLIN--------HSGLIQNLKSLKSFIVPNTGELAIESK- 458 N R+EL M + L+ +N L+ +L +K + + + G + +ESK Sbjct: 385 KFANLRSELWWNMRELLDPNQRLNPEPIALPPDDELLADLSGVK-YKIDSRGRIQVESKE 443 Query: 459 --RVKGAKSTDYSDGLMYTFAE 478 + + +S D +D ++ FA+ Sbjct: 444 DMKKRLGRSPDRADAVVLAFAK 465 >gi|315122636|ref|YP_004063125.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496038|gb|ADR52637.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 301 Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 61/170 (35%), Positives = 90/170 (52%), Gaps = 8/170 (4%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQ----L 56 M+ N E + L + S I + F + + WGE+GTPL PR+WQ L Sbjct: 1 MNATFQPNIEYDTALLQNVLSPAIAGNPLAFTKYMYRWGEEGTPLANCKGPRAWQTEVFL 60 Query: 57 EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116 E E ++ + +VFK AI++ RGIGKT L AW+ W +STR G +V+ ANS Sbjct: 61 ELAEFIEKNKEAKRLGKPLQVFKLAIASARGIGKTALVAWITYWFLSTRIGCTVVISANS 120 Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQS----LSLHPAPWYSDVLHCSL 162 + Q KTT +AE+ +W SL N H+FE L+ +PW ++ + +L Sbjct: 121 DDQCKTTSFAEIRRWHSLAKNAHFFEANIAEALLAGGCSPWQAEPVAKTL 170 >gi|83593922|ref|YP_427674.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170] gi|83576836|gb|ABC23387.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170] Length = 505 Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 121/468 (25%), Positives = 183/468 (39%), Gaps = 72/468 (15%) Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134 P K + AG G+GKTT A + W + C A + +QL+ LW+E+++ Sbjct: 34 PAGAKVTVRAGHGVGKTTATAAAIWWHLECFDYSKTPCTAPTASQLEQILWSELAR---- 89 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGI------DSKHYSTMCRTYSEERPDTFVGHH 188 L + Q L PA + L G + + + RT ++PD G H Sbjct: 90 LRRRADARAQGTGL-PAALRLEALFAVSGRAIADRGTPREWFVVARTARRDQPDALQGFH 148 Query: 189 ----------------NTYGMAI--INDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 + G A+ + +EASG PD + G L+ A +M NP Sbjct: 149 ASDIDLEAGAGPRLSAKSGGAALMFVIEEASGVPDAVFEVAEGALSSPGAR--LLMVGNP 206 Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 R +G F + + ++ +DP + G++ +YG +S+V RV G FP+ Sbjct: 207 TRNTGFFARSHKRDRASFTALRLRCADSPLVDPGYRAGLVRKYGAESNVVRVRADGAFPR 266 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 QD D I L E AL R P P A +G D+A G D TV +LR GPV+ + Sbjct: 267 QDDDVLIALETAEAALAR-PLPARMATEDERRLGVDVARFGDDRTVFLLRIGPVVGAIEV 325 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG----YHVYRVLGQKRA 403 + D + L E +R I +D GA D L G +RA Sbjct: 326 TAGRDTMAVAGRARRLAEIWRAGRIYVDEIGVGAGVVDRLREDGAPVVAVNVAASAPERA 385 Query: 404 VDLEFCRNRRTELHVKMADWLE-----------------FASLINHSG----------LI 436 E R R L + + WL A L++ G L Sbjct: 386 AGEERGRLLRDHLWLMVRGWLRDEAPVFAGPGGGPASGSAAGLLSGMGSCLVPGVDADLA 445 Query: 437 QNLK---SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478 Q+L + + +G + +ESK + +G +S D +D L TF E Sbjct: 446 QDLAGELATPRYAFDGSGRVVVESKDAMKRRGLRSPDLADALALTFHE 493 >gi|262316909|emb|CBA18135.1| putative terminase B [Paenibacillus phage phiBP] Length = 248 Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 64/208 (30%), Positives = 96/208 (46%), Gaps = 16/208 (7%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 ++ +G+G+GKT L A + LW + P V+C A + QL LWAE+SKW S P Sbjct: 57 SVRSGQGVGKTALEAAISLWFLCCFPFPRVVCTAPTRQQLNDVLWAEISKWQSQSP---- 112 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + W ++ + + ++T RT + +P+ G H Y M I DEA Sbjct: 113 -----ILKRILKWTKTKIYMK-NYEERWFAT-ARTAT--KPENMQGFHEDY-MLFIVDEA 162 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260 SG D I I G L+ F M NP + SG F++ N+ ++ ++ Sbjct: 163 SGVDDRIMAAIFGTLSGDYNKLF--MCGNPTKTSGFFFDSHNRDRAIYRTHRVSCLDSPR 220 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQF 288 E + A+YG SDV RV V G+F Sbjct: 221 TSKENIEMLKAKYGEGSDVWRVRVLGEF 248 >gi|48697461|ref|YP_024846.1| Pas60 [Actinoplanes phage phiAsp2] gi|47679679|gb|AAT36808.1| Pas60 [Actinoplanes phage phiAsp2] Length = 492 Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 92/361 (25%), Positives = 149/361 (41%), Gaps = 37/361 (10%) Query: 50 APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 +P +W + ++V A + + P + A+ G+GK+ A LV W +TR + Sbjct: 22 SPTAWAADCLDVRLAGYQGEILDAVPRERRVAVRGPHGLGKSFSGAILVNWFATTRDLMG 81 Query: 110 ----VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165 +I A++ L+ LW E+ KW + ++L AP+ L + Sbjct: 82 KDWKIITTASAWRHLEVYLWPEIHKWAG--------RINFVALGRAPYNPRTELLDLRLK 133 Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA----N 221 H + + +P+ G H + ++ DEA P I G + N Sbjct: 134 LTHGAATA--VASNQPERIEGAHAEELLYLL-DEAKIVPPATWDSIEGAFSNAGVDVADN 190 Query: 222 RFWIMTSNPRRLSGKFYEIFNKP--LDDW--KRFQIDTRTVEG-IDPSFHEGIIARYGLD 276 + S P SG+FY+I + +DW + ++ G I ++ + +++G D Sbjct: 191 AYAFAMSTPGAPSGRFYDIHRRAPGYEDWWTRHVTLEEAIASGRISRAWADQRRSQWGSD 250 Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEAL------NREPCPDPYAPLIMGCDIAEEGGD 330 S V V G+F D DS IPL +E A+ +R+ P P PL G D+ GGD Sbjct: 251 SAVFHNRVLGEFHASDEDSVIPLAWLEAAIERWHEWDRQGRPSPGGPLWTGVDVG-RGGD 309 Query: 331 NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 TV+ R G + +T+ R GL++ R IID GA D L L Sbjct: 310 ETVLAARDGWAVT-----LETNRRRDTMATVGLIQA-REGRAIIDVIGLGAGVFDRLREL 363 Query: 391 G 391 G Sbjct: 364 G 364 >gi|228924410|ref|ZP_04087639.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] gi|228835241|gb|EEM80653.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] Length = 293 Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 79/280 (28%), Positives = 125/280 (44%), Gaps = 32/280 (11%) Query: 226 MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285 + NP R SG FY+ N+ D +K ++ + E + +YG SDV RV V Sbjct: 3 LCGNPTRTSGVFYDSHNRDRDLYKIHKVSSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVL 62 Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345 G+FP+ + D+FIPL I+E+A + + P L +G D+A G D TV+ R G + L Sbjct: 63 GEFPKAEADAFIPLEIVEQAASCKVEPTGET-LDLGVDVARFGDDETVIAPRIGNKVFKL 121 Query: 346 FDWSKTDLRTTNNKISGLVEKY--------RPDAIIIDANNTGARTCDYL------EMLG 391 + K D T + L ++Y R D I +D + G D L E L Sbjct: 122 LNHYKQDTMETAGHVLKLAKEYMAKYKQLKRVD-IKVDDSGVGGGVTDRLKEVIKSERLP 180 Query: 392 YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE------------FASLINHSGLIQNL 439 + VY V+ + +D E N E + D LE + N +I Sbjct: 181 FKVYPVVNNGKPLDDEHYDNAGAEGWAVVRDLLEENMKAFIQGEEPTMEIPNDEKMISQF 240 Query: 440 KSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTF 476 S K + + + G++A+E K + +G +S D +D ++ F Sbjct: 241 SSRK-YRITSRGKIALERKEEMKKRGLQSPDRADAIVLAF 279 >gi|292670767|ref|ZP_06604193.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292647388|gb|EFF65360.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 442 Score = 84.0 bits (206), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 58/202 (28%), Positives = 94/202 (46%), Gaps = 5/202 (2%) Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY--APLIMGCDIAEEGGDNTVVVLRR 338 R E+ F D IP++++ A NR D P+I+G D+A G D TV+ +R+ Sbjct: 214 RQELLCDFTASASDVVIPIDLVTAAANRLLKDDDVLGQPVILGVDVARFGDDRTVLCVRQ 273 Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398 G ++ + ++ T +++ + ++ P A IDA GA D L L Y V V Sbjct: 274 GLWLKEVRTFTGLSTMETASRVIDCINQHHPHATFIDAGAMGAGVIDRLRQLRYQVSEVN 333 Query: 399 GQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK 458 + A+D N R E++ K WLE I + ++ S + TG + +E K Sbjct: 334 FGEMAMDAARYANIRAEMYFKCRAWLEAGGAIPQNAELKTELSTVEYKFNPTGRIILEPK 393 Query: 459 ---RVKGAKSTDYSDGLMYTFA 477 + + KS D +DG + TFA Sbjct: 394 DKLKERTGKSPDLADGFVLTFA 415 >gi|315649222|ref|ZP_07902312.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453] gi|315275441|gb|EFU38799.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453] Length = 189 Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 66/223 (29%), Positives = 99/223 (44%), Gaps = 45/223 (20%) Query: 15 LFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPN 74 L DL W D + +F+ ++ F P WQ + M ++ P Sbjct: 11 LLDLYWDDPV--AFAEDMMGF--------------DPDDWQCDVM-------MDVTQFP- 46 Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134 + ++ +G+G+GKT L A LV+W + RP V+C A ++ QL LW EVSKWL Sbjct: 47 ----RTSVRSGQGVGKTGLEAALVIWFLCCRPNPKVVCTAPTKQQLHDVLWTEVSKWLE- 101 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 S+ + W ++ +G + + ++T + +P+ G H Y M Sbjct: 102 ---------NSMVKNLLKWTKTKVY-MIGHEQRWFAT---ARTANKPENMQGFHEDY-ML 147 Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I DEASG D I ILG L+ A +M NP R SG F Sbjct: 148 FIVDEASGVSDPIMEAILGTLS--GAENKLLMCGNPTRTSGVF 188 >gi|257459276|ref|ZP_05624390.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268] gi|257443289|gb|EEV18418.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268] Length = 431 Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 66/256 (25%), Positives = 113/256 (44%), Gaps = 10/256 (3%) Query: 236 KFYEIFNKPLDD---WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQ 291 KF+++ + + + W+ FQ + + + ++A G DSDV R E+ G+F Sbjct: 161 KFFDLAQRGMRNEKGWRNFQFSSYDNPLLQKEEIDRLVAELGGADSDVARQEIFGEFLDT 220 Query: 292 DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT 351 +S L IE A ++ D AP+I D+A EG D +V+ R+G +E L + Sbjct: 221 TSNSVFSLAAIEAAFRKQRYFDAGAPVIWALDVAREGDDESVLCKRQGDSVEPLKPYRIA 280 Query: 352 DLRTTNNKISGLVEK--YRPDAIIIDANNTGARTCDYLEMLGYH--VYRVLGQKRAVDLE 407 +I G E+ +P AI ID GA D L LG V G +A D Sbjct: 281 STSELAREIYGEYERTDLKPHAIYIDTIGVGAGVFDTLCDLGLRGIVREAKGSFKASDER 340 Query: 408 FCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKG--AKS 465 N+R E++ + + L ++ L + L+++ + L + + +K +S Sbjct: 341 KYANKRAEMYFNLREKLPLLAIAPDEELKRQLQTIAFYFDKKERYLLMPKEGIKKEYGRS 400 Query: 466 TDYSDGLMYTFAENPP 481 D +D L +F + P Sbjct: 401 PDRADALAMSFFDLCP 416 >gi|226940459|ref|YP_002795533.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715386|gb|ACO74524.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 272 Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 69/243 (28%), Positives = 103/243 (42%), Gaps = 8/243 (3%) Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 W QID+RTVEG + YG +SD +V V G FP FI + A Sbjct: 14 WVARQIDSRTVEGTNKEQIAKWAEDYGEESDFFKVRVRGMFPSMSARQFISETDVSAAYG 73 Query: 308 REPCPD--PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 R P+ YAP I+ D A EG D V+ LR+G L +K D ++ E Sbjct: 74 RALRPEQYQYAPKILTVDPAWEGDDEFVIGLRQGLSFRVLHTMAKNDNDLVAAQVIARYE 133 Query: 366 KYR-PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424 DA+ +DA G + +G V ++D C N+R E+ DWL Sbjct: 134 DEEGADAVFVDA-GFGTGIVSAGKSMGRDWTLVWFAGNSMDAG-CLNKRAEMWRDARDWL 191 Query: 425 EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481 + I ++++ + G++ IESK + +G S + +D L+ +FA Sbjct: 192 KSGGAIPDDPVLRDELQAPEIVPRLDGKIQIESKKEMKARGVPSPNRADALILSFAYPVT 251 Query: 482 RSD 484 R D Sbjct: 252 RRD 254 >gi|154175204|ref|YP_001409090.1| Ppx/GppA family phosphatase [Campylobacter curvus 525.92] gi|112803006|gb|EAU00350.1| phosphatase, Ppx/GppA family [Campylobacter curvus 525.92] Length = 433 Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 70/258 (27%), Positives = 121/258 (46%), Gaps = 24/258 (9%) Query: 236 KFYEIFNKPL---DDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQ 291 +F+++ ++ + DW FQI + + + +IA G +DSDV + E+ G+F Sbjct: 161 RFFDLASRGMRNEKDWVNFQISSFENPLLRKEEIDELIAELGGVDSDVVKQEIYGEFLDT 220 Query: 292 DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--- 348 ++ PL+ IE A + +P A I G D+A +G D +V+ +R G +++L + Sbjct: 221 TTNALFPLSQIEAAFGKVRAYEPNAVQIWGLDVARDGDDESVLCVREGYHVKNLEGFRIA 280 Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EM-LGYHVYRVLGQKRAVDL 406 S T+L + + EK +P+AI ID+ GA T D L E LG +A + Sbjct: 281 STTELAREIYRRYEMSEK-KPEAIFIDSVGVGAGTFDRLCEFGLGAICREAKASYKATNE 339 Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSL--------KSFIVPNTGELAIESK 458 N+R E++ + + ++ H L + L+ + + I+P E K Sbjct: 340 AKFANKRAEMYFALKEKFHLLTMNAHEKLKKQLQMIEFQYDRKERYLILPKD-----ELK 394 Query: 459 RVKGAKSTDYSDGLMYTF 476 + G S DY+D L TF Sbjct: 395 KEYGT-SPDYADALALTF 411 >gi|119386463|ref|YP_917518.1| PBSX family phage terminase large subunit [Paracoccus denitrificans PD1222] gi|119377058|gb|ABL71822.1| phage terminase, large subunit, PBSX family [Paracoccus denitrificans PD1222] Length = 441 Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 60/206 (29%), Positives = 92/206 (44%), Gaps = 19/206 (9%) Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG------ 339 G + + FI ++ EA+ R+P L++G D+A G D +V+ RRG Sbjct: 214 GDYEAESDMQFIGGGLVREAMARQPFSQIGDELVLGVDVARFGDDRSVIWARRGRDAQTE 273 Query: 340 -PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRV- 397 P+I D ++ +++ PD + ID G D +GY V V Sbjct: 274 LPIIMK-----GADTMAVAARVMAEIDRLHPDGVFIDEGGVGGGVIDRCRQMGYSVVGVN 328 Query: 398 LGQK--RAVD-LEFCRNRRTELHVKMADWLEFASLINHS-GLIQNLKS-LKSFIVPNTGE 452 G K RA++ + CRN+R ++ M +WL I S L +L L SF V N E Sbjct: 329 FGGKADRAIEGVPKCRNKRAQMWATMREWLRSGGCIPDSRDLEMDLTGPLYSFDVNNAIE 388 Query: 453 LAIESK-RVKGAKSTDYSDGLMYTFA 477 + +S + +G S D +D L TFA Sbjct: 389 IEKKSDMKKRGVSSPDEADALALTFA 414 >gi|56266666|gb|AAV84947.1| DNA pacase B subunit [Enterobacteria phage D6] Length = 502 Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 87/345 (25%), Positives = 144/345 (41%), Gaps = 44/345 (12%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 + +++G G GK++L A L+L M P VI +AN Q+KT ++ V ++ + + Sbjct: 56 RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 H + L +Y GI + +C+ Y + G H + + +I D Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH-LLLILD 167 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-------NKPLDDWKRF 251 EASG D + G LTE + NR +M S P R SG FY+ + P W Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 225 Query: 252 QIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 +++ + P F + + Y G DS V+V GQFP++ + + + A R+ Sbjct: 226 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRAARRKV 285 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI---SGLV--- 364 + + D+ G D +V+ + + V H + R N K+ SG + Sbjct: 286 LLEKNWGWVATADVG-NGRDKSVLNICK--VSGH-----RDKRRVVNFKVMEMSGTMDPL 337 Query: 365 ------------EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRV 397 EKY I +DA+ G+ TC L G + R+ Sbjct: 338 AFADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRI 382 >gi|303257560|ref|ZP_07343572.1| putative terminase B protein [Burkholderiales bacterium 1_1_47] gi|302859530|gb|EFL82609.1| putative terminase B protein [Burkholderiales bacterium 1_1_47] Length = 330 Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 59/202 (29%), Positives = 90/202 (44%), Gaps = 6/202 (2%) Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY--APLIMGCDIAEEGGDNTVVVLRR 338 R E F + IP++ I A N+ Y APLI G D+A G D +V+ RR Sbjct: 95 RQEFLCDFSAAQDNGLIPIDDIRAAANKFYRESEYMGAPLIYGIDVARFGSDASVIFKRR 154 Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398 G V K D ++I+ + K +PDA+ ID + G D L + + V V Sbjct: 155 GLVAFEPIVIRKFDNMALADRIAVEMAKEKPDAVFID-SGAGQGVIDRLRQMRFDVVEVP 213 Query: 399 GQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK 458 +A+D E NRR E+ MA W++ I ++Q ++ G +E+K Sbjct: 214 FGAQAIDKEQFANRRMEMWWHMAQWIKQGGAIPPDPVLQGDLGAPTYGYTPKGPKILEAK 273 Query: 459 ---RVKGAKSTDYSDGLMYTFA 477 + + +S D +D L TFA Sbjct: 274 DKLKERIGRSPDLADALALTFA 295 >gi|216906085|ref|YP_002333619.1| terminase [Abalone shriveling syndrome-associated virus] gi|216263178|gb|ACJ72002.1| terminase [Abalone shriveling syndrome-associated virus] Length = 507 Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 109/450 (24%), Positives = 182/450 (40%), Gaps = 46/450 (10%) Query: 54 WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113 WQLE ++ + A ++ V A+S G G GKT L+ L +W PG L Sbjct: 51 WQLEIVDYI-AKFFRKNSDEKHFVCAIAVSGGNGTGKTKLSKALNIWRFCCHPGSRQFIL 109 Query: 114 ANSETQLK----TTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 NSE Q K T L +SK LS + ++S + + +P +D D Sbjct: 110 TNSERQTKRTGFTMLVRRISKLLSCIA-----ALESSAYYYSPAVADKPEVRTN-DMWDV 163 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 + + ++ +E G H+ M DE++ D + + T+ F T N Sbjct: 164 TYLLQSSTEA---ALSGLHHPM-MTFSFDESTYFNDHVWQALENMWTQGQVLCF--CTGN 217 Query: 230 PRRLSGKFY-EIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR-------YGLDSDVTR 281 P + ++ +FNK L + TR V ++ AR YG Sbjct: 218 PSHDNNNYFARLFNKSLHKKDSLWL-TRCVSLLELPLKYRNDARARYIEEHYGKTHPRYI 276 Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREPCPD-PYAPLIMGCD--IAEEGGDNTVVVLRR 338 V GQFP+++ + + I EA+ RE + + P+IMG D I+ G + + +R Sbjct: 277 ASVLGQFPKKNTCNPFDITAISEAMEREVREEFIHHPVIMGIDVSISANNGSASAICVRE 336 Query: 339 GPVIEHLFDW--SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-----EMLG 391 G + L ++ T+ R K+ L+++ +P +++DAN G + L E Sbjct: 337 GTAVRVLREYRCHYTEFRI---KLLELLQEIKPTIVVVDANGVGFGLYEELHRTLPETSN 393 Query: 392 YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLKSLKSFIVPN 449 VY V A ++ +EL K ++W E S+ + + L SL Sbjct: 394 VRVYGVRAHAEAFLKSEYADKMSELAKKSSEWFNNELVSIPKNYQFLNALTSLS--FADA 451 Query: 450 TGELAIESKRVKGAK---STDYSDGLMYTF 476 +G++ + K K S D +D TF Sbjct: 452 SGKIKLIGKTDAKKKVDLSMDMADAFFLTF 481 >gi|323179619|gb|EFZ65182.1| terminase B protein [Escherichia coli 1180] Length = 453 Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 85/345 (24%), Positives = 142/345 (41%), Gaps = 44/345 (12%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 + +++G G GK++L A L+L M P VI +AN Q+KT ++ V ++ + + Sbjct: 7 RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 66 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 H + L +Y GI + +C+ Y + G H + + +I D Sbjct: 67 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH-LLLILD 118 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-------NKPLDDWKRF 251 EASG D + G LTE + NR +M S P R SG FY+ + P W Sbjct: 119 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 176 Query: 252 QIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 +++ + P F + + Y G DS V+V GQFP++ + + + A R+ Sbjct: 177 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRAARRKV 236 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL------- 363 + + D+ G D +V+ + + V H + R N K+ + Sbjct: 237 LLEKNWGWVATADVG-NGRDKSVLNICK--VSGH-----RDKRRVVNFKVMEMPGTMDPL 288 Query: 364 -----------VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRV 397 EKY I +DA+ G+ TC L G + R+ Sbjct: 289 AFADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRI 333 >gi|323948959|gb|EGB44853.1| terminase B protein [Escherichia coli H252] Length = 502 Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 62/221 (28%), Positives = 100/221 (45%), Gaps = 18/221 (8%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 + +++G G GK++L A L+L M P VI +AN Q+KT ++ V ++ + + Sbjct: 56 RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 H + L +Y GI + +C+ Y + G H + + +I D Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH-LLLILD 167 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-------NKPLDDWKRF 251 EASG D + G LTE + NR +M S P R SG FY+ + P W Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSRAKTPDNPKGIWTAI 225 Query: 252 QIDTRTVEGIDPSF-HEGIIARYGLDSDVTRVEVCGQFPQQ 291 +++ + P F E ++ G DS V+V GQFP++ Sbjct: 226 VLNSEESPFVTPQFIKEKLLEYGGRDSIEYMVKVLGQFPRE 266 >gi|322656964|gb|EFY53248.1| DNA packaging protein [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] Length = 411 Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 62/221 (28%), Positives = 100/221 (45%), Gaps = 18/221 (8%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 + +++G G GK++L A L+L M P VI +AN Q+KT ++ V ++ + + Sbjct: 56 RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 H + L +Y GI + +C+ Y + G H + + +I D Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH-LLLILD 167 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-------NKPLDDWKRF 251 EASG D + G LTE + NR +M S P R SG FY+ + P W Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 225 Query: 252 QIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQ 291 +++ + P F + + Y G DS V+V GQFP++ Sbjct: 226 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPRE 266 >gi|269119479|ref|YP_003307656.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386] gi|268613357|gb|ACZ07725.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386] Length = 499 Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 107/476 (22%), Positives = 189/476 (39%), Gaps = 81/476 (17%) Query: 58 FMEVVDAHCLNSVNNPNPEVF----KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113 F ++++ H L+ + F + ++ AG GK++L L + + TRP VI Sbjct: 22 FKDILNFHFLSEDQTRVLQAFNEYRRLSVPAGHSTGKSSLAGGLTTYWLITRPKSRVIVT 81 Query: 114 ANSETQLKTTLWAEVSK--------WLSLLP-------------NKHWFEMQSLSLHPAP 152 A + QLKT WAEV+K L+L + WF + + P Sbjct: 82 APTYRQLKTIYWAEVNKIYNRSKLKQLNLFEINDKIMRINDKDLKREWFALPVTASTPEG 141 Query: 153 WYS---------DVLHCSLGI----DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + LGI D + + + E+ + + + ++ DE Sbjct: 142 MQGQHGDKTEVIEQIMKHLGIEEIGDDETIEIVSQILRGEKQIEGLTKEDKEKLLVMVDE 201 Query: 200 ASGTPDVI----------NLGILGFLTERNANRFWIMTSNPRRLSGKFYEI----FNKPL 245 +SG + I L + G +T +N F+ NP+ KFY++ +N P Sbjct: 202 SSGVKNEIFEVLEGTDYDKLVLFGNMT-KNTGYFYESVYNPK---SKFYKVTMSSYNSPF 257 Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 K+ QI H+ + YG DS+V RV + G+ P + +S N I+ A Sbjct: 258 --MKKEQI------------HD-LEETYGPDSNVVRVRLKGEAPDGNENSIFSSNKIDSA 302 Query: 306 LNREPCPDPYAPLIMGCDIAE-EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV 364 R Y + +G D+ + GGD++ + ++ + D L +I Sbjct: 303 FQRSLSLSEYETIKLGVDVGKGSGGDSSTIYEKKDNRVRKKLDRKDFTLPDVKREIIQYC 362 Query: 365 EKYRPDAIIIDANNTGART-----CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419 K R II + + TG T + E+ V + +A + + N+RTE++ + Sbjct: 363 YKNRDKLIIANIDGTGLGTGLVQELEEGEIENLVVNDIQFAGKAKNKKEFNNKRTEMYFE 422 Query: 420 MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK-RVKG--AKSTDYSDGL 472 ++ L+ L L + L ++ + N G + SK ++K S D SD L Sbjct: 423 LSRNLDKLDLEEDQELKREL-LIQIYEFDNNGRFKLISKDKIKEMLGHSPDKSDAL 477 >gi|153951273|ref|YP_001397540.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|153951467|ref|YP_001398214.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938719|gb|ABS43460.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938913|gb|ABS43654.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] Length = 430 Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 70/256 (27%), Positives = 107/256 (41%), Gaps = 20/256 (7%) Query: 237 FYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARY-----GLDSDVTRVEVCGQFP 289 FYE+ K L D WK FQ + +P E I G SDV R E+ G+F Sbjct: 164 FYELCRKELSDKNWKHFQFSSYD----NPFLKEEQIKELIEEVGGESSDVVRQEIYGEFI 219 Query: 290 QQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 L+ IE A+++ I G D+A G D +V+ R+G VI+ L Sbjct: 220 DSSSAELFSLSGIENAMSKNSFSTQKMQGENIWGLDVARYGDDKSVLAKRKGFVIDELKK 279 Query: 348 WSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 +S+ NKI ++ +P I ID G D L G V+ A Sbjct: 280 YSQLGTIELANKILAEYKQSEEKPKGIFIDTCGLGVGVYDVLLNYGLPVFEANSANSATS 339 Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462 ++ N+R +++ A L+ L+ L +++ ++ + + G L I SK + Sbjct: 340 NQYL-NKRAQMYFTFAKNLKHMELVKDEELKNDMRRIE-YEYSDKGLLKIVSKEQLKKNY 397 Query: 463 AKSTDYSDGLMYTFAE 478 KS D SD + TF E Sbjct: 398 GKSPDLSDAVALTFFE 413 >gi|304399103|ref|ZP_07380971.1| DNA packaging protein [Pantoea sp. aB] gi|304353343|gb|EFM17722.1| DNA packaging protein [Pantoea sp. aB] Length = 503 Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 71/275 (25%), Positives = 115/275 (41%), Gaps = 37/275 (13%) Query: 45 LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104 +E F +WQ E +NSV + +++G G GK++L A ++L M Sbjct: 32 VELFGMIPTWQQE-------EIMNSVQETGSQT---TVTSGHGTGKSSLTAMMLLIYMIM 81 Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164 P VI +AN Q+KT ++ V + + +H + +L +Y GI Sbjct: 82 YPDARVIIVANKIGQVKTGVFKYVKTYWANAARRHPWLQNYFTLTDTMFYE---KSRKGI 138 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224 + +C+ Y + G H + + I+ DEASG D + G LTE + NR Sbjct: 139 ----WEVLCKGYRLGNEEALAGEHAAHILLIL-DEASGISDKAIAIMRGALTEED-NRM- 191 Query: 225 IMTSNPRRLSGKFYEIF-------NKPLDDWKRFQIDTRTVEGIDPSF-HEGIIARYGLD 276 +M S P R SG FY+ + P W +++ + F E ++ G D Sbjct: 192 LMMSQPTRPSGYFYDSHHSLARHPDNPNGFWNAIVLNSEEAPHVTLKFIREKLVEYGGRD 251 Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 S V+V G+FP+ N+ L R+ C Sbjct: 252 SLEYMVKVLGRFPR---------NVSGYLLGRDEC 277 >gi|283956317|ref|ZP_06373797.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni 1336] gi|283792037|gb|EFC30826.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni 1336] Length = 430 Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 66/256 (25%), Positives = 107/256 (41%), Gaps = 20/256 (7%) Query: 237 FYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARY-----GLDSDVTRVEVCGQFP 289 FYE+ K L D WK FQ + +P E I G DS+V + E+ G+F Sbjct: 164 FYELCRKELSDKNWKHFQFSSYD----NPFLKEEQIKELIEEVGGEDSEVVKQEIYGEFI 219 Query: 290 QQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 L IE A+++ I G D+A G D +V+ R+G +++ + Sbjct: 220 DSSSAELFALTEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSVLAKRKGFIVDEIKK 279 Query: 348 WSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 +S+ N+I + +P I ID G D L G V+ A Sbjct: 280 YSQLGTMELANRILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEANSANSATS 339 Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462 E+ N+R +++ A L+ L+ L ++++ ++ + + G L I SK + Sbjct: 340 NEYL-NKRAQMYFTFAKNLKHMELVKDEELKKDMRMIE-YEYSDKGLLKIVSKEQLKKNY 397 Query: 463 AKSTDYSDGLMYTFAE 478 KS D SD + TF E Sbjct: 398 GKSPDVSDAVALTFFE 413 >gi|212703250|ref|ZP_03311378.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098] gi|212673294|gb|EEB33777.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098] Length = 330 Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 59/216 (27%), Positives = 94/216 (43%), Gaps = 12/216 (5%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--PLIMGCDIAEEGG 329 R L + R E+ F D IPL + EA R+ D P+I+G D+A G Sbjct: 79 RRELSDNAFRQEMLCDFTASSDDILIPLPDVLEAEARQLAWDDVGGMPVILGVDVARFGA 138 Query: 330 DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 D++V+V R+G ++ D ++++ + + RP A+ IDA G D L Sbjct: 139 DSSVIVRRQGLKVDGPVVMRGLDNMQLADRVAAAIMENRPHAVFIDAGQ-GQGVIDRLRQ 197 Query: 390 LGYHVYRV-LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG----LIQNLKSLKS 444 LG+ V V G K + F NRR+E+ + WL+ + G ++ S Sbjct: 198 LGHEVIEVPFGGKPLQEGRFA-NRRSEMWYGLRQWLKSGGKLPDEGDDVPRLRAELSAPL 256 Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 + G + +E K + + S D +D L TFA Sbjct: 257 YWYDAAGRMVLEPKDKIKERLGASPDIADALALTFA 292 >gi|315929403|gb|EFV08605.1| phosphatase, Ppx/GppA family [Campylobacter jejuni subsp. jejuni 305] Length = 430 Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 67/256 (26%), Positives = 104/256 (40%), Gaps = 20/256 (7%) Query: 237 FYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARY-----GLDSDVTRVEVCGQFP 289 FYE+ K L D WK FQ + +P E I G S+V + E+ G+F Sbjct: 164 FYELCRKELSDKNWKHFQFSSYD----NPFLKEEQIKELIEEVGGEGSEVVKQEIYGEFI 219 Query: 290 QQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 L+ IE A+++ I G D+A G D + + R+G VI + Sbjct: 220 DSSSAELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKGFVIYEIKK 279 Query: 348 WSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 +S+ NKI + +P I ID G D L G V+ A Sbjct: 280 YSQLGTIELANKILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEANSANSATS 339 Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462 E+ N+R +++ A L+ L L ++++ ++ + + G L I SK + Sbjct: 340 NEYL-NKRAQMYFTFAKNLKHMELFKDEELKKDMRMIE-YEYSDKGLLKIVSKEYLKKNY 397 Query: 463 AKSTDYSDGLMYTFAE 478 KS D SD + TF E Sbjct: 398 GKSPDVSDAVALTFFE 413 >gi|57237579|ref|YP_178593.1| terminase B protein, putative [Campylobacter jejuni RM1221] gi|57166383|gb|AAW35162.1| terminase B protein, putative [Campylobacter jejuni RM1221] Length = 430 Score = 63.2 bits (152), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 64/252 (25%), Positives = 105/252 (41%), Gaps = 12/252 (4%) Query: 237 FYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDI 293 FYE+ K L D WK FQ + + + +I G + S+V + E+ G+F Sbjct: 164 FYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVKQEIYGEFIDSSS 223 Query: 294 DSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT 351 L+ IE A+++ I G D+A G D + + R+G VI + +S+ Sbjct: 224 AELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKGFVIYEIKKYSQL 283 Query: 352 DLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 NKI + +P I ID G D L G V+ A E+ Sbjct: 284 GTIELANKILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEANSANSATSNEYL 343 Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKST 466 N+R +++ L+ L+ L ++++ ++ + + G L I SK + KS Sbjct: 344 -NKRAQMYFTFTKNLKHMELVKDEELKKDMRMIE-YEYSDKGLLKIVSKEQLKKNYGKSP 401 Query: 467 DYSDGLMYTFAE 478 D SD + TF E Sbjct: 402 DVSDAVALTFFE 413 >gi|168467778|ref|ZP_02701615.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195629119|gb|EDX48493.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 494 Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 88/381 (23%), Positives = 154/381 (40%), Gaps = 59/381 (15%) Query: 48 FSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPG 107 F +WQ + + SV P K ++S+G G GK+ + + +++ + PG Sbjct: 30 FGKTPTWQQD-------QIIESVQEPGS---KTSVSSGHGTGKSDMTSIMIMLFIIMFPG 79 Query: 108 ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167 I +AN Q+ T ++ K+L + +W S + PW ++ + D+ Sbjct: 80 ARAIIVANKIQQVMTGIF----KYLKI----NW----STATSRFPWLAEYFVLT---DTS 124 Query: 168 HYSTMCRTYSEERPDTF--------VGHHNTYGMAIINDEASGTPDVINLGILGFLTERN 219 Y + P F G H + + II DEASG D + G LT ++ Sbjct: 125 FYEITSKGVWTVVPKGFRLGNEEALAGEHADHLLYII-DEASGVSDKAFGIMTGALTGKD 183 Query: 220 ANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272 NR ++ S P R SG FY+ +K P + +++ + P F + +A Sbjct: 184 -NRI-LLLSQPTRPSGYFYDTHHKLAKRPGNPNGIYTAITLNSEESPLVTPEFIKMKLAE 241 Query: 273 Y-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIA-EEGGD 330 Y G DS + ++V G FP+ + + +E A R+ I D+A G D Sbjct: 242 YGGRDSPMYLIKVRGLFPKTQDGFLLGRDEVERASRRKVKIAKGWGWIACVDVAGGTGRD 301 Query: 331 NTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDAIIIDANNTG 380 +V+ + +R + + ++S KI+ ++Y I+ID + G Sbjct: 302 KSVINIMMVSGERNKRRIIGYRIIEYSDVTETQLAAKINAECSPDRYPNITIVIDGDGLG 361 Query: 381 ARTCDYLEMLGYHVYRVLGQK 401 T D L Y Y + Q+ Sbjct: 362 KSTADLL----YDNYGITAQR 378 >gi|282598783|ref|YP_003359102.1| putative large subunit terminase [Clavibacter phage CMP1] gi|262212571|gb|ACY35907.1| putative large subunit terminase [Clavibacter phage CMP1] Length = 872 Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 84/339 (24%), Positives = 132/339 (38%), Gaps = 49/339 (14%) Query: 183 TFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN 242 +F G H+ + +A++ DEA G P+ + +G T +A I NP + + F+E F Sbjct: 511 SFQGIHDGH-VAVVLDEAGGLPEDLYIGANAVTTNFHARILAI--GNPDKRNTPFHERFT 567 Query: 243 --KPLDDWKRFQI---DTRTVEGI----DPSFHE-----------GIIARYGLDSDVTRV 282 + W RF I DT G DP+ E + R V Sbjct: 568 DTEKFSSWNRFTIGAEDTPNFTGEKIYEDPAKDEDVKKHLVQVSWAVEMRKSARPSVVAA 627 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342 +V G FP+ D +F ++I + E P+ MG DI+ +G D +V + G I Sbjct: 628 KVDGNFPESDDTTFFDQSVINRGYSTEIEPESTDFKYMGVDISYQGEDQSVAYINHGGQI 687 Query: 343 EHLFDWSKTD--------LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--- 391 +W++ D +R N V++ R ID TGA L+ML Sbjct: 688 RIADEWNRFDGAEHIESAIRIHNKACQEGVQEVR-----IDMAGTGAGVYSNLKMLDQFK 742 Query: 392 ---YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKSF 445 Y + V G R + N R + + L + I L + ++ L+ Sbjct: 743 DKPYVLIGVNGANRTPNSNRWLNARAWHYDQFRTGLITGKIDITITDVDLKKEME-LQPS 801 Query: 446 IVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481 N G+L I K R G S D+ D +Y+ + P Sbjct: 802 TFTNRGQLQITRKDDMRKMGISSPDHLDAAIYSAIDTTP 840 >gi|148653111|ref|YP_001280204.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1] gi|148572195|gb|ABQ94254.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1] Length = 520 Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 94/445 (21%), Positives = 178/445 (40%), Gaps = 60/445 (13%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 + ++++G G GK+ + LW + P ++ A QL+T +W E++ L L N Sbjct: 57 RTSVASGHGTGKSRSAGIIALWHLLFYPESVMLFTAPQIGQLRTVVWKEINICLQRLRNN 116 Query: 139 HWFEMQSLSLHPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGM 193 W +D V+ + I K + +T + +P G H + M Sbjct: 117 ----------KALGWLADYVVVLAEKIYIKGFKDTWFVFAKTAPKHQPTNIAGQHGDHYM 166 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL----DDWK 249 + DEA G D + +G LT N NR ++TS P + +G FY+ +K W Sbjct: 167 -VWADEACGIDDAVMEVAIGALTHEN-NRA-VLTSQPAKNTGFFYDTHHKLSHHNGGKWT 223 Query: 250 RFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 + + + + +YG +S + + G+FP+ ++ E + + Sbjct: 224 ALEFNGEMSPIVSKDKLIEALYQYGSRNSPGYLIRIRGKFPELK-GEYLLTRTDYENMKQ 282 Query: 309 EPC----PDPYAPLIMGCDIAEEGGDNTVVV--------LRRGPVIEH-------LFDWS 349 +PC D + +I+ D+ + G ++ V+ + +G + H LF + Sbjct: 283 QPCVIEEGDKWG-IIVAVDVGGDVGRDSSVISVMQVVDKMIKGRIERHVHLLDIPLFS-N 340 Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + ++ T KI+ ++ Y ++ID G L+ G + V + Sbjct: 341 RANINTLKAKINDVMSDYPGATLVIDPLGAGMGLTQSLKADGVYFDEVHWGSPCFNNTLK 400 Query: 410 R---NRRTELHVKMADWLE---FASLINHSGLIQNLKSLKS------FIVPNTGELAIES 457 R N+R+ +V MA +E F+ + Q + +L+ + + S Sbjct: 401 RYYMNKRSHAYVSMAKAVEKGYFSVSDKVKKMYQVMTNLEEQMTRLPYYFDEKARWCMMS 460 Query: 458 KR---VKGAKSTDYSDGLMYTFAEN 479 K+ KG KS D +D + + F EN Sbjct: 461 KKDMLKKGIKSPDIADTIAFGFMEN 485 >gi|226227228|ref|YP_002761334.1| hypothetical protein GAU_1822 [Gemmatimonas aurantiaca T-27] gi|226090419|dbj|BAH38864.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 549 Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 107/462 (23%), Positives = 179/462 (38%), Gaps = 67/462 (14%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSK-WLSLLPNKH 139 A+++G G GKT L A L+LW ++ P +A Q + +W EV++ W Sbjct: 71 AVASGTGTGKTFLEAVLLLWWIAVEPDSIATTVATKADQQEKGIWREVARHWPRFQACFP 130 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 E+ +L + PW D + GI T EE G H + I+ DE Sbjct: 131 EAELTTLRIRMEPWRGDAWG-AWGI------TAAPKAGEESSSAVQGLHAKR-LLILVDE 182 Query: 200 ASGTPDVINLGILGFLT-ERNANRFWIMTSNPRRLS---GKFYEIFNKPLDDWKRFQID- 254 G P + ++ T E N + NP + G+F E K + + +D Sbjct: 183 TPGVPQPVMTALVNTATGEENVIAAF---GNPDYQADPLGQFAE--TKRVTAIRISALDH 237 Query: 255 ---TRTVEGIDPSFHEGIIA----RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 VE I + IA +YG++S V + V G P+Q + I L A + Sbjct: 238 PNVVLGVERIPGAATRLSIATREDKYGVESGVYQSRVRGIAPEQSASALIHLAWCVAAAD 297 Query: 308 REPCPDPYA----PLIMGCDIAE-EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 R A P +G D+A+ E GD V + +G + + + + ++ Sbjct: 298 RAESVQHAALALGPKALGVDVAQSENGDKAAVAMGQGARLLSVIAKACPNATKLGAEVWQ 357 Query: 363 LV--EKYRPDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVD--------- 405 L+ E P+ + +D GA T ++L E G V R G +A++ Sbjct: 358 LMRDEGIVPEYVGVDPIGVGAATVNHLDGECEKENAGRSVVRCSGGAKAMEASSRAADGS 417 Query: 406 -LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNT-------GELAIES 457 +E+ + +++ W + + + GLI + + F T G + +ES Sbjct: 418 AMEWLADANKFKNLRAQMWWQLREDLRN-GLIALPRDRELFRELTTVQFDEDGGIVTLES 476 Query: 458 K---RVKGAKSTDYSDGLMY-------TFAENPPRSDMDFGR 489 K R + +S D +D ++Y T PP D R Sbjct: 477 KDDIRKRLGRSPDRADAVVYWNWVRPRTRVNQPPPEGFDVAR 518 >gi|299769795|ref|YP_003731821.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1] gi|298699883|gb|ADI90448.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1] Length = 668 Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 101/432 (23%), Positives = 163/432 (37%), Gaps = 61/432 (14%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT + LW + ++ A QLK +W E+S + L Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259 Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 P W +D V + S + K Y +T + +P G+H M + DEASG Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259 D + G LT + NR +MTS P R +G FYE +K W + Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376 Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317 + E +YG D ++ V G+FP + I EE D + Sbjct: 377 LVSKQSLEEQRQKYGSRDDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436 Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363 I+ D+ G D++V+V+ RR V++ ++ D+ KI+ L Sbjct: 437 YIITVDVGGGVGRDDSVIVISKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA-D 422 + +Y +++D N G YL+ G V + F + R E K + Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQC----FSNDNRKEFTNKRSLA 552 Query: 423 WLEFASLINHSGLIQNLKSLKSFI--------VPNTGE-------LAIESKRVKGAKSTD 467 ++ FA + SG + +K+ K ++ +P + L+ + R G KS D Sbjct: 553 YVGFARAVA-SGRFK-MKTKKHYVKIKDQLIHIPYRFDDFARYKILSKDEMRRMGIKSPD 610 Query: 468 YSDGLMYTFAEN 479 D + F EN Sbjct: 611 LGDAFAFLFLEN 622 >gi|323516996|gb|ADX91377.1| hypothetical protein ABTW07_0941 [Acinetobacter baumannii TCDC-AB0715] gi|323518424|gb|ADX92805.1| hypothetical protein ABTW07_2381 [Acinetobacter baumannii TCDC-AB0715] Length = 663 Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 78/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT + LW + ++ A QLK +W E+S + L Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259 Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 P W +D V + S + K Y +T + +P G+H M + DEASG Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259 D + G LT + NR +MTS P R +G FYE +K W + Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376 Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317 + E +YG D ++ V G+FP + I EE D + Sbjct: 377 LVSKQSLEEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436 Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363 ++ D+ G D++V+V+ RR V++ ++ D+ KI+ L Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGESQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391 + +Y +++D N G YL+ G Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524 >gi|256392042|ref|YP_003113606.1| hypothetical protein Caci_2856 [Catenulispora acidiphila DSM 44928] gi|256358268|gb|ACU71765.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 484 Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 93/443 (20%), Positives = 161/443 (36%), Gaps = 77/443 (17%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRP--GISVICLANSETQLKTTLWAEVSKWLSL---- 134 A+ + G GK+ + + L W + T P V+ A + Q+K LWAE++K + Sbjct: 58 AVQSCHGTGKSFVASRLTAWWLDTHPPGEAFVVTTAPTGDQVKAILWAEINKAFAKAEAR 117 Query: 135 ---LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 LP + ++ W D + G R S+ P F G H Y Sbjct: 118 GTPLPGR---------INETDWKYDKFLVAFG----------RKPSDYNPHAFQGIHAKY 158 Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251 + I+ DEA G L T + I NP F ++ D W Sbjct: 159 VLVIL-DEACGISKQFWTAALAIATGVHCRILAI--GNPDDPGSHFAQVCKS--DRWNMI 213 Query: 252 QIDTR-----TVEGIDPSFHEGIIAR---------YGLDSDVTRVEVCGQFPQQDIDSFI 297 +I R T E + + ++++ +G +S + +V +FP D + Sbjct: 214 KIAARDTPNFTGEEVPDDLADMLVSQAYVLDMAEEFGPESPIYLSKVDAEFPSDASDGVV 273 Query: 298 PLNIIEEALNREP----CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL 353 L+ + A REP PD P+ +G D+ GGD T + RRG + + D Sbjct: 274 RLSKL-MACTREPVHPYAPDRLVPVELGVDLG-AGGDETCIRERRGIAAGREWRNREKDS 331 Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM---LGYHVYRVLGQKRAVDLEFCR 410 + I + + + +D+ G L+ G H V+G + E Sbjct: 332 EKVVDHIVRAIRETGATKVKVDSIGIGWGIVGSLQARRKQGLHTAEVVGVNVS---EAST 388 Query: 411 NRRTELHVKMADWLEFASLINHSG--------------LIQNLKSLKSFIVPNTGELAIE 456 ++ W E ++ G L+ L + K + + +G + +E Sbjct: 389 QPEKYARLRSQIWWEVGRKLSEDGGWDLSQLDTTDRDRLVSQLTAPK-YDLDASGRIVVE 447 Query: 457 SK---RVKGAKSTDYSDGLMYTF 476 K + + +S D +D L+ F Sbjct: 448 KKEETKKRIGRSPDNADALLLAF 470 >gi|312964323|ref|ZP_07778627.1| terminase B protein [Escherichia coli 2362-75] gi|331655801|ref|ZP_08356790.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M718] gi|312291036|gb|EFR18910.1| terminase B protein [Escherichia coli 2362-75] gi|323186470|gb|EFZ71817.1| terminase B protein [Escherichia coli 1357] gi|323969205|gb|EGB64507.1| terminase B protein [Escherichia coli TA007] gi|325495624|gb|EGC93488.1| DNA pacase B subunit [Escherichia fergusonii ECD227] gi|331046575|gb|EGI18664.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M718] Length = 494 Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 79/340 (23%), Positives = 139/340 (40%), Gaps = 32/340 (9%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPN 137 K ++S+G G GK+ + + +++ + PG I +AN Q+ T ++ + W + Sbjct: 51 KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 W L +Y + K + R SEE G H + + II Sbjct: 111 FPWLA-DYFVLTETAFYEVTGKGVWTVVPKGF----RLGSEE---ALAGEHADHLLYII- 161 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKR 250 DEASG D I G LT ++ NR ++ S P R SG FY+ +K P + Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRI-LLLSQPTRPSGYFYDTHHKLAKRPGNPDGVYTA 219 Query: 251 FQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 +++ + P+F + +A Y G D+ + ++V G FP+ + + +E A R+ Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279 Query: 310 PCPDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKI 360 + D+A G D +V+ + +R + + +++ KI Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKI 339 Query: 361 SGLV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397 E++ I ID + G T D + E G V R+ Sbjct: 340 FAECNPERFPNITIAIDGDGLGKATADLMYEYYGITVQRI 379 >gi|332974843|gb|EGK11758.1| hypothetical protein HMPREF9373_1714 [Psychrobacter sp. 1501(2011)] Length = 520 Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 96/466 (20%), Positives = 183/466 (39%), Gaps = 67/466 (14%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138 + ++++G G GK+ + LW + P ++ A QL+T +W E++ L L N Sbjct: 57 RTSVASGHGTGKSRSAGIIALWHLLFYPESVMLFTAPQIGQLRTVVWKEINICLQRLRNN 116 Query: 139 HWFEMQSLSLHPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGM 193 W +D V+ + I K + +T + +P G H + M Sbjct: 117 ----------KALGWLADYVVVLAEKIYIKGFKDTWFVFAKTAPKHQPTNIAGQHGDHYM 166 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL----DDWK 249 + DEA G D + +G LT N NR ++TS P + +G FY+ +K W Sbjct: 167 -VWADEACGIDDAVMEVAIGALTHEN-NRA-VLTSQPAKNTGFFYDTHHKLSHYNGGKWI 223 Query: 250 RFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 + + + + +YG +S + + G+FP+ ++ E + Sbjct: 224 ALEFNGEMSPIVSKEKLIEALYQYGSRNSPGYLIRIRGKFPELK-GEYLLTRTDYENMKA 282 Query: 309 EPC----PDPYAPLIMGCDIAEEGGDNTVVV--------LRRGPVIEH-------LFDWS 349 PC D + +I+ D+ + G ++ V+ + +G + H LF + Sbjct: 283 HPCVIKEGDKWG-IIVTVDVGGDVGRDSSVISVLQVVDKMVKGRIERHVHLLDIPLFS-N 340 Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + ++ T KI+ ++ Y ++ID G ++ G + V + Sbjct: 341 RANINTLKAKINDVMSDYPGATLVIDPLGAGMGLTQSVKADGVYFDEVHWGSPCFNNTLK 400 Query: 410 R---NRRTELHVKMADWLE---FASLINHSGLIQNLKSLKS------FIVPNTGELAIES 457 R N+R+ +V MA +E F+ + Q + +L+ + + S Sbjct: 401 RYYMNKRSHAYVSMAKAVEKGYFSVSDKIKKMYQVITNLEEQMTRLPYYFDEKARWCMMS 460 Query: 458 KR---VKGAKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQYEGVDL 500 K+ KG KS D +D + + F EN P+ YE +++ Sbjct: 461 KKDMLKKGIKSPDIADTIAFGFMEN-------ISYAPAESYEDLNI 499 >gi|260871239|ref|YP_003238019.1| DNA packaging protein [Escherichia coli O111:H- str. 11128] gi|257767818|dbj|BAI39311.1| DNA packaging protein [Escherichia coli O111:H- str. 11128] Length = 494 Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 77/338 (22%), Positives = 140/338 (41%), Gaps = 32/338 (9%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKH 139 ++++G G GK+ + + + + + PG VI +AN Q+ ++ + S W + + Sbjct: 53 SVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFP 112 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 W + L ++ I K CR+ +EE G H + + II DE Sbjct: 113 WLS-KYFILTETSFFEVTGKGVWTILIKS----CRSGNEE---ALAGEHADHLLYII-DE 163 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQ 252 ASG D I G LT ++ NR ++ S P R SG FY+ ++ P + Sbjct: 164 ASGVSDKAFSVITGALTGKD-NRI-LLLSQPTRPSGYFYDSHHRLAIRPGNPDGLFTAII 221 Query: 253 IDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 +++ +D F +A Y G D+ + ++V G+FP+ + + +E A R+ Sbjct: 222 LNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVK 281 Query: 312 PDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISG 362 + D+A G D +V+ + +R + + +++ KI Sbjct: 282 IAKGWGWVACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFA 341 Query: 363 LV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397 E++ I ID + G T D + E G V R+ Sbjct: 342 ECNPERFPNITIAIDGDGLGKSTADLMYERYGITVQRI 379 >gi|56266643|gb|AAV84926.1| DNA pacase B subunit [Enterobacteria phage phiW39] Length = 494 Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 79/340 (23%), Positives = 139/340 (40%), Gaps = 32/340 (9%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPN 137 K ++S+G G GK+ + + +++ + PG I +AN Q+ T ++ + W + Sbjct: 51 KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 W L +Y + K + R SEE G H + + II Sbjct: 111 FPWLA-DYFVLTETAFYEITGKGVWTVVPKGF----RLGSEE---ALAGEHADHLLYII- 161 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKR 250 DEASG D I G LT ++ NR ++ S P R SG FY+ +K P + Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRI-LLLSQPTRPSGYFYDTHHKLAKRPGNPDGVYTA 219 Query: 251 FQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 +++ + P+F + +A Y G D+ + ++V G FP+ + + +E A R+ Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279 Query: 310 PCPDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKI 360 + D+A G D +V+ + +R + + +++ KI Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKI 339 Query: 361 SGLV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397 E++ I ID + G T D + E G V R+ Sbjct: 340 FAECNPERFPNITIAIDGDGLGKATADLMYEYYGITVQRI 379 >gi|324111095|gb|EGC05081.1| terminase B protein [Escherichia fergusonii B253] Length = 494 Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 79/340 (23%), Positives = 139/340 (40%), Gaps = 32/340 (9%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPN 137 K ++S+G G GK+ + + +++ + PG I +AN Q+ T ++ + W + Sbjct: 51 KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 W L +Y + K + R SEE G H + + II Sbjct: 111 FPWLA-DYFVLTETAFYEVTGKGVWTVVPKGF----RLGSEE---ALAGEHADHLLYII- 161 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKR 250 DEASG D I G LT ++ NR ++ S P R SG FY+ +K P + Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRI-LLLSQPTRPSGYFYDTHHKLAKRPGNPDGVYTA 219 Query: 251 FQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 +++ + P+F + +A Y G D+ + ++V G FP+ + + +E A R+ Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279 Query: 310 PCPDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKI 360 + D+A G D +V+ + +R + + +++ KI Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRILEYTDVTETQLAAKI 339 Query: 361 SGLV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397 E++ I ID + G T D + E G V R+ Sbjct: 340 FAECNPERFPNITIAIDGDGLGKATADLMYEYYGITVQRI 379 >gi|213156231|ref|YP_002318651.1| phage terminase [Acinetobacter baumannii AB0057] gi|301346399|ref|ZP_07227140.1| phage terminase [Acinetobacter baumannii AB056] gi|301594275|ref|ZP_07239283.1| phage terminase [Acinetobacter baumannii AB059] gi|213055391|gb|ACJ40293.1| phage terminase [Acinetobacter baumannii AB0057] Length = 663 Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT + LW + ++ A QLK +W E+S + L Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259 Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 P W +D V + S + K Y +T + +P G+H M + DEASG Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259 D + G LT + NR +MTS P R +G FYE +K W + Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376 Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317 + + +YG D ++ V G+FP + I EE D + Sbjct: 377 LVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436 Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363 ++ D+ G D++V+V+ RR V++ ++ D+ KI+ L Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391 + +Y +++D N G YL+ G Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524 >gi|260551382|ref|ZP_05825582.1| phage terminase [Acinetobacter sp. RUH2624] gi|260405545|gb|EEW99037.1| phage terminase [Acinetobacter sp. RUH2624] Length = 663 Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT + LW + ++ A QLK +W E+S + L Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259 Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 P W +D V + S + K Y +T + +P G+H M + DEASG Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259 D + G LT + NR +MTS P R +G FYE +K W + Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376 Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317 + + +YG D ++ V G+FP + I EE D + Sbjct: 377 LVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436 Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363 ++ D+ G D++V+V+ RR V++ ++ D+ KI+ L Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391 + +Y +++D N G YL+ G Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524 >gi|332852816|ref|ZP_08434408.1| intein splicing region-containing protein [Acinetobacter baumannii 6013150] gi|332871045|ref|ZP_08439658.1| intein splicing region-containing protein [Acinetobacter baumannii 6013113] gi|332729027|gb|EGJ60377.1| intein splicing region-containing protein [Acinetobacter baumannii 6013150] gi|332731805|gb|EGJ63085.1| intein splicing region-containing protein [Acinetobacter baumannii 6013113] Length = 663 Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT + LW + ++ A QLK +W E+S + L Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259 Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 P W +D V + S + K Y +T + +P G+H M + DEASG Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259 D + G LT + NR +MTS P R +G FYE +K W + Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376 Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317 + + +YG D ++ V G+FP + I EE D + Sbjct: 377 LVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436 Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363 ++ D+ G D++V+V+ RR V++ ++ D+ KI+ L Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391 + +Y +++D N G YL+ G Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524 >gi|226940437|ref|YP_002795511.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715364|gb|ACO74502.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 133 Score = 55.1 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 40/126 (31%), Positives = 53/126 (42%), Gaps = 23/126 (18%) Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL------HPAPWYSDVLHCSLGIDSK 167 AN++TQL+T EV KW L HWF+ QS S+ H W +D + Sbjct: 4 ANTDTQLRTKTSPEVGKWQRLSITSHWFDPQSASIAARDKEHAKTWRADFV--------- 54 Query: 168 HYSTMCRTYSEERPDTFVGHHNT-YGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226 +SE + F G HN + +I DEAS D + G LT+ WI Sbjct: 55 -------PWSEHNTEAFAGLHNKGKRIVLIFDEASAIADKVWEVAEGALTDEETEIIWIA 107 Query: 227 TSNPRR 232 NP R Sbjct: 108 FGNPTR 113 >gi|184158505|ref|YP_001846844.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU] gi|183210099|gb|ACC57497.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU] Length = 663 Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT + LW + ++ A QLK +W E+S + L Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259 Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 P W +D V + S + K Y +T + +P G+H M + DEASG Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259 D + G LT + NR +MTS P R +G FYE +K W + Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376 Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317 + + +YG D ++ V G+FP + I EE D + Sbjct: 377 LVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436 Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363 ++ D+ G D++V+V+ RR V++ ++ D+ KI+ L Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGESQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391 + +Y +++D N G YL+ G Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524 >gi|46401730|ref|YP_006576.1| PacB [Enterobacteria phage P1] gi|301646767|ref|ZP_07246623.1| putative terminase B protein [Escherichia coli MS 146-1] gi|129547|sp|P27753|TERL_BPP1 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein B; AltName: Full=PACase B protein; AltName: Full=Terminase B protein; AltName: Full=Terminase large subunit gi|68597607|sp|Q5XLR0|TERL_BPP7 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein B; AltName: Full=PACase B protein; AltName: Full=Terminase B protein; AltName: Full=Terminase large subunit gi|33323612|gb|AAQ07582.1|AF503408_106 PacB [Enterobacteria phage P7] gi|215636|gb|AAA21724.1| pacB [Enterobacteria phage P1] gi|33338757|gb|AAQ14080.1| PacB [Enterobacteria phage P1] gi|33338866|gb|AAQ14188.1| PacB [Enterobacteria phage P1] gi|54112354|gb|AAV28854.1| PacB [Enterobacteria phage P7] gi|301075042|gb|EFK89848.1| putative terminase B protein [Escherichia coli MS 146-1] Length = 494 Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 78/343 (22%), Positives = 141/343 (41%), Gaps = 32/343 (9%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKH 139 ++++G G GK+ + + + + + PG VI +AN Q+ ++ + S W + + Sbjct: 53 SVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFP 112 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 W + L ++ I K CR +EE G H + + II DE Sbjct: 113 WLS-KYFILTETSFFEVTGKGVWTILIKS----CRPGNEE---ALAGEHADHLLYII-DE 163 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQ 252 ASG D I G LT ++ NR ++ S P R SG FY+ ++ P + Sbjct: 164 ASGVSDKAFSVITGALTGKD-NRI-LLLSQPTRPSGYFYDSHHRLAIRPGNPDGLFTAII 221 Query: 253 IDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 +++ +D F +A Y G D+ + ++V G+FP+ + + +E A R+ Sbjct: 222 LNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVK 281 Query: 312 PDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISG 362 + D+A G D +V+ + +R + + +++ KI Sbjct: 282 IAKGWGWVACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFA 341 Query: 363 LV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKR 402 E++ I ID + G T D + E G V R+ K+ Sbjct: 342 ECNPERFPNITIAIDGDGLGKSTADLMYERYGITVQRIRWGKK 384 >gi|225155389|ref|ZP_03723881.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2] gi|224803845|gb|EEG22076.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2] Length = 479 Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 56/227 (24%), Positives = 94/227 (41%), Gaps = 13/227 (5%) Query: 176 YSEERPDTFVGHHNTYG--MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233 ++ +R F G H G + II DEA D I + +R + S+ L Sbjct: 129 FATDRGGRFEGFHAYPGRPLLIILDEAKSIADDIFVA-----ADRCQPTMLLYISSWGGL 183 Query: 234 SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293 G+F++ F++ D + +FQ I P F E + A+YG DSD+ R + GQ P+ + Sbjct: 184 FGRFHDAFSQ--DRFAQFQAGIADCPHITPEFIEAMRAQYGEDSDIYRSMILGQRPKGNE 241 Query: 294 DSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW-SKTD 352 F+ + E P + CD AE D V+ R G + + W + Sbjct: 242 TGFVVPFVDYERCESNPPVWQEGTKQVFCDFAET-SDECVIAKRDGNRLSIVDAWIPDGN 300 Query: 353 LRTTNNKISGLVEKYRPDAIII--DANNTGARTCDYLEMLGYHVYRV 397 ++ G + + + + +I DA+ TG L + G + V Sbjct: 301 TAGITDRFEGHLRRLQNEGFVIRGDADGTGHGYITALSLRGIKISGV 347 >gi|331649955|ref|ZP_08351031.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M605] gi|331041212|gb|EGI13366.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M605] Length = 494 Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 77/338 (22%), Positives = 139/338 (41%), Gaps = 32/338 (9%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKH 139 ++++G G GK+ + + + + + PG VI +AN Q+ ++ + S W + + Sbjct: 53 SVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFP 112 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 W + L ++ I K CR +EE G H + + II DE Sbjct: 113 WLS-KYFILTETSFFEVTGKGVWTILIKS----CRPGNEE---ALAGEHADHLLYII-DE 163 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQ 252 ASG D I G LT ++ NR ++ S P R SG FY+ ++ P + Sbjct: 164 ASGVSDKAFSVITGALTGKD-NRI-LLLSQPTRPSGYFYDSHHRLAIRPGNPDGLFTAII 221 Query: 253 IDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 +++ +D F +A Y G D+ + ++V G+FP+ + + +E A R+ Sbjct: 222 LNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVK 281 Query: 312 PDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISG 362 + D+A G D +V+ + +R + + +++ KI Sbjct: 282 IAKGWGWVACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMQEYTDVTETQLAAKIFA 341 Query: 363 LV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397 E++ I ID + G T D + E G V R+ Sbjct: 342 ECNPERFPNITIAIDGDGLGKSTADLMYERYGITVQRI 379 >gi|261381054|ref|ZP_05985627.1| phage terminase, large subunit, PBSX family [Neisseria subflava NJ9703] gi|284796087|gb|EFC51434.1| phage terminase, large subunit, PBSX family [Neisseria subflava NJ9703] Length = 450 Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 45/194 (23%), Positives = 89/194 (45%), Gaps = 35/194 (18%) Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 I+G D+A+EG D + +LR G V+ + +W D+ + +K+ ++ + D I+ D+ Sbjct: 241 ILGFDVADEGDDASATILRHGSVVIDMDEWRGQDVIYSADKVYLYGQEAKADKIVYDSIG 300 Query: 379 TGA-------RTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW-------- 423 GA R ++ +G++ + + A + +N+ ++K W Sbjct: 301 VGAGVKAQFRRKTGKVQTIGFNAGGSVFKPEARYTDDKKNKDMFSNIKAQAWWMVRERFY 360 Query: 424 -----LEFA------SLINHSGLIQNLKSLKSFI------VPNTGELAIESKR---VKGA 463 +EF LI+ SG +++L+ LK+ + N G + +ESK+ +G Sbjct: 361 KTWRAIEFGDTYPIDELISISGSLKDLEYLKAELSRPRVDYDNNGRVKVESKKDMAKRGI 420 Query: 464 KSTDYSDGLMYTFA 477 S + +D L+ FA Sbjct: 421 PSPNRADALIMAFA 434 >gi|320103661|ref|YP_004179252.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644] gi|319750943|gb|ADV62703.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644] Length = 553 Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 77/349 (22%), Positives = 128/349 (36%), Gaps = 39/349 (11%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 ++ G +GK+ L A L LW + T PG V+ A S+ L T L+ E+ K L+ + Sbjct: 68 VATGNAVGKSYLAAGLTLWWLYTHPGSLVVATAPSQGLLGTVLFRELQKALA-ASRRRGL 126 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 + + + L G C + + G H+ M ++ DEAS Sbjct: 127 GLPGMVVGSDRGTPFSLRVGPGRRLAAEGWGCLGIATRGVERLAGRHHADLMVVV-DEAS 185 Query: 202 GT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQID------ 254 G P+ LT N + ++ NP F+++ + L + I Sbjct: 186 GVQPEAWE-----ALTSLNPRKLFV-CGNPLTPGTVFHKLHQRGLTEASDPSIPDHARGV 239 Query: 255 --------------TRTVEGI-DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL 299 R+ G+ D F ++G S + V G FP + + I Sbjct: 240 ALTIPSTASPDINLERSPRGLADRGFIREAERQWGRGSPLWLSHVEGVFPTVAVHALIEP 299 Query: 300 NIIEEALNREPCP---DPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFDWSKTDLRT 355 +++A + E +P ++GCD+A G D T +V+R I L + Sbjct: 300 GWLDQAASLERSQTYENPPGQPVLGCDLAAGVGADRTAIVVRDEGGIRELIASDRLAPDE 359 Query: 356 TNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLG---YHVYRVLG 399 I+ L K+ P+ I+ D GA L G H + G Sbjct: 360 AATLIASLARKHLIAPERILYDGAGLGAELTTRLARQGPGFVHARAIFG 408 >gi|298387330|ref|ZP_06996883.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] gi|298259999|gb|EFI02870.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] Length = 500 Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 62/222 (27%), Positives = 93/222 (41%), Gaps = 25/222 (11%) Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP---LIMGCDIAEEGGDNTV 333 +D+ R++V G FP+ D IP IE A R PY P +G D+A G DN+V Sbjct: 264 NDLFRIKVRGMFPKVAEDVLIPYEWIEIANKRWQENHPYRPRKSCKLGVDVAGMGRDNSV 323 Query: 334 VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR---PDAIIIDANNTGARTCDYLEML 390 R G + FD ++ + ++ + G Y+ D I ID GA L Sbjct: 324 FCPRYGNYVSQ-FDVFQSAGKASHMHVVGKALSYKRTDRDIIFIDTIGEGAGVYSRLVEQ 382 Query: 391 G----YHVYRVLGQKRAVDL--EFC-RNRRTELHVKMADWLE----FASLINHSGLIQNL 439 G + V G K D+ E+ N R L+ + DWL+ F ++ Sbjct: 383 GIRNIFSVKNSQGAKGLHDITGEYSFANMRAYLYWALRDWLDPKNNFFPMLPPCDQFTEE 442 Query: 440 KSLKSFIVPNTGELAIE-----SKRVKGAKSTDYSDGLMYTF 476 + + + G++ IE KR+K +S DY D L TF Sbjct: 443 ATETKWKFRSDGKILIEPKEEIKKRIK--RSPDYMDALSETF 482 >gi|186682890|ref|YP_001866086.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102] gi|186465342|gb|ACC81143.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102] Length = 543 Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 88/374 (23%), Positives = 143/374 (38%), Gaps = 85/374 (22%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 + A G GK+ + + LV++ + G++ I A SE Q+K LWAE+ K L K Sbjct: 64 VKAAHGTGKSFIASLLVIYFLFCVGGVA-ITTAPSEDQVKWILWAELRKIHGLHKTKLGG 122 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 + L +S+ ++ + GI S+ YS ++F G H + +I DEA Sbjct: 123 RCDIMQL----LFSETVY-AFGITSRDYSE----------NSFQGQHRQKQL-LIEDEAD 166 Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI------------FNKPLDDWK 249 G I+ G + LT ++ + NP +F + F+ P W Sbjct: 167 GITPQIDNGFIACLT--GSDNRGLRIGNPVDPQSQFAKTCKLDKRCLTVSAFSHPNVSWA 224 Query: 250 RFQIDTRTVEGIDPSFHEGIIARYG--------------------LDSD-VTRV------ 282 +++ V + P E II G + D + RV Sbjct: 225 -YELCADGVYRLKPEVAEHIINEDGEIKPQQEWPPEFPRDRIPGAISIDWIERVRREKFE 283 Query: 283 -------EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY-------APLIMGCDIAEEG 328 V G++ + D I L ++++A + Y P +G D+ +G Sbjct: 284 TSAYWKGRVMGEYAEDAADGIILLTLLKQARSLYDQNPQYWDAIAKRYPWRLGLDVG-DG 342 Query: 329 GDNTVVVLRRGPVI-EHLFDWSKTDLRTTN-------NKISGLVEKYRPDAIIIDANNTG 380 GD + L RGPV+ E +K DL T ++I L Y +I +D G Sbjct: 343 GDPHALALLRGPVLYEVQIHPTKGDLLDTERAADIAASQIKLLGTGY---SIAVDNTGVG 399 Query: 381 ARTCDYLEMLGYHV 394 A T L+ GY Sbjct: 400 AGTLAKLKKTGYQA 413 >gi|320091491|gb|ADW08983.1| terminase-like protein [Clavibacter phage CN77] Length = 414 Score = 50.1 bits (118), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 55/236 (23%), Positives = 91/236 (38%), Gaps = 47/236 (19%) Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQI-- 253 DEA G P + G +T +++ + NP +F+ IF P +D+W F I Sbjct: 51 DEAGGVPPELFTGAEAVMTGQDSK--IVAIGNPDSRGTEFHRIFTVPALMDEWNTFTISA 108 Query: 254 -DTRTVEG--------------------IDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQ 291 D TV G +D H+ + + G D +V G+FP + Sbjct: 109 YDLPTVTGEVVYPDHPEKQERMLKGLTSLDWIQHKERVWKVGGKPDGRFLAKVLGEFPGE 168 Query: 292 DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD---- 347 ++F P I+ N P +IMG D+A G D++VV +G + LF Sbjct: 169 TDNAFFPQEAIDRG-NDTTIDKPEKGIIMGVDLARMGDDDSVVYTNQGGRV-RLFKGQVR 226 Query: 348 -------------WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 WSK + + ++ + + + +D++ G D LE L Sbjct: 227 YSDREGTKTTTGVWSKENTVASARRVHAIAMQIGAKQVRLDSSGIGGAVFDELEQL 282 >gi|134287454|ref|YP_001109621.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4] gi|134131876|gb|ABO60570.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4] Length = 509 Score = 50.1 bits (118), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 79/363 (21%), Positives = 147/363 (40%), Gaps = 54/363 (14%) Query: 65 HCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTL 124 H + ++ + + + ++S+G G GKT+ A + LW + + I A + + + Sbjct: 40 HQIQMFDSVSKQGSRTSVSSGHGTGKTSGFAIIALWHLLCYYLSNTILTAPKISTVSDGV 99 Query: 125 WAEVSKWLSLLPNK------HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 W E + + + N +F ++S ++ + ++ + ++ Sbjct: 100 WKEFADLSTKISNGPQSWIWEYFVIESERVYVRGY------------KLNWFVIAKSAPR 147 Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL-GFLTERNANRFWIMTSNPRRLSGKF 237 P+ G H + + + DEASG PD N G++ G LT+ NR + S P R SG F Sbjct: 148 GSPENLAGAHRDW-LLWLADEASGIPD-DNFGVITGSLTDER-NRM-CLASQPTRSSGFF 203 Query: 238 YEIFN----KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLD--SDVTRVEVCGQFPQQ 291 YE + W ++ P IA L + +++V G+FP+ Sbjct: 204 YETHHALSRAEGGPWNNLVFNSE----FSPIVSAKFIAEKKLQYTEEEYQIKVQGRFPEN 259 Query: 292 DIDSFIPLNIIEEALNREPC-PDPYAPLIMGCDIAEEG-GDNTVV----VLRRGPVIEHL 345 + IE + R PD + ++ D+ G D TV+ V+ RG E+ Sbjct: 260 SSKYLVGPQAIEACVGRTVIKPDEHWGWLLPVDVGGGGWRDETVMPALHVIGRG---EYG 316 Query: 346 FDWSKTDLRTT--------NNKISGLV---EKYRPDAI-IIDANNTGARTCDYLEMLGYH 393 D + L + ++ G++ + R +A +IDA G C L++ G+ Sbjct: 317 MDARRAQLISVPLHSNTQDPAQLHGVIVHAARERSNATAMIDAGGMGLIVCKQLDLDGFS 376 Query: 394 VYR 396 YR Sbjct: 377 QYR 379 >gi|284162607|ref|YP_003401230.1| hypothetical protein Arcpr_1511 [Archaeoglobus profundus DSM 5631] gi|284012604|gb|ADB58557.1| protein of unknown function DUF264 [Archaeoglobus profundus DSM 5631] Length = 435 Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust. Identities = 83/347 (23%), Positives = 135/347 (38%), Gaps = 53/347 (15%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 + AGR GKT A ++ T PG +A S Q ++ ++ ++LS K Sbjct: 44 VVAGRRFGKTECMAVSAIYYALTNPGSIQFVIAPSYDQ-SNIMFGQIVQFLS----KSIL 98 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 ++ P++ H DS + S +P+ GH II DEA+ Sbjct: 99 GCMIRRIYKTPFH----HIIFKNDS-----VIHARSASKPEFLRGHK---AHRIILDEAA 146 Query: 202 GTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGK--FYEIFNK----PLDDWKRFQID 254 P DVI+ I L + N + WI P GK FY+ + K D+ ++ Sbjct: 147 FIPDDVISNIIEPMLADYNGS--WIKIGTP---FGKNHFYDTYLKGQSPDFPDYSSYRFP 201 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP------------QQDIDSFIPLNII 302 + I F E YG +S + R E +F Q+++D+ I L Sbjct: 202 STVNPHISHEFIEKKKREYGENSIIFRTEYLAEFVEDQNAVFRWADIQKNVDNSIELIDS 261 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 E ++++ ++GCD+A+ +VVL L + + + R I Sbjct: 262 AENVSKQ--------YVIGCDLAKYQDYTVIVVLDVTEKPYKLVHFERFNRRPYAEVIMR 313 Query: 363 LVEKYRP---DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 L E YR ++ID+ G + L+ +G Y V K V L Sbjct: 314 LKELYRRFNYAKVLIDSTGVGDPVLEDLQDVGAEGY-VFTPKSKVQL 359 >gi|159897183|ref|YP_001543430.1| hypothetical protein Haur_0654 [Herpetosiphon aurantiacus ATCC 23779] gi|159890222|gb|ABX03302.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC 23779] Length = 472 Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust. Identities = 93/394 (23%), Positives = 145/394 (36%), Gaps = 82/394 (20%) Query: 78 FKGAISAGRGIGKTTLNAWLV-LWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLP 136 ++ + A +GKT L LV W S PG+ V+ A ++ Q++ LW EV + Sbjct: 36 YRTLVKACHKVGKTHLGGGLVNWWYDSFDPGL-VLTTAPTDRQVRDLLWKEVR--MQRRG 92 Query: 137 NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAII 196 + +S L P H++ ++ + D+F GHH+ + + I Sbjct: 93 RAGFTGPKSPRLESTP--------------DHFA---HGFTAKDGDSFQGHHSPHTLFIF 135 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNP---------RRLSGKFYEI------- 240 DEA G V E A W+ NP LSG ++ I Sbjct: 136 -DEAVGVASVFWETAESMFNEGGA---WLAIFNPTDTSSQAYAEELSGGWHVISMSVLEH 191 Query: 241 ---------FNKPLDDWKRF-QIDT------RTVEGIDPSFHEGIIAR--YGLDSDVTRV 282 P R ++DT R + +P I R + + Sbjct: 192 PNILAELQGLPPPFPSAIRLSRVDTLLKKWCRALSPEEPKRATDIHWRDAWYRPGPIAEA 251 Query: 283 EVCGQFPQQ---DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339 + G++P Q ++ S + E L P P +GCD+A G D T + +RRG Sbjct: 252 RLLGRWPSQATNNVWSDGAFQVAESLL----LPASDEPCELGCDVARYGDDFTEIHVRRG 307 Query: 340 P---VIEHLFDWSKTDLRTTNNKISGLVEKY--------RPDAIIIDANNTGARTCDYLE 388 E WS + T ++ L +Y R A+ ID + G D + Sbjct: 308 GHSLYHEAANGWSTVE---TAGRLKQLANEYGRRCGVDGRAVAVKIDDDGIGGGVVDLAD 364 Query: 389 MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 GY V G + A D E NRR+EL +A+ Sbjct: 365 --GYTFLGVSGARTAYDPEKYPNRRSELWFSVAE 396 >gi|241763591|ref|ZP_04761642.1| phage terminase large subunit [Acidovorax delafieldii 2AN] gi|241367184|gb|EER61538.1| phage terminase large subunit [Acidovorax delafieldii 2AN] Length = 521 Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust. Identities = 54/209 (25%), Positives = 88/209 (42%), Gaps = 21/209 (10%) Query: 295 SFIPLNIIEEALNR-EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS---K 350 IP ++ A R +P D ++G D A G D T V R + L Sbjct: 290 QLIPTEWVKAAQARWQPRQDKGPMTVLGLDPARGGTDKTSVARRHDCWFDVLISEPGIVT 349 Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC- 409 D TT + LV P I +DA G+ D+++ LG VY V+G +R+ ++ Sbjct: 350 KDGPTTAAFTAPLVRNGAP--IAVDAIGIGSSALDFIQGLGLLVYAVVGSERSDHMDKAG 407 Query: 410 ----RNRRTELHVKMADWLEFA-----SLINHSGLIQNLKSLKSFIVPNTGELAIESK-- 458 RNRR E++ ++ + L+ +L L+ +L +++ +V AI+ + Sbjct: 408 TMRFRNRRAEMYWRLREALDPTAEQPIALPPDQELLGDLTAVRYKVVTMGQGAAIQIRDK 467 Query: 459 ---RVKGAKSTDYSDGLMYTFAENPPRSD 484 R +S D D + TF E P D Sbjct: 468 DEIREALGRSPDKGDSVAMTFCEGIPLLD 496 >gi|161789175|ref|YP_001595730.1| PacB [Vibrio sp. 0908] gi|161761461|gb|ABX77106.1| PacB [Vibrio sp. 0908] Length = 572 Score = 47.8 bits (112), Expect = 0.004, Method: Compositional matrix adjust. Identities = 43/172 (25%), Positives = 77/172 (44%), Gaps = 12/172 (6%) Query: 67 LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA 126 + +N P + ++++G G GK+ L A L L + T P + ANS Q+ +++ Sbjct: 50 IEVINALTPVGARVSVASGHGTGKSHLTAALCLHFIITHPESLCMLTANSLDQVTNVVFS 109 Query: 127 EVSK-WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 + + W+ + + W E Q + +Y+ G+ + +T S+ + Sbjct: 110 YIKRCWVKICQRQPWLE-QYFVITAKSFYAKGYK---GV----WQIFGKTCSKGNEEGLA 161 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 G H M ++ DEASG D + G LTE N N+ ++ S R +G F Sbjct: 162 GQHRRDYMVVV-DEASGVSDRAFEVLRGALTEDN-NKM-LLISQFTRPTGHF 210 >gi|260580755|ref|ZP_05848581.1| phage terminase large subunit [Haemophilus influenzae RdAW] gi|260092572|gb|EEW76509.1| phage terminase large subunit [Haemophilus influenzae RdAW] Length = 447 Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust. Identities = 50/203 (24%), Positives = 87/203 (42%), Gaps = 35/203 (17%) Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A+EG D+ G V+ + W D+ + N+ + K++ D II D+ Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304 Query: 380 GA-------RTCDYLEMLGYHV---------YRVLGQK--------RAVDLEFCRNR--R 413 GA R L++ G++ + G+K +A R+R + Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYK 364 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464 T VK D LI+ S I+ L+ LK+ + N G + +ESK + +G Sbjct: 365 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424 Query: 465 STDYSDGLMYTFAENPPRSDMDF 487 S + +D L+ +A P+S +D Sbjct: 425 SPNMADALVMCYAPTKPKSLLDL 447 >gi|16273317|ref|NP_439561.1| terminase large subunit-like protein [Haemophilus influenzae Rd KW20] gi|1175785|sp|P44184|Y1410_HAEIN RecName: Full=Uncharacterized protein HI_1410 gi|1574247|gb|AAC23058.1| predicted coding region HI1410 [Haemophilus influenzae Rd KW20] Length = 394 Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust. Identities = 50/203 (24%), Positives = 87/203 (42%), Gaps = 35/203 (17%) Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A+EG D+ G V+ + W D+ + N+ + K++ D II D+ Sbjct: 192 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 251 Query: 380 GA-------RTCDYLEMLGYHV---------YRVLGQK--------RAVDLEFCRNR--R 413 GA R L++ G++ + G+K +A R+R + Sbjct: 252 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYK 311 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464 T VK D LI+ S I+ L+ LK+ + N G + +ESK + +G Sbjct: 312 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 371 Query: 465 STDYSDGLMYTFAENPPRSDMDF 487 S + +D L+ +A P+S +D Sbjct: 372 SPNMADALVMCYAPTKPKSLLDL 394 >gi|85058727|ref|YP_454429.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] gi|84779247|dbj|BAE74024.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] Length = 456 Score = 47.0 bits (110), Expect = 0.007, Method: Compositional matrix adjust. Identities = 22/69 (31%), Positives = 37/69 (53%) Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372 +P +G D+A+EG D+ ++L G V+ HL W+K D+ + +++ E D I Sbjct: 234 EPAGKKRIGFDVADEGEDSNALILSHGSVVMHLETWNKGDVIQSADRVKNYAESVIADEI 293 Query: 373 IIDANNTGA 381 I D+ GA Sbjct: 294 IFDSIGVGA 302 >gi|282880015|ref|ZP_06288737.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS 5C-B1] gi|281306129|gb|EFA98167.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS 5C-B1] Length = 459 Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust. Identities = 65/230 (28%), Positives = 100/230 (43%), Gaps = 33/230 (14%) Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-------EPCPDPYAPLIMGCDIAEEGG 329 +D+ R++V G FP+ D+ IP +E A +R + P YA + G D+A G Sbjct: 221 NDLFRIKVLGLFPKASEDTLIPFEWLELAHDRWKKLNAEDFVPRKYARV--GIDVAGMGR 278 Query: 330 DNTVVVLRRG---PVIEHLFDWSKTD-LRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 D++ VLR G P I+ K D ++ + LVEK ++ID GA Sbjct: 279 DSSCFVLRYGNYVPEIKIHQSGGKADHMKVAGEAVQWLVEK--NTKVMIDTIGEGAGVYS 336 Query: 386 YLEMLGY-HVYRVL---GQKRAVDL----EFCRNRRTELHVKMADWLEFASLINHS---- 433 L LGY + Y G K D+ EF N R + + DWL + N + Sbjct: 337 RLLELGYDNAYSCKFSEGTKGLHDITGQYEFA-NMRAYCYWAVRDWLNPKNGFNPALPPC 395 Query: 434 -GLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L L + + ++G + IE K + + +S D +D L+ TF N Sbjct: 396 DELDAELTEVH-WSFQSSGSIIIEPKENIKSRLKRSPDRADALISTFYPN 444 >gi|68250076|ref|YP_249188.1| phage terminase large subunit [Haemophilus influenzae 86-028NP] gi|68058275|gb|AAX88528.1| predicted phage terminase large subunit [Haemophilus influenzae 86-028NP] Length = 447 Score = 45.8 bits (107), Expect = 0.016, Method: Compositional matrix adjust. Identities = 50/203 (24%), Positives = 87/203 (42%), Gaps = 35/203 (17%) Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A+EG D+ G V+ + W D+ + N+ + K++ D II D+ Sbjct: 245 VGFDVADEGADSNDNAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304 Query: 380 GA-------RTCDYLEMLGYHV---------YRVLGQK--------RAVDLEFCRNR--R 413 GA R L++ G++ + G+K +A R+R + Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYK 364 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464 T VK D LI+ S I+ L+ LK+ + N G + +ESK + +G Sbjct: 365 TYRAVKHGDVYPDDELISLSSNIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424 Query: 465 STDYSDGLMYTFAENPPRSDMDF 487 S + +D L+ +A P+S +D Sbjct: 425 SPNMADALVMCYATTKPKSLLDL 447 >gi|319776448|ref|YP_004138936.1| phage terminase large subunit [Haemophilus influenzae F3047] gi|319897217|ref|YP_004135412.1| phage terminase large subunit [Haemophilus influenzae F3031] gi|329123931|ref|ZP_08252483.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] gi|317432721|emb|CBY81084.1| predicted phage terminase large subunit [Haemophilus influenzae F3031] gi|317451039|emb|CBY87270.1| predicted phage terminase large subunit [Haemophilus influenzae F3047] gi|327468126|gb|EGF13613.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] Length = 447 Score = 45.8 bits (107), Expect = 0.018, Method: Compositional matrix adjust. Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 35/203 (17%) Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A+EG D G V+ + W D+ + N+ + K++ D II D+ Sbjct: 245 VGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304 Query: 380 GA-------RTCDYLEMLGYHV--------YRVLGQKRAVDLE---------FCRNR--R 413 GA R L++ G++ + K+ D+ R+R + Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYK 364 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464 T VK D LI+ S I+ L+ LK+ + N G + +ESK + +G Sbjct: 365 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424 Query: 465 STDYSDGLMYTFAENPPRSDMDF 487 S + +D L+ +A P+S +D Sbjct: 425 SPNMADALVMCYAPTKPKSLLDL 447 >gi|145629503|ref|ZP_01785301.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|145641440|ref|ZP_01797019.1| predicted phage terminase large subunit [Haemophilus influenzae R3021] gi|144978346|gb|EDJ88110.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|145273983|gb|EDK13850.1| predicted phage terminase large subunit [Haemophilus influenzae 22.4-21] gi|309750959|gb|ADO80943.1| Probable bacteriophage terminase, large subunit [Haemophilus influenzae R2866] Length = 447 Score = 45.4 bits (106), Expect = 0.019, Method: Compositional matrix adjust. Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 35/203 (17%) Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A+EG D G V+ + W D+ + N+ + K++ D II D+ Sbjct: 245 VGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304 Query: 380 GA-------RTCDYLEMLGYHV--------YRVLGQKRAVDLE---------FCRNR--R 413 GA R L++ G++ + K+ D+ R+R + Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYK 364 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464 T VK D LI+ S I+ L+ LK+ + N G + +ESK + +G Sbjct: 365 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424 Query: 465 STDYSDGLMYTFAENPPRSDMDF 487 S + +D L+ +A P+S +D Sbjct: 425 SPNMADALVMCYAPTKPKSLLDL 447 >gi|145638997|ref|ZP_01794605.1| terminase large subunit-like protein [Haemophilus influenzae PittII] gi|145271969|gb|EDK11878.1| terminase large subunit-like protein [Haemophilus influenzae PittII] Length = 379 Score = 45.4 bits (106), Expect = 0.022, Method: Compositional matrix adjust. Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 35/203 (17%) Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A+EG D G V+ + W D+ + N+ + K++ D II D+ Sbjct: 177 VGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 236 Query: 380 GA-------RTCDYLEMLGYHV--------YRVLGQKRAVDLE---------FCRNR--R 413 GA R L++ G++ + K+ D+ R+R + Sbjct: 237 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYK 296 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464 T VK D LI+ S I+ L+ LK+ + N G + +ESK + +G Sbjct: 297 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 356 Query: 465 STDYSDGLMYTFAENPPRSDMDF 487 S + +D L+ +A P+S +D Sbjct: 357 SPNMADALVMCYAPTKPKSLLDL 379 >gi|189460514|ref|ZP_03009299.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136] gi|189432758|gb|EDV01743.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136] Length = 556 Score = 44.7 bits (104), Expect = 0.042, Method: Compositional matrix adjust. Identities = 62/235 (26%), Positives = 89/235 (37%), Gaps = 43/235 (18%) Query: 278 DVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL-----IMGCDIAEEGGDNT 332 D+ R +V G FP+ D D+ IP +EEA R PL I+G D+A G D T Sbjct: 309 DLFRKKVLGLFPKVDEDTLIPRQWLEEAHERWKQAKGREPLRADLNILGVDVAGMGRDAT 368 Query: 333 VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI----IIDANNTGAR------ 382 VLRR + FD + + K++G + R I ID GA Sbjct: 369 CYVLRRDNWVAS-FDTHNSGGVADHMKVAGKIMVARRQNIGLYVSIDTIGEGAGVYSRCV 427 Query: 383 ----------TCDYLEML----GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---- 424 +C Y E G + + GQ + N R L + DWL Sbjct: 428 ELEDEPHYILSCKYSESAKTPNGRELSDITGQNKFF------NMRAYLFWAVRDWLNPRN 481 Query: 425 EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTF 476 +++ + F V + G+L IE K + + +S D D L TF Sbjct: 482 NTGAMLPPDDKFDEEATEIKFSVKSNGKLYIEPKEDIKERLGRSPDKFDALANTF 536 >gi|53793591|ref|YP_112491.1| terminase large subunit [Flavobacterium phage 11b] gi|53748181|emb|CAH56642.1| terminase large subunit [Flavobacterium phage 11b] Length = 432 Score = 44.3 bits (103), Expect = 0.049, Method: Compositional matrix adjust. Identities = 44/176 (25%), Positives = 84/176 (47%), Gaps = 21/176 (11%) Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP--DA 371 P+ + + DIA G D V+ + G + +F +K+ + + GL K++ Sbjct: 248 PFGEMYISADIARFGSDKMVICVWSGFRVVEIFSMAKSSITEIAEAVRGLSIKHKVPLSN 307 Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE----FCRNRRTELHVKMADWLEFA 427 +I D + G D L G+ + RA++++ +N +T+ + K+A+ ++ Sbjct: 308 VICDEDGVGGGVVDVLGCTGF-----INNSRAMEVDNQVVQYQNLKTQCYYKLAEVIQSN 362 Query: 428 SLINHS-------GLIQNLKSLKSFIVPNTGELAIESK-RVKGA--KSTDYSDGLM 473 +L HS + + L+ +K + + G+L + SK +VK A +S DYSD LM Sbjct: 363 NLYIHSEDATVNDEITKELEQVKRDKIDSDGKLQLISKDKVKQAIGRSPDYSDALM 418 >gi|301170180|emb|CBW29784.1| predicted phage terminase large subunit [Haemophilus influenzae 10810] Length = 447 Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust. Identities = 49/203 (24%), Positives = 86/203 (42%), Gaps = 35/203 (17%) Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A+EG D+ G V+ + W + + N+ + K++ D II D+ Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGYVIDSANRTNQSAVKFKADLIIFDSIGV 304 Query: 380 GA-------RTCDYLEMLGYHV---------YRVLGQK--------RAVDLEFCRNR--R 413 GA R L++ G++ + G+K +A R+R + Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYK 364 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464 T VK D LI+ S I+ L+ LK+ + N G + +ESK + +G Sbjct: 365 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424 Query: 465 STDYSDGLMYTFAENPPRSDMDF 487 S + +D L+ +A P+S +D Sbjct: 425 SPNMADALVMCYAPTKPKSLLDL 447 >gi|329119006|ref|ZP_08247700.1| phage terminase large subunit [Neisseria bacilliformis ATCC BAA-1200] gi|327464879|gb|EGF11170.1| phage terminase large subunit [Neisseria bacilliformis ATCC BAA-1200] Length = 449 Score = 42.7 bits (99), Expect = 0.13, Method: Compositional matrix adjust. Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 7/112 (6%) Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 I+G D+A+EG D VLR G V+ + W D+ + +K+ ++ D I+ D Sbjct: 240 ILGFDVADEGDDANATVLRHGSVVTDMQQWRGQDVIYSADKVYLYAQEQNVDRIVYDNIG 299 Query: 379 TGA-------RTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 GA R ++ LG++ + + A + +NR ++K W Sbjct: 300 VGAGVKAQFRRKNGKVQTLGFNAGGAVYKPDAKYTDDKKNRDMFANIKAQAW 351 >gi|254781186|ref|YP_003065599.1| hypothetical protein CLIBASIA_05465 [Candidatus Liberibacter asiaticus str. psy62] gi|254040863|gb|ACT57659.1| hypothetical protein CLIBASIA_05465 [Candidatus Liberibacter asiaticus str. psy62] Length = 45 Score = 42.4 bits (98), Expect = 0.21, Method: Composition-based stats. Identities = 19/43 (44%), Positives = 29/43 (67%), Gaps = 1/43 (2%) Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYH-VYRVLGQKRAV 404 + +Y PDAI++ AN GA T +YLE L Y + ++LGQ+ +V Sbjct: 1 MAHQYNPDAIVLYANGIGAVTANYLENLNYSPIEKILGQRSSV 43 >gi|153806881|ref|ZP_01959549.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185] gi|149131558|gb|EDM22764.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185] Length = 513 Score = 42.0 bits (97), Expect = 0.22, Method: Compositional matrix adjust. Identities = 67/234 (28%), Positives = 98/234 (41%), Gaps = 47/234 (20%) Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR---EPCPDPYAP---LIMGCDIAEEGGD 330 +D+ RV+V G FP+ D IP IE A NR E + P +G D+A G D Sbjct: 275 NDLFRVKVLGMFPKVSEDVLIPYEWIEIA-NRNWQELQASGFIPAKSCKLGVDVAGMGRD 333 Query: 331 NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP--------DAI---------I 373 N+V+ R G + FD ++ R + + G+ Y D I + Sbjct: 334 NSVLCPRYGNYVPQ-FDVHQSAGRADHMHVVGMTIPYLKKKGAKAFIDTIGEGAGVYSRL 392 Query: 374 IDANNTGARTCDYLEML-GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE-----FA 427 ++ T A +C Y E G H + G+ EF N R L+ + DWL A Sbjct: 393 LEEEFTNAFSCKYSEGTDGLH--DITGE-----YEFA-NMRAYLYWALRDWLNPKNGFGA 444 Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAIE-----SKRVKGAKSTDYSDGLMYTF 476 +L L++ K + N G++ IE KR+K +S DY D L TF Sbjct: 445 ALPPCDQLMEEATETKWKFLSN-GKVIIEPKEDVKKRIK--RSPDYMDALANTF 495 >gi|309379923|emb|CBX21334.1| unnamed protein product [Neisseria lactamica Y92-1009] Length = 449 Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust. Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 7/112 (6%) Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 I+G D+A+EG D VLR G V+ + W D+ + +K+ ++ D I+ D Sbjct: 240 ILGFDVADEGDDANATVLRHGSVVTDMRQWRGQDVIYSADKVYLYAQEQDIDRIVYDNIG 299 Query: 379 TGA-------RTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 GA R ++ LG++ + + A + +NR ++K W Sbjct: 300 VGAGVKAQFRRKRGKVQTLGFNAGGAVYKPDAKYTDDKKNRDMFANIKAQAW 351 >gi|303243859|ref|ZP_07330199.1| protein of unknown function DUF264 [Methanothermococcus okinawensis IH1] gi|302485795|gb|EFL48719.1| protein of unknown function DUF264 [Methanothermococcus okinawensis IH1] Length = 445 Score = 40.0 bits (92), Expect = 0.86, Method: Compositional matrix adjust. Identities = 71/328 (21%), Positives = 126/328 (38%), Gaps = 45/328 (13%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 ++AGR GK+ L A+L+++L ST+ +A + ++ E+ K++ + Sbjct: 56 VAAGRRFGKSKLMAFLLIFLCSTQKNKKYAVIAPFYANAR-IIFRELKKYIE---KSNVL 111 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 + +P+ + ID + S + P + G +Y + I+++ A Sbjct: 112 SRLVKRMVESPYMAIEFKTGCTIDFR---------SADNPTSIRGE--SYHLVILDEAAF 160 Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR---FQIDTRTV 258 DV+ I L + +A I T N FYE F + R F+ T T Sbjct: 161 IKDDVVKYVIKPLLLDYDAPLIEISTPNGH---NHFYESFLMGKNKQNRHISFRFPTWTN 217 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQF------------PQQDIDSFIPLNIIEEAL 306 + + E I G DS V + E C +F QQ ID I L E+ Sbjct: 218 PFLPKNAIEEIKQEVGEDSPVWKQEYCAEFIDNNEAVFNWEYIQQCIDGTIKLLKSGESG 277 Query: 307 NREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL---RTTNNKISGL 363 ++ +MG D+A+ + VL L + + +L +K+ L Sbjct: 278 HQ---------YVMGVDLAKFEDYTVITVLDVSVKPYKLVYFERFNLMPYSFVADKVKEL 328 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391 + + + +DA GA + +E L Sbjct: 329 YQLFNKPQVCMDATGPGAAVVEQVESLN 356 >gi|310641214|ref|YP_003945972.1| malate dehydrogenase, nad-dependent [Paenibacillus polymyxa SC2] gi|309246164|gb|ADO55731.1| malate dehydrogenase, NAD-dependent [Paenibacillus polymyxa SC2] Length = 313 Score = 40.0 bits (92), Expect = 0.92, Method: Compositional matrix adjust. Identities = 36/114 (31%), Positives = 53/114 (46%), Gaps = 8/114 (7%) Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV----EKYRPDAIII 374 IMG E+ D+ +V++ G I S+ DL TN I V +KY PD+I+I Sbjct: 64 IMGTSNYEDAADSDIVIITAG--IARKPGMSRDDLVNTNAGIVKSVCENVKKYAPDSIVI 121 Query: 375 DANN-TGARTCDYLEMLGYHVYRVLGQKRAVD-LEFCRNRRTELHVKMADWLEF 426 +N A T + L + RV+GQ +D +C EL+V + D F Sbjct: 122 ILSNPVDAMTYTAYQTLDFPKNRVIGQSGVLDTARYCTFIAQELNVSVEDVRGF 175 >gi|226940436|ref|YP_002795510.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715363|gb|ACO74501.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 93 Score = 39.7 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 8/59 (13%) Query: 31 FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89 + LH + WG LEG + PR+WQ E M + H NP A AGRG+G Sbjct: 22 WALHAYDWGRG--ELEGVTGPRAWQREVMSDIGNHL------KNPATRFSAFDAGRGLG 72 >gi|325295250|ref|YP_004281764.1| mutual gliding protein A [Desulfurobacterium thermolithotrophum DSM 11699] gi|325065698|gb|ADY73705.1| mutual gliding protein A [Desulfurobacterium thermolithotrophum DSM 11699] Length = 193 Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust. Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 2/66 (3%) Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNT 332 YG+D + + + Q+ ++D+ + +P+ I+++ LNR CPD A I G + E + T Sbjct: 127 YGID--IKEIPLVFQYNKRDLPNVLPIEILKKDLNRWKCPDFEAIAIKGIGVLETFKEIT 184 Query: 333 VVVLRR 338 VLR+ Sbjct: 185 KQVLRK 190 >gi|308068360|ref|YP_003869965.1| Malate dehydrogenase (Vegetative protein 69) [Paenibacillus polymyxa E681] gi|305857639|gb|ADM69427.1| Malate dehydrogenase (Vegetative protein 69) [Paenibacillus polymyxa E681] Length = 313 Score = 38.9 bits (89), Expect = 1.8, Method: Compositional matrix adjust. Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 8/114 (7%) Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV----EKYRPDAIII 374 I G E+ ++ +V++ G I S+ DL TN I V +KY PD+I+I Sbjct: 64 ITGTSNYEDAANSDIVIITAG--IARKPGMSRDDLVNTNAGIVKSVCENVKKYAPDSIVI 121 Query: 375 DANN-TGARTCDYLEMLGYHVYRVLGQKRAVD-LEFCRNRRTELHVKMADWLEF 426 +N A T + LG+ RV+GQ +D +C EL+V + D F Sbjct: 122 ILSNPVDAMTYTAYQTLGFPKNRVIGQSGVLDTARYCTFIAQELNVSVEDVRGF 175 >gi|22074007|gb|AAL05293.1| replication-associated protein [Tomato yellow leaf curl virus - Gezira] Length = 359 Score = 38.5 bits (88), Expect = 2.4, Method: Compositional matrix adjust. Identities = 38/150 (25%), Positives = 61/150 (40%), Gaps = 23/150 (15%) Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF-PQQDIDSFIPLNIIEEA 305 DW +FQID R+ G S ++ A S + V + P+ I F LN + Sbjct: 112 DWGQFQIDGRSARGGQQSANDAYAAAINSGSKAEALRVLRELAPRDYILQFHNLNSNLDR 171 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 + +EP P PY+ + + V E L W + N +S Sbjct: 172 IFQEP-PAPYSSPFLSSSFNQ--------------VPEELEVW------VSENVMSSAAR 210 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVY 395 +RP++III+ ++ +T + LG H Y Sbjct: 211 PWRPNSIIIEGDSRTGKTM-WARSLGPHNY 239 >gi|148826888|ref|YP_001291641.1| phage terminase large subunit [Haemophilus influenzae PittGG] gi|148718130|gb|ABQ99257.1| predicted phage terminase large subunit [Haemophilus influenzae PittGG] Length = 366 Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust. Identities = 19/71 (26%), Positives = 34/71 (47%) Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A+EG D+ G V+ + W D+ + N+ + K++ D II D+ Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304 Query: 380 GARTCDYLEML 390 GA + + L Sbjct: 305 GAGVKAHFKRL 315 >gi|2497856|sp|Q59202|MDH_BACIS RecName: Full=Malate dehydrogenase gi|963019|emb|CAA62129.1| malate dehydrogenase [Bacillus israeli] Length = 312 Score = 37.7 bits (86), Expect = 4.0, Method: Compositional matrix adjust. Identities = 36/114 (31%), Positives = 53/114 (46%), Gaps = 8/114 (7%) Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI----SGLVEKYRPDAIII 374 I+G EE D+ +VV+ G I S+ DL TN K+ + V KY P++III Sbjct: 64 IIGTSNYEETADSDIVVITAG--IARKPGMSRDDLVQTNQKVMKSVTKEVVKYSPNSIII 121 Query: 375 DANN-TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRN-RRTELHVKMADWLEF 426 N A T + G+ +RV+GQ +D R EL++ + D F Sbjct: 122 VLTNPVDAMTYTVYKESGFPKHRVIGQSGVLDTARFRTFVAQELNLSVKDITGF 175 >gi|40737892|gb|AAR89439.1| replication associated protein C1 [Tomato yellow leaf curl Mali virus] Length = 359 Score = 37.7 bits (86), Expect = 4.3, Method: Compositional matrix adjust. Identities = 37/150 (24%), Positives = 59/150 (39%), Gaps = 23/150 (15%) Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF-PQQDIDSFIPLNIIEEA 305 DW FQID R+ G S ++ A S + V + P+ + F LN + Sbjct: 112 DWGEFQIDGRSARGGQQSANDAYAAALNSGSKSEALRVIKELAPKDYVLQFHNLNSNLDR 171 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 + +EP P PY + + V E L W + N +S Sbjct: 172 IFQEP-PAPYISPFLSSSFNQ--------------VPEELEVW------VSENVMSSAAR 210 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVY 395 +RPD+I+I+ ++ +T + LG H Y Sbjct: 211 PWRPDSIVIEGDSRTGKTM-WARSLGPHNY 239 >gi|219965987|emb|CAR82110.1| replication associated protein (Rep) [Tomato yellow leaf curl Mali virus] Length = 359 Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust. Identities = 37/150 (24%), Positives = 59/150 (39%), Gaps = 23/150 (15%) Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF-PQQDIDSFIPLNIIEEA 305 DW FQID R+ G S ++ A S + V + P+ + F LN + Sbjct: 112 DWGEFQIDGRSARGGQQSANDAYAAAINAGSKSEALRVIRELAPKDYVLQFHNLNSNLDR 171 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 + +EP P PY + + V E L W + N +S Sbjct: 172 IFQEP-PAPYISPFLSSSFNQ--------------VPEELEIW------VSENVMSSAAR 210 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVY 395 +RPD+I+I+ ++ +T + LG H Y Sbjct: 211 PWRPDSIVIEGDSRTGKTM-WARSLGPHNY 239 >gi|219965994|emb|CAR82116.1| replication associated protein (Rep) [Tomato yellow leaf curl Mali virus] Length = 359 Score = 37.4 bits (85), Expect = 5.2, Method: Compositional matrix adjust. Identities = 37/150 (24%), Positives = 59/150 (39%), Gaps = 23/150 (15%) Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF-PQQDIDSFIPLNIIEEA 305 DW FQID R+ G S ++ A S + V + P+ + F LN + Sbjct: 112 DWGEFQIDGRSARGGQQSANDAYAAAINAGSKSEALRVIRELAPKDYVLQFHNLNSNLDR 171 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 + +EP P PY + + V E L W + N +S Sbjct: 172 IFQEP-PAPYISPFLSSSFNQ--------------VPEELEIW------VSENVMSSAAR 210 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVY 395 +RPD+I+I+ ++ +T + LG H Y Sbjct: 211 PWRPDSIVIEGDSRTGKTM-WARSLGPHNY 239 >gi|260945527|ref|XP_002617061.1| hypothetical protein CLUG_02505 [Clavispora lusitaniae ATCC 42720] gi|238848915|gb|EEQ38379.1| hypothetical protein CLUG_02505 [Clavispora lusitaniae ATCC 42720] Length = 348 Score = 37.4 bits (85), Expect = 6.2, Method: Compositional matrix adjust. Identities = 25/84 (29%), Positives = 37/84 (44%), Gaps = 3/84 (3%) Query: 382 RTCDYLEMLGYHVYRVLGQKRA---VDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438 RT +YLE G V RA V FCR W E A++++HS + Sbjct: 190 RTMEYLETQGVLVSTFNDDGRANIEVPSFFCRESGVRSPYSFTSWKEIAAVVHHSNNLMQ 249 Query: 439 LKSLKSFIVPNTGELAIESKRVKG 462 L+S +P E+A+ S+ + G Sbjct: 250 LQSGNLLCIPPPAEIALSSELMSG 273 Searching..................................................done Results from round 2 >gi|254781215|ref|YP_003065628.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040892|gb|ACT57688.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|317120680|gb|ADV02503.1| putative phage terminase large subunit [Liberibacter phage SC1] gi|317120824|gb|ADV02645.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 511 Score = 727 bits (1876), Expect = 0.0, Method: Composition-based stats. Identities = 511/511 (100%), Positives = 511/511 (100%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME Sbjct: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL Sbjct: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI Sbjct: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM Sbjct: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP Sbjct: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR 511 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR Sbjct: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR 511 >gi|315122902|ref|YP_004063391.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496304|gb|ADR52903.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 509 Score = 647 bits (1669), Expect = 0.0, Method: Composition-based stats. Identities = 373/508 (73%), Positives = 428/508 (84%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 M+RELPT E EQ+L +LM+SD+IKLSF+NFVL FPW E T L FS PR WQL+FME Sbjct: 1 MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VD CL +V+NP+P++FKGA+SAGRGIGKTTLNAW++LWL+STRPG+S++CLANSETQL Sbjct: 61 AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 K+TLWAEVSKWLS+LPNKHWFEMQSLSLHPA WY++ L + GIDSKHY+ CRTYSEER Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVGHHNTYGMAI NDEASGTPDVIN ILGF TE NANRFW+MTSNPRRL+G FY+I Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLNGWFYDI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FN PL+DW+RFQIDTRTVEGIDP+FHE IIARYGLDSDVTRVEV GQFPQQDI+SFIP Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPNFHENIIARYGLDSDVTRVEVLGQFPQQDINSFIPFY 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 IEEALNREP DPYAPL+MGCDIA EGGDNTVVVLRRG IEH+FDWS + ++ KI Sbjct: 301 RIEEALNREPIKDPYAPLVMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNVSSRKI 360 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 L+ KY+PDA+++DAN G +T YL GY V+ GQ RA D E RNRRTELHVKM Sbjct: 361 EELINKYKPDAVVVDANGIGVQTYYYLADEGYSVHPEKGQNRADDHESYRNRRTELHVKM 420 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 A+WLE AS+ +HSGLIQNLKSL+SFI PNTG+LA+ESKRVKGA STDYSD L YTFA +P Sbjct: 421 AEWLELASIPHHSGLIQNLKSLESFIEPNTGKLALESKRVKGAVSTDYSDALAYTFAVSP 480 Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEY 508 RSDM+FGRC SYQYE +LL++RRF Y Sbjct: 481 ARSDMNFGRCRSYQYEADELLVDRRFSY 508 >gi|315121940|ref|YP_004062429.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495342|gb|ADR51941.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 509 Score = 644 bits (1662), Expect = 0.0, Method: Composition-based stats. Identities = 376/508 (74%), Positives = 428/508 (84%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 M+RELPT E EQ+L +LM+SD+IKLSF+NFVL FPW E T L FS PR WQL+FME Sbjct: 1 MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VD CL +V+NP+P++FKGA+SAGRGIGKTTLNAW++LWL+STRPG+S++CLANSETQL Sbjct: 61 AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 K+TLWAEVSKWLS+LPNKHWFEMQSLSLHPA WY++ L + GIDSKHY+ CRTYSEER Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVGHHNTYGMAI NDEASGTPDVIN ILGF TE NANRFW+MTSNPRRL G FY+I Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLKGWFYDI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FN PL+DW+RFQIDTRTVEGIDPSFHEGII+RYGLDSDVTRVEV GQFPQQDI+SFIP Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPSFHEGIISRYGLDSDVTRVEVLGQFPQQDINSFIPFY 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 IEEALNREP DPYAPLIMGCDIA EGGDNTVVVLRRG IEH+FDWS + ++ KI Sbjct: 301 RIEEALNREPIKDPYAPLIMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNASSRKI 360 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 L+ KY+PDA+++DAN G +T YL GY V+ GQ RA D E RNRRTELHVKM Sbjct: 361 EELINKYKPDAVVVDANGIGVQTYYYLADEGYSVHAEKGQNRADDHESYRNRRTELHVKM 420 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 A+WLE AS+ NHSGLIQNLKSL+SFI PNTG+LA+ESKRVKGA STDYSD L YTFA +P Sbjct: 421 AEWLELASIPNHSGLIQNLKSLESFIEPNTGKLALESKRVKGAVSTDYSDALAYTFAVSP 480 Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEY 508 RSDM+FGRC SYQYE +LL++RRF Y Sbjct: 481 ARSDMNFGRCRSYQYEADELLVDRRFSY 508 >gi|317120722|gb|ADV02544.1| putative phage terminase large subunit [Liberibacter phage SC2] gi|317120783|gb|ADV02604.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 516 Score = 621 bits (1601), Expect = e-176, Method: Composition-based stats. Identities = 392/507 (77%), Positives = 414/507 (81%), Gaps = 9/507 (1%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME Sbjct: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL Sbjct: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPQQ 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 I EAL R PDPYAPLIMGCDIA EG D TVVVLRRG +IE +FDWS + TN KI Sbjct: 301 YIVEALERVAIPDPYAPLIMGCDIAGEGEDKTVVVLRRGNIIERIFDWSGELIEVTNRKI 360 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGY-HVYRVLGQKRAVDLEFCRNRRTELHVK 419 S L+ +Y PDAI+ID N G YL + + V +LGQ+R+ + E N R EL+ Sbjct: 361 SSLINRYNPDAIVIDGNGIGGTVVSYLLNMHHISVEVILGQRRSTEPEQYHNLRAELYDL 420 Query: 420 MADWLEFASLINHS--GLIQNLKSLKSFIVPNTGELAIESKRVK----GAKSTDYSDGLM 473 M + + LI LKS+KS I G L IE KR G +S D+ D L Sbjct: 421 MRSAITGGLQLPDDCPDLINELKSIKS-ISDTLGRLLIEKKRQGRSEFGVRSPDFVDALC 479 Query: 474 YTFAENPPRSDMDFGRCPS-YQYEGVD 499 YTFA +PPR D + +YE +D Sbjct: 480 YTFAVDPPRKDNPLYQGQDISEYEALD 506 >gi|227355862|ref|ZP_03840255.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906] gi|227164181|gb|EEI49078.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906] Length = 494 Score = 495 bits (1275), Expect = e-138, Method: Composition-based stats. Identities = 138/495 (27%), Positives = 217/495 (43%), Gaps = 25/495 (5%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 MS L +PE EQ + D+ L + + FPWGE G LE ++ PR WQ E + Sbjct: 1 MSEALQKSPE-EQLIEDIASFTHDPL---GYAYYAFPWGEAGGELEEYNGPRQWQAEALN 56 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 + H N P + A ++G GIGK+ + ++ W M T V+ AN+E QL Sbjct: 57 EIGEHLRNPKTRHQPLLL--ARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQL 114 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 +T W E++KW L +WF +++ + + +SE Sbjct: 115 RTKTWPEIAKWQRLSLTNNWFTCTKTAIYSND----------PNHANAWRADAVPWSENN 164 Query: 181 PDTFVGHHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239 + F G HN + ++ DEAS D++ G LT+ WI NP R +G+F E Sbjct: 165 TEAFAGLHNKGKRIILVFDEASNIADLVWEVAEGALTDEGTEIIWIAFGNPTRNTGRFRE 224 Query: 240 IFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL 299 F K W QID+RTVEG + + YG DSD +V V G FP FIP Sbjct: 225 CFRKFKHRWNTKQIDSRTVEGSNKEQIKNWEEDYGEDSDFFKVRVRGVFPSASELQFIPT 284 Query: 300 NIIEEALNR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD-WSKTDLRTT 356 + +EA+ R +AP+I+G D A G D+ V+ LR+G + L+ + TD Sbjct: 285 GLTDEAMKRIVTQAEVAHAPVIIGVDPAYSGIDDAVIYLRQGLFSKCLWTGFKTTDDVVM 344 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416 +I+ ++Y+ DA+ ID G G V + D + N+R E+ Sbjct: 345 AKRIADFEDQYKADAVHID-FGYGTGIHSIGTSWGRVWRLVKFGGASTDPQML-NKRGEM 402 Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLM 473 + + WL+ I+ +L + + ++ +E K + + +S D L Sbjct: 403 YNSVKTWLKIGGAIDDQETADDLSCGEYKVRVIDSKIVLEDKTEIKKRLGRSPGKGDALA 462 Query: 474 YTFAENPPRSDMDFG 488 TFA + D ++ Sbjct: 463 LTFAYPVTKIDRNYS 477 >gi|268589373|ref|ZP_06123594.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315400|gb|EFE55853.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 493 Score = 492 bits (1266), Expect = e-137, Method: Composition-based stats. Identities = 147/486 (30%), Positives = 226/486 (46%), Gaps = 26/486 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ LS + L+ FPWGE GT LE + PR WQ E + + H Sbjct: 6 SPE-EQLINDIGMFTHDPLS---YALYAFPWGEAGTELENANGPRQWQAEALNEIGEHLR 61 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + ++ W M T V+ AN+E QL+T W E Sbjct: 62 NPETRHQP--LQLARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQLRTKTWPE 119 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 ++KW L K WF +++ + + +SE + F G Sbjct: 120 IAKWQRLSITKDWFTYTKTAIYSND----------PNHANAWRADAVPWSENNTEAFAGL 169 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + +I DEAS D++ G LT+ N WI NP R +G+F E F K Sbjct: 170 HNQGKRIILIFDEASNIADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRECFRKFKH 229 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + E I YG+D D +V V G FP FIP + + A+ Sbjct: 230 RWKTKQIDSRTVEGTNKEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPTGLTDAAM 289 Query: 307 NREPCPDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ SKT D +I+ Sbjct: 290 KRTVTQAEVSHAPIIIGVDPAYSGDDDAVIYLRQGLHSKCLWTGSKTIDDVIMAKRIADF 349 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y DA+ ID G G + V + D + RN+R E++ + W Sbjct: 350 EDQYGADAVHID-FGYGTGIQSVGMNWGRNWQLVQFNGASTDPQM-RNKRGEMYNNVKSW 407 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L+ I+ + ++L + + + V +G++ +ESK + + +S D L TFA Sbjct: 408 LKIGGAIDDQEVAEDLSTPE-YKVELSGKILLESKDDIKKRIGRSPGKGDALALTFAYPV 466 Query: 481 PRSDMD 486 + + + Sbjct: 467 TKKERN 472 >gi|212710820|ref|ZP_03318948.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM 30120] gi|212686517|gb|EEB46045.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM 30120] Length = 493 Score = 491 bits (1263), Expect = e-136, Method: Composition-based stats. Identities = 144/492 (29%), Positives = 223/492 (45%), Gaps = 28/492 (5%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 M + EQ + D+ LS + L+ FPWGE GT LE S PR WQ E + Sbjct: 1 MIETMSPE---EQLINDIGMFTHDPLS---YALYAFPWGEAGTELENASGPRQWQAEALN 54 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 + H N P + A ++G GIGK+ + ++ W M T V+ AN+E QL Sbjct: 55 EIGEHLRNPETRHQP--LQLARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQL 112 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 +T W E++KW L K WF +++ + + +SE Sbjct: 113 RTKTWPEIAKWQRLSITKDWFTCTKTAIYSND----------PNHANAWRADAVPWSENN 162 Query: 181 PDTFVGHHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239 + F G HN + ++ DEAS D++ G LT+ N WI NP R +G+F E Sbjct: 163 TEAFAGLHNQGKRIILVFDEASNIADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRE 222 Query: 240 IFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL 299 F K WK QID+RTVEG + E I YG+D D +V V G FP FIP Sbjct: 223 CFRKFKHRWKTKQIDSRTVEGTNKEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPT 282 Query: 300 NIIEEALNREPCPDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTT 356 + + A+ R +AP+I+G D A G D+ V+ LR+G + L+ SKT D Sbjct: 283 GLTDAAMKRTVTQAEVSHAPIILGVDPAYSGDDDAVIYLRQGLHSKCLWTGSKTIDDVIM 342 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416 +I+ ++Y DA+ ID G G + V + D + +N+R E+ Sbjct: 343 AKRIADYEDQYGADAVHID-FGYGTGIQSVGMNWGRNWQLVSFNGASTDPQM-QNKRGEM 400 Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLM 473 + + WL+ I+ + +L + + + V +G++ +E K + + +S + D L Sbjct: 401 YNNVKSWLKIGGAIDDQEVADDLSTPE-YKVQLSGKILLEKKEDIKKRIGRSPNKGDALA 459 Query: 474 YTFAENPPRSDM 485 TFA + + Sbjct: 460 LTFAYPVTKKER 471 >gi|323156136|gb|EFZ42295.1| terminase large subunit [Escherichia coli EPECa14] Length = 491 Score = 483 bits (1244), Expect = e-134, Method: Composition-based stats. Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A+++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPATRYQPLML--ALASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSNDLGHD----------KRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R YAP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAYAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRLPGQQNQQ 480 >gi|324008564|gb|EGB77783.1| hypothetical protein HMPREF9532_01752 [Escherichia coli MS 57-2] Length = 491 Score = 483 bits (1244), Expect = e-134, Method: Composition-based stats. Identities = 142/498 (28%), Positives = 226/498 (45%), Gaps = 27/498 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSNDLGHD----------KRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSYQYEGV 498 + ++ S Q + Sbjct: 468 SKR-INIPGQQSQQGRAI 484 >gi|327252187|gb|EGE63859.1| terminase large subunit [Escherichia coli STEC_7v] Length = 491 Score = 482 bits (1240), Expect = e-134, Method: Composition-based stats. Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPATRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRIPGQQNQQ 480 >gi|332344357|gb|AEE57691.1| terminase, large subunit [Escherichia coli UMNK88] Length = 491 Score = 481 bits (1239), Expect = e-134, Method: Composition-based stats. Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLVEDIASFTYDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPATRHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRLPGQQNQQ 480 >gi|294491573|gb|ADE90329.1| putative phage terminase, large subunit [Escherichia coli IHE3034] Length = 491 Score = 481 bits (1239), Expect = e-134, Method: Composition-based stats. Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSTAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRIPGQQNQQ 480 >gi|330007152|ref|ZP_08305894.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3] gi|328535499|gb|EGF61959.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3] Length = 495 Score = 481 bits (1238), Expect = e-133, Method: Composition-based stats. Identities = 140/485 (28%), Positives = 218/485 (44%), Gaps = 25/485 (5%) Query: 6 PTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAH 65 P EQ + D+ L + L+ FPWGE GT L S PR WQ + + H Sbjct: 8 PEEQLKEQLIDDIASFTHDPL---GYALYAFPWGEDGTELAHASGPRQWQADAFREIGEH 64 Query: 66 CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 N P + + ++G GIGK+ + L+ W MST V+ AN++ QL+T W Sbjct: 65 LQNPATRHQPLM--ISRASGHGIGKSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTW 122 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ KW +L K WF + +++ D H K + +SE + F Sbjct: 123 PEIIKWSNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFA 172 Query: 186 GHHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244 G HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 173 GLHNERKRIVVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKY 232 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 WK QID+RTVEG + + + YG DSD +V V G FP FIP + +E Sbjct: 233 KHRWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDE 292 Query: 305 ALNR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361 A+ R +AP I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 293 AMKRVVTAAQVAHAPRIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIA 352 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 ++Y+ DA+ ID G + G V + D + N+R E+ Sbjct: 353 DFEDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASADPQML-NKRGEMFNACK 410 Query: 422 DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478 WL+ ++ +L + + + V G++ +E K + + +S D L+ TFA Sbjct: 411 TWLKLGGALDDQETADDLSAAE-YKVRVDGKIVMEPKEDIKERLGRSPGKGDALLLTFAY 469 Query: 479 NPPRS 483 + Sbjct: 470 PVTKR 474 >gi|218700994|ref|YP_002408623.1| putative phage terminase, large subunit [Escherichia coli IAI39] gi|218370980|emb|CAR18807.1| putative phage terminase, large subunit [Escherichia coli IAI39] Length = 491 Score = 481 bits (1237), Expect = e-133, Method: Composition-based stats. Identities = 143/498 (28%), Positives = 227/498 (45%), Gaps = 27/498 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIDDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSYQYEGV 498 + ++ S Q + Sbjct: 468 SKR-INIPGQQSQQGRAI 484 >gi|309702815|emb|CBJ02146.1| putative terminase, large subunit [Escherichia coli ETEC H10407] Length = 493 Score = 481 bits (1237), Expect = e-133, Method: Composition-based stats. Identities = 137/498 (27%), Positives = 228/498 (45%), Gaps = 25/498 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLVEDIAGFTYDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPATRHQPIML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD +V V G FP + FIP + + A+ Sbjct: 231 RWKCAQIDSRTVEGTNKEQLQKWVDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAV 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK-ISGL 363 R P +A +++G D + +G D V+ LR+G + L +W +T K I+ Sbjct: 291 GRVITPAQVQHAAVVLGVDPSHQGKDPAVIYLRQGLHCKKLGEWQRTTDDVLFAKVIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G + ++ D E N+R E++ D Sbjct: 351 EDQYQADAVFID-YGYGTGLKSVGDNWGRNWTLIMFGSGTADPEM-GNKRGEMYKSARDA 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L+ + ++ L L + + + ++ K + +S + +D + T+A Sbjct: 409 LKLGAQLDSQELADELSAPEYKVRLKDSRKILQDKDEVKELLGRSPNNADAYVLTYAAPV 468 Query: 481 PRSDMDFGRCPSYQYEGV 498 + ++G+ S Q + + Sbjct: 469 TKKQFNYGQQQSQQGKAL 486 >gi|298381721|ref|ZP_06991320.1| terminase large subunit protein [Escherichia coli FVEC1302] gi|301019339|ref|ZP_07183525.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|298279163|gb|EFI20677.1| terminase large subunit protein [Escherichia coli FVEC1302] gi|299882256|gb|EFI90467.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|323948690|gb|EGB44595.1| hypothetical protein ERKG_04913 [Escherichia coli H252] Length = 491 Score = 481 bits (1237), Expect = e-133, Method: Composition-based stats. Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRIPGQQNQQ 480 >gi|300898423|ref|ZP_07116764.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357890|gb|EFJ73760.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 491 Score = 480 bits (1236), Expect = e-133, Method: Composition-based stats. Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRIPGQQNQQ 480 >gi|117624715|ref|YP_853628.1| putative phage terminase, large subunit [Escherichia coli APEC O1] gi|115513839|gb|ABJ01914.1| putative phage terminase, large subunit [Escherichia coli APEC O1] Length = 491 Score = 480 bits (1235), Expect = e-133, Method: Composition-based stats. Identities = 141/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R ++P+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHSPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRIPGQQNQQ 480 >gi|89152423|ref|YP_512256.1| putative terminase large subunit [Escherichia phage phiV10] gi|74055446|gb|AAZ95895.1| putative terminase large subunit [Escherichia phage phiV10] Length = 491 Score = 480 bits (1235), Expect = e-133, Method: Composition-based stats. Identities = 141/493 (28%), Positives = 223/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG SD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSTAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRIPGQQNQQ 480 >gi|331648179|ref|ZP_08349269.1| conserved hypothetical protein [Escherichia coli M605] gi|331043039|gb|EGI15179.1| conserved hypothetical protein [Escherichia coli M605] Length = 491 Score = 479 bits (1233), Expect = e-133, Method: Composition-based stats. Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE+GT L + PR WQ + + H Sbjct: 7 SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNACKIW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRIPGQQNQQ 480 >gi|30387381|ref|NP_848210.1| terminase large subunit [Enterobacteria phage epsilon15] gi|30266036|gb|AAO06065.1| terminase large subunit [Salmonella phage epsilon15] Length = 491 Score = 479 bits (1233), Expect = e-133, Method: Composition-based stats. Identities = 141/494 (28%), Positives = 223/494 (45%), Gaps = 26/494 (5%) Query: 12 EQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVN 71 EQ + D+ L + L+ FPWGE GT L + PR WQ + + H N Sbjct: 10 EQLVEDIASFTYDPL---GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPAT 66 Query: 72 NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131 P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E+ KW Sbjct: 67 RHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKW 124 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 +L K WF + +++ D H K + +SE + F G HN Sbjct: 125 SNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGLHNER 174 Query: 192 G-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K WK Sbjct: 175 KRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKC 234 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-- 308 QID+RTVEG + + + YG +SD +V V G FP FIP + +EA+ R Sbjct: 235 AQIDSRTVEGTNKQQLQKWVDDYGEESDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVV 294 Query: 309 EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKY 367 +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ ++Y Sbjct: 295 TAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQY 354 Query: 368 RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFA 427 + DA+ ID G + G + + D + N+R E+ WL+ Sbjct: 355 QADAVFID-FGYGTGLKSIGDGWGRTWQLIPFGGGSTDPQML-NKRGEMFNSCKTWLKLG 412 Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSD 484 ++ +L + + + V G++ IE K + + +S D L+ TFA + Sbjct: 413 GALDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVTKH- 470 Query: 485 MDFGRCPSYQYEGV 498 + S Q + V Sbjct: 471 LRIPGQESQQGKAV 484 >gi|301046412|ref|ZP_07193572.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|300301638|gb|EFJ58023.1| conserved hypothetical protein [Escherichia coli MS 185-1] Length = 491 Score = 479 bits (1233), Expect = e-133, Method: Composition-based stats. Identities = 142/493 (28%), Positives = 223/493 (45%), Gaps = 26/493 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE GT L + PR WQ + + H Sbjct: 7 SPE-EQLVEDIASFTYDPL---GYALYAFPWGEDGTELAHATGPRQWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ +L + + + V G++ IE K + + +S D L+ TFA Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467 Query: 481 PRSDMDFGRCPSY 493 + G+ Sbjct: 468 SKRLRLPGQQNQQ 480 >gi|215487825|ref|YP_002330256.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] gi|215265897|emb|CAS10306.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] Length = 493 Score = 479 bits (1232), Expect = e-133, Method: Composition-based stats. Identities = 137/498 (27%), Positives = 226/498 (45%), Gaps = 25/498 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE GT L + PR WQ + + H Sbjct: 7 SPE-EQLVEDIASFTYDPL---GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPATRHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD +V V G FP + FIP + + A+ Sbjct: 231 RWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAV 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK-ISGL 363 R P +A +++G D + +G D V+ LR+G + L +W +T K I+ Sbjct: 291 GRVITPAQVQHAAVVLGVDPSHQGKDPAVIYLRQGLHCKKLGEWQRTTDDVLFAKIIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y+ DA+ ID G + G + + D E N+R E++ D Sbjct: 351 EDQYQADAVFID-YGYGTGLKSVGDNWGRNWTLIQFGSGTADPEM-GNKRGEMYKSARDA 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L+ + ++ L L + + + ++ K + +S + +D + T+A Sbjct: 409 LKLGAQLDSQNLADELSAPEYKVRLKDSRKILQDKEEVKELLGRSPNDADAYVLTYAAPV 468 Query: 481 PRSDMDFGRCPSYQYEGV 498 + ++G+ S Q + + Sbjct: 469 TKKQFNYGQQQSQQGKAL 486 >gi|262043569|ref|ZP_06016682.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039103|gb|EEW40261.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 491 Score = 478 bits (1230), Expect = e-132, Method: Composition-based stats. Identities = 141/483 (29%), Positives = 219/483 (45%), Gaps = 26/483 (5%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 +PE EQ + D+ L + L+ FPWGE GT L + PR WQ + + H Sbjct: 7 SPE-EQLIDDIASFTHDPL---GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQ 62 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 N P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E Sbjct: 63 NPATRHQPLML--ARASGHGIGKSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 + KW +L K WF + +++ D H K + +SE + F G Sbjct: 121 IIKWSNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + ++ DEAS D++ G LT+ + W+ NP R +G+F E F K Sbjct: 171 HNERKRIVVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 WK QID+RTVEG + + + YG DSD +V V G FP FIP + +EA+ Sbjct: 231 RWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363 R +AP I+G D A G D+ V+ LR+G + L+ +K TD +I+ Sbjct: 291 KRVVTAVQVAHAPRIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 ++Y DA+ ID G + G V + D + N+R E+ W Sbjct: 351 EDQYLADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASADPQML-NKRGEMFNACKTW 408 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L+ ++ +L + + + V G++ +E K + + +S D L+ TFA Sbjct: 409 LKLGGALDDQETADDLSAAE-YKVRVDGKIVMEPKEDIKERLGRSPGKGDALLLTFAYPV 467 Query: 481 PRS 483 + Sbjct: 468 TKR 470 >gi|320175050|gb|EFW50163.1| terminase B protein, putative [Shigella dysenteriae CDC 74-1112] Length = 480 Score = 476 bits (1225), Expect = e-132, Method: Composition-based stats. Identities = 138/486 (28%), Positives = 220/486 (45%), Gaps = 25/486 (5%) Query: 15 LFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPN 74 + D+ L + L+ FPWGE+GT L + PR WQ + + H N Sbjct: 2 IEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQ 58 Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134 P + A ++G GIGK+ + L+ W MST V+ AN++ QL+T W E+ KW +L Sbjct: 59 PLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNL 116 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-M 193 K WF + +++ D H K + +SE + F G HN + Sbjct: 117 AITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRI 166 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 ++ DEAS D++ G LT+ + W+ NP R +G+F E F K WK QI Sbjct: 167 IVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQI 226 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPC 311 D+RTVEG + + + YG DSD ++ V G FP FIP + +EA+ R Sbjct: 227 DSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAA 286 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPD 370 +AP+I+G D A G D+ V+ LR+G + L+ +K TD +I+ ++Y+ D Sbjct: 287 QVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQAD 346 Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430 A+ ID G + G V + D + N+R E+ + WL ++ Sbjct: 347 AVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFISCKTWLRLGGML 404 Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDMDF 487 + +L + + + V G++ IE K + + +S D L+ TFA + Sbjct: 405 DDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVSKRLRIP 463 Query: 488 GRCPSY 493 G+ Sbjct: 464 GQQNQQ 469 >gi|304398406|ref|ZP_07380280.1| terminase, large subunit [Pantoea sp. aB] gi|304354272|gb|EFM18645.1| terminase, large subunit [Pantoea sp. aB] Length = 490 Score = 474 bits (1220), Expect = e-131, Method: Composition-based stats. Identities = 135/485 (27%), Positives = 215/485 (44%), Gaps = 24/485 (4%) Query: 13 QKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNN 72 Q + D+ + L+ FPWGE+GT L PR WQ + + + AH N Sbjct: 10 QLIEDIGAFTHDPF---GYALYAFPWGEEGTDLAYSKGPRQWQEDAFKQIGAHLQNPDTR 66 Query: 73 PNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL 132 P + A +G GIGK+ + LV W M T V+ AN+E QL+T W E++KW Sbjct: 67 HQPLMIGRA--SGHGIGKSAFISMLVKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQ 124 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 L + WF + +++ +K + +SE + F G HN Sbjct: 125 RLSITQDWFTCTATAIYSND----------PSHAKSWRADAIPWSENNTEAFAGLHNERK 174 Query: 193 -MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251 + +I DEAS D++ G LT+ N W+ NP R +G+F E F K WK Sbjct: 175 RIILIFDEASNIADLVWEVAEGALTDENTEIIWVAFGNPTRNTGRFRECFRKLRHRWKTA 234 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 QID+R+VEG + + + YG DSD +V V G FP FIP + + A+ R Sbjct: 235 QIDSRSVEGTNKEQIQKWVDDYGEDSDFVKVRVRGLFPSASEAQFIPTGLTDAAVGRVIT 294 Query: 312 PDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG-LVEKYR 368 P +A ++G D A +GGD V+ LR+G + L ++ +T KI ++YR Sbjct: 295 PGQVAHAATVIGVDPAHQGGDPAVIYLRQGLHTKKLGEYQRTTDDVLFAKIVASFEDEYR 354 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 DA+ ID G + G + + + D + N+R E++ + WL+ Sbjct: 355 ADAVFID-YGYGTGLKSVGDNWGRNWQLIQFGGGSTDPQM-ANKRGEMYNAVKTWLKDGG 412 Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDM 485 ++ + + L + + + + +E K + + KS + +D L TFA + Sbjct: 413 QLDSQQVAEELSAAEYKVRLKDSRIVLEDKTSIKERLGKSPNDADALALTFAFPVVKKLH 472 Query: 486 DFGRC 490 G Sbjct: 473 YVGSN 477 >gi|303328395|ref|ZP_07358832.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861389|gb|EFL84326.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 500 Score = 468 bits (1204), Expect = e-129, Method: Composition-based stats. Identities = 144/465 (30%), Positives = 206/465 (44%), Gaps = 26/465 (5%) Query: 28 FSNFVLHFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGR 86 FVL FPWG G L + P WQ E + + S V + A+S+G Sbjct: 29 PLGFVLFAFPWG--GGALADYPDGPDVWQREILRGMGEQL--STGASAASVIREAVSSGH 84 Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146 G+GK+ L AW++LW MST + AN+E QLK WAE++KW L +WF+ + Sbjct: 85 GVGKSALVAWIILWAMSTFSDTRGVVTANTENQLKGKTWAELAKWHRLCLCGYWFDCTAT 144 Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT-YGMAIINDEASGTPD 205 +L K + +SE + F G HN + +I DEAS PD Sbjct: 145 ALIST----------QAGHEKTWRVDMVAWSERNTEAFAGLHNKGRRVLLIFDEASAIPD 194 Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSF 265 I G LT+ + W NP R +G+F E F + W ++D+RT D + Sbjct: 195 AIWEVSEGALTDADTEIIWCCFGNPTRNTGRFRECFGRYAHRWNTRRVDSRTAAMTDKNQ 254 Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY--APLIMGCD 323 + YG DSD RV V G+FP+ FI +I+ EA R PD Y AP I+G D Sbjct: 255 LAQWVEDYGEDSDFVRVRVRGEFPRAGDRQFISSDIVHEARGRSLKPDQYSFAPRILGVD 314 Query: 324 IAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383 +A G D +V+ R+G + D T ++ ++ D I +D GA Sbjct: 315 VARSGSDQSVITRRQGLACLEQRKFRGLDTVTLAGIVAEECREWGADKIFVDGIGVGAGV 374 Query: 384 CDYLEM---LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHS-GLIQNL 439 D L LG+ V + A+ E NRR E+ M WL + L + L Sbjct: 375 VDALRQVYGLGHLVVDAVAGATALQPERFLNRRAEMWTAMRKWLAEGGAVPDDAELAEQL 434 Query: 440 KSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481 L+ + V +G+L +ESK + +G S D +D L TF P Sbjct: 435 CGLE-YAVTVSGKLKLESKDDMKARGLTSPDCADALALTFYAPVP 478 >gi|167032754|ref|YP_001667985.1| putative phage terminase large subunit [Pseudomonas putida GB-1] gi|166859242|gb|ABY97649.1| putative phage terminase, large subunit [Pseudomonas putida GB-1] Length = 499 Score = 461 bits (1187), Expect = e-127, Method: Composition-based stats. Identities = 143/491 (29%), Positives = 225/491 (45%), Gaps = 27/491 (5%) Query: 8 NPETEQKLF-DLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHC 66 + EQ+L D+ + L +VL+ FPWGE G L + PR WQ E +E + Sbjct: 7 EIDYEQELANDIASFSDDPL---GYVLYAFPWGEAGGELANKTGPRKWQREVLESIGEQL 63 Query: 67 LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA 126 + EV + A+++G GIGK+ L +W++ W + T + AN+E+QL+T W Sbjct: 64 RAGAKDRG-EVIREAVASGHGIGKSALVSWVIKWALDTEVDTRGVVTANTESQLRTKTWP 122 Query: 127 EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG 186 EV+KW L HWF++ +L D H K++ +S+ + F G Sbjct: 123 EVAKWNRLSITAHWFKLTGTALIST----DPDH------EKNWRIDAVPWSDTNTEAFAG 172 Query: 187 HHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL 245 HN + +I DEAS D++ G LT+ + W NP R SG+F E F K Sbjct: 173 LHNEGKRILLIFDEASAIADLVWEVAEGALTDADTEIIWAAFGNPTRNSGRFRECFTKFK 232 Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 W+ Q+D+RTV+G + + IA YG DSD R+ V G FP+ IP + + EA Sbjct: 233 HRWRHRQVDSRTVDGTNKTQIAKWIADYGEDSDFVRIRVRGMFPRASDLQLIPTDWVAEA 292 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIE--HLFDWSKTDLRTTN---NKI 360 + R+ L+ G DIA G DN V+ RRG + ++ R T K+ Sbjct: 293 MRRDGVYGLDDALVCGIDIARGGMDNNVIRFRRGMDAKSIKPIKIPGSETRNTTPFIAKV 352 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHV 418 LV ++RPDA+ +D+ G D L L G + V +A D N RT + Sbjct: 353 CTLVVEHRPDAVFVDSTGVGGPVADQLRRLLPGVMIIDVNFASQAPD-RHYANMRTYIWW 411 Query: 419 KMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYT 475 +M + ++ I ++ + + ++ ++A+E K + + S D D L T Sbjct: 412 RMREAIKLGLAIESDTELETELTSPEYDHNSSDQIALEKKKDIKKRLGISPDDGDALALT 471 Query: 476 FAENPPRSDMD 486 F ++ Sbjct: 472 FTMPVMKAQYQ 482 >gi|228911519|ref|ZP_04075310.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL 200] gi|228848128|gb|EEM92991.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL 200] Length = 459 Score = 454 bits (1168), Expect = e-125, Method: Composition-based stats. Identities = 132/494 (26%), Positives = 216/494 (43%), Gaps = 75/494 (15%) Query: 14 KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73 ++ D+ W D + +F+ +L F+ P WQ + + ++ +P Sbjct: 2 EIIDVYWDDPV--AFAEDMLGFY--------------PDEWQRKVL-------MDLAQSP 38 Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 K ++ +G+G+GKT L + +V+W + RP VIC A ++ QL T LWAE++KWL Sbjct: 39 -----KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLE 93 Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 K+ + ++ + + RT + +P+ G H Y M Sbjct: 94 GSAVKNLLKWTKTRVYMIG------------SEERWFATARTAT--KPENMQGFHEDY-M 138 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 + DEASG D I ILG L+ A + NP R SG FY+ N+ D +K ++ Sbjct: 139 LFVCDEASGIADPIMEAILGTLS--GAENKLFLCGNPTRTSGVFYDSHNRDRDLYKIHKV 196 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 + E + +YG SDV RV V G+FP+ + D+FIPL I+E+A + + P Sbjct: 197 SSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCKVEPT 256 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA-- 371 L +G D+A G D TV+ R G + L + K D T + L ++Y Sbjct: 257 -GETLDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMETAGHVLKLAKEYMAKYKQ 315 Query: 372 -----IIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 I +D + G D L E L + VY V+ + +D E N E + Sbjct: 316 LKRVDIKVDDSGVGGGVTDRLKEVIKSERLPFKVYPVVNNGKPLDDEHYDNAGAEGWAVV 375 Query: 421 ADWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 D LE + N +I S K + + + G++A+E K + +G +S Sbjct: 376 RDLLEENMKAFIQGEEPTMEIPNDEKMISQFSSRK-YRITSRGKIALERKEEMKKRGLQS 434 Query: 466 TDYSDGLMYTFAEN 479 D +D ++ F + Sbjct: 435 PDRADAIVLAFYKP 448 >gi|228968731|ref|ZP_04129698.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis serovar sotto str. T04001] gi|228790961|gb|EEM38595.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis serovar sotto str. T04001] Length = 459 Score = 454 bits (1168), Expect = e-125, Method: Composition-based stats. Identities = 133/494 (26%), Positives = 217/494 (43%), Gaps = 75/494 (15%) Query: 14 KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73 ++ D+ W D + +F+ +L F+ P WQ + + ++ +P Sbjct: 2 EIIDVYWDDPV--AFAEDMLGFY--------------PDEWQRKVL-------MDLAQSP 38 Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 K ++ +G+G+GKT L + +V+W + RP VIC A ++ QL T LWAE++KWL Sbjct: 39 -----KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLE 93 Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 K+ + ++ + + RT + +P+ G H Y M Sbjct: 94 GSAVKNLLKWTKTRVYMIG------------SEERWFATARTAT--KPENMQGFHEDY-M 138 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 + DEASG D I ILG L+ A + NP R SG FY+ N+ D +K ++ Sbjct: 139 LFVCDEASGIADPIMEAILGTLS--GAENKLFLCGNPTRTSGVFYDSHNRDRDLYKIHKV 196 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 + E + +YG SDV RV V G+FP+ + D+FIPL I+E+A + + P Sbjct: 197 SSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCKVEPT 256 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA-- 371 L +G D+A G D TV+ R G + L + K D T + L ++Y Sbjct: 257 -GETLDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMETAGHVLKLAKEYMAKYKQ 315 Query: 372 -----IIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 I +D + G D L E L + VY V+ + +D E N TE + Sbjct: 316 LKRVDIKVDDSGVGGGVTDRLKEVIKSERLPFKVYPVVNNGKPLDDEHYDNAGTEGWAVV 375 Query: 421 ADWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 D LE + N +I S K + + + G++A+E K + +G +S Sbjct: 376 RDLLEENMKAFIQGEEPTMEIPNDEKMISQFSSRK-YRITSRGKIALERKEEMKKRGLQS 434 Query: 466 TDYSDGLMYTFAEN 479 D +D ++ F + Sbjct: 435 PDRADAIVLAFYKP 448 >gi|254781187|ref|YP_003065600.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040864|gb|ACT57660.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] Length = 367 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 252/359 (70%), Positives = 299/359 (83%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 M R + T+ + EQ+L +++ E LSF NFV+ FFPWG KG PLE FS P WQLEFME Sbjct: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 VD HC ++VNN NP +FK AISAGRGIGKTTLNAW++LWL+STRPG+S+IC+ANSETQL Sbjct: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 K TLWAEVSKWLS+LP++HWFEMQSLSLHP+ WY+++L S+GIDSKHY+ CRTYSEER Sbjct: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 PDTFVG HNT+GMA+ NDEASGTPD+IN ILGF TE N NRFWIMTSN RRL+G FY+I Sbjct: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 FN PL+DWKR+QIDTRTVEGID FHEGII+RYGLDSDV R+E+ GQFPQQ++++FIP N Sbjct: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359 IEEA++RE D YAPLIMGCDIA EGGD TVVV RRG +IEH+FDWS ++ TN + Sbjct: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 >gi|150390341|ref|YP_001320390.1| hypothetical protein Amet_2579 [Alkaliphilus metalliredigens QYMF] gi|149950203|gb|ABR48731.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF] Length = 469 Score = 451 bits (1161), Expect = e-124, Method: Composition-based stats. Identities = 131/494 (26%), Positives = 202/494 (40%), Gaps = 74/494 (14%) Query: 14 KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73 L D W + + F+ +L F+ P WQ + + + H Sbjct: 7 ALLDNYWDNPVW--FAEDMLGFY--------------PDPWQAKVLMDLAQH-------- 42 Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 K ++ +G+G+GKT L + + W + TRP VI A + QL LWAE+SKWLS Sbjct: 43 ----PKVSVRSGQGVGKTGLESIAITWYLCTRPFPKVIATAPTRQQLYDVLWAEISKWLS 98 Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 ++ + + + RT RP+ G H Y M Sbjct: 99 KSKVDKLLRWTKTKIYMNGF------------EERWWATARTAV--RPENMQGFHEDY-M 143 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 + DEASG D I ILG LT ++ NP + SG FY+ N+ D +K ++ Sbjct: 144 LFVVDEASGVADPIMEAILGTLTGY--ENKLLLCGNPTKTSGTFYDSHNRDRDTYKSHKV 201 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 + E + +YG DSDV RV V G FP+ + DS I L + E+A Sbjct: 202 SSMDSPRTSKENIEMLKKKYGADSDVFRVRVLGDFPKGEADSLISLEVTEQAAETVVDIS 261 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP---- 369 L +G DIA G D T++ R G + L +SK D T I V++ + Sbjct: 262 NAYTLNIGADIARFGDDKTIIAPRIGNRVLDLQQYSKKDTMETAGNILRTVDRLKTQHLQ 321 Query: 370 ---DAIIIDANNTGARTCDYLEM------LGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 I ID + G D L LGY + + +A D E N+ E+ + Sbjct: 322 INKIVIKIDDDGLGGGVTDRLREINRQQSLGYIIVPIKNGSKADDPEHYYNKAAEMWDNI 381 Query: 421 ADWLEF------------ASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 + L+ L LI+ L + K + V + G + +ESK + + +S Sbjct: 382 RELLDENLSKFLQGEPGVIQLPKDDILIKQLSNRK-YKVDSKGRIELESKDEMKRRIGES 440 Query: 466 TDYSDGLMYTFAEN 479 D +D ++Y+FA + Sbjct: 441 PDRADAVIYSFASD 454 >gi|282848875|ref|ZP_06258265.1| conserved hypothetical protein [Veillonella parvula ATCC 17745] gi|282581380|gb|EFB86773.1| conserved hypothetical protein [Veillonella parvula ATCC 17745] Length = 483 Score = 448 bits (1153), Expect = e-124, Method: Composition-based stats. Identities = 134/483 (27%), Positives = 217/483 (44%), Gaps = 27/483 (5%) Query: 10 ETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNS 69 + ++ + L L+F V +PWGE GTPLE P WQ++ ++ + Sbjct: 3 KHDELIEALGALTHDPLAF---VYFAYPWGEPGTPLENMEGPDEWQIQILKDIGEQLKKG 59 Query: 70 VNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129 + + A+++G GIGK+ L +WL+ + +ST + AN+E QL+T W E+S Sbjct: 60 KDLQT--AIQEAVASGHGIGKSALISWLIHFAISTHENTRGVVTANTEGQLRTKTWPELS 117 Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 KW ++ K F + ++ + K + +S+ P++F G HN Sbjct: 118 KWHNMFIAKDLFTYTATAIFSSD----------KDYEKTWRIDAIPWSKNSPESFAGLHN 167 Query: 190 TYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248 + ++ DEAS DVI G LT+ N W NP R SG+F E F K W Sbjct: 168 QGNRILVLFDEASAIDDVIWEVTEGALTDANTEIIWCAFGNPTRNSGRFRECFRKYRKFW 227 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 +QID+RTV+ + + E + YG DSD +V V G FP FI I ++A + Sbjct: 228 NTYQIDSRTVKISNKTKIEEWLEAYGEDSDFFKVRVRGVFPSASDLQFISTEIADKAQKQ 287 Query: 309 EPCPDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR-TTNNKISGLVE 365 P + P+I+G D A G D+ +V+R+G ++ L K D I+ + Sbjct: 288 VYKPGQFEHLPVIIGVDPAWTGSDSLEIVMRQGYYMKSLASIPKNDDDWRMAQLIAQFED 347 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425 +Y+ DA+ ID G + LG + ++ D N R + +M +WL Sbjct: 348 EYKADAVFIDM-GYGTGIYSIGKQLGRKWRLIEFGGKSNDP-VYLNMRAYMWGQMKEWLR 405 Query: 426 FASLI--NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 I N L ++ ++ I+ G + +ESK + +G S + D L TFA Sbjct: 406 EGGSIPPNDQALYDDIVGPEA-IIDKNGRIQLESKKDMKDRGLPSPNKGDALALTFAARV 464 Query: 481 PRS 483 + Sbjct: 465 VKK 467 >gi|150016512|ref|YP_001308766.1| hypothetical protein Cbei_1636 [Clostridium beijerinckii NCIMB 8052] gi|149902977|gb|ABR33810.1| conserved hypothetical protein [Clostridium beijerinckii NCIMB 8052] Length = 470 Score = 438 bits (1125), Expect = e-120, Method: Composition-based stats. Identities = 129/462 (27%), Positives = 201/462 (43%), Gaps = 47/462 (10%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 + P + + M + V + K ++ +G+G+GKT L + +V W + TRP Sbjct: 12 YWDNPVWFAEDMMNFHADKWQSEVLMALAQSPKVSVRSGQGVGKTGLESIVVTWYLCTRP 71 Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 VI A + QL LWAE+SKWL+ ++ E ++ + S Sbjct: 72 FPKVIATAPTRQQLYDVLWAEISKWLASSKIENLLEWTKTKIYMKGY------------S 119 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226 + + +T + RP+ G H Y M + DEASG D I ILG LT +M Sbjct: 120 ERWWATAKTAT--RPENMQGFHEDY-MLFVVDEASGVADPIMEAILGTLTGY--ENKLLM 174 Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286 NP R SG FY+ N+ D +K F++ + E + +Y SDV RV V G Sbjct: 175 CGNPTRTSGTFYDSHNRDRDLYKTFKVSSLESPRTSKDNIEMLKRKYHEGSDVWRVRVEG 234 Query: 287 QFPQQDIDSFIPLNIIEEA-LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345 +FP+ + DS I L E A + + L +G DIA G D +V+ R G + L Sbjct: 235 EFPKGESDSLISLEYAETATITKINNIHNNFTLHIGADIARFGNDESVIAPRIGNKVFDL 294 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPD-------AIIIDANNTGARTCDYLEM------LGY 392 ++K D T I +K++ + I +D + G D L LGY Sbjct: 295 LTYTKKDTMETTGNILRATDKFKNEYKHINKVKIRVDDDGLGGGVTDRLREVIRQEGLGY 354 Query: 393 HVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE------------FASLINHSGLIQNLK 440 V + +A D E ++ E+ M D LE L N+ LI+ L Sbjct: 355 EVMPIKNGSKANDEEHYSDKSAEMWGNMRDILEENFTNFVQGKEPTIELPNNDKLIKQLS 414 Query: 441 SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 + K F + + G + +E K + + +S D +D ++Y+FAEN Sbjct: 415 NRK-FRIDSKGRIDLEKKEEMKKRIGESPDLADAVIYSFAEN 455 >gi|209901239|ref|YP_002290878.1| putative terminase B [Clostridium phage phiCD27] gi|199612120|gb|ACH91293.1| putative terminase B [Clostridium phage phiCD27] Length = 469 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 134/493 (27%), Positives = 210/493 (42%), Gaps = 74/493 (15%) Query: 15 LFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPN 74 L D W + + F+ +L+F WQ + + + Sbjct: 8 LLDCYWDNPVW--FAEDMLNF--------------KADKWQSDVLMAL------------ 39 Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134 + K +I +G+G+GKT L + +W +STRP V+ A + QL LWAE++KWLS Sbjct: 40 AQTPKVSIRSGQGVGKTGLESIATVWYLSTRPFPKVVATAPTRQQLYDVLWAEIAKWLSN 99 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 + E ++ + + + RT +P+ G H Y M Sbjct: 100 SKVEKLLEWTKTKVYMKGF------------EERWWATARTAV--KPENMQGFHEDY-ML 144 Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQID 254 + DEASG D I ILG L+ A ++ NP R SG FY+ N+ D +K F++ Sbjct: 145 FVVDEASGVADPIMEAILGTLS--GAENKLLLCGNPTRTSGTFYDSHNRDRDLYKTFKVS 202 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP 314 + E + +Y SD RV V G+FP+ + DS I L +E + RE Sbjct: 203 SLDSPRTSKDNIEMLKRKYHEGSDPWRVRVLGEFPKGESDSLISLEAVETSTIREVNISN 262 Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP----- 369 L +G DIA G D T++ R G + L +SK D T I V+K++ Sbjct: 263 DYILNIGADIARYGDDETIIAPRIGGKVFDLLTYSKKDTMETVGNILRAVDKFKNMYHQI 322 Query: 370 --DAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 I D + GA D L E L Y V + A++ + N+ +E+ M Sbjct: 323 NRVKIKTDDDGLGAGVTDRLKEVIRHERLKYEVIPIQNGSSAIEKDKYYNKASEMWDNMR 382 Query: 422 DWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKST 466 + L+ L N LI+ L + K + V + G++ IESK + + +S Sbjct: 383 EELDANLSSFIQNKEAIIQLPNDDKLIKQLSNRK-YTVDSKGKIQIESKKEMKKRIGESP 441 Query: 467 DYSDGLMYTFAEN 479 D +D ++Y+FAEN Sbjct: 442 DRADAVIYSFAEN 454 >gi|290968649|ref|ZP_06560187.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] gi|290781302|gb|EFD93892.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] Length = 487 Score = 434 bits (1117), Expect = e-119, Method: Composition-based stats. Identities = 142/486 (29%), Positives = 232/486 (47%), Gaps = 37/486 (7%) Query: 8 NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67 + E Q L L FV F W + L+G P++WQ++ ++ V Sbjct: 5 DIELLQALGSLASDP------VAFVYFAFDWDSE--ELKG-QNPQTWQIKTLKEVGEGL- 54 Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 + A ++G GIGK+ L AWL+LW +STRP + AN+ TQL+T WAE Sbjct: 55 -----SLSTALQHATASGHGIGKSALVAWLILWAISTRPDTRGVVTANTATQLETKTWAE 109 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 +SKW L K +F + S ++ C + + +S +R ++F G Sbjct: 110 LSKWYHLFRGKKFFTLTSTAI----------FCRQEGHERTWRIDAIPWSVDRTESFAGL 159 Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 HN + +I DEAS + I G LT+++ W++ NP R +G+F++ F+K Sbjct: 160 HNQGNRLLLIFDEASAIDNKIWEVAEGALTDKDTEILWLVFGNPTRSTGRFFDCFHKYKK 219 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 W +ID+RTV+ + + + I YG+DSD +V V G+FP FI I+ A Sbjct: 220 SWITQKIDSRTVDISNKTQLQKWIQTYGIDSDFVKVRVLGEFPDTSDTQFISTAIVRTAW 279 Query: 307 NREP---CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR-TTNNKISG 362 R P +AP I+G D A GGD+TV+ LR+G E L ++ + D +++ Sbjct: 280 ERRPLRTAEYDFAPCIIGMDPAWTGGDSTVIFLRQGFFSEKLAEYKQNDNDGVMAARLAE 339 Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 +KY DA+ ID G + +G V +++ + N+R E+ M + Sbjct: 340 FEDKYHADAVFID-KGYGTGIYSFGVTMGRQWRLVSFAEKS-GAQAYANKRAEMWGNMKE 397 Query: 423 WLEFASLINH-SGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478 WL+ +I GLI+ L + ++F + GE+ +E K + +G +S + +D L TFA Sbjct: 398 WLQEGGVIPQVDGLIEELTAPQAF-INARGEIQLEKKEDMKKRGIESPNMADALALTFAY 456 Query: 479 NPPRSD 484 + + Sbjct: 457 PVLQRN 462 >gi|282598712|ref|YP_003358792.1| putative phage terminase B protein [Enterococcus phage phiEf11] gi|300860603|ref|ZP_07106690.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|307292389|ref|ZP_07572245.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis TX0411] gi|258598082|gb|ACV83339.1| putative phage terminase B protein [Enterococcus phage phiEf11] gi|300849642|gb|EFK77392.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|306496518|gb|EFM66079.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis TX0411] gi|315146097|gb|EFT90113.1| conserved hypothetical protein [Enterococcus faecalis TX2141] Length = 484 Score = 433 bits (1113), Expect = e-119, Method: Composition-based stats. Identities = 122/490 (24%), Positives = 216/490 (44%), Gaps = 51/490 (10%) Query: 33 LHFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 F P+ + G ++ + P ++ + + + + V + + K ++ +G+G+GKT Sbjct: 3 KEFIPFADIGAAIDYYYDKPVAFCQDILHLDPDEWQDKVLDDLAKFPKVSVRSGQGVGKT 62 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPA 151 L A +LW ++ RP VI A + QL LWAEV+KWL+ K + ++ Sbjct: 63 ALEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNNSLIKDLLKWTKTKIYMV 122 Query: 152 PWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGI 211 DS+ + RT + +P+ G H + M I+ DEASG D I I Sbjct: 123 G------------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVADPIMEAI 167 Query: 212 LGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIA 271 LG L+ + +M NP + G FY+ N D ++ ++ + + + + +I Sbjct: 168 LGTLSGFD--NKLLMCGNPNNIEGVFYDSHNTDRDKYRTHKVSSYDSKRTNKENIQMLID 225 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL---IMGCDIAEEG 328 +YG +SDV RV + G+FP+ +DSFI L I+E A + + +G D+A G Sbjct: 226 KYGENSDVARVRIYGEFPKGALDSFISLEIVEFAKDINISDSELKHVREGHIGVDVARFG 285 Query: 329 GDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI----SGLVEKYRPDA---IIIDANNTGA 381 D+T+V R G +SK D T ++ +++ Y I +D G Sbjct: 286 DDSTIVFPRIGAKALPFEKYSKQDTMQTTGRVLKAAKRMMDDYPTIKKVFIKVDDTGVGG 345 Query: 382 RTCDYLEM------LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE---------- 425 D L+ L Y V V + + D ++ N+ T++ + + LE Sbjct: 346 GVTDRLKEVISDEKLPYEVIPVNNGESSTD-DYYANKGTQIWGDVKELLEQNISNSINGQ 404 Query: 426 --FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 L +++ LI+ L + K F + + G++ +ESK + + S D +D L F E Sbjct: 405 GPTIELPDNANLIKELSTRK-FKMTSNGKIRLESKEDMKKRNVGSPDIADALTLAFYEPF 463 Query: 481 PRSDMDFGRC 490 ++ + Sbjct: 464 RPEPINVKKA 473 >gi|257883493|ref|ZP_05663146.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|294614775|ref|ZP_06694675.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636] gi|294622490|ref|ZP_06701512.1| conserved hypothetical protein [Enterococcus faecium U0317] gi|257819151|gb|EEV46479.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|291592387|gb|EFF23996.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636] gi|291598037|gb|EFF29147.1| conserved hypothetical protein [Enterococcus faecium U0317] Length = 471 Score = 429 bits (1104), Expect = e-118, Method: Composition-based stats. Identities = 125/484 (25%), Positives = 208/484 (42%), Gaps = 51/484 (10%) Query: 34 HFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92 F P+ + G+ ++ + P ++ + + + +V N E K ++ +G+G+GKT Sbjct: 4 EFIPFADIGSAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTA 63 Query: 93 LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152 L A +LW ++ RP VI A + QL LWAEV+KWL+ K+ + ++ Sbjct: 64 LEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNDSLIKNLLKWTKTKIYMVG 123 Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212 DS+ + RT + +P+ G H + M I+ DEASG D I IL Sbjct: 124 ------------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAIL 168 Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272 G L+ + +M NP + G FY+ N D ++ ++ + + + E I+ + Sbjct: 169 GTLSGFD--NKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKK 226 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD---PYAPLIMGCDIAEEGG 329 YG +SDV RV + G+FP+ +DSFI L +E A ++ +G D+A G Sbjct: 227 YGKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGD 286 Query: 330 DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS----GLVEKYRPD---AIIIDANNTGAR 382 D+T++ R +SK T + L+ +Y I +D G Sbjct: 287 DSTILFPRIATRALEYEKYSKRSTMETTGYVINMAKNLMSQYPSIDKVMIKVDDTGVGGG 346 Query: 383 TCDYLEML------GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF---------- 426 D LE L + V+ V + D +F N T+L + + LE Sbjct: 347 VTDRLEELIEDKHYPFEVFGVNNGSTSED-DFYDNLGTQLWGNIKEMLEENMTANLNGEQ 405 Query: 427 --ASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481 L + S LI+ L + K F + + + +ESK + + S D +D L F E P Sbjct: 406 PVIELPSDSSLIKELSTRK-FKMTSRSRIRLESKDDMKKRNIGSPDIADALALAFYEPPS 464 Query: 482 RSDM 485 Sbjct: 465 HYQF 468 >gi|261208032|ref|ZP_05922709.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289567088|ref|ZP_06447483.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|260077749|gb|EEW65463.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289161103|gb|EFD09008.1| conserved hypothetical protein [Enterococcus faecium D344SRF] Length = 471 Score = 429 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 125/484 (25%), Positives = 207/484 (42%), Gaps = 51/484 (10%) Query: 34 HFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92 F P+ + G ++ + P ++ + + + +V N E K ++ +G+G+GKT Sbjct: 4 EFIPFADIGAAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTA 63 Query: 93 LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152 L A +LW ++ RP VI A + QL LWAEV+KWL+ K+ + ++ Sbjct: 64 LEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNDSLIKNLLKWTKTKIYMVG 123 Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212 DS+ + RT + +P+ G H + M I+ DEASG D I IL Sbjct: 124 ------------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAIL 168 Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272 G L+ + +M NP + G FY+ N D ++ ++ + + + E I+ + Sbjct: 169 GTLSGFD--NKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKK 226 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD---PYAPLIMGCDIAEEGG 329 YG +SDV RV + G+FP+ +DSFI L +E A ++ +G D+A G Sbjct: 227 YGKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGD 286 Query: 330 DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS----GLVEKYRPD---AIIIDANNTGAR 382 D+T++ R +SK T + L+ +Y I +D G Sbjct: 287 DSTILFPRIATRALEYEKYSKRSTMETTGYVINMAKNLMSQYPSIDKVMIKVDDTGVGGG 346 Query: 383 TCDYLEML------GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF---------- 426 D LE L + V+ V + D +F N T+L + + LE Sbjct: 347 VTDRLEELIEDKHYPFEVFGVNNGSTSED-DFYDNLGTQLWGNIKEMLEENMTANLNGEQ 405 Query: 427 --ASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481 L + S LI+ L + K F + + + +ESK + + S D +D L F E P Sbjct: 406 PVIELPSDSSLIKELSTRK-FKMTSRSRIRLESKDDMKKRNIGSPDIADALALAFYEPPS 464 Query: 482 RSDM 485 Sbjct: 465 HYQF 468 >gi|228950291|ref|ZP_04112468.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] gi|228809453|gb|EEM55897.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] Length = 495 Score = 427 bits (1099), Expect = e-117, Method: Composition-based stats. Identities = 121/505 (23%), Positives = 200/505 (39%), Gaps = 83/505 (16%) Query: 13 QKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNN 72 +L ++ D + +F +L P WQ E + + H Sbjct: 19 TQLLEIYVDDPV--AFVEDILEV--------------EPDPWQKEVLNDIANHSH----- 57 Query: 73 PNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL 132 ++ +G+G+GKT + +W+ +W + RP +IC A ++ QL LWAE++KWL Sbjct: 58 -------VSVRSGQGVGKTAMESWICIWFLCCRPYPKIICTAPTKQQLYDVLWAEIAKWL 110 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 + K + ++ + + +T + RP+ G H Y Sbjct: 111 NSSQVKDLLKWTKTKIYMKGF------------EDRWFATAKTAT--RPENMQGFHEDY- 155 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 M I DEASG D I ILG L+ + M NP + SG F++ NK +K + Sbjct: 156 MLFIADEASGIADDIMEAILGTLS--GSENKLFMCGNPTKTSGVFFDSHNKDRALYKSHK 213 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP-- 310 + + E + +YG SDV RV V G+FP+ + D+FI L E A RE Sbjct: 214 VSSADSPRTSKKNIEMLKKKYGEGSDVYRVRVEGEFPRGEADAFISLETAEAARMREVYK 273 Query: 311 ---------------CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRT 355 A + +GCD+A G D T++ RRG + L + D Sbjct: 274 VEVIENEEEESTVKEIIPDTAVVEIGCDVARFGSDETIIATRRGWKVLPLQVHHQRDTMY 333 Query: 356 TNNKISGLVEKY--------RPDAIIIDANNTGARTCDYLEM------LGYHVYRVLGQK 401 + + +KY + I ID G D L+ V + Sbjct: 334 VSGLLVQEAKKYFSWCERTGKRIPIRIDDTGVGGGVTDRLKEVVAENDYPIDVIPINFAS 393 Query: 402 RAVDLEFCRNRRTELHVKMAD-WLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK-- 458 + + ++ D LEF +L + LI L S++ + + + G + IE K Sbjct: 394 K--GNAEYACIVSVMYGHFKDNCLEFVALPDDEDLIAQL-SVRKYQINSDGRIKIEPKKA 450 Query: 459 -RVKGAKSTDYSDGLMYTFAENPPR 482 + +G KS D ++ ++ FA P+ Sbjct: 451 MKDRGLKSPDRAEAVVMAFAPFYPK 475 >gi|332981151|ref|YP_004462592.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON] gi|332698829|gb|AEE95770.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON] Length = 461 Score = 424 bits (1090), Expect = e-116, Method: Composition-based stats. Identities = 133/448 (29%), Positives = 198/448 (44%), Gaps = 50/448 (11%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 + P WQ E ++ + + + A+ +G G+GKT L AW +LW + TRP Sbjct: 25 AEPDDWQAETLQALADN------------PRVAVRSGHGVGKTALEAWALLWFLFTRPYP 72 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPN-KHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167 + C A + QL LWAE SKWL P K +FE Q + Sbjct: 73 KIPCTAPTREQLHDILWAEASKWLERAPALKPYFEWQKTRI------------VQKQYPG 120 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227 + RT +P+ G H + + I DEASG D I I G LT +A +M Sbjct: 121 RWFATART--SNKPENMAGFHEEH-LLFIIDEASGIADNIFETIEGALTTSDAK--LLMC 175 Query: 228 SNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQ 287 NP + SG F++ F K + ++ + + + E + +Y DSDV RV V G+ Sbjct: 176 GNPTKNSGVFHDAFFKDRSLYWTRKVSCLDSQRVTLEYAERLKRKYHEDSDVYRVRVLGE 235 Query: 288 FPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 FP+ + D+FI L+I+E A R+ PD L +G D+A G D TV+ R G + +L Sbjct: 236 FPKAEPDTFISLDIVEAATMRDVEPD--GVLEIGVDVARFGDDETVLAARAGLKLVYLKA 293 Query: 348 WSKTDLRTTNNKISGLVEKY-----RPD-AIIIDANNTGARTCDYLEM------LGYHVY 395 ++K D TT L + +P I ID + G D L V Sbjct: 294 YTKQDTMTTAGYAIALAKDLMKECGKPKCTIKIDDDGVGGGVTDRCREVVREEKLYIDVI 353 Query: 396 RVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLKSLKSFIVPNTGEL 453 D E N TE + D L E A LIN LI L + K + + + G++ Sbjct: 354 DCHNGGAPEDKEHYENWGTEAWAYLRDLLQDEQAELINDEDLIGQLTTRK-YRITSKGKI 412 Query: 454 AIESK---RVKGAKSTDYSDGLMYTFAE 478 A+ESK + +G S D +D ++ +A+ Sbjct: 413 ALESKDEMKRRGLMSPDRADAVVLAYAK 440 >gi|308069786|ref|YP_003871391.1| hypothetical protein PPE_03030 [Paenibacillus polymyxa E681] gi|305859065|gb|ADM70853.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 452 Score = 416 bits (1070), Expect = e-114, Method: Composition-based stats. Identities = 124/456 (27%), Positives = 186/456 (40%), Gaps = 58/456 (12%) Query: 51 PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110 P WQ + ++ NNP + ++ +G+G+GKT L A LW +S P V Sbjct: 6 PDDWQASTL-------MDLANNP-----RVSVRSGQGVGKTGLEAATALWFLSCFPYPKV 53 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 IC A + QL LWAE++KW S P + W ++ + + Sbjct: 54 ICTAPTRQQLHDVLWAEINKWQSKSP---------VLKRILKWTKTKIYM--KNYEERWF 102 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 RT + +P+ G H Y M I DEASG D I ILG L+ +M NP Sbjct: 103 ATARTAT--KPENMQGLHEDY-MLFIVDEASGVADPIMEAILGTLSGE--FNKILMCGNP 157 Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 + SG FY+ NK D+K ++ + +YG SDV RV V G+FP+ Sbjct: 158 TKTSGVFYDSHNKDRADYKTRKVSCLDSPRTSKDNIAMLKRKYGEGSDVWRVRVEGEFPR 217 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350 D+FI L + E A L +G D+A G D T + GP I K Sbjct: 218 GGSDTFISLEVAEFAAKEVKLEPTGDMLTIGVDVARFGDDETSMFAGIGPRIVGEHHHFK 277 Query: 351 TDLRTTNNKISGLVEKYR-------PDAIIIDANNTGARTCDYL------EMLGYHVYRV 397 T + L ++ + I +D + G D L E L Y + + Sbjct: 278 KGTMVTAGWVINLAKELQVAHPYLNRIRIRVDDSGVGGGVTDRLSEIVAEEGLPYEIIPI 337 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLE------------FASLINHSGLIQNLKSLKSF 445 ++D E N TE+ + + LE L + LI L + K + Sbjct: 338 NNGSSSLD-EHYGNLVTEMWASIKEQLEQNMSNFMNGDSSILQLPDDDVLITQLTARK-W 395 Query: 446 IVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478 + + G++ +ESK + +G KS D +D + TF E Sbjct: 396 NMTSKGKMLLESKKDMKKRGLKSPDRADAFVLTFGE 431 >gi|54302246|ref|YP_132239.1| terminase large subunit [Photobacterium profundum SS9] gi|46915667|emb|CAG22439.1| hypothetical protein PBPRB0566 [Photobacterium profundum SS9] Length = 513 Score = 414 bits (1064), Expect = e-113, Method: Composition-based stats. Identities = 132/515 (25%), Positives = 213/515 (41%), Gaps = 42/515 (8%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEK------------GTPLEGF 48 M+++ N E Q D+ + L FV++ +PW + + Sbjct: 1 MAKKEEINYEH-QLAIDIGGFYDDPL---GFVMYAYPWDTDPDLQIVKLPEPWASKYDSV 56 Query: 49 SAPRSWQLE----FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104 P +W E EV+ + N V+ + F +IS+G GIGK+ ++WL+ ++MST Sbjct: 57 YGPDAWFCEMCDQLQEVIRKNDFNGVDPV--DAFLYSISSGHGIGKSCASSWLIHFVMST 114 Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164 RP + +N+ QL+T W E+ KW L NKHWF + + ++ D Sbjct: 115 RPNSKGVVTSNTSEQLRTKTWGELGKWTKKLINKHWFVYNNGKGNMNFYHKDY------- 167 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASGTPDVINLGILGFLTERNANRF 223 ++ + +T EE ++F G H + DEAS PD I G LT+ F Sbjct: 168 -AETWRVDAQTCREENSESFAGLHCASSTPWYLFDEASAVPDKIWEVAEGGLTDGEP--F 224 Query: 224 WIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 W + NP R SG+F E + + W R QID+ TV+ + + YG DSD RV Sbjct: 225 WFVFGNPTRNSGRFRECWRRFRQRWNRKQIDSSTVQVTNKKKISEWESDYGEDSDFYRVR 284 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG--PV 341 V G FP + I ++E A++R P +P +M D+A GGDN V R G Sbjct: 285 VKGVFPSASSNQKISGALLEAAMSRTAHVIPGSPRVMSLDVARGGGDNCVFRFRHGLNGG 344 Query: 342 IEHLFDWSKT---DLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398 + + D L +++PDA ID G D + LG++ + Sbjct: 345 VRKKVTLPGSEYRDSMKLAAMAVQLCSEFKPDAFFIDETGVGGPVGDRIRQLGFNCIGIN 404 Query: 399 GQKRAVDLEFCRNRRTELHVKMADWLEFASLIN-HSGLIQNLKSLKSFIVPNTGELAIES 457 +A D N R ++ + +WL+ ++ GL+ + +++ E+ I Sbjct: 405 FASKAPDP-HYANMRAYMYHQWGEWLKAGGSLHYDEGLLTEVGAIEYTHDRKDREILIPK 463 Query: 458 K--RVKGAKSTDYSDGLMYTFAENPPRSDMDFGRC 490 + STD D A + Sbjct: 464 DVIKKAIGISTDDGDACALLHAYPVAPRQQGYNSA 498 >gi|323486060|ref|ZP_08091391.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum WAL-14163] gi|323400627|gb|EGA92994.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum WAL-14163] Length = 476 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 125/473 (26%), Positives = 193/473 (40%), Gaps = 51/473 (10%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 P + E + E K AI +G+G+GKT + A +LW + P Sbjct: 20 YRKNPVLFAQEVLLFEPDDWQKQALMDLAESPKVAIKSGQGVGKTGMEAVALLWFLCCYP 79 Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 ++ A ++ QL LW+EVSKW+S P L W ++ Sbjct: 80 YPRIVATAPTKQQLHDVLWSEVSKWMSKSP---------LLSDILKWTKTYIYMVGN--E 128 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226 K + + RT + +P+ G H M I DEASG D I ILG L+ AN +M Sbjct: 129 KRWFAVARTAT--KPENMQGFHED-NMLFIVDEASGVADPIMEAILGTLS--GANNKLLM 183 Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286 NP R SG FY+ FN ++ + + + + E +I +YG DS+V V V G Sbjct: 184 CGNPTRTSGTFYDAFNVDRSIYRCHTVSSADSKRTNKQNIESLIRKYGKDSNVVLVRVFG 243 Query: 287 QFPQQDIDSFIPLNIIEEALNREPCPD-PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345 +FP+Q+ D FI L+I+E + D P + G D+A G D TV+ G I Sbjct: 244 EFPKQEDDVFIALSIVEHCCMLDLPDDVPIKRISFGVDVARYGSDETVIAKNVGGRITLP 303 Query: 346 FDWSKTDLRTTNNKISGLVEK-------YRPD-AIIIDANNTGARTCDYLEMLG------ 391 + L TT KI L + YR I ID G D LE + Sbjct: 304 VSFRGQSLMTTVGKIVQLYRQAITEFPRYRGKIYINIDDCGLGGGVTDRLEEVKQEEKLT 363 Query: 392 -YHVYRVLGQKRAVDL----------EFCRNRRTELHVKMADWL--EFASLINHSGLIQN 438 + V + + + N T L + D L E SL N + L+ Sbjct: 364 RMVIVPVNAAGKVPEETLGDGKQKACDIYDNMTTYLWGTVKDALMMEEVSLENDNELVAQ 423 Query: 439 LKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488 + + + + + G++ +ESK + +G S D +D + + + + + G Sbjct: 424 F-TCRKYRLTSRGKMLLESKEEMKKRGIDSPDRADAVALSCYQ---KKTFNIG 472 >gi|332976102|gb|EGK12970.1| hypothetical protein HMPREF9374_1123 [Desmospora sp. 8437] Length = 462 Score = 406 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 122/459 (26%), Positives = 195/459 (42%), Gaps = 39/459 (8%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 P + E ++ + + + A+ AG G+GKT AW VLW + TRP Sbjct: 17 YIRKPGLFVREVLKAEPDEWQDIALQALADNQRVAVRAGHGVGKTATEAWAVLWFLLTRP 76 Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPN-KHWFEMQSLSLHPAPWYSDVLHCSLGID 165 + C A ++ QL LW E++KWL P + E Q + Sbjct: 77 FPKIPCTAPTKPQLMDVLWPEIAKWLMNAPELAPYVEWQKTR------------VVMKQY 124 Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWI 225 + + RT +P+ G H + + + DEASG + I I G LT + + Sbjct: 125 EERWFATARTS--NKPENMAGFHEEH-LLFVIDEASGVDNAIFETIDGALTTAGSK--LV 179 Query: 226 MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285 M NP R +G FY+ F++ D + ++I + + + +YG DSD+ RV V Sbjct: 180 MFGNPTRTNGVFYDAFHQDRDLYWTYKISCLDSKMASKDYARNMARKYGEDSDIYRVRVQ 239 Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345 G+FPQ D DSFIPL ++E+A R+ L +G D+A G D TV+ R GPV L Sbjct: 240 GEFPQGDPDSFIPLELVEDARVRDLEWIDEDELHIGVDVARFGSDETVLAARIGPVAFRL 299 Query: 346 FDWSKTD-LRTTNNKIS----GLVEKYRPDA--IIIDANNTGARTCDYLEM------LGY 392 + T ++ L+E++R D + +D G D L+ L Sbjct: 300 DRYGGRTPTTETVGRVLALARELMEEHRRDYAVVKVDDTGVGGGVTDQLQEIVAEEGLNI 359 Query: 393 HVYRVLGQKRA-VDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKSFIVP 448 V D + + TE + D + + I+ LI L + K + Sbjct: 360 DVIPCNNGATPEHDPDHYHDWGTESWGTLLDRFKAGEIALKIDDEDLIGQLTTRKK-EMT 418 Query: 449 NTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSD 484 + G++ +ESK + +G +S D +D L+ FAE + Sbjct: 419 SKGKIKLESKEKMKKRGQRSPDRADALVLAFAEAATETG 457 >gi|289578588|ref|YP_003477215.1| hypothetical protein Thit_1395 [Thermoanaerobacter italicus Ab9] gi|289528301|gb|ADD02653.1| conserved hypothetical protein [Thermoanaerobacter italicus Ab9] Length = 460 Score = 398 bits (1023), Expect = e-109, Method: Composition-based stats. Identities = 123/464 (26%), Positives = 188/464 (40%), Gaps = 57/464 (12%) Query: 52 RSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 W Q E ++ V H + A+ A G+GKT + AW+ LW + T Sbjct: 30 DPWEKQEEILKAVRDHK------------RVAVRACHGVGKTKVAAWVALWFLYTHHNSK 77 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 VI A + Q++ LW E+ + P VL + + + + Sbjct: 78 VITTAPTWHQVENLLWREIHAAHAASR--------------IPLGGKVLQTQIELGEQWF 123 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 S ++P+ F G H + + I+ DEASG GFLT A ++ N Sbjct: 124 ---ALGLSTDKPERFQGFHAEHILLIV-DEASGVEQYTFDAAEGFLTSIGAK--LLLIGN 177 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVE-----------GIDPSFHEGIIARYGLDSD 278 P +LSG+FY F PL + + I + P + E ++G DS Sbjct: 178 PTQLSGEFYNAFRSPL--YHKIHISAFDSPNLKAGKIVRPYLVTPEWVEDKRLKWGEDSP 235 Query: 279 VTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR 338 + V G+FP+Q D+ IPL IE A R + P+ +G D+A G D TV++LRR Sbjct: 236 LWYSRVLGEFPEQGNDTLIPLAWIEAAQQRWHMTEAGEPVEIGADVARYGTDTTVIMLRR 295 Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398 G E ++ D K+ +K + I ID GA D L+ GY V + Sbjct: 296 GDKAEIVYQLRGQDTMEVTGKVIDAFKKTGANVIKIDVVGIGAGVVDRLKEQGYPVQGLN 355 Query: 399 GQKRAVDLEFCRNRRTELHVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPNTGELAIE 456 + A D N+R E + + + + ++ L L SLK + + G + IE Sbjct: 356 VGESATDKGRFVNKRAEWYWALRERFQEGTIAIPPDDELASQLASLK-YKFDSRGRIQIE 414 Query: 457 SK---RVKGAKSTDYSDGLMYTF----AENPPRSDMDFGRCPSY 493 SK R +G S D +D LM F + D R S+ Sbjct: 415 SKEELRRRGLPSPDKADALMLAFSSTGMKPVDEKIKDIFRRASF 458 >gi|255282256|ref|ZP_05346811.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] gi|255267204|gb|EET60409.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] Length = 506 Score = 388 bits (996), Expect = e-105, Method: Composition-based stats. Identities = 111/460 (24%), Positives = 184/460 (40%), Gaps = 48/460 (10%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 P + E ++ E + A+ +G+G+GKT + A VLW +S Sbjct: 34 YRKDPVLYAREVLQFEPDEWQRDALMDLAEESRVAVKSGQGVGKTGIEAVAVLWFLSCFR 93 Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 V+ A + QL LW+E++KW P L W ++ Sbjct: 94 YARVVATAPTRQQLHDVLWSEIAKWQERSP---------LLKAILRWTKTYVYV--KGYE 142 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226 K + + RT + +P+ G H M I DEASG D I +LG L+ N +M Sbjct: 143 KRWFAVARTAT--KPENMQGFHED-NMLFIVDEASGVADPIMEAVLGTLS--GGNNKLLM 197 Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286 NP R +G FY+ F K + + + D + + +I +YG DS++ RV V G Sbjct: 198 CGNPTRTTGTFYDAFTKDRSIFACHTVSSLDSSRTDKNNIDALIRKYGEDSNLVRVRVKG 257 Query: 287 QFPQQDIDSFIPLNIIEEALNRE---PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIE 343 FP+QD D FI +I++ +R+ P A +I+G D+A G D TV+ I+ Sbjct: 258 LFPKQDDDVFISQELIDQCTSRQYELPESRGMAQVILGVDVARYGNDETVIYRNFKGRIK 317 Query: 344 HLFDWSKTDLRTTNNKISGLV----EKYRP----DAIIIDANNTGARTCDYLEMLGYH-- 393 + + +L T I + Y I ID G D L + Sbjct: 318 MVRNRRGQNLMATAGDIVREYRHIVDGYPGFDGKIYINIDDTGLGGGVTDRLREVKKEQK 377 Query: 394 -----VYRVLGQKR--------AVDLEFCRNRRTELHVKMADWLEFASLI--NHSGLIQN 438 + + ++ E+ N T + + + LE ++ + + + Sbjct: 378 LTRMVIIPINAAEKIETDTKAGKEAAEYYNNLTTHMWAAVRELLEKREIVIEDDAETVAQ 437 Query: 439 LKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYT 475 L K + V + G++ IE K + +G S D +D L + Sbjct: 438 LSMRK-YTVASNGKIEIEPKKEMKKRGLDSPDRADALTLS 476 >gi|167767949|ref|ZP_02440002.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1] gi|167710278|gb|EDS20857.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1] gi|291560988|emb|CBL39788.1| hypothetical protein CL2_30180 [butyrate-producing bacterium SSC/2] Length = 473 Score = 384 bits (987), Expect = e-104, Method: Composition-based stats. Identities = 121/488 (24%), Positives = 193/488 (39%), Gaps = 68/488 (13%) Query: 11 TEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSV 70 + + + + + F VL F+ P WQ E + + Sbjct: 7 HDFLVESIPLWQQNPVQFFEEVLFFY--------------PDEWQKEAAFALRDN----- 47 Query: 71 NNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSK 130 K I +G+G+GKT A +LW +S V+ A + QL LWAEVSK Sbjct: 48 -------SKVTIKSGQGVGKTGFEAATLLWFLSCFENARVVATAPTLHQLNDVLWAEVSK 100 Query: 131 WLSLLPN-KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 W S P K + + + + + RT + P+ G H Sbjct: 101 WQSKSPLLKEILQWTKTKISMIG------------SKERWYAVARTATT--PENMQGFHE 146 Query: 190 TYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249 M I DEASG D I ILG LT +N ++ NP + SG FY+ + Sbjct: 147 D-NMLFIVDEASGVADPIMEAILGTLT--GSNNKLLLCGNPTKASGTFYDSHTSDRKLYY 203 Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 +++ + + + +I +YG +S+V RV V G FP+QD D ++PL ++E ++ E Sbjct: 204 CITVNSAESKRTNKDNIDSLIRKYGEESNVVRVRVKGLFPKQDDDVYMPLEMLEASIILE 263 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE---- 365 P P +G D+A G D+TV+ I DL T + Sbjct: 264 EIP-PADICTLGVDVARFGDDDTVIARNMNNKITLEKIRHGQDLMKTVGDVVVECRNIKE 322 Query: 366 --KY-RPDAIIIDANNTGARTCDYLEML-------GYHVYRVLGQKRAVDL---EFCRNR 412 KY + +IID G D L L G + V D E + Sbjct: 323 KFKYKKTIYVIIDDTGLGGGVTDRLNELKSEGKLSGVVIVPVNFSAAVPDKKAAEKYHDI 382 Query: 413 RTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTD 467 + + D LE A L N + LI L S + + + ++G++ +ESK + + +S D Sbjct: 383 TSYAWSILRDMLEEKEAVLPNDTELIAQL-SARKYDLSSSGKIRLESKKAMKERIGESPD 441 Query: 468 YSDGLMYT 475 +D ++ + Sbjct: 442 RADAVVLS 449 >gi|332980681|ref|YP_004462122.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON] gi|332698359|gb|AEE95300.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON] Length = 486 Score = 384 bits (987), Expect = e-104, Method: Composition-based stats. Identities = 110/470 (23%), Positives = 182/470 (38%), Gaps = 60/470 (12%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 + P + +E + + + + + A+ + G GK+ + ++LW + + Sbjct: 16 NDPVWFVIEILGTRPWKKQIDIISAVRDNPRTAVRSCHGAGKSFIAGQVILWFLYSFYPS 75 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 V+ A + Q++ +W EV S P ++L I Sbjct: 76 IVLSTAPTWRQVEKLIWKEVR--------------ASYRRSKVPLGGNLLPKRPEIQIIQ 121 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228 S PD F G H + ++ DEA+G P+ I I G LT +A ++ Sbjct: 122 DEWYAVGLSTNEPDRFQGFHEE-NILVVVDEAAGVPEEIFEAIEGVLTSEHAR--LLLLG 178 Query: 229 NPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE----------------------------- 259 NP + G FY F P W+ I T Sbjct: 179 NPTSVGGTFYNAFRTPG--WENISISAFTTPNFTAFGITEDDIINKTWESKITNSLPNPK 236 Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319 I P++ R+G +S + V GQFP + D+ IPL IE A+ R P+ Sbjct: 237 LITPAWVADKYRRWGPNSPAYQARVLGQFPSEGEDTLIPLAWIEAAMARWEDTPEGEPIE 296 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A G D TV+ RRG + L ++K D T I + K +D Sbjct: 297 IGVDVARFGSDKTVIAARRGQKVLPLNVYAKQDTMETVGCIIMVHRKIGASKTKVDVIGV 356 Query: 380 GARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--------EFASLIN 431 GA D L+ G+ V + + A D E N R+EL M + L E +L Sbjct: 357 GAGVVDRLKEQGHPVIGINVAEAATDTEKFANLRSELWWNMRELLDPNQRLNPEPIALPP 416 Query: 432 HSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478 L+ +L +K + + + G + +ESK + + +S D +D ++ FA+ Sbjct: 417 DDELLADLSGVK-YKIDSRGRIQVESKEDMKKRLGRSPDRADAVVLAFAK 465 >gi|319956916|ref|YP_004168179.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM 16511] gi|319419320|gb|ADV46430.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM 16511] Length = 462 Score = 377 bits (969), Expect = e-102, Method: Composition-based stats. Identities = 113/419 (26%), Positives = 185/419 (44%), Gaps = 26/419 (6%) Query: 64 AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTT 123 + ++ + K +I +G G GKTTL AW+VLW R + A + QL Sbjct: 30 KQQMKAIRAIDQGKKKISIRSGHGTGKTTLLAWIVLWWGLGREDAKIPMTAPTGHQLYDL 89 Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY-STMCRTYSEERPD 182 L E+ KW +P + + ++V + ID + + RT +++P+ Sbjct: 90 LMPEIRKWREKMPVQ--------------YQNEVEVKTEKIDFANGNFAVPRTARKDQPE 135 Query: 183 TFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN 242 G H T +A I DEASG P VI G +T + IM +NP R G FY+ + Sbjct: 136 ALQGFHAT-NLAFIIDEASGIPQVIFEVAEGAMTGE--STLVIMAANPTRTEGYFYDSHH 192 Query: 243 KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 K W+ FQ + E + + E +YG DSDV RV + G+FP+Q ++ L + Sbjct: 193 KNRWQWECFQFNAEESENVSKEWIEEKKRQYGEDSDVYRVRIKGEFPRQSSNAVFSLQEV 252 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 ++A RE D A + G D+A+ G D +V+ R+G + S L + Sbjct: 253 DDATTREIVDDSGAEV-WGLDVADFGDDKSVLAKRKGKHFHEITARSGLTLPDLAGWLIY 311 Query: 363 LVEKY--RPDAIIIDANNTGARTCDYLEMLGYH-VYRVLGQKRAVDLEFCRNRRTELHVK 419 + +P I +DA G+ G V V G A + E N+R E + Sbjct: 312 EYNQAKRKPAVIFVDAIGIGSSLPAVCFEKGLDIVIGVKGSNSASNSEKYHNKRAEWYYN 371 Query: 420 MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAI-ESK--RVKGAKSTDYSDGLMYT 475 + D LE + + L+ L + + + + +TG++ + E K + + +S D +D T Sbjct: 372 LKDLLEDGKIPDDDELVGELMA-QKYQISSTGKIQLVEKKEIKKELGRSPDKADACALT 429 >gi|253578914|ref|ZP_04856185.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA] gi|251849857|gb|EES77816.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA] Length = 473 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 109/451 (24%), Positives = 180/451 (39%), Gaps = 48/451 (10%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 P + E + K +I +G+G+GKT L A + LW ++ P Sbjct: 4 DDPVMFFREVLNFEPDEWQAQAARDLAANPKVSIKSGQGVGKTGLEAAVFLWFVTCFPHP 63 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 ++ A ++ QL LW+E+SKW+S W ++ + K Sbjct: 64 RIVATAPTKQQLHDVLWSEISKWMSKSELLSIL---------LKWTKTYVYMV--GEEKR 112 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228 + + RT + +P+ G H M I DEASG D I ILG L+ AN ++ Sbjct: 113 WFGVARTAT--KPENMQGFHED-NMLFIVDEASGVADPIMEAILGTLS--GANNKLLLCG 167 Query: 229 NPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288 NP + SG FY+ + +K + + + + ++ +YG DS+V RV V G+F Sbjct: 168 NPTKTSGTFYDSHTRDRALYKCHTVSSMDSTRTNKENIDSLVRKYGWDSNVVRVRVRGEF 227 Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHL 345 P Q+ D FIPL++IE+ ++ D + +G D+A G D T++ + + Sbjct: 228 PNQEDDVFIPLSLIEQCSSKLLELDDADGMQFVSLGVDVARFGDDETIIYRNYHGHCKIV 287 Query: 346 FDWSKTDLRTTNNKISGLVEK-YRPD-------AIIIDANNTGARTCDYLEM-------L 390 + +L T I +K YR + ID G D L+ Sbjct: 288 RNRRGQNLMATVGDIVQEFKKIYREHPTYESKVYVQIDDTGLGGGVTDRLKEVRKEQKLY 347 Query: 391 GYHVYRVLGQKR--------AVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLK 440 V + ++ E N T + M D L + + + I L Sbjct: 348 KMQVIPINAAEKIETDTAAGKDAAERYNNLTTAMWASMRDLLDNKQIVIEDDEQTIGQLS 407 Query: 441 SLKSFIVPNTGELAIESK---RVKGAKSTDY 468 S K + + + G+L IE K + +G S D Sbjct: 408 SRK-YTMASNGKLEIEPKKEMKKRGLDSPDR 437 >gi|160940775|ref|ZP_02088117.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC BAA-613] gi|158436295|gb|EDP14062.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC BAA-613] Length = 484 Score = 370 bits (951), Expect = e-100, Method: Composition-based stats. Identities = 113/487 (23%), Positives = 185/487 (37%), Gaps = 70/487 (14%) Query: 45 LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104 L P + + + + + ++ +G GIGK+ + AW V+W M T Sbjct: 8 LFYADNPIYFVEDVIRAKPDEKQRDILRSLRDYPMTSVRSGHGIGKSAVEAWSVIWYMCT 67 Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164 RP + C A +E QL LWAE+SKW+ P W + L+ Sbjct: 68 RPFPKIPCTAPTEHQLMDVLWAEISKWMRNNPALRD---------DLIWTKEKLYMQ--G 116 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224 + + + RT + P+ G H + + II DEASG D + +LG +T +A Sbjct: 117 HPEEWFAVPRTAT--NPEALQGFHAEHVLYII-DEASGVSDKVFEPVLGAMTGEDAK--L 171 Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284 +M NP RL+G FY+ ++ + + +D R + + +F + II +G DSDV RV V Sbjct: 172 LMMGNPTRLAGFFYDSHHRNREQYSAIHVDGRDSQHVSRTFVQKIIDMFGEDSDVFRVRV 231 Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPV--- 341 GQFP+ DS I + EEA N + P + +G D+A G D++ + Sbjct: 232 AGQFPKSTPDSLIAMEWCEEAANLQV-YAPGGQIDIGVDVARYGDDSSALYPLIDKKQSL 290 Query: 342 IEHLFDWSKTDLRT--TNNKISGLVEKYRPDAI--IIDANNTGARTCD------------ 385 L+ ++T I Y AI +D + G D Sbjct: 291 PYELYHHNRTTEIAGYVVIMIKQFAMDYPDAAIRVKVDCDGLGVGVYDNLYDQRDQIIDA 350 Query: 386 ----YLEMLGYH-------------------VYRVLGQKRA-----VDLEFCRNRRTELH 417 G + + D N + Sbjct: 351 IWYDRCRRAGINPEDGNQWNECQNVPKLDLEIIECHFGGSGGKVDDNDPVEYSNSTGLMW 410 Query: 418 VKMADWLEFASL--INHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGL 472 K+ +L+ L + L+ L + + ++V G+L +E K + +G S D +D L Sbjct: 411 GKVRKYLQEGKLQLPDDDTLVSQLCNRR-YLVNKDGKLELERKESMKKRGLTSPDIADAL 469 Query: 473 MYTFAEN 479 E Sbjct: 470 ALALYEP 476 >gi|302120432|gb|ADK92426.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 255 Score = 365 bits (936), Expect = 1e-98, Method: Composition-based stats. Identities = 250/255 (98%), Positives = 254/255 (99%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 IGKTTLNAWLVLWLMS RPG+S+ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS Sbjct: 1 IGKTTLNAWLVLWLMSIRPGMSIICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 60 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI Sbjct: 61 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 120 Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE 267 NLGILGFLTE+NANRFWIMTSNPRRLSGKFYEIFN+PLDDWKRFQIDTRTVEGIDPSFHE Sbjct: 121 NLGILGFLTEQNANRFWIMTSNPRRLSGKFYEIFNRPLDDWKRFQIDTRTVEGIDPSFHE 180 Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE Sbjct: 181 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 240 Query: 328 GGDNTVVVLRRGPVI 342 GGDNTVVVLRRGPVI Sbjct: 241 GGDNTVVVLRRGPVI 255 >gi|266623290|ref|ZP_06116225.1| putative terminase B protein [Clostridium hathewayi DSM 13479] gi|288864932|gb|EFC97230.1| putative terminase B protein [Clostridium hathewayi DSM 13479] Length = 484 Score = 364 bits (935), Expect = 1e-98, Method: Composition-based stats. Identities = 107/488 (21%), Positives = 191/488 (39%), Gaps = 72/488 (14%) Query: 45 LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104 L P + + + V + + ++ +G G+GK+ + +W V+W + T Sbjct: 8 LFYADEPIYFVEDIIRVTPDQKQRDILRSLRDYPMTSVRSGHGVGKSAVESWSVIWFLCT 67 Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPN-KHWFEMQSLSLHPAPWYSDVLHCSLG 163 RP + C A ++ QL LWAE+SKWL P K+ ++ + Sbjct: 68 RPFPKIPCTAPTQHQLYDILWAEISKWLRNNPELKNDIIWTQQRVYMNGY---------- 117 Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223 + + + RT + P+ G H + + II DEASG D + +LG +T +A Sbjct: 118 --PEEWFAVPRTAT--NPEALQGFHAEHVLYII-DEASGVSDKVFEPVLGAMTGEDAK-- 170 Query: 224 WIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 +M NP RLSG F++ +K ++ ID R + ++ F + II +G+DSDV RV Sbjct: 171 LLMMGNPTRLSGFFFDSHHKSRSEYSAMHIDGRDSQHVNQKFVQKIINMFGMDSDVFRVR 230 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIE 343 V GQFP+ DS I ++ E A +P + +G D+A G D++ + V Sbjct: 231 VAGQFPKSTPDSLIMMDWCEAATQLKPETVRN-RVDIGVDVARYGDDSSALYPVIDKVQS 289 Query: 344 HLFD-WSKTDLRTTNNKISGLVEKYRPD------AIIIDANNTGARTCDYLE-------- 388 ++ + + + ++++Y + + +D + G D L Sbjct: 290 DGYELYHHNRTTEISGYVVQMIKRYAVECLDAVIRVKVDCDGLGVGVYDNLYDLTDQIID 349 Query: 389 ---------------------------MLGYHVYRVLGQKRA-----VDLEFCRNRRTEL 416 L + D N + Sbjct: 350 EVWRDRCRREGLDPDNGNQWNECQRIPQLDLEIVECHFGAAGGKIDEDDPVEYSNSTGLM 409 Query: 417 HVKMADWLEFAS--LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDG 471 K+ L+ + + + LI L + + +IV G+L +E K + +G S D +D Sbjct: 410 WGKIRKLLQTGALQIPDDDALISQLSNRR-YIVNKDGKLELERKEAMKKRGLPSPDIADA 468 Query: 472 LMYTFAEN 479 L + Sbjct: 469 LALALYDP 476 >gi|153810665|ref|ZP_01963333.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174] gi|149833061|gb|EDM88143.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174] Length = 469 Score = 347 bits (889), Expect = 3e-93, Method: Composition-based stats. Identities = 123/465 (26%), Positives = 195/465 (41%), Gaps = 53/465 (11%) Query: 45 LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104 L + P + + ++ + E ++ +G GIGK+ + AW V+W M T Sbjct: 8 LYYANHPVEFVQDILKADPDPEQKKILRSLVENQMTSVRSGHGIGKSAVEAWSVIWFMCT 67 Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164 P + C A ++ QL LWAE+SKW W + L+ Sbjct: 68 HPYPKIPCTAPTQHQLFDILWAEISKWKRN---------NKTLDSELIWTKEKLYM--KG 116 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224 ++ + + RT S PD G H + M I DEASG D I +LG L+ A Sbjct: 117 HAEEWFAVARTAST--PDALQGFHAEH-MLYIIDEASGVEDKIFEPVLGALSTPGAK--L 171 Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284 +M NP +LSG FY+ NK + + F ID R + F + II YG DSDV RV V Sbjct: 172 LMCGNPTQLSGFFYDSHNKNREQYSTFHIDGRNSTRVSQEFVQTIINMYGEDSDVFRVRV 231 Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI-MGCDIAEEGGDNTVVVLRRGPVIE 343 G FP + D +IPL ++E+++ E P + +I +GCD+A G D TV+ R ++ Sbjct: 232 AGDFPLAEDDIYIPLPLVEKSIATEYFPRRHPQIIHIGCDVARFGTDKTVIGYRTDEKVQ 291 Query: 344 HLFDWSKTDLRTTNNKISG----LVEKYR-------PDAIIIDANNTGARTCDYLEMLGY 392 D T + I LV +Y P I ID G D L + Sbjct: 292 FFKKRVGQDTMKTADDIVSLGMLLVYQYGLKPDIDEPIPIKIDDGGVGGGVVDRLRQIKR 351 Query: 393 H---------VYRVLGQKRAVDLEFCRNRRTELHVKMADWLE-----------FASLINH 432 + VY V ++ + +F + T + + L+ L + Sbjct: 352 NNPERFWWMEVYPVKFGQK-IRHKFFDDSTTYMMSVLKKLLQPFDDNGLPKDVEIILPDD 410 Query: 433 SGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMY 474 L+ + K + + ++ +ESK + +G +S D +D ++ Sbjct: 411 DALVAQISGRK-YEMTENSKIRVESKKVMKARGVQSPDEADCILL 454 >gi|148653111|ref|YP_001280204.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1] gi|148572195|gb|ABQ94254.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1] Length = 520 Score = 340 bits (871), Expect = 4e-91, Method: Composition-based stats. Identities = 91/490 (18%), Positives = 181/490 (36%), Gaps = 73/490 (14%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 +WQ E + + + ++++G G GK+ + LW + P ++ Sbjct: 41 TWQQELL----------FKSIVVPGSRTSVASGHGTGKSRSAGIIALWHLLFYPESVMLF 90 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC-----SLGIDSK 167 A QL+T +W E++ L L N W +D + + Sbjct: 91 TAPQIGQLRTVVWKEINICLQRLRNNK----------ALGWLADYVVVLAEKIYIKGFKD 140 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227 + +T + +P G H + M + DEA G D + +G LT N ++T Sbjct: 141 TWFVFAKTAPKHQPTNIAGQHGDHYM-VWADEACGIDDAVMEVAIGALTHENNRA--VLT 197 Query: 228 SNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRV 282 S P + +G FY+ +K W + + + + +YG +S + Sbjct: 198 SQPAKNTGFFYDTHHKLSHHNGGKWTALEFNGEMSPIVSKDKLIEALYQYGSRNSPGYLI 257 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY---APLIMGCDIAEE-GGDNTVVV--- 335 + G+FP+ ++ E + ++PC +I+ D+ + G D++V+ Sbjct: 258 RIRGKFPELK-GEYLLTRTDYENMKQQPCVIEEGDKWGIIVAVDVGGDVGRDSSVISVMQ 316 Query: 336 ----LRRGPVIEHLFDW------SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 + +G + H+ ++ ++ T KI+ ++ Y ++ID G Sbjct: 317 VVDKMIKGRIERHVHLLDIPLFSNRANINTLKAKINDVMSDYPGATLVIDPLGAGMGLTQ 376 Query: 386 YLEMLGYHVYRVLGQKRAVD---LEFCRNRRTELHVKMADWLEFA---SLINHSGLIQNL 439 L+ G + V + + N+R+ +V MA +E + Q + Sbjct: 377 SLKADGVYFDEVHWGSPCFNNTLKRYYMNKRSHAYVSMAKAVEKGYFSVSDKVKKMYQVM 436 Query: 440 KSLKS------FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSDMDFGRC 490 +L+ + + SK+ KG KS D +D + + F EN Sbjct: 437 TNLEEQMTRLPYYFDEKARWCMMSKKDMLKKGIKSPDIADTIAFGFMEN-------ISYA 489 Query: 491 PSYQYEGVDL 500 P YE +++ Sbjct: 490 PVESYEDLNI 499 >gi|332974843|gb|EGK11758.1| hypothetical protein HMPREF9373_1714 [Psychrobacter sp. 1501(2011)] Length = 520 Score = 337 bits (865), Expect = 2e-90, Method: Composition-based stats. Identities = 90/490 (18%), Positives = 180/490 (36%), Gaps = 73/490 (14%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 +WQ E + + + ++++G G GK+ + LW + P ++ Sbjct: 41 TWQQELL----------FKSIVVPGSRTSVASGHGTGKSRSAGIIALWHLLFYPESVMLF 90 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC-----SLGIDSK 167 A QL+T +W E++ L L N W +D + + Sbjct: 91 TAPQIGQLRTVVWKEINICLQRLRNNK----------ALGWLADYVVVLAEKIYIKGFKD 140 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227 + +T + +P G H + M + DEA G D + +G LT N ++T Sbjct: 141 TWFVFAKTAPKHQPTNIAGQHGDHYM-VWADEACGIDDAVMEVAIGALTHENNRA--VLT 197 Query: 228 SNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRV 282 S P + +G FY+ +K W + + + + +YG +S + Sbjct: 198 SQPAKNTGFFYDTHHKLSHYNGGKWIALEFNGEMSPIVSKEKLIEALYQYGSRNSPGYLI 257 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY---APLIMGCDIAEE-GGDNTVVV--- 335 + G+FP+ ++ E + PC +I+ D+ + G D++V+ Sbjct: 258 RIRGKFPELK-GEYLLTRTDYENMKAHPCVIKEGDKWGIIVTVDVGGDVGRDSSVISVLQ 316 Query: 336 ----LRRGPVIEHLFDW------SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 + +G + H+ ++ ++ T KI+ ++ Y ++ID G Sbjct: 317 VVDKMVKGRIERHVHLLDIPLFSNRANINTLKAKINDVMSDYPGATLVIDPLGAGMGLTQ 376 Query: 386 YLEMLGYHVYRVLGQKRAVD---LEFCRNRRTELHVKMADWLEFA---SLINHSGLIQNL 439 ++ G + V + + N+R+ +V MA +E + Q + Sbjct: 377 SVKADGVYFDEVHWGSPCFNNTLKRYYMNKRSHAYVSMAKAVEKGYFSVSDKIKKMYQVI 436 Query: 440 KSLKS------FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSDMDFGRC 490 +L+ + + SK+ KG KS D +D + + F EN Sbjct: 437 TNLEEQMTRLPYYFDEKARWCMMSKKDMLKKGIKSPDIADTIAFGFMEN-------ISYA 489 Query: 491 PSYQYEGVDL 500 P+ YE +++ Sbjct: 490 PAESYEDLNI 499 >gi|83593922|ref|YP_427674.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170] gi|83576836|gb|ABC23387.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170] Length = 505 Score = 332 bits (850), Expect = 1e-88, Method: Composition-based stats. Identities = 111/463 (23%), Positives = 176/463 (38%), Gaps = 62/463 (13%) Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134 P K + AG G+GKTT A + W + C A + +QL+ LW+E+++ Sbjct: 34 PAGAKVTVRAGHGVGKTTATAAAIWWHLECFDYSKTPCTAPTASQLEQILWSELARLRRR 93 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLH-CSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY-- 191 + +L ++ + + + + RT ++PD G H + Sbjct: 94 ADARAQGTGLPAALRLEALFAVSGRAIADRGTPREWFVVARTARRDQPDALQGFHASDID 153 Query: 192 ----------------GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 + + +EASG PD + G L+ A +M NP R +G Sbjct: 154 LEAGAGPRLSAKSGGAALMFVIEEASGVPDAVFEVAEGALSSPGAR--LLMVGNPTRNTG 211 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295 F + + ++ +DP + G++ +YG +S+V RV G FP+QD D Sbjct: 212 FFARSHKRDRASFTALRLRCADSPLVDPGYRAGLVRKYGAESNVVRVRADGAFPRQDDDV 271 Query: 296 FIPLNIIEEALNREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTD 352 I L E AL R P P A +G D+A G D TV +LR GPV+ + + D Sbjct: 272 LIALETAEAALAR-PLPARMATEDERRLGVDVARFGDDRTVFLLRIGPVVGAIEVTAGRD 330 Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG----YHVYRVLGQKRAVDLEF 408 + L E +R I +D GA D L G +RA E Sbjct: 331 TMAVAGRARRLAEIWRAGRIYVDEIGVGAGVVDRLREDGAPVVAVNVAASAPERAAGEER 390 Query: 409 CRNRRTELHVKMADWLE------------------------FASLI---NHSGLIQNLK- 440 R R L + + WL S + + L Q+L Sbjct: 391 GRLLRDHLWLMVRGWLRDEAPVFAGPGGGPASGSAAGLLSGMGSCLVPGVDADLAQDLAG 450 Query: 441 --SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478 + + +G + +ESK + +G +S D +D L TF E Sbjct: 451 ELATPRYAFDGSGRVVVESKDAMKRRGLRSPDLADALALTFHE 493 >gi|269119479|ref|YP_003307656.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386] gi|268613357|gb|ACZ07725.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386] Length = 499 Score = 323 bits (829), Expect = 3e-86, Method: Composition-based stats. Identities = 99/472 (20%), Positives = 177/472 (37%), Gaps = 53/472 (11%) Query: 58 FMEVVDAHCLNSVNNPNPEVF----KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113 F ++++ H L+ + F + ++ AG GK++L L + + TRP VI Sbjct: 22 FKDILNFHFLSEDQTRVLQAFNEYRRLSVPAGHSTGKSSLAGGLTTYWLITRPKSRVIVT 81 Query: 114 ANSETQLKTTLWAEVSKWLSLLP---------------------NKHWFEMQSLSLHPAP 152 A + QLKT WAEV+K + + WF + + P Sbjct: 82 APTYRQLKTIYWAEVNKIYNRSKLKQLNLFEINDKIMRINDKDLKREWFALPVTASTPEG 141 Query: 153 WYS---------DVLHCSLGI----DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + LGI D + + + E+ + + + ++ DE Sbjct: 142 MQGQHGDKTEVIEQIMKHLGIEEIGDDETIEIVSQILRGEKQIEGLTKEDKEKLLVMVDE 201 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259 +SG + I + G T+ + ++ N + +G FYE P + + + + Sbjct: 202 SSGVKNEIFEVLEG--TDYD---KLVLFGNMTKNTGYFYESVYNPKSKFYKVTMSSYNSP 256 Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319 + + YG DS+V RV + G+ P + +S N I+ A R Y + Sbjct: 257 FMKKEQIHDLEETYGPDSNVVRVRLKGEAPDGNENSIFSSNKIDSAFQRSLSLSEYETIK 316 Query: 320 MGCDIA-EEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII--IDA 376 +G D+ GGD++ + ++ + D L +I K R II ID Sbjct: 317 LGVDVGKGSGGDSSTIYEKKDNRVRKKLDRKDFTLPDVKREIIQYCYKNRDKLIIANIDG 376 Query: 377 NNTGARTCDYLEM---LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHS 433 G LE V + +A + + N+RTE++ +++ L+ L Sbjct: 377 TGLGTGLVQELEEGEIENLVVNDIQFAGKAKNKKEFNNKRTEMYFELSRNLDKLDLEEDQ 436 Query: 434 GLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPR 482 L + L ++ + N G + SK + S D SD L E R Sbjct: 437 ELKRELL-IQIYEFDNNGRFKLISKDKIKEMLGHSPDKSDALALCNYEAETR 487 >gi|312964323|ref|ZP_07778627.1| terminase B protein [Escherichia coli 2362-75] gi|331655801|ref|ZP_08356790.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M718] gi|312291036|gb|EFR18910.1| terminase B protein [Escherichia coli 2362-75] gi|323186470|gb|EFZ71817.1| terminase B protein [Escherichia coli 1357] gi|323969205|gb|EGB64507.1| terminase B protein [Escherichia coli TA007] gi|325495624|gb|EGC93488.1| DNA pacase B subunit [Escherichia fergusonii ECD227] gi|331046575|gb|EGI18664.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M718] Length = 494 Score = 315 bits (807), Expect = 1e-83, Method: Composition-based stats. Identities = 92/483 (19%), Positives = 182/483 (37%), Gaps = 54/483 (11%) Query: 32 VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 L+ + W L F +WQ + + + + + K ++S+G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPNKHWFEMQSLSLHP 150 + + +++ + PG I +AN Q+ T ++ + W + W L Sbjct: 64 DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLA-DYFVLTE 122 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210 +Y ++ + + + + G H + + II DEASG D Sbjct: 123 TAFYEVTGKGV-------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGI 174 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263 I G LT ++ + S P R SG FY+ +K P + +++ + P Sbjct: 175 ITGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTP 232 Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322 +F + +A Y G D+ + ++V G FP+ + + +E A R+ + Sbjct: 233 AFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACV 292 Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371 D+A G D +V+ + +R + + +++ KI E++ Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNIT 352 Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427 I ID + G T D + E G V R+ K+ D ++R +V+ A+ ++ Sbjct: 353 IAIDGDGLGKATADLMYEYYGITVQRIRWGKKMHSREDKSLYFDKRAYANVQAAEAVKSG 412 Query: 428 --SLINHSGLIQNLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPP 481 L + I+ + + + G+ + S K+ S D+ D + + Sbjct: 413 RMRLDKGNETIEEASKIPV-GINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLADYV 471 Query: 482 RSD 484 D Sbjct: 472 PQD 474 >gi|324111095|gb|EGC05081.1| terminase B protein [Escherichia fergusonii B253] Length = 494 Score = 315 bits (806), Expect = 1e-83, Method: Composition-based stats. Identities = 92/483 (19%), Positives = 182/483 (37%), Gaps = 54/483 (11%) Query: 32 VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 L+ + W L F +WQ + + + + + K ++S+G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPNKHWFEMQSLSLHP 150 + + +++ + PG I +AN Q+ T ++ + W + W L Sbjct: 64 DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLA-DYFVLTE 122 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210 +Y ++ + + + + G H + + II DEASG D Sbjct: 123 TAFYEVTGKGV-------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGI 174 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263 I G LT ++ + S P R SG FY+ +K P + +++ + P Sbjct: 175 ITGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTP 232 Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322 +F + +A Y G D+ + ++V G FP+ + + +E A R+ + Sbjct: 233 AFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACV 292 Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371 D+A G D +V+ + +R + + +++ KI E++ Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRILEYTDVTETQLAAKIFAECNPERFPNIT 352 Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427 I ID + G T D + E G V R+ K+ D ++R +V+ A+ ++ Sbjct: 353 IAIDGDGLGKATADLMYEYYGITVQRIRWGKKMHSREDKSLYFDKRAYANVQAAEAVKSG 412 Query: 428 --SLINHSGLIQNLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPP 481 L + I+ + + + G+ + S K+ S D+ D + + Sbjct: 413 RMRLDKGNETIEEASKIPV-GINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLADYV 471 Query: 482 RSD 484 D Sbjct: 472 PQD 474 >gi|56266643|gb|AAV84926.1| DNA pacase B subunit [Enterobacteria phage phiW39] Length = 494 Score = 314 bits (805), Expect = 2e-83, Method: Composition-based stats. Identities = 92/483 (19%), Positives = 182/483 (37%), Gaps = 54/483 (11%) Query: 32 VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 L+ + W L F +WQ + + + + + K ++S+G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPNKHWFEMQSLSLHP 150 + + +++ + PG I +AN Q+ T ++ + W + W L Sbjct: 64 DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLA-DYFVLTE 122 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210 +Y ++ + + + + G H + + II DEASG D Sbjct: 123 TAFYEITGKGV-------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGI 174 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263 I G LT ++ + S P R SG FY+ +K P + +++ + P Sbjct: 175 ITGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTP 232 Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322 +F + +A Y G D+ + ++V G FP+ + + +E A R+ + Sbjct: 233 AFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACV 292 Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371 D+A G D +V+ + +R + + +++ KI E++ Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNIT 352 Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427 I ID + G T D + E G V R+ K+ D ++R +V+ A+ ++ Sbjct: 353 IAIDGDGLGKATADLMYEYYGITVQRIRWGKKMHSREDKSLYFDKRAYANVQAAEAVKSG 412 Query: 428 --SLINHSGLIQNLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPP 481 L + I+ + + + G+ + S K+ S D+ D + + Sbjct: 413 RMRLDKGNETIEEASKIPV-GINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLADYV 471 Query: 482 RSD 484 D Sbjct: 472 PQD 474 >gi|168467778|ref|ZP_02701615.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195629119|gb|EDX48493.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 494 Score = 313 bits (803), Expect = 4e-83, Method: Composition-based stats. Identities = 94/482 (19%), Positives = 181/482 (37%), Gaps = 52/482 (10%) Query: 32 VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 + + W + F +WQ + + SV P K ++S+G G GK+ Sbjct: 16 AQYRYDWIAAADVM--FGKTPTWQQD-------QIIESVQEPGS---KTSVSSGHGTGKS 63 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPNKHWFEMQSLSLHP 150 + + +++ + PG I +AN Q+ T ++ + W + W + L Sbjct: 64 DMTSIMIMLFIIMFPGARAIIVANKIQQVMTGIFKYLKINWSTATSRFPWLA-EYFVLTD 122 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210 +Y ++ + + + + G H + + II DEASG D Sbjct: 123 TSFYEITSKGV-------WTVVPKGFRLGNEEALAGEHADHLLYII-DEASGVSDKAFGI 174 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263 + G LT ++ + S P R SG FY+ +K P + +++ + P Sbjct: 175 MTGALTGKDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPNGIYTAITLNSEESPLVTP 232 Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322 F + +A Y G DS + ++V G FP+ + + +E A R+ I Sbjct: 233 EFIKMKLAEYGGRDSPMYLIKVRGLFPKTQDGFLLGRDEVERASRRKVKIAKGWGWIACV 292 Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371 D+A G D +V+ + +R + + ++S KI+ ++Y Sbjct: 293 DVAGGTGRDKSVINIMMVSGERNKRRIIGYRIIEYSDVTETQLAAKINAECSPDRYPNIT 352 Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427 I+ID + G T D L + G R+ K+ D ++R +V+ A+ ++ Sbjct: 353 IVIDGDGLGKSTADLLYDNYGITAQRIRWGKKMHSREDRSLYFDQRAYANVQAAEAVKSG 412 Query: 428 SLINHSGLIQ-NLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPPR 482 + G S + + G+ + S K+ +S D+ D + N Sbjct: 413 RMRLDKGDATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLRSPDHWDTYCFGMLANYVP 472 Query: 483 SD 484 + Sbjct: 473 QN 474 >gi|56266666|gb|AAV84947.1| DNA pacase B subunit [Enterobacteria phage D6] Length = 502 Score = 309 bits (792), Expect = 5e-82, Method: Composition-based stats. Identities = 101/444 (22%), Positives = 176/444 (39%), Gaps = 38/444 (8%) Query: 72 NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131 + + +++G G GK++L A L+L M P VI +AN Q+KT ++ V ++ Sbjct: 49 SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 + +H + L +Y + +C+ Y + G H + Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYERSRKGI-------WEVLCKGYRLGNEEALAGEHAAH 161 Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------P 244 + +I DEASG D + G LTE + +M S P R SG FY+ + P Sbjct: 162 -LLLILDEASGISDKAIGVMTGALTEEDNR--MLMLSQPTRPSGYFYDSHHSQAKTPDNP 218 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 W +++ + P F + + Y G DS V+V GQFP++ + + + Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278 Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKT-DLR 354 A R+ + + D+ G D +V+ + +R V + + S T D Sbjct: 279 RAARRKVLLEKNWGWVATADVG-NGRDKSVLNICKVSGHRDKRRVVNFKVMEMSGTMDPL 337 Query: 355 TTNNKISGLV--EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKR---AVDLEFC 409 + I EKY I +DA+ G+ TC L G + R+ K D E Sbjct: 338 AFADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRIRWGKPMFANKDRERF 397 Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLK-SFIVPNTGELAIESKR----VKGAK 464 N+R ++ D ++ + S ++ K F++ G++A+ K K Sbjct: 398 VNQRAYANIMARDAIKSGRMRIDSDPKTAEQASKIPFLLNEEGKMAMMRKEHMRQKLNIK 457 Query: 465 STDYSDGLMYTFAENPPRSDMDFG 488 S D D +T + ++ D G Sbjct: 458 SPDRWDTYCFTMLVDYVPANEDIG 481 >gi|323179619|gb|EFZ65182.1| terminase B protein [Escherichia coli 1180] Length = 453 Score = 309 bits (791), Expect = 8e-82, Method: Composition-based stats. Identities = 100/442 (22%), Positives = 174/442 (39%), Gaps = 38/442 (8%) Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 + +++G G GK++L A L+L M P VI +AN Q+KT ++ V ++ + Sbjct: 2 QETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWA 61 Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 +H + L +Y + +C+ Y + G H + + Sbjct: 62 NAVKRHGWLQTYFVLSDTMFYERSRKGI-------WEVLCKGYRLGNEEALAGEHAAH-L 113 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLD 246 +I DEASG D + G LTE + +M S P R SG FY+ + P Sbjct: 114 LLILDEASGISDKAIGVMTGALTEEDNR--MLMLSQPTRPSGYFYDSHHSQAKTPDNPKG 171 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 W +++ + P F + + Y G DS V+V GQFP++ + + + A Sbjct: 172 IWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRA 231 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKT-DLRTT 356 R+ + + D+ G D +V+ + +R V + + T D Sbjct: 232 ARRKVLLEKNWGWVATADVG-NGRDKSVLNICKVSGHRDKRRVVNFKVMEMPGTMDPLAF 290 Query: 357 NNKISGLV--EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKR---AVDLEFCRN 411 + I EKY I +DA+ G+ TC L G + R+ K D E N Sbjct: 291 ADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRIRWGKPMFANKDRERFVN 350 Query: 412 RRTELHVKMADWLEFASLINHSGLIQNLKSLK-SFIVPNTGELAIESKR----VKGAKST 466 +R ++ D ++ + S ++ K F++ G++A+ K KS Sbjct: 351 QRAYANIMARDAIKSGRMRIDSDPKTAEQASKIPFLLNEEGKMAMMRKEHMRQKLNIKSP 410 Query: 467 DYSDGLMYTFAENPPRSDMDFG 488 D D +T + ++ D G Sbjct: 411 DRWDTYCFTMLVDYVPANEDIG 432 >gi|304399103|ref|ZP_07380971.1| DNA packaging protein [Pantoea sp. aB] gi|304353343|gb|EFM17722.1| DNA packaging protein [Pantoea sp. aB] Length = 503 Score = 308 bits (790), Expect = 1e-81, Method: Composition-based stats. Identities = 107/503 (21%), Positives = 189/503 (37%), Gaps = 57/503 (11%) Query: 34 HFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTL 93 + + W +E F +WQ E +NSV + +++G G GK++L Sbjct: 23 YRYNWALA--VVELFGMIPTWQQE-------EIMNSVQETGSQ---TTVTSGHGTGKSSL 70 Query: 94 NAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPW 153 A ++L M P VI +AN Q+KT ++ V + + +H + +L + Sbjct: 71 TAMMLLIYMIMYPDARVIIVANKIGQVKTGVFKYVKTYWANAARRHPWLQNYFTLTDTMF 130 Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213 Y + +C+ Y + G H + + I+ DEASG D + G Sbjct: 131 YEKSRKGI-------WEVLCKGYRLGNEEALAGEHAAHILLIL-DEASGISDKAIAIMRG 182 Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDPSFH 266 LTE + +M S P R SG FY+ + P W +++ + F Sbjct: 183 ALTEEDNR--MLMMSQPTRPSGYFYDSHHSLARHPDNPNGFWNAIVLNSEEAPHVTLKFI 240 Query: 267 EGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIA 325 + Y G DS V+V G+FP+ + + + A R+ + + D+ Sbjct: 241 REKLVEYGGRDSLEYMVKVLGRFPRNVSGYLLGRDECDRAARRKVYLEKGWGWVATADVG 300 Query: 326 EEGGDNTVVVLR--------RGPVIEHLFDWSKT-DLRTTNNKISGLV--EKYRPDAIII 374 G D +++ + R V L + T D + + I+ E+Y I + Sbjct: 301 -NGRDKSILNICKVSGYGDARRVVSFKLLEMPGTMDPISFGDYIANECTQERYPGITIAV 359 Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDL---EFCRNRRTELHVKMADWLEFASL-I 430 D + G+ T LE G + + + E +N+R ++ AD + + I Sbjct: 360 DGDGVGSGTLKQLERRGVNAISIRWGQPPFSKKVRERFKNQRAWSNIMAADAIRSGRMRI 419 Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESK----RVKGAKSTDYSDGLMYTF------AENP 480 + S S + + G + + K + KS D D + F AE Sbjct: 420 DMSQHTAEQASKIPYFMDEMGRIMMVPKPQMRQKLNIKSPDRWDTYCFIFLIGYRPAEAE 479 Query: 481 PRSDM-DFGRCPSYQYEGVDLLI 502 DM DF + + +D L+ Sbjct: 480 LSEDMADFTQSKLDELSELDALL 502 >gi|323948959|gb|EGB44853.1| terminase B protein [Escherichia coli H252] Length = 502 Score = 307 bits (787), Expect = 3e-81, Method: Composition-based stats. Identities = 99/444 (22%), Positives = 175/444 (39%), Gaps = 38/444 (8%) Query: 72 NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131 + + +++G G GK++L A L+L M P VI +AN Q+KT ++ V ++ Sbjct: 49 SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 + +H + L +Y + +C+ Y + G H + Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYERSRKGI-------WEVLCKGYRLGNEEALAGEHAAH 161 Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------P 244 + +I DEASG D + G LTE + +M S P R SG FY+ + P Sbjct: 162 -LLLILDEASGISDKAIGVMTGALTEEDNR--MLMLSQPTRPSGYFYDSHHSRAKTPDNP 218 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 W +++ + P F + + Y G DS V+V GQFP++ + + + Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKEKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278 Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKT-DLR 354 + R+ + + D+ G D +V+ + +R V + + T D Sbjct: 279 RSARRKVLLEKNWGWVATADVG-NGRDKSVLNICKVSGHRDKRRVVNFKVMEMPGTMDPL 337 Query: 355 TTNNKISGLV--EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKR---AVDLEFC 409 + I EKY I +DA+ G+ TC L G + R+ K D E Sbjct: 338 AFADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRIRWGKPMFANKDRERF 397 Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLK-SFIVPNTGELAIESKR----VKGAK 464 N+R ++ D ++ + S ++ K F++ G++A+ K K Sbjct: 398 VNQRAYANIMARDAIKSGRMRIDSDPKTAEQASKIPFLLNEEGKMAMMRKEHMRQKLNIK 457 Query: 465 STDYSDGLMYTFAENPPRSDMDFG 488 S D D +T + ++ D G Sbjct: 458 SPDRWDTYCFTMLVDYVPANEDIG 481 >gi|307308936|ref|ZP_07588619.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti BL225C] gi|306900570|gb|EFN31183.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti BL225C] Length = 472 Score = 307 bits (785), Expect = 4e-81, Method: Composition-based stats. Identities = 109/439 (24%), Positives = 190/439 (43%), Gaps = 38/439 (8%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139 + G GKT ++A + W + + V A SE+ +K+ +W E Sbjct: 50 ITVKGSSGWGKTFISAISLWWSLIVFDPVKVTIFAPSESTIKSGIWNE------------ 97 Query: 140 WFEMQSLSLHPAPWYSDVLHCS-LGIDSKHYSTMC----RTYSEERPDTFVGHHNTYGMA 194 +Q L + AP + ++ S I K C R S++ G H+ + Sbjct: 98 ---LQVLYSNMAPLFRELFEVSATKIFRKSRGETCWAEYRLVSKDNIAAARGFHSKNNI- 153 Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQ 252 +I DEASG DVI G L + ++ SNP + SG F++ + P DW + Sbjct: 154 VIADEASGIEDVIFTGALLNVLNDGPGAKVVLVSNPDKASGFFFKTWRDPELSKDWIKVH 213 Query: 253 IDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREP 310 R P E YG + S V G+FP D+D I ++EA+ N++ Sbjct: 214 GSIRDKPNYTPGEEERFARLYGGVTSRDYLTLVEGEFPLSDVDGLISREFLDEAVTNKDA 273 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV----EK 366 P+P AP+I G D A G D +V+ +R V+ +W+ + ++ L +K Sbjct: 274 IPNPKAPIIWGLDPAGAGKDKSVLAIRHDNVLRGFEEWAGLEPVALALRVKELYLKTSKK 333 Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQK-RAVDLEFCRNRRTELHVKMADWLE 425 RP I +D N GA D L+ VY+ + + + + R ++ +M +W+ Sbjct: 334 DRPAVIAVDGNGLGAGVYDALKHFKIPVYKCMFAEVPKRNPDRYTRVRDQIWFEMREWIH 393 Query: 426 FA--SLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 S+ NH LI++L ++ ++ ++ ++ IE K + + +S DY+D L TF+ + Sbjct: 394 TGDVSIPNHKKLIEDL-AIPTYE--DSPKIKIEDKKSLKKRLGRSPDYADALALTFSVSH 450 Query: 481 PRSDMDFGRCPSYQYEGVD 499 R + +Y+ + Sbjct: 451 TRYASKYQWDKPIEYDNLS 469 >gi|260871239|ref|YP_003238019.1| DNA packaging protein [Escherichia coli O111:H- str. 11128] gi|257767818|dbj|BAI39311.1| DNA packaging protein [Escherichia coli O111:H- str. 11128] Length = 494 Score = 306 bits (784), Expect = 5e-81, Method: Composition-based stats. Identities = 90/482 (18%), Positives = 177/482 (36%), Gaps = 52/482 (10%) Query: 32 VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 L+ + W L F +WQ + + S ++++G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQD-------EIIESTQQDGSW---TSVTSGHGTGKS 63 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKHWFEMQSLSLHP 150 + + + + + PG VI +AN Q+ ++ + S W + + W + L Sbjct: 64 DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLS-KYFILTE 122 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210 ++ ++ + ++ + G H + + II DEASG D Sbjct: 123 TSFFEVTGKGV-------WTILIKSCRSGNEEALAGEHADHLLYII-DEASGVSDKAFSV 174 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263 I G LT ++ + S P R SG FY+ ++ P + +++ +D Sbjct: 175 ITGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDA 232 Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322 F +A Y G D+ + ++V G+FP+ + + +E A R+ + Sbjct: 233 KFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACV 292 Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371 D+A G D +V+ + +R + + +++ KI E++ Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNIT 352 Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427 I ID + G T D + E G V R+ K+ D + R +++ A+ ++ Sbjct: 353 IAIDGDGLGKSTADLMYERYGITVQRIRWGKKMHSREDKSLYFDMRAFANIQAAEAVKSG 412 Query: 428 SLINHSGLIQ-NLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPPR 482 + G S + + G+ + S K+ S D+ D + N Sbjct: 413 RMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLANYVP 472 Query: 483 SD 484 D Sbjct: 473 QD 474 >gi|46401730|ref|YP_006576.1| PacB [Enterobacteria phage P1] gi|301646767|ref|ZP_07246623.1| putative terminase B protein [Escherichia coli MS 146-1] gi|129547|sp|P27753|TERL_BPP1 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein B; AltName: Full=PACase B protein; AltName: Full=Terminase B protein; AltName: Full=Terminase large subunit gi|68597607|sp|Q5XLR0|TERL_BPP7 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein B; AltName: Full=PACase B protein; AltName: Full=Terminase B protein; AltName: Full=Terminase large subunit gi|33323612|gb|AAQ07582.1|AF503408_106 PacB [Enterobacteria phage P7] gi|215636|gb|AAA21724.1| pacB [Enterobacteria phage P1] gi|33338757|gb|AAQ14080.1| PacB [Enterobacteria phage P1] gi|33338866|gb|AAQ14188.1| PacB [Enterobacteria phage P1] gi|54112354|gb|AAV28854.1| PacB [Enterobacteria phage P7] gi|301075042|gb|EFK89848.1| putative terminase B protein [Escherichia coli MS 146-1] Length = 494 Score = 306 bits (783), Expect = 7e-81, Method: Composition-based stats. Identities = 90/482 (18%), Positives = 177/482 (36%), Gaps = 52/482 (10%) Query: 32 VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 L+ + W L F +WQ + + S ++++G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQD-------EIIESTQQDGSW---TSVTSGHGTGKS 63 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKHWFEMQSLSLHP 150 + + + + + PG VI +AN Q+ ++ + S W + + W + L Sbjct: 64 DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLS-KYFILTE 122 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210 ++ ++ + ++ + G H + + II DEASG D Sbjct: 123 TSFFEVTGKGV-------WTILIKSCRPGNEEALAGEHADHLLYII-DEASGVSDKAFSV 174 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263 I G LT ++ + S P R SG FY+ ++ P + +++ +D Sbjct: 175 ITGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDA 232 Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322 F +A Y G D+ + ++V G+FP+ + + +E A R+ + Sbjct: 233 KFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACV 292 Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371 D+A G D +V+ + +R + + +++ KI E++ Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNIT 352 Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427 I ID + G T D + E G V R+ K+ D + R +++ A+ ++ Sbjct: 353 IAIDGDGLGKSTADLMYERYGITVQRIRWGKKMHSREDKSLYFDMRAFANIQAAEAVKSG 412 Query: 428 SLINHSGLIQ-NLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPPR 482 + G S + + G+ + S K+ S D+ D + N Sbjct: 413 RMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLANYVP 472 Query: 483 SD 484 D Sbjct: 473 QD 474 >gi|331649955|ref|ZP_08351031.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M605] gi|331041212|gb|EGI13366.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M605] Length = 494 Score = 305 bits (782), Expect = 9e-81, Method: Composition-based stats. Identities = 90/482 (18%), Positives = 177/482 (36%), Gaps = 52/482 (10%) Query: 32 VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 L+ + W L F +WQ + + S ++++G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQD-------EIIESTQQDGSW---TSVTSGHGTGKS 63 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKHWFEMQSLSLHP 150 + + + + + PG VI +AN Q+ ++ + S W + + W + L Sbjct: 64 DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLS-KYFILTE 122 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210 ++ ++ + ++ + G H + + II DEASG D Sbjct: 123 TSFFEVTGKGV-------WTILIKSCRPGNEEALAGEHADHLLYII-DEASGVSDKAFSV 174 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263 I G LT ++ + S P R SG FY+ ++ P + +++ +D Sbjct: 175 ITGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDA 232 Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322 F +A Y G D+ + ++V G+FP+ + + +E A R+ + Sbjct: 233 KFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACV 292 Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371 D+A G D +V+ + +R + + +++ KI E++ Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMQEYTDVTETQLAAKIFAECNPERFPNIT 352 Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427 I ID + G T D + E G V R+ K+ D + R +++ A+ ++ Sbjct: 353 IAIDGDGLGKSTADLMYERYGITVQRIRWGKKMHSREDKSLYFDMRAFANIQAAEAVKSG 412 Query: 428 SLINHSGLIQ-NLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPPR 482 + G S + + G+ + S K+ S D+ D + N Sbjct: 413 RMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLANYVP 472 Query: 483 SD 484 D Sbjct: 473 QD 474 >gi|48697461|ref|YP_024846.1| Pas60 [Actinoplanes phage phiAsp2] gi|47679679|gb|AAT36808.1| Pas60 [Actinoplanes phage phiAsp2] Length = 492 Score = 304 bits (779), Expect = 2e-80, Method: Composition-based stats. Identities = 105/461 (22%), Positives = 173/461 (37%), Gaps = 53/461 (11%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 +P +W + ++V A + + P + A+ G+GK+ A LV W +TR + Sbjct: 21 DSPTAWAADCLDVRLAGYQGEILDAVPRERRVAVRGPHGLGKSFSGAILVNWFATTRDLM 80 Query: 109 ----SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164 +I A++ L+ LW E+ KW + ++L AP+ L + Sbjct: 81 GKDWKIITTASAWRHLEVYLWPEIHKWAG--------RINFVALGRAPYNPRTELLDLRL 132 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA---- 220 H + + +P+ G H + + DEA P I G + Sbjct: 133 KLTHGAATA--VASNQPERIEGAHAEE-LLYLLDEAKIVPPATWDSIEGAFSNAGVDVAD 189 Query: 221 NRFWIMTSNPRRLSGKFYEIFNK--PLDDWKRFQIDTRT---VEGIDPSFHEGIIARYGL 275 N + S P SG+FY+I + +DW + I ++ + +++G Sbjct: 190 NAYAFAMSTPGAPSGRFYDIHRRAPGYEDWWTRHVTLEEAIASGRISRAWADQRRSQWGS 249 Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP------CPDPYAPLIMGCDIAEEGG 329 DS V V G+F D DS IPL +E A+ R P P PL G D+ GG Sbjct: 250 DSAVFHNRVLGEFHASDEDSVIPLAWLEAAIERWHEWDRQGRPSPGGPLWTGVDVGR-GG 308 Query: 330 DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 D TV+ R G + L + D T I + R IID GA D L Sbjct: 309 DETVLAARDGWAVT-LETNRRRDTMATVGLI-----QAREGRAIIDVIGLGAGVFDRLRE 362 Query: 390 LGYHVYRVLGQKRAVDLEF-----CRNRRTELHVKMADWLE-----FASLINHSGLIQNL 439 LG G A + N R+ + + + L+ +L +I +L Sbjct: 363 LGTRPLAYTGSAGATVRDRSGKFGFTNTRSAAYWNLRELLDPAFDPVLALPPDDLMISDL 422 Query: 440 KSLKSFIVPNT--GELAIESKRV---KGAKSTDYSDGLMYT 475 + + V ++ +E K + +S D D + + Sbjct: 423 TT-PHWEVTTGVPPKIKVEPKDKVVERLGRSPDRGDAIAMS 462 >gi|323516996|gb|ADX91377.1| hypothetical protein ABTW07_0941 [Acinetobacter baumannii TCDC-AB0715] gi|323518424|gb|ADX92805.1| hypothetical protein ABTW07_2381 [Acinetobacter baumannii TCDC-AB0715] Length = 663 Score = 297 bits (761), Expect = 2e-78, Method: Composition-based stats. Identities = 89/431 (20%), Positives = 154/431 (35%), Gaps = 51/431 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 GKT + LW + ++ A QLK +W E+S + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256 Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 L P W +D + + + + +T + +P G+H M + DEA Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256 SG D + G LT + +MTS P R +G FYE +K W + Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373 Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 + E +YG D ++ V G+FP + I EE D + Sbjct: 374 ESPLVSKQSLEEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433 Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360 ++ D+ G D++V+V+ RR V++ ++ D+ KI Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGESQWGERARRVEVVDIPLCKNRDDILELFAKI 493 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417 + L+ +Y +++D N G YL+ G V + + + N+R+ + Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553 Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469 V +A + ++ + L + + + I SK + G KS D Sbjct: 554 VGLARAIASGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612 Query: 470 DGLMYTFAENP 480 D + F EN Sbjct: 613 DAFAFLFLENV 623 >gi|299769795|ref|YP_003731821.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1] gi|298699883|gb|ADI90448.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1] Length = 668 Score = 297 bits (759), Expect = 4e-78, Method: Composition-based stats. Identities = 91/431 (21%), Positives = 153/431 (35%), Gaps = 51/431 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 GKT + LW + ++ A QLK +W E+S + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256 Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 L P W +D + + + + +T + +P G+H M + DEA Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256 SG D + G LT + +MTS P R +G FYE +K W + Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373 Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 + E +YG D ++ V G+FP + I EE D + Sbjct: 374 ESPLVSKQSLEEQRQKYGSRDDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433 Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360 I+ D+ G D++V+V+ RR V++ ++ D+ KI Sbjct: 434 QFGYIITVDVGGGVGRDDSVIVISKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKI 493 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417 + L+ +Y +++D N G YL+ G V + + + N+R+ + Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553 Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469 V A + ++ + L + + + I SK R G KS D Sbjct: 554 VGFARAVASGRFKMKTKKHYVKIKDQLIHIP-YRFDDFARYKILSKDEMRRMGIKSPDLG 612 Query: 470 DGLMYTFAENP 480 D + F EN Sbjct: 613 DAFAFLFLENV 623 >gi|256392042|ref|YP_003113606.1| hypothetical protein Caci_2856 [Catenulispora acidiphila DSM 44928] gi|256358268|gb|ACU71765.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 484 Score = 295 bits (755), Expect = 1e-77, Method: Composition-based stats. Identities = 88/479 (18%), Positives = 164/479 (34%), Gaps = 58/479 (12%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 + P W + + + + A+ + G GK+ + + L W + T P Sbjct: 24 YLADPARWVDDKLGEYLWSRQVDIATSVRDQRLTAVQSCHGTGKSFVASRLTAWWLDTHP 83 Query: 107 --GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164 V+ A + Q+K LWAE++K + + ++ W D + G Sbjct: 84 PGEAFVVTTAPTGDQVKAILWAEINKAFAKAEARG--TPLPGRINETDWKYDKFLVAFG- 140 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224 R S+ P F G H Y + +I DEA G L T + Sbjct: 141 ---------RKPSDYNPHAFQGIHAKYVL-VILDEACGISKQFWTAALAIATGVHCRI-- 188 Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE--------------GIDPSFHEGII 270 + NP F ++ W +I R + ++ + Sbjct: 189 LAIGNPDDPGSHFAQVCKSDR--WNMIKIAARDTPNFTGEEVPDDLADMLVSQAYVLDMA 246 Query: 271 ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP----CPDPYAPLIMGCDIAE 326 +G +S + +V +FP D + L+ + A REP PD P+ +G D+ Sbjct: 247 EEFGPESPIYLSKVDAEFPSDASDGVVRLSKL-MACTREPVHPYAPDRLVPVELGVDLGA 305 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386 GGD T + RRG + + D + I + + + +D+ G Sbjct: 306 -GGDETCIRERRGIAAGREWRNREKDSEKVVDHIVRAIRETGATKVKVDSIGIGWGIVGS 364 Query: 387 LEMLG------YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL-EFAS-------LINH 432 L+ V V + + E R+++ ++ L E + Sbjct: 365 LQARRKQGLHTAEVVGVNVSEASTQPEKYARLRSQIWWEVGRKLSEDGGWDLSQLDTTDR 424 Query: 433 SGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN-PPRSDMDF 487 L+ L + K + + +G + +E K + + +S D +D L+ F P+ + Sbjct: 425 DRLVSQLTAPK-YDLDASGRIVVEKKEETKKRIGRSPDNADALLLAFYTPSVPKPGIRV 482 >gi|184158505|ref|YP_001846844.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU] gi|183210099|gb|ACC57497.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU] Length = 663 Score = 295 bits (755), Expect = 1e-77, Method: Composition-based stats. Identities = 87/431 (20%), Positives = 153/431 (35%), Gaps = 51/431 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 GKT + LW + ++ A QLK +W E+S + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256 Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 L P W +D + + + + +T + +P G+H M + DEA Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256 SG D + G LT + +MTS P R +G FYE +K W + Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373 Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 + + +YG D ++ V G+FP + I EE D + Sbjct: 374 ESPLVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433 Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360 ++ D+ G D++V+V+ RR V++ ++ D+ KI Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGESQWGERARRVEVVDIPLCKNRDDILELFAKI 493 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417 + L+ +Y +++D N G YL+ G V + + + N+R+ + Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553 Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469 V + + ++ + L + + + I SK + G KS D Sbjct: 554 VGLQRAIASGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612 Query: 470 DGLMYTFAENP 480 D + F EN Sbjct: 613 DAFAFLFLENV 623 >gi|213156231|ref|YP_002318651.1| phage terminase [Acinetobacter baumannii AB0057] gi|301346399|ref|ZP_07227140.1| phage terminase [Acinetobacter baumannii AB056] gi|301594275|ref|ZP_07239283.1| phage terminase [Acinetobacter baumannii AB059] gi|213055391|gb|ACJ40293.1| phage terminase [Acinetobacter baumannii AB0057] Length = 663 Score = 295 bits (754), Expect = 1e-77, Method: Composition-based stats. Identities = 88/431 (20%), Positives = 156/431 (36%), Gaps = 51/431 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 GKT + LW + ++ A QLK +W E+S + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256 Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 L P W +D + + + + +T + +P G+H M + DEA Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256 SG D + G LT + +MTS P R +G FYE +K W + Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373 Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 + + +YG D ++ V G+FP + I EE D + Sbjct: 374 ESPLVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433 Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360 ++ D+ G D++V+V+ RR V++ ++ D+ KI Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKI 493 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417 + L+ +Y +++D N G YL+ G V + + + N+R+ + Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553 Query: 418 VKMADWL-----EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469 V +A + + + ++ + L + + + I SK + G KS D Sbjct: 554 VGLARAIANGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612 Query: 470 DGLMYTFAENP 480 D + F EN Sbjct: 613 DAFAFLFLENV 623 >gi|332852816|ref|ZP_08434408.1| intein splicing region-containing protein [Acinetobacter baumannii 6013150] gi|332871045|ref|ZP_08439658.1| intein splicing region-containing protein [Acinetobacter baumannii 6013113] gi|332729027|gb|EGJ60377.1| intein splicing region-containing protein [Acinetobacter baumannii 6013150] gi|332731805|gb|EGJ63085.1| intein splicing region-containing protein [Acinetobacter baumannii 6013113] Length = 663 Score = 295 bits (754), Expect = 2e-77, Method: Composition-based stats. Identities = 88/431 (20%), Positives = 154/431 (35%), Gaps = 51/431 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 GKT + LW + ++ A QLK +W E+S + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256 Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 L P W +D + + + + +T + +P G+H M + DEA Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256 SG D + G LT + +MTS P R +G FYE +K W + Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373 Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 + + +YG D ++ V G+FP + I EE D + Sbjct: 374 ESPLVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433 Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360 ++ D+ G D++V+V+ RR V++ ++ D+ KI Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKI 493 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417 + L+ +Y +++D N G YL+ G V + + + N+R+ + Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553 Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469 V +A + ++ + L + + + I SK + G KS D Sbjct: 554 VGLARAIASGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612 Query: 470 DGLMYTFAENP 480 D + F EN Sbjct: 613 DAFAFLFLENV 623 >gi|260551382|ref|ZP_05825582.1| phage terminase [Acinetobacter sp. RUH2624] gi|260405545|gb|EEW99037.1| phage terminase [Acinetobacter sp. RUH2624] Length = 663 Score = 295 bits (754), Expect = 2e-77, Method: Composition-based stats. Identities = 88/431 (20%), Positives = 154/431 (35%), Gaps = 51/431 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 GKT + LW + ++ A QLK +W E+S + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256 Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 L P W +D + + + + +T + +P G+H M + DEA Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256 SG D + G LT + +MTS P R +G FYE +K W + Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373 Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 + + +YG D ++ V G+FP + I EE D + Sbjct: 374 ESPLVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433 Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360 ++ D+ G D++V+V+ RR V++ ++ D+ KI Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKI 493 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417 + L+ +Y +++D N G YL+ G V + + + N+R+ + Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553 Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469 V +A + ++ + L + + + I SK + G KS D Sbjct: 554 VGLARAIASGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612 Query: 470 DGLMYTFAENP 480 D + F EN Sbjct: 613 DAFAFLFLENV 623 >gi|216906085|ref|YP_002333619.1| terminase [Abalone shriveling syndrome-associated virus] gi|216263178|gb|ACJ72002.1| terminase [Abalone shriveling syndrome-associated virus] Length = 507 Score = 284 bits (727), Expect = 2e-74, Method: Composition-based stats. Identities = 109/470 (23%), Positives = 187/470 (39%), Gaps = 45/470 (9%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQLE ++ + A ++ V A+S G G GKT L+ L +W PG Sbjct: 50 DWQLEIVDYI-AKFFRKNSDEKHFVCAIAVSGGNGTGKTKLSKALNIWRFCCHPGSRQFI 108 Query: 113 LANSETQLK----TTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 L NSE Q K T L +SK LS + ++S + + +P +D D Sbjct: 109 LTNSERQTKRTGFTMLVRRISKLLSCIA-----ALESSAYYYSPAVADKPEVRTN-DMWD 162 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228 + + ++ +E G H+ M DE++ D + + T+ F T Sbjct: 163 VTYLLQSSTEA---ALSGLHHPM-MTFSFDESTYFNDHVWQALENMWTQGQVLCF--CTG 216 Query: 229 NPRRLSGKFY-EIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR-------YGLDSDVT 280 NP + ++ +FNK L + TR V ++ AR YG Sbjct: 217 NPSHDNNNYFARLFNKSLHKKDSLWL-TRCVSLLELPLKYRNDARARYIEEHYGKTHPRY 275 Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD-PYAPLIMGCDI--AEEGGDNTVVVLR 337 V GQFP+++ + + I EA+ RE + + P+IMG D+ + G + + +R Sbjct: 276 IASVLGQFPKKNTCNPFDITAISEAMEREVREEFIHHPVIMGIDVSISANNGSASAICVR 335 Query: 338 RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-----EMLGY 392 G + L ++ K+ L+++ +P +++DAN G + L E Sbjct: 336 EGTAVRVLREYRCH-YTEFRIKLLELLQEIKPTIVVVDANGVGFGLYEELHRTLPETSNV 394 Query: 393 HVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLKSLKSFIVPNT 450 VY V A ++ +EL K ++W E S+ + + L SL + Sbjct: 395 RVYGVRAHAEAFLKSEYADKMSELAKKSSEWFNNELVSIPKNYQFLNALTSLS--FADAS 452 Query: 451 GELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQYEG 497 G++ + K + K S D +D TF + +MD+ + Y Sbjct: 453 GKIKLIGKTDAKKKVDLSMDMADAFFLTFLDGV---EMDWAQGVKDNYLD 499 >gi|134287454|ref|YP_001109621.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4] gi|134131876|gb|ABO60570.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4] Length = 509 Score = 274 bits (701), Expect = 2e-71, Method: Composition-based stats. Identities = 84/457 (18%), Positives = 157/457 (34%), Gaps = 49/457 (10%) Query: 59 MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118 ++ H + ++ + + + ++S+G G GKT+ A + LW + + I A + Sbjct: 34 LKAPTHHQIQMFDSVSKQGSRTSVSSGHGTGKTSGFAIIALWHLLCYYLSNTILTAPKIS 93 Query: 119 QLKTTLWAEVSKWLSLLPNKHW-FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177 + +W E + + + N + + + + + ++ + ++ Sbjct: 94 TVSDGVWKEFADLSTKISNGPQSWIWEYFVI-------ESERVYVRGYKLNWFVIAKSAP 146 Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 P+ G H + + + DEASG PD I G LT+ + S P R SG F Sbjct: 147 RGSPENLAGAHRDW-LLWLADEASGIPDDNFGVITGSLTDE--RNRMCLASQPTRSSGFF 203 Query: 238 YEIFN----KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293 YE + W ++ + F +Y + +++V G+FP+ Sbjct: 204 YETHHALSRAEGGPWNNLVFNSEFSPIVSAKFIAEKKLQYTEEE--YQIKVQGRFPENSS 261 Query: 294 DSFIPLNIIEEALNREPC-PDPYAPLIMGCDIAEEG-GDNTVV----VLRRGPV------ 341 + IE + R PD + ++ D+ G D TV+ V+ RG Sbjct: 262 KYLVGPQAIEACVGRTVIKPDEHWGWLLPVDVGGGGWRDETVMPALHVIGRGEYGMDARR 321 Query: 342 ---IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR-V 397 I + D + I + +IDA G C L++ G+ YR V Sbjct: 322 AQLISVPLHSNTQDPAQLHGVIVHAARERSNATAMIDAGGMGLIVCKQLDLDGFSQYRKV 381 Query: 398 LGQKRAVDLEF---CRNRRTELHVKMADWLEFA--SLINH------SGLIQNLKSLKSFI 446 E+ N+R + A + + L++ + F Sbjct: 382 NWGNPNFAKEYKDRYVNQRAQACCGFARAITEGRFGINPDVPKSFVKKLVKQGSRIPYFW 441 Query: 447 VPNTGELAIESKRVK----GAKSTDYSDGLMYTFAEN 479 I K S D D L + F E+ Sbjct: 442 -DEKARRQIMKKEDMREKENLPSPDVFDALSFAFLED 477 >gi|228924410|ref|ZP_04087639.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] gi|228835241|gb|EEM80653.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] Length = 293 Score = 274 bits (701), Expect = 2e-71, Method: Composition-based stats. Identities = 77/283 (27%), Positives = 124/283 (43%), Gaps = 30/283 (10%) Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284 + NP R SG FY+ N+ D +K ++ + E + +YG SDV RV V Sbjct: 2 FLCGNPTRTSGVFYDSHNRDRDLYKIHKVSSLDSPRTSKDNIEVLKKKYGEGSDVWRVRV 61 Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEH 344 G+FP+ + D+FIPL I+E+A + + P L +G D+A G D TV+ R G + Sbjct: 62 LGEFPKAEADAFIPLEIVEQAASCKVEPT-GETLDLGVDVARFGDDETVIAPRIGNKVFK 120 Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDA-------IIIDANNTGARTCDYL------EMLG 391 L + K D T + L ++Y I +D + G D L E L Sbjct: 121 LLNHYKQDTMETAGHVLKLAKEYMAKYKQLKRVDIKVDDSGVGGGVTDRLKEVIKSERLP 180 Query: 392 YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE------------FASLINHSGLIQNL 439 + VY V+ + +D E N E + D LE + N +I Sbjct: 181 FKVYPVVNNGKPLDDEHYDNAGAEGWAVVRDLLEENMKAFIQGEEPTMEIPNDEKMISQF 240 Query: 440 KSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 S K + + + G++A+E K + +G +S D +D ++ F + Sbjct: 241 SSRK-YRITSRGKIALERKEEMKKRGLQSPDRADAIVLAFYKP 282 >gi|226227228|ref|YP_002761334.1| hypothetical protein GAU_1822 [Gemmatimonas aurantiaca T-27] gi|226090419|dbj|BAH38864.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 549 Score = 273 bits (697), Expect = 6e-71, Method: Composition-based stats. Identities = 106/544 (19%), Positives = 177/544 (32%), Gaps = 91/544 (16%) Query: 13 QKLFDLMWSDEIKLSFSNFVLHFFP----WGEKGTPLEGFSAPRSWQLEFMEVVDAHCLN 68 + D L ++ L W L W L Sbjct: 10 SLVIDHSAYRHDPLGWAEVALGVSRETLLW-----SLFDAYGTHEW------DGTPDPLA 58 Query: 69 SVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV 128 +V + A+++G G GKT L A L+LW ++ P +A Q + +W EV Sbjct: 59 TVLEAIAKNQWVAVASGTGTGKTFLEAVLLLWWIAVEPDSIATTVATKADQQEKGIWREV 118 Query: 129 SK-WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 ++ W E+ +L + PW D T EE G Sbjct: 119 ARHWPRFQACFPEAELTTLRIRMEPWRGDAWGA-------WGITAAPKAGEESSSAVQGL 171 Query: 188 HNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPR---RLSGKFYEIFNKP 244 H + I+ DE G P + ++ T NP G+F E Sbjct: 172 HAKR-LLILVDETPGVPQPVMTALVNTATGE--ENVIAAFGNPDYQADPLGQFAET---- 224 Query: 245 LDDWKRFQIDTRTVEGI-----------DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293 +I + +YG++S V + V G P+Q Sbjct: 225 -KRVTAIRISALDHPNVVLGVERIPGAATRLSIATREDKYGVESGVYQSRVRGIAPEQSA 283 Query: 294 DSFIPLNIIEEALNREPCPDPYA----PLIMGCDIAE-EGGDNTVVVLRRGPVIEHLFDW 348 + I L A +R A P +G D+A+ E GD V + +G + + Sbjct: 284 SALIHLAWCVAAADRAESVQHAALALGPKALGVDVAQSENGDKAAVAMGQGARLLSVIAK 343 Query: 349 SKTDLRTTNNKISGLVEKYR--PDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQ 400 + + ++ L+ P+ + +D GA T ++L E G V R G Sbjct: 344 ACPNATKLGAEVWQLMRDEGIVPEYVGVDPIGVGAATVNHLDGECEKENAGRSVVRCSGG 403 Query: 401 KRAV----------------DLEFCRNRRTELHVKMADWLEFA--SLINHSGLIQNLKSL 442 +A+ D +N R ++ ++ + L +L L + L ++ Sbjct: 404 AKAMEASSRAADGSAMEWLADANKFKNLRAQMWWQLREDLRNGLIALPRDRELFRELTTV 463 Query: 443 KSFIVPNTGELA-IESK---RVKGAKSTDYSDGLMY-------TFAENPPRSDMDFGRCP 491 + G + +ESK R + +S D +D ++Y T PP D R P Sbjct: 464 Q---FDEDGGIVTLESKDDIRKRLGRSPDRADAVVYWNWVRPRTRVNQPPPEGFDVAR-P 519 Query: 492 SYQY 495 Y Sbjct: 520 IRNY 523 >gi|159897183|ref|YP_001543430.1| hypothetical protein Haur_0654 [Herpetosiphon aurantiacus ATCC 23779] gi|159890222|gb|ABX03302.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC 23779] Length = 472 Score = 258 bits (660), Expect = 1e-66, Method: Composition-based stats. Identities = 100/490 (20%), Positives = 169/490 (34%), Gaps = 82/490 (16%) Query: 45 LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEV-FKGAISAGRGIGKTTLNAWLVLWLMS 103 L P ++ E + V + ++ + A +GKT L LV W Sbjct: 2 LPYAHDPVAYAREVLGEVWWTKQELIARSLLTPPYRTLVKACHKVGKTHLGGGLVNWWYD 61 Query: 104 TRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG 163 + V+ A ++ Q++ LW EV + + +S L P + Sbjct: 62 SFDPGLVLTTAPTDRQVRDLLWKEVR--MQRRGRAGFTGPKSPRLESTPDH--------- 110 Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223 ++ + D+F GHH+ + + I DEA G V E A Sbjct: 111 --------FAHGFTAKDGDSFQGHHSPHTL-FIFDEAVGVASVFWETAESMFNEGGA--- 158 Query: 224 WIMTSNPR---------RLSGKFYEI----------------FNKPLDDWKRF-QIDT-- 255 W+ NP LSG ++ I P R ++DT Sbjct: 159 WLAIFNPTDTSSQAYAEELSGGWHVISMSVLEHPNILAELQGLPPPFPSAIRLSRVDTLL 218 Query: 256 ----RTVEGIDPSFHEGIIAR--YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 R + +P I R + + + G++P Q ++ + A + Sbjct: 219 KKWCRALSPEEPKRATDIHWRDAWYRPGPIAEARLLGRWPSQATNNVWSDGAFQVAESL- 277 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-- 367 P P +GCD+A G D T + +RRG + + T ++ L +Y Sbjct: 278 LLPASDEPCELGCDVARYGDDFTEIHVRRGGHSLYHEAANGWSTVETAGRLKQLANEYGR 337 Query: 368 ------RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 R A+ ID + G D GY V G + A D E NRR+EL +A Sbjct: 338 RCGVDGRAVAVKIDDDGIGGGVVDL--ADGYTFLGVSGARTAYDPEKYPNRRSELWFSVA 395 Query: 422 D-----WLEFASLINHSGLIQNLK---SLKSFIVPNTGELAIESK---RVKGAKSTDYSD 470 + L F +L + + L+ ++ + G +E K + + +S D D Sbjct: 396 ERAMEQRLSFVAL--DAETRRELRRQAMAPTWKQDSQGRRVVEPKADTKKRIKRSPDGMD 453 Query: 471 GLMYTFAENP 480 + +A P Sbjct: 454 AVNLAYAPAP 463 >gi|322656964|gb|EFY53248.1| DNA packaging protein [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] Length = 411 Score = 255 bits (652), Expect = 1e-65, Method: Composition-based stats. Identities = 77/327 (23%), Positives = 132/327 (40%), Gaps = 30/327 (9%) Query: 72 NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131 + + +++G G GK++L A L+L M P VI +AN Q+KT ++ V ++ Sbjct: 49 SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 + +H + L +Y GI + +C+ Y + G H + Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH 161 Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------P 244 + +I DEASG D + G LTE + +M S P R SG FY+ + P Sbjct: 162 -LLLILDEASGISDKAIGVMTGALTEEDNR--MLMLSQPTRPSGYFYDSHHSQAKTPDNP 218 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 W +++ + P F + + Y G DS V+V GQFP++ + + + Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278 Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKT-DLR 354 A R+ + + D+ G D +V+ + +R V + + T D Sbjct: 279 RAARRKVLLEKNWGWVATADVG-NGRDKSVLNICKVSGHRDKRRVVNFKVMEMPGTMDPL 337 Query: 355 TTNNKISGLV--EKYRPDAIIIDANNT 379 + I EKY I +DA+ Sbjct: 338 AFADFIYNECTPEKYPNITIAVDADGL 364 >gi|262316909|emb|CBA18135.1| putative terminase B [Paenibacillus phage phiBP] Length = 248 Score = 252 bits (644), Expect = 9e-65, Method: Composition-based stats. Identities = 66/242 (27%), Positives = 104/242 (42%), Gaps = 16/242 (6%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 +P+++ E + SV++ + ++ +G+G+GKT L A + LW + P Sbjct: 23 YRKSPKTFFKEILNFSPDKWQESVSDDIAKYRFVSVRSGQGVGKTALEAAISLWFLCCFP 82 Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 V+C A + QL LWAE+SKW S P + W ++ Sbjct: 83 FPRVVCTAPTRQQLNDVLWAEISKWQSQSP---------ILKRILKWTKTKIYM--KNYE 131 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226 + + RT + +P+ G H Y M I DEASG D I I G L+ M Sbjct: 132 ERWFATARTAT--KPENMQGFHEDY-MLFIVDEASGVDDRIMAAIFGTLSGDY--NKLFM 186 Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286 NP + SG F++ N+ ++ ++ E + A+YG SDV RV V G Sbjct: 187 CGNPTKTSGFFFDSHNRDRAIYRTHRVSCLDSPRTSKENIEMLKAKYGEGSDVWRVRVLG 246 Query: 287 QF 288 +F Sbjct: 247 EF 248 >gi|111222161|ref|YP_712955.1| hypothetical protein FRAAL2741 [Frankia alni ACN14a] gi|111149693|emb|CAJ61385.1| hypothetical protein FRAAL2741 [Frankia alni ACN14a] Length = 535 Score = 247 bits (631), Expect = 3e-63, Method: Composition-based stats. Identities = 92/467 (19%), Positives = 151/467 (32%), Gaps = 59/467 (12%) Query: 47 GFSAPRSWQLEFMEVV-DAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTR 105 P W + + V + N K A+ + GK+ + A V + T Sbjct: 52 YRDEPVRWARDRLGGVHLWSKQQEIINALRVHRKVAVPSCHDAGKSFVAAAAVAHWLDTH 111 Query: 106 PG--ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG 163 P I A + Q++ LW E+ + L+ ++ W D + G Sbjct: 112 PPGSAFAITTAPTFPQVRAILWREIRRLSRLM------NPPLGRVNQTEWLIDDDLVAFG 165 Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223 R ++ F G H Y + ++ DEA G P + + T NA Sbjct: 166 ----------RKPADHDEGGFQGIHAQYPL-VVLDEAGGIPQQLWIAADSIATNENARI- 213 Query: 224 WIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE--------------GIDPSFHEGI 269 + NP + F ++ L W I + ++ E Sbjct: 214 -LAIGNPDDPTSYFAQVC--ELPSWHVITIPAAETPAFTGEQIPDDLRQALLSRAWAEEK 270 Query: 270 IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA---PLIMGCDIAE 326 +G D+ V +V QFP+ I + + + P P + P+ +G D+ Sbjct: 271 RREWGEDNPVYISKVLAQFPKDVAWKVIKASDVAKRRIGRDEPWPASKLRPVCLGVDVG- 329 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386 EG D TVV RRG + + I V + IDA G Sbjct: 330 EGRDWTVVRERRGVQAGREWQARTPEPEQAVKLIGQAVLITGAKTVNIDAGGPGWGIAAA 389 Query: 387 LEML-------GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL------EFASLINHS 433 L G V + ++ + E N R EL + L + + + N Sbjct: 390 LRGWLKQHKVRGVAVNPIRFGAKSREPEKYLNMRAELWWGVGRLLSEQGGWDLSVMENAD 449 Query: 434 GLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477 L + + IESK R + +S D +D L+ FA Sbjct: 450 DTTAQLLD-PIWREGAGDRIVIESKEELRKRTGRSPDNADALLLAFA 495 >gi|161789175|ref|YP_001595730.1| PacB [Vibrio sp. 0908] gi|161761461|gb|ABX77106.1| PacB [Vibrio sp. 0908] Length = 572 Score = 246 bits (628), Expect = 7e-63, Method: Composition-based stats. Identities = 81/438 (18%), Positives = 155/438 (35%), Gaps = 38/438 (8%) Query: 64 AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTT 123 + +N P + ++++G G GK+ L A L L + T P + ANS Q+ Sbjct: 47 FQQIEVINALTPVGARVSVASGHGTGKSHLTAALCLHFIITHPESLCMLTANSLDQVTNV 106 Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT 183 +++ + + + + + Q + +Y+ + +T S+ + Sbjct: 107 VFSYIKRCWVKICQRQPWLEQYFVITAKSFYA-------KGYKGVWQIFGKTCSKGNEEG 159 Query: 184 FVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK 243 G H M ++ DEASG D + G LTE N ++ S R +G F + + Sbjct: 160 LAGQHRRDYM-VVVDEASGVSDRAFEVLRGALTEDN--NKMLLISQFTRPTGHFADSQME 216 Query: 244 --PLDDWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLN 300 + +++ ++ F Y G+ S + V G P I + Sbjct: 217 LAEQGLYTAITLNSEMSPFVNLKFIREKRIEYGGVTSPEYGIRVLGVCPDDASGFLISRS 276 Query: 301 IIEEALNREPCPDPYAPLIMGCDIA-EEGGDNTVVVL---------RRGPVIEHLFDWSK 350 ++++ + D+A EG D++V+ + R+ V++ + + Sbjct: 277 LVDKGFEAVIEFADEWGWVAVADVAGGEGRDSSVLKIGKVCGFGSERQVEVVKAIEAPAD 336 Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD---LE 407 D I Y ++ IDA+ G T E LG +V R+ + + Sbjct: 337 MDGVQFARFIHQETAGYTNISVGIDADGYGLTTAQECEKLGVNVTRIHWGRPPHANSVKQ 396 Query: 408 FCRNRRTELHVKMADWLEFASLINH--------SGLIQNLKSLKSFIVPNTGELAIESKR 459 + V + + L L H L + + + G I SK+ Sbjct: 397 RFPKEKDFACVMVKEALGTGRLKLHRGETKQFEKKLQKQFVKIP-YEFDELGRWRIFSKK 455 Query: 460 V---KGAKSTDYSDGLMY 474 +G KS D D + Sbjct: 456 QLRSEGIKSPDIFDATAF 473 >gi|257459276|ref|ZP_05624390.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268] gi|257443289|gb|EEV18418.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268] Length = 431 Score = 245 bits (625), Expect = 2e-62, Method: Composition-based stats. Identities = 76/318 (23%), Positives = 131/318 (41%), Gaps = 18/318 (5%) Query: 177 SEERPDTFVGHHNTYGMAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLS 234 S ERP+ G +I +EA + + + N N + P+ + Sbjct: 104 SAERPENIEGFGYD---TVILNEAGIILKDPYLWDNAISPMLLDNPNSRAFIGGVPKGKN 160 Query: 235 GKFYEIFNKPL---DDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQ 290 KF+++ + + W+ FQ + + + ++A G DSDV R E+ G+F Sbjct: 161 -KFFDLAQRGMRNEKGWRNFQFSSYDNPLLQKEEIDRLVAELGGADSDVARQEIFGEFLD 219 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350 +S L IE A ++ D AP+I D+A EG D +V+ R+G +E L + Sbjct: 220 TTSNSVFSLAAIEAAFRKQRYFDAGAPVIWALDVAREGDDESVLCKRQGDSVEPLKPYRI 279 Query: 351 TDLRTTNNKISGLVEK--YRPDAIIIDANNTGARTCDYLEMLGYH--VYRVLGQKRAVDL 406 +I G E+ +P AI ID GA D L LG V G +A D Sbjct: 280 ASTSELAREIYGEYERTDLKPHAIYIDTIGVGAGVFDTLCDLGLRGIVREAKGSFKASDE 339 Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463 N+R E++ + + L ++ L + L+++ + + K + + Sbjct: 340 RKYANKRAEMYFNLREKLPLLAIAPDEELKRQLQTIAFY-FDKKERYLLMPKEGIKKEYG 398 Query: 464 KSTDYSDGLMYTFAENPP 481 +S D +D L +F + P Sbjct: 399 RSPDRADALAMSFFDLCP 416 >gi|292670767|ref|ZP_06604193.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292647388|gb|EFF65360.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 442 Score = 243 bits (621), Expect = 5e-62, Method: Composition-based stats. Identities = 80/376 (21%), Positives = 147/376 (39%), Gaps = 28/376 (7%) Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 +A Q K W + + + +P + ++ + Y ++ ++ Sbjct: 63 VAPYRNQAKRVAWEYLKYYTNPIPGR--------VVNESELYIEL----PTRHARSPGAR 110 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINL-GILGFLTERNANRFWIMTSNPR 231 + PD G + +I DE + + I L +R + + P+ Sbjct: 111 LYIIGADHPDALRGIYLDG---VILDEYADIKPELWGGVIRPALADRQG--WAVFIGTPK 165 Query: 232 RLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289 + +FYE++ W T + + + A+ + R E+ F Sbjct: 166 GQN-QFYEMYQHAEKSAGWYSCIYRTDETGVLPAEELKDMQAQ--MTEMEIRQELLCDFT 222 Query: 290 QQDIDSFIPLNIIEEALNREPCPDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 D IP++++ A NR D P+I+G D+A G D TV+ +R+G ++ + Sbjct: 223 ASASDVVIPIDLVTAAANRLLKDDDVLGQPVILGVDVARFGDDRTVLCVRQGLWLKEVRT 282 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE 407 ++ T +++ + ++ P A IDA GA D L L Y V V + A+D Sbjct: 283 FTGLSTMETASRVIDCINQHHPHATFIDAGAMGAGVIDRLRQLRYQVSEVNFGEMAMDAA 342 Query: 408 FCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464 N R E++ K WLE I + ++ S + TG + +E K + + K Sbjct: 343 RYANIRAEMYFKCRAWLEAGGAIPQNAELKTELSTVEYKFNPTGRIILEPKDKLKERTGK 402 Query: 465 STDYSDGLMYTFAENP 480 S D +DG + TFA Sbjct: 403 SPDLADGFVLTFARPV 418 >gi|298387330|ref|ZP_06996883.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] gi|298259999|gb|EFI02870.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] Length = 500 Score = 242 bits (618), Expect = 8e-62, Method: Composition-based stats. Identities = 93/491 (18%), Positives = 160/491 (32%), Gaps = 88/491 (17%) Query: 53 SWQL---EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP--- 106 W + + +V + A+++G GK + A L M P Sbjct: 15 DWCAFASDVLRANLDEEQKAVLRSVQKNPMTALASGTSRGKDFVAACAALCFMYLTPEWD 74 Query: 107 -------GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH 159 + A S+ Q++ + EV + ++ Sbjct: 75 DDGNLIRNTKIALSAPSQRQVENIMTPEVRRLFRNAGILP---------------GRLVA 119 Query: 160 CSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERN 219 + D + Y + + + G H M +I EASG + I I G L Sbjct: 120 NDIRTDYEEYFLTGFKADNKNQEVWSGFHAANVMFVIT-EASGVSETIFSAIEGNL---Q 175 Query: 220 ANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDP-----------SFHEG 268 N ++ NP +G + +F++D+ + + E Sbjct: 176 GNSRLLLVFNPNITTGYAANAMKSDR--FAKFRLDSLNATNVTAKREIIPGQVNYEWVED 233 Query: 269 IIARY----------------------GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 + + +D+ R++V G FP+ D IP IE A Sbjct: 234 KVKHWCTPITKEEYNEGEGDFLFENNLYRPNDLFRIKVRGMFPKVAEDVLIPYEWIEIAN 293 Query: 307 NREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL 363 R PY P +G D+A G DN+V R G + + ++ + ++ + G Sbjct: 294 KRWQENHPYRPRKSCKLGVDVAGMGRDNSVFCPRYGNYVSQFDVF-QSAGKASHMHVVGK 352 Query: 364 VEKYR---PDAIIIDANNTGARTCDYLEMLG----YHVYRVLGQKRAVDLE---FCRNRR 413 Y+ D I ID GA L G + V G K D+ N R Sbjct: 353 ALSYKRTDRDIIFIDTIGEGAGVYSRLVEQGIRNIFSVKNSQGAKGLHDITGEYSFANMR 412 Query: 414 TELHVKMADWLE----FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKST 466 L+ + DWL+ F ++ + + + G++ IE K + + +S Sbjct: 413 AYLYWALRDWLDPKNNFFPMLPPCDQFTEEATETKWKFRSDGKILIEPKEEIKKRIKRSP 472 Query: 467 DYSDGLMYTFA 477 DY D L TF Sbjct: 473 DYMDALSETFY 483 >gi|225155389|ref|ZP_03723881.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2] gi|224803845|gb|EEG22076.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2] Length = 479 Score = 241 bits (616), Expect = 1e-61, Method: Composition-based stats. Identities = 92/451 (20%), Positives = 166/451 (36%), Gaps = 48/451 (10%) Query: 42 GTPLEGFSA--PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT-LNAWLV 98 GTP P ++ + +++ + + G GKT+ + L Sbjct: 12 GTPAPHAEKLNPITFAVAVLKLRIYSWQAKIMASVWSGKPTVAATPNGAGKTSVIIVALA 71 Query: 99 LWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVL 158 L L+ PG +V+ + + + ++A SL++H A + + Sbjct: 72 LTLLHEFPGATVVLTSATYRAVCDQIFA------------------SLAVHQAKFSAWKW 113 Query: 159 HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG--MAIINDEASGTPDVINLGILGFLT 216 + + D + + ++ +R F G H G + II DEA D I + Sbjct: 114 NDTEINDGQGGRII--GFATDRGGRFEGFHAYPGRPLLIILDEAKSIADDIFVAA----- 166 Query: 217 ERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLD 276 +R + S+ L G+F++ F++ + +FQ I P F E + A+YG D Sbjct: 167 DRCQPTMLLYISSWGGLFGRFHDAFSQDR--FAQFQAGIADCPHITPEFIEAMRAQYGED 224 Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL 336 SD+ R + GQ P+ + F+ + E P + CD AE D V+ Sbjct: 225 SDIYRSMILGQRPKGNETGFVVPFVDYERCESNPPVWQEGTKQVFCDFAET-SDECVIAK 283 Query: 337 RRGPVIEHLFDW-SKTDLRTTNNKISGLVEKYRPDAIII--DANNTGARTCDYLEMLGYH 393 R G + + W + ++ G + + + + +I DA+ TG L + G Sbjct: 284 RDGNRLSIVDAWIPDGNTAGITDRFEGHLRRLQNEGFVIRGDADGTGHGYITALSLRGIK 343 Query: 394 VYRVLGQKRAVDLEFCRNRRTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIV---- 447 + V +D + N E A ++ F L + L + L S + Sbjct: 344 ISGVKNNDAPMDNHYF-NLAAEHWWTFAKKVKSNFWILPHDEVLKRQLCSREEVYRKVGD 402 Query: 448 -----PNTGELAIESKRVKGAKSTDYSDGLM 473 G L + K KS D +D L+ Sbjct: 403 KKVYGREDGRLQLMPKSRLSTKSPDRADALV 433 >gi|283956317|ref|ZP_06373797.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni 1336] gi|283792037|gb|EFC30826.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni 1336] Length = 430 Score = 241 bits (615), Expect = 2e-61, Method: Composition-based stats. Identities = 74/324 (22%), Positives = 127/324 (39%), Gaps = 21/324 (6%) Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGT-----PDVINLGILGFLTERNANRFW 224 + S ER + G +I +EA + + + + N Sbjct: 96 GAVLHMRSAERSENIEGFGYD---LVILNEAGIILKGSKGEYLWYNAIRPMLLDNPKSRA 152 Query: 225 IMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTR 281 I+ P+ + FYE+ K L D WK FQ + + + +I G DS+V + Sbjct: 153 IIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEDSEVVK 211 Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339 E+ G+F L IE A+++ I G D+A G D +V+ R+G Sbjct: 212 QEIYGEFIDSSSAELFALTEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSVLAKRKG 271 Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRV 397 +++ + +S+ N+I + +P I ID G D L G V+ Sbjct: 272 FIVDEIKKYSQLGTMELANRILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEA 331 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457 A E N+R +++ A L+ L+ L ++++ ++ + + G L I S Sbjct: 332 NSANSATSNE-YLNKRAQMYFTFAKNLKHMELVKDEELKKDMRMIE-YEYSDKGLLKIVS 389 Query: 458 K---RVKGAKSTDYSDGLMYTFAE 478 K + KS D SD + TF E Sbjct: 390 KEQLKKNYGKSPDVSDAVALTFFE 413 >gi|153951273|ref|YP_001397540.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|153951467|ref|YP_001398214.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938719|gb|ABS43460.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938913|gb|ABS43654.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] Length = 430 Score = 241 bits (614), Expect = 3e-61, Method: Composition-based stats. Identities = 80/325 (24%), Positives = 126/325 (38%), Gaps = 23/325 (7%) Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV------INLGILGFLTERNANRF 223 + S ER + G +I +EA I L + N Sbjct: 96 GAVLHMRSAERSENIEGFAYD---LVILNEAGIILKDSKGGYLWYNSIRPMLLD-NPKSR 151 Query: 224 WIMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVT 280 I+ P+ + FYE+ K L D WK FQ + + + +I G SDV Sbjct: 152 AIIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGESSDVV 210 Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRR 338 R E+ G+F L+ IE A+++ I G D+A G D +V+ R+ Sbjct: 211 RQEIYGEFIDSSSAELFSLSGIENAMSKNSFSTQKMQGENIWGLDVARYGDDKSVLAKRK 270 Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVE--KYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396 G VI+ L +S+ NKI + + +P I ID G D L G V+ Sbjct: 271 GFVIDELKKYSQLGTIELANKILAEYKQSEEKPKGIFIDTCGLGVGVYDVLLNYGLPVFE 330 Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456 A + N+R +++ A L+ L+ L +++ ++ + + G L I Sbjct: 331 ANSANSATSNQ-YLNKRAQMYFTFAKNLKHMELVKDEELKNDMRRIE-YEYSDKGLLKIV 388 Query: 457 SK---RVKGAKSTDYSDGLMYTFAE 478 SK + KS D SD + TF E Sbjct: 389 SKEQLKKNYGKSPDLSDAVALTFFE 413 >gi|226940459|ref|YP_002795533.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715386|gb|ACO74524.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 272 Score = 237 bits (605), Expect = 3e-60, Method: Composition-based stats. Identities = 73/265 (27%), Positives = 113/265 (42%), Gaps = 9/265 (3%) Query: 239 EIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 + + W QID+RTVEG + YG +SD +V V G FP FI Sbjct: 5 KCGRRFRHRWVARQIDSRTVEGTNKEQIAKWAEDYGEESDFFKVRVRGMFPSMSARQFIS 64 Query: 299 LNIIEEALNREPCPD--PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTT 356 + A R P+ YAP I+ D A EG D V+ LR+G L +K D Sbjct: 65 ETDVSAAYGRALRPEQYQYAPKILTVDPAWEGDDEFVIGLRQGLSFRVLHTMAKNDNDLV 124 Query: 357 NNK-ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTE 415 + I+ ++ DA+ +DA G + +G V ++D C N+R E Sbjct: 125 AAQVIARYEDEEGADAVFVDA-GFGTGIVSAGKSMGRDWTLVWFAGNSMDAG-CLNKRAE 182 Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRV---KGAKSTDYSDGL 472 + DWL+ I ++++ + G++ IESK+ +G S + +D L Sbjct: 183 MWRDARDWLKSGGAIPDDPVLRDELQAPEIVPRLDGKIQIESKKEMKARGVPSPNRADAL 242 Query: 473 MYTFAENPPRSD-MDFGRCPSYQYE 496 + +FA R D +D R S + E Sbjct: 243 ILSFAYPVTRRDPLDALRNHSERRE 267 >gi|57237579|ref|YP_178593.1| terminase B protein, putative [Campylobacter jejuni RM1221] gi|57166383|gb|AAW35162.1| terminase B protein, putative [Campylobacter jejuni RM1221] Length = 430 Score = 237 bits (604), Expect = 4e-60, Method: Composition-based stats. Identities = 74/324 (22%), Positives = 124/324 (38%), Gaps = 21/324 (6%) Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGT-----PDVINLGILGFLTERNANRFW 224 + S ER + G +I +EA + + + + N Sbjct: 96 GAVLHMRSAERSENIEGFGYD---LVILNEAGIILKGSKGEYLWYNAIRPMLLDNPKSRA 152 Query: 225 IMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTR 281 I+ P+ + FYE+ K L D WK FQ + + + +I G S+V + Sbjct: 153 IIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVK 211 Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339 E+ G+F L+ IE A+++ I G D+A G D + + R+G Sbjct: 212 QEIYGEFIDSSSAELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKG 271 Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRV 397 VI + +S+ NKI + +P I ID G D L G V+ Sbjct: 272 FVIYEIKKYSQLGTIELANKILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEA 331 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457 A E N+R +++ L+ L+ L ++++ ++ + + G L I S Sbjct: 332 NSANSATSNE-YLNKRAQMYFTFTKNLKHMELVKDEELKKDMRMIE-YEYSDKGLLKIVS 389 Query: 458 K---RVKGAKSTDYSDGLMYTFAE 478 K + KS D SD + TF E Sbjct: 390 KEQLKKNYGKSPDVSDAVALTFFE 413 >gi|315929403|gb|EFV08605.1| phosphatase, Ppx/GppA family [Campylobacter jejuni subsp. jejuni 305] Length = 430 Score = 236 bits (603), Expect = 5e-60, Method: Composition-based stats. Identities = 75/324 (23%), Positives = 124/324 (38%), Gaps = 21/324 (6%) Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGT-----PDVINLGILGFLTERNANRFW 224 + S ER + G +I +EA + + + + N Sbjct: 96 GAVLHMRSAERSENIEGFGYD---LVILNEAGIILKGSKGEYLWYNAIRPMLLDNPKSRA 152 Query: 225 IMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTR 281 I+ P+ + FYE+ K L D WK FQ + + + +I G S+V + Sbjct: 153 IIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVK 211 Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339 E+ G+F L+ IE A+++ I G D+A G D + + R+G Sbjct: 212 QEIYGEFIDSSSAELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKG 271 Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRV 397 VI + +S+ NKI + +P I ID G D L G V+ Sbjct: 272 FVIYEIKKYSQLGTIELANKILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEA 331 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457 A E N+R +++ A L+ L L ++++ ++ + + G L I S Sbjct: 332 NSANSATSNE-YLNKRAQMYFTFAKNLKHMELFKDEELKKDMRMIE-YEYSDKGLLKIVS 389 Query: 458 K---RVKGAKSTDYSDGLMYTFAE 478 K + KS D SD + TF E Sbjct: 390 KEYLKKNYGKSPDVSDAVALTFFE 413 >gi|189460514|ref|ZP_03009299.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136] gi|189432758|gb|EDV01743.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136] Length = 556 Score = 231 bits (590), Expect = 1e-58, Method: Composition-based stats. Identities = 90/510 (17%), Positives = 161/510 (31%), Gaps = 93/510 (18%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP--------- 106 E + V + + + ++++G GK + A + + P Sbjct: 57 REALGVTLDKEQQEILSSVQYNRRTSVASGTARGKDFVAACAAICFLYLTPRWRKNSLGE 116 Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161 V A ++ Q+K + E+S+ + + + L+ + +D Sbjct: 117 IELVENTKVALTAPTDRQVKNIMMPEISRLFNRAKARGVELIGKLNAYDIRTNND----- 171 Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221 + E + + G H + M ++ EA+G D I G L + Sbjct: 172 ------EWFLTGFKADEHNHEAWSGFHAVHTMFVVT-EATGIGDDTFAAIEGNL---QGD 221 Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE-------------- 267 ++ NP + G + + D W ++++++ T I Sbjct: 222 SRILLVFNPNKTVGYAAKS--QKGDRWHKYRLNSLTAPNIASKKIIIPGQVDYDWVLDKL 279 Query: 268 -------------------GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 ++ D+ R +V G FP+ D D+ IP +EEA R Sbjct: 280 ENWCEKISPDEIISEMDDFEFEGQWYRPEDLFRKKVLGLFPKVDEDTLIPRQWLEEAHER 339 Query: 309 EPCPDPYAPL-----IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK---TDLRTTNNKI 360 PL I+G D+A G D T VLRR + + D KI Sbjct: 340 WKQAKGREPLRADLNILGVDVAGMGRDATCYVLRRDNWVASFDTHNSGGVADHMKVAGKI 399 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDY---LEMLGYHVYRVLGQKRAVDL----------- 406 + + ID GA LE +++ + A Sbjct: 400 MVARRQNIGLYVSIDTIGEGAGVYSRCVELEDEPHYILSCKYSESAKTPNGRELSDITGQ 459 Query: 407 EFCRNRRTELHVKMADWL----EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---R 459 N R L + DWL +++ + F V + G+L IE K + Sbjct: 460 NKFFNMRAYLFWAVRDWLNPRNNTGAMLPPDDKFDEEATEIKFSVKSNGKLYIEPKEDIK 519 Query: 460 VKGAKSTDYSDGLMYTFAENPPRSDMDFGR 489 + +S D D L TF ++ R Sbjct: 520 ERLGRSPDKFDALANTFYPVRYAKPINVNR 549 >gi|154175204|ref|YP_001409090.1| Ppx/GppA family phosphatase [Campylobacter curvus 525.92] gi|112803006|gb|EAU00350.1| phosphatase, Ppx/GppA family [Campylobacter curvus 525.92] Length = 433 Score = 229 bits (584), Expect = 8e-58, Method: Composition-based stats. Identities = 89/458 (19%), Positives = 164/458 (35%), Gaps = 56/458 (12%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111 WQ E A I GR G T A + + G ++ Sbjct: 11 TDWQREVFFKNKAKF-------------TTIEKGRRSGFTKGMANACIEWLI--EGKKIL 55 Query: 112 ----CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167 AN + + E+ + + + H L G Sbjct: 56 WVDTVTANLQRYFERYFVPELKQLPADMWKFH-------------AQDKKLTVGEGYLDM 102 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT--PDVINLGILGFLTERNANRFWI 225 S ERP+ G +I +EA + + + N Sbjct: 103 R--------SAERPENIEGFGYD---VVILNEAGIILKNSYLWDNAIRPMLLDYPNSRAF 151 Query: 226 MTSNPRRLSGKFYEIFNKPL---DDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTR 281 + P+ + +F+++ ++ + DW FQI + + + +IA G +DSDV + Sbjct: 152 IGGVPKGKN-RFFDLASRGMRNEKDWVNFQISSFENPLLRKEEIDELIAELGGVDSDVVK 210 Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPV 341 E+ G+F ++ PL+ IE A + +P A I G D+A +G D +V+ +R G Sbjct: 211 QEIYGEFLDTTTNALFPLSQIEAAFGKVRAYEPNAVQIWGLDVARDGDDESVLCVREGYH 270 Query: 342 IEHLFDWSKTDLRTTNNKISG--LVEKYRPDAIIIDANNTGARTCDYLEM--LGYHVYRV 397 +++L + +I + + +P+AI ID+ GA T D L LG Sbjct: 271 VKNLEGFRIASTTELAREIYRRYEMSEKKPEAIFIDSVGVGAGTFDRLCEFGLGAICREA 330 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457 +A + N+R E++ + + ++ H L + L+ ++ L + Sbjct: 331 KASYKATNEAKFANKRAEMYFALKEKFHLLTMNAHEKLKKQLQMIEFQYDRKERYLILPK 390 Query: 458 K--RVKGAKSTDYSDGLMYTFAENPPRSDMDFGRCPSY 493 + + S DY+D L TF ++ + + Y Sbjct: 391 DELKKEYGTSPDYADALALTFFDDVMSARRTEEKRQRY 428 >gi|153806881|ref|ZP_01959549.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185] gi|149131558|gb|EDM22764.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185] Length = 513 Score = 229 bits (584), Expect = 8e-58, Method: Composition-based stats. Identities = 85/492 (17%), Positives = 150/492 (30%), Gaps = 92/492 (18%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP--------- 106 + + ++ A+++G GK + A L M P Sbjct: 27 RDALCARLDREQQAIIESVQHNPMTAVASGTARGKDFVAACASLCFMYLTPRFNEKGVLV 86 Query: 107 -GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165 V A + Q+K + E+ + + K F ++ + D Sbjct: 87 GNTKVAMTAPTGRQVKNIMTPEIRRLIRAARTKFPFCCP----------GRLVADDIRTD 136 Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWI 225 + + + +++ G H M +I EASG +++ I G L N + Sbjct: 137 YEEWFLTGFKADDNATESWSGFHAANTMFVIT-EASGISEIVYNAIEGNL---QGNSRML 192 Query: 226 MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE------------------ 267 + NP +G + +F++ + E + Sbjct: 193 IVFNPNITTGYAARAMKSDR--FAKFRLSSLNAENVVKKQIVIPGQVDYEWVKDKVINWC 250 Query: 268 ---------------GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312 + +D+ RV+V G FP+ D IP IE A Sbjct: 251 SPIQQTDFNEGEGDFNWEGKLYRPNDLFRVKVLGMFPKVSEDVLIPYEWIEIANRNWQEL 310 Query: 313 D-----PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---V 364 P +G D+A G DN+V+ R G + FD ++ R + + G+ Sbjct: 311 QASGFIPAKSCKLGVDVAGMGRDNSVLCPRYGNYV-PQFDVHQSAGRADHMHVVGMTIPY 369 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE------------FCRNR 412 K + ID GA L E N Sbjct: 370 LKKKGAKAFIDTIGEGAGVYSRLLEE-----EFTNAFSCKYSEGTDGLHDITGEYEFANM 424 Query: 413 RTELHVKMADWL----EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 R L+ + DWL F + + + + + + G++ IE K + + +S Sbjct: 425 RAYLYWALRDWLNPKNGFGAALPPCDQLMEEATETKWKFLSNGKVIIEPKEDVKKRIKRS 484 Query: 466 TDYSDGLMYTFA 477 DY D L TF Sbjct: 485 PDYMDALANTFY 496 >gi|282598783|ref|YP_003359102.1| putative large subunit terminase [Clavibacter phage CMP1] gi|262212571|gb|ACY35907.1| putative large subunit terminase [Clavibacter phage CMP1] Length = 872 Score = 229 bits (584), Expect = 8e-58, Method: Composition-based stats. Identities = 88/428 (20%), Positives = 150/428 (35%), Gaps = 48/428 (11%) Query: 91 TTLNAWLVLWLMSTRP--GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 T L LV W +S P SV+ A Q+ ++ + +L + + + Sbjct: 424 TRLAGDLVTWFVSVFPPEETSVMVSAPIREQIDVMMFRYLRDNYNLAIERE--QPLIGEI 481 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN 208 P++ + R +F G H+ + +A++ DEA G P+ + Sbjct: 482 TKWPYWQVGAPLDKKLVMPK-----RPADGNLISSFQGIHDGH-VAVVLDEAGGLPEDLY 535 Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN--KPLDDWKRFQIDTRTVEGIDPSFH 266 +G T +A + NP + + F+E F + W RF I Sbjct: 536 IGANAVTTNFHARI--LAIGNPDKRNTPFHERFTDTEKFSSWNRFTIGAEDTPNFTGEKI 593 Query: 267 EG------------------IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 + R V +V G FP+ D +F ++I + Sbjct: 594 YEDPAKDEDVKKHLVQVSWAVEMRKSARPSVVAAKVDGNFPESDDTTFFDQSVINRGYST 653 Query: 309 EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL---RTTNNKISGLVE 365 E P+ MG DI+ +G D +V + G I +W++ D + +I Sbjct: 654 EIEPESTDFKYMGVDISYQGEDQSVAYINHGGQIRIADEWNRFDGAEHIESAIRIHNKAC 713 Query: 366 KYRPDAIIIDANNTGARTCDYLEML------GYHVYRVLGQKRAVDLEFCRNRRTELHVK 419 + + ID TGA L+ML Y + V G R + N R + + Sbjct: 714 QEGVQEVRIDMAGTGAGVYSNLKMLDQFKDKPYVLIGVNGANRTPNSNRWLNARAWHYDQ 773 Query: 420 MADWLEFASL---INHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLM 473 L + I L + ++ L+ N G+L I K R G S D+ D + Sbjct: 774 FRTGLITGKIDITITDVDLKKEME-LQPSTFTNRGQLQITRKDDMRKMGISSPDHLDAAI 832 Query: 474 YTFAENPP 481 Y+ + P Sbjct: 833 YSAIDTTP 840 >gi|303257560|ref|ZP_07343572.1| putative terminase B protein [Burkholderiales bacterium 1_1_47] gi|302859530|gb|EFL82609.1| putative terminase B protein [Burkholderiales bacterium 1_1_47] Length = 330 Score = 228 bits (582), Expect = 1e-57, Method: Composition-based stats. Identities = 72/301 (23%), Positives = 118/301 (39%), Gaps = 17/301 (5%) Query: 195 IINDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRR--LSGKFYE----IFNKPLDD 247 ++ DE + + I L +R + P+ L + Y+ + +K D Sbjct: 6 VVIDEVAQIKPTLWGEVIRPALADRKGWAAF--IGTPKGINLFSQLYDQALNLMSKGDPD 63 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 W ID + + + + R E F + IP++ I A N Sbjct: 64 WIAMLYSVEQTHVIDEKELAAL--KVEMSENEFRQEFLCDFSAAQDNGLIPIDDIRAAAN 121 Query: 308 REPCPDPY--APLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 + Y APLI G D+A G D +V+ RRG V K D ++I+ + Sbjct: 122 KFYRESEYMGAPLIYGIDVARFGSDASVIFKRRGLVAFEPIVIRKFDNMALADRIAVEMA 181 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425 K +PDA+ ID+ G D L + + V V +A+D E NRR E+ MA W++ Sbjct: 182 KEKPDAVFIDS-GAGQGVIDRLRQMRFDVVEVPFGAQAIDKEQFANRRMEMWWHMAQWIK 240 Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPR 482 I ++Q ++ G +E+K + + +S D +D L TFA Sbjct: 241 QGGAIPPDPVLQGDLGAPTYGYTPKGPKILEAKDKLKERIGRSPDLADALALTFAAPVAP 300 Query: 483 S 483 Sbjct: 301 K 301 >gi|282880015|ref|ZP_06288737.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS 5C-B1] gi|281306129|gb|EFA98167.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS 5C-B1] Length = 459 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 81/466 (17%), Positives = 156/466 (33%), Gaps = 87/466 (18%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRP----------GISVICLANSETQLKTTLWAEVS 129 A+++G GK + A + M P + A + Q + EV+ Sbjct: 2 VAVASGTSRGKDFVAACAAMCFMYLTPRWNINHRLIQNTKIAMTAPTGRQCINIMIPEVA 61 Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 + +L + ++ + S++ + + G H Sbjct: 62 RLFRNASVLP---------------GRMLSDGIRTNNAEWFLTAFKASDDNTEAWSGFHA 106 Query: 190 TYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249 M ++ EASG + I G L N ++ NP +G + +K Sbjct: 107 VNTMFVVT-EASGVSETTFNAIEGNL---QGNSRLLLVFNPNVTTGYAAKAMKSSR--FK 160 Query: 250 RFQIDTRTVEGI-----------DPSFHEGIIARY----------------------GLD 276 +F++++ E + D + + + + Sbjct: 161 KFRLNSLNAENVIKKKNVIPGQVDYEWVKDKVHNWCELIQKEDFNNGEGDFMFEDSFYRP 220 Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL-----IMGCDIAEEGGDN 331 +D+ R++V G FP+ D+ IP +E A +R + + +G D+A G D+ Sbjct: 221 NDLFRIKVLGLFPKASEDTLIPFEWLELAHDRWKKLNAEDFVPRKYARVGIDVAGMGRDS 280 Query: 332 TVVVLRRGPVIEHLFDWS---KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 + VLR G + + K D + + + + ++ID GA L Sbjct: 281 SCFVLRYGNYVPEIKIHQSGGKADHMKVAGEAVQWLVE-KNTKVMIDTIGEGAGVYSRLL 339 Query: 389 MLGY-HVYRVLGQKRAVDLE------FCRNRRTELHVKMADWL----EFASLINHSGLIQ 437 LGY + Y + L N R + + DWL F + + Sbjct: 340 ELGYDNAYSCKFSEGTKGLHDITGQYEFANMRAYCYWAVRDWLNPKNGFNPALPPCDELD 399 Query: 438 NLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480 + + ++G + IE K + + +S D +D L+ TF N Sbjct: 400 AELTEVHWSFQSSGSIIIEPKENIKSRLKRSPDRADALISTFYPNT 445 >gi|212703250|ref|ZP_03311378.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098] gi|212673294|gb|EEB33777.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098] Length = 330 Score = 225 bits (573), Expect = 1e-56, Method: Composition-based stats. Identities = 64/301 (21%), Positives = 116/301 (38%), Gaps = 23/301 (7%) Query: 197 NDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD-------W 248 DE + + + L +R + + P+ + F E++ + + W Sbjct: 1 MDEVAQMKPEVWGEVVQPALADRRGSA--VFIGTPKG-ANLFAELYQRGMAAQAQGDAAW 57 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 + + + E + L + R E+ F D IPL + EA R Sbjct: 58 CALSYPVTSTDVLPAEDVERLRRE--LSDNAFRQEMLCDFTASSDDILIPLPDVLEAEAR 115 Query: 309 EPCPDPYA--PLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK 366 + D P+I+G D+A G D++V+V R+G ++ D ++++ + + Sbjct: 116 QLAWDDVGGMPVILGVDVARFGADSSVIVRRQGLKVDGPVVMRGLDNMQLADRVAAAIME 175 Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426 RP A+ IDA G D L LG+ V V + + NRR+E+ + WL+ Sbjct: 176 NRPHAVFIDA-GQGQGVIDRLRQLGHEVIEVPFGGKPLQEGRFANRRSEMWYGLRQWLKS 234 Query: 427 ASLINHSG----LIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 + G ++ S + G + +E K + + S D +D L TFA Sbjct: 235 GGKLPDEGDDVPRLRAELSAPLYWYDAAGRMVLEPKDKIKERLGASPDIADALALTFAAP 294 Query: 480 P 480 Sbjct: 295 V 295 >gi|320103661|ref|YP_004179252.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644] gi|319750943|gb|ADV62703.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644] Length = 553 Score = 220 bits (560), Expect = 4e-55, Method: Composition-based stats. Identities = 81/407 (19%), Positives = 133/407 (32%), Gaps = 49/407 (12%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 P W ++ G +GK+ L A L LW + T PG Sbjct: 45 GRPDYW----------EGQRRAALALTRARSVVVATGNAVGKSYLAAGLTLWWLYTHPGS 94 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 V+ A S+ L T L+ E+ K L+ + + + + L G Sbjct: 95 LVVATAPSQGLLGTVLFRELQKALA-ASRRRGLGLPGMVVGSDRGTPFSLRVGPGRRLAA 153 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228 C + + G H+ M ++ DEASG LT N + ++ Sbjct: 154 EGWGCLGIATRGVERLAGRHHADLM-VVVDEASGVQPEAWE----ALTSLNPRKLFV-CG 207 Query: 229 NPRRLSGKFYEIFNKPLDDWK-----------RFQIDTRTVEGI----------DPSFHE 267 NP F+++ + L + I + I D F Sbjct: 208 NPLTPGTVFHKLHQRGLTEASDPSIPDHARGVALTIPSTASPDINLERSPRGLADRGFIR 267 Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP---DPYAPLIMGCDI 324 ++G S + V G FP + + I +++A + E +P ++GCD+ Sbjct: 268 EAERQWGRGSPLWLSHVEGVFPTVAVHALIEPGWLDQAASLERSQTYENPPGQPVLGCDL 327 Query: 325 AEE-GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGA 381 A G D T +V+R I L + I+ L K+ P+ I+ D GA Sbjct: 328 AAGVGADRTAIVVRDEGGIRELIASDRLAPDEAATLIASLARKHLIAPERILYDGAGLGA 387 Query: 382 RTCDYLEMLG---YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425 L G H + G A N R ++ L+ Sbjct: 388 ELTTRLARQGPGFVHARAIFGA--ASGGAGFLNHRAWCGWRLRQRLD 432 Score = 42.4 bits (98), Expect = 0.17, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 28/48 (58%), Gaps = 5/48 (10%) Query: 435 LIQNLKSLKSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAEN 479 L + L++L+ +V +LA+E KR + +S D +D L+ TF+ + Sbjct: 508 LREELEALRYRLVGT--KLALEDKRETRRRLGRSPDLADALLITFSVD 553 >gi|186682890|ref|YP_001866086.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102] gi|186465342|gb|ACC81143.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102] Length = 543 Score = 216 bits (551), Expect = 5e-54, Method: Composition-based stats. Identities = 98/512 (19%), Positives = 176/512 (34%), Gaps = 104/512 (20%) Query: 46 EGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTR 105 + P + + + + + + + A G GK+ + + LV++ + Sbjct: 28 QYADDPVGFFKNELGIELTNEQTIIAESVRDRPITNVKAAHGTGKSFIASLLVIYFLFCV 87 Query: 106 PGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165 G+ I A SE Q+K LWAE+ K L K + L +S+ ++ Sbjct: 88 GGV-AITTAPSEDQVKWILWAELRKIHGLHKTKLGGRCDIMQL----LFSETVYA----- 137 Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWI 225 + R YSE +F G H + I DEA G I+ G + LT ++ + Sbjct: 138 ---FGITSRDYSEN---SFQGQHRQKQLL-IEDEADGITPQIDNGFIACLT--GSDNRGL 188 Query: 226 MTSNPRRLSGKFYEI------------FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY 273 NP +F + F+ P W +++ V + P E II Sbjct: 189 RIGNPVDPQSQFAKTCKLDKRCLTVSAFSHPNVSW-AYELCADGVYRLKPEVAEHIINED 247 Query: 274 G----------------------------------LDSDVTRVEVCGQFPQQDIDSFIPL 299 G S + V G++ + D I L Sbjct: 248 GEIKPQQEWPPEFPRDRIPGAISIDWIERVRREKFETSAYWKGRVMGEYAEDAADGIILL 307 Query: 300 NIIEEALNREPCPDPYA-------PLIMGCDIAEEGGDNTVVVLRRGPVIEHL-FDWSKT 351 ++++A + Y P +G D+ +GGD + L RGPV+ + +K Sbjct: 308 TLLKQARSLYDQNPQYWDAIAKRYPWRLGLDVG-DGGDPHALALLRGPVLYEVQIHPTKG 366 Query: 352 DLRTT-------NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ---- 400 DL T ++I L Y +I +D GA T L+ GY Sbjct: 367 DLLDTERAADIAASQIKLLGTGY---SIAVDNTGVGAGTLAKLKKTGYQALPCRFGDVPS 423 Query: 401 -----KRAVDLEFCRNRRTELHVKMADWLEFASL-----INHSGLIQNLKSLKSFIVPNT 450 ++ + N + EL+ + + L + N + Q+L + + + Sbjct: 424 YKKKKQKEEPKQKFTNLKAELYWQFRELLMGGRIAIAPLENEEYVFQDLTATR-YSTNTK 482 Query: 451 GELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 E+ E K + + +S D S+ ++ Sbjct: 483 DEIFCEPKDKTKSRLGRSPD-SEAVIIALTNP 513 >gi|294789575|ref|ZP_06754810.1| putative terminase B protein [Simonsiella muelleri ATCC 29453] gi|294482512|gb|EFG30204.1| putative terminase B protein [Simonsiella muelleri ATCC 29453] Length = 516 Score = 215 bits (548), Expect = 1e-53, Method: Composition-based stats. Identities = 78/450 (17%), Positives = 147/450 (32%), Gaps = 63/450 (14%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRP----------GISVICLANSETQLKTTLWAEV 128 K ++ +G G GKT + LW + P G + A + Q+ +W E+ Sbjct: 49 KVSVVSGTGTGKTMSFGRIALWHLLCFPVAKYDGKIEIGSNTYIGAPAIKQVGDGVWKEI 108 Query: 129 SKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-----DSKHYSTMCRTYSEERPDT 183 + + + W ++ + + + + + Sbjct: 109 TDAVQAMRAN----------RATAWLAEYIVVQAERVYIIDYKATWFITKFAMQQGQSVS 158 Query: 184 FVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK 243 G H Y + II DEA+G D I G T+ ++ S + G FYE +K Sbjct: 159 IAGKHRFYQL-IIIDEAAGVSDEHYEVINGTQTQGGNRT--LLASQGVKQGGFFYETHHK 215 Query: 244 ----PLDDWKRFQIDTRTVEGIDPSFHEGIIAR-YGLDSDVTRVEVCGQFPQQDIDSFIP 298 +W + + + E + + G ++ RV V G+F + + ++ + Sbjct: 216 LNKENGGNWTALCFSSENSPFVTTEWLENVALQAGGKNTTEYRVRVLGKFAENEHENLLT 275 Query: 299 LNIIEEALNREPCPDPYAP--LIMGCDIAEE--------------GGDNTVVVLRRGPVI 342 IE ++ P + P ++ D+ G D+ RR Sbjct: 276 RAQIEPRIDTLPIIEKGEPFGWLLLVDVGAGEYRDDSVCIAAKVIGDDDFGENARRVEYE 335 Query: 343 EHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKR 402 + + ++ I + I++DA G C LE G+ V R+ Sbjct: 336 ANPIITNTKNIHEFRGLIVEKAAQLSNVRILVDAGGIGLELCKMLENDGFDVERINWGNP 395 Query: 403 AV---DLEFCRNRRTELHVKMADWLEFASLINH-------SGLIQNLKSLKS-FIVPNTG 451 E N+R V+ D + ++ + + F T Sbjct: 396 CFKRAYKERFFNQRACAMVRWRDAIRQGRVLFPKMENGLREKFLMQASRIPYGFTDTGTA 455 Query: 452 ELAIESK---RVKGAKSTDYSDGLMYTFAE 478 I K R +G KS D +D + + F + Sbjct: 456 RYQIAQKAEMRKRGIKSPDIADAMSFAFLD 485 >gi|315649222|ref|ZP_07902312.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453] gi|315275441|gb|EFU38799.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453] Length = 189 Score = 211 bits (538), Expect = 2e-52, Method: Composition-based stats. Identities = 65/225 (28%), Positives = 93/225 (41%), Gaps = 45/225 (20%) Query: 13 QKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNN 72 L DL W D + +F+ ++ F P WQ + M V Sbjct: 9 TDLLDLYWDDPV--AFAEDMMGF--------------DPDDWQCDVMMDVT--------- 43 Query: 73 PNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL 132 + + ++ +G+G+GKT L A LV+W + RP V+C A ++ QL LW EVSKWL Sbjct: 44 ---QFPRTSVRSGQGVGKTGLEAALVIWFLCCRPNPKVVCTAPTKQQLHDVLWTEVSKWL 100 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 K+ + ++ + + RT +P+ G H Y Sbjct: 101 ENSMVKNLLKWTKTKVYMIG------------HEQRWFATARTA--NKPENMQGFHEDY- 145 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 M I DEASG D I ILG L+ A +M NP R SG F Sbjct: 146 MLFIVDEASGVSDPIMEAILGTLS--GAENKLLMCGNPTRTSGVF 188 >gi|119386463|ref|YP_917518.1| PBSX family phage terminase large subunit [Paracoccus denitrificans PD1222] gi|119377058|gb|ABL71822.1| phage terminase, large subunit, PBSX family [Paracoccus denitrificans PD1222] Length = 441 Score = 208 bits (529), Expect = 2e-51, Method: Composition-based stats. Identities = 88/424 (20%), Positives = 153/424 (36%), Gaps = 30/424 (7%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ T PG+S ICL + + L +++ + + + L Sbjct: 26 GGRGSGKSWDRAMHMIVRHLTEPGLSSICLRDVQKSLDQSVFKLLVETAARLGVAEAIR- 84 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 P SD + + G ++ M ++ E + G +EA+ Sbjct: 85 --------PVESDRIIRTPGNGIIAFNGMNE-FNAENIKSLEGFD-----IAWWEEAATA 130 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI---DTRTVEG 260 + L + + ++ T NPR S + + + + R Sbjct: 131 GQGPLDMLRPTLRKPGSQIWF--TYNPRLRSDPVDVMMRQDARFADSRTVVEANWRDNPF 188 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320 P E + D R G + + FI ++ EA+ R+P L++ Sbjct: 189 RGPELEEERLLDLAGDEARYRHIWEGDYEAESDMQFIGGGLVREAMARQPFSQIGDELVL 248 Query: 321 GCDIAEEGGDNTVVVLRRGPVI--EHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 G D+A G D +V+ RRG E D ++ +++ PD + ID Sbjct: 249 GVDVARFGDDRSVIWARRGRDAQTELPIIMKGADTMAVAARVMAEIDRLHPDGVFIDEGG 308 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAV----DLEFCRNRRTELHVKMADWLEFASLINHSG 434 G D +GY V V +A + CRN+R ++ M +WL I S Sbjct: 309 VGGGVIDRCRQMGYSVVGVNFGGKADRAIEGVPKCRNKRAQMWATMREWLRSGGCIPDSR 368 Query: 435 LIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN-PPRSDMDFGRC 490 ++ + + + IE K + +G S D +D L TFA PRS Sbjct: 369 DLEMDLTGPLYSFDVNNAIEIEKKSDMKKRGVSSPDEADALALTFAYPVVPRSIQRQQEA 428 Query: 491 PSYQ 494 + + Sbjct: 429 RAQE 432 >gi|284162607|ref|YP_003401230.1| hypothetical protein Arcpr_1511 [Archaeoglobus profundus DSM 5631] gi|284012604|gb|ADB58557.1| protein of unknown function DUF264 [Archaeoglobus profundus DSM 5631] Length = 435 Score = 205 bits (522), Expect = 1e-50, Method: Composition-based stats. Identities = 91/449 (20%), Positives = 162/449 (36%), Gaps = 68/449 (15%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 S P ++ F++ + + + AGR GKT A ++ T PG Sbjct: 13 SDPVTFAKVFLDWGAHPAQAQILRDRHQF--ITVVAGRRFGKTECMAVSAIYYALTNPGS 70 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 +A S Q ++ ++ ++LS ++ P++ H DS Sbjct: 71 IQFVIAPSYDQ-SNIMFGQIVQFLSKSI----LGCMIRRIYKTPFH----HIIFKNDS-- 119 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV-INLGILGFLTERNANRFWIMT 227 + S +P+ GH II DEA+ PD I+ I L + N + WI Sbjct: 120 ---VIHARSASKPEFLRGHKA---HRIILDEAAFIPDDVISNIIEPMLADYNGS--WIKI 171 Query: 228 SNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 P + FY+ + K D+ ++ + I F E YG +S + R E Sbjct: 172 GTPFGKN-HFYDTYLKGQSPDFPDYSSYRFPSTVNPHISHEFIEKKKREYGENSIIFRTE 230 Query: 284 VCGQFPQQ------------DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331 +F + ++D+ I L E ++++ ++GCD+A+ Sbjct: 231 YLAEFVEDQNAVFRWADIQKNVDNSIELIDSAENVSKQ--------YVIGCDLAKYQDYT 282 Query: 332 TVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP---DAIIIDANNTGARTCDYLE 388 +VVL L + + + R I L E YR ++ID+ G + L+ Sbjct: 283 VIVVLDVTEKPYKLVHFERFNRRPYAEVIMRLKELYRRFNYAKVLIDSTGVGDPVLEDLQ 342 Query: 389 MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL--INHSGLIQNLKSLKSFI 446 +G Y V K V L ++ LE + L++ L+ + + Sbjct: 343 DVGAEGY-VFTPKSKVQLIQ----------RLQAALENGEIRYPYIEELVKELQFFE-YQ 390 Query: 447 VPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + TG + +E + DY L Sbjct: 391 LTRTG-IKME---ARQGFHDDYVIALALA 415 >gi|168704975|ref|ZP_02737252.1| hypothetical protein GobsU_35915 [Gemmata obscuriglobus UQM 2246] Length = 519 Score = 186 bits (473), Expect = 5e-45, Method: Composition-based stats. Identities = 84/507 (16%), Positives = 153/507 (30%), Gaps = 94/507 (18%) Query: 46 EGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEV-FKGAISAGRGIGKTTLNAWLVLWLMST 104 + + P + + ++V + + ++ + A +GK+ L LV W T Sbjct: 29 KYRTDPAGYARDILKVKWWAKQVEIAEALCKPPYRVLVKASHSVGKSHLAGGLVNWWYDT 88 Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164 R + A ++ Q+K LW EV + P +M L P + Sbjct: 89 RFPGVCLTTAPTDRQVKDVLWKEVRRQRRKRPGFVGPKMPRLESDPTHF----------- 137 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224 ++ +F G H + +I DEA G + A W Sbjct: 138 --------AHGFTARDATSFQGQHEA-SILLIFDEAVGIDGDFWEAAESMC--QGAEYGW 186 Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID-------------------PSF 265 + NP + + Y + + W I I Sbjct: 187 LAIFNPTDTTSRAY-LEEQAGSRWTVIDIPATEHPNIAAELVARPPEYPSAVRLNWLRDR 245 Query: 266 HEGIIAR---------------------YGLDSDVTRVEVCGQFPQQDIDSFIPLNI--I 302 E R + + + ++P + + Sbjct: 246 LEQWAERIEPGDATPTDIQFPNPDGSPQWWRPGPLADARLLARWPASGCGVWSDPVWRSV 305 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 E A +P P+ + P +GCD+A G D T + +R G V H + D + T ++ Sbjct: 306 ERAAP-DPVPERWLP-QIGCDVARFGEDWTELHVRCGNVSLHHEAHNGWDTKRTTERLKQ 363 Query: 363 LVEKYRPDAIIIDANNT---------------GARTCDYLEMLGYHVYRVLGQKRAVDLE 407 + ++ A + G + G++ V A D E Sbjct: 364 MCGEWAQWATQLRDRGADPIDPRRIPVKVDDDGVGGGVTDQRGGFNFQAVSSASNANDKE 423 Query: 408 FCRNRRTELHVKMADWLEFAS-----LINH--SGLIQNLKSLKSFIVPNTGELAIESK-- 458 NRR+EL +AD + L H L + ++ + G +E K Sbjct: 424 AYPNRRSELWFTVADRAKRGELFLSNLPAHVRQELKRQ-AMAPTYKLDAAGRRVVEPKED 482 Query: 459 -RVKGAKSTDYSDGLMYTFAENPPRSD 484 + + +S D D + + E R Sbjct: 483 TKERIGRSPDGMDAVNLAYYEPSGRGG 509 >gi|320091491|gb|ADW08983.1| terminase-like protein [Clavibacter phage CN77] Length = 414 Score = 184 bits (468), Expect = 2e-44, Method: Composition-based stats. Identities = 73/393 (18%), Positives = 137/393 (34%), Gaps = 60/393 (15%) Query: 157 VLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLT 216 G ++ + R ++ TF G + DEA G P + G +T Sbjct: 11 KYKKMDGSGNEAIAFGKRPTDQDIVSTFQGT-RKLRTFVALDEAGGVPPELFTGAEAVMT 69 Query: 217 ERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQIDTRTVEGIDPS---------- 264 +++ + NP +F+ IF P +D+W F I + + Sbjct: 70 GQDSKI--VAIGNPDSRGTEFHRIFTVPALMDEWNTFTISAYDLPTVTGEVVYPDHPEKQ 127 Query: 265 -------------FHEGIIARYGLDSD-VTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 H+ + + G D +V G+FP + ++F P I+ N Sbjct: 128 ERMLKGLTSLDWIQHKERVWKVGGKPDGRFLAKVLGEFPGETDNAFFPQEAIDRG-NDTT 186 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD----------------WSKTDLR 354 P +IMG D+A G D++VV +G + WSK + Sbjct: 187 IDKPEKGIIMGVDLARMGDDDSVVYTNQGGRVRLFKGQVRYSDREGTKTTTGVWSKENTV 246 Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVDLEF 408 + ++ + + + +D++ G D LE L Y + + + + Sbjct: 247 ASARRVHAIAMQIGAKQVRLDSSGIGGAVFDELEQLEEFDGKCYTLVGINNANSSSNNMR 306 Query: 409 CRNRRTELHVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAK-- 464 N R E H + D L L ++++ + ++ + G + I K ++ Sbjct: 307 WANIRAENHDNLRDMLIKGYLDLDPEDTMLRDELLVITYKLNLRGAVQITPKDEMKSELN 366 Query: 465 -STDYSDGLMYTFAENPPRSDMDFGRCPSYQYE 496 S D D ++Y+ A+ D G P + E Sbjct: 367 GSPDRLDAVIYSLADLDHIVD---GPQPGERIE 396 >gi|315122636|ref|YP_004063125.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496038|gb|ADR52637.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 301 Score = 170 bits (430), Expect = 5e-40, Method: Composition-based stats. Identities = 61/170 (35%), Positives = 90/170 (52%), Gaps = 8/170 (4%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQ----L 56 M+ N E + L + S I + F + + WGE+GTPL PR+WQ L Sbjct: 1 MNATFQPNIEYDTALLQNVLSPAIAGNPLAFTKYMYRWGEEGTPLANCKGPRAWQTEVFL 60 Query: 57 EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116 E E ++ + +VFK AI++ RGIGKT L AW+ W +STR G +V+ ANS Sbjct: 61 ELAEFIEKNKEAKRLGKPLQVFKLAIASARGIGKTALVAWITYWFLSTRIGCTVVISANS 120 Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQS----LSLHPAPWYSDVLHCSL 162 + Q KTT +AE+ +W SL N H+FE L+ +PW ++ + +L Sbjct: 121 DDQCKTTSFAEIRRWHSLAKNAHFFEANIAEALLAGGCSPWQAEPVAKTL 170 >gi|261381054|ref|ZP_05985627.1| phage terminase, large subunit, PBSX family [Neisseria subflava NJ9703] gi|284796087|gb|EFC51434.1| phage terminase, large subunit, PBSX family [Neisseria subflava NJ9703] Length = 450 Score = 161 bits (408), Expect = 2e-37, Method: Composition-based stats. Identities = 57/320 (17%), Positives = 116/320 (36%), Gaps = 40/320 (12%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQID 254 +EA D ++ + + + + T NP+ + Y+ F P DD ++ Sbjct: 117 WIEEAENVSDESWNILIPTIRKAGSEIWL--TWNPKNILDPTYQRFVVNPPDDMVDIVVN 174 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--P 312 + + D D+ R G+ S I I+ A++ Sbjct: 175 YTDNIYLPEVLRLEAESCKARDYDLYRHIWLGEPVADSELSVIKPKWIDAAIDSHIKLGF 234 Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372 + I+G D+A+EG D + +LR G V+ + +W D+ + +K+ ++ + D I Sbjct: 235 EATGQRILGFDVADEGDDASATILRHGSVVIDMDEWRGQDVIYSADKVYLYGQEAKADKI 294 Query: 373 IIDANNTGART-------CDYLEMLGYHVYRVLGQKRA------VDLEFCRNRRTELHVK 419 + D+ GA ++ +G++ + + A + + N + + Sbjct: 295 VYDSIGVGAGVKAQFRRKTGKVQTIGFNAGGSVFKPEARYTDDKKNKDMFSNIKAQAWWM 354 Query: 420 MAD-------WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESKR- 459 + + +EF LI + S N G + +ESK+ Sbjct: 355 VRERFYKTWRAIEFGDTYPIDELISISGSLKDLEYLKAELSRPRVDYDNNGRVKVESKKD 414 Query: 460 --VKGAKSTDYSDGLMYTFA 477 +G S + +D L+ FA Sbjct: 415 MAKRGIPSPNRADALIMAFA 434 >gi|329122215|ref|ZP_08250807.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] gi|327474100|gb|EGF19511.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] Length = 452 Score = 160 bits (404), Expect = 7e-37, Method: Composition-based stats. Identities = 65/440 (14%), Positives = 143/440 (32%), Gaps = 63/440 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ T+P I V+C E+ K +S + + Sbjct: 27 GGRGSGKSFSIARALVLRAYTQP-IRVLCC------------REIQKSISDSVIQMLAD- 72 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q L ++ +G + ++ + + G + +E Sbjct: 73 QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262 ++ + E + I++ NP+ + Y+ F P + K ++ + Sbjct: 128 SKESWDVLIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320 E + D ++ R G+ I I+ A++ ++ I+ Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDKVIIKPLWIDAAVDAHKKLGFVAAGRKII 245 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 G D+A+EG D G V+ + +W D+ + ++ ++ + I+ D+ G Sbjct: 246 GFDVADEGSDANANAFVHGSVVLRMDEWRGEDVIGSADRTRLNALEFGANEIVYDSIGVG 305 Query: 381 ART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA----- 421 A L+ + + + N + + ++ Sbjct: 306 AGVKAHYHRLDDKSIRINGFNAGGAVFEPDVEYVYGKTNRDMFANIKAQAWWRLRDRFYK 365 Query: 422 --------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464 + + +S I ++ + G + +ESK + +G Sbjct: 366 TYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGIP 425 Query: 465 STDYSDGLMYTFAENPPRSD 484 S + +D L+ FA P+ D Sbjct: 426 SPNKADALVMCFA---PKED 442 >gi|229844502|ref|ZP_04464642.1| predicted phage terminase large subunit [Haemophilus influenzae 6P18H1] gi|229812751|gb|EEP48440.1| predicted phage terminase large subunit [Haemophilus influenzae 6P18H1] Length = 452 Score = 158 bits (400), Expect = 2e-36, Method: Composition-based stats. Identities = 65/440 (14%), Positives = 144/440 (32%), Gaps = 63/440 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ T+P I V+C E+ K +S + + Sbjct: 27 GGRGSGKSFSIARALVLRAYTQP-IRVLCC------------REIQKSISDSVIQMLAD- 72 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q L ++ +G + ++ + + G + +E Sbjct: 73 QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262 ++ + E + I++ NP+ + Y+ F P + K ++ + Sbjct: 128 SKESWDVLIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320 E ++ D ++ R G+ I I+ A++ ++ I+ Sbjct: 186 KELMEDMVQMRERDYELYRHVYEGEPVADSDKVIIKPLWIDAAVDAHKKLGFVAAGRKII 245 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 G D+A+EG D G V+ + +W D+ + ++ ++ + I+ D+ G Sbjct: 246 GFDVADEGSDANANAFVHGSVVLRMDEWHGEDVIGSADRTRLNALEFGTNEIVYDSIGVG 305 Query: 381 ART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA----- 421 A L+ + + + N + + ++ Sbjct: 306 AGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWRLRDRFYK 365 Query: 422 --------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464 + + +S I ++ + G + +ESK + +G Sbjct: 366 TYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGIP 425 Query: 465 STDYSDGLMYTFAENPPRSD 484 S + +D L+ FA P+ D Sbjct: 426 SPNKADALVMCFA---PKED 442 >gi|260580755|ref|ZP_05848581.1| phage terminase large subunit [Haemophilus influenzae RdAW] gi|260092572|gb|EEW76509.1| phage terminase large subunit [Haemophilus influenzae RdAW] Length = 447 Score = 153 bits (386), Expect = 7e-35, Method: Composition-based stats. Identities = 72/442 (16%), Positives = 145/442 (32%), Gaps = 59/442 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ P + V+C E+ K +S + + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q L ++ +G + ++ + + G + +E Sbjct: 73 QIEMLSLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262 ++ + E + I++ NP+ + Y+ F P + K ++ + Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320 E + D ++ R G+ + I IE A++ + + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIEYAVDAHLKLGFTAKGMKKV 245 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 G D+A+EG D+ G V+ + W D+ + N+ + K++ D II D+ G Sbjct: 246 GFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305 Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422 A + + L V E N + + + D Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYKT 365 Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 +++ + LI + S N G + +ESK + +G S Sbjct: 366 YRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425 Query: 466 TDYSDGLMYTFAENPPRSDMDF 487 + +D L+ +A P+S +D Sbjct: 426 PNMADALVMCYAPTKPKSLLDL 447 >gi|319776448|ref|YP_004138936.1| phage terminase large subunit [Haemophilus influenzae F3047] gi|319897217|ref|YP_004135412.1| phage terminase large subunit [Haemophilus influenzae F3031] gi|329123931|ref|ZP_08252483.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] gi|317432721|emb|CBY81084.1| predicted phage terminase large subunit [Haemophilus influenzae F3031] gi|317451039|emb|CBY87270.1| predicted phage terminase large subunit [Haemophilus influenzae F3047] gi|327468126|gb|EGF13613.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] Length = 447 Score = 152 bits (383), Expect = 1e-34, Method: Composition-based stats. Identities = 72/442 (16%), Positives = 144/442 (32%), Gaps = 59/442 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ P + V+C E+ K +S + + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q L ++ +G + ++ + + G + +E Sbjct: 73 QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262 ++ + E + I++ NP+ + Y+ F P + K ++ + Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320 E + D ++ R G+ + I IE A++ + + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKKV 245 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 G D+A+EG D G V+ + W D+ + N+ + K++ D II D+ G Sbjct: 246 GFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305 Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422 A + + L V E N + + + D Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYKT 365 Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 +++ + LI + S N G + +ESK + +G S Sbjct: 366 YRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425 Query: 466 TDYSDGLMYTFAENPPRSDMDF 487 + +D L+ +A P+S +D Sbjct: 426 PNMADALVMCYAPTKPKSLLDL 447 >gi|145629503|ref|ZP_01785301.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|145641440|ref|ZP_01797019.1| predicted phage terminase large subunit [Haemophilus influenzae R3021] gi|144978346|gb|EDJ88110.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|145273983|gb|EDK13850.1| predicted phage terminase large subunit [Haemophilus influenzae 22.4-21] gi|309750959|gb|ADO80943.1| Probable bacteriophage terminase, large subunit [Haemophilus influenzae R2866] Length = 447 Score = 151 bits (381), Expect = 2e-34, Method: Composition-based stats. Identities = 72/442 (16%), Positives = 144/442 (32%), Gaps = 59/442 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ P + V+C E+ K +S + + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q L ++ +G + ++ + + G + +E Sbjct: 73 QVEMLGLQDFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262 ++ + E + I++ NP+ + Y+ F P + K ++ + Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320 E + D ++ R G+ + I IE A++ + + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKKV 245 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 G D+A+EG D G V+ + W D+ + N+ + K++ D II D+ G Sbjct: 246 GFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305 Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422 A + + L V E N + + + D Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYKT 365 Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 +++ + LI + S N G + +ESK + +G S Sbjct: 366 YRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425 Query: 466 TDYSDGLMYTFAENPPRSDMDF 487 + +D L+ +A P+S +D Sbjct: 426 PNMADALVMCYAPTKPKSLLDL 447 >gi|330958838|gb|EGH59098.1| hypothetical protein PMA4326_09820 [Pseudomonas syringae pv. maculicola str. ES4326] Length = 512 Score = 151 bits (381), Expect = 3e-34, Method: Composition-based stats. Identities = 55/239 (23%), Positives = 89/239 (37%), Gaps = 22/239 (9%) Query: 267 EGIIARYGLDSDVTRVEVC---GQFPQQDIDSF--------IPLNIIEEALNREPC-PDP 314 E + R G S +V ++P +F I + A +E Sbjct: 253 EQMAWRAGKISSDFANDVDFFNQEYPATPDLAFQKVGHKPLIKTVKVSLARKKEIKHERR 312 Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI-I 373 ++G D A GGD + + R+G V + + D + + ++ + + Sbjct: 313 IGAHVVGLDPAR-GGDTSTFIHRQGRVAWGIERNNIPDTMAVVGQAARMLMDDKTIRMMF 371 Query: 374 IDANNTGARTCDYLEMLGY--HVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE---FAS 428 ID GA D L LG+ V V A D N+R E+ +MA+W+ S Sbjct: 372 IDIGGLGAGIYDRLVELGFGDRVTAVNFGSSASDSRKYANKRCEMWGEMAEWIHDDITPS 431 Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSD 484 + + L +L S + G+L + K + K +S D D L TFAE D Sbjct: 432 IPDDDQLHSDLTSAAKDKYTSNGQLKLLPKEDAKKKIGRSPDDGDALALTFAEPVSADD 490 >gi|68250076|ref|YP_249188.1| phage terminase large subunit [Haemophilus influenzae 86-028NP] gi|68058275|gb|AAX88528.1| predicted phage terminase large subunit [Haemophilus influenzae 86-028NP] Length = 447 Score = 149 bits (377), Expect = 8e-34, Method: Composition-based stats. Identities = 72/442 (16%), Positives = 144/442 (32%), Gaps = 59/442 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ P + V+C E+ K +S + + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q L ++ +G + ++ + + G + +E Sbjct: 73 QIEMLGLQNFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262 ++ + E + I++ NP+ + Y+ F P + K ++ + Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320 E + D ++ R G+ + I IE A++ + + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIECAVDAHLKLGFTAKGMKKV 245 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 G D+A+EG D+ G V+ + W D+ + N+ + K++ D II D+ G Sbjct: 246 GFDVADEGADSNDNAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305 Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422 A + + L V E N + + + D Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYKT 365 Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 ++ + LI + S N G + +ESK + +G S Sbjct: 366 YRAVKHGDVYPDDELISLSSNIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425 Query: 466 TDYSDGLMYTFAENPPRSDMDF 487 + +D L+ +A P+S +D Sbjct: 426 PNMADALVMCYATTKPKSLLDL 447 >gi|301170180|emb|CBW29784.1| predicted phage terminase large subunit [Haemophilus influenzae 10810] Length = 447 Score = 147 bits (372), Expect = 3e-33, Method: Composition-based stats. Identities = 70/442 (15%), Positives = 141/442 (31%), Gaps = 59/442 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ P + V+C E+ K +S + + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLADQ 73 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 + + S+ +T + + G + +E Sbjct: 74 VEMLGLQDFFDVQKTQIIEQNGSRFTFAGLKT-NITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262 ++ + E + I++ NP+ + Y+ F P + K ++ + Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320 E + D ++ R G+ + I IE A++ + + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKKV 245 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 G D+A+EG D+ G V+ + W + + N+ + K++ D II D+ G Sbjct: 246 GFDVADEGADSNANAFVHGSVVLDIEVWKNGYVIDSANRTNQSAVKFKADLIIFDSIGVG 305 Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422 A + + L V E N + + + D Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYKT 365 Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465 +++ + LI + S N G + +ESK + +G S Sbjct: 366 YRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425 Query: 466 TDYSDGLMYTFAENPPRSDMDF 487 + +D L+ +A P+S +D Sbjct: 426 PNMADALVMCYAPTKPKSLLDL 447 >gi|16273317|ref|NP_439561.1| terminase large subunit-like protein [Haemophilus influenzae Rd KW20] gi|1175785|sp|P44184|Y1410_HAEIN RecName: Full=Uncharacterized protein HI_1410 gi|1574247|gb|AAC23058.1| predicted coding region HI1410 [Haemophilus influenzae Rd KW20] Length = 394 Score = 146 bits (369), Expect = 7e-33, Method: Composition-based stats. Identities = 63/402 (15%), Positives = 133/402 (33%), Gaps = 46/402 (11%) Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT 183 ++ E+ K +S + + Q L ++ +G + ++ + + Sbjct: 1 MFREIQKSISDSVIQMLAD-QIEMLSLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKS 59 Query: 184 FVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN- 242 G + +E ++ + E + I++ NP+ + Y+ F Sbjct: 60 MTGID-----VVWVEEGENVSKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVI 112 Query: 243 KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P + K ++ + E + D ++ R G+ + I I Sbjct: 113 HPPERCKSVLVNWQDNPYFPKELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWI 172 Query: 303 EEALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 E A++ + +G D+A+EG D+ G V+ + W D+ + N+ Sbjct: 173 EYAVDAHLKLGFTAKGMKKVGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRT 232 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLE----------- 407 + K++ D II D+ GA + + L V E Sbjct: 233 NQSAVKFKADLIIFDSIGVGAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQD 292 Query: 408 FCRNRRTELHVKMAD-------WLEFASLINHSGLI------------QNLKSLKSFIVP 448 N + + + D +++ + LI + S Sbjct: 293 MFSNIKAQSWWALRDRFYKTYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYD 352 Query: 449 NTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDMDF 487 N G + +ESK + +G S + +D L+ +A P+S +D Sbjct: 353 NNGRVKVESKKDMKKRGIPSPNMADALVMCYAPTKPKSLLDL 394 >gi|68249883|ref|YP_248995.1| phage terminase large subunit [Haemophilus influenzae 86-028NP] gi|68058082|gb|AAX88335.1| predicted phage terminase large subunit [Haemophilus influenzae 86-028NP] Length = 438 Score = 145 bits (366), Expect = 2e-32, Method: Composition-based stats. Identities = 65/441 (14%), Positives = 152/441 (34%), Gaps = 64/441 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A L++ ++ R + V C + + ++ ++ + L FE+ Sbjct: 12 GGRGSGKSWGVAQLLI-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 70 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q +++ S+ + + + + + G + +EA Sbjct: 71 QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 113 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261 + ++ + + + W+ T NP+ + Y+ F P + + R +I+ Sbjct: 114 SNESWDILIPTIRKERSE-IWV-TFNPKNILDPTYQRFVIAPPKNSFVR-KINYDENPYF 170 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLI 319 + + D ++ R G+ I IE A++ ++ P I Sbjct: 171 PETLRLEMEECKERDYELYRHIWLGEPVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKI 230 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A++G D+ G V+ + +W D+ + ++ ++ + I+ D+ Sbjct: 231 VGFDVADDGVDSNANAFVHGSVVLRVDEWRGEDVIGSADRTRLNALEFGANEIVYDSIGV 290 Query: 380 GART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA---- 421 GA L+ + + + N + + ++ Sbjct: 291 GAGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWRLRDRFY 350 Query: 422 ---------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463 + + +S I ++ + G + +ESK + +G Sbjct: 351 KTYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGI 410 Query: 464 KSTDYSDGLMYTFAENPPRSD 484 S + +D L+ FA P+ D Sbjct: 411 PSPNKADALVMCFA---PKED 428 >gi|329119006|ref|ZP_08247700.1| phage terminase large subunit [Neisseria bacilliformis ATCC BAA-1200] gi|327464879|gb|EGF11170.1| phage terminase large subunit [Neisseria bacilliformis ATCC BAA-1200] Length = 449 Score = 145 bits (366), Expect = 2e-32, Method: Composition-based stats. Identities = 56/322 (17%), Positives = 105/322 (32%), Gaps = 42/322 (13%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQI 253 +EA ++ + W+ NP+ + Y+ F + P D + Sbjct: 114 WVEEAEAVTKNSWDVLIPSIRGDKNAEIWVSF-NPKNILDDTYQRFIVHPPKDS-IVLKA 171 Query: 254 DTRTVEGI-DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 + D ++ D D+ R G+ + I + IE A++ Sbjct: 172 NYDINPHFADTPLLADMLECKERDEDLYRHIWLGEPVADSELAIIKPSWIEAAIDAHEKL 231 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 I+G D+A+EG D VLR G V+ + W D+ + +K+ ++ D Sbjct: 232 GFSAAGRRILGFDVADEGDDANATVLRHGSVVTDMQQWRGQDVIYSADKVYLYAQEQNVD 291 Query: 371 AIIIDANNTGART-------CDYLEMLGYHVYRVLGQKRA------VDLEFCRNRRTELH 417 I+ D GA ++ LG++ + + A + + N + + Sbjct: 292 RIVYDNIGVGAGVKAQFRRKNGKVQTLGFNAGGAVYKPDAKYTDDKKNRDMFANIKAQAW 351 Query: 418 VKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIVPNTGELAIESK 458 + D + LI S G + ESK Sbjct: 352 WMVRDRFYKTWRAVHHGDSYPEDQLISLSSSLHELEYLTAELSRPQVDYDQNGRVKAESK 411 Query: 459 ---RVKGAKSTDYSDGLMYTFA 477 + +G S + +D L+ FA Sbjct: 412 KDMKKRGIPSPNRADALVMVFA 433 >gi|309379923|emb|CBX21334.1| unnamed protein product [Neisseria lactamica Y92-1009] Length = 449 Score = 144 bits (362), Expect = 4e-32, Method: Composition-based stats. Identities = 55/322 (17%), Positives = 104/322 (32%), Gaps = 42/322 (13%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQI 253 +EA ++ + W+ NP+ + Y F + P D + Sbjct: 114 WVEEAEAVTKNSWDVLIPSIRGDKNAEIWVSF-NPKNILDDTYRRFIVHPPQDS-IVLKA 171 Query: 254 DTRTVEGI-DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 + D ++ D D+ R G+ + I + IE A++ Sbjct: 172 NYDINPHFADTPLLADMLECKERDEDLYRHIWLGEPVADSELAIIKPSWIEAAIDAHEKL 231 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 I+G D+A+EG D VLR G V+ + W D+ + +K+ ++ D Sbjct: 232 GFQAAGKRILGFDVADEGDDANATVLRHGSVVTDMRQWRGQDVIYSADKVYLYAQEQDID 291 Query: 371 AIIIDANNTGARTC-------DYLEMLGYHVYRVLGQKRA------VDLEFCRNRRTELH 417 I+ D GA ++ LG++ + + A + + N + + Sbjct: 292 RIVYDNIGVGAGVKAQFRRKRGKVQTLGFNAGGAVYKPDAKYTDDKKNRDMFANIKAQAW 351 Query: 418 VKMAD-------WLEFASLINHSGL------------IQNLKSLKSFIVPNTGELAIESK 458 + D + L + S G + ESK Sbjct: 352 WMVRDRFYKTWRAVHHGDSYPEDQLVSLSSSLHELEYLTAELSRPQVDYDQNGRVKAESK 411 Query: 459 ---RVKGAKSTDYSDGLMYTFA 477 + +G S + +D L+ FA Sbjct: 412 KDMKKRGIPSPNRADALVMAFA 433 >gi|145629819|ref|ZP_01785613.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|148827544|ref|YP_001292297.1| hypothetical protein CGSHiGG_04845 [Haemophilus influenzae PittGG] gi|144977965|gb|EDJ87753.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|148718786|gb|ABQ99913.1| hypothetical protein CGSHiGG_04845 [Haemophilus influenzae PittGG] Length = 449 Score = 143 bits (361), Expect = 6e-32, Method: Composition-based stats. Identities = 65/441 (14%), Positives = 151/441 (34%), Gaps = 64/441 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A L++ ++ R + V C + + ++ ++ + L FE+ Sbjct: 23 GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 81 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q +++ S+ + + + + + G + +EA Sbjct: 82 QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 124 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261 + ++ + + + W+ T NP+ + Y+ F P + + R +I+ Sbjct: 125 SNESWDILIPTIRKERSE-IWV-TFNPKNILDPTYQRFVIAPPKNSFVR-KINYDENPYF 181 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLI 319 + + D ++ R G+ I IE A++ ++ P I Sbjct: 182 PETLRLEMEECKERDYELYRHIWLGEPVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKI 241 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A++G D+ G V+ + +W D+ + ++ ++ + I+ D+ Sbjct: 242 VGFDVADDGVDSNANAFVHGSVVLRVDEWHGEDVIGSADRTRLNALEFGANEIVYDSIGV 301 Query: 380 GART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA---- 421 GA L+ + + + N + + + Sbjct: 302 GAGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWCLRDRFY 361 Query: 422 ---------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463 + + +S I ++ + G + +ESK + +G Sbjct: 362 KTYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGI 421 Query: 464 KSTDYSDGLMYTFAENPPRSD 484 S + +D L+ FA P+ D Sbjct: 422 PSPNKADALVMCFA---PKED 439 >gi|319775727|ref|YP_004138215.1| phage terminase large subunit [Haemophilus influenzae F3047] gi|319896735|ref|YP_004134928.1| phage terminase large subunit [Haemophilus influenzae F3031] gi|317432237|emb|CBY80589.1| predicted phage terminase large subunit [Haemophilus influenzae F3031] gi|317450318|emb|CBY86534.1| predicted phage terminase large subunit [Haemophilus influenzae F3047] Length = 449 Score = 143 bits (360), Expect = 7e-32, Method: Composition-based stats. Identities = 65/441 (14%), Positives = 151/441 (34%), Gaps = 64/441 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A L++ ++ R + V C + + ++ ++ + L FE+ Sbjct: 23 GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEDFEV 81 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q +++ S+ + + + + + G + +EA Sbjct: 82 QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 124 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261 + ++ + + + W+ T NP+ + Y+ F P + + R +I+ Sbjct: 125 SNESWDILIPTIRKERSE-IWV-TFNPKNILDPTYQRFVIAPPKNSFVR-KINYDENPYF 181 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLI 319 + + D ++ R G+ I IE A++ ++ P I Sbjct: 182 PETLRLEMEECKERDYELYRHIWLGEPVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKI 241 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A++G D+ G V+ + +W D+ + ++ ++ + I+ D+ Sbjct: 242 VGFDVADDGVDSNANAFVHGSVVLRVDEWRGEDVIGSADRTRLNALEFGANEIVYDSIGV 301 Query: 380 GART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA---- 421 GA L+ + + + N + + + Sbjct: 302 GAGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWCLRDRFY 361 Query: 422 ---------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463 + + +S I ++ + G + +ESK + +G Sbjct: 362 KTYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGI 421 Query: 464 KSTDYSDGLMYTFAENPPRSD 484 S + +D L+ FA P+ D Sbjct: 422 PSPNKADALIMCFA---PKED 439 >gi|260583110|ref|ZP_05850891.1| phage terminase large subunit [Haemophilus influenzae NT127] gi|260093822|gb|EEW77729.1| phage terminase large subunit [Haemophilus influenzae NT127] Length = 445 Score = 143 bits (360), Expect = 7e-32, Method: Composition-based stats. Identities = 65/441 (14%), Positives = 151/441 (34%), Gaps = 64/441 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A L++ ++ R + V C + + ++ ++ + L FE+ Sbjct: 19 GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 77 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q +++ S+ + + + + + G + +EA Sbjct: 78 QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 120 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261 + ++ + + + W+ T NP+ + Y+ F P + + R +I+ Sbjct: 121 SNESWDILIPTIRKERSE-IWV-TFNPKNILDPTYQRFVIAPPKNSFVR-KINYDENPYF 177 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLI 319 + + D ++ R G+ I IE A++ ++ P I Sbjct: 178 PETLRLEMEECKERDYELYRHIWLGEPVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKI 237 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +G D+A++G D+ G V+ + +W D+ + ++ ++ + I+ D+ Sbjct: 238 VGFDVADDGVDSNANAFVHGSVVLRVDEWHGEDVIGSADRTRLNALEFGANEIVYDSIGV 297 Query: 380 GART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA---- 421 GA L+ + + + N + + + Sbjct: 298 GAGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWCLRDRFY 357 Query: 422 ---------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463 + + +S I ++ + G + +ESK + +G Sbjct: 358 KTYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGI 417 Query: 464 KSTDYSDGLMYTFAENPPRSD 484 S + +D L+ FA P+ D Sbjct: 418 PSPNKADALVMCFA---PKED 435 >gi|145638997|ref|ZP_01794605.1| terminase large subunit-like protein [Haemophilus influenzae PittII] gi|145271969|gb|EDK11878.1| terminase large subunit-like protein [Haemophilus influenzae PittII] Length = 379 Score = 142 bits (359), Expect = 1e-31, Method: Composition-based stats. Identities = 56/332 (16%), Positives = 111/332 (33%), Gaps = 40/332 (12%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 + +E ++ + E + I++ NP+ + Y+ F P + K Sbjct: 50 VVWVEEGENVSKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVL 107 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 ++ + E + D ++ R G+ + I IE A++ + Sbjct: 108 VNWQDNPYFPKELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIESAVDAHLKL 167 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 +G D+A+EG D G V+ + W D+ + N+ + K++ D Sbjct: 168 GFTTKGMKKVGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKAD 227 Query: 371 AIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELH 417 II D+ GA + + L V E N + + Sbjct: 228 LIIFDSIGVGAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSW 287 Query: 418 VKMAD-------WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK 458 + D +++ + LI + S N G + +ESK Sbjct: 288 WALRDRFYKTYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESK 347 Query: 459 ---RVKGAKSTDYSDGLMYTFAENPPRSDMDF 487 + +G S + +D L+ +A P+S +D Sbjct: 348 KDMKKRGIPSPNMADALVMCYAPTKPKSLLDL 379 >gi|307251380|ref|ZP_07533296.1| hypothetical protein appser4_21360 [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306856621|gb|EFM88761.1| hypothetical protein appser4_21360 [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 384 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 57/376 (15%), Positives = 121/376 (32%), Gaps = 46/376 (12%) Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 E Q L+ P++ +G + ++ + + G + +E Sbjct: 2 LEDQIEILNLKPFFEVQKTQIIGRNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEG 56 Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVE 259 ++ + E + I++ NP+ L Y+ F P + ++ + Sbjct: 57 ENVSKESWDVLIPTIREDGSQI--IVSFNPKNLLDDTYQRFVINPPERCCSVLVNWQDNP 114 Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAP 317 E + D ++ R GQ + I IE+A++ ++ Sbjct: 115 YFPKELMEDMKQMKERDFELYRHVYEGQPVADSDLAIIKPLWIEKAVDAHKKLGFTASGR 174 Query: 318 LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377 ++G D+A+EG D G V+ + +W D+ + ++ + D I+ D+ Sbjct: 175 KVVGFDVADEGIDANANCFAHGSVVLQVDEWRGDDVIQSAHRTHTNAVMWGVDEIVFDSI 234 Query: 378 NTGART---CDYLEMLGYHVYRVLGQKRAVDL-----------EFCRNRRTELHVKMAD- 422 GA ++ + E N + + + D Sbjct: 235 GVGAGVKAEYRRMDTKRILCSGFNAGASVFEPDEYYTQDKTNGEMFANIKAQAWWLLRDR 294 Query: 423 ------WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESKR---VK 461 +EF + +I + S N G++ +ESK+ + Sbjct: 295 FYKTYRAIEFGDVYPVDEMISLSSDIKDLEYLKAELSRPRVDHDNNGKVRVESKKDMRKR 354 Query: 462 GAKSTDYSDGLMYTFA 477 G S + +D L+ FA Sbjct: 355 GIPSPNKADSLVMCFA 370 >gi|85058727|ref|YP_454429.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] gi|84779247|dbj|BAE74024.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] Length = 456 Score = 135 bits (340), Expect = 2e-29, Method: Composition-based stats. Identities = 73/456 (16%), Positives = 141/456 (30%), Gaps = 68/456 (14%) Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 P +K A GRG GK+ A L + R G A E ++ Sbjct: 13 QPHRYKIA-KGGRGSGKSW--AIARLLVEIARRGTYRFLCA-----------REFQASMA 58 Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193 + + + + + + + + + G Sbjct: 59 DSVIQLIADTIQREGYLKEFEIQKAYIRYLATDSLFMFYGIKNNVTKIKSLEGID----- 113 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 +EA ++ + + + W+ NP+ + Y+ F PLDD Sbjct: 114 IAWVEEAEAVTKESWDILIPTIRKPGSE-IWVSF-NPKNILDDTYQRFVVNPLDDICLLT 171 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 + + D D+ G+ + I I A++ Sbjct: 172 VHYTDNPHFPEVLRLEMEECKCKDYDLYLHIWEGEPVADSDLAIIKPLWIAAAVDAHITL 231 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 +P +G D+A+EG D+ ++L G V+ HL W+K D+ + +++ E D Sbjct: 232 GFEPAGKKRIGFDVADEGEDSNALILSHGSVVMHLETWNKGDVIQSADRVKNYAESVIAD 291 Query: 371 AIIIDANNTGARTCDYLEML------GYHVYRVLGQKRA------VDLEFCRNRRTELHV 418 II D+ GA L + G++ + + A + + N + + Sbjct: 292 EIIFDSIGVGAGVKARLRRVSRITASGFNAGGGVFKPDAKYVDGKTNKDMFVNLKAQAWW 351 Query: 419 KMADW----LEFASLI----NHSGLIQNLK---------------------SLKSFIVPN 449 + + I + S ++ L+ S N Sbjct: 352 GVRERFYNTWHAVEYIKHHPDDSDFVKGLRDDQLISLSSRLSSLDYLKAELSRPWVDYDN 411 Query: 450 TGELAIESK---RVKGAKSTDYSDGLMYTFAENPPR 482 G + +ESK + +G S + +D L+ FA Sbjct: 412 NGRVKVESKKDMKKRGIPSPNRADALIMAFAPTYKP 447 >gi|149174861|ref|ZP_01853485.1| hypothetical protein PM8797T_10814 [Planctomyces maris DSM 8797] gi|148846198|gb|EDL60537.1| hypothetical protein PM8797T_10814 [Planctomyces maris DSM 8797] Length = 568 Score = 132 bits (332), Expect = 1e-28, Method: Composition-based stats. Identities = 84/530 (15%), Positives = 153/530 (28%), Gaps = 141/530 (26%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111 WQ + +E + + + + G GK +I Sbjct: 57 DDWQWDILESLFD----------LTIRRVFVKGNTGCGKGAAAGIACCTYFHIWNDAKII 106 Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171 +S + + EV KW + K ++ + + +S L Sbjct: 107 ITRDSVRTAQKIAFGEVDKWWRKMRFKPPGKLLTSGVFDNNQHSISL------------- 153 Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPR 231 + + + F G H+ + + DEA+ + T+ + ++ SNP Sbjct: 154 ----ANPQHIEGFRGAHSPH-VFFWFDEAT--APNLEDKYKLANTQA---KKFLALSNPS 203 Query: 232 RLSGKFYEIFNKPLDDW-----------KRFQIDTRTVEGIDPSFHEGIIARYG------ 274 LSG F + F D + + + E +A G Sbjct: 204 TLSGTFRDSFPVVNPDKTQTIIDQYGNTRCITVSGWECTNVKEKCLEQPVAPIGGIKISD 263 Query: 275 ------------------------------------LDSDVTRVEVCGQFPQQDID-SFI 297 D + V G+FP QD D I Sbjct: 264 NYYPHGSPIAADDFEKVQPRIPGQTCYDEFMALLNDADPLIRNVYALGKFPDQDPDKQVI 323 Query: 298 PLNIIEE------ALNREPCPDPYAPLIM--------------GCDIA--EEGGDNTVVV 335 + + E NR I+ G D+A G D +V+ Sbjct: 324 LPDWLIEPVKFWTRWNRLCLRAREQFHILALKLLEQILPVEGFGLDVAASRFG-DASVLA 382 Query: 336 LRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA------IIID-ANNTGARTCDYLE 388 + I + + +D + T + + + D I ID G D L+ Sbjct: 383 VGGRYGIRAIHECQFSDTQQTMSWVLETANSHGVDLEQGIVPIAIDWGGGYGNAVGDPLK 442 Query: 389 MLGYHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFAS--------LINHSGLIQNL 439 +V + G +D + N+R EL+ + A L+ A L ++ L L Sbjct: 443 KRNVNVIEIHGNASSNLDSKKYANKRAELYGEAARRLDPAGDFRMMPFALPDNQRLKAEL 502 Query: 440 KSLKSFIVPNTG-ELAIESKRVKG--------------AKSTDYSDGLMY 474 + + + G + I K +G +S D +D ++Y Sbjct: 503 VAPEKIYAGHDGEKYYITPKGRRGSDANYNGKTLHEILGRSPDRADAVVY 552 >gi|261402679|ref|YP_003246903.1| protein of unknown function DUF264 [Methanocaldococcus vulcanius M7] gi|261369672|gb|ACX72421.1| protein of unknown function DUF264 [Methanocaldococcus vulcanius M7] Length = 437 Score = 132 bits (332), Expect = 1e-28, Method: Composition-based stats. Identities = 74/390 (18%), Positives = 138/390 (35%), Gaps = 47/390 (12%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 ++AGR GK+ L +L+++L T+ +A + ++ E+ ++ Sbjct: 50 VAAGRRFGKSKLMCFLLIFLSCTQKDKKFAVIAPYYANAR-IIFKELRTYIEKNKTLQKL 108 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 + +P+ ID + S + P + G +I DEA+ Sbjct: 109 ---VKRITESPYMVIEFKTGCIIDFR---------SADNPTSIRG---ESYHLVILDEAA 153 Query: 202 GT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF---NKPLDDWKRFQIDTRT 257 DV+ I L + +A I S P + FYE F + F+ T + Sbjct: 154 FIKDDVVKYVIKPLLIDYDAP--LIEISTPNGHN-HFYESFLMGENRQNRHISFRFPTWS 210 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI----DSFIPLNIIEEALNREPCPD 313 + S E I +G DS V + E C +F +I I+ + + Sbjct: 211 NPFLPKSVIEEIKREFGEDSLVWKQEFCAEFIDDQDAVFKWEYI-QQCIDSNIELLTVGE 269 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRT---TNNKISGLVEKYRPD 370 +MG D+A+ +++L L + + + +I L K++P Sbjct: 270 KGHRYVMGVDLAKYQDYTVIIILDVSENPYKLVYFERFKDKPYSYVVERIKELYIKFKP- 328 Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430 + +D+ G + LE ++ Q + +L K+ LE +I Sbjct: 329 VVCVDSTGVGDPVVEQLEDCNPIPFKFTNQSK-----------MQLITKLQTALERKEVI 377 Query: 431 --NHSGLIQNLKSLKSFIVPNTGELAIESK 458 LI LK + V ++ E+K Sbjct: 378 FPYIDTLITELKYFRY--VKKKTTISFEAK 405 >gi|241763591|ref|ZP_04761642.1| phage terminase large subunit [Acidovorax delafieldii 2AN] gi|241367184|gb|EER61538.1| phage terminase large subunit [Acidovorax delafieldii 2AN] Length = 521 Score = 130 bits (327), Expect = 5e-28, Method: Composition-based stats. Identities = 58/233 (24%), Positives = 94/233 (40%), Gaps = 26/233 (11%) Query: 275 LDSDVTRVEVCGQFPQQDID---SFIPLNIIEEALNR-EPCPDPYAPLIMGCDIAEEGGD 330 L + + G F D IP ++ A R +P D ++G D A G D Sbjct: 267 LPEPLRSQMLRGDFSAGAADPAWQLIPTEWVKAAQARWQPRQDKGPMTVLGLDPARGGTD 326 Query: 331 NTVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 T V R + L D TT + LV I +DA G+ D++ Sbjct: 327 KTSVARRHDCWFDVLISEPGIVTKDGPTTAAFTAPLVR--NGAPIAVDAIGIGSSALDFI 384 Query: 388 EMLGYHVYRVLGQKRAVDLE-----FCRNRRTELHVKMADWL-----EFASLINHSGLIQ 437 + LG VY V+G +R+ ++ RNRR E++ ++ + L + +L L+ Sbjct: 385 QGLGLLVYAVVGSERSDHMDKAGTMRFRNRRAEMYWRLREALDPTAEQPIALPPDQELLG 444 Query: 438 NLKSLKSFIVPNTGE---LAIESK---RVKGAKSTDYSDGLMYTFAENPPRSD 484 +L +++ + V G+ + I K R +S D D + TF E P D Sbjct: 445 DLTAVR-YKVVTMGQGAAIQIRDKDEIREALGRSPDKGDSVAMTFCEGIPLLD 496 >gi|303243859|ref|ZP_07330199.1| protein of unknown function DUF264 [Methanothermococcus okinawensis IH1] gi|302485795|gb|EFL48719.1| protein of unknown function DUF264 [Methanothermococcus okinawensis IH1] Length = 445 Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats. Identities = 66/321 (20%), Positives = 121/321 (37%), Gaps = 31/321 (9%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 ++AGR GK+ L A+L+++L ST+ +A + ++ E+ K++ + Sbjct: 56 VAAGRRFGKSKLMAFLLIFLCSTQKNKKYAVIAPFYANAR-IIFRELKKYIEKS---NVL 111 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 + +P+ + ID + S + P + G +I DEA+ Sbjct: 112 SRLVKRMVESPYMAIEFKTGCTIDFR---------SADNPTSIRG---ESYHLVILDEAA 159 Query: 202 GT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF---NKPLDDWKRFQIDTRT 257 DV+ I L + +A I S P + FYE F + F+ T T Sbjct: 160 FIKDDVVKYVIKPLLLDYDAP--LIEISTPNGHN-HFYESFLMGKNKQNRHISFRFPTWT 216 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI----DSFIPLNIIEEALNREPCPD 313 + + E I G DS V + E C +F + +I I+ + + Sbjct: 217 NPFLPKNAIEEIKQEVGEDSPVWKQEYCAEFIDNNEAVFNWEYI-QQCIDGTIKLLKSGE 275 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRT---TNNKISGLVEKYRPD 370 +MG D+A+ + VL L + + +L +K+ L + + Sbjct: 276 SGHQYVMGVDLAKFEDYTVITVLDVSVKPYKLVYFERFNLMPYSFVADKVKELYQLFNKP 335 Query: 371 AIIIDANNTGARTCDYLEMLG 391 + +DA GA + +E L Sbjct: 336 QVCMDATGPGAAVVEQVESLN 356 >gi|187476925|ref|YP_784949.1| phage terminase large subunit [Bordetella avium 197N] gi|115421511|emb|CAJ48020.1| Putative phage terminase large subunit [Bordetella avium 197N] Length = 512 Score = 120 bits (302), Expect = 4e-25, Method: Composition-based stats. Identities = 75/359 (20%), Positives = 123/359 (34%), Gaps = 60/359 (16%) Query: 194 AIINDEASGTPDVINLGILG--FLTERNANRFWIMTSNP-RRLSGKFYEIFNKPLDDWKR 250 I+ DEA+ + +LG T+ +MT NP + G++ + P D K Sbjct: 144 LIVLDEATELREHQARFVLGWNRTTKAGQRCRVLMTFNPPTTVEGRWVVEYFAPWLDPKH 203 Query: 251 FQ------------IDTRTVEGI---------------DPSFHEGIIAR----------- 272 ID + VE +F IA Sbjct: 204 PHPAKPGELRWFAVIDGKEVEVEGGAPFAHNGETIVPRSRTFIPSRIADNPFLMGTGYES 263 Query: 273 --YGLDSDVTRVEVCGQF---PQQDIDSFIPLNIIEEALNREPCPDPYAPLI-MGCDIAE 326 L + + G F + D IP +E A R PD AP+ +G D+A Sbjct: 264 VLQSLPEPLRSQMLYGDFNAGIEDDPWQVIPTAWVEAAQARWKRPDRLAPMDSLGLDVAR 323 Query: 327 EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII-IDANNTGARTCD 385 G D T++ R G + + D + R A+I +D GA D Sbjct: 324 GGRDKTILARRHGWWFDEPLVYPGKDTPDGPTVAGLAISALRDHAVIHLDVIGVGASPYD 383 Query: 386 YLEMLGYHVYRVLGQKRAVDLE-----FCRNRRTELHVKMADWLE-----FASLINHSGL 435 +L V V + A + NRR+EL +M + L+ +L L Sbjct: 384 FLVTAKQQVVGVNVAEAACGTDKSGRLRFFNRRSELWWRMREALDPIHNTGIALPPDPRL 443 Query: 436 IQNLKSLKSFIVPNTGELA-IESKRVKGAKSTDYSDGLMYTFAENPPRSDMD-FGRCPS 492 + +L + + T ++A E K +S D+ + + P R+ ++ G+ S Sbjct: 444 LADLTAPTWSLSGATLKVASREDIIDKIGRSPDFGSAYVLALMDTPKRAAVEALGQARS 502 >gi|41179386|ref|NP_958694.1| Bbp25 [Bordetella phage BPP-1] gi|45569518|ref|NP_996587.1| hypothetical protein BMP-1p24 [Bordetella phage BMP-1] gi|45580769|ref|NP_996635.1| hypothetical protein BIP-1p24 [Bordetella phage BIP-1] gi|40950125|gb|AAR97691.1| Bbp25 [Bordetella phage BPP-1] Length = 533 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 51/237 (21%), Positives = 87/237 (36%), Gaps = 21/237 (8%) Query: 275 LDSDVTRVEVCGQF---PQQDIDSFIPLNIIEEALNREPCPDPYAPLI-MGCDIAEEGGD 330 L + + G F + D IP +E A R PD AP+ +G D+A G D Sbjct: 289 LPEPLRSQMLYGDFNAGIEDDPWQVIPTAWVEAAQARWKRPDRLAPMDSLGVDVARGGRD 348 Query: 331 NTVVVLRRGPVIEHLFDWSKTDL---RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 NT++ R + + D T + + I +D GA D+L Sbjct: 349 NTILARRHAMWFDVPLTYPGKDTPDGPTVAGLAIAALRDH--AVIHLDVIGVGASPYDFL 406 Query: 388 EMLGYHVYRVLGQKRAVDLE-----FCRNRRTELHVKMADWLE-----FASLINHSGLIQ 437 V V + A + N R+EL +M + L+ +L L+ Sbjct: 407 AQAKQQVVGVNVAEAARGTDKSGRLRFFNLRSELWWRMREALDPTNNTGIALPPDPRLLA 466 Query: 438 NLKSLKSFIVPNTGELA-IESKRVKGAKSTDYSDGLMYTFAENPPRSDMD-FGRCPS 492 +L + + T ++A E K +S D+ + + P R+ ++ G+ S Sbjct: 467 DLTAPTWSLSGATLKVASREDIIEKIGRSPDFGSAYVLALMDTPKRAAVEALGQARS 523 >gi|300907068|ref|ZP_07124735.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 84-1] gi|301304068|ref|ZP_07210185.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 124-1] gi|300401186|gb|EFJ84724.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 84-1] gi|300840675|gb|EFK68435.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 124-1] gi|315257729|gb|EFU37697.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 85-1] Length = 440 Score = 114 bits (285), Expect = 4e-23, Method: Composition-based stats. Identities = 52/340 (15%), Positives = 105/340 (30%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 96 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 153 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 154 INYDENPFLSDTMLKVIEAAKRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 213 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V R G V+ +W + +L + + + R Sbjct: 214 NFEPSGRKRIGFDVADSGADKCANVYRHGSVVYWADEWKAKEDELLKSCQRTYQAALE-R 272 Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQKRA----------VDL 406 I+ D+ GA + + R + Sbjct: 273 DADIVYDSIGVGASAGAKFAEINEDRKRENMNASRINYQRFNAGAGVNEPDYEYIGIPNK 332 Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQNLKSLKSF------------IV 447 +F N + + +AD ++ LI S Sbjct: 333 DFFANLKAQAWWLVADRFRNTFNAVKNGEQYPVDELISIDSSCPLLEKLKLELTTPHRDF 392 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 393 DKNGRVMVESKKDLAKRDVPSPNVADAFIMAFAPTDTAMD 432 >gi|226940437|ref|YP_002795511.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715364|gb|ACO74502.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 133 Score = 113 bits (282), Expect = 8e-23, Method: Composition-based stats. Identities = 39/126 (30%), Positives = 53/126 (42%), Gaps = 11/126 (8%) Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 + AN++TQL+T EV KW L HWF+ QS S+ +K + Sbjct: 1 MITANTDTQLRTKTSPEVGKWQRLSITSHWFDPQSASIAA----------RDKEHAKTWR 50 Query: 171 TMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 +SE + F G HN + +I DEAS D + G LT+ WI N Sbjct: 51 ADFVPWSEHNTEAFAGLHNKGKRIVLIFDEASAIADKVWEVAEGALTDEETEIIWIAFGN 110 Query: 230 PRRLSG 235 P R G Sbjct: 111 PTRNIG 116 >gi|229125159|ref|ZP_04254306.1| hypothetical protein bcere0016_54220 [Bacillus cereus 95/8201] gi|228658294|gb|EEL13987.1| hypothetical protein bcere0016_54220 [Bacillus cereus 95/8201] Length = 164 Score = 112 bits (280), Expect = 1e-22, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 53/147 (36%), Gaps = 21/147 (14%) Query: 354 RTTNNKISGLVEKY--------RPDAIIIDANNTGARTCDYLEM------LGYHVYRVLG 399 + +KY + I ID G D L+ V + Sbjct: 1 MYVTGLLIKEAKKYFSWCERTGKRIPIRIDDTGVGGGVTDRLKEVVAENDYPIDVIPINF 60 Query: 400 QKRAVDLEFCRNRRTELHVKMADW-LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK 458 + + ++ D LEF S+ + LI L S++ + + + G + IE K Sbjct: 61 ASK--GNAEYACIVSVMYGHFKDNCLEFVSIPDDEDLIAQL-SVRKYQINSDGRIKIEPK 117 Query: 459 ---RVKGAKSTDYSDGLMYTFAENPPR 482 + +G KS D ++ ++ FA P+ Sbjct: 118 KAMKDRGLKSPDRAEAVVMAFAPFYPK 144 >gi|218290759|ref|ZP_03494841.1| protein of unknown function DUF264 [Alicyclobacillus acidocaldarius LAA1] gi|218239297|gb|EED06496.1| protein of unknown function DUF264 [Alicyclobacillus acidocaldarius LAA1] Length = 422 Score = 111 bits (277), Expect = 3e-22, Method: Composition-based stats. Identities = 65/365 (17%), Positives = 125/365 (34%), Gaps = 42/365 (11%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 S P S QL + + H + + F+ A + GR GKT A + PG Sbjct: 7 SEPTSKQLR-LRLYTPHSGQVALHRSTARFRVA-TCGRRWGKTYACANEIAKWAWEHPGA 64 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 +A + Q + + + ++ + + Sbjct: 65 MTWWVAPTYRQ----------------------TLTAYRIITRNFHGAIEKATTTHMRIE 102 Query: 169 YSTMCRT--YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL-GFLTERNANRFWI 225 + + T S E D G ++ DEA+ P L L+++ I Sbjct: 103 WKSGSITEFRSTENFDALRG---EGLDFLVVDEAAMVPKEAWEAALRPTLSDKAGRA--I 157 Query: 226 MTSNPRRLSGKFYEIFNKP----LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTR 281 + S P+ + FY ++ + +W+ F+ T I P E AR L SDV R Sbjct: 158 IVSTPKGRN-WFYHVWARGQDPAFPEWESFRFPTLANPYIPPEEVEE--ARTTLPSDVFR 214 Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLR-RGP 340 E +F + F + +E P P ++G D+A+ + +VV+ Sbjct: 215 QEYEAEFLEDSAGVFRGIRDCIS--GQEEEPQPGRRYVVGWDVAKHQDFSVLVVMDLERA 272 Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400 + + +++ D ++ + ++Y +++DA G + + +G Sbjct: 273 HVVKMDRFNQVDYALQLERVKHICQRYNNARLLMDATGVGDPLLEQVRRMGIQAEGYSLS 332 Query: 401 KRAVD 405 A Sbjct: 333 NTAKQ 337 >gi|74311301|ref|YP_309720.1| putative bacteriophage protein [Shigella sonnei Ss046] gi|73854778|gb|AAZ87485.1| putative bacteriophage protein [Shigella sonnei Ss046] Length = 473 Score = 110 bits (274), Expect = 7e-22, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252 + +EA + + + + + ++ NP ++ + F + + Sbjct: 128 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 185 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 186 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 245 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V R G V+ +W + +L + + + Sbjct: 246 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 305 Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406 D I+ D+ GA + + R + Sbjct: 306 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 364 Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447 +F N + + +AD + LI + Sbjct: 365 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 424 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 425 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 464 >gi|188492395|ref|ZP_02999665.1| phage terminase large subunit [Escherichia coli 53638] gi|188487594|gb|EDU62697.1| phage terminase large subunit [Escherichia coli 53638] Length = 467 Score = 110 bits (274), Expect = 8e-22, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252 + +EA + + + + + ++ NP ++ + F + + Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 179 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 239 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V R G V+ +W + +L + + + Sbjct: 240 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 299 Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406 D I+ D+ GA + + R + Sbjct: 300 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 358 Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447 +F N + + +AD + LI + Sbjct: 359 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 418 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 419 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 458 >gi|16759908|ref|NP_455525.1| prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|29142320|ref|NP_805662.1| prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|213583175|ref|ZP_03365001.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-0664] gi|213647535|ref|ZP_03377588.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. J185] gi|213855100|ref|ZP_03383340.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. M223] gi|25512685|pir||AF0621 probable prophage terminase large chain STY1047 [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16502201|emb|CAD05440.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi] gi|29137950|gb|AAO69511.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] Length = 467 Score = 110 bits (274), Expect = 8e-22, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 179 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D + G D + I L+ IE A++ + Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 239 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P +G D+A+ G D V R G VI +W + +L + + + R Sbjct: 240 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 298 Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406 I+ D+ GA D + + RV Q + Sbjct: 299 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 358 Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447 +F N + + +AD + IN+ L+ L S+ Sbjct: 359 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 418 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 419 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 458 >gi|16760783|ref|NP_456400.1| bacteriophage protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|25512494|pir||AE0735 probable bacteriophage protein STY2040 [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16503080|emb|CAD05583.1| putative bacteriophage protein [Salmonella enterica subsp. enterica serovar Typhi] Length = 467 Score = 109 bits (273), Expect = 8e-22, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252 + +EA + + + + + ++ NP ++ + F + + Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 179 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 239 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V R G V+ +W + +L + + + Sbjct: 240 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 299 Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406 D I+ D+ GA + + R + Sbjct: 300 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 358 Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447 +F N + + +AD + LI + Sbjct: 359 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 418 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 419 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 458 >gi|213161040|ref|ZP_03346750.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E00-7866] Length = 421 Score = 109 bits (273), Expect = 9e-22, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 76 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 133 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D + G D + I L+ IE A++ + Sbjct: 134 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 193 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P +G D+A+ G D V R G VI +W + +L + + + R Sbjct: 194 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 252 Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406 I+ D+ GA D + + RV Q + Sbjct: 253 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 312 Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447 +F N + + +AD + IN+ L+ L S+ Sbjct: 313 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 372 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 373 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 412 >gi|324012808|gb|EGB82027.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 60-1] Length = 441 Score = 109 bits (273), Expect = 1e-21, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252 + +EA + + + + + ++ NP ++ + F + + Sbjct: 96 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 153 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 154 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 213 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V R G V+ +W + +L + + + Sbjct: 214 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 273 Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406 D I+ D+ GA + + R + Sbjct: 274 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 332 Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447 +F N + + +AD + LI + Sbjct: 333 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 392 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 393 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 432 >gi|194434997|ref|ZP_03067239.1| phage terminase, large subunit, pbsx family [Shigella dysenteriae 1012] gi|194416779|gb|EDX32906.1| phage terminase, large subunit, pbsx family [Shigella dysenteriae 1012] gi|323166781|gb|EFZ52535.1| phage terminase, large subunit, PBSX family [Shigella sonnei 53G] Length = 447 Score = 109 bits (273), Expect = 1e-21, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252 + +EA + + + + + ++ NP ++ + F + + Sbjct: 102 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 159 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 160 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 219 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V R G V+ +W + +L + + + Sbjct: 220 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 279 Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406 D I+ D+ GA + + R + Sbjct: 280 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 338 Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447 +F N + + +AD + LI + Sbjct: 339 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 398 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 399 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 438 >gi|213423381|ref|ZP_03356369.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E01-6750] Length = 414 Score = 109 bits (273), Expect = 1e-21, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 69 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 126 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D + G D + I L+ IE A++ + Sbjct: 127 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 186 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P +G D+A+ G D V R G VI +W + +L + + + R Sbjct: 187 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 245 Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406 I+ D+ GA D + + RV Q + Sbjct: 246 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 305 Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447 +F N + + +AD + IN+ L+ L S+ Sbjct: 306 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 365 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 366 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 405 >gi|260557981|ref|ZP_05830193.1| phage terminase large subunit [Acinetobacter baumannii ATCC 19606] gi|260408491|gb|EEX01797.1| phage terminase large subunit [Acinetobacter baumannii ATCC 19606] Length = 529 Score = 109 bits (272), Expect = 1e-21, Method: Composition-based stats. Identities = 52/239 (21%), Positives = 90/239 (37%), Gaps = 31/239 (12%) Query: 275 LDSDVTRVEVCGQF---PQQDIDSFIPLNIIEEALNREPCPDPYAPLI--------MGCD 323 L + + G F + D IP +E A R + L G D Sbjct: 280 LPEPLRSQMLYGDFGAGIEDDPWQVIPTEWVEAAQARWKPLEDMRILHRGDFKMDSYGLD 339 Query: 324 IAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR---TTNNKISGLVEKYRPDAIIIDANNTG 380 +A GGDNT+ R G ++ D T+ + V + P I +D G Sbjct: 340 VARGGGDNTIGFARYGYWYDNPNVLEGKDSPDGPTSASFAVSHVRDHAP--IHVDVIGVG 397 Query: 381 ARTCDYLEMLGYHVYRVLGQKRAVDLEF-----CRNRRTELHVKMADWLE-----FASLI 430 A T D+L+ G HV V + A + N R++L + + L+ +L Sbjct: 398 ASTYDFLKQSGIHVVPVDVRNAATAFDRSGQLSFYNLRSQLWWQFREALDPAYGSTVALP 457 Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSDMD 486 L+ +L + + + + T ++ +ES+ + +S DY ++ + P R M Sbjct: 458 PEPKLLADLTAPR-WGLQGT-KIKVESREEIIKRIGRSPDYGSAIINAQIDTPKRHIMQ 514 >gi|293396491|ref|ZP_06640767.1| phage terminase large subunit [Serratia odorifera DSM 4582] gi|291420755|gb|EFE94008.1| phage terminase large subunit [Serratia odorifera DSM 4582] Length = 430 Score = 109 bits (272), Expect = 1e-21, Method: Composition-based stats. Identities = 53/343 (15%), Positives = 107/343 (31%), Gaps = 57/343 (16%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + N+EA + + + + + +++ NPR + + P D + Sbjct: 80 VLWNEEAHAMTEAQWEVLEPTIRKEGSECWFL--FNPRLTTDFVWRNFVVAPPPDTLVRK 137 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + I A D+++ G D ++ I L+ IE A++ + Sbjct: 138 INYDENPFLSRTIMNVIEAAKARDAEMFEHVYLGMPRTDDDEAIIKLSWIEAAVDAHKAL 197 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V G V +W + +L + + + + R Sbjct: 198 NIEPAGHRRVGFDVADSGADKCANVYAHGSVALWADEWKAREDELMKSCKRTYNVALE-R 256 Query: 369 PDAIIIDANNTGARTCDYLEMLG-------------YHVYRVLGQKRAVDLE-------- 407 AII D+ GA + + ++ + E Sbjct: 257 EAAIIYDSIGVGASSGSKFAEINEERESASDWNVRTVDYFKFNAGGAVFEPERDYQPGIT 316 Query: 408 ---FCRNRRTELHVKMADWL----------EFASLINHSGLI------------QNLKSL 442 F N + + +AD E LI + S Sbjct: 317 NKDFFANIKAQAWWLVADRFRNTYNVINGKEKRESFADDQLISIDSACPLLDKLKFELST 376 Query: 443 KSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPR 482 G + +E+K + + S + +D + FA Sbjct: 377 PKRDFDKNGRVKVETKDDLKKRDIPSPNVADAFIMAFAPIETP 419 >gi|323175059|gb|EFZ60673.1| phage terminase large subunit [Escherichia coli LT-68] Length = 399 Score = 109 bits (271), Expect = 2e-21, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252 + +EA + + + + + ++ NP ++ + F + + Sbjct: 54 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 111 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 112 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 171 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V R G V+ +W + +L + + + Sbjct: 172 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 231 Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406 D I+ D+ GA + + R + Sbjct: 232 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 290 Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447 +F N + + +AD + LI + Sbjct: 291 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 350 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 351 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 390 >gi|289581321|ref|YP_003479787.1| hypothetical protein Nmag_1649 [Natrialba magadii ATCC 43099] gi|289530874|gb|ADD05225.1| hypothetical protein Nmag_1649 [Natrialba magadii ATCC 43099] Length = 602 Score = 107 bits (268), Expect = 3e-21, Method: Composition-based stats. Identities = 77/512 (15%), Positives = 152/512 (29%), Gaps = 113/512 (22%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVF----KGAISAGRGIGKTTLNAWLVLWLMST 104 + +W + +E + + + G+GK+ + A + + ++ Sbjct: 22 AGDETWLEDAIEDYLGITVTGAQAQICRGIAANERLLVVTANGLGKSYILAAITIVWLTV 81 Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164 R + +E ++K T V P + S + Sbjct: 82 RYPACSFATSGTERKMKRTYCKPVENLHGDARVPL----------PGEYKSRPERIEIDG 131 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA--SGTPDVINLGILGFLTERNANR 222 + +H+ S + G H Y +AII +EA + + +T+ Sbjct: 132 EPEHFFEAA---SPQDAGELEGVHAAYTLAII-EEADKKDVDAEVLDAMKSLVTDEQDRI 187 Query: 223 FWIMTSNP---------------RRLSGKF-------YEIF------------------- 241 I +NP + K+ ++ Sbjct: 188 --IAIANPPKDETNSIYPILDEQDDPTSKWEVLEFSSFDSHNVQVELGNVDDEKVDGLAS 245 Query: 242 -NKPLDDWKRF-----------------QIDTRTVEGI--------DPSFHEGIIARYGL 275 +K DDW+ + ++D +P F + R+ Sbjct: 246 LHKIQDDWEDYNKEPWPGAETARTLSAPKLDADGNPVFSHSDALEDNPEFRTDLDQRWYR 305 Query: 276 DSDVTRVEVCGQFP--QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTV 333 G P + ++ + A R+ P P G D+A +GGD T Sbjct: 306 -------RRAGIIPPGGASKNRPFTIDDVNAAWGRDWQPV-GRPQATGIDVARDGGDRTP 357 Query: 334 VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYH 393 V+ G V+E ++ D + ++ ++E + + IDA G+ D + Sbjct: 358 VISVDGDVLEVRYEEPCHDYTAHADDVTDVLEDDPDNPMPIDAVGEGSGFADIMHQRFPE 417 Query: 394 VYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSL------KSFIV 447 R A D ++ E + WL+ IN L + L + + Sbjct: 418 TIRFKSLGVAEDSANYKDCWAEGVALLGKWLQNGGSINDRTLREELLVAARTLEYEETHI 477 Query: 448 PNTGE-----LAIESK---RVKGAKSTDYSDG 471 + G L + K + + +S DY D Sbjct: 478 GSRGTNGEDVLKLTPKEKVKERLGRSPDYLDA 509 >gi|213426918|ref|ZP_03359668.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E02-1180] Length = 374 Score = 107 bits (267), Expect = 4e-21, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 29 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 86 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D + G D + I L+ IE A++ + Sbjct: 87 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 146 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P +G D+A+ G D V R G VI +W + +L + + + R Sbjct: 147 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 205 Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406 I+ D+ GA D + + RV Q + Sbjct: 206 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 265 Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447 +F N + + +AD + IN+ L+ L S+ Sbjct: 266 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 325 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 326 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 365 >gi|332091158|gb|EGI96248.1| phage terminase large subunit [Shigella dysenteriae 155-74] Length = 346 Score = 107 bits (267), Expect = 5e-21, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252 + +EA + + + + + ++ NP ++ + F + + Sbjct: 1 MLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 58 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 59 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 118 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 +P +G D+A+ G D V R G V+ +W + +L + + + Sbjct: 119 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 178 Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406 D I+ D+ GA + + R + Sbjct: 179 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 237 Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447 +F N + + +AD + LI + Sbjct: 238 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 297 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 298 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 337 >gi|328952976|ref|YP_004370310.1| hypothetical protein Desac_1270 [Desulfobacca acetoxidans DSM 11109] gi|328453300|gb|AEB09129.1| hypothetical protein Desac_1270 [Desulfobacca acetoxidans DSM 11109] Length = 466 Score = 107 bits (266), Expect = 6e-21, Method: Composition-based stats. Identities = 74/382 (19%), Positives = 126/382 (32%), Gaps = 58/382 (15%) Query: 51 PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110 P WQ +F+ V+ P + + + GK+T A L L PG + Sbjct: 27 PDPWQQDFL----------VSRPEQALLLCSRQS----GKSTSAAALALHEALFHPGALI 72 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 + L+ S Q L+ + + LP+ + + + H S I Sbjct: 73 LLLSPSLRQ-SQELFRKAAGLYQRLPHAP------AACRTSALRLEFDHGSRIISLPGQE 125 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 R +SE R ++ DEA+ PD + + L S P Sbjct: 126 ETIRGFSEVR-------------LLVIDEAALVPDELYYAVRPMLAVSRGR--LTALSTP 170 Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 G FY + + D W+R+ I I F + L + R E +F Sbjct: 171 AGKRGWFYHCYTEGGDQWQRYTIPATQCPRISADFLAA--EQRSLPAAWFRAEYFCEF-G 227 Query: 291 QDIDSFIPLNIIEEALNREPCP--------DPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342 + + P ++++ A + P P +G D+ + + + ++ R P + Sbjct: 228 EAANQLFPAHLLQTAQCSQVSPLFAEITPSPPTGTFFIGLDLGQSQDYSALTIIHRSPAL 287 Query: 343 E----HLFDWSKTDLRTTNNKISGLVEKY-------RPDAIIIDANNTGARTCDYLEMLG 391 HL + LRT I V + +I+D GA D L G Sbjct: 288 PDPPCHLRHLQRFPLRTPYPDIVRQVRELLQQPQIGPNPLLIVDKTGVGAPVVDMLTQAG 347 Query: 392 YHVYRVLGQKRAVDLEFCRNRR 413 + Y V + R+ R Sbjct: 348 MNPYAVTIHGGEAVSQNGRDLR 369 >gi|289829424|ref|ZP_06547036.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-3139] Length = 346 Score = 107 bits (266), Expect = 6e-21, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 1 MLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 58 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D + G D + I L+ IE A++ + Sbjct: 59 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 118 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P +G D+A+ G D V R G VI +W + +L + + + R Sbjct: 119 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 177 Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406 I+ D+ GA D + + RV Q + Sbjct: 178 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 237 Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447 +F N + + +AD + IN+ L+ L S+ Sbjct: 238 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 297 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 298 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 337 >gi|211731806|gb|ACJ10127.1| terminase [Bacteriophage APSE-3] Length = 469 Score = 107 bits (266), Expect = 6e-21, Method: Composition-based stats. Identities = 64/349 (18%), Positives = 102/349 (29%), Gaps = 67/349 (19%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252 +EA + ++ + + + W NP G Y+ F KP Q Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKAIIDKQGYY 162 Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 + + + R G+ D+ I +E Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVE 222 Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361 A++ P ++ D A+ G D + R G +IE WS+ D+ Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAF 282 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLG----YHVYRVLGQKRAVD------------ 405 YR D I D GA T G V G + D Sbjct: 283 DEAFDYRADDFIYDNIGLGAGTVKTHLRHGNDGNKMVVTGFGAGDSPDYPDEIYVPGNGE 342 Query: 406 ------------LEFCRNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436 + RN+R + V +AD +E ++ LI Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPEALISLSSKIAKLS 402 Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L + P + + SK R+KG KS + +D LM +FA Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451 >gi|161614489|ref|YP_001588454.1| hypothetical protein SPAB_02238 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161363853|gb|ABX67621.1| hypothetical protein SPAB_02238 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 441 Score = 106 bits (265), Expect = 7e-21, Method: Composition-based stats. Identities = 56/340 (16%), Positives = 109/340 (32%), Gaps = 52/340 (15%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 96 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 153 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A + G D + I L+ IE A++ + Sbjct: 154 INYDENPFLSDTMLKVIDAARRRYPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 213 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P +G D+A+ G D V R G VI +W + +L + + + R Sbjct: 214 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 272 Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406 I+ D+ GA D + + RV Q + Sbjct: 273 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 332 Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447 +F N + + +AD + IN+ L+ L S+ Sbjct: 333 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 392 Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 393 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 432 >gi|148826888|ref|YP_001291641.1| phage terminase large subunit [Haemophilus influenzae PittGG] gi|148718130|gb|ABQ99257.1| predicted phage terminase large subunit [Haemophilus influenzae PittGG] Length = 366 Score = 106 bits (265), Expect = 7e-21, Method: Composition-based stats. Identities = 56/355 (15%), Positives = 114/355 (32%), Gaps = 37/355 (10%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++ P + V+C E+ K +S + + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q L ++ +G + ++ + + G + +E Sbjct: 73 QIEMLGLRAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262 ++ + E + I++ NP+ + Y+ F P + K ++ + Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320 E + D ++ R G+ + I IE A++ + + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIEYAVDAHLKLGFTAKGMKKV 245 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 G D+A+EG D+ G V+ + W D+ + N+ + K++ D II D+ G Sbjct: 246 GFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305 Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD 422 A + + L V E N + + + D Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRD 360 >gi|212499721|ref|YP_002308529.1| terminase [Bacteriophage APSE-2] gi|238898754|ref|YP_002924436.1| APSE-2 prophage; terminase [Bacteriophage APSE-2] gi|211731690|gb|ACJ10178.1| terminase [Bacteriophage APSE-2] gi|229466514|gb|ACQ68288.1| APSE-2 prophage; terminase [Bacteriophage APSE-2] Length = 469 Score = 106 bits (264), Expect = 9e-21, Method: Composition-based stats. Identities = 64/349 (18%), Positives = 103/349 (29%), Gaps = 67/349 (19%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252 +EA + ++ + + + W NP G Y+ F KP Q Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKAIIDKQGYY 162 Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 + + + R G+ D+ I +E Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVE 222 Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361 A++ P ++ D A+ G D + R G +IE WS+ D+ Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAF 282 Query: 362 GLVEKYRPDAIIIDANNTGARTC-DYLEML---GYHVYRVLGQKRAVD------------ 405 YR D I D GA T +L V G + D Sbjct: 283 DEAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGNKIVVTGFGAGDSPDYPDEIYVPGNGE 342 Query: 406 ------------LEFCRNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436 + RN+R + V +AD +E ++ LI Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPEALISLSSKIAKLS 402 Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L + P + + SK R+KG KS + +D LM +FA Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451 >gi|294663744|gb|ADF29298.1| terminase [Pseudomonas phage JG024] Length = 460 Score = 106 bits (264), Expect = 9e-21, Method: Composition-based stats. Identities = 63/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 + +EA I + + N+ WI NP ++ Y+ F KP D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309 I+ + + + I Y D + + G P+ S I L I A++ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367 +P +G D+A++G D L G VI + +W +L +++++ L K Sbjct: 232 LGWEPAGSKRIGFDVADDGDDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KL 290 Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405 + ++ D+ GA L +Y AVD Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDVYMKLPHTTIKN 350 Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442 + N + + ++A + +E + LI L S Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410 Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487 + + G +ESK+ + KS + +D ++ + P+ DF Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 >gi|211731737|gb|ACJ10086.1| terminase [Bacteriophage APSE-5] Length = 469 Score = 105 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 64/349 (18%), Positives = 104/349 (29%), Gaps = 67/349 (19%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252 +EA + ++ + + + W NP G Y+ F KP + Q Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKELIDTQGYY 162 Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 + + + R G+ D+ I +E Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVE 222 Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361 A++ P ++ D A+ G D + R G +IE WS+ D+ Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAF 282 Query: 362 GLVEKYRPDAIIIDANNTGARTC-DYLEML---GYHVYRVLGQKRAVD------------ 405 YR D I D GA T +L V G + D Sbjct: 283 DEAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGNKIVVTGFGAGDSPDYPDEIYVPGNGE 342 Query: 406 ------------LEFCRNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436 + RN+R + V +AD +E ++ LI Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPEALISLSSKIAKLS 402 Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L + P + + SK R+KG KS + +D LM +FA Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451 >gi|284008456|emb|CBA74928.1| phage terminase large subunit [Arsenophonus nasoniae] Length = 477 Score = 105 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 81/470 (17%), Positives = 137/470 (29%), Gaps = 104/470 (22%) Query: 84 AGRGIGKTTLNAWLVLW--------LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135 GRG KT A + L + R ++ I E + L AE+ L L Sbjct: 21 GGRGGMKTVSFAKIALITASINKRRFLCLREFMNSI-----EDSVHAVLQAEIET-LRLQ 74 Query: 136 PNKHWFEMQSLSLHPAPW-YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 + ++ + + Y + I SKH + Sbjct: 75 NRFRILDNCIKGINDSIFKYGQLARNIASIKSKHDFDVA--------------------- 113 Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ-- 252 +EA + ++ + + + W NP G Y+ F KP D + Sbjct: 114 -WVEEAETVSEKSLDILIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKDIIDDKGY 170 Query: 253 ----------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + + + G+ D+ I + Sbjct: 171 YEDDDLYVGKVSYLDNPWLPEELKNDAEKMKRDNYKKWLHVYGGECDANYDDAIIQPEWV 230 Query: 303 EEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 + A++ P ++ D A+ G D + R G ++E WS+ D+ K Sbjct: 231 DAAIDAHIKLGFKPKGIRVITFDPADSGQDEKALSKRYGVLVEDCVSWSEGDVADATIKA 290 Query: 361 SGLVEKYRPDAIIIDANNTGARTC----------DYLEMLGY------------------ 392 YR D I D GA T + + + G+ Sbjct: 291 FDEAFDYRADDFIYDNIGLGAGTVKTYLRSSNDGNKMVVTGFGAGDSPDYPDEIYVPGNG 350 Query: 393 HVYRVLGQKRAVDLEFCRNRRTELHVKMAD-------WLEFASLINHSGLI--------- 436 L + + RN+R + V +AD +E I+ LI Sbjct: 351 EYIPSLNNDDRTNRDTFRNKRAQYWVYLADRFYKTWCAVEKKEYIDPEELISLSSKIDKL 410 Query: 437 ----QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L + P + + SK R KG KS + +D LM +FA Sbjct: 411 SQLKSELVKQQRKRTPGNRLIQLISKEEMRSKGIKSPNMADTLMMSFANP 460 >gi|332884414|gb|EGK04674.1| hypothetical protein HMPREF9456_03377 [Dysgonomonas mossii DSM 22836] Length = 450 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 53/302 (17%), Positives = 108/302 (35%), Gaps = 33/302 (10%) Query: 197 NDEASGTPDVINLGILGFLTERNANRFWI--MTS--NPRRLS--GKFYEIFNKPL--DDW 248 DE S + ++ + A I M NP + +FY+ + DD Sbjct: 133 IDENSQITEKCWNIVMSRIRHDVAKNGLIPKMFGACNPTKNFVYNRFYKPHRDGILPDDK 192 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDIDSFIPLNIIEEALN 307 Q +D + E + ++R + G++ + D D ++ + + Sbjct: 193 AFIQALVTDNPFVDKFYIENLKNL----DPISRARLLDGEW-EYDDDPYVLMQYEKIVDL 247 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI--EHLFDWSKTDLRTTNNKISGLVE 365 P M D+A G D+T + + G + + + + D T + Sbjct: 248 FTNSHVSGGPRYMTIDVARLGKDDTTIRIWEGLISIYKKVIPKCRIDDLTVLARKLQTEY 307 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELHVKMAD 422 I D + G D L G+ V K ++ +N R++ + K+A+ Sbjct: 308 SVPNSNTIADEDGVGGGLVDNLRCKGF----VNNSKPLPIYGEVRNYQNLRSQCYFKLAE 363 Query: 423 -------WLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGL 472 +L+ +++ +++ L+ +K +L + +K + KSTD +D L Sbjct: 364 IVNSNLMYLKNEPIVDRERVVKELEQIKQIDADKDTKLKVITKEMLKSILGKSTDEADNL 423 Query: 473 MY 474 M Sbjct: 424 MM 425 >gi|211731828|gb|ACJ10140.1| terminase [Bacteriophage APSE-6] Length = 469 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 82/470 (17%), Positives = 132/470 (28%), Gaps = 104/470 (22%) Query: 84 AGRGIGKTTLNAWLVLW--------LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135 GRG KT A + L + R ++ I E + L AEV L L Sbjct: 12 GGRGGMKTVSFAKIALITASMHKRRFLCLREFMNSI-----EDSVHAVLQAEVET-LGLQ 65 Query: 136 PNKHWFEMQSLSLHPAPW-YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 ++ + + Y + I SKH + Sbjct: 66 VRFRVLNSCIEGINDSIFKYGQLARNIASIKSKHDFDVA--------------------- 104 Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ-- 252 +EA + ++ + + + + + NP G Y+ F KP Q Sbjct: 105 -WVEEAETVSEKSLDTLIPTIRKPGSELRF--SFNPAEEDGAVYKRFVKPYKAIIDKQGY 161 Query: 253 ----------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + + + R G+ D+ I + Sbjct: 162 YEDDDLYVGNVSYLDNPWLPVELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWV 221 Query: 303 EEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 A++ P ++ D A G D + R G +IE W + D+ Sbjct: 222 GAAIDAHIKLGFKPSGIRVVTFDPAGSGQDEKALSKRYGVLIEDCVSWLEGDVADATMTA 281 Query: 361 SGLVEKYRPDAIIIDANNTGARTC-DYLEMLG---YHVYRVLGQKRAVDLEF-------- 408 YR D I D GA T +L V G + D Sbjct: 282 FDEAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGSKMVVTGFGAGDSPDYPHEIYVPGNG 341 Query: 409 ----------------CRNRRTELHVKMAD-------WLEFASLINHSGLI--------- 436 RN+R + V +AD +E ++ LI Sbjct: 342 EYLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPDALISLSSKIAKL 401 Query: 437 ----QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L + P + + SK R+KG KS + +D LM +FA Sbjct: 402 SQLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451 >gi|238027628|ref|YP_002911859.1| Bbp25 [Burkholderia glumae BGR1] gi|237876822|gb|ACR29155.1| Bbp25 [Burkholderia glumae BGR1] Length = 486 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 43/225 (19%), Positives = 80/225 (35%), Gaps = 30/225 (13%) Query: 275 LDSDVTRVEVCGQFP---QQDIDSFIPLNIIEEALNREPCPDPYAPLI----MGCDIAEE 327 L + + G F + D IP + A R P I +G D+A Sbjct: 255 LPEPLRSKMLYGDFAAGREDDPWQVIPSEWVRLAQERWRARSR--PRIPMTALGVDVARG 312 Query: 328 GGDNTVVVLRRGPVIEHLFDWSKT---DLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384 G D ++ R G + D ++ L + + +D GA Sbjct: 313 GQDQSIYTPRYGNWFDEQVCQPGLATPDGFVVAQQVFNL--REPSTLVNLDVVGVGASPF 370 Query: 385 DYL-EMLGYHVYRVLGQKRAVDLEF-----CRNRRTELHVKMADWL-----EFASLINHS 433 D + +++G ++ + G R +L+ N R L +M + L E ++ Sbjct: 371 DIIHQVIGDKIWGISGAARTDELDMSGQFGFVNLRALLWWRMREALDPINGEDLAIPPDP 430 Query: 434 GLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYT 475 L +L + + + G + +ESK + + +S D D +Y Sbjct: 431 ALAADLCAPR-YRKAPRG-ILVESKEEIKKRIGRSPDRGDSAVYA 473 >gi|85059798|ref|YP_455500.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] gi|84780318|dbj|BAE75095.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] Length = 483 Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats. Identities = 48/309 (15%), Positives = 96/309 (31%), Gaps = 44/309 (14%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQID 254 +EA ++ + + + W+ NP+ + Y+ F PLDD + Sbjct: 116 WVEEAEAVTKESWDILIPTIRKPGSE-IWVSF-NPKNILDDTYQRFVVNPLDDICLLTVH 173 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCP 312 + D D+ G+ + I I A++ Sbjct: 174 YTDNPHFPEVLRLEMEECKCKDYDLYLHIWEGEPVADSDLAIIKPLWIAAAVDAHMTLGF 233 Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372 D +G D+A+EG D + +G V+ L +W + D+ ++N+++ + I Sbjct: 234 DAVGEKRLGFDVADEGEDCNALCFVQGSVVLDLDEWHRGDVIASSNRVNRYAIERGITCI 293 Query: 373 IIDANNTGARTCDYLEMLGYHVYRVLGQKRA------------VDLEFCRNRRTELHVKM 420 I D+ GA +L+ + + A + + N + + + Sbjct: 294 IYDSIGVGAGVKAHLKRIAAINVKGFNAGEAVKDPDALYMPGKTNKDMFANIKAQAWWAV 353 Query: 421 ADWL--------------EFASLINHSGLI-------------QNLKSLKSFIVPNTGEL 453 + + A L LI + S N G + Sbjct: 354 RERFYKTWRCIEAKKQDPKAALLYPTDELISLSTTNIKRLEYLKAELSRPRVDYDNNGHV 413 Query: 454 AIESKRVKG 462 +ESK+ Sbjct: 414 KVESKKDMK 422 >gi|218148543|ref|YP_002364311.1| terminase, large subunit [Pseudomonas phage 14-1] gi|218059739|emb|CAU13815.1| terminase, large subunit [Pseudomonas phage 14-1] Length = 460 Score = 104 bits (260), Expect = 3e-20, Method: Composition-based stats. Identities = 63/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 + +EA I + + N+ WI NP ++ Y+ F KP D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309 I+ + + + I Y D + + G P+ S I L I A++ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367 +P +G D+A++G D L G VI + +W +L +++++ L K Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KM 290 Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405 + ++ D+ GA L +Y AVD Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDIYMKLPHTTIKN 350 Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442 + N + + ++A + +E + LI L S Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410 Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487 + + G +ESK+ + KS + +D ++ + P+ DF Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 >gi|218457805|ref|YP_002418810.1| terminase, large subunit [Pseudomonas phage SN] gi|218379073|emb|CAT99652.1| terminase, large subunit [Pseudomonas phage SN] Length = 460 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 63/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 + +EA I + + N+ WI NP ++ Y+ F KP D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309 I+ + + + I Y D + + G P+ S I L I A++ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367 +P +G D+A++G D L G VI + +W +L +++++ L K Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KV 290 Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405 + ++ D+ GA L +Y AVD Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDVYMKLPHTTIKN 350 Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442 + N + + ++A + +E + LI L S Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410 Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487 + + G +ESK+ + KS + +D ++ + P+ DF Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 >gi|9633565|ref|NP_050979.1| P18 [Acyrthosiphon pisum bacteriophage APSE-1] gi|6118013|gb|AAF03961.1|AF157835_18 P18 [Endosymbiont phage APSE-1] Length = 469 Score = 104 bits (259), Expect = 3e-20, Method: Composition-based stats. Identities = 64/349 (18%), Positives = 103/349 (29%), Gaps = 67/349 (19%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252 +EA + ++ + + + W NP G Y+ F KP + Q Sbjct: 105 WVEEAETVSEKSLDSLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKELIDTQGYY 162 Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 + + + R G+ D+ I +E Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVE 222 Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361 A++ P ++ D A+ G D + R G +IE WS+ D+ Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAF 282 Query: 362 GLVEKYRPDAIIIDANNTGARTC-DYLEMLGYHVYRVLGQKRAVDLEFC----------- 409 YR D I D GA T +L V+ A D Sbjct: 283 DDAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGNKMVVTGFGAGDSPDYPDEIYVPGNGE 342 Query: 410 ----------------RNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436 RN+R + V +AD +E ++ LI Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPDALISLSSKIAKLS 402 Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L + P + + SK R+KG KS + +D LM +FA Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451 >gi|197261331|ref|YP_002154147.1| putative terminase, large subunit [Pseudomonas phage LBL3] gi|197244421|emb|CAR31156.1| putative terminase, large subunit [Pseudomonas phage LBL3] Length = 460 Score = 104 bits (259), Expect = 4e-20, Method: Composition-based stats. Identities = 63/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 + +EA I + + N+ WI NP ++ Y+ F KP D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309 I+ + + + I Y D + + G P+ S I L I A++ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367 +P +G D+A++G D L G VI + +W +L +++++ L K Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KM 290 Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405 + ++ D+ GA L +Y AVD Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDIYMKLPHTTIKN 350 Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442 + N + + ++A + +E + LI L S Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410 Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487 + + G +ESK+ + KS + +D ++ + P+ DF Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 >gi|149408318|ref|YP_001294421.1| conserved hypothetical protein ORF004 [Pseudomonas phage F8] gi|219523873|ref|YP_002455934.1| terminase large subunit [Pseudomonas phage PB1] gi|190333469|gb|ACE73724.1| terminase large subunit [Pseudomonas phage PB1] Length = 460 Score = 104 bits (259), Expect = 4e-20, Method: Composition-based stats. Identities = 62/351 (17%), Positives = 117/351 (33%), Gaps = 62/351 (17%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 + +EA I + + N+ WI NP ++ Y+ F KP D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDAFVKM 172 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309 I+ + + + I Y D D + G P+ S I L I A++ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDKDQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367 +P +G D+A++G D L G VI + +W +L +++++ L K Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KM 290 Query: 368 RPDAIIIDANNTGARTCDYLEMLG---------YHVYRVLGQKRAVD------------- 405 + ++ D+ GA L Y + G D Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDSSPDFKLTYDPFNAGGAVDKPDDIYMKLPHTTIKN 350 Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442 + N + + ++A + + + LI L S Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410 Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487 + + G +ESK+ + KS + +D ++ + P+ DF Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 >gi|157265379|ref|YP_001467938.1| terminase large subunit [Thermus phage P23-45] gi|156905274|gb|ABU96918.1| terminase large subunit [Thermus phage P23-45] Length = 485 Score = 104 bits (258), Expect = 4e-20, Method: Composition-based stats. Identities = 69/395 (17%), Positives = 134/395 (33%), Gaps = 47/395 (11%) Query: 85 GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 GR GK+ + ++ + RPG +A + Q + V K L E+Q Sbjct: 38 GRQSGKSEAASVEAVFELFARPGSQGWIIAPTYDQAEIIFGRVVEKVERLAEVFPATEVQ 97 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMC-RTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +K +T R S +RPD G +I DEA+ Sbjct: 98 LQRRRLRLLVHHYDRPVNAPGAKRVATSEFRGKSADRPDNLRGATLD---FVILDEAAMI 154 Query: 204 PDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-----------------NKPL 245 P + I L+ R+ + ++ S P+ L+ FYE F N+ Sbjct: 155 PFSVWSEAIEPTLSVRDG--WALIISTPKGLN-WFYEFFLMGWRGGLKEGIPNSGVNQTH 211 Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII--- 302 D++ F + V ++ + R + R E +F F L+++ Sbjct: 212 PDFESFHAASWDVWPERREWY--MERRLYIPDLEFRQEYGAEFVSHSNSVFSGLDMLILL 269 Query: 303 -EEALNREPCPDPYAP---LIMGCDIAEEGGDN--TVVVLRRGPVIEHLFDWSKTDLRTT 356 E + Y P +G D + + +V+ L G ++ L + Sbjct: 270 PYERRGTRLVVEDYRPDHIYCIGADFGKNQDYSVFSVLDLDTGAIV-CLERMNGATWSDQ 328 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416 ++ L E Y ++ D G + L+ G + + + +V + N Sbjct: 329 VARLKALSEDYGHAYVVADTWGVGDAIAEELDAQGINYTPLPVKSSSVKEQLISN----- 383 Query: 417 HVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPN 449 +A +E ++ N ++ L++ + + + Sbjct: 384 ---LALLMEKGQVAVPNDKTILDELRNFRYYRTAS 415 >gi|211731785|gb|ACJ10115.1| terminase [Bacteriophage APSE-7] Length = 469 Score = 104 bits (258), Expect = 5e-20, Method: Composition-based stats. Identities = 82/470 (17%), Positives = 133/470 (28%), Gaps = 104/470 (22%) Query: 84 AGRGIGKTTLNAWLVLW--------LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135 GRG KT A + L + R ++ I E + L AEV L L Sbjct: 12 GGRGGMKTVSFAKIALITAAMHKRRFLCLREFMNSI-----EDSVHAVLQAEVET-LGLH 65 Query: 136 PNKHWFEMQSLSLHPAPW-YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 ++ + + Y + I SKH + Sbjct: 66 ARFRVLNSCIEGINASIFKYGQLARNIASIKSKHDFDVA--------------------- 104 Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ-- 252 +EA + ++ + + + W NP G Y+ F KP + Sbjct: 105 -WVEEAETVSEKSLDTLISTIRKPGSE-LWFSF-NPSEEDGAVYQRFVKPYKAIIDKKGY 161 Query: 253 ----------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + + + R G+ D+ I + Sbjct: 162 YEDDDLYVGNVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYDDALIQPEWV 221 Query: 303 EEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 + A++ P ++ D A+ G D + R G +IE WS+ D+ Sbjct: 222 DAAIDAHIKLGFPPRGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATITA 281 Query: 361 SGLVEKYRPDAIIIDANNTGARTC-DYLEMLGYHVYRVLGQKRAVDLEFC---------- 409 YR D I D GA T +L V+ A D Sbjct: 282 FDEAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGNKMVVTGFGAGDSPDYPDEVYVPSNA 341 Query: 410 -----------------RNRRTELHVKMAD-------WLEFASLINHSGLI--------- 436 RN+ + V +AD +E ++ LI Sbjct: 342 EYLPSSNNDDRTHRDTFRNKHAQYWVYLADRFYKTWRAVEKGEYLDPDELISLSSKIEKL 401 Query: 437 ----QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L +P + + SK R+KG KS + +D LM +FA Sbjct: 402 SQLKSELVKQPRKRMPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451 >gi|197261421|ref|YP_002154236.1| putative terminase, large subunit [Pseudomonas phage LMA2] gi|197244511|emb|CAR31245.1| putative terminase, large subunit [Pseudomonas phage LMA2] Length = 460 Score = 104 bits (258), Expect = 5e-20, Method: Composition-based stats. Identities = 62/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 + +EA I + + N+ WI NP ++ Y+ F KP D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309 I+ + + + I Y D + + G P+ S I L I A++ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367 +P +G D+A++G D L G +I + +W +L +++++ L K Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNIIMEVDEWDGLEDELLKSSSRVYNLA-KM 290 Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405 + ++ D+ GA L +Y AVD Sbjct: 291 KGTSVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDVYMKLPHTTIKN 350 Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442 + N + + ++A + +E + LI L S Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410 Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487 + + G +ESK+ + KS + +D ++ + P+ DF Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 >gi|157265496|ref|YP_001468054.1| phage terminase large subunit [Thermus phage P74-26] gi|156905391|gb|ABU97034.1| phage terminase large subunit [Thermus phage P74-26] Length = 485 Score = 104 bits (258), Expect = 5e-20, Method: Composition-based stats. Identities = 69/395 (17%), Positives = 134/395 (33%), Gaps = 47/395 (11%) Query: 85 GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 GR GK+ + ++ + RPG +A + Q + V K L E+Q Sbjct: 38 GRQSGKSEAASVEAVFELFARPGSQGWIIAPTYDQAEIIFGRVVEKVERLAEVFPATEVQ 97 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMC-RTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +K +T R S +RPD G +I DEA+ Sbjct: 98 LQRRRLRLLVHHYDRPVNAPGAKRVATSEFRGKSADRPDNLRGATLD---FVILDEAAMI 154 Query: 204 PDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-----------------NKPL 245 P + I L+ R+ + ++ S P+ L+ FYE F N+ Sbjct: 155 PFSVWSEAIEPTLSVRDG--WALIISTPKGLN-WFYEFFLMGWRGGLKEGIPNSGINQTH 211 Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII--- 302 D++ F + V ++ + R + R E +F F L+++ Sbjct: 212 PDFESFHAASWDVWPERREWY--MERRLYIPDLEFRQEYGAEFVSHSNSVFSGLDMLILL 269 Query: 303 -EEALNREPCPDPYAP---LIMGCDIAEEGGDN--TVVVLRRGPVIEHLFDWSKTDLRTT 356 E + Y P +G D + + +V+ L G ++ L + Sbjct: 270 PYERRGTRLVVEDYRPDHIYCIGADFGKNQDYSVFSVLDLDTGAIV-CLERMNGATWSDQ 328 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416 ++ L E Y ++ D G + L+ G + + + +V + N Sbjct: 329 VARLKALSEDYGHAYVVADTWGVGDAIAEELDAQGINYTPLPVKSSSVKEQLISN----- 383 Query: 417 HVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPN 449 +A +E ++ N ++ L++ + + + Sbjct: 384 ---LALLMEKGQVAVPNDKTILDELRNFRYYRTAS 415 >gi|159904490|ref|YP_001548152.1| hypothetical protein MmarC6_0096 [Methanococcus maripaludis C6] gi|159885983|gb|ABX00920.1| protein of unknown function DUF264 [Methanococcus maripaludis C6] Length = 505 Score = 102 bits (255), Expect = 1e-19, Method: Composition-based stats. Identities = 76/437 (17%), Positives = 138/437 (31%), Gaps = 72/437 (16%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 Q E E +D+ + I+ GR GKT + + S G SV+ +A Sbjct: 65 QEEIAEAIDSEMYDV----------ITINIGRRGGKTEVMGGVGPKFCSKYRGFSVLVVA 114 Query: 115 NSETQLKTTLWAEVSKWL-SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 Q KT ++ ++ + L S ++ + + +P+ + I+ K Sbjct: 115 PVYNQAKT-MYKKIKRGLESNKESRQLVKPKKEGFKESPFPLITFYNGSTIEFK------ 167 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG-ILGFLTERNANRFWIMTSNPRR 232 S E PD + II DEA+ D I + L + + S P Sbjct: 168 ---SAETPDNLR---SEGYDLIIVDEAAFVDDEIISAVLEPMLMDSGG--ILVKISTPWG 219 Query: 233 LSGKFYEIFNK----------------PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLD 276 FY+ + K +K F+ + + F G G D Sbjct: 220 TGNHFYDSYIKGELQAKMLEEGEGIPEDELRYKSFKFPSWVNPYLSKRFLMGKKKDLGED 279 Query: 277 SDVTRVEVCGQFPQQD-------------IDSFIPLNIIEEALNREPCPDPYAPLIMGCD 323 + V E C +F + D D+F E + + ++G D Sbjct: 280 NPVWLQEYCAEFIEDDTTVFSTAHVQACLSDAFETHYKTENLIYLIDEGERNKEYVIGLD 339 Query: 324 IAEEGGDNTVVVLRRGPV----IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 +A+ +VL + + ++ D K L E + +D Sbjct: 340 LAKHNDYTVFIVLDITTGPPYTLVYFERFNGIDYTDIAEKHLALSEAFNDAPACVDQTGI 399 Query: 380 GARTCDYLEMLGY-HVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLI 436 G D + +G ++ + TE+ K++ + + L+ Sbjct: 400 GEAYMDIAKKVGLDNLTGFKFTNESK---------TEIITKLSTSFRNKEVVMPKIRVLL 450 Query: 437 QNLKSLKSFIVPNTGEL 453 LK+ F T +L Sbjct: 451 TELKAFMRFRTKTTFKL 467 >gi|150021340|ref|YP_001306694.1| hypothetical protein Tmel_1462 [Thermosipho melanesiensis BI429] gi|149793861|gb|ABR31309.1| protein of unknown function DUF264 [Thermosipho melanesiensis BI429] Length = 421 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 57/316 (18%), Positives = 105/316 (33%), Gaps = 34/316 (10%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 I AGR GKT A + + + P VI S Q + Sbjct: 39 ICAGRRFGKTNYVAGKIFYYATIHPKSRVIVGGPSLDQ---------------AKIYYDL 83 Query: 142 EMQSLSLHPAPWYSDVLHCS--LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 +++ L P + S I K+ S++ + G ++ E Sbjct: 84 LTEAIELSPLKGFVKKTKDSPFPTIYLKNGSSITVRSTAHNGKYLRGRKVN---LVVLTE 140 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR---FQIDTR 256 A+ D + ++ + + + I+ S P ++ FYE + + L + K F Sbjct: 141 AAFIKDSVYEQVITPM-KLDTGAPVILESTPNGMN-YFYEEYQRGLKNKKHTISFHATVY 198 Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDP 314 +D E A+ V R E +F D F P I+ EA + Sbjct: 199 DNPFLDQEEIENAKAK--TPDYVWRQEYLAEFVD-DDTVFFPWKILVEAFEDYKPEGYKD 255 Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTD---LRTTNNKISGLVEKYRPDA 371 +G D+A+ ++VL + ++ + + ++ L KYR Sbjct: 256 GRKYSIGVDLAKYRDYTVIIVLDVTEEPFKIAEFHRFNQIPYEEVIRIVNDLQAKYRA-Q 314 Query: 372 IIIDANNTGARTCDYL 387 + +DA G + + Sbjct: 315 VYLDATGVGDPISERI 330 >gi|118590957|ref|ZP_01548357.1| hypothetical protein SIAM614_19891 [Stappia aggregata IAM 12614] gi|118436479|gb|EAV43120.1| hypothetical protein SIAM614_19891 [Stappia aggregata IAM 12614] Length = 526 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 80/203 (39%), Gaps = 18/203 (8%) Query: 290 QQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 Q IP + ++ A R + ++ D+A+ G D TV+ G E Sbjct: 294 QDHEWQVIPSDWVDLAFERYDQGIDRDEPMTVLAVDVAQGGKDRTVLQPLHGRRFETNIV 353 Query: 348 WSKTDLRTTNNKISGLVEKYRPDA-IIID-ANNTGARTCDYLEMLGYH-----VYRVLGQ 400 TD + + S ++ + R +A I++D G T +L+ V+ Sbjct: 354 RKGTDTKDGADVGSLIIRERRDNAMIVVDCTGGWGGDTVGFLKRENNIPAEKCVFSAQSG 413 Query: 401 KRAVDLE-FCRNRRTELHVKMADWLE----FASLINHSGLIQNLKSLKSFIVPNTGELAI 455 +RA D N R EL+ ++ + L I S ++ + + + N G++ I Sbjct: 414 ERAKDSRIPFYNLRAELYWRLREALHPKSGLGLAIRRSATVKAQLTAHRWKMRN-GKILI 472 Query: 456 ESK---RVKGAKSTDYSDGLMYT 475 ESK + + S D +D ++ Sbjct: 473 ESKEEIKDRLGSSPDEADAIVEA 495 >gi|126011061|ref|YP_001039811.1| TerL-like protein [Burkholderia ambifaria phage BcepF1] gi|119712637|gb|ABL96858.1| TerL-like protein [Burkholderia ambifaria phage BcepF1] Length = 459 Score = 101 bits (251), Expect = 3e-19, Method: Composition-based stats. Identities = 55/339 (16%), Positives = 107/339 (31%), Gaps = 57/339 (16%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252 + +EA + I + + + I NP + + Y+ F P D Q Sbjct: 115 ILWLEEAQYLTEEQWNVINPTIRREGSQIWLIW--NPDQYTDFIYQNFVVNPPADCLSKQ 172 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309 I+ + + + I Y D + V G P+ + I L + A++ ++ Sbjct: 173 INWTENPFLSDTMLKVIYDEYQRDPKLAE-HVYGGAPKMGGDKAIIQLQYVLAAIDAHKK 231 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN--KISGLVEKY 367 G DIA++G D +V G V+ +W + + K+ + Sbjct: 232 LGWKIEGSKRTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALE- 290 Query: 368 RPDAIIIDANNTGARTCDYLEMLGY-----HVYRVLGQKRA----------------VDL 406 + +II D+ GA L +Y A + Sbjct: 291 KGSSIIFDSIGVGAHAGSKFSELNEARSLEIIYEPFNAGGAVYDPDGTYMKLPHVVITNR 350 Query: 407 EFCRNRRTELHVKMADWLE-------FASLINHSGLI---------------QNLKSLKS 444 E N + ++ ++A + + H LI + + Sbjct: 351 EHFSNVKAQMWDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPH 410 Query: 445 FIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAEN 479 V G+ +ESK+ +G KS + +D + + Sbjct: 411 KDVDGMGKFKVESKKDMREKRGIKSPNIADAFIMAMIQP 449 >gi|211731761|gb|ACJ10100.1| terminase [Bacteriophage APSE-4] Length = 469 Score = 100 bits (250), Expect = 4e-19, Method: Composition-based stats. Identities = 63/349 (18%), Positives = 101/349 (28%), Gaps = 67/349 (19%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252 +EA + ++ + + + W NP G Y F KP Q Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYRRFVKPYKAIIDKQGYY 162 Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 + + + R G+ D+ I ++ Sbjct: 163 EDDEVYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYGDALIQPEWVD 222 Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361 A++ P ++ D A+ G D + R G +IE WS+ D+ Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATITAF 282 Query: 362 GLVEKYRPDAIIIDANNTGARTC-DYLEMLGYHVYRVLGQKRAVDLEFC----------- 409 YR D I D GA T +L V+ A D Sbjct: 283 DDAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGTKMVVTGFGAGDSPDYPDEIYVPGNGE 342 Query: 410 ----------------RNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436 RN+R + V +AD +E ++ LI Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVERGEYLDPDALISLSSKIAKLS 402 Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 L + P + + SK R+KG KS + +D LM +FA Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451 >gi|256422889|ref|YP_003123542.1| hypothetical protein Cpin_3879 [Chitinophaga pinensis DSM 2588] gi|256037797|gb|ACU61341.1| hypothetical protein Cpin_3879 [Chitinophaga pinensis DSM 2588] Length = 471 Score = 100 bits (248), Expect = 7e-19, Method: Composition-based stats. Identities = 52/286 (18%), Positives = 107/286 (37%), Gaps = 38/286 (13%) Query: 229 NPRRLSGKFYEIFNKPL------DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282 NP++ + +F KP D K Q + IDP + + +++ + V + Sbjct: 196 NPKKN--WCHTVFWKPFKAGQLPDKVKFLQALVQDNPFIDPGYIDNLMS---ITDKVKKQ 250 Query: 283 EVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP 340 + G F D ++ + + I + E + + DIA G D +VV++ G Sbjct: 251 RLLYGNFDYDDDDNALMEYDSINDIFTNEFVVE--GKKYITADIARFGSDKSVVMVWNGL 308 Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398 + + + K ++I + KY +++D + G D L+ + + Sbjct: 309 RVVEIRKFEKMRTTKVADEIEKIRNKYGIPLSHVVVDEDGVGGGVVDKLDG----CHGFV 364 Query: 399 GQKRAVDLEF------CRNRRTELHVKMADWL---EFASLINHSG----LIQNLKSLKSF 445 +D +N +++ + +A+ + + + L + L+ +K + Sbjct: 365 NNSAPIDNPQDQQQQNYKNLKSQCYYMLAERINDHKIFVRCDDYEMRELLSEELEQVKKW 424 Query: 446 IVPNTGELAIESK---RVKGAKSTDYSDGLMY-TFAENPPRSDMDF 487 N +L + K + +S DYSD LM F E P Sbjct: 425 DADNDKKLEVMPKKVVKELLGRSPDYSDTLMMRMFFELKPEQRWQI 470 >gi|313760829|gb|ADR79391.1| terminase [APSE phage Eptesicus fuscus/P5/IT/USA/2009] Length = 394 Score = 100 bits (248), Expect = 8e-19, Method: Composition-based stats. Identities = 66/342 (19%), Positives = 102/342 (29%), Gaps = 67/342 (19%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255 +EA + ++ + + + W NP G Y F KP Q Sbjct: 44 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYRRFVKPYKAIIDKQGYY 101 Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD-----IDSFIPLNIIEEALNREP 310 E LD+ E+ + + D+ I +E A + Sbjct: 102 EDDEVYVGKVS-------YLDNPWLPAELKNDAQKGECDANYEDALIQPEWVEAATDAHI 154 Query: 311 C--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR 368 P ++ D A+ G D + R G +IE WS+ D+ YR Sbjct: 155 KLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAFDEAFDYR 214 Query: 369 PDAIIIDANNTGARTC-DYLEML---GYHVYRVLGQKRAVDLEF---------------- 408 D I D GA T +L V G + D Sbjct: 215 ADDFIYDNIGLGAGTVKTHLRHSNDGNKMVVTGFGAGDSPDYPHEIYVPGNGEYLPSSNN 274 Query: 409 --------CRNRRTELHVKMAD-------WLEFASLINHSGLI-------------QNLK 440 RN+R + V +AD +E ++ LI L Sbjct: 275 DDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPEALISLSSKIAKLSQLKSELI 334 Query: 441 SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479 + P + + SK R+KG KS + +D LM +FA Sbjct: 335 KQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 376 >gi|161525001|ref|YP_001580013.1| hypothetical protein Bmul_1828 [Burkholderia multivorans ATCC 17616] gi|189350256|ref|YP_001945884.1| bacteriophage TerL protein [Burkholderia multivorans ATCC 17616] gi|160342430|gb|ABX15516.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] gi|189334278|dbj|BAG43348.1| bacteriophage TerL protein [Burkholderia multivorans ATCC 17616] Length = 531 Score = 99.0 bits (245), Expect = 2e-18, Method: Composition-based stats. Identities = 56/332 (16%), Positives = 104/332 (31%), Gaps = 55/332 (16%) Query: 190 TYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248 I DE++ + L T + S P + F + + Sbjct: 195 DRASFYIVDESAFLERPQLVDASLSATTNCRQD-----ISTPNGMGNSFAQ--RRHSGKV 247 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN- 307 K F R D +++ A LD V E+ + IP ++ A+ Sbjct: 248 KVFTFHWRDDPRKDDAWYAKQCAE--LDPVVVAQEIDINYAASVEGVVIPSAWVQAAIGA 305 Query: 308 -REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLV 364 + +P G D+A+EG D R G ++ L WS D+ T K G+ Sbjct: 306 HLKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLLNFLRSWSGKGGDIYETVEKTFGIC 365 Query: 365 EKYRPDAIIIDANNTGARTCDYLE----------MLGYHVYRVLGQKRAVDLE------- 407 ++ ++ DA+ GA + G D E Sbjct: 366 DELGYESFDYDADGLGAGVRGDARVINEQRIAIGKRPINDEPFRGSGPVHDPEGEMVPER 425 Query: 408 ----FCRNRRTELHVKMADWLEFA-------------SLINHSGLIQNLKSL------KS 444 + N + + + + +I+ ++ L +L + Sbjct: 426 KNKDYFANLKAQSWWALRLRFQATYRAVVEGKPYNPDDIISIDPELKELAALTMELSQPT 485 Query: 445 FIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476 + V G++ I+ K G KS + +D +M + Sbjct: 486 YTVNGVGKIVID-KAPDGTKSPNLADAVMIAY 516 >gi|255321082|ref|ZP_05362250.1| gp33 TerL [Acinetobacter radioresistens SK82] gi|262379515|ref|ZP_06072671.1| bacteriophage TerL protein [Acinetobacter radioresistens SH164] gi|255301852|gb|EET81101.1| gp33 TerL [Acinetobacter radioresistens SK82] gi|262298972|gb|EEY86885.1| bacteriophage TerL protein [Acinetobacter radioresistens SH164] Length = 558 Score = 98.6 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 53/344 (15%), Positives = 106/344 (30%), Gaps = 61/344 (17%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 DE + + +++ I S P + KF++ ++ + F + Sbjct: 210 MYFLDEWAFVERQ--EAVDAAISQ--NTNVHIKGSTPNGIGDKFHQ--DRFSGRYAVFTM 263 Query: 254 DTRTVEGIDP--SFHEGIIARYGL------DSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 R + +I + D V EV + IP ++ A Sbjct: 264 AWRDNPDKNWQVELDGKLIYPWYEKQLATLDDIVLAQEVDIDYAASVEGVLIPSAWVQAA 323 Query: 306 LNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKIS 361 ++ +P D+A+EG D R G V+++L WS D+ T K Sbjct: 324 VDAHIKLGIEPSGERNGALDVADEGKDKNSFAARHGIVLQYLDTWSGIGDDIFGTTQKAI 383 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDLE---- 407 + + DA+ GA ++ G + E Sbjct: 384 DACLDLKLNIFFYDADGLGAGVRGDARVINELNKAKGIPEIEANPFRGSGAVHNPEQEMV 443 Query: 408 -------FCRNRRTELHVKMA-------DWLEFASLINHS--------------GLIQNL 439 F N + ++ + L+ S ++ Sbjct: 444 EARKNVDFFANLKAQMWWSLRLRFQNTYRALQGMQYDPDSLISLSTKDINKQELEQLKRE 503 Query: 440 KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRS 483 S ++ G++ + +K+ GA S + +DG+M F++ P + Sbjct: 504 LSQPTYSKNGAGKILV-NKQPDGALSPNRADGVMICFSDIRPPA 546 >gi|308097723|gb|ADO14402.1| AB1gp31 [Acinetobacter phage AB1] Length = 313 Score = 96.7 bits (239), Expect = 8e-18, Method: Composition-based stats. Identities = 48/292 (16%), Positives = 89/292 (30%), Gaps = 52/292 (17%) Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--RE 309 I+ + + + I + D + G D S I + +E AL+ + Sbjct: 21 HINYNENPFLSQTALDVIADKKRRDPEGFAHIYDGMPRADDDMSIIKASWVEAALDAHKL 80 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKY 367 D +G D+A+ G D +V R+G V +W + +L + + + Sbjct: 81 LNLDDTGRSYLGFDVADAGKDKCALVHRKGIVAYWSDEWKAREDELLKSATRTYNEAIRL 140 Query: 368 RPDAIIIDANNTGARTCDYLEML-----------------GYHVYRVLGQKRAVDLEFCR 410 I D+ GA + L G H Q + + +F Sbjct: 141 -NALIHYDSTGVGAGVGAKVNELNKEKKTNVQHSKFVAGGGVHEPDKFYQPKITNKDFFA 199 Query: 411 NRRTELHVKMADWLEF-----------ASLINH--SGLI------------QNLKSLKSF 445 N + + +AD + H LI + S+ Sbjct: 200 NAKAQAWWLVADKFRLTYQVIQAIKNGTEIPKHKPEDLISISSDMPNLHRLKVELSIPHR 259 Query: 446 IVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQ 494 G + +ESK+ + KS + +D + +A P + M Sbjct: 260 DEDRLGRVMVESKQDLAKRDVKSPNLADAFIMAYA--PVKRSMQINIADVES 309 >gi|169633984|ref|YP_001707720.1| putative bacteriophage protein; putative prophage terminase large subunit [Acinetobacter baumannii SDF] gi|169152776|emb|CAP01795.1| putative bacteriophage protein; putative prophage terminase large subunit [Acinetobacter baumannii] Length = 552 Score = 96.7 bits (239), Expect = 8e-18, Method: Composition-based stats. Identities = 58/337 (17%), Positives = 107/337 (31%), Gaps = 61/337 (18%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 DE + + +++ I S P + +F++ ++ + F + Sbjct: 210 MYFLDEWAFVEQQ--EAVDAAISQ--NTNVHIKGSTPNGIGDRFHQ--DRFSGRYAVFTM 263 Query: 254 DTRTVEGIDPSFHE--GIIARYGL------DSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 R + + +I + D V EV + IP ++ A Sbjct: 264 PWRDNPDKNWTVTYNGKVIYPWYEKQLATLDDVVLAQEVDINYAASVEGVLIPSTWVQAA 323 Query: 306 LN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKIS 361 ++ ++ +P I G D+A+EG D R G V+ +L WS D+ T K Sbjct: 324 IDAHKKLQIEPTGDRIGGLDVADEGKDKNSFAARHGVVMTYLATWSGKGDDIFGTTQKAM 383 Query: 362 GLVEKYRPDAIIIDANNTGAR-------TCDYLEMLG---YHVYRVLGQKRAVDLE---- 407 L + D + DA+ GA + LG +V G D E Sbjct: 384 DLCFEKSIDTLFYDADGLGAGCRGDARVINEKRRELGLSEINVESFRGSGSVHDPEGEMV 443 Query: 408 -------FCRNRRTELHVKMA-------DWLEFASLINHS--------------GLIQNL 439 F N + + + LE L+ Sbjct: 444 EKRLNKDFFANLKAQSWWSLRLRFQETFRALEGRDYDPDMIISLSSEDIDAKELALLTTE 503 Query: 440 KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476 S ++ G++ + +K+ G S + +D +M F Sbjct: 504 LSQPTYTKNGVGKILV-NKQPDGTASPNRADSVMICF 539 >gi|298480040|ref|ZP_06998239.1| PBSX family phage terminase [Bacteroides sp. D22] gi|298273849|gb|EFI15411.1| PBSX family phage terminase [Bacteroides sp. D22] Length = 476 Score = 96.7 bits (239), Expect = 9e-18, Method: Composition-based stats. Identities = 55/281 (19%), Positives = 105/281 (37%), Gaps = 33/281 (11%) Query: 216 TERNANRFWIMTSNPRRLSGKFYEIFNKP-----LDDWKRFQID-TRTVEGIDPSFHEGI 269 E R +T NP++ Y+ F KP L ++ + + IDP + EG+ Sbjct: 184 NELGLRRKLFITCNPKKN--WMYDTFYKPDKKGELPEYMYYLACLVQENPFIDPDYIEGL 241 Query: 270 IARYGLDSDVTRVEVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327 V R + G + + ++ + I E + I G DIA Sbjct: 242 KTTK---DKVKRERLLKGNWEYDDNPNALCSHDAICEIFGNKISIKTGTNYITG-DIARF 297 Query: 328 GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR--PDAIIIDANNTGARTCD 385 G D + + G I L + + I +KYR I+D + G D Sbjct: 298 GADYARLAVWDGWHIIELQCFPVSKTTDIQTWIINKQKKYRIPNHKCIVDEDGVGGGVVD 357 Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---------EFASLINHSGLI 436 ++ G+ + + E +N +T+ K+AD + + S + +I Sbjct: 358 NCDIQGF-----VNNSTPFNGENYQNLQTQCGYKLADHINATEVGIDEDLISTADKEEII 412 Query: 437 QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMY 474 + L+ L+++ + G+L ++ K ++ S D+ D + Sbjct: 413 RELEQLQTWKADSDGKLKLKPKEEIKMDIGCSPDWRDMFLM 453 >gi|167763812|ref|ZP_02435939.1| hypothetical protein BACSTE_02192 [Bacteroides stercoris ATCC 43183] gi|167697928|gb|EDS14507.1| hypothetical protein BACSTE_02192 [Bacteroides stercoris ATCC 43183] Length = 476 Score = 95.9 bits (237), Expect = 1e-17, Method: Composition-based stats. Identities = 55/281 (19%), Positives = 105/281 (37%), Gaps = 33/281 (11%) Query: 216 TERNANRFWIMTSNPRRLSGKFYEIFNKP-----LDDWKRFQID-TRTVEGIDPSFHEGI 269 E R +T NP++ Y+ F KP L ++ + + IDP + EG+ Sbjct: 184 NELGLRRKLFITCNPKKN--WMYDTFYKPDKKGELPEYMYYLACLVQENPFIDPDYIEGL 241 Query: 270 IARYGLDSDVTRVEVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327 V R + G + + ++ + I E + I G DIA Sbjct: 242 KTTK---DKVKRERLLKGNWEYDDNPNALCSHDAICEIFGNKISIKTGTNYITG-DIARF 297 Query: 328 GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR--PDAIIIDANNTGARTCD 385 G D + + G I L + + I +KYR I+D + G D Sbjct: 298 GADYARLAVWDGWHIIELQCFPVSKTTDIQTWIINKQKKYRIPNHKCIVDEDGVGGGVVD 357 Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---------EFASLINHSGLI 436 ++ G+ + + E +N +T+ K+AD + + S + +I Sbjct: 358 NCDIQGF-----VNNSTPFNGENYQNLQTQCGYKLADHINATEVGIDEDLISTADKEEII 412 Query: 437 QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMY 474 + L+ L+++ + G+L ++ K ++ S D+ D + Sbjct: 413 RELEQLQTWEADSDGKLKLKPKEEIKMDIGCSPDWRDMFLM 453 >gi|168260952|ref|ZP_02682925.1| phage terminase, large subunit, pbsx family [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|205349913|gb|EDZ36544.1| phage terminase, large subunit, pbsx family [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] Length = 471 Score = 95.6 bits (236), Expect = 2e-17, Method: Composition-based stats. Identities = 65/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTI-RKTFSEIWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGART--------------------------CDYLEMLGYHVYR 396 L + D + D + GA D L G Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEVFSGKKITATMFKGSESPFDEDALYQAGAWADE 343 Query: 397 -VLGQKRAVDLEFCRNRRTELHVKMADWLEFA---------SLINH-------------- 432 V G + RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488 L L ++ N G+L +E K+ G S + +D LM + D+ Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462 >gi|163716617|gb|ABY40529.1| putative TerL [Burkholderia phage Bups phi1] Length = 531 Score = 95.2 bits (235), Expect = 3e-17, Method: Composition-based stats. Identities = 63/343 (18%), Positives = 106/343 (30%), Gaps = 57/343 (16%) Query: 190 TYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248 + DE++ + L T + S P + F + + Sbjct: 195 DRASFYVVDESAFLERPQLVDASLSATTNCRQD-----ISTPNGMGNSFAQ--RRHSGKI 247 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 K F R D +++ +A LD V E+ + IP ++ AL Sbjct: 248 KVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEIDINYAASVEGVVIPSAWVQAALGA 305 Query: 309 EPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLV 364 +P G D+A+EG D R G ++EHL WS D+ T +++ G+ Sbjct: 306 HVKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLLEHLESWSGVGGDIFGTVDRVLGIC 365 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDLE------- 407 + + DA+ GA +L G D E Sbjct: 366 DVRDYEVFDYDADGLGAGVRGDARVLNEQRVAAGKRSIRNEPFRGSGPVYDPEGEMVKER 425 Query: 408 ----FCRNRRTELHVKMADWL----------------EFASLINHSGLIQNLK---SLKS 444 + N + + + E S+ L S + Sbjct: 426 KNKDYFANLKAQSWWALRLRFQATYRAVVEGKPFDPDEIISIDPDLPERAALSMELSQPT 485 Query: 445 FIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDF 487 F V G++ I+ K G KS + +D +M A P +D Sbjct: 486 FTVNGVGKIVID-KAPDGTKSPNLADAVMI--AYQPAVRGIDI 525 >gi|260868683|ref|YP_003235085.1| putative terminase large subunit [Escherichia coli O111:H- str. 11128] gi|293446697|ref|ZP_06663119.1| phage terminase large subunit [Escherichia coli B088] gi|257765039|dbj|BAI36534.1| putative terminase large subunit [Escherichia coli O111:H- str. 11128] gi|291323527|gb|EFE62955.1| phage terminase large subunit [Escherichia coli B088] gi|323177130|gb|EFZ62720.1| phage terminase, large subunit, PBSX family [Escherichia coli 1180] Length = 471 Score = 94.8 bits (234), Expect = 3e-17, Method: Composition-based stats. Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488 L L ++ N G+L +E K+ G S + +D LM + D+ Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462 >gi|237704849|ref|ZP_04535330.1| terminase large subunit [Escherichia sp. 3_2_53FAA] gi|226901215|gb|EEH87474.1| terminase large subunit [Escherichia sp. 3_2_53FAA] gi|315288241|gb|EFU47640.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 110-3] Length = 471 Score = 94.8 bits (234), Expect = 3e-17, Method: Composition-based stats. Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVYGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488 L L ++ N G+L +E K+ G S + +D LM + D+ Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462 >gi|324019922|gb|EGB89141.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 117-3] Length = 471 Score = 94.4 bits (233), Expect = 4e-17, Method: Composition-based stats. Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKCIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVYGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488 L L ++ N G+L +E K+ G S + +D LM + D+ Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462 >gi|294492319|gb|ADE91075.1| phage terminase, large subunit, PBSX family [Escherichia coli IHE3034] Length = 471 Score = 94.4 bits (233), Expect = 4e-17, Method: Composition-based stats. Identities = 64/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYTAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + N Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPNDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488 L L ++ N G+L +E K+ G S + +D LM + D+ Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462 >gi|307544683|ref|YP_003897162.1| hypothetical protein HELO_2093 [Halomonas elongata DSM 2581] gi|307216707|emb|CBV41977.1| K06909 [Halomonas elongata DSM 2581] Length = 531 Score = 94.4 bits (233), Expect = 4e-17, Method: Composition-based stats. Identities = 51/338 (15%), Positives = 109/338 (32%), Gaps = 57/338 (16%) Query: 194 AIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 I DE++ + L T + S P + F + + F Sbjct: 199 FYIVDESAFLERPHLVDASLSATTNCRQD-----VSTPNGMGNPFAQ--RRHSGKISVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 R D +++ + LD E+ + IP ++ A++ ++ Sbjct: 252 FHWRDDPRKDDAWYAKQVDE--LDPVTVAQEIDINYSASVEGVLIPSAWVQAAVDAHKKL 309 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 + + D+A+EG D R G +++ + +W+ +D+ T K +++ Sbjct: 310 GIEITGERLGALDVADEGKDQNAYAGRHGILLDLVDEWTGKGSDIFGTVQKAFDHTDEHG 369 Query: 369 PDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQK-----------RAVDLE 407 DA+ G+ ++ V G + + + Sbjct: 370 GSRFDYDADGLGSGVRGDARVINEQRAEQKRPKLKVNPFRGSGGVIEPDKEMVPKRKNKD 429 Query: 408 FCRNRRTELHVKM--------ADWLEFASLINHS------------GLIQNLKSLKSFIV 447 F N + + + +E L+ L S ++ V Sbjct: 430 FFANLKAQAWWALRLRFQRTYRAVVEGMEFDPDDIISIDSRLPILSKLMLEL-SQPTYHV 488 Query: 448 PNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDM 485 TG++ ++ K +G KS + +D +M +A N +D Sbjct: 489 NGTGKVVVD-KAPEGTKSPNLADAVMILYAPNKSVTDR 525 >gi|157159763|ref|YP_001457081.1| PBSX family phage terminase large subunit [Escherichia coli HS] gi|300935792|ref|ZP_07150755.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 21-1] gi|157065443|gb|ABV04698.1| phage terminase, large subunit, pbsx family [Escherichia coli HS] gi|300459025|gb|EFK22518.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 21-1] Length = 471 Score = 94.4 bits (233), Expect = 4e-17, Method: Composition-based stats. Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINDGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488 L L ++ N G+L +E K+ G S + +D LM + D+ Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462 >gi|91211665|ref|YP_541651.1| terminase large subunit [Escherichia coli UTI89] gi|117624554|ref|YP_853467.1| phage terminase large subunit [Escherichia coli APEC O1] gi|218559279|ref|YP_002392192.1| Terminase large subunit [Escherichia coli S88] gi|91073239|gb|ABE08120.1| terminase large subunit [Escherichia coli UTI89] gi|115513678|gb|ABJ01753.1| phage terminase large subunit [Escherichia coli APEC O1] gi|148566126|gb|ABQ88401.1| phage terminase large subunit [Enterobacteria phage CUS-3] gi|218366048|emb|CAR03793.1| Terminase large subunit [Escherichia coli S88] gi|307626097|gb|ADN70401.1| terminase large subunit [Escherichia coli UM146] gi|323948780|gb|EGB44679.1| phage terminase large subunit [Escherichia coli H252] Length = 471 Score = 94.0 bits (232), Expect = 5e-17, Method: Composition-based stats. Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYTAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488 L L ++ N G+L +E K+ G S + +D LM + D+ Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462 >gi|167725769|ref|ZP_02409005.1| hypothetical protein BpseD_42528 [Burkholderia pseudomallei DM98] Length = 517 Score = 93.6 bits (231), Expect = 7e-17, Method: Composition-based stats. Identities = 63/343 (18%), Positives = 105/343 (30%), Gaps = 57/343 (16%) Query: 190 TYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248 + DE++ + L T + S P + F + + Sbjct: 181 DRASFYVVDESAFLERPQLVDASLSATTNCRQD-----ISTPNGMGNSFAQ--RRHSGKI 233 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 K F R D +++ +A LD V E+ + IP ++ AL Sbjct: 234 KVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEIDINYAASVEGVVIPSAWVQAALGA 291 Query: 309 EPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLV 364 +P G D+A+EG D R G ++EHL WS D+ T ++ G+ Sbjct: 292 HVKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLLEHLESWSGVGGDIFGTVDRALGIC 351 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDLE------- 407 + + DA+ GA +L G D E Sbjct: 352 DVRDYEVFDYDADGLGAGVRGDARVLNEQRVAAGKRSIRNEPFRGSGPVYDPEGEMVKER 411 Query: 408 ----FCRNRRTELHVKMADWL----------------EFASLINHSGLIQNLK---SLKS 444 + N + + + E S+ L S + Sbjct: 412 KNKDYFANLKAQSWWALRLRFQATYRAVVEGKPFDPDEIISIDPDLPERAALSMELSQPT 471 Query: 445 FIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDF 487 F V G++ I+ K G KS + +D +M A P +D Sbjct: 472 FTVNGVGKIVID-KAPDGTKSPNLADAVMI--AYQPAVRGIDI 511 >gi|41057280|ref|NP_958178.1| gene 2 protein [Enterobacteria phage Sf6] gi|191165541|ref|ZP_03027382.1| phage terminase, large subunit, pbsx family [Escherichia coli B7A] gi|218695968|ref|YP_002403635.1| Terminase large subunit [Escherichia coli 55989] gi|331678314|ref|ZP_08378989.1| phage terminase, large subunit, PBSX family [Escherichia coli H591] gi|33334159|gb|AAQ12192.1| gene 2 protein [Shigella phage Sf6] gi|190904464|gb|EDV64172.1| phage terminase, large subunit, pbsx family [Escherichia coli B7A] gi|218352700|emb|CAU98482.1| Terminase large subunit [Escherichia coli 55989] gi|324114096|gb|EGC08069.1| phage terminase large subunit [Escherichia fergusonii B253] gi|331074774|gb|EGI46094.1| phage terminase, large subunit, PBSX family [Escherichia coli H591] Length = 470 Score = 93.2 bits (230), Expect = 8e-17, Method: Composition-based stats. Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|13559866|ref|NP_112076.1| terminase large subunit [Enterobacteria phage HK620] gi|13517602|gb|AAK28891.1|AF335538_43 terminase large subunit [Salmonella phage HK620] Length = 470 Score = 93.2 bits (230), Expect = 8e-17, Method: Composition-based stats. Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGENIL 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|325497784|gb|EGC95643.1| gene 2 protein [Escherichia fergusonii ECD227] Length = 470 Score = 93.2 bits (230), Expect = 9e-17, Method: Composition-based stats. Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVYGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|293604595|ref|ZP_06686998.1| phage terminase large subunit [Achromobacter piechaudii ATCC 43553] gi|292817011|gb|EFF76089.1| phage terminase large subunit [Achromobacter piechaudii ATCC 43553] Length = 463 Score = 92.9 bits (229), Expect = 1e-16, Method: Composition-based stats. Identities = 54/321 (16%), Positives = 99/321 (30%), Gaps = 49/321 (15%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF--QI 253 +E G + I + + A + + NP L F + L I Sbjct: 135 WIEEGEGLTEEQWSIIDPTIRKEGAEVWVLW--NP-HLITDFVQAKLPALLGADCIIRHI 191 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPC 311 + + + D D R GQ D S I + IE A++ + Sbjct: 192 NYPDNPFLSATAKRKAERLKEADPDAYRHIYLGQPLSSDDASVIKFHWIEAAVDAHLKLG 251 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYRP 369 + +G D+A+ G D + G + + L +W + +L + + V R Sbjct: 252 IELGGARTVGYDVADSGADKNACSVFDGAICDELDEWAAPEDELNQSTKRAWAHV---RN 308 Query: 370 DAIIIDANNTGARTCDYLE----MLGYHVYRVLGQKRAVDLEF---------CRNRRTEL 416 ++ D+ GA L GYH + G + D E+ N + + Sbjct: 309 GILVYDSIGVGAHVGSTLADAGIRTGYHKFNAGGAVISPDKEYAPKIKNKEKFENLKAQA 368 Query: 417 HVKMADWLE--------------------FASLINHSGLIQNLKSLKSFIVPNTGELAIE 456 +AD L + + L L + + G +E Sbjct: 369 WQDVADRLRNTYNAVTKGMVFPASELISISSGISKLEQLKIELSAPRK-RYSKRGLDMVE 427 Query: 457 SKR---VKGAKSTDYSDGLMY 474 +K +G S + +D + Sbjct: 428 TKEDMARRGIPSPNLADSFIM 448 >gi|222032743|emb|CAP75482.1| Terminase large subunit [Escherichia coli LF82] Length = 470 Score = 92.9 bits (229), Expect = 1e-16, Method: Composition-based stats. Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADSLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|168239626|ref|ZP_02664684.1| phage terminase, large subunit, pbsx family protein [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|197287704|gb|EDY27095.1| phage terminase, large subunit, pbsx family protein [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] Length = 470 Score = 92.9 bits (229), Expect = 1e-16, Method: Composition-based stats. Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYAAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILVPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEVIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|323936486|gb|EGB32774.1| phage terminase large [Escherichia coli E1520] Length = 470 Score = 92.9 bits (229), Expect = 1e-16, Method: Composition-based stats. Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDL------------ 406 L + D + D + GA T + G + D Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDGPYQAGAWADE 343 Query: 407 -----------EFCRNRRTELHVKMADWLEFA---------SLINH-------------- 432 + RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIDEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|300897414|ref|ZP_07115839.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 198-1] gi|300358826|gb|EFJ74696.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 198-1] Length = 470 Score = 92.1 bits (227), Expect = 2e-16, Method: Composition-based stats. Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGSDHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|324114526|gb|EGC08494.1| hypothetical protein ERIG_00518 [Escherichia fergusonii B253] Length = 540 Score = 91.7 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 57/396 (14%), Positives = 122/396 (30%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441 ++ +F N + + ++ E + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + + G++ I+ K+ G +S + +D +M +A Sbjct: 490 QPTYSINSVGKIVID-KQPDGTRSPNLADSVMINYA 524 >gi|238027169|ref|YP_002911400.1| hypothetical protein bglu_1g15550 [Burkholderia glumae BGR1] gi|237876363|gb|ACR28696.1| Hypothetical protein bglu_1g15550 [Burkholderia glumae BGR1] Length = 531 Score = 91.7 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 59/332 (17%), Positives = 101/332 (30%), Gaps = 55/332 (16%) Query: 190 TYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248 + DE++ + L T + S P + F + + Sbjct: 195 DRASFYVVDESAFLERPQLVDASLSATTNCRQD-----ISTPNGMGNSFAQ--RRHSGKI 247 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 K F R D +++ +A LD V E+ + IP ++ AL Sbjct: 248 KVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEIDINYAASVEGVVIPSAWVQAALGA 305 Query: 309 EPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLV 364 P G D+A+EG D R G ++EHL WS D+ T ++ G+ Sbjct: 306 HVKLGISPSGARRGGLDVADEGKDKNAFAGRYGFLLEHLESWSGVGGDIFGTVDRALGIC 365 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDL-------- 406 + + DA+ GA +L G D Sbjct: 366 DVRGYEVFDYDADGLGAGVRGDARVLNEQRAAAGKRSIRSEPFRGSGPVYDPDGEMVKER 425 Query: 407 ---EFCRNRRTELHVKMADWL----------------EFASLINHSGLIQNLK---SLKS 444 ++ N + + + E S+ L S + Sbjct: 426 KNKDYFANLKAQSWWALRLRFQATYRAVVEGKPFDPDEIISIDPDLPERAALSMELSQPT 485 Query: 445 FIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476 F V G++ I+ K G KS + +D +M + Sbjct: 486 FTVNGVGKIVID-KAPDGTKSPNLADAVMIAY 516 >gi|260856407|ref|YP_003230298.1| putative terminase large subunit [Escherichia coli O26:H11 str. 11368] gi|257755056|dbj|BAI26558.1| putative terminase large subunit [Escherichia coli O26:H11 str. 11368] Length = 470 Score = 91.7 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 62/466 (13%), Positives = 133/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + + ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHTKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDL------------ 406 L + D + D + GA T + G + D Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDGPYQAGAWADE 343 Query: 407 -----------EFCRNRRTELHVKMADWLEFA---------SLINH-------------- 432 + RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|330910791|gb|EGH39301.1| phage terminase, large subunit [Escherichia coli AA86] Length = 540 Score = 91.7 bits (226), Expect = 3e-16, Method: Composition-based stats. Identities = 57/397 (14%), Positives = 127/397 (31%), Gaps = 67/397 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 R D ++ + +D+ V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIK 309 Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKY 367 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 310 LGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQD 369 Query: 368 RPDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQ 400 + D + GA + L V GQ Sbjct: 370 NLEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQ 429 Query: 401 KRAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLK 440 ++ +F N + + ++ + +++ + LI L Sbjct: 430 AARLNKDFFANAKAQSWWRLRKLFQNTYRAVVEGMAYNPDEIISISSAMASKDKLIIEL- 488 Query: 441 SLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 S ++ + G++ ++ K+ G KS + +D +M ++A Sbjct: 489 SQPTYSINGVGKIVVD-KQPDGTKSPNLADSVMISYA 524 >gi|319789040|ref|YP_004150673.1| protein of unknown function DUF264 [Thermovibrio ammonificans HB-1] gi|317113542|gb|ADU96032.1| protein of unknown function DUF264 [Thermovibrio ammonificans HB-1] Length = 419 Score = 91.7 bits (226), Expect = 3e-16, Method: Composition-based stats. Identities = 70/419 (16%), Positives = 146/419 (34%), Gaps = 58/419 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 +Q+E ++ +D+H + I R GK+ + ++ +T+P +++ Sbjct: 6 PYQIEIVKGIDSHKFSV------------IKMARQTGKSFVVSYWATRRATTKPNHAIVV 53 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 ++ +E Q L +K ++++ L ++ D L ++ + S + Sbjct: 54 VSPTERQ------------SKLFVDKVKLHIKAMRLTGVKFFEDTELKKLEVNFPNGSQI 101 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNP 230 PD G +I DE + + + + +T + + + S P Sbjct: 102 --IALPANPDGIRGFSGD----VIMDEVAFFKNWQEVYRAVFPIITRK-KDYKLVAISTP 154 Query: 231 RRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288 + FY ++ ++ W R+ ++ + + D R E +F Sbjct: 155 FGKNDLFYYLWSISENNPKWFRYSLNIFEAVAKGLKVDVEELRAGIKNEDAWRTEYLVEF 214 Query: 289 PQQDIDSFIPLNIIEEA-LNREPCPDPY-----APLIMGCDIAEEGGDNTVVV----LRR 338 + D+ +P +I++ + +E L G D+ D TV+ L Sbjct: 215 IDEA-DAVLPYELIQKCEMPKEELLVEDIKELKGELYCGVDVGRR-KDLTVITLLEKLGD 272 Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRV 397 + + + SK R IS + + ID G + + L+ G V V Sbjct: 273 VLYVRRIEELSKKPFREQLELISHYA--HYARRLAIDETGLGMQLAEELKERFGSKVIPV 330 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456 A + E + L K D + L ++L S++ V N G + E Sbjct: 331 YF--SAKNKEELAEK---LRAKFQD--RLIRVPADPDLREDLHSVRK-TVTNAGNVRYE 381 >gi|268589862|ref|ZP_06124083.1| phage terminase, large subunit, PBSX family [Providencia rettgeri DSM 1131] gi|291314845|gb|EFE55298.1| phage terminase, large subunit, PBSX family [Providencia rettgeri DSM 1131] Length = 470 Score = 91.7 bits (226), Expect = 3e-16, Method: Composition-based stats. Identities = 67/487 (13%), Positives = 138/487 (28%), Gaps = 79/487 (16%) Query: 66 CLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTL 124 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 3 QINPIFMPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILC 49 Query: 125 WAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTF 184 E+ +S + + + + + + + + Sbjct: 50 ARELQNSISDSVIRLLEDTIEREGYNNEFEIQRTMIKHLGTGAEFMFYGIKNNPTKIKSL 109 Query: 185 VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-K 243 G +EA ++ + + N+ W+ NP+ + Y+ F Sbjct: 110 EGVD-----VCWVEEAEAVTKESWDILIPTIRKPNSE-IWVSF-NPKNILDDTYQRFVVN 162 Query: 244 PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 P DD + + + + R G+ + I +E Sbjct: 163 PPDDICLLTANYTDNPHFPDVLRLEMEECKRKNPTLYRHIWLGEPVSASDMAIIKREWLE 222 Query: 304 EALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361 A + ++ +I D ++ GGD +R G V++ + + D+ + + Sbjct: 223 AATDAHKKLGWKAKGAIIATHDPSDVGGDAKGYAMRHGSVVKRISEGLLMDVNDGADWAT 282 Query: 362 GLVEKYRPDAIIIDANNTGART--------------------------CDYLEMLGYHVY 395 + D + D + GA D L G Sbjct: 283 EKAIQDGADHFLWDGDGLGAALRRQVTDAFTGKQTTVTMFKGSESPFDEDALYQSGAWAD 342 Query: 396 RVLGQKRAVDL-EFCRNRRTELHVKMADWL-------EFASLINHSGLIQ---------- 437 V+ + + + RN+R + + +AD L E N +I Sbjct: 343 EVVSGDNSRTIGDVFRNKRAQFYYALADRLYLTYRAVEHGEYANPDDMISFDKEAIGEQM 402 Query: 438 ------NLKSLKSFIVPNTGELAIESKRVK----GAKSTDYSDGLMYTFAENPPRSDMDF 487 L ++ G+L + +K G S + +D LM + D Sbjct: 403 LEKLFAELTQIQR-KFNGNGKLELMTKVDMKVKLGIPSPNLADSLMMSMYCPVIIHDDTE 461 Query: 488 GRCPSYQ 494 PS Sbjct: 462 IYVPSSS 468 >gi|85716479|ref|ZP_01047450.1| prophage MuMc02, terminase, ATPase subunit, putative [Nitrobacter sp. Nb-311A] gi|85696668|gb|EAQ34555.1| prophage MuMc02, terminase, ATPase subunit, putative [Nitrobacter sp. Nb-311A] Length = 250 Score = 91.3 bits (225), Expect = 3e-16, Method: Composition-based stats. Identities = 50/262 (19%), Positives = 78/262 (29%), Gaps = 38/262 (14%) Query: 51 PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110 P WQ E + NP + + + GKTT+ A + L G V Sbjct: 24 PDPWQAELLR----------LNPKRALLLCSRQS----GKTTVTALMALHRAIYETGALV 69 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 + ++ S Q L ++ K L ++ Sbjct: 70 VIVSPSNRQSGEML-RQIKKLHGSLKGAPELVGDAVLKVELA--------------NGSR 114 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 + +E+ G +I DEAS D + + L R A+ I + P Sbjct: 115 IIALPGTEKTIRGIAG-----VSLVIIDEASRVDDELLAAVRPMLATR-ADGSLIALTTP 168 Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 G FYE ++ W R ++ I F + G E F Sbjct: 169 AGKRGFFYEAWHSDDQTWHRVRVAASDCPRISKEFLADELRSLG--PARYSEEYELAFVD 226 Query: 291 QDIDSFIPLNIIEEALNREPCP 312 +F P +IE A E P Sbjct: 227 DAASAF-PTAVIERAFTTEVEP 247 >gi|298381518|ref|ZP_06991117.1| phage terminase large subunit [Escherichia coli FVEC1302] gi|298278960|gb|EFI20474.1| phage terminase large subunit [Escherichia coli FVEC1302] Length = 470 Score = 90.9 bits (224), Expect = 4e-16, Method: Composition-based stats. Identities = 61/466 (13%), Positives = 133/466 (28%), Gaps = 79/466 (16%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIFKREWLEA 223 Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 A + ++ ++ D ++ G D R G V++ + + D+ + + Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283 Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408 L + D + D + GA T + G + D + Sbjct: 284 LAIEDGSDHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343 Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432 RN+R + + +AD L + + Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403 Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448 >gi|167753387|ref|ZP_02425514.1| hypothetical protein ALIPUT_01661 [Alistipes putredinis DSM 17216] gi|167658012|gb|EDS02142.1| hypothetical protein ALIPUT_01661 [Alistipes putredinis DSM 17216] Length = 472 Score = 90.9 bits (224), Expect = 4e-16, Method: Composition-based stats. Identities = 43/259 (16%), Positives = 92/259 (35%), Gaps = 31/259 (11%) Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDP 314 I+ + E + + V + + G + + ++ + I E + Sbjct: 230 DNPFIEKDYIEALKST---TDKVKKERLLKGNWDYDDNPNALCSYDNIREIFYPKIH-TR 285 Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR--PDAI 372 + DIA G D +++ G I + ++ I L K+R I Sbjct: 286 TGIKYITADIARFGSDRARILVWDGWAIIEQVSFDRSATTEIAACIESLAAKHRIPRYRI 345 Query: 373 IIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH 432 I D + G D + G+ + + ++ E N +T+ K+A+ + ++ Sbjct: 346 IADEDGVGGGVVDMCRISGF-----VNNSQCLNGENFSNLQTQCGYKLANKINSFAISFD 400 Query: 433 SGL--------IQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481 L + L+ L+++ V N +L ++ K + +S D+ D L+ Sbjct: 401 CELSDGQKDEITEELEQLQTWNVDNDRKLFLKPKDEIKQDIGRSPDWRDALLM------- 453 Query: 482 RSDMDFGRCPSYQYEGVDL 500 R D+ + E + L Sbjct: 454 RVWFDYKQIIPLSKEDLGL 472 >gi|238765385|ref|ZP_04626308.1| Gp33 TerL [Yersinia kristensenii ATCC 33638] gi|238696377|gb|EEP89171.1| Gp33 TerL [Yersinia kristensenii ATCC 33638] Length = 501 Score = 90.9 bits (224), Expect = 5e-16, Method: Composition-based stats. Identities = 58/402 (14%), Positives = 121/402 (30%), Gaps = 64/402 (15%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F S W + ++ + + + + Sbjct: 106 ALFWKARKFVETLPSEFRGSWSEKKHAPYMRVEFPDTGAVIKGEAGDNIGR-----GDRT 160 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DE++ + I L++ R I S+ ++ F + + F Sbjct: 161 TLYFVDESAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHGGKIPVFT 214 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312 R+ D + + V E+ + IP ++ A++ Sbjct: 215 FHWRSDPRKDDEW-YRKECEKIDNPVVVAQELDLNYQASAEGILIPSEWVQAAIDAHIHL 273 Query: 313 D--PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN--KISGLVEKYR 368 D P + D+A+EG D +R G +++ + +WS + K+ G ++Y Sbjct: 274 DIQPSGARLGAMDVADEGRDKNGFAIRYGFLLQDVKEWSGEGSDIYASVVKVFGYCDEYG 333 Query: 369 PDAIIIDANNTGAR------TCDYLEM---------------------LGYHVYRVLGQK 401 D D + GA + L V G+ Sbjct: 334 LDEFRFDEDGLGAGVRGDARVINELRQSERLGPITATPFRGSGAVFDPDDEAVIGDNGKP 393 Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442 ++ +F N + + + +++ N LI L S Sbjct: 394 ARLNKDFFANAKAQGWWHLRKLFRNTFRAMKGMDYNPDEIISINSTMENKDRLIMEL-SQ 452 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484 ++ G++ I+ K+ +G KS + +D +M +A D Sbjct: 453 PTWSKNAVGKIVID-KQPEGTKSPNLADAVMINYAPMDSSLD 493 >gi|300824951|ref|ZP_07105051.1| conserved hypothetical protein [Escherichia coli MS 119-7] gi|300522580|gb|EFK43649.1| conserved hypothetical protein [Escherichia coli MS 119-7] Length = 540 Score = 90.9 bits (224), Expect = 5e-16, Method: Composition-based stats. Identities = 58/396 (14%), Positives = 121/396 (30%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + KI G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKIFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441 ++ +F N + + ++ E + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ I+ K+ G +S + +D +M +A Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524 >gi|254160843|ref|YP_003043951.1| hypothetical protein ECB_00733 [Escherichia coli B str. REL606] gi|253972744|gb|ACT38415.1| conserved hypothetical protein [Escherichia coli B str. REL606] Length = 540 Score = 90.9 bits (224), Expect = 5e-16, Method: Composition-based stats. Identities = 57/396 (14%), Positives = 122/396 (30%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L + V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNVARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441 ++ +F N + + ++ E + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ I+ K+ G +S + +D +M +A Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524 >gi|168467237|ref|ZP_02701079.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195630466|gb|EDX49092.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 539 Score = 90.6 bits (223), Expect = 6e-16, Method: Composition-based stats. Identities = 54/394 (13%), Positives = 117/394 (29%), Gaps = 62/394 (15%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F + W + ++ + + + + Sbjct: 143 ALFWKVRKFIATLPAEFRGGWDERKHSRFMSVEFPDTGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D + + + E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDDEW-YRKECEKIDNPIIVAQELDLNYQASAEGILIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 P + D+A+EG D LR G ++ + +WS +D+ + K+ GL + + Sbjct: 311 GIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEWSGKGSDIYDSVVKVFGLCDDFG 370 Query: 369 PDAIIIDANNTGART---------------CDYL-----EMLGYHVYRV-------LGQK 401 D D + GA D + G Y G+ Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGTDQITATPFRGSGRVFYPENEAVPGDNGKP 430 Query: 402 RAVDLEFCRNRRTELHVKMAD-------WLEFASLINHS-----------GLIQNLKSLK 443 ++ +F N + + + L+ + S Sbjct: 431 SRLNKDFFANAKAQGWWHLRKLFRNTFRALKGMEYDPDEIISISSTMENKDRLLMELSQP 490 Query: 444 SFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ G++ ++ K+ G KS + +D +M +A Sbjct: 491 TWSKNAVGKILVD-KQPDGTKSPNLADSVMIAYA 523 >gi|194430118|ref|ZP_03062621.1| gp33 TerL [Escherichia coli B171] gi|215487586|ref|YP_002330017.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] gi|260845222|ref|YP_003223000.1| putative terminase large subunit [Escherichia coli O103:H2 str. 12009] gi|194411828|gb|EDX28147.1| gp33 TerL [Escherichia coli B171] gi|215265658|emb|CAS10061.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] gi|257760369|dbj|BAI31866.1| predicted terminase large subunit [Escherichia coli O103:H2 str. 12009] gi|309702924|emb|CBJ02255.1| putative phage gp33 TerL [Escherichia coli ETEC H10407] gi|323159191|gb|EFZ45181.1| gp33 TerL [Escherichia coli E128010] Length = 540 Score = 90.2 bits (222), Expect = 7e-16, Method: Composition-based stats. Identities = 57/396 (14%), Positives = 121/396 (30%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441 ++ +F N + + ++ E + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ I+ K+ G +S + +D +M +A Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524 >gi|168820654|ref|ZP_02832654.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205342611|gb|EDZ29375.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] Length = 539 Score = 90.2 bits (222), Expect = 7e-16, Method: Composition-based stats. Identities = 52/394 (13%), Positives = 116/394 (29%), Gaps = 62/394 (15%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F + W + ++ + + + + Sbjct: 143 ALFWKVRKFIATLPAEFRGGWDERKHSRFMSVEFPDTGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKISVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D + + + E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDDEW-YRKECEKIDNPIIVAQELDLNYQASAEGILIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 P + D+A+EG D LR G ++ + +WS +D+ + K+ GL + + Sbjct: 311 GIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEWSGKGSDIYDSVVKVFGLCDDFG 370 Query: 369 PDAIIIDANNTGART---------------CDYLEMLGYHVYRV------------LGQK 401 D D + GA D + + G+ Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGTDQITATPFRGSGSVFYPENEAVPGDNGKP 430 Query: 402 RAVDLEFCRNRRTELHVKMAD-------WLEFASLINHS-----------GLIQNLKSLK 443 ++ +F N + + + L+ + S Sbjct: 431 ARLNKDFFANAKAQGWWHLRKLFRNTFRALKGMEYDPDEIISISSTMENKDRLLMELSQP 490 Query: 444 SFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ G++ ++ K+ G KS + +D +M +A Sbjct: 491 TWSKNAVGKILVD-KQPDGTKSPNLADSVMIAYA 523 >gi|218555117|ref|YP_002388030.1| hypothetical protein ECIAI1_2647 [Escherichia coli IAI1] gi|218361885|emb|CAQ99485.1| conserved hypothetical protein from bacteriophage origin [Escherichia coli IAI1] Length = 540 Score = 90.2 bits (222), Expect = 8e-16, Method: Composition-based stats. Identities = 57/396 (14%), Positives = 121/396 (30%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLMENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441 ++ +F N + + ++ E + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ I+ K+ G +S + +D +M +A Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524 >gi|291283815|ref|YP_003500633.1| hypothetical protein G2583_3121 [Escherichia coli O55:H7 str. CB9615] gi|290763688|gb|ADD57649.1| hypothetical protein G2583_3121 [Escherichia coli O55:H7 str. CB9615] Length = 540 Score = 90.2 bits (222), Expect = 8e-16, Method: Composition-based stats. Identities = 57/395 (14%), Positives = 122/395 (30%), Gaps = 63/395 (15%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWL----------------EFASLINHSGLIQNL---KSL 442 ++ +F N + + ++ E S+ + L L S Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIALSQ 490 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ I+ K+ G +S + +D +M +A Sbjct: 491 PTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524 >gi|62181180|ref|YP_217597.1| hypothetical protein SC2610 [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62128813|gb|AAX66516.1| orf, partial conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322715669|gb|EFZ07240.1| hypothetical protein SCA50_2790 [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 540 Score = 89.8 bits (221), Expect = 9e-16, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 125/396 (31%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWNEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D ++ + + V E+ + IP + ++ A++ Sbjct: 252 FHWRSDPRKDDEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDAHIRL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRKAARRPPILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLKS 441 ++ +F N + + + +++ + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWYLRKLFRNTYRAVVEGMAYNPDEIISISSTMESKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ ++ K+ G +S + +D +M ++A Sbjct: 490 QPTYSINGVGKIVVD-KQPDGTRSPNLADSVMISYA 524 >gi|194445851|ref|YP_002040314.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|194404514|gb|ACF64736.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str. SL254] Length = 540 Score = 89.8 bits (221), Expect = 9e-16, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 125/396 (31%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWNEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D ++ + + V E+ + IP + ++ A++ Sbjct: 252 FHWRSDPRKDDEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDAHIRL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRKAARRPPILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLKS 441 ++ +F N + + + +++ + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWYLRKLFRNTYRAVVEGMAYNPDEIISISSTMESKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ ++ K+ G +S + +D +M ++A Sbjct: 490 QPTYSINGVGKIVVD-KQPDGTRSPNLADSVMISYA 524 >gi|188494674|ref|ZP_03001944.1| gp33 TerL [Escherichia coli 53638] gi|188489873|gb|EDU64976.1| gp33 TerL [Escherichia coli 53638] Length = 539 Score = 89.8 bits (221), Expect = 1e-15, Method: Composition-based stats. Identities = 56/395 (14%), Positives = 121/395 (30%), Gaps = 64/395 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D + + + E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 P + D+A+EG D LR G ++ + +WS +D+ + K+ GL + + Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370 Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401 D D + GA + L V G+ Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430 Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442 ++ +F N + + + +++ N L+ L S Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISSTMENKDRLLMEL-SQ 489 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ TG++ ++ K+ G KS + +D +M +A Sbjct: 490 PTWSKNATGKILVD-KQPDGTKSPNLADSVMIAYA 523 >gi|167553969|ref|ZP_02347711.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205321713|gb|EDZ09552.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 539 Score = 89.4 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 52/394 (13%), Positives = 116/394 (29%), Gaps = 62/394 (15%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F + W + ++ + + + + Sbjct: 143 ALFWKVRKFIATLPAEFRGGWDERKHSRFMSVEFPDTGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D + + + E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPIIVAQELDLNYQASTEGILIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 P + D+A+EG D LR G ++ + +WS +D+ + K+ GL + + Sbjct: 311 GIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEWSGKGSDIYDSVVKVFGLCDDFG 370 Query: 369 PDAIIIDANNTGART---------------CDYLEMLGYHVYRV------------LGQK 401 D D + GA D + + G+ Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGTDQITATPFRGSGSVFYPENEAVPGDNGKP 430 Query: 402 RAVDLEFCRNRRTELHVKMAD-------WLEFASLINHS-----------GLIQNLKSLK 443 ++ +F N + + + L+ + S Sbjct: 431 SRLNKDFFANAKAQGWWHLRKLFRNTFRALKGMEYDPDEIISISSTMENKDRLLMELSQP 490 Query: 444 SFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ G++ ++ K+ G KS + +D +M +A Sbjct: 491 TWSKNAVGKILVD-KQPDGTKSPNLADSVMIAYA 523 >gi|224582844|ref|YP_002636642.1| hypothetical protein SPC_1035 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] gi|224467371|gb|ACN45201.1| hypothetical protein SPC_1035 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] Length = 540 Score = 89.4 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 55/396 (13%), Positives = 125/396 (31%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWNEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D ++ + + V E+ + IP + ++ A++ Sbjct: 252 FHWRSDPRKDDEWYRRECEKI-DNPVVVAQELDLNYSASAEGILIPSDWVQAAVDAHIRL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRKAARRPPILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLKS 441 ++ +F N + + + +++ + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWYLRKLFRNTYRAVVEGMAYNPDEIISISSTMESKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ ++ K+ G +S + +D +M ++A Sbjct: 490 QPTYSINGVGKIVVD-KQPDGTRSPNLADSVMISYA 524 >gi|332088044|gb|EGI93169.1| gp33 TerL [Shigella boydii 5216-82] Length = 539 Score = 89.4 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 56/395 (14%), Positives = 120/395 (30%), Gaps = 64/395 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D + + + E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 P + D+A+EG D LR G ++ + +WS +D+ + K+ GL + + Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370 Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401 D D + GA + L V G+ Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430 Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442 ++ +F N + + + +++ N L+ L S Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISSTMENKDRLLMEL-SQ 489 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ TG++ ++ K+ G KS + +D +M +A Sbjct: 490 PTWSKNATGKILVD-KQPDGTKSPNLADSVMIAYA 523 >gi|323173153|gb|EFZ58784.1| gp33 TerL protein [Escherichia coli LT-68] Length = 539 Score = 89.4 bits (220), Expect = 1e-15, Method: Composition-based stats. Identities = 56/395 (14%), Positives = 120/395 (30%), Gaps = 64/395 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D + + + E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 P + D+A+EG D LR G ++ + +WS +D+ + K+ GL + + Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370 Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401 D D + GA + L V G+ Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430 Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442 ++ +F N + + + +++ N L+ L S Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISSTMENKDRLLMEL-SQ 489 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ TG++ ++ K+ G KS + +D +M +A Sbjct: 490 PTWSKNATGKILVD-KQPDGTKSPNLADSVMIAYA 523 >gi|332759085|gb|EGJ89395.1| gp33 TerL [Shigella flexneri 4343-70] Length = 519 Score = 89.0 bits (219), Expect = 2e-15, Method: Composition-based stats. Identities = 56/396 (14%), Positives = 121/396 (30%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 122 ALFWKARKFVETLPVEFRGSWDEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 176 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DE++ + I L++ R I S+ ++ F + + F Sbjct: 177 TLYLVDESAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 230 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 231 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 289 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 290 GIQPTGKRLGAMDVADEGRDKNSFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 349 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 350 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 409 Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441 ++ +F N + + ++ E LI L S Sbjct: 410 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMDYNPDEIISISSSMALKDKLIIEL-S 468 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ I+ K+ G +S + +D +M ++A Sbjct: 469 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMISYA 503 >gi|191172603|ref|ZP_03034142.1| gp33 TerL [Escherichia coli F11] gi|190907076|gb|EDV66676.1| gp33 TerL [Escherichia coli F11] gi|324014340|gb|EGB83559.1| hypothetical protein HMPREF9533_01599 [Escherichia coli MS 60-1] Length = 540 Score = 89.0 bits (219), Expect = 2e-15, Method: Composition-based stats. Identities = 56/396 (14%), Positives = 120/396 (30%), Gaps = 65/396 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + + G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVENVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441 ++ +F N + + ++ E + LI L S Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ I+ K+ G +S + +D +M +A Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524 >gi|333006277|gb|EGK25786.1| gp33 TerL [Shigella flexneri K-218] Length = 540 Score = 88.6 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 56/395 (14%), Positives = 123/395 (31%), Gaps = 63/395 (15%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWDEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DE++ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDESAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R D ++ + + V E+ + IP ++ A++ Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNSFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370 Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401 + D + GA + L V GQ Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430 Query: 402 RAVDLEFCRNRRTELHVKMADWL----------------EFASLINHSGLIQNL---KSL 442 ++ +F N + + ++ E S+ + L L S Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMDYNPDEIISISSSMALKDKLIIELSQ 490 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ + G++ I+ K+ G +S + +D +M ++A Sbjct: 491 PTYSINGVGKIVID-KQPDGTRSPNLADSVMISYA 524 >gi|320179507|gb|EFW54461.1| Phage terminase, large subunit [Shigella boydii ATCC 9905] Length = 539 Score = 88.6 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 56/395 (14%), Positives = 120/395 (30%), Gaps = 64/395 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D + + + E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDDEW-YHKECDKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 P + D+A+EG D LR G ++ + +WS +D+ + K+ GL + + Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370 Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401 D D + GA + L V G+ Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430 Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442 ++ +F N + + + +++ N L+ L S Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISSTMENKDRLLMEL-SQ 489 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 ++ TG++ ++ K+ G KS + +D +M +A Sbjct: 490 PTWSKNATGKILVD-KQPDGTKSPNLADSVMIAYA 523 >gi|224583103|ref|YP_002636901.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] gi|224467630|gb|ACN45460.1| terminase large subunit [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] Length = 492 Score = 88.6 bits (218), Expect = 2e-15, Method: Composition-based stats. Identities = 56/384 (14%), Positives = 108/384 (28%), Gaps = 79/384 (20%) Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL------TERN---ANRFWIMTSNP 230 + G+ N + +EA ++ + E + W+ NP Sbjct: 104 NVENIKGYANFDAALV--EEAENVSKDSWETLIPTVRKEFYSAEYGRVVESEIWVA-YNP 160 Query: 231 RRLSGKFYEIF--NKPLDDW--------KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVT 280 + ++ F N+ D+ QI+ + + + ++ Sbjct: 161 KNRLSDTHQRFVTNRIYPDYDENGNRYCIVKQINYTANPWFPETLRRDMEIMKKANHELY 220 Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRR 338 R G+ + I +E A + +I D ++ G D +R Sbjct: 221 RHVYLGEPVGASEMAIIKFAWLEAATDAHIKLGWKAKGAVIAAHDPSDTGPDAKGYAVRH 280 Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHV 394 G V++ + + D+ + S L D + D + GA DY V Sbjct: 281 GSVVKRVCEGLLMDINEGADWASSLAVIDDVDHFLFDGDGLGAGLRRQITDYFSGKKVTV 340 Query: 395 YRVLGQKRAVDLEF-----------------------CRNRRTELHVKMADWL------- 424 G + D + RN+R + + +AD L Sbjct: 341 TMFKGSESPFDEDAPYQAGAWTDEVVQGDNVRTIGDVFRNKRAQFYYTLADRLYRTYRAV 400 Query: 425 EFASLINHSG----------------LIQNLKSLKSFIVPNTGEL----AIESKRVKGAK 464 E + L L ++ G+L +E K+ G Sbjct: 401 EHGEYADPDEMLSFDKEAIGENILNKLFAELTQIQR-KFNGNGKLELMTKVEMKQKLGIP 459 Query: 465 STDYSDGLMYTFAENPPRSDMDFG 488 S + +D LM + D+ Sbjct: 460 SPNLADALMMCMHCPESVAQPDYS 483 >gi|110804738|ref|YP_688258.1| putative bacteriophage protein [Shigella flexneri 5 str. 8401] gi|110614286|gb|ABF02953.1| putative bacteriophage protein [Shigella flexneri 5 str. 8401] Length = 255 Score = 88.2 bits (217), Expect = 3e-15, Method: Composition-based stats. Identities = 38/238 (15%), Positives = 72/238 (30%), Gaps = 49/238 (20%) Query: 295 SFIPLNIIEEALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SK 350 + I L+ IE A++ + +P +G D+A+ G D V R G V+ +W + Sbjct: 10 AIIKLSWIEAAVDAHKTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 69 Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG------------YHVYRVL 398 +L + + + D I+ D+ GA + + R Sbjct: 70 DELLKSCQRTYQAALEREAD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFN 128 Query: 399 GQ----------KRAVDLEFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ---- 437 + +F N + + +AD + LI Sbjct: 129 AGAGVHEPDDEYNGIPNKDFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSR 188 Query: 438 --------NLKSLKSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 + G + +ESK+ + S + +D + FA D Sbjct: 189 CPLLEKLKLELTTPHRDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 246 >gi|167993618|ref|ZP_02574712.1| gp33 TerL [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|205328294|gb|EDZ15058.1| gp33 TerL [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] Length = 539 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 56/404 (13%), Positives = 127/404 (31%), Gaps = 67/404 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWNEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 R+ D ++ + +D+ V E+ + IP + ++ A++ Sbjct: 252 FHWRSDPRKDDEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDAHIR 309 Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKY 367 P + D+A+EG D R G ++E++ +WS +D+ + ++ G E+ Sbjct: 310 LGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVERVFGFCEQD 369 Query: 368 RPDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQ 400 + D + GA + L V GQ Sbjct: 370 NLEEFRFDEDGLGAGVRGDARAINELRKAARRPPILATPFRGSGAVFDPDDEAVRGDNGQ 429 Query: 401 KRAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLK 440 ++ +F N + + + +++ + LI L Sbjct: 430 AARLNKDFFANAKAQSWWYLRKLFRNTYRAVVEGMAYNPDEIISISSTMESKDKLIIEL- 488 Query: 441 SLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484 S ++ + G++ ++ K+ G +S + +D M ++A D Sbjct: 489 SQPTYSINGVGKIVVD-KQPDGTRSPNLADSAMISYAPMDSSLD 531 >gi|294650848|ref|ZP_06728195.1| bacteriophage terminase large subunit TerL [Acinetobacter haemolyticus ATCC 19194] gi|292823266|gb|EFF82122.1| bacteriophage terminase large subunit TerL [Acinetobacter haemolyticus ATCC 19194] Length = 552 Score = 87.9 bits (216), Expect = 3e-15, Method: Composition-based stats. Identities = 54/337 (16%), Positives = 106/337 (31%), Gaps = 61/337 (18%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 DE + + +++ I S P + +F++ ++ + F + Sbjct: 210 MYFLDEWAFVERQ--EAVDAAISQ--NTNVHIKGSTPNGIGDRFHQ--DRFSGRYAVFSM 263 Query: 254 DTRTVEGIDP--SFHEGIIARYGL------DSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 R + ++ I + D V EV + IP ++ A Sbjct: 264 PWRANPDKNWTVEYNGKQIHPWYEKQLATLDDVVLAQEVDINYAASVEGVLIPSTWVQLA 323 Query: 306 LNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKIS 361 ++ +P I G D+A+EG D R G V+ +L WS D+ T K Sbjct: 324 IDAHIKLGIEPTGDRIAGLDVADEGKDKNSFASRHGIVMTYLDTWSGKGDDIFGTTQKAM 383 Query: 362 GLVEKYRPDAIIIDANNTGAR------TCDYL-EMLGYHVYRVLGQKRA----------- 403 L D + DA+ GA + L G V + + Sbjct: 384 DLSIDQSIDTLFYDADGLGAGCRGDARVVNELRREQGLSEVDVQPFRGSGAVHEPDEQMV 443 Query: 404 ---VDLEFCRNRRTELHVKMADWLEF-------------------ASLINHSGL--IQNL 439 + +F N + + + + + I+ L + Sbjct: 444 EMRFNKDFFANLKAQSWWSLRLRFQETFRALEGREYDRDMIISFSSEHIDPKELAMLTTE 503 Query: 440 KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476 S ++ G++ + +K+ G S + +D +M F Sbjct: 504 LSQPTYTKNGVGKILV-NKQPDGTASPNRADSVMICF 539 >gi|322835667|ref|YP_004215693.1| terminase large subunit [Rahnella sp. Y9602] gi|321170868|gb|ADW76566.1| terminase large subunit [Rahnella sp. Y9602] Length = 539 Score = 87.5 bits (215), Expect = 4e-15, Method: Composition-based stats. Identities = 63/404 (15%), Positives = 120/404 (29%), Gaps = 67/404 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + + ++ + + + + Sbjct: 143 ALFWKARKFVEMLPVEFRGGWSAKKHAPYMRVEFPTTGAVLKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IEASLSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGRIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 R+ D +++ A+ +D+ V E+ + IP I A+N Sbjct: 252 FHWRSDPRKDEAWYAKECAK--IDNPVVVAQELDLNYSASAEGVLIPNEWIRAAINAHIK 309 Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKY 367 P + D+A+EG D R G ++ + +WS +D+ ++ K GL +K+ Sbjct: 310 LGIQPTGKRLGAMDVADEGRDKNAFSARYGFLLTEVEEWSGVGSDIYKSSEKAFGLCDKH 369 Query: 368 RPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDLE---------- 407 + D + GA + G D E Sbjct: 370 GLEEFRFDEDGLGAGVRGDARAINEIRKAEGARYILATPFRGSASVFDPEAEAVPGDNGQ 429 Query: 408 -------FCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLK 440 F N + + + + N LI L Sbjct: 430 PARINKDFFANAKAQSWWHLRKLFRNVYRAVEEKMDYNPDEIISISGDIKNLDKLIIEL- 488 Query: 441 SLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484 S ++ + G++ I K+ G KS + SD +M +A D Sbjct: 489 SQPTYSINGVGKI-IVDKQPDGTKSPNLSDSVMINYAPMDTTMD 531 >gi|238790716|ref|ZP_04634478.1| Gp33 TerL [Yersinia frederiksenii ATCC 33641] gi|238721211|gb|EEQ12889.1| Gp33 TerL [Yersinia frederiksenii ATCC 33641] Length = 538 Score = 87.5 bits (215), Expect = 5e-15, Method: Composition-based stats. Identities = 58/356 (16%), Positives = 107/356 (30%), Gaps = 63/356 (17%) Query: 172 MCRTYSEERPDTFVGH-HNTYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSN 229 T S + G I DE++ + L T + S Sbjct: 176 FPETESAMTGEAGDGIGRGDRTSFYIVDESAFLERPYLVDASLSATTNCRQD-----VST 230 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289 P ++ F E + K F R D ++++ + LD E+ + Sbjct: 231 PNGMANSFAE--RRHSGKIKVFTFHWRDDPRKDDAWYQKQVEN--LDPVTVAQEIDINYS 286 Query: 290 QQDIDSFIPLNIIEEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347 IP ++ A+N P + DIA+EG D R G ++E + + Sbjct: 287 ASVEGVLIPSAWVQAAINAHEVLGIVPTGQRLGALDIADEGKDTNSFAGRHGFLLESIEE 346 Query: 348 WSKT--DLRTTNNKISGLVEKYRPDAIIIDANNTGAR------TC--DYLEMLGYHVYRV 397 WS D+ T K + + + D + GA E H+ Sbjct: 347 WSGKGDDIFGTVQKAFDICDAQNLETFRFDTDGLGAGARGDARVINEQREEQRRRHIVAT 406 Query: 398 -------------------LGQKRAVDLEFCRNRRTELHVKMADWL-------------- 424 GQ+ ++ +F N + + + Sbjct: 407 PFRGSGGVTDPDDEAVPGDNGQQGRLNKDFFANAKAQGWWSLRTRFQKTYRAVKENMEFD 466 Query: 425 --EFASLINHSGLIQNLK---SLKSFIVPNTGELAIESKRVKGAKSTDYSD-GLMY 474 E S+ + L S ++ V G++ ++ K G KS + +D ++ Sbjct: 467 PDEIISIPKDLKNLTKLTSELSQPTYSVNGVGKIVVDKKPD-GTKSPNLADSAMIL 521 >gi|227113418|ref|ZP_03827074.1| Terminase large subunit [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 472 Score = 87.5 bits (215), Expect = 5e-15, Method: Composition-based stats. Identities = 49/337 (14%), Positives = 94/337 (27%), Gaps = 61/337 (18%) Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQID 254 +EA ++ + + + W+ NP+ + Y+ F P DD ++ Sbjct: 116 WVEEAEAVTKESWDILIPTIRKPGSE-IWVSF-NPKNILDDTYQRFVVTPPDDICLLTVN 173 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--P 312 + + + R G+ + I +E A + Sbjct: 174 YTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASEMAIIKREWLEAATDAHIKLGW 233 Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDA 371 ++ D ++ G D+ +R G V++ + D+ + + L D Sbjct: 234 KAKGAIVAAHDPSDTGPDDKGYAMRHGSVVKRIASPPAPLDVNDGADWATDLAIADGADH 293 Query: 372 IIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF------------------- 408 + D + GA D V G + D + Sbjct: 294 FLFDGDGLGAGLRRQVTDSFTGKKVTVTMFKGSESPFDEDSPYQAGAWFDEVVDGDNIRT 353 Query: 409 ----CRNRRTELHVKMADWL-----------------------EFASLINHSGLIQNLKS 441 RN+R + + +AD L E L L Sbjct: 354 IGDVFRNKRAQFYYTLADRLYLTYRAIVHGEYANPDDMLSFDKEAIGDQMLEKLFAELTQ 413 Query: 442 LKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 ++ G+L +E K G S + +D LM Sbjct: 414 IQR-KFNGNGKLELMTKVEMKSKLGIPSPNLADSLMM 449 >gi|260906962|ref|ZP_05915284.1| hypothetical protein BlinB_16637 [Brevibacterium linens BL2] Length = 249 Score = 86.7 bits (213), Expect = 8e-15, Method: Composition-based stats. Identities = 45/258 (17%), Positives = 79/258 (30%), Gaps = 40/258 (15%) Query: 50 APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 P WQ + + + + R +GKTT A+ L PG Sbjct: 23 DPELWQERLLRT--------------QEARVLVLCARQVGKTTATAYKALHAAMFNPGRD 68 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 V+ ++ S+ Q L + + + P S+ L S+ Sbjct: 69 VLIVSPSQRQSDEML------------RRVASLYRGMKEAPKLSRSNTSEMGLSNGSR-- 114 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 + SE F G +I DEAS D + +L + + S Sbjct: 115 -VVSLPGSEGGIRGFAGVK-----LLILDEASRVDDDVFASVLPMVASDGQ---MVALST 165 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289 P G F+E+ + + W+R ++ + P + A G S V + +F Sbjct: 166 PWGRRGWFHELHQETRNGWERHKVTVYESDQYTPPRIAEVKASLG--SFVFSSDYLCEF- 222 Query: 290 QQDIDSFIPLNIIEEALN 307 + A + Sbjct: 223 GDTDSQLFSTENVRAAFS 240 >gi|315426011|dbj|BAJ47659.1| prophage MuMc02, terminase, ATPase subunit [Candidatus Caldiarchaeum subterraneum] Length = 439 Score = 86.3 bits (212), Expect = 1e-14, Method: Composition-based stats. Identities = 63/333 (18%), Positives = 112/333 (33%), Gaps = 25/333 (7%) Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ- 119 + H +P F+ + RG G T A T P +++ ++ S Q Sbjct: 18 DIRLHPWQKRFIDDPSRFRIILKH-RGAGATFTIAAEACAEALTHPASTILLISYSLRQS 76 Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE 179 L+ ++ V LS L NK S+ A + + G Sbjct: 77 LE--IFRHVRTILSRLENKRLKHGHSIYRLAAKIGARTVELGNGSRI--------ISLPN 126 Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239 P++ G+ A+ DEA+ NL T N + S P+ G F+E Sbjct: 127 NPESLRGYRAD---AVYVDEAAFFRGDTNLKTAIMFTTVARNGRVTLVSTPKGKRGWFHE 183 Query: 240 IFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL 299 + W + + I E + R + R E+ +F ++++FIP Sbjct: 184 AWTTDNT-WSKHLVKLGDSPHITMHDLEEL--RKTMSPLEWRQEMMCEFLD-EVNAFIPY 239 Query: 300 NIIEEALNRE-PCPDPYAPLIMGCDIAEEGGDNTVV--VLRRGPVIE--HLFDWSKTDLR 354 I E + P + +G D D+TV+ V+ G ++ + + Sbjct: 240 EKILECVEDYVPARVVGGRVYVGVDFGRF-RDSTVIIAVVEDGERFRVCYVEELRQKPFA 298 Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 I+ P + +D+ GA + L Sbjct: 299 AQLEAINRANMVLHPAIVAVDSTGMGAPLAETL 331 >gi|297520464|ref|ZP_06938850.1| hypothetical protein EcolOP_22727 [Escherichia coli OP50] Length = 313 Score = 86.3 bits (212), Expect = 1e-14, Method: Composition-based stats. Identities = 47/270 (17%), Positives = 90/270 (33%), Gaps = 56/270 (20%) Query: 262 DPSFHEGIIARYGLDSD---VTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--PDPYA 316 DP E R D V E+ + IP ++ A++ P Sbjct: 30 DPRKDEEWYRRECEKIDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKLGIQPTG 89 Query: 317 PLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIII 374 + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ + Sbjct: 90 KRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDNLEEFRF 149 Query: 375 DANNTGART------CDYLEMLGYH---------------------VYRVLGQKRAVDLE 407 D + GA + L + V GQ ++ + Sbjct: 150 DEDGLGAGVRGDARAINELRNVARRPSILATPFRGSGAVFDPDDEAVRGDNGQAARLNKD 209 Query: 408 FCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKSLKSFIV 447 F N + + ++ E + LI L S ++ + Sbjct: 210 FFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-SQPTYSI 268 Query: 448 PNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 G++ I+ K+ G +S + +D +M +A Sbjct: 269 NGVGKIVID-KQPDGTRSPNLADSVMINYA 297 >gi|327191373|gb|EGE58399.1| prophage MuMc02, terminase, ATPase subunit, putative [Rhizobium etli CNPAF512] Length = 248 Score = 86.3 bits (212), Expect = 1e-14, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 86/262 (32%), Gaps = 38/262 (14%) Query: 50 APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 P WQ + NP + + + GK+T+ A+LV+ P Sbjct: 22 EPDPWQANLLRA----------NPRRSMLLCSRQS----GKSTVAAFLVIQTALFVPAAQ 67 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 ++ ++ ++ Q L+ + +LS LP +S S Sbjct: 68 IVVVSPTQRQ-SNELFRTIVGFLSRLPGAPRPTAESKQGTEL--------------SNGA 112 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 + +E+ G ++ DEA+ D + + + + + + + Sbjct: 113 RVLSLPGTEKTIRGIAGVD-----LVVMDEAARVEDALLTAVRPMMATK-PDARLVALTT 166 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289 P G FYE + W+R ++ I F + + G E +F Sbjct: 167 PAGKRGWFYEAWVSDDPSWERVRVPASACPRITQQFLDEELKALGA--IKFSEEYGLEFH 224 Query: 290 QQDIDSFIPLNIIEEALNREPC 311 + + PL IIE A +E Sbjct: 225 DPEE-AVFPLAIIEAAFTQEVR 245 >gi|315576663|gb|EFU88854.1| conserved hypothetical protein [Enterococcus faecalis TX0630] Length = 519 Score = 85.9 bits (211), Expect = 2e-14, Method: Composition-based stats. Identities = 67/430 (15%), Positives = 137/430 (31%), Gaps = 62/430 (14%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS----LLPNKHWFEMQ 144 GK+ L++ + +WL +A + + V+ L + K + Sbjct: 92 GKSWLSSRIAVWLA---DHNRRCYVAGGKKDTTDIIMQHVTDTLQTVDESIARKLLEPVD 148 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 L + S G + S + + +G Y I DE++ Sbjct: 149 KLERLQTGLSKRKISFSGGGSIEGISLGEHFKGNKSGNQAIGRGGDY----IIDESAFVS 204 Query: 205 DVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQIDTRTVEG 260 + LG F N SNP G+FY+ + D RT Sbjct: 205 NETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLVVWADVRTAFE 263 Query: 261 IDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 D E +I S+ + + + P ++ D EE + Sbjct: 264 EDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTEE-----EHTE 313 Query: 314 PYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGL-- 363 +G D A +G D + + + + + K D T+ I+ L Sbjct: 314 KNWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGDWQDGVTSKKIITQLLM 373 Query: 364 -VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV---------DLEFCRN 411 +E + + +D G + L + + ++ + ++ N Sbjct: 374 IIEHFEVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEKKHYSAKYGAN 432 Query: 412 RRTELHVKMADWLEFASLINHSGLIQ---NLKSLKSFIVPNTGELAIESK---RVKGAKS 465 +R E+H+ + + ++ ++ + + + L S + + G+ AI K + K S Sbjct: 433 KRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLVSSKIKSNGKTAIVPKEEIKAKLGHS 492 Query: 466 TDYSDGLMYT 475 D D ++ + Sbjct: 493 PDTLDSVLLS 502 >gi|255975409|ref|ZP_05425995.1| predicted protein [Enterococcus faecalis T2] gi|255968281|gb|EET98903.1| predicted protein [Enterococcus faecalis T2] Length = 519 Score = 85.9 bits (211), Expect = 2e-14, Method: Composition-based stats. Identities = 67/430 (15%), Positives = 137/430 (31%), Gaps = 62/430 (14%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS----LLPNKHWFEMQ 144 GK+ L++ + +WL +A + + V+ L + K + Sbjct: 92 GKSWLSSRIAVWLA---DHNRRCYVAGGKKDTTDIIMQHVTDTLQTVDESIARKLLEPVD 148 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 L + S G + S + + +G Y I DE++ Sbjct: 149 KLERLQTGLSKRKISFSGGGSIEGISLGEHFKGNKSGNQAIGRGGDY----IIDESAFVS 204 Query: 205 DVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQIDTRTVEG 260 + LG F N SNP G+FY+ + D RT Sbjct: 205 NETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLVVWADVRTAFE 263 Query: 261 IDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 D E +I S+ + + + P ++ D EE + Sbjct: 264 EDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTEE-----EHTE 313 Query: 314 PYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGL-- 363 +G D A +G D + + + + + K D T+ I+ L Sbjct: 314 KNWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGNWQDGVTSKKIITQLLM 373 Query: 364 -VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV---------DLEFCRN 411 +E + + +D G + L + + ++ + ++ N Sbjct: 374 IIEHFEVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEKNHYSAKYGAN 432 Query: 412 RRTELHVKMADWLEFASLINHSGLIQ---NLKSLKSFIVPNTGELAIESK---RVKGAKS 465 +R E+H+ + + ++ ++ + + + L S + + G+ AI K + K S Sbjct: 433 KRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLVSSKIKSNGKTAIVPKEEIKAKLGHS 492 Query: 466 TDYSDGLMYT 475 D D ++ + Sbjct: 493 PDTLDSVLLS 502 >gi|29376621|ref|NP_815775.1| hypothetical protein EF2112 [Enterococcus faecalis V583] gi|257090386|ref|ZP_05584747.1| predicted protein [Enterococcus faecalis CH188] gi|307276045|ref|ZP_07557178.1| hypothetical protein HMPREF9521_01673 [Enterococcus faecalis TX2134] gi|29344085|gb|AAO81845.1| hypothetical protein EF_2112 [Enterococcus faecalis V583] gi|256999198|gb|EEU85718.1| predicted protein [Enterococcus faecalis CH188] gi|306507375|gb|EFM76512.1| hypothetical protein HMPREF9521_01673 [Enterococcus faecalis TX2134] Length = 519 Score = 85.9 bits (211), Expect = 2e-14, Method: Composition-based stats. Identities = 67/430 (15%), Positives = 137/430 (31%), Gaps = 62/430 (14%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS----LLPNKHWFEMQ 144 GK+ L++ + +WL +A + + V+ L + K + Sbjct: 92 GKSWLSSRIAVWLA---DHNRRCYVAGGKKDTTDIIMQHVTDTLQTVDESIARKLLEPVD 148 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 L + S G + S + + +G Y I DE++ Sbjct: 149 KLERLQTGLSKRKISFSGGGSIEGISLGEHFKGNKSGNQAIGRGGDY----IIDESAFVS 204 Query: 205 DVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQIDTRTVEG 260 + LG F N SNP G+FY+ + D RT Sbjct: 205 NETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLVVWADVRTAFE 263 Query: 261 IDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 D E +I S+ + + + P ++ D EE + Sbjct: 264 EDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTEE-----EHTE 313 Query: 314 PYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGL-- 363 +G D A +G D + + + + + K D T+ I+ L Sbjct: 314 KNWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGDWQDGVTSKKIITQLLM 373 Query: 364 -VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV---------DLEFCRN 411 +E + + +D G + L + + ++ + ++ N Sbjct: 374 IIEHFEVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEKNHYSAKYGAN 432 Query: 412 RRTELHVKMADWLEFASLINHSGLIQ---NLKSLKSFIVPNTGELAIESK---RVKGAKS 465 +R E+H+ + + ++ ++ + + + L S + + G+ AI K + K S Sbjct: 433 KRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLVSSKIKSNGKTAIVPKEEIKAKLGHS 492 Query: 466 TDYSDGLMYT 475 D D ++ + Sbjct: 493 PDTLDSVLLS 502 >gi|315575102|gb|EFU87293.1| conserved hypothetical protein [Enterococcus faecalis TX0309B] gi|315582529|gb|EFU94720.1| conserved hypothetical protein [Enterococcus faecalis TX0309A] Length = 407 Score = 85.5 bits (210), Expect = 2e-14, Method: Composition-based stats. Identities = 52/320 (16%), Positives = 106/320 (33%), Gaps = 51/320 (15%) Query: 195 IINDEASGTPDVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKR 250 I DE++ + LG F N SNP G+FY+ + Sbjct: 83 YIIDESAFVSNETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLV 141 Query: 251 FQIDTRTVEGIDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIE 303 D RT D E +I S+ + + + P ++ D E Sbjct: 142 VWADVRTAFEEDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTE 196 Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRT 355 E + +G D A +G D + + + + + K D T Sbjct: 197 E-----EHTEKNWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGDWQDGVT 251 Query: 356 TNNKISGL---VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV------ 404 + I+ L +E + + +D G + L + + ++ + Sbjct: 252 SKKIITQLLMIIEHFEVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEK 310 Query: 405 ---DLEFCRNRRTELHVKMADWLEFASLINHSGLIQ---NLKSLKSFIVPNTGELAIESK 458 ++ N+R E+H+ + + ++ ++ + + + L S + + G+ AI K Sbjct: 311 NHYSAKYGANKRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLVSSKIKSNGKTAIVPK 370 Query: 459 ---RVKGAKSTDYSDGLMYT 475 + K S D D ++ + Sbjct: 371 EEIKAKLGHSPDTLDSVLLS 390 >gi|315034678|gb|EFT46610.1| conserved hypothetical protein [Enterococcus faecalis TX0027] Length = 519 Score = 85.2 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 68/431 (15%), Positives = 138/431 (32%), Gaps = 64/431 (14%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS----LLPNKHWFEMQ 144 GK+ L++ + +WL +A + + V+ L + K + Sbjct: 92 GKSWLSSRIAVWLA---DHNRRCYVAGGKKDTTDIIMQHVTDTLQTVDESIARKLLEPVD 148 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 L + S G + S + + +G Y I DE++ Sbjct: 149 KLERLQTGLSKRKISFSGGGSIEGISLGEHFKGNKSGNQAIGRGGDY----IIDESAFVS 204 Query: 205 DVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQIDTRTVEG 260 + LG F N SNP G+FY+ + D RT Sbjct: 205 NETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLVVWADVRTAFE 263 Query: 261 IDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 D E +I S+ + + + P ++ D EE + Sbjct: 264 EDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTEE-----EHTE 313 Query: 314 PYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGL-- 363 +G D A +G D + + + + + K D T+ I+ L Sbjct: 314 KDWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGDWQDGVTSKKIITQLLM 373 Query: 364 -VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV---------DLEFCRN 411 +E + + +D G + L + + ++ + ++ N Sbjct: 374 IIEHFDVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEKNHYSAKYGAN 432 Query: 412 RRTELHVKMADWLEFASLI----NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464 +R E+H+ + + ++ ++ + +I L + S + + G+ AI K + K Sbjct: 433 KRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLISS-KIKSNGKTAIVPKEEIKAKLGH 491 Query: 465 STDYSDGLMYT 475 S D D ++ + Sbjct: 492 SPDTLDSVLLS 502 >gi|53793591|ref|YP_112491.1| terminase large subunit [Flavobacterium phage 11b] gi|53748181|emb|CAH56642.1| terminase large subunit [Flavobacterium phage 11b] Length = 432 Score = 83.2 bits (204), Expect = 9e-14, Method: Composition-based stats. Identities = 52/314 (16%), Positives = 114/314 (36%), Gaps = 38/314 (12%) Query: 196 INDEASGTPDVINLGILG----FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD----- 246 DE + + L + + T NP + + + + K + Sbjct: 126 FIDECNQITYKAWQIVKSRIRYKLNQYGIEPKMLGTCNPAKNW-VYAQFYLKDKNGTLDN 184 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS-FIPLNIIEEA 305 D K Q + S+ +++ LD + + G + + + I I+ Sbjct: 185 DKKFIQALPTDNPHLPASYLTSLLS---LDENSKQRLYYGNWEYDNDPAKLIDYEKIQNC 241 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 P + + + DIA G D V+ + G + +F +K+ + + GL Sbjct: 242 FTNTFIP--FGEMYISADIARFGSDKMVICVWSGFRVVEIFSMAKSSITEIAEAVRGLSI 299 Query: 366 KYRP--DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE----FCRNRRTELHVK 419 K++ +I D + + + RA++++ +N +T+ + K Sbjct: 300 KHKVPLSNVICDED-----GVGGGVVDVLGCTGFINNSRAMEVDNQVVQYQNLKTQCYYK 354 Query: 420 MAD-------WLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469 +A+ ++ + + + L+ +K + + G+L + SK + +S DYS Sbjct: 355 LAEVIQSNNLYIHSEDATVNDEITKELEQVKRDKIDSDGKLQLISKDKVKQAIGRSPDYS 414 Query: 470 DGLMY-TFAENPPR 482 D LM + E P+ Sbjct: 415 DALMMRMYFEFKPK 428 >gi|312126991|ref|YP_003991865.1| hypothetical protein Calhy_0759 [Caldicellulosiruptor hydrothermalis 108] gi|311777010|gb|ADQ06496.1| conserved hypothetical protein [Caldicellulosiruptor hydrothermalis 108] Length = 444 Score = 80.5 bits (197), Expect = 6e-13, Method: Composition-based stats. Identities = 57/335 (17%), Positives = 108/335 (32%), Gaps = 39/335 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 AGR GK+T+ V+ +T+ A S Q K + E + N + Sbjct: 54 AGRRFGKSTVTLIDVVHECATKTKQVWYITAPSIDQAK-IYFQEFEQ---RAANNSLLDA 109 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +P+ L I + + G + EA+ Sbjct: 110 LVKDFKWSPFPEITLINGSKILGRS--------TSRNGVYLRGKGADG---VAITEAAFI 158 Query: 204 PDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTRTV 258 D + I + +RN T N Y++F + L+D +K F Sbjct: 159 KDKVYHDVIRAMVLDRNGVLRLETTPN---GMNYVYKLFQEGLNDSTGYYKSFHATVYDN 215 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI----PLNIIEEALNREPCPDP 314 E +D E I + R+E +F + DSFI L + + + P Sbjct: 216 ERLDREELERIRRE--IPELAWRIEYLAEFVE--DDSFIFPWNLLCEVFDDYELKKEPQN 271 Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR---TTNNKISGLVEKYRPDA 371 +G D+A+ ++VL + ++ + R ++ L KY Sbjct: 272 GHRYSIGVDLAKYQDYTVIIVLDITREPYQIVEYHRYQGRLYTDVVAHVNELQAKY-NAR 330 Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 + +DA G + + + + +++ + Sbjct: 331 VYLDATGVGDPIAEQVR----NCEPFVFSEKSRNK 361 >gi|333010190|gb|EGK29625.1| phage terminase large subunit domain protein [Shigella flexneri K-272] gi|333021147|gb|EGK40404.1| phage terminase large subunit domain protein [Shigella flexneri K-227] Length = 235 Score = 80.5 bits (197), Expect = 6e-13, Method: Composition-based stats. Identities = 33/223 (14%), Positives = 63/223 (28%), Gaps = 47/223 (21%) Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVE 365 + +P +G D+A+ G D V R G V+ +W + +L + + Sbjct: 5 KTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAAL 64 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRA 403 + D I+ D+ GA + + R Sbjct: 65 EREAD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGI 123 Query: 404 VDLEFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKS 444 + +F N + + +AD + LI + Sbjct: 124 PNKDFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPH 183 Query: 445 FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 184 RDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 226 >gi|48697520|ref|YP_024878.1| gp33 TerL [Burkholderia phage BcepB1A] gi|47717490|gb|AAT37736.1| gp33 TerL [Burkholderia phage BcepB1A] Length = 532 Score = 79.4 bits (194), Expect = 1e-12, Method: Composition-based stats. Identities = 52/338 (15%), Positives = 105/338 (31%), Gaps = 60/338 (17%) Query: 196 INDEASGTPDV-INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQID 254 DEA+ + L T + S+ L+ F E + K + Sbjct: 203 FVDEAAHLENAQAVDTALAATTNCRID-----ISSVNGLNNPFAE--KRFSGRVKVKTMH 255 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--P 312 R D +++ ++ ++ V E+ + IPL I+ A++ + Sbjct: 256 WRDDPRKDDEWYKKQKQKF--NALVVAQEIDIDYSASAEGVLIPLEWIDAAIDADVKLGL 313 Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR---TTNNKISGLVEKYRP 369 D+A+EG D R G +++ WS TT I ++ + Sbjct: 314 TVTGQRFSSLDVADEGKDMNAFGSRLGIRMDYAESWSGKGSNIYGTTLRTIGLVIAQNGR 373 Query: 370 DAIIIDANNTGARTCDYLEMLGY--------HVYRVLGQKRAV----------------D 405 D D++ G E + + + + + + Sbjct: 374 DFQF-DSDGLGVGVRGDAEAINALPERKAYPKIDAIAFRGSSSVREPDKQVPGAYKGVKN 432 Query: 406 LEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSLKSFI 446 ++F +NR+ + + + E +S I I+ + Sbjct: 433 VDFFQNRKAQEYWALRMRFEATYRAVVEKLEYDPDEIISISSRIPDLQKIRMELHQPLYK 492 Query: 447 VPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484 TG++ I+ K G S +Y+D M +A + Sbjct: 493 PSTTGKIMIQ-KTPDGMVSPNYADMTMMLYAPQQTKRG 529 >gi|269941618|emb|CBI50024.1| phage protein [Staphylococcus aureus subsp. aureus TW20] Length = 599 Score = 79.0 bits (193), Expect = 2e-12, Method: Composition-based stats. Identities = 82/446 (18%), Positives = 132/446 (29%), Gaps = 100/446 (22%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 A RG+GKT L+A L PG +I A +++Q L K E+ Sbjct: 82 ASRGLGKTFLSAVYCLTRCILYPGTKIIITAPTKSQGINVL------------EKIENEL 129 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 S +H + + I + S + S D GH ++ DE Sbjct: 130 LSPLIHREIESINTGNQKPMIAFHNGSWIRVVASN---DNARGHRAN---LLLVDEFVKV 183 Query: 204 P-DVINLGILGFLTERNANRFW---------------IMTSNPRRLSGKFYEIFN----- 242 D+I+ LT + F + S+ S Y+ Sbjct: 184 DEDLIDTVFKKMLTSQREPAFLHKAKYKNYPREENTQMYLSSAWMKSHWAYDSMRSFTKQ 243 Query: 243 ----KPLDDWKRF--QIDTRTVEGIDPSFHEGIIAR-------------------YGLDS 277 K DD K F I T H+ + A +G Sbjct: 244 MLKKKSEDDLKSFVCHIPYYTGVMEKLYSHKQMKAEAQAEGFNKMKFAMEMEAVWWGETE 303 Query: 278 DVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG---DNTVV 334 F ++ +F P ++ +A P +P ++ D+A GG D +V Sbjct: 304 SAFFNFNTIDFNRKLSQAFYPKEVLVQADINNPIKEPKEKRLLAVDVARMGGNSNDASVF 363 Query: 335 VLRR---------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 L R + ++ D D +T +I L + + D I++D N GA D Sbjct: 364 SLIRLLPKGKQQYERQLNYMEDMEGIDFQTQAIRIRQLYDDFDCDYIVLDLKNVGAGILD 423 Query: 386 YLE------MLGYHVYRVLG------QKRAVDLEF--------CRNRR-TELHVKMADWL 424 L G + E N R E+ +AD Sbjct: 424 NLRIPLTDIDRGVEYEPLNVSNDDDLASTCKYPEAPRVIHVINATNERNMEMANLLADNF 483 Query: 425 EFASLINHSGLIQNLKSLKSFIVPNT 450 LI+ ++ + F Sbjct: 484 MRGKF---RLLIREEQAEELFRQDKK 506 >gi|57867562|ref|YP_189190.1| prophage, terminase, ATPase subunit [Staphylococcus epidermidis RP62A] gi|57638220|gb|AAW55008.1| prophage, terminase, ATPase subunit, putative [Staphylococcus epidermidis RP62A phage SP-beta] Length = 599 Score = 79.0 bits (193), Expect = 2e-12, Method: Composition-based stats. Identities = 82/446 (18%), Positives = 132/446 (29%), Gaps = 100/446 (22%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 A RG+GKT L+A L PG +I A +++Q L K E+ Sbjct: 82 ASRGLGKTFLSAVYCLTRCILYPGTKIIITAPTKSQGINVL------------EKIENEL 129 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 S +H + + I + S + S D GH ++ DE Sbjct: 130 LSPLIHREIESINTGNQKPMIAFHNGSWIRVVASN---DNARGHRAN---LLLVDEFVKV 183 Query: 204 P-DVINLGILGFLTERNANRFW---------------IMTSNPRRLSGKFYEIFN----- 242 D+I+ LT + F + S+ S Y+ Sbjct: 184 DEDLIDTVFKKMLTSQREPAFLHKAKYKNYPREENTQMYLSSAWMKSHWAYDSMRSFTRQ 243 Query: 243 ----KPLDDWKRF--QIDTRTVEGIDPSFHEGIIAR-------------------YGLDS 277 K DD K F I T H+ + A +G Sbjct: 244 MLKKKSEDDLKSFVCHIPYYTGVMEKLYSHKQMKAEAQAEGFNKMKFAMEMEAVWWGETE 303 Query: 278 DVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG---DNTVV 334 F ++ +F P ++ +A P +P ++ D+A GG D +V Sbjct: 304 SAFFNFNTIDFNRKLSQAFYPKEVLVQADINNPIKEPKEKRLLAVDVARMGGNSNDASVF 363 Query: 335 VLRR---------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 L R + ++ D D +T +I L + + D I++D N GA D Sbjct: 364 SLIRLLPKGKQQYERQLNYMEDMEGIDFQTQAIRIRQLYDDFDCDYIVLDLKNVGAGILD 423 Query: 386 YLE------MLGYHVYRVLG------QKRAVDLEF--------CRNRR-TELHVKMADWL 424 L G + E N R E+ +AD Sbjct: 424 NLRIPLTDIDRGVEYEPLNVSNDDDLASTCKYPEAPRVIHVINATNERNMEMANLLADNF 483 Query: 425 EFASLINHSGLIQNLKSLKSFIVPNT 450 LI+ ++ + F Sbjct: 484 MRGKF---RLLIREEQAEELFRQDKK 506 >gi|326784324|ref|YP_004324722.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SSM5] gi|310003555|gb|ADO97951.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SSM5] Length = 549 Score = 79.0 bits (193), Expect = 2e-12, Method: Composition-based stats. Identities = 66/377 (17%), Positives = 126/377 (33%), Gaps = 51/377 (13%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T+ +LW + P ++V LAN + L L + + L Sbjct: 85 GKSTIVTSYLLWYVLFNPNVNVAILANKAATAREML--------QRLQLSYENLPKWLQQ 136 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204 W L G ST + I DE + P Sbjct: 137 GILQWNRGSLELENGSKIMAASTSASAVRGMSFN-----------VIFLDEFAFIPNHIA 185 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261 D + ++ + I+ S P ++ FY++++ + +++ ++ V G Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGSNEYVPTEVHWSEVPGR 243 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD-------- 313 D + E I RVE +F +D+ I + + EP Sbjct: 244 DEVWKEQTIKNTSEQQ--FRVEFECEFL-GSVDTLISPSKLRIMPYHEPMNQNRGLAVFE 300 Query: 314 ---PYAPLIMGCDIAEE-GGDNTVV-VLRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365 P I+ D++ G D + V+ + + K + N I + + Sbjct: 301 QAIPEHNYILTVDVSRGVGNDYSAFTVMDTTTIPYKMVARYKNNEIKPIVLPNIIVDVAK 360 Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 Y I+ + N+ G + D LE + + G+ + ++T+L VKM+ Sbjct: 361 AYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 420 Query: 422 DWLEFASLINHSGLIQN 438 ++ N LI++ Sbjct: 421 TAVKQVGCSNLKALIED 437 >gi|158337379|ref|YP_001518554.1| hypothetical protein AM1_4258 [Acaryochloris marina MBIC11017] gi|158307620|gb|ABW29237.1| conserved domain protein [Acaryochloris marina MBIC11017] Length = 476 Score = 78.6 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 71/443 (16%), Positives = 133/443 (30%), Gaps = 77/443 (17%) Query: 38 WGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWL 97 W + G L+ F WQ + ++ ++ S + + K GR +G + L Sbjct: 41 WIKSGGSLKQFILWD-WQKDVVDWIEEPQSLSDSPKLSVIIK-----GRQLGLSQL---C 91 Query: 98 VLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDV 157 W + W + S W+ ++ ++ + L+ S Sbjct: 92 CSWFLY-------------------KAW-QNSAWVGVIISRTQSDSSLLASRMREMASTA 131 Query: 158 LHCSLGIDSKHYSTMCRT----YSEERPDTFVGHHNTYGMAIINDEASGTPDV--INLGI 211 DS + + D G I+ DEA+ ++ Sbjct: 132 GLVDFSTDSLLKLEISGGGTLHFRSAAVDAVRGI--DSVSGILFDEAAFQTNLKLSLSAA 189 Query: 212 LGFLTERNANRFWIMTSNPRRLSGKFYEIFN-----------------KPLDDW------ 248 +++ ++ I+ S P SG F++ N P++ W Sbjct: 190 TPAMSQVGSDARIILCSTPNGASGHFFDTLNGFDNCVSDIERIRSGELPPVNKWQREDGN 249 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 I ++V G +PS+ E + L E + ++ A Sbjct: 250 IAIAIHWKSVYGDNPSYLEDLEKSLSLPKAQIAQEYDLSLTESSS-VVFSFAVVRAAATG 308 Query: 309 EPCPD--PYAPLIMGCDIAEEGGDN--TVVVLRRGP--VIEHLFDWSKTDLRTTNNKISG 362 E P +G D A G D +V + + G + L+ L +I Sbjct: 309 EYEPQFTEDELYYVGVDPAGSGADYFCSVFLKKTGETFTVSKLYRKRTGTLEVHMGRIDE 368 Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 ++ P + ++ N G + LE V N + L ++ Sbjct: 369 FIKASNPIKVTVETNGLGQFVYESLESRYGSVIERFNTT--------ANSKGALIGRLQL 420 Query: 423 WLEFASL--INHSGLIQNLKSLK 443 LE + S L Q L S + Sbjct: 421 ALERGHISYPAGSPLEQELLSFR 443 >gi|113200627|ref|YP_717790.1| terminase large subunit [Synechococcus phage syn9] gi|76574526|gb|ABA47091.1| terminase large subunit [Synechococcus phage syn9] Length = 549 Score = 77.8 bits (190), Expect = 4e-12, Method: Composition-based stats. Identities = 57/377 (15%), Positives = 126/377 (33%), Gaps = 51/377 (13%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T+ +LW + ++V LAN + L L + + L Sbjct: 85 GKSTIVTSYLLWYVLFNANVNVAILANKAATAREML--------QRLQLSYENLPKWLQQ 136 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204 W L G ST + I DE + P Sbjct: 137 GILQWNRGSLELENGSKILAASTSASAVRGMSFN-----------VIFLDEFAFVPNHVA 185 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261 D + ++ + I+ S P ++ FY++++ + +++ ++ V G Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERKANEYIPTEVHWSEVPGR 243 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----- 316 D ++ E I RVE +F +D+ I + + + +P + Sbjct: 244 DAAWKEQTIKNTSEQQ--FRVEFECEFL-GSVDTLISPSKLRTMVYGDPIAEKNGLSMYE 300 Query: 317 ------PLIMGCDIAEE--GGDNTVVVLRRGPVIEHLFDWSKTDLRT---TNNKISGLVE 365 ++ D++ G + +V+ + L + + N I + Sbjct: 301 KTIQGHTYVITADVSRGVSGDYSAFLVIDTTTIPYKLVAKYRNNDIKPILFPNIIVDVAR 360 Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 Y ++++ N+ G + D LE + + G+ + ++T++ +KM+ Sbjct: 361 NYNHAFVLVEVNDVGGQVADIIQYDLEYDNLLMCAMRGRAGQQLGQGFSGKKTQMGIKMS 420 Query: 422 DWLEFASLINHSGLIQN 438 + N L+++ Sbjct: 421 SATKQVGCSNLKALLED 437 >gi|262276634|ref|ZP_06054439.1| P-loop protein [alpha proteobacterium HIMB114] gi|262225214|gb|EEY75661.1| P-loop protein [alpha proteobacterium HIMB114] Length = 409 Score = 77.5 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 46/302 (15%), Positives = 102/302 (33%), Gaps = 32/302 (10%) Query: 78 FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 F+ I+ GR GKT L +L + ++ + K +W ++ K Sbjct: 17 FRVLIT-GRRFGKTHLCLVEILRQARHCDNGKIFYVSPTYRMSKEIMWKQIKKL------ 69 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + W + L I + + +++ D G ++ Sbjct: 70 ----------VKELRWDKYINETELTIVLVNNCQISLKGADKSADNLRGV---GLNFLVL 116 Query: 198 DEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL---DDWKRFQI 253 DE + P+ + ++++ AN + P+ Y++F + +WK ++ Sbjct: 117 DEFADIPEEAWTEVLRPTISDKYANGKVLFVGTPKGYGNWSYDMFQRGQAGDPEWKSWKY 176 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 T ++P E A+ LD+ R E F + N + D Sbjct: 177 TTIEGGQVEPHEIEQ--AKKDLDARSFRQEYEASFETYA--GVVYYNFDRAKNVKPVPYD 232 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRG-PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372 A + +G D + + +++G ++ + ++I+ +Y P + Sbjct: 233 QNAVIHIGMDFNIDPMSACLFYVKQGISYFFKEIVIYSSNTQEMIDEIT---RQYDPKRV 289 Query: 373 II 374 I+ Sbjct: 290 IV 291 >gi|61806303|ref|YP_214662.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM4] gi|61563847|gb|AAX46902.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM4] Length = 550 Score = 77.1 bits (188), Expect = 7e-12, Method: Composition-based stats. Identities = 57/376 (15%), Positives = 123/376 (32%), Gaps = 49/376 (13%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T+ +LW + ++V LAN + L L + + + Sbjct: 86 GKSTIVTAYLLWYVLFNANVNVAILANKAPTAREML--------GRLQLSYENLPKWMQQ 137 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN 208 W L G ST + I DE + P+ I Sbjct: 138 GILGWNKGSLELENGSKILASSTSASAVRGMSFN-----------IIFLDEFAFVPNHIA 186 Query: 209 LGILGFL---TERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGID 262 + + I+ S P ++ +FY++++ + +++ ++ V G D Sbjct: 187 EQFFASVYPTISSGKSTKVIIISTPHGMN-QFYKLWHDAERGANNYVATEVHWSQVPGRD 245 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD--------- 313 + + I RVE +F +D+ I + + ++P + Sbjct: 246 DKWKQQTIEN--TSEAQFRVEFECEFL-GSVDTLITPSKLRIMPYKDPIQENRGLAVYEH 302 Query: 314 --PYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFDWSKTDL----RTTNNKISGLVEK 366 I+ D++ G D + + + + + N I + Sbjct: 303 VQENHNYIITVDVSRGVGNDYSAFCVIDTTTVPYKVVARYKNNQIKPLVFPNLIVDVATN 362 Query: 367 YRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 Y ++ + N+ G + D LE + + G+ + ++T+L +KM+ Sbjct: 363 YNGAYVLCEVNDIGGQVADIIQYDLEYENLLMVSMRGRAGQQLGQGFSGKKTQLGIKMST 422 Query: 423 WLEFASLINHSGLIQN 438 ++ N LI++ Sbjct: 423 AVKQVGCSNLKALIED 438 >gi|326782611|ref|YP_004323017.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SM1] gi|310002825|gb|ADO97224.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SM1] Length = 549 Score = 76.3 bits (186), Expect = 1e-11, Method: Composition-based stats. Identities = 60/377 (15%), Positives = 123/377 (32%), Gaps = 51/377 (13%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T+ +LW + ++V LAN + L L + + L Sbjct: 85 GKSTIVTSYLLWYVLFNDNVNVAILANKAATAREML--------QRLQLSYENLPKWLQQ 136 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204 W L G ST + I DE + P Sbjct: 137 GILQWNRGSLELENGSKIMAASTSASAVRGMSFN-----------VIFLDEFAFIPNHIA 185 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261 D + ++ + I+ S P ++ FY++++ + +++ ++ V G Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGTNEYIPTEVHWSEVPGR 243 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE---------EALNREPCP 312 D + E I RVE +F +D+ I + + E Sbjct: 244 DDVWKEQTIKNTSEQQ--FRVEFECEFL-GSVDTLISPSKLRIMPYHDPMKENRGLAIFE 300 Query: 313 D--PYAPLIMGCDIAEE-GGDNTVVV-LRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365 P ++ D++ G D + + + + + + N + + + Sbjct: 301 QSIPDHNYVITVDVSRGVGNDYSAFCVMDTTTIPYKMVARYRNNEIKPIILPNIVVDVAK 360 Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 Y I+ + N+ G + D LE + + G+ + ++T+L VKM+ Sbjct: 361 NYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 420 Query: 422 DWLEFASLINHSGLIQN 438 ++ N LI++ Sbjct: 421 TAVKQVGCSNLKALIED 437 >gi|170023468|ref|YP_001719973.1| hypothetical protein YPK_1222 [Yersinia pseudotuberculosis YPIII] gi|169750002|gb|ACA67520.1| conserved hypothetical protein [Yersinia pseudotuberculosis YPIII] Length = 534 Score = 76.3 bits (186), Expect = 1e-11, Method: Composition-based stats. Identities = 54/402 (13%), Positives = 122/402 (30%), Gaps = 65/402 (16%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F + W + + I+ ++ + + + Sbjct: 143 ALFWKARKFIETLPAEFRGSWDNKKHAPYMRIEFPDSGSIIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DE++ + I L++ R I S+ ++ F + + F Sbjct: 198 TMYFVDESAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHGGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 R+ D ++ + + E+ + IP ++ A+ + Sbjct: 252 FHWRSDPRKDDAW-YKKECEKIDNPVIVAQELDLNYNAAAEGILIPSEWVQAAIGAHTKL 310 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368 P I D+A+EG D R G +++ L WS +D+ T L ++ Sbjct: 311 GITPSGARIGALDVADEGIDLNAFSSRTGVLLDRLKAWSGKGSDIYATTQDAMILSDEND 370 Query: 369 PDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKR---------------- 402 D ++ D++ GA ++ + G Sbjct: 371 CDYLLYDSDGLGAGCRGDGRVINETRQKAGQRQVEIKPFRGSGEVIYPDKPVFKADTKRD 430 Query: 403 -AVDLEFCRNRRTELHVKMADWLEFA--------------------SLINHSGLIQNLKS 441 + ++ NR+ + + + +L LI L S Sbjct: 431 ARTNKDYFANRKAQGWWALRMRFQEVYRAVVKGMPFDPDEIISIDENLPEKEKLIAEL-S 489 Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRS 483 ++ + G++ ++ K G +S +++D +M +A R Sbjct: 490 QPTYTINGAGKVTVD-KAPSGTRSPNHADTVMICYAPEKIRR 530 >gi|326783331|ref|YP_004323723.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage Syn33] gi|310005278|gb|ADO99667.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage Syn33] Length = 549 Score = 76.3 bits (186), Expect = 1e-11, Method: Composition-based stats. Identities = 64/393 (16%), Positives = 127/393 (32%), Gaps = 64/393 (16%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T+ +LW + P ++V LAN + L L + + L Sbjct: 85 GKSTIVTAYLLWYVLFNPNVNVAILANKAATAREML--------GRLQLSYENLPKWLQQ 136 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204 W L G ST + I DE + P Sbjct: 137 GILQWNRGSLELENGSKILAASTSASAVRGMSFN-----------VIFLDEFAFVPNHIA 185 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261 D + ++ + I+ S P ++ FY++++ + +++ ++ V G Sbjct: 186 DQFFSSVYPTVSS-GKSTKVIIISTPHGMN-MFYKLWHDAEQGKNEYLPTEVHWSQVPGR 243 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-----------LNREP 310 D ++ E I +VE +F +D+ I + + L Sbjct: 244 DAAWKEQTIKNTSEQQ--FKVEFECEFL-GSVDTLISPSKLRTMPYVDPVAQNKGLAIYE 300 Query: 311 CPDPYAPLIMGCDIAEE-GGDNTV-VVLRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365 + I+ D++ G D + VV+ + + + + N I + + Sbjct: 301 RVEAEHNYIITVDVSRGIGNDYSAFVVVDTTTMPYKVVARYRNNEIKPIIFPNIIIDVAK 360 Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 Y I+ + N+ G + D LE + + G+ + ++T+L VKM+ Sbjct: 361 NYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 420 Query: 422 DWLEFAS-------------LINHSGLIQNLKS 441 ++ LI I L + Sbjct: 421 SAVKQVGCSNLKALIEEDKLLIPDYETIAELTT 453 >gi|294508906|ref|YP_003566117.1| hypothetical protein PSR_11004 [Salinibacter ruber M8] gi|294342043|emb|CBH22709.1| conserved hypothetical protein [Salinibacter ruber M8] Length = 255 Score = 75.9 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 50/261 (19%), Positives = 79/261 (30%), Gaps = 40/261 (15%) Query: 50 APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 P WQ + ++ + A + GKTT +A L L Sbjct: 7 DPDPWQEALL----------TSDWERALLNCARQS----GKTTASAALALETALEATDSL 52 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 V+ LA + Q K L V + QS + + S I Sbjct: 53 VLILAPARRQSKEFL-RSVRSLYRDAAPDGGLDKQS------ELRLRLENESRIIALPGK 105 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 R Y+ + +I DEA+ PD + L ++ S Sbjct: 106 EGTVRGYTAD--------------LVIADEAARVPDAAYVATRPMLAVTGGR--FVGLST 149 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289 P G FYE + P +W++ ++ + + +F E G R E +F Sbjct: 150 PAGQRGWFYEAWTDPGQEWEQVKVTGQDCPRMTEAFLEQERREMG--DWQFRSEYMCEFT 207 Query: 290 QQDIDSFIPLNIIEEALNREP 310 D IE +L E Sbjct: 208 D-TEDQLFATEHIESSLTSEV 227 >gi|323186590|gb|EFZ71927.1| gp33 TerL protein [Escherichia coli 1357] Length = 503 Score = 75.5 bits (184), Expect = 2e-11, Method: Composition-based stats. Identities = 50/369 (13%), Positives = 105/369 (28%), Gaps = 58/369 (15%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ ++ F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311 R+ D + + + E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310 Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368 P + D+A+EG D LR G ++ + +WS +D+ + K+ GL + + Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370 Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401 D D + GA + L V G+ Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430 Query: 402 RAVDLEFCRNRRTELHVKMADWLE-----FASLINHSGLIQNLKSLKSFIVPNTGELAIE 456 ++ +F N + + + + I ++ S + N L +E Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISST----MENKDRLLME 486 Query: 457 ------SKR 459 SK+ Sbjct: 487 LSQPTWSKK 495 >gi|326782863|ref|YP_004323261.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-RSM4] gi|310004122|gb|ADO98516.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-RSM4] Length = 547 Score = 75.1 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 62/377 (16%), Positives = 120/377 (31%), Gaps = 51/377 (13%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T+ +LW + ++V LAN + L L + + Sbjct: 83 GKSTIVTAYLLWYVLFNANVNVAILANKAATAREML--------QRLQLSYENLPNWMQQ 134 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204 W L G ST + I DE + P Sbjct: 135 GILQWNRGSLELENGSKIMAASTSASAVRGMSFN-----------VIFLDEFAFIPNHIA 183 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261 D + ++ + I+ S P ++ FY++++ + +++ ++ V G Sbjct: 184 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGTNEYVPTEVHWSEVPGR 241 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE-----------EALNREP 310 D + E I RVE +F +D+ I + + L Sbjct: 242 DDVWKEQTIKNTSESQ--FRVEFECEFL-GSVDTLIAPSKLRIMPYHDPITSNRGLAVYE 298 Query: 311 CPDPYAPLIMGCDIAEE-GGDNTVVV-LRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365 P I+ D++ G D + + + + K + N I + + Sbjct: 299 QVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAK 358 Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 Y I+ + N+ G + D LE + + G+ + ++T+L VKM+ Sbjct: 359 NYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 418 Query: 422 DWLEFASLINHSGLIQN 438 + N LI+ Sbjct: 419 TATKQVGCSNLKALIEE 435 >gi|326783550|ref|YP_004323947.1| terminase DNA packaging enzyme large subunit [Synechococcus phage Syn19] gi|310005053|gb|ADO99443.1| terminase DNA packaging enzyme large subunit [Synechococcus phage Syn19] Length = 549 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 64/377 (16%), Positives = 126/377 (33%), Gaps = 51/377 (13%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T+ +LW + ++V LAN + L L + + L Sbjct: 85 GKSTIVTSYLLWYVLFNQNVNVAILANKAATSREML--------QRLQLSYENLPKWLQQ 136 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204 W L G + S R +F I DE + P Sbjct: 137 GILQWNRGSLELENGSKI---MAASTSSSAVRGMSFN--------VIFLDEFAFVPNHIA 185 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261 D + ++ + I+ S P ++ FY++++ + +++ ++ V G Sbjct: 186 DQFFSSVYPTISS-GQSTKVIIISTPHGMN-MFYKLWHDAERSKNEYIPTEVHWSEVPGR 243 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---------- 311 D + E IA +VE +F +D+ I + + +P Sbjct: 244 DAKWKEQTIANTSEQQ--FKVEFECEFL-GSVDTLISPSKLRVMPYHDPIAQNKGLAVYK 300 Query: 312 -PDPYAPLIMGCDIAEE--GGDNTVVVLRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365 +P I+ D+A + V+ V + + + N I + + Sbjct: 301 RAEPDHNYIITVDVARGTSNDYSAFCVMDTTTVPYEMVARYRNNEIKPIVFPNIIVDVAK 360 Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 Y I+ + N+ G + D LE + + G+ + ++T+L VKM+ Sbjct: 361 NYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 420 Query: 422 DWLEFASLINHSGLIQN 438 ++ N LI+ Sbjct: 421 TAVKQVGCSNLKALIEE 437 >gi|18138498|ref|NP_542602.1| probable terminase [Halorubrum phage HF2] gi|32453919|ref|NP_861683.1| hypothetical protein HalHV1gp095 [Halovirus HF1] gi|18000439|gb|AAL55022.1| probable terminase [Halorubrum phage HF2] gi|32346487|gb|AAO61393.1| hypothetical protein [Halovirus HF1] Length = 563 Score = 74.8 bits (182), Expect = 3e-11, Method: Composition-based stats. Identities = 74/464 (15%), Positives = 127/464 (27%), Gaps = 82/464 (17%) Query: 85 GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 GR IG + + +L +P L+ ++ Q + +S +L+ N Sbjct: 75 GRRIGVSYIIGICILIEALLKPDTFYPILSKTKGQSN----SRISDIKTLIKNAK----- 125 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 + + D + G K Y+ + E P + DE + Sbjct: 126 -IDIPLEKDNQDEIVLPNGSRIKAYTGDPDSARGEDPPK----------TVFIDEMAFLE 174 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE---------------------IFNK 243 D T + + S P+ + +F + F Sbjct: 175 DQSATLDAYLPTISLGSSQMVQVSTPKAQNDEFMDANERGTPDGRNDFGILALKQPTFKN 234 Query: 244 PLDDWKRFQIDTRTVEGIDPSFHEGIIA-RYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + + + VE + F + D + E + P D F + I Sbjct: 235 ADEIQTDVSLFEQDVEPVRGDFDLMAAETQRASDPNGFAQEYLCR-PVSDEYRFFSMPTI 293 Query: 303 EEALNREPCPD---------PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW----- 348 E+A+ R D L+MG DI D +VV +L Sbjct: 294 EDAMGRGAADDYSYGLRRYDTPNTLVMGVDIGFNSDDTAIVVFEHEGPRRYLRYHEVVND 353 Query: 349 -----------SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYR 396 S+ + +IS + +I+D G D + G Sbjct: 354 RVLEQAGITPSSRQNPAAVAERISQVYNGMGVSNVIMDMTGVGQGFHDEVRRRIGRGYTG 413 Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456 + + N LH + WL L + L ++ + + Sbjct: 414 FNFSAKDKVEKMMGNMNYALHNDL-VWL-----PEDDSLREQLGAIVKQQKEDWQKPKFT 467 Query: 457 SKRVKGAKSTDYSDGLMYT--FAENPPRSDMDFGRCPSYQYEGV 498 K + D D L A PP D R Q E V Sbjct: 468 GKE----HAPDGKDDLAMATVLAAFPPNFKSDKSRN-LQQREDV 506 >gi|326784562|ref|YP_004324947.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM7] gi|310004595|gb|ADO98987.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM7] Length = 550 Score = 74.4 bits (181), Expect = 4e-11, Method: Composition-based stats. Identities = 61/378 (16%), Positives = 128/378 (33%), Gaps = 51/378 (13%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 GK+T+ +LW + + ++V LAN + L L + + L Sbjct: 85 TGKSTIVTSYLLWYVLFKANVNVAILANKAATSREML--------QRLQLSYENLPKWLQ 136 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--- 204 W L G + S R +F I DE + P Sbjct: 137 QGILQWNRGSLELENGSKI---MAASTSSSAVRGMSFN--------VIFLDEFAFVPNHI 185 Query: 205 -DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEG 260 D + ++ + I+ S P ++ FY++++ + +++ ++ V G Sbjct: 186 ADQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGKNEYIPTEVHWSAVPG 243 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--------- 311 D ++ + IA +VE +F +D+ I + + +P Sbjct: 244 RDAAWKDQTIANTSEQQ--FKVEFECEFL-GSVDTLISPSKLRTMPYEDPIIQNRGLAVY 300 Query: 312 --PDPYAPLIMGCDIAEEGG-DNTVVVLRRGPVI--EHLFDWSKTDL--RTTNNKISGLV 364 + I+ D+A D + + + E + + D+ N I + Sbjct: 301 KQVEKDHNYIVTVDVARGVSQDYSAFCIIDTTTVPYELVAKYRNNDIKPIIFPNVIVDVA 360 Query: 365 EKYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 + Y ++ + N+ G + D LE + G+ + ++T+L VKM Sbjct: 361 KNYNNAYVLCEVNDIGGQVADIIQFDLEYENLLQVAMRGRAGQQLGQGFSGKKTQLGVKM 420 Query: 421 ADWLEFASLINHSGLIQN 438 + ++ N L++ Sbjct: 421 STAVKAVGCSNLKALLEE 438 >gi|218296727|ref|ZP_03497433.1| protein of unknown function DUF264 [Thermus aquaticus Y51MC23] gi|218242816|gb|EED09350.1| protein of unknown function DUF264 [Thermus aquaticus Y51MC23] Length = 425 Score = 74.4 bits (181), Expect = 5e-11, Method: Composition-based stats. Identities = 73/377 (19%), Positives = 127/377 (33%), Gaps = 46/377 (12%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 GK+ G + + L+ E Q + AE +K + M+S Sbjct: 28 TGKSFALTLEAALHAVEHRGSTWVLLSAGERQSREL--AEKAKAHLDAMKQVGTLMES-R 84 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-- 205 L L S+ P T G Y ++ DE + D Sbjct: 85 FFEGGESVTQLEIRLPNLSRLIFLPA------NPRTARG----YTGNVVLDEFAFHQDSE 134 Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE----GI 261 I + +T R + + S P GKF+E++ K W R ++ + Sbjct: 135 AIWAAMYPIIT-RRPDLKIRVMSTPNGPRGKFWELWEKGGPAWSRHKVTIYDAVAQGLPV 193 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP--LI 319 DP +A D + + E +F + +F+P ++I EA RE P+ P Sbjct: 194 DPEELRAGLA----DDFIWQQEYLCEFLSAEE-AFLPWSLILEAEAREDPRGPWNPDQAY 248 Query: 320 MGCDIAEEGGDNTVVVL--RRGPV--IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIID 375 +G D+ D TV V+ R G V + L + ++ L+ + R + D Sbjct: 249 LGVDVGRH-RDLTVFVVLERVGDVYWVRLLETLHRAPFAQQEARLHALLPQVRRACL--D 305 Query: 376 ANNTGARTCDYLEM-LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF--ASLINH 432 A G + GY V V +L ++ + E + Sbjct: 306 ATGLGEMLAENARRAFGYKVEPVKFTPEVK---------ADLAQRLRLFFEDRRVRIPED 356 Query: 433 SGLIQNLKSLKSFIVPN 449 L ++L S++ + P+ Sbjct: 357 RALREDLHSVRRIVTPS 373 >gi|182682964|ref|YP_001837088.1| terminase, large subunit [Enterobacteria phage EPS7] gi|182630676|gb|ACB97608.1| terminase, large subunit [Enterobacteria phage EPS7] Length = 438 Score = 73.6 bits (179), Expect = 7e-11, Method: Composition-based stats. Identities = 60/329 (18%), Positives = 115/329 (34%), Gaps = 48/329 (14%) Query: 66 CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N++ +P +S R +GK+ + A+ + +L P + V+ +A + + L W Sbjct: 45 IINALEDPRHRFVTACVS--RRVGKSFI-AYTLGFLKLLEPNVKVLVVAPNYS-LANIGW 100 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 +++ + ++ + ++ + S + D+ V Sbjct: 101 SQIR----------------GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV 144 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIF-- 241 G II DEA+ DV L T N + S PR G +++ F Sbjct: 145 GRSYD---FIIFDEAA-ISDVGGAAFDIQLRPTLDKPNSKALFISTPRG--GNWFKEFYE 198 Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 N+ L +W R D + E AR + + R E F + F Sbjct: 199 KGFNETLPNWVSIHGTYRDNPRADLNDIEE--ARRTVSKNYFRQEYEADFSVFEGQIFDT 256 Query: 299 LNIIE-----EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR----GPVIEHLFDWS 349 N IE + + D ++G D+ D T V+ + V L ++ Sbjct: 257 FNAIEHVKDLKGMRHFFKDDEAFETLLGIDVGY--RDPTAVLTIKYHYDTDVYYVLEEYQ 314 Query: 350 KTD--LRTTNNKISGLVEKYRPDAIIIDA 376 + + I +++Y D I +D+ Sbjct: 315 QAEKTTAQHATYIQHCIDRYNVDRIFVDS 343 >gi|46401884|ref|YP_006983.1| terminase, large subunit [Enterobacteria phage T5] gi|45775062|gb|AAS77194.1| terminase, large subunit [Enterobacteria phage T5] gi|59897286|gb|AAX12081.1| ORF144 [Enterobacteria phage T5] Length = 438 Score = 73.6 bits (179), Expect = 7e-11, Method: Composition-based stats. Identities = 56/329 (17%), Positives = 113/329 (34%), Gaps = 48/329 (14%) Query: 66 CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N++ +P +S R +GK+ + A+ + +L P + V+ +A + + L W Sbjct: 45 IINALEDPRHRFVTACVS--RRVGKSFI-AYTLGFLKLLEPNVKVLVVAPNYS-LANIGW 100 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 +++ + ++ + ++ + S + D+ V Sbjct: 101 SQIR----------------GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV 144 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIF-- 241 G II DEA+ DV L T N + S PR G +++ F Sbjct: 145 GRSYD---FIIFDEAA-ISDVGGDAFRVQLRPTLDKPNSKALFISTPRG--GNWFKEFYA 198 Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 + L +W R D + E AR + + R E F + F Sbjct: 199 YGFDDTLPNWVSIHGTYRDNPRADLNDIEE--ARRTVSKNYFRQEYEADFSVFEGQIFDT 256 Query: 299 LNIIE-----EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR------GPVIEHLFD 347 N I+ + + D ++G D+ D T V+ + + + Sbjct: 257 FNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGY--RDPTAVLTIKYHYDTDTYYVLEEYQ 314 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 ++ I +++Y+ D I +D+ Sbjct: 315 QAEKTTAQHAAYIQHCIDRYKVDRIFVDS 343 >gi|326633035|ref|YP_004306624.1| terminase large subunit [Enterobacteria phage SPC35] gi|321272229|gb|ADW80121.1| terminase large subunit [Enterobacteria phage SPC35] Length = 438 Score = 73.2 bits (178), Expect = 9e-11, Method: Composition-based stats. Identities = 55/329 (16%), Positives = 113/329 (34%), Gaps = 48/329 (14%) Query: 66 CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N++ +P +S R +GK+ + A+ + +L P + V+ +A + + L W Sbjct: 45 IINALEDPRHRFVTACVS--RRVGKSFI-AYTLGFLKLLEPNVKVLVVAPNYS-LANIGW 100 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 +++ + ++ + ++ + S + D+ V Sbjct: 101 SQIR----------------GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV 144 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIF-- 241 G II DEA+ DV L T N + S PR G +++ F Sbjct: 145 GRSYD---FIIFDEAA-ISDVGGDAFRVQLRPTLDKPNSKALFISTPRG--GNWFKEFYA 198 Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 + L +W R D + E AR + + R E F + F Sbjct: 199 YGFDDTLPNWVSIHGTYRDNPRADLNDIEE--ARRTVSKNYFRQEYEADFSVFEGQIFDT 256 Query: 299 LNIIE-----EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR------GPVIEHLFD 347 N I+ + + D ++G D+ D T V+ + + + Sbjct: 257 FNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGY--RDPTAVLTIKYHYDTDTYYVLEEYQ 314 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 ++ I +++Y+ D + +D+ Sbjct: 315 QAEKTTAQHAAYIQHCIDRYKVDRVFVDS 343 >gi|116624478|ref|YP_826634.1| hypothetical protein Acid_5400 [Candidatus Solibacter usitatus Ellin6076] gi|116227640|gb|ABJ86349.1| hypothetical protein Acid_5400 [Candidatus Solibacter usitatus Ellin6076] Length = 260 Score = 72.8 bits (177), Expect = 1e-10, Method: Composition-based stats. Identities = 46/260 (17%), Positives = 82/260 (31%), Gaps = 27/260 (10%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 W + V + + + ++ R GK+T+ A + G I Sbjct: 25 EWARRALGFEADAAQARVLDTRSK--RVLLNCTRQWGKSTVTAARAVHEAVKNAGSLTIA 82 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 + + Q T + V K + EM+ + S + Sbjct: 83 VTPTARQ--TGEF--VRKAATFAS---GLEMRVKGDGHNEMSLAFPNGSRIVGLPGTEAT 135 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 R +S ++ DEAS D + + + L +A W+M S P Sbjct: 136 VRGFSA-------------VTLLLIDEASRVGDDLYMAMRPMLA-VSAGTLWLM-STPHG 180 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292 G FYE + + W+R + + E G + R E C +F + Sbjct: 181 KRGFFYEAWANGGETWERVSVKAEDCPRFKAEYLEEERQVMGER--IYRQEYCCEF-GET 237 Query: 293 IDSFIPLNIIEEALNREPCP 312 + ++IE A + E P Sbjct: 238 SGAVFDRDLIEAAFSDEVTP 257 >gi|114320225|ref|YP_741908.1| hypothetical protein Mlg_1066 [Alkalilimnicola ehrlichii MLHE-1] gi|114226619|gb|ABI56418.1| hypothetical protein Mlg_1066 [Alkalilimnicola ehrlichii MLHE-1] Length = 463 Score = 72.4 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 71/473 (15%), Positives = 133/473 (28%), Gaps = 64/473 (13%) Query: 15 LFDLMWSDEIKLS-FSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73 + D+M + F W L GF E S Sbjct: 5 IRDVMTDPALFGGQFGGDT-----WAAWRALLSGFYGLPLDDAEAQHWHALTDRESAPQS 59 Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSK--- 130 + + GR GK+ A L ++ + + A EV+ Sbjct: 60 AHDELWLVV--GRRGGKSNAAALLAVYEACFKDHRDAL--AP----------GEVATTRV 105 Query: 131 -WLSLLPNKHWFEMQSLSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTF 184 + F S +H P ++ + ++ R TF Sbjct: 106 MAADRAQARSVFRYISGLMHANPMLERLIVREDRESIELSNRAVIEVGTASFRTTRGYTF 165 Query: 185 VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244 +D+++ I + L N I S+P G+ +E + + Sbjct: 166 AAVIADEVAFWRSDDSANPDSEIIAAVRPGLATLNGK--LIALSSPYARRGELWENYRRH 223 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIE 303 + ++PS E ++ E +F + D+++F+ ++E Sbjct: 224 YGKASPILVAQAPSRTMNPSLPERVVTEAMERDPASAAAEYLAEF-RTDVETFLQREVVE 282 Query: 304 EALNREPCPDPYA---PLIMGCDIAEEGGDN--TVVVLRRGPV-IEHLFDWSKTDLRTTN 357 A P PY D A G D + R G + + K Sbjct: 283 AATRPTPLELPYNKRVTYTAFVDPAGGGADEFTAAIGHREGERVVVDVLRARKGTPAEIV 342 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417 + + L++ YR I D G+ D G V Q + R+ ++ Sbjct: 343 AEYADLLKSYRITRAISDRY-AGSWPADEFSRHGITVE----QAAKPKSDLYRDMLASMN 397 Query: 418 VKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAK-STDYS 469 L L+ L +++E + +G + S D++ Sbjct: 398 SAR------VELPPDDRLMTQL-------------ISLERRTARGGRDSIDHA 431 >gi|51512091|gb|AAU05290.1| terminase large subunit [Enterobacteria phage T5] Length = 438 Score = 72.4 bits (176), Expect = 2e-10, Method: Composition-based stats. Identities = 55/329 (16%), Positives = 112/329 (34%), Gaps = 48/329 (14%) Query: 66 CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N++ +P +S R +GK+ + A+ + +L P + V+ +A + + L W Sbjct: 45 IINALEDPRHRFVTACVS--RRVGKSFI-AYTLGFLKLLEPNVKVLVVAPNYS-LANIGW 100 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 +++ + ++ + ++ + S + D+ V Sbjct: 101 SQIR----------------GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV 144 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIF-- 241 G II DEA+ DV L T N + S PR G +++ F Sbjct: 145 GRSYD---FIIFDEAA-ISDVGGDAFRVQLRPTLDKPNSKALFISTPRG--GNWFKEFYA 198 Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 + L +W R D + E AR + + R E F + F Sbjct: 199 YGFDDTLPNWVSIHGTYRDNPRADLNDIEE--ARRTVSKNYFRQEYEADFSVFEGQIFDT 256 Query: 299 LNIIE-----EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR------GPVIEHLFD 347 N + + + D ++G D+ D T V+ + + + Sbjct: 257 FNATDHVKDLKGMRHFFKDDEAFETLLGIDVGY--RDPTAVLTIKYHYDTDTYYVLEEYQ 314 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 ++ I +++Y+ D I +D+ Sbjct: 315 QAEKTTAQHAAYIQHCIDRYKVDRIFVDS 343 >gi|307308946|ref|ZP_07588629.1| hypothetical protein SinmeBDRAFT_4513 [Sinorhizobium meliloti BL225C] gi|306900580|gb|EFN31193.1| hypothetical protein SinmeBDRAFT_4513 [Sinorhizobium meliloti BL225C] Length = 408 Score = 72.4 bits (176), Expect = 2e-10, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 71/202 (35%), Gaps = 24/202 (11%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN--KHWFEMQS 145 GKT + A V W + + V SE+ +K +W+ + + + + K F++ + Sbjct: 208 WGKTYVAAIAVWWSLVCFDDVKVTIFGPSESLIKNGMWSNLQALHARMASSFKDLFDVSA 267 Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205 + R S + G H + D+A G + Sbjct: 268 TRVSRKTAAP------------SCFAEYRLVSADNASAARGIHAVNN-FVFVDDADGVSE 314 Query: 206 VINLGILGFLTERNANRFWI--MTSN--PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261 V+ ++ + + N + M +N P+ + E+FN+ L + + Sbjct: 315 VVIAYLMNIMIDPNPKLCLLSTMFANETPKLETVTEAELFNEALSSLRAM-VSGEV--RT 371 Query: 262 DPSFHEGIIARYGLDSDVTRVE 283 DP + E I RY L++ Sbjct: 372 DPVWLEAI--RYQLENAEYLAR 391 >gi|331650684|ref|ZP_08351739.1| conserved hypothetical protein [Escherichia coli M605] gi|331040472|gb|EGI12647.1| conserved hypothetical protein [Escherichia coli M605] Length = 414 Score = 72.4 bits (176), Expect = 2e-10, Method: Composition-based stats. Identities = 44/325 (13%), Positives = 98/325 (30%), Gaps = 45/325 (13%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 89 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 143 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 + DEA+ + I L++ R I S+ ++ F + + F Sbjct: 144 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 197 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 R D ++ + +D+ V E+ + IP ++ ++ Sbjct: 198 FHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQATVDAHIK 255 Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKY 367 P + D+A+EG D R G ++E++ +WS +D+ + K+ G E+ Sbjct: 256 LGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQD 315 Query: 368 RPDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQ 400 + D + GA + L + V GQ Sbjct: 316 NLEEFRFDEDGLGAGVRGDARAINELRNVARRPSILATPFRGSGAVFDPDDEAVRGDNGQ 375 Query: 401 KRAVDLEFCRNRRTELHVKMADWLE 425 ++ +F N + + ++ + Sbjct: 376 AARLNKDFFANAKAQSWWRLRKLFQ 400 >gi|61806000|ref|YP_214360.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM2] gi|61374509|gb|AAX44506.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM2] gi|265525210|gb|ACY76007.1| terminase large subunit gp17 [Prochlorococcus phage P-SSM2] Length = 547 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 62/378 (16%), Positives = 124/378 (32%), Gaps = 51/378 (13%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 GK+T +L ++V LAN + + L L + + + Sbjct: 82 TGKSTTCISYLLHYAVFNDNVNVAVLANKASTARDLL--------GRLQLAYENLPRWMQ 133 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--- 204 W L G ST + I DE + P Sbjct: 134 QGIISWNKGSLELENGSKISANSTSSSAVRGGSYN-----------VIFLDEFAFIPNHI 182 Query: 205 -DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEG 260 D + +T + I+ S PR ++ FY +++ K ++ + V G Sbjct: 183 ADDFFASVYPTITS-GQSTKVIIVSTPRGMN-HFYRMWHDSEKGKSEYVATDVHWSEVPG 240 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI----PLNIIEEA-------LNRE 309 D + E IA ++E +F +++ I N++ EA L+ Sbjct: 241 RDEEWKEQTIANTSEQQ--FKIEFECEFL-GSVNTLINPAKLRNLVYEAPKTRNAGLDIY 297 Query: 310 PCPDPYAPLIMGCDIAEE-GGDNTV-VVLRRGPVIEHLFDWSKTD---LRTTNNKISGLV 364 P I+ D+A G D + +V + + + N I + Sbjct: 298 ETPVKEHNYIITVDVARGLGNDYSAFIVFDTTEFPYKVVAKYRNNEIKPMLFPNIILDVA 357 Query: 365 EKYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 + Y ++I+ N+ G + LE + + G+ + + ++T+L V+M Sbjct: 358 KGYNNAYLLIEVNDIGDQVASILQYDLEYENVLMASMRGRAGQIVGQGFSGKKTQLGVRM 417 Query: 421 ADWLEFASLINHSGLIQN 438 ++ N ++++ Sbjct: 418 TSAVKKLGCSNLKTMMED 435 >gi|255929035|ref|YP_003097347.1| DNA terminase packaging enzyme large subunit [Synechococcus phage S-RSM4] gi|255705321|emb|CAR63310.1| DNA terminase packaging enzyme large subunit [Synechococcus phage S-RSM4] Length = 550 Score = 72.1 bits (175), Expect = 2e-10, Method: Composition-based stats. Identities = 47/358 (13%), Positives = 103/358 (28%), Gaps = 59/358 (16%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 +Q E + + N P GK+T +L+ +++ Sbjct: 62 DFQKEILRDFHENRFNIAKLPRQ------------TGKSTTVVAYLLYYAIFYDSVNIGI 109 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 LAN + + L L + + + W + G ST Sbjct: 110 LANKASTARELL--------GRLQLAYENLPKWMQHGILVWNKGNVELENGSKILAASTS 161 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI----NLGILGFLTERNANRFWIMTS 228 + + DE + P+ + + +T + I+ S Sbjct: 162 ASAVRGMSFN-----------ILFLDEFAFVPNHVAEQFFASVYPTITS-GKSTKVIIIS 209 Query: 229 NPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285 P ++ FY+++ + +D+ ++ V G D + E I E Sbjct: 210 TPNGMN-HFYKMWEDARRGKNDYVTNEVHWSQVPGRDAKWKEETIKN--TSPRQFAQEFE 266 Query: 286 GQFPQQDIDSFIPLNIIE-----------EALNREPCPDPYAPLIMGCDIAE--EGGDNT 332 F D+ I ++ L+ I+ D+A G + Sbjct: 267 CDFL-GSADTLISPAKLQNIPFHDPIQSNAGLDVYERVQKDHEYIITVDVARGIGGDYSA 325 Query: 333 VVVLRRGPVIEHLFDWSKTDLRT---TNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 +V + + + + + I + ++Y ++++ N+ G L Sbjct: 326 FIVFDITTMPYKIVAKYRNNEIKPVLFPSVIFQVCKEYNNPYVLVEVNDIGDSIAATL 383 >gi|329849103|ref|ZP_08264131.1| phage terminase, large subunit, PBSX family [Asticcacaulis biprosthecum C19] gi|328844166|gb|EGF93735.1| phage terminase, large subunit, PBSX family [Asticcacaulis biprosthecum C19] Length = 430 Score = 71.3 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 69/435 (15%), Positives = 133/435 (30%), Gaps = 43/435 (9%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117 +E + A+ + F+ A GRG K+ A ++ PG V+ + + Sbjct: 24 ILEPIPAYRFLTKKPLGSFRFRAA-YGGRGAAKSWEFANAAIYHSLNTPGARVVFVREIQ 82 Query: 118 TQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177 L + + V L + F + H +++L L + Sbjct: 83 GSLADSAFTLVRNRLEAYGLEGAFRQANGRFHHVENGAEILFLGL-------------WR 129 Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 +P+ I +EAS ++ + + W + NP + Sbjct: 130 GNKPEGIKSL--EGATLTIWEEASEGRQRSLDVLIPTVLRTPQSELWCLW-NPMLPTDPV 186 Query: 238 YEIFNKPLDDWKRF--QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295 F ++ K +++ + + E + D G + ++ Sbjct: 187 DRFFRGDVEPQKTICRRVNWDSNPHFPEALREQMALDRKKDPLRAAWIWDGAYMPSAQNA 246 Query: 296 FIPLNIIEEAL--NREPCPDPYAPLIMGCDIAEEGGDNT--VVVLRRGPVIEHLFDWSKT 351 +++ A R+ + +++G D A GGD VV R G + D Sbjct: 247 LWTRELLDRAWVQGRDKVMEAVGRVVVGVDPAGGGGDEVGIVVAGRYGAEGYIVLDDRSV 306 Query: 352 DLRTT---NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEF 408 R+ ++ V+ Y D ++++ N G L V V + R V Sbjct: 307 AARSPEGWATEVLRAVDAYAADCVVVEKN-FGG----DLVASNLRVNGVHCRIREVTASR 361 Query: 409 CRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDY 468 + R E + + + L L + G KS D Sbjct: 362 GKQVRAEPIAALYEQHKVYHRRPFPALEGQLL-----QMTPNGYAV-------KGKSPDR 409 Query: 469 SDGLMYTFAENPPRS 483 D L++ E RS Sbjct: 410 LDALVWALTELSRRS 424 >gi|291336011|gb|ADD95601.1| large terminase protein [uncultured phage MedDCM-OCT-S09-C7] Length = 526 Score = 71.3 bits (173), Expect = 4e-10, Method: Composition-based stats. Identities = 58/355 (16%), Positives = 110/355 (30%), Gaps = 47/355 (13%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 + A R GK+ + +LW + P ++V LAN + Sbjct: 80 VLASRQSGKSITSCAYLLWFLLFNPEVTVAVLANKG-----------------AIAREMI 122 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR-TYSEERPDTFVGHHNTYGMAIINDEA 200 L P++ L S ++ + + + G + DE Sbjct: 123 ARMVTMLESVPFFLQPGVKILNKGSIEFANDSKVVAAATSSSSIRGL---SINLLYLDEF 179 Query: 201 SGTPDV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD---WKRFQIDT 255 + D +T + I+TS + FY+I+ + D +K F I+ Sbjct: 180 AFVDDAETFYTATYPVVTS-GKDSKVIITSTANGVGNMFYKIYESAVHDQSEYKHFLINW 238 Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP----- 310 V G D + + IA S+ + G ++ I N + +++EP Sbjct: 239 FDVPGRDEEWKKETIAN---TSEAQFEQEYGNSFLGTGNTLINSNTLLGLMSKEPDWNKD 295 Query: 311 ------CPDPYAPLIMGCDI--AEEGGDNTVVVLRRGPVIEHLFDWSKTDL---RTTNNK 359 P I D+ +T ++ + ++ + Sbjct: 296 GVKVYEKPKEGHTYITTVDVSKGRGIDYSTFTIMDISVKPFRQVCTYRDNMISPMLFPDL 355 Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKRAVDLEFCRNRR 413 I+ + Y +II+ N G L + Y V G +A D+ +R Sbjct: 356 IAKYTKPYNESLVIIENNAEGGMVATQLHYDIEYPNVFVQGMSKAEDIGVTMTKR 410 >gi|229605025|ref|YP_002875724.1| hypothetical protein P087_gp56 [Lactococcus phage P087] gi|227826008|gb|ACP41732.1| hypothetical protein [Lactococcus phage P087] Length = 578 Score = 71.3 bits (173), Expect = 4e-10, Method: Composition-based stats. Identities = 71/460 (15%), Positives = 137/460 (29%), Gaps = 87/460 (18%) Query: 85 GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 G G GK+ +++ L + G + A + L + ++ E+ ++ P + Sbjct: 105 GTGFGKSFVSSQCNL--VRANRGELITAFAPNRE-LNSVIFKEMVSAVNHSPKLKKVLFE 161 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERP-DTFVGHHNTYGMAIINDEASGT 203 + S A G+ K ++ + + G H++ M DE + Sbjct: 162 AESKEEA--------LQRGVSQKRFAFPSGGFVDLTIAKNATGVHSSSYM----DEYALL 209 Query: 204 PDVINLGILG----FLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--------DDWK-- 249 G ++ + TSNP ++ + ++ PL DW+ Sbjct: 210 TKEEYNLAEGRAYAYVDKDGKPGKIFKTSNPHIMNFSYDDMIRNPLPPHEAVLWGDWRLN 269 Query: 250 ----------RFQIDTRTVEGIDPSFHEGIIARYGLD--------SDVT------RVEVC 285 Q+D D Y LD S R+ Sbjct: 270 IGEGKFMELVYSQLDDEHKYLKDKFPLNREERDYLLDQAIQQVIWSPFFNDEDNLRILYL 329 Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345 +F +F ++ P + G D+A G D + L + Sbjct: 330 SEFGVNTESAFFTTT---PKIDDSPIDWDNSTFYAGNDVAIRGTDACIYALLEYNPNKSY 386 Query: 346 FDWSKTDL------------RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYH 393 + + ++ + IDA+ G + L Sbjct: 387 SRIVAFNNVKPQLWIDHETPMEMAQNVIRQLKHDNARLLAIDASGVGEGQFNLLTTDDAE 446 Query: 394 ----VYRVLGQKRAV------DLEFCRNRRTELHVKMADWLEFASLINHSG----LIQNL 439 V V A + N+R+EL + ++++ +L S L + Sbjct: 447 TSCPVVPVRFGDGASKWRKDKNAVRSHNKRSELFLDFKEFIDTDTLRVTSEVWEFLQAEM 506 Query: 440 KSLKSFIVPNTGELAIESK---RVK-GAKSTDYSDGLMYT 475 +++ ++ IE K + + G KSTDY D M Sbjct: 507 QAVTKMSNDENKKIKIEPKDAIKKRLGGKSTDYLDSSMLA 546 >gi|116625333|ref|YP_827489.1| hypothetical protein Acid_6278 [Candidatus Solibacter usitatus Ellin6076] gi|116228495|gb|ABJ87204.1| hypothetical protein Acid_6278 [Candidatus Solibacter usitatus Ellin6076] Length = 260 Score = 70.9 bits (172), Expect = 5e-10, Method: Composition-based stats. Identities = 40/225 (17%), Positives = 72/225 (32%), Gaps = 25/225 (11%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 GK+T+ A + T+ I ++ + Q T + V K + +M+ Sbjct: 58 WGKSTVTAARAVHEAVTKADSLTIAVSPTARQ--TGEF--VRKAEAFAGM---LKMKVKG 110 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207 + S + R +S ++ DEAS D + Sbjct: 111 DGSNEMSLAFPNGSRIVGLPGTEATVRGFSA-------------VALLLVDEASRVEDDL 157 Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE 267 + + L W+M S P G FYE + W+R + + E Sbjct: 158 YMAMRPMLAVSG-GTLWLM-STPWGKRGFFYEAWANGGPTWERVSVKAEDCPRFGAEYLE 215 Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312 G + R E C +F + + ++IE A + + P Sbjct: 216 EERRVMGER--IYRQEYCCEFGESSS-AVFDRDLIEAAFSDDFGP 257 >gi|86372240|gb|ABC95184.1| GP17-terminase [Stenotrophomonas phage Smp14] Length = 536 Score = 70.1 bits (170), Expect = 8e-10, Method: Composition-based stats. Identities = 48/326 (14%), Positives = 107/326 (32%), Gaps = 48/326 (14%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKTT+ A ++LW + LAN Q + L ++ + WF + + Sbjct: 92 GKTTVVAAILLWYAIFNEEYRIAILANKGDQSREIL----ARLQLMYEELPWF----MQV 143 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI- 207 + W + + LG S+ ++ + + G + DE + + + Sbjct: 144 GVSVW--NKGNIKLGNRSEVFT------AATGGSSIRG---KSVNLMYLDEFAFVENDVD 192 Query: 208 -NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTR---TVEGIDP 263 +T I+TS P ++ FY+I+ + + + D Sbjct: 193 FYTSTYPVVTS-GTKTKVIITSTPNGMN-LFYKIWTDSTNGKNNYVHNEAFWHDHPKRDQ 250 Query: 264 SFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC------------ 311 ++ + + E +F Q D+ + +E+ ++ Sbjct: 251 AWKDEQLRNMSERQ--FEQEFLCKF-QGSSDTLLSPAKLEQLTYQDHIRELGGNRDFKIY 307 Query: 312 --PDPYAPLIMGCDIAEE-GGDNTVV-VLRRGPVIEHLFDWSKTDLR---TTNNKISGLV 364 P A ++ D++E G D +V+ V ++++ + + + Sbjct: 308 EDPIKDASYVVTVDVSEGIGKDYSVISVFDTTEAPFRQVAMLRSNIIAPLILADLANRIG 367 Query: 365 EKYRPDAIIIDANNTGARTCDYLEML 390 Y +I++ N+ G L Sbjct: 368 HLYNQAVLIVECNSIGNTVVTALWED 393 >gi|30044056|ref|NP_835653.1| similar to terminase DNA packaging enzyme, large subunit [Rhodothermus phage RM378] Length = 508 Score = 70.1 bits (170), Expect = 9e-10, Method: Composition-based stats. Identities = 55/335 (16%), Positives = 100/335 (29%), Gaps = 54/335 (16%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 G T L M V+ AN E K L + + + L + Sbjct: 66 GVTWCAVAYALHQMIFNSNYKVLIAANKEATAKNVL--------ERIKFAYEQLPRFLQI 117 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--V 206 W + S + ++ S + +I +EA+ + Sbjct: 118 KKRTWNKT--YIEFSNYSSARAVSSKSDSGR---------SESITLLIVEEAAFISNMEE 166 Query: 207 INLGILGFLTERNANRFWIMTSNPRRLS-GKFYE----IFNKPLDDWKRFQIDTRTVEGI 261 + + L N G +YE + ++K F I Sbjct: 167 LWASVQQTLATGGK-----CIVNSTYNGVGNWYERTIRAAKEGKSEFKYFGIKWSDHPER 221 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP---- 317 D + E L V E+ PQ ++ IP ++I E +P Y Sbjct: 222 DEKWFEEQKRL--LPPRVFAQEILCI-PQGSGENVIPFHLIREEEFIDPFVVKYGGDYWE 278 Query: 318 -------LIMGCDIA-EEGGDNTVVVLR------RGPVIEHLFDWS--KTDLRTTNNKIS 361 + D A G D + V ++ + IE + +++ KT L I Sbjct: 279 WYRKPGYYFISVDPASGRGEDRSAVGVQVLWVDPQTLTIEQVAEFASDKTSLPVMRQVIK 338 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396 + ++++P I I+ N G ++E + Sbjct: 339 QIYDEFKPQLIFIETNGIGMGLYQFMEAYTPSIVG 373 >gi|213029404|ref|ZP_03343851.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] Length = 282 Score = 69.8 bits (169), Expect = 1e-09, Method: Composition-based stats. Identities = 35/193 (18%), Positives = 70/193 (36%), Gaps = 8/193 (4%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 75 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 132 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D + G D + I L+ IE A++ + Sbjct: 133 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 192 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368 P +G D+A+ G D V R G VI +W + +L + + + R Sbjct: 193 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 251 Query: 369 PDAIIIDANNTGA 381 I+ D+ GA Sbjct: 252 DADIVYDSIGVGA 264 >gi|331648285|ref|ZP_08349374.1| conserved hypothetical protein [Escherichia coli M605] gi|331042834|gb|EGI14975.1| conserved hypothetical protein [Escherichia coli M605] Length = 219 Score = 69.8 bits (169), Expect = 1e-09, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 72/205 (35%), Gaps = 51/205 (24%) Query: 322 CDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 D+A+EG D R G ++E++ +WS +D+ + K+ G E+ + D + Sbjct: 1 MDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDNLEEFRFDEDGL 60 Query: 380 GART------CDYLEMLGYH---------------------VYRVLGQKRAVDLEFCRNR 412 GA + L + V GQ ++ +F N Sbjct: 61 GAGVRGDARAINELRNVARRPSILATPFRGSGAVFDPDDEAVRGDNGQAARLNKDFFANA 120 Query: 413 RTELHVKMADWL--------EFASLINH------------SGLIQNLKSLKSFIVPNTGE 452 + + ++ E + LI L S ++ + G+ Sbjct: 121 KAQSWWRLRKLFQNTWRAVAEGMAYNPDEIISISSSMALKDKLIIEL-SQPTYSINGVGK 179 Query: 453 LAIESKRVKGAKSTDYSDGLMYTFA 477 + I+ K+ G +S + +D +M +A Sbjct: 180 IVID-KQPDGTRSPNLADSVMINYA 203 >gi|256819733|ref|YP_003141012.1| hypothetical protein Coch_0896 [Capnocytophaga ochracea DSM 7271] gi|256581316|gb|ACU92451.1| hypothetical protein Coch_0896 [Capnocytophaga ochracea DSM 7271] Length = 450 Score = 69.4 bits (168), Expect = 2e-09, Method: Composition-based stats. Identities = 43/295 (14%), Positives = 104/295 (35%), Gaps = 38/295 (12%) Query: 217 ERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTR---------TVEGIDPSFHE 267 E N ++T+NP + ++ + +K ++ R + + + + Sbjct: 162 EYNLKGKLLITANPSKNF-----LYKEFYTPYKEGTLNKRRAFIQALPYDNKMLPKEYIQ 216 Query: 268 GIIAR-YGLDSDVTRVEVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI 324 + G + + + G + D +S + I N + P + DI Sbjct: 217 NLENTLRGAE----KQRLLNGLWEYDDDPNSLCDYDKILAIFNNDQLPKESTTY-LTADI 271 Query: 325 AEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY---RPDAIIIDANNTGA 381 A G D V+ + +G + ++ + + I+ L KY + + I D + G Sbjct: 272 ARFGSDLCVIGVWQGWELIEVYTLATSATTEIQALINTLRMKYNIPKGNCIA-DEDGVGG 330 Query: 382 RTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQ---- 437 D ++G+ ++ +N +T+ K+A+ + + + + + Sbjct: 331 GVVDNTGIVGFKNNSTPFEENG-QPTNYKNLQTQCLYKLAERINSNGIYISAEVSERTKE 389 Query: 438 --NLKSLKSFIVPNTG-ELAIESK---RVKGAKSTDYSDGLMY-TFAENPPRSDM 485 + + G L++ +K + +S DY D L+ + + P+ Sbjct: 390 MIIEEIEQIKSDNKDGQRLSVINKDTVKQAIGRSPDYRDMLLMREYFDLKPKRIF 444 >gi|291334534|gb|ADD94186.1| hypothetical protein Syncc9605_0456 [uncultured phage MedDCM-OCT-S04-C1220] gi|291335526|gb|ADD95137.1| hypothetical protein Syncc9605_0456 [uncultured phage MedDCM-OCT-S04-C491] gi|291335665|gb|ADD95272.1| hypothetical protein Syncc9605_0456 [uncultured phage MedDCM-OCT-S04-C846] Length = 354 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 51/270 (18%), Positives = 100/270 (37%), Gaps = 38/270 (14%) Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE----RPDTFVGHHNTYGMAIINDEASG 202 L P W L I+ + ST+ +E R + G ++ DEA+ Sbjct: 12 KLVPKVWIRTKNETDLRIELINGSTIELKGTENAMALRGRSLSG--------VVLDEAAF 63 Query: 203 T-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF----NKPLDDWKRFQIDTRT 257 +V I L ++ + + S P + FY+++ + ++W+R+ T Sbjct: 64 MDAEVWFEVIRPALADKEG--WALFISTPDGTASWFYDLWCYVPDDETNEWQRWSYTTID 121 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 + E A+ LD+ R E F +++ + ++ +E +++E P Sbjct: 122 GGNVSKHEVEAARAQ--LDTRTFRQEFEASF--ENLTGLVAISFSDENISQEAKDISIQP 177 Query: 318 LIMGCDIAEEGGD--NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII- 374 L++G D D + + ++ G + + T TT + + +Y D II Sbjct: 178 LLLGVD---FNVDPMSGICAVKNGETLYVFDEVMLTGGATTWDFAEEVTRRYGVDRRIIA 234 Query: 375 --DANN-----TGARTCDY--LEMLGYHVY 395 D +G D+ L G+ V Sbjct: 235 CPDPTGGARKTSGVGVTDHAILRRSGFTVQ 264 >gi|326804661|ref|YP_004327532.1| Gp17 terminase subunit for DNA packaging, nuclease and ATPase [Salmonella phage Vi01] gi|301795311|emb|CBW38029.1| Gp17 terminase subunit for DNA packaging, nuclease and ATPase [Salmonella phage Vi01] Length = 736 Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 66/393 (16%), Positives = 121/393 (30%), Gaps = 66/393 (16%) Query: 91 TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150 TT+ A +LW + LAN E Q L + K + Sbjct: 269 TTVVAAFLLWYAMFHSDKEIAVLANKEKQAIEIL-DRIRK----------------AYQD 311 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAIINDEASGTPDV--I 207 P++ G + + Y+ D+ G + DE + + Sbjct: 312 LPFFLQQGCEKFGSTLIEFENGSKIYAYATSSDSIRGR---SVSLLYVDEVAFIENDFEF 368 Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD---WKRFQIDT---RTVEGI 261 + + +R I+TS P+ G FY+I K + F + V Sbjct: 369 WESTFPAIASADTSR-CILTSTPKGQRGLFYDIVTKADPRHPQYNDFHLTEVPWYKVPAY 427 Query: 262 --DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---REP-----C 311 DP + AR G E +F + + S IP +++ + REP Sbjct: 428 TKDPDWETKQRARLG--DARFDQEFGIKF-RGSVGSLIPAKCLDKMTSKLYREPNEFTKI 484 Query: 312 PDPYAPLIMGCDIAEEG----GDNTVV-VLRRGPVIEHLFDWSKTDL---RTTNNKISGL 363 Y P + IA+ G GD +V+ +L + + + I+ + Sbjct: 485 YKEYDPQRIYFGIADTGKGVEGDYSVLTILDITEYPHVIAAKYRNNTIPPMMYAYTIADM 544 Query: 364 VEKYRPDAIIIDA-NNTGARTCDYLEMLGYHVYRVLG-----------QKRAVDLEFCRN 411 +Y ++++ N+ G + L + + R + N Sbjct: 545 CTEYGECPVLVETNNDVGGQVITILYQEIEYPEIIFTSTDNKGTGKRIGGRKPEPGINTN 604 Query: 412 R--RTELHVKMADWLE-FASLINHSGLIQNLKS 441 R R+ + +E +I I L + Sbjct: 605 RKVRSIGCANLKALIEKEMLVIEDQDTIDELST 637 >gi|282599341|ref|YP_003358653.1| Gp17 terminase DNA packaging enzyme large subunit [Shigella phage phiSboM-AG3] gi|226973647|gb|ACO94400.1| Gp17 terminase DNA packaging enzyme large subunit [Shigella phage phiSboM-AG3] Length = 736 Score = 68.6 bits (166), Expect = 3e-09, Method: Composition-based stats. Identities = 61/393 (15%), Positives = 124/393 (31%), Gaps = 66/393 (16%) Query: 91 TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150 TT+ A +LW + LAN E Q L + K + Sbjct: 269 TTVVAAFLLWYAMFHSDKEIAVLANKEKQAIEIL-DRIRK----------------AYQD 311 Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAIINDEASGTPDV--I 207 P++ G + + Y+ D+ G + DE + + Sbjct: 312 LPFFLQQGCEKFGSTLIEFENGSKIYAYATSSDSIRGR---SVSLLYVDEVAFIENDFEF 368 Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD---WKRFQIDTRTVEGI--- 261 + + +R I+TS P+ G FY+I K + + F++ + Sbjct: 369 WESTFPAIASADTSR-CILTSTPKGQRGLFYDIVTKANPEHPQYNDFKLTEVPWYRVPTY 427 Query: 262 --DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--------EPC 311 DP++ A+ G E +F + + S IP +++ ++ Sbjct: 428 TKDPNWESKQRAKLG--DARFDQEFGIKF-RGSVGSLIPAKCLDKMTSKLYQEPNEFTKI 484 Query: 312 PDPYAPLIMGCDIAEEG----GDNTVV-VLRRGPVIEHLFDWSKTDL---RTTNNKISGL 363 Y P + IA+ G GD +V+ +L + + + I+ + Sbjct: 485 YHDYDPKRIYMGIADTGKGVEGDYSVLTILDITDYPHKIAAKYRNNTIPPMMYAYTIADM 544 Query: 364 VEKYRPDAIIIDA-NNTGARTCDYLEMLGYHVYRVLG-----------QKRAVDLEFCRN 411 EKY ++++ N+ G + L + + R + N Sbjct: 545 GEKYGTCPMLVETNNDVGGQVITILYQEIEYPEIIFTTTDAKGTGKRIGGRRPEPGINTN 604 Query: 412 R--RTELHVKMADWLE-FASLINHSGLIQNLKS 441 + R+ + +E +++ I L + Sbjct: 605 KKVRSNGCANLKALIEREMLVVDDQDTIDELST 637 >gi|291334627|gb|ADD94276.1| hypothetical protein Syncc9605_0456 [uncultured phage MedDCM-OCT-S04-C231] Length = 320 Score = 68.2 bits (165), Expect = 3e-09, Method: Composition-based stats. Identities = 40/218 (18%), Positives = 83/218 (38%), Gaps = 26/218 (11%) Query: 195 IINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF----NKPLDDWK 249 ++ DEA+ +V I L ++ + + S P + FY+++ +W+ Sbjct: 56 VVLDEAAFMDAEVWFEVIRPALADKEG--WALFISTPDGTASWFYDLWCYVPEDETGEWQ 113 Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 R+ T + E A+ LD+ R E F +++ + ++ +E +++E Sbjct: 114 RWSYTTIEGGNVSKHEVEAARAQ--LDNRTFRQEFEASF--ENLTGLVAISFSDENISQE 169 Query: 310 PCPDPYAPLIMGCDIAEEGGD--NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY 367 PL++G D D + + ++ G + + T TT + + +Y Sbjct: 170 AKDISIQPLLLGVD---FNVDPMSGICAVKNGETLYVFDEIMLTGGATTWDFAEEVTRRY 226 Query: 368 RPDAIII---DANN-----TGARTCDY--LEMLGYHVY 395 D +I D +G D+ L G+ V Sbjct: 227 GVDRRVIACPDPTGGARKTSGVGVTDHAILRRSGFTVQ 264 >gi|297566322|ref|YP_003685294.1| hypothetical protein Mesil_1911 [Meiothermus silvanus DSM 9946] gi|296850771|gb|ADH63786.1| protein of unknown function DUF264 [Meiothermus silvanus DSM 9946] Length = 427 Score = 67.8 bits (164), Expect = 4e-09, Method: Composition-based stats. Identities = 68/386 (17%), Positives = 134/386 (34%), Gaps = 44/386 (11%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 +GK+ + + P + L+ E Q SK L+ +H +Q ++ Sbjct: 32 VGKSFAASLEAVLDCVAHPRSLWVFLSRGERQ---------SKELAEKAQRHLEAIQVVA 82 Query: 148 -LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD- 205 ++ P+ ++ + + + PDT G+ ++ DE + D Sbjct: 83 EMYDEPFDAESTQTVIRLPNGSRIISLPA----NPDTARGYSGN----VLLDEFALHKDS 134 Query: 206 -VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQIDTRTVEGID 262 I + +T R+ + S P+ GKFYEI+ D W R ++D Sbjct: 135 REIWGALYPTIT-RSKRYRLRVLSTPKGQQGKFYEIWQPEPGGDLWSRHRVDIYDAVQQG 193 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP--YAPLIM 320 + + D + + E +F + +++P +I + + D L + Sbjct: 194 LEVDPEELRKGLKDPVLWQQEYLLEFVDEAS-AWLPYELITSCESSQARTDGALEGDLYL 252 Query: 321 GCDIAEEGGDNTVV--VLRRGPVI--EHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 G DI D +V+ R G V+ + +T T + L+ + R IDA Sbjct: 253 GMDIGRH-RDLSVIWVAERVGDVLWTRRVIWLERTPFATQREVLYSLLPQVRRAC--IDA 309 Query: 377 NNTGARTCDYLE-MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL-INHSG 434 + G + + + G V V+ + + L V + E + I Sbjct: 310 SGLGMQLAEEAQSRFGSRVEPVMFTRAVKED---------LAVTLRRKFEDRLIRIPPDD 360 Query: 435 LIQNLKSLKSFIVPNTGELAIESKRV 460 I+ I + G + ++ R Sbjct: 361 RIRESLHAVRRITTSAGHIRFDADRD 386 >gi|300775654|ref|ZP_07085515.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300505681|gb|EFK36818.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 475 Score = 67.4 bits (163), Expect = 5e-09, Method: Composition-based stats. Identities = 46/284 (16%), Positives = 101/284 (35%), Gaps = 35/284 (12%) Query: 217 ERNANRFWIMTSNPRRLSGKFYEIFNKPLD------DWKRFQIDTRTVEGIDPSFHEGII 270 E +T NP++ Y F KP+ K Q + I + E + Sbjct: 179 EYGLKPKIFVTCNPKKN--WMYSYFYKPMKEGLLKLKQKFIQAFVQENPFITTDYIEQLE 236 Query: 271 ARYGLDSDVTRVEVC-GQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIMGCDIAEE 327 + R + G + + D+ L I + L+ + + + D+A Sbjct: 237 ST---TDKAKRERLLKGNW--EYDDNPYKLTIYDRILDLWKNDHIEKKGRKYITADVARF 291 Query: 328 GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI--SGLVEKYRPDAIIIDANNTGARTCD 385 G D V + + + ++ + I + K I DA+ G D Sbjct: 292 GSDLATVGVWEDWDLIEVHEFEISKTTEIQACIQAMRIKHKIPKHNCIADADGVGGGVVD 351 Query: 386 YLEMLGYHVYRVLGQ---KRAVDLEFCRNRRTELHVKMADWLEFASLINHSG-------- 434 L+++G+ + ++ + +N +T+L V +A+ + + +N S Sbjct: 352 NLDIIGFVNNAKPFEENTGQSKNAPKYKNMQTQLLVYLAEKIINQNKMNISADISEKQKE 411 Query: 435 -LIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMY 474 + + L +++ +P+ + + K + +S DY D ++ Sbjct: 412 YIKEELDTIE--RIPDVDIVTLVDKTQIKQNIGRSPDYRDMILM 453 >gi|329954246|ref|ZP_08295340.1| phage terminase, large subunit, PBSX family [Bacteroides clarus YIT 12056] gi|328527952|gb|EGF54938.1| phage terminase, large subunit, PBSX family [Bacteroides clarus YIT 12056] Length = 438 Score = 67.1 bits (162), Expect = 6e-09, Method: Composition-based stats. Identities = 52/319 (16%), Positives = 109/319 (34%), Gaps = 45/319 (14%) Query: 196 INDEASGTPDVINLGILG----FLTE-RNANRFWIMTSNPRRLSGKFYEIFNKP------ 244 +EA + + L + N ++T NP++ Y+ F KP Sbjct: 126 WIEEAGQVNRLAFEVLQTRIGRHLNDVYNVPGKILITCNPKKN--WLYDKFYKPWKEHKL 183 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDIDSFIPLNIIE 303 D + Q + + + + VT+ + G + + P + + Sbjct: 184 KDGYAFVQALVQDNPFATEDYINTLKNT---NDKVTKERLYFGNWEYDND----PAVLCD 236 Query: 304 EALNREPCPDPY-APLIMGC---DIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359 + + + P+ + D+A +G D V G V D + ++ Sbjct: 237 YDAICDLFVNEHVQPVGLSTGSSDLAMKGRDRFVSGHWIGNVCYIRLDQEYSTGKSIEAD 296 Query: 360 ISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417 + ++ ++ +I+D++ G YLE + G R ++ E N ++E Sbjct: 297 LKNMMIQWSIPRSMMIVDSDGLG----SYLESYLNGIKEFHGGNRPINPE-FDNLKSECA 351 Query: 418 VKMADWLEFASL------INHSGLIQNLKSLKSFIVP----NTGELAIESKRVKGAKSTD 467 K+A+ + + +I+ L LK + G ++ E + S D Sbjct: 352 FKLAELINNRQIRIICTEAQRERIIEELGVLKQDHIDADTRKKGIISKEKMKEILGHSPD 411 Query: 468 YSDGLMYT-FA--ENPPRS 483 Y D L+ F + P+ Sbjct: 412 YLDMLIMAMFFRIKPIPKR 430 >gi|167623253|ref|YP_001673547.1| hypothetical protein Shal_1320 [Shewanella halifaxensis HAW-EB4] gi|167353275|gb|ABZ75888.1| protein of unknown function DUF264 [Shewanella halifaxensis HAW-EB4] Length = 617 Score = 67.1 bits (162), Expect = 7e-09, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 87/249 (34%), Gaps = 34/249 (13%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317 E + Y + D + F D DS + +E+ + + P Sbjct: 380 IEELRDEY--NDDDFKNLFMCIFVD-DADSVFKFSDLEKCMVESARWQDHKPKEQRPFGN 436 Query: 318 --LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRP 369 + +G D + + T+VV+ ++G L W + ++I + +YR Sbjct: 437 REVWLGYDPSRTRDNATLVVIAPGEKKGEKFRVLEKHYWRGLNFSHHVSEIQKVYARYRV 496 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429 I +D GA D + L + A + + + +T L +KM D +E + Sbjct: 497 TYIGVDTTGIGAGVFDSISTL--------FPREATAIHYSVSSKTRLVLKMIDVVESGRI 548 Query: 430 I---NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDMD 486 +H + + S++ G + ++ R ++D + + A ++ Sbjct: 549 EWDASHKDIAMSCLSIRKTTTDTGGAITFKASRDNVT---GHAD-VFFAIAHAVINEPLN 604 Query: 487 FGRCPSYQY 495 F + + Sbjct: 605 FAHKRTSSW 613 >gi|291337121|gb|ADD96636.1| hypothetical protein Syncc9605_0456 [uncultured organism MedDCM-OCT-S12-C92] Length = 354 Score = 66.7 bits (161), Expect = 9e-09, Method: Composition-based stats. Identities = 50/270 (18%), Positives = 99/270 (36%), Gaps = 38/270 (14%) Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE----RPDTFVGHHNTYGMAIINDEASG 202 L P W L I+ + ST+ +E R + G ++ DEA+ Sbjct: 12 KLVPKVWIRTKNETDLRIELINGSTIELKGTENAMALRGRSLSG--------VVLDEAAF 63 Query: 203 T-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF----NKPLDDWKRFQIDTRT 257 +V I L ++ + + S P + FY+++ + ++W+R+ T Sbjct: 64 MDAEVWFEVIRPALADKEG--WALFISTPDGTASWFYDLWCYVPDDETNEWQRWSYTTID 121 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 + E A+ LD+ R E F +++ + ++ ++ ++ E P Sbjct: 122 GGNVSKHEVEAARAQ--LDTRTFRQEFEASF--ENLTGLVAISFSDDNISTEAKDISIQP 177 Query: 318 LIMGCDIAEEGGD--NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII- 374 L++G D D + + ++ G + + T TT + + +Y D II Sbjct: 178 LLLGVD---FNVDPMSGICAVKNGETLYVFDEVMLTGGATTWDFAEEVTRRYGVDRRIIA 234 Query: 375 --DANN-----TGARTCDY--LEMLGYHVY 395 D +G D+ L G+ V Sbjct: 235 CPDPTGGARKTSGVGVTDHAILRRSGFTVQ 264 >gi|326784094|ref|YP_004324487.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage Syn1] gi|310004826|gb|ADO99217.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage Syn1] Length = 550 Score = 66.3 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 49/323 (15%), Positives = 103/323 (31%), Gaps = 47/323 (14%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 GK+T +L + ++V LAN + + L ++ + N + Q + Sbjct: 84 TGKSTTVVSYLLHYLIFNDSVNVGILANKASTARDLL----ARLATAYENLPKWIQQGVV 139 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207 + W + G ST + I DE + P+ I Sbjct: 140 V----WNKGNIELENGSKILAASTSASAVRGMSFN-----------IIFLDEFAFVPNHI 184 Query: 208 ----NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEG 260 + +T + I+ S P+ ++ FY+++ +D+ ++ V G Sbjct: 185 ADSFFASVYPTITS-GKSTKVIIISTPQGMN-HFYKMWQDAVNGRNDYTYHEVHWSQVPG 242 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--------- 311 D + E I E +F +D+ I + ++ EP Sbjct: 243 RDAKWKEETIKNTSQRQ--FTQEFECEFL-GSVDTLISASKLKALAFDEPITRNKGLDIY 299 Query: 312 --PDPYAPLIMGCDIAE--EGGDNTVVVLRRGPVIEHLFDWSKTD---LRTTNNKISGLV 364 P ++ D++ G + +V V + + + N I+ + Sbjct: 300 EKPKDKNEYLLTVDVSRGIGGDYSAFIVYDITTVPYKIVGKYRNNEIKPMLFPNVINDVA 359 Query: 365 EKYRPDAIIIDANNTGARTCDYL 387 Y ++ + N+ G + L Sbjct: 360 RAYNNAWVLCEVNDVGDQVASIL 382 >gi|58532911|ref|YP_195134.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-PM2] gi|58331378|emb|CAF34164.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-PM2] Length = 548 Score = 66.3 bits (160), Expect = 1e-08, Method: Composition-based stats. Identities = 47/323 (14%), Positives = 104/323 (32%), Gaps = 47/323 (14%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 GK+T +L + +++ LAN + + L ++ + N + Q + Sbjct: 84 TGKSTTVVSYLLHYLIFNDNVNIGILANKASTARDLL----ARLATAYENLPKWIQQGVV 139 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207 + W + G ST + I DE + P+ I Sbjct: 140 V----WNKGNIELENGSKILAASTSASAVRGMSFN-----------IIFLDEFAFVPNHI 184 Query: 208 ----NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEG 260 + +T + I+ S P+ ++ FY+++ + + ++ V G Sbjct: 185 ADSFFASVYPTITS-GKSTKVIIISTPQGMN-HFYKMWVDATNGRNGYTFHEVHWSQVPG 242 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--------- 311 D + E I E +F +D+ I + ++ + +P Sbjct: 243 RDEKWKEETIKNTSERQ--FTQEFECEFL-GSVDTLIAASKLKALVFNDPIKRNKGLDIY 299 Query: 312 --PDPYAPLIMGCDIAE--EGGDNTVVVLRRGPVIEHLFDWSKTD---LRTTNNKISGLV 364 P + +M D++ G + ++ V + + + N I+ L Sbjct: 300 EEPKEKSEYLMTVDVSRGIGGDYSAFIIFDITTVPYKVVGKYRNNEIKPMLFPNIINDLA 359 Query: 365 EKYRPDAIIIDANNTGARTCDYL 387 Y ++ + N+ G + L Sbjct: 360 RSYNNAWVLCEVNDIGDQVASIL 382 >gi|319775358|ref|YP_004137846.1| Terminase, ATPase subunit [Haemophilus influenzae F3047] gi|317449949|emb|CBY86161.1| Terminase, ATPase subunit [Haemophilus influenzae F3047] Length = 603 Score = 65.5 bits (158), Expect = 2e-08, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 118/375 (31%), Gaps = 70/375 (18%) Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 K + +S ++ A +DV I + + + T+ T +H Sbjct: 210 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--TFLGTNSATAQSYHGN---- 263 Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P DV+ G ++ + S P ++ Y + FN+ Sbjct: 264 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRAK 321 Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283 ++ +ID G + + +IA + Sbjct: 322 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 379 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333 QF + +F ++ ++ Y P + +G D A G + Sbjct: 380 FLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 439 Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 V++ R + H + D T ++I + Y I+ID G+ + Sbjct: 440 VIVAPPKVERGDYRVLHKQTFHGMDYETQASRIKQFCDDYNVTRIVIDKTGMGSGVYQEV 499 Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444 V + E+ + + E+ +K + ++ L + ++ + ++K Sbjct: 500 RKFYPMVQGL---------EYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 550 Query: 445 FIVPNTGELAIESKR 459 + TG++ S R Sbjct: 551 -RITGTGKITYVSDR 564 >gi|320162476|ref|YP_004175701.1| hypothetical protein ANT_30750 [Anaerolinea thermophila UNI-1] gi|319996330|dbj|BAJ65101.1| hypothetical protein ANT_30750 [Anaerolinea thermophila UNI-1] Length = 506 Score = 64.7 bits (156), Expect = 3e-08, Method: Composition-based stats. Identities = 57/329 (17%), Positives = 94/329 (28%), Gaps = 43/329 (13%) Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSL--SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177 L + AE+ K L + M+ L L+ P G + S Sbjct: 72 LFSGTSAEMVKASPTLRPQSLTAMRRLERVLNANPLTRGRWRRESGNTFRLGQARIHFLS 131 Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLT--ERNANRFWIMTSNPRRLSG 235 + VG T + + DEA L +T FW L G Sbjct: 132 AAPGASIVG--ATASLLLEVDEAQAVSIEKFDTELAPMTASTGAVRVFWGTAWTASTLLG 189 Query: 236 K---FYEIFNKPLDDWKRFQIDTRTVEGIDPSFH---EGIIARYGLDSDVTRVEVCGQFP 289 + + + F++ V P + E I + G + R + + Sbjct: 190 RELRLAQAEQARDGVRRVFRLTAAEVIADHPRYARTVERAIQQLGRNHPAVRTQYFSEEV 249 Query: 290 QQDIDSFIPLNIIEEALNREPCPDP---------------YAPLIMGC--DIAEEGGDNT 332 + P + P D AP+ + D A D++ Sbjct: 250 DAA-GTLFPEERLALLRGTHPWQDAPLPGRTYAFLLDVGGTAPVQLPLMDDYAGNRRDSS 308 Query: 333 VVVLRR-----------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGA 381 +V+ HL W+ ++ L ++ P I+IDA GA Sbjct: 309 ALVIVEVEPPQDGRPAPRYRAVHLCQWTGVSQTRLFEQVLALARQWSPRRIVIDATGVGA 368 Query: 382 RTCDYLEML--GYHVYRVLGQKRAVDLEF 408 D+L+ G V V DL + Sbjct: 369 GLADFLDRALPGRVVRFVFSSASKSDLGY 397 >gi|163758712|ref|ZP_02165799.1| prophage MuMc02, terminase, ATPase subunit, putative [Hoeflea phototrophica DFL-43] gi|162284002|gb|EDQ34286.1| prophage MuMc02, terminase, ATPase subunit, putative [Hoeflea phototrophica DFL-43] Length = 460 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 86/304 (28%), Gaps = 32/304 (10%) Query: 129 SKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHH 188 +K L H ++ Q D+ H S T PDT G Sbjct: 79 AKAYDLAIEAHEYDWQGQEGSYRAMEVDLPHGSK-----------ITALPANPDTARGFS 127 Query: 189 NTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246 + DE + D I + ++ A +TS P KFYE+ D Sbjct: 128 AN----VFLDEFAFHKDSGAIWKALFPVIS---AGWKLRITSTPNGKGNKFYELMTAEGD 180 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDID----SFIPLNII 302 W + ++D + D D E ++ + I Sbjct: 181 RWSKHEVDIYRAVADGLPRDIEELREGLADEDAWAQEYELKWLDEASAWLSYELISSVED 240 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHLFDWSKTDLRTTNN 358 E A +P P +G DI D V+ + + + + + Sbjct: 241 ERA--GDPYLYQGGPCYVGRDIGRRN-DLHVIWVWELVGDVLWERERIEQKRATFASMDA 297 Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRAVDLEFCRNRRTELH 417 ++E+YR ID G + + + G + VL + + Sbjct: 298 AFDDVMERYRVVRACIDQTGMGEKVVEDAQTRHGSRIEGVLFTGPNKLVMATAGKEAFED 357 Query: 418 VKMA 421 ++ Sbjct: 358 RRVR 361 >gi|78212008|ref|YP_380787.1| hypothetical protein Syncc9605_0456 [Synechococcus sp. CC9605] gi|78196467|gb|ABB34232.1| conserved hypothetical protein [Synechococcus sp. CC9605] Length = 414 Score = 64.4 bits (155), Expect = 4e-08, Method: Composition-based stats. Identities = 50/254 (19%), Positives = 88/254 (34%), Gaps = 39/254 (15%) Query: 82 ISAGRGIGKTTLN-AWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 +++GR GKT + WL+ + T G + LA + Q K W ++ Sbjct: 25 VNSGRRFGKTRMALTWLLEGALLT-SGSRMWFLAPTRVQAKQIAWRDLK----------- 72 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + P W S V +L I+ ++ S + + + D+ G DE Sbjct: 73 ------EMVPGSWASQVRESTLTIELRNGSHI-QLAGADYADSLRGQRADR---FAIDEY 122 Query: 201 SGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTR 256 D + L + + + I +S P E++ + W R+ + Sbjct: 123 CYIRDLQEMWQAALLPMLGTSDDGSVIFSSTPAGGGTFSAELWERAETAEGWARWNFPSV 182 Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PD 313 + P + E AR +D + R E G L + A N++ D Sbjct: 183 AGGWVKPEYVEQ--ARQTMDPSLWRQEFFGSIES-------LLGAVYPAFNQQNISDTVD 233 Query: 314 PYAPLIMGCDIAEE 327 PL++GCD Sbjct: 234 NGGPLLVGCDFNRS 247 >gi|319762771|ref|YP_004126708.1| prophage mumc02, terminase, atpase subunit, putative [Alicycliphilus denitrificans BC] gi|317117332|gb|ADU99820.1| prophage MuMc02, terminase, ATPase subunit, putative [Alicycliphilus denitrificans BC] Length = 454 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 46/241 (19%), Positives = 77/241 (31%), Gaps = 21/241 (8%) Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232 T PDT G ++ DE + D I + +++ S P Sbjct: 113 TALPANPDTARGFSAN----VLLDEFAFHQDSRAIWKALFPVISKPGLKLRV--ISTPNG 166 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291 KFY++ D W R D V P E + G D D+ E ++ + Sbjct: 167 KGNKFYDLMTGADDGWSRHTTDIYQAVADGLPRNIEELRKGAG-DDDLWAQEFELKWLDE 225 Query: 292 DIDSFIPLNIIEEA---LNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEH 344 +++P +I +P P +G DIA D V+ + + Sbjct: 226 AS-AWLPFELITACEHEAAGKPEHYQGGPCFVGVDIASRN-DLFVIWVFELVGDVLWVRE 283 Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG-ARTCDYLEMLGYH-VYRVLGQKR 402 + + + + + G+ +YR +D G D G V VL Sbjct: 284 IIERRRITFAEQDMLLDGVFRRYRVIRACMDQTGMGEKPVEDAQRRHGSSRVQGVLFTSS 343 Query: 403 A 403 A Sbjct: 344 A 344 >gi|146277344|ref|YP_001167503.1| hypothetical protein Rsph17025_1297 [Rhodobacter sphaeroides ATCC 17025] gi|146278140|ref|YP_001168299.1| hypothetical protein Rsph17025_2103 [Rhodobacter sphaeroides ATCC 17025] gi|145555585|gb|ABP70198.1| protein of unknown function DUF264 [Rhodobacter sphaeroides ATCC 17025] gi|145556381|gb|ABP70994.1| protein of unknown function DUF264 [Rhodobacter sphaeroides ATCC 17025] Length = 476 Score = 64.0 bits (154), Expect = 5e-08, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 69/239 (28%), Gaps = 21/239 (8%) Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232 T PDT G +I DE + I + +++ + + S P Sbjct: 133 TALPANPDTARGFSAN----VILDEFAFHAKSREIWAALFPVISKG--RQKLRVISTPNG 186 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292 KFYE+ W R +D ++ D D E ++ + Sbjct: 187 KGNKFYELMTAEGSVWSRHVVDIYEAVRQGLDRDVDMLRAGMADEDAWAQEYELKWLDEA 246 Query: 293 ----IDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEH 344 I E L +P P +G DIA D V+ + Sbjct: 247 SAWLDYDLISS--CESELAGKPEGYQGGPCFVGVDIAARN-DLFVIWVMELVGDVLWTRE 303 Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG-ARTCDYLEMLG-YHVYRVLGQK 401 + + + + ++ +YR + +D G D G V VL Sbjct: 304 IIARRRISFAEQDALLDDVMRRYRVIRVQMDQTGMGEKPVEDAKRRHGQLRVEGVLFSA 362 >gi|171914351|ref|ZP_02929821.1| hypothetical protein VspiD_24270 [Verrucomicrobium spinosum DSM 4136] Length = 450 Score = 64.0 bits (154), Expect = 6e-08, Method: Composition-based stats. Identities = 61/349 (17%), Positives = 102/349 (29%), Gaps = 59/349 (16%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQ-LKTTLWAEVSKWLSLLPNKHWFEMQSL 146 GK +A ++ R + + A SE Q L+T L W E L Sbjct: 31 TGKDFSSAAEIVRDCKLRDKTTWMIAAPSERQSLET-----------LAKCSEWSEAFDL 79 Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCR-TYSEERPDTFVGHHNTYGM---AIINDEASG 202 + D L ++ R RPDT G M A D Sbjct: 80 ASEGIREERDGPEALLKQGEIKFANGSRVIAVPGRPDTVRGFSANVLMTEFAFFED---- 135 Query: 203 TPDVINLGILGFLTE--RNANRFWIMTSNPRRLSGKFYEIFNKP---LDDWKRFQIDTRT 257 PD IL +T R + + + P K ++++ K W + ++ Sbjct: 136 -PDATWRAILPSITNPLRGGEKKVRLITTPNGQGNKAHDLWTKENSTKHKWSKHKVTIHD 194 Query: 258 VE----GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD----IDSFIPLNIIEEALNRE 309 +DP ++ D + E +F I EA + Sbjct: 195 AVAAGLPVDPEELRAMLD----DPEGWAQEYECEFLDAAGVLLSYELIGSCEAPEATTTQ 250 Query: 310 P----CPDPYAPLIMGCDIAEEGGDNTVV--VLRRGP--VIEHLFDWSKTDLRTTNNKIS 361 P P PL G D A + D +V+ + GP V + + +T ++ Sbjct: 251 PDAFWAARPQFPLYAGWDFARK-KDLSVLWTAQKIGPLLVTKEVLVMRG---MSTPKQVE 306 Query: 362 GLVEKYRPDA-IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + + + + +D G D L + D Sbjct: 307 LVSHRLKNITRLCLDYTGAGVGAGDLLVE--------KFGEWNFDKHQF 347 >gi|227500282|ref|ZP_03930349.1| terminase [Anaerococcus tetradius ATCC 35098] gi|227217568|gb|EEI82880.1| terminase [Anaerococcus tetradius ATCC 35098] Length = 466 Score = 64.0 bits (154), Expect = 6e-08, Method: Composition-based stats. Identities = 55/364 (15%), Positives = 122/364 (33%), Gaps = 51/364 (14%) Query: 50 APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 +P WQ + ++ + A + + + + GKT + LW + G + Sbjct: 35 SPYPWQEKLIKDIFAVNDDGLWTHSKFGYAVPRRN----GKTEIVYMAELWFLM--DGKN 88 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 +I A+ + + + ++ K+L + + +S+ +++ + Sbjct: 89 IIHTAHRISTSHS-SFKKLKKYLEKMGLVDKVDFKSIKAK----GQEMIELIKTGGVIQF 143 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 T RT + + F ++ DEA + + +T+ + N +M Sbjct: 144 RT--RTETGGLGEGFD--------LLVIDEAQEYTEGQESALKYTVTDSD-NPMILMCGT 192 Query: 230 P------RRLSGKFYE---IFNKPLDDWKRFQIDTRTVE-------GIDPS-----FHEG 268 P + K+ + K + W + + T +PS Sbjct: 193 PPTLVSGGTVFSKYRDLILSGGKNHNGWAEWSVSEMTNPYDIDAWYKTNPSMGYKLRERA 252 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEE 327 + G D ++ G + + + S I L+ L P L +G + Sbjct: 253 VEEEIGPDETDFNIQRLGYWVKYNQKSVISKLDWDR--LKLTRLPSLVGKLHVGIKYGND 310 Query: 328 GGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386 G + + + + + +R N+ I ++K +P +++ID GA D Sbjct: 311 GRNVALSIAVKTLSNRIFIESIDCQSIRNGNDWIVDFLKKTKPISVVID----GASRQDI 366 Query: 387 LEML 390 LE Sbjct: 367 LEEQ 370 >gi|326782381|ref|YP_004322781.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-ShM2] gi|310003329|gb|ADO97726.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-ShM2] Length = 362 Score = 63.6 bits (153), Expect = 8e-08, Method: Composition-based stats. Identities = 46/287 (16%), Positives = 92/287 (32%), Gaps = 44/287 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T+ +LW + ++V LAN + L + N + Q + Sbjct: 85 GKSTIVTSYLLWYVIFNDNVNVAILANKAATSREML----QRLQRSYENLPKWLQQGIVQ 140 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204 W L G + S R +F I DE + P Sbjct: 141 ----WNRGSLELENGSKI---MAASTSSSAVRGMSFN--------VIFLDEFAFVPNHIA 185 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261 D + ++ + I+ S P ++ FY++++ + +++ ++ V G Sbjct: 186 DEFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDSERKKNEYISTEVHWSEVPGR 243 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---------- 311 D + IA +VE +F +D+ I + + + +P Sbjct: 244 DAKWKAQTIANTSEQQ--FKVEFECEFL-GSVDTLISPSKLRTMVYNDPLVQNKGLSIYE 300 Query: 312 -PDPYAPLIMGCDIAEE--GGDNTVVVLRRGPVIEHLFDWSKTDLRT 355 ++ D+A G + VV+ + L K + Sbjct: 301 HVQKDHNYVITVDVARGVSGDFSAFVVIDTTTIPYKLVAKYKNNTIK 347 >gi|171915351|ref|ZP_02930821.1| hypothetical protein VspiD_29290 [Verrucomicrobium spinosum DSM 4136] Length = 451 Score = 63.6 bits (153), Expect = 8e-08, Method: Composition-based stats. Identities = 61/348 (17%), Positives = 104/348 (29%), Gaps = 57/348 (16%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQ-LKTTLWAEVSKWLSLLPNKHWFEMQSL 146 GK +A ++ R + + A SE Q L+T L W E L Sbjct: 32 TGKDFSSAAEIVRDCKLRDKTTWMIAAPSERQSLET-----------LAKCGEWSEAFDL 80 Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCR-TYSEERPDTFVGHHNTYGM---AIINDEASG 202 + D L ++ R RPDT G M A D Sbjct: 81 ASEGIREERDGPEALLKQGEIKFANGSRVIAVPGRPDTVRGFSANVLMTEFAFFED---- 136 Query: 203 TPDVINLGILGFLTE--RNANRFWIMTSNPRRLSGKFYEIFNKP---LDDWKRFQIDTRT 257 PD IL +T R + + + P K ++++ K W + ++ Sbjct: 137 -PDATWRAILPSITNPLRGGEKKVRLITTPNGQGNKAHDLWTKENSTKHKWSKHKVTIHD 195 Query: 258 VE----GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII---EEALNREP 310 +DP ++ D + E +F +P +I E A Sbjct: 196 AVAAGLPVDPEELRAMLD----DPEGWAQEYECEFLD-SAGVLLPYELIATCEAAEATTT 250 Query: 311 CPDPYA------PLIMGCDIAEEGGDNTVV--VLRRGPVIEHLFDWSKTDLRTTNNKISG 362 D + PL G D A + D +V+ + GP I+ + +T ++ Sbjct: 251 QADAFWNARQQFPLYAGWDFARK-KDLSVLWTAQKVGP-IKVTKEVLIMRGMSTPAQVEL 308 Query: 363 LVEKYRPDA-IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + + + + +D G D L + D Sbjct: 309 VSHRLKHITRLCLDYTGAGVGAGDLLVE--------KFGEWNFDKHQF 348 >gi|68250195|ref|YP_249307.1| terminase, ATPase subunit [Haemophilus influenzae 86-028NP] gi|68058394|gb|AAX88647.1| terminase, ATPase subunit [Haemophilus influenzae 86-028NP] Length = 593 Score = 63.2 bits (152), Expect = 9e-08, Method: Composition-based stats. Identities = 54/375 (14%), Positives = 116/375 (30%), Gaps = 70/375 (18%) Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 K + +S ++ A +DV I + + + + T +H Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253 Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P DV+ G ++ + S P ++ Y + FN+ Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRAK 311 Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283 ++ +ID G + + +IA + Sbjct: 312 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMESGCNLFNIDDLIAENSKEE--FEQL 369 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333 QF + +F ++ ++ Y P + +G D A G + Sbjct: 370 FLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429 Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 V++ + H + D T ++I + Y I+ID G+ + Sbjct: 430 VIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFCDDYNVTRIVIDKTGMGSGVYQEV 489 Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444 A LE+ + + E+ +K + ++ L + ++ + ++K Sbjct: 490 R---------KFYPMAQGLEYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 540 Query: 445 FIVPNTGELAIESKR 459 + TG++ S R Sbjct: 541 -RITGTGKITYVSDR 554 >gi|326783799|ref|YP_004324193.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SSM7] gi|310003811|gb|ADO98206.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SSM7] Length = 552 Score = 63.2 bits (152), Expect = 9e-08, Method: Composition-based stats. Identities = 61/407 (14%), Positives = 124/407 (30%), Gaps = 56/407 (13%) Query: 59 MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118 M +N+ +N + K + GK+T +L +++ LAN Sbjct: 60 MYDFQEKLVNNFHNNRFNICKMPRQS----GKSTTVVSYLLHYAIFNDSVTIGILANKAQ 115 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 + L L + + + W + ST Sbjct: 116 TARDLL--------GRLQIAYENLPKWMQQGIIAWNKGSMELENKSKIIAASTSASAVRG 167 Query: 179 ERPDTFVGHHNTYGMAIINDE----ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234 + I DE A+ D + ++ + I+ S PR ++ Sbjct: 168 MSFN-----------IIFLDEFAFVANHLADDFFSSVYPTISS-GKSTKVIIVSTPRGMN 215 Query: 235 GKFYEIFNK---PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291 FY +++ +++ + V G D ++ E I RVE +F Sbjct: 216 -HFYRLWHDAELGRNEYVTTDVHWSEVPGRDEAWKEQTIKN--TSEAQFRVEFECEFL-G 271 Query: 292 DIDSFIPLNIIEEALNREPC------------PDPYAPLIMGCDIAEEG--GDNTVVVLR 337 +D+ I + ++ + EP P + D+A + +V Sbjct: 272 SVDTLIAPSKLKTMVYDEPINTGKRGGEIYQNPIEKHNYSITVDVARGVEKDYSAFIVFD 331 Query: 338 RGPVIEHLFDWSKTDL---RTTNNKISGLVEKYRPDAIIIDANNTGARTCD----YLEML 390 + + + + I+ Y I+ + N+ G + LE Sbjct: 332 TTTFPYKVVAKYRNNTIKPMLFPSVIAEFARAYNNAFILCEVNDIGDQIASILFYDLEYE 391 Query: 391 GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQ 437 + V G+ V + + +L VKM+ ++ +N LI+ Sbjct: 392 NVLMTAVRGRAGQVLGQGFSGSKVQLGVKMSKTVKKIGALNLKTLIE 438 >gi|323146129|gb|ADX32368.1| putative terminase ATPase subunit [Cronobacter phage ESSI-2] Length = 639 Score = 63.2 bits (152), Expect = 1e-07, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 69/211 (32%), Gaps = 28/211 (13%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY + + F DS + Sbjct: 377 PDGQWRYVITMEDAIAGGFNLASIEKLRNRY--NPTTFNMLYMCVFVDSK-DSVFSYGDL 433 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E + P + G D A G + V++ + +F Sbjct: 434 EACAVETETWQDHKPDAPRPFGDREVWGGFDPARSGDFSCFVIVAPPLFAGEKFRVLRVF 493 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 +W + R +I L +KY + +D G D ++ V AV + Sbjct: 494 NWKGMNFRWQAKQIEQLFKKYNFAYLGVDVTGIGQGVFDNIQHFALRV--------AVPI 545 Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437 + RN + +L +K AD +E + L + Sbjct: 546 RYDRNTKNQLVLKAADVVESQRIEWDKELKE 576 >gi|318603823|emb|CBY25321.1| phage terminase, ATPase subunit [Yersinia enterocolitica subsp. palearctica Y11] Length = 257 Score = 63.2 bits (152), Expect = 1e-07, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 47/170 (27%), Gaps = 27/170 (15%) Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDIAEEG 328 R +F S P ++ + Y P+ MG D + G Sbjct: 31 FRNLFLCEFVDDKA-SVFPFEELQACMVDSLVEWEDFAPFAEQPFNYHPVWMGYDPSHTG 89 Query: 329 GDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382 VV+ G L W D I L EKY + I IDA G Sbjct: 90 DSAGCVVMAPPWVPGGKFRILERHQWKGMDFADQAESIKKLTEKYNVEYIGIDATGIGQG 149 Query: 383 TCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH 432 + A ++ + +T + +K D + L Sbjct: 150 VYQLVR---------NFFPAAREIRYSAEVKTNMVLKAKDLITTGRLEYD 190 >gi|120599697|ref|YP_964271.1| hypothetical protein Sputw3181_2900 [Shewanella sp. W3-18-1] gi|120559790|gb|ABM25717.1| protein of unknown function DUF264 [Shewanella sp. W3-18-1] Length = 602 Score = 63.2 bits (152), Expect = 1e-07, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 71/208 (34%), Gaps = 32/208 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317 + + Y + D F D DS + +E+ + Y P Sbjct: 365 IDELRDEY--NGDDFANLFMCIFVD-DADSVFKFSDLEKCMVEAARWQDYKPAAPRPFGN 421 Query: 318 --LIMGCDIAEEGGDNTVVVL-----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 + +G D + DN V+ + ++G L W + +I + KYR Sbjct: 422 REVWLGYDPSRT-RDNAVLAVVAPGEKKGEKFRVLERHRWRGMNFAHHVAEIQKIYAKYR 480 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 I +D GA D + L + A + + +T L +KM D +E Sbjct: 481 VTYIGVDTTGIGAGVFDSISTL--------YPREATAIHYSVGSKTRLVLKMIDVVEGGR 532 Query: 429 LINHSGLIQ---NLKSLKSFIVPNTGEL 453 + +GL + S++ + + G + Sbjct: 533 IEWDAGLKDIAMSFLSIRRTVTDSGGAI 560 >gi|225872083|ref|YP_002753538.1| putative bacteriophage portal protein [Acidobacterium capsulatum ATCC 51196] gi|225792593|gb|ACO32683.1| putative bacteriophage portal protein [Acidobacterium capsulatum ATCC 51196] Length = 507 Score = 62.8 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 66/363 (18%), Positives = 111/363 (30%), Gaps = 65/363 (17%) Query: 78 FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 FK A+ + R IG + A P + L+ S+ Q + + E + Sbjct: 46 FKIAVKSAR-IGFSFATALEAALDCLAHPNTTWTVLSASKAQ--SVEFIE-----TCHRL 97 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAII 196 + H WY ++ H ++ R + P T G+ I Sbjct: 98 IEVMTGTAELYHDEDWYDELGHIEAIQQRITFANGARIIALPANPRTARGYPGNA----I 153 Query: 197 NDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN------------ 242 DE + + I I + + R S P GKFY++ Sbjct: 154 LDEFAHHEESYAIWAAITRQVALGHKVRVL---STPNGEQGKFYDLCKELGLTDGVAPEN 210 Query: 243 --KPLDDWKRFQIDTR----TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF 296 K + W ID I+ +I D+D+ E F + ++ Sbjct: 211 NFKIVKGWSIHWIDAPMAIADGCPINMDEMRQLIQ----DADIVNQEFYCVFLKSG-GAW 265 Query: 297 IPLNIIEEALNREPCPD------PYAPLIMGCDIAEEGGDNT---------VVVLRRGPV 341 IPL++I+ A + + P L G D+ T V+V R Sbjct: 266 IPLDLIQRAESETATVEWPGGYAPRGRLFGGIDVGRFSNRTTFWVKEDLGDVLVTRMAMA 325 Query: 342 IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRVLGQ 400 I + + +L K++ + ID+ G D L L V V Sbjct: 326 IHEMPFPDQANLIAPWMKMTQV--------TAIDSTGMGIGLFDDLNKLCPGRVMGVNFA 377 Query: 401 KRA 403 + Sbjct: 378 GSS 380 >gi|198242430|ref|YP_002214959.1| hypothetical protein SeD_A1100 [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|193876434|gb|ACF24836.1| ORF11 [Salmonella enterica subsp. enterica serovar Dublin] gi|197936946|gb|ACH74279.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326622711|gb|EGE29056.1| hypothetical protein SD3246_1075 [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 423 Score = 62.8 bits (151), Expect = 1e-07, Method: Composition-based stats. Identities = 64/372 (17%), Positives = 115/372 (30%), Gaps = 68/372 (18%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT-LNAWLVLWLMSTRPGISVICLANS 116 +E + H +P K I AGR GKTT L W + Sbjct: 6 VIEFLPFHAGQKKIYRSPAKRKV-IRAGRRFGKTTMLEQAGGNW---------------A 49 Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176 Q++ +A K L LP+ + + +D + +G + + Sbjct: 50 ARQMRVGWFAPSYKIL--LPSFKTIRDLLKPITISSSKTDSIIELIGGGLVEF------W 101 Query: 177 SEERPDTFVGHHNTYGMAIINDEAS----GTPDVINLGILGFLTERNANRFWIMTSNPRR 232 + + PD + +I DE S G D+ I L + + + +M P+ Sbjct: 102 TLDNPDAGR---SRKYHKVIIDEGSLVKKGMRDIWEQAIEPTLLDFDGDA--VMAGTPKG 156 Query: 233 --LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 FY+ N W+ T I+P+ II G V + E +F Sbjct: 157 VDDENFFYQACNDKSMGWEEHHAPTAANPTINPAALARIID--GRPPLVVQQEYNAEFVD 214 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEG---GDNTV-------------- 333 +F L+ + E P + D A++G D + Sbjct: 215 WRGQNFFKLDWLLENGAPVDYPFSCDTVYGVVDCAQKGKLQNDGSACIWFALDNLPSPHL 274 Query: 334 ------VVLRRGPVIEHLF-DWSKTDLRTTNNKISGLVE-KYRPDAIIIDANNTGARTCD 385 ++ G ++ + W +S + + + I+ TG Sbjct: 275 IILDWDIIQIDGYFLKDVVPQWEGK-----AKHLSEICRARMGTTGLFIEDKATGITLLQ 329 Query: 386 YLEMLGYHVYRV 397 G++V+ V Sbjct: 330 QDANEGWNVHPV 341 >gi|223939800|ref|ZP_03631671.1| protein of unknown function DUF264 [bacterium Ellin514] gi|223891576|gb|EEF58066.1| protein of unknown function DUF264 [bacterium Ellin514] Length = 449 Score = 62.4 bits (150), Expect = 2e-07, Method: Composition-based stats. Identities = 49/345 (14%), Positives = 101/345 (29%), Gaps = 48/345 (13%) Query: 137 NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAI 195 K W ++ + H ++T R YS P+ G + Sbjct: 71 CKAWAQLLDVVAHDLGEIIFDREKKFSAYVLEFATKLRIYSLSSNPNALAGKRGH----V 126 Query: 196 INDEASGTPDV--INLGILGFLTERNA-NRFWIMTSNPRRLSGKFYEIFNKPLDD-WKRF 251 I DE + D + T +G ++I ++ W Sbjct: 127 ILDEFALHGDQRMLYRIAKPVTTWGGQLEIISTHRGVGTVFNGIIHDIHHRGNPMGWSHH 186 Query: 252 QIDTRTVEGIDPSFHEGIIARYGL--DSDVTRVEVCGQ-------------FPQQDIDSF 296 ++ + I+ E I + G + V + P + F Sbjct: 187 KVTLQEA--IEQGVVERINGKTGEAESREGYLARVRAECLDEEQWLQEYCCVPADESSVF 244 Query: 297 IPLNIIEEALNREPCPDPY-----APLIMGCDIAEEGGDNTVVVLRRGPVIEHL------ 345 I ++I+ + Y PL +G D+ + D +V+ G + + Sbjct: 245 IGYDLIDACEDDCLKDFEYLRKCENPLYLGFDVGRK-RDLSVI--DVGEKVGDVMWDRMR 301 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKRAV 404 + + ++ L+E + IDA G + + + G+ V V Sbjct: 302 IELAGKTFSEQEAELYRLLELPKLKRACIDATGLGMQLAERAKYRFGWKVEAVTFTGHVK 361 Query: 405 DLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPN 449 + E N R + + + L +L+ +K + + Sbjct: 362 E-ELAYNLR--MAFEDRR----VRITRDPLLRADLRGIKKEVTTS 399 >gi|223940405|ref|ZP_03632258.1| protein of unknown function DUF264 [bacterium Ellin514] gi|223890900|gb|EEF57408.1| protein of unknown function DUF264 [bacterium Ellin514] Length = 447 Score = 62.0 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 49/345 (14%), Positives = 100/345 (28%), Gaps = 48/345 (13%) Query: 137 NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAI 195 K W ++ + H ++T R YS P+ G + Sbjct: 71 CKAWAQLLDVVAHDLGEIIFDREKKFSAYVLEFATKLRIYSLSSNPNALAGKRGH----V 126 Query: 196 INDEASGTPDV--INLGILGFLTERNA-NRFWIMTSNPRRLSGKFYEIFNKPLDD-WKRF 251 I DE + D + T +G ++I + W Sbjct: 127 ILDEFALHGDQRMLYRIAKPVTTWGGQLEIISTHRGVGTVFNGIIHDIHQRGNPMGWSHH 186 Query: 252 QIDTRTVEGIDPSFHEGIIARYGL--DSDVTRVEVCGQ-------------FPQQDIDSF 296 ++ + I+ E I + G + V + P + F Sbjct: 187 KVTLQEA--IEQGVVERINEKTGEAESREGYLARVRAECLDEEQWLQEYCCVPADESSVF 244 Query: 297 IPLNIIEEALNREPCPDPY-----APLIMGCDIAEEGGDNTVVVLRRGPVIEHL------ 345 I ++I+ + Y PL +G D+ + D +V+ G + + Sbjct: 245 IGYDLIDACEDDCLKDFEYLRKCENPLYLGFDVGRK-RDLSVI--DVGEKVGDVMWDRMR 301 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKRAV 404 + + ++ L+E + IDA G + + + G+ V V Sbjct: 302 IELAGKTFSEQEAELYRLLELPKLKRACIDATGLGMQLAERAKYRFGWKVEAVTFTGHVK 361 Query: 405 DLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPN 449 + E N R + + + L +L+ +K + + Sbjct: 362 E-ELAYNLR--MAFEDRR----VRITRDPLLRADLRGIKKEVTTS 399 >gi|146310462|ref|YP_001175536.1| hypothetical protein Ent638_0800 [Enterobacter sp. 638] gi|145317338|gb|ABP59485.1| conserved hypothetical protein [Enterobacter sp. 638] Length = 445 Score = 62.0 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 43/290 (14%), Positives = 83/290 (28%), Gaps = 28/290 (9%) Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L F W + ++ + + + + Sbjct: 143 ALFWKARKFVETLPVEFRGSWDEKKHAPYMRVEFPDTGAVIKGEAGDNIGR-----GDRT 197 Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA+ + I L++ R I S+ +S F + + F Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMSNPFAQ--KRHSGKIPVFT 251 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 R+ D ++ + +D+ V E+ + IP ++ A++ Sbjct: 252 FHWRSDPRKDNEWYRKECEK--IDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIK 309 Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS--GLVEKY 367 P + DIA+EG D R G +++ + +WS + + G + Y Sbjct: 310 LGIQPSGQRLGSMDIADEGKDKNGFSSRYGFLLQSVHEWSGEGSDIYASVVKSFGYCDDY 369 Query: 368 RPDAIIIDANNTGAR------TCDYLEML----GYHVYRVLGQKRAVDLE 407 D D + GA + L G D E Sbjct: 370 GLDEFRFDEDGLGAGARGDARVINELRQAEGRGTIAATPFRGSGSVFDPE 419 >gi|145636853|ref|ZP_01792518.1| terminase, ATPase subunit [Haemophilus influenzae PittHH] gi|145269934|gb|EDK09872.1| terminase, ATPase subunit [Haemophilus influenzae PittHH] Length = 593 Score = 61.7 bits (148), Expect = 3e-07, Method: Composition-based stats. Identities = 54/375 (14%), Positives = 116/375 (30%), Gaps = 70/375 (18%) Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 K + +S ++ A +DV I + + + + T +H Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253 Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P DV+ G ++ + S P ++ Y + FN+ Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRTK 311 Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283 ++ +ID G + + +IA + Sbjct: 312 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 369 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333 QF + +F ++ ++ Y P + +G D A G + Sbjct: 370 FLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429 Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 V++ + H + D T ++I + Y I+ID G+ + Sbjct: 430 VIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFCDDYNVTRIVIDKTGMGSGVYQEV 489 Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444 A LE+ + + E+ +K + ++ L + ++ + ++K Sbjct: 490 R---------KFYPMAQGLEYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 540 Query: 445 FIVPNTGELAIESKR 459 + TG++ S R Sbjct: 541 -RITGTGKITYVSDR 554 >gi|53802921|ref|YP_115325.1| prophage MuMc02, terminase, ATPase subunit [Methylococcus capsulatus str. Bath] gi|53756682|gb|AAU90973.1| putative prophage MuMc02, terminase, ATPase subunit [Methylococcus capsulatus str. Bath] Length = 443 Score = 61.7 bits (148), Expect = 3e-07, Method: Composition-based stats. Identities = 51/276 (18%), Positives = 80/276 (28%), Gaps = 25/276 (9%) Query: 180 RPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 PDT G + ++ DE + D I + ++ + S P KF Sbjct: 114 NPDTARGFTAS----VLLDEFAFHADSRKIWQALFPVVSRSDLKLRV--ISTPNGKGNKF 167 Query: 238 YEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDID-- 294 Y++ W R D V P E + A G D D E Q+ + Sbjct: 168 YDLITGDHPVWSRHVTDIYQAVADGLPRDIEELKAGVG-DDDAWAQEYELQWLDEASAWL 226 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVV----VLRRGPVIEHLFDWSK 350 SF +N +E P P +G DIA D V+ + + + Sbjct: 227 SFELINSVEHDHAGIPEHYAGGPCFLGVDIAARN-DLFVIWVLEAVGDVYWTREILARRR 285 Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTG-ARTCDYLEMLGYH-VYRVLGQKRAVDLEF 408 + ++ +YR +D G D G V VL + Sbjct: 286 ISFAEQDALLADAFNRYRVIRCCMDQTGMGEKPVEDAQRRFGSSRVEGVLFTG--PNKLA 343 Query: 409 CRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 E + + L +L LK Sbjct: 344 LATTGKEAFEDRRIRIPEG----NQELRNDLHKLKK 375 >gi|145630909|ref|ZP_01786686.1| terminase, ATPase subunit [Haemophilus influenzae R3021] gi|144983569|gb|EDJ91037.1| terminase, ATPase subunit [Haemophilus influenzae R3021] Length = 593 Score = 61.7 bits (148), Expect = 3e-07, Method: Composition-based stats. Identities = 54/375 (14%), Positives = 116/375 (30%), Gaps = 70/375 (18%) Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 K + +S ++ A +DV I + + + + T +H Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253 Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P DV+ G ++ + S P ++ Y + FN+ Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRAK 311 Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283 ++ +ID G + + +IA + Sbjct: 312 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 369 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333 QF + +F ++ ++ Y P + +G D A G + Sbjct: 370 FLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429 Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 V++ + H + D T ++I + Y I+ID G+ + Sbjct: 430 VIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFCDDYNVTRIVIDKTGMGSGVYQEV 489 Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444 A LE+ + + E+ +K + ++ L + ++ + ++K Sbjct: 490 R---------KFYPMAQGLEYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 540 Query: 445 FIVPNTGELAIESKR 459 + TG++ S R Sbjct: 541 -RITGTGKITYVSDR 554 >gi|300723941|ref|YP_003713254.1| Terminase, ATPase subunit [Xenorhabdus nematophila ATCC 19061] gi|297630471|emb|CBJ91136.1| Terminase, ATPase subunit (GpP) [Xenorhabdus nematophila ATCC 19061] Length = 573 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 54/144 (37%), Gaps = 23/144 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDP 314 + + +Y D + + +F DI+S L +++ + P Sbjct: 333 IDRLRRQY--SPDEYQNLLMCEF-MDDIESIFSLQLMQGCMVDSWEIWHDVQPLMLRPYG 389 Query: 315 YAPLIMGCDIAEEG--GDNT---VVVLRR--GPVIEHL--FDWSKTDLRTTNNKISGLVE 365 Y P+ +G D A+ G GD+ V+ + G L W D R ++ I L E Sbjct: 390 YHPVWIGYDPAKGGENGDSAGCVVIAPPQVPGGKFRILERHQWRGMDFRAQSDAIRQLTE 449 Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389 +Y + I ID+ G ++ Sbjct: 450 QYNVEYIGIDSTGIGHGVYQNVKE 473 >gi|120602517|ref|YP_966917.1| hypothetical protein Dvul_1472 [Desulfovibrio vulgaris DP4] gi|120562746|gb|ABM28490.1| protein of unknown function DUF264 [Desulfovibrio vulgaris DP4] Length = 599 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 57/197 (28%), Gaps = 28/197 (14%) Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-- 306 + G D + Y + R +F F L +E + Sbjct: 346 NIITLADAEAGGCDLFDVAQLKLEY--TPEEFRQLFGCEFIDDTQGVF-RLAQLEACMVD 402 Query: 307 --------NREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTD 352 +P P P+ G D A G D + VL R G I + W Sbjct: 403 PADWQDVRQGDPHPVGNLPVWGGYDPARSGDDASFAVLLPDLRDGGGIRCIERHKWKGRS 462 Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNR 412 +I L EKYR + ID G + ++ A + + Sbjct: 463 YLWQAERIRELAEKYRFAHLGIDTTGPGIGVFEQVQQ---------FCPVATPINYGVQS 513 Query: 413 RTELHVKMADWLEFASL 429 + L +K + +E L Sbjct: 514 KAMLVLKAREVIEEGRL 530 >gi|120603805|ref|YP_968205.1| hypothetical protein Dvul_2767 [Desulfovibrio vulgaris DP4] gi|120564034|gb|ABM29778.1| protein of unknown function DUF264 [Desulfovibrio vulgaris DP4] Length = 599 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 57/197 (28%), Gaps = 28/197 (14%) Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-- 306 + G D + Y + R +F F L +E + Sbjct: 346 NIITLADAEAGGCDLFDVAQLKLEY--TPEEFRQLFGCEFIDDTQGVF-RLAQLEACMVD 402 Query: 307 --------NREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTD 352 +P P P+ G D A G D + VL R G I + W Sbjct: 403 PADWQDVRQGDPHPVGNLPVWGGYDPARSGDDASFAVLLPDLRDGGGIRCIERHKWKGRS 462 Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNR 412 +I L EKYR + ID G + ++ A + + Sbjct: 463 YLWQAERIRELAEKYRFAHLGIDTTGPGIGVFEQVQQ---------FCPVATPINYGVQS 513 Query: 413 RTELHVKMADWLEFASL 429 + L +K + +E L Sbjct: 514 KAMLVLKAREVIEEGRL 530 >gi|302339289|ref|YP_003804495.1| hypothetical protein Spirs_2798 [Spirochaeta smaragdinae DSM 11293] gi|301636474|gb|ADK81901.1| conserved hypothetical protein [Spirochaeta smaragdinae DSM 11293] Length = 295 Score = 61.3 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 49/257 (19%), Positives = 85/257 (33%), Gaps = 45/257 (17%) Query: 85 GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 R GK+T+ A G +I ++ + Q K L +V +++L + + Sbjct: 53 CRQAGKSTVIAAKAAHKAKFFSGSLIILVSPALRQSKE-LMRKVEDFIALDKSFPPASEE 111 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 L + SE+ G II DEAS P Sbjct: 112 DNQLTKE-------------FKNRSRIVALPGSEKTIRGLSGP-----TLIIIDEASRIP 153 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDP- 263 D + I + + ++ + P G FY+ +++ W + ++ R + G P Sbjct: 154 DELYKAIRPMMAGADTE--LVLMTTPFGKRGVFYDAWSRSK-RWTKIEVVGRDILGRFPN 210 Query: 264 -------SFHEGIIARYGLDSDV--------------TRVEVCGQFPQQDIDSFIPLNII 302 +GI A Y V R E G+F IDS + + Sbjct: 211 EQVYAQLRRKDGIKACYSPRHSVEFLGEELEEMGEWWYRQEYGGEFMDP-IDSVFNMEDV 269 Query: 303 EEALNREPCPDPYAPLI 319 A+ + +AP+I Sbjct: 270 RAAIINDTPAISFAPII 286 >gi|273810556|ref|YP_003344937.1| gp2 [Sodalis phage SO-1] gi|258619841|gb|ACV84094.1| gp2 [Sodalis phage SO-1] Length = 461 Score = 61.3 bits (147), Expect = 4e-07, Method: Composition-based stats. Identities = 69/335 (20%), Positives = 120/335 (35%), Gaps = 46/335 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 +G G GK+ + A V+ L++ PG I + L ++ E+ K + F Sbjct: 58 SGFGGGKSWVAARKVIQLLTLNPGHDGIVTEPTIPLLVKIMYPELEKAFDEAGFRWKFNK 117 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q D ++ L + K +C S E +G + + I+ DE T Sbjct: 118 Q-----------DKIYSVL-VKGKWTRVICE--SMENYTRLIGVNAAW---IVADEFDTT 160 Query: 204 PDVINLGILGFLTER---NANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR-FQIDTRTVE 259 + L L R R +++ S P Y+IF D KR + T Sbjct: 161 KQDVALAAYHKLLGRLRAGFVRQFVIVSTPEGYRAM-YQIFEVEKDSQKRLIRAKTTDNH 219 Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319 + F + + ++Y +++ + G F + + EE + E P LI Sbjct: 220 HLPADFIDTLRSQY--PANLIDAYLNGLFVNLTSGAVYKMFNREENASTEEVQ-PEDTLI 276 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-------TDLRTTNNKISGLVEKYR---- 368 +G D VV +RR + E+ + DL T I + E+Y Sbjct: 277 IGMDFNVTKM-AAVVYVRRQRITENKEFLDEIHAVDEFVDLFDTPAMIEAIEERYPDHCA 335 Query: 369 PDAIII--DANN-----TGARTCD--YLEMLGYHV 394 +++ D++ A + D LE G+ V Sbjct: 336 AGRVVVYPDSSGKSRKTVNASSSDIAQLEDAGFEV 370 >gi|330874284|gb|EGH08433.1| hypothetical protein PSYMP_06646 [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 684 Score = 61.3 bits (147), Expect = 4e-07, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%) Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189 +LS + + W+ L + + SK + T GHH Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265 Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242 + DE D +N T + + S P +S + Y E F Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319 Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279 P W++ I G D E + Y D D Sbjct: 320 NSKRKNAKEPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329 + +F +F L +E + +P P +P+ +G D + Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436 Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383 D T VV L G L W + ++ L E++ I ID G Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496 Query: 384 CDYLEM 389 D + Sbjct: 497 FDLVRD 502 >gi|301386048|ref|ZP_07234466.1| hypothetical protein PsyrptM_25573 [Pseudomonas syringae pv. tomato Max13] gi|302060830|ref|ZP_07252371.1| hypothetical protein PsyrptK_12639 [Pseudomonas syringae pv. tomato K40] gi|302129770|ref|ZP_07255760.1| hypothetical protein PsyrptN_00140 [Pseudomonas syringae pv. tomato NCPPB 1108] Length = 684 Score = 61.3 bits (147), Expect = 4e-07, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%) Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189 +LS + + W+ L + + SK + T GHH Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265 Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242 + DE D +N T + + S P +S + Y E F Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319 Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279 P W++ I G D E + Y D D Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329 + +F +F L +E + +P P +P+ +G D + Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436 Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383 D T VV L G L W + ++ L E++ I ID G Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496 Query: 384 CDYLEM 389 D + Sbjct: 497 FDLVRD 502 >gi|152973346|ref|YP_001337126.1| putative prophage large terminase protein [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] gi|150958195|gb|ABR80225.1| putative prophage large terminase protein [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] Length = 589 Score = 61.3 bits (147), Expect = 4e-07, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 58/207 (28%), Gaps = 30/207 (14%) Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W++ I+ G E + +D R +F S P + Sbjct: 328 PDGQWRQIVTIEDALAGGCTLFNLEQLKRENSVDD--FRNLFMCEFVDDKA-SVFPFEDL 384 Query: 303 EEALNREPCPDPY-----------APLIMGCDIAEEGGDNTVVVL----RRGPVIEHL-- 345 + + P+ +G D + G VVL G L Sbjct: 385 QRCMVDSLEEWEDFAPFADNPFGSRPVWVGYDPSHSGDSAGCVVLAPPVVAGGKFRILER 444 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 W D T I L EKY + I IDA G + A D Sbjct: 445 HQWKGMDFATQAESIRQLTEKYNVEYIGIDATGLGIGVFQLVR---------SFYPAARD 495 Query: 406 LEFCRNRRTELHVKMADWLEFASLINH 432 + + +T + +K D + L Sbjct: 496 IRYTPEMKTAMVLKAKDVIRRGCLEYD 522 >gi|296141561|ref|YP_003648804.1| terminase [Tsukamurella paurometabola DSM 20162] gi|296029695|gb|ADG80465.1| Terminase [Tsukamurella paurometabola DSM 20162] Length = 489 Score = 61.3 bits (147), Expect = 4e-07, Method: Composition-based stats. Identities = 77/407 (18%), Positives = 115/407 (28%), Gaps = 74/407 (18%) Query: 28 FSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRG 87 F F F KGT +G R WQ++ V +V P RG Sbjct: 27 FLAFADKFLR-VPKGTGAKGKLHLRDWQVDVARDVLDSGARTVGIMFP----------RG 75 Query: 88 IGKTTLNAWLVLWLMST-RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146 GKTTLNA + L+ T G +V +A E Q L+ + E+ Sbjct: 76 QGKTTLNAAIALYRFFTGGEGANVCVVAVDERQAG----------LAFSAARRMVELNEE 125 Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV 206 + D L+ + + C S P G + + DEA Sbjct: 126 LSARCQIFKDRLY----LPTTDSVFQCLPAS---PTALEGL---DYVLALVDEAGVVNRD 175 Query: 207 INLGILGFLTERNANRFWIMTSNPRRLSG--------KFYEIFNKPLD-DWKRFQI---- 253 + + + + P ++ W+ F Sbjct: 176 VFEVVQLA-QGKREKSVLVAIGTPGPNLDDQVLLSLRDYHLEHPDDASLRWREFSAAGFE 234 Query: 254 -----DTRTVEGIDPSFHEGIIAR--------YGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 T E +P+ + + +S R QF SF+P Sbjct: 235 DHPVDCTHCWELANPALDDFLHRDALVALLPPKTRESTFRRAR-LCQFAADTEGSFLPAG 293 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR---GPVIEHLFDWSKTDLR--- 354 + E EP P A +++ D D T ++L P L W + Sbjct: 294 VWEGLSTGEPVP-LGAEVVIALD-GSFSDDTTALLLGTVAAAPHFHPLRVWERPADNDDW 351 Query: 355 -----TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396 N I Y+ II D RT LE G V Sbjct: 352 RVPVLEVENTIRQACRDYQVVEIIADPFRW-TRTLQVLEQEGLPVVE 397 >gi|114046227|ref|YP_736777.1| hypothetical protein Shewmr7_0720 [Shewanella sp. MR-7] gi|113887669|gb|ABI41720.1| protein of unknown function DUF264 [Shewanella sp. MR-7] Length = 602 Score = 61.3 bits (147), Expect = 4e-07, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 54/164 (32%), Gaps = 20/164 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ I+ G D + + Y + D F D DS + + Sbjct: 342 PDKQWRYVVTIEDALAGGCDLFDIDELREEY--NGDDFNNLFMCIFVD-DADSVFKFSDL 398 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD- 347 E+ + + P + +G D + + T+VV+ ++G L Sbjct: 399 EKCMVDAARWQDHKPAAPRPFGNREVWLGYDPSRTRDNATLVVVAPGEKKGEKFRVLEKH 458 Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 W + +I + KYR I +D GA D + L Sbjct: 459 YWRGMNFSHHVAEIQKIYAKYRVTYIGVDTTGIGAGVFDSISTL 502 >gi|291334706|gb|ADD94352.1| hypothetical protein Ddes_0719 [uncultured phage MedDCM-OCT-S04-C890] Length = 311 Score = 60.9 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 37/266 (13%), Positives = 81/266 (30%), Gaps = 32/266 (12%) Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 +A + Q K+ W + ++ + +PN + E + P +L Sbjct: 6 NPRFAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG------- 58 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRFWI 225 E D G + + DE + + I L++R + + Sbjct: 59 -----------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDR--KGYCV 102 Query: 226 MTSNPRRLSGKFYEIFNKPLD--DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 P ++ FY+++ DW ++ + +DP E G E Sbjct: 103 FIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGEKK--YLQE 160 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE---GGDNTVVVLRRGP 340 + + I + + PY P + A + ++++ ++ Sbjct: 161 FECDWIANIEGAIYGEEIAKIEDKNQIARVPYDP-TLPVSTAWDLGVADHSSIIFFQQKG 219 Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEK 366 + D+ + + I L EK Sbjct: 220 TGVQIIDYHEERGHGLPHYIQMLEEK 245 >gi|330830158|ref|YP_004393110.1| phage-related terminase [Aeromonas veronii B565] gi|328805294|gb|AEB50493.1| Phage-related terminase [Aeromonas veronii B565] Length = 588 Score = 60.9 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 34/210 (16%), Positives = 67/210 (31%), Gaps = 35/210 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317 + + + Y D R + +F S PL ++ + + + Y P Sbjct: 349 LDQLRSEYSEDE--YRNLLMCEFMDDTE-SLFPLATLQRCMVDSWLVWEDYKPHTLRPLA 405 Query: 318 ---LIMGCDIAEEGGDNTV--------VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK 366 + +G D A+ G ++ +V + W D I + ++ Sbjct: 406 NRAVWIGYDPAKGGKGDSAGCAVLAPPLVPGGKFRVLERHRWQGMDFDAQAKSIRAICDR 465 Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426 Y I ID G ++ +++ N + + +K D + Sbjct: 466 YNVAYIGIDTTGIGEGVYQLVKQ---------FYPAVTAIQYNPNVKMRMVMKAQDVMNK 516 Query: 427 ASLINHSG---LIQNLKSLKSFIVPNTGEL 453 L SG L Q S++ V +G+L Sbjct: 517 GRLEFDSGWTDLAQAFMSIRR-AVTQSGKL 545 >gi|330939345|gb|EGH42730.1| hypothetical protein PSYPI_10145 [Pseudomonas syringae pv. pisi str. 1704B] Length = 650 Score = 60.9 bits (146), Expect = 4e-07, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%) Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189 +LS + + W+ L + + SK + T GHH Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265 Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242 + DE D +N T + + S P +S + Y E F Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319 Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279 P W++ I G D E + Y D D Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329 + +F +F L +E + +P P +P+ +G D + Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436 Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383 D T VV L G L W + ++ L E++ I ID G Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496 Query: 384 CDYLEM 389 D + Sbjct: 497 FDLVRD 502 >gi|330985172|gb|EGH83275.1| hypothetical protein PLA107_09108 [Pseudomonas syringae pv. lachrymans str. M301315] Length = 684 Score = 60.9 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%) Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189 +LS + + W+ L + + SK + T GHH Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265 Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242 + DE D +N T + + S P +S + Y E F Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319 Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279 P W++ I G D E + Y D D Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329 + +F +F L +E + +P P +P+ +G D + Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436 Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383 D T VV L G L W + ++ L E++ I ID G Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496 Query: 384 CDYLEM 389 D + Sbjct: 497 FDLVRD 502 >gi|331017153|gb|EGH97209.1| hypothetical protein PLA106_13994 [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 684 Score = 60.9 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%) Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189 +LS + + W+ L + + SK + T GHH Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265 Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242 + DE D +N T + + S P +S + Y E F Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319 Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279 P W++ I G D E + Y D D Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329 + +F +F L +E + +P P +P+ +G D + Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436 Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383 D T VV L G L W + ++ L E++ I ID G Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496 Query: 384 CDYLEM 389 D + Sbjct: 497 FDLVRD 502 >gi|190890121|ref|YP_001976663.1| hypothetical protein RHECIAT_CH0000492 [Rhizobium etli CIAT 652] gi|190695400|gb|ACE89485.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 465 Score = 60.9 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 58/406 (14%), Positives = 128/406 (31%), Gaps = 61/406 (15%) Query: 85 GRGIGKTTLNAWLVLWLMSTRPG---------ISVICLANSETQLKTTLWAEVSKWLSLL 135 GR GK+ A + ++L +V+ +A Q + L V ++L Sbjct: 68 GRRGGKSFTMALIAVFLACFFDYRQYLAPGERATVLVIATDRRQARVIL-RYVR---AML 123 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 N + + D +S ++ R T+ Sbjct: 124 DNIPLLQAMVERDTADSFDLD--------NSTTIEVGTASFRSTRGYTYAAVLCDELAFW 175 Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255 D+A+ I I + N + S+P G ++ F + + Sbjct: 176 RTDDAAEPDYAILDAIRPGMASI-PNSMLLCASSPHARRGALWDAFKRFWGKDDAPLVWR 234 Query: 256 RTVEGIDPSFHEGIIAR-YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR---EPC 311 ++P+ + ++ R D E +F + DI+ F+ + ++E+ ++R E Sbjct: 235 AATREMNPTISQSVVDRALERDHASAMAEYGAEF-RSDIEQFVNIEVVEDCVSRGVYERA 293 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-----TDLRTTNNKISGLVEK 366 P P D + D+ + + ++ D + + + + + K Sbjct: 294 PLPNIRYRAFVDPSGGSNDSMTLAIGHKEGERNILDCVRERKPPFSPESVVAEFADTLAK 353 Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426 YR + + R +K+ + + R++L+ M L Sbjct: 354 YRVREV-------------EGDRYAGEWPREQFRKKGITYKIAEKPRSDLYRDMLPLLNS 400 Query: 427 A--SLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAK-STDYS 469 L++ L+ + +E + +G K S D++ Sbjct: 401 GVADLLDSDRLVTQIVG-------------LERRVSRGGKESIDHA 433 >gi|83943081|ref|ZP_00955541.1| hypothetical protein EE36_12908 [Sulfitobacter sp. EE-36] gi|83846089|gb|EAP83966.1| hypothetical protein EE36_12908 [Sulfitobacter sp. EE-36] Length = 259 Score = 60.9 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 45/260 (17%), Positives = 75/260 (28%), Gaps = 40/260 (15%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 P +WQ++ + + +N + +GR GK+T L G Sbjct: 30 GEPDAWQVDLLRS------DPRSNEADRMILAL--SGRQSGKSTTAGGLG--YDDFSRGK 79 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLP-NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167 +VI A S Q T L+ + ++ + P L P + + D Sbjct: 80 TVILTAPSLRQ-STELFRRILEYKNTDPFCPPIVRQTQTELEAHPRHGGRIIVVPATDQ- 137 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227 R + + II DEA D E + Sbjct: 138 -----ARGMTAD--------------TIIADEACFLDDDALTAFFPMRKETG---RIFLL 175 Query: 228 SNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQ 287 S P G FYE + +R + + + E A + R E + Sbjct: 176 STPNMRQGYFYETWTSAKRV-RRITARSIDIPR-RKAQVEFDKAT--MSEATFRREHLCE 231 Query: 288 FPQQDIDSFIPLNIIEEALN 307 F + +E+A N Sbjct: 232 FI-GAGTPLVSWEALEKASN 250 >gi|332185581|ref|ZP_08387329.1| terminase-like family protein [Sphingomonas sp. S17] gi|332014559|gb|EGI56616.1| terminase-like family protein [Sphingomonas sp. S17] Length = 436 Score = 60.9 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 68/409 (16%), Positives = 134/409 (32%), Gaps = 58/409 (14%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 I AGRG GKT A V L PG + + + ++ + + + Sbjct: 60 IRAGRGFGKTRAGAEWVSALARDNPGARIALMGATLRDVERVM----------VRGESGL 109 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTY--GMAIIN 197 + W + + ++ YS P+ G HH + + Sbjct: 110 LAVARKGEAPKWIGSLGQVHFTSGAIGFA-----YSAAAPEALRGPQHHAAWCDELGKWK 164 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRT 257 EA G +++ LG + ++T+ PR + K + + RT Sbjct: 165 GEA-GWDNLMMTLRLG------EHPRVLVTTTPRATP-----LMRKVMALPDCVETIGRT 212 Query: 258 VE--GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 + + SF + ++++YG D+ + R E+ G+ + +++ R Sbjct: 213 SDNAHLPDSFQDAMLSQYG-DTRLGRQELDGEMVDDREGALWTRALLDR--QRVKTVPAL 269 Query: 316 APLIMGCD-IAEEGGDNTVVV---LRRGPVIEHLFDWSKTDLRTT--NNKISGLVEKYRP 369 +++G D A GD +V L R L D S+ L +++G + R Sbjct: 270 DRVVVGVDPPATSSGDACGIVAVGLGRDGHGYVLEDASEAGLSPEGWAARVAGCARRNRA 329 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429 D ++ + N G + + L V ++ + L+ + W Sbjct: 330 DRVVAERNQ-GGDMVESVLRLADPTLPVHLVYASIGKAARAEPVSFLYAQGRVW-HARGF 387 Query: 430 INHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 + L ++ P S D +D L++ E Sbjct: 388 PALEDELCGLGVAGAYDGP--------------GHSPDRADALVWALTE 422 >gi|320172719|gb|EFW47954.1| Phage terminase, ATPase subunit [Shigella dysenteriae CDC 74-1112] Length = 590 Score = 60.9 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 61/209 (29%), Gaps = 33/209 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGELA 454 + + ++ S + ++G +A Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGRIA 546 >gi|239502629|ref|ZP_04661939.1| hypothetical protein AbauAB_09982 [Acinetobacter baumannii AB900] Length = 414 Score = 60.9 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 49/307 (15%), Positives = 98/307 (31%), Gaps = 38/307 (12%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 + AGR GKT+L+ L++ S +P + +A + K +W ++ Sbjct: 26 VVAGRRWGKTSLSRTLII-SKSRKPRQRIWYVAPTYRMAKQIMWKDL------------- 71 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 + P W + H SL I+ + T+ + PD+ G ++ DE Sbjct: 72 ----IEAIPRKWVVKINHSSLSIELVN-GTLIELKGADDPDSLRGVGID---FLVLDEFQ 123 Query: 202 GTPDVINL-GILGFLTERNANRFWIMTSNPRRLSGKF------YEIFNKPLDDWKRFQID 254 + + L + + P+ + + + W+ +Q Sbjct: 124 DISEEAWTQCLRPTLASTGGHAIF--IGTPKAYNQLYTVYMQGQDPKKVKAGQWQSWQFP 181 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP 314 T T I S E A S + E F + P + E + DP Sbjct: 182 TITSPFIPESEIEAARADMDEKS--FKQEFLASFETMSGRVYYPFDRKEH--VGKYPFDP 237 Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKY-RPDA 371 P+ +G D + ++ + + + + ++ +I +Y + Sbjct: 238 KLPIWIGMDFNIDPMSTVIMQPQPNGEVWVVDEIVQFGSNTEEICEEIERKYWRYMKQIV 297 Query: 372 IIIDANN 378 I D Sbjct: 298 IFPDPAG 304 >gi|291336431|gb|ADD95986.1| hypothetical protein Ddes_0719 [uncultured organism MedDCM-OCT-S04-C1073] Length = 311 Score = 60.9 bits (146), Expect = 5e-07, Method: Composition-based stats. Identities = 37/266 (13%), Positives = 81/266 (30%), Gaps = 32/266 (12%) Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 +A + Q K+ W + ++ + +PN + E + P +L Sbjct: 6 NPRYAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG------- 58 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRFWI 225 E D G + + DE + + I L++R + + Sbjct: 59 -----------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDR--KGYCV 102 Query: 226 MTSNPRRLSGKFYEIFNKPLD--DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 P ++ FY+++ DW ++ + +DP E G E Sbjct: 103 FIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGEKK--YLQE 160 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE---GGDNTVVVLRRGP 340 + + I + + PY P + A + ++++ ++ Sbjct: 161 FECDWIANIEGAIYGEEIAKIEDKNQIARVPYDP-TLPVSTAWDLGVADHSSIIFFQQKG 219 Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEK 366 + D+ + + I L EK Sbjct: 220 TGVQIIDYHEERGHGLPHYIQMLEEK 245 >gi|67920466|ref|ZP_00513986.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501] gi|67857950|gb|EAM53189.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501] Length = 244 Score = 60.5 bits (145), Expect = 6e-07, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 74/234 (31%), Gaps = 37/234 (15%) Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP-------------GISVICLANSETQL 120 +P+ F+ + GR GK+ L + P +V+ + Q Sbjct: 18 DPQKFQVLV-CGRRFGKSHLQVTKHVIDCLMFPKLMPGYNVKQQTMETAVLVGMPTLKQA 76 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 + LW + K L P ++ D++ L ++ + + + Sbjct: 77 RKILWKPLVKTLENCPYVDKISRSDYTIRFKGNRPDIILAGLNDNAGDRARGLKLWR--- 133 Query: 181 PDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239 + DE P VI+ I+ + + + + T P+ + Y Sbjct: 134 --------------VCIDEVQDVRPSVIDAVIIPAMADT-PHSRALFTGTPKGKNNHLYN 178 Query: 240 IFN--KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291 +F + DDWK + T T I E R L + E Q+ + Sbjct: 179 LFTMERDNDDWKSYNFPTWTNPLISKDEVERARKR--LSPRLFSQEFEAQWKES 230 >gi|294085818|ref|YP_003552578.1| hypothetical protein SAR116_2251 [Candidatus Puniceispirillum marinum IMCC1322] gi|292665393|gb|ADE40494.1| protein of unknown function DUF264 [Candidatus Puniceispirillum marinum IMCC1322] Length = 454 Score = 60.5 bits (145), Expect = 6e-07, Method: Composition-based stats. Identities = 74/401 (18%), Positives = 133/401 (33%), Gaps = 54/401 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 AGRG GKT A + WL + + + + + + S LS+ PN Sbjct: 82 AGRGFGKTRAGAEWIRWLAQSGRARRIALVGETFDDARQVMVEGASGILSVCPN------ 135 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE-ASG 202 W T+ R YS + P+ G YG DE A Sbjct: 136 ---------WARPAWRAGQRTLIWPSGTIARCYSADDPEQLRGPEFDYG---WADEIAKW 183 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE-GI 261 ++ L + I T+ P R ++ +D Q +R + Sbjct: 184 RYPSAWDNLMLAL-RIGKSPQCIATTTP-RPVRWLADL--AAAEDTVLVQGASRENAANL 239 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321 P+F + R+G DS + R E+ G D+ N I P + +++G Sbjct: 240 SPAFMAAMHRRFG-DSYLARQELEGIMMSNLPDALWCRNDILRLHRPMPKRHRFIRIVIG 298 Query: 322 CDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK----ISGLVEKYRPDAIIIDAN 377 D A GGD T ++ H++ + L T ++ I + ++R D++I + N Sbjct: 299 VDPAMGGGDETGIITAGKDQDGHIWILADDSLHATPDRWAVQIQRVFRQWRADSVIAEIN 358 Query: 378 NTGARTCDYLEMLG--YHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 G+ L G V V + +++ E ++ +F +L + Sbjct: 359 QGGSLIRTLLAQAGCALPVREVRAMRSKSIRAEPVA--AAYARGDVSHAGQFGALED--- 413 Query: 435 LIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + + + S D D +++ Sbjct: 414 ---QMCACVP--------------GQRQTPSPDRLDAMVWA 437 >gi|126173520|ref|YP_001049669.1| hypothetical protein Sbal_1282 [Shewanella baltica OS155] gi|125996725|gb|ABN60800.1| protein of unknown function DUF264 [Shewanella baltica OS155] Length = 602 Score = 60.5 bits (145), Expect = 6e-07, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 71/207 (34%), Gaps = 30/207 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317 + + Y + D F D DS + +E+ + Y P Sbjct: 365 IDELRDEY--NGDDFANLFMCIFVD-DADSVFKFSDLEKCMVEAARWQDYKPAAPRPFGN 421 Query: 318 --LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRP 369 + +G D + + T+VV+ ++G L W + +I + KYR Sbjct: 422 REVWLGYDPSRTRDNATLVVVAPGEKKGEKFRVLEKHYWRGMNFSHHVAEIQKIYAKYRV 481 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429 I +D GA D + L + A + + +T L +KM D +E + Sbjct: 482 TYIGVDTTGIGAGVFDSISTL--------YPREATAIHYSVGSKTRLVLKMIDVIEGGRI 533 Query: 430 I---NHSGLIQNLKSLKSFIVPNTGEL 453 H + + S++ + + G + Sbjct: 534 EWDAGHKDIAMSCLSIRRTVTDSGGAI 560 >gi|152985800|ref|YP_001350388.1| hypothetical protein PSPA7_5052 [Pseudomonas aeruginosa PA7] gi|152986886|ref|YP_001346099.1| hypothetical protein PSPA7_0704 [Pseudomonas aeruginosa PA7] gi|150960958|gb|ABR82983.1| conserved hypothetical protein, putative [Pseudomonas aeruginosa PA7] gi|150962044|gb|ABR84069.1| conserved hypothetical protein, putative [Pseudomonas aeruginosa PA7] Length = 682 Score = 60.5 bits (145), Expect = 7e-07, Method: Composition-based stats. Identities = 34/179 (18%), Positives = 53/179 (29%), Gaps = 20/179 (11%) Query: 244 PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W++ I G + E + Y D + +F +F L + Sbjct: 340 PDGQWRKVITIQDAIAGGCNLFDLERLQLEY--DEERFEQLFMCKFIDSTQAAF-ALADL 396 Query: 303 EEALNR----------EPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHLFD- 347 E + P P P+ +G D + D T VV L G L Sbjct: 397 ERCYSDLGLWTDYDPDSPRPFDNRPVWLGYDPSRTRDDATCVVVAPPLEPGGKFRILEKH 456 Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 W T +I L E++ I ID G D ++ + A + Sbjct: 457 SWRGTSFTHQAKQIEKLCERFNVQHIGIDITGVGYGVFDLVKDFFPRATPIHYSLEAKN 515 >gi|260582917|ref|ZP_05850701.1| terminase ATPase subunit [Haemophilus influenzae NT127] gi|260094017|gb|EEW77921.1| terminase ATPase subunit [Haemophilus influenzae NT127] Length = 593 Score = 60.1 bits (144), Expect = 8e-07, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 73/219 (33%), Gaps = 31/219 (14%) Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-- 317 G + + +IA + QF + +F ++ ++ Y P Sbjct: 348 GCNLFNIDDLIAENSKEE--FEQLFLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFY 405 Query: 318 --------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGL 363 + +G D A G +V++ + H + D T ++I Sbjct: 406 QRPFGNREVWLGYDPAFTGDRAALVIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQF 465 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 + Y I+ID G+ + A LE+ + + E+ +K + Sbjct: 466 CDDYNVTRIVIDKTGMGSGVYQEVR---------KFYPMAQGLEYNADLKNEMVLKTQNL 516 Query: 424 LEFASL---INHSGLIQNLKSLKSFIVPNTGELAIESKR 459 ++ L + ++ + ++K + TG++ S R Sbjct: 517 IQKRRLKFDSGDNDIVSSFMTVKK-RITGTGKITYVSDR 554 >gi|332654528|ref|ZP_08420271.1| phage terminase, large subunit, PBSX family [Ruminococcaceae bacterium D16] gi|332516492|gb|EGJ46098.1| phage terminase, large subunit, PBSX family [Ruminococcaceae bacterium D16] Length = 418 Score = 60.1 bits (144), Expect = 8e-07, Method: Composition-based stats. Identities = 55/347 (15%), Positives = 112/347 (32%), Gaps = 46/347 (13%) Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVL-WLMSTRPGISVICLANSETQLKTTLWA 126 + N + GA+ + GKT W MS + S ++ L + Sbjct: 21 SPFRNCQAIICDGAVRS----GKTLCTGLSFFCWAMSCYQDKTFALCGKSIPSVRRNLLS 76 Query: 127 EVSKWLSLL--PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTF 184 E+ L L + L++ S+ + G+D R+ + + T Sbjct: 77 ELLPILRQLGFSCRERASRNQLTVTM-GHRSNTFYLFGGLDE-------RSAALVQGITL 128 Query: 185 VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244 G + DE + P + + + R W NP + FY+ + + Sbjct: 129 AGA--------LLDEVALMPRSFVEQVCARCSVEGS-RLWFSC-NPESPAHWFYQEWIQK 178 Query: 245 LDDWKRFQID--TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS--FIPLN 300 ++ K ++ + P+ E + R V G++ + F + Sbjct: 179 AEEKKVLRLSFAMTDNPSLSPAMLERYRTMF--QGAFYRRFVLGEWVNAEGLVYDFFSQD 236 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--------SKTD 352 ++ REP D P + CD + + R+ V L ++ + Sbjct: 237 LV-----REPPLDVSGPFYVSCDYGTVNPTSMGLWGRKNGVWYRLEEYYYNSRQARRQKT 291 Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLG 399 + + + LV+ A+++D + A + L G V + Sbjct: 292 DQEYADDLGALVKGRPLGAVVVDPSA--ASFIEVLRRRGVPVRKANN 336 >gi|289628558|ref|ZP_06461512.1| hypothetical protein PsyrpaN_26063 [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289648058|ref|ZP_06479401.1| hypothetical protein Psyrpa2_09957 [Pseudomonas syringae pv. aesculi str. 2250] gi|330870325|gb|EGH05034.1| hypothetical protein PSYAE_24348 [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 684 Score = 60.1 bits (144), Expect = 8e-07, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%) Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189 +LS + + W+ L + + SK + T GHH Sbjct: 206 FLSASRAQSEIFRSYIIAFAQSWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265 Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242 + DE D +N T + + S P +S + Y E F Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319 Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279 P W++ I G D E + Y D D Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DEDK 377 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329 + +F +F L +E + +P P +P+ +G D + Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436 Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383 D T VV L G L W + ++ L E++ I ID G Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496 Query: 384 CDYLEM 389 D + Sbjct: 497 FDLVRD 502 >gi|116751218|ref|YP_847905.1| hypothetical protein Sfum_3801 [Syntrophobacter fumaroxidans MPOB] gi|116700282|gb|ABK19470.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB] Length = 507 Score = 59.7 bits (143), Expect = 1e-06, Method: Composition-based stats. Identities = 62/394 (15%), Positives = 108/394 (27%), Gaps = 69/394 (17%) Query: 38 WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95 WG+ S W Q+E + + ++ GR +GK+ + + Sbjct: 20 WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65 Query: 96 WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155 L T G + A + L T + E+ L P+ M S++L Sbjct: 66 TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDSNPD----LMNSIALTKYGKPK 120 Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215 ++ + S + + D F H + DE + + + L Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177 Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273 R + S P L +Y + D + F+ + ++ Y Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTEDREAELLEFYG 232 Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308 G DS + EV G+ + +F I + E A +R Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292 Query: 309 -----EPCPDPYAPLIMGCDIAEEGG-------DNTVVVLRRGPVIEHLFDWSKTDLRTT 356 P + G D+ T + R + Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQETEIGERTLLKMILRVHLEHVSYPHI 351 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 I+ L Y P I +D G L L Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385 >gi|170748408|ref|YP_001754668.1| hypothetical protein Mrad2831_1990 [Methylobacterium radiotolerans JCM 2831] gi|170654930|gb|ACB23985.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 478 Score = 59.7 bits (143), Expect = 1e-06, Method: Composition-based stats. Identities = 52/334 (15%), Positives = 106/334 (31%), Gaps = 35/334 (10%) Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213 S+ + G+D + +T R +T G + ++ +I + Sbjct: 145 TSETIRLLSGVDIEVRPANYKTI---RGETLAGCLADEVAFWHLENSANPDTLILDAVRP 201 Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI------DTRTVEGIDPSFHE 267 L + S+P G+ Y + + +DP+ + Sbjct: 202 GLATTGGP--LCVLSSPYARKGELYRTHQRDFGPSGDPAVLVLRAPSQTMNPSLDPAVVK 259 Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---REPCPDPYAPLIMGCDI 324 Y D E +F + D+++FI L ++ + E P P CD Sbjct: 260 ---RAYTRDPAAASAEYGAEF-RADVEAFISLEAVQACMAGDLLERAPAPGLTYQAFCDP 315 Query: 325 AEEGGDNTVVVLRRGPVIEHLFD-----WSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 + G D+ + + D + + + L++ Y ++ D Sbjct: 316 SGGGADSMTLAIGHAENGIAYLDAVREMYPGGSPEAVVSTFAELLQSYGLGSVTGDHY-A 374 Query: 380 GARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNL 439 G + + G R K + EF ++ + ++ + L L Sbjct: 375 GEWPKERFRVHGITYERSERSKSDIYREFLPVLNSQ---RCR-------MLPVAKLEAQL 424 Query: 440 KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473 SL+ TG+ I+ +VKGA D ++ + Sbjct: 425 VSLERRTTRGTGKDTIDHPQVKGAHD-DVANAVA 457 >gi|323699495|ref|ZP_08111407.1| protein of unknown function DUF264 [Desulfovibrio sp. ND132] gi|323459427|gb|EGB15292.1| protein of unknown function DUF264 [Desulfovibrio desulfuricans ND132] Length = 428 Score = 59.7 bits (143), Expect = 1e-06, Method: Composition-based stats. Identities = 55/334 (16%), Positives = 98/334 (29%), Gaps = 43/334 (12%) Query: 79 KGAISAGRG-IGKTTLN-AWLVLWLMSTR-PGISVICLANSETQLKTTLWAEVSKWLSLL 135 + A+ GKT L+ L+ TR +A Q KT +W E+ ++ Sbjct: 21 RFAVLVCHRRFGKTVLSVNRLINAARETRRDDWRGAYIAPLYRQAKTVVWDELKRY---- 76 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 + ++ + +D + S R + PD+ G + + Sbjct: 77 -CGFGLDGCTVKFNETELRADFDNGSR----------IRLFGANNPDSLRGMYLDG---V 122 Query: 196 INDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQ 252 + DE + P + I L++R + PR + YEI+ K DW Sbjct: 123 VFDEVAQMPLRVWTEVIRPALSDRKGWAMF--IGTPRGKNA-LYEIWEKGKTDPDWLAAM 179 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL---NIIEEALNRE 309 + E + + E F ++ + E + Sbjct: 180 YRASETGILPVEELEASARE--MSPEEYEQEFECSFTAAIRGAYFGQLLADADREGRMTD 237 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVL---RRGPVIEHLFDWS--KTDLRTTNNKISGLV 364 DP P+ D+ D+T + R G + + L + Sbjct: 238 VPADPSMPVHTAWDLGM--SDSTSIWFVQARPGGTFAVIDYYEACGEGLDHYARILDDKG 295 Query: 365 EKYR----PDAIIIDANNTGARTCDYLEMLGYHV 394 KY P I + TG + LG Sbjct: 296 YKYGTHIAPHDIRVRELGTGKSRLETARSLGIRF 329 >gi|145639982|ref|ZP_01795581.1| terminase, ATPase subunit [Haemophilus influenzae PittII] gi|145270948|gb|EDK10866.1| terminase, ATPase subunit [Haemophilus influenzae PittII] gi|309751635|gb|ADO81619.1| Probable bacteriophage terminase, ATPase subunit [Haemophilus influenzae R2866] Length = 591 Score = 59.7 bits (143), Expect = 1e-06, Method: Composition-based stats. Identities = 55/373 (14%), Positives = 118/373 (31%), Gaps = 68/373 (18%) Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 K + +S ++ A +DV I + + + + T +H Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253 Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P DV+ G ++ + S P ++ Y + FNK Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNKNRAK 311 Query: 248 -----------------------WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 WK+ I+ G + + +IA + Sbjct: 312 ADKVEIDISHENLRIGKLCADRQWKQIVTINDAMEGGCNLFNIDDLIAENSKEE--FEQL 369 Query: 284 VCGQFPQQD-------IDSFIPLNIIEEALNREP---CPDPYAPLIMGCDIAEEGGDNT- 332 QF + ++ +EE + +P P + +G D A G Sbjct: 370 FLCQFADDNTSAFKFADLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429 Query: 333 -VVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 ++ + + H + D ++I + Y I+ID G+ + Sbjct: 430 AIIAPPKVEGGDYRVLHWQTFHGMDYEAQASRIKSFCDDYNVTRIVIDKTGMGSGVFQEV 489 Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL-INHSGLIQNLKSLKSFI 446 + A+ L++ + + E+ +K + ++ L + + +I + ++K Sbjct: 490 K---------KFYPMAIGLDYNADLKNEMVLKTQNLIQKRRLKFDGNEIITSFMTVKK-R 539 Query: 447 VPNTGELAIESKR 459 + TG++ S R Sbjct: 540 ITGTGKITYVSDR 552 >gi|229845311|ref|ZP_04465443.1| terminase, ATPase subunit [Haemophilus influenzae 6P18H1] gi|229811764|gb|EEP47461.1| terminase, ATPase subunit [Haemophilus influenzae 6P18H1] Length = 593 Score = 59.7 bits (143), Expect = 1e-06, Method: Composition-based stats. Identities = 54/374 (14%), Positives = 114/374 (30%), Gaps = 70/374 (18%) Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 K + +S ++ A +DV I + + + + T +H Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253 Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P DV+ G ++ + S P ++ Y + FN+ Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRAK 311 Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283 ++ +ID G + + +IA + Sbjct: 312 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 369 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333 QF + +F ++ ++ Y P + +G D A G + Sbjct: 370 FLCQFADDNSSAFKFADLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429 Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 V++ + H + D T ++I E Y I+ID G + Sbjct: 430 VIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFCEDYNVTRIVIDKTGMGTGVYQEV 489 Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444 A LE+ + + E+ +K + ++ L + ++ + ++K Sbjct: 490 R---------KFYPMAQGLEYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 540 Query: 445 FIVPNTGELAIESK 458 + TG++ S Sbjct: 541 -RITGTGKITYVSD 553 >gi|301155044|emb|CBW14507.1| terminase, atpase subunit [Haemophilus parainfluenzae T3T1] Length = 591 Score = 59.3 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 53/373 (14%), Positives = 116/373 (31%), Gaps = 68/373 (18%) Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 K + +S ++ A +DV I + + + + T +H Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253 Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P DV+ G ++ + S P ++ Y + FNK Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNKNRAK 311 Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283 + +ID G + + +IA + Sbjct: 312 ADKVEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 369 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNT- 332 QF + +F ++ ++ Y P + +G D A G Sbjct: 370 FLCQFADDNSSAFKFADLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429 Query: 333 -VVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 ++ + + H + D ++I + Y I+ID G+ + Sbjct: 430 AIIAPPKVEGGDYRVLHWQTFHGMDYEAQASRIKSFCDDYNVTRIVIDKTGMGSGVFQEV 489 Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL-INHSGLIQNLKSLKSFI 446 + A+ L++ + + E+ +K + ++ L + + +I + ++K Sbjct: 490 K---------KFYPMAIGLDYNADLKNEMVLKTQNLIQKRRLKFDGNEIITSFMTVKK-R 539 Query: 447 VPNTGELAIESKR 459 + TG++ S R Sbjct: 540 ITGTGKITYVSDR 552 >gi|163735142|ref|ZP_02142578.1| hypothetical protein RLO149_23000 [Roseobacter litoralis Och 149] gi|161391600|gb|EDQ15933.1| hypothetical protein RLO149_23000 [Roseobacter litoralis Och 149] Length = 267 Score = 59.3 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 83/265 (31%), Gaps = 41/265 (15%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 P WQ M + + + + + AG+ + K P Sbjct: 28 GPPDPWQRSLMNSTSDVIMVLASRRSGKSTTVGVMAGQELAK---------------PDH 72 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 VI L+ + Q L+A+++ F + ++L + L S Sbjct: 73 QVIILSPTLAQ-SQLLFAKIA-----------FTWEKMALPIETRRRTMTELHLKNGS-- 118 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228 S +C ++ + G+ G+ DEA+ PD + L+ N + + Sbjct: 119 -SVVCVPAGQD-GEGARGYGVKNGIL-AFDEAAFIPDKVFGA---TLSIAEDNAKTVFIT 172 Query: 229 NPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286 P SGK YE++ + + +R + + + + ++ DV G Sbjct: 173 TPGGKSGKAYEMWTNHDLYPEVERIRACSLDLPRMAKLVARQRKTLSKMEFDVEH----G 228 Query: 287 QFPQQDIDSFIPLNIIEEALNREPC 311 F + I A P Sbjct: 229 LQWMGRGTPFFDPDTIRAAYTDTPE 253 >gi|146313136|ref|YP_001178210.1| hypothetical protein Ent638_3501 [Enterobacter sp. 638] gi|145320012|gb|ABP62159.1| protein of unknown function DUF264 [Enterobacter sp. 638] Length = 589 Score = 59.3 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 38/229 (16%), Positives = 64/229 (27%), Gaps = 32/229 (13%) Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W++ I+ G + + +D R +F S P + Sbjct: 328 PDGQWRQIVTIEDALAGGCTLFNLDQLKQE--NSADDFRNLFMCEFVDDKA-SVFPFEEL 384 Query: 303 EEALNREPCPDP-----------YAPLIMGCDIAEEGGDN--TVVV--LRRGPVIEHL-- 345 + + + P+ +G D + G V+ L G L Sbjct: 385 QRCMVDAMEEWEDFEQFADRPFNWRPVWIGYDPSHTGDSAGCAVLAPPLVAGGKFRILER 444 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 W D I L EKY D I IDA G + A Sbjct: 445 HQWKGMDFAAQAEAIRSLTEKYTVDYIGIDATGIGQGVYQLVR---------SFFPAARA 495 Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGL--IQNLKSLKSFIVPNTGE 452 + + +T + +K D + L +G I + ++G Sbjct: 496 IRYTPEMKTAMVLKAKDTIRRGCLEYDAGATDITQSFMAIRKTMTSSGR 544 >gi|293417393|ref|ZP_06660017.1| terminase [Escherichia coli B185] gi|291430913|gb|EFF03909.1| terminase [Escherichia coli B185] Length = 590 Score = 59.3 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRELTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|258544092|ref|ZP_05704326.1| probable terminase (atpase subunit) related protein [Cardiobacterium hominis ATCC 15826] gi|258520720|gb|EEV89579.1| probable terminase (atpase subunit) related protein [Cardiobacterium hominis ATCC 15826] Length = 562 Score = 59.3 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 29/201 (14%) Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--- 307 I+ G D E + ++ + QF D DS + ++ + Sbjct: 306 ITIEDAINSGFDRVTMEKLRIKF--PPGQFENLLMCQFV-NDTDSIFKMAELQRCMVDAW 362 Query: 308 --------REPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDL 353 P P AP+ +G D + D ++VV+ G V + ++ D Sbjct: 363 TLWKDYTPLAPRPLDDAPVWIGYDPSRSQDDASLVVIAPPRVEGGVFRIVDKQSFNGLDF 422 Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRR 413 KI Y I IDA G D + RA + + + Sbjct: 423 DGQAQKIREFCAIYNVANIAIDATGIGQAVYDLVRQ---------FYPRARKIIYTVEAK 473 Query: 414 TELHVKMADWLEFASLINHSG 434 E+ +K + L +G Sbjct: 474 NEMVLKAKQLIHHGRLQWDAG 494 >gi|300022629|ref|YP_003755240.1| hypothetical protein Hden_1105 [Hyphomicrobium denitrificans ATCC 51888] gi|299524450|gb|ADJ22919.1| protein of unknown function DUF264 [Hyphomicrobium denitrificans ATCC 51888] Length = 500 Score = 59.3 bits (142), Expect = 2e-06, Method: Composition-based stats. Identities = 71/420 (16%), Positives = 127/420 (30%), Gaps = 68/420 (16%) Query: 84 AGRGIGKTTLNA-WLVLWLMSTRPGISVIC-------LANSETQLKTTLWAEVSKWLSLL 135 GRG GKT A W+ PG A ++ + L V K L+ + Sbjct: 104 GGRGSGKTRAGAEWIRGLACGEEPGPRSAAGSRNASRRAPTKESPRIAL---VGKTLADV 160 Query: 136 PNKHWFEMQSL-SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 N L ++HPA V S + +S + + G A Sbjct: 161 RNVMIEGQSGLLAVHPARERP-VFEPSKRRLIWPNGAVAELFSADEAEALRG---PQFTA 216 Query: 195 IINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DE + + + L +A R +T+ PR ++ + D Sbjct: 217 AWCDELAKWRNAEKAWDMLQFALRLGDAPR-ACVTTTPRAT-----KLLKSIIADEATVT 270 Query: 253 IDTRTVEG---IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 ++ T + + P+F + RY S + R E+ G+ + D + IEEA R Sbjct: 271 VNLATADNALNLAPTFLAEMTRRY-AGSAIGRQELLGEIVEDASDGLWRRHWIEEA--RV 327 Query: 310 PCPDPYAPLIMGCDI---AEEGGDNTVVV-----LRRGPVIEHLFDWSKTDLRTTNNKIS 361 +++ D A D +V + + + Sbjct: 328 DAAPEMQRVVVAVDPPVTATAASDACGIVVAGLGVDKRAYVLADRTVQGRTPEIWARAAL 387 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEM--LGYHVYRVLGQKRAVDLEFCRNRRTE---- 415 + Y D ++ + N G L+ + V +V + R E Sbjct: 388 SAFDDYEADRMVAEVNQGGDLVVSVLQQFRQNFPVVKVRATRGKW-------VRAEPVAA 440 Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 L+ + L L + + + G + +S D SD L++ Sbjct: 441 LYAEGRVA-HVGRL---DALEDQMCT-----FGSDGTVK--------GRSPDRSDALVWA 483 >gi|156564098|ref|YP_001429607.1| terminase large subunit [Bacillus phage 0305phi8-36] gi|154622795|gb|ABS83675.1| terminase large subunit [Bacillus phage 0305phi8-36] Length = 635 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 64/206 (31%), Gaps = 22/206 (10%) Query: 40 EKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVL 99 E+ L P+ W E ++ + + + + GR +GKT ++L Sbjct: 45 EELHYLAILDKPKFWAAETLKWFCRDYQEPMLQEMADSKRTVLRLGRRLGKTETMCIMIL 104 Query: 100 WLMSTRPGIS------VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPW 153 W T+P ++ +A E Q+ ++ +S+ + +S P Sbjct: 105 WHAFTQPNKGPNNQYDILIIAPYEEQV-DLIFKRLSQLID------------MSGDVNPS 151 Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213 H L + + + S G I+ DE + I+ Sbjct: 152 RDIDKHIELPNGTVIHGITAGSKSGSGAANTRGQRAD---LIVLDEMDYMGESEITNIMN 208 Query: 214 FLTERNANRFWIMTSNPRRLSGKFYE 239 E I+ S P +Y+ Sbjct: 209 IRNEAPERIKMIVASTPSGRRDSYYK 234 >gi|322420465|ref|YP_004199688.1| hypothetical protein GM18_2968 [Geobacter sp. M18] gi|320126852|gb|ADW14412.1| hypothetical protein GM18_2968 [Geobacter sp. M18] Length = 507 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 62/394 (15%), Positives = 109/394 (27%), Gaps = 69/394 (17%) Query: 38 WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95 WG+ S W Q+E + + ++ GR +GK+ + + Sbjct: 20 WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65 Query: 96 WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155 L T G + A + L T + E+ L P+ M S++L + Sbjct: 66 TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDTNPD----LMNSIALTKYGKPN 120 Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215 ++ + S + + D F H + DE + + + L Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177 Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273 R + S P L +Y + D + F+ + ++ Y Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTEDREAELLEFYG 232 Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308 G DS + EV G+ + +F I + E A +R Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292 Query: 309 -----EPCPDPYAPLIMGCDIAEEGG-------DNTVVVLRRGPVIEHLFDWSKTDLRTT 356 P + G D+ T + R + Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQETEIGERTLLKMILRVHLEHVSYPHI 351 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 I+ L Y P I +D G L L Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385 >gi|161521371|ref|YP_001584798.1| hypothetical protein Bmul_4835 [Burkholderia multivorans ATCC 17616] gi|189352462|ref|YP_001948089.1| ATPase subunit of bacteriophage terminase [Burkholderia multivorans ATCC 17616] gi|327198040|ref|YP_004306409.1| gp42 [Burkholderia phage KS5] gi|160345421|gb|ABX18506.1| protein of unknown function DUF264 [Burkholderia multivorans ATCC 17616] gi|189336484|dbj|BAG45553.1| ATPase subunit of bacteriophage terminase [Burkholderia multivorans ATCC 17616] gi|310657174|gb|ADP02289.1| gp42 [Burkholderia phage KS5] Length = 588 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 44/138 (31%), Gaps = 20/138 (14%) Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------ 317 + + Y + + QF + S PL +++ + + D + P Sbjct: 350 NLDRLRLEY--SPEEYANLLLCQFIDDSL-SVFPLTVLQPCMVDTWEVWDDFKPLYLRPF 406 Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D + G VV+ R G L F W D +I L +Y Sbjct: 407 GDEEVWIGYDPSHTGDSAGCVVIAPPKRPGGKFRVLERFQWHGLDFEAQAAQIEALTRRY 466 Query: 368 RPDAIIIDANNTGARTCD 385 R I ID G Sbjct: 467 RVTYIGIDTTGIGQGVYQ 484 >gi|168822445|ref|ZP_02834445.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205341120|gb|EDZ27884.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] Length = 594 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 51/162 (31%), Gaps = 20/162 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + + + RY D+ + F DS + + Sbjct: 331 PDGQWRYIITLEDAIAGGFNLASIDKLRNRYNRDT--FNMLYMCVFVDSK-DSVFSFSHV 387 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346 E + + + G D A G +T +V + +F Sbjct: 388 ERCCVDPDIWEDHDENLPRPFGNREVWAGYDPARSGDTSTFVIIAPPIVAGEKFRVLRVF 447 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 W + + +I L +Y I ID G+ + ++ Sbjct: 448 HWQGMNWKWQAAQIKKLFGQYNMTYIGIDITGLGSGVFEDVQ 489 >gi|323943519|gb|EGB39636.1| terminase [Escherichia coli H120] Length = 367 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 128 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 184 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 185 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 244 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 245 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 294 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 295 CLEYDVSATDITSSFMAIRKTMTSSGR 321 >gi|78356952|ref|YP_388401.1| hypothetical protein Dde_1909 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219357|gb|ABB38706.1| hypothetical protein Dde_1909 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 507 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 61/394 (15%), Positives = 108/394 (27%), Gaps = 69/394 (17%) Query: 38 WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95 WG+ S W Q+E + + ++ GR +GK+ + + Sbjct: 20 WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65 Query: 96 WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155 L T G + A + L T + E+ L P+ M S++L Sbjct: 66 TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDTNPD----LMNSIALTKYGKPK 120 Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215 ++ + S + + D F H + DE + + + L Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177 Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273 R + S P L +Y + + + F+ + ++ Y Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSEQFHVFRWPSWLNPLWTEDREAELLEFYG 232 Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308 G DS + EV G+ + +F I + E A +R Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292 Query: 309 -----EPCPDPYAPLIMGCDIAEEGG-------DNTVVVLRRGPVIEHLFDWSKTDLRTT 356 P + G D+ T + R + Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQETEIGERTLLKMILRVHLEHVSYPHI 351 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 I+ L Y P I +D G L L Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385 >gi|197251462|ref|YP_002147591.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197215165|gb|ACH52562.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|312913681|dbj|BAJ37655.1| hypothetical protein STMDT12_C27120 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 594 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 51/162 (31%), Gaps = 20/162 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + + + RY D+ + F DS + + Sbjct: 331 PDGQWRYIITLEDAIAGGFNLASIDKLRNRYNRDT--FNMLYMCVFVDSK-DSVFSFSHV 387 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346 E + + + G D A G +T +V + +F Sbjct: 388 ERCCVDPDIWEDHDENLPRPFGNREVWAGYDPARSGDTSTFVIIAPPIVAGEKFRVLRVF 447 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 W + + +I L +Y I ID G+ + ++ Sbjct: 448 HWQGMNWKWQAAQIKKLFGQYNMTYIGIDITGLGSGVFEDVQ 489 >gi|322832199|ref|YP_004212226.1| terminase, ATPase subunit [Rahnella sp. Y9602] gi|321167400|gb|ADW73099.1| terminase, ATPase subunit [Rahnella sp. Y9602] Length = 588 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 53/349 (15%), Positives = 104/349 (29%), Gaps = 67/349 (19%) Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPL-- 245 + DE P+ + G ++ + S P L+ Y E+FNK Sbjct: 249 LYVDEIFWIPNFQKLRKVASGMASQEHLRTT--YFSTPSALTHGAYPFWSGELFNKGREN 306 Query: 246 ----------------------DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282 W++ I+ G + + + + R Sbjct: 307 PNDRIELDIGHHSLAKGRLCEDGQWRQIVTIEDALAGGCNLFNIDTLKQENSAED--FRN 364 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDIAEEGGDN 331 +F S P ++ + Y + +G D + G Sbjct: 365 LFMCEFVDDQ-TSVFPFAELQRCMVESAEEWQDFSPFAVRPFGYRAVWIGYDPSHTGDSA 423 Query: 332 --TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 VV L G L W D I L ++Y + I +DA G Sbjct: 424 GCAVVAPPLVDGGKFRVLERHQWKGMDFAAQAKSIEELTKRYCVEYIGVDATGIGQGVFQ 483 Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI---NHSGLIQNLKSL 442 + A+++ + +T++ +K D + L NH + + ++ Sbjct: 484 LVRQ---------FFPAAMEIRYSPETKTKMVLKAKDTITSGRLEYDTNHKDITSSFMAI 534 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFGRCP 491 + + + E+ R + A D + +M+ N P + + G+ P Sbjct: 535 RKTMTASGSRSTYEASRSEEASHADVAWAIMHALL-NEPLTAANGGQSP 582 >gi|154247076|ref|YP_001418034.1| hypothetical protein Xaut_3147 [Xanthobacter autotrophicus Py2] gi|154161161|gb|ABS68377.1| protein of unknown function DUF264 [Xanthobacter autotrophicus Py2] Length = 416 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 68/415 (16%), Positives = 121/415 (29%), Gaps = 68/415 (16%) Query: 82 ISAGRGIGKTTLNA-WLVLWLM-----STRPGISVICLANSETQLKTTLWAEVSKWLSLL 135 + GRG GKT A W+ + + RP + +A + ++ + VS L++ Sbjct: 31 VLGGRGAGKTRAGAEWVRGLALGRPPFAGRPVGRIALVAETMADVREVMVEGVSGLLAVH 90 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 P + + + +S E P++ G A Sbjct: 91 PRAERPRWEPTR---------------RRLEWANGAVAQGFSAEDPESLRG---PQFAAA 132 Query: 196 INDEASGTPDVINLGILGFLTERNANRFW-----IMTSNPRRLSGKFYEIFNKPLDDWKR 250 DE + M + R + + P R Sbjct: 133 WLDELAK-----WKRAEATFDMLQFGLRLGAQPRQMVTTTPRPTALLRRLLADPSTAVTR 187 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + + PSF ++ RYG + R E+ G+ + D+ +E RE Sbjct: 188 AR-TADNAFHLAPSFLGQVLTRYGGT-RLGRQELDGELIEDRADALFSRPALEAL--REA 243 Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVV---LRRGPVIEHLFDWSKTDLRTT--NNKISG 362 P +++ D + G D +V + V+ L D S LR K Sbjct: 244 QVPPLTRIVVAVDPPASSRAGADACGIVCAGMDATGVVHVLADDSAAGLRPAQWAAKAVA 303 Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEM--LGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 L ++ D I+ + N G + G V +V + R E + Sbjct: 304 LFRRFEADLIVAEVNQGGEMVRAVIAEVDDGVPVEQVRATRGKF-------LRAEPVAAL 356 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + L + G + S +S D D L++ Sbjct: 357 YEQGRVRHAGAFPALEDEMC-----DFGTDG---LSS-----GRSPDRLDALVWA 398 >gi|307315386|ref|ZP_07594955.1| protein of unknown function DUF264 [Escherichia coli W] gi|307315408|ref|ZP_07594975.1| protein of unknown function DUF264 [Escherichia coli W] gi|306905258|gb|EFN35804.1| protein of unknown function DUF264 [Escherichia coli W] gi|306905260|gb|EFN35805.1| protein of unknown function DUF264 [Escherichia coli W] Length = 385 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 58/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + D + +F S P ++ + Sbjct: 146 IEQLKRENSADD--FKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 202 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 203 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 262 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 263 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 312 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 313 CLEYDVSATDITSSFMAIRKTMTSSGR 339 >gi|289805729|ref|ZP_06536358.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. AG3] Length = 257 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 58/167 (34%), Gaps = 5/167 (2%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 82 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 139 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D + G D + I L+ IE A++ + Sbjct: 140 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 199 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTN 357 P +G D+A+ G D V R G VI +W + Sbjct: 200 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLK 246 >gi|213618708|ref|ZP_03372534.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-2068] Length = 282 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 58/163 (35%), Gaps = 5/163 (3%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252 + +EA + + + + + ++ NP ++ + P +D + Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 179 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D + G D + I L+ IE A++ + Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 239 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL 353 P +G D+A+ G D V R G VI +W + Sbjct: 240 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKED 282 >gi|323973818|gb|EGB68992.1| terminase [Escherichia coli TA007] Length = 589 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 35/196 (17%), Positives = 55/196 (28%), Gaps = 29/196 (14%) Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDI 324 +D R +F S P ++ + + P+ +G D Sbjct: 359 SADDFRNLFMCEFVDDKA-SVFPFEELQRCMVDAMEEWEDFEPFADRPFNWRPVWIGYDP 417 Query: 325 AEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 + G V+ L G L W D I L EKY D I IDA Sbjct: 418 SHTGDSAGCAVLAPPLVAGGKFRILERHQWKGMDFAAQAEAIRALTEKYTVDYIGIDATG 477 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL--I 436 G + A + + +T + +K D + L +G I Sbjct: 478 IGQGVYQLVR---------SFFPAARAIRYTPEMKTAMVLKAKDTIRRGCLEYDAGATDI 528 Query: 437 QNLKSLKSFIVPNTGE 452 + N+G Sbjct: 529 TQSFMAIRKTMTNSGR 544 >gi|261340099|ref|ZP_05967957.1| terminase, ATPase subunit [Enterobacter cancerogenus ATCC 35316] gi|288318026|gb|EFC56964.1| terminase, ATPase subunit [Enterobacter cancerogenus ATCC 35316] Length = 589 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 55/196 (28%), Gaps = 29/196 (14%) Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDI 324 +D R +F S P ++ + + P+ +G D Sbjct: 359 SADDFRNLFMCEFVDDKA-SVFPFEELQRCMVDAMEEWEDFEPFADRPFNWRPVWIGYDP 417 Query: 325 AEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 + G V+ L G L W D I L EKY D I IDA Sbjct: 418 SHTGDSAGCAVLAPPLVAGGKFRILERHQWKGMDFAAQAEAIRALTEKYTVDYIGIDATG 477 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL--I 436 G + A + + +T + +K D + L +G I Sbjct: 478 IGQGVYQLVR---------SFFPAARAIRYTPEMKTAMVLKAKDTIRRGCLEYDAGATDI 528 Query: 437 QNLKSLKSFIVPNTGE 452 + ++G Sbjct: 529 TQSFMAIRKTMTSSGR 544 >gi|218558996|ref|YP_002391909.1| Terminase, ATPase subunit (GpP) [Escherichia coli S88] gi|218365765|emb|CAR03503.1| Terminase, ATPase subunit (GpP) [Escherichia coli S88] Length = 600 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 361 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 417 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 418 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 477 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 478 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 527 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 528 CLEYDVSATDITSSFMAIRKTMTSSGR 554 >gi|212709268|ref|ZP_03317396.1| hypothetical protein PROVALCAL_00303 [Providencia alcalifaciens DSM 30120] gi|212688180|gb|EEB47708.1| hypothetical protein PROVALCAL_00303 [Providencia alcalifaciens DSM 30120] Length = 585 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 38/165 (23%), Positives = 58/165 (35%), Gaps = 24/165 (14%) Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 W++ ++ G D E + Y D + +F DI S L ++++ Sbjct: 324 GQWRQIVTVEDAIRGGCDLFEIEQLSLEY--SPDEFENLLMCEFVD-DIASIFNLQLMQK 380 Query: 305 ALNRE-----------PCPDPYAPLIMGCDIAE--EGGDN--TVVV---LRRGPVIEHLF 346 + P Y P+ +G D A+ + GD+ VVV LR G L Sbjct: 381 CMVDSWEVWNDVQPLMVRPYAYHPVWIGYDPAKGTQNGDSAGCVVVAPPLRAGDKFRILE 440 Query: 347 D--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 W D R N I L E+Y I ID+ G + Sbjct: 441 HHQWRGMDFRAQANAIKELTERYNVQYIGIDSTGIGHGVLQNVRD 485 >gi|194444881|ref|YP_002043300.1| hypothetical protein SNSL254_A4364 [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|194403544|gb|ACF63766.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Newport str. SL254] Length = 589 Score = 59.0 bits (141), Expect = 2e-06, Method: Composition-based stats. Identities = 37/230 (16%), Positives = 71/230 (30%), Gaps = 34/230 (14%) Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W++ I+ +G + + +D R +F S P + Sbjct: 328 PDGQWRQIVTIEDALAKGCTLFNIDTLKRENSVDE--FRNLFMCEFVDDKA-SVFPFEEL 384 Query: 303 EEALNR-----------EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL-- 345 + + P + P+ +G D + G VV+ G L Sbjct: 385 QRCMVDSLEKWEDYAPFADRPFGHRPVWIGYDPSLRGDSAGCVVIAPPVVAGGKFRILER 444 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 W D I L +KY + I IDA G + A + Sbjct: 445 HQWKGMDFAQQAESIRELTQKYTVEYIGIDATGLGQGVFQLVR---------SFYPAARE 495 Query: 406 LEFCRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGE 452 + + +T + +K D + L + + Q+ S++ + ++G Sbjct: 496 IRYTPEMKTAMVLKAKDTIRRGCLEYDVSATDITQSFMSIRK-TMTSSGR 544 >gi|154248423|ref|YP_001419381.1| hypothetical protein Xaut_4503 [Xanthobacter autotrophicus Py2] gi|154162508|gb|ABS69724.1| protein of unknown function DUF264 [Xanthobacter autotrophicus Py2] Length = 457 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 68/240 (28%), Gaps = 20/240 (8%) Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232 T PDT G + DE + D I + +++ TS P Sbjct: 113 TALPANPDTARGFSAN----VFLDEFAIHKDSKAIWGALFPVISKNGLRLRV--TSTPNG 166 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292 KFYEI + W R +D + D D+ E ++ + Sbjct: 167 KGNKFYEIMTAADEVWSRHVVDIYQAVADGLPRDIDELRAGLADDDLWAQEYELKWLDEA 226 Query: 293 ID----SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEH 344 I E A +P +G DI D V+ + Sbjct: 227 SAWLSYDLISSCEDERA--GDPALYQGGVCFVGRDIGRRQ-DLHVIWVWEQVGDVLWERE 283 Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC-DYLEMLGYHVYRVLGQKRA 403 + + ++ ++ +YR ID G + D G V VL + Sbjct: 284 RIEQKRATFAEMDDAFDDIMVRYRVGRACIDQTGMGEKVVEDAQRRWGSRVEGVLFTGPS 343 >gi|225220117|ref|YP_002720084.1| phage terminase large subunit [Enterobacteria phage SSL-2009a] gi|224986058|gb|ACN74622.1| phage terminase large subunit [Enterobacteria phage SSL-2009a] Length = 461 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 66/336 (19%), Positives = 117/336 (34%), Gaps = 48/336 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 +G G GK+ + A V+ L++ PG I + L ++ E+ K + F Sbjct: 58 SGFGGGKSWVAARKVIQLLTLNPGYDGIVTEPTIPLLVKIMYPELEKAFDEAGFRWKFNK 117 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 Q D ++ L + K +C S E +G + + I+ DE T Sbjct: 118 Q-----------DKIYNVL-VKGKWTRVICE--SMENYTRLIGVNAAW---IVADEFDTT 160 Query: 204 PDVINLGILGFLTER---NANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR-FQIDTRTVE 259 + + L R R +++ S P Y+IF KR + T Sbjct: 161 KQDVAMAAYHKLLGRLRAGFVRQFVIVSTPEGYRAM-YQIFEVEKGSQKRLIRAKTTDNH 219 Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319 + F + + ++Y +++ + G F + + E + E P LI Sbjct: 220 HLPADFIDTLRSQY--PANLIDAYLNGLFVNLTSGAVYKMFNREGNASTEE-VHPDDTLI 276 Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--------TDLRTTNNKISGLVEKYR--- 368 +G D VV +RR I ++ DL T I + E+Y Sbjct: 277 IGMDFNVTKM-AAVVYVRR-QRITENKEFRDEIHAVDEFVDLFDTPAMIEAIEERYPEHC 334 Query: 369 -PDAIII--DANN-----TGARTCD--YLEMLGYHV 394 +++ D++ A + D LE G+ V Sbjct: 335 AAGRVVVYPDSSGKSRKTVNASSSDIAQLEDAGFEV 370 >gi|326783087|ref|YP_004323484.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-HM2] gi|310005505|gb|ADO99893.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-HM2] Length = 560 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 72/414 (17%), Positives = 135/414 (32%), Gaps = 65/414 (15%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 +Q E +E H N P GK+T +L + ++V Sbjct: 60 DFQQELIESFHEHRFNIAKLPRQ------------TGKSTTCVSYLLHYILFNDNVNVGI 107 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 LAN + + L S+ + Q + ++ + L SK Sbjct: 108 LANKLSTARDLL----SRLQLAYEQLPLWIQQGIVVY------NKGSMELENGSK-ILAA 156 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI----NLGILGFLTERNANRFWIMTS 228 + S R +F I DE + P+ I + +T + I+ S Sbjct: 157 STSASAVRGMSFN--------IIFLDEFAFIPNHIAEQFFSSVYPTITS-GTSTKVIIIS 207 Query: 229 NPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285 P ++ FY+++ K + + ++ V G D + E IA E Sbjct: 208 TPNGMN-HFYKLWVDAQKGRNGYAWNEVHWSKVPGRDAKWKEQTIANTSERQ--FTQEFD 264 Query: 286 GQFPQQDIDSFIPLNIIEE-----------ALNREPCPDPYAPLIMGCDIAEE--GGDNT 332 +F +D+ I + + +L+ P I+ D++ + Sbjct: 265 CEFL-GSVDTLITASKLRVLTYDDVMTTNGSLDIYEKPIDKHEYIITVDVSRGLAQDYSA 323 Query: 333 VVVLRRGPVIEHLF-DWSKTDLRTT--NNKISGLVEKYRPDAIIIDANNTGARTCD---- 385 VV+ L + D+R N I + Y ++ + N+ G Sbjct: 324 FVVIDITHAPWRLVAKYRDKDVRPMLFPNIIFNVATNYNKAYVLTEVNDIGEAVAGSLFY 383 Query: 386 YLEMLGYHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438 LE + + G + V F N+ T++ VKM+ ++ N LI++ Sbjct: 384 DLEYENTLMCAMRGRAGQIVGQGFSGNK-TQMGVKMSKTVKAQGCSNLKTLIED 436 >gi|322662586|gb|EFY58794.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 81038-01] Length = 280 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 49/143 (34%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + + P Sbjct: 40 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQSLALRPFG 96 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L + Sbjct: 97 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 156 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 157 QYNVTYIGIDSTGVGHGVYENVK 179 >gi|323183894|gb|EFZ69285.1| terminase, ATPase subunit [Escherichia coli 1357] Length = 590 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|320180747|gb|EFW55673.1| Phage terminase, ATPase subunit [Shigella boydii ATCC 9905] gi|323167352|gb|EFZ53060.1| terminase, ATPase subunit [Shigella sonnei 53G] Length = 590 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 60/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F + S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKV-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|117623093|ref|YP_852006.1| putative phage terminase [Escherichia coli APEC O1] gi|117624286|ref|YP_853199.1| Phage protein P [Escherichia coli APEC O1] gi|115512217|gb|ABJ00292.1| putative phage terminase [Escherichia coli APEC O1] gi|115513410|gb|ABJ01485.1| Phage protein P [Escherichia coli APEC O1] Length = 590 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|331675382|ref|ZP_08376132.1| terminase, ATPase subunit (GpP) [Escherichia coli TA280] gi|331067442|gb|EGI38847.1| terminase, ATPase subunit (GpP) [Escherichia coli TA280] Length = 590 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSTDDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|330967816|gb|EGH68076.1| hypothetical protein PSYAC_24858 [Pseudomonas syringae pv. actinidiae str. M302091] Length = 774 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 51/163 (31%), Gaps = 20/163 (12%) Query: 244 PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W++ I G D E + Y D D + +F +F L + Sbjct: 343 PDGQWRKVITILDAISGGCDLFDLEQLQLEY--DEDKFQQLFMCKFIDSSQSAF-SLADL 399 Query: 303 EEALNR----------EPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHLFD- 347 E + +P +P+ +G D + D T VV L G L Sbjct: 400 ERCYSDLSLWADFDPDDPRLYGNSPVWIGYDPSRTRDDATCVVIAPPLENGGKFRILEKH 459 Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 W + ++ L E++ I ID G D + Sbjct: 460 SWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGVFDLVRD 502 >gi|294634584|ref|ZP_06713119.1| terminase, ATPase subunit [Edwardsiella tarda ATCC 23685] gi|291092098|gb|EFE24659.1| terminase, ATPase subunit [Edwardsiella tarda ATCC 23685] Length = 588 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 33/134 (24%), Positives = 44/134 (32%), Gaps = 17/134 (12%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE----ALNREPCPDPYA------PLIMG 321 + +D + +F S P ++ AL +PYA P+ +G Sbjct: 355 KRENSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDALEAWTDVNPYADHPFDRPVWIG 413 Query: 322 CDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIID 375 D + G VVL G L W D T I L EKYR D I ID Sbjct: 414 YDPSHTGDSAGCVVLAPPAVPGGKFRMLERHQWKGMDFSTQAEAIRALTEKYRVDYIGID 473 Query: 376 ANNTGARTCDYLEM 389 A G + Sbjct: 474 ATGIGQGVFQLVRE 487 >gi|323936689|gb|EGB32974.1| terminase [Escherichia coli E1520] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFATNPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|210062534|gb|ACJ06274.1| probable terminase subunit [Photorhabdus luminescens] Length = 585 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 53/144 (36%), Gaps = 23/144 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDP 314 + + Y D + + +F DI+S L +++ + P Sbjct: 345 IDQLRLEY--SPDEYQNLLMCEF-MDDIESIFSLQLMQGCMVDSWEIWDDVQPLMLRPYG 401 Query: 315 YAPLIMGCDIAEEG--GDNT---VVVLRR--GPVIEHL--FDWSKTDLRTTNNKISGLVE 365 Y P+ +G D A+ G GD+ VV R G L W + R ++ I L E Sbjct: 402 YHPVWIGYDPAKGGENGDSAGCVVVAPPRVPGDKFRILERHQWRGMNFRAQSDAIKRLTE 461 Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389 +Y + I ID+ G ++ Sbjct: 462 QYNVEYIGIDSTGVGHGVYQNVKE 485 >gi|324113792|gb|EGC07767.1| terminase [Escherichia fergusonii B253] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFATNPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|332088966|gb|EGI94078.1| terminase, ATPase subunit [Shigella boydii 5216-82] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|323961666|gb|EGB57270.1| terminase [Escherichia coli H489] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|294494147|gb|ADE92903.1| terminase, ATPase subunit [Escherichia coli IHE3034] gi|323951869|gb|EGB47743.1| terminase [Escherichia coli H252] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|254039145|ref|ZP_04873195.1| terminase [Escherichia sp. 1_1_43] gi|226838581|gb|EEH70610.1| terminase [Escherichia sp. 1_1_43] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|30065706|ref|NP_839851.1| gpP [Yersinia phage L-413C] gi|300947250|ref|ZP_07161455.1| conserved hypothetical protein [Escherichia coli MS 116-1] gi|301022960|ref|ZP_07186775.1| conserved hypothetical protein [Escherichia coli MS 69-1] gi|331678021|ref|ZP_08378696.1| terminase, ATPase subunit (GpP) [Escherichia coli H591] gi|30025900|gb|AAP04439.1| gpP [Yersinia phage L-413C] gi|33413700|gb|AAN28220.1| gpP [Enterobacteria phage WPhi] gi|300397301|gb|EFJ80839.1| conserved hypothetical protein [Escherichia coli MS 69-1] gi|300453115|gb|EFK16735.1| conserved hypothetical protein [Escherichia coli MS 116-1] gi|315061386|gb|ADT75713.1| terminase, ATPase subunit [Escherichia coli W] gi|315063221|gb|ADT77548.1| phage large terminase subunit [Escherichia coli W] gi|323380714|gb|ADX52982.1| phage large terminase subunit GpP [Escherichia coli KO11] gi|325499372|gb|EGC97231.1| Terminase, ATPase subunit (GpP) [Escherichia fergusonii ECD227] gi|331074481|gb|EGI45801.1| terminase, ATPase subunit (GpP) [Escherichia coli H591] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|323378035|gb|ADX50303.1| phage large terminase subunit GpP [Escherichia coli KO11] Length = 589 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 350 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 406 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 407 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 466 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 467 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 516 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 517 CLEYDVSATDITSSFMAIRKTMTSSGR 543 >gi|315296184|gb|EFU55492.1| conserved hypothetical protein [Escherichia coli MS 16-3] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|213646682|ref|ZP_03376735.1| Phage protein P [Salmonella enterica subsp. enterica serovar Typhi str. J185] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|9630329|ref|NP_046758.1| gpP [Enterobacteria phage P2] gi|168789033|ref|ZP_02814040.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC869] gi|188492656|ref|ZP_02999926.1| phage large terminase subunit GpP [Escherichia coli 53638] gi|261225041|ref|ZP_05939322.1| Terminase, ATPase subunit (GpP) [Escherichia coli O157:H7 str. FRIK2000] gi|261257612|ref|ZP_05950145.1| Terminase, ATPase subunit (GpP) [Escherichia coli O157:H7 str. FRIK966] gi|301048706|ref|ZP_07195715.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|139354|sp|P25479|VPP_BPP2 RecName: Full=Terminase, ATPase subunit; AltName: Full=GpP gi|3139088|gb|AAD03269.1| gpP [Enterobacteria phage P2] gi|188487855|gb|EDU62958.1| phage large terminase subunit GpP [Escherichia coli 53638] gi|189371250|gb|EDU89666.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC869] gi|300299452|gb|EFJ55837.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|324020535|gb|EGB89754.1| hypothetical protein HMPREF9542_00768 [Escherichia coli MS 117-3] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|320196848|gb|EFW71470.1| Phage terminase, ATPase subunit [Escherichia coli WV_060327] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|170683976|ref|YP_001746268.1| phage large terminase subunit GpP [Escherichia coli SMS-3-5] gi|170521694|gb|ACB19872.1| phage large terminase subunit GpP [Escherichia coli SMS-3-5] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|170769222|ref|ZP_02903675.1| phage large terminase subunit GpP [Escherichia albertii TW07627] gi|170121874|gb|EDS90805.1| phage large terminase subunit GpP [Escherichia albertii TW07627] Length = 590 Score = 58.6 bits (140), Expect = 3e-06, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 51/184 (27%), Gaps = 29/184 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRGC 518 Query: 429 LINH 432 L Sbjct: 519 LEYD 522 >gi|18466735|ref|NP_569542.1| hypothetical protein HCM2.0070c [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|16506051|emb|CAD09937.1| hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18] Length = 418 Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 50/335 (14%), Positives = 106/335 (31%), Gaps = 46/335 (13%) Query: 59 MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118 + +V H +P FK + AGR GK+ L+ ++ + V +A + Sbjct: 7 LSLVQLHSGQMQVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 + LW ++ + L P W ++ I K+ S + Sbjct: 66 MARQILWDDLQEVL-----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GA 107 Query: 179 ERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 ++PDT G ++ DE PD + L+ ++ P+ +F Sbjct: 108 DKPDTLRGV---ALHFVVLDEFQDMKPDTWYKVLRPTLSS--TRGGALIIGTPKG-FSEF 161 Query: 238 YEIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 ++++ + WK +Q T + + E +D E F Sbjct: 162 HKLWTIGQNKDLQRKGQWKSWQFVTADSPFVPSAEIEAAKND--MDPKSFAQEYLASFEN 219 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350 + P + + +P P+ +G D D V+ + L+ + Sbjct: 220 MSGRVYYPFD--RNVHVKPLQFNPKLPIWVGQD---FNIDPMSSVILQPQPNGELWAVDE 274 Query: 351 -----TDLRTTNNKISGLVEKYRPD-AIIIDANNT 379 ++ +++ +++ I D Sbjct: 275 VVLFSSNTAEVCDELERRFWRWKSQVTIFPDPAGA 309 >gi|16082806|ref|NP_395360.1| hypothetical protein YPMT1.24c [Yersinia pestis CO92] gi|31795361|ref|NP_857813.1| hypothetical protein Y1030 [Yersinia pestis KIM] gi|40787951|ref|NP_857660.2| hypothetical protein YPKMT021 [Yersinia pestis KIM] gi|45478613|ref|NP_995469.1| hypothetical protein YP_pMT025 [Yersinia pestis biovar Microtus str. 91001] gi|52788073|ref|YP_093901.1| hypothetical protein pG8786_021 [Yersinia pestis] gi|108793557|ref|YP_636707.1| hypothetical protein YPA_MT0025 [Yersinia pestis Antiqua] gi|108793757|ref|YP_636595.1| hypothetical protein YPN_MT0025 [Yersinia pestis Nepal516] gi|145597216|ref|YP_001154679.1| hypothetical protein YPDSF_4052 [Yersinia pestis Pestoides F] gi|149192775|ref|YP_001294006.1| hypothetical protein YPE_4292 [Yersinia pestis CA88-4125] gi|162417876|ref|YP_001604588.1| hypothetical protein YpAngola_0076 [Yersinia pestis Angola] gi|165939469|ref|ZP_02228016.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166214433|ref|ZP_02240468.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167402343|ref|ZP_02307808.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422791|ref|ZP_02314544.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167466683|ref|ZP_02331387.1| hypothetical protein YpesF_02065 [Yersinia pestis FV-1] gi|229896952|ref|ZP_04512111.1| hypothetical protein YPS_4795 [Yersinia pestis Pestoides A] gi|229897756|ref|ZP_04512911.1| hypothetical protein YPH_4790 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229900293|ref|ZP_04515428.1| hypothetical protein YPF_4819 [Yersinia pestis biovar Orientalis str. India 195] gi|229904817|ref|ZP_04519927.1| hypothetical protein YP516_4657 [Yersinia pestis Nepal516] gi|270491004|ref|ZP_06208077.1| phage terminase, large subunit, PBSX family [Yersinia pestis KIM D27] gi|294502015|ref|YP_003565752.1| hypothetical protein YPZ3_pMT0023 [Yersinia pestis Z176003] gi|3883031|gb|AAC82691.1| unknown [Yersinia pestis KIM 10] gi|5834709|emb|CAB55206.1| hypothetical protein YPMT1.24c [Yersinia pestis CO92] gi|45357266|gb|AAS58660.1| hypothetical protein YP_pMT025 [Yersinia pestis biovar Microtus str. 91001] gi|52538002|emb|CAG27427.1| hypothetical protein [Yersinia pestis] gi|108777821|gb|ABG20339.1| hypothetical protein YPN_MT0025 [Yersinia pestis Nepal516] gi|108782104|gb|ABG16161.1| hypothetical protein YPA_MT0025 [Yersinia pestis Antiqua] gi|145212984|gb|ABP42389.1| hypothetical protein YPDSF_4052 [Yersinia pestis Pestoides F] gi|148872433|gb|ABR14922.1| hypothetical protein YPMT1.24c [Yersinia pestis CA88-4125] gi|162350848|gb|ABX84797.1| conserved hypothetical protein [Yersinia pestis Angola] gi|165912657|gb|EDR31287.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166204381|gb|EDR48861.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166958284|gb|EDR55305.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167048235|gb|EDR59643.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|229678132|gb|EEO74238.1| hypothetical protein YP516_4657 [Yersinia pestis Nepal516] gi|229686652|gb|EEO78733.1| hypothetical protein YPF_4819 [Yersinia pestis biovar Orientalis str. India 195] gi|229693337|gb|EEO83387.1| hypothetical protein YPH_4790 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229699988|gb|EEO88028.1| hypothetical protein YPS_4795 [Yersinia pestis Pestoides A] gi|262363909|gb|ACY60628.1| hypothetical protein YPD4_pMT0023 [Yersinia pestis D106004] gi|262364065|gb|ACY64401.1| hypothetical protein YPD8_pMT0023 [Yersinia pestis D182038] gi|270334985|gb|EFA45763.1| phage terminase, large subunit, PBSX family [Yersinia pestis KIM D27] gi|294352486|gb|ADE66542.1| hypothetical protein YPZ3_pMT0023 [Yersinia pestis Z176003] gi|320017547|gb|ADW01117.1| hypothetical protein YPC_4788 [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 418 Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 50/335 (14%), Positives = 106/335 (31%), Gaps = 46/335 (13%) Query: 59 MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118 + +V H +P FK + AGR GK+ L+ ++ + V +A + Sbjct: 7 LSLVQLHSGQMQVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 + LW ++ + L P W ++ I K+ S + Sbjct: 66 MARQILWDDLQEVL-----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GA 107 Query: 179 ERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 ++PDT G ++ DE PD + L+ ++ P+ +F Sbjct: 108 DKPDTLRGV---ALHFVVLDEFQDMKPDTWYKVLRPTLSS--TRGGALIIGTPKG-FSEF 161 Query: 238 YEIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 ++++ + WK +Q T + + E +D E F Sbjct: 162 HKLWTIGQNKDLQRKGQWKSWQFVTADSPFVPSAEIEAAKND--MDPKSFAQEYLASFEN 219 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350 + P + + +P P+ +G D D V+ + L+ + Sbjct: 220 MSGRVYYPFD--RNVHVKPLQFNPKLPIWVGQD---FNIDPMSSVILQPQPNGELWAVDE 274 Query: 351 -----TDLRTTNNKISGLVEKYRPD-AIIIDANNT 379 ++ +++ +++ I D Sbjct: 275 VVLFSSNTAEVCDELERRFWRWKSQVTIFPDPAGA 309 >gi|324115403|gb|EGC09352.1| terminase [Escherichia coli E1167] Length = 572 Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 357 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 413 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 414 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 473 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 474 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 523 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 524 CLEYDVSATDITSSFMAIRKTMTSSGR 550 >gi|302343251|ref|YP_003807780.1| hypothetical protein Deba_1821 [Desulfarculus baarsii DSM 2075] gi|301639864|gb|ADK85186.1| conserved hypothetical protein [Desulfarculus baarsii DSM 2075] Length = 507 Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 63/394 (15%), Positives = 109/394 (27%), Gaps = 69/394 (17%) Query: 38 WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95 WG+ S W Q+E + + ++ GR +GK+ + + Sbjct: 20 WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65 Query: 96 WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155 L T G + A + L T + E+ L P+ M S++L Sbjct: 66 TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDTNPD----LMNSIALTKYGKPK 120 Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215 ++ + S + + D F H + DE + + + L Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177 Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273 R + S P L +Y + D + F+ + ++ Y Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTEDREAELLEFYG 232 Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308 G DS + EV G+ + +F I + E A +R Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSEMRDCDTEEAAHDR 292 Query: 309 -----EPCPDPYAPLIMGCDIAEEGGDNTVVVL-------RRGPVIEHLFDWSKTDLRTT 356 P + G D+ +VV R + Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQEMEIGERTLLKMILRVHLEHVSYPHI 351 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 I+ L Y P I +D G L L Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385 >gi|300715671|ref|YP_003740474.1| Terminase, ATPase [Erwinia billingiae Eb661] gi|299061507|emb|CAX58621.1| Terminase, ATPase subunit [Erwinia billingiae Eb661] Length = 588 Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 46/144 (31%), Gaps = 23/144 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y + + +F D+ S PL ++ + P Sbjct: 348 IDQLRLEY--SPPEYQNLLMCEFID-DLASVFPLADLQACMVDSWEVWQDFEALALRPFG 404 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L Sbjct: 405 WREVWIGYDPAKGTQHGDSAGCVVIAPPSVPGGKFRILERHQWRGMDFRAQADAIKELTR 464 Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389 +Y I ID+ G + ++M Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVKM 488 >gi|221633560|ref|YP_002522786.1| hypothetical protein trd_1584 [Thermomicrobium roseum DSM 5159] gi|221155562|gb|ACM04689.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159] Length = 489 Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 53/352 (15%), Positives = 96/352 (27%), Gaps = 55/352 (15%) Query: 89 GKTTLNAWLVLWLMSTRP--GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146 GK A + WL+ G V+ S +L + + + Sbjct: 65 GKDEALAQFLAWLLLRFHRRGGEVVVALPSWR-----------PQGALARERLLAVLAAP 113 Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV 206 L + G + R S G T + ++ +EA Sbjct: 114 RLAALLAGLGLAPEVAGARVALGRAVVRYASAGPSANVRGL--TASLLLVANEAQDIAPD 171 Query: 207 INLGILGFLTERNANRFWIMTSNPRRLSG------KFYEIFNKPLDDWKRFQIDTRTVEG 260 + + + P ++ + + +++ TV Sbjct: 172 RWDSAFAPMA-ASTGAPALYLGTPWGSDSLLARELRYLTALERQDGQQRVWRVPWTTVAA 230 Query: 261 IDPSF---HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDP 314 P++ +A+ G R E G P + P P P Sbjct: 231 ELPAYGDHVRERMAQLGAGHPFVRTEY-GLEELAGEGRLFPPERLALVRGDHPALLAPRP 289 Query: 315 YAPLIMGCDIAEEGGDN-------------------TVVVLRRGPVIEHLFDWS----KT 351 + D+A G D TVV + G + + W Sbjct: 290 GERYALTVDVA--GEDEASAGELRDDPGARRDATALTVVRVVPGTLPRYEAVWRARWVGA 347 Query: 352 DLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKR 402 + + L +R + +++DA+ GA +LE LG V RV+ R Sbjct: 348 RQVRQHEALVQLARAWRAERVVVDASGVGAGLAAFLEHALGERVRRVVFSPR 399 >gi|307826152|ref|ZP_07656363.1| protein of unknown function DUF264 [Methylobacter tundripaludum SV96] gi|307732791|gb|EFO03657.1| protein of unknown function DUF264 [Methylobacter tundripaludum SV96] Length = 598 Score = 58.2 bits (139), Expect = 3e-06, Method: Composition-based stats. Identities = 44/261 (16%), Positives = 78/261 (29%), Gaps = 52/261 (19%) Query: 189 NTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIF 241 + +G + DE PD + G + R S P +S + Y E + Sbjct: 261 SYHGHLYV-DECFWIPDFDKMWKVASGMAAHKKWRRTL--FSTPSAISHQAYPMWCGEKY 317 Query: 242 NKPLDDWKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDS 277 N+ D K+ + D +G D + + Y D Sbjct: 318 NQGKADDKKAEFDVSHAALKDGLMGADKIWRHMVTVVDAEAQGCDLFDIDELQDEYSKDD 377 Query: 278 DVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---------PCPDPYAPLIMGCDIAEEG 328 +F D S L I+ RE P P P+ +G D + Sbjct: 378 --FANLFMCKFID-DAKSVFNLGIMMTCYAREDYTDYNDKAPRPYGNRPVAIGYDPSRTR 434 Query: 329 GDNT----VVVLRRG--PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382 + + + LR G + D+ + + N+I +V+ + + ID G Sbjct: 435 DNASLAILAIPLRPGDKWRVLKTMDFHGQNFQYQANRIKEIVDSHNVQHVGIDVTGIGYG 494 Query: 383 TCDYLEMLGYHVYRVLGQKRA 403 + +E V + Sbjct: 495 LFELVEQFYRRVTPINYSNET 515 >gi|318064508|gb|ADV36483.1| phage terminase large subunit [Edwardsiella phage eiDWF] gi|318064606|gb|ADV36532.1| phage terminase large subunit [Edwardsiella phage eiMSLS] Length = 460 Score = 58.2 bits (139), Expect = 4e-06, Method: Composition-based stats. Identities = 63/346 (18%), Positives = 113/346 (32%), Gaps = 38/346 (10%) Query: 44 PLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS 103 P++ R+W+++ + H +N+ ++ +G G GKT A + L Sbjct: 27 PVKKERKSRTWRIKTL----PHQRGLINDTTTKILGLC--SGFGGGKTWSAARKAVQLAI 80 Query: 104 TRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG 163 PG I + L ++ E+ K L+ K F Q H Sbjct: 81 LNPGCDGIITEPTIPLLVKIMYPELEKALNEAGIKWKFNKQDKIYHC------------R 128 Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG---ILGFLTERNA 220 I + +C S E +G + + + D PD+ +LG L N Sbjct: 129 IAGQMTRIICD--SMENYTRLIGVNAAWCVCDEFDTTK--PDIAMEAYRKLLGRLRTGNV 184 Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279 + I S P Y+IF DD KR + T + + + + A+Y ++ Sbjct: 185 RQMVI-VSTPEGFRAM-YQIFISEADDQKRLIKARTTDNHYLPQDYIDTLRAQY--PPEL 240 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339 + G+F + N N + + L++G D V V R Sbjct: 241 IEAYLNGEFVNLTGGAVY-RNFSRTLNNCDTVAEDDDTLMIGMDFNVGQMAGAVYVQRIA 299 Query: 340 PVIEHLFDWSKT----DLRTTNNKISGLVEKYRP---DAIIIDANN 378 +E + + D + I + I D++ Sbjct: 300 DGVEEMHLVDEFCGLLDTDAMIDAIKERYPDHHARGLIEIFPDSSG 345 >gi|318064394|gb|ADV36428.1| phage terminase large subunit [Edwardsiella phage eiAU] Length = 460 Score = 58.2 bits (139), Expect = 4e-06, Method: Composition-based stats. Identities = 63/346 (18%), Positives = 113/346 (32%), Gaps = 38/346 (10%) Query: 44 PLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS 103 P++ R+W+++ + H +N+ ++ +G G GKT A + L Sbjct: 27 PVKKERKSRTWRIKTL----PHQRGLINDTTTKILGLC--SGFGGGKTWSAARKAVQLAI 80 Query: 104 TRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG 163 PG I + L ++ E+ K L+ K F Q H Sbjct: 81 LNPGCDGIITEPTIPLLVKIMYPELEKALNEAGIKWKFNKQDKIYHC------------R 128 Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG---ILGFLTERNA 220 I + +C S E +G + + + D PD+ +LG L N Sbjct: 129 IAGQMTRIICD--SMENYTRLIGVNAAWCVCDEFDTTK--PDIAMEAYRKLLGRLRTGNV 184 Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279 + I S P Y+IF DD KR + T + + + + A+Y ++ Sbjct: 185 RQMVI-VSTPEGFRAM-YQIFISEADDQKRLIKARTTDNHYLPQDYIDTLRAQY--PPEL 240 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339 + G+F + N N + + L++G D V V R Sbjct: 241 IEAYLNGEFVNLTGGAVY-RNFSRTLNNCDTVAEDDDTLMIGMDFNVGQMAGAVYVQRIA 299 Query: 340 PVIEHLFDWSKT----DLRTTNNKISGLVEKYRP---DAIIIDANN 378 +E + + D + I + I D++ Sbjct: 300 DGVEEMHLVDEFCGLLDTDAMIDAIKERYPDHHARGLIEIFPDSSG 345 >gi|293411885|ref|ZP_06654610.1| predicted protein [Escherichia coli B354] gi|220980013|emb|CAP72205.1| Hypothetical protein [Escherichia coli LF82] gi|291469440|gb|EFF11929.1| predicted protein [Escherichia coli B354] gi|323934319|gb|EGB30739.1| PBSX family protein phage terminase [Escherichia coli E1520] Length = 418 Score = 58.2 bits (139), Expect = 4e-06, Method: Composition-based stats. Identities = 48/335 (14%), Positives = 105/335 (31%), Gaps = 46/335 (13%) Query: 59 MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118 + +V H +P FK + AGR GK+ L+ ++ + V +A + Sbjct: 7 LSLVQLHSGQMKVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 + LW ++ + L P W ++ I K+ S + Sbjct: 66 MARQILWDDLQEVL-----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GA 107 Query: 179 ERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 ++PDT G ++ DE D + L+ ++ P+ +F Sbjct: 108 DKPDTLRGV---ALHFVVLDEFQDMKADTWYKVLRPTLSS--TRGGALIIGTPKG-FSEF 161 Query: 238 YEIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 ++++ + WK +Q T + + E +D E F Sbjct: 162 HKLWTIGQNVELQRKGQWKSWQFVTADSPFVPTAEIEAAKND--MDPKSFAQEYLASFEN 219 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350 + P + + +P P+ +G D D V+ + L+ + Sbjct: 220 MSGRVYYPFD--RNVHVKPLQFNPRLPIWVGQD---FNIDPMSSVILQPQPNGELWAIDE 274 Query: 351 -----TDLRTTNNKISGLVEKYRP-DAIIIDANNT 379 ++ +++ +++ + D Sbjct: 275 LVLFSSNTAEVCDELERRFWRWKSQITVFPDPAGA 309 >gi|322614428|gb|EFY11359.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 315996572] gi|322621507|gb|EFY18360.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-1] gi|322624368|gb|EFY21201.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-3] gi|322626565|gb|EFY23370.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-4] gi|322633573|gb|EFY30315.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-1] gi|322638384|gb|EFY35082.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-2] gi|322647317|gb|EFY43813.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. NC_MB110209-0054] gi|322649287|gb|EFY45724.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. OH_2009072675] gi|322655993|gb|EFY52293.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] gi|322661388|gb|EFY57613.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 19N] gi|322666960|gb|EFY63135.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. MD_MDA09249507] gi|322671329|gb|EFY67452.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 414877] gi|322677664|gb|EFY73727.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 366867] gi|322681510|gb|EFY77540.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 413180] gi|322683910|gb|EFY79920.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 446600] gi|323195479|gb|EFZ80657.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 609458-1] gi|323200466|gb|EFZ85546.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 556150-1] gi|323203030|gb|EFZ88062.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 609460] gi|323205271|gb|EFZ90246.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 507440-20] gi|323210579|gb|EFZ95463.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 556152] gi|323218140|gb|EGA02852.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. MB101509-0077] gi|323221594|gb|EGA06007.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. MB102109-0047] gi|323227645|gb|EGA11800.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. MB110209-0055] gi|323230903|gb|EGA15021.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. MB111609-0052] gi|323234745|gb|EGA18831.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 2009083312] gi|323238784|gb|EGA22834.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 2009085258] gi|323241484|gb|EGA25515.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. 315731156] gi|323248370|gb|EGA32306.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2009159199] gi|323252865|gb|EGA36699.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008282] gi|323257014|gb|EGA40723.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008283] gi|323260513|gb|EGA44124.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008284] gi|323264430|gb|EGA47936.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008285] gi|323269565|gb|EGA53018.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008287] Length = 588 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 49/143 (34%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQSLALRPFG 404 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|262194129|ref|YP_003265338.1| hypothetical protein Hoch_0830 [Haliangium ochraceum DSM 14365] gi|262077476|gb|ACY13445.1| protein of unknown function DUF264 [Haliangium ochraceum DSM 14365] Length = 503 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 41/288 (14%), Positives = 85/288 (29%), Gaps = 52/288 (18%) Query: 227 TSNPRRLSGKFYEI----------FNKPLDDWKRFQIDTRTVEGIDPSF----HEGIIAR 272 S P G F+EI + W R + ++ E +A Sbjct: 208 CSTPLGRRGIFWEISTEELRKYPHHTRDEVPWWRCRFFCLDIDRAMREAPHMPTEERVAA 267 Query: 273 YGLDSDV----------TRVEVCGQFPQQDIDSFIPLNIIEEALNRE--------PCPDP 314 +G + V + E F + S+ P +I + + P+P Sbjct: 268 FGTQAIVQQLDSLPLEDFQQEFECSFVDESY-SYYPYELILPCTSEDLVPAGDFTDLPEP 326 Query: 315 YAPLIMGCDIAEEGGD-NTVVVLRRGPV--IEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 ++ G D+ V G L + + + +++ Sbjct: 327 EGRIVAGFDVGRTRDRSELAVFEDTGGHFVCRLLRRYDQVPFAEQEADLRRFLDRVPVAR 386 Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---EFAS 428 + ID + G + L V + N E L + + Sbjct: 387 LSIDQSGIGMHLAENLARDYAQVVG----------DTFTNDNKERWATDLKILFQRKDIA 436 Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESKRV-KGAKSTDYSDGLMYT 475 L L+ + S+K ++P+ G++ +++R +G + D + Sbjct: 437 LPRDRELVGQIHSIKRRVLPS-GKVGFDAERSTRGGHA-DRFWAIALA 482 >gi|213865314|ref|ZP_03387433.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. M223] Length = 171 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 11 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 67 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 68 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 127 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 128 QYNVTYIGIDSTGVGHGVYENVK 150 >gi|253991767|ref|YP_003043123.1| putative phage terminase subunit [Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949] gi|211638542|emb|CAR67163.1| probable phage terminase subunit [Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949] gi|253783217|emb|CAQ86382.1| probable phage terminase subunit [Photorhabdus asymbiotica] Length = 585 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 32/144 (22%), Positives = 54/144 (37%), Gaps = 23/144 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDP 314 + + Y D + + +F DI+S L +++ + P Sbjct: 345 IDQLRLEY--SPDEYQNLLMCEF-MDDIESIFSLQLMQGCMVDSWEIWNDVQPLMLRPYG 401 Query: 315 YAPLIMGCDIAEEG--GDN--TVVV---LRRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 Y P+ +G D A+ G GD+ VVV L G L W + R ++ I L E Sbjct: 402 YNPVWIGYDPAKGGKNGDSAGCVVVAPPLVPGGKFRILERHQWRGMNFRAQSDAIKRLTE 461 Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389 +Y + I ID+ G ++ Sbjct: 462 QYNVEYIGIDSTGVGHGVYQNVKE 485 >gi|322831306|ref|YP_004211333.1| terminase, ATPase subunit [Rahnella sp. Y9602] gi|321166507|gb|ADW72206.1| terminase, ATPase subunit [Rahnella sp. Y9602] Length = 596 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 50/338 (14%), Positives = 98/338 (28%), Gaps = 66/338 (19%) Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPL-- 245 + DE P+ + G ++ + S P L+ Y E+FNK Sbjct: 257 LYVDEIFWIPNFQKLRKVASGMASQEHLRTT--YFSTPSALTHGAYPFWSGELFNKGREN 314 Query: 246 ----------------------DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282 W++ I+ G + + + + R Sbjct: 315 PNDRIELDIGHHALAKGRLCEDGQWRQIVTIEDALAGGCNLFNIDTLKQENSAED--FRN 372 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDIAEEGGDN 331 +F S P ++ + Y + +G D + G Sbjct: 373 LFMCEFVDDQ-TSVFPFAELQRCMVESAEEWQDFSPFAMRPFGYRAVWIGYDPSHTGDSA 431 Query: 332 --TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 VV L G L W D I L ++Y + I +DA G Sbjct: 432 GCAVVAPPLVDGGKFRVLERHQWKGMDFAAQAKSIEELTKRYCVEYIGVDATGIGQGVFQ 491 Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI---NHSGLIQNLKSL 442 + A+++ + +T++ +K D + L NH + + ++ Sbjct: 492 LVRQ---------FFPAAMEIRYSPETKTKMVLKAKDTITSGRLEYDTNHKDITSSFMAI 542 Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 + + + E+ R + A D + +M+ P Sbjct: 543 RKTMTASGSRSTYEASRSEEASHADVAWAIMHALLNEP 580 >gi|312601717|gb|ADQ92391.1| terminase ATPase subunit [Salmonella phage RE-2010] gi|321223512|gb|EFX48577.1| Phage terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] Length = 572 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 332 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 388 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L + Sbjct: 389 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 448 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 449 QYNVTYIGIDSTGVGHGVYENVK 471 >gi|298346517|ref|YP_003719204.1| phage terminase protein [Mobiluncus curtisii ATCC 43063] gi|298236578|gb|ADI67710.1| phage terminase protein [Mobiluncus curtisii ATCC 43063] Length = 470 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 63/406 (15%), Positives = 117/406 (28%), Gaps = 63/406 (15%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ +V + + R GKTTL L+ + PG V Sbjct: 32 PWQKLVADVAGERQAEHPERARYQTVVVTVP--RQSGKTTLIKALMAAVAQANPGCQVYY 89 Query: 113 LANSETQLKTTL--WAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 A + K + W E++K L + + P + G + + Sbjct: 90 TAQTR---KDAVEKWGELAKQLRKD----------MGIAPDGKPRVKVLEGTGNERIVFR 136 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP----DVINLGILGF------------ 214 P T G H ++ DEA D + Sbjct: 137 GTESMIMPFAP-TVEGIHGKTSPLVVVDEAWAFDQARGDDLMAAFNPVGLTIPHSQVWII 195 Query: 215 LTERNANRFWI---------MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPS- 264 T + W+ ++P + F ++ + + + P+ Sbjct: 196 STAGDTRSEWLRSLVDKGRQAINDPGTTTAFFEWSADEEMAAAN---LRSDEALAFHPAI 252 Query: 265 ------FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 + +A+ D + R +P S + L E+ EP P + Sbjct: 253 GFTQELWKIQSLAQTEPDH-LYRRSYLNLWPTAAETSIVDLEAWEKLAEPEPASMPPD-V 310 Query: 319 IMGCDIAEEGGDNTVVVL-RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377 +G D+A T+ + G ++ SK I+ L E P A++ D + Sbjct: 311 AIGFDVATARTGATIYAAWQDGETVQIHRLVSKAGAAWVEKAIAHLQETLAPMAVVADDS 370 Query: 378 NTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 + L G +Y A+ + +E +++D Sbjct: 371 GDNRPIIEALRRNGKEIY-------ALRPREYASANSEFFARISDN 409 >gi|300088757|ref|YP_003759279.1| hypothetical protein Dehly_1680 [Dehalogenimonas lykanthroporepellens BL-DC-9] gi|299528490|gb|ADJ26958.1| conserved hypothetical protein [Dehalogenimonas lykanthroporepellens BL-DC-9] Length = 507 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 62/394 (15%), Positives = 109/394 (27%), Gaps = 69/394 (17%) Query: 38 WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95 WG+ S W Q+E + + ++ GR +GK+ + + Sbjct: 20 WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65 Query: 96 WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155 L T G + A + L T + E+ L P+ M S++L Sbjct: 66 TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDSNPD----LMNSIALTKYGKPK 120 Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215 ++ + S + + D F H + DE + + + L Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177 Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273 R + S P L +Y + + + F+ + ++ Y Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSEQFHVFRWPSWLNPLWTEDREAELLEFYG 232 Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308 G DS + EV G+ + +F I + E A +R Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292 Query: 309 -----EPCPDPYAPLIMGCDIAEEGGDNTVVVL-------RRGPVIEHLFDWSKTDLRTT 356 P + G D+ +VV R + Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQEMEIGERTLLKMILRVHLEHVSYPHI 351 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 I+ L Y P I +D G L L Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385 >gi|262194298|ref|YP_003265507.1| hypothetical protein Hoch_1017 [Haliangium ochraceum DSM 14365] gi|262077645|gb|ACY13614.1| protein of unknown function DUF264 [Haliangium ochraceum DSM 14365] Length = 478 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 41/288 (14%), Positives = 85/288 (29%), Gaps = 52/288 (18%) Query: 227 TSNPRRLSGKFYEI----------FNKPLDDWKRFQIDTRTVEGIDPSF----HEGIIAR 272 S P G F+EI + W R + ++ E +A Sbjct: 183 CSTPLGRRGIFWEISTEELRKYPHHTRDEVPWWRCRFFCLDIDRAVREAPHMPTEERVAA 242 Query: 273 YGLDSDV----------TRVEVCGQFPQQDIDSFIPLNIIEEALNRE--------PCPDP 314 +G + V + E F + S+ P +I + + P+P Sbjct: 243 FGTQAIVQQLDSLALEDFQQEFECSFVDESY-SYYPYELILPCTSEDLVLAGDFTDLPEP 301 Query: 315 YAPLIMGCDIAEEGG-DNTVVVLRRGPV--IEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 ++ G D+ V G L + + + +++ Sbjct: 302 EGRIVAGFDVGRTRDHSELAVFEDTGGHFVCRLLRRYDQVPFAEQEADLRRFLDRVPVAR 361 Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---EFAS 428 + ID + G + L V + N E L + + Sbjct: 362 LSIDQSGIGMHLAENLARDYAQVVG----------DTFTNDNKERWATDLKILFQRKDIA 411 Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESKRV-KGAKSTDYSDGLMYT 475 L L+ + S+K ++P+ G++ +++R +G + D + Sbjct: 412 LPRDRELVGQIHSIKRRVLPS-GKVGFDAERSTRGGHA-DRFWAIALA 457 >gi|255103207|ref|ZP_05332184.1| hypothetical protein CdifQCD-6_20513 [Clostridium difficile QCD-63q42] Length = 582 Score = 57.8 bits (138), Expect = 4e-06, Method: Composition-based stats. Identities = 69/505 (13%), Positives = 144/505 (28%), Gaps = 118/505 (23%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 + P + +++ + + + A RG+GK+ L + +P Sbjct: 31 YLANPHRFCMDYFGFNLHLFQQILIYMMMKSDQFVFIASRGLGKSWLLGVFCCVIAVLKP 90 Query: 107 GISVICLANSETQLKTTLWAEVS-----KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161 G V+ A + Q K + +++ K +L F++ + + W + Sbjct: 91 GTCVLIAAKRKKQAKLLITSKILGDLYLKSDTLKREIKSFQVNAQEVSIDFWNGSRIEAV 150 Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-VINLGILGFLTERNA 220 + D R Y +I DE + +N ++ FLT Sbjct: 151 VSNDD------ARGYRAN--------------VLIVDEYRMVDEGTVNDVLVPFLTNPRQ 190 Query: 221 NRFWIMTSNPRR-----------LSGKFYEIFNKPLDDWKRFQI---------------D 254 NP+ LS +Y + + Sbjct: 191 PG---YLQNPKYRYMQEENKEIYLSSGWYSQHWSYKKFMETVKGMLSGEDMFACSIPFTC 247 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI----------------P 298 + + + + + +E CG F + D+F P Sbjct: 248 SLEHGLLTKKRILKEMKKESMSDASFMMEYCGVFYNESDDAFFKSSWVNPCRVLESMFYP 307 Query: 299 LNIIEEALNREPCPDPY-------APLIMGCDI--AEEGGDNTVVV------LRRGPV-- 341 + IE N++ Y I+G DI A ++ + G Sbjct: 308 PSDIEYLENKKKRDKKYHLNKIKGEIRIIGADIALARGVKNDNSIYTLMRMLPNEGTYKR 367 Query: 342 -IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART--------CDYLEMLGY 392 + H+ ++ + ++ L ++ D +I+D G D Y Sbjct: 368 CVVHIEAYNGMEAEKQAIRLKQLFSDFQADYMILDTQGIGTTVWSYIQKANYDSDRDEWY 427 Query: 393 HVYRVLGQKRAVDL-------------EFCRNRRTELHVKMADWLEFASL------INHS 433 Y + VD + + ++ + + D L +L I Sbjct: 428 DAYTCFNEDNTVDKSLAKKSLPVVYSMKAYADENHKMAMSLRDVLTNRTLELPISDIEAK 487 Query: 434 GLIQNLKSLKSFIVPNTGELAIESK 458 +I + +K+ + E +E+K Sbjct: 488 EMILEKEMIKADEIDKKAE--LEAK 510 >gi|197249763|ref|YP_002147654.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197213466|gb|ACH50863.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Agona str. SL483] Length = 588 Score = 57.8 bits (138), Expect = 5e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|152982949|ref|YP_001353896.1| hypothetical protein mma_2206 [Janthinobacterium sp. Marseille] gi|151283026|gb|ABR91436.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille] Length = 436 Score = 57.8 bits (138), Expect = 5e-06, Method: Composition-based stats. Identities = 42/276 (15%), Positives = 84/276 (30%), Gaps = 35/276 (12%) Query: 82 ISAGRGIGKT-TLNAWLVLWLMSTRP-GISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139 + A R GKT L+ ++ +A Q K+ W V ++ +++P Sbjct: 29 VVAHRRAGKTVACVNELIKAALTFHGNDGRFAYVAPFYRQAKSVAWDYVKRFSAVIPGIS 88 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 E + +P + + + D G ++ DE Sbjct: 89 INESELRIDYPNGSR------------------IQLFGADNADALRGLFFDG---VVADE 127 Query: 200 ASGTPDVINL-GILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQIDTR 256 + I L +R + ++ P+ + + EI+ +DW I Sbjct: 128 YGDWKPSVWGYVIRPALADRGG--WAVIIGTPKGRNQFW-EIYQHAGVNEDWLCLTIRAS 184 Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA 316 + P E + + L D R E+ F + I + + D Y Sbjct: 185 ESGLLPPKEIEAL--QLELTEDAWRQEMECDFDAALPGAIFGKEIWQAEQDGRVKDDLYD 242 Query: 317 P---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS 349 P + D+ D + + G + + +S Sbjct: 243 PELKVHAVLDLG-FTDDTAIWWFQVGKELRIIDCYS 277 >gi|331656886|ref|ZP_08357848.1| terminase, ATPase subunit [Escherichia coli TA206] gi|331055134|gb|EGI27143.1| terminase, ATPase subunit [Escherichia coli TA206] Length = 531 Score = 57.8 bits (138), Expect = 5e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 291 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 347 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 348 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 407 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 408 QYNVTYIGIDSTGVGHGVYENVK 430 >gi|78355964|ref|YP_387413.1| hypothetical protein Dde_0917 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78218369|gb|ABB37718.1| hypothetical protein Dde_0917 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 507 Score = 57.8 bits (138), Expect = 5e-06, Method: Composition-based stats. Identities = 62/394 (15%), Positives = 109/394 (27%), Gaps = 69/394 (17%) Query: 38 WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95 WG+ S W Q+E + + ++ GR +GK+ + + Sbjct: 20 WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65 Query: 96 WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155 L T G + A + L T + E+ L P+ M S++L Sbjct: 66 TDALHYAFTTRGGQGLVAAPHQGHLDTII-EEIEFQLDTNPD----LMNSIALTKYGKPK 120 Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215 ++ + S + + D F H + DE + + + L Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177 Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273 R + S P L +Y + + + F+ + ++ Y Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSEQFHVFRWPSWLNPLWTEDREAELLEFYG 232 Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308 G DS + EV G+ + +F I + E A +R Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292 Query: 309 -----EPCPDPYAPLIMGCDIAEEGGDNTVVVL-------RRGPVIEHLFDWSKTDLRTT 356 P + G D+ +VV R + Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQEMEVGERTLLKMILRVHLEHVSYPHI 351 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 I+ L Y P I +D G L L Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385 >gi|34335039|gb|AAQ65014.1| unknown [synthetic construct] gi|301159280|emb|CBW18795.1| probable terminase subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|323131065|gb|ADX18495.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] Length = 588 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|200387487|ref|ZP_03214099.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Virchow str. SL491] gi|199604585|gb|EDZ03130.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Virchow str. SL491] Length = 588 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|221196218|ref|ZP_03569265.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221202891|ref|ZP_03575910.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221176825|gb|EEE09253.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221182772|gb|EEE15172.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 424 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 43/240 (17%), Positives = 67/240 (27%), Gaps = 35/240 (14%) Query: 67 LNSVNNPNPEVFKGAISAGRGIGKTTL-NAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 + E + I GR GKTTL W G+ V + Sbjct: 14 QAEIGRAFNESRRVVIRCGRRFGKTTLLERCASKWA---YNGLKVGWFGPTYK------- 63 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 L+L K ++ V+ + G + ++ D Sbjct: 64 ------LNLPTYKRILRTVQPVVYSKSKIDQVIELNSGGCIEFWTL---------QDEDA 108 Query: 186 GHHNTYGMAIINDEASGTPD---VINLGILGFLTERNANRFWIMTSNPRR--LSGKFYEI 240 G Y +I DE S P I + T + IM P+ FYE Sbjct: 109 GRSRFYD-RVIIDEGSLVPKGLRSIWEQAI-APTLLDRKGHAIMAGTPKGIDPENFFYEA 166 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 W+ F T + +DP + Y + V + E F + +F Sbjct: 167 CTDKTLGWREFHAPTASNPMLDPEAVARLKDEY--PALVYQQEYLADFVDWNGAAFFSEE 224 >gi|16763092|ref|NP_458709.1| terminase subunit [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|25315565|pir||AH1037 probable terminase chain [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16505400|emb|CAD06749.1| probable terminase subunit [Salmonella enterica subsp. enterica serovar Typhi] Length = 588 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ V+ G L W D R + I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|309797383|ref|ZP_07691776.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 145-7] gi|308119007|gb|EFO56269.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 145-7] Length = 418 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 48/335 (14%), Positives = 105/335 (31%), Gaps = 46/335 (13%) Query: 59 MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118 + +V H +P FK + AGR GK+ L+ ++ + V +A + Sbjct: 7 LSLVQLHSGQMKVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 + LW ++ + L P W ++ I K+ S + Sbjct: 66 MARQILWDDLQEVL-----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GA 107 Query: 179 ERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 ++PDT G ++ DE D + L+ ++ P+ +F Sbjct: 108 DKPDTLRGV---ALHFVVLDEFQDMKADTWYKVLRPTLSS--TRGGALIIGTPKG-FSEF 161 Query: 238 YEIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 ++++ + WK +Q T + + E +D E F Sbjct: 162 HKLWTIGQNVELQRKGQWKSWQFVTADSPFVPTAEIEAAKND--MDPKSFAQEYLASFEN 219 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350 + P + + +P P+ +G D D V+ + L+ + Sbjct: 220 MSGRVYYPFD--RNVHVKPLQFNPRLPIWVGQD---FNIDPMSSVILQPQPNGELWAIDE 274 Query: 351 -----TDLRTTNNKISGLVEKYRPD-AIIIDANNT 379 ++ +++ +++ + D Sbjct: 275 LVLFSSNTAEVCDELERRFWRWKSQVTVFPDPAGA 309 >gi|163801735|ref|ZP_02195633.1| hypothetical protein 1103602000597_AND4_09782 [Vibrio sp. AND4] gi|159174652|gb|EDP59454.1| hypothetical protein AND4_09782 [Vibrio sp. AND4] Length = 546 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 34/260 (13%), Positives = 71/260 (27%), Gaps = 55/260 (21%) Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P D +N T +N + S P + + Y + + + D Sbjct: 211 VYVDEYFWIPKFDELNKLASAMATHKNWRKT--YFSTPSAKTHQAYTFWTGDQWRRGRDT 268 Query: 248 WKRFQIDT----RTVEGIDPSF--------------------HEGIIARYGLDSDVTRVE 283 + T R + P + + Y D Sbjct: 269 RANIEFPTFDEYRDGGRLCPDKQWRYVVTIEDAAAGGCELFDIDELRDEYSKDD--FDNL 326 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNT- 332 F S + +E+A+ + P + +G D + + Sbjct: 327 FMCIFVDGAS-SVFKFSALEKAMVDISRWQDFKPNDNDPFERREVWLGYDPSRTRDNACL 385 Query: 333 ------VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386 V+ + + + W + + ++S + E+Y + ID GA D Sbjct: 386 VVVAPPVIAIEK-FRVLEKHYWRGLNFQYQAQQVSKVFERYNVSYLGIDTTGIGAGVYDL 444 Query: 387 L-EMLGYHVYRVLGQKRAVD 405 L + + + + Sbjct: 445 LSKKHPRETVAIQYSNESKN 464 >gi|253689540|ref|YP_003018730.1| hypothetical protein PC1_3171 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251756118|gb|ACT14194.1| protein of unknown function DUF264 [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 589 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 62/205 (30%), Gaps = 30/205 (14%) Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 W++ ++ G + + ++ Y + + +F S P ++ Sbjct: 329 GQWRQIVTVEDALSGGCNLFDLDQLMLEY--SPAEYQNLLMCEFVDDKA-SVFPFEELQR 385 Query: 305 ALNREPCPDP-----------YAPLIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FD 347 + Y P+ +G D + G VVL G L F Sbjct: 386 CMVDALEEWEDFNPYALRPFAYKPVWIGYDPSHTGDSAGCVVLAPPQAPGGKFRILERFQ 445 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE 407 W D + I L EKY + I IDA G + A +++ Sbjct: 446 WKGMDFAAQADAIKLLTEKYIVEYIGIDATGIGQGVYQLVRG---------FFPAAREIK 496 Query: 408 FCRNRRTELHVKMADWLEFASLINH 432 + +T + +K D + L Sbjct: 497 YSPEIKTAMVLKAKDTITSGRLEYD 521 >gi|213650797|ref|ZP_03380850.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. J185] Length = 518 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 278 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 334 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 335 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 394 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 395 QYNVTYIGIDSTGVGHGVYENVK 417 >gi|309795387|ref|ZP_07689805.1| conserved hypothetical protein [Escherichia coli MS 145-7] gi|308121037|gb|EFO58299.1| conserved hypothetical protein [Escherichia coli MS 145-7] Length = 588 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|16766035|ref|NP_461650.1| terminase-like protein [Enterobacteria phage Fels-2] gi|169936048|ref|YP_001718747.1| P2 gpP-like protein [Enterobacteria phage Fels-2] gi|16421269|gb|AAL21609.1| Fels-2 prophage protein [Enterobacteria phage Fels-2] gi|312913743|dbj|BAJ37717.1| terminase-like protein [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] Length = 588 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|323938219|gb|EGB34479.1| terminase [Escherichia coli E1520] Length = 588 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|291335343|gb|ADD94958.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C148] Length = 234 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 73/187 (39%), Gaps = 23/187 (12%) Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE----RPDTFVGHHNTYGMAIINDEASG 202 L P PW L ++ + ST+ +E R + G ++ DEA+ Sbjct: 12 KLVPKPWIKTKNETDLKLELVNGSTIELKGTENAMALRGRSLSG--------VVLDEAAF 63 Query: 203 T-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF----NKPLDDWKRFQIDTRT 257 +V I L ++ + + S P + FY+++ + P ++WKR+ T Sbjct: 64 MDAEVWFEVIRPALADKQG--WALFISTPDGTASWFYDLWCYCEDDPTNEWKRWCYTTIE 121 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 + E A+ LD R E F +++ + ++ ++ ++ + P Sbjct: 122 GGNVPQEEVEAARAQ--LDPRTFRQEFEASF--ENLTGLVAISFSDDNISTDAKDISIQP 177 Query: 318 LIMGCDI 324 L++G D Sbjct: 178 LLLGVDF 184 >gi|198245759|ref|YP_002216726.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940275|gb|ACH77608.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326624483|gb|EGE30828.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 588 Score = 57.4 bits (137), Expect = 5e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|170020778|ref|YP_001725732.1| hypothetical protein EcolC_2777 [Escherichia coli ATCC 8739] gi|169755706|gb|ACA78405.1| protein of unknown function DUF264 [Escherichia coli ATCC 8739] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|306812733|ref|ZP_07446926.1| Terminase, ATPase subunit (GpP) [Escherichia coli NC101] gi|305853496|gb|EFM53935.1| Terminase, ATPase subunit (GpP) [Escherichia coli NC101] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|300907199|ref|ZP_07124862.1| hypothetical protein HMPREF9536_05153 [Escherichia coli MS 84-1] gi|301303626|ref|ZP_07209748.1| hypothetical protein HMPREF9347_02221 [Escherichia coli MS 124-1] gi|300401074|gb|EFJ84612.1| hypothetical protein HMPREF9536_05153 [Escherichia coli MS 84-1] gi|300841125|gb|EFK68885.1| hypothetical protein HMPREF9347_02221 [Escherichia coli MS 124-1] gi|315257856|gb|EFU37824.1| conserved hypothetical protein [Escherichia coli MS 85-1] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWSDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|253774139|ref|YP_003036970.1| hypothetical protein ECBD_2764 [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|254160943|ref|YP_003044051.1| Terminase, ATPase subunit [Escherichia coli B str. REL606] gi|242376647|emb|CAQ31358.1| ybl37 [Escherichia coli BL21(DE3)] gi|253325183|gb|ACT29785.1| protein of unknown function DUF264 [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|253972844|gb|ACT38515.1| Terminase, ATPase subunit [Escherichia coli B str. REL606] gi|253977058|gb|ACT42728.1| Terminase, ATPase subunit [Escherichia coli BL21(DE3)] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEIWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|16762249|ref|NP_457866.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|29143738|ref|NP_807080.1| terminase ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|215485952|ref|YP_002328383.1| predicted terminase, ATPase subunit [Escherichia coli O127:H6 str. E2348/69] gi|312969111|ref|ZP_07783318.1| terminase, ATPase subunit [Escherichia coli 2362-75] gi|25315563|pir||AB0927 terminase, ATPase chain [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16504553|emb|CAD09436.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi] gi|29139373|gb|AAO70940.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|215264024|emb|CAS08365.1| predicted terminase, ATPase subunit [Escherichia coli O127:H6 str. E2348/69] gi|312286513|gb|EFR14426.1| terminase, ATPase subunit [Escherichia coli 2362-75] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|324112701|gb|EGC06677.1| terminase [Escherichia fergusonii B253] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|323953478|gb|EGB49344.1| terminase [Escherichia coli H252] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|157160343|ref|YP_001457661.1| terminase, ATPase subunit [Escherichia coli HS] gi|218559567|ref|YP_002392480.1| Terminase, ATPase subunit (GpP) [Escherichia coli S88] gi|256021061|ref|ZP_05434926.1| Terminase, ATPase subunit (GpP) [Shigella sp. D9] gi|300817075|ref|ZP_07097294.1| conserved hypothetical protein [Escherichia coli MS 107-1] gi|331662228|ref|ZP_08363151.1| terminase, ATPase subunit [Escherichia coli TA143] gi|331676606|ref|ZP_08377302.1| terminase, ATPase subunit [Escherichia coli H591] gi|332282288|ref|ZP_08394701.1| DNA-dependent ATPase terminase subunit [Shigella sp. D9] gi|157066023|gb|ABV05278.1| terminase, ATPase subunit [Escherichia coli HS] gi|218366336|emb|CAR04087.1| Terminase, ATPase subunit (GpP) [Escherichia coli S88] gi|300530427|gb|EFK51489.1| conserved hypothetical protein [Escherichia coli MS 107-1] gi|315615257|gb|EFU95893.1| terminase, ATPase subunit [Escherichia coli 3431] gi|323172219|gb|EFZ57857.1| terminase, ATPase subunit [Escherichia coli LT-68] gi|323190830|gb|EFZ76098.1| terminase, ATPase subunit [Escherichia coli RN587/1] gi|323942735|gb|EGB38900.1| terminase [Escherichia coli E482] gi|323946304|gb|EGB42336.1| terminase [Escherichia coli H120] gi|323963883|gb|EGB59377.1| terminase [Escherichia coli M863] gi|327252355|gb|EGE64027.1| terminase, ATPase subunit [Escherichia coli STEC_7v] gi|331060650|gb|EGI32614.1| terminase, ATPase subunit [Escherichia coli TA143] gi|331075295|gb|EGI46593.1| terminase, ATPase subunit [Escherichia coli H591] gi|332104640|gb|EGJ07986.1| DNA-dependent ATPase terminase subunit [Shigella sp. D9] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|312970940|ref|ZP_07785119.1| terminase, ATPase subunit [Escherichia coli 1827-70] gi|310336701|gb|EFQ01868.1| terminase, ATPase subunit [Escherichia coli 1827-70] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|307314499|ref|ZP_07594102.1| protein of unknown function DUF264 [Escherichia coli W] gi|306905922|gb|EFN36444.1| protein of unknown function DUF264 [Escherichia coli W] gi|315060102|gb|ADT74429.1| terminase, ATPase subunit [Escherichia coli W] gi|323379340|gb|ADX51608.1| terminase ATPase subunit [Escherichia coli KO11] gi|332342200|gb|AEE55534.1| phage terminase, ATPase subunit [Escherichia coli UMNK88] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|26246838|ref|NP_752878.1| terminase, ATPase subunit [Escherichia coli CFT073] gi|26107238|gb|AAN79421.1|AE016758_25 Terminase, ATPase subunit [Escherichia coli CFT073] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|300916285|ref|ZP_07133032.1| conserved hypothetical protein [Escherichia coli MS 115-1] gi|300416374|gb|EFJ99684.1| conserved hypothetical protein [Escherichia coli MS 115-1] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|271499312|ref|YP_003332337.1| hypothetical protein Dd586_0742 [Dickeya dadantii Ech586] gi|270342867|gb|ACZ75632.1| protein of unknown function DUF264 [Dickeya dadantii Ech586] Length = 591 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 54/162 (33%), Gaps = 20/162 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY +D+ + F + D+ + + Sbjct: 329 PDGQWRYVITMEDAIRGGFNLASLEKLRNRYNVDT--FNMLYMCVFVD-NKDAVFSFDDL 385 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346 E + P + G D A G +T + + + Sbjct: 386 ERCGVDPATWQDHDPTAPRPFGNREVWGGYDPARSGDLSTFVIVAPPIYEGEKFRVLLVV 445 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 +W + R N+I L ++Y I ID GA + ++ Sbjct: 446 NWHGMNFRYQANQIKKLFQRYHFTYIGIDVTGIGAGVFENIQ 487 >gi|222034345|emb|CAP77086.1| Terminase, ATPase subunit [Escherichia coli LF82] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|218690765|ref|YP_002398977.1| terminase, ATPase subunit (GpP) [Escherichia coli ED1a] gi|218428329|emb|CAR09255.2| Terminase, ATPase subunit (GpP) [Escherichia coli ED1a] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|300896792|ref|ZP_07115295.1| terminase, ATPase subunit family protein [Escherichia coli MS 198-1] gi|300359367|gb|EFJ75237.1| terminase, ATPase subunit family protein [Escherichia coli MS 198-1] Length = 391 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 51/184 (27%), Gaps = 29/184 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 218 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAAHPFG 274 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 275 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 334 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 335 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPELKTAMVLKAKDVIRRGC 385 Query: 429 LINH 432 L Sbjct: 386 LEYD 389 >gi|320199051|gb|EFW73648.1| Phage terminase, ATPase subunit [Escherichia coli EC4100B] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|304360765|ref|YP_003856886.1| gp8 [Mycobacterium phage Angelica] gi|302858349|gb|ADL71097.1| gp8 [Mycobacterium phage Angelica] Length = 473 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 69/389 (17%), Positives = 123/389 (31%), Gaps = 57/389 (14%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111 WQ + ++V A S ++F +I R GKT +V PG +VI Sbjct: 43 DQWQDDLGKLVCAK--RSDGLYAADMFAMSIP--RQTGKTYFLGAIVFAFCKMNPGTTVI 98 Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171 A+ +T AE K + L + L++H G ++ ++ Sbjct: 99 WTAH-----RTRTAAETFKSMQALAKREQIAPHILNVH----------TGNGKEAVLFTN 143 Query: 172 MCRTYSEERPDTF-VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 R R F G +I DEA + ++ T + N + P Sbjct: 144 GSRILFGAREKGFGRGF--AKVDVLIFDEAQILSENAMDDMIPA-TNASPNGLILFAGTP 200 Query: 231 RRLS--GKFY-----EIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGI------------ 269 + + G+ + + N DD + D + ++ + Sbjct: 201 PKPTDPGEVFTNLRMDALNGESDDVAYVEISADENDDPDEESTWRKMNPSYPHRTSARAI 260 Query: 270 -IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC----PDPYA-PLIMGCD 323 R L D R E G + + + + +I+ L R+ P+P A P +G D Sbjct: 261 RRMRKALSWDSFRREAMGIWDKISVHA----QVIKAGLWRDLADPLGPEPGAKPASLGVD 316 Query: 324 IAEEGGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382 ++ G + + H+ W+ TD I R ++ID + Sbjct: 317 MSHGGAISIGGCWLIDDELRHVEQVWAGTDTAAAVEFIVERAG--RRIPVVIDDASPAKA 374 Query: 383 TCDYLEMLGYHVYRVLGQKRAVDLEFCRN 411 L+ V A +N Sbjct: 375 LVPELKRRKVKVRITYAGDMAKACGLFKN 403 >gi|82543312|ref|YP_407259.1| terminase, ATPase subunit [Shigella boydii Sb227] gi|81244723|gb|ABB65431.1| terminase, ATPase subunit [Shigella boydii Sb227] gi|320185726|gb|EFW60482.1| Phage terminase, ATPase subunit [Shigella flexneri CDC 796-83] gi|332097052|gb|EGJ02035.1| terminase, ATPase subunit [Shigella boydii 3594-74] Length = 588 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|307129625|ref|YP_003881641.1| hypothetical protein Dda3937_02574 [Dickeya dadantii 3937] gi|306527154|gb|ADM97084.1| Possible phage protein [Dickeya dadantii 3937] Length = 591 Score = 57.4 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY +D+ + F + D+ + + Sbjct: 329 PDGQWRYVITMEDAIRGGFNLASLEKLRNRYNVDT--FNMLYMCVFVD-NKDAVFSFDDL 385 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVV----LRRGPVIEHLF-- 346 E + P + G D A G +T+V+ + G L Sbjct: 386 ERCGVDPATWQDHDPTAPRPFGNREVWGGYDPARSGDLSTLVIVAPPIYDGEKFRVLLVV 445 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 +W + R N+I L ++Y I ID GA + ++ Sbjct: 446 NWHGMNFRYQANQIKKLFQRYHFTYIGIDVTGIGAGVFENIQ 487 >gi|260599032|ref|YP_003211603.1| Terminase, ATPase subunit [Cronobacter turicensis z3032] gi|260218209|emb|CBA33092.1| Terminase, ATPase subunit [Cronobacter turicensis z3032] Length = 590 Score = 57.0 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 40/143 (27%), Gaps = 18/143 (12%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY-----------APLIM 320 + ++ R +F S P ++ + P+ + Sbjct: 355 KRENSAEDFRNLFMCEFVDDKA-SVFPFEELQRCMVDSLEEWEDFSPFAARPFGSRPVWI 413 Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D + G VVL G L W D T I L EKY+ + I I Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRILERHQWKGMDFATQAQAIRELTEKYQVEYIGI 473 Query: 375 DANNTGARTCDYLEMLGYHVYRV 397 DA G + + Sbjct: 474 DATGIGQGVFQLVRAFWPAAREI 496 >gi|291334416|gb|ADD94071.1| hypothetical protein GobsU_33659 [uncultured phage MedDCM-OCT-S04-C1035] gi|291334470|gb|ADD94124.1| hypothetical protein GobsU_33659 [uncultured phage MedDCM-OCT-S04-C1161] Length = 223 Score = 57.0 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 72/235 (30%), Gaps = 31/235 (13%) Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 +A + Q K+ W + ++ + +PN + E + P +L Sbjct: 6 NPRFAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG------- 58 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRFWI 225 E D G + + DE + + I L++R + + Sbjct: 59 -----------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDR--KGYCV 102 Query: 226 MTSNPRRLSGKFYEIFNKPLD--DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 P ++ FY+++ DW ++ + +DP E G E Sbjct: 103 FIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGEKK--YLQE 160 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG--DNTVVVL 336 + + I + + PY P + A + G D++ ++ Sbjct: 161 FECDWIANIEGAIYGEEIAKIEDKNQIARVPYDP-TLPVSTAWDLGVADHSSIIF 214 >gi|296103195|ref|YP_003613341.1| hypothetical protein ECL_02853 [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295057654|gb|ADF62392.1| hypothetical protein ECL_02853 [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 591 Score = 57.0 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 55/169 (32%), Gaps = 20/169 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY ++ + F DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFDMLYMCVFVDSK-DSVFSFSDL 384 Query: 303 EEA---LNREPCPDPYA-------PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E ++ DP A P+ G D A G + V++ + + Sbjct: 385 EACGVEMDTWQDHDPDAKRPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVY 395 W + R +I L ++Y + +D G D ++ V Sbjct: 445 YWKGMNFRYQAKQIEKLFDQYNFTYLGVDVTGIGQGVFDNIQHFAMKVV 493 >gi|310815629|ref|YP_003963593.1| Putative large terminase [Ketogulonicigenium vulgare Y25] gi|308754364|gb|ADO42293.1| Putative large terminase [Ketogulonicigenium vulgare Y25] Length = 427 Score = 57.0 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 68/424 (16%), Positives = 113/424 (26%), Gaps = 75/424 (17%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ S G V +A + Q + + Sbjct: 36 IMGGRGAGKTRAGA---EWVRSMVEGPRPDTPGRAKRVGLIAQTMDQAREVMV------- 85 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 F L P + R +S P+ G Sbjct: 86 --------FGDSGLMACCPPARRPEWIAGRAMLRWPNGAEARLFSAHDPEALRGPQFD-- 135 Query: 193 MAIINDEASG--TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 AI DE + ++ L + R G F Sbjct: 136 -AIWADEVAKWRLAQEAWDMLVMGLRLGDDPR---ACLTTTPRGGPFLRKLLAQSGTVMT 191 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + P F + A + S + R E+ G + + P ++++ AL R+ Sbjct: 192 HAPTRANRANLAPGFVAAVEAMF-EGSHLGRQELDGLLVDEAEGTLWPQHLLDAALQRQA 250 Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357 P +++ D G D +++ DW T Sbjct: 251 PP--LDRIVVAVDPPVTGHAGSDACGIIVAGVEQRGAPTDWRLWVIEDATVQGASPHTWA 308 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417 + ++ D ++ + N GA L L H+ RAV + R E Sbjct: 309 SAAIAAFHRHGADRLVAEVNQGGALVESVLRQLDPHI-----PYRAVRASKSKGARAE-- 361 Query: 418 VKMADWLEFASLINHSGLI---QNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474 ++ E + GL + + G S D D L++ Sbjct: 362 -PVSTIYERGRACHLPGLALLEAQMSLMTLQGFTGKG-------------SPDRVDALVW 407 Query: 475 TFAE 478 E Sbjct: 408 AAHE 411 >gi|258545857|ref|ZP_05706091.1| probable terminase (atpase subunit) related protein [Cardiobacterium hominis ATCC 15826] gi|258518873|gb|EEV87732.1| probable terminase (atpase subunit) related protein [Cardiobacterium hominis ATCC 15826] Length = 595 Score = 57.0 bits (136), Expect = 6e-06, Method: Composition-based stats. Identities = 34/201 (16%), Positives = 60/201 (29%), Gaps = 29/201 (14%) Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 I+ G D E + ++ + QF DS + ++ + Sbjct: 339 ITIEDAINSGFDRVTLEKLRIKF--PPGQFENLLMCQFVNDG-DSIFKMAELQRCMVDAW 395 Query: 311 CPDPYA-----------PLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDL 353 P+ +G D + D ++VV+ G V + ++ D Sbjct: 396 TVWQDYTPLAARPLGDVPVWIGYDPSRSQDDASLVVIAPPQVEGGVFRIIDKQSFNGLDF 455 Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRR 413 KI Y I IDA G D + V ++L A + Sbjct: 456 DAQARKIRDFCRMYNVVHIAIDATGIGQAVYDLVRQFFPRVRKILYSVEAKN-------- 507 Query: 414 TELHVKMADWLEFASLINHSG 434 E+ +K + A L +G Sbjct: 508 -EMVLKAKQLIAHARLQWDNG 527 >gi|291334530|gb|ADD94183.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334650|gb|ADD94297.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] Length = 223 Score = 57.0 bits (136), Expect = 7e-06, Method: Composition-based stats. Identities = 32/240 (13%), Positives = 72/240 (30%), Gaps = 31/240 (12%) Query: 102 MSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161 M +A + Q K+ W + ++ +P+ + E + P +L Sbjct: 1 MCPHKNPRFAYIAPTFKQAKSIAWDYMKQFTDKIPSTKFNETELRVDLPNGARITLLG-- 58 Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNA 220 E D G + + DE + + I L++R Sbjct: 59 ----------------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDR-- 97 Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLD--DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278 + + P ++ FY+++ DW ++ + +D + G Sbjct: 98 KGYCVFIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASETKIVDQEELDKAKEVMGEKK- 156 Query: 279 VTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG--DNTVVVL 336 E + + I + ++ PY P + A + G D++ ++ Sbjct: 157 -YLQEFECDWIANIEGAIYGEEIAKLDDKKQLARVPYDP-TLPVSTAWDLGVADHSSIIF 214 >gi|51597451|ref|YP_071642.1| orf16-like phage protein [Yersinia pseudotuberculosis IP 32953] gi|51590733|emb|CAH22378.1| Possible [Haemophilus phage HP1] orf16-like phage protein [Yersinia pseudotuberculosis IP 32953] Length = 601 Score = 57.0 bits (136), Expect = 7e-06, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 47/140 (33%), Gaps = 19/140 (13%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317 E + +Y ++ + QF D+ + +E+ + P Sbjct: 354 IERLRNKY--NATAFAMLYMCQFVDSK-DAVFKFSELEKCAVDAGMWQDHDPKAARPFGN 410 Query: 318 --LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + G D + G ++T V++ + ++ W + ++I L+ +Y Sbjct: 411 REVWGGFDPSRSGDNSTFVIVAPPLYDGERFRVLAVYYWQGLNFNYQADQIKQLMRRYNM 470 Query: 370 DAIIIDANNTGARTCDYLEM 389 I ID G D +E Sbjct: 471 TYIGIDITGIGRGVFDLVER 490 >gi|332560992|ref|ZP_08415310.1| hypothetical protein RSWS8N_18139 [Rhodobacter sphaeroides WS8N] gi|332274790|gb|EGJ20106.1| hypothetical protein RSWS8N_18139 [Rhodobacter sphaeroides WS8N] Length = 468 Score = 57.0 bits (136), Expect = 7e-06, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 70/237 (29%), Gaps = 18/237 (7%) Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232 T PDT G +I DE + I + +++ S P Sbjct: 133 TALPANPDTARGFSAN----VILDEFAFHAKSREIWAALFPVISKGGQKLRV--ISTPNG 186 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292 KFYE+ W R +D ++ D D E ++ + Sbjct: 187 KGNKFYELMTAEGSVWSRHVVDIHEAVRQGLDRDIDMLRAGMADEDAWAQEYELKWLDEA 246 Query: 293 IDSFIPLNIIEEA---LNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL 345 ++++ ++I P P +G DIA D V+ + + Sbjct: 247 -NAWLDYDLISACEHPAAGMPGLYMGGPCFVGVDIAARN-DLFVIWVLELVGDVLWTREV 304 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG-ARTCDYLEMLGYHVYRVLGQK 401 + + + ++ + ++R ID G D G V +L Sbjct: 305 IARRRVSFQEQDRLLAEVFRRFRVVRCRIDQTGMGEKPVEDAKRAHGDRVEGILFSA 361 >gi|188495109|ref|ZP_03002379.1| terminase [Escherichia coli 53638] gi|188490308|gb|EDU65411.1| terminase [Escherichia coli 53638] Length = 607 Score = 57.0 bits (136), Expect = 7e-06, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 54/173 (31%), Gaps = 20/173 (11%) Query: 244 PLDDWK-RFQIDTRTVEGID-PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF--IPL 299 P W+ ++ +G+ E + RY + +F F L Sbjct: 336 PDGIWRYVITMEDACAKGLSARVNIEKLRNRYSAT--AFAMLYMCEFTDSRDTVFKFSDL 393 Query: 300 NIIEEALNREPCPDPYA-------PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E DP A + G D + G ++T V++ + + ++ Sbjct: 394 EKCEVEFGIWQDFDPSALRPFGNREVWGGFDPSRTGDNSTFVIVAPPVEPKEKFRVLAVY 453 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVL 398 W + +I L+++YR I +D G D L V + Sbjct: 454 QWVGLNFTWQVKQIEELMKRYRFTHIGVDITGIGRGVYDQLVRSAPREVMGIN 506 >gi|149911893|ref|ZP_01900493.1| putative bacteriophage terminase, ATPase subunit [Moritella sp. PE36] gi|149805043|gb|EDM65069.1| putative bacteriophage terminase, ATPase subunit [Moritella sp. PE36] Length = 601 Score = 57.0 bits (136), Expect = 8e-06, Method: Composition-based stats. Identities = 56/350 (16%), Positives = 101/350 (28%), Gaps = 77/350 (22%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 IG T A+ + + A S Q AE+ K + + F ++ Sbjct: 181 IGATFYFAFEAFYDAVVNGRNKIFISA-SRDQ------AEIFKANIIALCREQFGIE--- 230 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-- 205 L +P + + K ST RT D + DE P Sbjct: 231 LSGSPLTMRNKGKTTTLYFK--STNARTAQSASGD------------LYIDEVFWIPKFK 276 Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSF 265 + T ++ S P S + Y+++N W R E Sbjct: 277 ELRSLAQAMATHKDFRIT--YFSTPSVTSHEAYDLWN---GRWYRKTKACNDPEFAIDVS 331 Query: 266 HEGIIARYGLDSDVTRVEV------------------CGQFPQQDIDSFIPLNIIEEALN 307 H+ + D + R ++ ++ +++ D+ I++A + Sbjct: 332 HKTLKHGLLCDDGIWRQKLNVYDVVEQGFDRIDISMLENEYSKEEFDNLFMCKFIDDAHS 391 Query: 308 ----------------------REPCPDPYAPLIMGCDIAEEGGDNTVVVL------RRG 339 P P+++G D A +VVVL Sbjct: 392 AFSLKQLMACVGNSKKWTDFDPTWSRPYAMKPVVIGFDPARTRDIASVVVLSLPLGPDDK 451 Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 + + S D T ++I L KY I +D G + ++ Sbjct: 452 FRLLESLNLSGNDFETMASEIKELTLKYHVVHIGVDTTGMGLGVFELIQK 501 >gi|329122644|ref|ZP_08251223.1| terminase [Haemophilus aegyptius ATCC 11116] gi|327472658|gb|EGF18087.1| terminase [Haemophilus aegyptius ATCC 11116] Length = 202 Score = 57.0 bits (136), Expect = 8e-06, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 19/158 (12%) Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLV 364 P + +G D A G +V++ + H + D T ++I Sbjct: 16 RPFGNREVWLGYDPAFTGDRAALVIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFC 75 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424 + Y I+ID G+ + V + E+ + + E+ +K + + Sbjct: 76 DDYNVTRIVIDKTGMGSGVYQEVRKFYPMVQGL---------EYNADLKNEMVLKTQNLI 126 Query: 425 EFASL---INHSGLIQNLKSLKSFIVPNTGELAIESKR 459 + L + ++ + ++K + TG++ S R Sbjct: 127 QKRRLKFDSGDNDIVSSFMTVKK-RITGTGKITYVSDR 163 >gi|83954308|ref|ZP_00963028.1| terminase, large subunit, putative [Sulfitobacter sp. NAS-14.1] gi|83841345|gb|EAP80515.1| terminase, large subunit, putative [Sulfitobacter sp. NAS-14.1] Length = 408 Score = 57.0 bits (136), Expect = 8e-06, Method: Composition-based stats. Identities = 64/423 (15%), Positives = 114/423 (26%), Gaps = 73/423 (17%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ + G V + + Q++ + Sbjct: 16 IMGGRGAGKTRAGA---EWVRAQVEGSRPLDAGRCRRVALVGETIEQVREVM-------- 64 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 + S + W + + ++ P+ G Sbjct: 65 --IFGDSGILACSPADRRPDWEATRKRLVWPN-----GAVATVHTAHDPEGLRGPQFD-- 115 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + + L + +T+ P R G + P Sbjct: 116 -AAWVDELAKWKKAEETWDQLQFAL-RLGEDPRACVTTTP-RNVGVLKNLLASPSTV-TT 171 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + SF E + ARY + + R E+ G + IE R+ Sbjct: 172 HAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSERIEAGRVRDV 230 Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357 +++G D A G D +V+ DW Sbjct: 231 PL--LDRIVVGLDPATTAGAGADECGIVVVGAQTQGPPQDWRAVVLADCTVQGATPSGWA 288 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTE 415 +E+Y D ++ + N G + L + V V + V R E Sbjct: 289 RAAISAMEQYGADRLVAEVNQGGQMVAEVLRQVDPLVPVKSVHASRGKV-------ARAE 341 Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + + ++ L + + G +G S D D L++ Sbjct: 342 PVAALYEQGRVGHVVGLDALEDQMC-----RMTARGY--------EGGGSPDRVDALVWA 388 Query: 476 FAE 478 E Sbjct: 389 LHE 391 >gi|169344384|ref|ZP_02865357.1| phage terminase, large subunit, pbsx family [Clostridium perfringens C str. JGS1495] gi|169297509|gb|EDS79616.1| phage terminase, large subunit, pbsx family [Clostridium perfringens C str. JGS1495] Length = 415 Score = 57.0 bits (136), Expect = 8e-06, Method: Composition-based stats. Identities = 51/334 (15%), Positives = 107/334 (32%), Gaps = 37/334 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 G G GK+ +++ PG + + + LK +++A L W Sbjct: 31 GGGGSGKSHFVVQKMIYKYLKYPGRKCLVVRKVNSTLKESIFA-----LFRSVLSDWQIY 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 ++ ++ +K E+ + G + I+ +E + Sbjct: 86 DECKINKTDLTIELP-------NKSLFIFKGIDDPEKIKSIAGIDD-----IVVEECTEI 133 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK---PLDDWKRFQIDTRTVEG 260 + + L +N + NP S Y+ + K D + + Sbjct: 134 DEFDFDQLNLRLRSKNPYNQIHVMFNPVSKSNWVYKRWFKNGYDTKDTIVLHTTYKNNKF 193 Query: 261 IDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC--PDPYAP 317 + + + ++ + D+ V R+ G+F +D I N EE+ + + + Sbjct: 194 LPKDYIDSLL-KLEKDNPVYFRIYALGEF--ATLDKLIYTNWKEESFDYKEILKNNRNTK 250 Query: 318 LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTD-----LRTTNNKISGLVEKYRPDAI 372 I D V + + L+ + + KI L YR + I Sbjct: 251 AIFSLDFGYTNDPTAFVCSIIDKINKKLWIFDEFQEKGLLNDEIAEKIIDL--GYRKEVI 308 Query: 373 IIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 + D+ ++ + L+ G RV G + D Sbjct: 309 VCDS--AEPKSIEELKRNGLS--RVKGAVKGRDS 338 >gi|166012063|ref|ZP_02232961.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|167427125|ref|ZP_02318878.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|2996304|gb|AAC13184.1| P-loop protein [Yersinia pestis KIM 10] gi|165988997|gb|EDR41298.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|167053876|gb|EDR63708.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] Length = 402 Score = 57.0 bits (136), Expect = 8e-06, Method: Composition-based stats. Identities = 48/321 (14%), Positives = 102/321 (31%), Gaps = 46/321 (14%) Query: 73 PNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL 132 +P FK + AGR GK+ L+ ++ + V +A + + LW ++ + L Sbjct: 5 QSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQMARQILWDDLQEVL 63 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 P W ++ I K+ S + ++PDT G Sbjct: 64 -----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GADKPDTLRGV---AL 102 Query: 193 MAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-------KP 244 ++ DE PD + L+ ++ P+ +F++++ + Sbjct: 103 HFVVLDEFQDMKPDTWYKVLRPTLSS--TRGGALIIGTPKG-FSEFHKLWTIGQNKDLQR 159 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 WK +Q T + + E +D E F + P + Sbjct: 160 KGQWKSWQFVTADSPFVPSAEIEAAKND--MDPKSFAQEYLASFENMSGRVYYPFD--RN 215 Query: 305 ALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-----TDLRTTNNK 359 + +P P+ +G D D V+ + L+ + ++ ++ Sbjct: 216 VHVKPLQFNPKLPIWVGQD---FNIDPMSSVILQPQPNGELWAVDEVVLFSSNTAEVCDE 272 Query: 360 ISGLVEKYRPD-AIIIDANNT 379 + +++ I D Sbjct: 273 LERRFWRWKSQVTIFPDPAGA 293 >gi|323146172|gb|ADX32410.1| terminase ATPase subunit [Cronobacter phage ENT90] Length = 587 Score = 56.7 bits (135), Expect = 8e-06, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 53/176 (30%), Gaps = 27/176 (15%) Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDPYAPLIMGCDI 324 + + +F + S P ++ + P P Y P+ +G D Sbjct: 357 SPSEYQNLLMCEFVDDEA-SVFPFAELQTCMIDSLEEWSDFNPYLPRPFDYRPVWIGYDP 415 Query: 325 AEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 + G V+ L G L W D I L +KY + I +DA Sbjct: 416 SHTGDSAGCAVIAPPLVAGGKFRVLERHQWRGMDFAAQAKSIEDLTKKYTVEYIGVDATG 475 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 G + A ++ + +T + +K D + L +G Sbjct: 476 IGQGVFQLVRQ---------FYPAAREIRYSPEVKTAMVLKAKDTISSGRLEYDAG 522 >gi|119869106|ref|YP_939058.1| phage terminase [Mycobacterium sp. KMS] gi|119695195|gb|ABL92268.1| phage Terminase [Mycobacterium sp. KMS] Length = 489 Score = 56.7 bits (135), Expect = 8e-06, Method: Composition-based stats. Identities = 63/373 (16%), Positives = 108/373 (28%), Gaps = 71/373 (19%) Query: 41 KGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLW 100 KGT PR WQ++ + V +V P RG GKTTL+A ++L+ Sbjct: 41 KGTGAREVFRPREWQMDIVRDVLDSGARTVGLMMP----------RGQGKTTLSAAILLY 90 Query: 101 LMSTR-PGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH 159 + TR G +V+ A E Q SL +Q + Y Sbjct: 91 IFFTRGEGANVVLFAVDERQ------------ASLAFRVAARMVQLSEDLSSRCYVYADK 138 Query: 160 CSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERN 219 L + Y M P + +A + DEA + + Sbjct: 139 LVLPLTDSTYQVM--------PASAAAAEGLDYVACLCDEAGVINRDVFEVAQLA-QGKR 189 Query: 220 ANRFWIMTSNPRRLSGK--------FYEIFNKPLDD-WKRFQIDTRTV---------EGI 261 I P + W+ F E Sbjct: 190 ERSVLIAIGTPGPDPNDQVLADLRAYAAEHPDDKSLVWREFSAAGFEDHGADCPHCWELA 249 Query: 262 DPSFHEGIIAR--------YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 +P+ + + ++ R QF +F+P + E P P Sbjct: 250 NPALDDFLHRDALHALLPPKTREATFRRAR-LCQFSTDTDGAFLPAGVWEGLSTSSPVP- 307 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRR---GPVIEHLFDWS-------KTDLRTTNNKISGL 363 P +++ D GD T +++ P + + W + + + I Sbjct: 308 PGVDVVLALD-GSYNGDTTALLVGTVSAEPHFDVVQVWDPKGDPDYRVPVAEVEDVIRRS 366 Query: 364 VEKYRPDAIIIDA 376 ++++ II D Sbjct: 367 AKEWQVVEIIADP 379 >gi|171316543|ref|ZP_02905759.1| protein of unknown function DUF264 [Burkholderia ambifaria MEX-5] gi|171098271|gb|EDT43077.1| protein of unknown function DUF264 [Burkholderia ambifaria MEX-5] Length = 583 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 29/137 (21%), Positives = 43/137 (31%), Gaps = 20/137 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317 E + Y +D + QF + S PL ++ + + D + P Sbjct: 346 LERLKLEY--SADEYANLLLCQFIDDSL-SVFPLATLQTCMVDTWEVWDDFKPLYLRPFG 402 Query: 318 ---LIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 + +G D + G VVL G L F W D +I L +YR Sbjct: 403 DEEVWIGYDPSHTGDSAGCVVLAPPKYPGGKFRVLERFQWHGLDFEAQAAQIEALTRRYR 462 Query: 369 PDAIIIDANNTGARTCD 385 I ID G Sbjct: 463 VTYIGIDTTGIGQGVYQ 479 >gi|331646084|ref|ZP_08347187.1| terminase, ATPase subunit [Escherichia coli M605] gi|331044836|gb|EGI16963.1| terminase, ATPase subunit [Escherichia coli M605] Length = 588 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 49/143 (34%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVDDFA-SVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|213419227|ref|ZP_03352293.1| terminase subunit [Salmonella enterica subsp. enterica serovar Typhi str. E01-6750] Length = 442 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 47/139 (33%), Gaps = 23/139 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 282 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 338 Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ V+ G L W D R + I L + Sbjct: 339 WREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 398 Query: 366 KYRPDAIIIDANNTGARTC 384 +Y I ID+ G Sbjct: 399 QYNVTYIGIDSTGVGHGVY 417 >gi|120436787|ref|YP_862473.1| phage terminase large subunit [Gramella forsetii KT0803] gi|117578937|emb|CAL67406.1| phage terminase large subunit [Gramella forsetii KT0803] Length = 506 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 35/210 (16%), Positives = 66/210 (31%), Gaps = 35/210 (16%) Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDA 371 + D+A G D V+ L G I L K+ + ++I + K+ Sbjct: 296 KAGEKYITVDVATSGKDKLVIWLWEGFRIRDLMKIDKSTGKDIIDEIKAMALKWGVPNRR 355 Query: 372 IIIDANNTGA--RTCDYLEMLGYHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFAS 428 I DAN GA D + + + D +N +T+ +V + +E Sbjct: 356 IAYDANGVGAFIGGADNAFIPNSIAFDSNNRPRETKDGRKFKNLKTQCYVLSGERVERNE 415 Query: 429 L----------INHSGLIQNLKSLKSFIVPNTGE--------LAIESKRVKG--AKSTDY 468 + + I+ + + + + E + K +STD Sbjct: 416 IWVMPQVANMMFDEKQTIRQRMLAERKAIKKQPKKDEEPQALIKKEEMKAKYLNGESTDL 475 Query: 469 SDGLMY----------TFAENPPRSDMDFG 488 D M T ++ ++FG Sbjct: 476 LDPFMMREIFELEPPITISKPTKPKGLNFG 505 >gi|331672362|ref|ZP_08373153.1| terminase, ATPase subunit [Escherichia coli TA280] gi|331070557|gb|EGI41921.1| terminase, ATPase subunit [Escherichia coli TA280] Length = 588 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 52/143 (36%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y LD + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEYSLDE--YQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|323190958|gb|EFZ76225.1| terminase, ATPase subunit [Escherichia coli RN587/1] Length = 591 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 50/168 (29%), Gaps = 20/168 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY + + F DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NDATFNMLYMCVFVDSK-DSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E + P + G D A G + V++ + + Sbjct: 385 EACGVEIDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIIAPPMLAVEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394 W + R +I L +KY + +D G D ++ V Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV 492 >gi|204929563|ref|ZP_03220637.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Javiana str. GA_MM04042433] gi|204321282|gb|EDZ06482.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Javiana str. GA_MM04042433] Length = 588 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|167647878|ref|YP_001685541.1| hypothetical protein Caul_3918 [Caulobacter sp. K31] gi|167350308|gb|ABZ73043.1| protein of unknown function DUF264 [Caulobacter sp. K31] Length = 439 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 73/451 (16%), Positives = 133/451 (29%), Gaps = 67/451 (14%) Query: 37 PWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTL-NA 95 PW E ++W V +AH V+ +F GRG GKT + Sbjct: 33 PWTELAPWPVVQDGLKTW-----RVTEAHQKPPVDPWITWLFL----GGRGAGKTFAGAS 83 Query: 96 WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155 W+ +PG ++ + + ++ + + + L W + Sbjct: 84 WIAN---QAKPGRNLALVGPTFHDVREVM----------IEGPSGIKSLYLPGDRPKWQA 130 Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDEASGTPDVINL-GIL 212 + + +S E PD G H DE P +L Sbjct: 131 SRRRLEFRN-----GAIAQAFSAEDPDALRGPQFHAA-----WADEFCAWPKPAETLAML 180 Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272 F + ++T+ P R + +P + + + + P+F + Sbjct: 181 RFGLRLGTDPRLVVTTTP-RPIRALRNLIAEPGAV-QTRAPTSANADHLAPAFLSTLRGL 238 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI-AEEGGDN 331 YG + E+ G + + A R P + +++ D A GD Sbjct: 239 YGGT-RLAAQELDGLIVEG-EGGLFRAEDL--ARCRGAPPAAFDRVVVAIDPPATATGDA 294 Query: 332 T--VVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 VV R G L D + + ++ DA++ +AN G L Sbjct: 295 CGIVVCGRFGDRAFVLADRTAKGLSPNGWARRAVDAAVRFDADALVAEANQGGDMVRSVL 354 Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIV 447 V K +V L+ + + + + L S Sbjct: 355 -AQAAPPCPVKLVKASVGKRARAEPVAALYEQGRV-VHCGAFPALEEELMALGS------ 406 Query: 448 PNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 G+L S D +D L++ +E Sbjct: 407 ---GDL---------GHSPDRADALVWALSE 425 >gi|99080642|ref|YP_612796.1| hypothetical protein TM1040_0801 [Ruegeria sp. TM1040] gi|99036922|gb|ABF63534.1| hypothetical protein TM1040_0801 [Ruegeria sp. TM1040] Length = 416 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 51/307 (16%), Positives = 93/307 (30%), Gaps = 37/307 (12%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 G G GKT + + P A + ++ T W V H Sbjct: 27 GGFGSGKTYVGCLDLGLFAGQHPKTVQGYFAPTYRDIRDTFWPTV------DEAAHSLGF 80 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 + +D S + +T+CR S + P VG + DE Sbjct: 81 TTKVKS-----ADKEVEFYRGRSYYGTTICR--SMDDPGGIVGFKIARAL---VDE---I 127 Query: 204 PDVINLGILGFLTERNANRFWIM------TSNPRRLSG--KFYEIFNK-PLDDWKRFQID 254 + + A ++ G Y+ F + P ++ Q Sbjct: 128 DILSKDKAQAAWRKIIARMRLVLPGVVNGIGVTTTPEGFRFVYDSFKREPKSNYSMVQAS 187 Query: 255 TRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 T E + P + ++ Y ++ + + G+F + + Sbjct: 188 TYENEAFLPPDYISTLLEDY--PEELIKAYLMGEFVNLTSGTVY-RSYDRLRHRSTQSIQ 244 Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373 P PL +G D G +VV ++RG + + T + I L ++Y + Sbjct: 245 PREPLHIGQDF-NVGNMASVVFVQRGEDWHAVDELQGLQ--DTPHLIEVLCDRYEGHHLT 301 Query: 374 I--DANN 378 I DA+ Sbjct: 302 IYPDASG 308 >gi|296104758|ref|YP_003614904.1| terminase, ATPase subunit [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295059217|gb|ADF63955.1| terminase, ATPase subunit [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 572 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 332 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWADFQALALRPFG 388 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L + Sbjct: 389 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 448 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 449 QYNVTYIGIDSTGVGHGVYENVK 471 >gi|146311014|ref|YP_001176088.1| hypothetical protein Ent638_1356 [Enterobacter sp. 638] gi|145317890|gb|ABP60037.1| protein of unknown function DUF264 [Enterobacter sp. 638] Length = 254 Score = 56.7 bits (135), Expect = 9e-06, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 14 IDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQVCMVDSWEVWSDFHALALRPFG 70 Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ G VV+ G L W D R + I L + Sbjct: 71 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 130 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 131 QYNVTYIGIDSTGVGHGVYENVK 153 >gi|55163155|emb|CAH61098.1| large terminase subunit [Yersinia enterocolitica subsp. palearctica Y11] Length = 202 Score = 56.7 bits (135), Expect = 1e-05, Method: Composition-based stats. Identities = 28/129 (21%), Positives = 40/129 (31%), Gaps = 15/129 (11%) Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGL 363 P Y P+ MG D + G VV+ G L W D I L Sbjct: 16 EQPFNYHPVWMGYDPSHTGDSAGCVVMAPPWVPGGKFRILERHQWKGMDFADQAESIKKL 75 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 EKY + I IDA G + A ++ + +T + +K D Sbjct: 76 TEKYNVEYIGIDATGIGQGVYQLVR---------NFFPAAREIRYSAEVKTNMVLKAKDL 126 Query: 424 LEFASLINH 432 + L Sbjct: 127 ITTGRLEYD 135 >gi|200389255|ref|ZP_03215867.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Virchow str. SL491] gi|199606353|gb|EDZ04898.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Virchow str. SL491] Length = 591 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 69/211 (32%), Gaps = 28/211 (13%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY ++ + F + DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVD-NKDSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E + P + G D A G + V++ + + Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 W + R +I L +KY + +D G D ++ V AV + Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV--------AVAI 496 Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437 + N + +L +K AD +E + L + Sbjct: 497 RYDLNTKNQLVLKAADVVESQRIEWDKNLKE 527 >gi|194443211|ref|YP_002041983.1| hypothetical protein SNSL254_A2937 [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|194401874|gb|ACF62096.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Newport str. SL254] Length = 591 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 69/211 (32%), Gaps = 28/211 (13%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY ++ + F + DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVD-NKDSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E + P + G D A G + V++ + + Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 W + R +I L +KY + +D G D ++ V AV + Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV--------AVAI 496 Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437 + N + +L +K AD +E + L + Sbjct: 497 RYDLNTKNQLVLKAADVVESQRIEWDKNLKE 527 >gi|324009700|gb|EGB78919.1| hypothetical protein HMPREF9532_00529 [Escherichia coli MS 57-2] Length = 588 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRIEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|168752369|ref|ZP_02777391.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4113] gi|168756331|ref|ZP_02781338.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4401] gi|168770046|ref|ZP_02795053.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4486] gi|168775976|ref|ZP_02800983.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4196] gi|168782400|ref|ZP_02807407.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4076] gi|168799001|ref|ZP_02824008.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC508] gi|195938306|ref|ZP_03083688.1| Phage protein P [Escherichia coli O157:H7 str. EC4024] gi|208807993|ref|ZP_03250330.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4206] gi|208814612|ref|ZP_03255941.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4045] gi|208819940|ref|ZP_03260260.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4042] gi|209400321|ref|YP_002271352.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4115] gi|254793895|ref|YP_003078732.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. TW14359] gi|187768594|gb|EDU32438.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4196] gi|188013771|gb|EDU51893.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4113] gi|189000116|gb|EDU69102.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4076] gi|189356443|gb|EDU74862.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4401] gi|189360957|gb|EDU79376.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4486] gi|189378450|gb|EDU96866.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC508] gi|208727794|gb|EDZ77395.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4206] gi|208735889|gb|EDZ84576.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4045] gi|208740063|gb|EDZ87745.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4042] gi|209161721|gb|ACI39154.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. EC4115] gi|254593295|gb|ACT72656.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str. TW14359] gi|326344901|gb|EGD68646.1| Phage terminase, ATPase subunit [Escherichia coli O157:H7 str. 1125] Length = 590 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + A D+ + +T + +K D + Sbjct: 468 VEYIGIDATGLGVGVFLLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517 Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452 + + ++ S + ++G Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544 >gi|213620832|ref|ZP_03373615.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-2068] Length = 130 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 34/103 (33%), Gaps = 22/103 (21%) Query: 404 VDLEFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KS 444 + +F N + + +AD + IN+ L+ L S+ Sbjct: 19 PNKDFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPH 78 Query: 445 FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 79 RDFDRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 121 >gi|213027436|ref|ZP_03341883.1| Phage protein P [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] Length = 222 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 39/140 (27%), Gaps = 20/140 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 81 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 137 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 138 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 197 Query: 369 PDAIIIDANNTGARTCDYLE 388 + I IDA G + Sbjct: 198 VEYIGIDATGLGVGVFQLVR 217 >gi|213026708|ref|ZP_03341155.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] Length = 143 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 34/103 (33%), Gaps = 22/103 (21%) Query: 404 VDLEFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KS 444 + +F N + + +AD + IN+ L+ L S+ Sbjct: 32 PNKDFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPH 91 Query: 445 FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 92 RDFDRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 134 >gi|29144574|ref|NP_807916.1| terminase subunit [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|29140212|gb|AAO71776.1| probable terminase subunit [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] Length = 588 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 47/139 (33%), Gaps = 23/139 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ V+ G L W D R + I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTC 384 +Y I ID+ G Sbjct: 465 QYNVTYIGIDSTGVGHGVY 483 >gi|76788305|ref|YP_329267.1| prophage LambdaSa03, terminase, large subunit [Streptococcus agalactiae A909] gi|76563362|gb|ABA45946.1| prophage LambdaSa03, terminase, large subunit, putative [Streptococcus agalactiae A909] Length = 471 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 58/346 (16%), Positives = 114/346 (32%), Gaps = 47/346 (13%) Query: 54 WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113 WQ + + +N N + + AI R GKT + L LW + G+ ++ Sbjct: 44 WQENML--IPMMAINEDNLWVHQKYGYAIP--RRNGKTEVVYILELWAL--HKGLKILHT 97 Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 A+ + + + +V K+L + + + + + A + S G + + Sbjct: 98 AHRISTSHS-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKSSGSVIQFRT--- 150 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR- 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 151 RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPTM 201 Query: 233 -LSGKFYEIFNKP-------LDDWKRFQIDTRTVEGIDPSF------------HEGIIAR 272 +G +E + K W + +D S+ I A Sbjct: 202 VSTGTVFESYRKECLKGDRRYSGWAEWSVDEMQPIHDVKSWYVANPSMGYHLNERKIEAE 261 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 262 LGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNNV 319 Query: 332 T-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 320 SLSIAARASENKVFVEAIDCLSIRNGTQWIINFLKSADIAKVVIDG 365 >gi|194450112|ref|YP_002047236.1| hypothetical protein SeHA_C3493 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|194408416|gb|ACF68635.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] Length = 591 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 51/168 (30%), Gaps = 20/168 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY ++ + F DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E + P + G D A G + V++ + + Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394 W + R +I L +KY + +D G D ++ V Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV 492 >gi|320177430|gb|EFW52430.1| Phage terminase, ATPase subunit [Shigella dysenteriae CDC 74-1112] Length = 588 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 50/143 (34%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQAYAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|317152051|ref|YP_004120099.1| hypothetical protein Daes_0328 [Desulfovibrio aespoeensis Aspo-2] gi|316942302|gb|ADU61353.1| protein of unknown function DUF264 [Desulfovibrio aespoeensis Aspo-2] Length = 428 Score = 56.3 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 50/333 (15%), Positives = 95/333 (28%), Gaps = 41/333 (12%) Query: 79 KGAISAGRG-IGKTTLNAWLVLWLMST--RPGISVICLANSETQLKTTLWAEVSKWLSLL 135 + ++ GKT L+ ++ T R +A Q KT +W E+ ++ Sbjct: 21 RFSVLVCHRRFGKTVLSVNRLIRAARTTSRTDWRGAYIAPLYKQAKTVVWDELKRY---- 76 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 + ++ + +D R + E PD+ G + Sbjct: 77 -CGLGLDGCTVKFNETELRADF----------DNGARIRLFGAENPDSLRGMYLDGA--- 122 Query: 196 INDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQ 252 + DE + P + I L++R + PR + Y ++ DW Sbjct: 123 VFDEVAQMPHRVWTEVIRPALSDRMGWAMF--IGTPRGKNA-LYRLWQDARRDPDWFAAM 179 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312 I+P + + E F ++ I E Sbjct: 180 YRASQTGIIEPGELAAAARE--MSPEEYEQEFECSFTAAIRGAYFSALIGEADKGGRITK 237 Query: 313 DPYAPLIMGCDIAEEGG--DNTVVVL---RRGPVIEHLFDWSKTDL------RTTNNKIS 361 P+ P + A + G D+T + R G + + + R + + Sbjct: 238 VPHDP-SLPVHTAWDLGMSDSTAIWFVQARPGNAFAIVDYYEASGEGLDHYARVLDERRY 296 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394 P I + TG + LG Sbjct: 297 AYGSHIAPHDIRVRELGTGKSRLEIARALGIRF 329 >gi|324115391|gb|EGC09344.1| terminase [Escherichia coli E1167] Length = 492 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 39/140 (27%), Gaps = 20/140 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + +D + +F S P ++ + Sbjct: 322 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 378 Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 P+ +G D + G VVL G L W D T I L EKY Sbjct: 379 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 438 Query: 369 PDAIIIDANNTGARTCDYLE 388 + I IDA G + Sbjct: 439 VEYIGIDATGLGVGVFQLVR 458 >gi|323185221|gb|EFZ70586.1| terminase, ATPase subunit [Escherichia coli 1357] Length = 588 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQAGMVDSWEVWTDFHALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ VVV G L W D R + I L E Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487 >gi|315655961|ref|ZP_07908859.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333] gi|315490025|gb|EFU79652.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333] Length = 460 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 73/432 (16%), Positives = 123/432 (28%), Gaps = 73/432 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GKT A LV PG + +A E+ +++ + Sbjct: 62 TGRGWGKTRTAAELVRDWAK-NPGTQIAVVAKKESLVRSICF---------EHKTSGLLH 111 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE-ASG 202 A + + + K+ ST+ + E PD G DE A+ Sbjct: 112 VIPKSDQARFNASGGSGRFFLQLKNGSTI-YGFGAEVPDNLRGFAFDKA---WFDEFAAW 167 Query: 203 TPDVINLGILGFLTE-RNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261 + R + ++ S + ++ +KP R + + Sbjct: 168 NKQTAQEVYDMMWYDLRESPSPQMVISTTPKPLKHVRDLVSKPGVVITRGH-TKDNLPNL 226 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321 E + YG + R E+ G+ + + + + ++ + R P +++G Sbjct: 227 SAIALEKLERDYGKT-RLGRQELAGELIESIEGALWDVTMFQDPVFRPDTMPPLEDIVVG 285 Query: 322 CDIA---EEGGDNTVV---------------VLRRGPVIEHLFDWSKTDLRTTNNKISGL 363 D A EG D T L G ++E + R K L Sbjct: 286 VDPAVRSSEGADMTAFTVAARAEDAPGMFPDHLNHGYILEAIQGH--YTPRDAMAKAGEL 343 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEML--GYHVYRVLG-----QKRAVDLEFCRNRRTEL 416 KY ++++ANN G L+M+ G V + R Sbjct: 344 ARKYGASRVVLEANNGGEYLPTVLQMVAPGVPWKIVHAQQDKRGRAMPVATLYEQGRIHH 403 Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSD----GL 472 + L + V TG G KS D D L Sbjct: 404 Y---------GGAEKFEDLESQM-------VTYTG--------AAGEKSPDLLDSMVWAL 439 Query: 473 MYTFAENPPRSD 484 F D Sbjct: 440 TELFLSPVGHGD 451 >gi|94970433|ref|YP_592481.1| hypothetical protein Acid345_3406 [Candidatus Koribacter versatilis Ellin345] gi|94552483|gb|ABF42407.1| protein of unknown function DUF264 [Candidatus Koribacter versatilis Ellin345] Length = 482 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 60/331 (18%), Positives = 109/331 (32%), Gaps = 54/331 (16%) Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239 PDT G+H ++ DE G + + + S P +GKF+E Sbjct: 123 NPDTVRGYHGD----VVLDE-FGFHRDAKKIYKAAIAIASRGYQLEVISTPNEQAGKFWE 177 Query: 240 IFNKP--------------LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285 I W +D T ++ + D D + E C Sbjct: 178 IAKAAGVPADGGSERTHWTKGVWSVHWLDIYTAVKEGCPIDVEVMRQACYDDDTWQQEYC 237 Query: 286 GQFPQQDIDSFIPLNIIEEALNR------EPCPDPYAPLIMGCDIAEEGGDNTVVVLR-- 337 F + +IP+ +I A ++ P L +G DI + D TV+ + Sbjct: 238 CVFLADAQN-YIPMELIIAAESQMASLDARPEDLAGRELYLGMDIGRK-KDRTVIWIDEK 295 Query: 338 --RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVY 395 + + +T + + + R ID+ GA+ + LE Sbjct: 296 LGDVMITRAVETLERTPFAKQFEQAAAWMPYVRRGC--IDSTGIGAQIGEDLERK----- 348 Query: 396 RVLGQKRAVDLEF-CRNRRTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIVPNTGE 452 G + +EF N+ T + LE A + + + + ++K + P TG Sbjct: 349 --FGAAKVEKVEFNIANKET-MAGLAKRKLEDRQARIPESPSIRRAINAVKRYTSP-TGH 404 Query: 453 LAIESKRVKGAKSTDYSD-----GLMYTFAE 478 ++ R + ++D L + AE Sbjct: 405 FRFDADRTEAG----HADEFWAFALCLSAAE 431 >gi|56414723|ref|YP_151798.1| phage gene [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197363650|ref|YP_002143287.1| phage gene [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|56128980|gb|AAV78486.1| putative phage gene [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197095127|emb|CAR60674.1| putative phage gene [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] Length = 591 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 52/168 (30%), Gaps = 20/168 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY ++ + F + DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVD-NKDSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E + P + G D A G + V++ + + Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394 W + R +I L +KY + +D G D ++ V Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV 492 >gi|262279834|ref|ZP_06057619.1| conserved hypothetical protein [Acinetobacter calcoaceticus RUH2202] gi|262260185|gb|EEY78918.1| conserved hypothetical protein [Acinetobacter calcoaceticus RUH2202] Length = 416 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 54/322 (16%), Positives = 98/322 (30%), Gaps = 50/322 (15%) Query: 78 FKGAISAGRGIGKTTLNAW--------LVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129 F+ A+ GR GKT L W +S + A + Q K W + Sbjct: 10 FRDAV-CGRRFGKTFLAKAEMRRAARLAAKWNVSVEDE--IWYAAPTFKQAKRVFWKRLK 66 Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 + + PA W + + + + + R + D G Sbjct: 67 QAI-----------------PASWRAGKPNETECSITLRSGHVIRVVGLDNYDDLRG--- 106 Query: 190 TYGMAIINDEASGTPDVIN-LGILGFLTE--------RNANRFWIMTSNPRRLSGKFYEI 240 + +I DE + + L+ + + P+ Y+ Sbjct: 107 SGLFFLIIDEWADCKWAAWEEVLRPMLSTCKYVVNGVQRVGGHVLRIGTPKG-FNHCYDT 165 Query: 241 F--NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 F +P + +++G + E I+A+ +D E F F Sbjct: 166 FMDGQPGHEPDCKSFSYTSLQGGNIPESEIIVAKRKMDPKTFSQEYEASFESYQGVIFYC 225 Query: 299 LNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN 358 N + A L +G D VV +RRG + + ++ +L T Sbjct: 226 FNRLLSA--STETVQANDVLHVGMDFNVTKM-AAVVYVRRGEQMHAVDEF--VNLFDTPA 280 Query: 359 KISGLVEKYRPDAIII--DANN 378 I + E+Y I + DA+ Sbjct: 281 MIEAIQERYPDHEIAVYPDASG 302 >gi|308187132|ref|YP_003931263.1| Terminase, ATPase subunit (GpP) [Pantoea vagans C9-1] gi|308057642|gb|ADO09814.1| Terminase, ATPase subunit (GpP) [Pantoea vagans C9-1] Length = 587 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 54/176 (30%), Gaps = 27/176 (15%) Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDPYAPLIMGCDI 324 + + +F + S P ++ + P P Y P+ +G D Sbjct: 357 SPAEYQNLLMCEFVDDEA-SVFPFAELQTCMIDSLEEWEDFNPYLPRPFAYRPVWIGYDP 415 Query: 325 AEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 + G V+ L G L W D I L +KY + I +DA Sbjct: 416 SHTGDSAGCAVIAPPLVAGGKFRVLERHQWRGMDFAAQAKSIEDLTKKYTVEYIGVDATG 475 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 G + A ++++ +T + +K D + L +G Sbjct: 476 IGQGVFQLVRQ---------FYPAAREIKYSPEVKTAMVLKAKDTISSGRLEYDAG 522 >gi|221214652|ref|ZP_03587622.1| putative ATPase subunit of terminase (gpP-like) [Burkholderia multivorans CGD1] gi|221165542|gb|EED98018.1| putative ATPase subunit of terminase (gpP-like) [Burkholderia multivorans CGD1] Length = 583 Score = 55.9 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 28/137 (20%), Positives = 44/137 (32%), Gaps = 20/137 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317 E + Y +D + QF + S PL+ ++ + + D + P Sbjct: 346 LERLKLEY--SADEYANLLLCQFIDDSL-SVFPLSALQPCMVDTWEVWDDFKPLYLRPFG 402 Query: 318 ---LIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 + +G D + G VV+ G L F W D +I L +YR Sbjct: 403 DEDVWIGYDPSHTGDSAGCVVVAPPKYPGGKFRVLERFQWHGLDFEAQAGQIEALTRRYR 462 Query: 369 PDAIIIDANNTGARTCD 385 I ID G Sbjct: 463 VTYIGIDTTGIGQGVYQ 479 >gi|257460901|ref|ZP_05626002.1| transposase family protein [Campylobacter gracilis RM3268] gi|257442232|gb|EEV17374.1| transposase family protein [Campylobacter gracilis RM3268] Length = 518 Score = 55.9 bits (133), Expect = 2e-05, Method: Composition-based stats. Identities = 60/372 (16%), Positives = 117/372 (31%), Gaps = 61/372 (16%) Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS----EERPDTFVGHHNTYGMAIIN 197 E + ++ + ++ L S DS++ ++ + T G I Sbjct: 171 EQARILMNYSQMWAKKLGVSFAKDSEYEKSLDNGATIRVMAHNFRTVQGFTGD----IWM 226 Query: 198 DEASGTPDV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW--KRFQI 253 DE + P+ I + + + S P + F E+F L + R ++ Sbjct: 227 DEFAWYPNQKRIWHAFVPSI--GAVAGRLTILSTPFEENSFFAELFGDELKFYMFSRHRV 284 Query: 254 DT-RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP-- 310 D R + G E + A + D+D QF + + + +++I+ ++ Sbjct: 285 DIYRAMAGGLKFDLETMRALF--DADTWASAYECQFVDDES-ALLGIDLIKSCVSDFTPT 341 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVV-VLRRGPVIEHL---FDWSKTDLRTTNNKISGLVEK 366 P P+ G D+ + + V G I+ L +K N ++ + Sbjct: 342 LPPKNIPVFSGYDVGRTKDRSVHMGVYDAGEGIKRLCLYDVIAKASFEAQENLLTDFLRL 401 Query: 367 YRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425 + ID G + L+ V V + N + + Sbjct: 402 NLLAYLKIDKTGIGMPVAERLKSRFTSRVSGVYFTASVKEA-LALNLKKHFED------K 454 Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS----TDY-----SD-----G 471 S+ N LI +L ++ KR G KS +D +D Sbjct: 455 SISIPNDPLLIADLHAI---------------KRKAGQKSFLYDSDRNEHGHADRFWALA 499 Query: 472 LMYTFAENPPRS 483 L ++ E Sbjct: 500 LALSYFEKVRER 511 >gi|152994622|ref|YP_001339457.1| hypothetical protein Mmwyl1_0587 [Marinomonas sp. MWYL1] gi|150835546|gb|ABR69522.1| protein of unknown function DUF264 [Marinomonas sp. MWYL1] Length = 601 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 39/267 (14%), Positives = 80/267 (29%), Gaps = 35/267 (13%) Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W++ I+ G D E + +D + ++ D +S L + Sbjct: 345 PDKMWRQIVTIEDAEEGGCDLFDIEQLRDE--NSTDAFNNKYLCKWID-DANSVFTLAKL 401 Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346 + Y P+ +G D + + ++ + + Sbjct: 402 LSCMVDTETWTDYHKDAGQPFGNRPVAIGYDPSRTTDNASLALLSIPLGASDPWRLLKKD 461 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 W + + +I EK+ I ID + G + +E V + Sbjct: 462 SWRGVNFQWQAARIKEEKEKHNVKHIGIDVSGIGRGVFELVEQFYRRVTPI--------- 512 Query: 407 EFCRNRRTELHVKMADWLEFAS---LINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGA 463 + +TEL +K D +E + Q + V + G++ + R Sbjct: 513 TYSVQTKTELVLKALDLIENGLFKFSAGDKEVAQAFMMITK-KVTDNGQITYVANRSNAT 571 Query: 464 KSTDYSDGLMYTFAENP--PRSDMDFG 488 D + +M+ F P P+ Sbjct: 572 GHADVAWAIMHAFNYEPIAPKRKTTVA 598 >gi|170731562|ref|YP_001763509.1| hypothetical protein Bcenmc03_0207 [Burkholderia cenocepacia MC0-3] gi|169814804|gb|ACA89387.1| protein of unknown function DUF264 [Burkholderia cenocepacia MC0-3] Length = 583 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 29/137 (21%), Positives = 43/137 (31%), Gaps = 20/137 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317 E + Y +D + QF + S PL ++ + + D + P Sbjct: 346 LERLKLEY--SADEYANLLLCQFIDDSL-SVFPLATLQTCMVDTWEVWDDFKPLYLRPFG 402 Query: 318 ---LIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 + +G D + G VVL G L F W D +I L +YR Sbjct: 403 DEEVWIGYDPSHTGDSAGCVVLAPPKYPGGKFRVLERFQWHGLDFEAQAAQIEALTTRYR 462 Query: 369 PDAIIIDANNTGARTCD 385 I ID G Sbjct: 463 VTYIGIDTTGIGQGVYQ 479 >gi|262193957|ref|YP_003265166.1| hypothetical protein Hoch_0641 [Haliangium ochraceum DSM 14365] gi|262077304|gb|ACY13273.1| protein of unknown function DUF264 [Haliangium ochraceum DSM 14365] Length = 478 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 87/287 (30%), Gaps = 52/287 (18%) Query: 227 TSNPRRLSGKFYEI----------FNKPLDDWKRFQIDTRTVEGIDPSF----HEGIIAR 272 S P G F+EI + W R + ++ E +A Sbjct: 183 CSTPLGRRGIFWEISTEELRKYPHHTRDEVPWWRCRFFCIDIDRAMREAPHMPTEERVAA 242 Query: 273 YG-------LDS---DVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE--------PCPDP 314 +G LDS + + E F + S+ P +I + + P+P Sbjct: 243 FGTQAIAQQLDSLALEDFQQEFECSFVDESY-SYYPYELILPCTSEDLVLAGDFTDLPEP 301 Query: 315 YAPLIMGCDIAEEGG-DNTVVVLRRGPV--IEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 ++ G D+ V +G L + + + +++ Sbjct: 302 EGRIVAGFDVGRTRDHSELAVFEDKGGHFVCRLLRRYDQVPFAEQEADLRRFLDRVPVAR 361 Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---EFAS 428 + ID + G + L V + N E L + Sbjct: 362 LSIDQSGIGMHLAENLARDYAQVVG----------DTFTNDNKERWATDLKILFQRKDIV 411 Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESKRV-KGAKSTDYSDGLMY 474 L L+ + S+K ++P+ G++ +++R +G + D + Sbjct: 412 LPRDRELVGQIHSIKRRVLPS-GKVGFDAERSTRGGHA-DRFWAIAL 456 >gi|156976253|ref|YP_001447159.1| hypothetical protein VIBHAR_05025 [Vibrio harveyi ATCC BAA-1116] gi|156527847|gb|ABU72932.1| hypothetical protein VIBHAR_05025 [Vibrio harveyi ATCC BAA-1116] Length = 593 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 34/259 (13%), Positives = 70/259 (27%), Gaps = 53/259 (20%) Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE P D +N T +N + S P + + Y + + + D Sbjct: 258 VYVDEYFWIPKFDELNKLASAMATHKNWRKT--YFSTPSAKTHQAYTFWTGDQWRQGRDT 315 Query: 248 WKRFQIDT----RTVEGIDPSF--------------------HEGIIARYGLDSDVTRVE 283 + T R + P + + Y D Sbjct: 316 RANIEFPTFDDYRDGGRLCPDKQWRYVVTIEDAAAGGCELFDIDELRDEYSKDD--FDNL 373 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333 F S + +E+A+ + P + +G D + + + Sbjct: 374 FMCIFVDGAS-SVFKFSALEKAMVDISRWQDFKPNDNDPFDRREVWLGYDPSRTRDNACL 432 Query: 334 ------VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 +V + W + + ++S + E+Y + ID GA D L Sbjct: 433 VVVAPPIVAIEKFRVLEKHYWRGLNFQYQAQQVSKVFERYNVSYLGIDTTGIGAGVYDLL 492 Query: 388 -EMLGYHVYRVLGQKRAVD 405 + + + + Sbjct: 493 SKKHPRETVAIQYSNESKN 511 >gi|261347084|ref|ZP_05974728.1| terminase, ATPase subunit [Providencia rustigianii DSM 4541] gi|282564814|gb|EFB70349.1| terminase, ATPase subunit [Providencia rustigianii DSM 4541] Length = 119 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 36/89 (40%), Gaps = 9/89 (10%) Query: 310 PCPDPYAPLIMGCDIAE--EGGDNT---VVV--LRRGPVIEHLFD--WSKTDLRTTNNKI 360 P Y P+ +G D A+ + GD+ V+ +R+G L W D R ++ I Sbjct: 21 IRPYAYHPVWIGYDPAKGTQNGDSAGCVVIAPPMRKGDKFRILEHHQWRGMDFRAQSDAI 80 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEM 389 L E+Y I ID+ G + Sbjct: 81 KELTERYNVQYIGIDSTGIGHGVLQNVRE 109 >gi|149909656|ref|ZP_01898309.1| putative bacteriophage terminase, ATPase subunit [Moritella sp. PE36] gi|149807360|gb|EDM67313.1| putative bacteriophage terminase, ATPase subunit [Moritella sp. PE36] Length = 601 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 58/350 (16%), Positives = 98/350 (28%), Gaps = 77/350 (22%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 IG T A+ + + A S Q AE+ K + + F ++ Sbjct: 181 IGATFYFAFEAFYDAVVNGRNKIFISA-SRDQ------AEIFKANIIALCREQFGIE--- 230 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-- 205 L +P + + K ST RT D + DE P Sbjct: 231 LSGSPLTMRNKGKTTTLYFK--STNARTAQSASGD------------LYIDEVFWIPKFK 276 Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLS--------GKFY---EIFNKP---------- 244 + T ++ S P S G++Y + N P Sbjct: 277 ELRSLAQAMATHKDFRIT--YFSTPSVTSHEAYDLWNGRWYRKTKACNDPEFAIDVSRKT 334 Query: 245 -------LDDWKRFQIDTRTV--EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295 D R +++ V +G D + Y + +F + Sbjct: 335 LKHGLLCDDGIWRQKLNVYDVVEQGFDRIDISMLENEYSKEE--FDNLFMCKFIDDAHSA 392 Query: 296 FIPLNIIEEALN----------REPCPDPYAPLIMGCDIAEEGGDNTVVVL------RRG 339 F L + + P P+++G D A +VVVL Sbjct: 393 F-SLKQLMACVGNSKKWTDFDPTWSRPYAMKPVVIGFDPARTRDIASVVVLSLPLGPDDK 451 Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 + + S D T ++I L KY I +D G + ++ Sbjct: 452 FRLLESLNLSGNDFETMASEIKELTLKYHVVHIGVDTTGMGLGVFELIQK 501 >gi|317049635|ref|YP_004117283.1| hypothetical protein Pat9b_3434 [Pantoea sp. At-9b] gi|316951252|gb|ADU70727.1| protein of unknown function DUF264 [Pantoea sp. At-9b] Length = 588 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 50/144 (34%), Gaps = 23/144 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y + + +F D+ S PL ++++ + P Sbjct: 348 LDQLRMEY--SPPEYQNLLMCEFVD-DLASVFPLQLLQKCMVDSWEVWTDFEALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ V+ G L W D R I L + Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVIAPPAVPGGKFRILERHQWRGMDFRAQAESIRKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389 +Y I ID+ G + ++M Sbjct: 465 QYNVTYIGIDSTGVGLGVYENVKM 488 >gi|293433090|ref|ZP_06661518.1| terminase [Escherichia coli B088] gi|291323909|gb|EFE63331.1| terminase [Escherichia coli B088] Length = 591 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 67/211 (31%), Gaps = 28/211 (13%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY ++ + F DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVV------VLRRGPVIEHLF 346 E + P + G D A G + V + + Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIIAPPMYAAEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 +W + R +I L +KY + +D G D ++ V AV + Sbjct: 445 NWKGMNFRYQARQIELLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV--------AVAI 496 Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437 + N + +L +K AD +E + L + Sbjct: 497 RYDMNTKNQLVLKAADVVESQRIEWDKNLKE 527 >gi|326782137|ref|YP_004322538.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-HM1] gi|310004344|gb|ADO98737.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-HM1] Length = 560 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 68/414 (16%), Positives = 130/414 (31%), Gaps = 65/414 (15%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 +Q E +E + N P GK+T +L + ++V Sbjct: 60 DFQEELIESFHENRFNIAKLPRQ------------TGKSTTCVSYLLHYILFNDNVNVGI 107 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 LAN + + L + + Q + ++ + L SK Sbjct: 108 LANKLSTARDLL----GRLQLAYEQLPLWLQQGIVVY------NKGSMELENGSK-ILAA 156 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI----NLGILGFLTERNANRFWIMTS 228 + S R +F I DE + P+ I + +T + I+ S Sbjct: 157 STSASAVRGMSFN--------IIFLDEFAFIPNHIAEQFFSSVYPTITS-GTSTKVIIIS 207 Query: 229 NPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285 P ++ FY+++ K + + ++ V G D + E IA E Sbjct: 208 TPNGMN-HFYKLWVDAQKGRNGYAWSEVHWSKVPGRDAKWKEQTIANTSERQ--FTQEFD 264 Query: 286 GQFPQQDIDSFIPLNIIEEALNREPC-----------PDPYAPLIMGCDIAEE--GGDNT 332 +F +D+ I + +P P I+ D++ + Sbjct: 265 CEFL-GSVDTLITAAKLRTLTYDDPLTTNGSLDVYENPVRDHDYIICVDVSRGLAQDYSA 323 Query: 333 VVVLRRGPVIEHLF-DWSKTDL--RTTNNKISGLVEKYRPDAIIIDANNTG----ARTCD 385 VV+ L + D+ N I + Y ++ + N+ G Sbjct: 324 FVVIDITHAPWRLVAKYRDHDVRPMVYPNIIFNVATNYNKAYVLTEVNDIGEAVSGSLFY 383 Query: 386 YLEMLGYHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438 LE + + G + V F N+ ++ VKM+ ++ N LI++ Sbjct: 384 DLEYENVLMCAMRGRAGQIVGQGFSGNK-VQMGVKMSKTVKAQGCSNLKTLIED 436 >gi|218704209|ref|YP_002411728.1| putative Terminase, ATPase subunit from bacteriophage origin [Escherichia coli UMN026] gi|218431306|emb|CAR12184.1| Putative Terminase, ATPase subunit from bacteriophage origin [Escherichia coli UMN026] Length = 591 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 67/211 (31%), Gaps = 28/211 (13%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY ++ + F DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVV------VLRRGPVIEHLF 346 E + P + G D A G + V + + Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIIAPPMYAAEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 +W + R +I L +KY + +D G D ++ V AV + Sbjct: 445 NWKGMNFRYQARQIELLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV--------AVAI 496 Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437 + N + +L +K AD +E + L + Sbjct: 497 RYDMNTKNQLVLKAADVVESQRIEWDKNLKE 527 >gi|328912284|gb|AEB63880.1| SPBc2 prophage-derived uncharacterized protein yonF [Bacillus amyloliquefaciens LL3] Length = 589 Score = 55.5 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 62/431 (14%), Positives = 129/431 (29%), Gaps = 82/431 (19%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 A RG GKT L + PG ++ + ++ Q + + Sbjct: 84 ASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKGQAREVI-----------EKIDDLRK 132 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +S +L ++ + S + S + G + +I DE Sbjct: 133 ESPNLRREIEDLKTSTNDAKVEFHNGSWIKIVASND------GARSKRANLLIVDEFRMV 186 Query: 204 P-DVINLGILGFLTERNANRFW-------IMTSNPR-RLSGKFYE---IFNKPLDDWKR- 250 ++I+ + FLT + ++ + N LS +Y+ FN+ + + Sbjct: 187 DFEIISKVLRKFLTAPRSPKYLEKEEYAHLKERNKEIYLSSCWYKVHWSFNRFITYYNAM 246 Query: 251 ------------FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 +QI + +D +A D +E+ + + ++ Sbjct: 247 MKGSKYFVCGLPYQIAIKEG-LLDKDQVRDEMAEEDFDPIGWSMEMEALWFGESEKAYFK 305 Query: 299 LNIIEEALN-------------------REPCPDPYAPLIMGCDIAEEGG---DNTVVVL 336 IE+ + P ++ DIA G D +V + Sbjct: 306 FEDIEKNRKLASPLFPPDYYSLIKDSNFKYESKKPGEIRLVSNDIAGMAGKDNDASVYTV 365 Query: 337 RR--------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 R I ++ T +I + E Y D I++D + G D L Sbjct: 366 FRLIPNSNGYDRHIVYMESIVGGHTGTQATRIRQIYEDYDCDYIVLDTQSIGLGVYDAL- 424 Query: 389 MLGYHVYRVLGQKRAVDLEFC---RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSF 445 + ++RA + E + R + + I + + + ++ Sbjct: 425 -----CQPLYDKERAKEYEPFSCINDERMAERCTYQNAEKVIYSIKGNAQLNSEIAVLLK 479 Query: 446 IVPNTGELAIE 456 G++ I Sbjct: 480 DGFKRGKIKIP 490 >gi|326775607|ref|ZP_08234872.1| phage terminase, large subunit, PBSX family [Streptomyces cf. griseus XylebKG-1] gi|326655940|gb|EGE40786.1| phage terminase, large subunit, PBSX family [Streptomyces cf. griseus XylebKG-1] Length = 416 Score = 55.1 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 64/412 (15%), Positives = 127/412 (30%), Gaps = 49/412 (11%) Query: 79 KGAISAGR-GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 + I++G GKT + L+ W+M + A S +L A ++K + + Sbjct: 23 RINIASGSIRAGKT--ISTLLRWIMY-------VATAPSGGEL-----AVIAKTTNTAAS 68 Query: 138 KHWFEMQSLSL-HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER-PDTFVGHHNTYGMAI 195 + +Q +L P + + ++ R + G + Sbjct: 69 NVFIPLQDPNLFGPLAQHVHYTRGAPTATILGRQVRVIGANDSRAEERLRGMTCAGAL-- 126 Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD----DWKRF 251 DEA+ P +LG ++ A ++NP + F D + + Sbjct: 127 -VDEATLVPQEFWTQLLGRMSVPGAK--LFASTNPGSPAHWLKRDFIDRRDELGIRYWHY 183 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 +D + + I + R V G++ S + EE + Sbjct: 184 VLD--DNPSLGDDYKNSIKNEF--VGLWYRRFVLGEWIA-AEGSIFDM-WDEEKHVVDTL 237 Query: 312 PDPYAPLIMGCDIAEEGG-DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 P+ + +G D + T++ L R + +W + D R +++ + R Sbjct: 238 PEIAKWISVGVDYGQTNPFHATLLGLGRDRRLYAASEW-RYDGRQQRRQLTDIEYSERMR 296 Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRA------VDLEFCRNRRTELHVKMADWL 424 + + G + V A + N + MA L Sbjct: 297 GWLSNVAGIG-----PVRPQFVTVDPSAASFSAQLRRDRLTPTPANNAVLDGIRTMASLL 351 Query: 425 EFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS-DGLMYT 475 L+ HS + + + + A E + K D+ D L Y Sbjct: 352 SAGKLVVHSSCKALIGEMPGYAWDDK---AAEKGEDRPIKVADHGVDALRYA 400 >gi|171779706|ref|ZP_02920662.1| hypothetical protein STRINF_01543 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] gi|171281808|gb|EDT47242.1| hypothetical protein STRINF_01543 [Streptococcus infantarius subsp. infantarius ATCC BAA-102] Length = 470 Score = 55.1 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 55/350 (15%), Positives = 101/350 (28%), Gaps = 53/350 (15%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ ++ V + + + GKT + L LW + G++++ Sbjct: 42 PWQQNLLKSVMGIEEDGLWTHQKFGYSIPRRN----GKTEIIYMLELWGL--YHGLNILH 95 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + + +V ++L + + S+ + + Sbjct: 96 TAHRISTSHS-SFEKVKRYLEKMGLEDGKSFNSIRA--------KGQERIELYETGGVVQ 146 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT G ++ DEA + +T+ + N IM P Sbjct: 147 YRT-RTSNGGLGEGFD-----LLVIDEAQEYTTEQESALKYTVTDSD-NPMTIMCGTPPT 199 Query: 233 L------SGKFYEIFNKPLDD---WKRFQIDTRT---------VEGIDPSFH---EGIIA 271 K+ E W + + FH I A Sbjct: 200 PVSSGTVFTKYRETCLFGRGKYLGWAEWSVSEEKEIDDIDAWYNSNPSMGFHLNERKIEA 259 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D V+ G +P + S I AL + P L +G + G D Sbjct: 260 ELGEDKLDHNVQRLGYWPTYNQKSAISETEW--NALKIDDLPQLQGQLFVGI---KYGQD 314 Query: 331 NT----VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 T V +R + +R N+ I +++ I+ID Sbjct: 315 GTNVALSVAVRTKDKHIFVETVDCQSVRNGNHWIINFLKQADIAQIVIDG 364 >gi|262042498|ref|ZP_06015656.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259040136|gb|EEW41249.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 587 Score = 55.1 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL ++ + P Sbjct: 347 LDQLRLEY--SPDEYQNLLMCEFID-DLASVFPLADLQACMVDSWEVWEDFQALALRPFG 403 Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ V+ G L W D R I L + Sbjct: 404 WREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQAEAIRKLTQ 463 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 464 QYNVTYIGIDSTGVGHGVYENVK 486 >gi|9630181|ref|NP_046608.1| hypothetical protein SPBc2p056 [Bacillus phage SPBc2] gi|16079170|ref|NP_389994.1| prophage terminase, ATP subunit; phage SPbeta [Bacillus subtilis subsp. subtilis str. 168] gi|221310018|ref|ZP_03591865.1| hypothetical protein Bsubs1_11626 [Bacillus subtilis subsp. subtilis str. 168] gi|221314340|ref|ZP_03596145.1| hypothetical protein BsubsN3_11547 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221319262|ref|ZP_03600556.1| hypothetical protein BsubsJ_11473 [Bacillus subtilis subsp. subtilis str. JH642] gi|221323538|ref|ZP_03604832.1| hypothetical protein BsubsS_11602 [Bacillus subtilis subsp. subtilis str. SMY] gi|81342066|sp|O31952|YONF_BACSU RecName: Full=SPBc2 prophage-derived uncharacterized protein yonF gi|2634531|emb|CAB14029.1| putative prophage terminase, ATP subunit; phage SPbeta [Bacillus subtilis subsp. subtilis str. 168] gi|3025534|gb|AAC13029.1| unknown [Bacillus phage SPbeta] Length = 589 Score = 55.1 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 62/431 (14%), Positives = 129/431 (29%), Gaps = 82/431 (19%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 A RG GKT L + PG ++ + ++ Q + + Sbjct: 84 ASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKGQAREVI-----------EKIDDLRK 132 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +S +L ++ + S + S + G + +I DE Sbjct: 133 ESPNLRREIEDLKTSTNDAKVEFHNGSWIKIVASND------GARSKRANLLIVDEFRMV 186 Query: 204 P-DVINLGILGFLTERNANRFW-------IMTSNPR-RLSGKFYE---IFNKPLDDWKR- 250 ++I+ + FLT + ++ + N LS +Y+ FN+ + + Sbjct: 187 DFEIISKVLRKFLTAPRSPKYLEKEEYAHLKERNKEIYLSSCWYKVHWSFNRFITYYNAM 246 Query: 251 ------------FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 +QI + +D +A D +E+ + + ++ Sbjct: 247 MKGSKYFVCGLPYQIAIKEG-LLDKDQVRDEMAEEDFDPIGWSMEMEALWFGESEKAYFK 305 Query: 299 LNIIEEALN-------------------REPCPDPYAPLIMGCDIAEEGG---DNTVVVL 336 IE+ + P ++ DIA G D +V + Sbjct: 306 FEDIEKNRKLASPLFPPDYYSLIKDSNFKYEGKKPGEIRLVSNDIAGMAGKDNDASVYTV 365 Query: 337 RR--------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 R I ++ T +I + E Y D I++D + G D L Sbjct: 366 FRLIPNSNGYDRHIVYMESIVGGHTGTQATRIRQIYEDYDCDYIVLDTQSIGLGVYDAL- 424 Query: 389 MLGYHVYRVLGQKRAVDLEFC---RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSF 445 + ++RA + E + R + + I + + + ++ Sbjct: 425 -----CQPLYDKERAKEYEPFSCINDERMAARCTYQNAEKVIYSIKGNAQLNSEIAVLLK 479 Query: 446 IVPNTGELAIE 456 G++ I Sbjct: 480 DGFKRGKIKIP 490 >gi|190893406|ref|YP_001979948.1| hypothetical protein RHECIAT_CH0003832 [Rhizobium etli CIAT 652] gi|190698685|gb|ACE92770.1| hypothetical protein RHECIAT_CH0003832 [Rhizobium etli CIAT 652] Length = 443 Score = 55.1 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 60/384 (15%), Positives = 109/384 (28%), Gaps = 53/384 (13%) Query: 84 AGRGIGKTTLNAWLVLW---------LMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134 AGR GKT L + ++T ++ +A S Q K Sbjct: 34 AGRRSGKTRAAGTLAGYVATLVDHSAYLATSERATIPVMAASTVQ--------AQKAFQA 85 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE-ERPDTFVGHHNTYGM 193 + S + + S+ + D + RT P + Sbjct: 86 CMVLEESSLLSKQIESS--NSETIKLKTRCDIEVRPANHRTIRGITSPLAIA-----DEV 138 Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKR 250 A + T I + L N + S+P G+ Y+ + P D Sbjct: 139 AFYFTDGQNTDSQILDAVRPSLQSGNHAGPLVCISSPYAKRGELYDAFKNHHGPNGDAHV 198 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-- 308 ++E I Y + V E G F + D+ + L +E A + Sbjct: 199 IVAKGASLEFNSTIDPATIARAYKRNPTVADAEYGGNF-RSDVTNLFTLEAVEAATDLGV 257 Query: 309 -EPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFDWSKT-----DLRTTNNKIS 361 E P D A G D + + + D + + T + Sbjct: 258 TERAPREGVQYFAHADPAGGSGADGFTLAIGHRENNVAVIDLIRERKSPYNPETVVADYA 317 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 L+ YR ++ DA + R +K ++ + +E + Sbjct: 318 DLLRLYRCATVVTDAYAS-------------EWNRAAWRKAGIEPKSAPMTASEFFAALV 364 Query: 422 DWLEFA--SLINHSGLIQNLKSLK 443 + +L++ L L L+ Sbjct: 365 PAVNSGQVALLDDETLKHQLVGLE 388 >gi|228904911|ref|ZP_04068965.1| hypothetical protein bthur0014_60350 [Bacillus thuringiensis IBL 4222] gi|228854925|gb|EEM99529.1| hypothetical protein bthur0014_60350 [Bacillus thuringiensis IBL 4222] Length = 954 Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats. Identities = 62/426 (14%), Positives = 117/426 (27%), Gaps = 83/426 (19%) Query: 92 TLNAWLVLWLMSTRPGISV------ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 T+ A ++ + G + + + Q + ++ ++ + + N + Sbjct: 442 TMCAHMLWVAFTCNGGTRMAKGAACVVATPYDNQAR-LIFDQLK---TFIDNNPVLQESI 497 Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205 S+ P+ V+ G + ++ R+ SE + G + + DE D Sbjct: 498 KSITKNPY---VIEFKNGSVIRLFTAGTRSGSE--GGSLRGQRADW---LYMDEVDYMGD 549 Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL-------------------- 245 I + E ++ S P G FY+ + Sbjct: 550 KDFESIFAIVNEAPDRIGCMIASTPTGRRGMFYKTCTQMKLNQDVKMNKNNVYDMRSYNR 609 Query: 246 ---DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + W F T P + + EV +F + + F + + Sbjct: 610 TLSEGWAEFYFPTMVNPEWGPKMERELRKLFSE--AAYEHEVLAEFGTEMVGVF-NKDYV 666 Query: 303 EEA----LNREPCPDPYAPLIMGCDIAEEGGDNTVVVL---------------------- 336 +EA N P P+ +G D + G +VV Sbjct: 667 DEASSIGYNYTTSPTHDGPIAIGIDWDKAGAATQIVVTQYNPFEVRRPRPELGETEPSFG 726 Query: 337 RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM-LGYHVY 395 R + + KI L Y P I DA G + L LG V Sbjct: 727 RFQIINRIEIPKGEFTYDIAVKKIIELDGVYNPFGIYADA-GAGEYQIELLRKTLGDKVK 785 Query: 396 RVLGQKRAV--DLE----FCRNRRTELHVKMADWLEFASL-----INHSGLIQNLKSLKS 444 RV + D + + + + LE L L + + + + Sbjct: 786 RVHLGSSQMVRDPHSREFEKKPLKAFIVDQTKLMLERGQLRIPHREKDETLARQMTNYQV 845 Query: 445 FIVPNT 450 Sbjct: 846 TRYSPK 851 >gi|109897022|ref|YP_660277.1| hypothetical protein Patl_0695 [Pseudoalteromonas atlantica T6c] gi|109699303|gb|ABG39223.1| protein of unknown function DUF264 [Pseudoalteromonas atlantica T6c] Length = 577 Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats. Identities = 38/250 (15%), Positives = 72/250 (28%), Gaps = 50/250 (20%) Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DE D I G + ++ R S P S + Y + +N+ D Sbjct: 241 LYVDEVFWMNDFNNIWHVAKGMASHKHWTRTL--ISTPSSKSHEAYTMWSGDRYNQGRKD 298 Query: 248 -----------------------WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 W+ F ++ +G + + D Sbjct: 299 EDKQELKVNHATLYGGHLAKDRIWRDFVTVEDAANDGCTLFDIDELKDE--NTPDEFDNL 356 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNRE---------PCPDPYAPLIMGCDIAEEGGDNTV- 333 +F F ++++ A +R+ P P + +G D A D T+ Sbjct: 357 YMCEFVDDSHSVFKLADLLKCATDRQHWKDYRERAPRPFLNRAVWLGYDPARSRDDATLA 416 Query: 334 -----VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 V+ + + DLR +I + +++ I ID G D ++ Sbjct: 417 IVAPPVLPGEKFRVLERIYFKGKDLREQAVEIQKICKRFNVVHIGIDVTGIGWGVFDLVK 476 Query: 389 MLGYHVYRVL 398 V Sbjct: 477 AFYPRVQGFH 486 >gi|322833247|ref|YP_004213274.1| hypothetical protein Rahaq_2540 [Rahnella sp. Y9602] gi|321168448|gb|ADW74147.1| hypothetical protein Rahaq_2540 [Rahnella sp. Y9602] Length = 595 Score = 55.1 bits (131), Expect = 3e-05, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 48/162 (29%), Gaps = 20/162 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + + + RY D+ + F DS + + Sbjct: 332 PDGQWRYVITMEDAVKGGFNRASIDKLRNRYNRDT--FNMLYMCVFVDSK-DSVFKFSDL 388 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346 E + P + G D A G +T + + LF Sbjct: 389 EICGVDVADWQDHDPNAERPFGNREVWGGFDPARSGDTSTFAIVAPPLYAVEKFRVLCLF 448 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 W + +I L KY I +D G + +E Sbjct: 449 HWKGMNFAYQAAQIKKLFGKYNMTYIGVDVTGIGRGVFELIE 490 >gi|320087122|emb|CBY96890.1| Terminase, ATPase subunit GpP [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 589 Score = 54.7 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320 R + + +F S P ++ + P+A P+ + Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFAPFADHPFGSRPVWI 413 Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D + G VVL G L W D I L EKY + I I Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEAIRRLTEKYNVEYIGI 473 Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 DA G + A + + +T + +K D + L +G Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524 Query: 435 ---LIQNLKSLKSFIVPNTGE 452 + Q+ S++ + ++G Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544 >gi|238027334|ref|YP_002911565.1| phage terminase ATPase subunit [Burkholderia glumae BGR1] gi|237876528|gb|ACR28861.1| Phage terminase ATPase subunit [Burkholderia glumae BGR1] Length = 605 Score = 54.7 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 44/297 (14%), Positives = 75/297 (25%), Gaps = 57/297 (19%) Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLG 210 D + S G + T RT D + DE + +N Sbjct: 239 LTGDPMRLSNGAELIFLGTSSRTAQSYNGD------------LYFDEYFWVSNFATLNKV 286 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRFQID------TRTVEGI 261 +G T + T + + FN+ D +R +ID + Sbjct: 287 AMGMATHSHLRMTHFSTPSTTTHEAYPFWTGAHFNRDRADDERVEIDISHTSLAPGRQCG 346 Query: 262 DPSFHEGIIARYGLDSDV----------------TRVEVCGQFPQQDIDSFIPLNIIEEA 305 D + + + A + S QF S +++ Sbjct: 347 DGQWRQIVTAEDAVASGFTKLDLEDLRSTNSPADFENLYMCQFVDDTS-SVFAFRLVQAC 405 Query: 306 LN-----------REPCPDPYAPLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDW 348 + P + + +G D A G VV+ + W Sbjct: 406 MVDSWDVWTDVKPLLDRPFGWKQVWIGYDPALTGDSAGCVVIAPPEQPNGKFRVLERHRW 465 Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 D T KI L ++Y I ID G + V + + Sbjct: 466 KGIDFETQAEKIRELTQRYHVTYIAIDTTGIGHGVHQLVRQFFPRVVPINYSPEVKN 522 >gi|222149246|ref|YP_002550203.1| hypothetical protein Avi_3048 [Agrobacterium vitis S4] gi|221736230|gb|ACM37193.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 452 Score = 54.7 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 71/241 (29%), Gaps = 22/241 (9%) Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232 T PDT G + DE + D I + +++ R TS P Sbjct: 114 TALPANPDTARGFSAN----VFLDEFAFHKDSQQIWRALFPVISKGWNIRV---TSTPNG 166 Query: 233 LSGKFYEIFNKPLDD-WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291 S KFYE+ P+DD W R +D + D D E ++ + Sbjct: 167 KSNKFYELATGPIDDPWSRHVVDIYQAVRDGLPRDIEELRAGLADEDSWAQEFELKWLDE 226 Query: 292 DID----SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIE 343 I E A +P +G DI D V+ + Sbjct: 227 ASAWLSYDLISSCEDERA--GDPEGYQGNVCFVGRDIGRR-EDLHVIWVWEQIGDVLWER 283 Query: 344 HLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKR 402 + + + ++ +YR ID G + + + G V VL Sbjct: 284 ERIEQKRATFAEMDEAFDDVMTRYRVGRACIDQTGMGEKVTEDAQIRYGSRVEGVLFTGP 343 Query: 403 A 403 Sbjct: 344 N 344 >gi|88858953|ref|ZP_01133594.1| Mu-like prophage FluMu protein gp28 [Pseudoalteromonas tunicata D2] gi|88819179|gb|EAR28993.1| Mu-like prophage FluMu protein gp28 [Pseudoalteromonas tunicata D2] Length = 593 Score = 54.7 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 43/142 (30%), Gaps = 19/142 (13%) Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----I------PLNIIEEALN 307 G D + Y +D +F +F I + L Sbjct: 350 NSGFDRIDISVLENEY--STDEFNNLFMCKFIDDAHSAFNLKQIMDCVGDSTKWTDFNLE 407 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL--FDWSKTDLRTTNNKIS 361 P P+++G D A G +V V ++ G L D S D ++I Sbjct: 408 -WERPFALRPVVIGFDPARFGDKASVAVLSAPMKPGEKFRLLEAIDLSGNDFEAMASEIK 466 Query: 362 GLVEKYRPDAIIIDANNTGART 383 L +KY I +D G Sbjct: 467 LLTDKYNVQHIGVDTTGIGYGV 488 >gi|290474053|ref|YP_003466928.1| hypothetical protein XBJ1_0997 [Xenorhabdus bovienii SS-2004] gi|289173361|emb|CBJ80138.1| putative phage gene [Xenorhabdus bovienii SS-2004] Length = 594 Score = 54.7 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 34/239 (14%), Positives = 64/239 (26%), Gaps = 52/239 (21%) Query: 198 DEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP----------- 244 DE PD N T + S P + Y ++ Sbjct: 258 DEYFWIPDFKRFNEVASAMATHDHWRTT--YFSTPSAKTHPAYSLWTGDEWRGNDPKRKN 315 Query: 245 --LDDWKRFQIDTRTVE----------------GIDPSFHEGIIARYGLDSDVTRVEVCG 286 + + R G + + + + +Y DS + Sbjct: 316 VAFPAFDELRDGGRDCPDGQWRYVITLEDAIKGGFNLASIDRLRNKYNPDS--FNMLFMC 373 Query: 287 QFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL 336 F S + +++ + +AP + G D A G +T V++ Sbjct: 374 VFVDSGA-SVFTYSQVDKCGVDINLWEDHAPNASRPFGEREVWGGFDPARSGDTSTFVIV 432 Query: 337 ------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 + F W + + I L ++YR I ID G + ++ Sbjct: 433 APPMMANEAFRVLATFYWQGMNWKHQAKLIEELFKRYRFTHIGIDTTGIGHGVYEMVQD 491 >gi|320198795|gb|EFW73395.1| Phage terminase, ATPase subunit [Escherichia coli EC4100B] Length = 603 Score = 54.7 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 42/135 (31%), Gaps = 17/135 (12%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP---------CPDPYA 316 E + +Y + QF F ++ ++R P Sbjct: 356 IERLRNKYSPT--AFAMLYMCQFVDSKDAVFKFSALVGCEVDRATWGDFDLTATRPFGNR 413 Query: 317 PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 + G D + G ++T V++ + ++ W + ++I L+ ++ Sbjct: 414 EVWAGFDPSRSGDNSTFVLIAPPIEDGERFRVLAVWQWQGFNFSWQADQIKQLMRRFNIT 473 Query: 371 AIIIDANNTGARTCD 385 I ID G D Sbjct: 474 YIGIDTTGIGKGVYD 488 >gi|193062794|ref|ZP_03043887.1| putative conserved hypothetical protein [Escherichia coli E22] gi|192931437|gb|EDV84038.1| putative conserved hypothetical protein [Escherichia coli E22] Length = 603 Score = 54.7 bits (130), Expect = 3e-05, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 42/135 (31%), Gaps = 17/135 (12%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---------PCPDPYA 316 E + +Y + QF F ++ ++R P Sbjct: 356 IERLRNKYSPT--AFAMLYMCQFVDSKDAVFKFSALVGCEVDRATWGDFDLTAARPFGNR 413 Query: 317 PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 + G D + G ++T V++ + ++ W + ++I L+ ++ Sbjct: 414 EVWAGFDPSRSGDNSTFVLIAPPIEDGERFRVLAVWQWQGFNFSWQADQIKQLMRRFNIT 473 Query: 371 AIIIDANNTGARTCD 385 I ID G D Sbjct: 474 YIGIDTTGIGKGVYD 488 >gi|89071120|ref|ZP_01158320.1| Putative large terminase [Oceanicola granulosus HTCC2516] gi|89043331|gb|EAR49553.1| Putative large terminase [Oceanicola granulosus HTCC2516] Length = 444 Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats. Identities = 61/418 (14%), Positives = 114/418 (27%), Gaps = 67/418 (16%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ + G V + + Q + + Sbjct: 58 ILGGRGAGKTRAGA---EWVRAQVEGPRATDPGRARRVALVGETIDQAREVMV------- 107 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 F L P + + +S P+ G Sbjct: 108 --------FGDSGLLACAPPDRRPEWIAGRRLLVWPNGAQAQLFSAHDPEALRGPQFD-- 157 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + + L + R + R + + + + Sbjct: 158 -AAWVDELAKWKKAEEAWDMLQLALRLGDDPR--CCVTTTPRPTALMRALLERD-GTART 213 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + +F + RY S + R E+ G + + I A N + Sbjct: 214 HAPTEANAANLARAFLAEVRRRY-AGSPLGRQELDGVMLSEIEGALWSAGAI-AAANCDV 271 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTT-NNKIS 361 PD + +++ D + GGD +V+ L D S TT Sbjct: 272 VPDLH-RVVVAVDPSAGGGDVCGIVVAGACYDGGADNWRAWVLEDASVAGSSTTWARAAI 330 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 E+++ D I+ + N G L + G + L+ + Sbjct: 331 AAYERHQADRIVAEVNQGGDMVAAMLRQV-APTVPYKGVRAMRGKAARAEPVAALYEQGR 389 Query: 422 -DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 + + L + + + +G S D +D L++ +E Sbjct: 390 VRHVRGLGALEDQ---MALMTHQGY---------------RGRGSPDRADALVWALSE 429 >gi|167553298|ref|ZP_02347048.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205322236|gb|EDZ10075.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 589 Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats. Identities = 50/310 (16%), Positives = 90/310 (29%), Gaps = 67/310 (21%) Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPL-- 245 + DE P+ + G ++++ S P L+ Y E+FNK Sbjct: 250 LYVDEIFWIPNFQKLRKVASGMASQKHLRST--YFSTPSTLAHGAYPFWSGELFNKGRAS 307 Query: 246 ----------------------DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282 W++ I+ G + + + + Sbjct: 308 AADRIEIDISHSALAGGLLCADGQWRQIVTIEDALASGCTLFDLDQLRRE--NSDEDFKN 365 Query: 283 EVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIMGCDIAEEGGDN 331 +F S P ++ + P+A P+ +G D + G Sbjct: 366 LFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFTPFADHPFGSRPVWIGYDPSHTGDSA 424 Query: 332 TVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 VVL G L W D I L EKY + I IDA G Sbjct: 425 GCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRRLTEKYNVEYIGIDATGLGLGVFQ 484 Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG---LIQNLKSL 442 + A + + +T + +K D + L +G + Q+ S+ Sbjct: 485 LVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAGATDVTQSFMSI 535 Query: 443 KSFIVPNTGE 452 + + ++G Sbjct: 536 RK-TMTSSGR 544 >gi|83943173|ref|ZP_00955633.1| terminase, large subunit, putative [Sulfitobacter sp. EE-36] gi|83846181|gb|EAP84058.1| terminase, large subunit, putative [Sulfitobacter sp. EE-36] Length = 408 Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats. Identities = 64/423 (15%), Positives = 113/423 (26%), Gaps = 73/423 (17%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ + G V + + Q++ + Sbjct: 16 IMGGRGAGKTRAGA---EWVRAQVEGSRPLDAGRCRRVALVGETIEQVREVM-------- 64 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 + S + W + + ++ P+ G Sbjct: 65 --IFGDSGILACSPADRRPDWEATRKRLVWPN-----GAVATVHTAHDPEGLRGPQFD-- 115 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + + L + +T+ P R G + P Sbjct: 116 -AAWVDELAKWKKAEETWDQLQFAL-RLGEDPRVCVTTTP-RNVGVLKNLLASPSTV-TT 171 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + SF E + ARY + + R E+ G + IE R+ Sbjct: 172 HAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSERIEAGRVRDV 230 Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357 +++G D A G D +V+ DW Sbjct: 231 PL--LDRIVVGLDPATTAGAGSDECGIVVVGAQTQGPPQDWRAVVMADCTVQGATPSGWA 288 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTE 415 +E+Y D ++ + N G + L + V V + V R E Sbjct: 289 RAAISAMEQYGADRLVAEVNQGGQMVAEVLRQVDPLVPVKSVHASRGKV-------ARAE 341 Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + + + L + + + TG S D D L++ Sbjct: 342 PVAALYEQGRVGHVAGLDALEDQMCRMTARGYEATG-------------SPDRVDALVWA 388 Query: 476 FAE 478 E Sbjct: 389 LHE 391 >gi|224582696|ref|YP_002636494.1| hypothetical protein SPC_0883 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] gi|224467223|gb|ACN45053.1| putative phage protein [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] Length = 591 Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 51/168 (30%), Gaps = 20/168 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY ++ + F DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E + P + G D A G + V++ + + Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394 W + R +I L +KY + +D G D ++ V Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGIFDNIQHFAMRV 492 >gi|170023192|ref|YP_001719697.1| hypothetical protein YPK_0943 [Yersinia pseudotuberculosis YPIII] gi|169749726|gb|ACA67244.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis YPIII] Length = 697 Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats. Identities = 21/157 (13%), Positives = 45/157 (28%), Gaps = 20/157 (12%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E + G F F + +E+ L + + Sbjct: 367 IEELREENGES--AFNQLYMCLFVDTGDCVF-RFDQLEKCLVTVSNWEDHDVNAARPFGN 423 Query: 317 -PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + G D A G + V++ + H+ W + +I + +Y Sbjct: 424 REVWAGYDPARSGDTASFVLVAPPQADGEPFRVLHIETWHGFAFKYQVGRIKEYMARYNI 483 Query: 370 DAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRAVD 405 I ID+ G C+ ++ V ++ + + Sbjct: 484 THIGIDSTGIGGPVCELVQEFARREVTQIHYSPESKN 520 >gi|317153313|ref|YP_004121361.1| hypothetical protein Daes_1602 [Desulfovibrio aespoeensis Aspo-2] gi|316943564|gb|ADU62615.1| hypothetical protein Daes_1602 [Desulfovibrio aespoeensis Aspo-2] Length = 507 Score = 54.7 bits (130), Expect = 4e-05, Method: Composition-based stats. Identities = 60/394 (15%), Positives = 110/394 (27%), Gaps = 69/394 (17%) Query: 38 WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95 WG+ S W Q+E + + + ++ GR +GK+ + + Sbjct: 20 WGQAYLYNRDGSGRDYWPHQVEDLRCLARNIIHLD--------------GRDVGKSIVLS 65 Query: 96 WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155 L T G + A + L + + E+ L P+ M S+++ Sbjct: 66 TDALHYAFTTRGGQGLIAAPHQGHLDSII-EEIEYQLDTNPD----LMNSIAVTKYGKPK 120 Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215 ++ + S + + D+F H + DE + + + L Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDSFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177 Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYG 274 R + S P L +Y + D + F+ + ++ YG Sbjct: 178 KTGGILRIY---STPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTEDRESELLEFYG 232 Query: 275 -LDSDVTRVEVCGQFPQQDIDSF-----------------IPLNIIE--------EALNR 308 DS + EV G+ + +F I + E A +R Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292 Query: 309 -----EPCPDPYAPLIMGCDIAEEGGDNTVVVL-------RRGPVIEHLFDWSKTDLRTT 356 P I G D+ ++V R + Sbjct: 293 MEMLLNLTPRSGQFWIGG-DLGYTNDPTEIIVFQEMEVGERTLLKMILRVHLEHVSYPHI 351 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 + L Y P I +D G L L Sbjct: 352 AQIFALLERYYTPAGIGVDNGGNGLAVVQELLTL 385 >gi|159044464|ref|YP_001533258.1| hypothetical protein Dshi_1915 [Dinoroseobacter shibae DFL 12] gi|157912224|gb|ABV93657.1| hypothetical protein Dshi_1915 [Dinoroseobacter shibae DFL 12] Length = 260 Score = 54.3 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 52/294 (17%), Positives = 89/294 (30%), Gaps = 62/294 (21%) Query: 35 FFPWGE-------KGTPLEGFSA--PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAG 85 PW E + L + P WQ+E A+ G Sbjct: 6 LIPWAEDLERRLDPVSRLTHWMGHAPDPWQVEAFTTRATE--------------VALRVG 51 Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 R GKT++ A + + P +C+A +E Q K + E+ + L Sbjct: 52 RQSGKTSVLAARAVEELHV-PESLTLCVAPAERQAK-IIAREIGRQLQRTS--------- 100 Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERP-DTFVGHHNTYGMAIINDE----- 199 ++ + R + DT G +I DE Sbjct: 101 -----------LVINRPTQTELEIANGARVIALPSTSDTIRGF--PAVSCLIIDECAFLQ 147 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRT 257 G + + +L LTE +S P + F +F KP D R + Sbjct: 148 GDGGGEDLISSVLPMLTEDGQ---VFFSSTPAGKNNYFARLFLDAKPGDGIHRIVVRGTD 204 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 + + R L + R E+ + ++ L+IIE+A ++ Sbjct: 205 IPRLADKVERM---RRTLSATKFRQEILVEMLADGQ-AYFDLSIIEQATSKTEK 254 >gi|323527775|ref|YP_004229928.1| bacteriophage terminase ATPase subunit [Burkholderia sp. CCGE1001] gi|323384777|gb|ADX56868.1| bacteriophage terminase, ATPase subunit [Burkholderia sp. CCGE1001] Length = 588 Score = 54.3 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 47/170 (27%), Gaps = 21/170 (12%) Query: 247 DWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 W++ ++ G + + + Y + + QF S L ++ Sbjct: 330 QWRQIVTVEDAARAGCNLFNLDELRREY--SDEEYANLLMCQFIDDTA-SIFTLANLQRC 386 Query: 306 LN-----------REPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGP--VIEHLFDW 348 + P + P+ +G D A G VV + G + W Sbjct: 387 MVDSWELWADYKPLAARPFAWHPVWVGYDPALSGDSAGCVVVAPPMVEGGPFRVLEKHQW 446 Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398 D I + E+Y + ID G ++ V Sbjct: 447 RGLDFEAQAQSIKEITERYNVAYMAIDTTGIGQGVYQLVKQFYPRVVAFN 496 >gi|103487487|ref|YP_617048.1| hypothetical protein Sala_2004 [Sphingopyxis alaskensis RB2256] gi|98977564|gb|ABF53715.1| protein of unknown function DUF264 [Sphingopyxis alaskensis RB2256] Length = 436 Score = 54.3 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 66/407 (16%), Positives = 124/407 (30%), Gaps = 56/407 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 AGRG GKT A V T PG + +A + L E Sbjct: 56 AGRGFGKTRTGAEWVRAFAETTPGARIALVA--------------ASLLEARQVMVEGES 101 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 L++ P + SL + + YS PD+ G + A DE + Sbjct: 102 GLLAIAPDHLRPEY-ESSLRRLTWPNGAVATLYSAVEPDSLRGPEHD---AAWCDEIAKW 157 Query: 204 P--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261 P + ++ + A + T+ PR + + + Sbjct: 158 PKGEAAWDNLMLTM-RIGARPQVVATTTPRCV--PLVRRLIQERGVATTRGRTASNRRNL 214 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321 + + A YG + R E+ G+ + D+ +IE +A +++G Sbjct: 215 SVQWLATMDAIYGGT-RLGRQELDGELLEDVEDALWTRALIERCRVDAGSIGKFARVVIG 273 Query: 322 CD-IAEEGGDNTVVV----LRRGPVIEHLFDWS-KTDLRTTNNKISGLVEKYRPDAIIID 375 D A GGD +V LR G + + + ++ ++ + ++ + Sbjct: 274 VDPPASAGGDACGIVVAALLRDGRLAVVEDASALRPLPGVWAQAVAAAAARWGAERVVAE 333 Query: 376 ANNTGARTCDYLEM--LGYHVYRVLGQKRAVDLEFCRNRRTE---LHVKMADWLEFASLI 430 +N G L + V + RR E L + + + Sbjct: 334 SNMGGDMVAAVLRQADMTLPVVAIHASVGKA-------RRAEPVALAYERGQVVHAGAFA 386 Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 + + L+ + P +S D +D ++ A Sbjct: 387 DLEDQLCGLQMGGGYAGP--------------GRSPDRADACVWALA 419 >gi|254485756|ref|ZP_05098961.1| phage DNA Packaging Protein [Roseobacter sp. GAI101] gi|214042625|gb|EEB83263.1| phage DNA Packaging Protein [Roseobacter sp. GAI101] Length = 452 Score = 54.3 bits (129), Expect = 4e-05, Method: Composition-based stats. Identities = 62/426 (14%), Positives = 118/426 (27%), Gaps = 79/426 (18%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ S G V + + Q++ + Sbjct: 60 IMGGRGAGKTRAGA---EWVRSMVEGARPLDAGRCRRVALVGETIEQVREVM-------- 108 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 + S + W + + ++ P+ G Sbjct: 109 --IFGDSGILACSPADRRPDWEATRKRLVWPN-----GAVASVHTAHDPEGLRGPQFD-- 159 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + + L + +T+ PR + ++ K L Sbjct: 160 -AAWVDELAKWKKAEETWDQLQFAL-RLGEDPRVCVTTTPRNV-----DVLKKLLASPST 212 Query: 251 FQIDTRT---VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 T + SF E + ARY + + R E+ G + ++E Sbjct: 213 VTTHAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSEMLER--G 269 Query: 308 REPCPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLR 354 R + +++G D A G D +V+ +W Sbjct: 270 RIEKLPTFDRIVVGVDPATTAGAGSDECGIVVVGAQTQGAPQNWRAVVLADCTAQGATPS 329 Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNR 412 +E+Y D ++++ N G + L + + V + V Sbjct: 330 GWARAAVSAMEQYGADRLVVETNQGGLMVGEVLRQIDPLVPLKSVHASRGKV-------A 382 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGL 472 R E + + + L + + + +G S D D L Sbjct: 383 RAEPVAALYEQGRVGHVAGLVALEDQMCRMTARGFEGSG-------------SPDRVDAL 429 Query: 473 MYTFAE 478 ++ E Sbjct: 430 VWALHE 435 >gi|24112089|ref|NP_706599.1| putative bacteriophage protein [Shigella flexneri 2a str. 301] gi|30062202|ref|NP_836373.1| putative bacteriophage protein [Shigella flexneri 2a str. 2457T] gi|24050918|gb|AAN42306.1| putative bacteriophage protein [Shigella flexneri 2a str. 301] gi|30040447|gb|AAP16179.1| putative bacteriophage protein [Shigella flexneri 2a str. 2457T] gi|281600053|gb|ADA73037.1| putative bacteriophage protein [Shigella flexneri 2002017] gi|332768291|gb|EGJ98476.1| hypothetical protein SF293071_0835 [Shigella flexneri 2930-71] Length = 179 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 49/150 (32%), Gaps = 27/150 (18%) Query: 295 SFIPLNIIEEALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SK 350 + I L+ IE A++ + +P +G D+A+ G D V R G V+ +W + Sbjct: 10 AIIKLSWIEAAVDAHKTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 69 Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG------------YHVYRVL 398 +L + + + D I+ D+ GA + + R Sbjct: 70 DELLKSCQRTYQAALEREAD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFN 128 Query: 399 GQ----------KRAVDLEFCRNRRTELHV 418 + +F N + + Sbjct: 129 AGAGVHEPDDEYNGIPNKDFFANLKAQAWW 158 >gi|75758280|ref|ZP_00738405.1| Hypothetical protein RBTH_06375 [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|74494334|gb|EAO57425.1| Hypothetical protein RBTH_06375 [Bacillus thuringiensis serovar israelensis ATCC 35646] Length = 660 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 62/426 (14%), Positives = 117/426 (27%), Gaps = 83/426 (19%) Query: 92 TLNAWLVLWLMSTRPGISV------ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 T+ A ++ + G + + + Q + ++ ++ + + N + Sbjct: 148 TMCAHMLWVAFTCNGGTRMAKGAACVVATPYDNQAR-LIFDQLK---TFIDNNPVLQESI 203 Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205 S+ P+ V+ G + ++ R+ SE + G + + DE D Sbjct: 204 KSITKNPY---VIEFKNGSVIRLFTAGTRSGSE--GGSLRGQRADW---LYMDEVDYMGD 255 Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL-------------------- 245 I + E ++ S P G FY+ + Sbjct: 256 KDFESIFAIVNEAPDRIGCMIASTPTGRRGMFYKTCTQMKLNQDVKMNKNNVYDMRSYNR 315 Query: 246 ---DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + W F T P + + EV +F + + F + + Sbjct: 316 TLSEGWAEFYFPTMVNPEWGPKMERELRKLFSE--AAYEHEVLAEFGTEMVGVF-NKDYV 372 Query: 303 EEA----LNREPCPDPYAPLIMGCDIAEEGGDNTVVVL---------------------- 336 +EA N P P+ +G D + G +VV Sbjct: 373 DEASSIGYNYTTSPTHDGPIAIGIDWDKAGAATQIVVTQYNPFEVRRPRPELGETEPSFG 432 Query: 337 RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM-LGYHVY 395 R + + KI L Y P I DA G + L LG V Sbjct: 433 RFQIINRIEIPKGEFTYDIAVKKIIELDGVYNPFGIYADA-GAGEYQIELLRKTLGDKVK 491 Query: 396 RVLGQKRAV--DLE----FCRNRRTELHVKMADWLEFASL-----INHSGLIQNLKSLKS 444 RV + D + + + + LE L L + + + + Sbjct: 492 RVHLGSSQMVRDPHSREFEKKPLKAFIVDQTKLMLERGQLRIPHREKDETLARQMTNYQV 551 Query: 445 FIVPNT 450 Sbjct: 552 TRYSPK 557 >gi|17975126|ref|NP_536648.1| putative terminase, ATPase subunit [Vibrio phage K139] gi|153820795|ref|ZP_01973462.1| terminase [Vibrio cholerae B33] gi|165970256|ref|YP_001650887.1| putative terminase ATPase subunit [Vibrio phage kappa] gi|229512054|ref|ZP_04401533.1| hypothetical protein VCE_003464 [Vibrio cholerae B33] gi|229519190|ref|ZP_04408633.1| hypothetical protein VCC_003218 [Vibrio cholerae RC9] gi|229607255|ref|YP_002877903.1| hypothetical protein VCD_002166 [Vibrio cholerae MJ-1236] gi|254849294|ref|ZP_05238644.1| terminase [Vibrio cholerae MO10] gi|17865408|gb|AAL47515.1|AF125163_21 orf16 [Vibrio phage K139] gi|126521587|gb|EAZ78810.1| terminase [Vibrio cholerae B33] gi|165292233|dbj|BAF98815.1| putative terminase ATPase subunit [Vibrio phage kappa] gi|229343879|gb|EEO08854.1| hypothetical protein VCC_003218 [Vibrio cholerae RC9] gi|229352019|gb|EEO16960.1| hypothetical protein VCE_003464 [Vibrio cholerae B33] gi|229369910|gb|ACQ60333.1| hypothetical protein VCD_002166 [Vibrio cholerae MJ-1236] gi|254844999|gb|EET23413.1| terminase [Vibrio cholerae MO10] Length = 605 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 48/181 (26%), Gaps = 23/181 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ I+ G D E + Y F S N I Sbjct: 345 PDKQWRYVVTIEDAAKGGCDLFDIEELREEYSETD--FNNLFMCVFVDGAS-SIFEFNKI 401 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345 E + Y P + +G D + DN V +V + Sbjct: 402 ERCMVDSDIWQDYKPNAARPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAVEKFRVLEK 460 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404 W + ++IS + E++ + ID GA D L + Sbjct: 461 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVHDLLVNKHPRETVAIHYSNENK 520 Query: 405 D 405 + Sbjct: 521 N 521 >gi|118475162|ref|YP_891824.1| hypothetical protein CFF8240_0649 [Campylobacter fetus subsp. fetus 82-40] gi|261886523|ref|ZP_06010562.1| hypothetical protein CfetvA_16765 [Campylobacter fetus subsp. venerealis str. Azul-94] gi|118414388|gb|ABK82808.1| hypothetical protein CFF8240_0649 [Campylobacter fetus subsp. fetus 82-40] Length = 523 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 54/344 (15%), Positives = 108/344 (31%), Gaps = 38/344 (11%) Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYS----EERPDTFVGHHNTYGMAIINDEASGTP 204 + W + S DS+H + T G I DE + P Sbjct: 183 YMRHWAKEYG-ISFKKDSEHEVVLENGAYIKSFANNFRTVQGFAGD----IWMDEFAWYP 237 Query: 205 DV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQIDTRTVEG 260 + I + + + S P F++I++ +KRF + Sbjct: 238 NPKRIWHAFVPSI--GAIKGRLTILSTPFEERSLFHQIYSDKTKFHMFKRFCVSIYKAIE 295 Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP---CPDPYAP 317 F + D+D QF + S + +++I+ ++ + P Sbjct: 296 DGLDFDLETMRDL-FDTDTWASAYECQFVDDES-SLLSISLIKSCVDNKAHYFTPKSSEC 353 Query: 318 LIMGCDIAEEGGDNTV--VVLRRGPVIEHLFDW-SKTDLRTTNNKISGLVEKYRPDAIII 374 + G D+ +T+ VVL G L D +K ++ ++ Y + I Sbjct: 354 IYAGYDVGRVSDRSTLAGVVLENGVYKTALMDILAKARFEEQKEHLTSFLKTYPISVLKI 413 Query: 375 DANNTGARTCDYL-EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE--FASLIN 431 D G + + + V V + E+ + + E + N Sbjct: 414 DKTGIGMNLAENMHDKFKSRVSGVWFSNTRKE---------EMALNLKKAFEDKLIKIPN 464 Query: 432 HSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 LI ++ ++K I + + ++KR + + D L Sbjct: 465 DPLLIADIHAIKRTIGAKSFKY--DAKRNEYGHA-DRFWALALA 505 >gi|83643297|ref|YP_431732.1| Mu-like prophage FluMu protein gp28 [Hahella chejuensis KCTC 2396] gi|83631340|gb|ABC27307.1| Mu-like prophage FluMu protein gp28 [Hahella chejuensis KCTC 2396] Length = 581 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 72/391 (18%), Positives = 124/391 (31%), Gaps = 83/391 (21%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 IG T AW T + A S +Q A++ K L + +F+ + Sbjct: 168 IGATYYFAWEAFQDAITSGDNQIFLSA-SRSQ------ADIFKAYILKFAREYFDTELKG 220 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207 + DV+ S G + + ST RT G+H + DE PD Sbjct: 221 V-------DVIPLSNGAELRFVSTNGRTA--------QGYHGH----LYIDEVFWIPD-- 259 Query: 208 NLGILGFLTERNANRFW--IMTSNPRRLSGKFYEIF----------NK------------ 243 + + A++ W S P S Y ++ +K Sbjct: 260 FDRLNKLASGMAAHKKWRKTYFSTPSVKSHGAYTLWSGERYNESRRHKVEFDLSRAALRE 319 Query: 244 ----PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 P W+ ++ +G D + + Y F + + F Sbjct: 320 GQLGPDKVWRNVVTVEDAANQGCDLFDIDELKQEY--TEAEFNNLFMCAFMEAGLSVFKL 377 Query: 299 LNIIEEAL----------NREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEH 344 +++ A+ R+P P P+ +G D A G +TVVV + Sbjct: 378 DDLLSCAVCSSDVWPDFKPRQPRPFANYPVWLGYDPARTGDRSTVVVVAPPMHPAGKFRV 437 Query: 345 LFDWS-KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRA 403 L K N+I L+ +Y + ID G + ++ RA Sbjct: 438 LEKIQLKGAFSYQANRIKDLLGRYNVQFVGIDCTGPGLGVFEQVKA---------FYPRA 488 Query: 404 VDLEFCRNRRTELHVKMADWLEFASLINHSG 434 + + N +T L +K D +E A + + Sbjct: 489 TPIHYSLNAKTALVLKAMDVIENARIEWDAE 519 >gi|12276099|gb|AAG50261.1|AF311654_1 probable terminase [Phage GMSE-1] Length = 268 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 64/202 (31%), Gaps = 27/202 (13%) Query: 267 EGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP--------- 317 E + ++Y + + QF DS +E + P Sbjct: 35 ERLRSKY--PARYFNMLYQCQFVDSG-DSVFSFGDLERCGVETVRWQDHQPNAARPFGNR 91 Query: 318 -LIMGCDIAEEGGDNTVVVLR----RGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPD 370 + G D A G +T V++ G L W + + ++I L +Y Sbjct: 92 EVWAGFDPARSGDTSTFVIMAPPQYEGERFRVLVTFYWQGMNWKYQASQIKALFARYHMT 151 Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430 I ID G+ + + ++ V + + +T + +KM D +E + Sbjct: 152 HIGIDTTGIGSGV--------FEMVEAFAPRQTVAIRYGVETKTRMVLKMVDLVESKRIE 203 Query: 431 NHSGLIQNLKSLKSFIVPNTGE 452 + S S +T + Sbjct: 204 WDGEQKEIAASFLSIRRTSTAK 225 >gi|3337256|gb|AAC34148.1| terminase subunit [Enterobacteria phage 186] Length = 589 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 60/201 (29%), Gaps = 31/201 (15%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY-----------APLIM 320 + + + +F S P ++ + P+ + Sbjct: 355 KRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMEEWEDFAPFADHPFGSRPVWI 413 Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D + G VVL G L W D + I L EKY + I I Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQADGIRKLTEKYSVEYIGI 473 Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 DA G + A + + +T + +K D + L +G Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524 Query: 435 ---LIQNLKSLKSFIVPNTGE 452 + Q+ S++ + ++G Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544 >gi|83649379|ref|YP_437814.1| Mu-like prophage FluMu protein gp28 [Hahella chejuensis KCTC 2396] gi|83637422|gb|ABC33389.1| Mu-like prophage FluMu protein gp28 [Hahella chejuensis KCTC 2396] Length = 581 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 72/391 (18%), Positives = 124/391 (31%), Gaps = 83/391 (21%) Query: 88 IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147 IG T AW T + A S +Q A++ K L + +F+ + Sbjct: 168 IGATYYFAWEAFQDAITSGDNQIFLSA-SRSQ------ADIFKAYILKFAREYFDTELKG 220 Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207 + DV+ S G + + ST RT G+H + DE PD Sbjct: 221 V-------DVIPLSNGAELRFVSTNGRTA--------QGYHGH----LYIDEVFWIPD-- 259 Query: 208 NLGILGFLTERNANRFW--IMTSNPRRLSGKFYEIF----------NK------------ 243 + + A++ W S P S Y ++ +K Sbjct: 260 FDRLNKLASGMAAHKKWRKTYFSTPSVKSHGAYTLWSGERYNESRRHKVEFDLSRAALRE 319 Query: 244 ----PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 P W+ ++ +G D + + Y F + + F Sbjct: 320 GQLGPDKVWRNVVTVEDAANQGCDLFDIDELKQEY--TEAEFNNLFMCAFMEAGLSVFKL 377 Query: 299 LNIIEEAL----------NREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEH 344 +++ A+ R+P P P+ +G D A G +TVVV + Sbjct: 378 DDLLSCAVCSGEVWPDFKPRQPRPFANYPVWLGYDPARTGDRSTVVVVAPPMHPAGKFRV 437 Query: 345 LFDWS-KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRA 403 L K N+I L+ +Y + ID G + ++ RA Sbjct: 438 LEKIQLKGAFSYQANRIKDLLGRYNVQFVGIDCTGPGLGVFEQVKA---------FYPRA 488 Query: 404 VDLEFCRNRRTELHVKMADWLEFASLINHSG 434 + + N +T L +K D +E A + + Sbjct: 489 TPIHYSLNAKTALVLKAMDVIENARIEWDAE 519 >gi|237745794|ref|ZP_04576274.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS] gi|229377145|gb|EEO27236.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS] Length = 585 Score = 54.3 bits (129), Expect = 5e-05, Method: Composition-based stats. Identities = 47/284 (16%), Positives = 77/284 (27%), Gaps = 62/284 (21%) Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA--SGTPDVINLG 210 D + S G T RT G+H DE + + +N Sbjct: 216 LKGDPIVLSNGAHLYFLGTNARTA--------QGYHGN----FYFDEFFWTFKFEELNKV 263 Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDDWKRFQIDTRTVEGIDPSF 265 G + + S P +S + Y + FNK ++ +ID + Sbjct: 264 ASGMALHKRWRKT--YFSTPSAMSHEAYPFWTGDAFNKRRRKEEQVRIDVSHKWLAEGRL 321 Query: 266 HEGIIAR-----------------------YGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 E I R Y D + F S PL+ + Sbjct: 322 CEDRIWRQIVTIEDAEKGGCDLFDIDELRNYEYSPDQFDNLLMCNFIDDTA-SVFPLSEL 380 Query: 303 EEALNR-----------EPCPDPYAPLIMGCDIAEEGGDNTVVVLRR------GPVIEHL 345 + + P P+ +G D + G VV+ I Sbjct: 381 QRCMVDSWEAWNDYKPFTARPFGNRPVWVGYDPSRSGDSAGCVVMAPPLTFPGKFRIIEK 440 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 + D +I + +KY + I IDA G + + Sbjct: 441 HQYRGMDFAAQAEQIRQITQKYNVEYIGIDATGMGLGVYEIVRQ 484 >gi|139473519|ref|YP_001128235.1| phage terminase [Streptococcus pyogenes str. Manfredo] gi|134271766|emb|CAM29999.1| putative phage terminase [Streptococcus pyogenes str. Manfredo] Length = 471 Score = 54.0 bits (128), Expect = 5e-05, Method: Composition-based stats. Identities = 53/347 (15%), Positives = 109/347 (31%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ + + A + + + GKT + + LW + G+ ++ Sbjct: 43 PWQENMLIPIMAVDEDGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + S G + + Sbjct: 97 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKSSGAVIQFRT-- 150 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200 Query: 233 --LSGKFYEIFNKP-------LDDWKRFQIDTRTVEGIDPSF------------HEGIIA 271 +G +E + K W + +D S+ I A Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVDEMQPIHDVKSWYIANPSMGFHLNERKIEA 260 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365 >gi|82776052|ref|YP_402399.1| putative bacteriophage protein [Shigella dysenteriae Sd197] gi|81240200|gb|ABB60910.1| putative bacteriophage protein [Shigella dysenteriae Sd197] Length = 272 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 51/143 (35%), Gaps = 5/143 (3%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252 + +EA + + + + + ++ NP ++ + F + + Sbjct: 131 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 188 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310 I+ + + + I A D D + G D + I L+ IE A++ + Sbjct: 189 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 248 Query: 311 CPDPYAPLIMGCDIAEEGGDNTV 333 +P +G D+A+ G D Sbjct: 249 NFEPSGRKRIGFDVADSGTDKCA 271 >gi|238920149|ref|YP_002933664.1| hypothetical protein NT01EI_2255 [Edwardsiella ictaluri 93-146] gi|238869718|gb|ACR69429.1| conserved hypothetical protein [Edwardsiella ictaluri 93-146] Length = 601 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 53/161 (32%), Gaps = 18/161 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IP 298 P W+ ++ +G + + E + RYG + F F + Sbjct: 334 PDKQWRYVVTMEDAIADGFNRADIEELRERYGE--NAFNRLYMCVFVDDKDSVFDFAKLV 391 Query: 299 LNIIEEALNREPCPDPYAP-----LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFD 347 ++ + ++ PD AP + G D A G + T VV+ + Sbjct: 392 RCGVDPHIWQDFHPDEAAPLGNREVWGGFDPARSGDNATFVVIAVPLLAVERFRVLEKHH 451 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 W + ++I + +Y I ID G + ++ Sbjct: 452 WRGLSFQWMADQIRTIKSRYNMTHIGIDVTGIGYGVYELVQ 492 >gi|168699883|ref|ZP_02732160.1| hypothetical protein GobsU_10183 [Gemmata obscuriglobus UQM 2246] Length = 205 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 48/139 (34%), Gaps = 17/139 (12%) Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY 238 + + VG ++ DE S D + + L + + S P G F+ Sbjct: 63 DSQEGVVGFSAPR--LVVIDEGSRVSDELYKSVRPMLAV--SKGQLLTLSTPFGNQGWFF 118 Query: 239 EIF----------NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288 +I+ +K + W+R + + I P F E A G + E +F Sbjct: 119 DIWDDSAEGLKRRSKLHEPWQRTAVPASQIPRITPEFLEDERAELGER--WFQQEYFLRF 176 Query: 289 PQQDIDSFIPLNIIEEALN 307 ID+ +I A + Sbjct: 177 LD-SIDAVFSQAVIHGARS 194 >gi|56414686|ref|YP_151761.1| terminase subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197363613|ref|YP_002143250.1| terminase subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|56128943|gb|AAV78449.1| probable terminase subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|197095090|emb|CAR60636.1| probable terminase subunit [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] Length = 588 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 47/139 (33%), Gaps = 23/139 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 + + Y D + + +F D+ S PL+ ++ + P Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ V+ G L W D R + I L + Sbjct: 405 WREVWIGYDSAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464 Query: 366 KYRPDAIIIDANNTGARTC 384 +Y I ID+ G Sbjct: 465 QYNVTYIGIDSTGVGHGVY 483 >gi|254286518|ref|ZP_04961475.1| terminase [Vibrio cholerae AM-19226] gi|150423467|gb|EDN15411.1| terminase [Vibrio cholerae AM-19226] Length = 605 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 48/181 (26%), Gaps = 23/181 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ I+ G D + + Y F S N I Sbjct: 345 PDKQWRYVVTIEDAAKGGCDLFDIDELREEYSETD--FNNLFMCVFVDGAS-SIFEFNKI 401 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345 E + Y P + +G D + DN V +V + Sbjct: 402 ERCMVDSEIWQDYKPNAARPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAVEKFRVLEK 460 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404 W + ++IS + E++ + ID GA D L + Sbjct: 461 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVHDLLVNKHPRETVAIHYSNENK 520 Query: 405 D 405 + Sbjct: 521 N 521 >gi|22536761|ref|NP_687612.1| hypothetical protein SAG0585 [Streptococcus agalactiae 2603V/R] gi|22533605|gb|AAM99484.1|AE014218_1 conserved hypothetical protein [Streptococcus agalactiae 2603V/R] Length = 471 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 57/346 (16%), Positives = 114/346 (32%), Gaps = 47/346 (13%) Query: 54 WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113 WQ + + +N N + + AI R GKT + L LW + G+ ++ Sbjct: 44 WQENML--IPMMAINEDNLWVHQKYGYAIP--RRNGKTEVVYILELWAL--HKGLKILHT 97 Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 A+ + + + +V K+L + + + + + A + S G + + Sbjct: 98 AHRISTSHS-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKSSGSVIQFRT--- 150 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR- 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 151 RTSNGGLGEGFD--------LLIIDEAQEYTAEQESALKYTVTDSD-NPMTIMCGTPPTM 201 Query: 233 -LSGKFYEIFNKP-------LDDWKRFQIDTRTVEGIDPSF------------HEGIIAR 272 +G +E + K W + +D S+ I A Sbjct: 202 VSTGTVFESYRKECLKGDRRYSGWAEWSVDEMQPIHDVKSWYVANPSMGYHLNERKIEAE 261 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 262 LGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNNV 319 Query: 332 T-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ +++D Sbjct: 320 SLSIAARASENKVFVEAIDCLSVRNGTQWIINFLKSADIAKVVVDG 365 >gi|261248365|emb|CBG26202.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] Length = 589 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320 R + + +F S P ++ + P+A P+ + Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFAPFADHPFGSRPVWI 413 Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D + G VVL G L W D I L EKY + I I Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRRLTEKYNVEYIGI 473 Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 DA G + A + + +T + +K D + L +G Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524 Query: 435 ---LIQNLKSLKSFIVPNTGE 452 + Q+ S++ + ++G Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544 >gi|41057355|ref|NP_958058.1| gp3 [Enterobacteria phage PsP3] gi|37548561|gb|AAN08365.1| gp3 [Enterobacteria phage PsP3] Length = 589 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320 R + + +F S P ++ + P+A P+ + Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFAPFADHPFGSRPVWI 413 Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D + G VVL G L W D I L EKY + I I Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRRLTEKYNVEYIGI 473 Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 DA G + A + + +T + +K D + L +G Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524 Query: 435 ---LIQNLKSLKSFIVPNTGE 452 + Q+ S++ + ++G Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544 >gi|167991605|ref|ZP_02572704.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|205329999|gb|EDZ16763.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] Length = 590 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320 R + + +F S P ++ + P+A P+ + Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFTPFADHPFGSRPVWI 413 Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D + G VVL G L W D I L EKY + I I Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRRLTEKYNVEYIGI 473 Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 DA G + A + + +T + +K D + L +G Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524 Query: 435 ---LIQNLKSLKSFIVPNTGE 452 + Q+ S++ + ++G Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544 >gi|153816772|ref|ZP_01969439.1| terminase [Vibrio cholerae NCTC 8457] gi|126512575|gb|EAZ75169.1| terminase [Vibrio cholerae NCTC 8457] Length = 605 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 48/181 (26%), Gaps = 23/181 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ I+ G D E + Y F S N I Sbjct: 345 PDRQWRYVVTIEDAAKCGCDLFDIEELREEYSETD--FNNLFMCVFVDGAS-SIFEFNKI 401 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345 E + Y P + +G D + DN V +V + Sbjct: 402 ERCMVDSEIWQDYKPNAARPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAVEKFRVLEK 460 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404 W + ++IS + E++ + ID GA D L + Sbjct: 461 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVHDLLVNKHPRETVAIHYSNENK 520 Query: 405 D 405 + Sbjct: 521 N 521 >gi|168262802|ref|ZP_02684775.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|205348497|gb|EDZ35128.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] Length = 591 Score = 54.0 bits (128), Expect = 6e-05, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 51/168 (30%), Gaps = 20/168 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + + + RY ++ + F DS + + Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIDKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 E + P + G D A G + V++ + + Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394 W + R +I L +KY + +D G D ++ V Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGIFDNIQHFAMRV 492 >gi|218781804|ref|YP_002433122.1| hypothetical protein Dalk_3968 [Desulfatibacillum alkenivorans AK-01] gi|218763188|gb|ACL05654.1| protein of unknown function DUF264 [Desulfatibacillum alkenivorans AK-01] Length = 443 Score = 54.0 bits (128), Expect = 7e-05, Method: Composition-based stats. Identities = 49/334 (14%), Positives = 99/334 (29%), Gaps = 46/334 (13%) Query: 79 KGAISAGRG-IGKTTLNAWLVLWLMSTR---PGISVICLANSETQLKTTLWAEVSKWLSL 134 + ++ GKT + A L + + + P +A Q K+ +W + K+ Sbjct: 37 RFSVLVCHRRFGKT-VAAVNELIMKACQNPLPAPRYAYIAPLYKQAKSVVWDYLKKFAG- 94 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 + + H D+ + + PD G + Sbjct: 95 -------AINGTTFHETELRCDLPN----------GARITLLGADNPDRLRGIYLDGA-- 135 Query: 195 IINDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEI--FNKPLDDWKRF 251 + DE + P+ + I L++R + PR + FY++ F + DW Sbjct: 136 -VLDEMAQMPERVWGEIIRPALSDRLGWAMF--IGTPRGHNA-FYDLYQFARSDPDWFCA 191 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 + + A+ + + E F + ++ +I +A Sbjct: 192 MYRASETGIVGRDELDA--AKKEMTPEQYEQEFECSFSAAIVGAYYGP-LIAQAEKEGRI 248 Query: 312 PDPYAPLIMGCDIAEEGG--DNTVVVLRR---GPVIEHLF--DWSKTDLRTTNNKISGLV 364 + A + G D+T V + G I + + + L + Sbjct: 249 VTLPVERALPVHTAWDLGMSDSTAVWFFQVSPGGEIRVVDYLEDAGQGLDYYVRALRERD 308 Query: 365 EKYR----PDAIIIDANNTGARTCDYLEMLGYHV 394 Y P I + TG + + LG Sbjct: 309 YLYGTHLAPHDIRVRELGTGKSRLESAKSLGVSF 342 >gi|163792602|ref|ZP_02186579.1| hypothetical protein BAL199_17183 [alpha proteobacterium BAL199] gi|159182307|gb|EDP66816.1| hypothetical protein BAL199_17183 [alpha proteobacterium BAL199] Length = 422 Score = 54.0 bits (128), Expect = 7e-05, Method: Composition-based stats. Identities = 74/421 (17%), Positives = 124/421 (29%), Gaps = 67/421 (15%) Query: 82 ISAGRGIGKTTLNA-WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 I AGRG GKT A W+ S R + +A + + + Sbjct: 45 ILAGRGFGKTRTGAEWVRGLAESGR-ARRIALVAETAADARDVM---------------I 88 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE- 199 L APW S + + ++S + PD G A DE Sbjct: 89 EGESGLLACCAPWGRPKYEPSKRRVTWPNGAIATSFSADDPDQLRGPQFD---AAWADEI 145 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259 A + ++ L A+ + T+ P + + P + TR Sbjct: 146 AKWRYEAAWDNLMLGL-RLGADPRCVATTTP-KPRAWLARLMADPG------TVVTRGAT 197 Query: 260 GID-----PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP 314 + P F + I+ARY + + R E+ G+F + + +IE A Sbjct: 198 RENAGNLAPGFLDQILARY-AGTRLGRQEIDGEFLTEIPGALWTRTLIEGARALPGAVPG 256 Query: 315 YAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWSKTDLRTT----NNKISGLVEKY 367 A +I+ D A D T +V+ + R + + + ++ Sbjct: 257 LARIIVAVDPAVTSGSDSDETGIVVAGVDGEGRFWVLEDLSGRMSPDLWARRSADAYRRH 316 Query: 368 RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFA 427 DA++ + N G L + + RAV + R E + + Sbjct: 317 HADAVVCEVNQGGDLVVATLRTVDGSL-----PVRAVRATRGKRLRAEPVAALYEQGRVR 371 Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM-----YTFAENPPR 482 L + G S D D L+ F P R Sbjct: 372 HAGPFPELEDQMAG-----FTGAS----------GDASPDRLDALVWALTDLAFDRPPAR 416 Query: 483 S 483 S Sbjct: 417 S 417 >gi|192289100|ref|YP_001989705.1| hypothetical protein Rpal_0670 [Rhodopseudomonas palustris TIE-1] gi|192282849|gb|ACE99229.1| protein of unknown function DUF264 [Rhodopseudomonas palustris TIE-1] Length = 441 Score = 53.6 bits (127), Expect = 8e-05, Method: Composition-based stats. Identities = 45/235 (19%), Positives = 75/235 (31%), Gaps = 19/235 (8%) Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232 T PDT G + + DE + D I + ++ + N T N Sbjct: 114 TALPANPDTARGFSSN----VFLDEFAFHKDSREIWKALFPVISAGH-NLRVTSTGN--G 166 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291 KFYE+ D W R +D VE P + + A D D E ++ + Sbjct: 167 KDNKFYELATGKDDVWSRHFVDIYKAVEDGLPRNIDELKAGI-NDDDAWAQEYELKWLDE 225 Query: 292 DID--SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL 345 S+ + E+ +P P +G DI D V+ + Sbjct: 226 ASAWLSYDLITACEDPRAGDPSGYRNNPCFVGRDIGRRN-DLHVIWVWELIGDVLWERER 284 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLG 399 + + + ++ +YR ID G + + + G V VL Sbjct: 285 IEQKRATFAAMDAAFDDVMTRYRVARACIDQTGMGEKVVEDAQAKWGSVVEGVLF 339 >gi|282599774|ref|ZP_05971828.2| terminase, ATPase subunit [Providencia rustigianii DSM 4541] gi|282567779|gb|EFB73314.1| terminase, ATPase subunit [Providencia rustigianii DSM 4541] Length = 594 Score = 53.6 bits (127), Expect = 8e-05, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 80/263 (30%), Gaps = 31/263 (11%) Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK-RFQIDTRTVEGIDPS 264 G + R + L + P W+ ++ G + + Sbjct: 302 PFWTGDE--WRGSDPARKKVKFPQFDELRDGGRDC---PDGQWRYVITLEDAIKGGFNLA 356 Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF---------IPLNIIEEALNREPCPDPY 315 E + +Y DS + F F + +++ E+ P P Sbjct: 357 SIERLRNKYNPDS--FNMLFMCVFVDSGASVFKYHQLDKCGVDVHLWEDHNPDAPRPFGD 414 Query: 316 APLIMGCDIAEEGGDNTVVVLR------RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + G D A G +T ++ + +F W + + I L ++YR Sbjct: 415 REVWGGFDPARSGDTSTFAIVAPPMMAPEVFRVLAIFYWQGMNWKHQAKLIEDLTKRYRF 474 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429 I ID G + ++ ++ + + + + +L +KM D + L Sbjct: 475 TYIGIDTTGIGHGVYEMVQD--------FAPRQTHSIHYSQQTKNQLVMKMIDVVSEERL 526 Query: 430 INHSGLIQNLKSLKSFIVPNTGE 452 + L S S TG+ Sbjct: 527 EWDEEQKEILASFLSIRHTTTGK 549 >gi|331650737|ref|ZP_08351769.1| conserved hypothetical protein [Escherichia coli M605] gi|331040418|gb|EGI12616.1| conserved hypothetical protein [Escherichia coli M605] Length = 158 Score = 53.6 bits (127), Expect = 8e-05, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 46/133 (34%), Gaps = 29/133 (21%) Query: 322 CDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNT 379 D+A+EG D R G ++E++ +WS +D+ + K+ G E+ + D + Sbjct: 1 MDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDNLEEFRFDEDGL 60 Query: 380 GART------CDYLEMLGYH---------------------VYRVLGQKRAVDLEFCRNR 412 GA + L V GQ ++ +F N Sbjct: 61 GAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQAARLNKDFFANA 120 Query: 413 RTELHVKMADWLE 425 + + ++ + Sbjct: 121 KAQSWWRLRKLFQ 133 >gi|168704532|ref|ZP_02736809.1| hypothetical protein GobsU_33659 [Gemmata obscuriglobus UQM 2246] Length = 209 Score = 53.6 bits (127), Expect = 9e-05, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 48/139 (34%), Gaps = 17/139 (12%) Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY 238 + + VG ++ DE S D + + L + + S P G F+ Sbjct: 63 DSQEGVVGFSAPR--LVVIDEGSRVSDELYKSVRPMLAV--SKGQLLTLSTPFGNQGWFF 118 Query: 239 EIFN----------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288 +I++ K + W+R + + I P F E A G + E +F Sbjct: 119 DIWDDSAEGLKRRAKLHEPWQRTAVPASQIPRITPEFLEDERAELGER--WFQQEYFLRF 176 Query: 289 PQQDIDSFIPLNIIEEALN 307 ID+ +I A + Sbjct: 177 LD-SIDAVFSQAVIHGARS 194 >gi|134295281|ref|YP_001119016.1| hypothetical protein Bcep1808_1170 [Burkholderia vietnamiensis G4] gi|134138438|gb|ABO54181.1| protein of unknown function DUF264 [Burkholderia vietnamiensis G4] Length = 458 Score = 53.2 bits (126), Expect = 9e-05, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 78/247 (31%), Gaps = 31/247 (12%) Query: 246 DDWKRFQID-TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 D+ FQI+ + + + + + G+ + + + G+F + IE+ Sbjct: 201 GDYAHFQINPGDNAQNLSADYLDTLK---GMSPRLQKRFLRGEFSDATPNQLFAEETIEK 257 Query: 305 ALNREPCPDPYAP-LIMGCDIAEEGG-DNT--------VVVLRRGPVIEHLFDWSKTDLR 354 + P P +++ D + G DN VV L L D + Sbjct: 258 WRHGTDQPLPDFVRVVVAVDPSGSGDVDNADNDAIGIIVVGLGTDGRAYVLDDCTVKAGP 317 Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRT 414 T + +++ N G ++ R + V + R Sbjct: 318 ATWGSVVASAYDRHAGDVVVGETNYGGAMVQHV----VQTARARTPFKQVTASRGKAVRA 373 Query: 415 ELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474 E + + + H G+ + L+ + G + G +S + +D L++ Sbjct: 374 EPFSSLYEQ----GKVRHVGIFRELED-ELTAFSTVGYI--------GERSPNRADALIW 420 Query: 475 TFAENPP 481 E P Sbjct: 421 ALTEIFP 427 >gi|161613293|ref|YP_001587258.1| hypothetical protein SPAB_01003 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161362657|gb|ABX66425.1| hypothetical protein SPAB_01003 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 443 Score = 53.2 bits (126), Expect = 9e-05, Method: Composition-based stats. Identities = 59/360 (16%), Positives = 105/360 (29%), Gaps = 65/360 (18%) Query: 81 AISAGRGIGKTTLNAWLVLWLMST---RPGISV-------ICLANSETQLKTTLWAEVSK 130 A+ GR GKT + + + + RPG+ + I A E + Sbjct: 27 AVRCGRRWGKTFMLSSAAVTYATAPFKRPGMDIELGGRVGIFTA------------EYRQ 74 Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT 190 + + + L + + + + G Sbjct: 75 YQEIYDKLEEILLP---LKKSFSRQEKRLLLKNGGKIDFWVT-------NDNKLAGRGRE 124 Query: 191 YGMAIINDEASGTPDV-----IN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244 Y + +I DEA+ T I I L + T + FY I + Sbjct: 125 YEIILI-DEAAFTKSPEMLREIWPKSIKPTLLTTKGRAYVFSTPDGVDEENFFYAICHDK 183 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 + T + + P E A D V R E +F S + E Sbjct: 184 NLGFIEHHAPTSSNPFVPPEELEKEEAN--NDPRVFRQEFLAEFVDWSAASLFDIRKWFE 241 Query: 305 ALNREPC---PDPYAPLIMGCDIAEEGG---DNTVVVL-----RRGPVIEHLFDWSKTDL 353 N++ P+ + D A +GG D T VV R G + DW + Sbjct: 242 GENQDQPVDYPEMCQAVFAVMDTAVKGGSEHDGTAVVYYAVDTRPGIQRLTILDWDVVQI 301 Query: 354 R---------TTNNKISGLVEKYRPD----AIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400 + ++++ L + + I+ + G+ E LG+ V ++ Sbjct: 302 DGALLETWMPSVFDRLNELSGQCVAINGSLGVFIEDASMGSILLQKGESLGWPVNKIESA 361 >gi|238762068|ref|ZP_04623041.1| terminase, ATPase subunit [Yersinia kristensenii ATCC 33638] gi|238699796|gb|EEP92540.1| terminase, ATPase subunit [Yersinia kristensenii ATCC 33638] Length = 595 Score = 53.2 bits (126), Expect = 1e-04, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 52/163 (31%), Gaps = 20/163 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + +Y D+ + F DS +++ Sbjct: 333 PDGQWRYVITLEDAIDGGFNLANIERLRNKYNRDT--FNMLYMCVFVDSG-DSVFKFHML 389 Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNTVVVLR----RGPVIEHLFD- 347 E+ + + G D A G +T V++ G L Sbjct: 390 EKCGVDIEMWQDHDFSAPRPFGNREVWGGFDPARSGDTSTFVIIAPPQFEGERFRVLATF 449 Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 W + N+I L ++Y I +D G + ++ Sbjct: 450 YWQGLNFNYQANQIKELFQRYNMTYIGVDITGIGNGVFELVQN 492 >gi|168822412|ref|ZP_02834412.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205341123|gb|EDZ27887.1| putative conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] Length = 589 Score = 53.2 bits (126), Expect = 1e-04, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320 R + + +F S P ++ + P+A P+ + Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFAPFADHPFGSRPVWI 413 Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D + G VVL G L W D I L EKY + I I Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRKLTEKYNVEYIGI 473 Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 DA G + A + + +T + +K D + L +G Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524 Query: 435 ---LIQNLKSLKSFIVPNTGE 452 + Q+ S++ + ++G Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544 >gi|300310242|ref|YP_003774334.1| DNA-dependent ATPase terminase subunit [Herbaspirillum seropedicae SmR1] gi|300073027|gb|ADJ62426.1| DNA-dependent ATPase terminase subunit [Bacteriophage phi CTX] related protein [Herbaspirillum seropedicae SmR1] Length = 593 Score = 53.2 bits (126), Expect = 1e-04, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 36/132 (27%), Gaps = 18/132 (13%) Query: 270 IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----------EPCPDPYAPL 318 + + D + F S PL ++ + P + P+ Sbjct: 358 LRDFEYSPDQFDNLLMCNFIDDSA-SVFPLADLQRGMVDSWVDWDDYKPFTARPFGHRPV 416 Query: 319 IMGCDIAEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAI 372 +G D + G +V+ L G L W D I + +Y I Sbjct: 417 WIGYDPSLTGDSAGCSVIAPPLIPGGNFRILERHQWRGKDFAEQAALIKEMCGRYNVQYI 476 Query: 373 IIDANNTGARTC 384 ID G Sbjct: 477 GIDTTGMGVGVY 488 >gi|51596097|ref|YP_070288.1| phage terminase subunit GpP [Yersinia pseudotuberculosis IP 32953] gi|51589379|emb|CAH21001.1| terminase subunit [Enterobacteria phage 186] gb|AAC3414 [Yersinia pseudotuberculosis IP 32953] Length = 590 Score = 53.2 bits (126), Expect = 1e-04, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 42/137 (30%), Gaps = 22/137 (16%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIM 320 YG + + +F S P ++ + + P+ + Sbjct: 355 EYGPSE--YQNLLMCEFVDDQA-SVFPFAELQACMVDSLEEWEDYNPYSLRPFGHRPVWI 411 Query: 321 GCDIAE-EGGDNT---VVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAI 372 G D +E GGD+ V+ + G L W D I L +KY + I Sbjct: 412 GYDPSEANGGDSAGCAVIAPPMVPGGKFRVLERHQWKGMDFEAQAKHIEELTQKYCVEYI 471 Query: 373 IIDANNTGARTCDYLEM 389 IDA G + Sbjct: 472 GIDATTVGQGVFQLVRQ 488 >gi|15675368|ref|NP_269542.1| hypothetical protein SPy_1460 [Streptococcus pyogenes M1 GAS] gi|13622552|gb|AAK34263.1| conserved hypothetical protein - phage associated [Streptococcus pyogenes M1 GAS] Length = 471 Score = 53.2 bits (126), Expect = 1e-04, Method: Composition-based stats. Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ+ + + A N + + GKT + + LW + G+ ++ Sbjct: 43 PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVQLWAL--HKGLKILH 96 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + + G + + Sbjct: 97 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200 Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271 +G +E + K +W + + + + FH I A Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365 >gi|261885730|ref|ZP_06009769.1| hypothetical protein CfetvA_11664 [Campylobacter fetus subsp. venerealis str. Azul-94] Length = 560 Score = 53.2 bits (126), Expect = 1e-04, Method: Composition-based stats. Identities = 43/307 (14%), Positives = 90/307 (29%), Gaps = 41/307 (13%) Query: 195 IINDEASGTPDV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 I DE + P+ I + + + S P F+E+++ + + Sbjct: 251 IWMDEFAWYPNPKKIWHAFVPSI--GAIKGRLTILSTPFEERSLFHELYSDESKYYMFKR 308 Query: 253 IDTRTVEGIDPSFHEGIIARYGL-DSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL---NR 308 I+ + L D+D QF + S + + +I+ + Sbjct: 309 FCVSIYSAIEDGLDFDLETMRNLFDADTWASAYECQFVDDES-SLLSIALIKSCVYDKAS 367 Query: 309 EPCPDPYAPLIMGCDIAEEGGDNT---VVVLRRGPVIEHLFDWSKTDLRTTNN------- 358 P + G DI +T VV+ S+ Sbjct: 368 YYTPKSNQVIYAGYDIGRVSDRSTLAGVVLEDVTNRARGQRSLSQGGRYIVAMMDVLAKA 427 Query: 359 -------KISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAVDLEFCR 410 ++ ++ Y + ID G + + + V V + Sbjct: 428 KFDEQKEHLTSFLKTYPLSVLKIDKTGIGMNLAENIHDKFRSRVSGVWFSNTRKE----- 482 Query: 411 NRRTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDY 468 E+ + + E S+ N LI ++ ++K I + + ++KR + + D Sbjct: 483 ----EMALNLKKAFEDKLISIPNDPLLIADIHAIKRTIGAKSFKY--DAKRNEYGHA-DR 535 Query: 469 SDGLMYT 475 L Sbjct: 536 FWALALA 542 >gi|304360860|ref|YP_003856980.1| gp8 [Mycobacterium phage CrimD] gi|302858609|gb|ADL71354.1| gp8 [Mycobacterium phage CrimD] Length = 473 Score = 53.2 bits (126), Expect = 1e-04, Method: Composition-based stats. Identities = 65/385 (16%), Positives = 118/385 (30%), Gaps = 49/385 (12%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111 WQ + ++V A S ++F +I R GKT +V L PG +VI Sbjct: 43 DQWQDDLGKLVCAK--RSDGLYAADMFAMSIP--RQTGKTYFLGAIVFALCKMTPGTTVI 98 Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171 A+ +T AE K + L + L++H G ++ ++ Sbjct: 99 WTAH-----RTRTAAETFKSMQALAKREQIAPHILNVH----------TGNGKEAVLFTN 143 Query: 172 MCRTYSEERPDTF-VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 R R F G +I DEA + ++ T + N + P Sbjct: 144 GSRILFGAREKGFGRGF--AKVDVLIFDEAQILSENAMDDMVPA-TNASPNGLILFAGTP 200 Query: 231 RRLS--GKFY-----EIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGI------------ 269 + + G+ + + N DD + D + ++ + Sbjct: 201 PKPTDPGEVFTNLRLDAINGESDDVAYVEISADENDDPDEESTWRKMNPSYPHRTSARAI 260 Query: 270 -IARYGLDSDVTRVEVCGQFPQQDID-SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327 R L D R E G + + + I ++ + + P +G D++ Sbjct: 261 RRMRKALSWDSFRREAMGIWDKISVHAQVIKPSLWRDLADPLGPEPGAKPASLGVDMSHG 320 Query: 328 GGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386 G + + H+ W+ TD I R ++ID + Sbjct: 321 GAISIGGCWLIDDELRHVEQVWAGTDTAAAVEFIVERAG--RRIPVVIDDASPAKSLVPE 378 Query: 387 LEMLGYHVYRVLGQKRAVDLEFCRN 411 L+ V A +N Sbjct: 379 LKRRKVKVRITYAGDMAKACGLFKN 403 >gi|50914563|ref|YP_060535.1| Phage terminase [Streptococcus pyogenes MGAS10394] gi|50903637|gb|AAT87352.1| Phage terminase [Streptococcus pyogenes MGAS10394] Length = 476 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ+ + + A N + + GKT + + LW + G+ ++ Sbjct: 48 PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 101 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + + G + + Sbjct: 102 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 155 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 156 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 205 Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271 +G +E + K +W + + + + FH I A Sbjct: 206 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 265 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 266 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 323 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 324 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 370 >gi|19746414|ref|NP_607550.1| hypothetical protein spyM18_1474 [Streptococcus pyogenes MGAS8232] gi|19748615|gb|AAL98049.1| conserved hypothetical phage protein [Streptococcus pyogenes MGAS8232] Length = 471 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ+ + + A N + + GKT + + LW + G+ ++ Sbjct: 43 PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + + G + + Sbjct: 97 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200 Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271 +G +E + K +W + + + + FH I A Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365 >gi|21910651|ref|NP_664919.1| putative terminase - phage associated [Streptococcus pyogenes MGAS315] gi|28876285|ref|NP_795519.1| putative terminase [Streptococcus pyogenes phage 315.3] gi|28895661|ref|NP_802011.1| hypothetical protein SPs0749 [Streptococcus pyogenes SSI-1] gi|21904853|gb|AAM79722.1| putative terminase - phage-associated [Streptococcus pyogenes MGAS315] gi|28810910|dbj|BAC63844.1| conserved hypothetical protein (phage associated) [Streptococcus pyogenes SSI-1] Length = 471 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ+ + + A N + + GKT + + LW + G+ ++ Sbjct: 43 PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + + G + + Sbjct: 97 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200 Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271 +G +E + K +W + + + + FH I A Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365 >gi|71911002|ref|YP_282552.1| phage terminase [Streptococcus pyogenes MGAS5005] gi|71853784|gb|AAZ51807.1| phage terminase [Streptococcus pyogenes MGAS5005] Length = 471 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ+ + + A N + + GKT + + LW + G+ ++ Sbjct: 43 PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + + G + + Sbjct: 97 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200 Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271 +G +E + K +W + + + + FH I A Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365 >gi|148727151|ref|YP_001285645.1| putative terminase large subunit [Aeromonas phage phiO18P] gi|110349286|gb|ABG73174.1| putative terminase large subunit [Aeromonas phage phiO18P] Length = 604 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 46/139 (33%), Gaps = 19/139 (13%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317 E + Y + V +F D S +E A + Y P Sbjct: 363 IEELKDEYPEE--VFDRLYMCRFID-DALSVFKFQDMERAGVDPTRWEDYKPGRPDPFGR 419 Query: 318 --LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + MG D + + T+VV+ + W + + +I + +K+R Sbjct: 420 REVWMGYDPSRTRDNATLVVVAPPTVAGERFRVLEKHYWRGLNFQYQAQEIERIAKKFRV 479 Query: 370 DAIIIDANNTGARTCDYLE 388 + +D + GA D L+ Sbjct: 480 TYLGVDVSGIGAGVYDLLK 498 >gi|261251508|ref|ZP_05944082.1| terminase [Vibrio orientalis CIP 102891] gi|260938381|gb|EEX94369.1| terminase [Vibrio orientalis CIP 102891] Length = 594 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 49/180 (27%), Gaps = 21/180 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ +G D E + Y + + F S N + Sbjct: 332 PDKQWRYVITMEDAVAQGFDLVDIEDLRDEY--SDNEFKNLFMCIFVDGAA-SIFEFNKV 388 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346 + + P + +G D + + +VV+ + Sbjct: 389 MRCMVDSKQWQDFDPKAKRPIGAREVWLGYDPSRTRDNACLVVVAPPALPGEKFRVLEKH 448 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAVD 405 W + + +I + + Y + ID GA D L + + + Sbjct: 449 YWKGLNFQYQAKQIGEVFKCYNVTYLGIDVTGIGAGVYDLLSKQHPREAVAIHYSNDNKN 508 >gi|295111846|emb|CBL28596.1| Mu-like prophage FluMu protein gp28 [Synergistetes bacterium SGP1] Length = 532 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 68/430 (15%), Positives = 134/430 (31%), Gaps = 62/430 (14%) Query: 78 FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 F+ + + + IG + L VL R ++ A+ + + KW L Sbjct: 143 FRVCLKSRQ-IGFSFLLGLEVLLGAIERGDNQIVISASQDQ--SDIVRNYAVKWCKDL-- 197 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + D + + Y C P T G+ + Sbjct: 198 ------------DVDYLEDGGNIIFPGGAIAYFLPC------NPRTVQGYTGD----VYL 235 Query: 198 DEAS----GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRF 251 DE + G ++ + T + +TS P + F EI P + R Sbjct: 236 DEFAWHMRG--RLMWQAAVPAATTKGKR--LTVTSTPYTETDMFGEIVTNPDKYPRFSRH 291 Query: 252 QIDTRTVEGIDPSFHEGIIARYGL-DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + + I GL D+ +F ++ + + + L+ + Sbjct: 292 TVTIYDA--VKDGHQVDIEELRGLFDAITFAQAYECRFFADELC-LLQPDEVRAVLDDDC 348 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI--------EHLFDWSKTDLRTTNNKISG 362 A + G DI D T +VL + H+ ++ + ++G Sbjct: 349 LRHVSAWVNGGVDIGRT-KDVTAIVLAEQLQVEAEKLVFVRHMETLARMAFDGQRSHMAG 407 Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 LVE ++ + +DA G + + ++ L + + F R ++ E+ + + Sbjct: 408 LVEGWKIRRLAMDATGIGMQLSEDMQRL--------YPGKVERVHFTREKKEEMALSVKK 459 Query: 423 WLEF--ASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 E + N L+ L ++K G ++ R + K D L E Sbjct: 460 LFETQRIRIPNDRDLVMQLHAIKR-KPTEKG-FTYDADRNEQIKHADLFWALALAVKEFG 517 Query: 481 PRSDMDFGRC 490 R + R Sbjct: 518 GRRRVLTARN 527 >gi|15894418|ref|NP_347767.1| hypothetical protein CA_C1134 [Clostridium acetobutylicum ATCC 824] gi|15024053|gb|AAK79107.1|AE007629_13 Phage related protein, YonF B.subtilis homolog [Clostridium acetobutylicum ATCC 824] gi|325508547|gb|ADZ20183.1| Phage related protein [Clostridium acetobutylicum EA 2018] Length = 589 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 68/484 (14%), Positives = 151/484 (31%), Gaps = 87/484 (17%) Query: 54 WQLEFMEVVDAHCLNSVNNPNPE--VFKGAISAG-----------RGIGKTTLNAWLVLW 100 W +++++ + ++ +P K + R +GKT + ++ Sbjct: 45 WWRQYLDIFIENYFSTEKSPVRFYDFQKVIVRECGNCSIVRDTEARSMGKTFKMSRVLAG 104 Query: 101 LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC 160 L P ++ ++N+ Q ++ N +Q + + Sbjct: 105 LAILYPQNKILIVSNTVRQ-AILTVKYINDLGEENANFAREIIQPIKISKDG-------- 155 Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFL---- 215 + K+ S + + G I DE++ VI ++ L Sbjct: 156 -AKVKFKNGSEIEAMAMNKDGSNIRG---ERRKIIYIDESAWVMSSVIQSVLIPMLRYNR 211 Query: 216 -----------TERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPS 264 + + I TS+ S +Y+ F + + D + D + Sbjct: 212 KVVENNRLKGLAFEDFSSKLIETSSAYLKSCDYYQRFKETIQDIRDGYKDRFCCALSYKT 271 Query: 265 FHE--------GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-----LNREPC 311 + + + V +E +F + S+ P ++ E + + Sbjct: 272 AVRCGIVEEDFVLEQKKKMPLSVWEMEWNSKFVGETEGSYFPYDLTEPCRDFSHVEIQQP 331 Query: 312 PDPYAPLIMGCDIAEEGG---DNTVVVL------RRGPVIEHLF---DWSKTDLRTTNNK 359 + + ++ D+A DN + + G +++ + L + Sbjct: 332 KNSMSRYVLSLDVATSEDKKADNACITVIKIVPKNDGTYEKYIVFIRTYHGYSLEMLAEQ 391 Query: 360 ISGLVEKYRPD-AIIIDANNTGARTCDYL-----EMLGYHV-------YRVLGQKRAVDL 406 + ++ +IIDAN G L + LG V ++A+++ Sbjct: 392 VRITCCRFPNIIKVIIDANAIGEGVVSLLNIPYVDDLGREYPPLIKDTIEVSDSRKAINI 451 Query: 407 ---EFCRNRRTE-LHVKMADWLEFASL---INHSGLIQNLKSLKSFIVPNTGELAIESKR 459 N++ E + V +LE SL I + + ++ K I +TG SK Sbjct: 452 ISAIKADNKKNENMAVHTLLFLENHSLHIPIPSVKIRRQIEEQKIIIKDDTGTKRKISKE 511 Query: 460 VKGA 463 G Sbjct: 512 EVGV 515 >gi|109302915|ref|YP_654730.1| hypothetical protein F108p19 [Pasteurella phage F108] gi|73918076|gb|AAZ93654.1| unknown [Pasteurella phage F108] Length = 603 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 30/183 (16%), Positives = 61/183 (33%), Gaps = 25/183 (13%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IPLNIIEEALNREPCPDPYAP---- 317 + + +Y + F + ++ A ++ P+ P Sbjct: 367 IDALKQKYSKY--AFAQLFMCVWVDDADSIFNIKKLLKCGVDIAKWKDHNPNDARPFGAR 424 Query: 318 -LIMGCDIAEEGGDNTVVV------LRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 + G D A G + V+ L+ + + W+ + +I L EKY Sbjct: 425 EVWGGYDPAHSGDGASFVIVAPPALLKEKYRVLARYQWNGLSYKYQAAQIKQLFEKYNMT 484 Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430 I IDA G + ++ ++AV L + +TE+ +K+ D +E + Sbjct: 485 YIGIDATGVGYGVYEQVKE--------FAGRKAVPLVYNPESKTEMVLKVHDLVEHEQIE 536 Query: 431 NHS 433 Sbjct: 537 WDE 539 >gi|123441220|ref|YP_001005207.1| terminase, ATPase subunit [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122088181|emb|CAL10969.1| terminase, ATPase subunit [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 725 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 45/157 (28%), Gaps = 20/157 (12%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316 E I S QF F + +E+ L + + Sbjct: 378 LEEIREENAESS--FNQLYMCQFVDTGDCVF-RFDQLEKCLTNVSTWEDHDVNAMRPFGN 434 Query: 317 -PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + G D A G + V++ + H+ W + +I + +Y Sbjct: 435 REVWAGYDPARTGDTASFVLVAPPQVDGEPFRVLHIETWHGFAFKYQVGRIKEYMARYNI 494 Query: 370 DAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRAVD 405 I ID G C+ ++ V + + + + Sbjct: 495 THIGIDTTGIGGPVCEMVQDFARREVTPIRYSQESKN 531 >gi|330810733|ref|YP_004355195.1| phage terminase protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327378841|gb|AEA70191.1| Putative phage terminase protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 439 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 67/418 (16%), Positives = 122/418 (29%), Gaps = 75/418 (17%) Query: 78 FKGAISAGRGIGKTTL--------NAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129 F+ A+ GR GKT L W +S + A ++ Q + W + Sbjct: 32 FRDAV-CGRRFGKTFLGKAEMRRAAKLAAAWNVSVEDE--IWYAAPTQKQARRVFWRRLK 88 Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 + + P W + + + + + R E D G Sbjct: 89 QAI-----------------PKSWLVTKPNETDMLITLKSGHLLRCVGLENYDDLRG--- 128 Query: 190 TYGMAIINDEASGTPDVIN-LGILGFLT---------ERNANRFWIMTSNPRRLSGKFYE 239 + I+ DE + I L+ E + P+ Y+ Sbjct: 129 SGLFFILVDEWADCKYAAWEEVIRPMLSTCTYTLPNGEVRKGGHALRIGTPKG-FNHCYD 187 Query: 240 IF------NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293 F ++P + + + P E + +D R E F ++ Sbjct: 188 TFQDGKPGHEPDHRSWLYT--SLDGGNVPPEEIEAARRK--MDPRTFRQEYEASF--ENY 241 Query: 294 DSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL 353 + EA L +G D VV + R + L ++S ++ Sbjct: 242 QGVVYYTFNREANRTSETIKRGEALHIGMDFNVMKM-AAVVHVIRDDLPLALSEFS--EV 298 Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGART---------CDYLEMLGYHVYRVLGQKRAV 404 R T I + ++ +I I + +G T L+ G+ V V AV Sbjct: 299 RDTPEMIEKIKLRFPDHSIAIYPDASGQNTSSKSASESDLSLLKKAGFTVI-VDSTNPAV 357 Query: 405 DLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKG 462 N + + E L+N + + L+ I + G E + G Sbjct: 358 KDR--VNAMCAMFAN--TYGEHRYLVNVDQCPKYTQCLERQIYTDKG----EPDKKAG 407 >gi|307591253|ref|YP_003900462.1| hypothetical protein Cyan7822_6211 [Cyanothece sp. PCC 7822] gi|306986818|gb|ADN18693.1| conserved hypothetical protein [Cyanothece sp. PCC 7822] Length = 474 Score = 52.8 bits (125), Expect = 1e-04, Method: Composition-based stats. Identities = 39/249 (15%), Positives = 81/249 (32%), Gaps = 40/249 (16%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRL 233 + P+ G + I+ DE++ + I + T + I+ S P Sbjct: 145 FRNSTPNGARGLESVSD--ILYDESAFVDEIEEIYKSSIPCTTVVGSEARIIILSTPNGQ 202 Query: 234 SGKFYEIFNKPLDD-----------------------------WKRFQIDTRTVEGIDPS 264 SG +++ + D + + + Sbjct: 203 SGWYWDKLSSNNGDRDILEICEQIRTEKIEPIQYWIDNNQWCKFIVHWLGHPKFSQQKET 262 Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI--MGC 322 + I A++ L D+ E F ++ I+ + + DP + I G Sbjct: 263 YLRDIKAQFDLPEDIIEQEYNLSFTHSEV-IVFSSEIVRKNAIGQWENDPKSNCIYYFGI 321 Query: 323 DIAEEGGDNTVVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 D + G D TV + R + ++ K +I+ L+++Y P + I+ N+ Sbjct: 322 DTSLLGNDYTVCTILREIDNRYHLVKMYRQRKKTHEYNIYQIAELIKQYNPIIVGIEVNS 381 Query: 379 TGARTCDYL 387 +G + L Sbjct: 382 SGQVYYEQL 390 >gi|147671611|ref|YP_001215893.1| terminase [Vibrio cholerae O395] gi|262167851|ref|ZP_06035552.1| terminase [Vibrio cholerae RC27] gi|146313994|gb|ABQ18534.1| terminase [Vibrio cholerae O395] gi|227014845|gb|ACP11054.1| putative terminase, ATPase subunit [Vibrio cholerae O395] gi|262023759|gb|EEY42459.1| terminase [Vibrio cholerae RC27] Length = 605 Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 48/181 (26%), Gaps = 23/181 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ I+ G D E + Y F S N I Sbjct: 345 PDRQWRYVVTIEDAAKGGCDLFDIEELREEYSETD--FNNLFMCVFVDGAS-SIFEFNKI 401 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345 E + + P + +G D + DN V +V + Sbjct: 402 ERCMVDSEIWQDFKPNAARPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAVEKFRVLEK 460 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404 W + ++IS + E++ + ID GA D L + Sbjct: 461 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVHDLLVNKHPRETVAIHYSNENK 520 Query: 405 D 405 + Sbjct: 521 N 521 >gi|332522503|ref|ZP_08398755.1| putative phage terminase, large subunit [Streptococcus porcinus str. Jelinkova 176] gi|332313767|gb|EGJ26752.1| putative phage terminase, large subunit [Streptococcus porcinus str. Jelinkova 176] Length = 470 Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 51/345 (14%), Positives = 110/345 (31%), Gaps = 45/345 (13%) Query: 54 WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113 WQ + + A + + + GKT + L LW + G+ ++ Sbjct: 43 WQKNMLSPIMAIDEDGLWVHQKYGYAIPRRN----GKTEIVYILELWGL--HKGLKILHT 96 Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 A+ + + + ++ K+L + + + + + A + S G + + Sbjct: 97 AHRISTSHS-SFEKLKKYLEMS---GYVDGEDFISNKAKGQERIEFKSSGSVIQWRT--- 149 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR- 232 RT + + F ++ DEA + +T+ + N +M P Sbjct: 150 RTSNGGLGEGFD--------LLVIDEAQEYTSEQESALKYTVTDSD-NPMTVMCGTPPTM 200 Query: 233 -LSGKFYEIFNKP-------LDDWKRFQIDTRTV-------EGIDPS-----FHEGIIAR 272 +G +E + K W + + T +PS I A Sbjct: 201 VSTGTVFESYRKEVLKGAKKYSGWAEWSVSEMTKIDDVQSWYIANPSMGFHLNERKIEAE 260 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNT 332 G D ++ G +P + S I + L E P+ L +G ++G + + Sbjct: 261 LGDDEIDHNIQRLGYWPTFNQKSVISEKEWGK-LKVEQTPELSGKLFVGIKFGQDGNNVS 319 Query: 333 -VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + R + +R I ++ +++D Sbjct: 320 MSIAARTKENKIFVESIDCLSVRNGTQWIIDFLKSADIAKVVVDG 364 >gi|323940932|gb|EGB37119.1| hypothetical protein ERDG_02336 [Escherichia coli E482] Length = 443 Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 60/361 (16%), Positives = 103/361 (28%), Gaps = 65/361 (18%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMST---RPGISV-------ICLANSETQLKTTLWAEVS 129 A+ GR GKT + + + ++ RPG+ + I A E Sbjct: 26 NAVRCGRRWGKTFMLSSAAVTYATSQFRRPGMDIELGGRVGIFTA------------EYR 73 Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 ++ + + L + + + + G Sbjct: 74 QYQEIYDKLEEILLP---LKKSFSRQEKRLLLKNGGKIDFWVT-------NDNKLAGRGR 123 Query: 190 TYGMAIINDEASGTPDV-----IN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK 243 Y + +I DEA+ T I I L + T + FY I + Sbjct: 124 EYEIILI-DEAAFTKSPEMLKEIWPKSIKPTLLTTKGRAYVFSTPDGVDEENFFYAICHN 182 Query: 244 PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 + T + + P E R D V R E +F S + Sbjct: 183 KDLGFHEHHAPTSSNPFVPPEELEK--ERQNNDPRVFRQEFLAEFVDWSAASLFDVRKWF 240 Query: 304 EALNREPC---PDPYAPLIMGCDIAEEGG---DNTVVVL-----RRGPVIEHLFDWS--K 350 E N++ P+ + D A +GG D T VV R G + DW + Sbjct: 241 EGENQDQPVDYPEMCQAVFAVMDTAVKGGTDHDGTAVVYYAVDTRPGIQRLTILDWDVVQ 300 Query: 351 TDLRTTNNKISGLVEKY-----------RPDAIIIDANNTGARTCDYLEMLGYHVYRVLG 399 D I + + + I+ + G+ E LG+ V ++ Sbjct: 301 IDGALLEEWIPSVFTRLNELSGQCVAVNGSLGVFIEDASMGSILLQKGESLGWPVNKIES 360 Query: 400 Q 400 Sbjct: 361 A 361 >gi|319645791|ref|ZP_08000021.1| YonF protein [Bacillus sp. BT1B_CT2] gi|317391541|gb|EFV72338.1| YonF protein [Bacillus sp. BT1B_CT2] Length = 589 Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 62/449 (13%), Positives = 127/449 (28%), Gaps = 108/449 (24%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 A RG GKT L + PG ++ + ++ Q + + Sbjct: 84 ASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKGQAREVI-----------EKIDDLRK 132 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +S +L ++ + S + S + G + +I DE Sbjct: 133 ESPNLKREIEDLKTSTNDARVEFHNGSWIKIVASND------GARSKRANLLIVDEFRMV 186 Query: 204 P-DVINLGILGFLTERNANRFW-------IMTSNPR-RLSGKFYEIF------------- 241 ++I+ + FLT + ++ + N LS +Y++ Sbjct: 187 DFEIISKVLRKFLTAPRSPKYLEKEEYAHLKERNKEIYLSSCWYKVHWSYGRFVTYFNAM 246 Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 +K +QI R +D + ++ D +E+ + + ++ Sbjct: 247 MKGSKYFVCGLPYQIAIREG-LLDKDQVKDEMSEEDFDPIGWSMEMEALWFGESEKAYFK 305 Query: 299 LNIIEEALN-------------------REPCPDPYAPLIMGCDIAEEGG---DNTVVVL 336 +E+ + P ++ DIA G D +V + Sbjct: 306 FEDLEKNRKLASPLFPPDYYDLIKDSNFKFENKKPGELRLISNDIAGMAGKDNDASVYTV 365 Query: 337 RR--------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 R I ++ + +I L E Y D I++D + G D L Sbjct: 366 FRLIPNSNGYDRHIVYMESIVGGHTGSQATRIRQLFEDYACDYIVLDTQSIGLGVYDALC 425 Query: 389 MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQ--NLKSLKSFI 446 R + + E S IN + + ++ + I Sbjct: 426 Q-----------------PLYDKERAKEY-------EPLSCINDEKMAERCTYQNAEKLI 461 Query: 447 VPNTGELAIESK---------RVKGAKST 466 G + S+ + + K Sbjct: 462 YSIKGNAQLNSEIAVLLKDGFKRRKIKIP 490 >gi|282599667|ref|ZP_05971423.2| terminase, ATPase subunit [Providencia rustigianii DSM 4541] gi|282568162|gb|EFB73697.1| terminase, ATPase subunit [Providencia rustigianii DSM 4541] Length = 574 Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 79/257 (30%), Gaps = 42/257 (16%) Query: 241 FNKPLDDWKRFQ-IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI-----D 294 W++ I G++ E I D R +F + D Sbjct: 305 HYSGDSMWRQIVNIHDAIARGLNRVILEEIKDE--NPPDDFRNLYECEFVKTGERAFSYD 362 Query: 295 SFIPLNIIEEALNREPCPDPYAP-------LIMGCDIAEEGGDNTVVVLRR-------GP 340 + I + + PYAP + +G D G + + L G Sbjct: 363 ALINCGVDGYNSHIWSDWKPYAPRPLGNRPVWVGADPTGTGDNGDGLGLVIASPPAVSGG 422 Query: 341 VIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCDYLEMLGYHVYRV 397 + +I + ++Y +I ID TGA + + V Sbjct: 423 KFRIIETVQLRGMAFEKQAEEIRRITQRYNVQSITIDGTGGTGAAVHELV---------V 473 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG----LIQNLKSLKSFIVPNTGEL 453 A L + + + +KM + +G LI +L ++K +G + Sbjct: 474 KFFPAANLLNYSAPIKRMMIMKMQMLIRSGRFEYDAGLHKPLITSLMTIKKIQ-TPSGII 532 Query: 454 AIESKRVKGAKSTDYSD 470 ES RV+G D+ D Sbjct: 533 TYESSRVRGL---DHGD 546 >gi|182415227|ref|YP_001820293.1| hypothetical protein Oter_3416 [Opitutus terrae PB90-1] gi|177842441|gb|ACB76693.1| protein of unknown function DUF264 [Opitutus terrae PB90-1] Length = 521 Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 53/328 (16%), Positives = 96/328 (29%), Gaps = 66/328 (20%) Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPR-RLS 234 P T G +I DE + D I ++ F S+ Sbjct: 153 AANPRTARGFSGD----LILDEFAFHQDSRAIWEAAEPIISA--NPEFLCRISSTGNGRR 206 Query: 235 GKFYEIFNKPL---------DDWKRFQIDTRTV---EGIDPSFHEGIIARYGLDSDVTRV 282 FY++ + D WKR +I +V E I P + D Sbjct: 207 NMFYQLIAEGRIPYYRMRRSDAWKRGEIRIYSVVTGEEITPDQARAEAS----DKRAYDQ 262 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL----------------IMGCDIAE 326 G+F + + + +I A RE + +G D+ Sbjct: 263 NYEGEFNDEAS-ALLTQELISAA-EREGIAIEHQEWSEATIERLRTKTIGDLFLGQDVGR 320 Query: 327 EGGDNTV--VVLRRGPVIEHLFDWSKTDLRTTN--NKISGLVEKYRPDAIIIDANNTGAR 382 + D +V V+ R G + ++R ++ + + + + ID G Sbjct: 321 K-RDFSVQTVIERIGSGYRVVAMLRMENMRLPAQQRELEKICKLPKFRSAEIDMTGLGLG 379 Query: 383 TCDYLEM--LGYHVYRVLGQKRAVDLEFCR-------------NRRTELHVKMADWLEFA 427 +Y + G +V V R TEL D + Sbjct: 380 LVEYAQEEPWGGNVRGVNFGSSEPISLKLRADGKKGETAPVTELMATELLGVFED--KRI 437 Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAI 455 + L +L+ + + P+ G ++I Sbjct: 438 EIPMDPELRDDLRKPEKLVSPS-GRVSI 464 >gi|238760573|ref|ZP_04621704.1| Terminase, ATPase subunit [Yersinia aldovae ATCC 35236] gi|238701192|gb|EEP93778.1| Terminase, ATPase subunit [Yersinia aldovae ATCC 35236] Length = 590 Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 42/137 (30%), Gaps = 22/137 (16%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIM 320 YG + + +F S P ++ + Y P+ + Sbjct: 355 EYGPSE--YQNLLMCEFVDDQA-SVFPFKELQACMVDSLEEWEDYNPYSLRPFGYRPVWI 411 Query: 321 GCDIAE-EGGDNT---VVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAI 372 G D +E GGD+ V+ + G L W D I L +KY + I Sbjct: 412 GYDPSEANGGDSAGCAVIAPPMVPGGKFRVLERHQWKGMDFEAQAKHIEELTQKYCVEYI 471 Query: 373 IIDANNTGARTCDYLEM 389 IDA G + Sbjct: 472 GIDATTVGQGVFQLVRQ 488 >gi|186896884|ref|YP_001873996.1| hypothetical protein YPTS_3586 [Yersinia pseudotuberculosis PB1/+] gi|186699910|gb|ACC90539.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis PB1/+] Length = 595 Score = 52.4 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 52/163 (31%), Gaps = 20/163 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + +Y D+ + F DS +++ Sbjct: 333 PDGQWRYVITLENAIDGGFNLADIERLRNKYNRDT--FNMLYMCVFVDSG-DSVFKFHML 389 Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNTVVVLR----RGPVIEHLFD- 347 E+ + + G D A G +T V++ G L Sbjct: 390 EKCGVDIEMWQDHDFSAPRPFGNREVWGGFDPARSGDTSTFVIIAPPQFEGERFRVLATF 449 Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 W + N+I L ++Y I +D G + ++ Sbjct: 450 YWQGLNFNYQANQIKELFQRYNMTYIGVDITGIGNGVFELVQN 492 >gi|226305996|ref|YP_002765956.1| hypothetical protein RER_25090 [Rhodococcus erythropolis PR4] gi|226185113|dbj|BAH33217.1| hypothetical protein RER_25090 [Rhodococcus erythropolis PR4] Length = 402 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 48/366 (13%), Positives = 96/366 (26%), Gaps = 71/366 (19%) Query: 65 HCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTL 124 H + FK + GR GKTT + PG V A + Q + + Sbjct: 5 HQSQRKIAESSSRFKV-LRCGRRFGKTTYAVEEMKGACLFEPGP-VAYFATTRDQARDIV 62 Query: 125 WAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTF 184 WAE+ +++ ++ L D + + R Sbjct: 63 WAEL--LENVIGTTNYVSHNEQRLEVTLRRPDGSLNRIRLFGWENIETARG--------- 111 Query: 185 VGHHNTYGMAIINDEAS---GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-EI 240 ++ DE I L + + P+ + E Sbjct: 112 -----KKYSLVVLDELDSMRAFEKQWREIIRATLADYRGRALF--MGTPKGYKSLYRLEK 164 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 +K +++ F + + + + + E+ ++ + + Sbjct: 165 LSKTNANYEVFHFTSFDNPFLSVEELDEMRGEMTVTQ--YAQEMLAEYHKME-------G 215 Query: 301 IIEEALNRE----PCPDPYAPLIMGCDIAEE----------GGDNTVVVLRRGPVIEHLF 346 +I E NR+ P + D G DN+ ++ + Sbjct: 216 LIYEEFNRDQHIKALPFTPERWALSIDFGYNHPFAAGIFAIGSDNS-------LHLDRMV 268 Query: 347 DWSKTDLRTTNNKISGLVEKYR--------PDAIIIDANNTGARTCDYLEMLGYHVYRVL 398 K N + L+ + D + ID LG + V+ Sbjct: 269 YKRKLSDEQRMNAVRDLIGDTKLDFQIGDSEDPLAIDT---------LNRQLGLKIQPVV 319 Query: 399 GQKRAV 404 +V Sbjct: 320 KGAGSV 325 >gi|94994695|ref|YP_602793.1| phage terminase [Streptococcus pyogenes MGAS10750] gi|94548203|gb|ABF38249.1| phage terminase [Streptococcus pyogenes MGAS10750] Length = 476 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 52/347 (14%), Positives = 111/347 (31%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ + + A + + + GKT + + LW + G+ ++ Sbjct: 48 PWQENMLIPIMAVDEDGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 101 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + + G + + Sbjct: 102 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 155 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 156 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 205 Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271 +G +E + K +W + + + + FH I A Sbjct: 206 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 265 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 266 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 323 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 324 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 370 >gi|320105341|ref|YP_004180931.1| hypothetical protein AciPR4_0096 [Terriglobus saanensis SP1PR4] gi|319923862|gb|ADV80937.1| hypothetical protein AciPR4_0096 [Terriglobus saanensis SP1PR4] Length = 484 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 57/342 (16%), Positives = 102/342 (29%), Gaps = 61/342 (17%) Query: 106 PGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165 PG + +A++ + ++ V + LP+ S ++V + Sbjct: 84 PGTMTVLVAHTREATEQ-MFRIVQRMWENLPDDLREGPAKRS------RANVGQMAFPAL 136 Query: 166 SKHYSTMCRT-YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224 + + + R + H + D AS G+ L Sbjct: 137 DSEFRVVSAGEQNAGRSMSIQNLHCSELSRWPGDAASTLA-----GLKAALAPGGE---M 188 Query: 225 IMTSNPRRLSGKFYEIF---------NKPLDDWKRFQIDTRTVEGIDPSFHEG-IIARYG 274 ++ S P G FY+ + L W VE D + E +++R G Sbjct: 189 VLESTPNGAYGCFYQEWMEAEAQRMARHFLPWWMEPTYLGARVEASDWTEEERALVSREG 248 Query: 275 LDSD----------VTRVEVCGQFPQQDI-------DSFIPLNIIEEALN---------- 307 L + R +F + + + F L+ IE L Sbjct: 249 LRPEQIGYRRELQRTYRGMARQEFAEDAVSCFRASGECFFELDAIEARLAELTPPLASRR 308 Query: 308 -----REPCPDPYAPLIMGCDIAEEGGD---NTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359 P ++ D A G + V+ ++ + + R Sbjct: 309 SGSLLLWMPPVKGRRYLIASDPAGGGSEGDFAAAQVVDIDLGLQCAELRQRLNPRELAEV 368 Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQK 401 + L +Y I+++ NN GA YLE G VY GQ Sbjct: 369 LIDLAREYNGALIVVERNNHGAGVLAYLEKRGVAVYEEGGQA 410 >gi|238765012|ref|ZP_04625949.1| Terminase, ATPase subunit [Yersinia kristensenii ATCC 33638] gi|238696781|gb|EEP89561.1| Terminase, ATPase subunit [Yersinia kristensenii ATCC 33638] Length = 587 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 61/207 (29%), Gaps = 32/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP----------- 314 + + Y + + +F S P ++ + Sbjct: 349 LDQLALEY--SPAEYQNLLMCEFVDDK-TSVFPFEELQGCMVDSLEEWDDFNPYAYRPFG 405 Query: 315 YAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 Y + +G D + G VVL G L W D T I L EKY Sbjct: 406 YRAVWLGYDPSHTGDSAGCVVLAPPLVPGGKFRILERHQWKGMDFATQAESIKTLTEKYC 465 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + ++ + +T L +K D + Sbjct: 466 VEYIGIDATGIGQGVYQLVRE---------FFPAVQEIRYSPEIKTALVLKAKDLITSGR 516 Query: 429 LINHSG---LIQNLKSLKSFIVPNTGE 452 L SG + Q+ +++ + + G Sbjct: 517 LEYDSGHTDITQSFMAIRKTMTASGGR 543 >gi|238788385|ref|ZP_04632179.1| Terminase, ATPase subunit [Yersinia frederiksenii ATCC 33641] gi|238723631|gb|EEQ15277.1| Terminase, ATPase subunit [Yersinia frederiksenii ATCC 33641] Length = 587 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 61/207 (29%), Gaps = 32/207 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP----------- 314 + + Y + + +F S P ++ + Sbjct: 349 LDQLALEY--SPAEYQNLLMCEFVDDK-TSVFPFEELQGCMVDSLEEWDDFNPYAYRPFG 405 Query: 315 YAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 Y + +G D + G VVL G L W D T I L EKY Sbjct: 406 YRAVWLGYDPSHTGDSAGCVVLAPPLVLGGKFRILERHQWKGMDFATQAESIKTLTEKYC 465 Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428 + I IDA G + ++ + +T L +K D + Sbjct: 466 VEYIGIDATGIGQGVYQLVRE---------FFPAVREIRYSPEIKTALVLKAKDLITSGR 516 Query: 429 LINHSG---LIQNLKSLKSFIVPNTGE 452 L SG + Q+ +++ + + G Sbjct: 517 LEYDSGHTDITQSFMAIRKTMTASGGR 543 >gi|225621691|ref|YP_002724049.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547649|gb|ACN93626.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 61/332 (18%), Positives = 108/332 (32%), Gaps = 51/332 (15%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109 Q E + +++H + V +F G I++ GKT L ++L++ + S + Sbjct: 49 QKEVLFDIESHDYSKV------IFSGGIAS----GKTFLASYLLVKKLIENKSFYEQDTN 98 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + NS L T ++ K SL + + + L I Sbjct: 99 NFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNI----- 147 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 Y + D F I +EA+ L ++ L R I +N Sbjct: 148 ------YGGKNRDAFSKIRGGNSAIIYVNEATVIHKETLLEVIKRL--RKGKEIIIFDTN 199 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288 P + F + + D +K + T F + Y R V G++ Sbjct: 200 PESPAHYFKTDYIENTDVFKTYTFTTYDNPLNSADFIQTQEKLY-RRFPAYRARVLYGEW 258 Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFD 347 + F E N++ IM D A GGDNT + + E + Sbjct: 259 ILNESTLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAICVLERTF-EKFYA 309 Query: 348 WSKTDLRTTN-----NKISGLVEKYRPDAIII 374 + D + + I L+E + + + I Sbjct: 310 YIYQDQKPVSDSLVLASIQVLIENFNVNTVYI 341 >gi|306818204|ref|ZP_07451935.1| possible phage terminase protein [Mobiluncus mulieris ATCC 35239] gi|304649168|gb|EFM46462.1| possible phage terminase protein [Mobiluncus mulieris ATCC 35239] Length = 470 Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats. Identities = 58/389 (14%), Positives = 100/389 (25%), Gaps = 51/389 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ EV + N + ++ R GKTTL L+ + PG V Sbjct: 33 PWQKLVAEVACE--RQAANPERARYQRVIVTVPRQSGKTTLIKALMAAVAQANPGCKVYY 90 Query: 113 LANSETQLKTTL--WAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 A + K + W E++K L + + ++ + ++ Sbjct: 91 TAQTR---KDAVEKWGELAKQLRKEMGTGPDGKPRVKVLEGTGNEKIVFQGTESVIQPFA 147 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL--------------- 215 G H I DEA ++ L Sbjct: 148 PTP-----------GGLHGATSPLAIVDEAWAFDQAQGDDLMAALNPVGLTIPHSQVWII 196 Query: 216 -TERNANRFWI---------MTSNPRRLSGKF---YEIFNKPLDDWKRFQIDTRTVEGID 262 T + W+ TS+P + F + D + G Sbjct: 197 STAGDTRSQWLKSLVDDGRAATSDPGATTAFFEWSADEETADADLRGDAALSFHPALGYT 256 Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN-IIEEALNREPCPDPYAPLIMG 321 + + R +P S I L A P + Sbjct: 257 QELWKLKALGKDEKDHLYRRAYLNLWPTNAQTSIIDLETWDGLATEISETPT---GATIA 313 Query: 322 CDIAEEGGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 D+A+ T+ + L SK I L + P A++ D + Sbjct: 314 FDVADGRTGATIYAAWQQDNSVCLHRLISKAGAAWIEKAIEHLQDTLAPAALVADDSGDN 373 Query: 381 ARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + L G +Y + ++ A Sbjct: 374 RPIIEGLRRAGATIYALKPKEYATANSEF 402 >gi|170023146|ref|YP_001719651.1| hypothetical protein YPK_0897 [Yersinia pseudotuberculosis YPIII] gi|169749680|gb|ACA67198.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis YPIII] Length = 595 Score = 52.0 bits (123), Expect = 3e-04, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 52/163 (31%), Gaps = 20/163 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + +Y D+ + F DS +++ Sbjct: 333 PDGQWRYVITLEDAIDGGFNLADIERLRNKYNRDT--FNMLYMCVFVDSG-DSVFKFHML 389 Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNT--VVVLRR--GPVIEHLFD- 347 E+ + + G D A G +T ++ + G L Sbjct: 390 EKCGVDIEMWQDHDFSAPRPFGNREVWGGFDPARSGDTSTFAIIAPPQFEGERFRVLVTF 449 Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 W + N+I L ++Y I +D G + ++ Sbjct: 450 YWQGLNFNYQANQIKELFQRYNMTYIGVDITGIGNGVFELVQN 492 >gi|226953564|ref|ZP_03824028.1| possible ATPase terminase subunit [Acinetobacter sp. ATCC 27244] gi|226835689|gb|EEH68072.1| possible ATPase terminase subunit [Acinetobacter sp. ATCC 27244] Length = 374 Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 64/357 (17%), Positives = 103/357 (28%), Gaps = 87/357 (24%) Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE---ASGTPDVINL 209 D + S G + + T RT G H + DE G D + Sbjct: 6 LTGDPIILSNGAELRFLGTNYRTA--------QGPHGN----LYFDEIFWTYGF-DELEK 52 Query: 210 GILGFLTERNANRFWIMTSNPRRLSGKFYE-----IFNKPLDDWKRFQID--------TR 256 G T + S P ++ + Y+ FNK ++F+ID R Sbjct: 53 VASGMATHDKWRKT--YFSTPSSITHEAYKFWTGTRFNKGKPKDQQFKIDLSHKALKHGR 110 Query: 257 TVE----------------GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 E G D E + Y + +F S PL+ Sbjct: 111 VCEDLMWRQIVTVEDAKEGGCDLFNIERLKFEYSPED--FANLFMCEFVDDGQ-SMFPLS 167 Query: 301 IIEEALNREP-----------CPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL 345 +++ + P P+ +G D A G + +VV+ G L Sbjct: 168 MLQICMVDTLEIWNDFKIWHNRPFSNKPVWIGYDPALTGDNAGLVVVAPPAVAGGKFRVL 227 Query: 346 FD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD-------YLEMLGYHVYR 396 + D +I + +Y I ID G + L Y Sbjct: 228 EKHQFKGDDFSEQAERIRAITLRYNVTYIGIDTTGMGYGVAELVRAFFPALTTFNYSP-E 286 Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGEL 453 V Q L+ RN R E + L Q+L S+K + + ++ Sbjct: 287 VKSQLVYKTLDVIRNGRLE--------FDAG----DKDLAQSLMSIKKTLTSSQKQI 331 >gi|291618711|ref|YP_003521453.1| P [Pantoea ananatis LMG 20103] gi|291153741|gb|ADD78325.1| P [Pantoea ananatis LMG 20103] Length = 588 Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 E + RY + + + F D+ S L +++ + P Sbjct: 348 LEQLRTRYSPED--YQNLLMCVF-MDDLASVFQLAMLQRCMVDSWEVWDDFEALALRPFG 404 Query: 315 YAPLIMGCDIAEE--GGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ GD+ V+ G L W D R + I L + Sbjct: 405 WKEVWIGYDPAKGTKNGDSAGCVVIAPPAVPGGKFRILERHQWRGMDFRAQADAIKSLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGLGVYENVK 487 >gi|281356810|ref|ZP_06243301.1| protein of unknown function DUF264 [Victivallis vadensis ATCC BAA-548] gi|281316937|gb|EFB00960.1| protein of unknown function DUF264 [Victivallis vadensis ATCC BAA-548] Length = 417 Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 75/419 (17%), Positives = 128/419 (30%), Gaps = 72/419 (17%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICL---ANSETQLKTTLWAEVSKWLSLLPNKHW 140 G GKT + + +L G L AN+ Q S LP Sbjct: 29 GGSRSGKTFILVYAILVRALRAAGSRHAILRLHANTVRQ---------SIRFDTLPKVVK 79 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 L L + D L + + +EER D +G I +E Sbjct: 80 LAFPGLGLSESK--VDQLIRLPNGSELWFGGLD---TEERADKILG---KEFATIYFNEC 131 Query: 201 SGTPDVINLGILGFLTERNA-----NRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255 S ++ + LT NR W NP S Y +F + D + Sbjct: 132 S---ELGFEAVSTALTRLAQRTALKNRAWFDC-NPAGKSHWSYRLFIERRDPVSGLPLSF 187 Query: 256 RTVE---GIDPSFHEGIIARYGLDSDVT----RVEVC---GQFPQQDIDSFIPLNIIEEA 305 ++P+ + + L+ + R + G + + +IE Sbjct: 188 PDNYASMLLNPAENRENLPEGYLEETLAGLTERQRLRFQEGAWLDDLSGALWSTAMIER- 246 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNT----VVVLRRG--PVIEHLFDWSKTDLRT-TNN 358 +R +++G D A G ++ +V RG L D S + Sbjct: 247 -SRVEAAPSLERIVIGVDPAVTSGKDSDETGIVTAGRGADGHYYVLADASCRERPAGWAA 305 Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTEL 416 ++ ++R D ++ + NN G L L +V + + R E Sbjct: 306 RVRDEYRRFRADRVVAEVNNGGDLVETVLRSQELDLPFRQVRAMRGKI-------ARAE- 357 Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 +A E ++H G + L+ + P TG S D D L++ Sbjct: 358 --PVAALYEQGK-VHHVGCFRELEEQMTSFTPQTG-----------TGSPDRLDALVWA 402 >gi|323937704|gb|EGB33972.1| terminase [Escherichia coli E1520] Length = 433 Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 70/417 (16%), Positives = 122/417 (29%), Gaps = 62/417 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + + PGI+ A + Q++ + EV+ L + Sbjct: 25 AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 84 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 85 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 125 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 126 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 185 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + GQF + + N Sbjct: 186 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 242 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363 E P P+ +G D V VLR G + D I Sbjct: 243 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 302 Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 K R I DA+ + A T D L+ G++V V+ + + Sbjct: 303 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 360 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469 + + + L SL+ + G E + G + + Sbjct: 361 NA-MFCNANGERRYKVNVKRCPLYAE--SLEQQVWDEKG----EPDKKSGNDHPNDA 410 >gi|301025610|ref|ZP_07189133.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 69-1] gi|300395930|gb|EFJ79468.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 69-1] Length = 435 Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 70/417 (16%), Positives = 122/417 (29%), Gaps = 62/417 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + + PGI+ A + Q++ + EV+ L + Sbjct: 27 AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 86 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 87 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 127 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 128 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 187 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + GQF + + N Sbjct: 188 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 244 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363 E P P+ +G D V VLR G + D I Sbjct: 245 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 304 Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 K R I DA+ + A T D L+ G++V V+ + + Sbjct: 305 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 362 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469 + + + L SL+ + G E + G + + Sbjct: 363 NA-MFCNANGERRYKVNVKRCPLYAE--SLEQQVWDEKG----EPDKKSGNDHPNDA 412 >gi|209918626|ref|YP_002292710.1| hypothetical protein ECSE_1435 [Escherichia coli SE11] gi|209911885|dbj|BAG76959.1| conserved hypothetical protein [Escherichia coli SE11] Length = 436 Score = 51.6 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 70/417 (16%), Positives = 122/417 (29%), Gaps = 62/417 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + + PGI+ A + Q++ + EV+ L + Sbjct: 28 AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 87 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 88 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 128 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 129 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 188 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + GQF + + N Sbjct: 189 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 245 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363 E P P+ +G D V VLR G + D I Sbjct: 246 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 305 Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 K R I DA+ + A T D L+ G++V V+ + + Sbjct: 306 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 363 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469 + + + L SL+ + G E + G + + Sbjct: 364 NA-MFCNANGERRYKVNVKRCPLYAE--SLEQQVWDEKG----EPDKKSGNDHPNDA 413 >gi|318604142|emb|CBY25640.1| phage terminase, ATPase subunit [Yersinia enterocolitica subsp. palearctica Y11] gi|318605359|emb|CBY26857.1| phage terminase, ATPase subunit [Yersinia enterocolitica subsp. palearctica Y11] Length = 590 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/133 (18%), Positives = 40/133 (30%), Gaps = 20/133 (15%) Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDI 324 + + +F S P ++ + + P+ +G D Sbjct: 357 SPAEYQNLLMCEFVDDQA-SVFPFAELQACMVDSLEEWEDYNPYSLRPFGHRPVWIGYDP 415 Query: 325 AE-EGGDNT---VVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 +E GGD+ V+ + G L W D I L +KY + I IDA Sbjct: 416 SEANGGDSAGCAVIAPPMVPGGKFRVLERHQWKGMDFEAQAKHIEELTQKYCVEYIGIDA 475 Query: 377 NNTGARTCDYLEM 389 G + Sbjct: 476 TTVGQGVFQLVRQ 488 >gi|254465926|ref|ZP_05079337.1| phage DNA Packaging Protein [Rhodobacterales bacterium Y4I] gi|206686834|gb|EDZ47316.1| phage DNA Packaging Protein [Rhodobacterales bacterium Y4I] Length = 428 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 65/433 (15%), Positives = 119/433 (27%), Gaps = 67/433 (15%) Query: 82 ISAGRGIGKTTLNA-WLVLWLMSTRPGI-----SVICLANSETQLKTTLWAEVSKWLSLL 135 I GRG GKT A W+ RP + +A + Q++ + + +L Sbjct: 36 ILGGRGAGKTRAGAEWVRSLAEGARPHDPGTARRIALVAETYDQVRDVM---IHGDSGIL 92 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 P S + +S P+ G A Sbjct: 93 ACSP------------PDRRPKWKASERKLIWPNGAEAQAFSAHDPEALRGPQFD---AA 137 Query: 196 INDE----ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251 DE G + L + +T+ P R + + P + Sbjct: 138 WADELAKWRKG--QESWDMLQFAL-RLGQDPRVCVTTTP-RNAPVLKRLLASPSTV-QTH 192 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 + PSF E + +RY + + R E+ G + + EA E Sbjct: 193 AATEANRANLAPSFLEEVRSRY-AGTRLGRQELDGVMLSDIQGALWTTAALVEAQVAEAP 251 Query: 312 PDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTNN 358 P +++ D A + D +V+ DW Sbjct: 252 P--LDRVVVAVDPAVSAGKDSDACGIVVAGAVTRGKPQDWQAYVLADCTVQGVGPLAWAQ 309 Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHV 418 + +++ + ++ + N GA L V RA+ ++ R E Sbjct: 310 AVIAARDRFGAERVVAEVNQGGALVESVLRQADPLV-----PFRALHARKGKSARAEPVA 364 Query: 419 KMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 + + L L + + + +G S D D L++ E Sbjct: 365 ALYEQGRVRHLPGLGELEDQMC-------------QMTPQGYRGGGSPDRVDALVWALHE 411 Query: 479 NPPRSDMDFGRCP 491 + + R Sbjct: 412 LIIQPAANLRRPQ 424 >gi|294338167|emb|CBJ94203.1| Putative phage DNA packaging protein (terminase) [Campylobacter phage CPt10] Length = 731 Score = 51.3 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 58/365 (15%), Positives = 112/365 (30%), Gaps = 63/365 (17%) Query: 64 AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTT 123 HC + + + + + K+T + + L + I++ +A Sbjct: 260 DHCYDLTLEHHHLYYTNGVLS-HNSSKSTTTSVKLAHLYCFKKDINIGIVA--------- 309 Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE-ERPD 182 S + + + L P + + S + ++ D Sbjct: 310 --------YSGNSAREFLDKTKKMLIGLPIWMQPGTVTWNKGSIECENNIKILTDVPSSD 361 Query: 183 TFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRF--WIMTSNPRRLSGKFYE 239 F G T I+ DE + G L + F ++ S P+ + FY+ Sbjct: 362 AFRG---TSTNIIVVDECAYLDPAGWIDFTDGVLPSQAGLAFKKLVILSTPKGKN-HFYD 417 Query: 240 IFN--------------KPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284 I+ + DW+ + + + F + I GL V Sbjct: 418 IWQGAGDTLETSINGFVRHRVDWRLVPRFKSDGTKYDPEEFKQQQIKTGGL--VVWNSAY 475 Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPC---------------PDPYAPLIMGCDIAEEGG 329 +F + + IP I++ +EP P P +MG D A+EG Sbjct: 476 ECKF-EGSAMTLIPSEILDTYKPQEPIEVDNIKDSKILIYEEPIPGHKYVMGVDTAKEGA 534 Query: 330 DNT-VVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCD 385 D T V + + +K D ++ ++ II++ N +G D Sbjct: 535 DFTGVQIFDITDLNFRQVASAKLKIDYMLLPELLNEYGLRFNQALIIVENNEGSGQVVAD 594 Query: 386 YLEML 390 L+ Sbjct: 595 ILKRD 599 >gi|224796473|ref|YP_002641230.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii A14S] gi|224497687|gb|ACN53304.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii A14S] Length = 450 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 62/332 (18%), Positives = 110/332 (33%), Gaps = 51/332 (15%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109 Q E + +++H + V +F G I++ GKT L ++L++ + S + Sbjct: 49 QKEVLFDIESHKYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEQDTN 98 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + NS + L T ++ K LL + + S + ++ D Sbjct: 99 NFIIGNSISLLMTNTIKQIEKICRLLGIDYQKKKSGQSFCKIAGFELNIYGGKNRD---- 154 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 F I +EA+ L +L L R I +N Sbjct: 155 -------------AFSKIRGGNSAIIYVNEATVIHKETLLEVLKRL--RKGKSIIIFDTN 199 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288 P + F + + D +K + T F E Y S + V G++ Sbjct: 200 PESPAHFFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHSPAYKARVLYGEW 258 Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFD 347 + F E N++ IM D A GGDNT + + E + Sbjct: 259 IVNESSLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAICVLERTF-EKFYA 309 Query: 348 WSKTDLRTTNN-----KISGLVEKYRPDAIII 374 + D + N+ I L+E + + + I Sbjct: 310 YIYQDQKPVNDSLMLNSIQVLIENFNVNTVYI 341 >gi|291336835|gb|ADD96368.1| phage terminase large subunit [uncultured organism MedDCM-OCT-S09-C20] Length = 454 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 49/333 (14%), Positives = 104/333 (31%), Gaps = 32/333 (9%) Query: 40 EKGTPLEGFSAP---RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAW 96 E P+E P + + + + +++ + + + G G GK+ A Sbjct: 14 EPKRPVERAIDPGAADALRAKILADCLPAQREFLDDESHRIL--SYIGGFGSGKSFALAA 71 Query: 97 LVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSD 156 +++L PG +++ + ++T L + + W S P P Y Sbjct: 72 KLIFLGLRNPGGTLMACEPTFPMIRTVLVPAI-----DMALDQWDIEYSYRASPQPEY-- 124 Query: 157 VLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASG----TPDVINLGIL 212 S+ + + + C++ + + A + DE T +L Sbjct: 125 ----SINLPTGPVTIYCQSA-----ENYQRIRGQNICAAVWDECDTSPVDTAQKAGEMLL 175 Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIF-NKPLDDWKRFQIDTRTVEGIDPSFHEGIIA 271 + N+ + S P Y F D + ++ T+ + F + Sbjct: 176 ARMRTGELNQLAVA-STPEG-FRWAYRTFVENDGPDKRLIRVRTQDNPHLPADFIPSLER 233 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331 Y S + + + G F S P + P + +G D+ G Sbjct: 234 NY--PSQLIQAYLEGHFVNLASCSLYP-EFDRSLNYCDTQPTENDTIWIGVDL-NVGNCV 289 Query: 332 TVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV 364 T ++RRG + D + + + Sbjct: 290 TQHLVRRGDEFHFFAEKVYRDTQQIAQGLKEMY 322 >gi|299531659|ref|ZP_07045064.1| putative phage associated protein [Comamonas testosteroni S44] gi|298720375|gb|EFI61327.1| putative phage associated protein [Comamonas testosteroni S44] Length = 436 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 61/340 (17%), Positives = 122/340 (35%), Gaps = 41/340 (12%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++L + ++RP V+C E+ K + ++ + Sbjct: 39 GGRGGGKSWTVAAVLLVMAASRPL-RVLCT------------REIQKSIKQSVHQ-LLKD 84 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHY-STMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202 L+ ++ + GI+ + + ++++ + +F G + +EA G Sbjct: 85 VITRLNLHAFFEVLETEVRGINGSLFLFSGLQSHTVDSIKSFEGCD-----IVWVEEAHG 139 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-NKPLDDWKRFQIDTRTVEGI 261 ++ + + + + + NP + + Y+ F P D +I+ R Sbjct: 140 VSKKSWDTLIPTIRKEGSEIWLTL--NPDMETDETYQRFIATPSPDTWVVEINWRDNPWF 197 Query: 262 DPSFHEGII-ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE---EALNREPCPDPYAP 317 E A+ + +D G+ + + + + R+ DP P Sbjct: 198 PRVLDEERRKAKRTMLADDYAHIWEGKARRVAAGAIYRHEMESVYLDNRARDVPYDPTLP 257 Query: 318 LIMGCDIAEEGGDNTVVVLRRGP-----VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372 + D+ D + L + +I H+ D +T L K+ L ++ D + Sbjct: 258 VHTVWDLGWN--DAMSIALVQRGPQDVRIIGHIEDSHRT-LDWYVAKLEKLPYRWGTDYL 314 Query: 373 IIDAN----NTGARTCDYLEMLGYHVYRVLGQKRAVDLEF 408 D TG T L LG V+ Q RA D+E Sbjct: 315 PHDGKTKNFQTGKSTEQLLRELGRR--SVMVQPRATDVEE 352 >gi|330503113|ref|YP_004379982.1| phage P2 terminase ATPase subunit, gpP-like protein [Pseudomonas mendocina NK-01] gi|328917399|gb|AEB58230.1| phage P2 terminase ATPase subunit, gpP-like protein [Pseudomonas mendocina NK-01] Length = 585 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 43/238 (18%), Positives = 72/238 (30%), Gaps = 51/238 (21%) Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 W++ I G D + + Y D + QF DS PL ++ Sbjct: 326 DDKVWRQIVTILDAEARGCDLFDLDELRLEY--DGPAFDNLLMCQFVDDG-DSIFPLTML 382 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--F 346 + + + P + +G D AE G ++VL G L F Sbjct: 383 QPCMVESWDWPDFKPFAARPFGDRQVWLGYDPAENGDSAGLMVLAPPTEPGGKFRVLDRF 442 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 + D KI L + Y I ID G + Sbjct: 443 QFRGMDFEAQAEKIRQLTQIYWVTYIGIDTTGMGTGVAQLVRQ----------------- 485 Query: 407 EFCRNRRTELH-------VKMADW--LEFASLINHSG---LIQNLKSLKSFIVPNTGE 452 F N RT + + M W + L +G + Q+L +++ + +G+ Sbjct: 486 -FFPNLRTFSYSPEVKTQLVMKAWDVVRKGRLEFDAGATDIAQSLMAIRK-TMTPSGK 541 >gi|294337972|emb|CBJ93810.1| putative phage DNA packaging protein (terminase) [Campylobacter phage CP220] Length = 744 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 58/365 (15%), Positives = 112/365 (30%), Gaps = 63/365 (17%) Query: 64 AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTT 123 HC + + + + + K+T + + L + I++ +A Sbjct: 260 DHCYDLTLEHHHLYYTNGVLS-HNSSKSTTTSVKLAHLYCFKKDINIGIVA--------- 309 Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE-ERPD 182 S + + + L P + + S + ++ D Sbjct: 310 --------YSGNSAREFLDKTKKMLIGLPIWMQPGTVTWNKGSIECENNIKILTDVPSSD 361 Query: 183 TFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRF--WIMTSNPRRLSGKFYE 239 F G T I+ DE + G L + F ++ S P+ + FY+ Sbjct: 362 AFRG---TSTNIIVVDECAYLDPAGWIDFTDGVLPSQAGLAFKKLVILSTPKGKN-HFYD 417 Query: 240 IFN--------------KPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284 I+ + DW+ + + + F + I GL V Sbjct: 418 IWQGAGDTLETSINGFVRHRVDWRLVPRFKSDGTKYDPEEFKQQQIKTGGL--VVWNSAY 475 Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPC---------------PDPYAPLIMGCDIAEEGG 329 +F + + IP I++ +EP P P +MG D A+EG Sbjct: 476 ECKF-EGSAMTLIPSEILDTYKPQEPIEVDNIKDSKILIYEEPIPGHKYVMGVDTAKEGA 534 Query: 330 DNTVVVLRRGPVI---EHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCD 385 D T V + + + K D ++ ++ II++ N +G D Sbjct: 535 DFTGVQIFDTTDLNFRQVASAKLKIDYMLLPELLNEYGLRFNQALIIVENNEGSGQVVAD 594 Query: 386 YLEML 390 L+ Sbjct: 595 ILKRD 599 >gi|186896569|ref|YP_001873681.1| hypothetical protein YPTS_3269 [Yersinia pseudotuberculosis PB1/+] gi|186699595|gb|ACC90224.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis PB1/+] Length = 725 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 43/144 (29%), Gaps = 18/144 (12%) Query: 279 VTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----------PLIMGCDIAEEG 328 QF F + +E+ L + + + G D A G Sbjct: 389 AFNQLYMCQFVDSGDCVF-KFDQLEKCLTNVSTWEDHDVNAMRPFGNREVWAGYDPARTG 447 Query: 329 GDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382 + V++ + H+ W + +I + +Y I ID G Sbjct: 448 DTASFVLVAPPQVDGEPFRVLHIETWHGFAFKYQVGRIKEYMTRYNITHIGIDTTGIGGP 507 Query: 383 TCDYLEML-GYHVYRVLGQKRAVD 405 C+ ++ V ++ + + + Sbjct: 508 VCEMVQDFARREVTQIHYSQESKN 531 >gi|331677171|ref|ZP_08377867.1| putative phage terminase [Escherichia coli H591] gi|331075860|gb|EGI47158.1| putative phage terminase [Escherichia coli H591] Length = 436 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 70/417 (16%), Positives = 129/417 (30%), Gaps = 62/417 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + + PGI+ A + Q++ + EV+ L + Sbjct: 28 AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 87 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 88 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 128 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 129 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 188 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + G+F + + N Sbjct: 189 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLLGRFTNLTSGTVYH-QFDRKLNN 245 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP---VIEHLFDWSKTDLRTTNNKISGLV 364 E P P+ +G D V VLR G V E + + D+ + L Sbjct: 246 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRLIKERFWLY 305 Query: 365 E-----KYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 + K R I DA+ + A T D L+ G++V V+ + + Sbjct: 306 DGHDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 363 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469 + + + + SL+ + + G E + G + + Sbjct: 364 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 413 >gi|300825021|ref|ZP_07105118.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 119-7] gi|300522485|gb|EFK43554.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 119-7] Length = 435 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 70/417 (16%), Positives = 129/417 (30%), Gaps = 62/417 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + + PGI+ A + Q++ + EV+ L + Sbjct: 27 AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 86 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 87 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 127 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 128 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 187 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + G+F + + N Sbjct: 188 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLLGRFTNLTSGTVYH-QFDRKLNN 244 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP---VIEHLFDWSKTDLRTTNNKISGLV 364 E P P+ +G D V VLR G V E + + D+ + L Sbjct: 245 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRLIKERFWLY 304 Query: 365 E-----KYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 + K R I DA+ + A T D L+ G++V V+ + + Sbjct: 305 DGHDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 362 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469 + + + + SL+ + + G E + G + + Sbjct: 363 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 412 >gi|193062487|ref|ZP_03043581.1| terminase [Escherichia coli E22] gi|192931609|gb|EDV84209.1| terminase [Escherichia coli E22] Length = 433 Score = 51.3 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 70/417 (16%), Positives = 129/417 (30%), Gaps = 62/417 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + + PGI+ A + Q++ + EV+ L + Sbjct: 25 AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 84 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 85 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 125 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 126 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 185 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + G+F + + N Sbjct: 186 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLLGRFTNLTSGTVYH-QFDRKLNN 242 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP---VIEHLFDWSKTDLRTTNNKISGLV 364 E P P+ +G D V VLR G V E + + D+ + L Sbjct: 243 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRLIKERFWLY 302 Query: 365 E-----KYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 + K R I DA+ + A T D L+ G++V V+ + + Sbjct: 303 DGHDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 360 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469 + + + + SL+ + + G E + G + + Sbjct: 361 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 410 >gi|251810445|ref|ZP_04824918.1| large terminase subunit [Staphylococcus epidermidis BCM-HMP0060] gi|251806049|gb|EES58706.1| large terminase subunit [Staphylococcus epidermidis BCM-HMP0060] Length = 420 Score = 50.9 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 45/338 (13%), Positives = 109/338 (32%), Gaps = 40/338 (11%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 ++ E++ H + + GRG GK++ + ++ + R ++ + + Sbjct: 4 IKLSELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVR 62 Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174 ++ L T+++ ++ + H F+++ + + + R Sbjct: 63 KTDNTLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFR 110 Query: 175 TYSEERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMT 227 + P+ ++ + I + A T D + L + + + Sbjct: 111 GA--QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFS 168 Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 NP + + YE +P + + I F + + + R E Sbjct: 169 YNPPKRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESAKERNEQRYRWE 227 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRG 339 G+ +P N ++ + D + + G D D V ++ Sbjct: 228 YMGEAIGS---GVVPFNNLQIETIPQEMIDGFDNIRNGLDFG-YADDPLAFVRWHYDKKK 283 Query: 340 PVIEHLFDWSKTDL--RTTNNKISGLVEKYRPDAIIID 375 VI + ++ + R N++ KY+ D I D Sbjct: 284 RVIYAIDEYYGVQISNRQYANEMWK--RKYQSDDIYAD 319 >gi|332768290|gb|EGJ98475.1| hypothetical protein SF293071_0834 [Shigella flexneri 2930-71] Length = 65 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 19/55 (34%), Gaps = 4/55 (7%) Query: 433 SGLIQNLKSLKSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 L L + G + +ESK+ + S + +D + FA D Sbjct: 3 EKLKLELTT-PHRDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 56 >gi|330996205|ref|ZP_08320095.1| phage terminase, large subunit, PBSX family [Paraprevotella xylaniphila YIT 11841] gi|329573709|gb|EGG55300.1| phage terminase, large subunit, PBSX family [Paraprevotella xylaniphila YIT 11841] Length = 430 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 59/349 (16%), Positives = 111/349 (31%), Gaps = 38/349 (10%) Query: 76 EVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135 E F I+ GRG GK+ A T A+S + T+ VS LS++ Sbjct: 22 EHFIILITGGRGSGKSFNAA--TFIERLTFEQSRDRTFAHSILYCRYTM---VSANLSII 76 Query: 136 PN-KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 P + ++ S + SD+++ G +T S + H Sbjct: 77 PEIQEKIDIDGGSKYFKTTRSDIVNMFSGGRIMFRGI--KTSSGNQTAKLKSIHG--ITT 132 Query: 195 IINDEA-SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-- 251 + DEA T + I+ + ++ I+ NP + Y+ + + Sbjct: 133 FVCDEAEEWTNEQDFDKIMLSIRQKGIQNRIIIIMNPTDSNHFIYKKYIENTHKLVEIDG 192 Query: 252 ---QIDT------------RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF 296 QI T ++ + P F + + + + V G++ + Sbjct: 193 VPVQISTHPNVLHIHTTYFDNIDNLSPQFIKEVEQMKAENPEKYAHTVIGRWADVAEGA- 251 Query: 297 IPLNIIEEALNREPCPDPYAPLI-MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRT 355 I + + I +G D D T +V+ + D + Sbjct: 252 -----IYKKWGVVKSIPQWCKKIALGLDFG-FTHDETAIVMCGVMDNDLYIDEICYKTQM 305 Query: 356 TNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV 404 I + Y+ +I D+ + R + G +Y V K +V Sbjct: 306 LTKDIIQTLRPYQGMKVIADSAD--PRLIQEIHNAGIRIYPVEKGKGSV 352 >gi|238025823|ref|YP_002910054.1| phage terminase, ATPase subunit [Burkholderia glumae BGR1] gi|237875017|gb|ACR27350.1| Phage terminase, ATPase subunit [Burkholderia glumae BGR1] Length = 589 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 38/142 (26%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 E + RY +D + QF + S L ++ + P Sbjct: 350 LERLRRRY--SADAFANLLMCQFIDDSV-SVFKLAELQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDN--TVVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G VV R + + D I + +Y Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVEGGTFRVLERHQFRGNDFEEQAAAIEQITRRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|167567112|ref|ZP_02360028.1| phage terminase, ATPase subunit [Burkholderia oklahomensis EO147] Length = 589 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + + +G D A G +VV+ G L + D I + ++Y Sbjct: 407 GHREVWVGYDPALTGDSAGLVVVAPPRVDGGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 HVGYIAIDTTGMGQGVYQLVRK 488 >gi|145298582|ref|YP_001141423.1| phage terminase ATPase subunit [Aeromonas salmonicida subsp. salmonicida A449] gi|145300715|ref|YP_001143556.1| phage terminase ATPase subunit [Aeromonas salmonicida subsp. salmonicida A449] gi|142851354|gb|ABO89675.1| phage terminase ATPase subunit [Aeromonas salmonicida subsp. salmonicida A449] gi|142853487|gb|ABO91808.1| phage terminase ATPase subunit [Aeromonas salmonicida subsp. salmonicida A449] Length = 606 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 46/139 (33%), Gaps = 19/139 (13%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317 E + Y +V +F D S +E A + Y P Sbjct: 365 IEELKDEY--PEEVFDRLYLCRFID-DALSVFKFQDMERAGVDPTRWEDYKPGRPDPFGR 421 Query: 318 --LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + +G D + + T+VV+ + W + + +I+ + +K+R Sbjct: 422 REVWLGYDPSRTRDNATLVVVAPPTVAGERFRVLEKHYWRGLNFQYQAQEITRIAKKFRV 481 Query: 370 DAIIIDANNTGARTCDYLE 388 + +D + G D L+ Sbjct: 482 TYLGVDVSGIGVGVFDLLK 500 >gi|188532717|ref|YP_001906514.1| Terminase, ATPase subunit [Erwinia tasmaniensis Et1/99] gi|188027759|emb|CAO95616.1| Terminase, ATPase subunit [Erwinia tasmaniensis Et1/99] Length = 588 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 23/143 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314 E + RY + + + F D+ S L ++++ + P Sbjct: 348 LEQLRTRYSPED--YQNLLMCVF-MDDLASVFQLAMLQKCMVDSWEVWDDFEALALRPFG 404 Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365 + + +G D A+ + GD+ V+ G L W D R + I L + Sbjct: 405 WKEVWIGYDPAKGTQNGDSAGCVVIAPPAVPGGKFRILERHQWRGMDFRAQADAIKTLTQ 464 Query: 366 KYRPDAIIIDANNTGARTCDYLE 388 +Y I ID+ G + ++ Sbjct: 465 QYNVTYIGIDSTGVGLGVYENVK 487 >gi|219872451|ref|YP_002476937.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] gi|219694305|gb|ACL34832.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] Length = 450 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 60/332 (18%), Positives = 108/332 (32%), Gaps = 51/332 (15%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109 Q E + +++H + V +F G I++ GKT L ++L++ + S + Sbjct: 49 QKEVLFDIESHDYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEKDTN 98 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + NS L T ++ K + + + + L I Sbjct: 99 NFIIGNSIGLLMTNTIKQIEKICG------FLGIDYQKKKSGESFCKIAGLELNI----- 147 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 Y + D+F I +EA+ L ++ L R I +N Sbjct: 148 ------YGGKNRDSFSKIRGGNSAIIYVNEATVIHKETLLEVIKRL--RKGKAIIIFDTN 199 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288 P + F F + D +K + T F E Y + V G++ Sbjct: 200 PEGPTHFFKTDFIENKDVFKTYNFTTYDNPLNSADFIETQKKLY-KHLPAYKARVLYGEW 258 Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFD 347 + F E N++ IM D A GGDNT + + E + Sbjct: 259 ILNESTLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAICVLE-RAFEKFYA 309 Query: 348 WSKTDLRTTN-----NKISGLVEKYRPDAIII 374 + D + + I L+E + + + I Sbjct: 310 YIYQDQKPVSDSLMLGSIQVLIENFNVNTVYI 341 >gi|329888629|ref|ZP_08267227.1| phage DNA packaging protein [Brevundimonas diminuta ATCC 11568] gi|328847185|gb|EGF96747.1| phage DNA packaging protein [Brevundimonas diminuta ATCC 11568] Length = 411 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 43/312 (13%), Positives = 85/312 (27%), Gaps = 38/312 (12%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP---RR 232 + +P+ I +EA +L + + W NP Sbjct: 106 WKGGKPEGIKSLEGAG--LTILEEAQEVRQASLDVLLPTILRTAISELW-AIWNPRLDTD 162 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292 F+ KP R +I+ + E + + D G + Sbjct: 163 PIDVFFRGPVKPKGAIVR-KINYDQNPHFPDALRELMELDFSKDKLRAAWIWLGGYMPSV 221 Query: 293 IDSFIPLNIIEEAL--NREPCPDPYAPLIMGCDIAEEGGDNTVVVL---RRGPVIEHLFD 347 + ++EA R + +++G D + G D +VV G +I Sbjct: 222 QGAIWNREGLDEAWREGRHAPEGSWGRVVVGVDPSGGGDDVGIVVAAEYGDGAIILEDAT 281 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE 407 T + V+++ D ++ + N G L G V+ V Sbjct: 282 CPATSPMAWATATAKAVDRWGADCVVAEKNFGGDMVESTLRAGGVKARVVM-----VTAS 336 Query: 408 FCRNRRTE----LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGA 463 + R E L+ + + + + + G Sbjct: 337 RGKQVRAEPVAALYDQKR--IRHREQFPLMEAEMLMTTPAGYQ---------------GE 379 Query: 464 KSTDYSDGLMYT 475 S + D L++ Sbjct: 380 DSPNRMDALVWA 391 >gi|294010834|ref|YP_003544294.1| hypothetical protein SJA_C1-08480 [Sphingobium japonicum UT26S] gi|292674164|dbj|BAI95682.1| phage-related protein [Sphingobium japonicum UT26S] Length = 419 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 64/428 (14%), Positives = 119/428 (27%), Gaps = 92/428 (21%) Query: 89 GKTTLNAWLVLWLMST--RPGISVICLANSETQLKTTLWAEVSKWL-------SLLPNKH 139 GK+ A L M G + + + + T +++LW + W+ S P + Sbjct: 17 GKS--IAILYYIFMRCLLYAGSTHLIVRRTRTACESSLWRQTLNWMLDHMADPSGAPLRE 74 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 ++ S L ++ + + E R D +G T I +E Sbjct: 75 KVKLNSSDL--IAYFDNGSYIMFD-----------GLDENRLDKVLG---TEYQTIWMNE 118 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF-------YEIFNKPLDDWKRFQ 252 S + G L +P ++ + DW Sbjct: 119 VSEFDWSDVQQLAGCLN-----------GSPTHNDNGLPIVRKMVFDCNPRFESDWDCKV 167 Query: 253 IDTRTVEGIDPSF--------------HEGIIARYGLDSDVTRVEVC-GQFPQQDIDSFI 297 + E +A Y TR G + Q+ ++ Sbjct: 168 FRDGQNPVNNQPLNDVQKYGKVKVQNVDEEYLAIYANADPRTRARYLDGDWSAQNDNAIF 227 Query: 298 PLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNT------VVVLRRGPVIEHLFDWSKT 351 L+ E +++G D A + + V + + K Sbjct: 228 DLDNFERNRRFGIFAKDLERIVIGVDPASKSKKESDLTGIIVAGMLKDEAYILADLTGKY 287 Query: 352 DLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFC 409 K++ + Y+ D+II++ NN G + L V +V + + Sbjct: 288 TPEQVAQKVTEAFDTYQADSIIVETNNGGDWIENGLRQYAPNLPVKQVTASRGKLT---- 343 Query: 410 RNRRTE--LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTD 467 R E + D + N S L + + AKS D Sbjct: 344 ---RAEPIALIYAQDKVHHVGH-NLSELETQM-----YEF---------GMERGAAKSPD 385 Query: 468 YSDGLMYT 475 D L++ Sbjct: 386 RMDALVWA 393 >gi|329735579|gb|EGG71866.1| phage terminase, large subunit, PBSX family [Staphylococcus epidermidis VCU028] Length = 420 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 45/338 (13%), Positives = 109/338 (32%), Gaps = 40/338 (11%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 ++ E++ H + + GRG GK++ + ++ + R ++ + + Sbjct: 4 IKLSELLPKHFHSLWKATKDRKKLNVVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVR 62 Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174 ++ L T+++ ++ + H F+++ + + + R Sbjct: 63 KTDNTLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFR 110 Query: 175 TYSEERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMT 227 + P+ ++ + I + A T D + L + + + Sbjct: 111 GA--QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFS 168 Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 NP + + YE +P + + I F + + + R E Sbjct: 169 YNPPKRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESTKERNELRYRWE 227 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRG 339 G+ +P N ++ + D + + G D D V ++ Sbjct: 228 YMGEAIGS---GVVPFNNLQIETIPQEMIDGFDNIRNGLDFG-YADDPLAFVRWHYDKKK 283 Query: 340 PVIEHLFDWSKTDL--RTTNNKISGLVEKYRPDAIIID 375 VI + ++ + R N++ KY+ D I D Sbjct: 284 RVIYAIDEYYGVQISNRQYANEMWK--RKYQSDDIYAD 319 >gi|251782540|ref|YP_002996842.1| phage terminase [Streptococcus dysgalactiae subsp. equisimilis GGS_124] gi|242391169|dbj|BAH81628.1| phage terminase [Streptococcus dysgalactiae subsp. equisimilis GGS_124] Length = 476 Score = 50.9 bits (120), Expect = 5e-04, Method: Composition-based stats. Identities = 53/347 (15%), Positives = 111/347 (31%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ + + A + + + GKT + + LW + G+ ++ Sbjct: 48 PWQENMLIPIMAVDEDGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 101 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + + G + + Sbjct: 102 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 155 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 156 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 205 Query: 233 --LSGKFYEIFNKP-------LDDWKRFQI-------DTRTVEGIDPS-----FHEGIIA 271 +G +E + K W + + D + +PS I A Sbjct: 206 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYIANPSMGFHLNERKIEA 265 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G +P + S I + L E P+ + L +G ++G + Sbjct: 266 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 323 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 324 VSLSIAARTSENKVFVEVIDCLSVRNGTQWIINFLKSADIAKVVIDG 370 >gi|167584288|ref|ZP_02376676.1| bacteriophage terminase, ATPase subunit [Burkholderia ubonensis Bu] Length = 520 Score = 50.5 bits (119), Expect = 6e-04, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 54/224 (24%), Gaps = 47/224 (20%) Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDDWKRFQIDTRTVEG 260 +N +++ + S P +S + Y E +N+ +D Sbjct: 196 ELNKVAQAMASQKQWRKT--YFSTPSSISHQAYPFWSGEAYNRGRAKADHIHLDISHAAL 253 Query: 261 IDPSFHEGIIAR----------------------YGLDSDVTRVEVCGQFPQQDIDSFIP 298 E R +D QF S Sbjct: 254 SGGRLCEDRQWRQIVTIEDAAAMGCDLFDLDELRLENSADDFAQLYLCQFIDDSA-SIFK 312 Query: 299 LNIIEEALNRE-----------PCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIE 343 I+ + P + P+ +G D A G +V++ G Sbjct: 313 FADIQRCMIDSWEEWDDVEFLIQRPFGHRPVWLGYDPALSGDSAGLVIVAPPAVPGGKFR 372 Query: 344 HLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 L W D I L E+Y + ID G Sbjct: 373 VLEKMQWRGMDFEAQAESIRQLTERYTVTYMAIDTTGIGQGVYQ 416 >gi|300922729|ref|ZP_07138820.1| hypothetical protein HMPREF9548_00965 [Escherichia coli MS 182-1] gi|300420946|gb|EFK04257.1| hypothetical protein HMPREF9548_00965 [Escherichia coli MS 182-1] Length = 181 Score = 50.5 bits (119), Expect = 6e-04, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 47/145 (32%), Gaps = 19/145 (13%) Query: 317 PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPD 370 P+ +G D + G VVL G L W D T I L EKY + Sbjct: 1 PVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYNVE 60 Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430 I IDA G + A D+ + +T + +K D + + Sbjct: 61 YIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG-CL 110 Query: 431 NHSGLIQNLKS---LKSFIVPNTGE 452 + ++ S + ++G Sbjct: 111 EYDVSATDITSSFMAIRKTMTSSGR 135 >gi|282534188|gb|ADA82296.1| putative terminase [Escherichia phage K1H] gi|282535239|gb|ADA82445.1| putative terminase [Escherichia phage K1ind3] Length = 416 Score = 50.5 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 65/413 (15%), Positives = 117/413 (28%), Gaps = 63/413 (15%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 G G GKT + +L M PG + + ++ + + + + Sbjct: 24 GGFGSGKTFVGCLDLLTFMLKHPGTRLGYFGPTYPAIRDIFYP------TFEEAANLLGL 77 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 L D + + +CR S + P + VG A + DE Sbjct: 78 DVLVKS-----GDKEVVVTRGKTVLGTVICR--SMDNPGSIVGF---KIAAAVVDELDVL 127 Query: 204 ----PDVINLGILG--FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-R 256 ++ I+ L +T+ P + + P + Q T Sbjct: 128 SREKAELAWNKIVARMRLVIPGVTNHISVTTTPEGFKFVYAKFKENPTPSYSMVQASTHE 187 Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPC 311 + P + + Y + + + G+F S + A +R + Sbjct: 188 NARFLPPDYISSLTETY--PAQLINAYLNGEFVNLTSGS------VYYAYDRRKHRSKET 239 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 P L +G D + V V R+ D T I+ K + Sbjct: 240 IQPGDTLYIGQDFNVTKNASAVYVQRKDGWHAVAELKGLFDTPDTVRVITEKW-KSQGHR 298 Query: 372 III--DANNTGART-------CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 I++ DA+ +T L+ G+ V D N Sbjct: 299 IVVYPDASGKNRKTNSASISDIALLQQAGFDVRAKSANPPVKDRVLAVN----------T 348 Query: 423 WLEFASLINHSGLIQNL-KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474 LE L + L + K+L+ + G E + +D L Y Sbjct: 349 ALEKGKLWVNDHLCPEIAKTLEQQAYDDNG----EPAKDGIID--HMADALGY 395 >gi|84393331|ref|ZP_00992091.1| putative phage gene [Vibrio splendidus 12B01] gi|84376047|gb|EAP92935.1| putative phage gene [Vibrio splendidus 12B01] Length = 590 Score = 50.5 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 49/156 (31%), Gaps = 18/156 (11%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IPLNIIEEALNREPCPDPYAP---- 317 + + Y + F + F + +++ A ++ P P Sbjct: 353 IDELREEYSQED--FNNLFMCMFVDGALSVFKFSDLEKGMVDAAHWQDFKPKNKQPFARR 410 Query: 318 -LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPD 370 + +G D + + +VV+ G L W + + ++I + ++Y+ Sbjct: 411 EVWLGYDPSRTRDNACLVVVAPPAVAGEKFRVLEKHYWKGLNFQYHVSEIDKVFQRYKVT 470 Query: 371 AIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAVD 405 I +D G D + + + + + Sbjct: 471 YIGVDTTGIGGGVWDLISKKYPREAHAIHYSNENKN 506 >gi|311993449|ref|YP_004010314.1| gp17 terminase DNA packaging enzyme large subunit [Acinetobacter phage Acj9] gi|295917406|gb|ADG60077.1| gp17 terminase DNA packaging enzyme large subunit [Acinetobacter phage Acj9] Length = 609 Score = 50.5 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 62/365 (16%), Positives = 113/365 (30%), Gaps = 62/365 (16%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111 R +Q + ++++ + ++ V K + +GKTT+ A + W ++ Sbjct: 141 RDYQKDMLKIMAENRMS--------VSKLSRQ----LGKTTVVAIFLAWFACFNKDKNIG 188 Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171 LA+ + + AEV L L W + G Y+ Sbjct: 189 ILAHKGS-----MSAEV---LDRTKQAIELLPDFLQPGIVEWNKGSIELDNGSAISAYAA 240 Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSN 229 PD G I DE + P + L I ++ + I+T+ Sbjct: 241 --------SPDAVRG---NSFSLIYIDECAFIPNFNDAWLAIQPVISS-GRHSKIIITTT 288 Query: 230 PRRLSGKFYEIFN----------KPLDDWKRFQIDTRTVEGI-DPSFHEGIIARYGLDSD 278 P ++ FY+I+ +W + + + D +H G + Sbjct: 289 PNGMN-HFYDIWTAAVEGISGFVPYESEWNAVKERLYDDKDVFDDGWHFSFTTIGGSSVE 347 Query: 279 VTRVEVCGQFPQQDIDSFI-----------PLNIIEEALNREPCPDPYAPLIMGCDIAEE 327 R E G F + P+N+ + + PDP I D AE Sbjct: 348 QFRQEHVGVFAGGQGTLSLRHETCNLKLETPINVGDSTFYKYKEPDPTRKYIATLDSAEG 407 Query: 328 -GGDNTVVVLRRGPVIE----HLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382 G D + + + + + + I + Y I I+ N+TG Sbjct: 408 RGQDYHCMNIIDVTDEQWEQVAVLHSNTISHLILPDIILKHLLDYNEAPIYIELNSTGVS 467 Query: 383 TCDYL 387 L Sbjct: 468 IAKTL 472 >gi|223935635|ref|ZP_03627551.1| protein of unknown function DUF264 [bacterium Ellin514] gi|223895643|gb|EEF62088.1| protein of unknown function DUF264 [bacterium Ellin514] Length = 437 Score = 50.5 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 57/348 (16%), Positives = 99/348 (28%), Gaps = 54/348 (15%) Query: 137 NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAI 195 K W ++ A + T R YS P+ G + Sbjct: 71 CKEWARTMQIAEPDADEVVFDSKTDFSAHVLQFKTGLRIYSLSSNPNALAGKRGH----V 126 Query: 196 INDEASGTPDV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN--KPLDD---W 248 I DE + D + T + S R + F E+ K + W Sbjct: 127 ILDEFALHADQRLLYRIAKPVTTWGGQ---LEIISTHRGANSVFNEMIRGIKENGNKMGW 183 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDS--DVTRVEVCGQ-------------FPQQDI 293 ++ I E I A+ G + + V + P + Sbjct: 184 SHHKVTLHDA--IAEGLVERINAKTGRNESREAYLARVESECLDQEQWLQEYCCVPADET 241 Query: 294 DSFIPLNIIEEALNREPCPDPY-----APLIMGCDIAEEGGDNTV--VVLRRGPVI--EH 344 +FI ++I + Y PL +G D+ + D TV V + G VI Sbjct: 242 SAFITYDMISGCEDDCLKDFNYLAECKNPLYLGVDVGRK-RDLTVMDVGEKIGDVIWDRL 300 Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRA 403 + ++ L+ + IDA G + + G+ V V+ Sbjct: 301 RIEMQGRTFAEQEFELERLLALPKLRRACIDATGIGMQLAERARERFGWKVEPVMFTAP- 359 Query: 404 VDLEFCRNRRTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIVPN 449 + EL + E + L +L+ +K I + Sbjct: 360 --------MKEELAFPLRGAFEDRTLRIARDPQLRADLRGIKKEITTS 399 >gi|83950455|ref|ZP_00959188.1| Putative large terminase [Roseovarius nubinhibens ISM] gi|83838354|gb|EAP77650.1| Putative large terminase [Roseovarius nubinhibens ISM] Length = 434 Score = 50.5 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 58/427 (13%), Positives = 117/427 (27%), Gaps = 75/427 (17%) Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTT-LWAEVSK 130 AI GRG GKT A W+ + G + + + Q++ ++ E S Sbjct: 41 AIMGGRGAGKTRAGA---EWVRAAVEGATPGAPGRCRRIALVGETVDQVREVMIFGE-SG 96 Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT 190 L+ P E Q+ +S P+ G Sbjct: 97 ILACSPPDRRPEWQASR---------------RRLVWPNGAEAAVFSAHDPEALRGPQFD 141 Query: 191 YGMAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248 DE + + L + + I + R G ++ P Sbjct: 142 GA---WLDEMAKWKKARATWDMLQFALRLGDDPQ--ICVTTTPRNVGVLKDVLAAPSTV- 195 Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 + SF + ARY + + + R E+ G ++ + L ++ A R Sbjct: 196 VTQAPTEANRAHLAESFLAEVRARY-VGTRLGRQELDGILLEEAEGALWSLAALDAA--R 252 Query: 309 EPCPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRT 355 + +++ D G D +++ + +W Sbjct: 253 VSKLPELSRIVVAVDPPVTGHAGSDECGIIVAGVDQSGPVQEWRAYVLADRSVSGMSPTG 312 Query: 356 TNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRR 413 +E++ D ++ + N G L + V + R Sbjct: 313 WAGAAIRAMEEFGADKMVAEVNQGGDLVETVLRQIDPLIPFRGVHAS-------RGKQAR 365 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473 E + + + L ++++ + G S D +D L+ Sbjct: 366 AEPVAALYEQGRVHHMAGLDRLEDQMRAMTTRGYEGRG-------------SPDRADALV 412 Query: 474 YTFAENP 480 + E Sbjct: 413 WALHELV 419 >gi|284921252|emb|CBG34318.1| phage protein [Escherichia coli 042] gi|323942326|gb|EGB38497.1| terminase [Escherichia coli E482] Length = 433 Score = 50.5 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 69/417 (16%), Positives = 123/417 (29%), Gaps = 62/417 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + + PGI+ A + Q++ + EV+ L + Sbjct: 25 AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 84 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 85 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 125 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 126 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 185 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + GQF + + N Sbjct: 186 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 242 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363 E P P+ +G D V VLR G + D I Sbjct: 243 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 302 Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 K R I DA+ + A T D L+ G++V V+ + + Sbjct: 303 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 360 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469 + + + + SL+ + + G E + G + + Sbjct: 361 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 410 >gi|117623621|ref|YP_852534.1| putative phage terminase [Escherichia coli APEC O1] gi|331672908|ref|ZP_08373694.1| putative phage terminase [Escherichia coli TA280] gi|115512745|gb|ABJ00820.1| putative phage terminase [Escherichia coli APEC O1] gi|309701027|emb|CBJ00325.1| phage protein [Escherichia coli ETEC H10407] gi|331070129|gb|EGI41498.1| putative phage terminase [Escherichia coli TA280] Length = 436 Score = 50.5 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 69/417 (16%), Positives = 123/417 (29%), Gaps = 62/417 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + + PGI+ A + Q++ + EV+ L + Sbjct: 28 AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 87 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 88 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 128 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 129 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 188 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + GQF + + N Sbjct: 189 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 245 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363 E P P+ +G D V VLR G + D I Sbjct: 246 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 305 Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 K R I DA+ + A T D L+ G++V V+ + + Sbjct: 306 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 363 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469 + + + + SL+ + + G E + G + + Sbjct: 364 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 413 >gi|319896520|ref|YP_004134713.1| phage terminase atpase subunit protein [Haemophilus influenzae F3031] gi|317432022|emb|CBY80370.1| Probable phage terminase ATPase subunit protein [Haemophilus influenzae F3031] Length = 556 Score = 50.5 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 59/194 (30%), Gaps = 25/194 (12%) Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---------REPCPDPY 315 E + RY + F +++ ++ + P Sbjct: 367 NIEKLKQRYSKY--AFNQLYMCVWIDDADSIFTVHQLLKCGVDISKWKDFNPKADRPFGD 424 Query: 316 APLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + G D A G + V++ + + W N+I L EKY Sbjct: 425 REVWGGFDPAHSGDGASFVIIAPPALPSEKYRVLARYQWQGLSYVYQANQIRALYEKYNM 484 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429 I IDA G + ++ ++ A + + +T + +K+ D +E + Sbjct: 485 TYIGIDATGVGYGVYELVKE--------FARRAATAIIYNPESKTGMVLKVHDLVEHGQI 536 Query: 430 INHSGLIQNLKSLK 443 + L + + Sbjct: 537 EWSEKELDILSTNQ 550 >gi|282533135|gb|ADA82244.1| putative terminase [Escherichia phage K1G] Length = 416 Score = 50.1 bits (118), Expect = 8e-04, Method: Composition-based stats. Identities = 65/413 (15%), Positives = 117/413 (28%), Gaps = 63/413 (15%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 G G GKT + +L M PG + + ++ + + + + Sbjct: 24 GGFGSGKTFVGCLDLLTFMLKHPGTRLGYFGPTYPAIRDIFYP------TFEEAANLLGL 77 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 L D + + +CR S + P + VG A + DE Sbjct: 78 DVLVKS-----GDKEVVVTRGKTVLGTVICR--SMDNPGSIVGF---KIAAAVVDELDVL 127 Query: 204 ----PDVINLGILG--FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-R 256 ++ I+ L +T+ P + + P + Q T Sbjct: 128 SREKAELAWNKIVARMRLVIPGVINHISVTTTPEGFKFVYAKFKENPTPSYSMVQASTHE 187 Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPC 311 + P + + Y + + + G+F S + A +R + Sbjct: 188 NARFLPPDYISSLTETY--PAQLINAYLNGEFVNLTSGS------VYYAYDRRKHRSKEV 239 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 P L +G D + V V R+ D T I+ K + Sbjct: 240 IQPGDTLYIGQDFNVTKNASAVYVQRKDGWHAVAELKGLFDTPDTVRVITEKW-KSQGHR 298 Query: 372 III--DANNTGART-------CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 I++ DA+ +T L+ G+ V D N Sbjct: 299 IVVYPDASGKNRKTNSASISDIALLQQAGFDVRAKSANPPVKDRVLAVN----------T 348 Query: 423 WLEFASLINHSGLIQNL-KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474 LE L + L + K+L+ + G E + +D L Y Sbjct: 349 ALEKGKLWVNDHLCPEIAKTLEQQAYDDNG----EPAKDGIID--HMADALGY 395 >gi|282547341|gb|ADA82397.1| putative terminase [Escherichia phage K1ind2] Length = 416 Score = 50.1 bits (118), Expect = 8e-04, Method: Composition-based stats. Identities = 65/413 (15%), Positives = 117/413 (28%), Gaps = 63/413 (15%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 G G GKT + +L M PG + + ++ + + + + Sbjct: 24 GGFGSGKTFVGCLDLLTFMLKHPGTRLGYFGPTYPAIRDIFYP------TFEEAANLLGL 77 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 L D + + +CR S + P + VG A + DE Sbjct: 78 DVLVKS-----GDKEVVVTRGKTVIGTVICR--SMDNPGSIVGF---KIAAAVVDELDVL 127 Query: 204 ----PDVINLGILG--FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-R 256 ++ I+ L +T+ P + + P + Q T Sbjct: 128 SREKAELAWNKIVARMRLVIPGVINHISVTTTPEGFKFVYAKFKENPTPSYSMVQASTHE 187 Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPC 311 + P + + Y + + + G+F S + A +R + Sbjct: 188 NARFLPPDYISSLTETY--PAQLINAYLNGEFVNLTSGS------VYYAYDRRKHRSKET 239 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 P L +G D + V V R+ D T I+ K + Sbjct: 240 IQPGDTLYIGQDFNVTKNASAVYVQRKDGWHAVAELKGLFDTPDTVRVITEKW-KSQGHR 298 Query: 372 III--DANNTGART-------CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 I++ DA+ +T L+ G+ V D N Sbjct: 299 IVVYPDASGKNRKTNSASISDIALLQQAGFDVRAKSANPPVKDRVLAVN----------T 348 Query: 423 WLEFASLINHSGLIQNL-KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474 LE L + L + K+L+ + G E + +D L Y Sbjct: 349 ALEKGKLWVNDHLCPEIAKTLEQQAYDDNG----EPAKDGIID--HMADALGY 395 >gi|332533822|ref|ZP_08409678.1| phage terminase, ATPase subunit [Pseudoalteromonas haloplanktis ANT/505] gi|332036753|gb|EGI73216.1| phage terminase, ATPase subunit [Pseudoalteromonas haloplanktis ANT/505] Length = 382 Score = 50.1 bits (118), Expect = 8e-04, Method: Composition-based stats. Identities = 19/80 (23%), Positives = 30/80 (37%), Gaps = 6/80 (7%) Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPV--IEHLFDWSKTDLRTTNNKISGL 363 P P+++G D A G +V + ++ G + D S D ++I L Sbjct: 198 ERPYGLKPVVIGFDPARFGDKASVAILSAPMKPGEKFLLLEAIDLSGNDFEAMASEIKLL 257 Query: 364 VEKYRPDAIIIDANNTGART 383 EKY I +D G Sbjct: 258 TEKYNVVHIGVDTTGIGYGV 277 >gi|220903520|ref|YP_002478832.1| hypothetical protein Ddes_0239 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219867819|gb|ACL48154.1| protein of unknown function DUF264 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 615 Score = 50.1 bits (118), Expect = 8e-04, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 50/155 (32%), Gaps = 18/155 (11%) Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN- 307 + +D G + + Y + R +F + F+ L ++E + Sbjct: 361 RIVTLDDAEAGGCNLFNRADLEQEYSPED--MRQLFGCEFIDDTLAVFL-LGLLEGCMED 417 Query: 308 --------REPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDL 353 R+ P A + G D + D + VVL + G I L W Sbjct: 418 PDGWGIDLRQARPVDNAGVWGGYDPSRTRDDASFVVLLPPQKAGDKIRTLERHTWKGKSY 477 Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 +I L +KYR + ID G + + Sbjct: 478 LWQVGRIRELHDKYRFQHMGIDVTGPGQAVLENVR 512 >gi|294624257|ref|ZP_06702968.1| phage-related terminase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292601451|gb|EFF45477.1| phage-related terminase [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 587 Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 44/142 (30%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPD 313 + + Y D + +F S PL +++ + P P Sbjct: 349 IDELREEY--SPDAFANLLMCEFVDDGA-SIFPLAMLQPCMVDSWIEWGQDYKPFAPRPY 405 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL + G L + D +I + +Y Sbjct: 406 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 465 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 466 WVTYIGIDTTGMGSGVAQLVKQ 487 >gi|327198086|ref|YP_004306453.1| gp41 [Burkholderia phage KL3] gi|310657220|gb|ADP02334.1| gp41 [Burkholderia phage KL3] Length = 611 Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 38/142 (26%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 E + RY + + QF + S L ++ + P Sbjct: 372 LERLRRRYSAE--AFANLLMCQFIDDSV-SVFKLAELQRCMVDSWEEWADDFSPLLLRPF 428 Query: 314 PYAPLIMGCDIAEEGGDN--TVVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G VV R + + D I + ++Y Sbjct: 429 GYREVWVGYDPALTGDSAGLVVVAPPRVEGGTFRVLERHQFRGNDFEEQAAAIEQITQRY 488 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 489 NVGYIAIDTTGMGQGVYQLVRK 510 >gi|320535831|ref|ZP_08035911.1| conserved domain protein [Treponema phagedenis F0421] gi|320147321|gb|EFW38857.1| conserved domain protein [Treponema phagedenis F0421] Length = 488 Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 57/356 (16%), Positives = 101/356 (28%), Gaps = 92/356 (25%) Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRRLSGKFYEIF--NKPLDDWKRF 251 I+ DE + P+ I A + + P G+FY++F K + R+ Sbjct: 99 IVFDEMAIYPENKAEVIYTAGIPVTARGGCVEIGSTPLGKIGRFYDVFIDKKKYRTYNRY 158 Query: 252 QID-----------TRTVEGIDPSFHEGIIARYGLDSDV----------TRVEVCGQFPQ 290 I V E + RYG + + E F Sbjct: 159 TIPWWFSAALCTNVEEAVRNAPAMDTEERVYRYGTPPLIEAFEAMLLEDFQQEFECTFID 218 Query: 291 QDIDSFIPLNIIEE------ALNREPCP-------------------------------- 312 SFI L++I A +R Sbjct: 219 -SALSFITLDLIYANTPGMRAEDRTEEIRGGNIEDADIEDEKDLEIKIFRTSDELCAGYS 277 Query: 313 -DPYAPLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 + + L +G D+A D V+ + ++ V E K + + ++I ++ Sbjct: 278 REEHGALYLGYDVARY-RDAAVIYVLGVVDGKKKCVAEIEMKNKKFEYQR--DEIRKIMR 334 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTE-LHVKMADWL 424 + ID G T + L+ + ++ E L + + L Sbjct: 335 QLPVVRGCIDRTGQGLDTTETLQKE--------FGESKLEGIDFTTPAKEVLAMGVRTGL 386 Query: 425 EFAS--LINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSD---GLMYT 475 E L N + + S+K I G +S R K ++D Sbjct: 387 EKREFLLPNDQKFRKQIHSIKR-IPSAGGSFRYDSTRDKDG----HADSFWAFALA 437 >gi|327198304|ref|YP_004306879.1| gp35 [Burkholderia phage KS14] gi|310657267|gb|ADP02380.1| gp35 [Burkholderia phage KS14] Length = 604 Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 42/142 (29%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + QF S PL ++ + P P Sbjct: 366 IDELRLEYSAQE--YANLLMCQFIDDTA-SIFPLAELQRCMVDSWEEWADDFKPLAPRPF 422 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKY 367 + P+ +G D A G +VV+ G L + D I + ++Y Sbjct: 423 GFRPVWVGYDPALSGDSAGLVVVAPPAVPGGKFRVLHKCQFRGMDFEGQAEAIRQITQQY 482 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 + + ID G ++ Sbjct: 483 NVEYMSIDTTGIGQGVYQLVKQ 504 >gi|21243371|ref|NP_642953.1| phage-related terminase [Xanthomonas axonopodis pv. citri str. 306] gi|21108918|gb|AAM37489.1| phage-related terminase [Xanthomonas axonopodis pv. citri str. 306] Length = 594 Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 45/142 (31%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317 E + Y D + F S PL +++ + ++ P P Sbjct: 356 IEELREEY--SPDAFANLLMCDFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAARPY 412 Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL + G L + D +I + +Y Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494 >gi|317120885|ref|YP_004100888.1| hypothetical protein Tmar_0036 [Thermaerobacter marianensis DSM 12885] gi|315590865|gb|ADU50161.1| hypothetical protein Tmar_0036 [Thermaerobacter marianensis DSM 12885] Length = 410 Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 80/408 (19%), Positives = 129/408 (31%), Gaps = 56/408 (13%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 I AGRG GKT A V + + + + ++ + S LS+ P Sbjct: 36 ILAGRGFGKTRTGAEWVREQVERHGRRRIAIVGRTAADVRDVMVEGESGILSISP----- 90 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE-- 199 PW+ V S + + YS + PD G + A DE Sbjct: 91 ----------PWFRPVYEPSKRRLTWPNGAIATLYSADEPDLLRGPQHD---AAWADELA 137 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259 A P+ + + G + T P +L ++ N P R Sbjct: 138 AWRRPEAWDNLMFGLRLGPDPRVVVTTTPRPVKL---IRDLLNDPTCVVTRGS-TYENAA 193 Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319 + P+F E II+RY + + R E+ G+ + I+E RE ++ Sbjct: 194 NLAPAFLEQIISRY-EGTRLGRQELYGEVLDDVPGALWQRKRIDELRVREAP--ELVRVV 250 Query: 320 MGCDIA---EEGGDNT-VVVLRRG--PVIEHLFDWS-KTDLRTTNNKISGLVEKYRPDAI 372 + D A EEG D T +VV RG L D S + + + D I Sbjct: 251 VAIDPAVTSEEGSDETGIVVAGRGVDGDAYVLADRSCRMSPDGWARRAVKAYYDFDGDRI 310 Query: 373 I--IDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430 + ++ L +AV + R E + + + + Sbjct: 311 VGEVNNGG-------DLVETVIRTVDPKVPYKAVRASRGKAVRAEPVAALYEQGKVHHVG 363 Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 L L I +GA S D +D L++ E Sbjct: 364 TFDHLEDQLC-------------QITPDGYQGAGSPDRADALVWALTE 398 >gi|238920988|ref|YP_002934503.1| hypothetical protein NT01EI_3118 [Edwardsiella ictaluri 93-146] gi|238870557|gb|ACR70268.1| conserved hypothetical protein [Edwardsiella ictaluri 93-146] Length = 595 Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 49/162 (30%), Gaps = 21/162 (12%) Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 W++ I+ +G D + + Y + QF S PL++++ Sbjct: 334 GQWRQIVTIEDAIRQGYDLFDIDQLRLEY--SPEEFANLFMCQFIDDTE-SVFPLSLLQG 390 Query: 305 AL-NREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD-- 347 + + D Y P + +G D A G VV+ G + Sbjct: 391 CMVDSWAVWDDYKPFALRPLGERSVWVGYDPALTGDSAGCVVVAPPVVEGGKFRVIEKHQ 450 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 W D I + +Y I ID G ++ Sbjct: 451 WHGMDFAAQAENIRKITGRYNVTYIGIDVTGIGHGVHQLVKQ 492 >gi|251783038|ref|YP_002997341.1| terminase large subunit [Streptococcus dysgalactiae subsp. equisimilis GGS_124] gi|242391668|dbj|BAH82127.1| terminase large subunit [Streptococcus dysgalactiae subsp. equisimilis GGS_124] Length = 424 Score = 49.3 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 51/343 (14%), Positives = 101/343 (29%), Gaps = 50/343 (14%) Query: 57 EFMEVVDAHCLNSVNNP-NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLAN 115 + +++ V NP++ A GRG GK++ A+++ L+ P ++ +C+ Sbjct: 4 DLADIIPIGFRPVVQATWNPKILNIACKGGRGSGKSSNIAFIISRLIIQYP-VNAVCIRK 62 Query: 116 SETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT 175 ++ L+ +++ ++ KW + + +P + I + Sbjct: 63 TDNTLEQSVYEQI-KW----AISEQGLERYFKFNKSPLRITYIPRGNYIVFRG------- 110 Query: 176 YSEERPDTFVGHHNTYGMAII-----------NDEASGTPDVINLGILGFLTERNANRFW 224 + P+ ++ I DE + + G LG + Sbjct: 111 --AQNPERIKSLKDSRFPFAIGWIEELAEFKTEDEVKTITNSLLRGELG----DGLFYKF 164 Query: 225 IMTSNPRRLSGKF----YEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDV 279 T NP + + YE +P + + T I F A Sbjct: 165 FYTYNPPKRKQSWVNKKYESQFQPKNTF--VHASTYKDNPFIAKEFIAEAEATRERSERR 222 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR- 338 R E G+ +P + + + + + D D V Sbjct: 223 YRWEYLGEAIGS---GVVPFDNLRFETIPDELYRSFDNIRNAVDFG-YATDPLAFVRWHY 278 Query: 339 -----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 G K R N + + Y D I DA Sbjct: 279 DKKHNGIYAIDELYGQKISNRQLANWLKD--KSYSNDEIFADA 319 >gi|262273310|ref|ZP_06051125.1| terminase [Grimontia hollisae CIP 101886] gi|262222683|gb|EEY73993.1| terminase [Grimontia hollisae CIP 101886] Length = 594 Score = 49.3 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/251 (13%), Positives = 65/251 (25%), Gaps = 52/251 (20%) Query: 180 RPDTFVGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSN-------P 230 T G H + DE P D +N T + + + T + P Sbjct: 245 NSKTAQGFHGH----VYVDEYFWIPKFDELNKLASAMATHKTWRKTYFSTPSSKTHQAYP 300 Query: 231 RRLSGKF--------------YEIFNK-----PLDDWK-RFQIDTRTVEGIDPSFHEGII 270 + +E F P W+ ++ G D + + Sbjct: 301 FWTGDTWRGNANTREHVEFPTFEDFRNGGALCPDKHWRYVVTLEDAAAGGCDLFDIDELR 360 Query: 271 ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIM 320 Y + F + S + +E+ + + P + + Sbjct: 361 DEYSKND--FANLFMCVFVDGNA-SVFTFSKLEKCMVDASKWKDFKPDAARPYANQEVWL 417 Query: 321 GCDIAEEGGDNT--VVVLRRG----PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D + + V+ + + W + + ++I +Y I I Sbjct: 418 GYDPSRTRDNACLVVIAPPQTHAEVFRVLEKHYWKGLNFQYQASQIDEAFHRYHVTYIGI 477 Query: 375 DANNTGARTCD 385 D G D Sbjct: 478 DTTGVGYGVWD 488 >gi|282547289|gb|ADA82346.1| putative terminase [Escherichia phage K1ind1] Length = 416 Score = 49.3 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 65/413 (15%), Positives = 117/413 (28%), Gaps = 63/413 (15%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 G G GKT + +L M PG + + ++ + + + + Sbjct: 24 GGFGSGKTFVGCLDLLTFMLKHPGTRLGYFGPTYPAIRDIFYP------TFEEAANLLGL 77 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 L D + + +CR S + P + VG A + DE Sbjct: 78 DVLVKS-----GDKEVVVTCGKTVLGTVICR--SMDNPGSIVGF---KIAAAVVDELDVL 127 Query: 204 ----PDVINLGILG--FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-R 256 ++ I+ L +T+ P + + P + Q T Sbjct: 128 SREKAELAWNKIVARMRLVIPGVTNHISVTTTPEGFKFVYAKFKENPTPSYSMVQASTHE 187 Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPC 311 + P + + Y + + + G+F S + A +R + Sbjct: 188 NARFLPPDYISSLTETY--PAQLINAYLNGEFVNLTSGS------VYYAYDRRKHRSKEV 239 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 P L +G D + V V R+ D T I+ K + Sbjct: 240 IQPGDTLYIGQDFNVTKNASAVYVQRKDGWHAVAELKGLFDTPDTVRVITEKW-KSQGHR 298 Query: 372 III--DANNTGART-------CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 I++ DA+ +T L+ G+ V D N Sbjct: 299 IVVYPDASGKNRKTNSASISDIALLQQAGFDVRAKSANPPVKDRVLAVN----------T 348 Query: 423 WLEFASLINHSGLIQNL-KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474 LE L + L + K+L+ + G E + +D L Y Sbjct: 349 ALEKGKLWVNDHLCPEIAKTLEQQAYDDNG----EPAKDGIID--HMADALGY 395 >gi|221316874|ref|YP_002527821.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|226246930|ref|YP_002776267.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] gi|221237339|gb|ACM10180.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|226201508|gb|ACO38105.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] Length = 450 Score = 49.3 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + ++ +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNVETFKTYNFTTYDNVFLSKGFIETQEKLY-KDIPAYKARVLLGEWLASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI E+ + P I D A GGDNT + + Sbjct: 265 IFTQINITEDYMFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|169796738|ref|YP_001714531.1| putative phage terminase [Acinetobacter baumannii AYE] gi|169149665|emb|CAM87555.1| conserved hypothetical protein; putative phage terminase [Acinetobacter baumannii AYE] Length = 437 Score = 49.3 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 58/349 (16%), Positives = 99/349 (28%), Gaps = 55/349 (15%) Query: 81 AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135 A AG G GKT + + W V A + Q++ + + Sbjct: 31 AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 80 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 W + + + Y T S E+P T VG + + Sbjct: 81 VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 130 Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244 DE A I+ + + A+ I + YE F K Sbjct: 131 -IDELDVMAKVKAQQAWRKIIARMRYKQASLLNGIDVATTPEGFKFTYEQFVKEANKSEA 189 Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + Q T E + + + Y + + GQF + P + Sbjct: 190 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 246 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362 + + PL++G D V V+R G L + +R T Sbjct: 247 RVLNHTDEEIKQGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVG--VRDTPTMCYL 303 Query: 363 LVEKYRPDAIIIDANNTGARTCDY---------LEMLGYHVYRVLGQKR 402 + E++ I + + +G T L+ G+ V V G Sbjct: 304 IKERFPDHDITVIPDASGQATSSKGFSESDHAILKKNGFKV-EVNGVNP 351 >gi|323137496|ref|ZP_08072573.1| hypothetical protein Met49242DRAFT_1961 [Methylocystis sp. ATCC 49242] gi|322397122|gb|EFX99646.1| hypothetical protein Met49242DRAFT_1961 [Methylocystis sp. ATCC 49242] Length = 323 Score = 49.3 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 58/284 (20%), Positives = 96/284 (33%), Gaps = 41/284 (14%) Query: 59 MEVVDAHCLNSVNNPNPEVFKG----AISAGRGIGKTTLNAWLVLWLMSTRPG------- 107 M +A SV + +P + A+ GR GK ++ + +V W + G Sbjct: 38 MTEAEADFFRSVADRDPPSRRARELWAV-CGRRAGKDSIASAIVTWSAAMFDGADRLRPG 96 Query: 108 --ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165 +CLA + Q + L ++ E++ L D L S G+D Sbjct: 97 ERALCLCLACDKDQARIVL---------SYVRAYFAELEPLRAMVTRETKDGLELSNGVD 147 Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-VINLGILGFLTERNANRFW 224 R V +A DE S +PD + + + Sbjct: 148 IYVGVNDFRAVRGRTILCAV----LDEIAYWRDENSASPDLELYRALKPGMATL-PEAML 202 Query: 225 IMTSNPRRLSGKFYEIFN----KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVT 280 I S+P R +G + + D +D + + +A D Sbjct: 203 IGISSPYRRAGLLHAKHRQAYGRDGDTLVIRAPSAVMNPTLDQAEIDQAMAE---DPAAA 259 Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP----YAPLIM 320 R E +F + DI F+ L++IE A++ P YAP IM Sbjct: 260 RAEWLAEF-RDDISGFLGLDLIEAAVDPTIVTRPPRGCYAPWIM 302 >gi|145639505|ref|ZP_01795109.1| putative phage gene [Haemophilus influenzae PittII] gi|145271296|gb|EDK11209.1| putative phage gene [Haemophilus influenzae PittII] Length = 629 Score = 49.3 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 56/180 (31%), Gaps = 25/180 (13%) Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---------REPCPDPY 315 E + RY + F +++ ++ + P Sbjct: 389 NIEKLKQRYSKY--AFNQLYMCVWIDDADSIFTVHQLLKCGVDISKWKDFNPKADRPFGD 446 Query: 316 APLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + G D A G + V++ + + W+ N+I L EKY Sbjct: 447 REVWGGFDPAHSGDGASFVIIAPPALPSEKYRVLARYQWNGLSYVYQANQIRALYEKYNM 506 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429 I IDA G + ++ ++ A + + +T + +K+ D +E + Sbjct: 507 TYIGIDATGVGYGVYELVKE--------FARRAATAIIYNPESKTGMVLKVHDLVEHGQI 558 >gi|209694587|ref|YP_002262515.1| terminase, ATPase subunit [Aliivibrio salmonicida LFI1238] gi|208008538|emb|CAQ78711.1| terminase, ATPase subunit [Aliivibrio salmonicida LFI1238] Length = 590 Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats. Identities = 21/179 (11%), Positives = 54/179 (30%), Gaps = 19/179 (10%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IP 298 W+ I+ G D + + Y +D F + F + Sbjct: 330 DDKQWRYVVTIEDAANGGCDLFDIDELREEYSVDD--FNNLFMCMFVDGSLSVFKFSDLE 387 Query: 299 LNIIEEA-----LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLR------RGPVIEHLFD 347 +++ A + P + + +G D + + +VV+ + Sbjct: 388 KGMVDAAHWQDFKPKNKQPFEHREVWLGYDPSRTRDNACLVVVAPPAVVVEKFRVLEKHY 447 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAVD 405 W + + ++I + ++Y+ I +D G D + + + + + Sbjct: 448 WKGLNFQYHVSEIDKVFKRYKVTYIGVDTTGIGGGVWDLISKKYPREAHAIHYSNENKN 506 >gi|300922774|ref|ZP_07138861.1| conserved domain protein [Escherichia coli MS 182-1] gi|300420878|gb|EFK04189.1| conserved domain protein [Escherichia coli MS 182-1] Length = 199 Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 49/177 (27%), Gaps = 55/177 (31%) Query: 352 DLRTTNNKISGLVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLE 407 D+ + + + + D + D + GA T + G + D + Sbjct: 2 DINEGADWATSMAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDED 61 Query: 408 F-----------------------CRNRRTELHVKMADWLEFA---------SLINH--- 432 RN+R + + +AD L + + Sbjct: 62 APYQAGAWADEVVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVAHGEYADPDDMLS 121 Query: 433 -----------SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474 L L ++ N G+L +E K+ G S + +D LM Sbjct: 122 FDKEAIGEKMLEKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 177 >gi|284008602|emb|CBA75192.1| phage terminase protein [Arsenophonus nasoniae] Length = 598 Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 18/162 (11%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF------ 296 P W+ ++ G + + + + RY + D ++ F + F Sbjct: 336 PDGQWRYVITLEDAIKGGFNLASIDKLRQRY--NPDTFKMLYMCIFIEHGASVFKYDTLQ 393 Query: 297 ---IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLR------RGPVIEHLFD 347 + +N+ E+ + P P + G D A G +T V++ I F Sbjct: 394 KCGVDVNLWEDHNPKAPRPFGEREVWGGYDPARSGDTSTFVIVAPPMMAPEVFRILATFY 453 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 W R I L +KYR I ID G + ++ Sbjct: 454 WQGFSWRHQAKLIEDLTKKYRFTHIGIDTTGIGQSVYEMVQD 495 >gi|326536310|ref|YP_004300751.1| gp17 terminase DNA packaging enzyme large subunit [Acinetobacter phage 133] gi|299483391|gb|ADJ19485.1| gp17 terminase DNA packaging enzyme large subunit [Acinetobacter phage 133] Length = 606 Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats. Identities = 51/328 (15%), Positives = 94/328 (28%), Gaps = 51/328 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT A + + SV LA+ + + L+ L Sbjct: 161 GKTAAVAIFLAHYVCFNESKSVGILAHKGSMSEEVLFR--------TKQAIELLPDFLQP 212 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--DV 206 W + G ++ PD G I DE + D Sbjct: 213 GIVEWNKRSIELDNGSSIGAFA--------SSPDAVRG---NSFSLIYIDETAFVQNWDD 261 Query: 207 INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF------NKPLDDWKRFQIDTRT-VE 259 L I ++ + IMT+ P ++ FY+++ ++ + + + Sbjct: 262 CWLAIQPVISS-GRHSKIIMTTTPNGMN-HFYDLWQGAINGTSGFRPYEATWVSVKDRLY 319 Query: 260 GIDPSFHEGII---ARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 +F +G G S E G F ++ I + +E D + Sbjct: 320 NEADTFDDGWEFSARAIGSSSIEQFLQEHLGNF-AGGSNTLIDGTKLAVLFGQERIADQH 378 Query: 316 APL-----------IMGCDIAEE-GGDNTVVVLRRGPVIE----HLFDWSKTDLRTTNNK 359 + I D AE G D + + + + +K + Sbjct: 379 EFIEFKPPVAGRKYIATLDSAEGRGQDYHALHIIDVTDEQWEQAGVLHSNKISHLILADI 438 Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYL 387 I + +Y + I+ N+TG L Sbjct: 439 IFLYLTRYNEAPVYIELNSTGVSIAKTL 466 >gi|289661923|ref|ZP_06483504.1| phage-related terminase [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 267 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 46/142 (32%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317 + + Y D + +F S PL +++ + ++ P P Sbjct: 29 IDELREEY--SPDAFANLLMCEFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAARPY 85 Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL + G L + D +I + +Y Sbjct: 86 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 145 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 146 WVTYIGIDTTGMGSGVAQLVKQ 167 >gi|188577619|ref|YP_001914548.1| phage terminase, ATPase subunit [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522071|gb|ACD60016.1| phage terminase, ATPase subunit [Xanthomonas oryzae pv. oryzae PXO99A] Length = 533 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 42/142 (29%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPD 313 + + Y D + F S PL +++ + P Sbjct: 295 IDELREEY--SPDAFANLLMCDFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAVRPY 351 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL + G L + D +I + +Y Sbjct: 352 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 411 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 412 WVTYIGIDTTGMGSGVAQLVKQ 433 >gi|325926090|ref|ZP_08187451.1| hypothetical protein XPE_1415 [Xanthomonas perforans 91-118] gi|325928218|ref|ZP_08189424.1| hypothetical protein XPE_3475 [Xanthomonas perforans 91-118] gi|325541407|gb|EGD12943.1| hypothetical protein XPE_3475 [Xanthomonas perforans 91-118] gi|325543435|gb|EGD14857.1| hypothetical protein XPE_1415 [Xanthomonas perforans 91-118] Length = 587 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 26/142 (18%), Positives = 45/142 (31%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317 + + Y D + F S PL +++ + +E P P Sbjct: 349 IDELREEY--SPDAFANLLMCDFVDDGA-SIFPLAMLQPCMVDSWVEWGQEYKPFAARPY 405 Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL + G L + D +I + +Y Sbjct: 406 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 465 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 466 WVTYIGIDTTGMGSGVAQLVKQ 487 >gi|21232401|ref|NP_638318.1| phage-related terminase [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|21114179|gb|AAM42242.1| phage-related terminase [Xanthomonas campestris pv. campestris str. ATCC 33913] Length = 594 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 46/142 (32%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317 + + Y D + +F S PL +++ + ++ P P Sbjct: 356 IDELREEY--SPDAFANLLMCEFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAARPY 412 Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL + G L + D +I + +Y Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494 >gi|254183934|ref|ZP_04890525.1| putative terminase, ATPase subunit [Burkholderia pseudomallei 1655] gi|184214466|gb|EDU11509.1| putative terminase, ATPase subunit [Burkholderia pseudomallei 1655] Length = 589 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVV-----LRRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV + G + + D I + ++Y Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRIDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|160876026|ref|YP_001555342.1| hypothetical protein Sbal195_2916 [Shewanella baltica OS195] gi|160861548|gb|ABX50082.1| protein of unknown function DUF264 [Shewanella baltica OS195] Length = 589 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 52/343 (15%), Positives = 107/343 (31%), Gaps = 42/343 (12%) Query: 163 GIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNA 220 + Y + + + + + H + I+ +S T D G L A Sbjct: 249 NLYLDEYFWIHKFQEFRKVASGMAIHAKWRQTYISTPSSITHDAYPFWTGTLFNRGRPKA 308 Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDV 279 +R I S+ +G+ W++ +D +G + + + Y D Sbjct: 309 DRIEIDVSHSALANGR-----RCEDGQWRQVVTVDDAIRKGCNLFDPDTLHLEY--SPDE 361 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR-----------EPCPDPYAPLIMGCDIAEEG 328 + +F D S P+ +++ + P P + + +G D + G Sbjct: 362 YSNLLMCEFID-DTMSVFPMVMMQRCMVDSWEVWTDYKPFAPRPLAHREVWIGYDPNKGG 420 Query: 329 GDNTVVVLRR------GPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 ++ + G + W+ D I + KY I ID G Sbjct: 421 KGDSAGCIVICPPAVPGGKFRVIEKHRWNGMDFEAQAKAIQDICNKYNVTFIGIDTTGLG 480 Query: 381 ARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG---LIQ 437 ++ V L ++++ +K D + L +G L Q Sbjct: 481 EAVYQLVKKFFPQVTPFLYNPV---------LKSQMVIKAYDVISKGRLEYDAGWTDLAQ 531 Query: 438 NLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 S++ + + ++ ES R + D + M+ P Sbjct: 532 AFMSIRKTLTASGKQVTYESARSEEISHADIAWAAMHALYNEP 574 >gi|332288320|ref|YP_004419172.1| terminase-like family protein [Gallibacterium anatis UMN179] gi|330431216|gb|AEC16275.1| terminase-like family protein [Gallibacterium anatis UMN179] Length = 590 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 43/248 (17%), Positives = 72/248 (29%), Gaps = 27/248 (10%) Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF--IPLN 300 P W++ I +G + + + Y ++ QF + F I L Sbjct: 330 PDGQWRQIVTIYDAMAQGCNLFDVDALKLEYSVEE--FEQLFLCQFIDDNSSVFKFIDLQ 387 Query: 301 ---------IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLR----RGPVIE--HL 345 + P P+ +G D A G + V+ G H Sbjct: 388 KCGVDSLEVWSDFN-PLAKRPFADNPVWIGYDPAHTGDRAALAVVAPPAVEGGKYRLLHY 446 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 D I ++ Y I ID G + Y + R L + Sbjct: 447 KTVHGMDFEQQAGLIKDYLQIYNVQKITIDRTGLGEGVYQLVRKF-YPLTRGLTYNVDLK 505 Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS 465 E L++ LEF S +I + ++K ++ S R K A Sbjct: 506 NEMVL---KTLNIIGKRRLEFDS--GDKEVINSFMTIKKQTTRTGQKITYISDRSKEASH 560 Query: 466 TDYSDGLM 473 D + +M Sbjct: 561 GDIAWAIM 568 >gi|134288710|ref|YP_001111154.1| gp4, phage terminase, ATPase subunit [Burkholderia phage phiE12-2] gi|134132095|gb|ABO60770.1| gp4, phage terminase, ATPase subunit [Burkholderia phage phiE12-2] Length = 601 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 362 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 478 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500 >gi|53722089|ref|YP_111074.1| bacteriophage terminase, ATPase subunit [Burkholderia pseudomallei K96243] gi|52212503|emb|CAH38529.1| putative bacteriophage terminase, ATPase subunit [Burkholderia pseudomallei K96243] Length = 601 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 362 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 478 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500 >gi|72537721|ref|YP_293751.1| phage terminase ATPase subunit [Burkholderia phage phi52237] gi|72398411|gb|AAZ72646.1| phage terminase ATPase subunit [Burkholderia phage phi52237] Length = 601 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 362 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 478 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500 >gi|53717814|ref|YP_106800.1| phage terminase, ATPase subunit [Burkholderia pseudomallei K96243] gi|52208228|emb|CAH34159.1| phage terminase, ATPase subunit [Burkholderia pseudomallei K96243] Length = 589 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|167916806|ref|ZP_02503897.1| bacteriophage terminase, ATPase subunit [Burkholderia pseudomallei 112] Length = 589 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|167619947|ref|ZP_02388578.1| phage terminase, ATPase subunit [Burkholderia thailandensis Bt4] Length = 589 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|167821684|ref|ZP_02453364.1| phage terminase, ATPase subunit [Burkholderia pseudomallei 91] gi|254188172|ref|ZP_04894684.1| Putative ATPase subunit of terminase (gpP-like) [Burkholderia pseudomallei Pasteur 52237] gi|157935852|gb|EDO91522.1| Putative ATPase subunit of terminase (gpP-like) [Burkholderia pseudomallei Pasteur 52237] Length = 589 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|308389159|gb|ADO31479.1| Terminase, ATPase subunit [Neisseria meningitidis alpha710] Length = 610 Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats. Identities = 36/182 (19%), Positives = 59/182 (32%), Gaps = 24/182 (13%) Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLV 364 P P+ +G D + + +VV G L TD + I + Sbjct: 428 RPAGNLPVWVGYDPSYTADASGLVVAVPPQNNGEPFYILETALIPGTDFESQAANIRKIT 487 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYR------VLGQKRAVDLEFCRNRRTELHV 418 E+Y I+IDAN GA D + V + G +N+R Sbjct: 488 ERYNVSKIVIDANGIGAAVFDLVRKFYPPVIGMTYTPDIKGMMVLKTQNLLKNKR----- 542 Query: 419 KMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 + ++ L S++ + + + ES R K A D + M F + Sbjct: 543 ---IKWDAGNI----DLQMAFLSVRRSVTASGRNITYESVRSKTASHGDLAWAAMMLFYQ 595 Query: 479 NP 480 P Sbjct: 596 EP 597 >gi|17981830|ref|NP_536821.1| terminase [Haemophilus phage HP2] gi|13752203|gb|AAK37798.1| orf16 [Haemophilus phage HP2] gi|309750513|gb|ADO80497.1| probable terminase, ATPase subunit [Haemophilus influenzae R2866] Length = 607 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 56/180 (31%), Gaps = 25/180 (13%) Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---------REPCPDPY 315 E + RY + F +++ ++ + P Sbjct: 367 NIEKLKQRYSKY--AFNQLYMCVWIDDADSIFTVHQLLKCGVDISKWKDFNPKADRPFGD 424 Query: 316 APLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + G D A G + V++ + + W+ N+I L EKY Sbjct: 425 REVWGGFDPAHSGDGASFVIIAPPALPSEKYRVLARYQWNGLSYVYQANQIRALYEKYNM 484 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429 I IDA G + ++ ++ A + + +T + +K+ D +E + Sbjct: 485 TYIGIDATGVGYGVYELVKE--------FARRAATAIIYNPESKTGMVLKVHDLVEHGQI 536 >gi|84623266|ref|YP_450638.1| phage-related terminase [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84367206|dbj|BAE68364.1| phage-related terminase [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 594 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 42/142 (29%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPD 313 + + Y D + F S PL +++ + P Sbjct: 356 IDELREEY--SPDAFANLLMCDFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAVRPY 412 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL + G L + D +I + +Y Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494 >gi|163736656|ref|ZP_02144075.1| hypothetical protein RGBS107_16031 [Phaeobacter gallaeciensis BS107] gi|161390526|gb|EDQ14876.1| hypothetical protein RGBS107_16031 [Phaeobacter gallaeciensis BS107] Length = 430 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 65/416 (15%), Positives = 112/416 (26%), Gaps = 65/416 (15%) Query: 82 ISAGRGIGKTTLNA-WLVLWLMSTRPGI-----SVICLANSETQLKTTLWAEVSKWLSLL 135 I GRG GKT A W+ P + L + Q++ + S L+ Sbjct: 38 ILGGRGAGKTRAGAEWVRTLAEGATPLSAGRARRIALLGETYDQVRDVMVQGDSGLLACT 97 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 P + + +S P+ G A Sbjct: 98 PRD---------------RRPTWKATERRLIWPNGATAQAFSAHDPEALRGPQFD---AA 139 Query: 196 INDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 DE + + L + R + + R G ++ P + Sbjct: 140 WADELAKWKRGQDSWDMLQFALRLGDDPR--VCVTTTPRNVGVLRDLLASPSTV-QTHAA 196 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-LNREPCP 312 + SF + RY S + R E+ G Q + + A + + P Sbjct: 197 TEANRANLAASFIAEVRNRY-AGSRLGRQELDGILLQDVEGALWTNAGLVAAQIAKAPTL 255 Query: 313 DPYAPLIMGCDIAEEGG---DNTVVVLRRGPVIEHLFDW----------SKTDLRTTNNK 359 D +++ D A G D +V+ + DW T Sbjct: 256 DR---VVVAVDPAVSAGKRSDACGIVVVGATLQGPPQDWCAYVLADCTVQGVGPLTWAQA 312 Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419 ++Y D ++ + N GA L + V A+ + R E Sbjct: 313 AIDARDRYGADRVVAEVNQGGALVESLLRQIDPLV-----PFTALHASRGKGARAEPVAA 367 Query: 420 MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + + + L L + G L G S D D L++ Sbjct: 368 LYEQGRVRHVPGLGALEDQLC-----QMTPRGYL--------GQGSPDRLDALVWA 410 >gi|224542959|ref|ZP_03683498.1| hypothetical protein CATMIT_02153 [Catenibacterium mitsuokai DSM 15897] gi|224524097|gb|EEF93202.1| hypothetical protein CATMIT_02153 [Catenibacterium mitsuokai DSM 15897] Length = 479 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 56/385 (14%), Positives = 114/385 (29%), Gaps = 32/385 (8%) Query: 34 HFFPWGEK--GTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91 + P+ +E ++ +E+ E+ + ++ K S R GK+ Sbjct: 14 YIIPYKSTLGNEAIELYNNTTRNAMEWQEIQMMDIMAVDDDGQWVHIKYGYSIPRRNGKS 73 Query: 92 TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPA 151 + LW + G ++ A+ T W ++ + L + Sbjct: 74 EILVMRELWGLL--HGEKILHTAH-RTTTSHASWEKLKQMLDENDYTEVKRADKEKTYEK 130 Query: 152 PWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGI 211 + + I ++ +G +I DEA + + Sbjct: 131 SYTATAQFGLETIKILDDGGGSASFRTRSSKGGLG---EGFDLLIVDEAQEYTEDQQSAL 187 Query: 212 LGFLTERNANRFWIMTSNP--------------------RRLSGKFYEIFNKPLDDWKRF 251 +T N +M P + + E + + D K Sbjct: 188 QYVVTSSE-NPQTLMCGTPPTAVSSGTVFVNLRKECLSGGSDTSGWAEWSVEHMSDVKDR 246 Query: 252 QIDTRTVEGIDPSFHEGII-ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 I T + + E + A D ++ G + Q + S I N AL E Sbjct: 247 DIWYETNPSLGQTLKERSVAAEDSSDEIDFNIQRFGLWLQYNQKSAISENE-WNALKVET 305 Query: 311 CPDPYAPLIMGCDIAEEGGDNT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 P+ PL +G +G + + + ++ + +R N I + K Sbjct: 306 IPEFKGPLFVGIKYGHDGSNVSMSIAVKTKNDNILVDVIGCRPIRKGNGWIVDFLRKADI 365 Query: 370 DAIIIDANNTGARTCDYLEMLGYHV 394 A+ +D N + L+ G + Sbjct: 366 AAVTVDGANGQQMLINELKEAGIKL 390 >gi|293608730|ref|ZP_06691033.1| conserved hypothetical protein [Acinetobacter sp. SH024] gi|292829303|gb|EFF87665.1| conserved hypothetical protein [Acinetobacter sp. SH024] Length = 430 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%) Query: 81 AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135 A AG G GKT + + W V A + Q++ + + Sbjct: 24 AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 73 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 W + + + Y T S E+P T VG + + Sbjct: 74 VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 123 Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244 DE A I+ + + A I + YE F K Sbjct: 124 -IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 182 Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + Q T E + + + Y + + GQF + P + Sbjct: 183 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 239 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361 + + PL++G D V V+R G L + D T I+ Sbjct: 240 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 298 Query: 362 GLVEKYRPDAIIIDANN 378 + +I DA+ Sbjct: 299 ERFPDH-DITVIPDASG 314 >gi|260556008|ref|ZP_05828228.1| PBSX family phage terminase, large subunit [Acinetobacter baumannii ATCC 19606] gi|260410919|gb|EEX04217.1| PBSX family phage terminase, large subunit [Acinetobacter baumannii ATCC 19606] Length = 435 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%) Query: 81 AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135 A AG G GKT + + W V A + Q++ + + Sbjct: 29 AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 78 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 W + + + Y T S E+P T VG + + Sbjct: 79 VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 128 Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244 DE A I+ + + A I + YE F K Sbjct: 129 -IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 187 Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + Q T E + + + Y + + GQF + P + Sbjct: 188 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 244 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361 + + PL++G D V V+R G L + D T I+ Sbjct: 245 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 303 Query: 362 GLVEKYRPDAIIIDANN 378 + +I DA+ Sbjct: 304 ERFPDH-DITVIPDASG 319 >gi|94990333|ref|YP_598433.1| terminase large subunit [Streptococcus phage 10270.2] gi|94994256|ref|YP_602354.1| Terminase large subunit [Streptococcus phage 10750.2] gi|94543841|gb|ABF33889.1| Terminase large subunit [Streptococcus phage 10270.2] gi|94547764|gb|ABF37810.1| Terminase large subunit [Streptococcus phage 10750.2] Length = 432 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 89/290 (30%), Gaps = 41/290 (14%) Query: 57 EFMEVVDAHCLNSVNNP-NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLAN 115 + +++ V NP++ A GRG GK++ A+++ L+ P ++ +C+ Sbjct: 12 DLADIIPIGFKPVVQATWNPQILNIACKGGRGSGKSSNIAFIISRLIIQYP-VNAVCIRK 70 Query: 116 SETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT 175 ++ L+ +++ ++ KW + + +P + I + Sbjct: 71 TDNTLEQSVYEQI-KW----AISEQGLERYFKFNKSPLRITYIPRGNYIVFRG------- 118 Query: 176 YSEERPDTFVGHHNTYGMAII-----------NDEASGTPDVINLGILGFLTERNANRFW 224 + P+ ++ I DE + + G LG + Sbjct: 119 --AQNPERIKSLKDSRFPFAIGWIEELAEFKTEDEVKTITNSLLRGELG----DGLFYKF 172 Query: 225 IMTSNPRRLSGKF----YEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDV 279 T NP + + YE +P + + T I F A Sbjct: 173 FYTYNPPKRKQSWVNKKYESQFQPSNTF--VHASTYKDNPFIAKEFIAEAEATRERSERR 230 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG 329 R E G+ +P + + + + + G D Sbjct: 231 YRWEYLGEAIGS---GVVPFDNLRFERITDEQVADFDNIRNGIDYGYATD 277 >gi|190572396|ref|YP_001970241.1| putative phage terminase, ATPase subunit (gpp) [Stenotrophomonas maltophilia K279a] gi|190010318|emb|CAQ43926.1| putative phage terminase, ATPase subunit (gpp) [Stenotrophomonas maltophilia K279a] Length = 597 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 39/142 (27%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL------------NIIEEALNREPCPD 313 E + Y + + +F S PL +++ P Sbjct: 360 IEELRRDYSAEE--FANLLMCEFVDDSA-SIFPLTMLQPCQVDSWVEWVDDFKPLAIRPY 416 Query: 314 PYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VV L G L + D I + +Y Sbjct: 417 GDRAVWIGYDPAETGDSAGIVVVAPPLVPGGKFRVLERHQFKGMDFAAQAAFIQQVTLRY 476 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I IDA G + Sbjct: 477 WVTYIGIDATGMGTGVAQLVRQ 498 >gi|328552921|gb|AEB23413.1| putative helicase, ATP-dependent, intein-containing [Bacillus amyloliquefaciens TA208] Length = 1021 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 43/144 (29%), Gaps = 33/144 (22%) Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP----------YAPLIMGCDIAE--- 326 R + + I ++ + +A ++G D+A Sbjct: 731 FRQNYLCDWIGASDGALINISKLIKARTITHPELSCPRDKNKNFLLHEYVIGVDVARSAA 790 Query: 327 EGGDNTVVV-----------LRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-------- 367 E + T +V +R+ V+ + + + + + + + Y Sbjct: 791 ESNNKTAIVVLKIIRNSNNLIRQVQVVNIIEPPNGLSFKEQSIMVKRVFKNYGGNQDTSL 850 Query: 368 -RPDAIIIDANNTGARTCDYLEML 390 R A+I+D N G D L Sbjct: 851 SRVKAVIVDGNGVGGGLIDRLLED 874 >gi|167900122|ref|ZP_02487523.1| Putative ATPase subunit of terminase (gpP-like) protein [Burkholderia pseudomallei 7894] Length = 589 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 38/142 (26%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + F + S L ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLAELQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + + +G D A G +VV+ G L + D I + ++Y Sbjct: 407 GHREVWVGYDPALTGDSAGLVVVAPPRVDGGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|325921366|ref|ZP_08183223.1| hypothetical protein XGA_2215 [Xanthomonas gardneri ATCC 19865] gi|325548124|gb|EGD19121.1| hypothetical protein XGA_2215 [Xanthomonas gardneri ATCC 19865] Length = 594 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 45/142 (31%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317 + + Y D + +F S PL +++ + ++ P P Sbjct: 356 IDELREEY--SPDAFANLLMCEFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAARPY 412 Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL G L + D +I + +Y Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQLPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494 >gi|239590013|ref|YP_002941860.1| gp2 [Mycobacterium phage Angel] gi|238890545|gb|ACR77534.1| gp2 [Mycobacterium phage Angel] Length = 478 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 56/372 (15%), Positives = 107/372 (28%), Gaps = 54/372 (14%) Query: 54 WQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISVIC 112 W + + + +++ ++ + R +GKT L +V L PG++VI Sbjct: 36 WTFDRWQDGLGRLILALDGTGLYAADTSVISIPRQVGKTYLIGCIVFALALLTPGLTVIW 95 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ +T E + ++ GI + S + Sbjct: 96 TAH-----RTKTAKE-------TFGSMKAMCATPLVNAHVRNVSDARGDEGIYLHNGSRI 143 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 + G ++ DEA D ++ + N ++T P R Sbjct: 144 LFGAR----ENGFGLGFAGVGILVLDEAQRLTDKAMDDLIPTMNTVE-NPLILLTGTPPR 198 Query: 233 LS-------------------GKFYEIFN-----KPLDDWKRFQIDTRTVEGIDPSFHEG 268 + G Y F+ P D + + + Sbjct: 199 PTDSGEVFTMLRQDALDGESEGTLYVEFSADEGAHPDDRAQLRKANPSYPHRTSERAIRR 258 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----PLIMGCDI 324 + +S E G + D + ++ A R A P G D+ Sbjct: 259 MRKNLTEES--FLREAFGIW-----DKVVHRPVVTAARWRRLESTGPAAGVKPNGFGVDM 311 Query: 325 AEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-RPDAIIIDANNTGART 383 + + V G W+ D I+ ++ R ++ID+ + A Sbjct: 312 SHSRMVSVNAVWLDGDQAHTEEVWAGDDTDAAVAWIADAWKRAGRRTVVVIDSESPAASL 371 Query: 384 CDYLEMLGYHVY 395 LE G +VY Sbjct: 372 VVDLENAGVNVY 383 >gi|9628620|ref|NP_043485.1| hypothetical protein HP1p21 [Haemophilus phage HP1] gi|1722793|sp|P51718|VPP_BPHP1 RecName: Full=Probable terminase, ATPase subunit; AltName: Full=ORF16 gi|1046243|gb|AAB09201.1| orf16 [Haemophilus phage HP1] Length = 607 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 29/180 (16%), Positives = 56/180 (31%), Gaps = 25/180 (13%) Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNI----IEEALNREPCPDPYAP--- 317 E + RY + F + ++ A ++ P P Sbjct: 367 NIEKLKQRYSKY--AFNQLYMCIWIDDADSIFNVKQLLKCGVDIAKWKDFNPKADRPFGD 424 Query: 318 --LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 + G D A G + V++ + + W N+I L EKY Sbjct: 425 REVWGGFDPAHSGDGASFVIIAPPALPGEKYRMLARYQWHGLSYVYQANQIRALYEKYNM 484 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429 I IDA G + ++ ++ A + + +T + +K+ D +E + Sbjct: 485 TYIGIDATGVGYGVYELVKE--------FARRAATAIIYNPESKTGMVLKVHDLVEHGQI 536 >gi|215484220|ref|YP_002326447.1| Terminase-like family protein [Acinetobacter baumannii AB307-0294] gi|213985731|gb|ACJ56030.1| Terminase-like family protein [Acinetobacter baumannii AB307-0294] Length = 413 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%) Query: 81 AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135 A AG G GKT + + W V A + Q++ + + Sbjct: 7 AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 56 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 W + + + Y T S E+P T VG + + Sbjct: 57 VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 106 Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244 DE A I+ + + A I + YE F K Sbjct: 107 -IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 165 Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + Q T E + + + Y + + GQF + P + Sbjct: 166 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 222 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361 + + PL++G D V V+R G L + D T I+ Sbjct: 223 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 281 Query: 362 GLVEKYRPDAIIIDANN 378 + +I DA+ Sbjct: 282 ERFPDH-DITVIPDASG 297 >gi|184157353|ref|YP_001845692.1| putative phage terminase [Acinetobacter baumannii ACICU] gi|301345227|ref|ZP_07225968.1| putative phage terminase [Acinetobacter baumannii AB056] gi|301595737|ref|ZP_07240745.1| putative phage terminase [Acinetobacter baumannii AB059] gi|332851175|ref|ZP_08433263.1| phage terminase, large subunit, PBSX family [Acinetobacter baumannii 6013150] gi|332869110|ref|ZP_08438600.1| phage terminase, large subunit, PBSX family [Acinetobacter baumannii 6013113] gi|332875310|ref|ZP_08443140.1| phage terminase, large subunit, PBSX family [Acinetobacter baumannii 6014059] gi|183208947|gb|ACC56345.1| putative phage terminase [Acinetobacter baumannii ACICU] gi|332730195|gb|EGJ61521.1| phage terminase, large subunit, PBSX family [Acinetobacter baumannii 6013150] gi|332732895|gb|EGJ64103.1| phage terminase, large subunit, PBSX family [Acinetobacter baumannii 6013113] gi|332736478|gb|EGJ67475.1| phage terminase, large subunit, PBSX family [Acinetobacter baumannii 6014059] Length = 413 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%) Query: 81 AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135 A AG G GKT + + W V A + Q++ + + Sbjct: 7 AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 56 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 W + + + Y T S E+P T VG + + Sbjct: 57 VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 106 Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244 DE A I+ + + A I + YE F K Sbjct: 107 -IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 165 Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + Q T E + + + Y + + GQF + P + Sbjct: 166 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 222 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361 + + PL++G D V V+R G L + D T I+ Sbjct: 223 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 281 Query: 362 GLVEKYRPDAIIIDANN 378 + +I DA+ Sbjct: 282 ERFPDH-DITVIPDASG 297 >gi|261494619|ref|ZP_05991100.1| terminase ATPase subunit [Mannheimia haemolytica serotype A2 str. OVINE] gi|261309731|gb|EEY10953.1| terminase ATPase subunit [Mannheimia haemolytica serotype A2 str. OVINE] Length = 612 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 18/132 (13%) Query: 276 DSDVTRVEVCGQFPQQDIDSF----IPLNIIEEA--------LNREPCPDPYAPLIMGCD 323 D +F + F + +++ + P + +G D Sbjct: 378 SPDEFEQLFMCEFIDDNQSVFKFTMMQRCLVDSMEVWRDYVFTDGYQRPFGNKEVWVGYD 437 Query: 324 IAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377 + G + +VV+ G L + D +I + KY + ID Sbjct: 438 PSYTGDRSALVVIAPPKVDGGKFRLLEYRTFKGADFAEQAAEIVAICAKYNVTRLAIDTT 497 Query: 378 NTGARTCDYLEM 389 G + ++ Sbjct: 498 GLGVGVYEIVKK 509 >gi|213425656|ref|ZP_03358406.1| hypothetical protein SentesTyphi_08397 [Salmonella enterica subsp. enterica serovar Typhi str. E02-1180] Length = 195 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 34/87 (39%), Gaps = 9/87 (10%) Query: 311 CPDPYAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKIS 361 P + + +G D A+ + GD+ VVV G L W D R + I Sbjct: 8 RPFGWREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIK 67 Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLE 388 L E+Y I ID+ G + ++ Sbjct: 68 KLTEQYNVTYIGIDSTGVGHGVYENVK 94 >gi|109392289|ref|YP_655519.1| gp2 [Mycobacterium phage Halo] gi|189043089|ref|YP_001936030.1| hypothetical protein BPs1_2 [Mycobacterium phage BPs] gi|91980539|gb|ABE67259.1| terminase [Mycobacterium phage Halo] gi|171909204|gb|ACB58161.1| hypothetical protein BPs1_2 [Mycobacterium phage BPs] gi|255927846|gb|ACU41466.1| gp2 [Mycobacterium phage Hope] Length = 478 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 56/372 (15%), Positives = 107/372 (28%), Gaps = 54/372 (14%) Query: 54 WQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISVIC 112 W + + + +++ ++ + R +GKT L +V L PG++VI Sbjct: 36 WTFDRWQDGLGRLILALDGTGLYAADTSVISIPRQVGKTYLIGCIVFALALLTPGLTVIW 95 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ +T E + ++ GI + S + Sbjct: 96 TAH-----RTKTAKE-------TFGSMKAMCATPLVNAHVRNVSDARGDEGIYLHNGSRI 143 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 + G ++ DEA D ++ + N ++T P R Sbjct: 144 LFGAR----ENGFGLGFAGVGILVLDEAQRLTDKAMDDLIPTMNTVE-NPLILLTGTPPR 198 Query: 233 LS-------------------GKFYEIFN-----KPLDDWKRFQIDTRTVEGIDPSFHEG 268 + G Y F+ P D + + + Sbjct: 199 PTDSGEVFTMLRQDALDGESEGTLYVEFSADEGAHPDDRAQLRKANPSYPHRTSERAIRR 258 Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----PLIMGCDI 324 + +S E G + D + ++ A R A P G D+ Sbjct: 259 MRKNLTEES--FLREAFGIW-----DKVVHRPVVTAARWRRLESTGPAAGVKPNGFGVDM 311 Query: 325 AEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-RPDAIIIDANNTGART 383 + + V G W+ D I+ ++ R ++ID+ + A Sbjct: 312 SHSRMVSVNAVWLDGDQAHTEEVWAGDDTDAAVAWIADAWKRAGRRTVVVIDSESPAASL 371 Query: 384 CDYLEMLGYHVY 395 LE G +VY Sbjct: 372 VVDLENAGVNVY 383 >gi|261492632|ref|ZP_05989185.1| terminase ATPase subunit [Mannheimia haemolytica serotype A2 str. BOVINE] gi|261311791|gb|EEY12941.1| terminase ATPase subunit [Mannheimia haemolytica serotype A2 str. BOVINE] Length = 612 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 18/132 (13%) Query: 276 DSDVTRVEVCGQFPQQDIDSF----IPLNIIEEA--------LNREPCPDPYAPLIMGCD 323 D +F + F + +++ + P + +G D Sbjct: 378 SPDEFEQLFMCEFIDDNQSVFKFTMMQRCLVDSMEVWRDYVFTDGYQRPFGNKEVWVGYD 437 Query: 324 IAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377 + G + +VV+ G L + D +I + KY + ID Sbjct: 438 PSYTGDRSALVVIAPPKVDGGKFRLLEYRTFKGADFAEQAAEIVAICAKYNVTRLAIDTT 497 Query: 378 NTGARTCDYLEM 389 G + ++ Sbjct: 498 GLGVGVYEIVKK 509 >gi|33601198|ref|NP_888758.1| putative phage terminase [Bordetella bronchiseptica RB50] gi|33602480|ref|NP_890040.1| putative phage terminase [Bordetella bronchiseptica RB50] gi|33575633|emb|CAE32711.1| putative phage terminase [Bordetella bronchiseptica RB50] gi|33576919|emb|CAE33999.1| putative phage terminase [Bordetella bronchiseptica RB50] Length = 425 Score = 48.6 bits (114), Expect = 0.003, Method: Composition-based stats. Identities = 72/439 (16%), Positives = 127/439 (28%), Gaps = 62/439 (14%) Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKW 131 P F+ + AG G GKT + + P ++ A + Q++ + EV+ Sbjct: 15 PHKFRAFV-AGFGSGKTWVGGAGLCRHAWEFPRVNSGYFAPTYGQIRDIFYPTIEEVAHD 73 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 L + + + +CR S E+P VG Sbjct: 74 WGLAAKINESNKEVHLFAGRKYRGT--------------VICR--SMEKPGDIVGFKIGK 117 Query: 192 GMAIINDEASGTPDV----INLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPL 245 G+ DE I+ L T +T+ P Y+ F K + Sbjct: 118 GL---IDELDVMKTDKAALAWRKIIARLRHTAPGLINGVDVTTTPEG-FKFVYQQFVKQV 173 Query: 246 DD-------WKRFQIDTRTV-EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI 297 + + Q T + + + + A Y + + GQF S Sbjct: 174 RERPDLVALYGLVQASTYENGKNLPEDYIPSLRASY--PPQLIAAYLRGQFTNLTSGSVY 231 Query: 298 PLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTN 357 P N + + +P+ L +G D TV V+R G + D Sbjct: 232 P-NFDRRLHHTDAAEEPHEELHIGMDFNVLNMTATVNVIRAGLPLTVGELTKVRDTPEMA 290 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDY---------LEMLGYHVYRVLGQKRAVDLEF 408 + + + + I + +G T L G+ V RV + AV Sbjct: 291 RMLKERFKD-KGHGVTIYPDASGGNTSSKNASESDLSILRKAGFTV-RVNSRNPAVKDRI 348 Query: 409 CRNRRTELHVK-MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTD 467 L+ + WL +N ++L+ G E + G + Sbjct: 349 NAVNGMLLNDEGARRWL-----VNTDRCPTLTEALEQQAYDKNG----EPDKSTGHDHPN 399 Query: 468 YSDGLMYTFAENPPRSDMD 486 + G + M Sbjct: 400 DAQGYFLVHRYPITPTGMS 418 >gi|289807324|ref|ZP_06537953.1| hypothetical protein Salmonellaentericaenterica_24067 [Salmonella enterica subsp. enterica serovar Typhi str. AG3] Length = 96 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 16/45 (35%), Gaps = 3/45 (6%) Query: 443 KSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 43 PHRDFDRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 87 >gi|109289938|ref|YP_655470.1| terminase ATPase subunit [Mannheimia phage phiMHaA1] gi|90110544|gb|ABD90554.1| terminase ATPase subunit [Mannheimia phage phiMhaA1-PHL101] gi|90110594|gb|ABD90603.1| terminase ATPase subunit [Mannheimia phage phiMhaA1-BAA410] Length = 605 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 18/132 (13%) Query: 276 DSDVTRVEVCGQFPQQDIDSF----IPLNIIEEA--------LNREPCPDPYAPLIMGCD 323 D +F + F + +++ + P + +G D Sbjct: 371 SPDEFEQLFMCEFIDDNQSVFKFTMMQRCLVDSMEVWRDYVFTDGYQRPFGNKEVWVGYD 430 Query: 324 IAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377 + G + +VV+ G L + D +I + KY + ID Sbjct: 431 PSYTGDRSALVVIAPPKVDGGKFRLLEYRTFKGADFAEQAAEIVAICAKYNVTRLAIDTT 490 Query: 378 NTGARTCDYLEM 389 G + ++ Sbjct: 491 GLGVGVYEIVKK 502 >gi|315268220|gb|ADT95073.1| terminase, ATPase subunit [Shewanella baltica OS678] Length = 589 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 52/343 (15%), Positives = 107/343 (31%), Gaps = 42/343 (12%) Query: 163 GIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNA 220 + Y + + + + + H + I+ +S T D G L A Sbjct: 249 NLYLDEYFWIHKFQEFRKVASGMAIHAKWRQTYISTPSSITHDAYPFWTGKLFNRGRPKA 308 Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDV 279 +R I S+ +G+ W++ +D +G + + + Y D Sbjct: 309 DRIEIDVSHSALANGR-----RCEDGQWRQVVTVDDAIRKGCNLFDPDTLHLEY--SPDE 361 Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR-----------EPCPDPYAPLIMGCDIAEEG 328 + +F D S P+ +++ + P P + + +G D + G Sbjct: 362 YSNLLMCEFID-DTMSVFPMVMMQRCMVDSWEVWTDYKPFAPRPLAHREVWIGYDPNKGG 420 Query: 329 GDNTVVVLRR------GPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380 ++ + G + W+ D I + KY I ID G Sbjct: 421 KGDSAGCIVICPPAVPGGKFRVIEKHRWNGMDFEAQAKAIQDICNKYNVTFIGIDTTGLG 480 Query: 381 ARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG---LIQ 437 ++ V L ++++ +K D + L +G L Q Sbjct: 481 EAVYQLVKKFFPQVTPFLYNPV---------LKSQMVIKAYDVISKGRLEYDAGWTDLAQ 531 Query: 438 NLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480 S++ + + ++ ES R + D + M+ P Sbjct: 532 AFMSIRKTLTASGKQVTYESARSEEISHADIAWAAMHALYNEP 574 >gi|254360872|ref|ZP_04977019.1| bacteriophage terminase large subunit [Mannheimia haemolytica PHL213] gi|153092346|gb|EDN73415.1| bacteriophage terminase large subunit [Mannheimia haemolytica PHL213] Length = 600 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 18/132 (13%) Query: 276 DSDVTRVEVCGQFPQQDIDSF----IPLNIIEEA--------LNREPCPDPYAPLIMGCD 323 D +F + F + +++ + P + +G D Sbjct: 366 SPDEFEQLFMCEFIDDNQSVFKFTMMQRCLVDSMEVWRDYVFTDGYQRPFGNKEVWVGYD 425 Query: 324 IAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377 + G + +VV+ G L + D +I + KY + ID Sbjct: 426 PSYTGDRSALVVIAPPKVDGGKFRLLEYRTFKGADFAEQAAEIVAICAKYNVTRLAIDTT 485 Query: 378 NTGARTCDYLEM 389 G + ++ Sbjct: 486 GLGVGVYEIVKK 497 >gi|322412171|gb|EFY03079.1| phage terminase [Streptococcus dysgalactiae subsp. dysgalactiae ATCC 27957] Length = 471 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 51/347 (14%), Positives = 111/347 (31%), Gaps = 47/347 (13%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ + + A + + + GKT + + LW + G+ ++ Sbjct: 43 PWQENMLIPIMAIDEDGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + + +V K+L + + + + + A + + G + + Sbjct: 97 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT + + F +I DEA + +T+ + N IM P Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200 Query: 233 L--SGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271 + +G +E + K +W + + + + FH I A Sbjct: 201 IVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D ++ G + + S I + L E P+ + L +G ++G + Sbjct: 261 ELGEDEIDHNIQRLGYWSSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318 Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 + + R + +R I ++ ++ID Sbjct: 319 VSLSIAARTSENKVFVEVIDCLSVRNGTQWIINFLKSADIAKVVIDG 365 >gi|194289059|ref|YP_002004966.1| bacteriophage p2 gpp capsid protein; terminase, atpase subunit [Cupriavidus taiwanensis LMG 19424] gi|193222894|emb|CAQ68899.1| bacteriophage P2 GPP capsid protein; Terminase, ATPase subunit [Cupriavidus taiwanensis LMG 19424] Length = 593 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 38/137 (27%), Gaps = 21/137 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDP 314 E + Y + F S PL+++ + P P Sbjct: 355 LEQLRREY--SDADFENLLMCGFIDDTA-SVFPLSMLMRCMVDSWEVWEDFRHWSPRPLG 411 Query: 315 YAPLIMGCDIAEEGGDNTVVVLRR-----GPVIEHLFD--WSKTDLRTTNNKISGLVEKY 367 + +G D GGD+ +V+ G L + D + + E+Y Sbjct: 412 NREVWVGYDPNGGGGDSAALVVVAPPLVPGGKFRVLEKHQFRGIDYEEQAAAVLKVCERY 471 Query: 368 RPDAIIIDANNTGARTC 384 I ID G Sbjct: 472 NVTYIGIDRTGVGDAVY 488 >gi|154488071|ref|ZP_02029188.1| hypothetical protein BIFADO_01641 [Bifidobacterium adolescentis L2-32] gi|154083544|gb|EDN82589.1| hypothetical protein BIFADO_01641 [Bifidobacterium adolescentis L2-32] Length = 477 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 59/376 (15%), Positives = 107/376 (28%), Gaps = 60/376 (15%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISV 110 WQ + +V A + + + A+ + R GKT W+ + + PG+ + Sbjct: 37 DPWQRQINRIVLAKSADGFWSA-----RNAVLSIPRQTGKTYDIGWVAIHRAARTPGMRI 91 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH---CSLGIDSK 167 + A + S++ + + D H + G + Sbjct: 92 VWTA---------------QHFSVIKDTFESLCAIVLRPEMSGLVDPDHGISLAAGKEEI 136 Query: 168 HYSTMCRT-YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226 + R + G ++ DEA D +L R N I Sbjct: 137 RFRNGSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPT-QNRAYNPQTIY 193 Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDT---------RTVEGIDPSFHEGIIARY---- 273 P E F + D + + + R + +D Y Sbjct: 194 MGTPPGPRDNG-EAFTRLRDKARAGRTHSTLYVEFAADRDADPLDREQWRKANPSYPAHT 252 Query: 274 ----------GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCD 323 L D R E G + + + I EEA P + G D Sbjct: 253 SDESIANLWENLTGDDFRREALGIWDEHALSRAIDRRQWEEATIDARRP--GGVMSFGID 310 Query: 324 IAEEGGDNTV---VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK--YRPDAIIIDANN 378 + + T+ + G L ++ T+ T L++K + A++ID + Sbjct: 311 MNPQRTRLTIGACMRYDDGTAHIELAEYRDTNHDGT-MWAVNLIDKVWEQTAALVIDGQS 369 Query: 379 TGARTCDYLEMLGYHV 394 L G V Sbjct: 370 PATALLPDLAEAGITV 385 >gi|264678567|ref|YP_003278474.1| hypothetical protein CtCNB1_2432 [Comamonas testosteroni CNB-2] gi|262209080|gb|ACY33178.1| hypothetical conserved protein [Comamonas testosteroni CNB-2] Length = 322 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 40/133 (30%), Gaps = 19/133 (14%) Query: 275 LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPDPYAPLIMGC 322 DV QF D + PL++++ + P Y + +G Sbjct: 86 KSEDVFNNLYMCQFVD-DALAVFPLSVLQRCMVDSWDAWRKDFKAFAQRPFGYKRVWVGY 144 Query: 323 DIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 D + G +VVL + G L+ D I + + Y + + ID Sbjct: 145 DPSLTGDKAALVVLAPPDKPGGKCRILYKVQLHGVDFEAQAAAIKKVCDSYSVEKMTIDI 204 Query: 377 NNTGARTCDYLEM 389 G + Sbjct: 205 TGLGNGVYQLVRK 217 >gi|56808979|ref|ZP_00366686.1| COG1783: Phage terminase large subunit [Streptococcus pyogenes M49 591] gi|71910836|ref|YP_282386.1| terminase large subunit [Streptococcus pyogenes MGAS5005] gi|71853618|gb|AAZ51641.1| terminase large subunit [Streptococcus pyogenes MGAS5005] Length = 424 Score = 48.2 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 37/287 (12%), Positives = 85/287 (29%), Gaps = 35/287 (12%) Query: 57 EFMEVVDAHCLNSVNNP-NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLAN 115 + +++ V NP++ A GRG GK++ A+++ L+ P ++ +C+ Sbjct: 4 DLADIIPIGFKPVVQATWNPQILNIACKGGRGSGKSSNIAFIISRLIIQYP-VNAVCIRK 62 Query: 116 SETQLKTTLWAEVSKWLSLLPNKHWFEMQ--------SLSLHPAPWYSDVLHCSLGIDSK 167 ++ L+ +++ ++ +S + +F+ + + + Sbjct: 63 TDNTLEQSVYEQIKWAISEQGLERYFKFNKSPLRITYIPRGNYIVFRGAQNPERIKSLKD 122 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227 + + DE + + G LG + T Sbjct: 123 SRFPFAIGW----IEELAEFKTE-------DEVKTITNSLLRGELG----DGLFYKFFYT 167 Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDVTRV 282 NP + + YE +P + + T I F A R Sbjct: 168 YNPPKRKQSWVNKKYESQFQPSNTF--VHASTYKDNPFIAKEFIAEAEATRERSERRYRW 225 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG 329 E G+ +P + + + + + G D Sbjct: 226 EYLGEAIGS---GVVPFDNLRFERITDEQVADFDNIRNGIDYGYATD 269 >gi|225575978|ref|YP_002724813.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225576296|ref|YP_002725339.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547342|gb|ACN93326.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547454|gb|ACN93434.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + +K ++ T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNTATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NII++ + P I D A GGDNT + + Sbjct: 265 IFTQINIIQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|17546658|ref|NP_520060.1| terminase (ATPase subunit) related protein [Ralstonia solanacearum GMI1000] gi|17428957|emb|CAD15641.1| probable terminase (atpase subunit) related protein [Ralstonia solanacearum GMI1000] Length = 506 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 40/137 (29%), Gaps = 21/137 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----------EPCPDP 314 + + Y + F + S PL+++ + P P Sbjct: 355 LDQLRLEYSE--PEFANLLMCAFIDDNA-SVFPLSMLMRGMVDSWEAWEDFRPFAPRPFG 411 Query: 315 YAPLIMGCDIAEEGGDNTVVVLRR-----GPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 P+ +G D GGD+ +V+ G L + D I + E++ Sbjct: 412 NRPVWVGYDPNGGGGDSAALVVVAPPLVPGGKFRVLERHQFRGIDYEEQAGAIRRVCERF 471 Query: 368 RPDAIIIDANNTGARTC 384 + ID G Sbjct: 472 NVAYVGIDRTGIGDAVF 488 >gi|134288784|ref|YP_001111035.1| gp4, phage terminase, ATPase subunit [Burkholderia phage phiE202] gi|134131997|gb|ABO60745.1| gp4, phage terminase, ATPase subunit [Burkholderia phage phiE202] Length = 589 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 39/142 (27%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + F + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 + + +G D A G +VV+ G + + D I + ++Y Sbjct: 407 GHREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|224593667|ref|YP_002641021.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] gi|224554694|gb|ACN56072.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K ++ T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|323188196|gb|EFZ73489.1| terminase, ATPase subunit [Escherichia coli RN587/1] Length = 594 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 49/162 (30%), Gaps = 21/162 (12%) Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 W++ ++ +G D E + + QF S PL++++ Sbjct: 333 GQWRQIVTVEDAINQGYDLFDLEQLRLE--NSPEEFANLFMCQFIDDTA-SVFPLSMLQG 389 Query: 305 AL-NREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD-- 347 + + D Y P + +G D A G VV+ G + Sbjct: 390 CMVDSWEVWDDYKPFALRPLGERSVWVGYDPALSGDSAGCVVVAPPVIEGGKFRVIEKHQ 449 Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 W D I + E+Y I ID G ++ Sbjct: 450 WHGMDFAAQAENIRKITERYNVTYIGIDVTGIGHGVHQLVKQ 491 >gi|221067857|ref|ZP_03543962.1| phage terminase, large subunit, PBSX family [Comamonas testosteroni KF-1] gi|220712880|gb|EED68248.1| phage terminase, large subunit, PBSX family [Comamonas testosteroni KF-1] Length = 434 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 56/330 (16%), Positives = 112/330 (33%), Gaps = 39/330 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GK+ A ++L + ++RP V+C E+ K + Sbjct: 39 GGRGGGKSWTVAAVLLVMAASRPL-RVLCT------------REIQK-SIKQSVHQLLKD 84 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHY-STMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202 L+ ++ + GI+ + + ++++ + +F G + +EA G Sbjct: 85 VIARLNLHAFFEVLETEVRGINGSLFLFSGLQSHTVDSIKSFEGCD-----IVWVEEAHG 139 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-NKPLDDWKRFQIDTRTVEGI 261 ++ + + + + + NP + + Y+ F P D +I+ R Sbjct: 140 VSKKSWDTLIPTIRKEGSEIWLTL--NPDMETDETYQRFIATPCPDTWVVEINWRDNPWF 197 Query: 262 DPSFHEGII-ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE---EALNREPCPDPYAP 317 E A+ + +D G+ + + + + R+ DP P Sbjct: 198 PRVLDEERRKAKRTMLADDYAHIWEGKARRVAAGAIYRHEMESVYLDNRARDVPYDPTLP 257 Query: 318 LIMGCDIAEEGGDNTVVVLRRGP-----VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372 + D+ D + L + +I H+ D +T L K+ L ++ D + Sbjct: 258 VHTVWDLGWN--DAMSIALVQRGPQDVRIIGHIEDSHRT-LDWYVAKLEKLPYRWGTDYL 314 Query: 373 IIDAN----NTGARTCDYLEMLGYHVYRVL 398 D TG T L LG V Sbjct: 315 PHDGKTRNFQTGKSTEQLLRELGRRSVMVQ 344 >gi|254192775|ref|ZP_04899210.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13] gi|254197102|ref|ZP_04903525.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13] gi|169649529|gb|EDS82222.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13] gi|169653844|gb|EDS86537.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13] Length = 601 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 362 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQATAIEAITQRY 478 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500 >gi|167839678|ref|ZP_02466362.1| phage terminase, ATPase subunit [Burkholderia thailandensis MSMB43] Length = 589 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 37/138 (26%), Gaps = 21/138 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + F + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + + +G D A G +VV+ G L + D I + +Y Sbjct: 407 GHREVWVGYDPALTGDSAGLVVVAPPRVDGGAFRVLERHQFRGNDFEEQAAAIEAITRRY 466 Query: 368 RPDAIIIDANNTGARTCD 385 I ID G Sbjct: 467 NVGYIAIDTTGMGQGVYQ 484 >gi|82776058|ref|YP_402405.1| hypothetical protein SDY_0732 [Shigella dysenteriae Sd197] gi|33323489|gb|AAQ07461.1| HI1410 hypothetical protein-like protein [Shigella flexneri] gi|81240206|gb|ABB60916.1| hypothetical bacteriophage protein [Shigella dysenteriae Sd197] Length = 97 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 16/45 (35%), Gaps = 3/45 (6%) Query: 443 KSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 44 PHRDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 88 >gi|308071887|emb|CBW54808.1| putative DNA maturase B [Pantoea phage LIMElight] Length = 614 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 67/461 (14%), Positives = 132/461 (28%), Gaps = 89/461 (19%) Query: 2 SRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEV 61 R +P++ TE + + ++F F + G GF Q + + Sbjct: 25 PRTIPSDKRTELAMM-------LAITFKEFRDFAY----VGMRFLGFELTDM-QADIADY 72 Query: 62 VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ-- 119 + K ++A RG K+TL A +W + V+ L+ E Q Sbjct: 73 MQYG-----------PRKKMVAAQRGEAKSTLAALYSVWRLIQDQRCRVLILSGGEQQAS 121 Query: 120 -LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 + T + + W P W + S + + +HC L K S C + Sbjct: 122 EVATLVIRLIETW----PLLCWLKADSTRGDRTSYTAYDVHCDLKPLDKSPSVACIGVTA 177 Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF--------------W 224 + G PD I G Sbjct: 178 ----SLQGKRADLL----------IPDDIETTKNGMTQTEREKLLTVSKDFAAICTHGDT 223 Query: 225 IMTSNPRRLSGKFYEIFNKPL------DDWKRFQIDTRTVEGIDPSFHEGI-----IARY 273 + P+ + + + +++ R E + P HE I + Y Sbjct: 224 LYLGTPQTKDSIYKTLPARGFEVRVWPGRIPSLEMEERYGETLAPYIHELIAAGYSRSGY 283 Query: 274 GLDSDVTRVEVCGQFPQQD----IDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG 329 G+D + + G++ + D F P + + D I D+ GG Sbjct: 284 GVDGTLGQSTDTGRYSEDDLIEKELDFGPEGFQLQYMLDTSLLDAMRTKIKLSDLLIHGG 343 Query: 330 DNTV-----VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN------ 378 D + + ++ + + + +Y+ ++ID Sbjct: 344 DTDTAPDRFMYAADRRNLVEEYEPIRGEKLYYPAGTGSEMLQYKHKLMVIDPAGCGGDEI 403 Query: 379 ---TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416 G Y+ + + V G +++ + E+ Sbjct: 404 SYAAGGAVSSYIHL--FSVGGFQGGVSTENIDKVIDLAIEM 442 >gi|254197041|ref|ZP_04903465.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13] gi|169653784|gb|EDS86477.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13] Length = 576 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + QF + S L+ ++ + P Sbjct: 337 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 393 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 394 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQATAIEAITQRY 453 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 454 NVGYIAIDTTGMGQGVYQLVRK 475 >gi|237720954|ref|ZP_04551435.1| phage terminase large subunit [Bacteroides sp. 2_2_4] gi|229449789|gb|EEO55580.1| phage terminase large subunit [Bacteroides sp. 2_2_4] Length = 450 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 81/242 (33%), Gaps = 40/242 (16%) Query: 262 DPSFHEGIIARYGLDSDVTRVE-VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320 DP++ ++ + SD R + G + + I EAL R + Sbjct: 201 DPTYLANLVNQ----SDEQRARDLDGNWKYKAAGDDIIKLTHMEALYRNSMQIGDGIRRV 256 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANN 378 CD A EGGD+ V+ L G I +F K D + T + + ++E++ R + D N Sbjct: 257 SCDAAFEGGDSLVMWLWEGWHIRDIFV-CKLDSKKTVDTVKAMLEEWHVREECFTYDLNG 315 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEF----CRNRRTELHVKMADWLEFASLINHSG 434 G G+ + + E N +++ A + + Sbjct: 316 LGQI------FKGFFPNAIPFNNKEAVEEKFKYIYANLKSQAAYLFAQKIINREISIEPT 369 Query: 435 LIQNLKSLKSFIVPNTGELA-IESKRVKG---------------------AKSTDYSDGL 472 L++ S K F ++ E K ++ S D+ + L Sbjct: 370 LLERKFSGKGFEKVPLRQILDKERKAIRKDEDSEEKGWTIIKKIIMKKLVGHSPDFIEAL 429 Query: 473 MY 474 + Sbjct: 430 LM 431 >gi|226941496|ref|YP_002796570.1| Terminase [Laribacter hongkongensis HLHK9] gi|226716423|gb|ACO75561.1| Terminase [Laribacter hongkongensis HLHK9] Length = 578 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 40/129 (31%), Gaps = 18/129 (13%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIM 320 R D R +F S PL+++ + P + + + Sbjct: 345 RLENSPDEFRQLFECEFIDDG-KSVFPLSMLHRCMVDSMEAWPDYNPFTLRPLGHREVWI 403 Query: 321 GCDIAEEGGDNTVVVLR-----RGP-VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D +E G +VV+ GP + + D I ++YR I I Sbjct: 404 GYDPSESGDSAAMVVVAPPAVPDGPFRLLECRQFRGLDYSAQAQAIKEATDRYRVTHIAI 463 Query: 375 DANNTGART 383 D G+ Sbjct: 464 DRTGLGSAV 472 >gi|58581337|ref|YP_200353.1| phage-related terminase [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425931|gb|AAW74968.1| phage-related terminase [Xanthomonas oryzae pv. oryzae KACC10331] Length = 594 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 41/142 (28%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPD 313 + + Y D + F S L +++ + P Sbjct: 356 IDELREEY--SPDAFANLLMCDFVDDGA-SIFSLAMLQPCMVDSWVEWGQDYKPFAVRPY 412 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D AE G +VVL + G L + D +I + +Y Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G+ ++ Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494 >gi|154247555|ref|YP_001418513.1| hypothetical protein Xaut_3628 [Xanthobacter autotrophicus Py2] gi|154161640|gb|ABS68856.1| protein of unknown function DUF264 [Xanthobacter autotrophicus Py2] Length = 690 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 48/322 (14%), Positives = 96/322 (29%), Gaps = 63/322 (19%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNANRF--WIMTSNPRRLSGKFYEIFNKPLDDW 248 + + +E S P L L + + NP Y F + D Sbjct: 380 HNCTLYFNECSQIPYSSILVARTRLAQVVPGLMQRALYDLNPAGTGHWTYREFIEGRDPI 439 Query: 249 KRFQIDTRTV------------EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF 296 + + + P F + A G + + + Sbjct: 440 SGAPLSAPDNFQHMFLNPGDNAKNLSPEFLRSLEALPEKQRRRF---FDGMYVAEIDGAL 496 Query: 297 IPLNIIEEALNREPCPDPYA--PLIMGCD------------------IAEEGGDNTVVVL 336 L++IE + P + +++G D +A D T V+L Sbjct: 497 WTLDLIERCRSEPIAPGDHRLRRIVIGVDPSGAANKEDARSDEIGIVVAGMMDDGTAVIL 556 Query: 337 RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396 G V + W K ++GL K+ D +I + N G +++ + Sbjct: 557 EDGTVRDGPSGWGKV--------VAGLYHKWGADRVIAERN-YGGAMVEFVILTADKSIP 607 Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456 V + ++ R E ++ E + H+G ++ + +G + Sbjct: 608 V----SVITASRGKHIRAE---PVSALYEQGK-VRHAGRFPEMED-QFTNFSTSGYM--- 655 Query: 457 SKRVKGAKSTDYSDGLMYTFAE 478 G +S D +D ++ E Sbjct: 656 -----GDRSPDRADAAVWALTE 672 >gi|226939350|ref|YP_002794423.1| Terminase [Laribacter hongkongensis HLHK9] gi|226714276|gb|ACO73414.1| Terminase [Laribacter hongkongensis HLHK9] Length = 578 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 25/129 (19%), Positives = 40/129 (31%), Gaps = 18/129 (13%) Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIM 320 R D R +F S PL+++ + P + + + Sbjct: 345 RLENSPDEFRQLFECEFIDDG-KSVFPLSMLHRCMVDSMEAWPDYNPFTLRPLGHREVWI 403 Query: 321 GCDIAEEGGDNTVVVLR-----RGP-VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII 374 G D +E G +VV+ GP + + D I E+YR I I Sbjct: 404 GYDPSESGDSAAMVVVAPPAVPDGPFRLLECRQFRGLDYSAQAQAIKEATERYRVTHIAI 463 Query: 375 DANNTGART 383 D G+ Sbjct: 464 DRTGLGSAV 472 >gi|213427183|ref|ZP_03359933.1| terminase subunit [Salmonella enterica subsp. enterica serovar Typhi str. E02-1180] Length = 195 Score = 47.8 bits (112), Expect = 0.004, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 9/83 (10%) Query: 311 CPDPYAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKIS 361 P + + +G D A+ + GD+ V+ G L W D R + I Sbjct: 8 RPFGWREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIK 67 Query: 362 GLVEKYRPDAIIIDANNTGARTC 384 L ++Y I ID+ G Sbjct: 68 KLTQQYNVTYIGIDSTGVGHGVY 90 >gi|251778523|ref|ZP_04821443.1| phage terminase, large subunit, pbsx family [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243082838|gb|EES48728.1| phage terminase, large subunit, pbsx family [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 448 Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 61/190 (32%), Gaps = 10/190 (5%) Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI-FNKPLDDWKRFQI 253 I+ +E + + L +N + NP S YE+ F D+ + Sbjct: 142 IVVEECTEIDKQEFSQLGLRLRSKNGYNQIHVMFNPISKSNWVYEMWFQNGYDESDTMVL 201 Query: 254 DT--RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNI--IEEALNRE 309 T + + + + +I D R+ G+F +D I N ++ + Sbjct: 202 KTTYKDNKFLPYDYINALIKMKETDPVYYRIYALGEF--ASLDKLIYTNWEELDFDWRKL 259 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPV---IEHLFDWSKTDLRTTNNKISGLVEK 366 PYA G D + + + V I ++ + L + + Sbjct: 260 MQQRPYAKACFGLDFGYVNDPSAFIAMIVDEVNKEIYIFDEFYEKGLLNDALALKIVKRG 319 Query: 367 YRPDAIIIDA 376 Y + I D+ Sbjct: 320 YGKEIIFADS 329 >gi|163746673|ref|ZP_02154030.1| hypothetical protein OIHEL45_14759 [Oceanibulbus indolifex HEL-45] gi|161379787|gb|EDQ04199.1| hypothetical protein OIHEL45_14759 [Oceanibulbus indolifex HEL-45] Length = 414 Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats. Identities = 59/423 (13%), Positives = 109/423 (25%), Gaps = 73/423 (17%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ S G V + + Q++ + Sbjct: 22 IMGGRGAGKTRAGA---EWVRSKVEGSRPLDPGECSRVALVGETIEQVREVM-------- 70 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 F + P + + ++ P+ G Sbjct: 71 -------IFGDSGILACSPPDRRPDWEATRKRLVWPNGAIATVHTAHDPEGLRGPQFD-- 121 Query: 193 MAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + L +T+ PR + + + Sbjct: 122 -AAWVDELAKWKRGQEAWDQLQFAL-RLGERPQVCVTTTPRNVD--VLKALLQSPSTVTT 177 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + SF E + ARY + + R E+ G + ++E R Sbjct: 178 HAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSALLEA--GRVQ 234 Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357 +++G D A G D +V+ +W Sbjct: 235 VAPELDRIVVGLDPATTSGAGSDECGIVVVGAQTQGPPQEWRAVVLADCTVQGATPNGWA 294 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTE 415 + +Y D ++ + N G + L + + V + V R E Sbjct: 295 QAAIAAMTRYGADRLVAEVNQGGQLVSEVLRQVDPLVSLKTVHAARGKV-------ARAE 347 Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + + + L L + + + G S D D L++ Sbjct: 348 PVAALYEQGRVSHLPGLDALEDQMCLMTARGYEGKG-------------SPDRVDALVWA 394 Query: 476 FAE 478 E Sbjct: 395 LHE 397 >gi|226940436|ref|YP_002795510.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715363|gb|ACO74501.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 93 Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 10/80 (12%) Query: 10 ETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNS 69 + + +L +L E + LH + WG LEG + PR+WQ E M + H N Sbjct: 3 DIDDELIELA--AECATDPLRWALHAYDWGR--GELEGVTGPRAWQREVMSDIGNHLKNP 58 Query: 70 VNNPNPEVFKGAISAGRGIG 89 + A AGRG+G Sbjct: 59 ATRFS------AFDAGRGLG 72 >gi|219723016|ref|YP_002474442.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692691|gb|ACL33908.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] Length = 450 Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNMATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|226246851|ref|YP_002776184.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] gi|226202003|gb|ACO38584.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] Length = 450 Score = 47.8 bits (112), Expect = 0.005, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNMATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|83717940|ref|YP_439548.1| putative ATPase subunit of terminase (gpP-like) [Burkholderia thailandensis E264] gi|83651765|gb|ABC35829.1| Putative ATPase subunit of terminase (gpP-like) [Burkholderia thailandensis E264] Length = 601 Score = 47.4 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 39/142 (27%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + F + S L+ ++ + P Sbjct: 362 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 478 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500 >gi|257142677|ref|ZP_05590939.1| putative ATPase subunit of terminase (gpP-like) protein [Burkholderia thailandensis E264] Length = 589 Score = 47.4 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 39/142 (27%), Gaps = 21/142 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 + + Y + + F + S L+ ++ + P Sbjct: 350 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367 Y + +G D A G +VV+ G + + D I + ++Y Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466 Query: 368 RPDAIIIDANNTGARTCDYLEM 389 I ID G + Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488 >gi|269838926|ref|YP_003323618.1| hypothetical protein Tter_1890 [Thermobaculum terrenum ATCC BAA-798] gi|269790656|gb|ACZ42796.1| hypothetical protein Tter_1890 [Thermobaculum terrenum ATCC BAA-798] Length = 534 Score = 47.4 bits (111), Expect = 0.005, Method: Composition-based stats. Identities = 35/247 (14%), Positives = 71/247 (28%), Gaps = 33/247 (13%) Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234 T+ P+ NT + ++ +EA + + M + + Sbjct: 186 TFLSASPEASA-RGNTASLLLVANEAQDISPDRWDAVFDPMAASTNATTIFMGTVWTSRT 244 Query: 235 -----GKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGI---IARYGLDSDVTRVEVCG 286 ++ + + F + + V P++ E + IA+ G R E Sbjct: 245 LLARQMRYLRQLELEVGRRRVFMVPWQEVARHVPAYGERVQARIAQLGRHHPFVRTEY-C 303 Query: 287 QFPQQDIDSFIPLNIIEEALN---REPCPDPYAPLIMGCDIAEEG--------GDNTVVV 335 D F P + E R+ P P D+ E D+T + Sbjct: 304 LEELSDDGGFFPPAVTERMRGDHPRQLLPTPGRTYAALLDVGGEDLAAGPSPRRDSTALT 363 Query: 336 LRRG-----------PVIEHLFDWSKTDLRTTNNKISGLVEK-YRPDAIIIDANNTGART 383 + + + W+ ++ LV + +++DA GA Sbjct: 364 IVEVCHPEGADLQPVYRVMTRYVWTGVGQPELLPQVVHLVRDVWACRRLVVDATGLGAGL 423 Query: 384 CDYLEML 390 L + Sbjct: 424 ASALRRI 430 >gi|317490974|ref|ZP_07949410.1| terminase [Enterobacteriaceae bacterium 9_2_54FAA] gi|316920521|gb|EFV41844.1| terminase [Enterobacteriaceae bacterium 9_2_54FAA] Length = 590 Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 57/182 (31%), Gaps = 23/182 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + E + RY + + F D+ + Sbjct: 328 PDGQWRYIITMEDAIRGGFNLADIERLRNRY--NDSTFAMLYMCVFVDSK-DAVFSFEDL 384 Query: 303 EEA-LNREP---------CPDPYAPLIMGCDIAEEGGDNT-------VVVLRRGPVIEHL 345 E ++R+ P + G D A G +T V+ + + + + Sbjct: 385 ERCGVDRDIWQDFDIKLKRPFGDREVWAGYDPARSGDLSTFAVLAPPVLAVEK-FRVLEI 443 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRAV 404 +W R N+I L KY + ID G + ++ G + + Sbjct: 444 VNWHGMSFRWQANEIKKLFAKYNIRYLGIDVTGIGNAVFENIQHFAGRVAVPIRYSVKTK 503 Query: 405 DL 406 D Sbjct: 504 DE 505 >gi|30062201|ref|NP_836372.1| hypothetical protein S0695 [Shigella flexneri 2a str. 2457T] gi|309786465|ref|ZP_07681089.1| phage terminase large subunit domain protein [Shigella dysenteriae 1617] gi|30040446|gb|AAP16178.1| hypothetical bacteriophage protein [Shigella flexneri 2a str. 2457T] gi|308925653|gb|EFP71136.1| phage terminase large subunit domain protein [Shigella dysenteriae 1617] gi|313649746|gb|EFS14170.1| phage terminase large subunit domain protein [Shigella flexneri 2a str. 2457T] gi|332761021|gb|EGJ91309.1| phage terminase large subunit domain protein [Shigella flexneri 4343-70] gi|332761177|gb|EGJ91463.1| phage terminase large subunit domain protein [Shigella flexneri 2747-71] gi|332763392|gb|EGJ93632.1| phage terminase large subunit domain protein [Shigella flexneri K-671] gi|333008020|gb|EGK27496.1| phage terminase large subunit domain protein [Shigella flexneri K-218] gi|333021447|gb|EGK40697.1| phage terminase large subunit domain protein [Shigella flexneri K-304] Length = 77 Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 16/45 (35%), Gaps = 3/45 (6%) Query: 443 KSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484 G + +ESK+ + S + +D + FA D Sbjct: 24 PHRDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 68 >gi|226315790|ref|YP_002776047.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] gi|226201663|gb|ACO38256.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] Length = 450 Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + +K + T + F E Y D + + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKPY-KDIPLYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|169794751|ref|YP_001712544.1| phage-related terminase, ATPase subunit (GPP-like) [Acinetobacter baumannii AYE] gi|169147678|emb|CAM85541.1| phage-related terminase, ATPase subunit (GPP-like) [Acinetobacter baumannii AYE] Length = 604 Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats. Identities = 43/271 (15%), Positives = 84/271 (30%), Gaps = 53/271 (19%) Query: 246 DDWKRFQIDTRTVEGI--DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 D R ++ + E D E +IA + +F S PL++I+ Sbjct: 342 DKMWRHIVNIQDAERQGCDLFDIEELIAE--NSPEEFANLYMCEFVDDG-HSVFPLSVIQ 398 Query: 304 EALN------------REPCPDPYAPLIMGCDIAEEGGDN--TVVVLRRGPVIE-HLFDW 348 + P P+ +G D AE G V+ + L + Sbjct: 399 PCMVDSWEVWSKDFKPLALRPFGNKPVWIGYDPAESGDSAGLVVIAPPEPDYPKFRLLEH 458 Query: 349 SKTDLRTTNNK---ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 + ++ I L KY I +D + G + Sbjct: 459 HQFKGMDFASQAQYIKKLTTKYNVKYIGLDKSGMGTGVAQLV------------------ 500 Query: 406 LEFCRNRRTELH-VKMADWL--EFASLINHS---------GLIQNLKSLKSFIVPNTGEL 453 L+F N T + V + L + +IN + ++ +++ + + ++ Sbjct: 501 LDFFPNLTTFNYSVDVKTQLVMKAMDVINKERFEFDAGSTDVAMSIMAIRKTLTASQRQM 560 Query: 454 AIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484 E+ R + D + + + FA P D Sbjct: 561 TFEASRAENIGHADLAFAIFHAFANEPLTLD 591 >gi|239504148|ref|ZP_04663458.1| putative phage terminase [Acinetobacter baumannii AB900] Length = 413 Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats. Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%) Query: 81 AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135 A AG G GKT + + W V A + Q++ + + Sbjct: 7 AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 56 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 W + + + Y T S E+P T VG + + Sbjct: 57 VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 106 Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244 DE A I+ + + A I + YE F K Sbjct: 107 -IDELDVMAMTKAQQAWRKIIARMRFKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 165 Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + Q T E + + + Y + + GQF + P + Sbjct: 166 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 222 Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361 + + PL++G D V V+R G L + D T I+ Sbjct: 223 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 281 Query: 362 GLVEKYRPDAIIIDANN 378 + +I DA+ Sbjct: 282 ERFPDH-DITVIPDASG 297 >gi|195942579|ref|ZP_03087961.1| hypothetical protein Bbur8_07059 [Borrelia burgdorferi 80a] Length = 450 Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHRQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K ++ T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|70727357|ref|YP_254273.1| putative phage terminase large subunit [Staphylococcus haemolyticus JCSC1435] gi|68448083|dbj|BAE05667.1| putative phage terminase large subunit [Staphylococcus haemolyticus JCSC1435] Length = 421 Score = 47.4 bits (111), Expect = 0.006, Method: Composition-based stats. Identities = 32/244 (13%), Positives = 81/244 (33%), Gaps = 28/244 (11%) Query: 56 LEFMEVVDAHCLNS-VNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 L +++ H + NP++ GRG GK++ + ++ + R ++ + + Sbjct: 5 LNLSQLIPKHFHDLWRATKNPDILNVVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVR 63 Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174 ++ L T+++ ++ + H F+++ + + + R Sbjct: 64 KTDNTLATSVFEQIKWAIEEQKVSHLFKIKVS------------PMEITFIPRGNRIIFR 111 Query: 175 TYSEERPDTFVGHHNTYG-MAIINDEASG---TPDVINLGILGFL---TERNANRFWIMT 227 + P+ ++ +I+ E G T D + L + + + Sbjct: 112 GA--QNPERLKSLKDSRFPFSIMWIEELGEFKTEDEVTTITNSMLRGELDEGLFYKFYFS 169 Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 NP + + YE +P + + I F + + + R E Sbjct: 170 YNPAKRKQHWANKKYETSFQPDNTFVHHS-TYLNNPFISKQFIQEAESAKQRNELRYRWE 228 Query: 284 VCGQ 287 G+ Sbjct: 229 YLGE 232 >gi|84687555|ref|ZP_01015431.1| hypothetical protein 1099457000249_RB2654_04994 [Maritimibacter alkaliphilus HTCC2654] gi|84664464|gb|EAQ10952.1| hypothetical protein RB2654_04994 [Rhodobacterales bacterium HTCC2654] Length = 260 Score = 47.0 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 78/258 (30%), Gaps = 39/258 (15%) Query: 49 SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108 P +WQ F+ +P V R IGK+ A L PG Sbjct: 28 GPPDNWQRRFL-----------TTASPFVMALC---SRRIGKSQTTAILAA-QTIGAPGR 72 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 +V+ L+ + Q L+ + + SL + + + G Sbjct: 73 TVLVLSPTLGQ-SQLLFKRI---------LEAWAAMSLPIEKTRLTQTTMELANG----- 117 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228 S + + + + G+ G +I +E + D + + + ++ + Sbjct: 118 -SVVACVPAGQDGSSARGYGVKDGGLLIYEEGAFLADAVYDATIPIREDGG---RILLIT 173 Query: 229 NPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288 P + G + + + + +I R+ E + RY + E ++ Sbjct: 174 TPGNVGGFAHTAWTENDEI---EKITARSTEIERMAEKVAFDRRY-MPPRQFATEHELRW 229 Query: 289 PQQDIDSFIPLNIIEEAL 306 D IE A Sbjct: 230 SSGG-DPLFASETIENAF 246 >gi|298247861|ref|ZP_06971666.1| conserved hypothetical protein [Ktedonobacter racemifer DSM 44963] gi|297550520|gb|EFH84386.1| conserved hypothetical protein [Ktedonobacter racemifer DSM 44963] Length = 499 Score = 47.0 bits (110), Expect = 0.006, Method: Composition-based stats. Identities = 64/381 (16%), Positives = 114/381 (29%), Gaps = 65/381 (17%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGAISAGRG----------IGKTTLNAWLVLWLMSTRPG 107 F V L + ++ GRG +GK L+A L +L+ Sbjct: 23 FAREVLGKPLYPYQELVGDAILESVLEGRGDTFTVMFARQMGKNQLSATLEAYLLFCMRE 82 Query: 108 ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM-QSLSLHPAPW--YSDVLHCSLGI 164 S++ A + K ++ + + + + W Y + + Sbjct: 83 GSIVKAAPTY------------KPQTINSRQRLLSLLDNPLMRNRVWKHYGYTIGMAPRH 130 Query: 165 DSKHYSTMCRT--YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANR 222 + Y T R +S + VG T + + DEA L + Sbjct: 131 EQVPYQTGPRVMFFSAGPGASIVG--ATASLLLEIDEAQSIDPNKYDTDLRPMASTTNAT 188 Query: 223 FWIMTSNPRRLSGKFY-EIFN----KPLDDWKRFQIDTRTVEGID---PSFHEGIIARYG 274 + + + N + + F D RT+ I+ + E I R G Sbjct: 189 TVLYGTAWSEETLLARMRTHNLELERLDGRQRHFAYDWRTLAAINDHYKRYVESEIKRLG 248 Query: 275 LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPCPDP-YAPLIMGCDIAEEG 328 D R + P LN ++ +L R E PDP + G DIA E Sbjct: 249 EDHISIRTQYR-LLPILGSGYL--LNDLQFSLLRGQHTWESSPDPAEGFYVAGLDIAGEQ 305 Query: 329 G----------DNTVVVLRR---------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 D+T++ + R I +DW+ +S ++ ++ Sbjct: 306 HARPGQPAGKHDSTILTIGRVTINDLGLPELRIVRHYDWTGMKYTDQYAAVSRILPEWNV 365 Query: 370 DAIIIDANNTGARTCDYLEML 390 ++D G L Sbjct: 366 RRTVVDKTGLGEGLASLLSTR 386 >gi|108799880|ref|YP_640077.1| phage terminase [Mycobacterium sp. MCS] gi|119868990|ref|YP_938942.1| phage terminase [Mycobacterium sp. KMS] gi|108770299|gb|ABG09021.1| phage Terminase [Mycobacterium sp. MCS] gi|119695079|gb|ABL92152.1| phage Terminase [Mycobacterium sp. KMS] Length = 489 Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 60/363 (16%), Positives = 105/363 (28%), Gaps = 70/363 (19%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLW-LMSTRPGISV 110 R WQ E SV + +P RG GK+TL A L+ + G V Sbjct: 49 REWQQELA--------GSVLDADPRPRTAGWMLPRGQGKSTLLAAYGLYDFFTGDEGAVV 100 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 +A E Q ++ +++ + L ++ Q L I + Sbjct: 101 CVVAVDERQAG-IIFG-IARRMVELSDELASRCQVFK------------ERLYIPERDAH 146 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230 C P G T + DEA G + +L + A I P Sbjct: 147 FHCLPA---EPKRLEGLDYTTALL---DEA-GVASRDSYEVLTLAQGKRAQSTLIAIGTP 199 Query: 231 RRLSG--------KFYEIFNKPLDD-WKRFQI---------DTRTVEGIDPSFHEGIIAR 272 + + W+ F T E +P+ + + Sbjct: 200 GPDPNNQVLADLRNYAADHPEDASLVWREFSAAGFEDHPVDCTHCWELANPALDDFLHRD 259 Query: 273 --------YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI 324 ++ R QF +F+P + + P PD +++ D Sbjct: 260 ALYALLPPKTREATFRRAR-LCQFASDTDGAFLPQGVWDGLSTGRPIPD-GTEVVVALD- 316 Query: 325 AEEGGDNTVVVLRR---GPVIEHLFDWSKTDLR--------TTNNKISGLVEKYRPDAII 373 D T ++ P + + W +T+ ++I ++R II Sbjct: 317 GSFSDDTTALLAGTVSAEPHFDTIHVWQRTNGDDSYRVPVAEVEDEIRAACRRWRVAEII 376 Query: 374 IDA 376 D Sbjct: 377 ADP 379 >gi|237710644|ref|ZP_04541125.1| phage terminase large subunit [Bacteroides sp. 9_1_42FAA] gi|229455366|gb|EEO61087.1| phage terminase large subunit [Bacteroides sp. 9_1_42FAA] Length = 461 Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 44/242 (18%), Positives = 81/242 (33%), Gaps = 40/242 (16%) Query: 262 DPSFHEGIIARYGLDSDVTRVE-VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320 DP++ ++ + SD R + G + + I EAL R + Sbjct: 212 DPTYLANLVNQ----SDEQRARDLDGNWKYKAAGDDIIKLTHMEALYRNSMQIGDGIRRV 267 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANN 378 CD A EGGD+ V+ L G I +F K D + T + + ++E++ R + D N Sbjct: 268 SCDAAFEGGDSLVMWLWEGWHIRDIFV-CKLDSKKTVDTVKAVLEEWHVREECFTYDLNG 326 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEF----CRNRRTELHVKMADWLEFASLINHSG 434 G G+ + + E N +++ A + + Sbjct: 327 LGQI------FKGFFPNAIPFNNKEAVEEKFKYIYTNLKSQAAYLFAQKIINREISIEPT 380 Query: 435 LIQNLKSLKSFIVPNTGELA-IESKRVKG---------------------AKSTDYSDGL 472 L++ S K F ++ E K ++ S D+ + L Sbjct: 381 LLERKFSGKGFEKVPLRQILDKERKAIRKDEDSEEKGWTIIKKIIMKKLVGHSPDFIEAL 440 Query: 473 MY 474 + Sbjct: 441 LM 442 >gi|15668504|ref|NP_247302.1| hypothetical protein MJ_0330 [Methanocaldococcus jannaschii DSM 2661] gi|2833503|sp|Q57776|Y330_METJA RecName: Full=Uncharacterized protein MJ0330 gi|1591049|gb|AAB98318.1| hypothetical protein MJ_0330 [Methanocaldococcus jannaschii DSM 2661] Length = 549 Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 33/242 (13%), Positives = 65/242 (26%), Gaps = 35/242 (14%) Query: 85 GRGIGKTTLNAWLVLWLMS--TRPG-----ISV--ICLANSETQLKTTLWAEVSKWLSLL 135 G+G GK + + L ++M + + +A ++ K + E W Sbjct: 93 GKGGGKDFMVSLLFNYMMFRACVEDYYEKFTRIDFVNVAPNDHLAKNVFFKEFKAWFLKC 152 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 + AP +G +S R S + + Sbjct: 153 KVWQMIGIDKKKRQKAPICVLETKAEIGDKITMHSGHSRATS---------FEGMNALCV 203 Query: 196 INDEASGTPDVINLGILGFLTERNANRF-------------WIMTSNPRRLSGKFYEIFN 242 + DE D + ++ W P Y ++ Sbjct: 204 VADE---ISDPDFKNAEQLFEQGLSSAKSRFKDKARVVAITWTRFPTPNPRDDVGYRLYL 260 Query: 243 KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 + + +T E E A+Y + + R + P+ + FI L + Sbjct: 261 DYKAVDEAYTFKGKTWEVNTRVSKEDFKAQYQKNPILARCMYECEPPELNA-YFISLEAL 319 Query: 303 EE 304 E Sbjct: 320 EA 321 >gi|297848822|ref|XP_002892292.1| hypothetical protein ARALYDRAFT_470549 [Arabidopsis lyrata subsp. lyrata] gi|297338134|gb|EFH68551.1| hypothetical protein ARALYDRAFT_470549 [Arabidopsis lyrata subsp. lyrata] Length = 1406 Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 13/155 (8%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + ++ + F+ + G G GKT L + + P Sbjct: 823 QQEGFEFIWKNLAGTILLNELKDFENSDETGGCIMSHAPGTGKTRLTIIFLQAYLQCFPD 882 Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 + +A + L WA E KW +P + + + ++ + S Sbjct: 883 CKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGKESSAALGLLMQKNATARS 939 Query: 167 KHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDE 199 + M + YS + + +G ++ +A + DE Sbjct: 940 NNEIRMVKIYSWIKSKSILGISYNLYEKLAGVKDE 974 >gi|163742707|ref|ZP_02150092.1| terminase, large subunit, putative [Phaeobacter gallaeciensis 2.10] gi|161383962|gb|EDQ08346.1| terminase, large subunit, putative [Phaeobacter gallaeciensis 2.10] Length = 417 Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 66/416 (15%), Positives = 112/416 (26%), Gaps = 65/416 (15%) Query: 82 ISAGRGIGKTTLNA-WLVLWLMSTRPGI-----SVICLANSETQLKTTLWAEVSKWLSLL 135 I GRG GKT A W+ P + L + Q++ + S L+ Sbjct: 25 ILGGRGAGKTRAGAEWVRTLAEGATPLSAGRARRIALLGETYDQVRDVMVQGDSGLLACT 84 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 P + + +S P+ G A Sbjct: 85 PRD---------------RRPTWKATERRLIWPNGATAQAFSAHDPEALRGPQFD---AA 126 Query: 196 INDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 DE + + L + R + + R G E+ P + Sbjct: 127 WADELAKWKRGQDSWDMLQFALRLGDDPR--VCVTTTPRNVGVLRELLASPSTV-QTHAA 183 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-LNREPCP 312 + SF + RY S + R E+ G Q + + A + + P Sbjct: 184 TEANRANLAASFLAEVRNRY-AGSRLGRQELDGILLQDIEGALWTNAGLVAAQIAKAPTL 242 Query: 313 DPYAPLIMGCDIAEEGG---DNTVVVLRRGPVIEHLFDW----------SKTDLRTTNNK 359 D +++ D A G D +V+ + DW T Sbjct: 243 DR---VVVAVDPAVSAGKHSDACGIVVVGATLQGPPQDWCAYVLADCTVQGVGPLTWAQA 299 Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419 ++Y D ++ + N GA L + V A+ + R E Sbjct: 300 AIDARDRYGADRVVAEVNQGGALVESLLRQIDPLV-----PFTALHASRGKGARAEPVAA 354 Query: 420 MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + + + L L + G L G S D D L++ Sbjct: 355 LYEQGRVRHVPGLGALEDQLC-----QMTPRGYL--------GQGSPDRLDALVWA 397 >gi|315180730|gb|ADT87644.1| terminase [Vibrio furnissii NCTC 11218] Length = 607 Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 47/181 (25%), Gaps = 23/181 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ I+ G D E + Y F S N + Sbjct: 340 PDRQWRYVVTIEDAAKGGCDLFDIEELREEYSEHD--FNNLFMCIFVDGAS-SIFEFNKV 396 Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345 ++ + + + +G D + DN V +V + Sbjct: 397 QKCMVDAGIWQDFKASAKRPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAAEKFRVLEK 455 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404 W + ++I + E++ + ID GA D L + Sbjct: 456 HTWRGLSFQHQASEIDKVFERFNVTYLGIDITGIGAGVYDLLSNKHPRETVAIHYSNENK 515 Query: 405 D 405 + Sbjct: 516 N 516 >gi|153213615|ref|ZP_01948888.1| terminase [Vibrio cholerae 1587] gi|124115814|gb|EAY34634.1| terminase [Vibrio cholerae 1587] Length = 606 Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 48/181 (26%), Gaps = 23/181 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G D + + + +F S N I Sbjct: 346 PDKQWRYVITMEDAVKSGFDLADIDILREENSERD--FNNLFMCEFVDGAS-SIFEYNKI 402 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345 + + P + +G D + DN V +V + Sbjct: 403 LRCMVDIEIWQDFKPSSDRPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAAEKFRVLEK 461 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404 W + ++IS + E++ + ID GA D L + Sbjct: 462 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVYDLLSNKHPRETVAIHYSNENK 521 Query: 405 D 405 + Sbjct: 522 N 522 >gi|160700654|ref|YP_001552334.1| gp5 [Mycobacterium phage Giles] gi|159136604|gb|ABW88400.1| gp5 [Mycobacterium phage Giles] Length = 544 Score = 47.0 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 74/440 (16%), Positives = 130/440 (29%), Gaps = 62/440 (14%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT L A +++ + G ++ A +K + + ++ + L Sbjct: 109 GKTQLIALRIIYGL-FFLGEKIVYTAQRWQTVKDVY----DRIVEIIKRRPSLL---RRL 160 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN 208 P P D + G + Y+T + VG T I DEA DV+ Sbjct: 161 KPMPGVPD-GYSEAGQHGEIYTTNGGSLDMGPRTKAVGRGQTKIDLAIFDEAYDIKDVLV 219 Query: 209 LGILG---FLTERNA---NRFWIMTSNP--RRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260 G+ G T + + + +P L+G K D + + Sbjct: 220 GGLTGAQKAATNPQTIYISTAAVASEHPDCGVLAGMRRNGQRKEPDLYAAEWCAPPGMAR 279 Query: 261 IDPSFHEGIIARYG---LDSDVTR--------VEVCGQFPQQDIDSFIPLNIIEEALNRE 309 DP +G + D+ R + + D D + N E Sbjct: 280 DDPEAWRLACPSFGITVRERDLAREYRMARANARLLAIY---DADYLGWGEWPPDPENTE 336 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLR----------------RGPVIEHLFDWSKTDL 353 P DP + GD + + R G V + W ++ Sbjct: 337 PIIDPDWWEALTVLQPALVGDICIAIERTLDTRYWCIAAGQRTIDGRVHVEVGYWRAANI 396 Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRR 413 + LVE + P AII+D + + G + K A+ + + Sbjct: 397 GVVAAALLELVELWNPAAIIVDDRSKAKPIVGVMFNQGIEIETASTPKLAMYTQGFIDA- 455 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473 V AD +I + + + G+L + K + + L Sbjct: 456 ----VNAADVTHIG-----QKIITDGIAGAAMRELPRGDLVFDEKESGAPVAPLKAIALA 506 Query: 474 YT-----FAENPPRSDMDFG 488 + AE P + D G Sbjct: 507 HGAVLEYAAEPKPAASPDTG 526 >gi|281416525|ref|YP_003347326.1| terminase large subunit [Enterococcus phage phiFL2A] gi|270209389|gb|ACZ63932.1| terminase large subunit [Enterococcus phage phiFL2A] gi|270209454|gb|ACZ63996.1| terminase large subunit [Enterococcus phage phiFL2B] Length = 430 Score = 47.0 bits (110), Expect = 0.008, Method: Composition-based stats. Identities = 51/354 (14%), Positives = 100/354 (28%), Gaps = 42/354 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 RG KTT A + LM P ++I L ++T + E+ ++ + + +F+ Sbjct: 52 RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106 Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +L+ + + + G H +I D+ Sbjct: 107 FALYGVELVLLKETTTEVDTNLKTSSRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257 D ++ + N + G+F ++K K + D Sbjct: 164 KDRVSRA-----ERERTKLQYQELQNVKNRGGRFINTGTPWHKEDAISKMPNVKKFDCYE 218 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 ID + + + + + + F + +N + Sbjct: 219 TGLIDKEQRQAL--QQAMTPSLFAANYELKHIADSESLFTAPTYTDS-INLIYNGVAH-- 273 Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373 D A G D+T + + G +I + W K +I L + Y+ Sbjct: 274 ----VDAAYGGDDSTAFTIFKEQKDGTIIGYGRKWQKHVDDCLP-EILRLHQHYQAGTFH 328 Query: 374 IDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL--HVKMADWLE 425 + N L G V + T L + + WLE Sbjct: 329 TETNGDKGYLAKNLRECGQFVTEYHES-----MNKFIKISTYLRKYWHLIIWLE 377 >gi|291326278|ref|ZP_06123867.2| terminase, ATPase subunit [Providencia rettgeri DSM 1131] gi|291314958|gb|EFE55411.1| terminase, ATPase subunit [Providencia rettgeri DSM 1131] Length = 574 Score = 47.0 bits (110), Expect = 0.008, Method: Composition-based stats. Identities = 47/253 (18%), Positives = 79/253 (31%), Gaps = 48/253 (18%) Query: 248 WKRFQ-IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI-----DSFIPLNI 301 W++ I G++ E I D R +F + D+ I + Sbjct: 312 WRQIVNIHDAIARGLNRVNLEEIKDE--NPPDDFRNLYECEFVKTGERAFSYDALINCGV 369 Query: 302 IEEALNREPCPDPYAP-------LIMGCDIAEEGGDNTVVVLRR-------GPVIEHL-- 345 + P PYAP + +G D G + + L G + Sbjct: 370 DGYNSDVWPDWKPYAPRPLGNRPVWVGADPTGTGDNGDGLGLVVASPPAVSGGKFRIIET 429 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCDYLEM-------LGYHVYRV 397 ++I + ++Y +I ID TGA + + L Y + Sbjct: 430 IQLRGMAFEKQADEIKRITQRYNVLSITIDGTGGTGAAVHELVVKFFPAANLLNYSA-PI 488 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457 L RN R E + H LI + ++K + +G + ES Sbjct: 489 KRMMIMKMLMLIRNGRFEYDAGL-----------HKPLITSFMTIKK-VQTQSGIITYES 536 Query: 458 KRVKGAKSTDYSD 470 RV+G D+ D Sbjct: 537 SRVRGL---DHGD 546 >gi|226329986|ref|ZP_03805504.1| hypothetical protein PROPEN_03899 [Proteus penneri ATCC 35198] gi|225200781|gb|EEG83135.1| hypothetical protein PROPEN_03899 [Proteus penneri ATCC 35198] Length = 584 Score = 46.6 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 47/140 (33%), Gaps = 23/140 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN-----------REPCPDP 314 E + Y D + F DI+S N+++ + P Sbjct: 344 LEQLKKEY--SPDEYNNLLMCHF-MDDIESLFNFNMMQNCMVDSWEVWDDIQPLALRPYG 400 Query: 315 YAPLIMGCDIAEEG--GDNT---VVVLRR--GPVIEHL--FDWSKTDLRTTNNKISGLVE 365 Y P+ +G D ++ G GD+ V+ + G L W D R + I + E Sbjct: 401 YDPVWVGYDPSKGGENGDSAGCVVIAPPKVPGGKFRILERHQWRGMDFRAQADAIKKITE 460 Query: 366 KYRPDAIIIDANNTGARTCD 385 ++ + + ID G Sbjct: 461 RFYVEYMGIDTTGLGHGVYQ 480 >gi|197285843|ref|YP_002151715.1| phage terminase, ATPase subunit [Proteus mirabilis HI4320] gi|194683330|emb|CAR44037.1| phage terminase, ATPase subunit [Proteus mirabilis HI4320] Length = 584 Score = 46.6 bits (109), Expect = 0.008, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 47/140 (33%), Gaps = 23/140 (16%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN-----------REPCPDP 314 E + Y D + F DI+S N+++ + P Sbjct: 344 LEQLKKEY--SPDEYNNLLMCHF-MDDIESLFNFNMMQNCMVDSWEVWDDIQPLALRPYG 400 Query: 315 YAPLIMGCDIAEEG--GDNT---VVVLRR--GPVIEHL--FDWSKTDLRTTNNKISGLVE 365 Y P+ +G D ++ G GD+ V+ + G L W D R + I + E Sbjct: 401 YDPVWVGYDPSKGGENGDSAGCVVIAPPKVPGGKFRILERHQWRGMDFRAQADAIKKITE 460 Query: 366 KYRPDAIIIDANNTGARTCD 385 ++ + + ID G Sbjct: 461 RFYVEYMGIDTTGLGHGVYQ 480 >gi|224020497|ref|YP_002601287.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|223929730|gb|ACN24438.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] Length = 450 Score = 46.6 bits (109), Expect = 0.009, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYINNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYIFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|313649747|gb|EFS14171.1| phage terminase large subunit domain protein [Shigella flexneri 2a str. 2457T] gi|332761022|gb|EGJ91310.1| phage terminase large subunit domain protein [Shigella flexneri 4343-70] gi|332761328|gb|EGJ91614.1| phage terminase large subunit domain protein [Shigella flexneri 2747-71] gi|332763393|gb|EGJ93633.1| phage terminase large subunit domain protein [Shigella flexneri K-671] gi|333007918|gb|EGK27394.1| phage terminase large subunit domain protein [Shigella flexneri K-218] gi|333021518|gb|EGK40768.1| phage terminase large subunit domain protein [Shigella flexneri K-304] Length = 159 Score = 46.6 bits (109), Expect = 0.009, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 40/135 (29%), Gaps = 25/135 (18%) Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVE 365 + +P +G D+A+ G D V R G V+ +W + +L + + Sbjct: 5 KTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAAL 64 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRA 403 + D I+ D+ GA + + R Sbjct: 65 EREAD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGI 123 Query: 404 VDLEFCRNRRTELHV 418 + +F N + + Sbjct: 124 PNKDFFANLKAQAWW 138 >gi|84687436|ref|ZP_01015314.1| Putative large terminase [Maritimibacter alkaliphilus HTCC2654] gi|84664594|gb|EAQ11080.1| Putative large terminase [Rhodobacterales bacterium HTCC2654] Length = 426 Score = 46.6 bits (109), Expect = 0.009, Method: Composition-based stats. Identities = 67/437 (15%), Positives = 116/437 (26%), Gaps = 75/437 (17%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ + G V + + Q++ + Sbjct: 33 ILGGRGAGKTRAGA---EWVRAQVEGPAPLSPGRAGRVALIGETFDQVRDVMV------- 82 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 F + P + ++S P+ G Sbjct: 83 --------FGDSGIVACAPPDRRPAWEATKRRLVWPNGATATSFSASEPEGLRGPQFD-- 132 Query: 193 MAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + D + L + R + T PR + I + L Sbjct: 133 -AAWADELAKWKKVDDAWDMLQFALRLGDHPRQVVTT-TPRDVP-----ILRRLLTLSST 185 Query: 251 FQIDTRTVEG---IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 T + SF E I ARYG + R E+ G +F ++E+ Sbjct: 186 VTTHAPTTANRANLAKSFLEEIEARYGGT-RLGRQELEGVLLDDREGAFWSTAMLEDC-- 242 Query: 308 REPCPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIEHLFDWSK----------TDLR 354 R P P + +++ D G D +V+ W Sbjct: 243 RIDGPPPLSRIVVAVDPPVTGHAGSDECGIVVAGAVTEGAPGAWRAVVLEDASVKAAKPI 302 Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRT 414 +E+Y D ++ + N G + + + G + A R Sbjct: 303 DWARAALDAMERYGADRLVAEVNQ-GGDLVETVIRQIDPLVPYRGVRAAKGKS----ARA 357 Query: 415 ELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474 E + + + L L + + G S D D L++ Sbjct: 358 EPVAALYEQGRVSHLRGLGDLEDQMCLMTVQGFEGKG-------------SPDRVDALVW 404 Query: 475 TFAENPPRSDMDFGRCP 491 + + R Sbjct: 405 ALTDLVVEPGAKWRRPQ 421 >gi|66396341|ref|YP_240671.1| ORF008 [Staphylococcus phage 88] gi|66396415|ref|YP_240743.1| ORF009 [Staphylococcus phage 92] gi|62636756|gb|AAX91867.1| ORF008 [Staphylococcus phage 88] gi|62636829|gb|AAX91940.1| ORF009 [Staphylococcus phage 92] Length = 421 Score = 46.6 bits (109), Expect = 0.009, Method: Composition-based stats. Identities = 44/320 (13%), Positives = 101/320 (31%), Gaps = 35/320 (10%) Query: 72 NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131 + EV GRG GK++ + ++ + R ++ + + ++ L T+++ ++ Sbjct: 22 TKDKEVLNVVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVIRKTDNTLATSVFEQIKWA 80 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 + H F+++ + + + R + P+ ++ Sbjct: 81 IEEQKVSHLFKVKVS------------PMEITYIPRGNRIIFRGA--QNPERLKSLKDSR 126 Query: 192 ---GMAIINDEASGTPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF----YEI 240 +A I + A + I L + + + NP + + YE Sbjct: 127 FPFSIAWIEELAEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYES 186 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 + + + I F + + + R E G+ + F L Sbjct: 187 SFQADNTYVHHS-TYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR 245 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTT 356 IEE R+ D + + D D V ++ VI + ++ + Sbjct: 246 -IEEIPQRQY--DTFDNIRNAVDFG-YATDPLAFVRWHYDKKKRVIYAMDEYYGVQISNR 301 Query: 357 NNKISGLVEKYRPDAIIIDA 376 + Y+ D I D+ Sbjct: 302 EFANWLKKKGYQSDEIFADS 321 >gi|195942518|ref|ZP_03087900.1| hypothetical protein Bbur8_06704 [Borrelia burgdorferi 80a] gi|312149990|gb|ADQ30051.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi N40] Length = 450 Score = 46.6 bits (109), Expect = 0.009, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + ++ +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNVETFKTYNFTTYDNVFLSKGFIETQEKLY-KDIPAYKARVLLGEWLASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI E+ + P I D GGDNT + + Sbjct: 265 IFTQINITEDYMFTSP--------IAYLDPTFSVGGDNTALCV 299 >gi|222147998|ref|YP_002548955.1| large terminase [Agrobacterium vitis S4] gi|221734986|gb|ACM35949.1| large terminase [Agrobacterium vitis S4] Length = 459 Score = 46.6 bits (109), Expect = 0.009, Method: Composition-based stats. Identities = 63/403 (15%), Positives = 119/403 (29%), Gaps = 54/403 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GKT A + I A ++ ++ L E + + Sbjct: 83 GGRGSGKTRAGA----------EWVHEIASAGEKSAVRIALVGETLGDAREVMVDGLSGI 132 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 ++ H P + S M + +S E P++ G DE + Sbjct: 133 ARIARHKRP----EVEISRRRLVWPNGAMAQMFSAEDPESLRG---PQFHYAWCDEIAKW 185 Query: 204 --PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261 + + L + R I T+ P R + P R + Sbjct: 186 KHAEETFDMLQFSLRLGDDPRQVI-TTTP-RPVPILKRLLADPGTRLTRLSTFGNAC-NL 242 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321 P F E + ARYG + R E+ G+ + D+ + +E+ R +P +++G Sbjct: 243 APGFIEALQARYGGT-RLGRQELDGELIEDREDALWRRDRLEQLTVR--LSEPLHRIVVG 299 Query: 322 CDIAEEGGDNTVVVL------RRGPVIEHLF-DWSKTDLRTTNNKISGLVEKYRPDAIII 374 D G +V + R G + + + + ++ D ++ Sbjct: 300 VDPPSGAGAQSVCGIIVAGLDRLGRAVVLADCSVTGESPASWATAVVRAFRRFEADRVVA 359 Query: 375 DANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH 432 + N G L+ + V V + R E + + Sbjct: 360 EVNQGGEMVGALLKSVDANLPVRMVRATRGKF-------LRAEPVAALYEQGRVFHAARF 412 Query: 433 SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + L + + N +S D D L++ Sbjct: 413 ADLEDQMCDFGPEGLSN-------------GQSPDRLDALVWA 442 >gi|312148837|gb|ADQ31485.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|312148805|gb|ADQ31454.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|312147637|gb|ADQ30298.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|312147604|gb|ADQ30266.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|224590670|ref|YP_002640676.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|224553765|gb|ACN55167.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|224983785|ref|YP_002641105.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] gi|224553986|gb|ACN55383.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|195942842|ref|ZP_03088224.1| hypothetical protein Bbur8_08565 [Borrelia burgdorferi 80a] gi|312150044|gb|ADQ30103.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi N40] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|225622041|ref|YP_002724986.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] gi|225546350|gb|ACN92359.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|225576422|ref|YP_002725451.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] gi|225547005|gb|ACN92996.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|226322171|ref|ZP_03797692.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] gi|226232426|gb|EEH31184.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|226246703|ref|YP_002776000.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] gi|226202392|gb|ACO38050.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|224022662|ref|YP_002606275.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|224593632|ref|YP_002640950.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] gi|223929246|gb|ACN23964.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|224554688|gb|ACN56067.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|219723193|ref|YP_002474612.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|224591572|ref|YP_002640899.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] gi|219693035|gb|ACL34243.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|224554907|gb|ACN56281.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|11497124|ref|NP_051248.1| hypothetical protein BB_S45 [Borrelia burgdorferi B31] gi|223987739|ref|YP_002601211.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|6382145|gb|AAF07462.1|AE001576_21 conserved hypothetical protein [Borrelia burgdorferi B31] gi|223929452|gb|ACN24166.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|221316998|ref|YP_002533177.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|221237630|gb|ACM10461.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + +A I D A GGDNT + + Sbjct: 265 IFTQINITD--------DYVFASPIAYLDPAFSVGGDNTALCV 299 >gi|238764966|ref|ZP_04625904.1| terminase, ATPase subunit [Yersinia kristensenii ATCC 33638] gi|238696825|gb|EEP89604.1| terminase, ATPase subunit [Yersinia kristensenii ATCC 33638] Length = 591 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 33/243 (13%), Positives = 64/243 (26%), Gaps = 46/243 (18%) Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY---EIFNKPLDDWK 249 + DE P+ +N G T ++ + T + + G + + + K K Sbjct: 253 LYIDEYLWIPNFRRLNEVASGMATHKHWRITYFSTPSAKTHQGYPFWSGDEWRKGDTKRK 312 Query: 250 RFQIDT--------RTVE----------------GIDPSFHEGIIARYGLDSDVTRVEVC 285 + R G + + + RY + Sbjct: 313 DVVFPSFDEMRDGGRECPDGQWRYVVTLEDAIAGGFNLADINELRERYNES--AFNMLFM 370 Query: 286 GQFPQQDIDSF---------IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL 336 F F + + E+ EP P + G D A + T VV+ Sbjct: 371 CVFVDDKESVFKFGDLMRCGVDIRTWEDFHPDEPMPFGNREVWGGFDPARSNDNATFVVV 430 Query: 337 R------RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390 + W + +I + +Y I ID G + ++ Sbjct: 431 APPLVAAERFRVLEKHHWRSMSFQFMAERIRSIKARYNMTYIGIDVTGLGYGVFELVQGF 490 Query: 391 GYH 393 + Sbjct: 491 AHR 493 >gi|226246889|ref|YP_002776229.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] gi|226202275|gb|ACO37943.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] Length = 450 Score = 46.6 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYIFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|94263678|ref|ZP_01287487.1| hypothetical protein MldDRAFT_1386 [delta proteobacterium MLMS-1] gi|93455983|gb|EAT06138.1| hypothetical protein MldDRAFT_1386 [delta proteobacterium MLMS-1] Length = 457 Score = 46.6 bits (109), Expect = 0.011, Method: Composition-based stats. Identities = 71/409 (17%), Positives = 112/409 (27%), Gaps = 66/409 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMS-----------TRPGISVIC--LANSETQLKTTLWAEVSK 130 AGR GKT A + ++L + PG + LA Q K L ++ Sbjct: 64 AGRRSGKTNATAGIAVYLATIGAAVDGLLDKLAPGERGVVALLAVDRQQAKVAL-RYIA- 121 Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT 190 + + + D + + S RT D Sbjct: 122 --GMFEASPVLAQMVVKRDAEALHLDNRISIEVSTNNYRSVRGRTLLAAVLDEVA----- 174 Query: 191 YGMAIINDEASGTPDV-INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD--- 246 D+ S PDV IL L + S+P G Y+ + K Sbjct: 175 ----FFRDDQSANPDVETYRAILPGLATTGG--LLVGISSPYAKRGLLYQKWRKHYGQDG 228 Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 D Q T P+ I D R E G F + D++ F+ ++E Sbjct: 229 DILVIQGATPDFNPTIPTSV--ITDAEADDPAAARAEWFGLF-RDDVEGFLTREVVEACT 285 Query: 307 NREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW---SKTDLRTTNNKI 360 P PY D A G D + + + D + + Sbjct: 286 RPSPLVIPYNRENIYTAFADPAGGGRDEFCLAIGHQEGEVVVVDNLQARRGAPAKIVAEY 345 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 + L++ Y AI D G+ D G K L+ Sbjct: 346 ADLLKAYNVQAITADRY-AGSWPADEFARHGITCNPAANSKSVFYLDALA-----AFNSG 399 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAK-STDY 468 L L+ L + +E + +G + S D+ Sbjct: 400 R-----LQLPPDDMLLNQLTA-------------LERRTARGGRDSIDH 430 >gi|327198525|ref|YP_004327112.1| phage terminase large subunit [Pseudoalteromonas phage H105/1] gi|304367920|gb|ADM26679.1| phage terminase large subunit [Pseudoalteromonas phage H105/1] Length = 414 Score = 46.3 bits (108), Expect = 0.011, Method: Composition-based stats. Identities = 65/413 (15%), Positives = 112/413 (27%), Gaps = 50/413 (12%) Query: 67 LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA 126 N N ++ + G G GKT + +L P S ++ + Sbjct: 8 QNIFLNELNTKYRAYV-GGFGSGKTFVGCMDLLNFFGKHPRTRQGYFGTSYPSIRDIFYP 66 Query: 127 EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG 186 FE +L + + + Y S +RP+T VG Sbjct: 67 -------------TFEEAALMMGFTVDIKESNKEVHVYRNGFYYGTVICRSMDRPNTIVG 113 Query: 187 HHNTYGMAIINDEASGTPDV----INLGILGFL--TERNANRFWIMTSNPRRLSGKFYEI 240 + + DE P I+ L +T+ P + + Sbjct: 114 FKVSRAL---VDEIDTLPKDKATNAWNKIVARLRLKIDGVENGIGVTTTPEGFLFVYSKF 170 Query: 241 FNKPLDDWKRFQIDTRTV-EGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFIP 298 ++P + Q T E + + + + Y D + G+F S P Sbjct: 171 KDEPTKSYSMVQASTYENAEFLPDDYIDTLKETYPEGLIDAY---LMGKFVNLTAGSVYP 227 Query: 299 LNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN 358 + + E P PLI+G D + V R G D Sbjct: 228 QYGRNKNNSFESIQ-PQDPLIVGMDFNVNDMAACIFVERDGIYHCVEELTKGRDTDYMAR 286 Query: 359 KISG-LVEKYRPDAIIIDANN-----TGART--CDYLEMLGYHVYRVLGQKRAVDLEFCR 410 + ++K + DA+ GA D L+ G V R + C Sbjct: 287 ILKERYLDKGHRVTVYPDASGKNTSSKGADKSDIDILKSYGLWVVAKDSNPRVRERVNCV 346 Query: 411 NRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGA 463 NR + + + + I+ G E + G Sbjct: 347 NRGFQ---DLKIMINSMRCPETAKCIEQQP------YDKNG----EPDKKSGL 386 >gi|87201130|ref|YP_498387.1| hypothetical protein Saro_3118 [Novosphingobium aromaticivorans DSM 12444] gi|87136811|gb|ABD27553.1| protein of unknown function DUF264 [Novosphingobium aromaticivorans DSM 12444] Length = 440 Score = 46.3 bits (108), Expect = 0.011, Method: Composition-based stats. Identities = 71/425 (16%), Positives = 130/425 (30%), Gaps = 55/425 (12%) Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127 S P + + AGRG GKT L A V + P + + S + ++ + Sbjct: 43 QSQQAPPSDWRVWLVMAGRGFGKTRLGAEWVRKIAEEDPEARIALVGASLHEARSVMVE- 101 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 L APW V S+ YS P++ G Sbjct: 102 --------------GESGLLSIDAPWRRPVFESSVRRLVWPNGAQAFLYSAGEPESLRGP 147 Query: 188 HNTYGMAIINDE------ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF 241 +++ DE S +L L + + T+ P R I Sbjct: 148 QHSHA---WCDEIAKWDNGSNRAMATWDNLLMGL-RLGRDPRLVATTTP-RPVPLVARIM 202 Query: 242 NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNI 301 ++ D + + F E + +G + + R E+ G+ + + + + Sbjct: 203 DEGDDVVVTRGSTFENQDNLPRRFVEAMRRTFGGTT-LGRQELLGEMIEDLVGALWSRAL 261 Query: 302 IEEALNREPCPDPYAPLIMGCD-IAEEGGDNTVVV---LRRGPVIEHLFD--WSKTDLRT 355 IE A RE +++G D A GD ++ + + L D + Sbjct: 262 IENA--REDAAPAMTRVVVGVDPPASAHGDACGIIVCGIGDDRIARVLADCSVEQASPER 319 Query: 356 TNNKISGLVEKYRPDAIIIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRR 413 ++ + D ++ +AN G L + V + R Sbjct: 320 WARAVANAARAWSADRVVAEANQGGEMVAAVLRAAEASLPLRLVHASRGKA-------AR 372 Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473 E L A + H+G+ L+ + + GE +S D +D + Sbjct: 373 AE----PVAALYEAGRVRHAGMFPQLED-ELCGLMPGGEYQGP------GRSPDRADACV 421 Query: 474 YTFAE 478 + E Sbjct: 422 WALTE 426 >gi|188026021|ref|ZP_02997754.1| hypothetical protein PROSTU_02527 [Providencia stuartii ATCC 25827] gi|188021298|gb|EDU59338.1| hypothetical protein PROSTU_02527 [Providencia stuartii ATCC 25827] Length = 264 Score = 46.3 bits (108), Expect = 0.011, Method: Composition-based stats. Identities = 47/253 (18%), Positives = 78/253 (30%), Gaps = 48/253 (18%) Query: 248 WKRFQ-IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI-----DSFIPLNI 301 W++ I G++ E I D R +F + D+ I + Sbjct: 2 WRQIVNIHDAIARGLNRVNLEEIKDE--NPPDDFRNLYECEFVKTGERAFSYDALINCGV 59 Query: 302 IEEALNREPCPDPYAP-------LIMGCDIAEEGGDNTVVVLRR-------GPVIEHL-- 345 + P PYAP + +G D G + + L G + Sbjct: 60 DGYNSDVWPDWKPYAPRPLGNRPVWVGADPTGTGDNGDGLGLVVASPPAVSGGKFRIIET 119 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCDYLEM-------LGYHVYRV 397 +I + ++Y +I ID TGA + + L Y + Sbjct: 120 IQLRGMAFEKQAEEIKRITQRYNVQSITIDGTGGTGAAVHELVVKFFPAANLLNYSA-PL 178 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457 L RN R E + H LI + ++K + +G + ES Sbjct: 179 KRMMIMKMLMLIRNGRFEYDAGL-----------HKPLITSFMTIKK-VQTQSGIITYES 226 Query: 458 KRVKGAKSTDYSD 470 RV+G D+ D Sbjct: 227 SRVRGL---DHGD 236 >gi|186895208|ref|YP_001872320.1| hypothetical protein YPTS_1896 [Yersinia pseudotuberculosis PB1/+] gi|186698234|gb|ACC88863.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis PB1/+] Length = 587 Score = 46.3 bits (108), Expect = 0.011, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 41/133 (30%), Gaps = 15/133 (11%) Query: 311 CPDPYAPLIMGCDIAEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLV 364 P P+ +G D A G V+ + G L W D + I + Sbjct: 403 RPFGDRPVWIGYDPASTGDSAGCAVIAPPVVAGGKFRVLERHQWKGMDFADQASNIKKIT 462 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424 E+Y I ID G + + + + + + +L K + + Sbjct: 463 ERYNVTYIGIDDTGLGRSVTQLVRQ----FFPAVNA-----IHYSLEMKADLIYKAKNII 513 Query: 425 EFASLINHSGLIQ 437 + L +G I Sbjct: 514 QGGRLEFDAGCID 526 >gi|238786939|ref|ZP_04630739.1| Terminase, ATPase subunit [Yersinia frederiksenii ATCC 33641] gi|238724727|gb|EEQ16367.1| Terminase, ATPase subunit [Yersinia frederiksenii ATCC 33641] Length = 587 Score = 46.3 bits (108), Expect = 0.012, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 41/133 (30%), Gaps = 15/133 (11%) Query: 311 CPDPYAPLIMGCDIAEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLV 364 P P+ +G D A G V+ + G L W D + I + Sbjct: 403 RPFGDRPVWIGYDPASTGDSAGCAVIAPPVVAGGKFRVLERHQWKGMDFADQASNIKKIT 462 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424 E+Y I ID G + + + + + + +L K + + Sbjct: 463 ERYNVTYIGIDDTGLGRSVTQLVRQ----FFPAVNA-----IHYSLEMKADLIYKAKNII 513 Query: 425 EFASLINHSGLIQ 437 + L +G I Sbjct: 514 QGGRLEFDAGCID 526 >gi|145708080|ref|YP_001165255.1| terminase [Ralstonia phage phiRSA1] gi|139003869|dbj|BAF52383.1| terminase [Ralstonia phage phiRSA1] Length = 593 Score = 46.3 bits (108), Expect = 0.012, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 27/82 (32%), Gaps = 7/82 (8%) Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRR-----GPVIEHL--FDWSKTDLRTTNNKISG 362 P P P+ +G D GGD+ +V+ G L + D I Sbjct: 407 PRPFGNRPVWVGYDPNGGGGDSAALVVVAPPLVPGGKFRVLERHQFRGIDYEEQAGAIRR 466 Query: 363 LVEKYRPDAIIIDANNTGARTC 384 + E+Y + ID G Sbjct: 467 VAERYDVAYVGIDRTGIGDAVF 488 >gi|80159854|ref|YP_398598.1| conserved phage-related protein [Clostridium phage c-st] gi|78675444|dbj|BAE47866.1| conserved phage-related protein [Clostridium phage c-st] Length = 580 Score = 46.3 bits (108), Expect = 0.012, Method: Composition-based stats. Identities = 51/374 (13%), Positives = 106/374 (28%), Gaps = 50/374 (13%) Query: 46 EGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTR 105 + +E + + + + RG+GK+ L+A + Sbjct: 49 YYRKYIDKFCIEVLGLKLYLFQRLILRAMARNQYVMLICCRGLGKSWLSAVFFVASCILY 108 Query: 106 PGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165 G+ + Q + + + K L + + P +D + Sbjct: 109 KGLKCGIASGQGQQARNVI---IQKVKGELAKNPSIAREI--VFPIKTGADDCVVNFRNG 163 Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLT--------- 216 S+ + + + D G + ++ DE D + IL +T Sbjct: 164 SEIRAIV---LGRNQGD---GARSWRFHYLLVDECRLVSDKVINTILIPMTKTKRAVAIH 217 Query: 217 -ERNANRFWIMTSNPRRLSGKFYEIFN----KPLDDWKRFQIDTRTVE------GIDPSF 265 + I S+ + Y+ F K + + + D Sbjct: 218 HNKREKGKVIFISSAYLKTSDLYKRFKYFCDKMSSGANNYFVCSLDYRVGIEAGIFDQDD 277 Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA--LNREPCPDPYA---PLIM 320 + + + + + E G F +S+ P A L R P I+ Sbjct: 278 IDEERNKPDMTIEEFQYEYEGIFVGSSGESYFPYETTTPARVLGRGEITQPKKSKSEYII 337 Query: 321 GCDIAEEGGDNT------VVVLR---RGPVIEHLFDWSKTDLRTTNNKISGLVEKYR--- 368 D+A G ++ V+ L+ G ++ + + + + L E Y Sbjct: 338 THDVAISGASDSDNACTHVIKLKPKPNGTYVKEVVYTKTHNGISLPEQRDFLRELYHLKF 397 Query: 369 --PDAIIIDANNTG 380 I+ID G Sbjct: 398 PNAVKIVIDMRGNG 411 >gi|163849591|ref|YP_001637634.1| diguanylate cyclase [Methylobacterium extorquens PA1] gi|163661196|gb|ABY28563.1| diguanylate cyclase [Methylobacterium extorquens PA1] Length = 1428 Score = 46.3 bits (108), Expect = 0.012, Method: Composition-based stats. Identities = 36/229 (15%), Positives = 70/229 (30%), Gaps = 26/229 (11%) Query: 83 SAGRGI--GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 S+G G G+ WL + P + Q TT W E+ + + Sbjct: 644 SSGWGTYTGQPESAYIGYGWLDTVHPD---------DRQRVTTTWREIFASQAAGSFEFR 694 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY-SEERPDTFVGHHNTYGMAIINDE 199 + + + L + G + T + S + + Y +A++ Sbjct: 695 ALCRDGAYRWTLTRAVPLKDASGQVQEWVGTDGDIHESRQASEAIRLQEERYRLAMLA-- 752 Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259 T D I LG T ++ + + + + W + +I E Sbjct: 753 ---TQDAIWDWDLGADTAEWSDGAYRLFG--------YDDAERADTGAWWKSKIHPDDRE 801 Query: 260 GIDPSFHEGIIARYGLDSDVTR-VEVCGQFPQQDIDSFIPLNIIEEALN 307 + S I ++ SD R G + + F+ + +AL Sbjct: 802 RVTTSIKHIIESQEHRWSDEYRFARADGSYAEVTDCGFVIRDTEGQALR 850 >gi|51596194|ref|YP_070385.1| gpP phage P2 terminase [Yersinia pseudotuberculosis IP 32953] gi|170024552|ref|YP_001721057.1| hypothetical protein YPK_2327 [Yersinia pseudotuberculosis YPIII] gi|51589476|emb|CAH21098.1| similar to gpP phage P2 TERMINASE [Yersinia pseudotuberculosis IP 32953] gi|169751086|gb|ACA68604.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis YPIII] Length = 587 Score = 46.3 bits (108), Expect = 0.012, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 40/133 (30%), Gaps = 15/133 (11%) Query: 311 CPDPYAPLIMGCDIAEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLV 364 P P+ +G D A G V+ + G L W D + I + Sbjct: 403 RPFGDRPVWIGYDPASTGDSAGCAVIAPPVVAGGKFRVLERHQWKGMDFADQASNIKKIT 462 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424 E+Y I ID G + + + + + + +L K + + Sbjct: 463 ERYNVTYIGIDDTGLGRSVTQLVRQ----FFPAVNA-----IHYSLEMKADLIYKAKNII 513 Query: 425 EFASLINHSGLIQ 437 L +G I Sbjct: 514 HGGRLEFDAGCID 526 >gi|169634245|ref|YP_001707981.1| phage-related terminase, ATPase subunit (GPP-like) [Acinetobacter baumannii SDF] gi|169153037|emb|CAP02098.1| phage-related terminase, ATPase subunit (GPP-like) [Acinetobacter baumannii] Length = 594 Score = 46.3 bits (108), Expect = 0.012, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 51/175 (29%), Gaps = 26/175 (14%) Query: 238 YEIFNKPL----DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292 ++ + W++ I G D + + Y + + QF Sbjct: 322 HDALKRGRYCEDKIWRQIVTILDAENGGCDLFDIDELRFEYSAEE--FANLLMCQFIDDG 379 Query: 293 IDSFIPLNIIEEALNR------------EPCPDPYAPLIMGCDIAEEGGDNTVVVL---- 336 S PLNI++ + P P+ +G D AE G +VV+ Sbjct: 380 A-SIFPLNILQACMVDSWEAWADDYKPFHARPLASRPVWVGYDPAETGDSAGLVVVAPPS 438 Query: 337 --RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 I + D + +I + +Y I +D G + Sbjct: 439 VANGKFRILERHQFRGMDFKAQAEQIRQITLRYNVTYIGLDTTGMGTGVAQLVRQ 493 >gi|33594269|ref|NP_881913.1| putative phage terminase [Bordetella pertussis Tohama I] gi|33564344|emb|CAE43647.1| putative phage terminase [Bordetella pertussis Tohama I] gi|332383682|gb|AEE68529.1| putative phage terminase [Bordetella pertussis CS] Length = 425 Score = 46.3 bits (108), Expect = 0.012, Method: Composition-based stats. Identities = 70/440 (15%), Positives = 131/440 (29%), Gaps = 64/440 (14%) Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKW 131 P F+ + AG G GKT + + P ++ A + Q++ + EV+ Sbjct: 15 PHKFRAFV-AGFGSGKTWVGGAGLCRHAWEFPRVNSGYFAPTYGQIRDIFYPTIEEVAHD 73 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 L + + + +CR S E+P VG Sbjct: 74 WGLAAKINESNKEVHLFAGRKYRGT--------------VICR--SMEKPGDIVGFKIGK 117 Query: 192 GM-----AIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKP 244 G+ + D+A+ + I+ L T +T+ P Y+ F K Sbjct: 118 GLIDELDVMKADKAA----LAWRKIIARLRHTAPGLLNGVDVTTTPEG-FKFVYQQFVKQ 172 Query: 245 LDD-------WKRFQIDTRTV-EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF 296 + + + Q T + + + + A Y + + GQF S Sbjct: 173 VRERPELAALYGLVQASTYENGKNLPEDYIPSLRASY--PPQLIAAYLRGQFTNLTSGSV 230 Query: 297 IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTT 356 N + + +P+ L +G D TV V+R G + D Sbjct: 231 YA-NFDRRLHHTDAAEEPHEELHIGMDFNVLNMTATVNVIRAGLPLTVGELTKVRDTPEM 289 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDY---------LEMLGYHVYRVLGQKRAVDLE 407 + + + + I + +G T L G+ V RV + +V Sbjct: 290 ARMLKERFKD-KGHGVTIYPDASGGNTSSKNASESDLSILRKAGFTV-RVNSRNPSVKDR 347 Query: 408 FCRNRRTELHVK-MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKST 466 L+ + WL +N ++L+ + G E + G Sbjct: 348 INAVNGMLLNDEGARRWL-----VNTDRCPTLTEALEQQVYDKNG----EPDKSTGHDHP 398 Query: 467 DYSDGLMYTFAENPPRSDMD 486 + + G + M Sbjct: 399 NDAQGYFLVHRYPITPTGMS 418 >gi|224984406|ref|YP_002641809.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] gi|224497005|gb|ACN52640.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] Length = 450 Score = 46.3 bits (108), Expect = 0.013, Method: Composition-based stats. Identities = 58/332 (17%), Positives = 105/332 (31%), Gaps = 51/332 (15%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS-----TRPGIS 109 Q E + +++H + V +F G I++ GKT L ++L++ + + Sbjct: 49 QKEVLFDIESHDYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSLYERDTN 98 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + NS L T ++ K + + + + L I Sbjct: 99 NFIIGNSIGLLMTNTIKQIEKICG------FLGIDYQKKKSGESFCKIAGLELNI----- 147 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 Y D F I +EA+ L ++ L R I +N Sbjct: 148 ------YGGRNRDAFSKIRGGNSAIIYVNEATVIHKETLLEVIKRL--RKGKSIIIFDTN 199 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288 P + F + + D +K + T F E Y + V G++ Sbjct: 200 PESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLYGEW 258 Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFD 347 + F E N++ IM D A GGDNT + + E + Sbjct: 259 ILNESALF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAICVLERTF-EKFYA 309 Query: 348 WSKTDLRTTN-----NKISGLVEKYRPDAIII 374 + D + + I L+E + + + I Sbjct: 310 YIYQDQKPVSDSLMLGSIQVLIENFNVNTVYI 341 >gi|293368016|ref|ZP_06614649.1| large terminase subunit [Staphylococcus epidermidis M23864:W2(grey)] gi|291317838|gb|EFE58251.1| large terminase subunit [Staphylococcus epidermidis M23864:W2(grey)] Length = 421 Score = 46.3 bits (108), Expect = 0.013, Method: Composition-based stats. Identities = 41/335 (12%), Positives = 103/335 (30%), Gaps = 40/335 (11%) Query: 60 EVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118 E++ H + + GRG GK++ + ++ + R ++ + + ++ Sbjct: 9 ELLPKHFHSLWKATKDRKKLNVVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVRKTDN 67 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 L T+++ ++ + H F+++ + + + R Sbjct: 68 TLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFRGA-- 113 Query: 179 ERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMTSNPR 231 + P+ ++ + I + A T D + L + + + NP Sbjct: 114 QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFSYNPP 173 Query: 232 RLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQ 287 + + YE +P + + I F + + + R E G+ Sbjct: 174 KRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESTKERNELRYRWEYMGE 232 Query: 288 FPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIE 343 +P N ++ + + + D D V ++ +I Sbjct: 233 AIGS---GVVPFNNLQIEKIPDELYKSFDNIRNAVDFG-YATDPLAFVRWHYDKKKRIIY 288 Query: 344 HLFDWSKTDL--RTTNNKISGLVEKYRPDAIIIDA 376 + + + R N + Y+ D I D+ Sbjct: 289 AVDEHYGVQISNREFANWLKR--RGYQSDEIFADS 321 >gi|229589112|ref|YP_002871231.1| terminase ATPase subunit [Pseudomonas fluorescens SBW25] gi|229360978|emb|CAY47838.1| terminase, ATPase subunit [Pseudomonas fluorescens SBW25] Length = 585 Score = 46.3 bits (108), Expect = 0.013, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 51/161 (31%), Gaps = 22/161 (13%) Query: 248 WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 W++ I G D + + Y D++ + + QF S PL +++ + Sbjct: 328 WRQIVTILDAEDRGCDLFDLDELRQEY--DAEAFQNLLMCQFIDDGA-SIFPLAMLQPCM 384 Query: 307 NR------------EPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHLFD--W 348 P + +G D AE G +VV + G L + Sbjct: 385 VDSWDLWAQDYKPFAARPFGDRQVWVGYDPAESGDSAGLVVIAPPMVPGGKFRVLEKHQF 444 Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 D I + ++Y I ID G+ ++ Sbjct: 445 RGMDFAAQAEAIRQVTKRYWVTYIGIDITGMGSGVAQLVKQ 485 >gi|219723069|ref|YP_002474484.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219693000|gb|ACL34209.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|312147710|gb|ADQ30370.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 46.3 bits (108), Expect = 0.013, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHRQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYINNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|291517493|emb|CBK71109.1| Phage terminase large subunit [Bifidobacterium longum subsp. longum F8] Length = 477 Score = 46.3 bits (108), Expect = 0.014, Method: Composition-based stats. Identities = 57/376 (15%), Positives = 105/376 (27%), Gaps = 60/376 (15%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISV 110 WQ + ++ A + + + + + R GKT W+ + + PG+ + Sbjct: 37 DVWQRQINRIILAKSADGFWSA-----RNTVLSIPRQTGKTYDIGWVAIHRAARTPGMRI 91 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH---CSLGIDSK 167 + A + S++ + + D H + G + Sbjct: 92 VWTA---------------QHFSVIKDTFESLCAIVLRPEMSGLVDPDHGISLAAGKEEI 136 Query: 168 HYSTMCRT-YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226 + R + G ++ DEA D +L R N I Sbjct: 137 RFRNGSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPT-QNRAYNPQTIY 193 Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDT---------RTVEGIDPSFHEGIIARY---- 273 P E F + D + + + R + +D Y Sbjct: 194 MGTPPGPRDNG-EAFTRLRDKTRAGRTHSTLYVEFAADRDADPLDREQWRKANPSYPAHT 252 Query: 274 ----------GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCD 323 L D R E G + + + I EEA P + G D Sbjct: 253 SDESIANLWENLTGDDFRREALGIWDEHALSRAIDRRQWEEATIDARRP--GGVMSFGID 310 Query: 324 IAEEGGDNTV---VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK--YRPDAIIIDANN 378 + T+ + G L ++ T+ T L++K + A++ID + Sbjct: 311 MNPTRTRLTIGACMRYDDGTAHIELAEYRDTNHDGT-MWAVNLIDKVWEQTAALVIDGQS 369 Query: 379 TGARTCDYLEMLGYHV 394 L G V Sbjct: 370 PATALLPDLAEAGVTV 385 >gi|260427953|ref|ZP_05781932.1| phage DNA Packaging Protein [Citreicella sp. SE45] gi|260422445|gb|EEX15696.1| phage DNA Packaging Protein [Citreicella sp. SE45] Length = 409 Score = 46.3 bits (108), Expect = 0.014, Method: Composition-based stats. Identities = 75/424 (17%), Positives = 134/424 (31%), Gaps = 70/424 (16%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTT-LWAEVSKW 131 I GRG GKT A W+ + G V + + Q++ ++ E S Sbjct: 16 IMGGRGAGKTRAGA---EWVRACVEGAMPLSPGRCKRVALIGETMDQVREVMVFGE-SGI 71 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 ++ P E Q+ W + +S P+ G Sbjct: 72 MNCSPPDRRPEWQATR-RCLVWPN--------------GAEAMVFSAHDPEGLRGPQFD- 115 Query: 192 GMAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249 A DE + + L + +T+ P R G E+ P Sbjct: 116 --AAWVDELAKWKKARETWDMLQFAL-RLGEHPQVCVTTTP-RNVGILKELLELPSTVVT 171 Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 R + + SF E + ARYG +S + R E+ G + +++ A Sbjct: 172 RAK-TEANRANLAESFLEEVRARYG-NSRLARQELDGILVTDVDGALWTGEMLDRAQALA 229 Query: 310 PCPDPYAPLIMGCDI-AEEG--GDNTVVVLR--------RGPVIEHLFDWSKTDLRTTN- 357 P P + +++ D A +G D +V+ + L D S + T Sbjct: 230 P-PATFDRIVVAVDPPAGDGKASDACGIVVAGVVCEGPPQAWRAWVLEDASVQGVSPTGW 288 Query: 358 -NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416 + E+++ D ++ + N GA L + V R V + R E Sbjct: 289 AQAAAAAYERWQADRVVAEVNQGGAMVETVLRQVSPQV-----PLRKVHATRGKAARAEP 343 Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476 + + +GL + + ++ + G +G S D D L++ Sbjct: 344 VAALYEQGRVGHAAGLAGLEEQMG-----LMTSAGY--------QGQGSPDRVDALVWAL 390 Query: 477 AENP 480 E Sbjct: 391 TELV 394 >gi|111074104|ref|YP_709233.1| hypothetical protein BAPKO_4029 [Borrelia afzelii PKo] gi|110891215|gb|ABH02376.1| hypothetical protein BAPKO_4029 [Borrelia afzelii PKo] Length = 450 Score = 45.9 bits (107), Expect = 0.014, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F ++I ++ + P I D A GGDNT + + Sbjct: 265 IFTQIDITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|148807391|gb|ABR13464.1| predicted ATPase terminase subunit [Pseudomonas aeruginosa] Length = 593 Score = 45.9 bits (107), Expect = 0.014, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 50/161 (31%), Gaps = 22/161 (13%) Query: 248 WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 W++ I G D E + Y D++ + + QF S PL +++ + Sbjct: 336 WRQIVTILDAEARGCDLFDIEELRLEY--DAEAFQNLLMCQFVDDGA-SIFPLTMLQPCM 392 Query: 307 NR------------EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDW 348 P + +G D AE G +VV+ G L + Sbjct: 393 VDSWDLWSEDYKPFALRPFGDRQVWLGYDPAETGDTAGLVVVAPPAVPGGKFRVLERHQF 452 Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 D I + ++Y I +D G+ + Sbjct: 453 RGKDFAEQAEFIRKVTQRYWVTYIGVDTTGMGSGVAQLVRQ 493 >gi|170718356|ref|YP_001783582.1| hypothetical protein HSM_0231 [Haemophilus somnus 2336] gi|168826485|gb|ACA31856.1| protein of unknown function DUF264 [Haemophilus somnus 2336] Length = 595 Score = 45.9 bits (107), Expect = 0.015, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 45/139 (32%), Gaps = 17/139 (12%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---------REPCPDPYA 316 + + +Y + + + + F +++ A+N P P Sbjct: 358 LDQLKQKY--SALAFKQLFECHWIDDEDSIFTISKLLKCAVNINKWADFQPDTPRPFGDR 415 Query: 317 PLIMGCDIAEEGGDNTVVV----LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPD 370 + G D A + V+ + G L + W R +I L ++Y Sbjct: 416 EVWGGYDPAHSSDGASFVIVAPPINEGEKFRVLARYQWFGLSYRWQAEQIKKLYQQYNFS 475 Query: 371 AIIIDANNTGARTCDYLEM 389 I IDAN G + ++ Sbjct: 476 YIGIDANGVGQGVFEMIQE 494 >gi|224591489|ref|YP_002640832.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] gi|224554623|gb|ACN56003.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 45.9 bits (107), Expect = 0.015, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKIDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|221641598|ref|YP_002527783.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|225622087|ref|YP_002725040.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] gi|221237550|gb|ACM10383.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|225546885|gb|ACN92880.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] Length = 450 Score = 45.9 bits (107), Expect = 0.015, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKIDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|295096346|emb|CBK85436.1| Terminase-like family [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 435 Score = 45.9 bits (107), Expect = 0.016, Method: Composition-based stats. Identities = 51/317 (16%), Positives = 99/317 (31%), Gaps = 42/317 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 AG G GKT + + M P I+ A + Q++ + + + F+ Sbjct: 26 AGFGSGKTWVGCGGICKGMWEHPKINQGYFAPTYPQIRDIFYPTI--------EEVAFDW 77 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE---- 199 + + + + Y S E+P + VG M DE Sbjct: 78 GLSVIINEGNKEVHFY-----EGRRYRGTTICRSMEKPGSIVGFKIGNAM---VDELDVM 129 Query: 200 ASGTPDVINLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKP-------LDDWKRF 251 A+ I+ + + + R I + Y+ F K + Sbjct: 130 AAAKAQQAWRKIIARMRYKVDGLRNGIDVTTTPEGFKFVYQQFVKAVREKPELSALYGLI 189 Query: 252 QIDTRTV-EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 Q T + + P + +++ Y ++ + + G+F + + I + N Sbjct: 190 QASTFDNAKNLPPDYISSLLSSY--PDELIQAYLRGKFTNLNSGT-IYHTFNRKLNNCSD 246 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTTNNKISGLVEKY-- 367 PL +G D G +V ++R + + + K D +I +Y Sbjct: 247 EIQDGDPLFIGMDF-NVGKMAAIVHVKRNGLPRAVRELVKVYDTPAMIKRIQEEFWRYED 305 Query: 368 ------RPDAIIIDANN 378 R I DA+ Sbjct: 306 GRYVKSREIYIYPDASG 322 >gi|226315871|ref|YP_002776346.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] gi|226202080|gb|ACO37753.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] Length = 450 Score = 45.9 bits (107), Expect = 0.016, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHRQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|148724503|ref|YP_001285469.1| DNA packaging protein [Cyanophage Syn5] gi|145588148|gb|ABP87967.1| DNA packaging protein [Synechococcus phage Syn5] Length = 574 Score = 45.9 bits (107), Expect = 0.016, Method: Composition-based stats. Identities = 58/398 (14%), Positives = 117/398 (29%), Gaps = 96/398 (24%) Query: 79 KGAISAGRGIGKTTLNAWLVLWLMSTRPGISV-ICLANSET--------Q--LKTTLWAE 127 + ISA RG+GK+ + A VLW++ P + + A+ E Q + W Sbjct: 47 RLQISAFRGVGKSWITAAFVLWVLFVDPDRKIMVISASKERADNFSIFCQKLILDIEW-- 104 Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER---PDTF 184 +S ++ W + S + PA + S+GI + + + P Sbjct: 105 LSHLRPRDSDQRWSRI-SFDVGPAKPHQAPSVKSVGITGQMTGSRAHLMVFDDVEVPANS 163 Query: 185 VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---- 240 + + E+ + + +A ++ P+ + ++ Sbjct: 164 ATDMQREKLLQLVSESESI----------LVPDDDARIMFL--GTPQSTFTIYRKLAERS 211 Query: 241 ---------FNKPLDDWK-----RFQIDTRTVEGIDPSFHEGIIARY---GLDSDVTRVE 283 + + L ++ + D + + +S + R Sbjct: 212 YRPFVWPARYPRDLSKYEGLLAPQLVADLEKDPELTWKPTDTRFNELNLMERESAMGRSN 271 Query: 284 VCGQF---PQQDIDSFIPL-----------NIIEEA---------LNREPCPD------- 313 QF PL EA + +E P Sbjct: 272 FMLQFMLDTSLSDAEKFPLKFQDLIVTPLGAECAEAYAWSADPRYMRKELNPVGLPGDRF 331 Query: 314 -----------PYAPLIMGCDIAEEGGDNTV-VVLRRGP---VIEHLFDWSKTDLRTTNN 358 PY+ I+ D + G D TV VVL + + + + T + Sbjct: 332 YGPMYIDEGIVPYSETIVSVDPSGRGTDETVAVVLSQANGYIFVRDMKAFRDGYSDETLS 391 Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396 I L ++Y+ +++++N G L Sbjct: 392 DIVRLGKRYKASKLLVESN-FGDGMITELFKRHISQMG 428 >gi|319761996|ref|YP_004125933.1| hypothetical protein Alide_1284 [Alicycliphilus denitrificans BC] gi|317116557|gb|ADU99045.1| hypothetical protein Alide_1284 [Alicycliphilus denitrificans BC] Length = 633 Score = 45.9 bits (107), Expect = 0.017, Method: Composition-based stats. Identities = 30/186 (16%), Positives = 55/186 (29%), Gaps = 25/186 (13%) Query: 231 RRLSGKFYEIFNKPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289 RLSG F W+ I G D + + Y + F Sbjct: 359 DRLSGG----FTGEDRVWRNIVTILDALAGGCDLFDLDELRLEY--SDAEFANLLMCGFV 412 Query: 290 QQDIDSFIPLNIIEEALNRE-----------PCPDPYAPLIMGCDIAEEGGDNTVVVL-- 336 S PL++++ + P + P+ +G D + G +VVL Sbjct: 413 DDSF-SVFPLSMLQACMVDSWELWADFKPFSQRPFGWMPVWVGYDPSHTGDSAGLVVLAP 471 Query: 337 --RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGY 392 + G + L + D I + ++Y A+ +D G ++ Sbjct: 472 PAKPGGQLRVLHTQQFKGMDFEAQAKAIKEITQRYNVAAMTLDTTGIGQGVFQLVQKFYP 531 Query: 393 HVYRVL 398 + Sbjct: 532 AARGIN 537 >gi|270307731|ref|YP_003329789.1| hypothetical protein DhcVS_300 [Dehalococcoides sp. VS] gi|270153623|gb|ACZ61461.1| hypothetical protein DhcVS_300 [Dehalococcoides sp. VS] Length = 457 Score = 45.9 bits (107), Expect = 0.017, Method: Composition-based stats. Identities = 44/291 (15%), Positives = 85/291 (29%), Gaps = 45/291 (15%) Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213 ++++ H G + S E VG NT + + DEA Sbjct: 87 FTEIYHTEGGYIIRLNQARAVFLSAEPSANVVG--NTAHLLLEVDEAQDVSKEKYTKEFK 144 Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD------WKRFQIDTRTVEGIDPSFHE 267 + N I+ + EI + ++ + F+ D V +P++ Sbjct: 145 PM-GATTNVTTILYGTTWDNASLLEEIKRQNIEKEQKDGLKRHFRYDWEEVAAHNPAYLA 203 Query: 268 GIIARY---GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDPYAPLIMG 321 ++ G + + + P ++ + PC P+ + G Sbjct: 204 YALSEKDRLGENHPLFLTQYR-LLPVSGGGGMFSTEQLDLLKSSHPCQIYPENGKVYVAG 262 Query: 322 CDIAEE-----GGDNTVVVLRRGPVIEHLFD----------------------WSKTDLR 354 D+A E G V LRR + + + W Sbjct: 263 LDLAGEDGQIDGDLPATVNLRRDSSVLTIAELDYTFAKAPCNLPQLKLVCHYSWQGARHA 322 Query: 355 TTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYHVYRVLGQKRA 403 K+ L+ K ++ + +DA G +L LG + Q + Sbjct: 323 LLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSRILPFAFQPSS 373 >gi|221065290|ref|ZP_03541395.1| protein of unknown function DUF264 [Comamonas testosteroni KF-1] gi|220710313|gb|EED65681.1| protein of unknown function DUF264 [Comamonas testosteroni KF-1] Length = 632 Score = 45.9 bits (107), Expect = 0.017, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 38/138 (27%), Gaps = 21/138 (15%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313 ++ Y D +F F L +++ + P Sbjct: 393 LAELLEEY--PDDEFSNLFRCEFIDDSNSQF-TLQMMQACMVDSWEAWADDFKPLAARPF 449 Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKY 367 + P+ +G D + G +VV+ G L + D I + E+Y Sbjct: 450 AWQPVWVGYDPSFTGDTAALVVIAPPKVPGGKFRLLHRQQFRGADFEAQAEYIRSITERY 509 Query: 368 RPDAIIIDANNTGARTCD 385 + ID G Sbjct: 510 NVTFMGIDTTGLGQGVYQ 527 >gi|206563738|ref|YP_002234501.1| putative phage terminase large subunit [Burkholderia cenocepacia J2315] gi|198039778|emb|CAR55749.1| putative phage terminase large subunit [Burkholderia cenocepacia J2315] Length = 436 Score = 45.9 bits (107), Expect = 0.018, Method: Composition-based stats. Identities = 52/347 (14%), Positives = 98/347 (28%), Gaps = 70/347 (20%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPD---VINLGILGFLTERNANRFWIMTSNPR- 231 +S E G ++ DE + T D I + + S P Sbjct: 99 WSLESGLFGRGREYD---LLLFDETAFTKDGTLEIYRDAISPVVATRPGFRMFSFSTPLV 155 Query: 232 -RLSGKFYEIFNKPLDDW------------KRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278 LS FY + W K + +D + G + S Sbjct: 156 MDLSNFFYALHEHKDYKWNPKIGADNYERFKVHHRPSWCNPLVDDKWLRGEYRKRSALS- 214 Query: 279 VTRVEVCGQFPQQDIDSFIP---------------LNIIEEALNREPCPDPYAPLIMGC- 322 R E+ G+F S P +I+ A+ D A + +G Sbjct: 215 -WRQEIEGEFVDWSGISLFPNLNKPVDPHPRYDAVFAVIDTAMKSGIEHDGTACMWLGYS 273 Query: 323 DIAEEGGDNTVVVLRRGPVIEHLFDWSKTD----LRTTNNKISGLVEKYRPDAI-IIDAN 377 D+ G DN ++ + L + D + ++ + ++ + I+ Sbjct: 274 DV--FGPDNLHIL---DWEVTSLDASGQYDWLKRILNHGEALARHYKSHQGFTVAYIEDK 328 Query: 378 NTGARTCDYLEMLGYHVYRVLGQKRAVDLEF---------CRNRRTELHVKMADWLEFAS 428 +G + G V + + A+ + NR +H +EF + Sbjct: 329 QSGIVLLQQGKESGLPVEAINSKFTALGKDERMRICVDPVHANRVKFVHESFNKLVEFKN 388 Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + L Q L + ++ D +D Y+ Sbjct: 389 ESKNHALKQILSYR-------------IGDKEAYKRADDLADCFAYS 422 >gi|146310689|ref|YP_001175763.1| hypothetical protein Ent638_1030 [Enterobacter sp. 638] gi|145317565|gb|ABP59712.1| hypothetical protein Ent638_1030 [Enterobacter sp. 638] Length = 402 Score = 45.9 bits (107), Expect = 0.018, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 84/284 (29%), Gaps = 32/284 (11%) Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 L + P I K+ + + + G ++ DEA+ T Sbjct: 42 DKLVEYLEPLIKSSSRSEKRILLKNGGKIDFWVTNDNKLAGRGREYD---LVLIDEAAFT 98 Query: 204 PDVINLGILGFL----TERNANRFWIMTSNPRRLS--GKFYEIFNKPLDDWKRFQIDTRT 257 L + T + S P + FY I K + T + Sbjct: 99 KSPEMLAEIWAKSIKPTLLTTKGRAYIFSTPDGVDEDNFFYAICRKKELGFFEHYAPTSS 158 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 + P E R + V R E +F D+ ++ E P+ Sbjct: 159 NPFVPPEELEK--ERLSCEPRVFRQEFLAEFVDWSADALFDVSKWLEDGKPVEFPEMCMA 216 Query: 318 LIMGCDIAEEGG---DNTVVVL-----RRGPVIEHLFDWSKTDLR---------TTNNKI 360 + D A +GG D T VV R G + DW + + +++ Sbjct: 217 VFAVMDTAVKGGIEHDGTAVVYYAIDTRPGRERLTILDWDVVQIDGALLEVWMPSVFDRL 276 Query: 361 SGL----VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400 + L V + I+ + G+ E LG+ V ++ Sbjct: 277 NELSGLCVAVNGSLGVFIEDASMGSILLQKGESLGWQVNKIESA 320 >gi|145335142|ref|NP_172040.2| chr31 (chromatin remodeling 31); ATP binding / DNA binding / helicase/ nucleic acid binding [Arabidopsis thaliana] gi|332189724|gb|AEE27845.1| chromatin remodeling 31 [Arabidopsis thaliana] Length = 1410 Score = 45.9 bits (107), Expect = 0.018, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 13/155 (8%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + ++ + F+ + G G GKT L + + P Sbjct: 827 QQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTGKTRLTIIFLQAYLQCFPD 886 Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 + +A + L WA E KW +P + + + ++ + S Sbjct: 887 CKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGKENSAALGLLMQKNATARS 943 Query: 167 KHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDE 199 + M + YS + + +G ++ +A + DE Sbjct: 944 NNEIRMVKIYSWIKSKSILGISYNLYEKLAGVKDE 978 >gi|110740804|dbj|BAE98499.1| hypothetical protein [Arabidopsis thaliana] Length = 1410 Score = 45.9 bits (107), Expect = 0.018, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 13/155 (8%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + ++ + F+ + G G GKT L + + P Sbjct: 827 QQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTGKTRLTIIFLQAYLQCFPD 886 Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 + +A + L WA E KW +P + + + ++ + S Sbjct: 887 CKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGKENSAALGLLMQKNATARS 943 Query: 167 KHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDE 199 + M + YS + + +G ++ +A + DE Sbjct: 944 NNEIRMVKIYSWIKSKSILGISYNLYEKLAGVKDE 978 >gi|8778726|gb|AAF79734.1|AC005106_15 T25N20.14 [Arabidopsis thaliana] Length = 1465 Score = 45.9 bits (107), Expect = 0.018, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 13/155 (8%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + ++ + F+ + G G GKT L + + P Sbjct: 882 QQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTGKTRLTIIFLQAYLQCFPD 941 Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 + +A + L WA E KW +P + + + ++ + S Sbjct: 942 CKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGKENSAALGLLMQKNATARS 998 Query: 167 KHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDE 199 + M + YS + + +G ++ +A + DE Sbjct: 999 NNEIRMVKIYSWIKSKSILGISYNLYEKLAGVKDE 1033 >gi|319407675|emb|CBI81323.1| phage-related protein [Bartonella sp. 1-1C] Length = 442 Score = 45.9 bits (107), Expect = 0.018, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 63/193 (32%), Gaps = 9/193 (4%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247 + DEA D ++ L E +T NP R + + F + Sbjct: 122 RILLCWVDEAEPVTDAAWQILIPTLREEGKEWHSELWVTWNPCRENAAVEKRFRFTEDPN 181 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 K +I+ R + A + + G++ Q ++ ++E Sbjct: 182 IKGVEINWRDNPKFPAKLNRDRKADLEQRPEQYQHIWEGEYLQAMQGAYYQKLLLEAEQE 241 Query: 308 REPCPDPYAPLI---MGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361 P PLI + DI G D T + + + + D+ + + + I Sbjct: 242 GRITIVPRDPLIQVKIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHIG 301 Query: 362 GLVEKYRPDAIII 374 + +K A+++ Sbjct: 302 WVCQKGYEKALMV 314 >gi|224535035|ref|ZP_03675589.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii A14S] gi|224513696|gb|EEF84036.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii A14S] Length = 379 Score = 45.5 bits (106), Expect = 0.019, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|216968428|ref|YP_002333693.1| phage terminase, large subunit, pbsx family [Borrelia afzelii ACA-1] gi|216752682|gb|ACJ73366.1| phage terminase, large subunit, pbsx family [Borrelia afzelii ACA-1] Length = 450 Score = 45.5 bits (106), Expect = 0.019, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|99080898|ref|YP_613052.1| hypothetical protein TM1040_1057 [Ruegeria sp. TM1040] gi|99037178|gb|ABF63790.1| DNA packaging protein Gp17 (Terminase) [Ruegeria sp. TM1040] Length = 425 Score = 45.5 bits (106), Expect = 0.021, Method: Composition-based stats. Identities = 64/422 (15%), Positives = 115/422 (27%), Gaps = 77/422 (18%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ S G V + + Q++ + + Sbjct: 33 ILGGRGAGKTRAGA---EWVRSQVEGAGPFGVGSARRVALVGETYDQVRDVM---IHGDS 86 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 +L P + +S P+ G Sbjct: 87 GILACSP------------PDRRPEWRAGERRLLWPNGASAQAFSASDPEVLRGPQFD-- 132 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + L +T+ PR + + + Sbjct: 133 -AAWVDELAKWRRAQEAWDMLQFAL-RLGTAPRVCVTTTPRNV--PLLKGLLQSPSTVTT 188 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + PSF + ARY S + R E+ G + +++ E R+ Sbjct: 189 HAPTEANSANLAPSFLSEVRARY-AGSRLARQELDGVLLADVDGALWSSDMLAEIQRRDT 247 Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVVL----RRGP----VIEHLFDWSKTDLRTTN-- 357 +++ D A +G D +++ +GP L D + L T Sbjct: 248 P--RLDRIVVAVDPSVSAHKGSDACGIIVAGAQTQGPISSWRAYVLADHTVQGLGPTGWA 305 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTE 415 + Y+ D ++ + N GA L + V K R E Sbjct: 306 RAAIAARDAYKADRLVAEVNQGGALVGTVLRQVDPLVPFTPVHASKGKA-------ARAE 358 Query: 416 LHVKMADWLEFASLINHSGLIQN--LKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473 + + L + L + + + +G S D D L+ Sbjct: 359 PVAALYEQGRVHHAPGLQELEEQMCLMTAQGY---------------RGDASPDRVDALV 403 Query: 474 YT 475 + Sbjct: 404 WA 405 >gi|11497347|ref|NP_051454.1| hypothetical protein BBN43 [Borrelia burgdorferi B31] gi|6382368|gb|AAF07680.1|AE001581_22 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 45.5 bits (106), Expect = 0.021, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKIYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITDDYIFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|216997755|ref|YP_002333847.1| phage terminase, large subunit, pbsx family protein [Borrelia afzelii ACA-1] gi|216752400|gb|ACJ73182.1| phage terminase, large subunit, pbsx family protein [Borrelia afzelii ACA-1] Length = 450 Score = 45.5 bits (106), Expect = 0.021, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFAQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|216969097|ref|YP_002333737.1| PBSX family phage termninase large subunit [Borrelia afzelii ACA-1] gi|216753027|gb|ACJ73621.1| phage terminase, large subunit, PBSX family [Borrelia afzelii ACA-1] Length = 450 Score = 45.5 bits (106), Expect = 0.022, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYDFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|187939507|gb|ACD38655.1| terminase ATPase subunit [Pseudomonas aeruginosa] Length = 593 Score = 45.1 bits (105), Expect = 0.025, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 50/161 (31%), Gaps = 22/161 (13%) Query: 248 WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 W++ I G D + + Y D++ + + QF S PL +++ + Sbjct: 336 WRQIVTILDAEARGCDLFDIDELRLEY--DAEAFQNLLMCQFVDDGA-SIFPLTMLQPCM 392 Query: 307 NR------------EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDW 348 P + +G D AE G +VV+ G L + Sbjct: 393 VDSWDLWSEDYKPFALRPFGDRQVWLGYDPAETGDTAGLVVVAPPAVPGGKFRVLERHQF 452 Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 D I + ++Y I +D G+ + Sbjct: 453 RGKDFAEQAEFIRKVTQRYWVTYIGVDTTGMGSGVAQLVRQ 493 >gi|17313220|ref|NP_490600.1| predicted DNA-dependent ATPase terminase subunit [Pseudomonas phage phiCTX] gi|4063774|dbj|BAA36228.1| unnamed protein product [Pseudomonas phage phiCTX] Length = 594 Score = 45.1 bits (105), Expect = 0.025, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 50/161 (31%), Gaps = 22/161 (13%) Query: 248 WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 W++ I G D + + Y D++ + + QF S PL +++ + Sbjct: 336 WRQIVTILDAEARGCDLFDIDELRLEY--DAEAFQNLLMCQFVDDGA-SIFPLTMLQPCM 392 Query: 307 NR------------EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDW 348 P + +G D AE G +VV+ G L + Sbjct: 393 VDSWDLWSEDYKPFALRPFGDRQVWLGYDPAETGDTAGLVVVAPPAVPGGKFRVLERHQF 452 Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 D I + ++Y I +D G+ + Sbjct: 453 RGKDFAEQAEFIRKVTQRYWVTYIGVDTTGMGSGVAQLVRQ 493 >gi|260556808|ref|ZP_05829026.1| P-loop protein [Acinetobacter baumannii ATCC 19606] gi|260410067|gb|EEX03367.1| P-loop protein [Acinetobacter baumannii ATCC 19606] Length = 437 Score = 45.1 bits (105), Expect = 0.025, Method: Composition-based stats. Identities = 53/332 (15%), Positives = 93/332 (28%), Gaps = 70/332 (21%) Query: 78 FKGAISAGRGIGKTTLNAW--------LVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129 F+ A+ GR GKT L W +S + A + Q K W + Sbjct: 31 FRDAV-CGRRFGKTFLAKAEMRRAARLAQKWNVSVEDE--IWYAAPTFKQAKRVFWKRLK 87 Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 + + P W + + + + R + D G Sbjct: 88 QAI-----------------PPSWRFGKPNETECTITLKTGHVIRVVGLDNYDDLRG--- 127 Query: 190 TYGMAIINDEASGTPDVIN-LGILGFLTE--------RNANRFWIMTSNPRRLSGKFYEI 240 + +I DE + + L+ + + P+ + Y+ Sbjct: 128 SGLFFLIIDEWADCKWAAWEEVLRPMLSTCKYTVNGVQRVGGNVLRIGTPKGYN-HCYDT 186 Query: 241 F-------NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293 + W + + E +AR +D R E F Sbjct: 187 WMDGQNGREPDHKSWIYTSLQGGNIP-----ESEIDVARRKMDPKTFRQEYEASFETYQ- 240 Query: 294 DSFIPLNIIEEALNR-----EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW 348 +I R E L +G D + VV +R G + + ++ Sbjct: 241 ------GVIYYCFERTFNCTEKVVKEGDVLHIGMDFNVQKM-AAVVYVRDGEELYAVGEF 293 Query: 349 SKTDLRTTNNKISGLVEKYRPDAIII--DANN 378 DL T I + KY+ II+ DA+ Sbjct: 294 --KDLFDTPAMIEAIKAKYQDHEIIVYPDASG 323 >gi|194436023|ref|ZP_03068125.1| putative conserved hypothetical protein [Escherichia coli 101-1] gi|194424751|gb|EDX40736.1| putative conserved hypothetical protein [Escherichia coli 101-1] Length = 595 Score = 45.1 bits (105), Expect = 0.025, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 65/238 (27%), Gaps = 46/238 (19%) Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY---EIFNKPLDDWK 249 + DE P +N G T ++ + T + + G + + + K K Sbjct: 257 LYIDEYLWIPGFRRLNEVASGMATHKHWRITYFSTPSSKTHQGYPFWSGDEWRKGDPKRK 316 Query: 250 RFQIDT--------RTVE----------------GIDPSFHEGIIARYGLDSDVTRVEVC 285 + + R G + + + RY + Sbjct: 317 GVEFPSFDELRDGGRECPDGQWRYVVTLEDAIAGGFNLADINELRERYNET--AFNMLFM 374 Query: 286 GQFPQQDIDSF---------IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL 336 F F + ++ E+ EP P + G D A G + T VVL Sbjct: 375 CVFVDDKESVFKFDDLVRCGVDVSTWEDFHPEEPMPFGNREVWGGFDPARSGDNATFVVL 434 Query: 337 R------RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 + W + +I + +Y I ID G + ++ Sbjct: 435 APPLVSAERFRVLEKHHWRSMSFQFMAERIRSIKARYNMTFIGIDVTGLGYGVFELVQ 492 >gi|218964078|ref|YP_002455438.1| putative phage terminase, pbsx family protein [Borrelia afzelii ACA-1] gi|216752969|gb|ACJ73583.1| putative phage terminase, pbsx family protein [Borrelia afzelii ACA-1] Length = 450 Score = 45.1 bits (105), Expect = 0.025, Method: Composition-based stats. Identities = 55/289 (19%), Positives = 93/289 (32%), Gaps = 45/289 (15%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109 Q E + +++H + V +F G I++ GKT L ++L++ + S + Sbjct: 49 QKEVLFDIESHTYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEQDTN 98 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + NS L T ++ K L + + + L I Sbjct: 99 NFIIGNSIGLLMTNTIKQIEKICGL------LGIDYQKKKSGQSFCKIAGLELNI----- 147 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 Y + D F I +EA+ L ++ L R I +N Sbjct: 148 ------YGGKNRDAFSKIRGGNSAIIYVNEATVIHKETLLEVMKRL--RKGKSIIIFDTN 199 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288 P + F + + D +K + T F E Y + V G++ Sbjct: 200 PESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLYGEW 258 Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 + F E N++ IM D A GGDNT V + Sbjct: 259 VLNESSLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAVCV 299 >gi|170769336|ref|ZP_02903789.1| conserved hypothetical protein [Escherichia albertii TW07627] gi|170121988|gb|EDS90919.1| conserved hypothetical protein [Escherichia albertii TW07627] Length = 595 Score = 45.1 bits (105), Expect = 0.025, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 65/238 (27%), Gaps = 46/238 (19%) Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY---EIFNKPLDDWK 249 + DE P +N G T ++ + T + + G + + + K K Sbjct: 257 LYIDEYLWIPGFRRLNEVASGMATHKHWRITYFSTPSSKTHQGYPFWSGDEWRKGDPKRK 316 Query: 250 RFQIDT--------RTVE----------------GIDPSFHEGIIARYGLDSDVTRVEVC 285 + + R G + + + RY + Sbjct: 317 GVEFPSFDELRDGGRECPDGQWRYVVTLEDAIAGGFNLADINELRERYNET--AFNMLFM 374 Query: 286 GQFPQQDIDSF---------IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL 336 F F + ++ E+ EP P + G D A G + T VVL Sbjct: 375 CVFVDDKESVFKFDDLVRCGVDVSTWEDFHPEEPMPFGNREVWGGFDPARSGDNATFVVL 434 Query: 337 R------RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 + W + +I + +Y I ID G + ++ Sbjct: 435 APPLVSAERFRVLEKHHWRSMSFQFMAERIRSIKARYNMTFIGIDVTGLGYGVFELVQ 492 >gi|117621599|ref|YP_853855.1| hypothetical protein BAPKO_2028 [Borrelia afzelii PKo] gi|110890985|gb|ABH02150.1| hypothetical protein BAPKO_2028 [Borrelia afzelii PKo] Length = 450 Score = 45.1 bits (105), Expect = 0.025, Method: Composition-based stats. Identities = 55/289 (19%), Positives = 93/289 (32%), Gaps = 45/289 (15%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109 Q E + +++H + V +F G I++ GKT L ++L++ + S + Sbjct: 49 QKEVLFDIESHTYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEQDTN 98 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + NS L T ++ K L + + + L I Sbjct: 99 NFIIGNSIGLLMTNTIKQIEKICGL------LGIDYQKKKSGQSFCKIAGLELNI----- 147 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 Y + D F I +EA+ L ++ L R I +N Sbjct: 148 ------YGGKNRDAFSKIRGGNSAIIYVNEATVIHKETLLEVMKRL--RKGKSIIIFDTN 199 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288 P + F + + D +K + T F E Y + V G++ Sbjct: 200 PESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLYGEW 258 Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 + F E N++ IM D A GGDNT V + Sbjct: 259 VLNESSLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAVCV 299 >gi|224022826|ref|YP_002606317.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|223929278|gb|ACN23995.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] Length = 450 Score = 45.1 bits (105), Expect = 0.026, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|224020463|ref|YP_002601168.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|223929158|gb|ACN23879.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] Length = 450 Score = 45.1 bits (105), Expect = 0.026, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|219869985|ref|YP_002474251.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692877|gb|ACL34089.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] Length = 450 Score = 45.1 bits (105), Expect = 0.026, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|195942413|ref|ZP_03087795.1| hypothetical protein Bbur8_06149 [Borrelia burgdorferi 80a] gi|312201120|gb|ADQ44434.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] gi|312201339|gb|ADQ44646.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] Length = 450 Score = 45.1 bits (105), Expect = 0.026, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|11497152|ref|NP_051291.1| hypothetical protein BB_R45 [Borrelia burgdorferi B31] gi|6382173|gb|AAF07489.1|AE001577_3 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 45.1 bits (105), Expect = 0.026, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|11497247|ref|NP_051377.1| hypothetical protein BB_O44 [Borrelia burgdorferi B31] gi|6382268|gb|AAF07582.1|AE001579_11 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 45.1 bits (105), Expect = 0.026, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|331035425|gb|AEC52982.1| large terminase protein [Synechococcus phage S-CRM01] Length = 567 Score = 45.1 bits (105), Expect = 0.026, Method: Composition-based stats. Identities = 47/329 (14%), Positives = 95/329 (28%), Gaps = 56/329 (17%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GK+T ++ R I++ LAN + +K N + Q + Sbjct: 84 GKSTTVTAYLIHQAIFRDNINIAILANKRETAYELM----AKLQLSYENLPKWMQQGV-- 137 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204 W + G ST G ++ DE + P Sbjct: 138 --LGWNKGSIELENGSRITASSTSSSAVR--------GF---AYNIVMLDEFAFVPTNVA 184 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEGI 261 D + ++ + I+ S P ++ FY+++ K + + + V G Sbjct: 185 DDFFSSVYPTISS-GKSTKVIIVSTPCGMN-HFYKMWTDATKGRNSYNPIEAHWSEVPGR 242 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----- 316 D F + IA L + E F + + I ++ + EP Sbjct: 243 DEKFKQETIANTSLSQ--WQQEFETDFI-GSVGTLINPAKLKSLVYDEPLLSSGGLDVYE 299 Query: 317 ---------------PLIMGCDIAEEG--GDNTVVVLRRGPVIEHLFDWSKTD---LRTT 356 ++ D++ + +V L + + Sbjct: 300 HPIMKDENDENSRDHEYMITVDVSRGMKLDYSAFLVFDITQYPHRLVAKYRNNEIKPMLF 359 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCD 385 + I + +KY I+ + N+ G + Sbjct: 360 PDVIVPVAKKYNNAWILCEVNDIGDQVAS 388 >gi|45597419|ref|NP_996704.1| TerL [Lactococcus phage phiLC3] gi|45504639|gb|AAS66808.1| large subunit terminase [Lactococcus phage phiLC3] Length = 469 Score = 45.1 bits (105), Expect = 0.026, Method: Composition-based stats. Identities = 56/349 (16%), Positives = 103/349 (29%), Gaps = 51/349 (14%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ ++ + A + + + GKT + L LW + G+S++ Sbjct: 41 PWQKNLLKEIMAIDEDGLWTHQKFGYSIPRRN----GKTEIVYILELWAL--EQGLSILH 94 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + ++ + K+L + +S+ + L + T Sbjct: 95 TAHRISTSHSSYEK-LKKYLEDSGYVEGEDFKSIKAK----GQERLELIESGGVIQFRT- 148 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT S + F + DEA + +T+ + N IM P Sbjct: 149 -RTSSGGLGEGFD--------ILFIDEAQEYTTEQESALKYTVTDSD-NPMTIMCGTPPT 198 Query: 233 L------SGKFYE---IFNKPLDDWKRFQI-------DTRTVEGIDPS-----FHEGIIA 271 + + W + + D +PS I A Sbjct: 199 PVSSGTVFTNYRDNTLAGKAKYSGWAEWSVEDVKDIHDVEAWYNSNPSMGYHLNERKIEA 258 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331 G D V+ G +P+ + S I AL P L +G + G D Sbjct: 259 ELGEDKLDHNVQRLGYWPKYNQKSVISEQE-WNALKVNRLPVIKGKLFVGI---KYGNDG 314 Query: 332 TVVVLRRGPVIEH----LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 V + + +R N I ++K + ++ID Sbjct: 315 ANVAMSIAVKTLSGKVFVETIDCQSIRNGNQWIINFLKKADVEKVVIDG 363 >gi|219723219|ref|YP_002474654.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692798|gb|ACL34012.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|312148753|gb|ADQ31404.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 45.1 bits (105), Expect = 0.027, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITADYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|119953680|ref|YP_950600.1| putative large terminase subunit [Staphylococcus phage CNPH82] gi|112361306|gb|ABI15678.1| putative large terminase subunit [Staphylococcus phage CNPH82] gi|329736010|gb|EGG72285.1| phage terminase, large subunit, PBSX family [Staphylococcus epidermidis VCU045] Length = 421 Score = 44.7 bits (104), Expect = 0.032, Method: Composition-based stats. Identities = 29/240 (12%), Positives = 76/240 (31%), Gaps = 28/240 (11%) Query: 60 EVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118 E++ H + + GRG GK++ + ++ + R ++ + + ++ Sbjct: 9 ELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVRKTDN 67 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 L T+++ ++ + H F+++ + + + R Sbjct: 68 TLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFRGA-- 113 Query: 179 ERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMTSNPR 231 + P+ ++ + I + A T D + L + + + NP Sbjct: 114 QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFSYNPP 173 Query: 232 RLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQ 287 + + YE +P + + I F + + + R E G+ Sbjct: 174 KRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESAKERNEQRYRWEYMGE 232 >gi|56560912|ref|YP_161331.1| hypothetical protein BGP046 [Borrelia garinii PBi] gi|52696553|gb|AAU85896.1| hypothetical protein BGP046 [Borrelia garinii PBi] Length = 450 Score = 44.7 bits (104), Expect = 0.032, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSTLIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y + + V G++ Sbjct: 206 YFKTDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KEIPTYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|56560985|ref|YP_161401.1| hypothetical protein BGP116 [Borrelia garinii PBi] gi|52696625|gb|AAU85966.1| hypothetical protein BGP116 [Borrelia garinii PBi] Length = 336 Score = 44.7 bits (104), Expect = 0.033, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSIYSLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYTHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ E + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219 >gi|312984196|ref|ZP_07791542.1| putative phage terminase, large subunit [Lactobacillus crispatus CTV-05] gi|310894415|gb|EFQ43491.1| putative phage terminase, large subunit [Lactobacillus crispatus CTV-05] Length = 632 Score = 44.7 bits (104), Expect = 0.034, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 48/166 (28%), Gaps = 21/166 (12%) Query: 54 WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAW----LVLWLMSTRPGIS 109 WQ + +++ + + ++ GRG GKT + VL Sbjct: 101 WQKFILAMING-WKDENDEKRFTDIHISV--GRGQGKTQIAGIQMCKAVLIDTLNYTNKD 157 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + AN+ Q T L+ + K L + F + + ++ Sbjct: 158 FLVTANTSDQ-STKLFGYIKKMLEAVIKIEPFASLAKESGLDLQTNQIIEKRTNNKVWKI 216 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--DVINLGILG 213 S Y +T+ + I DE D I G Sbjct: 217 SYEADKYD-----------STHNVLAIYDETGALNTYDRITDITDG 251 >gi|224591529|ref|YP_002640858.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|224554111|gb|ACN55505.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] Length = 450 Score = 44.7 bits (104), Expect = 0.034, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + + +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALVFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|219873383|ref|YP_002477648.1| phage terminase, large subunit, pbsx family [Borrelia garinii Far04] gi|219694616|gb|ACL35135.1| phage terminase, large subunit, pbsx family [Borrelia garinii Far04] Length = 267 Score = 44.7 bits (104), Expect = 0.034, Method: Composition-based stats. Identities = 43/248 (17%), Positives = 78/248 (31%), Gaps = 36/248 (14%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109 Q E + +++H + V +F G I++ GKT L ++L++ + S + Sbjct: 49 QKEVLFDIESHDYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEKDTN 98 Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + NS L T ++ K + + + + L I Sbjct: 99 NFIIGNSIGLLMTNTIKQIEKICG------FLGIDYQKKKSGESFCKIAGLELNI----- 147 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229 Y + D+F I +EA+ L + L R I +N Sbjct: 148 ------YGGKNRDSFSKIRGGNSAIIYVNEATVIHKETLLEAIKRL--RKGKAIIIFDTN 199 Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288 P + F F + D +K + T F E Y + V G++ Sbjct: 200 PESPTHFFKTDFIENKDVFKTYNFTTYDNPLNSADFIETQKKLY-KHLPAYKARVLYGEW 258 Query: 289 PQQDIDSF 296 + F Sbjct: 259 ILNESTLF 266 >gi|326387547|ref|ZP_08209153.1| hypothetical protein Y88_0459 [Novosphingobium nitrogenifigens DSM 19370] gi|326207593|gb|EGD58404.1| hypothetical protein Y88_0459 [Novosphingobium nitrogenifigens DSM 19370] Length = 656 Score = 44.7 bits (104), Expect = 0.035, Method: Composition-based stats. Identities = 39/251 (15%), Positives = 64/251 (25%), Gaps = 61/251 (24%) Query: 185 VGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY---- 238 G+H +I DE + + T + R S P L + Y Sbjct: 310 QGYHGD----VIVDECFWIYGFEELFKVASAMATHKQYTRTL--FSTPSTLDHEAYGMWS 363 Query: 239 -EIFNKPLDDWKRFQIDTRT------------------------VEGIDPSFHEGIIARY 273 + FN+ + +ID +G+D + + Sbjct: 364 GDRFNRRRAKADKVRIDIANEHLRDGSLGPDGVWRQVVTIFDAIAKGLDLVDVDELQREN 423 Query: 274 GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP---------LIMGCD 323 G+D F S P ++ + + Y P + +G D Sbjct: 424 GIDE--FDNLFRCIFLDDSQ-SMFPFALMRRCMVDAWEVWQDYQPYALRPYAGEVWLGYD 480 Query: 324 IAEEGG----DNTVVVLR-----RGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAI 372 D+ +V G L D + I + KYR I Sbjct: 481 PNASEDNPTSDDAALVAIAPPSAIGGKFRILEKKRLKGLDFAGQADAIREMAGKYRVTKI 540 Query: 373 IIDANNTGART 383 ID G Sbjct: 541 GIDTTGAGKAV 551 >gi|255652557|ref|ZP_05399459.1| hypothetical protein CdifQCD_20411 [Clostridium difficile QCD-37x79] Length = 591 Score = 44.7 bits (104), Expect = 0.036, Method: Composition-based stats. Identities = 47/366 (12%), Positives = 102/366 (27%), Gaps = 75/366 (20%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 RG K+ + A P + A +++Q + + ++ K L E++ Sbjct: 71 RGFAKSWIAAVYACCRAVLYPNSKIGIAAFTKSQAELIIREKIEKELVKQSPMLAREIKK 130 Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205 + + + H I++ + R + +I DE Sbjct: 131 IE-YNNKFSKVTFHNGSTIEAIVSNEQSRGFRFN--------------ILIVDEFRLVKK 175 Query: 206 VINLGILGFLTERNANRFWIMTS-------NPRR---LSGKFYEIFNKP--LDDWKRFQI 253 I IL + N + P + LS ++ + + + + Sbjct: 176 EIQDRILKPFLNVSRNLKFKKDGKYEDYPPEPNKELYLSSAWFRMHEAYDKFKLYVKDMV 235 Query: 254 DTRTV-------------EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI--- 297 D R +D + + +D+ +E+ F ++ D+ Sbjct: 236 DGRDKFVLNCNYKLSLHHGILDKERADEMKRE--MDAVSWIMEMESLFFGENEDAIFKSS 293 Query: 298 -------------PLNIIEEALNREPCPD------PYAPLIMGCDIA---EEGGDNTV-- 333 P +E + I+ DIA + DN+V Sbjct: 294 YVNPCRTLKNPFYPPTDLEILSAKNGKVKCNLQKRKGELRIISADIAVAEGDNNDNSVYT 353 Query: 334 ---VVLRRGPVIE---HLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387 ++ + H+ + ++ L + D ++ID G L Sbjct: 354 CWRLLPEKDYYERMVVHIESHNGMKPDKQAIRLKQLFFDFEADFLVIDTQGVGQSVLSDL 413 Query: 388 EMLGYH 393 + Y Sbjct: 414 LRVNYD 419 >gi|225621943|ref|YP_002724616.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547242|gb|ACN93227.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 44.7 bits (104), Expect = 0.037, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y + + V G++ Sbjct: 206 YFKTDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KEIPTYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|23455748|ref|NP_695057.1| putative terminase [Lactococcus phage r1t] gi|1353546|gb|AAB18704.1| ORF29 [Lactococcus phage r1t] Length = 469 Score = 44.7 bits (104), Expect = 0.038, Method: Composition-based stats. Identities = 57/349 (16%), Positives = 104/349 (29%), Gaps = 51/349 (14%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ ++ V A + + + GKT + L LW + G+S++ Sbjct: 41 PWQKNLLKEVMAIDEDGLWTHQKFGYSIPRRN----GKTEIVYILELWSLV--QGLSILH 94 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + ++ + K+L + +S+ + L + T Sbjct: 95 TAHRISTSHSSYEK-LKKYLEDSGYVEGEDFKSIKAK----GQERLELIESGGVIQFRT- 148 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT S + F ++ DEA + +T+ + N IM P Sbjct: 149 -RTSSGGLGEGFD--------ILVIDEAQEYTTEQESALKYTVTDSD-NPMTIMCGTPPT 198 Query: 233 L------SGKFYE---IFNKPLDDWKRFQI-------DTRTVEGIDPS-----FHEGIIA 271 + + W + + D +PS I A Sbjct: 199 PVSSGTVFTNYRDNTIAGKAKYSGWAEWSVEDVKDIHDVEAWYNSNPSMGYHLNERKIEA 258 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331 G D V+ G +P+ + S I AL P L +G + G D Sbjct: 259 ELGEDKLDHNVQRLGYWPKYNQKSVISEQE-WNALKVNRLPVIKGKLFVGI---KYGNDG 314 Query: 332 TVVVLRRGPVIEH----LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 V + + +R N I ++K + ++ID Sbjct: 315 ANVAMSIAVKTLSGKVFVETIDCQSIRNGNQWIINFLKKADVEKVVIDG 363 >gi|269978258|ref|ZP_06185208.1| putative phage terminase, large subunit [Mobiluncus mulieris 28-1] gi|269933767|gb|EEZ90351.1| putative phage terminase, large subunit [Mobiluncus mulieris 28-1] Length = 477 Score = 44.7 bits (104), Expect = 0.038, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 78/237 (32%), Gaps = 26/237 (10%) Query: 200 ASGTPDVINLGILG----FLTERNANRFWIM---TSNPRRLSGKFYEIFNKPLDDWKRFQ 252 ++G D + G +T NA + T + ++ ++ W+ Sbjct: 183 STGLADS--EVLEGLRTRAVTGENAGSLCYLEWSTKSWDEMTVSERSHWDDDRAKWRA-- 238 Query: 253 IDTRTVEGIDPSFHEGIIARY------GLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEA 305 D +P+F I A Y SD+ E G + + DS IP+++ + Sbjct: 239 -DPEVWREANPAFEIRISADYMQKELASEMSDIDFEREHLGIWERIGGDSLIPVDVWQSL 297 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 N + P L + + E + +R + L ++ L Sbjct: 298 ANEKSQPGENIVLALDVPPSREQAFIAMASIRDDGKTHLELVDTADGLAWITPRLQQLQR 357 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 KYRP+AI++DA + L+ ++ G + + + + Sbjct: 358 KYRPEAIVVDAQSAAGSLLPELKANRVRTLQISG-------RDYAKACGQFYDAVRE 407 >gi|322507236|gb|ADX02690.1| Putative phage terminase [Acinetobacter baumannii 1656-2] Length = 378 Score = 44.7 bits (104), Expect = 0.039, Method: Composition-based stats. Identities = 44/279 (15%), Positives = 77/279 (27%), Gaps = 35/279 (12%) Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 A + Q++ + + W + + + Y T Sbjct: 5 APTYPQIRDIFFPTI-----EEVAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTI 51 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDE----ASGTPDVINLGILGFLTERNANRF-WIMTS 228 S E+P T VG + + DE A I+ + + A I + Sbjct: 52 ICRSMEKPATIVGFKIGHAL---IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVA 108 Query: 229 NPRRLSGKFYEIFNKP-------LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVT 280 YE F K + Q T E + + + Y + Sbjct: 109 TTPEGFKFTYEQFVKEANKSEAKRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLI 166 Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP 340 + GQF + P + + + PL++G D V V+R G Sbjct: 167 SAYLRGQFVNLTSGAVYP-DFDRVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG- 224 Query: 341 VIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIIDANN 378 L + D T I+ + +I DA+ Sbjct: 225 KPRALDELVGVRDTPTMCQLINERFPDH-DITVIPDASG 262 >gi|239502405|ref|ZP_04661715.1| putative phage terminase [Acinetobacter baumannii AB900] Length = 427 Score = 44.7 bits (104), Expect = 0.039, Method: Composition-based stats. Identities = 54/326 (16%), Positives = 97/326 (29%), Gaps = 52/326 (15%) Query: 75 PEVFKGAISAGRGIGKTTLN-------AWLVLWLMSTRPGISVIC--LANSETQLKTTLW 125 P F+ + AG G GKT + +W V A + Q++ + Sbjct: 19 PNKFRAFV-AGFGSGKTWVGCSSLCDKSWS---------FPKVPLGYFAPTYPQIRDIFF 68 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 + W ++ + D+ + + Y + S E+P+T V Sbjct: 69 PTI-----DEVAFDWGL--KTKIYESNKEVDLYYG------RQYRSTIICRSMEKPNTIV 115 Query: 186 GHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNK 243 G + + D + I+ + + A I + +E F K Sbjct: 116 GFKIGHALIDELDVMTKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTHEQFVK 175 Query: 244 P-------LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295 + Q T E + + + Y + + GQF + Sbjct: 176 EANLSDAKRALYGMIQASTYDNEVNLPDDYIASLFESY--PPQLISAYLKGQFVNLTSGA 233 Query: 296 FIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLR 354 P + + + P L++G D V V+R G L + D Sbjct: 234 VYP-DFDRTLNHTDEEIRPNEALLIGMDFNVLKMAAVVYVIRDG-KPRALDELVGVRDTP 291 Query: 355 TTNNKISGLVEKYRPD--AIIIDANN 378 T + L+EK+ II DA Sbjct: 292 TMADL---LIEKFPNHEMTIIPDAAG 314 >gi|169633422|ref|YP_001707158.1| putative phage terminase [Acinetobacter baumannii SDF] gi|169152214|emb|CAP01118.1| conserved hypothetical protein; Putative phage terminase [Acinetobacter baumannii] Length = 432 Score = 44.7 bits (104), Expect = 0.039, Method: Composition-based stats. Identities = 54/326 (16%), Positives = 97/326 (29%), Gaps = 52/326 (15%) Query: 75 PEVFKGAISAGRGIGKTTLN-------AWLVLWLMSTRPGISVIC--LANSETQLKTTLW 125 P F+ + AG G GKT + +W V A + Q++ + Sbjct: 24 PNKFRAFV-AGFGSGKTWVGCSSLCDKSWS---------FPKVPLGYFAPTYPQIRDIFF 73 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 + W ++ + D+ + + Y + S E+P+T V Sbjct: 74 PTI-----DEVAFDWGL--KTKIYESNKEVDLYYG------RQYRSTIICRSMEKPNTIV 120 Query: 186 GHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNK 243 G + + D + I+ + + A I + +E F K Sbjct: 121 GFKIGHALIDELDVMTKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTHEQFVK 180 Query: 244 P-------LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295 + Q T E + + + Y + + GQF + Sbjct: 181 EANQSDAKRALYGMIQASTYDNEANLPDDYIASLFESY--PPQLISAYLKGQFVNLTSGA 238 Query: 296 FIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLR 354 P + + + P L++G D V V+R G L + D Sbjct: 239 VYP-DFDRTLNHTDEEIRPNEALLIGMDFNVLKMAAVVYVIRDG-KPRALDELVGVRDTP 296 Query: 355 TTNNKISGLVEKYRPD--AIIIDANN 378 T + L+EK+ II DA Sbjct: 297 TMADL---LIEKFPNHEMTIIPDAAG 319 >gi|117530337|ref|YP_851180.1| superfamily II DNA/RNA helicase [Microcystis phage Ma-LMM01] gi|117165949|dbj|BAF36257.1| superfamily II DNA/RNA helicase [Microcystis phage Ma-LMM01] Length = 483 Score = 44.7 bits (104), Expect = 0.041, Method: Composition-based stats. Identities = 31/177 (17%), Positives = 53/177 (29%), Gaps = 27/177 (15%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117 ++ ++G + A G GK+ + A L+++ + + + Sbjct: 96 AIDARLRLDQQEAVAGILRGYRGYVRAATGYGKSAVIATLMMYF-----EARRLIVVPTV 150 Query: 118 TQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176 L AE V +W SL P D L+ + + Y Sbjct: 151 RLLYQM--AEDVQEWASLSPGLVGDG-NDDISTMTIATVDTLYERIKRGDRRY------- 200 Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233 + G + DEA + GI L+ NA MT+ P R Sbjct: 201 ----IEWLSGIE-----VAVFDEAHTYMNA--SGITTALSLVNARYKIGMTATPTRT 246 >gi|11497404|ref|NP_051512.1| hypothetical protein BB_Q50 [Borrelia burgdorferi B31] gi|6382425|gb|AAF07735.1|AE001584_32 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 44.7 bits (104), Expect = 0.041, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 51/161 (31%), Gaps = 13/161 (8%) Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 ++ F + I +EA+ +L L R I +NP F Sbjct: 150 GDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEHYF 207 Query: 238 YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDIDSF 296 + + +K + T + F E Y D + V G++ F Sbjct: 208 KTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIF 266 Query: 297 IPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 +NI + + P I D A GGDNT + + Sbjct: 267 TQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|313499430|gb|ADR60796.1| P-loop protein [Pseudomonas putida BIRD-1] Length = 374 Score = 44.3 bits (103), Expect = 0.041, Method: Composition-based stats. Identities = 45/329 (13%), Positives = 92/329 (27%), Gaps = 62/329 (18%) Query: 78 FKGAISAGRGIGKTTLNAW--------LVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129 F+ A+ GR GKT L W +S + A + Q K W + Sbjct: 34 FRDAV-CGRRFGKTFLGKAEMRRAARLAAEWGVSVEDE--IWYGAPTFKQAKRVFWRRLK 90 Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 + + P W + + + + + R + D G Sbjct: 91 QAI-----------------PEAWRAARPNETECSITLKSGHIMRVVGLDNYDNLRG--- 130 Query: 190 TYGMAIINDEASGTPDVINLGILGFLTERNANRF-----------WIMTSNPRRLSGKFY 238 + ++ DE + +L + + P+ Y Sbjct: 131 SGLFFVLVDEWADCSWAAWEEVLRPMLSTCQYTIPQTGESRKGGHALRIGTPKG-FNHCY 189 Query: 239 EIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291 + + W+ + V ++ AR +D R E F + Sbjct: 190 DTYRDGQPGGEPDHKSWQYTSLQGGNVPAVELD-----AARRKMDPRTFRQEYEAGF--E 242 Query: 292 DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT 351 + + + P + +G D V V+R G L ++ Sbjct: 243 NYAGVVYSTFDRAECHTSERIKPGEAIHIGMDFNVMKMAAVVYVVRDGL-PLALDEFH-- 299 Query: 352 DLRTTNNKISGLVEKYRPDAIII--DANN 378 +R T + I + ++ ++ + DA+ Sbjct: 300 SVRDTPDMIEKIKVRFSGHSVSVYPDASG 328 >gi|319404714|emb|CBI78316.1| phage-related protein [Bartonella rochalimae ATCC BAA-1498] Length = 442 Score = 44.3 bits (103), Expect = 0.042, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 63/193 (32%), Gaps = 9/193 (4%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247 + DEA D ++ L E +T NP R + + F + Sbjct: 122 RILLCWVDEAEPVTDAAWQILIPTLREEGKEWHSELWVTWNPCRENAAVEKRFRFTKDPN 181 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 K +I+ R + A + + G++ Q ++ ++E Sbjct: 182 IKGVEINWRDNPKFPAKLNRDRQADLEQRPEQYQHIWEGEYLQAMQGAYYQKLLLEAEQE 241 Query: 308 REPCPDPYAPLI---MGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361 P PLI + DI G D T + + + + D+ + + + I Sbjct: 242 GRITTVPRDPLIQVKIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHIG 301 Query: 362 GLVEKYRPDAIII 374 + +K A+++ Sbjct: 302 WVCQKGYEKALMV 314 >gi|306818632|ref|ZP_07452355.1| possible phage-related terminase [Mobiluncus mulieris ATCC 35239] gi|304648805|gb|EFM46107.1| possible phage-related terminase [Mobiluncus mulieris ATCC 35239] Length = 470 Score = 44.3 bits (103), Expect = 0.042, Method: Composition-based stats. Identities = 39/237 (16%), Positives = 78/237 (32%), Gaps = 26/237 (10%) Query: 200 ASGTPDVINLGILG----FLTERNANRFWIM---TSNPRRLSGKFYEIFNKPLDDWKRFQ 252 ++G D + G +T NA + T + ++ ++ W+ Sbjct: 176 STGLADS--EVLEGLRTRAVTGENAGSLCYLEWSTKSWDEMTVSERSHWDDDRAKWRA-- 231 Query: 253 IDTRTVEGIDPSFHEGIIARY------GLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEA 305 D +P+F I A Y SD+ E G + + DS IP+++ + Sbjct: 232 -DPEVWREANPAFEIRISADYMQKELASEMSDIDFEREHLGIWERIGGDSLIPVDVWQSL 290 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 N + P L + + E + +R + L ++ L Sbjct: 291 ANEKSQPGENIVLALDVPPSREQAFIAMASIRDDGKTHLELVDTADGLAWITPRLQQLQR 350 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422 KYRP+AI++DA + L+ ++ G + + + + Sbjct: 351 KYRPEAIVVDAQSAAGSLLPELKANRVRTLQISG-------RDYAKACGQFYDAVRE 400 >gi|332290535|ref|YP_004421387.1| conserved hypothetical protein, Terminase-like family [Gallibacterium anatis UMN179] gi|330433431|gb|AEC18490.1| conserved hypothetical protein, Terminase-like family [Gallibacterium anatis UMN179] Length = 597 Score = 44.3 bits (103), Expect = 0.043, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 40/135 (29%), Gaps = 17/135 (12%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN----REPCPDPYAP---- 317 E + +Y + ++ F +++ A + ++ PD P Sbjct: 361 LEALKRKY--NKAAFDQLFMCKWIDDADSIFNISQLLKCATDISKWQDFRPDSDRPLDNR 418 Query: 318 -LIMGCDIA--EEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370 + G D A +G V+ I W +I + ++Y Sbjct: 419 EVWCGYDPAKSYDGASFVVIAPPVLPGEKYRILERHQWHGLSYSYQAEQIKQIYQRYNVS 478 Query: 371 AIIIDANNTGARTCD 385 I ID + G + Sbjct: 479 YIGIDTSGVGVGVYE 493 >gi|307544941|ref|YP_003897420.1| cobalamin synthesis protein, P47K [Halomonas elongata DSM 2581] gi|307216965|emb|CBV42235.1| cobalamin synthesis protein, P47K [Halomonas elongata DSM 2581] Length = 399 Score = 44.3 bits (103), Expect = 0.043, Method: Composition-based stats. Identities = 29/136 (21%), Positives = 51/136 (37%), Gaps = 7/136 (5%) Query: 300 NIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL--FDWSKTDLRTTN 357 ++I+ L +P + +A +I + G D T+ +IE L Sbjct: 81 SLIQHLLAHKPAGERWAVVIN--EFGRVGIDQTMFEAHDDLIIESLPGGCLCCQQAVVLR 138 Query: 358 NKISGLVEKYRPDAIIIDANNTG--ARTCDYLEMLGY-HVYRVLGQKRAVDLEFCRNRRT 414 + L+ ++RPD +II+ + G A D L G+ + G +D + R Sbjct: 139 ASLVRLLRRHRPDRLIIEPSGLGHPAGLLDLLRGEGFADALDIRGVVAVLDPRRLDDTRA 198 Query: 415 ELHVKMADWLEFASLI 430 H D L A + Sbjct: 199 MAHETFLDQLRMADAV 214 >gi|226949140|ref|YP_002804231.1| putative phage terminase, large subunit [Clostridium botulinum A2 str. Kyoto] gi|226841904|gb|ACO84570.1| putative phage terminase, large subunit [Clostridium botulinum A2 str. Kyoto] Length = 572 Score = 44.3 bits (103), Expect = 0.043, Method: Composition-based stats. Identities = 54/306 (17%), Positives = 97/306 (31%), Gaps = 34/306 (11%) Query: 57 EFMEVVDAHCLNSVNNPNP-EVFKGA-ISAGRGIGKTTLNAWLVLWL--MSTRPGISVIC 112 +F E + + V F+ + I GR GK+ LN L +L S + C Sbjct: 77 QFQEFILGSLIGWVTKDKEYRRFRSSYIQLGRQNGKSFLNGILGTYLGNFSGYKYGKIFC 136 Query: 113 LANSETQLKTTLWAEVSKW-LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171 +A Q K +W E++K+ S F +Q ++ + +LG D+K Sbjct: 137 VATKHDQAK-IVWDEMNKFIQSDDDLGELFTVQEYKSTIICNLTNTVIKALGRDTKGLD- 194 Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN---------- 221 R + H M + + G I ++ +T Sbjct: 195 GLRPLLTVIDEYHA--HKDNQMYKLME---GGQKKIKQSLISVITTAGFELESPCHKMYK 249 Query: 222 -RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI--DPSFHEGIIARY----- 273 I+ + S Y DD +Q + + D E +I Y Sbjct: 250 YCKQILEGTEKNESKFIYIAEMDEEDDLNNYQNWIKANPMLQYDREALENLIPVYKSAKA 309 Query: 274 --GLDSDVTRVEVCGQFPQQDIDSFI-PLNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 D + + + + ++ + A N+ I+G D++ GGD Sbjct: 310 IGSKDWNDFLTKQLNMWVEFTETKYMNMTAWNKCASNKTLEDFRGQEFILGIDLS-SGGD 368 Query: 331 NTVVVL 336 T + Sbjct: 369 LTSICF 374 >gi|163850863|ref|YP_001638906.1| hypothetical protein Mext_1434 [Methylobacterium extorquens PA1] gi|163662468|gb|ABY29835.1| protein of unknown function DUF264 [Methylobacterium extorquens PA1] Length = 458 Score = 44.3 bits (103), Expect = 0.043, Method: Composition-based stats. Identities = 62/296 (20%), Positives = 98/296 (33%), Gaps = 43/296 (14%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 L +E H P P + A+ GRG GKT A W+ G V Sbjct: 46 LRLLEADWLHLARHDQLPPPGNWTTWAVIGGRGSGKTRTGA---EWVRGLAHGDPVFTPE 102 Query: 115 NSET-QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 E L +A+V + P+ + L P W G + Sbjct: 103 PVERIALVGETFADVRDVMIEGPSG-LLALPRLGGAPPVWQPSRRRVVFGN-----GAVA 156 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRR 232 +S E PD+ G A+ +DE + + +F + PR Sbjct: 157 LAFSAEEPDSLRG---PQFGAVWSDEVAK-----WREAEAT---YDMIQFGLRLGTHPRG 205 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTV----------EGIDPSFHEGIIARYGLDSDVTRV 282 L +P+ +R D RTV + + PSF E ++ RY + + R Sbjct: 206 LVT----TTPRPVPLIRRLLADPRTVVTRSRTADNAQNLAPSFLEEVVGRY-AGTRLRRQ 260 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI---AEEGGDNTVVV 335 E+ G+ + D+ + IE A R P + + D + G D +V Sbjct: 261 ELDGELIEDRPDALWTRDAIERA--RVSEAPPLQRIAVAIDPPASSRVGADACGIV 314 >gi|225626397|ref|YP_002727892.1| terminase large subunit [Enterococcus phage EFAP-1] gi|225346568|gb|ACN86334.1| terminase large subunit [Enterococcus phage EFAP-1] Length = 574 Score = 44.3 bits (103), Expect = 0.045, Method: Composition-based stats. Identities = 49/303 (16%), Positives = 97/303 (32%), Gaps = 52/303 (17%) Query: 68 NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLW-LMST-RPGIS--VICLANSETQLKTT 123 N K IS R GK+ L A + L+ + P S ++ AN++ Q Sbjct: 89 RKKKNKMRRFRKVYISLARKNGKSILVAGISLYEFLLGQYPNASRQIVAAANTKDQ---- 144 Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT 183 ++ N ++++L I+ + + S + D+ Sbjct: 145 --------AGIVFNMLKSQLKALRAVSDGTRKVTKVNKKDIEHLEDESTVKPLSSD-VDS 195 Query: 184 FVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY----- 238 G G+ EA T + + +++ I+++ + L+G + Sbjct: 196 LDGLDVLCGVLDEYGEAKSTA--MIEVLESSQSQQLQGLILIISTTTKNLNGPMHSIEYP 253 Query: 239 ---EIFNKPLDD-------WKRFQIDTRTVE---------GIDPSFHEGI-------IAR 272 ++ N+ ++ W+ + E + HE + +A Sbjct: 254 FITKLLNEEVEADAYLALCWEMDSLSEVDDEANWIKSNPLFENAQLHETMYEHKVNSLAE 313 Query: 273 YGLDSDV--TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 Y D+ + + Q DSFI E +P P+ +G D+A G Sbjct: 314 YKAKGDMSGWLTKEMNFWVQSSQDSFIDKESWEAVKQTKPYDIKGRPVYIGLDLARTGDM 373 Query: 331 NTV 333 V Sbjct: 374 TAV 376 >gi|328543446|ref|YP_004303555.1| endopeptidase Clp ATP-binding chain A [polymorphum gilvum SL003B-26A1] gi|326413190|gb|ADZ70253.1| Endopeptidase Clp ATP-binding chain A [Polymorphum gilvum SL003B-26A1] Length = 813 Score = 44.3 bits (103), Expect = 0.047, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 73/215 (33%), Gaps = 19/215 (8%) Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTR-VEVCGQFPQQDIDSFIPLNIIE 303 K ++ V + + I G DS++ R +++ + + + PL + + Sbjct: 171 KPKKKTDALEAYCVNLNEKATKGKIDPLIGRDSEIARTIQILCRRSKNN-----PLFVGD 225 Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL 363 + + + A I+ D+ E D T+ L G ++ + D ++ Sbjct: 226 PGVGKTAIAEGLARRIVKGDVPEVLKDATIFALDMGALL--AGTRYRGDFEERLKQVVKE 283 Query: 364 VEKYRPDAIIID----ANNTGARTCDYLEMLG-YHVYRVLGQKRAVDLEFCRNRRTELHV 418 +E+Y + ID GA + ++ G R + + R + Sbjct: 284 IEEYPGAVMFIDEIHTVIGAGATSGGAMDASNLLKPALASGAIRCIGSTTYKEYR-QFFE 342 Query: 419 KMADWLEFASLINHSG-----LIQNLKSLKSFIVP 448 K + I+ + I+ LK LK + Sbjct: 343 KDRALVRRFQKIDVNEPSVPDAIEILKGLKPYFED 377 >gi|293393565|ref|ZP_06637875.1| conserved hypothetical protein [Serratia odorifera DSM 4582] gi|291423900|gb|EFE97119.1| conserved hypothetical protein [Serratia odorifera DSM 4582] Length = 572 Score = 44.3 bits (103), Expect = 0.048, Method: Composition-based stats. Identities = 58/338 (17%), Positives = 99/338 (29%), Gaps = 62/338 (18%) Query: 195 IINDEASGTPDVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247 + DEA + +N G T R S P + Y ++FNK Sbjct: 230 LYLDEAFWISNFLNLRKVAAGMATHEGLRRT--YFSTPSSEEHEAYQFWTGDLFNKSRRK 287 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIAR---------------------YGLDSDV-TRVEVC 285 +R +ID + I R G +S Sbjct: 288 AERVEIDISHKALKNGRLGGDGIWRQIVTIEDAIKLGFNRVKIETIKGENSPEDYDNLYR 347 Query: 286 GQFPQQDIDSF-----IPLNIIEEALNREPCPDPYAP-------LIMGCDI--AEEGGDN 331 +F +F I + + P +P+AP + +G D GD+ Sbjct: 348 CRFVTVGERAFNYNAMIGCCVDGFNDDVWPDWNPFAPRPIGDRGVWIGYDPNGGSGNGDS 407 Query: 332 TVVVLRR-----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384 +V+ G + + I GL E+Y I ID Sbjct: 408 AGLVVIVPPAVAGGKFRIIERVQLRGMEFEEQAKVIEGLTERYNVQHIAIDGTG---GFG 464 Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444 D + L V AV ++ + + +K + L +G++ ++S + Sbjct: 465 DAVWQL-----VVKFFPLAVKYQYSVQLKRAMVLKALMLVRAGRLELDAGMMDLIQSFMT 519 Query: 445 FIVPNTGE-LAIESKRVKGAKSTDYSDGLMYTFAENPP 481 G + S R +G+ D + T N P Sbjct: 520 VRKVQKGNVMTYVSDRKRGSNHGDLAWA-SMTALYNEP 556 >gi|56560881|ref|YP_161301.1| hypothetical protein BGP016 [Borrelia garinii PBi] gi|52696522|gb|AAU85866.1| hypothetical protein BGP016 [Borrelia garinii PBi] Length = 396 Score = 44.3 bits (103), Expect = 0.052, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSIYGLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ E + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219 >gi|213580952|ref|ZP_03362778.1| hypothetical protein SentesTyph_07004 [Salmonella enterica subsp. enterica serovar Typhi str. E98-0664] Length = 67 Score = 44.3 bits (103), Expect = 0.052, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 18/48 (37%) Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 I W D R + I L E+Y I ID+ G + ++ Sbjct: 16 RILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVK 63 >gi|213162920|ref|ZP_03348630.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. E00-7866] Length = 113 Score = 44.3 bits (103), Expect = 0.052, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 18/48 (37%) Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 I W D R + I L E+Y I ID+ G + ++ Sbjct: 16 RILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVK 63 >gi|323139470|ref|ZP_08074518.1| hypothetical protein Met49242DRAFT_3906 [Methylocystis sp. ATCC 49242] gi|322395272|gb|EFX97825.1| hypothetical protein Met49242DRAFT_3906 [Methylocystis sp. ATCC 49242] Length = 439 Score = 44.3 bits (103), Expect = 0.052, Method: Composition-based stats. Identities = 65/412 (15%), Positives = 125/412 (30%), Gaps = 62/412 (15%) Query: 82 ISAGRGIGKTTLNA----WLVLW--LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135 I GRG GKT A L L TRP + + + ++ + VS L++ Sbjct: 54 ILGGRGAGKTRAGAEWVKGLALGRPHFCTRPVSRIALIGETAADVREVMIEGVSGLLAIH 113 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGM 193 + +S + + +S E P++ G H Sbjct: 114 GKRDRPRWESSR---------------RRLVWDSGVVAQAFSAEDPESLRGPQFHAA--- 155 Query: 194 AIINDEASG--TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251 DE + + L + R + T+ P R + ++ P R Sbjct: 156 --WCDELAKWRYARETWDMLQFGLRLGDWPRQLV-TTTP-RPTPLLKDLIAHPATVLTR- 210 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 + + PSF E ++A+Y + + R E+ G+ ++ D+ ++IE +R Sbjct: 211 ALTRENAANLAPSFLESVVAQY-AGTRLGRQELDGEIVEERKDALWTRDLIEA--SRVAD 267 Query: 312 PDPYAPLIMGCD-IAEEGG--DNTVVVLRRGPVIEHLFDWSKTDLRTT-----NNKISGL 363 A +++ D A G DN ++ +F + + + L Sbjct: 268 APRLARIVVAVDPPASFGKRADNCGIIAAGADAGGAIFVLADSTISAARPAQWARAAIAL 327 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423 K D ++ + N G L V + +L+ + Sbjct: 328 YHKLSADVLVAEVNQGGEMVRAVLNEAD-PAAPVTMVRATRGKYLRAAPVAQLYEQGR-- 384 Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 ++ L + G +S D D L++ Sbjct: 385 VKHVGAFP--ALEDEMC-----DFGFDGLSC--------GRSPDRLDALVWA 421 >gi|323978427|gb|EGB73511.1| terminase [Escherichia coli TW10509] Length = 595 Score = 43.9 bits (102), Expect = 0.054, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 44/162 (27%), Gaps = 20/162 (12%) Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302 P W+ ++ G + + + RY + F S + + Sbjct: 334 PDGQWRYVVTLEDAIAGGFNLADINELRERYNET--AFNMLFMCVFVDDKE-SVFKFDDL 390 Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVLR------RGPVIEHLF 346 + + P + G D A G + T VVL + Sbjct: 391 VRCGVDVSTWEDFHPEDAMPFGNREVWGGFDPARSGDNATFVVLSPPLVAAERFRVLEKH 450 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 W + +I + +Y I ID G + ++ Sbjct: 451 HWRSMSFQFMAERIRSIKARYNMTFIGIDVTGLGYGVFELVQ 492 >gi|291085166|ref|ZP_06570961.1| terminase, ATPase subunit [Citrobacter youngae ATCC 29220] gi|291072161|gb|EFE10270.1| terminase, ATPase subunit [Citrobacter youngae ATCC 29220] Length = 106 Score = 43.9 bits (102), Expect = 0.055, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 28/84 (33%), Gaps = 6/84 (7%) Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLV 364 P + P +G D + G VL G L W D T I L Sbjct: 17 RPFNWRPAWIGYDPSHTGDSAGCAVLAPPLVAGGKFRILERHQWRGMDFATQAEAIRELT 76 Query: 365 EKYRPDAIIIDANNTGARTCDYLE 388 EKY + I IDA + G + Sbjct: 77 EKYCVEYIGIDATDIGQGVYQLVR 100 >gi|259418958|ref|ZP_05742875.1| phage DNA Packaging Protein [Silicibacter sp. TrichCH4B] gi|259345180|gb|EEW57034.1| phage DNA Packaging Protein [Silicibacter sp. TrichCH4B] Length = 478 Score = 43.9 bits (102), Expect = 0.056, Method: Composition-based stats. Identities = 59/423 (13%), Positives = 110/423 (26%), Gaps = 79/423 (18%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ S G + + + Q++ + Sbjct: 86 ILGGRGAGKTRAGA---EWVRSEVEGAEPFGIGRARRMALVGETYDQVRDVM-------- 134 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 + S W + + +S P+ G Sbjct: 135 --IHGDSGILACSPPDRRPEWRAGERRLVWPN-----GATAQAFSASDPEALRGPQFD-- 185 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + L A R + + R ++ P Sbjct: 186 -AAWVDELAKWRRAQDAWDMLQFALRLGAAPR--VCVTTTPRNVPLLKQLLESPSTV-TT 241 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + P F + ARYG S + R E+ G + ++E+ R+ Sbjct: 242 HAPTEANRANLAPGFLTEVRARYG-GSRLARQELDGVMLADVDGALWTSGMLEQLQRRDR 300 Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG----- 362 P +++ D A +G D +++ + +W + Sbjct: 301 PP--LDRIVVAVDPSVSAHKGSDACGIIVAGAQTQGPISEWRAY---VLADHTVQGLGPT 355 Query: 363 --------LVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNR 412 + YR + ++ + N GA L + V K Sbjct: 356 GWARAAIAARDAYRAERLVAEVNQGGALVGTVLRQVDPLVPFTPVHASKGKA-------A 408 Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGL 472 R E + + L + + + + G S D D L Sbjct: 409 RAEPVAALYEQGRVHHAPGLQELEEQMCLMTAQGYRGDG-------------SPDRVDAL 455 Query: 473 MYT 475 ++ Sbjct: 456 VWA 458 >gi|254474412|ref|ZP_05087798.1| phage DNA Packaging Protein [Ruegeria sp. R11] gi|214028655|gb|EEB69490.1| phage DNA Packaging Protein [Ruegeria sp. R11] Length = 417 Score = 43.9 bits (102), Expect = 0.057, Method: Composition-based stats. Identities = 64/415 (15%), Positives = 113/415 (27%), Gaps = 63/415 (15%) Query: 82 ISAGRGIGKTTLNA-WLVLWLMSTRPGI-----SVICLANSETQLKTTLWAEVSKWLSLL 135 I GRG GKT A W+ + P + L + Q++ + S L+ Sbjct: 25 ILGGRGAGKTRAGAEWVRALAEGSTPLSAGRARRIALLGETYDQVRDVMVQGDSGILACT 84 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 P P + + +S P+ G A Sbjct: 85 P---------------PDRRPQWKATERRLIWPNGATAQAFSAHDPEALRGPQFD---AA 126 Query: 196 INDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 DE + + L + +T+ P R ++ P + Sbjct: 127 WADELAKWKRGQDSWDMLQFAL-RLGTDPRVCVTTTP-RNVSVLRDLLASPSTV-QTHAA 183 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 + SF E + RY S + R E+ G Q + + A R P Sbjct: 184 TEANRANLATSFIEEVRNRY-AGSRLGRQELDGVLLQDVEGALWCNAGLVGAQVRSAPP- 241 Query: 314 PYAPLIMGCDIA---EEGGDNTVVVLR--------RGPVIEHLFDWSKTDLRTT--NNKI 360 +++ D A + D +++ + L D + R + Sbjct: 242 -LDRVVVAVDPAVSAGKSSDACGILVVGAVLQGPPQDWRAYVLADCTVQGARPLVWAQAV 300 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 ++ D ++ + N GA L + V R + R E + Sbjct: 301 VDAAHRFDADRVVAEVNQGGALVESLLRQIDPLV-----PFRPRHAARSKGARAEPVAAL 355 Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + L L + + G L G S D D L++ Sbjct: 356 YEQGRVRHLPGLGALEDQMC-----QMTPRGYL--------GQGSPDRLDALVWA 397 >gi|219723512|ref|YP_002476767.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] gi|219694406|gb|ACL34930.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] Length = 396 Score = 43.9 bits (102), Expect = 0.059, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 74/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSRYNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ E + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219 >gi|212712878|ref|ZP_03321006.1| hypothetical protein PROVALCAL_03975 [Providencia alcalifaciens DSM 30120] gi|212684570|gb|EEB44098.1| hypothetical protein PROVALCAL_03975 [Providencia alcalifaciens DSM 30120] Length = 436 Score = 43.9 bits (102), Expect = 0.059, Method: Composition-based stats. Identities = 67/424 (15%), Positives = 127/424 (29%), Gaps = 58/424 (13%) Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134 P FK + AG G GKT + + M P I+ A + Q++ + + Sbjct: 18 PHKFKAYV-AGFGSGKTWVGCGGICKGMWEFPKINQGYFAPTYPQIRDIFYPTI----EE 72 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 + ++ + + + + + Y S E+P+T VG + Sbjct: 73 VALDWGLKVNIVESNKEVHF---------YEGRRYRGTVICRSMEKPETIVGFKIGNAL- 122 Query: 195 IINDE----ASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPLDD- 247 DE S I+ + +T+ P Y+ F K + D Sbjct: 123 --IDELDVMKSDKAQKAWRKIIARMRYNVAGLRNGIDVTTTPEG-FKFVYQQFVKAVRDK 179 Query: 248 ------WKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 + Q T E + + +++ Y ++ + + GQF + I Sbjct: 180 PELSTLYGIVQASTFDNEKNLPADYIPSLMSSY--PPELIKAYLKGQFTNLTSGT-IYHT 236 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP---VIEHLFDWSKTDLRTTN 357 + N E P L +G D V VLR G V E + + D+ Sbjct: 237 FDRKLNNSEEEEQPGETLYIGMDFNVGKMAGIVHVLRLGLPHAVTEIINAYDTPDMVRII 296 Query: 358 NKISGLVE-----KYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVD 405 + L + K R I DA+ A + D L G+HV ++ Sbjct: 297 KERFWLYDGSDYKKVREIYIYPDASGDSRKSNNASSTDIAQLRQAGFHV--IVNDSNPPV 354 Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS 465 + + + + + + + + + E + G Sbjct: 355 KDRINSMNA-MFCNAKGERRYKVNVKRCPVYTESLEQQVWDPTSG-----EPDKKSGNDH 408 Query: 466 TDYS 469 + Sbjct: 409 PNDG 412 >gi|320160638|ref|YP_004173862.1| hypothetical protein ANT_12280 [Anaerolinea thermophila UNI-1] gi|319994491|dbj|BAJ63262.1| hypothetical protein ANT_12280 [Anaerolinea thermophila UNI-1] Length = 1068 Score = 43.9 bits (102), Expect = 0.062, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 65/204 (31%), Gaps = 17/204 (8%) Query: 43 TPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM 102 T L+ F P S + C+ N P F + G GKT + L Sbjct: 539 TYLKQFENPTSDINRKRNEILHACIEKGENEKPGFFSLTVPT--GGGKTLASIAFALHHA 596 Query: 103 STRPGISVICLAN--SETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC 160 +T +I + + + ++ E+ ++L + F+ A ++ + Sbjct: 597 ATHGLKRIIYVIPFTTIIEQNAQVFKEIFGEENVLEHHSNFDWNDGKREDADNRTNSILA 656 Query: 161 SLGIDSKHYSTMCRTYS---------EERPDTFVGHHNTYGMAIINDEASGTPD----VI 207 L + ++++ + + + HN II DEA P Sbjct: 657 KLKLAAENWDIPIVVTTNVQFFESLFDNKSSRCRKLHNIAKSVIIFDEAQMLPKEYIRPA 716 Query: 208 NLGILGFLTERNANRFWIMTSNPR 231 + +T A+ + + P Sbjct: 717 MAAVWELVTNYGASAVFCTATQPG 740 >gi|119967835|ref|YP_950664.1| putative large terminase subunit [Staphylococcus phage PH15] gi|112790059|gb|ABI21779.1| putative large terminase subunit [Staphylococcus phage PH15] Length = 446 Score = 43.9 bits (102), Expect = 0.063, Method: Composition-based stats. Identities = 41/365 (11%), Positives = 109/365 (29%), Gaps = 41/365 (11%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 ++ E++ H + + GRG GK++ + ++ + R ++ + + Sbjct: 4 IKLSELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVR 62 Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174 ++ L T+++ ++ + H F+++ + + + R Sbjct: 63 KADNTLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFR 110 Query: 175 TYSEERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMT 227 + P+ ++ + I + A T D + L + + + Sbjct: 111 GA--QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFS 168 Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283 NP + + YE +P + + I F + + + R E Sbjct: 169 YNPPKRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESAKERNEQRYRWE 227 Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN--TVVVLRRGPV 341 G+ +P N ++ + + + D + V + G Sbjct: 228 YMGEAIGS---GVVPFNNLQIEKIPDELYKSFDNIRNAVDFGLTKTAPLHSDVYSKLGEH 284 Query: 342 IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY-----LEMLGYHVYR 396 I + + + +K + +D + G + + L+ GY Sbjct: 285 ISGVRKKACATDPL--AFVRWHYDKKKRIIYAVDEH-YGVQISNREFANWLKRRGYQSDE 341 Query: 397 VLGQK 401 + Sbjct: 342 IYADS 346 >gi|219048282|ref|YP_002455497.1| PbsX family phage terminase large subunit [Borrelia afzelii ACA-1] gi|216752464|gb|ACJ73223.1| phage terminase, large subunit, pbsx family protein [Borrelia afzelii ACA-1] Length = 396 Score = 43.9 bits (102), Expect = 0.064, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSIYGLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANEDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ E + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219 >gi|289824955|ref|ZP_06544345.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-3139] Length = 98 Score = 43.9 bits (102), Expect = 0.066, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 18/48 (37%) Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 I W D R + I L E+Y I ID+ G + ++ Sbjct: 2 RILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVK 49 >gi|307294267|ref|ZP_07574111.1| hypothetical protein SphchDRAFT_1737 [Sphingobium chlorophenolicum L-1] gi|306880418|gb|EFN11635.1| hypothetical protein SphchDRAFT_1737 [Sphingobium chlorophenolicum L-1] Length = 438 Score = 43.9 bits (102), Expect = 0.067, Method: Composition-based stats. Identities = 65/403 (16%), Positives = 115/403 (28%), Gaps = 55/403 (13%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 AGRG GKT A V + P + + + + + + E Sbjct: 59 AGRGFGKTRAGAEWVRSVAEGDPAARIALVGATLGEARAVM----------------VEG 102 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHY-STMCRTYSEERPDTFVGHHNTYGMAIINDE--- 199 S L +PW++ + + ++ G ++G DE Sbjct: 103 ASGVLAVSPWWNRPAFLPALRKLVWRNGAVATLFGAAEAESLRGPQFSHG---WADEIAK 159 Query: 200 -ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258 A G + ++G T P L E D Sbjct: 160 WAGGQA-AWDNLMMGMRLGIAPRVLATTTPRPVALVRGLVE--RNGSDVVVTRGRSADNA 216 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 + F + YG + R E+ G+ ++ + +++E A + Sbjct: 217 SHLADGFLAAMERNYGGT-RLGRQELDGELIEEVEGALWSRDLLERCRVAHVRGT-LARV 274 Query: 319 IMGCDI-AEEGGDNT---VVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAI 372 ++ D A GD VV L + D + ++ + D + Sbjct: 275 VVAVDPPASVHGDACGIVVVGLGGDGRAYVIADATVEGATPEGWARAVAAAALVHGADRV 334 Query: 373 IIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430 + +ANN GA L G V V + V R E + + A Sbjct: 335 VAEANNGGAMVESVLRAAEAGLPVRLVHASRGKV-------ARAEPVAALYEAGRVAHRG 387 Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473 + L L L + G + +S D +D L+ Sbjct: 388 GFAELEDQLCGL----MLGGGYV-------GPGRSPDRADALV 419 >gi|158422729|ref|YP_001524021.1| hypothetical protein AZC_1105 [Azorhizobium caulinodans ORS 571] gi|158329618|dbj|BAF87103.1| conserved hypothetical protein [Azorhizobium caulinodans ORS 571] Length = 436 Score = 43.9 bits (102), Expect = 0.067, Method: Composition-based stats. Identities = 65/410 (15%), Positives = 123/410 (30%), Gaps = 58/410 (14%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPG------ISVICLANSETQLKTTLWAEVSKWLSLL 135 + GRG GKT A V L RP + +A + L+ + VS L++ Sbjct: 51 VLGGRGAGKTRAGAEWVRGLALGRPPFAPAPVGRIALVAETMGDLREVMVEGVSGLLAVH 110 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 P + + + +S E P++ G A Sbjct: 111 PAAERPRWEPTR---------------RRLVWPNGAVAQGFSAEDPESLRG---PQFEAA 152 Query: 196 INDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 DE + + + + L R M + R S + P R Sbjct: 153 WLDELAKWRRAEAVFDMLQFGLRLGAQPRQ--MVTTTPRPSALLRRLMADPSTVLSRAT- 209 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 + + P+F + ++ARYG + R E+ G+ + D+ + RE Sbjct: 210 TAQNAFHLAPAFLDTVLARYGGT-RLGRQELEGEIIEDRPDALWTRSA--LEAAREAAAP 266 Query: 314 PYAPLIMGCDI---AEEGGDNTVVVL--RRGPVIEHLF---DWSKTDLRTTNNKISGLVE 365 P A +++ D + G D ++ G + H+ + + L Sbjct: 267 PLARVVVALDPPASSRAGADACGIIAAGIDGEGLVHVLADATAAGLRPAQWAARAIDLWR 326 Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425 + DA++ + N G L + V V+ + L+ + + Sbjct: 327 THEADAVVAEVNQGGEMVRSVLAEVDASV-PVVSVRATRGKYLRAEPVAALYEQGR--VR 383 Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 A L + + + +S D D L++ Sbjct: 384 HAGAFP--ALEDEMCDFGPEGLSS-------------GRSPDRLDALVWA 418 >gi|254776419|ref|ZP_05217935.1| phage terminase [Mycobacterium avium subsp. avium ATCC 25291] Length = 491 Score = 43.9 bits (102), Expect = 0.069, Method: Composition-based stats. Identities = 71/462 (15%), Positives = 132/462 (28%), Gaps = 81/462 (17%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS-V 110 R WQ+ L +P+P GAI RG+GKT + A L L+ + P + + Sbjct: 51 RPWQM--------GMLRPFLDPDPRPLVGAIMGPRGLGKTGIFAALGLYELFCGPDGNEI 102 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170 +A E L + V + + K + Sbjct: 103 PIVAVDERMAGRLL--------------KPAAQMVELNDELAARAVVYRDRIEVPGKRST 148 Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG------------ILGFLTER 218 +R + T+ +A + DE LG T Sbjct: 149 LTALPAEAKRIEGL----GTWTLA-LADELGEIDPDTWSTLLLGAGKLDGAMALGIGTPP 203 Query: 219 NANRFWI------MTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-RTVEGIDPSFHEGIIA 271 N + +NP + FYE D ++ + +E +P + + Sbjct: 204 NRETSVLTDLREACRANPDDRTMAFYEF---SADGFEHHPVSCVHCLELANPQLDDLLSR 260 Query: 272 RYG------LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIA 325 R + Q + F+ + + P PD A +++ D Sbjct: 261 DRATALLKQTTEGEYRRKRLCQVVTTNESPFVDADTWDGLKAPHPVPD-GADVVIALD-G 318 Query: 326 EEGGDNTVVV---LRRGPVIEHLFDWS-------KTDLRTTNNKISGLVEKYRPDAIIID 375 D+T +V + + P + L W + + I +++R I D Sbjct: 319 SLKDDSTALVVGTVGKVPHFDRLDAWENPGDEAWRVPVLDVEQAIREAAKRWRVREIAFD 378 Query: 376 ANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL 435 R+ L G + Q A + R+ + + L + L Sbjct: 379 PY-LFTRSAQILAAEGLPMVEFR-QSPARQTAATNDLRS---AAVNEQLTHSG---DEVL 430 Query: 436 IQNLKSLKSFIVPNTGELAIESKRVKGAKST--DYSDGLMYT 475 +++ + +A K + + D LM Sbjct: 431 RRHVLAATVLESDKGIRIA---KVNRSKHAPKIDLCTALMMA 469 >gi|296445591|ref|ZP_06887546.1| protein of unknown function DUF264 [Methylosinus trichosporium OB3b] gi|296256836|gb|EFH03908.1| protein of unknown function DUF264 [Methylosinus trichosporium OB3b] Length = 442 Score = 43.6 bits (101), Expect = 0.071, Method: Composition-based stats. Identities = 67/418 (16%), Positives = 126/418 (30%), Gaps = 75/418 (17%) Query: 82 ISAGRGIGKTTLNA-WLVLWL-----MSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135 I +GRG GKT A W+ +TRP + + + ++ + VS L+ Sbjct: 58 ILSGRGAGKTRAGAEWVKGIARGRPQFATRPLSPIALIGETAADVRDVMIEGVSGILAAH 117 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 +S + + +S E P++ G A Sbjct: 118 SRSERPLWESSRRRLTF---------------DNGVVAQAFSAEDPESLRG---PQFAAA 159 Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF---- 251 DE + + +F + + R + +P+ KR Sbjct: 160 WCDELAK-----WRYAEETW---DMLQFGLRLGDWPR---QLVTTTPRPMPLIKRLLTEN 208 Query: 252 -----QIDTR-TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 + TR + PSF E ++++YG + R E+ G+ +Q D+ +++E A Sbjct: 209 GVAVTRAKTRANAANLAPSFLETVLSQYGGT-RLGRQELDGEIVEQRADALWTRDMLERA 267 Query: 306 LNREPCPDPYAPLIMGCD-IAEEGG--DNTVVVLRRGPVIEHLFDWSKTDLRTT-----N 357 R P P +++ D A G D +V G + + + Sbjct: 268 --RILAPPPLERIVVAIDPPASSGKRADRCGIVA-VGIAQNIVHVLADATVEAARPAQWA 324 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417 L K DA++ + N G + V + +L+ Sbjct: 325 RAAIALYHKLSADALVAEVNQ-GGEMVRAVIHEADPSVPVKEARATRGKYLRAAPAAQLY 383 Query: 418 VKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + + L + G + S +S D D L++ Sbjct: 384 EQGRAR-HVGAFPA---LEDEMC-----DFGPDG---LSS-----GRSPDRLDALVWA 424 >gi|89054122|ref|YP_509573.1| hypothetical protein Jann_1631 [Jannaschia sp. CCS1] gi|88863671|gb|ABD54548.1| protein of unknown function DUF264 [Jannaschia sp. CCS1] Length = 483 Score = 43.6 bits (101), Expect = 0.071, Method: Composition-based stats. Identities = 62/424 (14%), Positives = 113/424 (26%), Gaps = 75/424 (17%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132 + GRG GKT A W+ S G V + + Q + Sbjct: 91 VLGGRGAGKTRAGA---EWVRSMVEGATPEAPGRAKRVALIGETYDQAMAVMVK------ 141 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 + S W + ++ + +S P+ G Sbjct: 142 ----GESGLIACSPPDRVPRWIAGERKLVWPNGAE-----AQVFSANDPEALRGPQFD-- 190 Query: 193 MAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 DE + P + L + I+T+ P R ++ + Sbjct: 191 -LAWADELAKWPKAQETWDMLQFGL-RLGQHPQQIVTTTP-RNVNVLKDLLARD-GVAHT 246 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + SF + +RYG D+ + R E+ G ++ I+E R Sbjct: 247 HAPTEANSAYLADSFLTEVRSRYG-DTRLGRQELDGVLLDDVDNALWVRGAIDE--GRLT 303 Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIE-HLFDWS----------KTDLRTT 356 +I+ D G D +V+ G + W Sbjct: 304 DAPDVTRVIVAVDPPVTGHAGSDACGIVV-VGIIERGDPAQWRAVVLEDCSVQGVSPNQW 362 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRT 414 N ++ ++ + N G D + + ++ V V R Sbjct: 363 ANAAVAAYHRHGASRMVAEVNQGGVMVADTIRTVDPTINLRTVHASTGKV-------ARA 415 Query: 415 ELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474 E + + + A L H+ L + + G S D D L++ Sbjct: 416 EPVAALYEQGKVAHLGTHAELEDEMCKMALTGYEGQG-------------SPDRVDALVW 462 Query: 475 TFAE 478 E Sbjct: 463 ALTE 466 >gi|312149784|gb|ADQ29854.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi N40] Length = 304 Score = 43.6 bits (101), Expect = 0.074, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEG-GDNTVVVL 336 F +NI ++ + P I D A GDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVRGDNTALCV 299 >gi|203288763|ref|YP_002223713.1| hypothetical protein BDU_2013 [Borrelia duttonii Ly] gi|201084613|gb|ACH94190.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 398 Score = 43.6 bits (101), Expect = 0.074, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 79/244 (32%), Gaps = 23/244 (9%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS + F + Sbjct: 27 SSRGTGKTYDIATVNLERKFAKDGGDTLAVRKKKNKTTQSIHKEILELLSRYNLRREFTI 86 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 + + G ++ + + + +EA+ Sbjct: 87 SKAKI-------ETKKLIYGRKRAFVFEGGHDTTDLKSYA-------HFKDLWLEEANQF 132 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261 + ++ + ER + M+SNP S Y+ + N+ + R + Sbjct: 133 TESDIEKLIPTMRERGGRIY--MSSNPVPRSHWLYKRYIANEDNPSVCVIKSTYRDNPFL 190 Query: 262 DP----SFHEGIIARYGLDSDVTRVEVCG-QFPQQDIDSFIPLNIIEEALNREPCPDPYA 316 + ++ E Y + R+EV G +F I +E+L Y Sbjct: 191 NGGDVNAWLEKQKLAYHGNDIGFRIEVLGEEFEFGTARFIKEFTICDESLISRVQGSFYT 250 Query: 317 PLIM 320 + + Sbjct: 251 GIHI 254 >gi|195942758|ref|ZP_03088140.1| hypothetical protein Bbur8_08065 [Borrelia burgdorferi 80a] Length = 312 Score = 43.6 bits (101), Expect = 0.074, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEG-GDNTVVVL 336 F +NI ++ + P I D A GDNT + + Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVRGDNTALCV 299 >gi|66394679|ref|YP_240816.1| ORF009 [Staphylococcus phage X2] gi|62636903|gb|AAX92014.1| ORF009 [Staphylococcus phage X2] Length = 421 Score = 43.6 bits (101), Expect = 0.075, Method: Composition-based stats. Identities = 40/309 (12%), Positives = 97/309 (31%), Gaps = 35/309 (11%) Query: 83 SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142 GRG GK++ + ++ + R ++ + + ++ L T+++ ++ + H F+ Sbjct: 33 KGGRGSGKSSDISIIIT-QLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFK 91 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY---GMAIINDE 199 ++ + + + R + P+ ++ ++ I + Sbjct: 92 VKVS------------PMEITYIPRGNRIIFRGA--QNPERLKSLKDSRFPFSISWIEEL 137 Query: 200 ASGTPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF----YEIFNKPLDDWKRF 251 A + I L + + + NP + + YE + + + Sbjct: 138 AEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSFQADNTYVHH 197 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 I F + + + R E G+ + F L IEE R+ Sbjct: 198 S-TYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR-IEEIPQRQY- 254 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKY 367 D + + D D V ++ VI + + + + Y Sbjct: 255 -DTFDNIRNAVDFG-YATDPLAFVRWHYDKKKRVIYAMDEHYGVQISNREFANWLKKKGY 312 Query: 368 RPDAIIIDA 376 + D + D+ Sbjct: 313 QSDEVFADS 321 >gi|216996657|ref|YP_002333778.1| phage terminase, large subunit, PBSX family [Borrelia afzelii ACA-1] gi|216752579|gb|ACJ73283.1| phage terminase, large subunit, PBSX family [Borrelia afzelii ACA-1] Length = 450 Score = 43.6 bits (101), Expect = 0.076, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKIDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIAITDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|265753755|ref|ZP_06089110.1| terminase [Bacteroides sp. 3_1_33FAA] gi|263235469|gb|EEZ20993.1| terminase [Bacteroides sp. 3_1_33FAA] Length = 521 Score = 43.6 bits (101), Expect = 0.082, Method: Composition-based stats. Identities = 34/240 (14%), Positives = 72/240 (30%), Gaps = 33/240 (13%) Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQF---PQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 +P++ + A G + + + G F P+++ IP + N P + Sbjct: 244 NPNYIGSVAASGGK---MAQAIIEGNFNVDPEENEKIPIPSTSAQGVFNNNPAVN--GDK 298 Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378 + D+A+ G DN V + G + SK+ R + ++ I + Sbjct: 299 WITVDLADYGTDNLVALAWDGFHAYDILILSKSTPRENAMAVKTFAFEHGTAESHIIFDA 358 Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL--- 435 T R + + L + ++++ +E +L L Sbjct: 359 TAGRYFNDYIPDAVPYISLNKPFGLYQLTAMTVKDM-CYIRLCKMIEEGNLTFDDKLAVQ 417 Query: 436 ----------------IQNLKSLKSFIVPNTGELAIESKRVK-----GAKSTDYSDGLMY 474 S+ F +G+ + +K+ +S D D Sbjct: 418 TYTHQNLKYKVTVENEFMEECSVVRFDDMQSGKKRLWNKKKMNQMLGKGRSMDLLDPCAM 477 >gi|319409256|emb|CBI82900.1| phage-related protein [Bartonella schoenbuchensis R1] Length = 441 Score = 43.6 bits (101), Expect = 0.083, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 66/194 (34%), Gaps = 11/194 (5%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFNKPLDD- 247 + DEA + ++ L E N +T NP R + + F D Sbjct: 122 RILLCWVDEAEPVTETAWQTLIPTLREEGENWHCELWVTWNPLRENAPVEKRFRAVKDPH 181 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF---IPLNIIEE 304 K +I+ R + A + + G++ Q ++ + L +E Sbjct: 182 IKGVEINWRDNPQFPDRLNRAREADFTQRPEQYNHIWEGEYLQAVQGAYYQKLLLEAEQE 241 Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKI 360 DP + + DI G D T + + + G I L D+ + + + I Sbjct: 242 GRITHVPRDPLIQIKIFWDIGGTGAKADATALWVAQFIGREIRVL-DYYEAQGQPLSEHI 300 Query: 361 SGLVEKYRPDAIII 374 + ++ A+++ Sbjct: 301 GWMCQRGYDKALMV 314 >gi|317152167|ref|YP_004120215.1| hypothetical protein Daes_0447 [Desulfovibrio aespoeensis Aspo-2] gi|316942418|gb|ADU61469.1| hypothetical protein Daes_0447 [Desulfovibrio aespoeensis Aspo-2] Length = 590 Score = 43.6 bits (101), Expect = 0.083, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 92/293 (31%), Gaps = 40/293 (13%) Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223 + Y + + + + + H + + + TP + T N+ Sbjct: 255 VYIDEYFWITKFNELYKVASAMAAHKKWRITLF-----STPSAVTHEAYDLWTGDRFNKR 309 Query: 224 WIMTSNPRRLSGKFYEIFNK----PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSD 278 W +R +E + P W++ I G D E + +Y +D Sbjct: 310 WSR--QAKRKEFPSFEAMQRGVVCPDKVWRKVITIKDAEAGGCDLFDFEDLNLQY--STD 365 Query: 279 VTRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEG 328 R +F D+ + L+ +E P P+ G D + Sbjct: 366 EFRNLFMCEFVD-DLQAVFRLHNLEACYGDMDEWTDFNPDAARPFGNLPVWGGYDPSRNR 424 Query: 329 GDNTVVVL----RRGPVIEHLFDWSKTDLRTT--NNKISGLVEKYRPDAIIIDANNTGAR 382 D + V+L + G + L + D T +I L +++ I ID G Sbjct: 425 DDASFVILAPPLQPGGMFRVLARYKWVDKSYTWQAQRIKELTQQFNFVHIGIDVTGPGIG 484 Query: 383 TCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL 435 + ++ A+ + + +T L +K D +E + + L Sbjct: 485 VFESVQA---------FFPAAMPITYGVQTKTTLVLKAKDVIESGRIQWDASL 528 >gi|328857391|gb|EGG06508.1| hypothetical protein MELLADRAFT_36161 [Melampsora larici-populina 98AG31] Length = 824 Score = 43.6 bits (101), Expect = 0.084, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 52/151 (34%), Gaps = 9/151 (5%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQS 145 G+GKT L+L G + +A + + W E+ K+ L W Sbjct: 237 GMGKTIQTISLILSDRKAGDGKQTLVIAPT---VAIIQWRNEIEKFTKGLKVNVWHGGNR 293 Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYS---TMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202 + D++ S + + + R + E R + + H+ + +I DEA Sbjct: 294 STDKKTMKSYDIVLTSYAVLESSFRRQNSGYRKFGELRKEASL-LHSIHWHRVILDEAHN 352 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRL 233 D G E A W ++ P + Sbjct: 353 IKDRSCNTAKGAF-ELQATFKWCLSGTPLQN 382 >gi|160897385|ref|YP_001562967.1| PBSX family phage terminase large subunit [Delftia acidovorans SPH-1] gi|160362969|gb|ABX34582.1| phage terminase, large subunit, PBSX family [Delftia acidovorans SPH-1] Length = 433 Score = 43.6 bits (101), Expect = 0.086, Method: Composition-based stats. Identities = 41/295 (13%), Positives = 77/295 (26%), Gaps = 25/295 (8%) Query: 147 SLHPAPWYSDVLHCSLG-IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASGTP 204 ++ PW + I S+ +R + + + DEA Sbjct: 78 AIEDEPWLAAYYDVGDKYIKSRDGRITFAFAGLDR--NIASIKSKGRLLLCWVDEAEPVT 135 Query: 205 DVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGI 261 D ++ L E N +T NP+R S + F K + + + Sbjct: 136 DEAWTTLIPTLREEGTDWNAELWVTWNPKRKSAPVEKRFKGSSDPRMKYVRCNWKDNPKF 195 Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII----EEALNREPCPDPYAP 317 + + G + ++ +I+ + + R P DP Sbjct: 196 PALLERVRLRDLAERPEQYAHIWEGDYATVIEGAYFASHIVKARQDNRIGRVPA-DPLMT 254 Query: 318 LIMGCDIAEEGGDNTVVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373 L DI G + G I L + + +++ + Y I Sbjct: 255 LRAFVDIGGTGARADAFAMWIAQFVGKEIRVLDYYEQVGQPLSSHLNWMREKGYDKAQIW 314 Query: 374 IDANNTGARTC------DYLEMLGYHVYRVLGQKRAVDLEFCRNRR---TELHVK 419 + + L GY V V Q + R + Sbjct: 315 LPHDGATQDKVHDVSYESALRQAGYTVTVVPNQGKGAAKARIEAGRRLFGSMWFN 369 >gi|319899324|ref|YP_004159421.1| hypothetical protein BARCL_1179 [Bartonella clarridgeiae 73] gi|319403292|emb|CBI76851.1| phage-related protein [Bartonella clarridgeiae 73] Length = 442 Score = 43.6 bits (101), Expect = 0.087, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 64/193 (33%), Gaps = 9/193 (4%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247 + DEA D ++ L E + +T NP R + + F + Sbjct: 122 RILLCWVDEAEPVTDAAWQILIPTLREEGKDWHSELWVTWNPCRENAAVEKRFRFTKDPN 181 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 K +I+ R + A + + G++ Q ++ ++E Sbjct: 182 VKGVEINWRDNPKFPAKLNRDRKADLEQRPEQYQYIWEGEYLQAMQGAYYQKLLLEAEQE 241 Query: 308 REPCPDPYAPLI---MGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361 P PLI + DI G D T + + + + D+ + + + I Sbjct: 242 GRITKVPRDPLIQIKIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHIG 301 Query: 362 GLVEKYRPDAIII 374 + +K A+++ Sbjct: 302 WICQKGYEKALMV 314 >gi|16127022|ref|NP_421586.1| hypothetical protein CC_2790 [Caulobacter crescentus CB15] gi|221235816|ref|YP_002518253.1| phage DNA packaging protein [Caulobacter crescentus NA1000] gi|13424390|gb|AAK24754.1| conserved hypothetical protein [Caulobacter crescentus CB15] gi|220964989|gb|ACL96345.1| phage DNA packaging protein [Caulobacter crescentus NA1000] Length = 567 Score = 43.6 bits (101), Expect = 0.087, Method: Composition-based stats. Identities = 62/405 (15%), Positives = 109/405 (26%), Gaps = 60/405 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 GRG GKT A + W P ++I + ++ + + + Sbjct: 199 GGRGAGKTFAGARWITWNALAYPSQALI--GPTLHDVREVM----------IEGPSGLKA 246 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDEAS 201 + W + + +S E P++ G H DE Sbjct: 247 MGGPAYRPRWEASRRRLVWPN-----GAVAYAFSAEDPESLRGPQFHAA-----WADEFC 296 Query: 202 GTPDVINLGI---LGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258 P G + T P R + + Sbjct: 297 AWPKPAETLAMLRFGLRLGEDPRLVVTTTPKPHRAL----KTLMAEPGVALTRAGTSANA 352 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 + P+F + + YG + E+ G + + A R P + Sbjct: 353 GNLAPAFLRTLASLYGGT-RLAAQELDGVVVE-TDGGLFRAEDL--ARCRAARPARLDRV 408 Query: 319 IMGCD-IAEEGGDNT--VVVLRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAII 373 ++ D A GD VVV RR L D + + DA++ Sbjct: 409 VVAVDPPATATGDACGIVVVGRRDDRAFVLADETARGLSPAGWAGRAVAAARAWTADALV 468 Query: 374 IDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHS 433 +AN G L RV + ++ L+ + L + + Sbjct: 469 AEANQGGDMVRSVLAQAD-PPCRVKLVRASLGKRARAEPVAALYEQGRV-LHCGAFVALE 526 Query: 434 GLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 + L S G+L S D +D L++ +E Sbjct: 527 EELMALGS---------GDLE---------HSPDRADALVWAVSE 553 >gi|282598984|ref|YP_003358901.1| gp17 terminase DNA packaging enzyme large subunit [Deftia phage phiW-14] gi|257219054|gb|ACV50069.1| gp17 terminase DNA packaging enzyme large subunit [Deftia phage phiW-14] Length = 585 Score = 43.6 bits (101), Expect = 0.091, Method: Composition-based stats. Identities = 50/286 (17%), Positives = 95/286 (33%), Gaps = 39/286 (13%) Query: 130 KWLSLLPNKHWFEMQSLSLH-----PAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDT 183 K ++L NK ++ + + P++ + + M + +S PDT Sbjct: 139 KNWAVLANKSSAALEVMDRYRVMFQELPYFMQIGAVRFNLAEVELENMSKVFSGTSDPDT 198 Query: 184 FVGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF 241 G G+ DE++ T + L+ + + I+TS P G FY+I+ Sbjct: 199 VRGK-ALNGIYW--DESAFTARDEEFWTSTFPVLSSGDTS-KAILTSTPNGARGVFYKIW 254 Query: 242 NKPLDD-------WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDID 294 + D + R + D ++ E I + G + E F Sbjct: 255 KESEDPNSDVYNGFARLAVPWYRHPRRDEAWKELSIRKIGPTK--FKQEHELSFL-GSSG 311 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIM--GCD------IAEEGG----DNTV-VVLRRGPV 341 IP +E P + I D IA+ GG D +V ++ + Sbjct: 312 CLIPPMTLERMGFINPLREDEHLKIFVEPVDDHKYIGIADSGGGVGADYSVCTIIDVTEI 371 Query: 342 IEHLFDWSKTD---LRTTNNKISGLVEKYRPDAIII-DANNTGART 383 + + + +I L Y ++I + N+ G + Sbjct: 372 PYRVVAKYRNNEIAPIVFPYQIVSLCGLYNDCPVLIENNNDVGGQV 417 >gi|260433583|ref|ZP_05787554.1| putative phage terminase, large subunit [Silicibacter lacuscaerulensis ITI-1157] gi|260417411|gb|EEX10670.1| putative phage terminase, large subunit [Silicibacter lacuscaerulensis ITI-1157] Length = 504 Score = 43.2 bits (100), Expect = 0.092, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 86/298 (28%), Gaps = 54/298 (18%) Query: 64 AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVL------WLMSTRPGISVICLANSE 117 ++ P V ++ GRG GK+TL+A L L W + I +A Sbjct: 33 NKFIDGAYGPGINVGVLSV--GRGNGKSTLSAILALGELVGAWSDA---KEREILIAAKT 87 Query: 118 TQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177 Q W V LP + + D ++ + R S Sbjct: 88 QQQAQICWHYVVSLSKTLPEDVQAAITIRR---------QPRFEIQFDDENGPHILRAIS 138 Query: 178 EERPDTFVGHHNTYGMAIINDEASGTP----DVINLGILGFLTERNANRFWIMT--SNPR 231 + T I DE P D + +L L++R+ I T SN Sbjct: 139 ADGKSAL----GTSPTLAILDERGHWPLAQGDELEAALLTGLSKRDGKALIISTSASNDM 194 Query: 232 RLSGKF--------YEIFNKPLDDWKRFQIDTR--TVEGID----------PSFHEGIIA 271 + Y ++P + + G I Sbjct: 195 HPFSLWLDREAPGVYRQEHRPEPGLPADDVASLIIANPGTKYGIGPSLKRLKDDAALAIE 254 Query: 272 RYGLDSDVTRVEVCGQFPQQDI-DSFIPL-NIIEEALNREPCPDPYAPLIMGCDIAEE 327 R G R+ + Q+D D I L + ++ + P P ++G D+ Sbjct: 255 RGGSALSRFRLLSRNERVQEDNRDILISLDDWLK--CETDALPPKSGPCVIGLDLGGS 310 >gi|154489097|ref|ZP_02029946.1| hypothetical protein BIFADO_02409 [Bifidobacterium adolescentis L2-32] gi|154083234|gb|EDN82279.1| hypothetical protein BIFADO_02409 [Bifidobacterium adolescentis L2-32] Length = 1055 Score = 43.2 bits (100), Expect = 0.092, Method: Composition-based stats. Identities = 46/242 (19%), Positives = 75/242 (30%), Gaps = 33/242 (13%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 +S E+ + + L D W + F + P P+E S ++ Q M+ Sbjct: 194 LSEEIESQISESKPLTD-AWLKLYEEDFKKYA----PQRPNRKPIEKTSQSQTIQPNAMQ 248 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 V +N + I + G GKT L+A+ V Q+ Sbjct: 249 V--EALMNLAQLRKQGESRAIIVSATGTGKTYLSAFDV-------------------RQV 287 Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 K +++ + K Q + P S D K+ +T S R Sbjct: 288 KPNRMLYIAQ-QEQILKKAEESFQKVLGCPKSELGLFSGGSKESDRKYVFATVQTMS--R 344 Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG-KFYE 239 P+T I+ DE + N MT+ P R G +E Sbjct: 345 PETLAQFDADEFDYILVDE---VHHAAAESYKRVIDHFQPNFMLGMTATPERTDGANIFE 401 Query: 240 IF 241 +F Sbjct: 402 LF 403 >gi|87307615|ref|ZP_01089759.1| hypothetical protein DSM3645_28877 [Blastopirellula marina DSM 3645] gi|87289785|gb|EAQ81675.1| hypothetical protein DSM3645_28877 [Blastopirellula marina DSM 3645] Length = 429 Score = 43.2 bits (100), Expect = 0.093, Method: Composition-based stats. Identities = 52/348 (14%), Positives = 93/348 (26%), Gaps = 63/348 (18%) Query: 51 PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLW---------- 100 P Q E + H + GR GKT + Sbjct: 26 PLPHQREILRDRHRHKR--------------VICGRRWGKTGAGLIAAILGHGDPSGPGH 71 Query: 101 LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC 160 G ++ +A + Q K E + V Sbjct: 72 WKGMVDGGTLYWVAPTFAQSKKI------------------ERDIMLAFANSGLVYVKSE 113 Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERN 219 S +T + G +I DE + + + L++R Sbjct: 114 GRIEHPSGGSITIKTAAAPVSLRGEGLDG-----MIGDEFAFVRKEVWSDALRPALSDRR 168 Query: 220 ANRFWIMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYGLDS 277 ++ T P + + + D +K +Q T ID + + + G S Sbjct: 169 GWSMFLTT--PNGPN-WMKDQHDLDGVDPTYKSWQCPTSDNCLIDQAELDSALLDLGQAS 225 Query: 278 DVTRVEVCGQFPQQDIDSFIPL--NIIEEALNREPCPDPYAPLIMGCDIAEEGGDNT--- 332 E QF F L + + P ++G D ++ D + Sbjct: 226 --FDQEYRAQFVDVSGAEFSGLYFQTPKFWFDDWPPESEIRFRVIGLDPSKGKNDKSDYS 283 Query: 333 ---VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377 ++ L I D + D+R K L + P +I++ N Sbjct: 284 AFVMLALAGDGQIYVDADIERRDVRKIAEKAFELCALFEPTGMIVETN 331 >gi|307275425|ref|ZP_07556567.1| phage uncharacterized protein [Enterococcus faecalis TX2134] gi|306507813|gb|EFM76941.1| phage uncharacterized protein [Enterococcus faecalis TX2134] Length = 418 Score = 43.2 bits (100), Expect = 0.098, Method: Composition-based stats. Identities = 45/305 (14%), Positives = 89/305 (29%), Gaps = 35/305 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 RG KTT A + LM P ++I L ++T + E+ ++ + + +F+ Sbjct: 52 RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106 Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +L+ + + + G H +I D+ Sbjct: 107 FALYGVELVLLKETTTEIDTNLKTSSRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257 D ++ + N + G+F ++K K + D Sbjct: 164 KDRVSRA-----EREKTKLQYQELQNVKNRGGRFINTGTPWHKEDAISKMPNVKKFDCYE 218 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 ID + + + + + + F I+ N + Sbjct: 219 TGLIDKEQRKAL--QQSMTPSLFAANYELKHIADSESLFTAPTYID-NTNLIYNGVAH-- 273 Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373 D A GGD+T + + G +I W K +I L + Y+ Sbjct: 274 ----IDAAYGGGDSTAFTIFKEQKDGTIIGFGKKWQKHVDDCLP-EILQLHQYYQAGTFY 328 Query: 374 IDANN 378 + N Sbjct: 329 TETNG 333 >gi|219053375|ref|YP_002455734.1| phage terminase, large subunit, pbsx family protein [Borrelia afzelii ACA-1] gi|226234324|ref|YP_002775459.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] gi|216752668|gb|ACJ73353.1| phage terminase, large subunit, pbsx family protein [Borrelia afzelii ACA-1] gi|226202138|gb|ACO37810.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] Length = 396 Score = 43.2 bits (100), Expect = 0.098, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 74/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSRYNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSADDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANEDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ E + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219 >gi|57234875|ref|YP_181104.1| hypothetical protein DET0357 [Dehalococcoides ethenogenes 195] gi|57225323|gb|AAW40380.1| hypothetical protein DET0357 [Dehalococcoides ethenogenes 195] Length = 441 Score = 43.2 bits (100), Expect = 0.10, Method: Composition-based stats. Identities = 44/291 (15%), Positives = 83/291 (28%), Gaps = 45/291 (15%) Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213 + D+ G + S E VG NT + + DEA Sbjct: 71 FGDIYQTEGGYIIRLNQARAVFLSAEPSANVVG--NTAHLLLEVDEAQDVNQEKYSKEFK 128 Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD------WKRFQIDTRTVEGIDPSFHE 267 + N ++ S EI + ++ + F+ D V +P++ Sbjct: 129 PM-GATTNVTTVLYGTTWDSSSLLEEIKRQNIEKEHKDGLKRHFRYDWEEVAAHNPAYLA 187 Query: 268 GIIARY---GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDPYAPLIMG 321 ++ G + + + P + PC P+ + G Sbjct: 188 YALSEKDRLGENHPLFLTQYR-LLPVSGGGGMFSSEQLGLLKGSHPCQLYPENGKVYVAG 246 Query: 322 CDIAEE-----GGDNTVVVLRRGPVIEHLFD----------------------WSKTDLR 354 D+A E T V LRR ++ + + W Sbjct: 247 LDLAGEDVQSAADLPTAVNLRRDSIVLTIAELDYTFAKAPFNLPQVRLVCHCSWQGARHA 306 Query: 355 TTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYHVYRVLGQKRA 403 K+ L+ K ++ + +DA G +L LG + + Q + Sbjct: 307 LLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSRILPFVFQPSS 357 >gi|331662794|ref|ZP_08363717.1| putative phage terminase [Escherichia coli TA143] gi|331061216|gb|EGI33180.1| putative phage terminase [Escherichia coli TA143] Length = 407 Score = 43.2 bits (100), Expect = 0.11, Method: Composition-based stats. Identities = 63/362 (17%), Positives = 107/362 (29%), Gaps = 55/362 (15%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140 AG G GKT + + PGI+ A + Q++ + EV+ L + Sbjct: 28 AGFGSGKTWVGCGGICKGTWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 87 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + + + + CR S E+P T VG + DE Sbjct: 88 GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 128 Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248 P I+ + + + R I + YE F K + + Sbjct: 129 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 188 Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 Q T E + + ++ Y ++ + + GQF + + N Sbjct: 189 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 245 Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363 E P P+ +G D + VLR G + D I Sbjct: 246 CEEVEQPGEPIYIGMDFNVGKMAGIIHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 305 Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412 K R I DA+ + A T D L+ G++V V+ + + Sbjct: 306 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 363 Query: 413 RT 414 Sbjct: 364 NA 365 >gi|269836053|ref|YP_003318281.1| hypothetical protein Sthe_0020 [Sphaerobacter thermophilus DSM 20745] gi|269785316|gb|ACZ37459.1| conserved hypothetical protein [Sphaerobacter thermophilus DSM 20745] Length = 497 Score = 43.2 bits (100), Expect = 0.11, Method: Composition-based stats. Identities = 58/385 (15%), Positives = 100/385 (25%), Gaps = 72/385 (18%) Query: 50 APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP--G 107 PR +Q E M V A + I R GK A L+ +L++ G Sbjct: 29 RPRRYQAEPMRAVAAAVVARARGDRSHPADFGIVFSRQAGKDEALAQLIAYLLTLFQRAG 88 Query: 108 ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167 S++ + + L ++ + G + Sbjct: 89 GSIVVALPT-----------LRPQGILARDRLIERLTCERARALGLRP---RVQDGTIVR 134 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227 C S G T + ++ +E + + M Sbjct: 135 LGRAACHFVSAGPQSNARGQ--TASLLLVANECQDIRPERWDSVFAPMAASTDAVTLSMG 192 Query: 228 SNPRRLS---------------GKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272 + + +F P W Q+ + V + E IA Sbjct: 193 TVWTADTLLARQMRHLAALEAEDGRRRLFRVP---W---QVVAQEVPSYGR-YVERQIAL 245 Query: 273 YGLDSDVTRVEVC--------GQFPQQDIDSF----------IPLNIIEEALNREPCPDP 314 G D R E G FP +P E AL + + Sbjct: 246 LGADHPFIRTEYELLELDGQGGLFPPSRQGQMQGDHPPLTRAVPGE--EYALLLDVAGEE 303 Query: 315 YAPLIMG--CDIAEEGGDNTVVVLR--------RGPVIEHLFDWSKTDLRTTNNKISGLV 364 + G D A + V+R R V+ W+ + ++ L Sbjct: 304 EESVDPGRAYDPAARRDSTALTVVRVVHQDARPRYEVVRRYL-WTGVKHTALHAQLVDLA 362 Query: 365 EK-YRPDAIIIDANNTGARTCDYLE 388 +R +++DA GA +L Sbjct: 363 RHVWRARYVVVDATGVGAGLASFLR 387 >gi|296393586|ref|YP_003658470.1| terminase [Segniliparus rotundus DSM 44985] gi|296180733|gb|ADG97639.1| Terminase [Segniliparus rotundus DSM 44985] Length = 498 Score = 42.8 bits (99), Expect = 0.12, Method: Composition-based stats. Identities = 70/388 (18%), Positives = 118/388 (30%), Gaps = 70/388 (18%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ E + H L + P A R GK+ L L LW + + G+ V Sbjct: 45 PWQ----EWLLTHALEVRPDGLPRFRTVIALAARQNGKSLLMIVLALWRVYVKGGVVVGT 100 Query: 113 ---LANSETQLKTTLWAEV----------------------SKWLSLLPNKHWFEMQSLS 147 LANSE W E K L L + + Sbjct: 101 AQDLANSE-----KAWGEAVELAEGTPELASEVLHVDKTNGKKSLRLHSGAQYRIAAASR 155 Query: 148 LHPAPWYSDVLHCSLGIDSKH---YSTMCRTYSEERPDTF------VGHHNTYGMAII-- 196 + +D++ + + ++ + +T + RPD G H + +A + Sbjct: 156 RGARGFTADLILLDELREHQSFDSWAAVTKT-TMARPDAQVWCLSNAGDHLSVVLAHLRN 214 Query: 197 -----NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251 D G PD + + A + G K W + Sbjct: 215 IAHRQLDWPDGKPDHVEDQAP----DDEAEDDSVGIFEWSAPPG----CDPKDRHAWAQA 266 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDS-DVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 V + + I + Y D V EV Q+P P + + Sbjct: 267 N-PALGVTITERA----IASAYATDPAPVFAAEVLCQWPLTVTPGPFPPGSWDSTRDDNS 321 Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS--GLVEKYR 368 +P ++G D+A +TV + G + + T R ++ ++ + + Sbjct: 322 TIATDSPRVVGLDMAWN--RSTVTLALAGRRDDGMAHVEITAQRAGSDWVAPWLAERREK 379 Query: 369 PDAIIIDANNTGA-RTCDYLEMLGYHVY 395 A+I+ AN A LE G V Sbjct: 380 IAAVIVQANGAPASSLVADLEAAGLPVI 407 >gi|114569469|ref|YP_756149.1| hypothetical protein Mmar10_0918 [Maricaulis maris MCS10] gi|114339931|gb|ABI65211.1| protein of unknown function DUF264 [Maricaulis maris MCS10] Length = 450 Score = 42.8 bits (99), Expect = 0.12, Method: Composition-based stats. Identities = 66/409 (16%), Positives = 113/409 (27%), Gaps = 59/409 (14%) Query: 84 AGRGIGKTTLNA-WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142 GRG GKT A W+ + T + + + ++ + + + Sbjct: 67 GGRGAGKTRAGAEWVRHRALRTV--SRIALVGPTFNDVREVM----------IEGPSGLK 114 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202 ++ + + S+ Y+ +S E D G Y DE + Sbjct: 115 HLGSAMERPRYEASRKRLVFPSGSQAYA-----FSAEDADGLRGPQFDYA---WGDEFAA 166 Query: 203 TPDV---INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259 PD ++ +G T P + ++ Q Sbjct: 167 WPDPQRVLDTLRMGVRLGGAPRILLTTTPRPIPALKALVKAWDPRGPIRVTHQPTAANAA 226 Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319 + P F E + A YG S + R EV G + IE A ++ Sbjct: 227 NLAPGFVEALNAAYG-GSMLGRQEVEGLLIDDPDGALWTRPKIEAARLAAGQMPELDRIV 285 Query: 320 MGCDIAEEGG---DNT--VVVLRRGP------VIEHLFDWSKTDLRTTNNKISGLVEKYR 368 + D GG D VV G V+ + + + + Y Sbjct: 286 VALDPPATGGPRSDECGIVVAGAHGEGPARIAVVLADLSFGPALPADWAARAASAFDDYS 345 Query: 369 PDAIIIDANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426 DA+I +AN G L+ G V V + R E + Sbjct: 346 ADALIAEANQGGEMVRSVLQAAAPGLPVRLVHASRGKR-------ARAEPVAALYAAGRV 398 Query: 427 ASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 L + + F P+ + S D D L++ Sbjct: 399 RHARPFPALEDQMCA---FGAPDGPK-----------SSPDRVDALVWA 433 >gi|308188181|ref|YP_003932312.1| Terminase, ATPase subunit (GpP) [Pantoea vagans C9-1] gi|308058691|gb|ADO10863.1| Terminase, ATPase subunit (GpP) [Pantoea vagans C9-1] Length = 190 Score = 42.8 bits (99), Expect = 0.12, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 30/81 (37%), Gaps = 9/81 (11%) Query: 317 PLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367 + +G D A+ G VV+ G L W D R + I L ++Y Sbjct: 9 EVWIGYDPAKGTQNGDSAGCVVMAPPAVPGGKFRILERHQWRGMDFRAQADAIRTLTQQY 68 Query: 368 RPDAIIIDANNTGARTCDYLE 388 I ID+ + G + ++ Sbjct: 69 NVTYIGIDSTSVGLGVYENVK 89 >gi|58040880|ref|YP_192844.1| Phage DNA packaging protein [Gluconobacter oxydans 621H] gi|58003294|gb|AAW62188.1| Phage DNA Packaging Protein [Gluconobacter oxydans 621H] Length = 435 Score = 42.8 bits (99), Expect = 0.13, Method: Composition-based stats. Identities = 63/436 (14%), Positives = 119/436 (27%), Gaps = 87/436 (19%) Query: 83 SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142 G GKT L V+ PG ++ KH Sbjct: 26 RGGSRSGKTFLLVRAVVIRAVKAPGSR------------HGIFR-----HRFNALKHTII 68 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY--------SEERPDTFVGHHNTYGMA 194 + + D+ + D Y T+ S +R + +G Sbjct: 69 GDTFPKVMRLCFPDLPYTLNRTD--WYVTLPNGSEILFHGLDSSDRTEKILGL---EFAT 123 Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMT-SNPRRLSGKFYEIFNKPL-------- 245 + +EAS +L L ++ +NP S Y +F + + Sbjct: 124 VYMNEASQISYAARNMLLTRLAQKTCLSVKEYIDANPPTTSHWLYSLFEQKIEPKSGEPL 183 Query: 246 ---DDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDIDSFIPLN 300 DD+ QI+ + + P + + A + R G + + L+ Sbjct: 184 PYPDDYATMQINPDSNRANLSPEYLAQLEAL----PEKERQRFLFGNYQTAIDGALWTLD 239 Query: 301 IIEEALN-----REPCPDPYAPLIMGCDIAE------EGGDNTVVVL----RRGPVIEHL 345 I R +++ D + D + + R G Sbjct: 240 RIRRLAQVTNETRAAVLADMRRIVVSVDPSGCSGNEDYKSDEIGISVCGIDRDGNGHVFA 299 Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 + ++ + D I+ + N GA R + V Sbjct: 300 DLTCRAGPAGWAKVAIDAMDLWGADRIVAEKNFGGAMV-----EQTIRSVRATAPVKLVT 354 Query: 406 LEFCRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKG 462 + R E +A E + +H L + L +G +G Sbjct: 355 ASRGKTARAE---PIAALYEQGKVFHHGRFPDLEEQLC-----QFSASGF--------QG 398 Query: 463 AKSTDYSDGLMYTFAE 478 A+S D +D +++ +E Sbjct: 399 ARSPDRADSMVWGLSE 414 >gi|29374972|ref|NP_814125.1| hypothetical protein EF0333 [Enterococcus faecalis V583] gi|29342430|gb|AAO80196.1| conserved hypothetical protein TIGR01630 [Enterococcus faecalis V583] Length = 418 Score = 42.8 bits (99), Expect = 0.13, Method: Composition-based stats. Identities = 45/305 (14%), Positives = 89/305 (29%), Gaps = 35/305 (11%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 RG KTT A + LM P ++I L ++T + E+ ++ + + +F+ Sbjct: 52 RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106 Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +L+ + + + G H +I D+ Sbjct: 107 FALYGVELVLLKETTTEIDTNLKTSTRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257 D ++ + N + G+F ++K K + D Sbjct: 164 KDRVSRA-----EREKTKLQYQELQNVKNREGRFINTGTPWHKEDAISKMPNVKKFDCYE 218 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 ID + + + + + + F I+ N + Sbjct: 219 TGLIDKEQRKAL--QQSMTPSLFAANYELKHIADSESLFTAPTYID-NTNLIYNGVAH-- 273 Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373 D A GGD+T + + G +I W K +I L + Y+ Sbjct: 274 ----IDAAYGGGDSTAFTIFKEQKDGTIIGFGKKWQKHVDDCLP-EILQLHQYYQAGTFY 328 Query: 374 IDANN 378 + N Sbjct: 329 TETNG 333 >gi|59712621|ref|YP_205397.1| terminase, ATPase subunit [Vibrio fischeri ES114] gi|59480722|gb|AAW86509.1| terminase, ATPase subunit [Vibrio fischeri ES114] Length = 588 Score = 42.8 bits (99), Expect = 0.14, Method: Composition-based stats. Identities = 29/243 (11%), Positives = 64/243 (26%), Gaps = 35/243 (14%) Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 ID G + + +Y D V + F SF + + Sbjct: 332 ITIDDAIEGGATFFNMDKLRRKY-PDKSVFNNLLRCVFLDDAS-SFFSIKSLLACKTDTD 389 Query: 311 CPDPYA----------PLIMGCDIAEEG-----GDNTVVV----LRRGPVIEHLFDWS-- 349 +++G D G D ++V + +G + Sbjct: 390 NWKDVDLESLHPVGRREVLVGYDPRGGGQGEGADDAGLIVSLKPIIKGGAFRFIERVRLK 449 Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + I + +KY + ID G+ + + L++ Sbjct: 450 GSSYEDQAAAIEAICKKYNVVYLAIDVGGVGSAVAELVR---------KFYPGLTTLDYS 500 Query: 410 RNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKST 466 + + K + + L L+ + ++ + ++ S R K Sbjct: 501 PEMKRMMAYKAREIINAGRLQFDNEWDDLVHSFLMIRQHTTKMSNQITFVSARNKVGSHA 560 Query: 467 DYS 469 D + Sbjct: 561 DLA 563 >gi|327198111|ref|YP_004306641.1| terminase large subunit [Enterococcus phage EFRM31] gi|297179206|gb|ADI23907.1| terminase large subunit [Enterococcus phage EFRM31] Length = 574 Score = 42.8 bits (99), Expect = 0.14, Method: Composition-based stats. Identities = 48/295 (16%), Positives = 92/295 (31%), Gaps = 52/295 (17%) Query: 76 EVFKGAISAGRGIGKTTLNAWLVLW-LMST-RPGIS--VICLANSETQLKTTLWAEVSKW 131 K IS R GK+ L A + L+ + P S ++ AN++ Q Sbjct: 97 RFRKVYISLARKNGKSILVAGISLYEFLLGQYPQASRQIVAAANTKDQ------------ 144 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 ++ N ++++L I+ + + S + D+ G Sbjct: 145 AGIVFNMLKSQLKALRAVSDGTRKVTKVNKKDIEHLEDESTVKPLSSDA-DSLDGLDVLC 203 Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD---- 247 G+ EA T + + +++ I+++ + L+G + I + Sbjct: 204 GVLDEYGEAKSTA--MIEVLESSQSQQLQGLILIISTTTKNLNGPMHSIEYPFITKLLNE 261 Query: 248 -----------WKRFQIDTRTVE---------GIDPSFHEGI-------IARYGLDSDV- 279 W+ + E + HE + +A Y D+ Sbjct: 262 EVEADAYLALCWEMDSLSEVDDEANWIKSNPLFENAQLHETMYEHKVNSLAEYKAKGDMS 321 Query: 280 -TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTV 333 + + Q DSFI E +P P+ +G D+A G V Sbjct: 322 GWLTKEMNFWVQSSQDSFIDKESWEAVKQTQPYDIKGRPVYIGLDLARTGDMTAV 376 >gi|238694889|ref|YP_002922083.1| Dda DNA helicase [Enterobacteria phage JSE] gi|220029025|gb|ACL77960.1| Dda DNA helicase [Enterobacteria phage JSE] Length = 463 Score = 42.8 bits (99), Expect = 0.14, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 53/177 (29%), Gaps = 34/177 (19%) Query: 57 EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116 + + H + + + + G GKT + ++V R G++ + LA Sbjct: 8 DMLTDGQKHAFDVLMKRIEQKKHTTVRGAAGTGKTAMMKFIVQ--EMVRRGVTGVVLATP 65 Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176 Q K L V + + LH L ++ +Y Sbjct: 66 THQAKKVLSKAVGR-----------------------QAFTLHALLRLNPTNYEDTQVFE 102 Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233 ++ P II DEAS + + + N I +P +L Sbjct: 103 QKDTPKL------DDVQIIIVDEASMVDKKLFDIL---MKSINGRIVIIAVGDPHQL 150 >gi|157311312|ref|YP_001469355.1| Dda DNA helicase [Enterobacteria phage Phi1] gi|149380516|gb|ABR24521.1| Dda DNA helicase [Enterobacteria phage Phi1] Length = 463 Score = 42.8 bits (99), Expect = 0.14, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 53/177 (29%), Gaps = 34/177 (19%) Query: 57 EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116 + + H + + + + G GKT + ++V R G++ + LA Sbjct: 8 DMLTDGQKHAFDVLMKRIEQKKHTTVRGAAGTGKTAMMKFIVQ--EMVRRGVTGVVLATP 65 Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176 Q K L V + + LH L ++ +Y Sbjct: 66 THQAKKVLSKAVGR-----------------------QAFTLHALLRLNPTNYEDTQVFE 102 Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233 ++ P II DEAS + + + N I +P +L Sbjct: 103 QKDTPKL------DDVQIIIVDEASMVDKKLFDIL---MKSINGRIVIIAVGDPHQL 150 >gi|49474625|ref|YP_032667.1| phage related protein [Bartonella quintana str. Toulouse] gi|49240129|emb|CAF26575.1| phage related protein [Bartonella quintana str. Toulouse] Length = 402 Score = 42.8 bits (99), Expect = 0.14, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 63/191 (32%), Gaps = 11/191 (5%) Query: 194 AIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNK-PLDDWKR 250 DEA + ++ L E N +T NP R + + F + K Sbjct: 86 LCWVDEAEPVTETAWQTLIPTLREEGKDWNAELWVTWNPCRENAPVEKRFRNVENPNIKG 145 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 +I+ R + A + G++ Q ++ ++E L Sbjct: 146 AEINWRDNPLFPQKLNRDRKADLQQRPENYNHIWEGEYLQSVQGAYYQKALLEAELEGRI 205 Query: 311 CPDPYAP---LIMGCDIAEEG--GDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKISGL 363 P P + + DI G D T + + + G I L D+ + + + + Sbjct: 206 TNVPRDPLMQIKIFWDIGGTGAKADATALWVAQFIGREIRVL-DYYEAQGQPLAEHVGWV 264 Query: 364 VEKYRPDAIII 374 ++ A+++ Sbjct: 265 FQRGYEKALMV 275 >gi|294011207|ref|YP_003544667.1| hypothetical protein SJA_C1-12210 [Sphingobium japonicum UT26S] gi|292674537|dbj|BAI96055.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 437 Score = 42.8 bits (99), Expect = 0.14, Method: Composition-based stats. Identities = 64/406 (15%), Positives = 114/406 (28%), Gaps = 61/406 (15%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 AGRG GKT A V + + P + + + + + + E Sbjct: 58 AGRGFGKTRAGAEWVRSVAESDPKARIALVGATLGEARAVM----------------VEG 101 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKH-YSTMCRTYSEERPDTFVGHHNTYGMAIINDE--- 199 S L APW++ + + Y ++ G ++G DE Sbjct: 102 ASGILAVAPWWNRPVFAPALRKLVWPNGAVATLYGAAEAESLRGPQFSHG---WADEIAK 158 Query: 200 -ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258 A G + ++G T P L + D V Sbjct: 159 WAGGQA-AWDNLMMGMRLGGAPRVLATTTPRPVPLVRGL--VARAGGDVVVTRGRTADNV 215 Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318 + F + YG + R E+ G+ ++ + +++E + Sbjct: 216 AHLADGFLAAMERSYGGT-RLGRQELDGELIEEVEGALWSRDLLERCRVAHVRG-GLTRV 273 Query: 319 IMGCD-IAEEGGDNT---VVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAI 372 ++ D A GD VV L + D + SG + D + Sbjct: 274 VVAVDPPASAHGDACGIVVVGLGEDRRAYVIADATVEGATPEGWARAASGAALVHGADRV 333 Query: 373 IIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW---LEFA 427 + +ANN GA L V V + V R E + + + Sbjct: 334 VAEANNGGAMVESVLRAAEAALPVRLVHASRGKV-------ARAEPVAALYEAGRVVHRG 386 Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473 + L ++ P +S D +D L+ Sbjct: 387 GFAELEDQLCGLMLGGGYVGP--------------GRSPDRADALV 418 >gi|315181719|gb|ADT88632.1| terminase, ATPase subunit [Vibrio furnissii NCTC 11218] Length = 574 Score = 42.8 bits (99), Expect = 0.14, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 72/244 (29%), Gaps = 37/244 (15%) Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 ID +G E + +Y D V + F F ++ A + Sbjct: 318 ITIDDAIEKGATFFNMEKLRRKY-PDKTVFDNLLRCVFLDDSASIFALKALL--ACKTDS 374 Query: 311 -----------CPDPYAPLIMGCDI----AEEGGDNTVVVL-----RRGPVIEHLFDWS- 349 P A +++G D EG D+ +V+ R+G V + Sbjct: 375 SLWKDVDHNKARPAGNAEVLVGYDPRGGGQGEGSDDAGLVVALKPKRKGGVFRLIERARL 434 Query: 350 -KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEF 408 + I + EKY + ID + G+ + + + + Sbjct: 435 KGSSYEQQALAIKAMTEKYNVVHLAIDVSGVGSAVAELVRKFYPSLIELDYSPEV----- 489 Query: 409 CRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS 465 +R + K + + L L+ + ++ + ++ S R K Sbjct: 490 ---KRM-MVYKAREIINDGRLQFDGEWDDLVHSFLMIRQQTTKASNQVTFISNRSKVGSH 545 Query: 466 TDYS 469 D + Sbjct: 546 ADLA 549 >gi|260769184|ref|ZP_05878117.1| terminase ATPase subunit [Vibrio furnissii CIP 102972] gi|260614522|gb|EEX39708.1| terminase ATPase subunit [Vibrio furnissii CIP 102972] Length = 574 Score = 42.8 bits (99), Expect = 0.15, Method: Composition-based stats. Identities = 36/244 (14%), Positives = 72/244 (29%), Gaps = 37/244 (15%) Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 ID +G E + +Y D V + F F ++ A + Sbjct: 318 ITIDDAIEKGATFFNMEKLRRKY-PDKTVFDNLLRCVFLDDSASIFALKALL--ACKTDS 374 Query: 311 -----------CPDPYAPLIMGCDI----AEEGGDNTVVVL-----RRGPVIEHLFDWS- 349 P A +++G D EG D+ +V+ R+G V + Sbjct: 375 SLWKDVDHNKARPAGNAEVLVGYDPRGGGQGEGSDDAGLVVALKPKRKGGVFRLIERARL 434 Query: 350 -KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEF 408 + I + EKY + ID + G+ + + + + Sbjct: 435 KGSSYEQQALAIKAMTEKYNVVHLAIDVSGVGSAVAELVRKFYPSLIELDYSPEV----- 489 Query: 409 CRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS 465 +R + K + + L L+ + ++ + ++ S R K Sbjct: 490 ---KRM-MVYKAREIINDGRLQFDGEWDDLVHSFLMIRQQTTKASNQVTFISNRSKVGSH 545 Query: 466 TDYS 469 D + Sbjct: 546 ADLA 549 >gi|240137990|ref|YP_002962462.1| hypothetical protein MexAM1_META1p1321 [Methylobacterium extorquens AM1] gi|240007959|gb|ACS39185.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 421 Score = 42.8 bits (99), Expect = 0.15, Method: Composition-based stats. Identities = 63/296 (21%), Positives = 100/296 (33%), Gaps = 43/296 (14%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 L +E H P P + A+ GRG GKT A W+ G V Sbjct: 9 LRLLEADWLHLARHDQLPPPGNWTTWAVIGGRGSGKTRTGA---EWVRGLAQGDPVFTPE 65 Query: 115 NSET-QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 E L +A+V + P+ + L P W G + Sbjct: 66 PVERIALVGETFADVRDVMIEGPSG-LLALPRLGGAPPVWQPSRRRVMFGN-----GAVA 119 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRR 232 +S E PD+ G A+ +DE + T + +F + PR Sbjct: 120 LAFSAEEPDSLRG---PQFGAVWSDEVAK-----WREAE---TTYDMIQFGLRLGTHPRG 168 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTV----------EGIDPSFHEGIIARYGLDSDVTRV 282 L +P+ +R D RTV + + PSF E ++ RY + + R Sbjct: 169 LVT----TTPRPVPLIQRLLADPRTVVTRSRTADNAQNLAPSFLEEVVGRY-AGTRLGRQ 223 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI---AEEGGDNTVVV 335 E+ G+ + D+ + IE A E P + + + D + G D +V Sbjct: 224 ELDGELIEDRPDALWTRDSIERARVFEAPPLQH--IAVAIDPPASSGVGADACGIV 277 >gi|213865421|ref|ZP_03387540.1| probable terminase subunit [Salmonella enterica subsp. enterica serovar Typhi str. M223] Length = 85 Score = 42.8 bits (99), Expect = 0.15, Method: Composition-based stats. Identities = 11/44 (25%), Positives = 15/44 (34%) Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384 I W D R + I L ++Y I ID+ G Sbjct: 17 RILERHQWRGMDFRAQADAIKKLTQQYNVTYIGIDSTGVGHGVY 60 >gi|213586958|ref|ZP_03368784.1| probable terminase subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-0664] Length = 67 Score = 42.8 bits (99), Expect = 0.15, Method: Composition-based stats. Identities = 11/44 (25%), Positives = 15/44 (34%) Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384 I W D R + I L ++Y I ID+ G Sbjct: 16 RILERHQWRGMDFRAQADAIKKLTQQYNVTYIGIDSTGVGHGVY 59 >gi|213162921|ref|ZP_03348631.1| probable terminase subunit [Salmonella enterica subsp. enterica serovar Typhi str. E00-7866] Length = 113 Score = 42.8 bits (99), Expect = 0.15, Method: Composition-based stats. Identities = 11/44 (25%), Positives = 15/44 (34%) Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384 I W D R + I L ++Y I ID+ G Sbjct: 16 RILERHQWRGMDFRAQADAIKKLTQQYNVTYIGIDSTGVGHGVY 59 >gi|291484314|dbj|BAI85389.1| hypothetical protein BSNT_02825 [Bacillus subtilis subsp. natto BEST195] Length = 577 Score = 42.8 bits (99), Expect = 0.15, Method: Composition-based stats. Identities = 43/278 (15%), Positives = 88/278 (31%), Gaps = 43/278 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGIS----VICLANSETQLKTTLWAEVSKWLSLLPNKHWF--E 142 GK+ L A L L+ + + ANS Q KT + +S L + +K F + Sbjct: 112 GKSVLVAGLSLYELIYGEAPKFDRQIYATANSRGQAKTV-FKMISMQLKKIRSKSKFMRK 170 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKH--------------YSTMCRTYSEERPDTFVGHH 188 + + + D Y T T E ++ G Sbjct: 171 WTKIIQNEIRYLKDDCVIMPLSRDTDNLDSLNVLIGILDEYHTASNTKMMEVLESSQGQQ 230 Query: 189 NTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD- 247 + + II+ +G + G + + + S + F ++ + ++ Sbjct: 231 DQGLILIIS--TAGFK------LNGPMYSQEYPYVDDILSGRKENENYFAIVYEQDDEEE 282 Query: 248 ------WKRFQIDTRTVEGIDPSFHEGIIARYGL-----DSDVTRVEVCGQFPQQDIDSF 296 W + VEG+ + + + D + T V+ + +SF Sbjct: 283 IYDESTWIKSN-PLLEVEGLQKKILKNLRKKLKEALDKDDLNGTLVKNFNIWQSASSESF 341 Query: 297 IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVV 334 I N ++ P+ +G D++ D + + Sbjct: 342 INGNDWKKRGVDVAPDITGKPVYIGIDLSRT-DDLSAL 378 >gi|253583367|ref|ZP_04860565.1| helicase [Fusobacterium varium ATCC 27725] gi|251833939|gb|EES62502.1| helicase [Fusobacterium varium ATCC 27725] Length = 1624 Score = 42.4 bits (98), Expect = 0.16, Method: Composition-based stats. Identities = 35/287 (12%), Positives = 81/287 (28%), Gaps = 24/287 (8%) Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY 238 P F G + D G + G G + + N R+ + S Sbjct: 1262 GNPSHFQGDERDVVFLSMVDSNDGVGPLAMKG-EG-IEDSNKKRYNVAVS---------- 1309 Query: 239 EIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298 W +D + + Y D +E + +++ DS Sbjct: 1310 ---RAKDQLWIVHSLDMAN--DLKKGDIRRGLLEYSEDPKAFMIE---ESVKKNSDSVFE 1361 Query: 299 LNIIEEALNREPCPDPYAPL-IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR--T 355 + + R + D+ + V + G + K D+ Sbjct: 1362 EEVAKYLYARGYNIIQQWEVGAYRIDMVAFFENKRVAIECDGERWHSTEEQVKQDIERQD 1421 Query: 356 TNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTE 415 + + R + +T + LE G + + + + E N+ Sbjct: 1422 ILERCGWDFIRIRGSRYFRNPEDTMKEVLEKLEKKGIYPEKTKSENYEIREEELLNKIKS 1481 Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVP-NTGELAIESKRVK 461 ++ + + I + + + +++ + L IE+ ++K Sbjct: 1482 RSFEIMELWKEQGNIEEIEITKEVNNIEDKEIKIPELVLKIENSKIK 1528 >gi|325848842|ref|ZP_08170352.1| putative phage terminase, large subunit [Anaerococcus hydrogenalis ACS-025-V-Sch4] gi|325480486|gb|EGC83548.1| putative phage terminase, large subunit [Anaerococcus hydrogenalis ACS-025-V-Sch4] Length = 462 Score = 42.4 bits (98), Expect = 0.16, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 67/205 (32%), Gaps = 24/205 (11%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL--SGKFYEIFNK------PL 245 +I DEA D + +T N IM P SG + F K P Sbjct: 154 LLIIDEAQEYTDDQESALKYTVTSS-KNPQTIMCGTPPTPISSGMVFVNFRKQCLTSRPN 212 Query: 246 DDWKRFQI--------DTRTVEGIDPSF-----HEGIIARYGLDSDVTRVEVCGQFPQQD 292 + + D+ +PS I G D ++ G + + Sbjct: 213 NAYWAEWSVPEMSDIHDSELWYKTNPSLGTIFTERSIEDEIGSDETDFNIQRLGLWISYN 272 Query: 293 IDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP-VIEHLFDWSKT 351 S I + L + P + +G +G + ++ V + + + Sbjct: 273 QKSAI-TEKEWQRLKLKSLPILTGEMHVGIKFGNDGTNVSLAVACKTLSKMIFIEAIDCQ 331 Query: 352 DLRTTNNKISGLVEKYRPDAIIIDA 376 ++R +N I + K +P +++ID Sbjct: 332 NVRNGDNWIIDFLVKTKPKSVVIDG 356 >gi|254560550|ref|YP_003067645.1| hypothetical protein METDI2093 [Methylobacterium extorquens DM4] gi|254267828|emb|CAX23679.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 421 Score = 42.4 bits (98), Expect = 0.16, Method: Composition-based stats. Identities = 62/296 (20%), Positives = 98/296 (33%), Gaps = 43/296 (14%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 L +E H P P + A+ GRG GKT A W+ G V Sbjct: 9 LRLLEADWLHLARHDQLPPPGNWTTWAVIGGRGSGKTRTGA---EWVRGLAYGDPVFSPE 65 Query: 115 NSET-QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 E L +A+V + P+ + L P W G + Sbjct: 66 PVERIALVGETFADVRDVMIEGPSG-LLALPRLGGAPPVWQPSRRRVVFGN-----GAVA 119 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRR 232 +S E PD+ G A+ +DE + + +F + PR Sbjct: 120 LAFSAEEPDSLRG---PQFGAVWSDEVAK-----WREAEAT---YDMIQFGLRLGTHPRG 168 Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTV----------EGIDPSFHEGIIARYGLDSDVTRV 282 L +P+ +R D RTV + + PSF E ++ RY + + R Sbjct: 169 LVT----TTPRPVPLIRRLLADPRTVVTRSRTADNAQNLAPSFLEEVVGRY-AGTRLGRQ 223 Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI---AEEGGDNTVVV 335 E+ G+ + D+ + IE A R P + + D + G D +V Sbjct: 224 ELDGELIEDRPDALWTRDSIERA--RVSEVPPLQRIAVAIDPPASSRVGADACGIV 277 >gi|224797098|ref|YP_002642985.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] gi|224554508|gb|ACN55891.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] Length = 396 Score = 42.4 bits (98), Expect = 0.16, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ + + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVQAWLEKQKLAYHGNDIGFRIEVLGE 219 >gi|168029927|ref|XP_001767476.1| predicted protein [Physcomitrella patens subsp. patens] gi|162681372|gb|EDQ67800.1| predicted protein [Physcomitrella patens subsp. patens] Length = 1075 Score = 42.4 bits (98), Expect = 0.16, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 46/134 (34%), Gaps = 10/134 (7%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139 A++A RG GK+ + ++ A S LKT L+ + K + K Sbjct: 279 VALTAARGRGKSAALGVAIA-GAVAFGYSNIFVTAPSPENLKT-LFEFIFKGFDAMEYKE 336 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + + + ++ ++ + + E+ ++ DE Sbjct: 337 HIDYDLVESTNSAFNKAIVRVNIFRQHRQTIQYIQPKDHEKLAQAE--------LLVIDE 388 Query: 200 ASGTPDVINLGILG 213 A+ P I +LG Sbjct: 389 AAAIPLPIVKALLG 402 >gi|320590344|gb|EFX02787.1| dead deah box DNA helicase [Grosmannia clavigera kw1407] Length = 2423 Score = 42.4 bits (98), Expect = 0.17, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 52/166 (31%), Gaps = 25/166 (15%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E + W L Sbjct: 1194 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERIKDWGRRLAGPAGLR 1248 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA- 200 + L+ P + + + + + + R++ G+ + II DE Sbjct: 1249 LVELTGDNTPDTRTIGEADVIVTTPEKWDGISRSWQT------RGYVRKVSLVII-DEIH 1301 Query: 201 --SGTPDVINLGI------LGFLTERNANRFWI--MTSNPRRLSGK 236 +G I I +G T + + +N L+ Sbjct: 1302 LLAGDRGPILEIIVSRMNYIGAATGSSVRLLGMSTACANATDLASW 1347 >gi|66395738|ref|YP_240074.1| ORF009 [Staphylococcus phage 37] gi|62636161|gb|AAX91272.1| ORF009 [Staphylococcus phage 37] Length = 419 Score = 42.4 bits (98), Expect = 0.17, Method: Composition-based stats. Identities = 53/375 (14%), Positives = 122/375 (32%), Gaps = 43/375 (11%) Query: 57 EFMEVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLAN 115 + E++ H + + + + GRG GK++ A +++ L+ R ++ + L Sbjct: 4 KLSELIPEHFHSLWHAAKDKGKLNIVAKGGRGSGKSSDIAIIIV-LLIMRYPVNALILRK 62 Query: 116 SETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT 175 + L +++ ++ ++++ H F+++ + + + R Sbjct: 63 IDNTLALSVFEQIKWAINVMGVSHLFKIKVS------------PMEITYVPRGNKMVFRG 110 Query: 176 YSEERPDTFVGHHN---TYGMAIINDEASGTPDVINLGILGFL----TERNANRFWIMTS 228 + P+ + Y +A I + A + I L + + T Sbjct: 111 A--QNPERIKSLKDAQFPYAIAWIEELAEFKTEDEVTTITNSLLRGELDNGLFYKFFYTY 168 Query: 229 NPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284 NP + + YE +P + + I F E A ++ R E Sbjct: 169 NPPKRKQSWVNKKYESSFQPDNTFVHHS-TYLNNPFIAKEFIEEAKAAKAINELRYRWEY 227 Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGP 340 G+ +P N + + D + + D D V ++ Sbjct: 228 LGEAIGS---GVVPFNNLRIETIPKEQFDTFDNIRNAVDFG-YATDPLAFVRWHYDKKKR 283 Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400 +I + + + + Y+ D I D+ ++ L+ + + R+ G Sbjct: 284 IIYAVDEHYGVQISNREFANWLKKKGYQSDEIYADS--AEPKSIAELKQE-HSIRRIKGV 340 Query: 401 KRAVDL----EFCRN 411 K+ D E N Sbjct: 341 KKGPDSVEHGEQWLN 355 >gi|13242438|ref|NP_077457.1| DNA packaging terminase subunit 1 [Cercopithecine herpesvirus 9] gi|11036590|gb|AAG27219.1|AF275348_40 unknown [Cercopithecine herpesvirus 9] Length = 745 Score = 42.4 bits (98), Expect = 0.17, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 49/152 (32%), Gaps = 22/152 (14%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT L+ LMST GI V A + K + FE L Sbjct: 271 GKTWFIVSLIALLMSTFRGIKVGYTA------------HIRK-----ATEPVFEEIKARL 313 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTM---CRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205 W+ + +S +S C T G + DEA+ Sbjct: 314 --EQWFGTERIEHVKGESITFSFSDGCCSTAVFSSSHNTNGIRGQTFNLLFVDEANFIRP 371 Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I+GFL + N ++ ++N + S F Sbjct: 372 DAVQTIVGFLNQTNCKIIFVSSTNTGKASTSF 403 >gi|226315677|ref|YP_002775693.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] gi|226202054|gb|ACO38634.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] Length = 396 Score = 42.4 bits (98), Expect = 0.18, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSIHNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSADDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ + + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219 >gi|260431843|ref|ZP_05785814.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260415671|gb|EEX08930.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 176 Score = 42.4 bits (98), Expect = 0.18, Method: Composition-based stats. Identities = 26/127 (20%), Positives = 39/127 (30%), Gaps = 9/127 (7%) Query: 182 DTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 + G II DEA PD F + T N R SG FYE Sbjct: 49 ENARGETAD---LIIGDEACFIQPDEALTAFFPMRRSTG-RIFLLSTPNGTR-SGYFYET 103 Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 + + +R + + D R + R E ++ S + N Sbjct: 104 WESDANV-RRIRARSMDTTREDRLAQIEFDRR-TMSDATFRREHLCEWVGAGE-SLLSWN 160 Query: 301 IIEEALN 307 +E A+ Sbjct: 161 TLERAMQ 167 >gi|289432252|ref|YP_003462125.1| hypothetical protein DehalGT_0302 [Dehalococcoides sp. GT] gi|288945972|gb|ADC73669.1| conserved hypothetical protein [Dehalococcoides sp. GT] Length = 420 Score = 42.4 bits (98), Expect = 0.18, Method: Composition-based stats. Identities = 49/295 (16%), Positives = 86/295 (29%), Gaps = 59/295 (20%) Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213 ++D+ H G + S E + VG NT + + DEA Sbjct: 87 FTDIYHTEGGYIIRLNQARAVFLSAEPSASVVG--NTAHLLLEVDEAQDVNKEKY----- 139 Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFN-------------KPLDDWKRFQIDTRTVEG 260 + T+ L G ++ F+ + + F+ D V Sbjct: 140 ---SKEFKPMGATTNVTTVLYGTTWDSFSLLEEIKEQNIEKEQKDGLKRHFRYDWEAVAA 196 Query: 261 IDPSFHE---GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDP 314 +P++ R G + + + P ++ PC P+ Sbjct: 197 HNPAYLAYALSEKERLGENHPLFLTQYR-LLPVSGGGGMFSNEQLDLLKGNHPCQVYPEK 255 Query: 315 YAPLIMGCDIAEE-----GGDNTVVVLRRGPVIEHL----------------------FD 347 + G D+A E G T V LRR + + + Sbjct: 256 GKVYVAGLDLAGEDSQTGGISPTTVNLRRDSSVLTIAQLDYTFAKAPYNLPQVRLVCHYS 315 Query: 348 WSKTDLRTTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYHVYRVLGQ 400 W T K+ L+ K ++ + +DA G +L LG + V Q Sbjct: 316 WQGTRHALLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSRILPVPFQ 370 >gi|298290710|ref|YP_003692649.1| hypothetical protein Snov_0699 [Starkeya novella DSM 506] gi|296927221|gb|ADH88030.1| protein of unknown function DUF264 [Starkeya novella DSM 506] Length = 428 Score = 42.4 bits (98), Expect = 0.18, Method: Composition-based stats. Identities = 65/414 (15%), Positives = 116/414 (28%), Gaps = 65/414 (15%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---SVICLANSETQLKTTLWAEVSKWLSLLPNK 138 + GRG GKT A V L R G + +A S L+ + VS L++ P Sbjct: 42 VLGGRGAGKTRAGAEWVRALAFGRAGPPAGRIALVAESLGDLREVMVEGVSGLLAVHPRG 101 Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198 + + + +S + P++ G A D Sbjct: 102 ERPTWEPTR---------------KRLEWPNGAVAQGFSADDPESLRGPQFD---AAWCD 143 Query: 199 EASGTPDVINLGILGFLTERNANRFW-----IMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 E + M + R + + P R Sbjct: 144 ELAK-----WRYAQAAFDNLQFGLRLGARPRQMVTTTPRPTTLLRALLADPRTAVTRM-G 197 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE--PC 311 + P F E ++ RY + + R E+ G+ + D+ +IE Sbjct: 198 TAENAAHLAPHFLETVVGRY-AGTRLGRQELDGELIEDRPDALWSRALIEAGREAAAPEM 256 Query: 312 PDPYAPLIMGCDI---AEEGGDNTVVV---LRRGPVIEHLFDWSKTDLRTT--NNKISGL 363 +++ D + + D +V + R ++ L D S L T + GL Sbjct: 257 VRQMERIVVAVDPPASSRKHADACGLVAAGIDRDGLVHVLADESAQGLTPTGWGGRAVGL 316 Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421 + D ++++ N G L + V V + R E + Sbjct: 317 FHRLEADRVVVEVNQGGEMVKSILAGIDPSVPVREVRATRGKW-------LRAEPVAALY 369 Query: 422 DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 + L L + G +S D D L++ Sbjct: 370 EQGRVRHAGAFPALEDELC-----DFGSDGL--------SNGRSPDRLDALVWA 410 >gi|225552551|ref|ZP_03773490.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225370879|gb|EEH00310.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 42.4 bits (98), Expect = 0.19, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I ++A+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNKATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI + + P I D A GGDNT + + Sbjct: 265 IFTQINITQNYVFTSP--------IAYLDPAFSIGGDNTALCV 299 >gi|224534955|ref|ZP_03675523.1| conserved hypothetical protein [Borrelia spielmanii A14S] gi|224513774|gb|EEF84100.1| conserved hypothetical protein [Borrelia spielmanii A14S] Length = 285 Score = 42.4 bits (98), Expect = 0.19, Method: Composition-based stats. Identities = 21/122 (17%), Positives = 36/122 (29%), Gaps = 4/122 (3%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264 Query: 295 SF 296 F Sbjct: 265 IF 266 >gi|23335598|ref|ZP_00120832.1| hypothetical protein Blon03000707 [Bifidobacterium longum DJO10A] gi|189440021|ref|YP_001955102.1| phage terminase large subunit [Bifidobacterium longum DJO10A] gi|189428456|gb|ACD98604.1| Phage terminase large subunit [Bifidobacterium longum DJO10A] Length = 477 Score = 42.4 bits (98), Expect = 0.19, Method: Composition-based stats. Identities = 55/376 (14%), Positives = 105/376 (27%), Gaps = 60/376 (15%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISV 110 WQ + ++ A + + + + + R GKT W+ + + PG+ + Sbjct: 37 DVWQRQINRIILAKSADGFWSA-----RNTVLSIPRQTGKTYDIGWVAIHRAARTPGMRI 91 Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH---CSLGIDSK 167 + A + S++ + + D H + G + Sbjct: 92 VWTA---------------QHFSVIKDTFESLCAIVLRPEMSGLVDPDHGISLAAGKEEI 136 Query: 168 HYSTMCRT-YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226 + R + G ++ DEA D +L R N I Sbjct: 137 RFRNGSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPT-QNRAWNPQTIY 193 Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDT---------RTVEGIDPSFHEGIIARY---- 273 P E F + D + + + R + +D Y Sbjct: 194 MGTPPGPRDNG-EAFTRLRDKARAGRTHSTLYVEFTADRDADPLDRQQWRKANPSYPSHT 252 Query: 274 ----------GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCD 323 L D R E G + + + I EEA P + G D Sbjct: 253 SDESIANLWENLTGDDFRREALGIWDEHALSRAIDRRQWEEATI--ERRRPGGVMSFGID 310 Query: 324 IAEEGGDNTV---VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK--YRPDAIIIDANN 378 + + T+ + L ++ T+ T L++K + +++ID + Sbjct: 311 MNPQRTRLTIGACMRYDDNTAHIELAEYRDTNQDGT-MWAVNLIDKVWEQTASLVIDGQS 369 Query: 379 TGARTCDYLEMLGYHV 394 L G V Sbjct: 370 PATALLPDLAQAGVTV 385 >gi|254471818|ref|ZP_05085219.1| phage DNA Packaging Protein [Pseudovibrio sp. JE062] gi|211959020|gb|EEA94219.1| phage DNA Packaging Protein [Pseudovibrio sp. JE062] Length = 428 Score = 42.4 bits (98), Expect = 0.20, Method: Composition-based stats. Identities = 55/322 (17%), Positives = 104/322 (32%), Gaps = 35/322 (10%) Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG-ILGFLTERNANRFW 224 + R +S E P+ G A DEA + +L F Sbjct: 119 EWPNGAIARAFSSEDPEALRGPQFD---AAWCDEAGKWSNATETFDMLQFGLRLGTQPQQ 175 Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284 ++T+ P+ S ++ + + +F + + RYG + R E+ Sbjct: 176 LVTTTPK--STPLLKMLLQDQRVVVTKAGTKSNAAFLAEAFLQQMAERYGGT-RLGRQEL 232 Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG---DNT-VVVLRRGP 340 G+ + D+ E +NR +++ D G D ++ Sbjct: 233 DGELIEDREDALFARKWFE--MNRVRHVPELKRIVVAIDPPATSGKSADACGIIAAGITE 290 Query: 341 VIE--HLFDWSKTDLRTTN--NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396 E L D + LR ++ L + D ++ + N G + +E + V Sbjct: 291 AAELFVLRDRTAQGLRPAAWADQAIRLYHELEADCLLAEVNQGGEMVREVIEGVDASV-- 348 Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456 ++V + RR E +A E ++H G+ L+ + + G Sbjct: 349 ---PVKSVHATRSKRRRAE---PVALLYEQGR-VHHCGVFPELED-ELADFGSGGL---- 396 Query: 457 SKRVKGAKSTDYSDGLMYTFAE 478 KS D D L++ E Sbjct: 397 ----SNGKSPDRLDALVWAITE 414 >gi|330507947|ref|YP_004384375.1| phage terminase, large subunit, PBSx family [Methanosaeta concilii GP-6] gi|328928755|gb|AEB68557.1| phage terminase, large subunit, PBSx family [Methanosaeta concilii GP-6] Length = 422 Score = 42.0 bits (97), Expect = 0.21, Method: Composition-based stats. Identities = 19/152 (12%), Positives = 46/152 (30%), Gaps = 9/152 (5%) Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 + ++ + DE + P+ +L L+++ A ++ NP Sbjct: 101 ADNTSSYKKIEGESLLRAYVDEGTTIPENFTNMLLSRLSDKGA-CLYLTC-NPETPRNYI 158 Query: 238 YEIF--NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295 Y + + + K ++ + + + Y + + G + + Sbjct: 159 YRNWIARQDELNIKVWKFTLDDNPYLPLEYKRDLEKEYPKGTVFYDRFILGNWVAAEGRV 218 Query: 296 FIPLNIIEEALNREPCPDPYAP--LIMGCDIA 325 F ++ E P P L +G D Sbjct: 219 FGLFA---RGMHCEVPPATLRPKELRIGADYG 247 >gi|121606179|ref|YP_983508.1| hypothetical protein Pnap_3289 [Polaromonas naphthalenivorans CJ2] gi|120595148|gb|ABM38587.1| protein of unknown function DUF264 [Polaromonas naphthalenivorans CJ2] Length = 596 Score = 42.0 bits (97), Expect = 0.21, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 39/137 (28%), Gaps = 20/137 (14%) Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317 + + Y ++ + F S P+ +++ + + + Y P Sbjct: 359 IDELRLEY--STEEFENLLMCGFIDDTQ-SVFPMAELQKCMVDSWVDWEDYKPFTARPYG 415 Query: 318 ---LIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368 + +G D + G VVL G L W D I + ++ Sbjct: 416 YRAVWVGYDPSHTGDTAGCVVLAAPLTPGGKFRVLERHQWRGLDFEAQAEAIRQITLRFN 475 Query: 369 PDAIIIDANNTGARTCD 385 I ID G Sbjct: 476 VQHIGIDTTGLGQGVYQ 492 >gi|301092109|ref|XP_002896227.1| N-acetyltransferase 10 [Phytophthora infestans T30-4] gi|262094857|gb|EEY52909.1| N-acetyltransferase 10 [Phytophthora infestans T30-4] Length = 1102 Score = 42.0 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 57/163 (34%), Gaps = 17/163 (10%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAIS--AGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 Q ++ A L V + + + ++ AGRG GK+ + ++ Sbjct: 254 QARTLDQAKA-ILTFVEAVSEKTLRSTVALTAGRGRGKSAALGMSLA-GAVAYGYSNIFV 311 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A S LKT + V K L K + + + + V+ ++ + + Sbjct: 312 TAPSPENLKTV-FEFVFKGFDALKYKEHLDYEIVQSTNPEFNHAVVRVNIFREHR----- 365 Query: 173 CRTYSEERPDTFVGHHNT--YGMAIINDEASGTPDVINLGILG 213 +T +P HH + DEA+ P + +LG Sbjct: 366 -QTIQYIQPT----HHEKLAQAELVAIDEAAAIPLPVVKNLLG 403 >gi|94497317|ref|ZP_01303888.1| hypothetical protein SKA58_07183 [Sphingomonas sp. SKA58] gi|94423180|gb|EAT08210.1| hypothetical protein SKA58_07183 [Sphingomonas sp. SKA58] Length = 437 Score = 42.0 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 71/412 (17%), Positives = 130/412 (31%), Gaps = 63/412 (15%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 AGRG GKT A V + P + + S + ++ + E Sbjct: 58 AGRGFGKTRAGAEWVRGIAEADPAARIALVGASLGEARSVM----------------VEG 101 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKH-YSTMCRTYSEERPDTFVGHHNTYGMAIINDE--- 199 +S L AP ++ + + + P+ G ++G DE Sbjct: 102 ESGLLAIAPHWARPAYAPALRRLTWPNGAVAMLFGAADPEGLRGPQFSHG---WADEIAK 158 Query: 200 -ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258 ASG + ++G R+ T P L + + DD + T Sbjct: 159 WASGEA-AWHNLMMGMRLGRDPRVLVTTTPRPVPL---VRSLVARDGDDVVVTRGRTADN 214 Query: 259 E-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 E + P F + A YG + R E+ G+ ++ + +IE+ P Sbjct: 215 EANLAPGFVAAMTAGYGGT-RLGRQELDGELIEEVEGALWTRALIEQCRV-VHVPGVLTR 272 Query: 318 LIMGCD-IAEEGGDNTVVV---LRRGPVIEHLFDWSKTDLRTT--NNKISGLVEKYRPDA 371 +++ D A GGD +V + + D S + R ++ + D Sbjct: 273 VVVAVDPPASVGGDACGIVVAGMGGDGRAYVIADASVSGARPEGWARAVAAAAMVHGADR 332 Query: 372 IIIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF--- 426 ++ +ANN GA L V V + R E + + Sbjct: 333 VVAEANNGGAMVESVLRAAEKTLPVKLVHASRGKA-------ARAEPVAALYEAGRVAHR 385 Query: 427 ASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 + + L + ++ P +S D +D L++ +E Sbjct: 386 GAFPELEDEMCGLLAGGGYVGP--------------GRSPDRADALVWAMSE 423 >gi|323352542|gb|EGA85041.1| Kre33p [Saccharomyces cerevisiae VL3] Length = 966 Score = 42.0 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%) Query: 78 FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 F A++AGRG GK+ + +S ++ + S LKT L+ + K L Sbjct: 187 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 244 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + + + + ++ + D H T+ ++ ++ Sbjct: 245 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 296 Query: 198 DEASGTPDVINLGILG 213 DEA+ P I +LG Sbjct: 297 DEAAAIPLPIVKNLLG 312 >gi|323335941|gb|EGA77219.1| Kre33p [Saccharomyces cerevisiae Vin13] Length = 961 Score = 42.0 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%) Query: 78 FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 F A++AGRG GK+ + +S ++ + S LKT L+ + K L Sbjct: 187 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 244 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + + + + ++ + D H T+ ++ ++ Sbjct: 245 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 296 Query: 198 DEASGTPDVINLGILG 213 DEA+ P I +LG Sbjct: 297 DEAAAIPLPIVKNLLG 312 >gi|190409119|gb|EDV12384.1| hypothetical protein SCRG_03266 [Saccharomyces cerevisiae RM11-1a] Length = 1056 Score = 42.0 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%) Query: 78 FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 F A++AGRG GK+ + +S ++ + S LKT L+ + K L Sbjct: 277 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 334 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + + + + ++ + D H T+ ++ ++ Sbjct: 335 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 386 Query: 198 DEASGTPDVINLGILG 213 DEA+ P I +LG Sbjct: 387 DEAAAIPLPIVKNLLG 402 >gi|151944405|gb|EDN62683.1| killer toxin resistant protein [Saccharomyces cerevisiae YJM789] gi|207341763|gb|EDZ69729.1| YNL132Wp-like protein [Saccharomyces cerevisiae AWRI1631] gi|256273837|gb|EEU08759.1| Kre33p [Saccharomyces cerevisiae JAY291] gi|259149229|emb|CAY82471.1| Kre33p [Saccharomyces cerevisiae EC1118] Length = 1056 Score = 42.0 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%) Query: 78 FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 F A++AGRG GK+ + +S ++ + S LKT L+ + K L Sbjct: 277 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 334 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + + + + ++ + D H T+ ++ ++ Sbjct: 335 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 386 Query: 198 DEASGTPDVINLGILG 213 DEA+ P I +LG Sbjct: 387 DEAAAIPLPIVKNLLG 402 >gi|6324197|ref|NP_014267.1| Kre33p [Saccharomyces cerevisiae S288c] gi|1730777|sp|P53914|KRE33_YEAST RecName: Full=UPF0202 protein KRE33; AltName: Full=Killer toxin-resistance protein 33 gi|854505|emb|CAA86893.1| orf16 [Saccharomyces cerevisiae] gi|1302072|emb|CAA96014.1| unnamed protein product [Saccharomyces cerevisiae] gi|285814522|tpg|DAA10416.1| TPA: Kre33p [Saccharomyces cerevisiae S288c] Length = 1056 Score = 42.0 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%) Query: 78 FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 F A++AGRG GK+ + +S ++ + S LKT L+ + K L Sbjct: 277 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 334 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + + + + ++ + D H T+ ++ ++ Sbjct: 335 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 386 Query: 198 DEASGTPDVINLGILG 213 DEA+ P I +LG Sbjct: 387 DEAAAIPLPIVKNLLG 402 >gi|260890025|ref|ZP_05901288.1| 3-isopropylmalate dehydratase, small subunit [Leptotrichia hofstadii F0254] gi|260860631|gb|EEX75131.1| 3-isopropylmalate dehydratase, small subunit [Leptotrichia hofstadii F0254] Length = 191 Score = 42.0 bits (97), Expect = 0.22, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 51/135 (37%), Gaps = 6/135 (4%) Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY-----LEMLGYH 393 G +W + RT N + +Y+ I+I +N G + L+ G+H Sbjct: 37 GFGQYVFDEWRYNEDRTDNMDFNLNKPEYKTGTILITGDNFGCGSSREHAAWALQDYGFH 96 Query: 394 VYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGEL 453 V G + + N + + AD LE A L + ++ +L++ K Sbjct: 97 VIVAGGYSGIFYMNWLNNGHLPITLPEADRLELAKLPGDAKVVVDLENNKLTANRKDYFF 156 Query: 454 AI-ESKRVKGAKSTD 467 + ES + + K D Sbjct: 157 ELEESWKQRLLKGLD 171 >gi|167462274|ref|ZP_02327363.1| hypothetical protein Plarl_06915 [Paenibacillus larvae subsp. larvae BRL-230010] gi|322382817|ref|ZP_08056660.1| phage-related terminase-like protein large subunit [Paenibacillus larvae subsp. larvae B-3650] gi|321153200|gb|EFX45647.1| phage-related terminase-like protein large subunit [Paenibacillus larvae subsp. larvae B-3650] Length = 423 Score = 42.0 bits (97), Expect = 0.23, Method: Composition-based stats. Identities = 46/302 (15%), Positives = 97/302 (32%), Gaps = 42/302 (13%) Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY 238 ++P HN + +E S +LG L + ++++NP + Sbjct: 105 DKPAKLKSIHNVS--IVWIEECSEVKYEGFKELLGRLRHPALDLHMLLSTNPVGEDNWTF 162 Query: 239 EIFNKP----------LDDWKRFQI----------DTRTVEGIDPSFHEGIIARYGLDSD 278 + F K D +++ I + S+ + D D Sbjct: 163 KHFFKDELKNHIVLEDTDLYEKRTIVKNDTFYHHSTAEDNLFLPKSYVAQLDELKAYDPD 222 Query: 279 VTRVEVCGQFPQQDIDSF-----IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTV 333 + R+ G+F + + EA++R P G D E N V Sbjct: 223 LYRIAREGRFGVNGVRVLPQFEVASHEEVIEAISRIRKPIE----RTGMDFGFEDSYNAV 278 Query: 334 VVLRRGPVIEHL-FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGY 392 V L + L W + T+++ + ++++ +I A++ +T Y G+ Sbjct: 279 VRLAVDHEQKILYIYWEYYKNQMTDDRTAEALQEFARTKELIKADSAEPKTIRYFRQKGF 338 Query: 393 HVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGE 452 ++ + R + K+ + + I+ LK L ++ G Sbjct: 339 NMRPAKKFPGS---------RLQYTKKIKRFKKIICSEKCPNTIRELKYL-TYKTDKNGR 388 Query: 453 LA 454 + Sbjct: 389 IL 390 >gi|149190524|ref|ZP_01868794.1| terminase, ATPase subunit [Vibrio shilonii AK1] gi|148835648|gb|EDL52615.1| terminase, ATPase subunit [Vibrio shilonii AK1] Length = 584 Score = 42.0 bits (97), Expect = 0.23, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 75/243 (30%), Gaps = 34/243 (13%) Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 +D +G D F+ + R +V + +F SF L + Sbjct: 327 ITVDDAIAKGGDKLFNMAKLKRKYPVKEVFDNLLRCKFLDDS-TSFFALKALLACKTDTE 385 Query: 311 ----------CPDPYAPLIMGCDI----AEEGGDNT--VVVLR---RGPVIEHLFDWS-- 349 P +++G D EG D+ VV L+ +G V + Sbjct: 386 NWKDVDHNKARPVGNEEVLVGYDPRGGGTGEGSDDAGLVVALKPKTKGGVFRAIEKVRLK 445 Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + I G+ EKY + +D G+ + + + + E Sbjct: 446 GSSYEQQAETIRGITEKYNVVYLAMDTGGVGSAVAELVRKFYPALVELN-----YSPEM- 499 Query: 410 RNRRTELHVKMADWLEFASLI---NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKST 466 +R + K + + + + L+ + ++ +G++ S R K Sbjct: 500 --KRM-MAYKAREIINNGRFLFDDDWDDLVHSFLMIRQQTTDRSGQVTFVSNRSKIGSHA 556 Query: 467 DYS 469 D + Sbjct: 557 DLA 559 >gi|262047916|ref|ZP_06020862.1| terminase large subunit [Lactobacillus crispatus MV-3A-US] gi|260571794|gb|EEX28369.1| terminase large subunit [Lactobacillus crispatus MV-3A-US] Length = 644 Score = 42.0 bits (97), Expect = 0.25, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 45/156 (28%), Gaps = 19/156 (12%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAW----LVLWLMSTRPGI 108 WQ + +++ ++ ++ GRG GKT + VL Sbjct: 112 DWQKFILAMING-WKDANGERRYTDIHISV--GRGQGKTQIAGIQMCKAVLIDTLNFTNK 168 Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168 + AN+ Q T L+ V K L + F + + ++ Sbjct: 169 DFLITANTSDQ-STKLFGYVKKMLEAVIKIEPFASIAKESGLDLQTNQIIEKETNNKVWK 227 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 S Y +T+ + I DE Sbjct: 228 ISYEADKYD-----------STHNVLAIYDETGALD 252 >gi|66395973|ref|YP_240307.1| ORF008 [Staphylococcus phage ROSA] gi|62636393|gb|AAX91504.1| ORF008 [Staphylococcus phage ROSA] Length = 421 Score = 42.0 bits (97), Expect = 0.25, Method: Composition-based stats. Identities = 38/309 (12%), Positives = 94/309 (30%), Gaps = 35/309 (11%) Query: 83 SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142 GRG GK++ + ++ + R ++ + + ++ L T+++ ++ + H F+ Sbjct: 33 KGGRGSGKSSDISIIIT-QLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVTHLFK 91 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY---GMAIINDE 199 ++ + + + R + P+ ++ +A I + Sbjct: 92 VKVS------------PMEITYIPRGNRIIFRGA--QNPERLKSLKDSRFPFSIAWIEEL 137 Query: 200 ASGTPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF----YEIFNKPLDDWKRF 251 A + I L + + + NP + + YE + + + Sbjct: 138 AEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSFQADNTFVHH 197 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 I F + + + R E G+ +P N + + Sbjct: 198 S-TYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGEAIGS---GVVPFNNLRIEEIPQGQ 253 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKY 367 D + + D D V ++ VI + + + + Y Sbjct: 254 YDTFDNIRNAVDFG-YATDPLAFVRWHYDKKKRVIYAMDEHYGVQISNREFANWLKKKGY 312 Query: 368 RPDAIIIDA 376 + D I D+ Sbjct: 313 QSDEIFADS 321 >gi|78043214|ref|YP_360500.1| prophage LambdaCh01, PBSX family terminase large subunit [Carboxydothermus hydrogenoformans Z-2901] gi|77995329|gb|ABB14228.1| prophage LambdaCh01, terminase, large subunit, PBSX family [Carboxydothermus hydrogenoformans Z-2901] Length = 420 Score = 42.0 bits (97), Expect = 0.25, Method: Composition-based stats. Identities = 54/339 (15%), Positives = 100/339 (29%), Gaps = 42/339 (12%) Query: 83 SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142 GRG GK++ + ++ M P + + L + LK +++ ++ + L +++ Sbjct: 32 KGGRGSGKSSFASIEIILGMMKDPNANAVVLRKVKETLKDSVFEQLIWAIEKLKVSDYWD 91 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAII----ND 198 + P + I + + S + + + I D Sbjct: 92 -----IKHNPMEMTYIPTGQKILFRGADKPKKIRSTKV--------SKGYIKFIWYEEVD 138 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSN-PRRLSGKFYEIFNKPLDDWKRFQIDTRT 257 E +G ++ I L T N P R++ E D K T Sbjct: 139 EFNGMEEI--RIINQSLMRGGEQFVVFYTYNPPNRVNAWVNEEILIERPDRKVHHSTYLT 196 Query: 258 VEG--IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IPLNIIEE--ALNRE 309 V + F ++ R E G+ + F I +E A +R Sbjct: 197 VPREWLGEQFLIEAEHLKRINERAYRHEYLGEITGTGGEIFSNITIRKITDDEIKAFDRI 256 Query: 310 PCPDPYAPLIMGCDIAEEGG---DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK 366 + D D T RR I + R I + Sbjct: 257 RRGIDWG---YAVDPVHYTVCHYDRT----RRRLFIFYEIHQVGLSNRRLAELIKEENKL 309 Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405 P I A++ ++ L+ G VY +V+ Sbjct: 310 NSP----ITADSAEPKSIAELKSYGLKVYGAKKGPGSVE 344 >gi|253682970|ref|ZP_04863757.1| hypothetical protein CLG_B2294 [Clostridium phage D-1873] gi|253560896|gb|EES90358.1| hypothetical protein CLG_B2294 [Clostridium phage D-1873] Length = 611 Score = 42.0 bits (97), Expect = 0.25, Method: Composition-based stats. Identities = 29/135 (21%), Positives = 50/135 (37%), Gaps = 30/135 (22%) Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCP-------DPYAPLIMGCDIAEEGG---DNT 332 E + + +F L+ I + C I+ D+A +GG D + Sbjct: 310 EYRSIWIKFSDKAFFKLDDINNCRVIKHCELEADFKNHKDDFYIISYDVARQGGTANDAS 369 Query: 333 VVVLRR------GPVIEHLFD-WSKTDLRTTNNK-------------ISGLVEKYRPDAI 372 + + R G +++ +S D NN + LVEKY+ A+ Sbjct: 370 IATIFRCTPRTDGSYFKNVVAMYSCEDKNKNNNNVNSIMHFKNQCIMLKRLVEKYQAKAL 429 Query: 373 IIDANNTGARTCDYL 387 ++D N G+ DYL Sbjct: 430 LVDINGIGSGLLDYL 444 >gi|320532097|ref|ZP_08032978.1| hypothetical protein HMPREF9057_00846 [Actinomyces sp. oral taxon 171 str. F0337] gi|320135702|gb|EFW27769.1| hypothetical protein HMPREF9057_00846 [Actinomyces sp. oral taxon 171 str. F0337] Length = 370 Score = 42.0 bits (97), Expect = 0.25, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 53/175 (30%), Gaps = 12/175 (6%) Query: 60 EVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ 119 V++ + + P I+ RG+GKT L S R V+ + Sbjct: 21 RVIEEFLESLDDGPGAPGLLELITGARGVGKTV---MLTALGDSARERGWVVIDETAREG 77 Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE 179 L L AE ++ LS L K + SLSL + R ++ Sbjct: 78 LMDRLAAEFTRQLSQLAGKERSRLTSLSLSTPLGGGSATLEHAPTPEPSWRQKARALTQW 137 Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGF---LTERNANRFWIMTSNPR 231 + G + + DE P + L A +M P+ Sbjct: 138 LAEHGTG------LLLTIDEVHAIPREELRALSAEVQHLIREGAPIGLLMAGLPK 186 >gi|171681273|ref|XP_001905580.1| hypothetical protein [Podospora anserina S mat+] gi|170940595|emb|CAP65823.1| unnamed protein product [Podospora anserina S mat+] Length = 1721 Score = 42.0 bits (97), Expect = 0.26, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 42/129 (32%), Gaps = 18/129 (13%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146 G GKT ++L L S P ++ A + + L ++LSL P + + Sbjct: 1332 GTGKTETILSIILSLQSHFPDSRILLTAPTHNAVDNVL----RRYLSLNPTHPPLRISTE 1387 Query: 147 SLHPAP-WYSDVLHCSLGIDSKHYSTMCRT-------------YSEERPDTFVGHHNTYG 192 +P L GI+ + T +S + N Sbjct: 1388 IRKVSPDVTPYTLDAMAGIELNTLHSRAETTKAKKRVKAAKIVFSTCIGSSLGLLRNEMF 1447 Query: 193 MAIINDEAS 201 +I DEAS Sbjct: 1448 DIVIIDEAS 1456 >gi|323487253|ref|ZP_08092556.1| hypothetical protein HMPREF9474_04307 [Clostridium symbiosum WAL-14163] gi|323399479|gb|EGA91874.1| hypothetical protein HMPREF9474_04307 [Clostridium symbiosum WAL-14163] Length = 550 Score = 42.0 bits (97), Expect = 0.26, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 33/84 (39%), Gaps = 10/84 (11%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNAWLVLW--LMSTRPGIS 109 +WQ + ++ V+ +F+ I GR GKT + ++ + + G Sbjct: 71 TWQKSTVSIM----FGIVDEAGIRIFREFLIVIGRKNGKTLFASGIIAYCLFLDGEYGAK 126 Query: 110 VICLANSETQ---LKTTLWAEVSK 130 V C+A Q + + W + K Sbjct: 127 VFCVAPKLDQADLVYQSFWQTIQK 150 >gi|326772022|ref|ZP_08231307.1| conserved hypothetical protein [Actinomyces viscosus C505] gi|326638155|gb|EGE39056.1| conserved hypothetical protein [Actinomyces viscosus C505] Length = 370 Score = 41.6 bits (96), Expect = 0.27, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 53/175 (30%), Gaps = 12/175 (6%) Query: 60 EVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ 119 V++ + + P I+ RG+GKT L S R V+ + Sbjct: 21 RVIEEFLESLDDGPGAPGLLELITGARGVGKTV---MLTALGDSARERGWVVVDETAREG 77 Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE 179 L L AE ++ LS L K + SLSL + R ++ Sbjct: 78 LMDRLAAEFTRQLSQLAGKERSRLTSLSLSTPLGGGSATLEHAPTPEPSWRQKARALTQW 137 Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGF---LTERNANRFWIMTSNPR 231 + G + + DE P + L A +M P+ Sbjct: 138 LAEHGTG------LLLTIDEVHAIPREELRALSAEVQHLIREGAPIGLLMAGLPK 186 >gi|118380585|ref|XP_001023456.1| Type III restriction enzyme, res subunit family protein [Tetrahymena thermophila] gi|89305223|gb|EAS03211.1| Type III restriction enzyme, res subunit family protein [Tetrahymena thermophila SB210] Length = 1858 Score = 41.6 bits (96), Expect = 0.27, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 43/121 (35%), Gaps = 11/121 (9%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139 ++ G+GKT + A ++L P + LA + + + E L+ Sbjct: 154 TLVALPTGLGKTFIAATVILNYYLWFPKGKIFFLAPTRPLVNQQM--ECLSQFELINKND 211 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 FEM +P P DV + + + +T + + + +I DE Sbjct: 212 IFEMTGN--YPIPKRRDVY-----LRKRIFFCTPQTLENDLIE--QRYDGYNLSLVIFDE 262 Query: 200 A 200 A Sbjct: 263 A 263 >gi|281491541|ref|YP_003353521.1| phage terminase [Lactococcus lactis subsp. lactis KF147] gi|281375259|gb|ADA64772.1| Phage protein, terminase [Lactococcus lactis subsp. lactis KF147] Length = 469 Score = 41.6 bits (96), Expect = 0.27, Method: Composition-based stats. Identities = 57/350 (16%), Positives = 105/350 (30%), Gaps = 53/350 (15%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 WQ ++ V A + + + GKT + L LW + G+S++ Sbjct: 41 PWQKNLLKEVMAIDEDGLWTHQKFGYSIPRRN----GKTEIVYILELWSL--EQGLSILH 94 Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172 A+ + ++ + K+L + +S+ + L + T Sbjct: 95 TAHRISTSHSSYEK-LKKYLEDSGYVEGEDFKSIKAK----GQERLELIESGGVIQFRT- 148 Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232 RT S + F ++ DEA + +T+ + N IM P Sbjct: 149 -RTSSGGLGEGFD--------ILVIDEAQEYTTEQESALKYTVTDSD-NPMTIMCGTPPT 198 Query: 233 L------SGKFYE---IFNKPLDDWKRFQI-------DTRTVEGIDPS-----FHEGIIA 271 + + W + + D +PS I A Sbjct: 199 PISSGTVFTNYRDNTLAGKAKYSGWAEWSVEDVKDIHDVEAWYNSNPSMGYHLNERKIEA 258 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330 G D V+ G +P+ + S I +NR P L +G + G D Sbjct: 259 ELGEDKLDHNVQRLGYWPKYNQKSVISEQEWNVLKVNRLPVIK--GKLFVGI---KYGND 313 Query: 331 NTVVVLRRGPVIEH----LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376 V + + +R N I ++K + ++ID Sbjct: 314 GANVAMSIAVKTLSGKVFVETIDCQSIRNGNQWIINFLKKADVEKVVIDG 363 >gi|29826542|ref|NP_821176.1| hypothetical protein SAV_2 [Streptomyces avermitilis MA-4680] gi|29603638|dbj|BAC67711.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 77 Score = 41.6 bits (96), Expect = 0.27, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 20/47 (42%), Gaps = 3/47 (6%) Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 P+ +G I + G GKT++ A ++ P ++ + L Sbjct: 2 PPQGARGTIVSATGSGKTSMAAAST---LNCFPEGRILVTVPTLDLL 45 >gi|66396048|ref|YP_240381.1| ORF008 [Staphylococcus phage 71] gi|62636467|gb|AAX91578.1| ORF008 [Staphylococcus phage 71] Length = 421 Score = 41.6 bits (96), Expect = 0.27, Method: Composition-based stats. Identities = 38/309 (12%), Positives = 94/309 (30%), Gaps = 35/309 (11%) Query: 83 SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142 GRG GK++ + ++ + R ++ + + ++ L T+++ ++ + H F+ Sbjct: 33 KGGRGSGKSSDISIIIT-QLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFK 91 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY---GMAIINDE 199 ++ + + + R + P+ ++ +A I + Sbjct: 92 VKVS------------PMEITYIPRGNRIIFRGA--QNPERLKSLKDSRFPFSVAWIEEL 137 Query: 200 ASGTPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF----YEIFNKPLDDWKRF 251 A + I L + + + NP + + YE + + + Sbjct: 138 AEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSFQADNTFVHH 197 Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311 I F + + + R E G+ +P N + + Sbjct: 198 S-TYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGEAIGS---GVVPFNNLRIEEIPQGQ 253 Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKY 367 D + + D D V ++ VI + + + + Y Sbjct: 254 YDTFDNIRNAVDFG-YATDPLAFVRWHYDKKKRVIYAMDEHYGVQISNREFANWLKKKGY 312 Query: 368 RPDAIIIDA 376 + D I D+ Sbjct: 313 QSDEIFADS 321 >gi|326385269|ref|ZP_08206932.1| putative phage terminase protein [Gordonia neofelifaecis NRRL B-59395] gi|326196012|gb|EGD53223.1| putative phage terminase protein [Gordonia neofelifaecis NRRL B-59395] Length = 439 Score = 41.6 bits (96), Expect = 0.28, Method: Composition-based stats. Identities = 58/384 (15%), Positives = 115/384 (29%), Gaps = 49/384 (12%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTR-PGISVI 111 WQ+ +++ + V + GKT L A ++L + G V Sbjct: 11 PWQILAADLIGECDASGRLIHPLVVVTVPRQS----GKTALLAAVMLHRLIMLGEGGRVW 66 Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171 A + + + +W E+ + + D LG ++ Sbjct: 67 YTAQTGIKAREQMW-EMMDAIDRSALGPL-------IKSKRGAGDTSMELLGTGARA--- 115 Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLT---ERNANRFWIMTS 228 PD+ G+ + + DEA + G++G +T N I+ S Sbjct: 116 ---KMHPPTPDSLHGNQSDLN---VIDEAWFFDEPQAHGLMGAITPTQSTRPNAQTIIIS 169 Query: 229 NPRRLSGKF-YEIFNKPLDDWKRFQIDTRTVEGIDPS----------------FHEGIIA 271 + +++ + D +D +G+ P + A Sbjct: 170 TAGTAESVWFHDLVARGHDGALCL-VDYGVADGVTPDDYPAIAAAHPAIGHTQKAAILPA 228 Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331 S + G + +P I++ A P P A ++ GC ++ E D Sbjct: 229 AREQLSSGEFLRAYGNVRTRTESRLLPAEIVDAATTTTPLPATGA-VVFGCALSFERDDA 287 Query: 332 TVV---VLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 +V G + L T + + L +++ + I D + Sbjct: 288 AIVACMAADDGTPVVELVARF-TSAEGVAARCAELTDRHGGH-VAIAPAGPAGSIADDAD 345 Query: 389 MLGYHVYRVLGQKRAVDLEFCRNR 412 LG V R + + +R Sbjct: 346 RLGATVTRYADAELSSSTADFLDR 369 >gi|224586458|ref|YP_002640348.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] gi|224497449|gb|ACN53076.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] Length = 359 Score = 41.6 bits (96), Expect = 0.28, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 57 YGGDKASDFERFIGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 114 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294 F + + +K + T + F E Y + + V G + Sbjct: 115 YFKTDYIDNVATFKTYNFTTYDNVLLSKVFIETQEKLY-KEIPTYKARVLLGAWIASTDS 173 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NII++ + P I D A GGDNT + + Sbjct: 174 IFTQINIIQDYVFTSP--------IAYLDPAFSIGGDNTALCV 208 >gi|297618941|ref|YP_003707046.1| hypothetical protein Mvol_0413 [Methanococcus voltae A3] gi|297377918|gb|ADI36073.1| hypothetical protein Mvol_0413 [Methanococcus voltae A3] Length = 576 Score = 41.6 bits (96), Expect = 0.28, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 80/265 (30%), Gaps = 53/265 (20%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWL-----VLWLMSTRPGIS 109 Q E ++ + + L+ + G+G GK + + L + +++ P Sbjct: 93 QAEILKKMKKNYLS------------TVLVGKGGGKDFMTSLLFNDELIDLILTDIPYTR 140 Query: 110 V--ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167 V I +A + + E +W S K W ++ P + + + Sbjct: 141 VDFINIAPNADLAHNVFFREFKQWFSR--CKLWKLFKNSEKSPIKINNTFIKIGDLVKI- 197 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL--------TERN 219 T R +F G T I+ DE D + T Sbjct: 198 -------TSGHSRSASFEG---TNPKCIVIDE---ISDENFMNAEKIFYQAKSSVQTRWG 244 Query: 220 ANRFWIMTS-------NPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272 + I+ S NP G Y+I+++ L F T E Sbjct: 245 KDGKVILISWTRFPTPNPLDDIG--YKIYSENLGIDDVFSFKGATWEVNSHRSKFDFEDD 302 Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFI 297 Y + + + + P+ + FI Sbjct: 303 YKRNGVLAKKMYECKPPELS-NYFI 326 >gi|34365522|tpg|DAA01288.1| TPA_exp: replicase/helicase/endonuclease [Danio rerio] Length = 3007 Score = 41.6 bits (96), Expect = 0.29, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 44/123 (35%), Gaps = 10/123 (8%) Query: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 M +L E E+ + DL K++ + + L + QL Sbjct: 2248 MKDKLQQVEEHEEHIPDLASEANQKVAHLEKKNNIM---CRRDGLALIRSLNDTQLSIFY 2304 Query: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTL------NAWLVLWLMSTRPG-ISVICL 113 + CL+ V NP I+ G G GK+ L A +L + P ISV+ Sbjct: 2305 EIRQWCLDKVMGKNPSPVHLFITGGAGTGKSHLIKAIQYEAMRILSTVCRHPDNISVLLT 2364 Query: 114 ANS 116 A + Sbjct: 2365 APT 2367 >gi|85709622|ref|ZP_01040687.1| Phage DNA Packaging Protein [Erythrobacter sp. NAP1] gi|85688332|gb|EAQ28336.1| Phage DNA Packaging Protein [Erythrobacter sp. NAP1] Length = 441 Score = 41.6 bits (96), Expect = 0.29, Method: Composition-based stats. Identities = 72/413 (17%), Positives = 126/413 (30%), Gaps = 56/413 (13%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 I AGRG GKT A V + + + +++S + + + S L+ P Sbjct: 55 IMAGRGFGKTRAGAEWVRSIAESHSEARIALVSSSLAEARAVMVEGESGLLACSP----- 109 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE-- 199 P SL YS P+ G ++ DE Sbjct: 110 ----------PDRRPEFEPSLRRVRFPNGAEAHLYSAGEPEALRGPQFSHA---WCDEVG 156 Query: 200 ----ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255 + +L L + R + T+ PR + + + + T Sbjct: 157 KWPISHSRATRAWDNLLMGLRLGDDPRIAV-TTTPRAVPLVQRLLKQETSQATAVTRGST 215 Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 P+ IA S + R E+ G+ + + +++E++ E P + Sbjct: 216 YDNSANLPARFLEAIADEFAGSQLGRQEIEGELIEDIEGALWSRSLLEQSKE-EAGPPGF 274 Query: 316 APLIMGCD-IAEEGGDNT---VVVLRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRP 369 +++G D GD V L L D ++ ++ +R Sbjct: 275 RRIVIGVDPPTSSTGDECGIVVAALGEDNKAWVLADCSVARAQPEQWARAVAEAAHHWRS 334 Query: 370 DAIIIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTE--LHVKMADWLE 425 D II +AN G L G V V + V R E + +D + Sbjct: 335 DRIIAEANQGGEMVESVLRAADAGLPVKLVHASRGKV-------ARAEPVAALYASDRVR 387 Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 A N L + + I + +S D D L++ +E Sbjct: 388 HAG--NFPQLQDQMCG-----------MLIGGEYAGPGRSPDRLDALVWALSE 427 >gi|300173892|ref|YP_003773058.1| phage terminase large subunit [Leuconostoc gasicomitatum LMG 18811] gi|299888271|emb|CBL92239.1| phage terminase, large subunit, pbsx family [Leuconostoc gasicomitatum LMG 18811] Length = 427 Score = 41.6 bits (96), Expect = 0.30, Method: Composition-based stats. Identities = 58/331 (17%), Positives = 104/331 (31%), Gaps = 26/331 (7%) Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL---ANSETQLKTTLWAEVSK 130 N + A RG GK+ A V+ + T+P ++ + L AN+ Q + + + K Sbjct: 21 NSKARYIAYKGSRGSGKSEGVATKVILDIVTKPYVNWLVLRRYANTNRQ---STFTLLQK 77 Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT 190 + + F+ SL + K S + Sbjct: 78 VANRMGVGSLFQFNG-SLPEITFKPTGQKILFRGADKPLSITSISVETGNLCRLW-VEEA 135 Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK- 249 Y M + E S + ++ + G + + + ++T NP + F Sbjct: 136 YQMEL---EESF--ETVDESMRGVIDDPDGFYQTVLTFNPWNERHWLKKRFFDEDTRVNN 190 Query: 250 --RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 + +D + ++ + RV V G++ + I N I E + Sbjct: 191 SLAITTTYKDNPFLDVDYVNRLLEMKKRNPRRARVAVDGEW--GVAEGLIYENTIVEKFD 248 Query: 308 -REPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISG 362 RE + ++ G D G D T + + I L + K + T Sbjct: 249 IREVLKGSH--IVRGMDWG-YGPDPTTFIEYAINTKTKDIYILKEMYKQHMLTDEIFKWL 305 Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYH 393 V Y+ I D N G R L G Sbjct: 306 YVHGYQQGDIRADYANGGDRMIQELRNKGIR 336 >gi|238581544|ref|XP_002389644.1| hypothetical protein MPER_11197 [Moniliophthora perniciosa FA553] gi|215452133|gb|EEB90574.1| hypothetical protein MPER_11197 [Moniliophthora perniciosa FA553] Length = 633 Score = 41.6 bits (96), Expect = 0.31, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 52/159 (32%), Gaps = 18/159 (11%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANS-----------ETQLKTTLWAEVSKWLSLL 135 G GKT +L L+S P ++ A S + ++ L+ + Sbjct: 480 GTGKTVTAVEAILQLLSANPNARILACAPSNSAADLIAMRLRSLGESGLFRAYAPSRDRE 539 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 H + + +S L + K + + T +G + I Sbjct: 540 QVPHEL-LPFTYQNATGHFSVPLLSRM----KRFRAVVTTCVSANIIAGIGIPRGHYTHI 594 Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234 DEA + ++ T + N +++ +P++L Sbjct: 595 FVDEAGQATEP--EVMIAIKTMADMNTNVVLSGDPKQLG 631 >gi|156933807|ref|YP_001437723.1| hypothetical protein ESA_01633 [Cronobacter sakazakii ATCC BAA-894] gi|156532061|gb|ABU76887.1| hypothetical protein ESA_01633 [Cronobacter sakazakii ATCC BAA-894] Length = 575 Score = 41.6 bits (96), Expect = 0.31, Method: Composition-based stats. Identities = 56/367 (15%), Positives = 95/367 (25%), Gaps = 56/367 (15%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP---GIS 109 WQ V S + + R GK+ L A V M G Sbjct: 84 DWQKFCFCVSFGWVRKSDGLRRFQEIYIEVP--RKNGKS-LIAASVGIYMFCADDEHGAE 140 Query: 110 VICLANSETQLKTTLWAE---VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 V C A +E Q E V K +L P + G Sbjct: 141 VYCGATTEKQAFKVFEPERQMVQKLPALRKRFSIKPWAKKMTRPDGSVFAPIVGDPGDGD 200 Query: 167 KHYSTMCRTYSEERPDTF-------VGHHNTYGMAII----NDEASGTPDV---INLGIL 212 + Y E D G II D AS D + + Sbjct: 201 SPSCAIIDEYHEHATDALYTTMTTGQGAREQPLTLIITTAGYDIASPCYDKRSQVVEILE 260 Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI----DPSFHEG 268 G T+ + + + DDW + + + P F Sbjct: 261 GIRTDGANETIFGIIYTLDKD------------DDWTSEEAIRKANPNLGVSLKPEFLRA 308 Query: 269 IIARYGLDSDVTRVEVCGQ----FPQQDIDSFIPLNIIEEA-LNREPCPDPYAPLIMGCD 323 + ++ + + + E A + P +G D Sbjct: 309 K-QELAKTTPSQTNKILTKHFNLWVSSKAAFYNMQRWQEAADPSLTLADFEGEPCYLGID 367 Query: 324 IAEEGGDNTV--VVLRRGPVIEHLFD-----WSKTDLRTTNN-KISGLVEKYR---PDAI 372 +A + N V V +R ++H + W D + + ++ E+Y+ + Sbjct: 368 LASKLDLNAVVPVFMREIDGLKHFYCIGAQFWVPEDTVYSTDPQLKRTAERYQSFVNQGV 427 Query: 373 IIDANNT 379 +I + Sbjct: 428 LIPTDGA 434 >gi|195942183|ref|ZP_03087565.1| hypothetical protein Bbur8_04905 [Borrelia burgdorferi 80a] gi|219786709|ref|YP_002477434.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692709|gb|ACL33925.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|312148688|gb|ADQ31340.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] gi|312148897|gb|ADQ31544.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] gi|312201269|gb|ADQ44578.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] Length = 396 Score = 41.6 bits (96), Expect = 0.31, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSADDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ + + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219 >gi|324504396|gb|ADY41899.1| ATP-dependent RNA helicase DDX20 [Ascaris suum] Length = 937 Score = 41.6 bits (96), Expect = 0.32, Method: Composition-based stats. Identities = 29/209 (13%), Positives = 61/209 (29%), Gaps = 19/209 (9%) Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLM-STRPGISVICLANSETQLKTTLWAEVSKWL 132 F + A G GKT + A + L + + R V+ +A + E++ + Sbjct: 53 GLMGFDMLVQAKSGTGKTLVFALMALEGLNAQRRQPQVMIIAPT---------REIAMQI 103 Query: 133 SLLPNK---HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189 ++ + + D+ G+ T R D +H Sbjct: 104 AVTVRRLAPPVIHVGVFVGGGRSVADDIKEIRKGVHIA-VGTTGRLCQLVNDDLLPTNH- 161 Query: 190 TYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249 + DEA + + FL N + + G E + + Sbjct: 162 --VHLFVLDEADKLMEENFQKDINFLFSSLPNNKQMAVFSATYP-GDLDETLARYMKKAH 218 Query: 250 RFQIDTRTVEGID-PSFHEGIIARYGLDS 277 +++ V+ + + + G S Sbjct: 219 LIRLNAEDVQLLGIKQYVAMSYSEDGPTS 247 >gi|225621767|ref|YP_002724125.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547658|gb|ACN93635.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 41.6 bits (96), Expect = 0.33, Method: Composition-based stats. Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDNPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + +K + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNTATFKTYNFTTYDNVLLGKGFIEPQEKLY-KDIPTYKARVLLGEWIASIDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITQDYVFSSP--------IAYLDPAFSVGGDNTALCV 299 >gi|283786098|ref|YP_003365963.1| ATP-dependent acetyltransferase [Citrobacter rodentium ICC168] gi|282949552|emb|CBG89168.1| putative ATP-dependent acetyltransferase [Citrobacter rodentium ICC168] Length = 669 Score = 41.6 bits (96), Expect = 0.33, Method: Composition-based stats. Identities = 25/112 (22%), Positives = 38/112 (33%), Gaps = 17/112 (15%) Query: 18 LMWSD-EIKLSFSNFVLHFFP----------WGEKGT-PLEGFSAPRSWQLEFMEVVDAH 65 L WSD + NFV HF W + + + F +WQ E + Sbjct: 121 LRWSDCPQPIPTPNFVRHFCRVLLADGQTLCWRQPQSLSVTHFPGRPAWQSATGEPLPEQ 180 Query: 66 CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117 A++A RG GK+ L L+ +S I A ++ Sbjct: 181 SAILAQLRQMPERIAAVTAARGRGKSALAGQLIA-HLSG----QAIVTAPTK 227 >gi|310005737|gb|ADP00124.1| DNA maturase beta subunit [Cyanophage NATL1A-7] Length = 577 Score = 41.6 bits (96), Expect = 0.35, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 41/124 (33%), Gaps = 11/124 (8%) Query: 297 IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL---RRGPVIEH-LFDWSKTD 352 +P + + + Y I D + G D T + G + H + + Sbjct: 324 LPGDYFYSPMQLQGEWSKYTETICSVDPSGRGSDETAAAYLSQKNGFIYLHEMRAYRDGY 383 Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCR-N 411 T I +KY ++I+ N G L K+A+D+E R N Sbjct: 384 TDNTLLNILRGCQKYGVTKLVIETN-FGDGIVAELFKKHLQ-----NTKQAIDIEEVRAN 437 Query: 412 RRTE 415 R E Sbjct: 438 VRKE 441 >gi|260433350|ref|ZP_05787321.1| phage DNA Packaging Protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417178|gb|EEX10437.1| phage DNA Packaging Protein [Silicibacter lacuscaerulensis ITI-1157] Length = 427 Score = 41.2 bits (95), Expect = 0.36, Method: Composition-based stats. Identities = 55/423 (13%), Positives = 109/423 (25%), Gaps = 69/423 (16%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ S G V + + Q++ + Sbjct: 35 IMGGRGAGKTRAGA---EWVRSMVEGAKPFDEGEARRVALVGETFDQVRDVM-------- 83 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 F + P S + +S P+ G Sbjct: 84 -------IFGDSGIMQCSPPDRRPQWKASERKLVWPNGAEAQAFSAHDPEGLRGPQFD-- 134 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + L +T+ P R ++ P Sbjct: 135 -AAWVDELAKWKKAGETWDMLQFAL-RLGERPRVCVTTTP-RNVKVLKDLLAAPSTVM-T 190 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + SF + + ARY + + R E+ G + ++++ A R Sbjct: 191 HAPTEANRANLAESFLQEVRARY-AGTRLGRQELDGVLLADAEGALWTGSMLDGA--RVG 247 Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357 +++ D A G D +V+ + DW Sbjct: 248 AVPELDRVVVALDPAVTGGSGADACGIVVVGAQLQGPPEDWRAYVLADRTVQGVGPAGWA 307 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417 ++++ + ++ + N G + + + + + + L+ Sbjct: 308 RAAIDAMDEFGAERLVAEVNQ-GGQLVEEVVRQVDPLVPFRAVRASRGKVARAEPVAALY 366 Query: 418 VKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 + A L + + + G S D D L++ Sbjct: 367 EQGRV-FHVAGLDALEEQMCQMTARGFE----------------GQGSPDRVDALVWALH 409 Query: 478 ENP 480 E Sbjct: 410 ELV 412 >gi|255945291|ref|XP_002563413.1| Pc20g09170 [Penicillium chrysogenum Wisconsin 54-1255] gi|211588148|emb|CAP86246.1| Pc20g09170 [Penicillium chrysogenum Wisconsin 54-1255] Length = 944 Score = 41.2 bits (95), Expect = 0.36, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 63/190 (33%), Gaps = 22/190 (11%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW-AEVSKWLS---LLPNKHWFE 142 G+GKT + A ++ +P +++ + + W +E+ ++ + H + Sbjct: 365 GMGKT-IQAVSLIMSDFPQPDPTLVIVPP----VALMQWVSEIKEYTDGKLKVLVYHNSD 419 Query: 143 MQSLSLHPAPWYSDVLHCS-----LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 + L PA + I K R + + D+ H + ++ Sbjct: 420 AKVKRLTPAEIRKYDVIMISYASLESIYRKQEKGFSRGETMVKADSV--IHAVHYHRLVL 477 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS-GKFYEIFN----KPLDDWKRFQ 252 DEA AN W ++ P + G+F+ + KP + Q Sbjct: 478 DEAHSIKSRTTGVARACFALE-ANYKWCLSGTPVQNRIGEFFSLLRFLQVKPFACYFCKQ 536 Query: 253 IDTRTVEGID 262 D ++ Sbjct: 537 CDCEQLQWTS 546 >gi|319406198|emb|CBI79835.1| phage-related protein [Bartonella sp. AR 15-3] Length = 442 Score = 41.2 bits (95), Expect = 0.37, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 61/193 (31%), Gaps = 9/193 (4%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247 + DEA D ++ L E +T NP R + + F + Sbjct: 122 RILLCWVDEAEPVTDAAWQVLIPTLREEGKEWHSELWVTWNPCRENAAVEKRFRFTKDPN 181 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF---IPLNIIEE 304 K +I+ R + A + + G++ ++ + L +E Sbjct: 182 IKGVEINWRDNPKFPAKLNRDRTADLEQRPEQYQHIWEGEYLLAMQGAYYQKLLLEAEQE 241 Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361 DP + + DI G D T + + + + D+ + + + I Sbjct: 242 GRITTVPRDPLIQVKIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHIG 301 Query: 362 GLVEKYRPDAIII 374 + K A+++ Sbjct: 302 WICHKGYEKALMV 314 >gi|182438394|ref|YP_001826113.1| hypothetical protein SGR_4601 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178466910|dbj|BAG21430.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 609 Score = 41.2 bits (95), Expect = 0.37, Method: Composition-based stats. Identities = 31/125 (24%), Positives = 42/125 (33%), Gaps = 24/125 (19%) Query: 5 LPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDA 64 +P +PE E + +F PWG G R+WQ ME Sbjct: 7 VPESPEPETVTTTTASH-HLSPAFPGRA----PWGTAG-------KLRAWQQGAME---- 50 Query: 65 HCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTL 124 V + A G GKTT L WL+ + +A +E LK Sbjct: 51 ---RYVQEQPRDFLAVATP---GAGKTTFALTLASWLLHHHVVQQITVVAPTEH-LKKQ- 102 Query: 125 WAEVS 129 WAE + Sbjct: 103 WAEAA 107 >gi|73748202|ref|YP_307441.1| hypothetical protein cbdb_A296 [Dehalococcoides sp. CBDB1] gi|73659918|emb|CAI82525.1| hypothetical protein cbdbA296 [Dehalococcoides sp. CBDB1] Length = 405 Score = 41.2 bits (95), Expect = 0.38, Method: Composition-based stats. Identities = 49/295 (16%), Positives = 86/295 (29%), Gaps = 59/295 (20%) Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213 ++D+ H G + S E + VG NT + + DEA Sbjct: 72 FTDIYHTEGGYIIRLNQARAVFLSAEPSASVVG--NTAHLLLEVDEAQDVNKEKY----- 124 Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFN-------------KPLDDWKRFQIDTRTVEG 260 + T+ L G ++ F+ + + F+ D V Sbjct: 125 ---SKEFKPMGATTNVTTVLYGTTWDSFSLLEEIKEQNIEKEQKDGLKRHFRYDWEAVAA 181 Query: 261 IDPSFHE---GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDP 314 +P++ R G + + + P ++ PC P+ Sbjct: 182 HNPTYLAYALSEKERLGKNHPLFLAQYR-LLPVSGGGGMFSNEQLDLLKGNHPCQVYPEK 240 Query: 315 YAPLIMGCDIAEE-----GGDNTVVVLRRGPVIEHL----------------------FD 347 + G D+A E G T V LRR + + + Sbjct: 241 GKVYVAGLDLAGEDSQTGGISPTTVNLRRDSSVLTIAQLDYTFAKAPYNLPQVRLVCHYS 300 Query: 348 WSKTDLRTTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYHVYRVLGQ 400 W T K+ L+ K ++ + +DA G +L LG + V Q Sbjct: 301 WQGTRHALLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSRILPVPFQ 355 >gi|149913871|ref|ZP_01902403.1| hypothetical protein RAZWK3B_17748 [Roseobacter sp. AzwK-3b] gi|149812155|gb|EDM71986.1| hypothetical protein RAZWK3B_17748 [Roseobacter sp. AzwK-3b] Length = 419 Score = 41.2 bits (95), Expect = 0.38, Method: Composition-based stats. Identities = 68/428 (15%), Positives = 125/428 (29%), Gaps = 83/428 (19%) Query: 82 ISAGRGIGKTTLNA-WLVLWLMSTRPGISVIC-----LANSETQLKTT-LWAEVSKWLSL 134 I GRG GKT A W+ + RP +C + + Q++ ++ E S ++ Sbjct: 27 ILGGRGAGKTRAGAEWVRAQVEGARPLSEGLCRRMALVGETIDQVREVMIFGE-SGIMAC 85 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 P + Q+ + + +S P+ G Sbjct: 86 SPPDRRPDWQATR---------------KRLVWPNGAVAQAFSAHEPEALRGPQFDGA-- 128 Query: 195 IINDEA---SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251 DE D ++ G + + + R G ++ L+ Sbjct: 129 -WVDEMAKWKKARDTWDMLQFGLRLGDHPQ---VCITTTPRNVGVLKDL----LEQKSTV 180 Query: 252 QIDTRTVEG---IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 T + SF E + ARY + + R E+ G + + + IE R Sbjct: 181 VTSAPTEANRAFLAQSFLEEVRARY-AGTRLGRQELDGVLLSEAEGALWTNSGIEAC--R 237 Query: 309 EPCPDPYAPLIMGCDI---AEEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTT- 356 +++ D G D +V+ + L D S R Sbjct: 238 VDNLPELDRIVVAIDPPVTGRAGSDECGIVVAGAVTRGPVQDWRAYVLADCSVGAARPLS 297 Query: 357 -NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRR 413 N +E + D ++ + N G + + V V ++ V R Sbjct: 298 WANAAISAMEHWGADRLVAEVNQGGDMVAQVIRQVDPLVPVKSVHARRGKVT-------R 350 Query: 414 TELHVKMADWLEFASLINHSG---LIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSD 470 E +A E + + G L + ++ + G S D D Sbjct: 351 AE---PVAALYEQGRVHHLRGLGTLEDQMCAMTARGFEGKG-------------SPDRVD 394 Query: 471 GLMYTFAE 478 L++ E Sbjct: 395 ALVWALTE 402 >gi|160898677|ref|YP_001564259.1| hypothetical protein Daci_3236 [Delftia acidovorans SPH-1] gi|160364261|gb|ABX35874.1| protein of unknown function DUF264 [Delftia acidovorans SPH-1] Length = 428 Score = 41.2 bits (95), Expect = 0.39, Method: Composition-based stats. Identities = 49/320 (15%), Positives = 92/320 (28%), Gaps = 38/320 (11%) Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134 P F+ + AG G GKT + + P I+ A S Q++ + + Sbjct: 19 PHKFRAFV-AGFGSGKTWVGCSGLSAHAWEFPRINAGYFAPSYPQIRDIFFPTI------ 71 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 H + +++ + + Y T S +RP++ VG + Sbjct: 72 EEVAHDWGLRTEI-------RESNKEVHLYSGRQYRTTVICRSMDRPESIVGFKIGQAL- 123 Query: 195 IINDE----ASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPLDD- 247 DE A + I+ + +T+ P ++ F K + D Sbjct: 124 --VDELDVMAKQKAEQAWRKIIARMRYNVDGLKNGVDVTTTPEG-FKFTHQQFVKAVQDK 180 Query: 248 ------WKRFQIDT-RTVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFIPL 299 + Q T + + + + Y D + G F S P Sbjct: 181 PELAKLYGLIQASTFENAKNLPADYIPSLFDSYPKQLIDAY---LRGLFVNLTSGSVYP- 236 Query: 300 NIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359 + + + P+++G D VLR G + D Sbjct: 237 DFDRKLNHSFESLQEGEPVMLGMDFNRLHMAAVAYVLRDGWPVAVDEITDGRDTPYMARL 296 Query: 360 ISGLV-EKYRPDAIIIDANN 378 +K + DA+ Sbjct: 297 FRERYQDKGHAVTVYPDASG 316 >gi|329928970|ref|ZP_08282780.1| Tex-like protein N-terminal domain protein [Paenibacillus sp. HGF5] gi|328937222|gb|EGG33649.1| Tex-like protein N-terminal domain protein [Paenibacillus sp. HGF5] Length = 731 Score = 41.2 bits (95), Expect = 0.40, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 35/107 (32%), Gaps = 9/107 (8%) Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342 EV G+ ++ + I + + P ++G D A G VV G ++ Sbjct: 293 EVRGELTEKGENQAISIF-AGNLRSLLLQPPVKGRCVLGVDPAYRTGCKLAVVDDTGKLL 351 Query: 343 EHLFDWSK---TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386 E + R K L+ KY I+I G T Sbjct: 352 EVAVTYPTPPANKKREAAAKFKELIAKYGIKLIVI-----GNGTASR 393 >gi|219872329|ref|YP_002476730.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] gi|219694371|gb|ACL34896.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] Length = 396 Score = 41.2 bits (95), Expect = 0.41, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 74/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L G + + + + ++ E+ + L++ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFNPDGGDTLAIRKKKNKTTQSIHKEICELLNIYNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKSKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFTSEDIEMLIPTMREQGGRVY--MSSNPVPKSHWLYKRYLSNEDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ + + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGNVQAWLEKQKLAYHGNDIGFRIEVLGE 219 >gi|330015975|ref|ZP_08308363.1| putative ATPase subunit of terminase [Klebsiella sp. MS 92-3] gi|328529845|gb|EGF56736.1| putative ATPase subunit of terminase [Klebsiella sp. MS 92-3] Length = 575 Score = 41.2 bits (95), Expect = 0.41, Method: Composition-based stats. Identities = 37/269 (13%), Positives = 75/269 (27%), Gaps = 36/269 (13%) Query: 237 FYEIFNK---PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293 + + + P W++ + + + + R D +F + Sbjct: 304 WKKTHSGVLYPDKTWRQI-VTIQDAINNGWDYTDIDEIRDENSPDEFENLYMCEFVKDGE 362 Query: 294 DSFIPLNIIEEALNREPCPDPYAP----------LIMGCDI--AEEGGDN-----TVVVL 336 +F ++ + + P + +G D GD TV L Sbjct: 363 SAFNLSQLLGCGADGYDDWPDWKPFASRPMGQREVWLGYDANGGSGNGDAGALSVTVPPL 422 Query: 337 RRGPVIE--HLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394 G L + I E+Y I ID G + + Sbjct: 423 VAGGRFRTVELKQLRGLEFEQQAAVIKEAAERYNVTHIAIDGQGVGEAV--------WQI 474 Query: 395 YRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTG 451 + ++R L +KM + GL++ +++ +V G Sbjct: 475 VKNWFPAAICYQMSLSSKRA-LVLKMLQVIRAGRWEYDRSEQGLVRAFNAVRK-VVTPGG 532 Query: 452 ELAIESKRVKGAKSTDYSDGLMYTFAENP 480 + E+ R +G D + M + P Sbjct: 533 FITYETDRSRGVSHGDMAWATMLSIINEP 561 >gi|224796986|ref|YP_002642738.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] gi|224553700|gb|ACN55104.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] gi|312149848|gb|ADQ29915.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi N40] Length = 396 Score = 41.2 bits (95), Expect = 0.41, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSADDIEMLVPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ + + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219 >gi|226234361|ref|YP_002775493.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] gi|226201889|gb|ACO38473.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] Length = 396 Score = 41.2 bits (95), Expect = 0.41, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSADDIEMLVPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ + + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219 >gi|11496682|ref|NP_045481.1| hypothetical protein BBG21 [Borrelia burgdorferi B31] gi|218868779|ref|YP_002455248.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi ZS7] gi|224796961|ref|YP_002642637.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] gi|224985496|ref|YP_002642672.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|225548803|ref|YP_002724009.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] gi|2690026|gb|AAC66069.1| predicted coding region BBG21 [Borrelia burgdorferi B31] gi|218165273|gb|ACK75330.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi ZS7] gi|223929545|gb|ACN24257.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|224554186|gb|ACN55578.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] gi|225546810|gb|ACN92808.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] Length = 396 Score = 41.2 bits (95), Expect = 0.41, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143 + RG GKT A + L + G + + + + ++ E+ + LS+ + +F + Sbjct: 26 SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196 + + ++R F G H+T + + Sbjct: 86 SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124 Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254 +EA+ ++ + E+ + M+SNP S Y+ + N+ + Sbjct: 125 LEEANQFSADDIEMLVPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182 Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287 R ++ + + + Y + R+EV G+ Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219 >gi|257088841|ref|ZP_05583202.1| predicted protein [Enterococcus faecalis CH188] gi|256997653|gb|EEU84173.1| predicted protein [Enterococcus faecalis CH188] gi|315160590|gb|EFU04607.1| phage uncharacterized protein [Enterococcus faecalis TX0645] gi|315579436|gb|EFU91627.1| phage uncharacterized protein [Enterococcus faecalis TX0630] Length = 418 Score = 41.2 bits (95), Expect = 0.42, Method: Composition-based stats. Identities = 46/323 (14%), Positives = 95/323 (29%), Gaps = 35/323 (10%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 RG KTT A + LM P ++I L ++T + E+ ++ + + +F+ Sbjct: 52 RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106 Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +L+ + + + G H +I D+ Sbjct: 107 FALYNVELVLLKETTTEIDTNLKTSSRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257 D ++ + N + +G+F ++K K + D Sbjct: 164 KDRVSRA-----EREKTKLQYQELQNVKNRAGRFINTGTPWHKEDAISKMPNVKKFDCYE 218 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 ID + + + + + + F + N + Sbjct: 219 TGLIDKEQRKAL--QQSMTPSLFAANYELKHIADSESLFTAPTYTD-NTNLIYNGVAH-- 273 Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373 D A G D+T + + G +I W K + +I L + Y+ Sbjct: 274 ----IDAAYGGDDSTAFTIFKEQKDGTIIGFGKKWQKH-VDDCIPEILQLHQHYQAGTFY 328 Query: 374 IDANNTGARTCDYLEMLGYHVYR 396 + N +L G +V + Sbjct: 329 NETNGDKGYLAKHLIERGQYVQK 351 >gi|158300801|ref|XP_320633.4| AGAP011893-PA [Anopheles gambiae str. PEST] gi|157013336|gb|EAA00145.5| AGAP011893-PA [Anopheles gambiae str. PEST] Length = 607 Score = 41.2 bits (95), Expect = 0.42, Method: Composition-based stats. Identities = 46/286 (16%), Positives = 86/286 (30%), Gaps = 34/286 (11%) Query: 3 RELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVV 62 R L + + L ++ + +S +F FP E P W + Sbjct: 151 RSLKIEFPLNRLQYKLEYTALVHMSRLDFSSILFPKIESAKPTTPAKTFD-WFQSCI--- 206 Query: 63 DAHCLNSVNNPNPEVFKGAISAGR------GIGKTT--LNAWLVLWLMSTRPGISVICLA 114 A V + A A G GKT + A L +W M RP ++ A Sbjct: 207 -AENEQQTQAIKNIVNRTAYPAPYILFGPPGTGKTCTIVEAVLQIWKM--RPKSRILVTA 263 Query: 115 NS--------ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 S + LK ++ ++ S + M + + + + D Sbjct: 264 TSNYACNELAKRLLKYVTVNDLFRYFSQTSQRDINGMDLKVVQVSNMHYGIYETPAMQDF 323 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW-- 224 + T +G + I DE ++ L +G + N Sbjct: 324 VQTRILVCTVMTSGRLLQLGVDRSMYDYIFIDECGSCRELSALVPIGCVGTDTTNNRLQA 383 Query: 225 --IMTSNPRRLSGKFYEIFNKPLDD-----W--KRFQIDTRTVEGI 261 ++ +P +L +FY+ + D W + R + + Sbjct: 384 SVVLAGDPLQLGPQFYDAELRAKGDPTITHWAVNWHHLPNRKLPML 429 >gi|86138748|ref|ZP_01057320.1| terminase, large subunit, putative [Roseobacter sp. MED193] gi|85824395|gb|EAQ44598.1| terminase, large subunit, putative [Roseobacter sp. MED193] Length = 417 Score = 41.2 bits (95), Expect = 0.42, Method: Composition-based stats. Identities = 64/428 (14%), Positives = 117/428 (27%), Gaps = 89/428 (20%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ + G + + + Q++ + Sbjct: 25 ILGGRGAGKTRAGA---EWIRTQVEGATPLGPGRGRRLALIGETYDQVRDVM-------- 73 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 + S S W + M + +S P+ G Sbjct: 74 --ILGDSGILACSPSDRRPQWKAGERKLIWAN-----GAMAQAFSAHDPEALRGPQFDTA 126 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 DE + + L + R + + R + ++ P K Sbjct: 127 ---WADELAKWRRAREAWDMLQFSLRLGDDPR--VCVTTTPRNAALLRQLLASPSTV-KS 180 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE----EAL 306 + PSF + ARY S + R E+ G I L+ +E A Sbjct: 181 HAATEANRANLAPSFLSEVRARY-AGSRLGRQELDG----------ILLSDVEGAIWRAA 229 Query: 307 NREPCPDPYAP----LIMGCDIA---EEGGDNTVVVL--------RRGPVIEHLFDWS-- 349 P AP +++ D A +G D +++ L D + Sbjct: 230 QLAELQVPTAPALDRIVVAVDPAVSSGKGSDACGIIVAGACLQGPVETWRAYVLADRTVQ 289 Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLE 407 + +++ D ++ + N GA + L + V + V Sbjct: 290 GVGPLAWAKAVIAAHQEFAADRVVAEVNQGGALVENLLRQIDPLVGFQPVHASRGKV--- 346 Query: 408 FCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTD 467 R E + + L + L + + + G S D Sbjct: 347 ----VRAEPVAALYEQHRVHHLPGLAELEEQMCQMSQQGFQGQG-------------SPD 389 Query: 468 YSDGLMYT 475 D L++ Sbjct: 390 RVDALVWA 397 >gi|256023437|ref|ZP_05437302.1| predicted type I site-specific deoxyribonuclease, HsdR family protein [Escherichia sp. 4_1_40B] Length = 1031 Score = 41.2 bits (95), Expect = 0.43, Method: Composition-based stats. Identities = 49/327 (14%), Positives = 99/327 (30%), Gaps = 44/327 (13%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISV-ICLANSE--TQLKTTLWAEVSKWLSLLPNKHWFE 142 +G GK+ WL W+ P V I +E Q+++ V++ + + Sbjct: 278 QGSGKSLTMVWLAKWIRENVPNSRVLIVTDRTELDEQIESVFMG-VNE--DIYRTSSGND 334 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY--SEERPDTFVGHHNTYGMAIINDE- 199 + + HP PW L G S+ +E + + + DE Sbjct: 335 LIATLNHPNPWLICSLVHKFGRRSEAEDNAATDAFITELQQSLTKTFRAKGDLFVFVDEC 394 Query: 200 ------------ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD 247 + PD + +G G + + + + G + + Sbjct: 395 HRTQSGKLHNAMTAILPDALFIGFTGTPLMKKDKKKSV------EVFGPYIHTYKFDEAV 448 Query: 248 WKRFQIDTR------TVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFIPLN 300 +D R + S++ R ++ ++ + Sbjct: 449 ADGVVLDLRYEARDIDQYLTSEKKVDDWFEAKTRGLSNLARTQLKQKWGSMQ-KLLSSKS 507 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 +E+ +N P +M +G N ++V + + + K+ Sbjct: 508 RLEQIVNDILLDMDTRPRLM------DGRGNAMLVC--SSIYQACKVYEMFSQTELAGKV 559 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYL 387 + +V YRPDA I TG + L Sbjct: 560 A-IVTSYRPDAASIKGEETGEGLTEKL 585 >gi|327400267|ref|YP_004341106.1| hypothetical protein Arcve_0358 [Archaeoglobus veneficus SNP6] gi|327315775|gb|AEA46391.1| protein of unknown function DUF699 ATPase [Archaeoglobus veneficus SNP6] Length = 807 Score = 41.2 bits (95), Expect = 0.45, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 57/166 (34%), Gaps = 35/166 (21%) Query: 60 EVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS------TRPGISVICL 113 +V + + E I+A RG GKT + + +L+S RP + ++ + Sbjct: 255 QVRVLQLFETFFDREKERKAVVITADRGRGKTAVLGIVTPYLISRMHRVLKRP-VRIMVV 313 Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS----LSLHPAPWYSDVLHCSLGIDSKHY 169 A + ++T + + K L K++ +S ++ + + + K Y Sbjct: 314 APTPQAVQTY-FRFLKKALVRQGMKNYKVKESNGLITVINSKFARVEYVVPRRAMIEKDY 372 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP-DVINLGILGF 214 + II DEA+G V+ G Sbjct: 373 AD----------------------IIIVDEAAGIDVPVLWQITEGA 396 >gi|148241989|ref|YP_001227146.1| hypothetical protein SynRCC307_0890 [Synechococcus sp. RCC307] gi|147850299|emb|CAK27793.1| Hypothetical protein SynRCC307_0890 [Synechococcus sp. RCC307] Length = 98 Score = 41.2 bits (95), Expect = 0.45, Method: Composition-based stats. Identities = 18/48 (37%), Positives = 24/48 (50%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129 + +GR GKT L + L T+PG V LA S Q K WA++ Sbjct: 25 VFSGRRFGKTRLMLTAGVELCLTKPGAKVFHLAPSRKQAKDIAWADLK 72 >gi|331238525|ref|XP_003331917.1| DNA repair protein RAD5 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309310907|gb|EFP87498.1| DNA repair protein RAD5 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 1036 Score = 41.2 bits (95), Expect = 0.45, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 63/206 (30%), Gaps = 16/206 (7%) Query: 38 WGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----RGIGKTT 92 WG+ G +E ++ Q + +E+ + G S G G+GKT Sbjct: 395 WGDLGQKVEVVQPSKAEQPDGLELTLLPFQLEGLYWMKKQETGPWSGGVLADEMGMGKTI 454 Query: 93 LNAWLVLWLMSTRPGIS--VICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLH 149 L+L PG + +A + + W E+ K+ L W + Sbjct: 455 QTIALIL--SDRVPGHRKQTLVIAPT---VAIMQWRNEIEKFAKGLTVNVWHGGNRSNAQ 509 Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDEASGTPDVI 207 DV+ S + + + + H +I DEA D Sbjct: 510 EEMENFDVVLTSFAVLESAFRRQNSGFRRKGQIIKESSLLHQINWHRVILDEAHNIKDRS 569 Query: 208 NLGILGFLTERNANRFWIMTSNPRRL 233 G E A W ++ P + Sbjct: 570 CNTAKGAF-ELKATYRWCLSGTPLQN 594 >gi|315618351|gb|EFU98939.1| type I site-specific deoxyribonuclease, HsdR family protein [Escherichia coli 3431] Length = 1028 Score = 40.9 bits (94), Expect = 0.47, Method: Composition-based stats. Identities = 49/327 (14%), Positives = 99/327 (30%), Gaps = 44/327 (13%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISV-ICLANSE--TQLKTTLWAEVSKWLSLLPNKHWFE 142 +G GK+ WL W+ P V I +E Q+++ V++ + + Sbjct: 275 QGSGKSLTMVWLAKWIRENVPNSRVLIVTDRTELDEQIESVFMG-VNE--DIYRTSSGND 331 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY--SEERPDTFVGHHNTYGMAIINDE- 199 + + HP PW L G S+ +E + + + DE Sbjct: 332 LIATLNHPNPWLICSLVHKFGRRSEAEDNAATDAFITELQQSLTKTFRAKGDLFVFVDEC 391 Query: 200 ------------ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD 247 + PD + +G G + + + + G + + Sbjct: 392 HRTQSGKLHNAMTAILPDALFIGFTGTPLMKKDKKKSV------EVFGPYIHTYKFDEAV 445 Query: 248 WKRFQIDTR------TVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFIPLN 300 +D R + S++ R ++ ++ + Sbjct: 446 ADGVVLDLRYEARDIDQYLTSEKKVDDWFEAKTRGLSNLARTQLKQKWGSMQ-KLLSSKS 504 Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360 +E+ +N P +M +G N ++V + + + K+ Sbjct: 505 RLEQIVNDILLDMDTRPRLM------DGRGNAMLVC--SSIYQACKVYEMFSQTELAGKV 556 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYL 387 + +V YRPDA I TG + L Sbjct: 557 A-IVTSYRPDAASIKGEETGEGLTEKL 582 >gi|290954633|ref|YP_003485815.1| helicase-like protein [Streptomyces scabiei 87.22] gi|290963375|ref|YP_003494557.1| helicase-like protein [Streptomyces scabiei 87.22] gi|260644159|emb|CBG67232.1| putative helicase-like protein [Streptomyces scabiei 87.22] gi|260652901|emb|CBG76036.1| putative helicase-like protein [Streptomyces scabiei 87.22] Length = 889 Score = 40.9 bits (94), Expect = 0.47, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 22/54 (40%), Gaps = 3/54 (5%) Query: 67 LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 ++ ++ P+ +G I + G GKT + A + P ++ + L Sbjct: 23 FSARSSVPPQGARGTIVSATGSGKTIMAAASA---LECFPEGRILVTVPTLDLL 73 >gi|326779045|ref|ZP_08238310.1| type III restriction protein res subunit [Streptomyces cf. griseus XylebKG-1] gi|326659378|gb|EGE44224.1| type III restriction protein res subunit [Streptomyces cf. griseus XylebKG-1] Length = 609 Score = 40.9 bits (94), Expect = 0.48, Method: Composition-based stats. Identities = 31/124 (25%), Positives = 41/124 (33%), Gaps = 24/124 (19%) Query: 6 PTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAH 65 P +PE E + +F PWG G R+WQ ME Sbjct: 8 PESPEPETVTTTTASH-HLSPAFPGRA----PWGTAG-------KLRAWQQGAME----- 50 Query: 66 CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 V + A G GKTT L WL+ + +A +E LK W Sbjct: 51 --RYVQEQPRDFLAVATP---GAGKTTFALTLASWLLHHHVVQQITVVAPTEH-LKKQ-W 103 Query: 126 AEVS 129 AE + Sbjct: 104 AEAA 107 >gi|156847104|ref|XP_001646437.1| hypothetical protein Kpol_1048p9 [Vanderwaltozyma polyspora DSM 70294] gi|156117114|gb|EDO18579.1| hypothetical protein Kpol_1048p9 [Vanderwaltozyma polyspora DSM 70294] Length = 1055 Score = 40.9 bits (94), Expect = 0.48, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 48/134 (35%), Gaps = 10/134 (7%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139 ++AGRG GK+ + +S ++ + S LKT L+ + K L + Sbjct: 279 VTLTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKAFDALGYQE 336 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + + ++ + D H T+ ++ ++ DE Sbjct: 337 HIDYDIIQSTNPQFNKAIVRVDIKRD--HRQTIQYIMPQDHQVLGQAE------LVVIDE 388 Query: 200 ASGTPDVINLGILG 213 A+ P I +LG Sbjct: 389 AAAIPLPIVKKLLG 402 >gi|49476071|ref|YP_034112.1| phage related protein [Bartonella henselae str. Houston-1] gi|49238879|emb|CAF28172.1| phage related protein [Bartonella henselae str. Houston-1] Length = 402 Score = 40.9 bits (94), Expect = 0.49, Method: Composition-based stats. Identities = 35/257 (13%), Positives = 79/257 (30%), Gaps = 13/257 (5%) Query: 129 SKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG-IDSKHYSTMCRTYSEERPDTFVGH 187 ++ N+ E ++ P+ D I SK + +R Sbjct: 21 ARQFQNSLNESSLEEIKRAIESYPFLQDYYEIGDKYIKSKDGRIVYVFAGLDR--NIASI 78 Query: 188 HNTYGMAI-INDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNK- 243 + + + DEA + ++ L E N +T NP R + + F Sbjct: 79 KSMGRVFLCWVDEAEPVTETAWQTLIPTLREEGNDWNAELWVTWNPCRENAPVEKRFRNV 138 Query: 244 PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303 K +I R + A + G++ Q ++ ++E Sbjct: 139 NNPHIKGAEITWRDNPQFPEKLNRDRKADLEQRPEHYNHIWEGEYLQTVEGAYYQKALLE 198 Query: 304 EALNREPCPDPYAP---LIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTN 357 + P P + + DI G D T + + + + D+ + + + Sbjct: 199 ASREGRITTVPRDPLMQIRIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLS 258 Query: 358 NKISGLVEKYRPDAIII 374 + + ++ A+++ Sbjct: 259 EHVGWVFQRGYDKALMV 275 >gi|6467533|gb|AAF13179.1|AF181080_1 putative gene transfer agent large terminase [Rhodobacter capsulatus] Length = 393 Score = 40.9 bits (94), Expect = 0.50, Method: Composition-based stats. Identities = 66/419 (15%), Positives = 120/419 (28%), Gaps = 68/419 (16%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWLSL 134 GRG GKT A W+ G V + + Q++ + Sbjct: 2 GGRGAGKTRAGA---EWVRMQVEGAGPADAGPAHRVALVGETFDQVRDVM---------- 48 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 F + P + + YS + P+ G A Sbjct: 49 -----IFGESGILACSPPDRRPEWEATKRRLVWANGATAQAYSAQEPEALRGPQFD---A 100 Query: 195 IINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DE + + + L + ++T+ P R G I N P Sbjct: 101 AWVDELAKWRRAEETWDMLQFAL-RLGKHPQQVITTTP-RNVGVLKAILNNPSTV-VTHA 157 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312 + SF + ARY + + R E+ G + + +E R P Sbjct: 158 PTEANRAYLAESFLAEVQARY-AGTRLGRQELEGVLLEDVEGALWTTAQLEGL--RLASP 214 Query: 313 DPYAPLIMGCDIA---EEGGDNTVVVL--------RRGPVIEHLFDWS-KTDLRTTNNKI 360 +++ D A G D +V+ + L D S + Sbjct: 215 PAMDRVVVALDPAVTGGAGSDECGIVVAGAVTRGPVQDWRAFVLEDASVRGRPTDWARAA 274 Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420 +E++ + ++ + N G L + V +A+ ++ R E + Sbjct: 275 IAAMERWGAEKLVAEVNQGGEMVESVLRQIDPLV-----PFKALRASRGKSARAE---PV 326 Query: 421 ADWLEFASLINH-SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478 A E + + G + L+ + + G G S D D L++ E Sbjct: 327 AALYEQGRVKHCRDGRLGALED-QMCRMTVRGY--------AGKGSPDRVDALVWAMTE 376 >gi|148548588|ref|YP_001268690.1| hypothetical protein Pput_3380 [Pseudomonas putida F1] gi|148512646|gb|ABQ79506.1| protein of unknown function DUF264 [Pseudomonas putida F1] Length = 433 Score = 40.9 bits (94), Expect = 0.51, Method: Composition-based stats. Identities = 55/303 (18%), Positives = 93/303 (30%), Gaps = 43/303 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVS-KWLSLLPNKH 139 AG G GKT + + + P I A + Q++ + EV+ W + K Sbjct: 23 AGFGSGKTWVGCAALCKHVWEWPRIDSGYFAPTYPQIRDIFFPTIEEVAFDWGLKVKTKE 82 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 SD +T+CR S E+P T VG + + DE Sbjct: 83 ---------------SDKEVEFYSGGQYRSTTICR--SMEKPQTIVGFKIGHAL---VDE 122 Query: 200 ASGTP----DVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPL-------D 246 P + I+ + +T+ P Y+ F K L Sbjct: 123 LDVLPALKAEHAWRKIIARMRYNVPGLKNGVDVTTTPEG-FKFVYQQFVKQLREKPALQG 181 Query: 247 DWKRFQIDTRTVEG-IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 + Q T E + P + ++ Y + + + GQF + S I + Sbjct: 182 MYGLVQASTFDNELNLPPDYIPSLMESY--PAQLILAYLNGQFVNLNAGS-IYHAYDRKL 238 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLV 364 + +P PL +G D V V R + + D +I Sbjct: 239 NSCFDTVEPGEPLFIGMDFNVGKMAAIVHVKRPDGKPRAVDELIDGFDTPDMIRRIKERY 298 Query: 365 EKY 367 ++ Sbjct: 299 WRH 301 >gi|51557524|ref|YP_068358.1| DNA packaging terminase subunit 1 [Suid herpesvirus 1] gi|40253983|tpg|DAA02178.1| TPA_exp: UL15 protein [Suid herpesvirus 1] Length = 735 Score = 40.9 bits (94), Expect = 0.52, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 49/153 (32%), Gaps = 24/153 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS----KWLSLLPNKHWFEMQ 144 GKT L+ ++T GI V A+ + + E+ +W H Sbjct: 277 GKTWFLVPLIALALATFRGIRVGYTAHIRKATEPV-FEEIHARLRRWCRDARVDHVKGEN 335 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 P S ++ S + G + + + DEA+ Sbjct: 336 ITVTFPDGARSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 376 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 ILGF+ + + ++ ++N + S F Sbjct: 377 PDAVQTILGFMNQASCKIIFVSSTNTGKASTSF 409 >gi|28395422|gb|AAO38880.1| UL15 [Suid herpesvirus 1] Length = 753 Score = 40.9 bits (94), Expect = 0.52, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 49/153 (32%), Gaps = 24/153 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS----KWLSLLPNKHWFEMQ 144 GKT L+ ++T GI V A+ + + E+ +W H Sbjct: 293 GKTWFLVPLIALALATFRGIRVGYTAHIRKATEPV-FEEIHARLRRWCRDARVDHVKGEN 351 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 P S ++ S + G + + + DEA+ Sbjct: 352 ITVTFPDGARSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 392 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 ILGF+ + + ++ ++N + S F Sbjct: 393 PDAVQTILGFMNQASCKIIFVSSTNTGKASTSF 425 >gi|330989588|gb|EGH87691.1| hypothetical protein PLA107_31509 [Pseudomonas syringae pv. lachrymans str. M301315] Length = 433 Score = 40.9 bits (94), Expect = 0.53, Method: Composition-based stats. Identities = 54/303 (17%), Positives = 90/303 (29%), Gaps = 43/303 (14%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVS-KWLSLLPNKH 139 AG G GKT + + + P I+ A + Q++ + EV+ W + K Sbjct: 23 AGFGSGKTWVGCAGICKHVWEWPRINSGYFAPTYPQIRDIFFPTIEEVAFDWGLKVKTKE 82 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 SD +T+CR S E+P T VG + + DE Sbjct: 83 ---------------SDKEVEFYSGGQYRSTTICR--SMEKPQTIVGFKIGHAL---VDE 122 Query: 200 ASGTP----DVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPL-------D 246 P + I+ + +T+ P Y+ F K L Sbjct: 123 LDVLPALKAEHAWRKIIARMRYNAPGLKNGVDVTTTPEG-FKFVYQQFVKQLREKPGMQG 181 Query: 247 DWKRFQIDTRTVEG-IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 + Q T E + P + ++ Y + + GQF + S I + Sbjct: 182 MYGLVQASTFDNELNLPPDYIPSLMESY--PPQLILAYLNGQFVNLNAGS-IYHAYDRKL 238 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW-SKTDLRTTNNKISGLV 364 PL +G D V R + ++ D +I Sbjct: 239 NGCFDSVQDGEPLFIGMDFNVGKMAAITHVKRADGKPRAVDEFIDGFDTPDMIRRIKERY 298 Query: 365 EKY 367 +Y Sbjct: 299 WRY 301 >gi|307940746|gb|ADN95987.1| polyprotein [Chionodraco hamatus] Length = 2968 Score = 40.9 bits (94), Expect = 0.53, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 7/69 (10%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWL------VLWLMSTRPGI 108 Q+ + CL+ ++ NP+ F I+ G G GK+ L L +L + P Sbjct: 2284 QMSIFYQIRQWCLDKISGKNPDPFHVFITGGAGTGKSHLIKALQYETTRLLSPLCDHPDS 2343 Query: 109 S-VICLANS 116 V+ A + Sbjct: 2344 VCVLLTAPT 2352 >gi|294677220|ref|YP_003577835.1| terminase-like family protein [Rhodobacter capsulatus SB 1003] gi|294476040|gb|ADE85428.1| terminase-like family protein [Rhodobacter capsulatus SB 1003] Length = 455 Score = 40.9 bits (94), Expect = 0.53, Method: Composition-based stats. Identities = 67/421 (15%), Positives = 121/421 (28%), Gaps = 68/421 (16%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132 I GRG GKT A W+ G V + + Q++ + Sbjct: 62 IMGGRGAGKTRAGA---EWVRMQVEGAGPADAGPAHRVALVGETFDQVRDVM-------- 110 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192 F + P + + YS + P+ G Sbjct: 111 -------IFGESGILACSPPDRRPEWEATKRRLVWANGATAQAYSAQEPEALRGPQFD-- 161 Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 A DE + + + L + ++T+ P R G I N P Sbjct: 162 -AAWVDELAKWRRAEETWDMLQFAL-RLGKHPQQVITTTP-RNVGVLKAILNNPSTV-VT 217 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 + SF + ARY + + R E+ G + + +E R Sbjct: 218 HAPTEANRAYLAESFLAEVQARY-AGTRLGRQELEGVLLEDVEGALWTTAQLEGL--RLA 274 Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVL--------RRGPVIEHLFDWS-KTDLRTTNN 358 P +++ D A G D +V+ + L D S + Sbjct: 275 SPPAMDRVVVALDPAVTGGAGSDECGIVVAGAVTRGPVQDWRAFVLEDASVRGRPTDWAR 334 Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHV 418 +E++ + ++ + N G L + V +A+ ++ R E Sbjct: 335 AAIAAMERWGAEKLVAEVNQGGEMVESVLRQIDPLV-----PFKALRASRGKSARAE--- 386 Query: 419 KMADWLEFASLINH-SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477 +A E + + G + L+ + + G G S D D L++ Sbjct: 387 PVAALYEQGRVKHCRDGRLGALED-QMCRMTVRGY--------AGKGSPDRVDALVWAMT 437 Query: 478 E 478 E Sbjct: 438 E 438 >gi|225683146|gb|EEH21430.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides brasiliensis Pb03] Length = 2011 Score = 40.9 bits (94), Expect = 0.54, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 40/118 (33%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E V W L + Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVHDWKRRLTVPMGLK 1217 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1218 LVELTGDNTPDTKTIRDSDIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1268 >gi|219846951|ref|YP_002333526.2| DNA packaging terminase subunit 1 [Equid herpesvirus 9] gi|226423816|dbj|BAH02470.2| DNA packaging protein [Equid herpesvirus 9] Length = 734 Score = 40.9 bits (94), Expect = 0.55, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 52/153 (33%), Gaps = 24/153 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLAN----SETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 GKT L+ ++T GI + A+ +E + + A + +W P H Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEP-VFDEIGARLRQWFGNSPVDHVKGEN 322 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 P S ++ S + G + + + DEA+ Sbjct: 323 ISFSFPDGSKSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 363 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I+GFL + N ++ ++N + S F Sbjct: 364 PEAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396 >gi|38640180|ref|NP_944136.1| Dda DNA helicase [Aeromonas phage Aeh1] gi|33414865|gb|AAQ17908.1| Dda DNA helicase [Aeromonas phage Aeh1] Length = 454 Score = 40.9 bits (94), Expect = 0.55, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 50/156 (32%), Gaps = 28/156 (17%) Query: 58 FMEVVDAHCLNSVNNPNPEVFK-GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116 +++ C + + K IS G GK+ L L+ L+ G + C A + Sbjct: 8 LAKIILTDCQKTAIDAVLTDKKHITISGPAGSGKSFLTKILIQKLLDLNSGAVITC-APT 66 Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176 Q K L + + + +H L I Y + R + Sbjct: 67 -HQAKIVL-----------------------SKMSGFTASTIHSVLKIHPDTYEDV-REF 101 Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212 + + D +I DEAS + + +L Sbjct: 102 KQSKSDK-AKEDLKAVRYLIVDEASMVDNDLFEILL 136 >gi|9629774|ref|NP_045262.1| DNA packaging terminase subunit 1 [Equid herpesvirus 4] gi|2605992|gb|AAC59564.1| 47/44 [Equid herpesvirus 4] Length = 734 Score = 40.9 bits (94), Expect = 0.55, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 52/153 (33%), Gaps = 24/153 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLAN----SETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 GKT L+ ++T GI + A+ +E + + A + +W P H Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEP-VFDEIGARLRQWFGNSPVDHVKGEN 322 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 P S ++ S + G + + + DEA+ Sbjct: 323 ISFSFPDGSKSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 363 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I+GFL + N ++ ++N + S F Sbjct: 364 PEAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396 >gi|50313286|ref|YP_053090.1| DNA packaging terminase subunit 1 [Equid herpesvirus 1] gi|139648|sp|P28969|TRM3_EHV1B RecName: Full=Tripartite terminase subunit UL15 homolog; AltName: Full=DNA-packaging protein 44; AltName: Full=Terminase large subunit gi|59798996|sp|P84396|TRM3_EHV1V RecName: Full=Tripartite terminase subunit UL15 homolog; AltName: Full=DNA-packaging protein 44; AltName: Full=Terminase large subunit gi|42795172|gb|AAS45929.1| putative terminase [Equid herpesvirus 1] gi|49617029|gb|AAT67302.1| DNA packaging protein [Equid herpesvirus 1] Length = 734 Score = 40.9 bits (94), Expect = 0.55, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 52/153 (33%), Gaps = 24/153 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLAN----SETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 GKT L+ ++T GI + A+ +E + + A + +W P H Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEP-VFDEIGARLRQWFGNSPVDHVKGEN 322 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 P S ++ S + G + + + DEA+ Sbjct: 323 ISFSFPDGSKSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 363 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I+GFL + N ++ ++N + S F Sbjct: 364 PEAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396 >gi|116196286|ref|XP_001223955.1| hypothetical protein CHGG_04741 [Chaetomium globosum CBS 148.51] gi|88180654|gb|EAQ88122.1| hypothetical protein CHGG_04741 [Chaetomium globosum CBS 148.51] Length = 2013 Score = 40.9 bits (94), Expect = 0.56, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 48/152 (31%), Gaps = 19/152 (12%) Query: 55 QLEFMEVVDAHCLNSVNNPN-----PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 LE + H N + + + G GKT + W RPG Sbjct: 1135 ALEEIYAQRFHFFNPMQTQLFHTLYHRPANVLLGSPTGSGKTVAAELAMWWAFRERPGSK 1194 Query: 110 VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167 V+ +A L E V W + L ++ L+ P + + I + Sbjct: 1195 VVYIAP-----MKALVRERVKDWGARLAKPLGLKLVELTGDNTPDTRTIQDADIIITTPE 1249 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + R++ G+ + II DE Sbjct: 1250 KWDGISRSWQT------RGYVRKVSLVII-DE 1274 >gi|317499861|ref|ZP_07958099.1| pbsx family Phage terminase [Lachnospiraceae bacterium 8_1_57FAA] gi|316898763|gb|EFV20796.1| pbsx family Phage terminase [Lachnospiraceae bacterium 8_1_57FAA] Length = 428 Score = 40.9 bits (94), Expect = 0.57, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 64/207 (30%), Gaps = 28/207 (13%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD------ 247 + +E S ILG L + I+++NP Y+ F + Sbjct: 121 IVWIEECSEVKYAGFKEILGRLRHPTLSNHIILSTNPVSKGNWCYKYFFQDKKKKVFVLD 180 Query: 248 ----------------WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291 + +D + + E + D D+ RV G+F Sbjct: 181 DEKLYKERTVVVGNTYYHHSTVD--DNFFVPKEYVEQLDDLQTHDPDLYRVARQGRFGVN 238 Query: 292 DIDSFIPLNIIEEA--LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS 349 F P ++E A + +E G D N + + + L+ + Sbjct: 239 GSLVF-PQFVVEPANQVEKEIKAIRTPLEKNGMDFGFVTSYNAALRMIVDHDEKILYIYR 297 Query: 350 K-TDLRTTNNKISGLVEKYRPDAIIID 375 + T+ +I+ ++ ++ I D Sbjct: 298 EYYSRNKTDPEIAEDMKDWKDIVIKAD 324 >gi|189913376|ref|YP_001964605.1| ATP-dependent RNA helicase, DEAD-box family (DeaD) [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167781444|gb|ABZ99741.1| ATP-dependent RNA helicase, DEAD-box family (DeaD) [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 534 Score = 40.9 bits (94), Expect = 0.57, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 73/270 (27%), Gaps = 41/270 (15%) Query: 21 SDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKG 80 E+ F +F L P +G GF +P Q + + +V Sbjct: 10 DTEVGNDFQSFGLR--PEILQGITEAGFESPSPIQKQAIPLVLEGKDLIAQAQT------ 61 Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET---QLKTTLWAEVSKWLSLLPN 137 G GKT L + G+ V+ L + Q+ L+ Sbjct: 62 ------GTGKTAAYGLPCLNRIKVEDGMQVLVLTPTRELALQVSDELFK----------L 105 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 +++ +++ YS + +T R + + +I Sbjct: 106 GKHLGIKTTTIYGGSSYSKQITQVAKGAQVAVATPGRLLDLLKGKELKNFKPSM---VIL 162 Query: 198 DEAS-----GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA G D I T+R F P + Y+ + Sbjct: 163 DEADEMLDMGFMDDIESIFNLLPTKRQTLLFSATMPEPIKKLASKYQTHP------AHVK 216 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282 I + II + V R+ Sbjct: 217 IAATEKSSKNIEQVYYIIDEAEREIAVVRI 246 >gi|189913047|ref|YP_001964936.1| ATP-dependent RNA helicase (superfamily II) [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777723|gb|ABZ96023.1| ATP-dependent RNA helicase (superfamily II) [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] Length = 529 Score = 40.9 bits (94), Expect = 0.57, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 73/270 (27%), Gaps = 41/270 (15%) Query: 21 SDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKG 80 E+ F +F L P +G GF +P Q + + +V Sbjct: 5 DTEVGNDFQSFGLR--PEILQGITEAGFESPSPIQKQAIPLVLEGKDLIAQAQT------ 56 Query: 81 AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET---QLKTTLWAEVSKWLSLLPN 137 G GKT L + G+ V+ L + Q+ L+ Sbjct: 57 ------GTGKTAAYGLPCLNRIKVEDGMQVLVLTPTRELALQVSDELFK----------L 100 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 +++ +++ YS + +T R + + +I Sbjct: 101 GKHLGIKTTTIYGGSSYSKQITQVAKGAQVAVATPGRLLDLLKGKELKNFKPSM---VIL 157 Query: 198 DEAS-----GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252 DEA G D I T+R F P + Y+ + Sbjct: 158 DEADEMLDMGFMDDIESIFNLLPTKRQTLLFSATMPEPIKKLASKYQTHP------AHVK 211 Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282 I + II + V R+ Sbjct: 212 IAATEKSSKNIEQVYYIIDEAEREIAVVRI 241 >gi|315649164|ref|ZP_07902254.1| Tex-like protein protein-like protein [Paenibacillus vortex V453] gi|315275383|gb|EFU38741.1| Tex-like protein protein-like protein [Paenibacillus vortex V453] Length = 737 Score = 40.9 bits (94), Expect = 0.58, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 35/107 (32%), Gaps = 9/107 (8%) Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342 EV G+ ++ + I + + P ++G D A G VV G ++ Sbjct: 300 EVRGELTEKGENQAISIF-AGNLRSLLLQPPVKGRCVLGVDPAYRTGCKLAVVDDTGKLL 358 Query: 343 EHLFDWSK---TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386 E + + K L+ KY I+I G T Sbjct: 359 EVAVTYPTPPANKRQEAAAKFKQLIAKYGIKLIVI-----GNGTASR 400 >gi|116625332|ref|YP_827488.1| hypothetical protein Acid_6277 [Candidatus Solibacter usitatus Ellin6076] gi|116228494|gb|ABJ87203.1| hypothetical protein Acid_6277 [Candidatus Solibacter usitatus Ellin6076] Length = 212 Score = 40.5 bits (93), Expect = 0.60, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 45/140 (32%), Gaps = 13/140 (9%) Query: 323 DIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382 D A T + LR V + K+ + P +++DA GA Sbjct: 33 DPATYEFRKT-ITLRLRHVERIPLATEYVQVVERVAKVMRKLGAQGPAHLVVDATGVGAP 91 Query: 383 TCDYLEMLG-----YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE------FASLIN 431 + L G + V G + R + +L V + E L+ Sbjct: 92 VVELLRRAGMGCRLWPVSITGGPAEGYGDGYYRVPKRDLVVGLQVMFEQGALEIAGGLVE 151 Query: 432 HSGLIQNLKSLKSFIVPNTG 451 + L++ + ++ + + G Sbjct: 152 RAALVKEMTDMRV-KMTSRG 170 >gi|261409036|ref|YP_003245277.1| Tex-like protein [Paenibacillus sp. Y412MC10] gi|261285499|gb|ACX67470.1| Tex-like protein protein-like protein [Paenibacillus sp. Y412MC10] Length = 740 Score = 40.5 bits (93), Expect = 0.61, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 35/107 (32%), Gaps = 9/107 (8%) Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342 EV G+ ++ + I + + P ++G D A G VV G ++ Sbjct: 300 EVRGELTEKGENQAISIF-AGNLRSLLLQPPVKGRRVLGVDPAYRTGCKLAVVDDTGKLL 358 Query: 343 EHLFDWSK---TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386 E + R K L+ KY I+I G T Sbjct: 359 EVAVTYPTPPANKRREAAAKFKELIAKYGIKLIVI-----GNGTASR 400 >gi|72021085|ref|XP_793570.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115928806|ref|XP_001188414.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 1117 Score = 40.5 bits (93), Expect = 0.61, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 39/129 (30%), Gaps = 11/129 (8%) Query: 82 ISAGRGIGKTTLNAWLVLWLM-STRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 + A G GKT + + + L + T P V+ LA + E++ + Sbjct: 8 VQAKSGTGKTCVFSVIALEGIDLTNPSTQVLILAPT---------REIAVQIQDTIRAIG 58 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 EM+ L H + + H + ++ + + DEA Sbjct: 59 CEMEGLRSHVFIGGTLFGPDRQKLKKCHIAVGTPG-RIKQLIEYEVLKTGTIRLFVLDEA 117 Query: 201 SGTPDVINL 209 D Sbjct: 118 DKLLDDTFQ 126 >gi|209694357|ref|YP_002262285.1| putative bacteriophage terminase [Aliivibrio salmonicida LFI1238] gi|208008308|emb|CAQ78458.1| putative bacteriophage terminase [Aliivibrio salmonicida LFI1238] Length = 598 Score = 40.5 bits (93), Expect = 0.62, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 73/246 (29%), Gaps = 41/246 (16%) Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 +D G + + +Y + S + + F F I+ L + Sbjct: 342 ITVDDAIKGGATFFNMDKLRRKYPIKS-IFDNVLRCVFLDDSASFF----NIKALLACKT 396 Query: 311 CPDPYAPLIMG-CDIAEE----------------GGDNTVVV-----LRRGPVIEHLFDW 348 + + MG C A + G D+ +V L++G V L Sbjct: 397 DTSKWKTIDMGKCRPAGDLEVLVGYDPRGGGQADGSDDAGLVISLKPLKKGGVFRFLERI 456 Query: 349 S--KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406 + I G+ EKY + +D + G+ + + + V Sbjct: 457 RLKGSSYEEQAKAIEGITEKYHVVHLEMDTSGVGSAVAELVRKFYPSLKEVNYSPEV--- 513 Query: 407 EFCRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKGA 463 +R + K + + L ++ + ++ + ++ + S R K Sbjct: 514 -----KRM-MAYKAREIINAGRLQFDDSWDDVVHSFLMIRQHTTKASNQITMISTRTKRG 567 Query: 464 KSTDYS 469 D + Sbjct: 568 SHADLA 573 >gi|19114536|ref|NP_593624.1| ATP-dependent 3' to 5' DNA helicase (predicted) [Schizosaccharomyces pombe 972h-] gi|74698622|sp|Q9HE09|MFH2_SCHPO RecName: Full=Putative ATP-dependent RNA helicase mfh2; AltName: Full=FancM homolog protein 2 gi|12038920|emb|CAC19734.1| ATP-dependent 3' to 5' DNA helicase (predicted) [Schizosaccharomyces pombe] Length = 783 Score = 40.5 bits (93), Expect = 0.62, Method: Composition-based stats. Identities = 18/115 (15%), Positives = 36/115 (31%), Gaps = 13/115 (11%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146 G+GKT + A ++L P +I LA ++ L + + M Sbjct: 134 GLGKTFIAAVVMLNYFRWFPESKIIFLAPTKPLL----------LQQRVACSNVAGMSPG 183 Query: 147 SLHPAPWYSDVLHCSLGIDSKH-YSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 + ++K + +T + + + +I DEA Sbjct: 184 ATAELNGEVSPDRRLFEYNTKRVFFMTPQTLQNDLKEHL--LDAKSIICLIFDEA 236 >gi|329936128|ref|ZP_08285927.1| helicase-like protein [Streptomyces griseoaurantiacus M045] gi|329304446|gb|EGG48325.1| helicase-like protein [Streptomyces griseoaurantiacus M045] Length = 1056 Score = 40.5 bits (93), Expect = 0.62, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 21/66 (31%), Gaps = 3/66 (4%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114 Q N+ + + +G I + G GKT A + PG V+ Sbjct: 18 QKARFREWAGSPFNTRSPVPEQGSRGTIVSATGSGKTITAAACA---LECFPGARVLVTV 74 Query: 115 NSETQL 120 + L Sbjct: 75 PTLDLL 80 >gi|147668985|ref|YP_001213803.1| hypothetical protein DehaBAV1_0339 [Dehalococcoides sp. BAV1] gi|146269933|gb|ABQ16925.1| hypothetical protein DehaBAV1_0339 [Dehalococcoides sp. BAV1] Length = 457 Score = 40.5 bits (93), Expect = 0.63, Method: Composition-based stats. Identities = 39/251 (15%), Positives = 69/251 (27%), Gaps = 53/251 (21%) Query: 249 KRFQIDTRTVEGIDPSFHE---GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 + F+ D V +P++ R G + + + P ++ Sbjct: 185 RHFRYDWEAVAAHNPAYLAYALSEKERLGENHPLFLTQYR-LLPVSGGGGMFSNEQLDLL 243 Query: 306 LNREPC---PDPYAPLIMGCDIAEE-----GGDNTVVVLRRGPVIEHLFD---------- 347 PC P+ + G D+A E G T V LRR + + + Sbjct: 244 KGNHPCQIYPEKGKVYVAGLDLAGEDSQTGGISPTTVNLRRDSSVLTIAELDYTFAKAPY 303 Query: 348 ------------WSKTDLRTTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYH 393 W T K+ L+ K ++ + +DA G +L LG Sbjct: 304 NLPQVRLVCHYSWQGTRHALLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSR 363 Query: 394 VYRVLGQKRAVDLEFCRNR-------RTELHVKMAD------WLEFASLINHSGLIQNLK 440 + Q A N R +++ W E Q + Sbjct: 364 ILPFAFQ-PASKSRLGFNLLSAVNSGRLKMYAANGSSEYTLFWQEMGLARADYRQSQQMN 422 Query: 441 SLKSFIVPNTG 451 ++ G Sbjct: 423 ---FYVETTRG 430 >gi|325849110|ref|ZP_08170602.1| phage terminase, large subunit, PBSX family [Anaerococcus hydrogenalis ACS-025-V-Sch4] gi|325480355|gb|EGC83418.1| phage terminase, large subunit, PBSX family [Anaerococcus hydrogenalis ACS-025-V-Sch4] Length = 439 Score = 40.5 bits (93), Expect = 0.64, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 45/173 (26%), Gaps = 14/173 (8%) Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----------PLDDWKRFQID 254 D I + FW + NP + Y + Sbjct: 149 DSIKEAFNRTAAAKRRKFFWDL--NPSSPNHFIYSDHIDKYQNMIDEGIDFGGYNYKHFT 206 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP 314 I + I +Y +S + ++ G + + I + + P Sbjct: 207 IDDNINISDQRKKEIKLQYDPNSVWYKRDILGLRVVAEGLIYKQFADIPDNYLIKEKPHE 266 Query: 315 YAPLIMGCDIAEEGGDNTVVV--LRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 + +G D + + + RG + + + + + L++ Sbjct: 267 LQLIQIGVDFGGNNSKHAFICTGISRGFKKVYALRSERLEPDKPTDLYNQLID 319 >gi|193671687|ref|XP_001946103.1| PREDICTED: SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A containing DEAD/H box 1-like [Acyrthosiphon pisum] Length = 848 Score = 40.5 bits (93), Expect = 0.67, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 48/153 (31%), Gaps = 15/153 (9%) Query: 87 GIGKTTLNAWLVLWLMS----TRPGISVICLANSETQLKTTLWAEVSKW-LSLLPNKHWF 141 G+GKT + L + T P + + + + T + E +W +++ K+ Sbjct: 337 GLGKT-VQVIAFLAHLKETGRTHPDLPQLIVVPAST--LDNWYQEFKRWCPTMIVEKYHG 393 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 M W G + + P+ I+ DEA Sbjct: 394 SMDERRYMRTKW------IRKGFGDVDVILTTYSCAANSPEEKKLFKTKEFHYIVYDEAH 447 Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234 ++ + + N N ++T P + + Sbjct: 448 KLKNMTSQTFE-VFSNFNGNYKILLTGTPLQNN 479 >gi|22855048|ref|NP_690654.1| hypothetical protein SPP1p003 [Bacillus phage SPP1] gi|1729903|sp|P54308|TERL_BPSPP RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein G2P; AltName: Full=Terminase large subunit gi|15466|emb|CAA39537.1| terminase [Bacillus phage SPP1] gi|2764840|emb|CAA66573.1| unnamed protein product [Bacillus phage SPP1] Length = 422 Score = 40.5 bits (93), Expect = 0.69, Method: Composition-based stats. Identities = 68/418 (16%), Positives = 136/418 (32%), Gaps = 48/418 (11%) Query: 75 PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWL 132 + K + GRG K+T A ++ LM P ++ + N+ Q +++ ++ + + Sbjct: 24 AQHLKYVLKGGRGSAKSTHIAMWIILLMMMMPITFLVIRRVYNTVEQ---SVFEQLKEAI 80 Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT-FVGHHNTY 191 +L H + + +P + I + + + S + G Sbjct: 81 DMLEVGHLW-----KVSKSPLRLTYIPRGNSIIFRGGDDVQKIKSIKASKFPVAGMWIEE 135 Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRRLSGKFYEIFNKPLDDWKR 250 +E I +L + + N P+R ++FN Sbjct: 136 LAEFKTEEEVSV---IEKSVLRAELPPGCRYIFFYSYNPPKRKQSWVNKVFNSSFLPANT 192 Query: 251 FQIDT--RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308 F + + +F E + R E G+ + F L IEE + Sbjct: 193 FVDHSTYLQNPFLSKAFIEEAEEVKRRNELKYRHEYLGEALGSGVVPFENLQ-IEEGIIT 251 Query: 309 EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLV 364 + + + G D G D V +R I + + D + + + + V Sbjct: 252 DAEVARFDNIRQGLDFG-YGPDPLAFVRWHYDKRKNRIYAIDEL--VDHKVSLKRTADFV 308 Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424 K + ++ I A+++ R+ D L+ L + + R+ G K+ D R WL Sbjct: 309 RKNKYESARIIADSSEPRSIDALK-LEHGINRIEGAKKGPDSVEHGER----------WL 357 Query: 425 EFASLINHSGL-----IQNLKSLKSFIVPNTGEL-AIESKRVKGAKSTDYSDGLMYTF 476 + I L + +++ N + +E K D Y F Sbjct: 358 DELDAIVIDPLRTPNIAREFENIDYQTDKNGDPIPRLEDKDNHTI------DATRYAF 409 >gi|281416465|ref|YP_003347385.1| terminase large subunit [Enterococcus phage phiFL4A] gi|270209641|gb|ACZ64180.1| terminase large subunit [Enterococcus phage phiFL4A] Length = 418 Score = 40.5 bits (93), Expect = 0.70, Method: Composition-based stats. Identities = 46/323 (14%), Positives = 95/323 (29%), Gaps = 35/323 (10%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145 RG KTT A + LM P ++I L ++T + E+ ++ + + +F+ Sbjct: 52 RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106 Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 +L+ + + + G H +I D+ Sbjct: 107 FALYGVELVLLKETTTEIDTNLKTSSRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163 Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257 D ++ + N + G+F ++K K + D Sbjct: 164 KDRVSRA-----EREKTKLQYQELQNVKNRGGRFINTGTPWHKEDAISKMPNVKKFDCYE 218 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 ID + + + + + + F + N + Sbjct: 219 TGLIDKEQRKAL--QQAMTPSLFAANYELKHIADSESLFTAPTYTD-NTNLIYNGVAH-- 273 Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373 D A G D+T + + G +I + W K + +I L + Y+ Sbjct: 274 ----IDAAYGGDDSTAFTIFKEQKDGTLIGYGKKWQKH-VDDCIPEILQLHQHYQAGTFY 328 Query: 374 IDANNTGARTCDYLEMLGYHVYR 396 + N +L G +V + Sbjct: 329 NETNGDKGYLAKHLIERGQYVQK 351 >gi|331229057|ref|XP_003327195.1| DNA repair protein rad16 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309306185|gb|EFP82776.1| DNA repair protein rad16 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 968 Score = 40.5 bits (93), Expect = 0.72, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 46/152 (30%), Gaps = 11/152 (7%) Query: 87 GIGKTTLNAWLVLWLMSTRPGIS--VICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEM 143 G+GKT L+L PG + +A + + W E+ K+ L W Sbjct: 398 GMGKTIQTIALIL--SDRVPGHRKQTLVIAPT---VAIMQWRNEIEKFAKGLTVNVWHGG 452 Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDEAS 201 + DV+ S + + + + H +I DEA Sbjct: 453 NRSNAQEEMENFDVVLTSFAVLESAFRRQNSGFRRKGQIIKESSLLHQINWHRVILDEAH 512 Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRL 233 D G E A W ++ P + Sbjct: 513 NIKDRSCNTAKGAF-ELKATYRWCLSGTPLQN 543 >gi|289976633|gb|ADD21678.1| DNA maturase B [Caulobacter phage Cd1] Length = 602 Score = 40.5 bits (93), Expect = 0.72, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 62/219 (28%), Gaps = 29/219 (13%) Query: 35 FFPWGEKGTPLEGFSAPRSWQLEFMEVVDA------HCLNSVNNPNPEVFKGAISAGRGI 88 W + G + + + + E + H + P + A RG Sbjct: 11 LLRWEQVGLLQRHYESFHDFLDDAFEHLGFSASWVQHDIGGFLAHGPNSLM--VQAQRGQ 68 Query: 89 GKTTLNAWLVLWLMSTRPGISVICL------ANSETQLKTTLWAEVSKWLSLLPNKHWFE 142 KTT+ A +W + P V+ L AN + L + K L + Sbjct: 69 AKTTITAAFAVWTLIHNPKARVLILSAGGTQANEISTL-------IVKLLLTMDELECLR 121 Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202 + + + +H +L K S C + G +A + A Sbjct: 122 PDASNGDRTSVEAFDIHYTLKGVDKSPSVACSGITG----NLQGKRADLLIADDIESAKN 177 Query: 203 TPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF 237 + + + L T I P+ + + Sbjct: 178 SATAMMREFIMNLTRDFTSICTEGRIIYLGTPQSMDSIY 216 >gi|330791351|ref|XP_003283757.1| hypothetical protein DICPUDRAFT_147464 [Dictyostelium purpureum] gi|325086380|gb|EGC39771.1| hypothetical protein DICPUDRAFT_147464 [Dictyostelium purpureum] Length = 1580 Score = 40.5 bits (93), Expect = 0.75, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 44/154 (28%), Gaps = 16/154 (10%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTR-----PGISVIC--LANSETQLK---TTLWAEVSKW 131 + G GKT A +VL M + P I N+ L +L E S Sbjct: 1094 VRGPPGTGKTHFLALIVLIFMESYKRLGKPFRIAITSFTHNAIDNLLIRIASLKKEYSTS 1153 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 + N F+ Q+ L + +H+ ++S D Sbjct: 1154 VGQDINFPLFKKQTKLSEDLKLNKIQLFDKKEFEREHFCVGATSWSLSNMD------YEN 1207 Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWI 225 +I DEAS I L + Sbjct: 1208 FDLLIIDEASQLSSYIGAIPFSRLNKDTGRVIVC 1241 >gi|153955889|ref|YP_001396654.1| hypothetical protein CKL_3280 [Clostridium kluyveri DSM 555] gi|219856242|ref|YP_002473364.1| hypothetical protein CKR_2899 [Clostridium kluyveri NBRC 12016] gi|146348747|gb|EDK35283.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555] gi|219569966|dbj|BAH07950.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 450 Score = 40.5 bits (93), Expect = 0.75, Method: Composition-based stats. Identities = 44/284 (15%), Positives = 94/284 (33%), Gaps = 36/284 (12%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 + +E S +LG L + + I+++NP Y+ F K + K F + Sbjct: 119 IVWIEECSEVKYEGFKELLGRLRHPSLSLHMILSTNPVSKDNWTYKHFFK-NEKKKTFIL 177 Query: 254 D---------------------TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292 D + S+ E + D D+ R+ G+F + Sbjct: 178 DDEELYKKRIVVRNNTYYHHSLADDNLFLPKSYIEQLEELKTYDIDLYRIARKGRF-GIN 236 Query: 293 IDSFIPLNIIEEALN-REPCPDPYAPL-IMGCDIAEEGGDNTVVVLRRGPVIEHL-FDWS 349 +P + + P+ +G D E N ++ L + L W Sbjct: 237 GRRVLPQFEARPHYEVLQAIGNIKNPIKRVGFDFGFEDSYNALLRLAVDDKEKILYIYWE 296 Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + T+++ + + +++ +I A+ +T Y G+++ R + Sbjct: 297 YYKNQMTDDRTAIEIAEFKSTQELIRADGAEPKTIKYFNQQGFNIRRAKKFPGSRLQNTK 356 Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGEL 453 + +R + IN +++L + V GE+ Sbjct: 357 KVKR------FKKIICSEDCINTVDELKDLT----YAVDKNGEI 390 >gi|195498547|ref|XP_002096570.1| GE25739 [Drosophila yakuba] gi|194182671|gb|EDW96282.1| GE25739 [Drosophila yakuba] Length = 1495 Score = 40.1 bits (92), Expect = 0.80, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 62/200 (31%), Gaps = 24/200 (12%) Query: 16 FDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPR--SWQLEFMEVVDAH-CLNSVNN 72 D+ W D+ + + +H E+ +G P + +V H + N Sbjct: 1 MDVNWIDDDEDLVAALAMHEEQKTEEADGADGHPRPELSDESCDGFDVATGHNWIYPNNL 60 Query: 73 PNPEVFKGAISAG----------RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKT 122 P + + + G+GKT + A L+ P ++ +A + + Sbjct: 61 PLRSYQQTIVQSALFKNTLVVLPTGLGKTFIAAVLMFNFYRWYPKGKIVFMAPTRPLVSQ 120 Query: 123 TLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE--R 180 ++ ++P +Q P P +++ + + + + Sbjct: 121 ----QIHASQKIMPFPSADTVQLTGQLPRPKRAELWGSK-----RVFFATPQVVHSDMLE 171 Query: 181 PDTFVGHHNTYGMAIINDEA 200 D I+ DEA Sbjct: 172 TDGGSTFPFESIKLIVVDEA 191 >gi|310792137|gb|EFQ27664.1| Sec63 Brl domain-containing protein [Glomerella graminicola M1.001] Length = 1974 Score = 40.1 bits (92), Expect = 0.82, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 41/118 (34%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E V W + L + Sbjct: 1153 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVKDWGARLARPLGLK 1207 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1208 LVELTGDNTPDTRTIKDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1258 >gi|226288385|gb|EEH43897.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides brasiliensis Pb18] Length = 2011 Score = 40.1 bits (92), Expect = 0.92, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 40/118 (33%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E V W L + Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVHDWKRRLTVPMGLK 1217 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1218 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1268 >gi|295672069|ref|XP_002796581.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides brasiliensis Pb01] gi|226283561|gb|EEH39127.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides brasiliensis Pb01] Length = 2012 Score = 40.1 bits (92), Expect = 0.92, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 40/118 (33%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E V W L + Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVHDWKRRLTVPMGLK 1217 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1218 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1268 >gi|213402789|ref|XP_002172167.1| antiviral helicase SLH1 [Schizosaccharomyces japonicus yFS275] gi|212000214|gb|EEB05874.1| antiviral helicase SLH1 [Schizosaccharomyces japonicus yFS275] Length = 1949 Score = 40.1 bits (92), Expect = 0.92, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 48/155 (30%), Gaps = 22/155 (14%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRP 106 Q +E + A + N + F I A G GKT W P Sbjct: 1125 QNPVLEEICAKRFSFFNAVQSQFFHTVYHTPTNVFIGAPTGSGKTMAAELATWWAFREHP 1184 Query: 107 GISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI- 164 G V+ +A L E + W + L M L+ +P ++ + I Sbjct: 1185 GSKVVYIAP-----MKALVKERLKDWGARLVEPMHINMIELTGDTSPDSKTIMGADIIIT 1239 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + + R + + + +I DE Sbjct: 1240 TPEKWDGITRNWRTRK-------YVQNVSLVIIDE 1267 >gi|209544598|ref|YP_002276827.1| hypothetical protein Gdia_2467 [Gluconacetobacter diazotrophicus PAl 5] gi|209532275|gb|ACI52212.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 491 Score = 40.1 bits (92), Expect = 0.96, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 67/199 (33%), Gaps = 26/199 (13%) Query: 87 GIGKTTLNAW-LVLWLMSTRPGISVI------CLANSETQLKTTLWAEVSKWLSLLPNKH 139 G GK++ W +VL + PG + + NS QL+ T V +W + Sbjct: 32 GSGKSSGCVWEMVLRGLKQAPGPDGVRRSRWAVIRNSYRQLEDTTIRTVHQWFPPMQFGR 91 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 W P+ + + D K + +RPD + +E Sbjct: 92 W--------KPSEHSYTINRLAAQGDEKPAEIELLFRALDRPDQVGNLLSLELTGAWINE 143 Query: 200 ASGTPDVINLGILGFL----TERNANRFW---IMTSNPRRLSGKFYEIF----NKPLDDW 248 A P + + G + +R+ W IM +NP ++Y+ F + + Sbjct: 144 AREVPWAVIEAVQGRVGRYPAKRDGGATWSGIIMDTNPPDAESEWYKFFEEKDHTDAVEA 203 Query: 249 KRFQIDTRTVEGIDPSFHE 267 I TVE F + Sbjct: 204 IAQVIPGMTVERYARIFKQ 222 >gi|329945026|ref|ZP_08292976.1| hypothetical protein HMPREF9056_00859 [Actinomyces sp. oral taxon 170 str. F0386] gi|328529487|gb|EGF56391.1| hypothetical protein HMPREF9056_00859 [Actinomyces sp. oral taxon 170 str. F0386] Length = 370 Score = 40.1 bits (92), Expect = 0.98, Method: Composition-based stats. Identities = 32/175 (18%), Positives = 52/175 (29%), Gaps = 12/175 (6%) Query: 60 EVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ 119 V++ + + P I+ RG+GKT L S R V+ + Sbjct: 21 RVIEEFLESLDDGPGAPGLLELITGARGVGKTV---MLTALGDSARERGWVVVDETAREG 77 Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE 179 L L E ++ LS L K + SLSL + R ++ Sbjct: 78 LMDRLATEFTRQLSQLAGKERSRLTSLSLSTPLGGGSATLEHAPAPEPSWRQKARALTQW 137 Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGF---LTERNANRFWIMTSNPR 231 + G + + DE P + L A +M P+ Sbjct: 138 LAEHGTG------LLLTIDEVHAIPREELRALSAEVQHLIREGAPIGLLMAGLPK 186 >gi|302422104|ref|XP_003008882.1| DEAD/DEAH box helicase [Verticillium albo-atrum VaMs.102] gi|261352028|gb|EEY14456.1| DEAD/DEAH box helicase [Verticillium albo-atrum VaMs.102] Length = 1801 Score = 39.7 bits (91), Expect = 1.0, Method: Composition-based stats. Identities = 57/353 (16%), Positives = 108/353 (30%), Gaps = 53/353 (15%) Query: 21 SDEIKLSFSNFVL-HFFPWGEKGTPLEGFSA----PRSWQLEFMEVVDAHCLNSVNNPNP 75 I L+ + F L H P+ E+ + P +WQ + ++ +DA+ V P Sbjct: 719 DLAIPLNLTEFQLEHSGPYMERNFDSKPDDRVTFDPDAWQRKVLDTIDANNSLMVVAPTS 778 Query: 76 EVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135 GKT ++ + + ++ ++ +A ++ + AEV+ S Sbjct: 779 ------------AGKTFISFYAMKKILQANDDDVLVYVAPTKALVNQIA-AEVAARYSKS 825 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMA 194 + + ++ + L TM S +RP + Sbjct: 826 YTREGKSVWAIHTRDYRVNNPTGCQVLVTVPHVLQTMLLAPSNSDRPSAWA----RRVKR 881 Query: 195 IINDEASGTPDV----INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250 II DE + +L A I S +G+F++ W Sbjct: 882 IIFDEVHCIGQAEDGIVWEQLLLL-----APCPIIALSATVGNAGEFHD--------W-- 926 Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF-----IPLNIIEEA 305 V + ++ SD+ R V Q +D +P+ I++ Sbjct: 927 -----LAVSQAQKGYKMELVVHNARYSDL-RKFVYCPPKQLKMDVLAKQDQLPIPGIDQG 980 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN 358 R P P+ DI D+ + R + D +T Sbjct: 981 EERNPRFFFTHPIAALLDINRGSLDDVSLEPRDCWTLWKCMDKHQTTDFPVAK 1033 >gi|85702762|ref|ZP_01033866.1| Putative large terminase [Roseovarius sp. 217] gi|85671690|gb|EAQ26547.1| Putative large terminase [Roseovarius sp. 217] Length = 419 Score = 39.7 bits (91), Expect = 1.0, Method: Composition-based stats. Identities = 43/270 (15%), Positives = 82/270 (30%), Gaps = 43/270 (15%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTT-LWAEVSKW 131 I GRG GKT A W+ + G + + + Q++ ++ E S Sbjct: 27 IMGGRGAGKTRAGA---EWVRAQVEGSRPLDEGRCKRIALVGETIDQVREVMVFGE-SGI 82 Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191 ++ P + Q+ + + YS P+ G Sbjct: 83 MACSPPDRRPDWQATR---------------KRLIWPNGAVAQAYSAHDPEALRGPQFDG 127 Query: 192 GMAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249 DE + + L +A R + + R G +I P Sbjct: 128 A---WVDELAKWKRARETWDMLQFGLRLGDAPR--VCVTTTPRNVGVLKDIVAVPSTV-V 181 Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 + SF + + ARY + + R E+ G + D+ ++E A R Sbjct: 182 TSAPTEANRAYLAESFLDEVRARY-AGTRLGRQELDGLLIDEAEDALWTPAMLEAA--RV 238 Query: 310 PCPDPYAPLIMGCDI---AEEGGDNTVVVL 336 + +++ D G D +++ Sbjct: 239 ESLPEFDRVVVAVDPPVTGHAGSDECGIIM 268 >gi|83310928|ref|YP_421192.1| protein-tyrosine-phosphatase [Magnetospirillum magneticum AMB-1] gi|82945769|dbj|BAE50633.1| Protein-tyrosine-phosphatase [Magnetospirillum magneticum AMB-1] Length = 152 Score = 39.7 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 56/155 (36%), Gaps = 16/155 (10%) Query: 215 LTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYG 274 +TE N + T N R I + W+ + +R V ++P E ++A G Sbjct: 1 MTESTINVLVLCTGNSARSVLGEALINHLGGAKWRAYSAGSRPVGRVNPLSLE-VLAEKG 59 Query: 275 LDSDVTRVEVCGQFPQQDI-DSFIPLNIIEEALNREPCPDPYAP--LIMGC-DIAEEGGD 330 L + R + +F D + + + + A P P L MG D A+ Sbjct: 60 LPTAGYRSKSWDEFAAADAPRMDLVITVCDNAAGEVCPVWPGHPSKLHMGFPDPADA--- 116 Query: 331 NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 +G E L ++ K K+ L++ Sbjct: 117 -------KGSHEEQLAEFRKVYAMIEA-KVRRLIQ 143 >gi|288554856|ref|YP_003426791.1| Tex transcription access, protein (S1 RNA binding) [Bacillus pseudofirmus OF4] gi|288546016|gb|ADC49899.1| Tex transcription access, protein (S1 RNA binding) [Bacillus pseudofirmus OF4] Length = 726 Score = 39.7 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 52/144 (36%), Gaps = 12/144 (8%) Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305 + D+ E + + +G ++ + R E+ + + + I + E Sbjct: 255 KRYMSRAGDSPAAEYVKLAIQDGYKRL--IEPSIER-EIRNELTAKAEEQAIHIF-SENL 310 Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW---SKTDLRTTNNKISG 362 N P +++G D A G VV G V++ + + ++ K+ Sbjct: 311 RNLLLQPPIKDKVVLGVDPAYRTGCKLAVVDGTGKVLDIGVVYPTPPRNEVEKAAAKVKQ 370 Query: 363 LVEKYRPDAIIIDANNTGARTCDY 386 LV++++ + I I G T Sbjct: 371 LVKEHKVEMIAI-----GNGTASR 389 >gi|289167314|ref|YP_003445583.1| terminase large subunit [Streptococcus mitis B6] gi|288906881|emb|CBJ21715.1| terminase large subunit [Streptococcus mitis B6] Length = 418 Score = 39.7 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 45/353 (12%), Positives = 104/353 (29%), Gaps = 42/353 (11%) Query: 74 NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133 +P++ GRG GK++ ++ + R ++ +C+ ++ L+ +++ ++ +S Sbjct: 22 DPKILHVVEKGGRGSGKSSDLGHTII-QLIMRYPVNAVCIRKTDNTLEQSVYEQLKWAIS 80 Query: 134 LLPNKHWFEMQ--------SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 H F++ + + + + + Sbjct: 81 EQGVSHLFKINKSPLKITYIPRGNYIIFRGAQDPERIKSLKDSRFPFAIGW----IEELA 136 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF----YEIF 241 DE + + G L + + + NP + + YE Sbjct: 137 EFKTE-------DEVKTITNSLLRGEL----DDGLFYKFFYSYNPPKRKQSWVNKKYESV 185 Query: 242 NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNI 301 +P + I +F E A R E G+ + P Sbjct: 186 IQP-PNTHVHHSTYLDNPYISQAFIEEAEATRERSEKRYRWEYLGEAIGSGVA---PFEN 241 Query: 302 IEEALNREPCPDPYAPLIMGCDIAEEGGDNTVV---VLRRGPVIEHLFDWSKT--DLRTT 356 + + + + G D V ++ VI + + R Sbjct: 242 LVFRKITDEEIARFDNIRQGNDFGYANDPLAFVRWHYDKKKRVIYAIDEIYGVKISNREL 301 Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 +I + Y+ I D+ ++ D L++ ++ V G K+ D Sbjct: 302 AERIRE--KGYQSQMITCDS--AEPKSIDELKLQ-LNIPLVQGAKKGPDSREY 349 >gi|269986940|gb|EEZ93216.1| type III restriction protein res subunit [Candidatus Parvarchaeum acidiphilum ARMAN-4] Length = 508 Score = 39.7 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 12/60 (20%), Positives = 25/60 (41%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117 F E ++ + + + G+GKT ++A L + + P V+ LA ++ Sbjct: 4 FKEEIENREYQTKIFETAKTGNTLVVLPTGLGKTIISAMLANYRLEKYPSSKVLFLAPTK 63 >gi|302412431|ref|XP_003004048.1| ATP-dependent DNA helicase MER3 [Verticillium albo-atrum VaMs.102] gi|261356624|gb|EEY19052.1| ATP-dependent DNA helicase MER3 [Verticillium albo-atrum VaMs.102] Length = 709 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 41/118 (34%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E V W + L + Sbjct: 279 SPTGSGKTVAAELAMWWAFKERPGSKVVYIAP-----MKALVRERVKDWGARLAKPLGLK 333 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 334 LVELTGDNTPDTRTIKDADVIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 384 >gi|188580687|ref|YP_001924132.1| hypothetical protein Mpop_1430 [Methylobacterium populi BJ001] gi|179344185|gb|ACB79597.1| protein of unknown function DUF264 [Methylobacterium populi BJ001] Length = 421 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 61/357 (17%), Positives = 108/357 (30%), Gaps = 52/357 (14%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNA-WLVLWLMSTRPGISVICL 113 L +E H P P + A+ GRG GKT A W+ + Sbjct: 9 LRLLEADWLHRARHDQLPPPGDWTTWAVIGGRGSGKTRTGAEWV---RGLAHGDP--VFT 63 Query: 114 ANSETQLKTT--LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171 A + ++ +A+V + P+ + L P W G Sbjct: 64 AEAVGRIALVGETFADVRDVMIEGPSG-LLALPRLGGPPPVWQPSRRRVMFGN-----GA 117 Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS-NP 230 + +S E PD+ G A +DE + + +F + +P Sbjct: 118 VALAFSAEEPDSLRG---PQFGAAWSDEVAK-----WREAEAA---YDMIQFGLRLGAHP 166 Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTV----------EGIDPSFHEGIIARYGLDSDVT 280 R L +P+ +R D RTV + + P F E ++ RY + + Sbjct: 167 RGLVT----TTPRPVPLIRRLLADPRTVVTRSRTADNAQNLAPRFLEEVVGRY-AGTRIG 221 Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI---AEEGGDNT----- 332 R E+ G+ + D+ + IE R P + + D + G D Sbjct: 222 RQELDGELIEDRPDALWTRDGIER--TRIHAAPPLQRIAVAVDPPASSRAGADACGIVAA 279 Query: 333 VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389 + + + L + + D ++ + N G L Sbjct: 280 GIAADGTAYVLADATLERAAPAAWAQGALALYHRLKADVLVAEVNQGGEMVVAVLAE 336 >gi|15618661|ref|NP_224947.1| exodeoxyribonuclease V, Alpha [Chlamydophila pneumoniae CWL029] gi|4377059|gb|AAD18890.1| Exodeoxyribonuclease V, Alpha [Chlamydophila pneumoniae CWL029] Length = 493 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 71/212 (33%), Gaps = 28/212 (13%) Query: 62 VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE---T 118 + + N + N + +S G G GKT L A L+L L+ +P + + ++ + + Sbjct: 132 ILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKATS 191 Query: 119 QLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177 ++ L + + L+ H F + + L +D T YS Sbjct: 192 HIRQILMKYNIFDDMVLMQTVHHFLQEY------AYRRYNSIDVLLVDEGSMVTFDLLYS 245 Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP-RRLSGK 236 + G+ + T +I LG L I NP + L G Sbjct: 246 LVQT--LQGYEKDKKLY--------TSSLIILGDTNQL-----PPIGIGVGNPLQDLIGY 290 Query: 237 FYEI--FNKPLDDWKRFQIDTRTVEGIDPSFH 266 F+E F K K +D T + Sbjct: 291 FHENTFFLKTSHRAKTGVVDQLTQSVLRGEMI 322 >gi|15836285|ref|NP_300809.1| exodeoxyribonuclease V, alpha [Chlamydophila pneumoniae J138] gi|16752288|ref|NP_445657.1| exodeoxyribonuclease V, alpha subunit, putative [Chlamydophila pneumoniae AR39] gi|33242111|ref|NP_877052.1| exonuclease V alpha-subunit [Chlamydophila pneumoniae TW-183] gi|7190033|gb|AAF38887.1| exodeoxyribonuclease V, alpha subunit, putative [Chlamydophila pneumoniae AR39] gi|8979125|dbj|BAA98960.1| exodeoxyribonuclease V, alpha [Chlamydophila pneumoniae J138] gi|33236621|gb|AAP98709.1| exonuclease V alpha-subunit [Chlamydophila pneumoniae TW-183] Length = 493 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 71/212 (33%), Gaps = 28/212 (13%) Query: 62 VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE---T 118 + + N + N + +S G G GKT L A L+L L+ +P + + ++ + + Sbjct: 132 ILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKATS 191 Query: 119 QLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177 ++ L + + L+ H F + + L +D T YS Sbjct: 192 HIRQILMKYNIFDDMVLMQTVHHFLQEY------AYRRYNSIDVLLVDEGSMVTFDLLYS 245 Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP-RRLSGK 236 + G+ + T +I LG L I NP + L G Sbjct: 246 LVQT--LQGYEKDKKLY--------TSSLIILGDTNQL-----PPIGIGVGNPLQDLIGY 290 Query: 237 FYEI--FNKPLDDWKRFQIDTRTVEGIDPSFH 266 F+E F K K +D T + Sbjct: 291 FHENTFFLKTSHRAKTGVVDQLTQSVLRGEMI 322 >gi|226945807|ref|YP_002800880.1| phage P2 terminase ATPase subunit, gpP-like protein [Azotobacter vinelandii DJ] gi|226720734|gb|ACO79905.1| Phage P2 terminase ATPase subunit, gpP-like protein [Azotobacter vinelandii DJ] Length = 585 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 50/162 (30%), Gaps = 22/162 (13%) Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL----- 299 W++ I G D E + Y +++ + +F S PL Sbjct: 326 KIWRQIVTILDAERRGCDLFDLEELRFEY--NAEQFANLLMCEFVDDGA-SIFPLAMLQP 382 Query: 300 ----NIIEEALNREP---CPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL--F 346 + +E A + +P P + +G D AE G +VV L G L Sbjct: 383 CQVDSWVEWAEDFKPFAARPFGDRQVWVGYDPAETGDSAGLVVVAPPLVPGGKFRVLERH 442 Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 + D I + +Y I +D G + Sbjct: 443 QFRGMDFAAQAEFIRQVTRRYWVTYIGLDTTGMGTGVAQLVR 484 >gi|213423446|ref|ZP_03356429.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. E01-6750] Length = 72 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 17/48 (35%) Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 I W D R + L E+Y I ID+ G + ++ Sbjct: 10 RILERHQWRGMDFRAQADANKKLTEQYNVTYIGIDSTGVGHGVYENVK 57 >gi|209964492|ref|YP_002297407.1| EAL domain proteni [Rhodospirillum centenum SW] gi|209957958|gb|ACI98594.1| EAL domain proteni [Rhodospirillum centenum SW] Length = 587 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 2/81 (2%) Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL 363 A + A L + D EG V + + + + + S+ D ++ Sbjct: 82 AATFKRLVGTAAAKLFLNVDPRLEGAVPLVTAIGQRYGVPIVHEISELDTTAVGERLEAA 141 Query: 364 VEKYRP--DAIIIDANNTGAR 382 VE+YR I +D G Sbjct: 142 VEQYRRRDIGIALDDFGVGFG 162 >gi|213027809|ref|ZP_03342256.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] Length = 141 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 10/40 (25%), Positives = 16/40 (40%) Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 D R + I L E+Y I ID+ G + ++ Sbjct: 1 RGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVK 40 >gi|54025903|ref|YP_120145.1| putative phage terminase [Nocardia farcinica IFM 10152] gi|54017411|dbj|BAD58781.1| putative phage terminase [Nocardia farcinica IFM 10152] Length = 436 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 49/171 (28%), Gaps = 9/171 (5%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG----KFYEIFNKPLDDWK 249 + DEA+ P+ + L+ A + T+NP F + + Sbjct: 125 LAMVDEATLLPENFWTQLGARLSVPGAK--LLATTNPDNPQHYLKVNFIDRAGERGMRLC 182 Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309 + G+D + + A + GQ+ D + + + + Sbjct: 183 AWDFTLDDNPGLDDEYVASLKAE--NQGLFYLRNILGQWVAADGAVYDCYDPAKHLVKWS 240 Query: 310 PCPDPYAPLIMGCDIAEEGGDNTV-VVLRRGPVIEHLFDWSKTDLRTTNNK 359 P+ + +G D V + L V+ + +W Sbjct: 241 ELPEMQFYVGVGVDHGTTNPTAAVLIGLGADNVLYAVDEWRYAPSNKEARW 291 >gi|227496997|ref|ZP_03927248.1| phage Terminase [Actinomyces urogenitalis DSM 15434] gi|226833491|gb|EEH65874.1| phage Terminase [Actinomyces urogenitalis DSM 15434] Length = 480 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 69/246 (28%), Gaps = 25/246 (10%) Query: 194 AIINDEASGTPDVINLGILGF-LTERNANRFWIMTSNPRRLSGKFYEIFNKPL------- 245 I+ DEA D + AN I T P E+F + Sbjct: 169 IIVLDEAQDLTDEALEALRSTNAAGPQANPQIIYTGTPPSPKNDG-EVFTRFRSGALSGT 227 Query: 246 ------DDWKRFQ----IDTRTVEGIDPSFHEGIIARYGLD------SDVTRVEVCGQFP 289 +W D T+ +P++ + A+ D + E G + Sbjct: 228 TASTCWHEWSAAPDADLDDETTIAQANPAYQIRLSAKTVADEREDISEEGFARERLGMWD 287 Query: 290 QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS 349 + + I + + L + G V R+ + Sbjct: 288 EVSTSAVIDQATWLRCADMASQVNDRLALAVDVQPDRTSGSVAVAGQRKDGRWHIEVIDN 347 Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409 + ++ +++G+ + R ++ID A + L+ G V + A Sbjct: 348 RNNVGWILQRVAGIWARQRIRTVVIDRRGPAASLIEPLQQKGIKVTTTDAAQMAASCGAF 407 Query: 410 RNRRTE 415 + E Sbjct: 408 YDAVME 413 >gi|125552219|gb|EAY97928.1| hypothetical protein OsI_19844 [Oryza sativa Indica Group] Length = 1367 Score = 39.7 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 15/92 (16%) Query: 55 QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + + + N K + G G GKT L + M P Sbjct: 782 QREAFEFMWTNLVGDIRLNEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 841 Query: 108 ISVICLANSETQLKTTLWA---EVSKWLSLLP 136 + +A + L+A E KW +P Sbjct: 842 CRPVIIAP-----RGMLFAWEQEFKKWNVNVP 868 >gi|269302541|gb|ACZ32641.1| putative exodeoxyribonuclease V, alpha subunit [Chlamydophila pneumoniae LPCoLN] Length = 493 Score = 39.3 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 71/212 (33%), Gaps = 28/212 (13%) Query: 62 VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE---T 118 + + N + N + +S G G GKT L A L+L L+ +P + + ++ + + Sbjct: 132 ILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKATS 191 Query: 119 QLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177 ++ L + + L+ H F + + L +D T YS Sbjct: 192 HIRQILMKYNIFDDMVLMQTVHHFLQEY------AYRRYNSIDVLLVDEGSMVTFDLLYS 245 Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP-RRLSGK 236 + G+ + T +I LG L I NP + L G Sbjct: 246 LVQT--LQGYEKDKKLY--------TSSLIILGDTNQL-----PPIGIGVGNPLQDLIGY 290 Query: 237 FYEI--FNKPLDDWKRFQIDTRTVEGIDPSFH 266 F+E F K K +D T + Sbjct: 291 FHENTFFLKTSHRAKTGAVDQLTQSVLRGEMI 322 >gi|222631484|gb|EEE63616.1| hypothetical protein OsJ_18433 [Oryza sativa Japonica Group] Length = 1364 Score = 39.3 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 15/92 (16%) Query: 55 QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + + + N K + G G GKT L + M P Sbjct: 779 QREAFEFMWTNLVGDIRLNEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 838 Query: 108 ISVICLANSETQLKTTLWA---EVSKWLSLLP 136 + +A + L+A E KW +P Sbjct: 839 CRPVIIAP-----RGMLFAWEQEFKKWNVNVP 865 >gi|300922509|ref|ZP_07138621.1| phage terminase large subunit [Escherichia coli MS 182-1] gi|300421167|gb|EFK04478.1| phage terminase large subunit [Escherichia coli MS 182-1] Length = 240 Score = 39.3 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 29/243 (11%), Positives = 69/243 (28%), Gaps = 22/243 (9%) Query: 67 LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 +N + P E + + GRG GK+ W + ++ A ++ Sbjct: 4 INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 E+ +S + + + A + + + + + + Sbjct: 51 RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110 Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244 G +EA ++ + + + W+ NP+ + Y+ F P Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163 Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304 DD ++ + + + R G+ + I +E Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 305 ALN 307 A + Sbjct: 224 ATD 226 >gi|293401139|ref|ZP_06645283.1| SNF2 domain protein [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291305265|gb|EFE46510.1| SNF2 domain protein [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 447 Score = 39.3 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 54/160 (33%), Gaps = 30/160 (18%) Query: 50 APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT--TLNAWLVLWLMSTRPG 107 +P ++Q ++ ++ H + +V G+GKT L A L S Sbjct: 4 SPHNYQSYAIDYIETHPVAAVLLDM------------GLGKTVIFLTAIADLLFDS-FEA 50 Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166 ++ +A + W E+SKW L + + ++ A + + ++ Sbjct: 51 HRILVVAPLR--VARDTWPAEISKWQHLKHLTYAVAVGTVKERKAALSAGADITIINREN 108 Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV 206 + G+ Y M II DE S + Sbjct: 109 LGWLIDS-----------SGYEFDYDMVII-DELSSFKNH 136 >gi|227499654|ref|ZP_03929757.1| PbsX family phage terminase, large subunit [Anaerococcus tetradius ATCC 35098] gi|227218251|gb|EEI83510.1| PbsX family phage terminase, large subunit [Anaerococcus tetradius ATCC 35098] Length = 439 Score = 39.3 bits (90), Expect = 1.4, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 45/184 (24%), Gaps = 36/184 (19%) Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----------PLDDWKRFQID 254 D I + FW + NP + Y + Sbjct: 149 DSIKEAFNRTAAAKRRKFFWDL--NPSSPNHFIYADHIDKYQNMIDEGIDFGGYNYKHFT 206 Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG-----------QFPQQDIDSFIPLNIIE 303 I + I +Y +S + ++ G QF D I Sbjct: 207 IDDNINISDQRKKEIKLQYDPNSVWYKRDILGLRVVAEGLIYKQFADNPDDYLI------ 260 Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVV--LRRGPVIEHLFDWSKTDLRTTNNKIS 361 + P + +G D + + + RG + + + + + Sbjct: 261 -----KEKPHELQMIQIGVDFGGNNSKHAFICCGISRGFKKVYALRSERLEPDKPTDLYN 315 Query: 362 GLVE 365 L++ Sbjct: 316 QLID 319 >gi|170764163|ref|ZP_02633320.2| phage terminase, large subunit, pbsx family [Clostridium perfringens E str. JGS1987] gi|170661287|gb|EDT13970.1| phage terminase, large subunit, pbsx family [Clostridium perfringens E str. JGS1987] Length = 441 Score = 39.3 bits (90), Expect = 1.5, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 51/189 (26%), Gaps = 33/189 (17%) Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----------WKRFQI 253 PD I + FW + NP + Y+ + + + Sbjct: 145 PDSIKEAFNRTIAAHKRKVFWDL--NPDNPNAFIYKDYIDNYKSKYENGELKGGYNYYHF 202 Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313 I E I ++Y +S + ++ G+ + +I P Sbjct: 203 TIDDNINISDERKEEIKSQYDKNSIWYQRDILGKR-------CVAEGLIYRRFANNPNSY 255 Query: 314 PYAP--------LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365 +++G D G + + F + K + ++ ++ Sbjct: 256 RAEESDVSNLMKIVIGVDFGGTGSGHAFIA------SAITFGYKKVIILSSERHFGDDID 309 Query: 366 KYRPDAIII 374 + I I Sbjct: 310 SEKLGKIFI 318 >gi|284018161|sp|A3GH78|MPH1_PICST RecName: Full=ATP-dependent DNA helicase MPH1 Length = 1050 Score = 39.3 bits (90), Expect = 1.5, Method: Composition-based stats. Identities = 10/44 (22%), Positives = 23/44 (52%), Gaps = 4/44 (9%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE----TQLK 121 ++ G+GKT + + ++L + P +I +A ++ Q+K Sbjct: 106 VALPTGLGKTFIASTVMLNFLRWFPESKMIFVAPTKPLVAQQIK 149 >gi|121602586|ref|YP_988560.1| PBSX family phage terminase large subunit [Bartonella bacilliformis KC583] gi|120614763|gb|ABM45364.1| putative phage terminase, large subunit, PBSX family [Bartonella bacilliformis KC583] Length = 402 Score = 39.3 bits (90), Expect = 1.5, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 65/194 (33%), Gaps = 11/194 (5%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247 + DEA + ++ L E + +T NP R + + F + Sbjct: 83 RILLCWVDEAEPVTETAWQTLIPTLREEGQDWHSELWVTWNPLRENAPVEKRFRLTKDPN 142 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF---IPLNIIEE 304 K +I+ R + +A + G + Q ++ + L+ +E Sbjct: 143 IKGVEINWRDNPQFPDKLNRDRLADLHQRPEQYGHIWEGDYLQAVQGAYYQKLLLDAEQE 202 Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKI 360 DP + + DI G D T + + + G I L D+ + + + I Sbjct: 203 GRIAHVSRDPLIQIKIFWDIGGTGAKADATALWVAQFIGREIRIL-DYYEAQGQPLSEHI 261 Query: 361 SGLVEKYRPDAIII 374 + + A+++ Sbjct: 262 GWICHRGYDKALMV 275 >gi|328870919|gb|EGG19291.1| DEAD/DEAH box helicase [Dictyostelium fasciculatum] Length = 2224 Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 42/288 (14%), Positives = 80/288 (27%), Gaps = 38/288 (13%) Query: 66 CLNSVNNPNPEVFKGA-----ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET-- 118 N V ++A GKT VL + P + +A E+ Sbjct: 1386 YFNPVQTQVFSSLYTTDENVFVAAPANTGKTVCAELAVLRTLINNPEARCVYIAPVESMV 1445 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 +++ WA K+ +++ + S ++ + ++ + + R + + Sbjct: 1446 TVRSRDWAY--KFGQKFGKVSVLTGDAVTDNKILEASRIIVTT----AERWDILSRKWRQ 1499 Query: 179 ERPDTFVGHHNTYGMAIINDE----ASGTPDVINLGILGFL----TERNANRFWIMTSNP 230 + I DE SG +L + T+ + +I S+P Sbjct: 1500 KNSRV------QSVSLFIVDELQMIGSGESGSTMEIVLSRMRYIATQTGSPIRFIGLSSP 1553 Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290 + + L +W T D E I G D + Sbjct: 1554 VANA--------RDLAEWMGATPATMFNFHPDVRPVEMEIQMQGFDYPNFQERQMAM--- 1602 Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR 338 + ++ A P A M DI + RR Sbjct: 1603 TKPALYAVSHMDRTAQTLVYVPTRKAARQMAADIILFVDSEDDMNTRR 1650 >gi|308198038|ref|XP_001387028.2| predicted protein [Scheffersomyces stipitis CBS 6054] gi|149389001|gb|EAZ63005.2| predicted protein [Pichia stipitis CBS 6054] Length = 941 Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 10/44 (22%), Positives = 23/44 (52%), Gaps = 4/44 (9%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE----TQLK 121 ++ G+GKT + + ++L + P +I +A ++ Q+K Sbjct: 42 VALPTGLGKTFIASTVMLNFLRWFPESKMIFVAPTKPLVAQQIK 85 >gi|220915119|ref|YP_002490424.1| hypothetical protein Mnod_7767 [Methylobacterium nodulans ORS 2060] gi|219952973|gb|ACL63358.1| hypothetical protein Mnod_7767 [Methylobacterium nodulans ORS 2060] Length = 846 Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 67/215 (31%), Gaps = 36/215 (16%) Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT 183 L+A + WL+ + + + + P S + + S ERP+ Sbjct: 513 LFAWLMNWLAHAAQRPHEKPGTAPIFKGPQGS--GKTTFTNLLRAIFHPAHVVSAERPEA 570 Query: 184 FVGHHNTYG---MAIINDEASGTPDV-INLGILGFLTERNANR--------------FWI 225 +G HN + + ++ DEA D N + +T+ ++ Sbjct: 571 LLGKHNAHLREALFVMADEAVFAGDPAANNRLKAMVTDATLTIEPKGIDAVSVPSFHRFV 630 Query: 226 MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285 MTSN + + W F + V + ++ + A ++ R + Sbjct: 631 MTSNEDHVIRAEADA-----RRWAVFDVSGEQVGNV--AYFRELYAVLKPETPEVRAFLR 683 Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320 + I E A+ R P I+ Sbjct: 684 ---------DLAVMEIDEAAVRRAPTTSALVGQIV 709 >gi|158318502|ref|YP_001511010.1| helicase domain-containing protein [Frankia sp. EAN1pec] gi|158113907|gb|ABW16104.1| helicase domain protein [Frankia sp. EAN1pec] Length = 1143 Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 5/136 (3%) Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169 + +A+ KT + E+ +L + + +L + W + +L D+ Y Sbjct: 273 GVIIADEVGLGKTYIAGELLHEAVILNRQKALVVAPATLRDSTWKPFLRETNLPADTVSY 332 Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGF---LTERNANRFWIM 226 + R H +I DEA + LT + R ++ Sbjct: 333 EELTRGMPAAGQQGAALQHPDAYALVIVDEAHALRSLGTQRAEAMRLLLTGKVPKRLVLL 392 Query: 227 TSNPRRLSGKFYEIFN 242 T+ P S Y+++N Sbjct: 393 TATPVNNS--LYDLYN 406 >gi|111184763|gb|ABH08471.1| putative terminase ATPase subunit [Human herpesvirus 3] gi|157965750|gb|ABW06896.1| DNA packaging protein [Human herpesvirus 3] Length = 743 Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 54/155 (34%), Gaps = 28/155 (18%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT L+ +M+T GI V A + K + + + Sbjct: 268 GKTWFLVPLIALVMATFRGIKVGYTA------------HIRKATEPV-------FEGIKS 308 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM------AIINDEASG 202 W+ + +S +S +YS F HNT G+ + DEA+ Sbjct: 309 RLEQWFGANYVDHVKGESITFSFTDGSYSTAV---FASSHNTNGIRGQDFNLLFVDEANF 365 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I+GFL + N ++ ++N + S F Sbjct: 366 IRPDAVQTIVGFLNQTNCKIIFVSSTNTGKASTSF 400 >gi|83721852|emb|CAI44887.1| putative terminase ATPase subunit [Human herpesvirus 3] gi|94481989|gb|ABF21689.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94482063|gb|ABF21762.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94482137|gb|ABF21835.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|116489977|gb|ABJ98890.1| ORF45/42 [Human herpesvirus 3] Length = 747 Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 54/155 (34%), Gaps = 28/155 (18%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT L+ +M+T GI V A + K + + + Sbjct: 272 GKTWFLVPLIALVMATFRGIKVGYTA------------HIRKATEPV-------FEGIKS 312 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM------AIINDEASG 202 W+ + +S +S +YS F HNT G+ + DEA+ Sbjct: 313 RLEQWFGANYVDHVKGESITFSFTDGSYSTAV---FASSHNTNGIRGQDFNLLFVDEANF 369 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I+GFL + N ++ ++N + S F Sbjct: 370 IRPDAVQTIVGFLNQTNCKIIFVSSTNTGKASTSF 404 >gi|9625919|ref|NP_040165.1| DNA packaging terminase subunit 1 [Human herpesvirus 3] gi|139650|sp|P09294|TRM3_VZVD RecName: Full=Tripartite terminase subunit UL15 homolog; AltName: Full=DNA-packaging protein 45; AltName: Full=Terminase large subunit; Contains: RecName: Full=Gene 42 protein gi|5869808|emb|CAB55553.1| putative ATPase subunit of terminase [Human herpesvirus 3 strain Dumas] gi|46981453|gb|AAT07724.1| DNA packaging protein [Human herpesvirus 3] gi|46981524|gb|AAT07800.1| DNA packaging protein [Human herpesvirus 3] gi|94481841|gb|ABF21543.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94481915|gb|ABF21616.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94482211|gb|ABF21908.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94482285|gb|ABF21981.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94482359|gb|ABF22054.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94482433|gb|ABF22127.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94482507|gb|ABF22200.1| putative ATPase subunit of terminase [Human herpesvirus 3] gi|94482581|gb|ABF22273.1| putative ATPase subunit of terminase [Human herpesvirus 3] Length = 747 Score = 39.3 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 54/155 (34%), Gaps = 28/155 (18%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148 GKT L+ +M+T GI V A + K + + + Sbjct: 272 GKTWFLVPLIALVMATFRGIKVGYTA------------HIRKATEPV-------FEGIKS 312 Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM------AIINDEASG 202 W+ + +S +S +YS F HNT G+ + DEA+ Sbjct: 313 RLEQWFGANYVDHVKGESITFSFTDGSYSTAV---FASSHNTNGIRGQDFNLLFVDEANF 369 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I+GFL + N ++ ++N + S F Sbjct: 370 IRPDAVQTIVGFLNQTNCKIIFVSSTNTGKASTSF 404 >gi|56692599|ref|YP_164067.1| large terminase subunit [Pseudomonas phage B3] gi|33338625|gb|AAQ13949.1|AF232233_31 large terminase subunit [Pseudomonas phage B3] Length = 486 Score = 39.3 bits (90), Expect = 1.7, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 49/139 (35%), Gaps = 7/139 (5%) Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-- 314 ++G+D + + I D + + E P D +F+ ++I A + Sbjct: 221 EIQGMDEAQYFDFIRAGCADEESFQQEYMCN-PADDDVAFLEYDLIASAEYPQTANWQQP 279 Query: 315 -YAPLIMGCDIAEEGGDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 L G DI + D TV+ + G V+ ++R + + R + Sbjct: 280 EGGRLFAGVDIGRK-KDLTVLWILELLGDVLYTRHVERLQNMRKSAQEAILWPWFQRCER 338 Query: 372 IIIDANNTGARTCDYLEML 390 I IDA G D + Sbjct: 339 ICIDATGLGIGWADDAQDQ 357 >gi|262172263|ref|ZP_06039941.1| ATP-dependent RNA helicase SrmB [Vibrio mimicus MB-451] gi|261893339|gb|EEY39325.1| ATP-dependent RNA helicase SrmB [Vibrio mimicus MB-451] Length = 416 Score = 39.3 bits (90), Expect = 1.7, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 68/237 (28%), Gaps = 34/237 (14%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 GFS P Q E + + SA G GKT A L + P Sbjct: 22 GFSRPTQVQAEAI------------PQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFP 69 Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161 ++ L + AE ++ L+ + F + + ++D+L + Sbjct: 70 RRKAGPARILILTPTRELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATT 125 Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221 I + +I DEA D+ + L+ Sbjct: 126 QDI------VVATPGRLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSAECRW 179 Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278 R + + L G+ E F L ID P I+++ +D Sbjct: 180 RKQTLLFSAT-LEGRGVEGFTADL-LKNPAHIDAE-----PPRRERKKISQWYHRAD 229 >gi|159482689|ref|XP_001699400.1| predicted protein [Chlamydomonas reinhardtii] gi|158272851|gb|EDO98646.1| predicted protein [Chlamydomonas reinhardtii] Length = 231 Score = 39.3 bits (90), Expect = 1.7, Method: Composition-based stats. Identities = 12/70 (17%), Positives = 26/70 (37%), Gaps = 2/70 (2%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111 R+W + V + V + G+GKT + A ++L P ++ Sbjct: 7 RTWLYPTDQEVREYQFRMV--RGALFANTLVCLPTGLGKTLIAAVVILNFYRWFPDGKLV 64 Query: 112 CLANSETQLK 121 A ++ ++ Sbjct: 65 FTAPTKPLVE 74 >gi|75763594|ref|ZP_00743293.1| Stage V sporulation protein AA [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|74488924|gb|EAO52441.1| Stage V sporulation protein AA [Bacillus thuringiensis serovar israelensis ATCC 35646] Length = 206 Score = 38.9 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 2/86 (2%) Query: 300 NIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359 I + P + +G D+A+ GD++VV L + ++ + KT + K Sbjct: 3 QTIYIKMRNRLKVSPTYEVKLG-DVAQLAGDSSVVELLQNEIVYKITAHDKTHVVIDVMK 61 Query: 360 ISGLVEKYRPDAIIIDANNTGARTCD 385 + ++++ + + I+ +G D Sbjct: 62 VIEIIQQ-KASHVQINLLGSGQTLVD 86 >gi|262163925|ref|ZP_06031664.1| ATP-dependent RNA helicase SrmB [Vibrio mimicus VM223] gi|262027453|gb|EEY46119.1| ATP-dependent RNA helicase SrmB [Vibrio mimicus VM223] Length = 416 Score = 38.9 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 68/237 (28%), Gaps = 34/237 (14%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 GFS P Q E + + SA G GKT A L + P Sbjct: 22 GFSRPTQVQAEAI------------PQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFP 69 Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161 ++ L + AE ++ L+ + F + + ++D+L + Sbjct: 70 RRKAGPARILILTPTRELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATT 125 Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221 I + +I DEA D+ + L+ Sbjct: 126 QDI------VVATPGRLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSAECRW 179 Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278 R + + L G+ E F L ID P I+++ +D Sbjct: 180 RKQTLLFSAT-LEGRGVEGFTADL-LKNPAHIDAE-----PPRRERKKISQWYHRAD 229 >gi|258620330|ref|ZP_05715368.1| Superfamily II DNA and RNA helicase [Vibrio mimicus VM573] gi|258624701|ref|ZP_05719635.1| Superfamily II DNA and RNA helicase [Vibrio mimicus VM603] gi|258582988|gb|EEW07803.1| Superfamily II DNA and RNA helicase [Vibrio mimicus VM603] gi|258587209|gb|EEW11920.1| Superfamily II DNA and RNA helicase [Vibrio mimicus VM573] Length = 416 Score = 38.9 bits (89), Expect = 1.8, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 68/237 (28%), Gaps = 34/237 (14%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 GFS P Q E + + SA G GKT A L + P Sbjct: 22 GFSRPTQVQAEAI------------PQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFP 69 Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161 ++ L + AE ++ L+ + F + + ++D+L + Sbjct: 70 RRKAGPARILILTPTRELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATT 125 Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221 I + +I DEA D+ + L+ Sbjct: 126 QDI------VVATPGRLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSAECRW 179 Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278 R + + L G+ E F L ID P I+++ +D Sbjct: 180 RKQTLLFSAT-LEGRGVEGFTADL-LKNPAHIDAE-----PPRRERKKISQWYHRAD 229 >gi|206580893|ref|YP_002240749.1| type I site-specific deoxyribonuclease, HsdR family [Klebsiella pneumoniae 342] gi|206569951|gb|ACI11727.1| type I site-specific deoxyribonuclease, HsdR family [Klebsiella pneumoniae 342] Length = 1031 Score = 38.9 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 50/330 (15%), Positives = 101/330 (30%), Gaps = 50/330 (15%) Query: 86 RGIGKTTLNAWLVLWLMSTRPGISV-ICLANSE--TQLKTTLWA---EVSKWLSLLPNKH 139 +G GK+ WL W+ P V I +E Q+++ E+ Sbjct: 278 QGSGKSLTMVWLAKWIRENVPNSRVLIVTDRTELDEQIESVFMGVDEEI------YRTSS 331 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT--YSEERPDTFVGHHNTYGMAIIN 197 ++ + HP PW L G S+ T +E + + + Sbjct: 332 GNDLIATLNHPNPWLICSLVHKFGRRSEAEDTAATDDFITELQQSLTKTFRAKGDLFVFV 391 Query: 198 DE-------------ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244 DE + P+ + +G G + + + + G + + Sbjct: 392 DECHRTQSGKLHNAMTAILPEALFIGFTGTPLMKKDKKKSV------EVFGPYIHTYKFD 445 Query: 245 LDDWKRFQIDTR------TVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFI 297 +D R + S++ + ++ ++ Sbjct: 446 EAVADGVVLDLRYEARDIDQYLTSEKKVDDWFEAKTRGLSNLAKTQLKQKWGSMQ-KLLS 504 Query: 298 PLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTN 357 + +E+ +N P +M + G+ +V + +S+TD Sbjct: 505 SKSRLEQIVNDILLDMDTRPRLM-----DGRGNAMLVCSSVYQACKVYEMFSQTD---LA 556 Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYL 387 K++ +V +RPDA I TGA + L Sbjct: 557 GKVA-IVTSFRPDAASIKGEETGAGLTEKL 585 >gi|310722509|ref|YP_003969332.1| Dda DNA helicase [Aeromonas phage phiAS5] gi|306021352|gb|ADM79886.1| Dda DNA helicase [Aeromonas phage phiAS5] Length = 454 Score = 38.9 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 46/147 (31%), Gaps = 27/147 (18%) Query: 66 CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125 S++ + IS G GK+ L L+ + + VI A + Q K L Sbjct: 17 QKRSIDAVLNDRSHITISGPAGSGKSFLTKILIK-KLIEKNNGGVILSAPT-HQAKIVL- 73 Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185 + + + +H + I Y + R + + + D Sbjct: 74 ----------------------SKMSGYTASTIHSIMKIHPDTYEDV-REFKQSKSDK-A 109 Query: 186 GHHNTYGMAIINDEASGTPDVINLGIL 212 +I DEAS + + IL Sbjct: 110 KKDLNEVRYLIVDEASMVDNDLFEIIL 136 >gi|145603324|ref|XP_369340.2| hypothetical protein MGG_06124 [Magnaporthe oryzae 70-15] gi|145011578|gb|EDJ96234.1| hypothetical protein MGG_06124 [Magnaporthe oryzae 70-15] Length = 1998 Score = 38.9 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 41/118 (34%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E V W + L + Sbjct: 1176 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVKDWGARLAQPMGLK 1230 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1231 LVELTGDNTPDTRTIKDADVIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1281 >gi|209885731|ref|YP_002289588.1| phage DNA Packaging Protein [Oligotropha carboxidovorans OM5] gi|209873927|gb|ACI93723.1| phage DNA Packaging Protein [Oligotropha carboxidovorans OM5] Length = 434 Score = 38.9 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 68/419 (16%), Positives = 129/419 (30%), Gaps = 80/419 (19%) Query: 84 AGRGIGKTTLNA------WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137 GRG GKT A L L ++ +P + + +E ++ + VS L++ Sbjct: 52 GGRGAGKTRAGAEWIRAQALGLAPLAQQPAGRIALVGETEHDVREVMIEGVSGLLAVHRR 111 Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197 W + +S E P++ G Sbjct: 112 DE----------RPMWQPSRRRLEWKN-----GAVAHAFSAEDPESLRG---PQFACAWA 153 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTS-NPRRLSGKFYEIFNKPLDDWKRFQIDTR 256 DE + + +F + PR+L +P KR D Sbjct: 154 DELAK-----WRYAEAAF---DMLQFGLRLGAQPRQLIT----TTPRPTALIKRLLNDES 201 Query: 257 TVE----------GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306 V + P+F + ++ARY + + R E+ G+ ++ D+ +IE Sbjct: 202 CVTTRAATRSNALHLAPTFLQSVMARY-AGTRLGRQELDGELIEERPDALWSRGLIETC- 259 Query: 307 NREPCPDPYAPLIMGCD-IAEEGGDNTV-------VVLRRGPVIEHLFDWSKTDLRTTNN 358 R P +++ D A G V G + ++ Sbjct: 260 -RISEAPPLQRIVVAVDPPATSGKRADACGIVAAGVAADNGLYVLADETLTQAAPAAWAA 318 Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTEL 416 + L + DA++++ N G + + V V + R E Sbjct: 319 RAVALWRRLEADALVVEVNQGGEMVRAVIAQVDPSVPVQPVRALRGKW-------LRAE- 370 Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475 +A E + H+G+ L+ + +G + S +S D+ D L++ Sbjct: 371 --PIATLYEQGR-VRHAGVFAALED-EMCDFATSG---LSS-----GRSPDHLDALVWA 417 >gi|221117267|ref|XP_002154001.1| PREDICTED: similar to yeast Swi2/Snf2-Like family member (ssl-1), partial [Hydra magnipapillata] Length = 2164 Score = 38.9 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 48/153 (31%), Gaps = 22/153 (14%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVI--CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 G+GKT + +L ++ G + + L L E+ KW +F Q Sbjct: 657 GLGKT-IQTIALLAHLACEEGCWGPHLIIVPTSVMLNWEL--ELKKWCPGFKILTYFGTQ 713 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 + G + +C T + II DEA Sbjct: 714 ----------KERKIKRAGWCKPNAFHVCITSYKLVIQDHQAFKRRKWKYIILDEAQNIK 763 Query: 205 D---VINLGILGFLTERNANRFWIMTSNPRRLS 234 + +L F N++R ++T P + S Sbjct: 764 NFKSQRWQTLLNF----NSHRRLLLTGTPLQNS 792 >gi|163868971|ref|YP_001610200.1| hypothetical protein Btr_1983 [Bartonella tribocorum CIP 105476] gi|161018647|emb|CAK02205.1| phage-related protein [Bartonella tribocorum CIP 105476] Length = 453 Score = 38.9 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 39/303 (12%), Positives = 88/303 (29%), Gaps = 27/303 (8%) Query: 84 AGRGIGKT---TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 GRG GKT L + +V + +I A + N+ Sbjct: 39 GGRGSGKTRSFALMSAVVGYRHGMAGERGIILCA---------------RQFQNSLNESS 83 Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 E ++ P+ D + ++ + DEA Sbjct: 84 LEEIKRAIEAYPFLQDYYEIGDKYIKSKDGRIAYVFAGLDRNIASIKSMGRVFLCWVDEA 143 Query: 201 SGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNK-PLDDWKRFQIDTRT 257 + ++ L E N +T NP + + F + K +I+ R Sbjct: 144 EPVTETAWQTLIPTLREEGDDWNAELWVTWNPYHENAPVEKRFRNVDNPNIKGVEINWRD 203 Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317 + ++ + G + Q ++ ++E + P P Sbjct: 204 NPKFPEKLNRDRLSDLQQRPEQYNHIWEGGYLQAVQGAYYQKCLLEAEMEGRITTVPRDP 263 Query: 318 ---LIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371 + + DI G D T + + + + D+ + + + + + ++ A Sbjct: 264 LMQVRIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHVGWVFQRGYEKA 323 Query: 372 III 374 +++ Sbjct: 324 LMV 326 >gi|255729652|ref|XP_002549751.1| conserved hypothetical protein [Candida tropicalis MYA-3404] gi|240132820|gb|EER32377.1| conserved hypothetical protein [Candida tropicalis MYA-3404] Length = 1162 Score = 38.9 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 10/46 (21%), Positives = 22/46 (47%), Gaps = 4/46 (8%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE----TQLK 121 ++ G+GKT + + ++L + P +I +A + Q+K Sbjct: 107 VLVALPTGLGKTFIASTVMLNFLRWFPNSKIIFMAPTRPLVAQQIK 152 >gi|330911327|gb|EGH39837.1| phage terminase, large subunit [Escherichia coli AA86] Length = 555 Score = 38.9 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 49/152 (32%), Gaps = 24/152 (15%) Query: 178 EERPDTFVGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNP-RRLS 234 RP G ++ DEA+ D + + LT A I T N L Sbjct: 159 SSRPSNLRGLQGD----VVIDEAAFHESLDELLKAAM-ALTMWGARVRIISTHNGVDNLF 213 Query: 235 GKFYEIFNKPLDDWKRFQIDTRTV--------------EGIDPSFHEGIIARYGLDSDVT 280 ++ + + D+ +I + P + ++ Sbjct: 214 NQYIQEAREGRKDYSVHRITLDDAIADGLYRRICYVTGQEWSPESEQKWRDDLYKNAPTR 273 Query: 281 RV--EVCGQFPQQDIDSFIPLNIIEEALNREP 310 E G P++ ++IP +IE A++R+ Sbjct: 274 EDADEEYGCIPKKSGGAYIPHALIEMAMSRDI 305 >gi|171690334|ref|XP_001910092.1| hypothetical protein [Podospora anserina S mat+] gi|170945115|emb|CAP71226.1| unnamed protein product [Podospora anserina S mat+] Length = 1993 Score = 38.9 bits (89), Expect = 2.0, Method: Composition-based stats. Identities = 22/118 (18%), Positives = 38/118 (32%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W PG V+ +A L E V W L Sbjct: 1164 SPTGSGKTVAAELAMWWAFREHPGSKVVYIAP-----MKALVRERVKDWGDRLAKPLGLR 1218 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + +I DE Sbjct: 1219 LVELTGDNTPDTRTIQDADIIITTPEKWDGISRSWQT------RGYVRKVSLVVI-DE 1269 >gi|297519140|ref|ZP_06937526.1| Terminase, ATPase subunit [Escherichia coli OP50] Length = 159 Score = 38.9 bits (89), Expect = 2.2, Method: Composition-based stats. Identities = 11/48 (22%), Positives = 17/48 (35%) Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388 I W D R + I L E+Y I ID+ + ++ Sbjct: 11 RILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVDHGVYENVK 58 >gi|212545286|ref|XP_002152797.1| DEAD/DEAH box helicase, putative [Penicillium marneffei ATCC 18224] gi|210065766|gb|EEA19860.1| DEAD/DEAH box helicase, putative [Penicillium marneffei ATCC 18224] Length = 2022 Score = 38.9 bits (89), Expect = 2.2, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 49/155 (31%), Gaps = 22/155 (14%) Query: 55 QLEFMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRP 106 Q +E + N ++F + + G GKT + W RP Sbjct: 1125 QNPILEEIYGQRFQFFNPMQTQLFHTLYHTSANVLLGSPTGSGKTVACELAMWWAFRERP 1184 Query: 107 GISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI- 164 G V+ +A L E V W + ++ L+ P + + I Sbjct: 1185 GSKVVYIAP-----MKALVRERVQDWRKRITTAMGLKLVELTGDNTPDTRTIRDADIIIT 1239 Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + + R++ G+ + II DE Sbjct: 1240 TPEKWDGISRSWQT------RGYVRQVSLVII-DE 1267 >gi|255933656|ref|XP_002558207.1| Pc12g14010 [Penicillium chrysogenum Wisconsin 54-1255] gi|211582826|emb|CAP81028.1| Pc12g14010 [Penicillium chrysogenum Wisconsin 54-1255] Length = 2009 Score = 38.9 bits (89), Expect = 2.2, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 38/118 (32%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W +PG V+ +A L E V W L + + Sbjct: 1166 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVQDWRKRLTRQMGLK 1220 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ +I DE Sbjct: 1221 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRDYVR-------KVSLVIIDE 1271 >gi|92113525|ref|YP_573453.1| hypothetical protein Csal_1399 [Chromohalobacter salexigens DSM 3043] gi|91796615|gb|ABE58754.1| protein of unknown function DUF264 [Chromohalobacter salexigens DSM 3043] Length = 594 Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 44/164 (26%), Gaps = 22/164 (13%) Query: 242 NKPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 P W++ I+ G D + + Y + + +F +F + Sbjct: 326 RGPDGQWRQIVTIEDAIAGGCDLFDLDQLRLEY--SDEEFANLLMCEFVDDSQSAFPMMT 383 Query: 301 IIEEALNREPCPDPYAP----------LIMGCDIAEEGGDN-----TVVVL--RRGPVIE 343 + ++ + P + +G D A + D V+ R Sbjct: 384 MQRCMVDSWDIWRDWKPFAARPFGDKPVWLGYDPAGDNLDGDGAGLVVLAPAKNRNDRHR 443 Query: 344 HLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385 L D I + +Y I ID N G Sbjct: 444 ILEKHRIKGQDYEEQAGFIEQVTRRYNVQFIGIDINGMGEAVAQ 487 >gi|159164912|gb|ABV80241.2| mutant required to maintain repression 1 [Zea mays] Length = 1435 Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%) Query: 55 QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + + + + K + G G GKT L + M P Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911 Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131 + +A + L+A E KW Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933 >gi|159164911|gb|ABV80240.2| mutant required to maintain repression 1 [Zea mays] Length = 1435 Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%) Query: 55 QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + + + + K + G G GKT L + M P Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911 Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131 + +A + L+A E KW Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933 >gi|332701845|ref|ZP_08421933.1| hypothetical protein Desaf_0686 [Desulfovibrio africanus str. Walvis Bay] gi|332551994|gb|EGJ49038.1| hypothetical protein Desaf_0686 [Desulfovibrio africanus str. Walvis Bay] Length = 554 Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 51/283 (18%), Positives = 87/283 (30%), Gaps = 68/283 (24%) Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------REPCPDPY 315 ++E I YG V R E+ P+ IP IEEA+ R D + Sbjct: 257 KRKWYERIRNSYGPRVAVMREELDAI-PRDGGGQAIPGVWIEEAMREARPILRIALDDDF 315 Query: 316 A--------------------PLIMGCDIAEE---------GGDNTVVV--------LRR 338 A PL+ D A E D +VV +RR Sbjct: 316 AKLPEDSRRVWGSEWIDRHLKPLLARLDPAREHVFGQDFARHRDFSVVAPLEIGQTLIRR 375 Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY-LEMLGYHVYRV 397 P + + + + + ++R A+ DA +GA +Y + G+ Sbjct: 376 APFLLEMHNVPTRQQEQILWALIAALPRFRGGAM--DATGSGATLAEYTADKFGHERIHQ 433 Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPNTGELAI 455 + +A E K+ D E L + + +L++L+ G + + Sbjct: 434 VMLSQAWYREHMP--------KLVDAFETGMIDLPRDADIESDLRALEEI----DGIIKL 481 Query: 456 ESKRVK------GAKSTDYSDGLMYT-FAENPPRSDMDFGRCP 491 R + + D + FA D+ P Sbjct: 482 PDIRKQDLKDAELKRHGDSAIAFALGWFASQGTAEAFDYRPVP 524 >gi|255683197|ref|YP_003084405.1| UL15 [Duck enteritis virus] gi|254840012|gb|ACT83557.1| UL15 [Anatid herpesvirus 1] Length = 739 Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 54/153 (35%), Gaps = 24/153 (15%) Query: 89 GKTTLNAWLVLWLMSTRPGISVICLAN----SETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144 GKT L+ ++ GI + A+ +E + + A + +W +H Sbjct: 265 GKTWFIVPLIALALTKFRGIKIGYTAHIRKATEP-VFDEIDARIRRWFGNGRVEHI---- 319 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204 + + S SK T S ++ G + + + DEA+ Sbjct: 320 ---------KGETISFSFQDGSKSTVTFA---SSHNTNSLRGQ--DFNLLFV-DEANFIR 364 Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237 I+GFL + N ++ ++N + S F Sbjct: 365 SDAVQTIVGFLNQTNCKIIFVSSTNTGKSSTSF 397 >gi|159164914|gb|ABV80243.2| mutant required to maintain repression 1 [Zea mays] Length = 1435 Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%) Query: 55 QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + + + + K + G G GKT L + M P Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911 Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131 + +A + L+A E KW Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933 >gi|159164908|gb|ABV80237.2| required to maintain repression 1 [Zea mays] gi|159164910|gb|ABV80239.2| required to maintain repression 1 [Zea mays] Length = 1435 Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%) Query: 55 QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + + + + K + G G GKT L + M P Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911 Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131 + +A + L+A E KW Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933 >gi|67524049|ref|XP_660086.1| hypothetical protein AN2482.2 [Aspergillus nidulans FGSC A4] gi|40744644|gb|EAA63800.1| hypothetical protein AN2482.2 [Aspergillus nidulans FGSC A4] gi|259487904|tpe|CBF86944.1| TPA: DEAD/DEAH box helicase, putative (AFU_orthologue; AFUA_4G03070) [Aspergillus nidulans FGSC A4] Length = 2015 Score = 38.5 bits (88), Expect = 2.3, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 44/151 (29%), Gaps = 19/151 (12%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKGAIS-----AGRGIGKTTLNAWLVLWLMSTRPGISV 110 LE + N + V + + G GKT + W RPG V Sbjct: 1122 LEELYGQRFQYFNPMQTQLFHVLYHTAANVLLGSPTGSGKTVAAELAMWWAFRERPGSKV 1181 Query: 111 ICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSKH 168 + +A L E V W L ++ L+ P + + I + Sbjct: 1182 VYIAP-----MKALVRERVMDWGRRLTAPMGLKLVELTGDNTPDTRTIRDADIIITTPEK 1236 Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + R++ +I DE Sbjct: 1237 WDGISRSWQTRDYVR-------KVSLVIIDE 1260 >gi|159164909|gb|ABV80238.2| required to maintain repression 1 [Zea mays] Length = 1435 Score = 38.5 bits (88), Expect = 2.4, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%) Query: 55 QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + + + + K + G G GKT L + M P Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911 Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131 + +A + L+A E KW Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933 >gi|52079727|ref|YP_078518.1| putative phage terminase (large subunit) [Bacillus licheniformis ATCC 14580] gi|52785096|ref|YP_090925.1| XtmB [Bacillus licheniformis ATCC 14580] gi|319646468|ref|ZP_08000697.1| XtmB protein [Bacillus sp. BT1B_CT2] gi|52002938|gb|AAU22880.1| putative phage terminase (large subunit) [Bacillus licheniformis ATCC 14580] gi|52347598|gb|AAU40232.1| XtmB [Bacillus licheniformis ATCC 14580] gi|317391056|gb|EFV71854.1| XtmB protein [Bacillus sp. BT1B_CT2] Length = 432 Score = 38.5 bits (88), Expect = 2.4, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 69/207 (33%), Gaps = 24/207 (11%) Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253 I +E S ++G L + ++T+NP S Y F K + KRF + Sbjct: 118 LIWIEECSEVKYEGFKELIGRLRHPYHRLYMMLTTNPVSQSNWTYRHFFKDERN-KRFIL 176 Query: 254 D---------------------TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292 D + S+ + + D D+ R+ G+F Sbjct: 177 DDEVLYKKRVAVVGDTYYHHSTADDNLFLPKSYLKQLDDMKAYDPDLYRIARKGRFGVNG 236 Query: 293 IDSFIPLNIIE-EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL-FDWSK 350 ++E E + R+ G D E N VV P ++L W Sbjct: 237 TRVLPQFEVMEHEEVMRQISAISNPLKRTGMDFGFEESYNAVVRAAVDPDKKYLYIYWEY 296 Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDAN 377 + T++K + + ++ +I A+ Sbjct: 297 YKNKMTDDKTAEELHEFAVAKELIKAD 323 >gi|288931818|ref|YP_003435878.1| hypothetical protein Ferp_1452 [Ferroglobus placidus DSM 10642] gi|288894066|gb|ADC65603.1| protein of unknown function DUF699 ATPase putative [Ferroglobus placidus DSM 10642] Length = 763 Score = 38.5 bits (88), Expect = 2.5, Method: Composition-based stats. Identities = 31/157 (19%), Positives = 59/157 (37%), Gaps = 27/157 (17%) Query: 65 HCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS------TRPGISVICLANSET 118 + + E I+A RG GKT + + +L+S RP + ++ +A + Sbjct: 218 EAFETFFDRKREKKAVVITANRGRGKTAVLGIVTPYLISRMNRVLKRP-VRILVVAPTPY 276 Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178 ++T + + K L K + E +S +D++ ++ + R Sbjct: 277 AVQTY-FKFLKKALVRQGMKEFKEKRS---------NDLVTVINSKWARVEYAVPRRAMV 326 Query: 179 ERPDTFVGHHNTYGMAIINDEASGTP-DVINLGILGF 214 E+ Y II DEA+G V+ + G Sbjct: 327 EK---------DYADIIIVDEAAGIDVPVLWKIVEGA 354 >gi|240850562|ref|YP_002971962.1| phage terminase, large subunit [Bartonella grahamii as4aup] gi|240267685|gb|ACS51273.1| phage terminase, large subunit [Bartonella grahamii as4aup] Length = 441 Score = 38.5 bits (88), Expect = 2.5, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 56/182 (30%), Gaps = 9/182 (4%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFN-KPLDD 247 + DEA + ++ L E +T NP R + + F + Sbjct: 122 RILLCWVDEAEPVTETAWQTLIPTLREEGEGWRAELWVTWNPLRENAPVEKRFRFSDNEA 181 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII---EE 304 KR +I+ +E + + + G + + ++ ++ +E Sbjct: 182 IKRVEINWSDNPKFPKILNEARLDDLRNRPETYKHIWEGDYLKAVQGAYYQKEMLAAEQE 241 Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361 DP + DI G D T + + + + ++ + + + I Sbjct: 242 GRIGRVARDPLMQIRAFWDIGGTGAKADATAIWIAQFVGREIRVLNYYEAQGQPLSEHIG 301 Query: 362 GL 363 L Sbjct: 302 WL 303 >gi|156543626|ref|XP_001604556.1| PREDICTED: hypothetical protein [Nasonia vitripennis] Length = 990 Score = 38.5 bits (88), Expect = 2.5, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 42/134 (31%), Gaps = 15/134 (11%) Query: 77 VFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI-SVICLANSETQLKTTLWAEVSKWLSLL 135 F + A G GKT + + L ++ + VI LA + E++ + + Sbjct: 61 GFDLIVRAKSGTGKTAVFGIIALEMIDIKISSVQVIILAPT---------REIAIQIKEV 111 Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195 E++ L + + + + H + + D + Sbjct: 112 IASLGCEIKGLKVESFIGGVAMDIDRKKLSNCHIAIGAPGRVKHLIDKGY-LKMDHVRLF 170 Query: 196 INDEASGTPDVINL 209 + DEA D + Sbjct: 171 VLDEA----DKLME 180 >gi|242087829|ref|XP_002439747.1| hypothetical protein SORBIDRAFT_09g019410 [Sorghum bicolor] gi|241945032|gb|EES18177.1| hypothetical protein SORBIDRAFT_09g019410 [Sorghum bicolor] Length = 1535 Score = 38.5 bits (88), Expect = 2.7, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%) Query: 55 QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107 Q E E + + + + + K + G G GKT L + M P Sbjct: 952 QREAFEFMWTNLVGGIRLDELKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 1011 Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131 + +A + L+A E KW Sbjct: 1012 CRPVIIAP-----RGMLFAWDEEFKKW 1033 >gi|46949065|gb|AAT07420.1| UL89 DNA packaging protein [Macacine herpesvirus 3] Length = 671 Score = 38.5 bits (88), Expect = 2.7, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID K + S ++ G ++ DEA + ILGFL + Sbjct: 273 VISIDHKGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKEKAFNTILGFLAQNTT 329 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 330 KIIFISSTNTTSDATCF 346 >gi|261212229|ref|ZP_05926515.1| ATP-dependent RNA helicase SrmB [Vibrio sp. RC341] gi|260838837|gb|EEX65488.1| ATP-dependent RNA helicase SrmB [Vibrio sp. RC341] Length = 421 Score = 38.5 bits (88), Expect = 2.9, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 69/237 (29%), Gaps = 34/237 (14%) Query: 47 GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106 GFS P Q E + + SA G GKT A L + P Sbjct: 22 GFSRPTQVQAEAI------------PQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFP 69 Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161 ++ L + AE ++ L+ + F + + ++D+L + Sbjct: 70 RRKAGPARILILTPTRELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATT 125 Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221 I + +I DEA D+ + L+ Sbjct: 126 QDI------VVATPGRLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSTECRW 179 Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278 R + + L G+ E F L D V+ P I+++ +D Sbjct: 180 RKQTLLFSAT-LEGRGVEGFTADLLK------DPAHVDAEPPRRERKKISQWYHRAD 229 >gi|240851102|ref|YP_002972504.1| phage terminase large subunit [Bartonella grahamii as4aup] gi|240268225|gb|ACS51813.1| phage terminase large subunit [Bartonella grahamii as4aup] Length = 453 Score = 38.5 bits (88), Expect = 2.9, Method: Composition-based stats. Identities = 46/306 (15%), Positives = 94/306 (30%), Gaps = 33/306 (10%) Query: 84 AGRGIGKT---TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140 GRG GKT L + +V + +I A + N+ Sbjct: 39 GGRGSGKTRSFALMSAVVGYRHGMAGERGIILCA---------------RQFQNSLNESS 83 Query: 141 FEMQSLSLHPAPWYSDVLHCSLG-IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-IND 198 E ++ P+ D I SK + +R + + + D Sbjct: 84 LEEIKRAIESYPFLQDYYDIGDKYIKSKDGRIVYVFAGLDR--NIASIKSMGRVFLCWVD 141 Query: 199 EASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNK-PLDDWKRFQIDT 255 EA + ++ L E N +T NP + + F K +I+ Sbjct: 142 EAEPVTETAWQTLIPTLREEGKDWNAELWVTWNPCYENAPVEKRFRNVDNPHIKGAEINW 201 Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315 R + + + G++ Q ++ ++E + P Sbjct: 202 RDNPQFPEKLNRDRMDDLQQRPEQYNHIWEGEYLQAVQGAYYQKCLLEAEMEGRITTVPR 261 Query: 316 AP---LIMGCDIAEEG--GDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKISGLVEKYR 368 P + + DI G D T + + + G I L D+ + + + + + ++ Sbjct: 262 DPLMQVRIFWDIGGTGAKADATALWVAQFIGREIRVL-DYYEAQGQPLSEHVGWVFQRGY 320 Query: 369 PDAIII 374 A+++ Sbjct: 321 EKALMV 326 >gi|299532092|ref|ZP_07045486.1| mu-like prophage Flumu protein gp28 [Comamonas testosteroni S44] gi|298719754|gb|EFI60717.1| mu-like prophage Flumu protein gp28 [Comamonas testosteroni S44] Length = 470 Score = 38.2 bits (87), Expect = 3.1, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 68/240 (28%), Gaps = 39/240 (16%) Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-------LNREPCPDPYAPLIM 320 + D++ E P D F+ +I R L Sbjct: 232 DFVKNGAADAESFDQEYMCI-PADDDSKFLEYGLITACEYLGGTDWKRGLQGPFQGRLFC 290 Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR----PDAIIIDA 376 G DI + D TV+ + + +F + K + D + ID+ Sbjct: 291 GVDIGRK-KDLTVLWVV--EQLGDVFYTRHVETMEKMRKSDQEKILWPWFAICDRVCIDS 347 Query: 377 NNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434 G D + + + V + + + K+ + Sbjct: 348 TGLGIGWTDDAQDKFGEHRIEGVSFTGQVKEALAYPLKGAMEDRKIR-------IPEDPK 400 Query: 435 LIQNLKSLKSFIVPNTG--ELAIESKRVKGAKSTD-YSD---GLMYTF-AENPPRSDMDF 487 + +L+ ++ + + G ES + D ++D L A N P + ++ Sbjct: 401 IRADLRKVQK-VTTSAGNIRFVAES-------TPDGHADRFWALALALQAGNSPAAPFEY 452 >gi|262401641|ref|ZP_06078207.1| ATP-dependent RNA helicase SrmB [Vibrio sp. RC586] gi|262352058|gb|EEZ01188.1| ATP-dependent RNA helicase SrmB [Vibrio sp. RC586] Length = 416 Score = 38.2 bits (87), Expect = 3.3, Method: Composition-based stats. Identities = 34/222 (15%), Positives = 62/222 (27%), Gaps = 22/222 (9%) Query: 62 VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP-----GISVICLANS 116 + SA G GKT A L + P ++ L + Sbjct: 25 RPTQVQAEAIPQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFPRRKPGPARILILTPT 84 Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176 AE ++ L+ + F + + ++D+L + I + Sbjct: 85 RELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATTQDI------VVATPG 134 Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK 236 +I DEA D+ + L+ R + + L G+ Sbjct: 135 RLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSTECRWRKQTLLFSAT-LEGR 193 Query: 237 FYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278 E F L D V+ P I+++ +D Sbjct: 194 GVEGFTADLLK------DPAHVDAEPPRRERKKISQWYHRAD 229 >gi|91199577|emb|CAI77931.1| putative helicase [Streptomyces ambofaciens ATCC 23877] gi|96771624|emb|CAI78205.1| putative helicase [Streptomyces ambofaciens ATCC 23877] gi|117164172|emb|CAJ87711.1| putative helicase [Streptomyces ambofaciens ATCC 23877] gi|126347284|emb|CAJ88989.1| putative helicase [Streptomyces ambofaciens ATCC 23877] Length = 886 Score = 38.2 bits (87), Expect = 3.3, Method: Composition-based stats. Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 4/70 (5%) Query: 52 RSWQLEFMEVVDAHC-LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110 R Q+E + + ++ ++ PE +G I + G GKT + A + G + Sbjct: 5 REHQVEQKQSIREWVGFSARSSVPPEGMRGTIVSATGSGKTIMAAASA---LECFAGGRI 61 Query: 111 ICLANSETQL 120 + + L Sbjct: 62 LVTVPTLDLL 71 >gi|302381364|ref|YP_003817187.1| hypothetical protein Bresu_0249 [Brevundimonas subvibrioides ATCC 15264] gi|302191992|gb|ADK99563.1| conserved hypothetical protein [Brevundimonas subvibrioides ATCC 15264] Length = 556 Score = 38.2 bits (87), Expect = 3.4, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 75/239 (31%), Gaps = 23/239 (9%) Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASG---------TPDVINLGILGFLTERNA 220 + + +S E P+ G A DE L +L Sbjct: 235 GAVAQAFSAEDPEALRG---PQFAAAWADEFCAWPRGGRGGRGGPGATLALLRMGLRLGE 291 Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVT 280 ++T+ P + G ++ +P + + + F EG+ YG Sbjct: 292 RPRLVVTTTP-KPIGALRDLRAEP-GLVQTHAATRDNADHLAAGFVEGLERLYGGTRKAA 349 Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNT--VVVLR 337 E+ G+ +Q S ++ A R + +++ D GG+ VV R Sbjct: 350 -QELEGRVVEQ-EGSLFTAEMMGRA--RGVLEGSFDRIVVAIDPTTTAGGNACGIVVAGR 405 Query: 338 RGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394 G L D S + E++ A++++ N G L+ G V Sbjct: 406 VGDRAHVLADRSVAGLGPDGWARRAVRAAEEFGAVALVVEVNQGGEMVRAVLKTAGCSV 464 >gi|312881427|ref|ZP_07741222.1| DNA-dependent helicase II [Vibrio caribbenthicus ATCC BAA-2122] gi|309370909|gb|EFP98366.1| DNA-dependent helicase II [Vibrio caribbenthicus ATCC BAA-2122] Length = 724 Score = 38.2 bits (87), Expect = 3.5, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 20/206 (9%) Query: 239 EIFNKP-----LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293 + FN P L ++ +Q +D + D+ R +F + Sbjct: 161 DTFNDPVTQTYLQLYRAYQEACDRAGLVDFAEILLRAQELLRDNKHIRQHYQTRFKHILV 220 Query: 294 DSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL 353 D F N I+ A R +I+ G D+ + RG +E++ K + Sbjct: 221 DEFQDTNNIQYAWLRLMAGPDTHVMIV-------GDDDQSIYGWRGAKVENIE---KFTV 270 Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRR 413 + L + YR I+DA+N A + E +G ++ + + N Sbjct: 271 EFPSVNTIRLEQNYRSTKTILDASN--ALIANNTERMGKELWTDGSAGEPISVYSAYNEL 328 Query: 414 TELHV---KMADWLEFASLINHSGLI 436 E K+ +W E ++ + ++ Sbjct: 329 DEARFAVSKIKEWQEKGGVLTDTAML 354 >gi|302539315|ref|ZP_07291657.1| TtrA [Streptomyces sp. C] gi|302448210|gb|EFL20026.1| TtrA [Streptomyces sp. C] Length = 888 Score = 38.2 bits (87), Expect = 3.5, Method: Composition-based stats. Identities = 12/52 (23%), Positives = 20/52 (38%), Gaps = 3/52 (5%) Query: 69 SVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 S + P+ +G I + G GKT A + PG ++ + L Sbjct: 23 SRSPVPPQGTRGTIVSATGSGKTITAAAGA---LECFPGGRILVTVPTLDLL 71 >gi|307727814|ref|YP_003911027.1| SNF2-related protein [Burkholderia sp. CCGE1003] gi|307588339|gb|ADN61736.1| SNF2-related protein [Burkholderia sp. CCGE1003] Length = 1227 Score = 38.2 bits (87), Expect = 3.5, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 51/203 (25%), Gaps = 25/203 (12%) Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187 V W F + L +G +T + +++ G Sbjct: 739 VHNWREEARR---FAPELKVLVLNGPQRKERFEQIGEHELILTTYALLWRDQKV--LAG- 792 Query: 188 HNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD 247 H + +I DEA + + +A +T P N + Sbjct: 793 HEYH--LLILDEAQYVKNATTKAAQ-AIRGLSARHRLCLTGTPLE---------NHLGEL 840 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 W +F G F + D R + + + F+ +E Sbjct: 841 WSQFDFLLPGFLGTQKDFTRRWRNPIEKNHDGVRRSLLARRIRP----FMLRRRKDEVAK 896 Query: 308 REPCPDPYAPLIMGCDIAEEGGD 330 P ++ D+ D Sbjct: 897 ELPAKT---TIVCSVDLEGAQRD 916 >gi|164659175|ref|XP_001730712.1| hypothetical protein MGL_2166 [Malassezia globosa CBS 7966] gi|159104609|gb|EDP43498.1| hypothetical protein MGL_2166 [Malassezia globosa CBS 7966] Length = 838 Score = 38.2 bits (87), Expect = 3.5, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 48/152 (31%), Gaps = 15/152 (9%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQS 145 G+GKT ++ L+ P + +A + L+ W E+ K+ L W Q Sbjct: 244 GMGKT----IQMISLLVADPKRPSLVVAPTVAILQ---WRNEMQKYAPGLRVVVWHGAQR 296 Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE----RPDTFVGHHNTYGMAIINDEAS 201 DV+ S + + + R + H II DEA Sbjct: 297 SRDRDTLSTVDVVLTSYAVLESTFRRDRYGVTRNGRHVREQSL--LHAMKWRRIILDEAH 354 Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRL 233 + + ++ W ++ P + Sbjct: 355 HIKERTSNTARSAFA-LQSDFKWCLSGTPLQN 385 >gi|289619624|emb|CBI53907.1| unnamed protein product [Sordaria macrospora] Length = 2051 Score = 38.2 bits (87), Expect = 3.5, Method: Composition-based stats. Identities = 29/152 (19%), Positives = 50/152 (32%), Gaps = 22/152 (14%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 +E + A N +VF + + G GKT + W RPG Sbjct: 1170 ALEEIYAQRFQYFNPMQTQVFHTLYHTPANVLLGSPTGSGKTVACELAMWWAFRERPGSK 1229 Query: 110 VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167 V+ +A L E V W + L ++ L+ P + + I + Sbjct: 1230 VVYIAP-----MKALVRERVKDWGARLAKPLGLKLVELTGDNTPDTRTIQDADIIITTPE 1284 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + R++ G+ + II DE Sbjct: 1285 KWDGISRSWQT------RGYVRKVSLVII-DE 1309 >gi|183600815|ref|ZP_02962308.1| hypothetical protein PROSTU_04416 [Providencia stuartii ATCC 25827] gi|188019600|gb|EDU57640.1| hypothetical protein PROSTU_04416 [Providencia stuartii ATCC 25827] Length = 413 Score = 38.2 bits (87), Expect = 3.5, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 64/200 (32%), Gaps = 21/200 (10%) Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH-HN-------TYGMAIIND 198 ++ PW SD K+ T CR+ S F G HN + D Sbjct: 79 AIRSVPWLSDFYELG----EKYIRTKCRSVSYV----FAGLRHNLDSIKSKARILIAWVD 130 Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-PLDDWKRFQIDTRT 257 EA ++ + + E + W+ T NP + + F K P D+ +++ Sbjct: 131 EAESVSEIAWTKLAPTVREAGSE-IWV-TWNPEKDGSATDKRFRKEPPDNAIIVEMNYDD 188 Query: 258 VEGIDPSFHEGIIARYGL-DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA 316 E ++ D + G + + + + ++ + Sbjct: 189 NPWFPSVLEEERLSDQSRLDPNTYAWIWEGAYLENSDKQVLANKYVVQSFP-DDLWKQAD 247 Query: 317 PLIMGCDIAEEGGDNTVVVL 336 L+ G D NT++ + Sbjct: 248 RLLFGADFGFAKDPNTLIRM 267 >gi|71004784|ref|XP_757058.1| hypothetical protein UM00911.1 [Ustilago maydis 521] gi|46096862|gb|EAK82095.1| hypothetical protein UM00911.1 [Ustilago maydis 521] Length = 1490 Score = 38.2 bits (87), Expect = 3.6, Method: Composition-based stats. Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 3/39 (7%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSE---TQLKT 122 G+GKT + A ++L P ++ LA + Q KT Sbjct: 304 GLGKTFIAAVVILNFFRWYPDGKILFLAPTRPLVDQQKT 342 >gi|319943331|ref|ZP_08017613.1| hypothetical protein HMPREF0551_0459 [Lautropia mirabilis ATCC 51599] gi|319743146|gb|EFV95551.1| hypothetical protein HMPREF0551_0459 [Lautropia mirabilis ATCC 51599] Length = 220 Score = 38.2 bits (87), Expect = 3.7, Method: Composition-based stats. Identities = 28/117 (23%), Positives = 37/117 (31%), Gaps = 14/117 (11%) Query: 52 RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111 +WQ H + AI G GKTTL A L S PG V+ Sbjct: 30 TAWQASPRTAFVDHLMARAGTHAGRPAIIAIDGRSGSGKTTLTAALA----SVVPGAQVL 85 Query: 112 CLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLH--PAPWYSDVLHCSLGID 165 L +W E + +W L + +L P PW S+ I Sbjct: 86 -------HLDDLIWNEPLYQWDQQLVAALSELHTTGALDLIPHPWREHGREGSIRIT 135 >gi|49475449|ref|YP_033490.1| terminase large subunit protein [Bartonella henselae str. Houston-1] gi|49475495|ref|YP_033536.1| terminase large subunit protein [Bartonella henselae str. Houston-1] gi|49238255|emb|CAF27468.1| Terminase large subunit protein [Bartonella henselae str. Houston-1] gi|49238301|emb|CAF27516.1| Terminase large subunit protein [Bartonella henselae str. Houston-1] Length = 340 Score = 38.2 bits (87), Expect = 3.8, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 54/186 (29%), Gaps = 17/186 (9%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFN-KPLDD 247 + DEA + ++ L E +T NP R + F + Sbjct: 83 RILLCWVDEAEPVTETAWQTLIPTLREEGEGWRAELWVTWNPLRDNAPVERRFRFSNNEA 142 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE--- 304 KR +I+ +E + + + G + ++ ++ Sbjct: 143 IKRVEINWSDNPKFPKILNEARLDDLKNRPETYKHIWEGAYLTAIQGAYYQKEMLAAEQE 202 Query: 305 ----ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTN 357 + R+P A DI G D T + + + + D+ + + + Sbjct: 203 GRIGRVARDPLMQMRAFW----DIGGTGAKADATAIWIAQFVGREIRVLDYYEAQGQPLS 258 Query: 358 NKISGL 363 I L Sbjct: 259 EHIGWL 264 >gi|327355898|gb|EGE84755.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces dermatitidis ATCC 18188] Length = 2024 Score = 37.8 bits (86), Expect = 3.9, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 22/152 (14%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 +E + A N ++F + + G GKT + W +PG Sbjct: 1131 ILEEIYAQRFQFFNPMQTQIFHTLYHTPANVLLGSPTGSGKTVAAELAMWWAFREKPGSK 1190 Query: 110 VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167 V+ +A L E V W L ++ L+ P + + I + Sbjct: 1191 VVYIAP-----MKALVRERVHDWRRRLTAPMGLKLVELTGDNTPDTRTIRDADIIITTPE 1245 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + R++ G+ + II DE Sbjct: 1246 KWDGISRSWQT------RGYVRQVSLVII-DE 1270 >gi|239609198|gb|EEQ86185.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces dermatitidis ER-3] Length = 2024 Score = 37.8 bits (86), Expect = 3.9, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 22/152 (14%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 +E + A N ++F + + G GKT + W +PG Sbjct: 1131 ILEEIYAQRFQFFNPMQTQIFHTLYHTPANVLLGSPTGSGKTVAAELAMWWAFREKPGSK 1190 Query: 110 VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167 V+ +A L E V W L ++ L+ P + + I + Sbjct: 1191 VVYIAP-----MKALVRERVHDWRRRLTAPMGLKLVELTGDNTPDTRTIRDADIIITTPE 1245 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + R++ G+ + II DE Sbjct: 1246 KWDGISRSWQT------RGYVRQVSLVII-DE 1270 >gi|261189015|ref|XP_002620920.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces dermatitidis SLH14081] gi|239591924|gb|EEQ74505.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces dermatitidis SLH14081] Length = 2024 Score = 37.8 bits (86), Expect = 3.9, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 22/152 (14%) Query: 58 FMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRPGIS 109 +E + A N ++F + + G GKT + W +PG Sbjct: 1131 ILEEIYAQRFQFFNPMQTQIFHTLYHTPANVLLGSPTGSGKTVAAELAMWWAFREKPGSK 1190 Query: 110 VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167 V+ +A L E V W L ++ L+ P + + I + Sbjct: 1191 VVYIAP-----MKALVRERVHDWRRRLTAPMGLKLVELTGDNTPDTRTIRDADIIITTPE 1245 Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + + R++ G+ + II DE Sbjct: 1246 KWDGISRSWQT------RGYVRQVSLVII-DE 1270 >gi|85105138|ref|XP_961895.1| activating signal cointegrator 1 complex subunit 3 [Neurospora crassa OR74A] gi|28923479|gb|EAA32659.1| activating signal cointegrator 1 complex subunit 3 [Neurospora crassa OR74A] Length = 2066 Score = 37.8 bits (86), Expect = 3.9, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 41/118 (34%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E V W + L + Sbjct: 1199 SPTGSGKTVACELAMWWAFRERPGSKVVYIAP-----MKALVRERVKDWGARLAKPLGLK 1253 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1254 LVELTGDNTPDTRTIQDADIIITTPEKWDGISRSWQT------RGYVRKVSLVII-DE 1304 >gi|18640523|ref|NP_570364.1| DNA packaging protein A [Synechococcus phage P60] gi|18478753|gb|AAL73302.1| DNA packaging protein A [Synechococcus phage P60] Length = 308 Score = 37.8 bits (86), Expect = 4.0, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 32/91 (35%), Gaps = 5/91 (5%) Query: 314 PYAPLIMGCDIAEEGGDNTV-VVLRRGP---VIEHLFDWSKTDLRTTNNKISGLVEKYRP 369 Y I+ D + G D TV VVL + + L + T + I L ++Y+ Sbjct: 77 DYDETIVSVDPSGRGTDETVAVVLSQANGYVFVRDLKAYRDGYSDATLSDIVRLGKRYKA 136 Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400 +++++N G L Sbjct: 137 SRLLVESN-FGDGMVCELFNRHIQQMGAGFS 166 >gi|213406229|ref|XP_002173886.1| ATP-dependent 3' to 5' DNA helicase [Schizosaccharomyces japonicus yFS275] gi|212001933|gb|EEB07593.1| ATP-dependent 3' to 5' DNA helicase [Schizosaccharomyces japonicus yFS275] Length = 812 Score = 37.8 bits (86), Expect = 4.2, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 42/116 (36%), Gaps = 15/116 (12%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146 G+GKT + A +++ P ++ LA ++ L + A + L+ +P E+ Sbjct: 156 GLGKTFIAAVVMMNYYRWFPQSNIAFLAPTKPLLYQQMQACIH--LTGIPESSIVELNGE 213 Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI--INDEA 200 L L D + + +T + + + + + DEA Sbjct: 214 V-------KPELRKQLFRDKRVFFVTPQTLNND----IQTEVCDPRLFVCLVFDEA 258 >gi|302894383|ref|XP_003046072.1| predicted protein [Nectria haematococca mpVI 77-13-4] gi|256726999|gb|EEU40359.1| predicted protein [Nectria haematococca mpVI 77-13-4] Length = 1970 Score = 37.8 bits (86), Expect = 4.3, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 40/118 (33%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RP V+ +A L E V W + L + Sbjct: 1158 SPTGSGKTVAAELAMWWAFRERPKSKVVYIAP-----MKALVRERVKDWGARLARPLGLK 1212 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1213 LVELTGDNTPDTRTIQDADVIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1263 >gi|194899456|ref|XP_001979275.1| GG24642 [Drosophila erecta] gi|190650978|gb|EDV48233.1| GG24642 [Drosophila erecta] Length = 1450 Score = 37.8 bits (86), Expect = 4.6, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 60/198 (30%), Gaps = 20/198 (10%) Query: 16 FDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPR--SWQLEFMEVVDAH-CLNSVNN 72 D+ W D+ + +H E+ EG P E ++ H + N Sbjct: 1 MDVNWIDDDDDLVAALAMHEEQKTEEADGTEGHPQPELSDEACEGFDMAAGHNWIYPNNL 60 Query: 73 PNPEVFKGAISAG----------RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKT 122 P + + + G+GKT + A L+ P ++ +A + + Sbjct: 61 PLRSYQQTIVQSALFKNTLVVLPTGLGKTFIAAVLMYNFYRWYPKGKIVFMAPTRPLVSQ 120 Query: 123 TLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPD 182 ++ ++P +Q P P +++ + + + Sbjct: 121 ----QIHASQKIMPFPSADTVQLTGQLPRPKRAELWDSKRVFFATPQVVHSDMLTADGGS 176 Query: 183 TFVGHHNTYGMAIINDEA 200 F I+ DEA Sbjct: 177 NFP---FGSIKLIVVDEA 191 >gi|296083594|emb|CBI23583.3| unnamed protein product [Vitis vinifera] Length = 1287 Score = 37.8 bits (86), Expect = 4.6, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 49/148 (33%), Gaps = 10/148 (6%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146 G+GKT + L+L RPG +S L A +S+W L E S+ Sbjct: 634 GLGKTVMTIALIL----ARPGRR-----SSGGTLIVCPMALLSQWKDELETHSKPESISI 684 Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV 206 +H ++ D + T + + + H ++ DEA Sbjct: 685 FIHYGGDRTNDPKVISEHDVVLTTYGVLTSAYKNDENSSIFHRVEWYRVVLDEAHTIKSS 744 Query: 207 INLGILGFLTERNANRFWIMTSNPRRLS 234 L ++ W +T P + + Sbjct: 745 KTLSAQAAFALP-SHCRWCLTGTPLQNN 771 >gi|242815191|ref|XP_002486521.1| DEAD/DEAH box helicase, putative [Talaromyces stipitatus ATCC 10500] gi|218714860|gb|EED14283.1| DEAD/DEAH box helicase, putative [Talaromyces stipitatus ATCC 10500] Length = 2030 Score = 37.8 bits (86), Expect = 4.8, Method: Composition-based stats. Identities = 24/118 (20%), Positives = 40/118 (33%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RPG V+ +A L E V W L + Sbjct: 1164 SPTGSGKTVACELAMWWAFRERPGSKVVYIAP-----MKALVRERVQDWRKRLTAAMGLK 1218 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1219 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1269 >gi|238502669|ref|XP_002382568.1| DEAD/DEAH box helicase, putative [Aspergillus flavus NRRL3357] gi|220691378|gb|EED47726.1| DEAD/DEAH box helicase, putative [Aspergillus flavus NRRL3357] Length = 1997 Score = 37.8 bits (86), Expect = 5.0, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 37/118 (31%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W +PG V+ +A L E V W L + Sbjct: 1163 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVHDWKKRLTGPMGLK 1217 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ +I DE Sbjct: 1218 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRDYVR-------KVSLVIIDE 1268 >gi|169775993|ref|XP_001822463.1| helicase mug81 [Aspergillus oryzae RIB40] gi|83771198|dbj|BAE61330.1| unnamed protein product [Aspergillus oryzae] Length = 1998 Score = 37.8 bits (86), Expect = 5.0, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 37/118 (31%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W +PG V+ +A L E V W L + Sbjct: 1163 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVHDWKKRLTGPMGLK 1217 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ +I DE Sbjct: 1218 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRDYVR-------KVSLVIIDE 1268 >gi|20026680|ref|NP_612722.1| DNA packaging terminase subunit 1 [Panine herpesvirus 2] gi|19881108|gb|AAM00728.1|AF480884_80 DNA packaging protein UL89 [Panine herpesvirus 2] Length = 672 Score = 37.4 bits (85), Expect = 5.0, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 29/85 (34%), Gaps = 3/85 (3%) Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212 + + + ID + + S ++ G ++ DEA IL Sbjct: 265 YLVENKDNVISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTIL 321 Query: 213 GFLTERNANRFWIMTSNPRRLSGKF 237 GFL + +I ++N + F Sbjct: 322 GFLAQNTTKIIFISSTNTTSDATCF 346 >gi|73695754|gb|AAZ80628.1| rhUL89 [Macacine herpesvirus 3] Length = 671 Score = 37.4 bits (85), Expect = 5.1, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID K + S ++ G ++ DEA ILGFL + Sbjct: 273 VISIDHKGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 329 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 330 KIIFISSTNTTSDATCF 346 >gi|222615621|gb|EEE51753.1| hypothetical protein OsJ_33185 [Oryza sativa Japonica Group] Length = 726 Score = 37.4 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 7/38 (18%), Positives = 16/38 (42%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117 ++ G+GKT + A ++ P ++ A + Sbjct: 264 TLVALPTGLGKTFIAAVVMYNYFRWFPEGKIVFTAPTR 301 >gi|218185362|gb|EEC67789.1| hypothetical protein OsI_35346 [Oryza sativa Indica Group] Length = 648 Score = 37.4 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 7/38 (18%), Positives = 16/38 (42%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117 ++ G+GKT + A ++ P ++ A + Sbjct: 186 TLVALPTGLGKTFIAAVVMYNYFRWFPEGKIVFTAPTR 223 >gi|62734194|gb|AAX96303.1| Similar to probable ATP-dependent RNA helicase - fission yeast (Schizosaccharomyces pombe) [Oryza sativa Japonica Group] gi|77548994|gb|ABA91791.1| Type III restriction enzyme, res subunit family protein, expressed [Oryza sativa Japonica Group] Length = 1488 Score = 37.4 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 7/38 (18%), Positives = 16/38 (42%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117 ++ G+GKT + A ++ P ++ A + Sbjct: 264 TLVALPTGLGKTFIAAVVMYNYFRWFPEGKIVFTAPTR 301 >gi|320035817|gb|EFW17757.1| DEAD/DEAH box helicase [Coccidioides posadasii str. Silveira] Length = 1970 Score = 37.4 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 41/118 (34%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W +PG V+ +A L E V W L + Sbjct: 1155 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVQDWRRRLAMPLGLK 1209 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + + I + + + R++ G+ + II DE Sbjct: 1210 LVELTGDNTPDTRTIRNADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1260 >gi|303321375|ref|XP_003070682.1| activating signal cointegrator 1 complex subunit, putative [Coccidioides posadasii C735 delta SOWgp] gi|240110378|gb|EER28537.1| activating signal cointegrator 1 complex subunit, putative [Coccidioides posadasii C735 delta SOWgp] Length = 1970 Score = 37.4 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 41/118 (34%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W +PG V+ +A L E V W L + Sbjct: 1155 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVQDWRRRLAMPLGLK 1209 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + + I + + + R++ G+ + II DE Sbjct: 1210 LVELTGDNTPDTRTIRNADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1260 >gi|119180556|ref|XP_001241737.1| hypothetical protein CIMG_08900 [Coccidioides immitis RS] Length = 1970 Score = 37.4 bits (85), Expect = 5.2, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 41/118 (34%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W +PG V+ +A L E V W L + Sbjct: 1155 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVQDWRRRLAMPLGLK 1209 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + + I + + + R++ G+ + II DE Sbjct: 1210 LVELTGDNTPDTRTIRNADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1260 >gi|171058461|ref|YP_001790810.1| exodeoxyribonuclease V subunit alpha [Leptothrix cholodnii SP-6] gi|170775906|gb|ACB34045.1| exodeoxyribonuclease V, alpha subunit [Leptothrix cholodnii SP-6] Length = 739 Score = 37.4 bits (85), Expect = 5.3, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 50/162 (30%), Gaps = 28/162 (17%) Query: 48 FSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPG 107 F P + S I+ G G GKT A L+ +M+ P Sbjct: 235 FGGPPA-------PDRFDWQRSACAIALRGRLALITGGPGTGKTYTVARLLALVMAVHPQ 287 Query: 108 I---SVICLANSET---QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161 + A + +LK ++ + + + + LP + + L + +L Sbjct: 288 PQALRIALAAPTGKAAARLKQSIDSALQQLAAALPGALDWGLLQQRLSQSLTLHKLLGAR 347 Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203 D++ + R H ++ DEAS Sbjct: 348 P--DTRRFGRDAR-------------HPLEVDLLVVDEASMV 374 >gi|331086511|ref|ZP_08335590.1| hypothetical protein HMPREF0987_01893 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330410569|gb|EGG89997.1| hypothetical protein HMPREF0987_01893 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 649 Score = 37.4 bits (85), Expect = 5.3, Method: Composition-based stats. Identities = 44/297 (14%), Positives = 86/297 (28%), Gaps = 58/297 (19%) Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQ---SLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173 + L W EV +W+ + K + ++P + C Sbjct: 344 QGHLFAKSWNEVERWVEAIIEKGDPIQKERVERVIYPERFEHSFEEMMFTRKE------C 397 Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233 R + G + + G D+ GI +T + P Sbjct: 398 RLSIYHLDENGTGR---DQLFV------GMEDLQEKGI--TITADQYRCVYSSLYLPNED 446 Query: 234 SGKFYEIFN-KPLDDWKRFQIDTRTVEGID------------------PSFHEGIIARYG 274 Y IFN P D+K + V ++ P F E G Sbjct: 447 MNAVYSIFNDDPPADYKAHSLSVSDVVIMNQNGDMKAYFVDRFGFQELPDFVEERKKILG 506 Query: 275 LDSDVTRVEVC-------------GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320 ++SD+ + ++ +FP ++ + L EA + P + Sbjct: 507 MESDIQKKDILEQTSCISFYAAECSEFPVLGEVHHDLTLPEALEAYEKIPAERMNGLKSV 566 Query: 321 GCDIAEEGG-----DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372 G ++ E G D V + +++ + + + L K + +P + Sbjct: 567 GFNLQEGGDYDGMMDLMVAGRSQREILDSIPFYRENKLVQEALKRVEQYIEEKPLNV 623 >gi|295688413|ref|YP_003592106.1| hypothetical protein Cseg_0983 [Caulobacter segnis ATCC 21756] gi|295430316|gb|ADG09488.1| protein of unknown function DUF264 [Caulobacter segnis ATCC 21756] Length = 445 Score = 37.4 bits (85), Expect = 5.9, Method: Composition-based stats. Identities = 71/442 (16%), Positives = 127/442 (28%), Gaps = 82/442 (18%) Query: 56 LEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS-TRPGISVICLA 114 L + + + H L + +F GRG GKT A WL+ G + + Sbjct: 53 LRTLRIREDHQLPPPDPWVTWLFL----GGRGAGKTYAGAA---WLIEQATAGARLALVG 105 Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174 + ++ + + + SL W + Sbjct: 106 PTFHDVREVM----------IEGPSGLKALSLPDEHPRWEASRRRLVWPN-----GATAY 150 Query: 175 TYSEERPDTFVG--HHNTYGMAIINDEASGTPDVINL-GILGFLTERNANRFWIMTSNPR 231 +S E PD+ G H DE P + +L F A+ ++T+ P Sbjct: 151 AFSAEDPDSLRGPQFHAA-----WADEFCAWPKPGDTLAMLRFGLRLGADPRLVVTTTP- 204 Query: 232 RLSGKFYEIFNKPLDDWKRFQID------TRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285 + + + + P+F + + YG + E+ Sbjct: 205 -------KPHRALKVLMAEPGVSLTRAGTSANAGNLAPAFLRTLESLYGGT-RLAAQELD 256 Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI-AEEGGDNT--VVVLRRGPVI 342 G + + P +++ D A GGD VVV RR Sbjct: 257 GVIVE-TDGGLFRAEDLARCRA--ARPARLDRVVVAVDPPATAGGDACGIVVVGRRDDRA 313 Query: 343 EHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG----YHVYR 396 L D + + DA++ +AN G L + R Sbjct: 314 FVLADETARGLSPAGWAARAVAAARAWSADALVAEANQGGDMVRSVLAQADPPCRVKLVR 373 Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456 KRA R E +A E +++ + + + +G+L Sbjct: 374 ASVGKRA---------RAE---PVAALYEQGRVLHCGSFVALE---EELMALGSGDLE-- 416 Query: 457 SKRVKGAKSTDYSDGLMYTFAE 478 S D +D L++ +E Sbjct: 417 -------HSPDRADALVWAVSE 431 >gi|94694803|gb|ABF47048.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694815|gb|ABF47054.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694825|gb|ABF47059.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694853|gb|ABF47073.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694857|gb|ABF47075.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|222354519|gb|ACM48067.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|239909445|gb|ACS32392.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694837|gb|ABF47065.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694821|gb|ABF47057.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694847|gb|ABF47070.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|219879683|gb|ACL51158.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694819|gb|ABF47056.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694829|gb|ABF47061.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694807|gb|ABF47050.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694809|gb|ABF47051.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694823|gb|ABF47058.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694839|gb|ABF47066.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|242345696|gb|ACS92014.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|256557083|gb|ACU83739.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694817|gb|ABF47055.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694827|gb|ABF47060.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694841|gb|ABF47067.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694813|gb|ABF47053.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|254770949|gb|ACT81760.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|94694843|gb|ABF47068.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|52139262|ref|YP_081537.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|39842097|gb|AAR31641.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|44903295|gb|AAS48974.1| UL89 [Human herpesvirus 5] gi|94694805|gb|ABF47049.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694811|gb|ABF47052.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694845|gb|ABF47069.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694849|gb|ABF47071.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694851|gb|ABF47072.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|94694855|gb|ABF47074.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] gi|157780097|gb|ABV71611.1| UL89 [Human herpesvirus 5] gi|242345862|gb|ACS92179.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|242554146|gb|ACS93421.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|254771115|gb|ACT81925.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|270311455|gb|ACZ72832.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|270355676|gb|ACZ79836.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|270355841|gb|ACZ80000.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|270356007|gb|ACZ80165.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|270356173|gb|ACZ80330.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|290564434|gb|ADD39135.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|294488421|gb|ADE88081.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] gi|317160580|gb|ADV04406.1| DNA packaging terminase subunit 1 [Human herpesvirus 5] Length = 674 Score = 37.4 bits (85), Expect = 6.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|262043664|ref|ZP_06016773.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039002|gb|EEW40164.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 464 Score = 37.4 bits (85), Expect = 6.4, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 72/252 (28%), Gaps = 39/252 (15%) Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174 Q++ +W V+ + ++ +L + Sbjct: 65 PQANQVRKAIWKAVN------------PRTGRLRIDEAFPHELRRKTLDNEMMIEFINGS 112 Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234 T+ D + + + I+ E + + + L + F++ S PR + Sbjct: 113 TWQAVGSDNYGALIGSGHVGIVFSEWALSNPSAWAFLRPILADNGGWAFFV--STPRGKN 170 Query: 235 GKFYEIFN---KPLDDWKRFQIDTRTVEGIDPS----FHEGIIARYGLDS--DVTRVEVC 285 FY++F K D+W + I P + A G + + E Sbjct: 171 -HFYKMFQGGLKDPDNWFCDHLSADITLHIPPETLAQELREMQAERGEEEGQALFNQEYM 229 Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------LIMGCDIAEEGGDNTVVVLRR 338 + ++ ++ + P+ P +G GD T + + Sbjct: 230 CDWNAAIPGAYYSSILVGLEKGGQIGNVPWDPQYEVYTSWDLGI------GDATAIWFYQ 283 Query: 339 --GPVIEHLFDW 348 G + + + Sbjct: 284 FIGKEVRVIDYY 295 >gi|321265233|ref|XP_003197333.1| member of the DEAH family of helicases; Mph1p [Cryptococcus gattii WM276] gi|317463812|gb|ADV25546.1| Member of the DEAH family of helicases, putative; Mph1p [Cryptococcus gattii WM276] Length = 1517 Score = 37.0 bits (84), Expect = 6.6, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 55/156 (35%), Gaps = 12/156 (7%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139 ++ G+GKT + ++L P ++ LA + + + E + +P++ Sbjct: 299 TLVALPTGLGKTFVAGVVMLNFYRWFPTGKIVFLAPTRPLVAQQI--EACQLSCGIPSRD 356 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 M + L + + + +T + + V + ++ DE Sbjct: 357 AAVMTGEGG------ARKGRERLWEEKRVFYCTPQTLDNDLKNGAVDP--QDIVLVVLDE 408 Query: 200 A-SGTPDVINLGILGFLTERNAN-RFWIMTSNPRRL 233 A T + I+ +LT + R +T+ P Sbjct: 409 AHKATGNYAYTTIVAYLTAHHPYFRVLALTATPGAD 444 >gi|119716507|ref|YP_923472.1| type III restriction enzyme, res subunit [Nocardioides sp. JS614] gi|119537168|gb|ABL81785.1| type III restriction enzyme, res subunit [Nocardioides sp. JS614] Length = 558 Score = 37.0 bits (84), Expect = 6.7, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 20/105 (19%) Query: 38 WGEKGTPLEGFSAP---------RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGI 88 W P R +QLE E + N V G+ Sbjct: 117 WAGPTLEAIYAKMPARVPSKYELRPYQLEAAERIQQDL--EDTNRALLVLAT------GL 168 Query: 89 GKTTLNAWLVLWLMSTRPGISVICLANSE---TQLKTTLWAEVSK 130 GKT + ++ + + P ++ +A+ + QL+ LW + K Sbjct: 169 GKTVVGGEVIRRHLESHPDARILVVAHMKELVEQLEKALWRHLDK 213 >gi|170086129|ref|XP_001874288.1| predicted protein [Laccaria bicolor S238N-H82] gi|164651840|gb|EDR16080.1| predicted protein [Laccaria bicolor S238N-H82] Length = 1307 Score = 37.0 bits (84), Expect = 6.9, Method: Composition-based stats. Identities = 9/47 (19%), Positives = 21/47 (44%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA 126 ++ G+GKT + ++L P V+ +A ++ + + A Sbjct: 224 TLVALPTGLGKTFIAGAVMLNFYRWFPEGKVVFVAPTKPLVAQQIMA 270 >gi|260811155|ref|XP_002600288.1| hypothetical protein BRAFLDRAFT_118278 [Branchiostoma floridae] gi|229285574|gb|EEN56300.1| hypothetical protein BRAFLDRAFT_118278 [Branchiostoma floridae] Length = 275 Score = 37.0 bits (84), Expect = 6.9, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 39/129 (30%), Gaps = 18/129 (13%) Query: 188 HNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD 247 H+ + DE + G+L L E +N +I + +L P+ Sbjct: 48 HSDRQHLFVFDEMETVYPSLAEGLLSLLEEDTSNTMFIFIWSTEKL----------PMGR 97 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307 + QI I + +++++V + +F A Sbjct: 98 YLLQQIS--------KGRSRESIREEEIQDLLSQLQVDTDSSEPSSTNFASTGKTFAATV 149 Query: 308 REPCPDPYA 316 ++P P Sbjct: 150 KKPSNIPEG 158 >gi|71018359|ref|XP_759410.1| hypothetical protein UM03263.1 [Ustilago maydis 521] gi|46098957|gb|EAK84190.1| hypothetical protein UM03263.1 [Ustilago maydis 521] Length = 1054 Score = 37.0 bits (84), Expect = 6.9, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 47/151 (31%), Gaps = 12/151 (7%) Query: 87 GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA-EVSKWLS-LLPNKHWFEMQ 144 G+GKT ++ LM + + +A + + W E+ ++ L W Sbjct: 468 GMGKT----IQMISLMLSDRKKPCLVVAPT---VAIMQWRNEIEQYTEPKLKVLMWHGAN 520 Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERP--DTFVGHHNTYGMAIINDEASG 202 +DV+ S + + + + H + II DEA Sbjct: 521 RTQDLKELKAADVVLTSYAVLESSFRKQESGFRRKNEILKERSALHAVHWRRIILDEAHN 580 Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRL 233 + G + + W ++ P + Sbjct: 581 IKERSTNTAKGAFALQG-DFRWCLSGTPLQN 610 >gi|94694835|gb|ABF47064.1| DNA cleavage and packaging protein large subunit [Human herpesvirus 5] Length = 674 Score = 37.0 bits (84), Expect = 7.0, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%) Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220 + ID + + S ++ G ++ DEA ILGFL + Sbjct: 275 VISIDHRGPKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331 Query: 221 NRFWIMTSNPRRLSGKF 237 +I ++N + F Sbjct: 332 KIIFISSTNTTSDATCF 348 >gi|49475696|ref|YP_033737.1| Phage related protein [Bartonella henselae str. Houston-1] gi|49238503|emb|CAF27734.1| Phage related protein [Bartonella henselae str. Houston-1] Length = 441 Score = 37.0 bits (84), Expect = 7.1, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 54/186 (29%), Gaps = 17/186 (9%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFN-KPLDD 247 + DEA + ++ L E +T NP R + F + Sbjct: 122 RILLCWVDEAEPVTETAWQTLIPTLREEGEGWRAELWVTWNPLRDNAPVERRFRFSNNEA 181 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE--- 304 KR +I+ +E + + + G + ++ ++ Sbjct: 182 IKRVEINWSDNPKFPKILNEARLDDLKNRPETYKHIWEGAYLTAVQGAYYQKEMLAAEQE 241 Query: 305 ----ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTN 357 + R+P A DI G D T + + + + D+ + + + Sbjct: 242 GRIGRVARDPLMQMRAFW----DIGGTGAKADATAIWIAQFVGREIRVLDYYEAQGQPLS 297 Query: 358 NKISGL 363 I L Sbjct: 298 EHIGWL 303 >gi|29826538|ref|NP_828844.1| putative helicase [Streptomyces avermitilis MA-4680] gi|29611336|dbj|BAC75379.1| putative helicase [Streptomyces avermitilis MA-4680] Length = 885 Score = 37.0 bits (84), Expect = 7.6, Method: Composition-based stats. Identities = 10/50 (20%), Positives = 19/50 (38%), Gaps = 3/50 (6%) Query: 71 NNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 ++ P+ +G I + G GKT A + G ++ + L Sbjct: 27 SSVPPQGARGTIVSATGSGKTFTAAACA---LECFSGGRILVTVPTLDLL 73 >gi|312880761|ref|ZP_07740561.1| UvrD/REP helicase [Aminomonas paucivorans DSM 12260] gi|310784052|gb|EFQ24450.1| UvrD/REP helicase [Aminomonas paucivorans DSM 12260] Length = 1200 Score = 37.0 bits (84), Expect = 7.8, Method: Composition-based stats. Identities = 29/231 (12%), Positives = 60/231 (25%), Gaps = 33/231 (14%) Query: 53 SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112 W + + V +P V + AG G GKT + WL+++ P V Sbjct: 11 PWMERLLGDLRPEQRQGVISPRSLVV---VQAGAGTGKTHTLSSRFAWLLASDPTCRV-- 65 Query: 113 LANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171 Q+ T + E ++ + + L P L + + Y + Sbjct: 66 -----EQILTLTFTEKAAREMRDRIRCRLLQW----LEAEPEKLGHLRDAAARIDEGYIS 116 Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV-----INLGILGFLTERNANRFWIM 226 ++ G+ + D S + + G + F + Sbjct: 117 TLHAFALRVI-------RESGLVLDLDPESRIASPCGEGALFEEMEGAFDRLDPAWFLRL 169 Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDS 277 +P + + D ++ +G Sbjct: 170 LEDP------WRDRCQDLFGDPAFPRLVNALSPRRLAELVREAAELHGSRD 214 >gi|321472411|gb|EFX83381.1| hypothetical protein DAPPUDRAFT_48010 [Daphnia pulex] Length = 657 Score = 37.0 bits (84), Expect = 7.9, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 52/168 (30%), Gaps = 29/168 (17%) Query: 36 FPWGEKGTPLEGFSAP-RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLN 94 F G T + P R +Q + +E H ++ G+GKT + Sbjct: 93 FDMGAGDTWFYPTNKPVRKYQRDIVETCLFH-------------NTLVTLPTGLGKTFIA 139 Query: 95 AWLVLWLMSTRPGISVICLANSETQLKTTLWA--EVSKWLSLLPNKHWFEMQSLSLHPAP 152 A ++ P +I +A ++ + + A E+ L L S + Sbjct: 140 AVVMYNFFRWYPRGKIIFMAPTKPLVAQQIQACYEIMG-LPLDSTSEMTGAMSPADRKTQ 198 Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200 W + + + + + + + ++ DEA Sbjct: 199 WREKRV----------FFLTPQILTND--ISRAAFPASEIKCLVLDEA 234 >gi|319408093|emb|CBI81746.1| phage related protein [Bartonella schoenbuchensis R1] gi|319408856|emb|CBI82513.1| phage related protein [Bartonella schoenbuchensis R1] Length = 444 Score = 37.0 bits (84), Expect = 8.2, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 52/182 (28%), Gaps = 9/182 (4%) Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNKPLDD- 247 + DEA + ++ L E +T NP R + F D Sbjct: 125 RILLCWVDEAEPVTETAWQTLIPTLREEGEGWRAELWVTWNPLRENAPVERRFRFTKDQN 184 Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII---EE 304 K +++ + + G + + ++ ++ +E Sbjct: 185 IKGVEVNWSDNPLFPQKLQRVRLDDLQNRPESYNHIWEGDYLKAVQGAYFQKEMLAAEQE 244 Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361 DP P+ DI G D T + + + + D+ + + + I Sbjct: 245 GRVGRVARDPLMPIRAFWDIGGTGAKADATAIWIAQFVGREIRVLDYYEAQGQPLSEHIG 304 Query: 362 GL 363 L Sbjct: 305 WL 306 >gi|322707444|gb|EFY99022.1| activating signal cointegrator 1 complex subunit 3 [Metarhizium anisopliae ARSEF 23] Length = 1969 Score = 37.0 bits (84), Expect = 8.3, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 39/118 (33%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RP V+ +A L E V W L + Sbjct: 1159 SPTGSGKTVAAELAMWWAFRERPKSKVVYIAP-----MKALVRERVKDWGKRLAQPLGLK 1213 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1214 IVELTGDNTPDTRTIKDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1264 >gi|322695748|gb|EFY87551.1| activating signal cointegrator 1 complex subunit 3 [Metarhizium acridum CQMa 102] Length = 1950 Score = 37.0 bits (84), Expect = 8.3, Method: Composition-based stats. Identities = 23/118 (19%), Positives = 39/118 (33%), Gaps = 14/118 (11%) Query: 84 AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142 + G GKT + W RP V+ +A L E V W L + Sbjct: 1140 SPTGSGKTVAAELAMWWAFRERPKSKVVYIAP-----MKALVRERVKDWGKRLAQPLGLK 1194 Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199 + L+ P + + I + + + R++ G+ + II DE Sbjct: 1195 IVELTGDNTPDTRTIKDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1245 >gi|224586602|ref|YP_002640499.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] gi|224497136|gb|ACN52769.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] Length = 450 Score = 36.6 bits (83), Expect = 9.0, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%) Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235 Y ++ F + I +EA+ +L L R I +NP Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDNPEH 205 Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294 F + + + + T + F E Y D + V G++ Sbjct: 206 YFKTDYIDNIHTFTTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASIDS 264 Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336 F +NI ++ + P I D A GGDNT + + Sbjct: 265 IFTQINITQDYVFSSP--------IAYLDPAFSVGGDNTALCV 299 >gi|71065561|ref|YP_264288.1| PBSX family phage terminase large subunit [Psychrobacter arcticus 273-4] gi|71038546|gb|AAZ18854.1| phage terminase, large subunit, PBSX family [Psychrobacter arcticus 273-4] Length = 421 Score = 36.6 bits (83), Expect = 9.0, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 55/195 (28%), Gaps = 11/195 (5%) Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS--EERPDTFVGHHNTY 191 N+ E ++ W +D + ++ D+ G + Sbjct: 72 NSLNESSLEEIKQAIKSVSWLNDYYEIGEKYIRTKNRRVAYAFTGLRHNLDSIKGK--SR 129 Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251 + DEA + +L + E ++ WI T NP + F + ++ Sbjct: 130 ILLAWVDEAENVSEAAWRKLLPTVREDDSE-VWI-TWNPENKGSATDKRFRQVEHEFI-V 186 Query: 252 QIDTRTVEGIDPSFH-EGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310 +++ E + + LD R G + + + R P Sbjct: 187 EMNHNDNPFFPDVLEQERLNDQENLDDATYRWIWEGAYLEASDAQIFNGKFVVREFERHP 246 Query: 311 CPDPYAPLIMGCDIA 325 + P G D Sbjct: 247 TWN--GPYN-GLDFG 258 >gi|308476267|ref|XP_003100350.1| hypothetical protein CRE_22485 [Caenorhabditis remanei] gi|308265092|gb|EFP09045.1| hypothetical protein CRE_22485 [Caenorhabditis remanei] Length = 1870 Score = 36.6 bits (83), Expect = 9.3, Method: Composition-based stats. Identities = 27/144 (18%), Positives = 53/144 (36%), Gaps = 15/144 (10%) Query: 57 EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116 ++ + A S+ + + A G GKT + L+ PG+ V+ +A Sbjct: 1031 DYFNPIQAQVFYSLYKTDKSAL---VGAPTGSGKTLCAELAMFRLLQDHPGMKVVYIAP- 1086 Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRT 175 LK+ + V W N + + +S P ++ S+ I + + + R+ Sbjct: 1087 ---LKSLVRERVDDWKQKFENGMGYRVVEVSGDVTPDPQELQASSILITTPEKWDGISRS 1143 Query: 176 YSEERPDTFVGHHNTYGMAIINDE 199 ++ VG I+ DE Sbjct: 1144 WATREYVRRVG-------LIVLDE 1160 >gi|312116003|ref|YP_004013599.1| hypothetical protein Rvan_3315 [Rhodomicrobium vannielii ATCC 17100] gi|311221132|gb|ADP72500.1| hypothetical protein Rvan_3315 [Rhodomicrobium vannielii ATCC 17100] Length = 466 Score = 36.6 bits (83), Expect = 9.4, Method: Composition-based stats. Identities = 72/420 (17%), Positives = 129/420 (30%), Gaps = 57/420 (13%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141 I AGRG GKT A W+ + G A + + L AE + + + Sbjct: 61 ILAGRGAGKTRTGA---EWVRACVCGP-TPLSAGRYS--RFALVAETAADARDVIVEGPS 114 Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201 + L++HP + S + + Y+ PD G + A DE + Sbjct: 115 GL--LAIHPRGFRP-KFEPSKRRLTWPNGAVAMLYNATEPDQLRGPQHD---AAWCDELA 168 Query: 202 G--TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259 + L + R I+T+ P R E K + T Sbjct: 169 KWRYARETWDMLQFGLRLGHDPRQ-IVTTTP-RPIAIIREFLGKEGHGVVLTRGSTYDNR 226 Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE-ALNREPCPDPYAPL 318 + I R + + R E+ + + +++++ + R + + Sbjct: 227 ANLAQNYFNTIVRSYEGTRLGRQEINAELLDDVAGALWTRSLLDQHRIARGTPLPRFDRV 286 Query: 319 IMGCDIAE---EGGDNT------VVVLRRGPVIEHLFDW-SKTDLRTTNNKISGLVEKYR 368 ++G D A GD T V L L D ++ K + Y Sbjct: 287 VVGIDPAARPSGAGDKTSETGIVVCGLGEDGRGYVLDDLSNRQGPMGWAQKAVAGFDLYE 346 Query: 369 PDAIIIDANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTE----LHVKMAD 422 DA++++ N GA L + G + V + R E L+ + Sbjct: 347 ADALVVEINQGGAMVETVLRAVRGGLPIRAVRATRGKT-------VRAEPIAALYAQGR- 398 Query: 423 WLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPR 482 + + L + F + G + D D L++ A+ PR Sbjct: 399 -VSHVGALP--TLEDQMVQFTPFGIEGDG-------------AADRVDALVWALADLFPR 442 >gi|302756859|ref|XP_002961853.1| hypothetical protein SELMODRAFT_140315 [Selaginella moellendorffii] gi|300170512|gb|EFJ37113.1| hypothetical protein SELMODRAFT_140315 [Selaginella moellendorffii] Length = 1015 Score = 36.6 bits (83), Expect = 9.6, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 64/211 (30%), Gaps = 24/211 (11%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139 A++A RG GK+ + ++ A S LKT L+ V K L K Sbjct: 279 VALTASRGRGKSAALGLAIA-GAVAFGYSNIFVTAPSPENLKT-LFEFVCKGFDALEYKE 336 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT--YGMAIIN 197 + + + V+ ++ + +T +P H +I Sbjct: 337 HIDYDLVQSTNPAFNKAVVRVNIFRQHR------QTIQYIQPQD----HAKLAQAELLII 386 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRT 257 DEA+ P + +LG M S G + K + + Q + + Sbjct: 387 DEAAAIPLPMVKALLG-------PYLVFMCSTVNGYEGTGRSLSLKLIQQLRS-QGKSES 438 Query: 258 VEGIDPSFHEGIIARYGLDSDV--TRVEVCG 286 + RYG + E+ Sbjct: 439 APSVFREVELAEPIRYGAGDPIEGWLHELLC 469 >gi|302798078|ref|XP_002980799.1| hypothetical protein SELMODRAFT_113365 [Selaginella moellendorffii] gi|300151338|gb|EFJ17984.1| hypothetical protein SELMODRAFT_113365 [Selaginella moellendorffii] Length = 1015 Score = 36.6 bits (83), Expect = 9.6, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 64/211 (30%), Gaps = 24/211 (11%) Query: 80 GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139 A++A RG GK+ + ++ A S LKT L+ V K L K Sbjct: 279 VALTASRGRGKSAALGLAIA-GAVAFGYSNIFVTAPSPENLKT-LFEFVCKGFDALEYKE 336 Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT--YGMAIIN 197 + + + V+ ++ + +T +P H +I Sbjct: 337 HIDYDLVQSTNPAFNKAVVRVNIFRQHR------QTIQYIQPQD----HAKLAQAELLII 386 Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRT 257 DEA+ P + +LG M S G + K + + Q + + Sbjct: 387 DEAAAIPLPMVKALLG-------PYLVFMCSTVNGYEGTGRSLSLKLIQQLRS-QGKSES 438 Query: 258 VEGIDPSFHEGIIARYGLDSDV--TRVEVCG 286 + RYG + E+ Sbjct: 439 APSVFREVELAEPIRYGAGDPIEGWLHELLC 469 >gi|281204972|gb|EFA79166.1| hypothetical protein PPL_07991 [Polysphondylium pallidum PN500] Length = 1587 Score = 36.6 bits (83), Expect = 9.6, Method: Composition-based stats. Identities = 42/197 (21%), Positives = 63/197 (31%), Gaps = 21/197 (10%) Query: 82 ISAGRGIGKTTLNAWLVLWLMSTRPGI-----SVICLANSETQLKTTLW--AEVSKWLSL 134 + G GKT A +VL +M T + A++ T + L AE+ K Sbjct: 1071 VVGPPGTGKTHFLALMVLIIMETLIRAEKKSYIIAITAHTHTAIDNLLVRIAELKKEYES 1130 Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194 S H + D KH + S +G M Sbjct: 1131 FAGNALNFQIVKKESSKLSESLTSHNIVKYDKKHKFNLMCIGSTCWGLNTLGL--DLDML 1188 Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQ 252 II DEAS P + LG ++ +P++L F K L Sbjct: 1189 II-DEASQLPSPL--AALGLNAVNLEKSRVVVVGDPKQLGPVLKASFIVRKDLSV----- 1240 Query: 253 IDTRTVEGIDPSFHEGI 269 + +E +P FH+ I Sbjct: 1241 --SDKLEHQEPKFHKSI 1255 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.130 0.353 Lambda K H 0.267 0.0399 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 9,114,867,950 Number of Sequences: 14124377 Number of extensions: 360134319 Number of successful extensions: 985041 Number of sequences better than 10.0: 1255 Number of HSP's better than 10.0 without gapping: 457 Number of HSP's successfully gapped in prelim test: 935 Number of HSP's that attempted gapping in prelim test: 982625 Number of HSP's gapped (non-prelim): 1549 length of query: 511 length of database: 4,842,793,630 effective HSP length: 144 effective length of query: 367 effective length of database: 2,808,883,342 effective search space: 1030860186514 effective search space used: 1030860186514 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 83 (36.6 bits)