BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.


Reference for composition-based statistics starting in round 2:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254781215|ref|YP_003065628.1| putative phage terminase,
large subunit [Candidatus Liberibacter asiaticus str. psy62]
         (511 letters)

Database: nr 
           14,124,377 sequences; 4,842,793,630 total letters

Searching..................................................done


Results from round 1


>gi|254781215|ref|YP_003065628.1| putative phage terminase, large subunit [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040892|gb|ACT57688.1| putative phage terminase, large subunit [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|317120680|gb|ADV02503.1| putative phage terminase large subunit [Liberibacter phage SC1]
 gi|317120824|gb|ADV02645.1| putative phage terminase large subunit [Candidatus Liberibacter
           asiaticus]
          Length = 511

 Score = 1066 bits (2757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 511/511 (100%), Positives = 511/511 (100%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME
Sbjct: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
           VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL
Sbjct: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER
Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI
Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN
Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
           IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI
Sbjct: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
           SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM
Sbjct: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
           ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP
Sbjct: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480

Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR 511
           PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR
Sbjct: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR 511


>gi|315121940|ref|YP_004062429.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313495342|gb|ADR51941.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 509

 Score =  796 bits (2056), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/508 (74%), Positives = 428/508 (84%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           M+RELPT  E EQ+L +LM+SD+IKLSF+NFVL  FPW E  T L  FS PR WQL+FME
Sbjct: 1   MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            VD  CL +V+NP+P++FKGA+SAGRGIGKTTLNAW++LWL+STRPG+S++CLANSETQL
Sbjct: 61  AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           K+TLWAEVSKWLS+LPNKHWFEMQSLSLHPA WY++ L  + GIDSKHY+  CRTYSEER
Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVGHHNTYGMAI NDEASGTPDVIN  ILGF TE NANRFW+MTSNPRRL G FY+I
Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLKGWFYDI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FN PL+DW+RFQIDTRTVEGIDPSFHEGII+RYGLDSDVTRVEV GQFPQQDI+SFIP  
Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPSFHEGIISRYGLDSDVTRVEVLGQFPQQDINSFIPFY 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
            IEEALNREP  DPYAPLIMGCDIA EGGDNTVVVLRRG  IEH+FDWS   +  ++ KI
Sbjct: 301 RIEEALNREPIKDPYAPLIMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNASSRKI 360

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
             L+ KY+PDA+++DAN  G +T  YL   GY V+   GQ RA D E  RNRRTELHVKM
Sbjct: 361 EELINKYKPDAVVVDANGIGVQTYYYLADEGYSVHAEKGQNRADDHESYRNRRTELHVKM 420

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
           A+WLE AS+ NHSGLIQNLKSL+SFI PNTG+LA+ESKRVKGA STDYSD L YTFA +P
Sbjct: 421 AEWLELASIPNHSGLIQNLKSLESFIEPNTGKLALESKRVKGAVSTDYSDALAYTFAVSP 480

Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEY 508
            RSDM+FGRC SYQYE  +LL++RRF Y
Sbjct: 481 ARSDMNFGRCRSYQYEADELLVDRRFSY 508


>gi|315122902|ref|YP_004063391.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313496304|gb|ADR52903.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 509

 Score =  790 bits (2041), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/508 (73%), Positives = 428/508 (84%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           M+RELPT  E EQ+L +LM+SD+IKLSF+NFVL  FPW E  T L  FS PR WQL+FME
Sbjct: 1   MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            VD  CL +V+NP+P++FKGA+SAGRGIGKTTLNAW++LWL+STRPG+S++CLANSETQL
Sbjct: 61  AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           K+TLWAEVSKWLS+LPNKHWFEMQSLSLHPA WY++ L  + GIDSKHY+  CRTYSEER
Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVGHHNTYGMAI NDEASGTPDVIN  ILGF TE NANRFW+MTSNPRRL+G FY+I
Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLNGWFYDI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FN PL+DW+RFQIDTRTVEGIDP+FHE IIARYGLDSDVTRVEV GQFPQQDI+SFIP  
Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPNFHENIIARYGLDSDVTRVEVLGQFPQQDINSFIPFY 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
            IEEALNREP  DPYAPL+MGCDIA EGGDNTVVVLRRG  IEH+FDWS   +  ++ KI
Sbjct: 301 RIEEALNREPIKDPYAPLVMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNVSSRKI 360

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
             L+ KY+PDA+++DAN  G +T  YL   GY V+   GQ RA D E  RNRRTELHVKM
Sbjct: 361 EELINKYKPDAVVVDANGIGVQTYYYLADEGYSVHPEKGQNRADDHESYRNRRTELHVKM 420

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
           A+WLE AS+ +HSGLIQNLKSL+SFI PNTG+LA+ESKRVKGA STDYSD L YTFA +P
Sbjct: 421 AEWLELASIPHHSGLIQNLKSLESFIEPNTGKLALESKRVKGAVSTDYSDALAYTFAVSP 480

Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEY 508
            RSDM+FGRC SYQYE  +LL++RRF Y
Sbjct: 481 ARSDMNFGRCRSYQYEADELLVDRRFSY 508


>gi|317120722|gb|ADV02544.1| putative phage terminase large subunit [Liberibacter phage SC2]
 gi|317120783|gb|ADV02604.1| putative phage terminase large subunit [Candidatus Liberibacter
           asiaticus]
          Length = 516

 Score =  778 bits (2009), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 396/512 (77%), Positives = 415/512 (81%), Gaps = 19/512 (3%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME
Sbjct: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
           VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL
Sbjct: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER
Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI
Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP  
Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPQQ 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
            I EAL R   PDPYAPLIMGCDIA EG D TVVVLRRG +IE +FDWS   +  TN KI
Sbjct: 301 YIVEALERVAIPDPYAPLIMGCDIAGEGEDKTVVVLRRGNIIERIFDWSGELIEVTNRKI 360

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDY-LEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419
           S L+ +Y PDAI+ID N  G     Y L M    V  +LGQ+R+ + E   N R EL+  
Sbjct: 361 SSLINRYNPDAIVIDGNGIGGTVVSYLLNMHHISVEVILGQRRSTEPEQYHNLRAELYDL 420

Query: 420 MADWLEFASLI--NHSGLIQNLKSLKSFIVPNTGELAIESKRVK----GAKSTDYSDGLM 473
           M   +     +  +   LI  LKS+KS I    G L IE KR      G +S D+ D L 
Sbjct: 421 MRSAITGGLQLPDDCPDLINELKSIKS-ISDTLGRLLIEKKRQGRSEFGVRSPDFVDALC 479

Query: 474 YTFAENPPRSDMDFGRCPSYQ------YEGVD 499
           YTFA +PPR D      P YQ      YE +D
Sbjct: 480 YTFAVDPPRKD-----NPLYQGQDISEYEALD 506


>gi|254781187|ref|YP_003065600.1| putative phage terminase, large subunit [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040864|gb|ACT57660.1| putative phage terminase, large subunit [Candidatus Liberibacter
           asiaticus str. psy62]
          Length = 367

 Score =  545 bits (1403), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 252/359 (70%), Positives = 299/359 (83%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           M R + T+ + EQ+L +++   E  LSF NFV+ FFPWG KG PLE FS P  WQLEFME
Sbjct: 1   MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            VD HC ++VNN NP +FK AISAGRGIGKTTLNAW++LWL+STRPG+S+IC+ANSETQL
Sbjct: 61  AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           K TLWAEVSKWLS+LP++HWFEMQSLSLHP+ WY+++L  S+GIDSKHY+  CRTYSEER
Sbjct: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVG HNT+GMA+ NDEASGTPD+IN  ILGF TE N NRFWIMTSN RRL+G FY+I
Sbjct: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FN PL+DWKR+QIDTRTVEGID  FHEGII+RYGLDSDV R+E+ GQFPQQ++++FIP N
Sbjct: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359
            IEEA++RE   D YAPLIMGCDIA EGGD TVVV RRG +IEH+FDWS   ++ TN +
Sbjct: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359


>gi|302120432|gb|ADK92426.1| putative phage terminase large subunit [Candidatus Liberibacter
           asiaticus]
          Length = 255

 Score =  529 bits (1362), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 250/255 (98%), Positives = 254/255 (99%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
           IGKTTLNAWLVLWLMS RPG+S+ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS
Sbjct: 1   IGKTTLNAWLVLWLMSIRPGMSIICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 60

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207
           LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI
Sbjct: 61  LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 120

Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE 267
           NLGILGFLTE+NANRFWIMTSNPRRLSGKFYEIFN+PLDDWKRFQIDTRTVEGIDPSFHE
Sbjct: 121 NLGILGFLTEQNANRFWIMTSNPRRLSGKFYEIFNRPLDDWKRFQIDTRTVEGIDPSFHE 180

Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327
           GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE
Sbjct: 181 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 240

Query: 328 GGDNTVVVLRRGPVI 342
           GGDNTVVVLRRGPVI
Sbjct: 241 GGDNTVVVLRRGPVI 255


>gi|303328395|ref|ZP_07358832.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
 gi|302861389|gb|EFL84326.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
          Length = 500

 Score =  206 bits (525), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 147/469 (31%), Positives = 213/469 (45%), Gaps = 38/469 (8%)

Query: 30  NFVLHFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGI 88
            FVL  FPWG  G  L  +   P  WQ E +  +      S       V + A+S+G G+
Sbjct: 31  GFVLFAFPWG--GGALADYPDGPDVWQREILRGMGEQL--STGASAASVIREAVSSGHGV 86

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+ L AW++LW MST      +  AN+E QLK   WAE++KW  L    +WF+  + +L
Sbjct: 87  GKSALVAWIILWAMSTFSDTRGVVTANTENQLKGKTWAELAKWHRLCLCGYWFDCTATAL 146

Query: 149 ------HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT-YGMAIINDEAS 201
                 H   W  D++                 +SE   + F G HN    + +I DEAS
Sbjct: 147 ISTQAGHEKTWRVDMV----------------AWSERNTEAFAGLHNKGRRVLLIFDEAS 190

Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261
             PD I     G LT+ +    W    NP R +G+F E F +    W   ++D+RT    
Sbjct: 191 AIPDAIWEVSEGALTDADTEIIWCCFGNPTRNTGRFRECFGRYAHRWNTRRVDSRTAAMT 250

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY--APLI 319
           D +     +  YG DSD  RV V G+FP+     FI  +I+ EA  R   PD Y  AP I
Sbjct: 251 DKNQLAQWVEDYGEDSDFVRVRVRGEFPRAGDRQFISSDIVHEARGRSLKPDQYSFAPRI 310

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A  G D +V+  R+G        +   D  T    ++    ++  D I +D    
Sbjct: 311 LGVDVARSGSDQSVITRRQGLACLEQRKFRGLDTVTLAGIVAEECREWGADKIFVDGIGV 370

Query: 380 GARTCDYLEM---LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL-EFASLINHSGL 435
           GA   D L     LG+ V   +    A+  E   NRR E+   M  WL E  ++ + + L
Sbjct: 371 GAGVVDALRQVYGLGHLVVDAVAGATALQPERFLNRRAEMWTAMRKWLAEGGAVPDDAEL 430

Query: 436 IQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481
            + L  L+ + V  +G+L +ESK   + +G  S D +D L  TF    P
Sbjct: 431 AEQLCGLE-YAVTVSGKLKLESKDDMKARGLTSPDCADALALTFYAPVP 478


>gi|268589373|ref|ZP_06123594.1| conserved hypothetical protein [Providencia rettgeri DSM 1131]
 gi|291315400|gb|EFE55853.1| conserved hypothetical protein [Providencia rettgeri DSM 1131]
          Length = 493

 Score =  204 bits (520), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 144/462 (31%), Positives = 216/462 (46%), Gaps = 36/462 (7%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
           ++ L+ FPWGE GT LE  + PR WQ E +  +  H  N      P   + A ++G GIG
Sbjct: 24  SYALYAFPWGEAGTELENANGPRQWQAEALNEIGEHLRNPETRHQP--LQLARASGHGIG 81

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL- 148
           K+   + ++ W M T     V+  AN+E QL+T  W E++KW  L   K WF     ++ 
Sbjct: 82  KSAFISMIIKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQRLSITKDWFTYTKTAIY 141

Query: 149 -----HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASG 202
                H   W +D +                 +SE   + F G HN     I I DEAS 
Sbjct: 142 SNDPNHANAWRADAV----------------PWSENNTEAFAGLHNQGKRIILIFDEASN 185

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID 262
             D++     G LT+ N    WI   NP R +G+F E F K    WK  QID+RTVEG +
Sbjct: 186 IADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRECFRKFKHRWKTKQIDSRTVEGTN 245

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIM 320
               E  I  YG+D D  +V V G FP      FIP  + + A+ R        +AP+I+
Sbjct: 246 KEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPTGLTDAAMKRTVTQAEVSHAPIII 305

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTTNNKISGLVEKYRPDAIIID-ANN 378
           G D A  G D+ V+ LR+G   + L+  SKT D      +I+   ++Y  DA+ ID    
Sbjct: 306 GVDPAYSGDDDAVIYLRQGLHSKCLWTGSKTIDDVIMAKRIADFEDQYGADAVHIDFGYG 365

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438
           TG ++        + + +  G      +   RN+R E++  +  WL+    I+   + ++
Sbjct: 366 TGIQSVGMNWGRNWQLVQFNGASTDPQM---RNKRGEMYNNVKSWLKIGGAIDDQEVAED 422

Query: 439 LKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           L S   + V  +G++ +ESK   + +  +S    D L  TFA
Sbjct: 423 L-STPEYKVELSGKILLESKDDIKKRIGRSPGKGDALALTFA 463


>gi|167032754|ref|YP_001667985.1| putative phage terminase large subunit [Pseudomonas putida GB-1]
 gi|166859242|gb|ABY97649.1| putative phage terminase, large subunit [Pseudomonas putida GB-1]
          Length = 499

 Score =  202 bits (513), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 142/465 (30%), Positives = 222/465 (47%), Gaps = 27/465 (5%)

Query: 27  SFSN----FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAI 82
           SFS+    +VL+ FPWGE G  L   + PR WQ E +E +    L +      EV + A+
Sbjct: 20  SFSDDPLGYVLYAFPWGEAGGELANKTGPRKWQREVLESI-GEQLRAGAKDRGEVIREAV 78

Query: 83  SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142
           ++G GIGK+ L +W++ W + T      +  AN+E+QL+T  W EV+KW  L    HWF+
Sbjct: 79  ASGHGIGKSALVSWVIKWALDTEVDTRGVVTANTESQLRTKTWPEVAKWNRLSITAHWFK 138

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT-YGMAIINDEAS 201
           +   +L      +D  H       K++      +S+   + F G HN    + +I DEAS
Sbjct: 139 LTGTALIS----TDPDH------EKNWRIDAVPWSDTNTEAFAGLHNEGKRILLIFDEAS 188

Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261
              D++     G LT+ +    W    NP R SG+F E F K    W+  Q+D+RTV+G 
Sbjct: 189 AIADLVWEVAEGALTDADTEIIWAAFGNPTRNSGRFRECFTKFKHRWRHRQVDSRTVDGT 248

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321
           + +     IA YG DSD  R+ V G FP+      IP + + EA+ R+        L+ G
Sbjct: 249 NKTQIAKWIADYGEDSDFVRIRVRGMFPRASDLQLIPTDWVAEAMRRDGVYGLDDALVCG 308

Query: 322 CDIAEEGGDNTVVVLRRGPVIEHL--FDWSKTDLRTTN---NKISGLVEKYRPDAIIIDA 376
            DIA  G DN V+  RRG   + +       ++ R T     K+  LV ++RPDA+ +D+
Sbjct: 309 IDIARGGMDNNVIRFRRGMDAKSIKPIKIPGSETRNTTPFIAKVCTLVVEHRPDAVFVDS 368

Query: 377 NNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
              G    D L  L  G  +  V    +A D  +  N RT +  +M + ++    I    
Sbjct: 369 TGVGGPVADQLRRLLPGVMIIDVNFASQAPDRHYA-NMRTYIWWRMREAIKLGLAIESDT 427

Query: 435 LIQNLKSLKSFIVPNTGELAIESKRVKGAK---STDYSDGLMYTF 476
            ++   +   +   ++ ++A+E K+    +   S D  D L  TF
Sbjct: 428 ELETELTSPEYDHNSSDQIALEKKKDIKKRLGISPDDGDALALTF 472


>gi|212710820|ref|ZP_03318948.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM
           30120]
 gi|212686517|gb|EEB46045.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM
           30120]
          Length = 493

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 142/462 (30%), Positives = 214/462 (46%), Gaps = 36/462 (7%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
           ++ L+ FPWGE GT LE  S PR WQ E +  +  H  N      P   + A ++G GIG
Sbjct: 24  SYALYAFPWGEAGTELENASGPRQWQAEALNEIGEHLRNPETRHQP--LQLARASGHGIG 81

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL- 148
           K+   + ++ W M T     V+  AN+E QL+T  W E++KW  L   K WF     ++ 
Sbjct: 82  KSAFISMIIKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQRLSITKDWFTCTKTAIY 141

Query: 149 -----HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASG 202
                H   W +D +                 +SE   + F G HN     I + DEAS 
Sbjct: 142 SNDPNHANAWRADAV----------------PWSENNTEAFAGLHNQGKRIILVFDEASN 185

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID 262
             D++     G LT+ N    WI   NP R +G+F E F K    WK  QID+RTVEG +
Sbjct: 186 IADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRECFRKFKHRWKTKQIDSRTVEGTN 245

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIM 320
               E  I  YG+D D  +V V G FP      FIP  + + A+ R        +AP+I+
Sbjct: 246 KEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPTGLTDAAMKRTVTQAEVSHAPIIL 305

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTTNNKISGLVEKYRPDAIIID-ANN 378
           G D A  G D+ V+ LR+G   + L+  SKT D      +I+   ++Y  DA+ ID    
Sbjct: 306 GVDPAYSGDDDAVIYLRQGLHSKCLWTGSKTIDDVIMAKRIADYEDQYGADAVHIDFGYG 365

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438
           TG ++        + +    G      ++   N+R E++  +  WL+    I+   +  +
Sbjct: 366 TGIQSVGMNWGRNWQLVSFNGASTDPQMQ---NKRGEMYNNVKSWLKIGGAIDDQEVADD 422

Query: 439 LKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           L S   + V  +G++ +E K   + +  +S +  D L  TFA
Sbjct: 423 L-STPEYKVQLSGKILLEKKEDIKKRIGRSPNKGDALALTFA 463


>gi|290968649|ref|ZP_06560187.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str.
           28L]
 gi|290781302|gb|EFD93892.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str.
           28L]
          Length = 487

 Score =  201 bits (511), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 143/463 (30%), Positives = 230/463 (49%), Gaps = 45/463 (9%)

Query: 31  FVLHFFPWGEKGTPLEGFSAPRSWQLEFM-EVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
           FV   F W  +   L+G   P++WQ++ + EV +   L++         + A ++G GIG
Sbjct: 22  FVYFAFDWDSE--ELKG-QNPQTWQIKTLKEVGEGLSLSTA-------LQHATASGHGIG 71

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL- 148
           K+ L AWL+LW +STRP    +  AN+ TQL+T  WAE+SKW  L   K +F + S ++ 
Sbjct: 72  KSALVAWLILWAISTRPDTRGVVTANTATQLETKTWAELSKWYHLFRGKKFFTLTSTAIF 131

Query: 149 -----HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASG 202
                H   W  D +  S+                +R ++F G HN    + +I DEAS 
Sbjct: 132 CRQEGHERTWRIDAIPWSV----------------DRTESFAGLHNQGNRLLLIFDEASA 175

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID 262
             + I     G LT+++    W++  NP R +G+F++ F+K    W   +ID+RTV+  +
Sbjct: 176 IDNKIWEVAEGALTDKDTEILWLVFGNPTRSTGRFFDCFHKYKKSWITQKIDSRTVDISN 235

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP---YAPLI 319
            +  +  I  YG+DSD  +V V G+FP      FI   I+  A  R P       +AP I
Sbjct: 236 KTQLQKWIQTYGIDSDFVKVRVLGEFPDTSDTQFISTAIVRTAWERRPLRTAEYDFAPCI 295

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR-TTNNKISGLVEKYRPDAIIIDANN 378
           +G D A  GGD+TV+ LR+G   E L ++ + D       +++   +KY  DA+ ID   
Sbjct: 296 IGMDPAWTGGDSTVIFLRQGFFSEKLAEYKQNDNDGVMAARLAEFEDKYHADAVFID-KG 354

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH-SGLIQ 437
            G     +   +G   +R++        +   N+R E+   M +WL+   +I    GLI+
Sbjct: 355 YGTGIYSFGVTMGRQ-WRLVSFAEKSGAQAYANKRAEMWGNMKEWLQEGGVIPQVDGLIE 413

Query: 438 NLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
            L + ++FI    GE+ +E K   + +G +S + +D L  TFA
Sbjct: 414 ELTAPQAFINAR-GEIQLEKKEDMKKRGIESPNMADALALTFA 455


>gi|323156136|gb|EFZ42295.1| terminase large subunit [Escherichia coli EPECa14]
          Length = 491

 Score =  196 bits (497), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 138/456 (30%), Positives = 217/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A+++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPATRYQPLML--ALASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D+ H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDLGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        YAP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAYAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|304398406|ref|ZP_07380280.1| terminase, large subunit [Pantoea sp. aB]
 gi|304354272|gb|EFM18645.1| terminase, large subunit [Pantoea sp. aB]
          Length = 490

 Score =  193 bits (490), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 136/456 (29%), Positives = 215/456 (47%), Gaps = 23/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L     PR WQ +  + + AH  N      P +   A  +G GIG
Sbjct: 24  GYALYAFPWGEEGTDLAYSKGPRQWQEDAFKQIGAHLQNPDTRHQPLMIGRA--SGHGIG 81

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + LV W M T     V+  AN+E QL+T  W E++KW  L   + WF   + +++
Sbjct: 82  KSAFISMLVKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQRLSITQDWFTCTATAIY 141

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASGTPDVIN 208
                +D  H      +K +      +SE   + F G HN     I I DEAS   D++ 
Sbjct: 142 S----NDPSH------AKSWRADAIPWSENNTEAFAGLHNERKRIILIFDEASNIADLVW 191

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ N    W+   NP R +G+F E F K    WK  QID+R+VEG +    + 
Sbjct: 192 EVAEGALTDENTEIIWVAFGNPTRNTGRFRECFRKLRHRWKTAQIDSRSVEGTNKEQIQK 251

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD--PYAPLIMGCDIAE 326
            +  YG DSD  +V V G FP      FIP  + + A+ R   P    +A  ++G D A 
Sbjct: 252 WVDDYGEDSDFVKVRVRGLFPSASEAQFIPTGLTDAAVGRVITPGQVAHAATVIGVDPAH 311

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI-SGLVEKYRPDAIIID-ANNTGARTC 384
           +GGD  V+ LR+G   + L ++ +T       KI +   ++YR DA+ ID    TG ++ 
Sbjct: 312 QGGDPAVIYLRQGLHTKKLGEYQRTTDDVLFAKIVASFEDEYRADAVFIDYGYGTGLKSV 371

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
                  + + +  G   + D +   N+R E++  +  WL+    ++   + + L + + 
Sbjct: 372 GDNWGRNWQLIQFGGG--STDPQMA-NKRGEMYNAVKTWLKDGGQLDSQQVAEELSAAEY 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
            +      + +E K   + +  KS + +D L  TFA
Sbjct: 429 KVRLKDSRIVLEDKTSIKERLGKSPNDADALALTFA 464


>gi|320175050|gb|EFW50163.1| terminase B protein, putative [Shigella dysenteriae CDC 74-1112]
          Length = 480

 Score =  191 bits (485), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 216/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 14  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 71

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 72  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 131

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 132 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 181

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 182 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 241

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 242 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 301

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 302 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 361

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+ +    WL    +++      +L S   
Sbjct: 362 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFISCKTWLRLGGMLDDQETADDL-SAAE 417

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 418 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 453


>gi|332344357|gb|AEE57691.1| terminase, large subunit [Escherichia coli UMNK88]
          Length = 491

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|327252187|gb|EGE63859.1| terminase large subunit [Escherichia coli STEC_7v]
          Length = 491

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPATRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|324008564|gb|EGB77783.1| hypothetical protein HMPREF9532_01752 [Escherichia coli MS 57-2]
          Length = 491

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 216/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D+ H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDLGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|300898423|ref|ZP_07116764.1| conserved hypothetical protein [Escherichia coli MS 198-1]
 gi|300357890|gb|EFJ73760.1| conserved hypothetical protein [Escherichia coli MS 198-1]
          Length = 491

 Score =  190 bits (482), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|309702815|emb|CBJ02146.1| putative terminase, large subunit [Escherichia coli ETEC H10407]
          Length = 493

 Score =  189 bits (481), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 133/472 (28%), Positives = 225/472 (47%), Gaps = 23/472 (4%)

Query: 31  FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGK 90
           + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIGK
Sbjct: 26  YALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPATRHQPIML--ARASGHGIGK 83

Query: 91  TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150
           +   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++ 
Sbjct: 84  SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMYS 143

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVINL 209
               +D  H       K +      +SE   + F G HN    + ++ DEAS   D++  
Sbjct: 144 ----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193

Query: 210 GILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGI 269
              G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    +  
Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKEQLQKW 253

Query: 270 IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAEE 327
           +  YG DSD  +V V G FP    + FIP  + + A+ R   P    +A +++G D + +
Sbjct: 254 VDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAVGRVITPAQVQHAAVVLGVDPSHQ 313

Query: 328 GGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTCD 385
           G D  V+ LR+G   + L +W + TD       I+   ++Y+ DA+ ID    TG ++  
Sbjct: 314 GKDPAVIYLRQGLHCKKLGEWQRTTDDVLFAKVIADFEDQYQADAVFIDYGYGTGLKSVG 373

Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSF 445
             +  G +   ++      D E   N+R E++    D L+  + ++   L   L + +  
Sbjct: 374 --DNWGRNWTLIMFGSGTADPEMG-NKRGEMYKSARDALKLGAQLDSQELADELSAPEYK 430

Query: 446 I-VPNTGELAIESKRVKG--AKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQ 494
           + + ++ ++  +   VK    +S + +D  + T+A    +   ++G+  S Q
Sbjct: 431 VRLKDSRKILQDKDEVKELLGRSPNNADAYVLTYAAPVTKKQFNYGQQQSQQ 482


>gi|298381721|ref|ZP_06991320.1| terminase large subunit protein [Escherichia coli FVEC1302]
 gi|301019339|ref|ZP_07183525.1| conserved hypothetical protein [Escherichia coli MS 196-1]
 gi|298279163|gb|EFI20677.1| terminase large subunit protein [Escherichia coli FVEC1302]
 gi|299882256|gb|EFI90467.1| conserved hypothetical protein [Escherichia coli MS 196-1]
 gi|323948690|gb|EGB44595.1| hypothetical protein ERKG_04913 [Escherichia coli H252]
          Length = 491

 Score =  189 bits (481), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|218700994|ref|YP_002408623.1| putative phage terminase, large subunit [Escherichia coli IAI39]
 gi|218370980|emb|CAR18807.1| putative phage terminase, large subunit [Escherichia coli IAI39]
          Length = 491

 Score =  189 bits (481), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|294491573|gb|ADE90329.1| putative phage terminase, large subunit [Escherichia coli IHE3034]
          Length = 491

 Score =  189 bits (481), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-STAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|301046412|ref|ZP_07193572.1| conserved hypothetical protein [Escherichia coli MS 185-1]
 gi|300301638|gb|EFJ58023.1| conserved hypothetical protein [Escherichia coli MS 185-1]
          Length = 491

 Score =  189 bits (480), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 214/456 (46%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEDGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|330007152|ref|ZP_08305894.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3]
 gi|328535499|gb|EGF61959.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3]
          Length = 495

 Score =  189 bits (480), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 138/456 (30%), Positives = 212/456 (46%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE GT L   S PR WQ +    +  H  N      P +   A  +G GIG
Sbjct: 29  GYALYAFPWGEDGTELAHASGPRQWQADAFREIGEHLQNPATRHQPLMISRA--SGHGIG 86

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 87  KSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMY 146

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 147 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIVVVFDEASNIADLVW 196

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 197 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 256

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  +V V G FP      FIP  + +EA+ R        +AP I+G D A 
Sbjct: 257 WVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPRIIGVDPAY 316

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 317 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 376

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL+    ++      +L S   
Sbjct: 377 G--DGWGRTWQLVPFGGASADPQML-NKRGEMFNACKTWLKLGGALDDQETADDL-SAAE 432

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ +E K   + +  +S    D L+ TFA
Sbjct: 433 YKVRVDGKIVMEPKEDIKERLGRSPGKGDALLLTFA 468


>gi|215487825|ref|YP_002330256.1| predicted terminase, large subunit [Escherichia coli O127:H6 str.
           E2348/69]
 gi|215265897|emb|CAS10306.1| predicted terminase, large subunit [Escherichia coli O127:H6 str.
           E2348/69]
          Length = 493

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 133/472 (28%), Positives = 224/472 (47%), Gaps = 23/472 (4%)

Query: 31  FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGK 90
           + L+ FPWGE GT L   + PR WQ +    +  H  N      P +   A ++G GIGK
Sbjct: 26  YALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIGK 83

Query: 91  TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150
           +   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++ 
Sbjct: 84  SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMYS 143

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVINL 209
               +D  H       K +      +SE   + F G HN    + ++ DEAS   D++  
Sbjct: 144 ----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193

Query: 210 GILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGI 269
              G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    +  
Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQKW 253

Query: 270 IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAEE 327
           +  YG DSD  +V V G FP    + FIP  + + A+ R   P    +A +++G D + +
Sbjct: 254 VDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAVGRVITPAQVQHAAVVLGVDPSHQ 313

Query: 328 GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK-ISGLVEKYRPDAIIID-ANNTGARTCD 385
           G D  V+ LR+G   + L +W +T       K I+   ++Y+ DA+ ID    TG ++  
Sbjct: 314 GKDPAVIYLRQGLHCKKLGEWQRTTDDVLFAKIIADFEDQYQADAVFIDYGYGTGLKSVG 373

Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSF 445
             +  G +   +       D E   N+R E++    D L+  + ++   L   L + +  
Sbjct: 374 --DNWGRNWTLIQFGSGTADPEMG-NKRGEMYKSARDALKLGAQLDSQNLADELSAPEYK 430

Query: 446 I-VPNTGELAIESKRVKG--AKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQ 494
           + + ++ ++  + + VK    +S + +D  + T+A    +   ++G+  S Q
Sbjct: 431 VRLKDSRKILQDKEEVKELLGRSPNDADAYVLTYAAPVTKKQFNYGQQQSQQ 482


>gi|331648179|ref|ZP_08349269.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331043039|gb|EGI15179.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 491

 Score =  189 bits (479), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 215/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNACKIWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|117624715|ref|YP_853628.1| putative phage terminase, large subunit [Escherichia coli APEC O1]
 gi|115513839|gb|ABJ01914.1| putative phage terminase, large subunit [Escherichia coli APEC O1]
          Length = 491

 Score =  188 bits (478), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 136/456 (29%), Positives = 215/456 (47%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  ++ V G FP      FIP  + +EA+ R        ++P+I+G D A 
Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHSPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|30387381|ref|NP_848210.1| terminase large subunit [Enterobacteria phage epsilon15]
 gi|30266036|gb|AAO06065.1| terminase large subunit [Salmonella phage epsilon15]
          Length = 491

 Score =  187 bits (474), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 136/456 (29%), Positives = 214/456 (46%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG +SD  +V V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEESDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
                  + +    G   + D +   N+R E+      WL+    ++      +L S   
Sbjct: 373 GDGWGRTWQLIPFGGG--STDPQML-NKRGEMFNSCKTWLKLGGALDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|89152423|ref|YP_512256.1| putative terminase large subunit [Escherichia phage phiV10]
 gi|74055446|gb|AAZ95895.1| putative terminase large subunit [Escherichia phage phiV10]
          Length = 491

 Score =  187 bits (474), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 136/456 (29%), Positives = 214/456 (46%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE+GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG  SD  ++ V G FP      FIP  + +EA+ R        +AP+I+G D A 
Sbjct: 253 WVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL    +++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTWLRLGGMLDDQETADDL-STAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ IE K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|262043569|ref|ZP_06016682.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259039103|gb|EEW40261.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 491

 Score =  187 bits (474), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 137/456 (30%), Positives = 212/456 (46%), Gaps = 24/456 (5%)

Query: 30  NFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
            + L+ FPWGE GT L   + PR WQ +    +  H  N      P +   A ++G GIG
Sbjct: 25  GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIG 82

Query: 90  KTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLH 149
           K+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L   K WF   + +++
Sbjct: 83  KSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMY 142

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVIN 208
                +D  H       K +      +SE   + F G HN    + ++ DEAS   D++ 
Sbjct: 143 S----NDPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRIVVVFDEASNIADLVW 192

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEG 268
               G LT+ +    W+   NP R +G+F E F K    WK  QID+RTVEG +    + 
Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAE 326
            +  YG DSD  +V V G FP      FIP  + +EA+ R        +AP I+G D A 
Sbjct: 253 WVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVVTAVQVAHAPRIIGVDPAY 312

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTC 384
            G D+ V+ LR+G   + L+  +K TD      +I+   ++Y  DA+ ID    TG ++ 
Sbjct: 313 SGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYLADAVFIDFGYGTGLKSI 372

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
              +  G     V     + D +   N+R E+      WL+    ++      +L S   
Sbjct: 373 G--DGWGRTWQLVPFGGASADPQML-NKRGEMFNACKTWLKLGGALDDQETADDL-SAAE 428

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           + V   G++ +E K   + +  +S    D L+ TFA
Sbjct: 429 YKVRVDGKIVMEPKEDIKERLGRSPGKGDALLLTFA 464


>gi|227355862|ref|ZP_03840255.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906]
 gi|227164181|gb|EEI49078.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906]
          Length = 494

 Score =  183 bits (464), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 143/501 (28%), Positives = 223/501 (44%), Gaps = 39/501 (7%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           MS  L  +PE EQ + D+       L ++ +    FPWGE G  LE ++ PR WQ E + 
Sbjct: 1   MSEALQKSPE-EQLIEDIASFTHDPLGYAYYA---FPWGEAGGELEEYNGPRQWQAEALN 56

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            +  H  N      P +   A ++G GIGK+   + ++ W M T     V+  AN+E QL
Sbjct: 57  EIGEHLRNPKTRHQPLLL--ARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQL 114

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSL------HPAPWYSDVLHCSLGIDSKHYSTMCR 174
           +T  W E++KW  L    +WF     ++      H   W +D +                
Sbjct: 115 RTKTWPEIAKWQRLSLTNNWFTCTKTAIYSNDPNHANAWRADAV---------------- 158

Query: 175 TYSEERPDTFVGHHNTYGMAI-INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233
            +SE   + F G HN     I + DEAS   D++     G LT+      WI   NP R 
Sbjct: 159 PWSENNTEAFAGLHNKGKRIILVFDEASNIADLVWEVAEGALTDEGTEIIWIAFGNPTRN 218

Query: 234 SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293
           +G+F E F K    W   QID+RTVEG +    +     YG DSD  +V V G FP    
Sbjct: 219 TGRFRECFRKFKHRWNTKQIDSRTVEGSNKEQIKNWEEDYGEDSDFFKVRVRGVFPSASE 278

Query: 294 DSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLF-DWSK 350
             FIP  + +EA+ R        +AP+I+G D A  G D+ V+ LR+G   + L+  +  
Sbjct: 279 LQFIPTGLTDEAMKRIVTQAEVAHAPVIIGVDPAYSGIDDAVIYLRQGLFSKCLWTGFKT 338

Query: 351 TDLRTTNNKISGLVEKYRPDAIIID-ANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
           TD      +I+   ++Y+ DA+ ID    TG  +   +      V+R++    A      
Sbjct: 339 TDDVVMAKRIADFEDQYKADAVHIDFGYGTGIHS---IGTSWGRVWRLVKFGGASTDPQM 395

Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKST 466
            N+R E++  +  WL+    I+      +L   +  +     ++ +E K   + +  +S 
Sbjct: 396 LNKRGEMYNSVKTWLKIGGAIDDQETADDLSCGEYKVRVIDSKIVLEDKTEIKKRLGRSP 455

Query: 467 DYSDGLMYTFAENPPRSDMDF 487
              D L  TFA    + D ++
Sbjct: 456 GKGDALALTFAYPVTKIDRNY 476


>gi|282848875|ref|ZP_06258265.1| conserved hypothetical protein [Veillonella parvula ATCC 17745]
 gi|282581380|gb|EFB86773.1| conserved hypothetical protein [Veillonella parvula ATCC 17745]
          Length = 483

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 140/475 (29%), Positives = 217/475 (45%), Gaps = 62/475 (13%)

Query: 31  FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGK 90
           FV   +PWGE GTPLE    P  WQ++ ++ +        +       + A+++G GIGK
Sbjct: 21  FVYFAYPWGEPGTPLENMEGPDEWQIQILKDIGEQLKKGKDLQT--AIQEAVASGHGIGK 78

Query: 91  TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150
           + L +WL+ + +ST      +  AN+E QL+T  W E+SKW ++   K  F   + ++  
Sbjct: 79  SALISWLIHFAISTHENTRGVVTANTEGQLRTKTWPELSKWHNMFIAKDLFTYTATAIFS 138

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRT----YSEERPDTFVGHHNTYG-MAIINDEASGTPD 205
           +               K Y    R     +S+  P++F G HN    + ++ DEAS   D
Sbjct: 139 S--------------DKDYEKTWRIDAIPWSKNSPESFAGLHNQGNRILVLFDEASAIDD 184

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSF 265
           VI     G LT+ N    W    NP R SG+F E F K    W  +QID+RTV+  + + 
Sbjct: 185 VIWEVTEGALTDANTEIIWCAFGNPTRNSGRFRECFRKYRKFWNTYQIDSRTVKISNKTK 244

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCD 323
            E  +  YG DSD  +V V G FP      FI   I ++A  +  +P    + P+I+G D
Sbjct: 245 IEEWLEAYGEDSDFFKVRVRGVFPSASDLQFISTEIADKAQKQVYKPGQFEHLPVIIGVD 304

Query: 324 IAEEGGDNTVVVLRRGPVIEHLF-------DWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            A  G D+  +V+R+G  ++ L        DW    L      I+   ++Y+ DA+ ID 
Sbjct: 305 PAWTGSDSLEIVMRQGYYMKSLASIPKNDDDWRMAQL------IAQFEDEYKADAVFIDM 358

Query: 377 N-NTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCR--------NRRTELHVKMADWLEFA 427
              TG           Y + + LG+K  + +EF          N R  +  +M +WL   
Sbjct: 359 GYGTGI----------YSIGKQLGRKWRL-IEFGGKSNDPVYLNMRAYMWGQMKEWLREG 407

Query: 428 SLI--NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
             I  N   L  ++   ++ I  N G + +ESK   + +G  S +  D L  TFA
Sbjct: 408 GSIPPNDQALYDDIVGPEAIIDKN-GRIQLESKKDMKDRGLPSPNKGDALALTFA 461


>gi|54302246|ref|YP_132239.1| terminase large subunit [Photobacterium profundum SS9]
 gi|46915667|emb|CAG22439.1| hypothetical protein PBPRB0566 [Photobacterium profundum SS9]
          Length = 513

 Score =  164 bits (414), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 125/446 (28%), Positives = 202/446 (45%), Gaps = 27/446 (6%)

Query: 37  PWGEKGTPLEGFSAPRSWQLE----FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92
           PW  K   + G   P +W  E      EV+  +  N V+    + F  +IS+G GIGK+ 
Sbjct: 48  PWASKYDSVYG---PDAWFCEMCDQLQEVIRKNDFNGVDPV--DAFLYSISSGHGIGKSC 102

Query: 93  LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152
            ++WL+ ++MSTRP    +  +N+  QL+T  W E+ KW   L NKHWF   +   +   
Sbjct: 103 ASSWLIHFVMSTRPNSKGVVTSNTSEQLRTKTWGELGKWTKKLINKHWFVYNNGKGNMNF 162

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-IINDEASGTPDVINLGI 211
           ++ D         ++ +    +T  EE  ++F G H        + DEAS  PD I    
Sbjct: 163 YHKDY--------AETWRVDAQTCREENSESFAGLHCASSTPWYLFDEASAVPDKIWEVA 214

Query: 212 LGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIA 271
            G LT+     FW +  NP R SG+F E + +    W R QID+ TV+  +        +
Sbjct: 215 EGGLTD--GEPFWFVFGNPTRNSGRFRECWRRFRQRWNRKQIDSSTVQVTNKKKISEWES 272

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331
            YG DSD  RV V G FP    +  I   ++E A++R     P +P +M  D+A  GGDN
Sbjct: 273 DYGEDSDFYRVRVKGVFPSASSNQKISGALLEAAMSRTAHVIPGSPRVMSLDVARGGGDN 332

Query: 332 TVVVLRRG--PVIEHLFDWSKTDLRTTNNKISGLVE---KYRPDAIIIDANNTGARTCDY 386
            V   R G    +        ++ R +    +  V+   +++PDA  ID    G    D 
Sbjct: 333 CVFRFRHGLNGGVRKKVTLPGSEYRDSMKLAAMAVQLCSEFKPDAFFIDETGVGGPVGDR 392

Query: 387 LEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH-SGLIQNLKSLKSF 445
           +  LG++   +    +A D  +  N R  ++ +  +WL+    +++  GL+  + +++  
Sbjct: 393 IRQLGFNCIGINFASKAPDPHYA-NMRAYMYHQWGEWLKAGGSLHYDEGLLTEVGAIEYT 451

Query: 446 IVPNTGELAIESKRVKGAKSTDYSDG 471
                 E+ I    +K A      DG
Sbjct: 452 HDRKDREILIPKDVIKKAIGISTDDG 477


>gi|332981151|ref|YP_004462592.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON]
 gi|332698829|gb|AEE95770.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON]
          Length = 461

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 140/452 (30%), Positives = 206/452 (45%), Gaps = 58/452 (12%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
           + P  WQ E ++ +           NP V   A+ +G G+GKT L AW +LW + TRP  
Sbjct: 25  AEPDDWQAETLQAL---------ADNPRV---AVRSGHGVGKTALEAWALLWFLFTRPYP 72

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPN-KHWFEMQSLSL----HPAPWYSDVLHCSLG 163
            + C A +  QL   LWAE SKWL   P  K +FE Q   +    +P  W++        
Sbjct: 73  KIPCTAPTREQLHDILWAEASKWLERAPALKPYFEWQKTRIVQKQYPGRWFA-------- 124

Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223
                     RT +  +P+   G H  + + II DEASG  D I   I G LT  +A   
Sbjct: 125 --------TARTSN--KPENMAGFHEEHLLFII-DEASGIADNIFETIEGALTTSDAK-- 171

Query: 224 WIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            +M  NP + SG F++ F K    +   ++     + +   + E +  +Y  DSDV RV 
Sbjct: 172 LLMCGNPTKNSGVFHDAFFKDRSLYWTRKVSCLDSQRVTLEYAERLKRKYHEDSDVYRVR 231

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIE 343
           V G+FP+ + D+FI L+I+E A  R+  PD    L +G D+A  G D TV+  R G  + 
Sbjct: 232 VLGEFPKAEPDTFISLDIVEAATMRDVEPD--GVLEIGVDVARFGDDETVLAARAGLKLV 289

Query: 344 HLFDWSKTDLRTTNNKISGLVEKY-----RPDAII-IDANNTGARTCDYL------EMLG 391
           +L  ++K D  TT      L +       +P   I ID +  G    D        E L 
Sbjct: 290 YLKAYTKQDTMTTAGYAIALAKDLMKECGKPKCTIKIDDDGVGGGVTDRCREVVREEKLY 349

Query: 392 YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLKSLKSFIVPN 449
             V          D E   N  TE    + D L  E A LIN   LI  L + K + + +
Sbjct: 350 IDVIDCHNGGAPEDKEHYENWGTEAWAYLRDLLQDEQAELINDEDLIGQLTTRK-YRITS 408

Query: 450 TGELAIESK---RVKGAKSTDYSDGLMYTFAE 478
            G++A+ESK   + +G  S D +D ++  +A+
Sbjct: 409 KGKIALESKDEMKRRGLMSPDRADAVVLAYAK 440


>gi|228968731|ref|ZP_04129698.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis
           serovar sotto str. T04001]
 gi|228790961|gb|EEM38595.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis
           serovar sotto str. T04001]
          Length = 459

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 137/492 (27%), Positives = 223/492 (45%), Gaps = 77/492 (15%)

Query: 14  KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73
           ++ D+ W D +  +F+  +L F+P                WQ + +       ++   +P
Sbjct: 2   EIIDVYWDDPV--AFAEDMLGFYP--------------DEWQRKVL-------MDLAQSP 38

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
                K ++ +G+G+GKT L + +V+W +  RP   VIC A ++ QL T LWAE++KWL 
Sbjct: 39  -----KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLE 93

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
               K+  +     ++            +G + + ++T  RT +  +P+   G H  Y M
Sbjct: 94  GSAVKNLLKWTKTRVY-----------MIGSEERWFAT-ARTAT--KPENMQGFHEDY-M 138

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             + DEASG  D I   ILG L+   A     +  NP R SG FY+  N+  D +K  ++
Sbjct: 139 LFVCDEASGIADPIMEAILGTLS--GAENKLFLCGNPTRTSGVFYDSHNRDRDLYKIHKV 196

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            +           E +  +YG  SDV RV V G+FP+ + D+FIPL I+E+A + +  P 
Sbjct: 197 SSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCKVEPT 256

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY------ 367
               L +G D+A  G D TV+  R G  +  L +  K D   T   +  L ++Y      
Sbjct: 257 GET-LDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMETAGHVLKLAKEYMAKYKQ 315

Query: 368 --RPDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419
             R D I +D +  G    D L      E L + VY V+   + +D E   N  TE    
Sbjct: 316 LKRVD-IKVDDSGVGGGVTDRLKEVIKSERLPFKVYPVVNNGKPLDDEHYDNAGTEGWAV 374

Query: 420 MADWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464
           + D LE               + N   +I    S K + + + G++A+E K   + +G +
Sbjct: 375 VRDLLEENMKAFIQGEEPTMEIPNDEKMISQFSSRK-YRITSRGKIALERKEEMKKRGLQ 433

Query: 465 STDYSDGLMYTF 476
           S D +D ++  F
Sbjct: 434 SPDRADAIVLAF 445


>gi|228911519|ref|ZP_04075310.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL
           200]
 gi|228848128|gb|EEM92991.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL
           200]
          Length = 459

 Score =  156 bits (394), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 136/492 (27%), Positives = 222/492 (45%), Gaps = 77/492 (15%)

Query: 14  KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73
           ++ D+ W D +  +F+  +L F+P                WQ + +       ++   +P
Sbjct: 2   EIIDVYWDDPV--AFAEDMLGFYP--------------DEWQRKVL-------MDLAQSP 38

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
                K ++ +G+G+GKT L + +V+W +  RP   VIC A ++ QL T LWAE++KWL 
Sbjct: 39  -----KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLE 93

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
               K+  +     ++            +G + + ++T  RT +  +P+   G H  Y M
Sbjct: 94  GSAVKNLLKWTKTRVY-----------MIGSEERWFAT-ARTAT--KPENMQGFHEDY-M 138

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             + DEASG  D I   ILG L+   A     +  NP R SG FY+  N+  D +K  ++
Sbjct: 139 LFVCDEASGIADPIMEAILGTLS--GAENKLFLCGNPTRTSGVFYDSHNRDRDLYKIHKV 196

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            +           E +  +YG  SDV RV V G+FP+ + D+FIPL I+E+A + +  P 
Sbjct: 197 SSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCKVEPT 256

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY------ 367
               L +G D+A  G D TV+  R G  +  L +  K D   T   +  L ++Y      
Sbjct: 257 GET-LDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMETAGHVLKLAKEYMAKYKQ 315

Query: 368 --RPDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419
             R D I +D +  G    D L      E L + VY V+   + +D E   N   E    
Sbjct: 316 LKRVD-IKVDDSGVGGGVTDRLKEVIKSERLPFKVYPVVNNGKPLDDEHYDNAGAEGWAV 374

Query: 420 MADWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464
           + D LE               + N   +I    S K + + + G++A+E K   + +G +
Sbjct: 375 VRDLLEENMKAFIQGEEPTMEIPNDEKMISQFSSRK-YRITSRGKIALERKEEMKKRGLQ 433

Query: 465 STDYSDGLMYTF 476
           S D +D ++  F
Sbjct: 434 SPDRADAIVLAF 445


>gi|332976102|gb|EGK12970.1| hypothetical protein HMPREF9374_1123 [Desmospora sp. 8437]
          Length = 462

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 129/427 (30%), Positives = 197/427 (46%), Gaps = 55/427 (12%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           A+ AG G+GKT   AW VLW + TRP   + C A ++ QL   LW E++KWL        
Sbjct: 51  AVRAGHGVGKTATEAWAVLWFLLTRPFPKIPCTAPTKPQLMDVLWPEIAKWL-------- 102

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
             M +  L P   +          + + ++T  RT +  +P+   G H  + + +I DEA
Sbjct: 103 --MNAPELAPYVEWQKTRVVMKQYEERWFAT-ARTSN--KPENMAGFHEEHLLFVI-DEA 156

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260
           SG  + I   I G LT   A    +M  NP R +G FY+ F++  D +  ++I     + 
Sbjct: 157 SGVDNAIFETIDGALT--TAGSKLVMFGNPTRTNGVFYDAFHQDRDLYWTYKISCLDSKM 214

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320
               +   +  +YG DSD+ RV V G+FPQ D DSFIPL ++E+A  R+        L +
Sbjct: 215 ASKDYARNMARKYGEDSDIYRVRVQGEFPQGDPDSFIPLELVEDARVRDLEWIDEDELHI 274

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG--------LVEKYRPDAI 372
           G D+A  G D TV+  R GPV    F   +   RT   +  G        L+E++R D  
Sbjct: 275 GVDVARFGSDETVLAARIGPVA---FRLDRYGGRTPTTETVGRVLALARELMEEHRRDYA 331

Query: 373 IIDANNT--GARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH--VKMADW----- 423
           ++  ++T  G    D L+ +      V  +   +D+  C N  T  H      DW     
Sbjct: 332 VVKVDDTGVGGGVTDQLQEI------VAEEGLNIDVIPCNNGATPEHDPDHYHDWGTESW 385

Query: 424 ---------LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDG 471
                     E A  I+   LI  L + K  +  + G++ +ESK   + +G +S D +D 
Sbjct: 386 GTLLDRFKAGEIALKIDDEDLIGQLTTRKKEMT-SKGKIKLESKEKMKKRGQRSPDRADA 444

Query: 472 LMYTFAE 478
           L+  FAE
Sbjct: 445 LVLAFAE 451


>gi|209901239|ref|YP_002290878.1| putative terminase B [Clostridium phage phiCD27]
 gi|199612120|gb|ACH91293.1| putative terminase B [Clostridium phage phiCD27]
          Length = 469

 Score =  153 bits (386), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 134/437 (30%), Positives = 202/437 (46%), Gaps = 62/437 (14%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           K +I +G+G+GKT L +   +W +STRP   V+  A +  QL   LWAE++KWLS    +
Sbjct: 44  KVSIRSGQGVGKTGLESIATVWYLSTRPFPKVVATAPTRQQLYDVLWAEIAKWLSNSKVE 103

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
              E          W    ++   G + + ++T  RT    +P+   G H  Y M  + D
Sbjct: 104 KLLE----------WTKTKVYMK-GFEERWWAT-ARTAV--KPENMQGFHEDY-MLFVVD 148

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT--- 255
           EASG  D I   ILG L+   A    ++  NP R SG FY+  N+  D +K F++ +   
Sbjct: 149 EASGVADPIMEAILGTLS--GAENKLLLCGNPTRTSGTFYDSHNRDRDLYKTFKVSSLDS 206

Query: 256 -RT----VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
            RT    +E +   +HEG        SD  RV V G+FP+ + DS I L  +E +  RE 
Sbjct: 207 PRTSKDNIEMLKRKYHEG--------SDPWRVRVLGEFPKGESDSLISLEAVETSTIREV 258

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR-- 368
                  L +G DIA  G D T++  R G  +  L  +SK D   T   I   V+K++  
Sbjct: 259 NISNDYILNIGADIARYGDDETIIAPRIGGKVFDLLTYSKKDTMETVGNILRAVDKFKNM 318

Query: 369 -----PDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDL--------EFC 409
                   I  D +  GA   D L      E L Y V  +     A++         E  
Sbjct: 319 YHQINRVKIKTDDDGLGAGVTDRLKEVIRHERLKYEVIPIQNGSSAIEKDKYYNKASEMW 378

Query: 410 RNRRTELHVKMADWLE----FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462
            N R EL   ++ +++       L N   LI+ L + K + V + G++ IESK   + + 
Sbjct: 379 DNMREELDANLSSFIQNKEAIIQLPNDDKLIKQLSNRK-YTVDSKGKIQIESKKEMKKRI 437

Query: 463 AKSTDYSDGLMYTFAEN 479
            +S D +D ++Y+FAEN
Sbjct: 438 GESPDRADAVIYSFAEN 454


>gi|150016512|ref|YP_001308766.1| hypothetical protein Cbei_1636 [Clostridium beijerinckii NCIMB
           8052]
 gi|149902977|gb|ABR33810.1| conserved hypothetical protein [Clostridium beijerinckii NCIMB
           8052]
          Length = 470

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 132/438 (30%), Positives = 203/438 (46%), Gaps = 63/438 (14%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           K ++ +G+G+GKT L + +V W + TRP   VI  A +  QL   LWAE+SKWL+    +
Sbjct: 44  KVSVRSGQGVGKTGLESIVVTWYLCTRPFPKVIATAPTRQQLYDVLWAEISKWLASSKIE 103

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
           +  E     ++   +            S+ +    +T +  RP+   G H  Y M  + D
Sbjct: 104 NLLEWTKTKIYMKGY------------SERWWATAKTAT--RPENMQGFHEDY-MLFVVD 148

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT--- 255
           EASG  D I   ILG LT    N+  +M  NP R SG FY+  N+  D +K F++ +   
Sbjct: 149 EASGVADPIMEAILGTLTGYE-NKL-LMCGNPTRTSGTFYDSHNRDRDLYKTFKVSSLES 206

Query: 256 -RT----VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-LNRE 309
            RT    +E +   +HEG        SDV RV V G+FP+ + DS I L   E A + + 
Sbjct: 207 PRTSKDNIEMLKRKYHEG--------SDVWRVRVEGEFPKGESDSLISLEYAETATITKI 258

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
                   L +G DIA  G D +V+  R G  +  L  ++K D   T   I    +K++ 
Sbjct: 259 NNIHNNFTLHIGADIARFGNDESVIAPRIGNKVFDLLTYTKKDTMETTGNILRATDKFKN 318

Query: 370 D-------AIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416
           +        I +D +  G    D L      E LGY V  +    +A D E   ++  E+
Sbjct: 319 EYKHINKVKIRVDDDGLGGGVTDRLREVIRQEGLGYEVMPIKNGSKANDEEHYSDKSAEM 378

Query: 417 HVKMADWLE--FASLI----------NHSGLIQNLKSLKSFIVPNTGELAIESK---RVK 461
              M D LE  F + +          N+  LI+ L + K F + + G + +E K   + +
Sbjct: 379 WGNMRDILEENFTNFVQGKEPTIELPNNDKLIKQLSNRK-FRIDSKGRIDLEKKEEMKKR 437

Query: 462 GAKSTDYSDGLMYTFAEN 479
             +S D +D ++Y+FAEN
Sbjct: 438 IGESPDLADAVIYSFAEN 455


>gi|150390341|ref|YP_001320390.1| hypothetical protein Amet_2579 [Alkaliphilus metalliredigens QYMF]
 gi|149950203|gb|ABR48731.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF]
          Length = 469

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 127/428 (29%), Positives = 195/428 (45%), Gaps = 44/428 (10%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           K ++ +G+G+GKT L +  + W + TRP   VI  A +  QL   LWAE+SKWLS     
Sbjct: 44  KVSVRSGQGVGKTGLESIAITWYLCTRPFPKVIATAPTRQQLYDVLWAEISKWLS----- 98

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
                +S       W    ++ + G + + ++T  RT    RP+   G H  Y M  + D
Sbjct: 99  -----KSKVDKLLRWTKTKIYMN-GFEERWWAT-ARTAV--RPENMQGFHEDY-MLFVVD 148

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258
           EASG  D I   ILG LT    N+  ++  NP + SG FY+  N+  D +K  ++ +   
Sbjct: 149 EASGVADPIMEAILGTLTGYE-NKL-LLCGNPTKTSGTFYDSHNRDRDTYKSHKVSSMDS 206

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
                   E +  +YG DSDV RV V G FP+ + DS I L + E+A            L
Sbjct: 207 PRTSKENIEMLKKKYGADSDVFRVRVLGDFPKGEADSLISLEVTEQAAETVVDISNAYTL 266

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD-------A 371
            +G DIA  G D T++  R G  +  L  +SK D   T   I   V++ +          
Sbjct: 267 NIGADIARFGDDKTIIAPRIGNRVLDLQQYSKKDTMETAGNILRTVDRLKTQHLQINKIV 326

Query: 372 IIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425
           I ID +  G    D L      + LGY +  +    +A D E   N+  E+   + + L+
Sbjct: 327 IKIDDDGLGGGVTDRLREINRQQSLGYIIVPIKNGSKADDPEHYYNKAAEMWDNIRELLD 386

Query: 426 ---FASLINHSGLIQNLK--------SLKSFIVPNTGELAIESK---RVKGAKSTDYSDG 471
                 L    G+IQ  K        S + + V + G + +ESK   + +  +S D +D 
Sbjct: 387 ENLSKFLQGEPGVIQLPKDDILIKQLSNRKYKVDSKGRIELESKDEMKRRIGESPDRADA 446

Query: 472 LMYTFAEN 479
           ++Y+FA +
Sbjct: 447 VIYSFASD 454


>gi|153810665|ref|ZP_01963333.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174]
 gi|149833061|gb|EDM88143.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174]
          Length = 469

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 96/284 (33%), Positives = 142/284 (50%), Gaps = 17/284 (5%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           ++ +G GIGK+ + AW V+W M T P   + C A ++ QL   LWAE+SKW     N   
Sbjct: 44  SVRSGHGIGKSAVEAWSVIWFMCTHPYPKIPCTAPTQHQLFDILWAEISKWKR---NNKT 100

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
            + + +      W  + L+  +   ++ +  + RT S   PD   G H  + + II DEA
Sbjct: 101 LDSELI------WTKEKLY--MKGHAEEWFAVARTAST--PDALQGFHAEHMLYII-DEA 149

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260
           SG  D I   +LG L+   A    +M  NP +LSG FY+  NK  + +  F ID R    
Sbjct: 150 SGVEDKIFEPVLGALSTPGAK--LLMCGNPTQLSGFFYDSHNKNREQYSTFHIDGRNSTR 207

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI- 319
           +   F + II  YG DSDV RV V G FP  + D +IPL ++E+++  E  P  +  +I 
Sbjct: 208 VSQEFVQTIINMYGEDSDVFRVRVAGDFPLAEDDIYIPLPLVEKSIATEYFPRRHPQIIH 267

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL 363
           +GCD+A  G D TV+  R    ++        D   T + I  L
Sbjct: 268 IGCDVARFGTDKTVIGYRTDEKVQFFKKRVGQDTMKTADDIVSL 311


>gi|228950291|ref|ZP_04112468.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis
           serovar monterrey BGSC 4AJ1]
 gi|228809453|gb|EEM55897.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis
           serovar monterrey BGSC 4AJ1]
          Length = 495

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 126/470 (26%), Positives = 200/470 (42%), Gaps = 73/470 (15%)

Query: 51  PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110
           P  WQ E +  +  H   SV             +G+G+GKT + +W+ +W +  RP   +
Sbjct: 41  PDPWQKEVLNDIANHSHVSVR------------SGQGVGKTAMESWICIWFLCCRPYPKI 88

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
           IC A ++ QL   LWAE++KWL+    K   +          W    ++   G + + ++
Sbjct: 89  ICTAPTKQQLYDVLWAEIAKWLNSSQVKDLLK----------WTKTKIYMK-GFEDRWFA 137

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
           T     +  RP+   G H  Y M  I DEASG  D I   ILG L+      F  M  NP
Sbjct: 138 T---AKTATRPENMQGFHEDY-MLFIADEASGIADDIMEAILGTLSGSENKLF--MCGNP 191

Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
            + SG F++  NK    +K  ++ +           E +  +YG  SDV RV V G+FP+
Sbjct: 192 TKTSGVFFDSHNKDRALYKSHKVSSADSPRTSKKNIEMLKKKYGEGSDVYRVRVEGEFPR 251

Query: 291 QDIDSFIPLNIIEEALNREP------------------CPDPYAPLIMGCDIAEEGGDNT 332
            + D+FI L   E A  RE                    PD  A + +GCD+A  G D T
Sbjct: 252 GEADAFISLETAEAARMREVYKVEVIENEEEESTVKEIIPDT-AVVEIGCDVARFGSDET 310

Query: 333 VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--------RPDAIIIDANNTGARTC 384
           ++  RRG  +  L    + D    +  +    +KY        +   I ID    G    
Sbjct: 311 IIATRRGWKVLPLQVHHQRDTMYVSGLLVQEAKKYFSWCERTGKRIPIRIDDTGVGGGVT 370

Query: 385 DYL-EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA--------DWLEFASLINHSGL 435
           D L E++  + Y +      + + F      E    ++        + LEF +L +   L
Sbjct: 371 DRLKEVVAENDYPI----DVIPINFASKGNAEYACIVSVMYGHFKDNCLEFVALPDDEDL 426

Query: 436 IQNLKSLKSFIVPNTGELAIESKRV---KGAKSTDYSDGLMYTFAENPPR 482
           I  L S++ + + + G + IE K+    +G KS D ++ ++  FA   P+
Sbjct: 427 IAQL-SVRKYQINSDGRIKIEPKKAMKDRGLKSPDRAEAVVMAFAPFYPK 475


>gi|257883493|ref|ZP_05663146.1| conserved hypothetical protein [Enterococcus faecium 1,231,502]
 gi|294614775|ref|ZP_06694675.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636]
 gi|294622490|ref|ZP_06701512.1| conserved hypothetical protein [Enterococcus faecium U0317]
 gi|257819151|gb|EEV46479.1| conserved hypothetical protein [Enterococcus faecium 1,231,502]
 gi|291592387|gb|EFF23996.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636]
 gi|291598037|gb|EFF29147.1| conserved hypothetical protein [Enterococcus faecium U0317]
          Length = 471

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 131/478 (27%), Positives = 218/478 (45%), Gaps = 49/478 (10%)

Query: 34  HFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92
            F P+ + G+ ++ +   P ++  + + +       +V N   E  K ++ +G+G+GKT 
Sbjct: 4   EFIPFADIGSAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTA 63

Query: 93  LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152
           L A  +LW ++ RP   VI  A +  QL   LWAEV+KWL+           SL  +   
Sbjct: 64  LEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLN----------DSLIKNLLK 113

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212
           W    ++  +G DS+ +    RT +  +P+   G H  + M I+ DEASG  D I   IL
Sbjct: 114 WTKTKIYM-VG-DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAIL 168

Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272
           G L+    +   +M  NP  + G FY+  N   D ++  ++ +   +  +    E I+ +
Sbjct: 169 GTLS--GFDNKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKK 226

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---PCPDPYAPLIMGCDIAEEGG 329
           YG +SDV RV + G+FP+  +DSFI L  +E A  ++      +      +G D+A  G 
Sbjct: 227 YGKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGD 286

Query: 330 DNTVVVLRRGPVIEHLFDWSK-TDLRTTN---NKISGLVEKY-RPDAIIIDANNT--GAR 382
           D+T++  R          +SK + + TT    N    L+ +Y   D ++I  ++T  G  
Sbjct: 287 DSTILFPRIATRALEYEKYSKRSTMETTGYVINMAKNLMSQYPSIDKVMIKVDDTGVGGG 346

Query: 383 TCDYLEML---GYHVYRVLGQKRAVDLE--FCRNRRTELHVKMADWLE------------ 425
             D LE L    ++ + V G       E  F  N  T+L   + + LE            
Sbjct: 347 VTDRLEELIEDKHYPFEVFGVNNGSTSEDDFYDNLGTQLWGNIKEMLEENMTANLNGEQP 406

Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
              L + S LI+ L S + F + +   + +ESK   + +   S D +D L   F E P
Sbjct: 407 VIELPSDSSLIKEL-STRKFKMTSRSRIRLESKDDMKKRNIGSPDIADALALAFYEPP 463


>gi|282598712|ref|YP_003358792.1| putative phage terminase B protein [Enterococcus phage phiEf11]
 gi|300860603|ref|ZP_07106690.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11]
 gi|307292389|ref|ZP_07572245.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis
           TX0411]
 gi|258598082|gb|ACV83339.1| putative phage terminase B protein [Enterococcus phage phiEf11]
 gi|300849642|gb|EFK77392.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11]
 gi|306496518|gb|EFM66079.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis
           TX0411]
 gi|315146097|gb|EFT90113.1| conserved hypothetical protein [Enterococcus faecalis TX2141]
          Length = 484

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 123/431 (28%), Positives = 198/431 (45%), Gaps = 50/431 (11%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           K ++ +G+G+GKT L A  +LW ++ RP   VI  A +  QL   LWAEV+KWL+     
Sbjct: 50  KVSVRSGQGVGKTALEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLN----- 104

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
                 SL      W    ++  +G DS+ +    RT +  +P+   G H  + M I+ D
Sbjct: 105 -----NSLIKDLLKWTKTKIYM-VG-DSERWFATARTAT--KPENMQGFHEDH-MLIVVD 154

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258
           EASG  D I   ILG L+    +   +M  NP  + G FY+  N   D ++  ++ +   
Sbjct: 155 EASGVADPIMEAILGTLS--GFDNKLLMCGNPNNIEGVFYDSHNTDRDKYRTHKVSSYDS 212

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
           +  +    + +I +YG +SDV RV + G+FP+  +DSFI L I+E A +          +
Sbjct: 213 KRTNKENIQMLIDKYGENSDVARVRIYGEFPKGALDSFISLEIVEFAKDINISDSELKHV 272

Query: 319 I---MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA---- 371
               +G D+A  G D+T+V  R G        +SK D   T  ++    ++   D     
Sbjct: 273 REGHIGVDVARFGDDSTIVFPRIGAKALPFEKYSKQDTMQTTGRVLKAAKRMMDDYPTIK 332

Query: 372 ---IIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
              I +D    G    D L      E L Y V  V   + + D ++  N+ T++   + +
Sbjct: 333 KVFIKVDDTGVGGGVTDRLKEVISDEKLPYEVIPVNNGESSTD-DYYANKGTQIWGDVKE 391

Query: 423 WLE--FASLINHSG----------LIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTD 467
            LE   ++ IN  G          LI+ L S + F + + G++ +ESK   + +   S D
Sbjct: 392 LLEQNISNSINGQGPTIELPDNANLIKEL-STRKFKMTSNGKIRLESKEDMKKRNVGSPD 450

Query: 468 YSDGLMYTFAE 478
            +D L   F E
Sbjct: 451 IADALTLAFYE 461


>gi|261208032|ref|ZP_05922709.1| conserved hypothetical protein [Enterococcus faecium TC 6]
 gi|289567088|ref|ZP_06447483.1| conserved hypothetical protein [Enterococcus faecium D344SRF]
 gi|260077749|gb|EEW65463.1| conserved hypothetical protein [Enterococcus faecium TC 6]
 gi|289161103|gb|EFD09008.1| conserved hypothetical protein [Enterococcus faecium D344SRF]
          Length = 471

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 131/478 (27%), Positives = 217/478 (45%), Gaps = 49/478 (10%)

Query: 34  HFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92
            F P+ + G  ++ +   P ++  + + +       +V N   E  K ++ +G+G+GKT 
Sbjct: 4   EFIPFADIGAAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTA 63

Query: 93  LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152
           L A  +LW ++ RP   VI  A +  QL   LWAEV+KWL+           SL  +   
Sbjct: 64  LEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLN----------DSLIKNLLK 113

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212
           W    ++  +G DS+ +    RT +  +P+   G H  + M I+ DEASG  D I   IL
Sbjct: 114 WTKTKIYM-VG-DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAIL 168

Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272
           G L+    +   +M  NP  + G FY+  N   D ++  ++ +   +  +    E I+ +
Sbjct: 169 GTLS--GFDNKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKK 226

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---PCPDPYAPLIMGCDIAEEGG 329
           YG +SDV RV + G+FP+  +DSFI L  +E A  ++      +      +G D+A  G 
Sbjct: 227 YGKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGD 286

Query: 330 DNTVVVLRRGPVIEHLFDWSK-TDLRTTN---NKISGLVEKY-RPDAIIIDANNT--GAR 382
           D+T++  R          +SK + + TT    N    L+ +Y   D ++I  ++T  G  
Sbjct: 287 DSTILFPRIATRALEYEKYSKRSTMETTGYVINMAKNLMSQYPSIDKVMIKVDDTGVGGG 346

Query: 383 TCDYLEML---GYHVYRVLGQKRAVDLE--FCRNRRTELHVKMADWLE------------ 425
             D LE L    ++ + V G       E  F  N  T+L   + + LE            
Sbjct: 347 VTDRLEELIEDKHYPFEVFGVNNGSTSEDDFYDNLGTQLWGNIKEMLEENMTANLNGEQP 406

Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
              L + S LI+ L S + F + +   + +ESK   + +   S D +D L   F E P
Sbjct: 407 VIELPSDSSLIKEL-STRKFKMTSRSRIRLESKDDMKKRNIGSPDIADALALAFYEPP 463


>gi|289578588|ref|YP_003477215.1| hypothetical protein Thit_1395 [Thermoanaerobacter italicus Ab9]
 gi|289528301|gb|ADD02653.1| conserved hypothetical protein [Thermoanaerobacter italicus Ab9]
          Length = 460

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 118/416 (28%), Positives = 183/416 (43%), Gaps = 45/416 (10%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           A+ A  G+GKT + AW+ LW + T     VI  A +  Q++  LW E+            
Sbjct: 49  AVRACHGVGKTKVAAWVALWFLYTHHNSKVITTAPTWHQVENLLWREIH----------- 97

Query: 141 FEMQSLSLHPA---PWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
                 + H A   P    VL   + +  + ++    T   ++P+ F G H  + + I+ 
Sbjct: 98  ------AAHAASRIPLGGKVLQTQIELGEQWFALGLST---DKPERFQGFHAEHILLIV- 147

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI---D 254
           DEASG          GFLT   A    ++  NP +LSG+FY  F  PL  + +  I   D
Sbjct: 148 DEASGVEQYTFDAAEGFLTSIGAK--LLLIGNPTQLSGEFYNAFRSPL--YHKIHISAFD 203

Query: 255 TRTVEG--------IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
           +  ++         + P + E    ++G DS +    V G+FP+Q  D+ IPL  IE A 
Sbjct: 204 SPNLKAGKIVRPYLVTPEWVEDKRLKWGEDSPLWYSRVLGEFPEQGNDTLIPLAWIEAAQ 263

Query: 307 NREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK 366
            R    +   P+ +G D+A  G D TV++LRRG   E ++     D      K+    +K
Sbjct: 264 QRWHMTEAGEPVEIGADVARYGTDTTVIMLRRGDKAEIVYQLRGQDTMEVTGKVIDAFKK 323

Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426
              + I ID    GA   D L+  GY V  +   + A D     N+R E +  + +  + 
Sbjct: 324 TGANVIKIDVVGIGAGVVDRLKEQGYPVQGLNVGESATDKGRFVNKRAEWYWALRERFQE 383

Query: 427 ASLI--NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
            ++       L   L SLK +   + G + IESK   R +G  S D +D LM  F+
Sbjct: 384 GTIAIPPDDELASQLASLK-YKFDSRGRIQIESKEELRRRGLPSPDKADALMLAFS 438


>gi|323486060|ref|ZP_08091391.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum
           WAL-14163]
 gi|323400627|gb|EGA92994.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum
           WAL-14163]
          Length = 476

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 125/430 (29%), Positives = 188/430 (43%), Gaps = 58/430 (13%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           K AI +G+G+GKT + A  +LW +   P   ++  A ++ QL   LW+EVSKW+S     
Sbjct: 52  KVAIKSGQGVGKTGMEAVALLWFLCCYPYPRIVATAPTKQQLHDVLWSEVSKWMS----- 106

Query: 139 HWFEMQSLSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
                       +P  SD+L     +  +  + K +  + RT +  +P+   G H    M
Sbjct: 107 -----------KSPLLSDILKWTKTYIYMVGNEKRWFAVARTAT--KPENMQGFHED-NM 152

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             I DEASG  D I   ILG L+   AN   +M  NP R SG FY+ FN     ++   +
Sbjct: 153 LFIVDEASGVADPIMEAILGTLS--GANNKLLMCGNPTRTSGTFYDAFNVDRSIYRCHTV 210

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-PCP 312
            +   +  +    E +I +YG DS+V  V V G+FP+Q+ D FI L+I+E     + P  
Sbjct: 211 SSADSKRTNKQNIESLIRKYGKDSNVVLVRVFGEFPKQEDDVFIALSIVEHCCMLDLPDD 270

Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE------- 365
            P   +  G D+A  G D TV+    G  I     +    L TT  KI  L         
Sbjct: 271 VPIKRISFGVDVARYGSDETVIAKNVGGRITLPVSFRGQSLMTTVGKIVQLYRQAITEFP 330

Query: 366 KYRPDAII-IDANNTGARTCDYLEML-----------------GYHVYRVLGQKRAVDLE 407
           +YR    I ID    G    D LE +                 G      LG  +    +
Sbjct: 331 RYRGKIYINIDDCGLGGGVTDRLEEVKQEEKLTRMVIVPVNAAGKVPEETLGDGKQKACD 390

Query: 408 FCRNRRTEL--HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462
              N  T L   VK A  +E  SL N + L+    + + + + + G++ +ESK   + +G
Sbjct: 391 IYDNMTTYLWGTVKDALMMEEVSLENDNELVAQF-TCRKYRLTSRGKMLLESKEEMKKRG 449

Query: 463 AKSTDYSDGL 472
             S D +D +
Sbjct: 450 IDSPDRADAV 459


>gi|319956916|ref|YP_004168179.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM
           16511]
 gi|319419320|gb|ADV46430.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM
           16511]
          Length = 462

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 121/426 (28%), Positives = 197/426 (46%), Gaps = 39/426 (9%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           K +I +G G GKTTL AW+VLW    R    +   A +  QL   L  E+ KW   +P +
Sbjct: 45  KISIRSGHGTGKTTLLAWIVLWWGLGREDAKIPMTAPTGHQLYDLLMPEIRKWREKMPVQ 104

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
           +  E++            V    +   + +++ + RT  +++P+   G H T  +A I D
Sbjct: 105 YQNEVE------------VKTEKIDFANGNFA-VPRTARKDQPEALQGFHAT-NLAFIID 150

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258
           EASG P VI     G +T    +   IM +NP R  G FY+  +K    W+ FQ +    
Sbjct: 151 EASGIPQVIFEVAEGAMT--GESTLVIMAANPTRTEGYFYDSHHKNRWQWECFQFNAEES 208

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
           E +   + E    +YG DSDV RV + G+FP+Q  ++   L  +++A  RE   D  A  
Sbjct: 209 ENVSKEWIEEKKRQYGEDSDVYRVRIKGEFPRQSSNAVFSLQEVDDATTREIVDDSGAE- 267

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-----RPDAII 373
           + G D+A+ G D +V+  R+G   +H  + +     T  +    L+ +Y     +P  I 
Sbjct: 268 VWGLDVADFGDDKSVLAKRKG---KHFHEITARSGLTLPDLAGWLIYEYNQAKRKPAVIF 324

Query: 374 IDANNTGARTCDYLEMLGYH-VYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH 432
           +DA   G+         G   V  V G   A + E   N+R E +  + D LE   + + 
Sbjct: 325 VDAIGIGSSLPAVCFEKGLDIVIGVKGSNSASNSEKYHNKRAEWYYNLKDLLEDGKIPDD 384

Query: 433 SGLIQNLKSLKSFIVPNTGELA-IESKRVKG--AKSTDYSDG-------LMYTFAENP-- 480
             L+  L + K + + +TG++  +E K +K    +S D +D        ++Y   EN   
Sbjct: 385 DELVGELMAQK-YQISSTGKIQLVEKKEIKKELGRSPDKADACALTCERMIYVEEENDDI 443

Query: 481 PRSDMD 486
           P +DM+
Sbjct: 444 PEADME 449


>gi|160940775|ref|ZP_02088117.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC
           BAA-613]
 gi|158436295|gb|EDP14062.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC
           BAA-613]
          Length = 484

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/265 (33%), Positives = 133/265 (50%), Gaps = 39/265 (14%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL-------- 132
           ++ +G GIGK+ + AW V+W M TRP   + C A +E QL   LWAE+SKW+        
Sbjct: 44  SVRSGHGIGKSAVEAWSVIWYMCTRPFPKIPCTAPTEHQLMDVLWAEISKWMRNNPALRD 103

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
            L+  K    MQ    HP  W++                + RT +   P+   G H  + 
Sbjct: 104 DLIWTKEKLYMQG---HPEEWFA----------------VPRTATN--PEALQGFHAEHV 142

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
           + II DEASG  D +   +LG +T  +A    +M  NP RL+G FY+  ++  + +    
Sbjct: 143 LYII-DEASGVSDKVFEPVLGAMTGEDAK--LLMMGNPTRLAGFFYDSHHRNREQYSAIH 199

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312
           +D R  + +  +F + II  +G DSDV RV V GQFP+   DS I +   EEA N +   
Sbjct: 200 VDGRDSQHVSRTFVQKIIDMFGEDSDVFRVRVAGQFPKSTPDSLIAMEWCEEAANLQ--- 256

Query: 313 DPYAP---LIMGCDIAEEGGDNTVV 334
             YAP   + +G D+A  G D++ +
Sbjct: 257 -VYAPGGQIDIGVDVARYGDDSSAL 280


>gi|266623290|ref|ZP_06116225.1| putative terminase B protein [Clostridium hathewayi DSM 13479]
 gi|288864932|gb|EFC97230.1| putative terminase B protein [Clostridium hathewayi DSM 13479]
          Length = 484

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 84/270 (31%), Positives = 137/270 (50%), Gaps = 31/270 (11%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH- 139
           ++ +G G+GK+ + +W V+W + TRP   + C A ++ QL   LWAE+SKWL   P    
Sbjct: 44  SVRSGHGVGKSAVESWSVIWFLCTRPFPKIPCTAPTQHQLYDILWAEISKWLRNNPELKN 103

Query: 140 ---WFEMQS-LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
              W + +  ++ +P  W++                + RT +   P+   G H  + + I
Sbjct: 104 DIIWTQQRVYMNGYPEEWFA----------------VPRTATN--PEALQGFHAEHVLYI 145

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255
           I DEASG  D +   +LG +T  +A    +M  NP RLSG F++  +K   ++    ID 
Sbjct: 146 I-DEASGVSDKVFEPVLGAMTGEDAK--LLMMGNPTRLSGFFFDSHHKSRSEYSAMHIDG 202

Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
           R  + ++  F + II  +G+DSDV RV V GQFP+   DS I ++  E A   +P     
Sbjct: 203 RDSQHVNQKFVQKIINMFGMDSDVFRVRVAGQFPKSTPDSLIMMDWCEAATQLKP-ETVR 261

Query: 316 APLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345
             + +G D+A  G D++ +     PVI+ +
Sbjct: 262 NRVDIGVDVARYGDDSSALY----PVIDKV 287


>gi|308069786|ref|YP_003871391.1| hypothetical protein PPE_03030 [Paenibacillus polymyxa E681]
 gi|305859065|gb|ADM70853.1| Conserved hypothetical protein [Paenibacillus polymyxa E681]
          Length = 452

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 134/463 (28%), Positives = 202/463 (43%), Gaps = 72/463 (15%)

Query: 51  PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110
           P  WQ   +       ++  NNP     + ++ +G+G+GKT L A   LW +S  P   V
Sbjct: 6   PDDWQASTL-------MDLANNP-----RVSVRSGQGVGKTGLEAATALWFLSCFPYPKV 53

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
           IC A +  QL   LWAE++KW S  P         +      W    ++     + + ++
Sbjct: 54  ICTAPTRQQLHDVLWAEINKWQSKSP---------VLKRILKWTKTKIYMK-NYEERWFA 103

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
           T  RT +  +P+   G H  Y M  I DEASG  D I   ILG L+    N+  +M  NP
Sbjct: 104 T-ARTAT--KPENMQGLHEDY-MLFIVDEASGVADPIMEAILGTLSGE-FNKI-LMCGNP 157

Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID-PSFHEGIIA----RYGLDSDVTRVEVC 285
            + SG FY+  NK   D+K     TR V  +D P   +  IA    +YG  SDV RV V 
Sbjct: 158 TKTSGVFYDSHNKDRADYK-----TRKVSCLDSPRTSKDNIAMLKRKYGEGSDVWRVRVE 212

Query: 286 GQFPQQDIDSFIPLNIIEEA---LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342
           G+FP+   D+FI L + E A   +  EP  D    L +G D+A  G D T +    GP I
Sbjct: 213 GEFPRGGSDTFISLEVAEFAAKEVKLEPTGD---MLTIGVDVARFGDDETSMFAGIGPRI 269

Query: 343 EHLFDWSKTDLRTTNNKISGLVEKYRPD-------AIIIDANNTGARTCDYL------EM 389
                  K     T   +  L ++ +          I +D +  G    D L      E 
Sbjct: 270 VGEHHHFKKGTMVTAGWVINLAKELQVAHPYLNRIRIRVDDSGVGGGVTDRLSEIVAEEG 329

Query: 390 LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE--FASLINHSGLIQNLK------- 440
           L Y +  +     ++D E   N  TE+   + + LE   ++ +N    I  L        
Sbjct: 330 LPYEIIPINNGSSSLD-EHYGNLVTEMWASIKEQLEQNMSNFMNGDSSILQLPDDDVLIT 388

Query: 441 --SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478
             + + + + + G++ +ESK   + +G KS D +D  + TF E
Sbjct: 389 QLTARKWNMTSKGKMLLESKKDMKKRGLKSPDRADAFVLTFGE 431


>gi|255282256|ref|ZP_05346811.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469]
 gi|255267204|gb|EET60409.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469]
          Length = 506

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 81/257 (31%), Positives = 124/257 (48%), Gaps = 19/257 (7%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           A+ +G+G+GKT + A  VLW +S      V+  A +  QL   LW+E++KW    P    
Sbjct: 68  AVKSGQGVGKTGIEAVAVLWFLSCFRYARVVATAPTRQQLHDVLWSEIAKWQERSP---- 123

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
                L      W    ++   G + K +  + RT +  +P+   G H    M  I DEA
Sbjct: 124 -----LLKAILRWTKTYVYVK-GYE-KRWFAVARTAT--KPENMQGFHED-NMLFIVDEA 173

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260
           SG  D I   +LG L+    N   +M  NP R +G FY+ F K    +    + +     
Sbjct: 174 SGVADPIMEAVLGTLS--GGNNKLLMCGNPTRTTGTFYDAFTKDRSIFACHTVSSLDSSR 231

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---PCPDPYAP 317
            D +  + +I +YG DS++ RV V G FP+QD D FI   +I++  +R+   P     A 
Sbjct: 232 TDKNNIDALIRKYGEDSNLVRVRVKGLFPKQDDDVFISQELIDQCTSRQYELPESRGMAQ 291

Query: 318 LIMGCDIAEEGGDNTVV 334
           +I+G D+A  G D TV+
Sbjct: 292 VILGVDVARYGNDETVI 308


>gi|307308936|ref|ZP_07588619.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti
           BL225C]
 gi|306900570|gb|EFN31183.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti
           BL225C]
          Length = 472

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 114/427 (26%), Positives = 188/427 (44%), Gaps = 46/427 (10%)

Query: 76  EVFKG----AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131
           E FK      +    G GKT ++A  + W +     + V   A SE+ +K+ +W E    
Sbjct: 42  EAFKNNQTITVKGSSGWGKTFISAISLWWSLIVFDPVKVTIFAPSESTIKSGIWNE---- 97

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSL-GIDSKHYSTMC----RTYSEERPDTFVG 186
                      +Q L  + AP + ++   S   I  K     C    R  S++      G
Sbjct: 98  -----------LQVLYSNMAPLFRELFEVSATKIFRKSRGETCWAEYRLVSKDNIAAARG 146

Query: 187 HHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP-- 244
            H+   + +I DEASG  DVI  G L  +         ++ SNP + SG F++ +  P  
Sbjct: 147 FHSKNNI-VIADEASGIEDVIFTGALLNVLNDGPGAKVVLVSNPDKASGFFFKTWRDPEL 205

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
             DW +     R      P   E     YG + S      V G+FP  D+D  I    ++
Sbjct: 206 SKDWIKVHGSIRDKPNYTPGEEERFARLYGGVTSRDYLTLVEGEFPLSDVDGLISREFLD 265

Query: 304 EAL-NREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           EA+ N++  P+P AP+I G D A  G D +V+ +R   V+    +W+  +      ++  
Sbjct: 266 EAVTNKDAIPNPKAPIIWGLDPAGAGKDKSVLAIRHDNVLRGFEEWAGLEPVALALRVKE 325

Query: 363 LV----EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ---KRAVDLEFCRNRRTE 415
           L     +K RP  I +D N  GA   D L+     VY+ +     KR  D  + R  R +
Sbjct: 326 LYLKTSKKDRPAVIAVDGNGLGAGVYDALKHFKIPVYKCMFAEVPKRNPD-RYTR-VRDQ 383

Query: 416 LHVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPNTGELAIESKRV---KGAKSTDYSD 470
           +  +M +W+     S+ NH  LI++L ++ ++   ++ ++ IE K+    +  +S DY+D
Sbjct: 384 IWFEMREWIHTGDVSIPNHKKLIEDL-AIPTY--EDSPKIKIEDKKSLKKRLGRSPDYAD 440

Query: 471 GLMYTFA 477
            L  TF+
Sbjct: 441 ALALTFS 447


>gi|253578914|ref|ZP_04856185.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849857|gb|EES77816.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
          Length = 473

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 82/264 (31%), Positives = 134/264 (50%), Gaps = 22/264 (8%)

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
           NP+V   +I +G+G+GKT L A + LW ++  P   ++  A ++ QL   LW+E+SKW+S
Sbjct: 32  NPKV---SIKSGQGVGKTGLEAAVFLWFVTCFPHPRIVATAPTKQQLHDVLWSEISKWMS 88

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
                   E+ S+ L     Y  ++      + K +  + RT +  +P+   G H    M
Sbjct: 89  K------SELLSILLKWTKTYVYMVG-----EEKRWFGVARTAT--KPENMQGFHED-NM 134

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             I DEASG  D I   ILG L+   AN   ++  NP + SG FY+   +    +K   +
Sbjct: 135 LFIVDEASGVADPIMEAILGTLS--GANNKLLLCGNPTKTSGTFYDSHTRDRALYKCHTV 192

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR---EP 310
            +      +    + ++ +YG DS+V RV V G+FP Q+ D FIPL++IE+  ++     
Sbjct: 193 SSMDSTRTNKENIDSLVRKYGWDSNVVRVRVRGEFPNQEDDVFIPLSLIEQCSSKLLELD 252

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVV 334
             D    + +G D+A  G D T++
Sbjct: 253 DADGMQFVSLGVDVARFGDDETII 276


>gi|167767949|ref|ZP_02440002.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1]
 gi|167710278|gb|EDS20857.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1]
 gi|291560988|emb|CBL39788.1| hypothetical protein CL2_30180 [butyrate-producing bacterium SSC/2]
          Length = 473

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 95/319 (29%), Positives = 141/319 (44%), Gaps = 24/319 (7%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           K  I +G+G+GKT   A  +LW +S      V+  A +  QL   LWAEVSKW S  P  
Sbjct: 49  KVTIKSGQGVGKTGFEAATLLWFLSCFENARVVATAPTLHQLNDVLWAEVSKWQSKSP-- 106

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
                    L     ++      +G   + Y+ + RT +   P+   G H    M  I D
Sbjct: 107 --------LLKEILQWTKTKISMIGSKERWYA-VARTAT--TPENMQGFHED-NMLFIVD 154

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258
           EASG  D I   ILG LT   +N   ++  NP + SG FY+        +    +++   
Sbjct: 155 EASGVADPIMEAILGTLT--GSNNKLLLCGNPTKASGTFYDSHTSDRKLYYCITVNSAES 212

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
           +  +    + +I +YG +S+V RV V G FP+QD D ++PL ++E ++  E  P P    
Sbjct: 213 KRTNKDNIDSLIRKYGEESNVVRVRVKGLFPKQDDDVYMPLEMLEASIILEEIP-PADIC 271

Query: 319 IMGCDIAEEGGDNTVVVLRRG-----PVIEHLFDWSKT--DLRTTNNKISGLVEKYRPDA 371
            +G D+A  G D+TV+            I H  D  KT  D+      I    +  +   
Sbjct: 272 TLGVDVARFGDDDTVIARNMNNKITLEKIRHGQDLMKTVGDVVVECRNIKEKFKYKKTIY 331

Query: 372 IIIDANNTGARTCDYLEML 390
           +IID    G    D L  L
Sbjct: 332 VIIDDTGLGGGVTDRLNEL 350


>gi|332980681|ref|YP_004462122.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON]
 gi|332698359|gb|AEE95300.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON]
          Length = 486

 Score =  106 bits (265), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 113/442 (25%), Positives = 178/442 (40%), Gaps = 64/442 (14%)

Query: 79  KGAISAGRGIGKTTLNAWLVLW-LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           + A+ +  G GK+ +   ++LW L S  P I V+  A +  Q++  +W EV         
Sbjct: 46  RTAVRSCHGAGKSFIAGQVILWFLYSFYPSI-VLSTAPTWRQVEKLIWKEVRA------- 97

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
                  S      P   ++L     I            S   PD F G H    + ++ 
Sbjct: 98  -------SYRRSKVPLGGNLLPKRPEIQIIQDEWYAVGLSTNEPDRFQGFHEE-NILVVV 149

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRT 257
           DEA+G P+ I   I G LT  +A    ++  NP  + G FY  F  P   W+   I   T
Sbjct: 150 DEAAGVPEEIFEAIEGVLTSEHAR--LLLLGNPTSVGGTFYNAFRTP--GWENISISAFT 205

Query: 258 VEG-----------------------------IDPSFHEGIIARYGLDSDVTRVEVCGQF 288
                                           I P++      R+G +S   +  V GQF
Sbjct: 206 TPNFTAFGITEDDIINKTWESKITNSLPNPKLITPAWVADKYRRWGPNSPAYQARVLGQF 265

Query: 289 PQQDIDSFIPLNIIEEALNR-EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
           P +  D+ IPL  IE A+ R E  P+   P+ +G D+A  G D TV+  RRG  +  L  
Sbjct: 266 PSEGEDTLIPLAWIEAAMARWEDTPE-GEPIEIGVDVARFGSDKTVIAARRGQKVLPLNV 324

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE 407
           ++K D   T   I  +  K       +D    GA   D L+  G+ V  +   + A D E
Sbjct: 325 YAKQDTMETVGCIIMVHRKIGASKTKVDVIGVGAGVVDRLKEQGHPVIGINVAEAATDTE 384

Query: 408 FCRNRRTELHVKMADWLEFASLIN--------HSGLIQNLKSLKSFIVPNTGELAIESK- 458
              N R+EL   M + L+    +N           L+ +L  +K + + + G + +ESK 
Sbjct: 385 KFANLRSELWWNMRELLDPNQRLNPEPIALPPDDELLADLSGVK-YKIDSRGRIQVESKE 443

Query: 459 --RVKGAKSTDYSDGLMYTFAE 478
             + +  +S D +D ++  FA+
Sbjct: 444 DMKKRLGRSPDRADAVVLAFAK 465


>gi|315122636|ref|YP_004063125.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313496038|gb|ADR52637.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 301

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/170 (35%), Positives = 90/170 (52%), Gaps = 8/170 (4%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQ----L 56
           M+     N E +  L   + S  I  +   F  + + WGE+GTPL     PR+WQ    L
Sbjct: 1   MNATFQPNIEYDTALLQNVLSPAIAGNPLAFTKYMYRWGEEGTPLANCKGPRAWQTEVFL 60

Query: 57  EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116
           E  E ++ +          +VFK AI++ RGIGKT L AW+  W +STR G +V+  ANS
Sbjct: 61  ELAEFIEKNKEAKRLGKPLQVFKLAIASARGIGKTALVAWITYWFLSTRIGCTVVISANS 120

Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQS----LSLHPAPWYSDVLHCSL 162
           + Q KTT +AE+ +W SL  N H+FE       L+   +PW ++ +  +L
Sbjct: 121 DDQCKTTSFAEIRRWHSLAKNAHFFEANIAEALLAGGCSPWQAEPVAKTL 170


>gi|83593922|ref|YP_427674.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170]
 gi|83576836|gb|ABC23387.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170]
          Length = 505

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 121/468 (25%), Positives = 183/468 (39%), Gaps = 72/468 (15%)

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134
           P   K  + AG G+GKTT  A  + W +         C A + +QL+  LW+E+++    
Sbjct: 34  PAGAKVTVRAGHGVGKTTATAAAIWWHLECFDYSKTPCTAPTASQLEQILWSELAR---- 89

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGI------DSKHYSTMCRTYSEERPDTFVGHH 188
           L  +     Q   L PA    + L    G         + +  + RT   ++PD   G H
Sbjct: 90  LRRRADARAQGTGL-PAALRLEALFAVSGRAIADRGTPREWFVVARTARRDQPDALQGFH 148

Query: 189 ----------------NTYGMAI--INDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
                            + G A+  + +EASG PD +     G L+   A    +M  NP
Sbjct: 149 ASDIDLEAGAGPRLSAKSGGAALMFVIEEASGVPDAVFEVAEGALSSPGAR--LLMVGNP 206

Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
            R +G F     +    +   ++       +DP +  G++ +YG +S+V RV   G FP+
Sbjct: 207 TRNTGFFARSHKRDRASFTALRLRCADSPLVDPGYRAGLVRKYGAESNVVRVRADGAFPR 266

Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
           QD D  I L   E AL R P P   A      +G D+A  G D TV +LR GPV+  +  
Sbjct: 267 QDDDVLIALETAEAALAR-PLPARMATEDERRLGVDVARFGDDRTVFLLRIGPVVGAIEV 325

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG----YHVYRVLGQKRA 403
            +  D      +   L E +R   I +D    GA   D L   G             +RA
Sbjct: 326 TAGRDTMAVAGRARRLAEIWRAGRIYVDEIGVGAGVVDRLREDGAPVVAVNVAASAPERA 385

Query: 404 VDLEFCRNRRTELHVKMADWLE-----------------FASLINHSG----------LI 436
              E  R  R  L + +  WL                   A L++  G          L 
Sbjct: 386 AGEERGRLLRDHLWLMVRGWLRDEAPVFAGPGGGPASGSAAGLLSGMGSCLVPGVDADLA 445

Query: 437 QNLK---SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478
           Q+L    +   +    +G + +ESK   + +G +S D +D L  TF E
Sbjct: 446 QDLAGELATPRYAFDGSGRVVVESKDAMKRRGLRSPDLADALALTFHE 493


>gi|262316909|emb|CBA18135.1| putative terminase B [Paenibacillus phage phiBP]
          Length = 248

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 96/208 (46%), Gaps = 16/208 (7%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           ++ +G+G+GKT L A + LW +   P   V+C A +  QL   LWAE+SKW S  P    
Sbjct: 57  SVRSGQGVGKTALEAAISLWFLCCFPFPRVVCTAPTRQQLNDVLWAEISKWQSQSP---- 112

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
                +      W    ++     + + ++T  RT +  +P+   G H  Y M  I DEA
Sbjct: 113 -----ILKRILKWTKTKIYMK-NYEERWFAT-ARTAT--KPENMQGFHEDY-MLFIVDEA 162

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260
           SG  D I   I G L+      F  M  NP + SG F++  N+    ++  ++       
Sbjct: 163 SGVDDRIMAAIFGTLSGDYNKLF--MCGNPTKTSGFFFDSHNRDRAIYRTHRVSCLDSPR 220

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQF 288
                 E + A+YG  SDV RV V G+F
Sbjct: 221 TSKENIEMLKAKYGEGSDVWRVRVLGEF 248


>gi|48697461|ref|YP_024846.1| Pas60 [Actinoplanes phage phiAsp2]
 gi|47679679|gb|AAT36808.1| Pas60 [Actinoplanes phage phiAsp2]
          Length = 492

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 92/361 (25%), Positives = 149/361 (41%), Gaps = 37/361 (10%)

Query: 50  APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
           +P +W  + ++V  A     + +  P   + A+    G+GK+   A LV W  +TR  + 
Sbjct: 22  SPTAWAADCLDVRLAGYQGEILDAVPRERRVAVRGPHGLGKSFSGAILVNWFATTRDLMG 81

Query: 110 ----VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165
               +I  A++   L+  LW E+ KW           +  ++L  AP+        L + 
Sbjct: 82  KDWKIITTASAWRHLEVYLWPEIHKWAG--------RINFVALGRAPYNPRTELLDLRLK 133

Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA----N 221
             H +      +  +P+   G H    + ++ DEA   P      I G  +        N
Sbjct: 134 LTHGAATA--VASNQPERIEGAHAEELLYLL-DEAKIVPPATWDSIEGAFSNAGVDVADN 190

Query: 222 RFWIMTSNPRRLSGKFYEIFNKP--LDDW--KRFQIDTRTVEG-IDPSFHEGIIARYGLD 276
            +    S P   SG+FY+I  +    +DW  +   ++     G I  ++ +   +++G D
Sbjct: 191 AYAFAMSTPGAPSGRFYDIHRRAPGYEDWWTRHVTLEEAIASGRISRAWADQRRSQWGSD 250

Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEAL------NREPCPDPYAPLIMGCDIAEEGGD 330
           S V    V G+F   D DS IPL  +E A+      +R+  P P  PL  G D+   GGD
Sbjct: 251 SAVFHNRVLGEFHASDEDSVIPLAWLEAAIERWHEWDRQGRPSPGGPLWTGVDVG-RGGD 309

Query: 331 NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
            TV+  R G  +       +T+ R       GL++  R    IID    GA   D L  L
Sbjct: 310 ETVLAARDGWAVT-----LETNRRRDTMATVGLIQA-REGRAIIDVIGLGAGVFDRLREL 363

Query: 391 G 391
           G
Sbjct: 364 G 364


>gi|228924410|ref|ZP_04087639.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis
           serovar huazhongensis BGSC 4BD1]
 gi|228835241|gb|EEM80653.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis
           serovar huazhongensis BGSC 4BD1]
          Length = 293

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 79/280 (28%), Positives = 125/280 (44%), Gaps = 32/280 (11%)

Query: 226 MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285
           +  NP R SG FY+  N+  D +K  ++ +           E +  +YG  SDV RV V 
Sbjct: 3   LCGNPTRTSGVFYDSHNRDRDLYKIHKVSSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVL 62

Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345
           G+FP+ + D+FIPL I+E+A + +  P     L +G D+A  G D TV+  R G  +  L
Sbjct: 63  GEFPKAEADAFIPLEIVEQAASCKVEPTGET-LDLGVDVARFGDDETVIAPRIGNKVFKL 121

Query: 346 FDWSKTDLRTTNNKISGLVEKY--------RPDAIIIDANNTGARTCDYL------EMLG 391
            +  K D   T   +  L ++Y        R D I +D +  G    D L      E L 
Sbjct: 122 LNHYKQDTMETAGHVLKLAKEYMAKYKQLKRVD-IKVDDSGVGGGVTDRLKEVIKSERLP 180

Query: 392 YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE------------FASLINHSGLIQNL 439
           + VY V+   + +D E   N   E    + D LE               + N   +I   
Sbjct: 181 FKVYPVVNNGKPLDDEHYDNAGAEGWAVVRDLLEENMKAFIQGEEPTMEIPNDEKMISQF 240

Query: 440 KSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTF 476
            S K + + + G++A+E K   + +G +S D +D ++  F
Sbjct: 241 SSRK-YRITSRGKIALERKEEMKKRGLQSPDRADAIVLAF 279


>gi|292670767|ref|ZP_06604193.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
 gi|292647388|gb|EFF65360.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
          Length = 442

 Score = 84.0 bits (206), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 58/202 (28%), Positives = 94/202 (46%), Gaps = 5/202 (2%)

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY--APLIMGCDIAEEGGDNTVVVLRR 338
           R E+   F     D  IP++++  A NR    D     P+I+G D+A  G D TV+ +R+
Sbjct: 214 RQELLCDFTASASDVVIPIDLVTAAANRLLKDDDVLGQPVILGVDVARFGDDRTVLCVRQ 273

Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398
           G  ++ +  ++      T +++   + ++ P A  IDA   GA   D L  L Y V  V 
Sbjct: 274 GLWLKEVRTFTGLSTMETASRVIDCINQHHPHATFIDAGAMGAGVIDRLRQLRYQVSEVN 333

Query: 399 GQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK 458
             + A+D     N R E++ K   WLE    I  +  ++   S   +    TG + +E K
Sbjct: 334 FGEMAMDAARYANIRAEMYFKCRAWLEAGGAIPQNAELKTELSTVEYKFNPTGRIILEPK 393

Query: 459 ---RVKGAKSTDYSDGLMYTFA 477
              + +  KS D +DG + TFA
Sbjct: 394 DKLKERTGKSPDLADGFVLTFA 415


>gi|315649222|ref|ZP_07902312.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453]
 gi|315275441|gb|EFU38799.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453]
          Length = 189

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 99/223 (44%), Gaps = 45/223 (20%)

Query: 15  LFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPN 74
           L DL W D +  +F+  ++ F               P  WQ + M       ++    P 
Sbjct: 11  LLDLYWDDPV--AFAEDMMGF--------------DPDDWQCDVM-------MDVTQFP- 46

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134
               + ++ +G+G+GKT L A LV+W +  RP   V+C A ++ QL   LW EVSKWL  
Sbjct: 47  ----RTSVRSGQGVGKTGLEAALVIWFLCCRPNPKVVCTAPTKQQLHDVLWTEVSKWLE- 101

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
                     S+  +   W    ++  +G + + ++T     +  +P+   G H  Y M 
Sbjct: 102 ---------NSMVKNLLKWTKTKVY-MIGHEQRWFAT---ARTANKPENMQGFHEDY-ML 147

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
            I DEASG  D I   ILG L+   A    +M  NP R SG F
Sbjct: 148 FIVDEASGVSDPIMEAILGTLS--GAENKLLMCGNPTRTSGVF 188


>gi|257459276|ref|ZP_05624390.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268]
 gi|257443289|gb|EEV18418.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268]
          Length = 431

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 66/256 (25%), Positives = 113/256 (44%), Gaps = 10/256 (3%)

Query: 236 KFYEIFNKPLDD---WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQ 291
           KF+++  + + +   W+ FQ  +     +     + ++A  G  DSDV R E+ G+F   
Sbjct: 161 KFFDLAQRGMRNEKGWRNFQFSSYDNPLLQKEEIDRLVAELGGADSDVARQEIFGEFLDT 220

Query: 292 DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT 351
             +S   L  IE A  ++   D  AP+I   D+A EG D +V+  R+G  +E L  +   
Sbjct: 221 TSNSVFSLAAIEAAFRKQRYFDAGAPVIWALDVAREGDDESVLCKRQGDSVEPLKPYRIA 280

Query: 352 DLRTTNNKISGLVEK--YRPDAIIIDANNTGARTCDYLEMLGYH--VYRVLGQKRAVDLE 407
                  +I G  E+   +P AI ID    GA   D L  LG    V    G  +A D  
Sbjct: 281 STSELAREIYGEYERTDLKPHAIYIDTIGVGAGVFDTLCDLGLRGIVREAKGSFKASDER 340

Query: 408 FCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKG--AKS 465
              N+R E++  + + L   ++     L + L+++  +       L +  + +K    +S
Sbjct: 341 KYANKRAEMYFNLREKLPLLAIAPDEELKRQLQTIAFYFDKKERYLLMPKEGIKKEYGRS 400

Query: 466 TDYSDGLMYTFAENPP 481
            D +D L  +F +  P
Sbjct: 401 PDRADALAMSFFDLCP 416


>gi|226940459|ref|YP_002795533.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
 gi|226715386|gb|ACO74524.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
          Length = 272

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 103/243 (42%), Gaps = 8/243 (3%)

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
           W   QID+RTVEG +          YG +SD  +V V G FP      FI    +  A  
Sbjct: 14  WVARQIDSRTVEGTNKEQIAKWAEDYGEESDFFKVRVRGMFPSMSARQFISETDVSAAYG 73

Query: 308 REPCPD--PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
           R   P+   YAP I+  D A EG D  V+ LR+G     L   +K D      ++    E
Sbjct: 74  RALRPEQYQYAPKILTVDPAWEGDDEFVIGLRQGLSFRVLHTMAKNDNDLVAAQVIARYE 133

Query: 366 KYR-PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424
                DA+ +DA   G       + +G     V     ++D   C N+R E+     DWL
Sbjct: 134 DEEGADAVFVDA-GFGTGIVSAGKSMGRDWTLVWFAGNSMDAG-CLNKRAEMWRDARDWL 191

Query: 425 EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481
           +    I    ++++       +    G++ IESK   + +G  S + +D L+ +FA    
Sbjct: 192 KSGGAIPDDPVLRDELQAPEIVPRLDGKIQIESKKEMKARGVPSPNRADALILSFAYPVT 251

Query: 482 RSD 484
           R D
Sbjct: 252 RRD 254


>gi|154175204|ref|YP_001409090.1| Ppx/GppA family phosphatase [Campylobacter curvus 525.92]
 gi|112803006|gb|EAU00350.1| phosphatase, Ppx/GppA family [Campylobacter curvus 525.92]
          Length = 433

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/258 (27%), Positives = 121/258 (46%), Gaps = 24/258 (9%)

Query: 236 KFYEIFNKPL---DDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQ 291
           +F+++ ++ +    DW  FQI +     +     + +IA  G +DSDV + E+ G+F   
Sbjct: 161 RFFDLASRGMRNEKDWVNFQISSFENPLLRKEEIDELIAELGGVDSDVVKQEIYGEFLDT 220

Query: 292 DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--- 348
             ++  PL+ IE A  +    +P A  I G D+A +G D +V+ +R G  +++L  +   
Sbjct: 221 TTNALFPLSQIEAAFGKVRAYEPNAVQIWGLDVARDGDDESVLCVREGYHVKNLEGFRIA 280

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EM-LGYHVYRVLGQKRAVDL 406
           S T+L     +   + EK +P+AI ID+   GA T D L E  LG          +A + 
Sbjct: 281 STTELAREIYRRYEMSEK-KPEAIFIDSVGVGAGTFDRLCEFGLGAICREAKASYKATNE 339

Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSL--------KSFIVPNTGELAIESK 458
               N+R E++  + +     ++  H  L + L+ +        +  I+P       E K
Sbjct: 340 AKFANKRAEMYFALKEKFHLLTMNAHEKLKKQLQMIEFQYDRKERYLILPKD-----ELK 394

Query: 459 RVKGAKSTDYSDGLMYTF 476
           +  G  S DY+D L  TF
Sbjct: 395 KEYGT-SPDYADALALTF 411


>gi|119386463|ref|YP_917518.1| PBSX family phage terminase large subunit [Paracoccus denitrificans
           PD1222]
 gi|119377058|gb|ABL71822.1| phage terminase, large subunit, PBSX family [Paracoccus
           denitrificans PD1222]
          Length = 441

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 92/206 (44%), Gaps = 19/206 (9%)

Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG------ 339
           G +  +    FI   ++ EA+ R+P       L++G D+A  G D +V+  RRG      
Sbjct: 214 GDYEAESDMQFIGGGLVREAMARQPFSQIGDELVLGVDVARFGDDRSVIWARRGRDAQTE 273

Query: 340 -PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRV- 397
            P+I         D      ++   +++  PD + ID    G    D    +GY V  V 
Sbjct: 274 LPIIMK-----GADTMAVAARVMAEIDRLHPDGVFIDEGGVGGGVIDRCRQMGYSVVGVN 328

Query: 398 LGQK--RAVD-LEFCRNRRTELHVKMADWLEFASLINHS-GLIQNLKS-LKSFIVPNTGE 452
            G K  RA++ +  CRN+R ++   M +WL     I  S  L  +L   L SF V N  E
Sbjct: 329 FGGKADRAIEGVPKCRNKRAQMWATMREWLRSGGCIPDSRDLEMDLTGPLYSFDVNNAIE 388

Query: 453 LAIESK-RVKGAKSTDYSDGLMYTFA 477
           +  +S  + +G  S D +D L  TFA
Sbjct: 389 IEKKSDMKKRGVSSPDEADALALTFA 414


>gi|56266666|gb|AAV84947.1| DNA pacase B subunit [Enterobacteria phage D6]
          Length = 502

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/345 (25%), Positives = 144/345 (41%), Gaps = 44/345 (12%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           +  +++G G GK++L A L+L  M   P   VI +AN   Q+KT ++  V ++ +    +
Sbjct: 56  RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
           H +      L    +Y        GI    +  +C+ Y     +   G H  + + +I D
Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH-LLLILD 167

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-------NKPLDDWKRF 251
           EASG  D     + G LTE + NR  +M S P R SG FY+         + P   W   
Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 225

Query: 252 QIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
            +++     + P F +  +  Y G DS    V+V GQFP++     +  +  + A  R+ 
Sbjct: 226 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRAARRKV 285

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI---SGLV--- 364
             +     +   D+   G D +V+ + +  V  H     +   R  N K+   SG +   
Sbjct: 286 LLEKNWGWVATADVG-NGRDKSVLNICK--VSGH-----RDKRRVVNFKVMEMSGTMDPL 337

Query: 365 ------------EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRV 397
                       EKY    I +DA+  G+ TC  L   G +  R+
Sbjct: 338 AFADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRI 382


>gi|303257560|ref|ZP_07343572.1| putative terminase B protein [Burkholderiales bacterium 1_1_47]
 gi|302859530|gb|EFL82609.1| putative terminase B protein [Burkholderiales bacterium 1_1_47]
          Length = 330

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 90/202 (44%), Gaps = 6/202 (2%)

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY--APLIMGCDIAEEGGDNTVVVLRR 338
           R E    F     +  IP++ I  A N+      Y  APLI G D+A  G D +V+  RR
Sbjct: 95  RQEFLCDFSAAQDNGLIPIDDIRAAANKFYRESEYMGAPLIYGIDVARFGSDASVIFKRR 154

Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398
           G V        K D     ++I+  + K +PDA+ ID +  G    D L  + + V  V 
Sbjct: 155 GLVAFEPIVIRKFDNMALADRIAVEMAKEKPDAVFID-SGAGQGVIDRLRQMRFDVVEVP 213

Query: 399 GQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK 458
              +A+D E   NRR E+   MA W++    I    ++Q      ++     G   +E+K
Sbjct: 214 FGAQAIDKEQFANRRMEMWWHMAQWIKQGGAIPPDPVLQGDLGAPTYGYTPKGPKILEAK 273

Query: 459 ---RVKGAKSTDYSDGLMYTFA 477
              + +  +S D +D L  TFA
Sbjct: 274 DKLKERIGRSPDLADALALTFA 295


>gi|216906085|ref|YP_002333619.1| terminase [Abalone shriveling syndrome-associated virus]
 gi|216263178|gb|ACJ72002.1| terminase [Abalone shriveling syndrome-associated virus]
          Length = 507

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 109/450 (24%), Positives = 182/450 (40%), Gaps = 46/450 (10%)

Query: 54  WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113
           WQLE ++ + A      ++    V   A+S G G GKT L+  L +W     PG     L
Sbjct: 51  WQLEIVDYI-AKFFRKNSDEKHFVCAIAVSGGNGTGKTKLSKALNIWRFCCHPGSRQFIL 109

Query: 114 ANSETQLK----TTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
            NSE Q K    T L   +SK LS +       ++S + + +P  +D        D    
Sbjct: 110 TNSERQTKRTGFTMLVRRISKLLSCIA-----ALESSAYYYSPAVADKPEVRTN-DMWDV 163

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
           + + ++ +E       G H+   M    DE++   D +   +    T+     F   T N
Sbjct: 164 TYLLQSSTEA---ALSGLHHPM-MTFSFDESTYFNDHVWQALENMWTQGQVLCF--CTGN 217

Query: 230 PRRLSGKFY-EIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR-------YGLDSDVTR 281
           P   +  ++  +FNK L       + TR V  ++        AR       YG       
Sbjct: 218 PSHDNNNYFARLFNKSLHKKDSLWL-TRCVSLLELPLKYRNDARARYIEEHYGKTHPRYI 276

Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREPCPD-PYAPLIMGCD--IAEEGGDNTVVVLRR 338
             V GQFP+++  +   +  I EA+ RE   +  + P+IMG D  I+   G  + + +R 
Sbjct: 277 ASVLGQFPKKNTCNPFDITAISEAMEREVREEFIHHPVIMGIDVSISANNGSASAICVRE 336

Query: 339 GPVIEHLFDW--SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-----EMLG 391
           G  +  L ++    T+ R    K+  L+++ +P  +++DAN  G    + L     E   
Sbjct: 337 GTAVRVLREYRCHYTEFRI---KLLELLQEIKPTIVVVDANGVGFGLYEELHRTLPETSN 393

Query: 392 YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLKSLKSFIVPN 449
             VY V     A       ++ +EL  K ++W   E  S+  +   +  L SL       
Sbjct: 394 VRVYGVRAHAEAFLKSEYADKMSELAKKSSEWFNNELVSIPKNYQFLNALTSLS--FADA 451

Query: 450 TGELAIESKRVKGAK---STDYSDGLMYTF 476
           +G++ +  K     K   S D +D    TF
Sbjct: 452 SGKIKLIGKTDAKKKVDLSMDMADAFFLTF 481


>gi|323179619|gb|EFZ65182.1| terminase B protein [Escherichia coli 1180]
          Length = 453

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 85/345 (24%), Positives = 142/345 (41%), Gaps = 44/345 (12%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           +  +++G G GK++L A L+L  M   P   VI +AN   Q+KT ++  V ++ +    +
Sbjct: 7   RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 66

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
           H +      L    +Y        GI    +  +C+ Y     +   G H  + + +I D
Sbjct: 67  HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH-LLLILD 118

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-------NKPLDDWKRF 251
           EASG  D     + G LTE + NR  +M S P R SG FY+         + P   W   
Sbjct: 119 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 176

Query: 252 QIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
            +++     + P F +  +  Y G DS    V+V GQFP++     +  +  + A  R+ 
Sbjct: 177 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRAARRKV 236

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL------- 363
             +     +   D+   G D +V+ + +  V  H     +   R  N K+  +       
Sbjct: 237 LLEKNWGWVATADVG-NGRDKSVLNICK--VSGH-----RDKRRVVNFKVMEMPGTMDPL 288

Query: 364 -----------VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRV 397
                       EKY    I +DA+  G+ TC  L   G +  R+
Sbjct: 289 AFADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRI 333


>gi|323948959|gb|EGB44853.1| terminase B protein [Escherichia coli H252]
          Length = 502

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 62/221 (28%), Positives = 100/221 (45%), Gaps = 18/221 (8%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           +  +++G G GK++L A L+L  M   P   VI +AN   Q+KT ++  V ++ +    +
Sbjct: 56  RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
           H +      L    +Y        GI    +  +C+ Y     +   G H  + + +I D
Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH-LLLILD 167

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-------NKPLDDWKRF 251
           EASG  D     + G LTE + NR  +M S P R SG FY+         + P   W   
Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSRAKTPDNPKGIWTAI 225

Query: 252 QIDTRTVEGIDPSF-HEGIIARYGLDSDVTRVEVCGQFPQQ 291
            +++     + P F  E ++   G DS    V+V GQFP++
Sbjct: 226 VLNSEESPFVTPQFIKEKLLEYGGRDSIEYMVKVLGQFPRE 266


>gi|322656964|gb|EFY53248.1| DNA packaging protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CASC_09SCPH15965]
          Length = 411

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 62/221 (28%), Positives = 100/221 (45%), Gaps = 18/221 (8%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           +  +++G G GK++L A L+L  M   P   VI +AN   Q+KT ++  V ++ +    +
Sbjct: 56  RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
           H +      L    +Y        GI    +  +C+ Y     +   G H  + + +I D
Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH-LLLILD 167

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-------NKPLDDWKRF 251
           EASG  D     + G LTE + NR  +M S P R SG FY+         + P   W   
Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 225

Query: 252 QIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQ 291
            +++     + P F +  +  Y G DS    V+V GQFP++
Sbjct: 226 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPRE 266


>gi|269119479|ref|YP_003307656.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386]
 gi|268613357|gb|ACZ07725.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386]
          Length = 499

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 107/476 (22%), Positives = 189/476 (39%), Gaps = 81/476 (17%)

Query: 58  FMEVVDAHCLNSVNNPNPEVF----KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113
           F ++++ H L+       + F    + ++ AG   GK++L   L  + + TRP   VI  
Sbjct: 22  FKDILNFHFLSEDQTRVLQAFNEYRRLSVPAGHSTGKSSLAGGLTTYWLITRPKSRVIVT 81

Query: 114 ANSETQLKTTLWAEVSK--------WLSLLP-------------NKHWFEMQSLSLHPAP 152
           A +  QLKT  WAEV+K         L+L                + WF +   +  P  
Sbjct: 82  APTYRQLKTIYWAEVNKIYNRSKLKQLNLFEINDKIMRINDKDLKREWFALPVTASTPEG 141

Query: 153 WYS---------DVLHCSLGI----DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
                       + +   LGI    D +    + +    E+    +   +   + ++ DE
Sbjct: 142 MQGQHGDKTEVIEQIMKHLGIEEIGDDETIEIVSQILRGEKQIEGLTKEDKEKLLVMVDE 201

Query: 200 ASGTPDVI----------NLGILGFLTERNANRFWIMTSNPRRLSGKFYEI----FNKPL 245
           +SG  + I           L + G +T +N   F+    NP+    KFY++    +N P 
Sbjct: 202 SSGVKNEIFEVLEGTDYDKLVLFGNMT-KNTGYFYESVYNPK---SKFYKVTMSSYNSPF 257

Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
              K+ QI            H+ +   YG DS+V RV + G+ P  + +S    N I+ A
Sbjct: 258 --MKKEQI------------HD-LEETYGPDSNVVRVRLKGEAPDGNENSIFSSNKIDSA 302

Query: 306 LNREPCPDPYAPLIMGCDIAE-EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV 364
             R      Y  + +G D+ +  GGD++ +  ++   +    D     L     +I    
Sbjct: 303 FQRSLSLSEYETIKLGVDVGKGSGGDSSTIYEKKDNRVRKKLDRKDFTLPDVKREIIQYC 362

Query: 365 EKYRPDAIIIDANNTGART-----CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419
            K R   II + + TG  T      +  E+    V  +    +A + +   N+RTE++ +
Sbjct: 363 YKNRDKLIIANIDGTGLGTGLVQELEEGEIENLVVNDIQFAGKAKNKKEFNNKRTEMYFE 422

Query: 420 MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK-RVKG--AKSTDYSDGL 472
           ++  L+   L     L + L  ++ +   N G   + SK ++K     S D SD L
Sbjct: 423 LSRNLDKLDLEEDQELKREL-LIQIYEFDNNGRFKLISKDKIKEMLGHSPDKSDAL 477


>gi|153951273|ref|YP_001397540.1| putative terminase B protein [Campylobacter jejuni subsp. doylei
           269.97]
 gi|153951467|ref|YP_001398214.1| putative terminase B protein [Campylobacter jejuni subsp. doylei
           269.97]
 gi|152938719|gb|ABS43460.1| putative terminase B protein [Campylobacter jejuni subsp. doylei
           269.97]
 gi|152938913|gb|ABS43654.1| putative terminase B protein [Campylobacter jejuni subsp. doylei
           269.97]
          Length = 430

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 107/256 (41%), Gaps = 20/256 (7%)

Query: 237 FYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARY-----GLDSDVTRVEVCGQFP 289
           FYE+  K L D  WK FQ  +      +P   E  I        G  SDV R E+ G+F 
Sbjct: 164 FYELCRKELSDKNWKHFQFSSYD----NPFLKEEQIKELIEEVGGESSDVVRQEIYGEFI 219

Query: 290 QQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
                    L+ IE A+++            I G D+A  G D +V+  R+G VI+ L  
Sbjct: 220 DSSSAELFSLSGIENAMSKNSFSTQKMQGENIWGLDVARYGDDKSVLAKRKGFVIDELKK 279

Query: 348 WSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
           +S+       NKI    ++   +P  I ID    G    D L   G  V+       A  
Sbjct: 280 YSQLGTIELANKILAEYKQSEEKPKGIFIDTCGLGVGVYDVLLNYGLPVFEANSANSATS 339

Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462
            ++  N+R +++   A  L+   L+    L  +++ ++ +   + G L I SK   +   
Sbjct: 340 NQYL-NKRAQMYFTFAKNLKHMELVKDEELKNDMRRIE-YEYSDKGLLKIVSKEQLKKNY 397

Query: 463 AKSTDYSDGLMYTFAE 478
            KS D SD +  TF E
Sbjct: 398 GKSPDLSDAVALTFFE 413


>gi|304399103|ref|ZP_07380971.1| DNA packaging protein [Pantoea sp. aB]
 gi|304353343|gb|EFM17722.1| DNA packaging protein [Pantoea sp. aB]
          Length = 503

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 71/275 (25%), Positives = 115/275 (41%), Gaps = 37/275 (13%)

Query: 45  LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104
           +E F    +WQ E         +NSV     +     +++G G GK++L A ++L  M  
Sbjct: 32  VELFGMIPTWQQE-------EIMNSVQETGSQT---TVTSGHGTGKSSLTAMMLLIYMIM 81

Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164
            P   VI +AN   Q+KT ++  V  + +    +H +     +L    +Y        GI
Sbjct: 82  YPDARVIIVANKIGQVKTGVFKYVKTYWANAARRHPWLQNYFTLTDTMFYE---KSRKGI 138

Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224
               +  +C+ Y     +   G H  + + I+ DEASG  D     + G LTE + NR  
Sbjct: 139 ----WEVLCKGYRLGNEEALAGEHAAHILLIL-DEASGISDKAIAIMRGALTEED-NRM- 191

Query: 225 IMTSNPRRLSGKFYEIF-------NKPLDDWKRFQIDTRTVEGIDPSF-HEGIIARYGLD 276
           +M S P R SG FY+         + P   W    +++     +   F  E ++   G D
Sbjct: 192 LMMSQPTRPSGYFYDSHHSLARHPDNPNGFWNAIVLNSEEAPHVTLKFIREKLVEYGGRD 251

Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
           S    V+V G+FP+         N+    L R+ C
Sbjct: 252 SLEYMVKVLGRFPR---------NVSGYLLGRDEC 277


>gi|283956317|ref|ZP_06373797.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni
           1336]
 gi|283792037|gb|EFC30826.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni
           1336]
          Length = 430

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/256 (25%), Positives = 107/256 (41%), Gaps = 20/256 (7%)

Query: 237 FYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARY-----GLDSDVTRVEVCGQFP 289
           FYE+  K L D  WK FQ  +      +P   E  I        G DS+V + E+ G+F 
Sbjct: 164 FYELCRKELSDKNWKHFQFSSYD----NPFLKEEQIKELIEEVGGEDSEVVKQEIYGEFI 219

Query: 290 QQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
                    L  IE A+++            I G D+A  G D +V+  R+G +++ +  
Sbjct: 220 DSSSAELFALTEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSVLAKRKGFIVDEIKK 279

Query: 348 WSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
           +S+       N+I     +   +P  I ID    G    D L   G  V+       A  
Sbjct: 280 YSQLGTMELANRILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEANSANSATS 339

Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462
            E+  N+R +++   A  L+   L+    L ++++ ++ +   + G L I SK   +   
Sbjct: 340 NEYL-NKRAQMYFTFAKNLKHMELVKDEELKKDMRMIE-YEYSDKGLLKIVSKEQLKKNY 397

Query: 463 AKSTDYSDGLMYTFAE 478
            KS D SD +  TF E
Sbjct: 398 GKSPDVSDAVALTFFE 413


>gi|212703250|ref|ZP_03311378.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098]
 gi|212673294|gb|EEB33777.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098]
          Length = 330

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/216 (27%), Positives = 94/216 (43%), Gaps = 12/216 (5%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--PLIMGCDIAEEGG 329
           R  L  +  R E+   F     D  IPL  + EA  R+   D     P+I+G D+A  G 
Sbjct: 79  RRELSDNAFRQEMLCDFTASSDDILIPLPDVLEAEARQLAWDDVGGMPVILGVDVARFGA 138

Query: 330 DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
           D++V+V R+G  ++        D     ++++  + + RP A+ IDA   G    D L  
Sbjct: 139 DSSVIVRRQGLKVDGPVVMRGLDNMQLADRVAAAIMENRPHAVFIDAGQ-GQGVIDRLRQ 197

Query: 390 LGYHVYRV-LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG----LIQNLKSLKS 444
           LG+ V  V  G K   +  F  NRR+E+   +  WL+    +   G     ++   S   
Sbjct: 198 LGHEVIEVPFGGKPLQEGRFA-NRRSEMWYGLRQWLKSGGKLPDEGDDVPRLRAELSAPL 256

Query: 445 FIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
           +     G + +E K   + +   S D +D L  TFA
Sbjct: 257 YWYDAAGRMVLEPKDKIKERLGASPDIADALALTFA 292


>gi|315929403|gb|EFV08605.1| phosphatase, Ppx/GppA family [Campylobacter jejuni subsp. jejuni
           305]
          Length = 430

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 67/256 (26%), Positives = 104/256 (40%), Gaps = 20/256 (7%)

Query: 237 FYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARY-----GLDSDVTRVEVCGQFP 289
           FYE+  K L D  WK FQ  +      +P   E  I        G  S+V + E+ G+F 
Sbjct: 164 FYELCRKELSDKNWKHFQFSSYD----NPFLKEEQIKELIEEVGGEGSEVVKQEIYGEFI 219

Query: 290 QQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
                    L+ IE A+++            I G D+A  G D + +  R+G VI  +  
Sbjct: 220 DSSSAELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKGFVIYEIKK 279

Query: 348 WSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
           +S+       NKI     +   +P  I ID    G    D L   G  V+       A  
Sbjct: 280 YSQLGTIELANKILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEANSANSATS 339

Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKG 462
            E+  N+R +++   A  L+   L     L ++++ ++ +   + G L I SK   +   
Sbjct: 340 NEYL-NKRAQMYFTFAKNLKHMELFKDEELKKDMRMIE-YEYSDKGLLKIVSKEYLKKNY 397

Query: 463 AKSTDYSDGLMYTFAE 478
            KS D SD +  TF E
Sbjct: 398 GKSPDVSDAVALTFFE 413


>gi|57237579|ref|YP_178593.1| terminase B protein, putative [Campylobacter jejuni RM1221]
 gi|57166383|gb|AAW35162.1| terminase B protein, putative [Campylobacter jejuni RM1221]
          Length = 430

 Score = 63.2 bits (152), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 64/252 (25%), Positives = 105/252 (41%), Gaps = 12/252 (4%)

Query: 237 FYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDI 293
           FYE+  K L D  WK FQ  +     +     + +I   G + S+V + E+ G+F     
Sbjct: 164 FYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVKQEIYGEFIDSSS 223

Query: 294 DSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT 351
                L+ IE A+++            I G D+A  G D + +  R+G VI  +  +S+ 
Sbjct: 224 AELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKGFVIYEIKKYSQL 283

Query: 352 DLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
                 NKI     +   +P  I ID    G    D L   G  V+       A   E+ 
Sbjct: 284 GTIELANKILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEANSANSATSNEYL 343

Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKST 466
            N+R +++      L+   L+    L ++++ ++ +   + G L I SK   +    KS 
Sbjct: 344 -NKRAQMYFTFTKNLKHMELVKDEELKKDMRMIE-YEYSDKGLLKIVSKEQLKKNYGKSP 401

Query: 467 DYSDGLMYTFAE 478
           D SD +  TF E
Sbjct: 402 DVSDAVALTFFE 413


>gi|168467778|ref|ZP_02701615.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar
           Newport str. SL317]
 gi|195629119|gb|EDX48493.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar
           Newport str. SL317]
          Length = 494

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 88/381 (23%), Positives = 154/381 (40%), Gaps = 59/381 (15%)

Query: 48  FSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPG 107
           F    +WQ +         + SV  P     K ++S+G G GK+ + + +++  +   PG
Sbjct: 30  FGKTPTWQQD-------QIIESVQEPGS---KTSVSSGHGTGKSDMTSIMIMLFIIMFPG 79

Query: 108 ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167
              I +AN   Q+ T ++    K+L +    +W    S +    PW ++    +   D+ 
Sbjct: 80  ARAIIVANKIQQVMTGIF----KYLKI----NW----STATSRFPWLAEYFVLT---DTS 124

Query: 168 HYSTMCRTYSEERPDTF--------VGHHNTYGMAIINDEASGTPDVINLGILGFLTERN 219
            Y    +      P  F         G H  + + II DEASG  D     + G LT ++
Sbjct: 125 FYEITSKGVWTVVPKGFRLGNEEALAGEHADHLLYII-DEASGVSDKAFGIMTGALTGKD 183

Query: 220 ANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272
            NR  ++ S P R SG FY+  +K       P   +    +++     + P F +  +A 
Sbjct: 184 -NRI-LLLSQPTRPSGYFYDTHHKLAKRPGNPNGIYTAITLNSEESPLVTPEFIKMKLAE 241

Query: 273 Y-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIA-EEGGD 330
           Y G DS +  ++V G FP+      +  + +E A  R+         I   D+A   G D
Sbjct: 242 YGGRDSPMYLIKVRGLFPKTQDGFLLGRDEVERASRRKVKIAKGWGWIACVDVAGGTGRD 301

Query: 331 NTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDAIIIDANNTG 380
            +V+ +        +R  +   + ++S         KI+     ++Y    I+ID +  G
Sbjct: 302 KSVINIMMVSGERNKRRIIGYRIIEYSDVTETQLAAKINAECSPDRYPNITIVIDGDGLG 361

Query: 381 ARTCDYLEMLGYHVYRVLGQK 401
             T D L    Y  Y +  Q+
Sbjct: 362 KSTADLL----YDNYGITAQR 378


>gi|282598783|ref|YP_003359102.1| putative large subunit terminase [Clavibacter phage CMP1]
 gi|262212571|gb|ACY35907.1| putative large subunit terminase [Clavibacter phage CMP1]
          Length = 872

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 84/339 (24%), Positives = 132/339 (38%), Gaps = 49/339 (14%)

Query: 183 TFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN 242
           +F G H+ + +A++ DEA G P+ + +G     T  +A    I   NP + +  F+E F 
Sbjct: 511 SFQGIHDGH-VAVVLDEAGGLPEDLYIGANAVTTNFHARILAI--GNPDKRNTPFHERFT 567

Query: 243 --KPLDDWKRFQI---DTRTVEGI----DPSFHE-----------GIIARYGLDSDVTRV 282
             +    W RF I   DT    G     DP+  E            +  R      V   
Sbjct: 568 DTEKFSSWNRFTIGAEDTPNFTGEKIYEDPAKDEDVKKHLVQVSWAVEMRKSARPSVVAA 627

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342
           +V G FP+ D  +F   ++I    + E  P+      MG DI+ +G D +V  +  G  I
Sbjct: 628 KVDGNFPESDDTTFFDQSVINRGYSTEIEPESTDFKYMGVDISYQGEDQSVAYINHGGQI 687

Query: 343 EHLFDWSKTD--------LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--- 391
               +W++ D        +R  N      V++ R     ID   TGA     L+ML    
Sbjct: 688 RIADEWNRFDGAEHIESAIRIHNKACQEGVQEVR-----IDMAGTGAGVYSNLKMLDQFK 742

Query: 392 ---YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKSF 445
              Y +  V G  R  +     N R   + +    L    +   I    L + ++ L+  
Sbjct: 743 DKPYVLIGVNGANRTPNSNRWLNARAWHYDQFRTGLITGKIDITITDVDLKKEME-LQPS 801

Query: 446 IVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481
              N G+L I  K   R  G  S D+ D  +Y+  +  P
Sbjct: 802 TFTNRGQLQITRKDDMRKMGISSPDHLDAAIYSAIDTTP 840


>gi|148653111|ref|YP_001280204.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1]
 gi|148572195|gb|ABQ94254.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1]
          Length = 520

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 94/445 (21%), Positives = 178/445 (40%), Gaps = 60/445 (13%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           + ++++G G GK+     + LW +   P   ++  A    QL+T +W E++  L  L N 
Sbjct: 57  RTSVASGHGTGKSRSAGIIALWHLLFYPESVMLFTAPQIGQLRTVVWKEINICLQRLRNN 116

Query: 139 HWFEMQSLSLHPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGM 193
                         W +D V+  +  I  K +        +T  + +P    G H  + M
Sbjct: 117 ----------KALGWLADYVVVLAEKIYIKGFKDTWFVFAKTAPKHQPTNIAGQHGDHYM 166

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL----DDWK 249
            +  DEA G  D +    +G LT  N NR  ++TS P + +G FY+  +K        W 
Sbjct: 167 -VWADEACGIDDAVMEVAIGALTHEN-NRA-VLTSQPAKNTGFFYDTHHKLSHHNGGKWT 223

Query: 250 RFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
             + +      +        + +YG  +S    + + G+FP+     ++      E + +
Sbjct: 224 ALEFNGEMSPIVSKDKLIEALYQYGSRNSPGYLIRIRGKFPELK-GEYLLTRTDYENMKQ 282

Query: 309 EPC----PDPYAPLIMGCDIAEEGGDNTVVV--------LRRGPVIEH-------LFDWS 349
           +PC     D +  +I+  D+  + G ++ V+        + +G +  H       LF  +
Sbjct: 283 QPCVIEEGDKWG-IIVAVDVGGDVGRDSSVISVMQVVDKMIKGRIERHVHLLDIPLFS-N 340

Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
           + ++ T   KI+ ++  Y    ++ID    G      L+  G +   V       +    
Sbjct: 341 RANINTLKAKINDVMSDYPGATLVIDPLGAGMGLTQSLKADGVYFDEVHWGSPCFNNTLK 400

Query: 410 R---NRRTELHVKMADWLE---FASLINHSGLIQNLKSLKS------FIVPNTGELAIES 457
           R   N+R+  +V MA  +E   F+       + Q + +L+       +         + S
Sbjct: 401 RYYMNKRSHAYVSMAKAVEKGYFSVSDKVKKMYQVMTNLEEQMTRLPYYFDEKARWCMMS 460

Query: 458 KR---VKGAKSTDYSDGLMYTFAEN 479
           K+    KG KS D +D + + F EN
Sbjct: 461 KKDMLKKGIKSPDIADTIAFGFMEN 485


>gi|226227228|ref|YP_002761334.1| hypothetical protein GAU_1822 [Gemmatimonas aurantiaca T-27]
 gi|226090419|dbj|BAH38864.1| hypothetical protein [Gemmatimonas aurantiaca T-27]
          Length = 549

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 107/462 (23%), Positives = 179/462 (38%), Gaps = 67/462 (14%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSK-WLSLLPNKH 139
           A+++G G GKT L A L+LW ++  P      +A    Q +  +W EV++ W        
Sbjct: 71  AVASGTGTGKTFLEAVLLLWWIAVEPDSIATTVATKADQQEKGIWREVARHWPRFQACFP 130

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             E+ +L +   PW  D    + GI      T      EE      G H    + I+ DE
Sbjct: 131 EAELTTLRIRMEPWRGDAWG-AWGI------TAAPKAGEESSSAVQGLHAKR-LLILVDE 182

Query: 200 ASGTPDVINLGILGFLT-ERNANRFWIMTSNPRRLS---GKFYEIFNKPLDDWKRFQID- 254
             G P  +   ++   T E N    +    NP   +   G+F E   K +   +   +D 
Sbjct: 183 TPGVPQPVMTALVNTATGEENVIAAF---GNPDYQADPLGQFAE--TKRVTAIRISALDH 237

Query: 255 ---TRTVEGIDPSFHEGIIA----RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
                 VE I  +     IA    +YG++S V +  V G  P+Q   + I L     A +
Sbjct: 238 PNVVLGVERIPGAATRLSIATREDKYGVESGVYQSRVRGIAPEQSASALIHLAWCVAAAD 297

Query: 308 REPCPDPYA----PLIMGCDIAE-EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           R       A    P  +G D+A+ E GD   V + +G  +  +   +  +      ++  
Sbjct: 298 RAESVQHAALALGPKALGVDVAQSENGDKAAVAMGQGARLLSVIAKACPNATKLGAEVWQ 357

Query: 363 LV--EKYRPDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVD--------- 405
           L+  E   P+ + +D    GA T ++L      E  G  V R  G  +A++         
Sbjct: 358 LMRDEGIVPEYVGVDPIGVGAATVNHLDGECEKENAGRSVVRCSGGAKAMEASSRAADGS 417

Query: 406 -LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNT-------GELAIES 457
            +E+  +     +++   W +    + + GLI   +  + F    T       G + +ES
Sbjct: 418 AMEWLADANKFKNLRAQMWWQLREDLRN-GLIALPRDRELFRELTTVQFDEDGGIVTLES 476

Query: 458 K---RVKGAKSTDYSDGLMY-------TFAENPPRSDMDFGR 489
           K   R +  +S D +D ++Y       T    PP    D  R
Sbjct: 477 KDDIRKRLGRSPDRADAVVYWNWVRPRTRVNQPPPEGFDVAR 518


>gi|299769795|ref|YP_003731821.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1]
 gi|298699883|gb|ADI90448.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1]
          Length = 668

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 101/432 (23%), Positives = 163/432 (37%), Gaps = 61/432 (14%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     + LW +       ++  A    QLK  +W E+S             +  L  
Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259

Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            P  W +D V + S  +  K Y        +T  + +P    G+H    M  + DEASG 
Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259
            D +     G LT  + NR  +MTS P R +G FYE  +K        W     +     
Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376

Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317
            +     E    +YG   D   ++ V G+FP    +  I     EE        D +   
Sbjct: 377 LVSKQSLEEQRQKYGSRDDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436

Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363
            I+  D+    G D++V+V+             RR  V++     ++ D+     KI+ L
Sbjct: 437 YIITVDVGGGVGRDDSVIVISKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA-D 422
           + +Y    +++D N  G     YL+  G     V    +     F  + R E   K +  
Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQC----FSNDNRKEFTNKRSLA 552

Query: 423 WLEFASLINHSGLIQNLKSLKSFI--------VPNTGE-------LAIESKRVKGAKSTD 467
           ++ FA  +  SG  + +K+ K ++        +P   +       L+ +  R  G KS D
Sbjct: 553 YVGFARAVA-SGRFK-MKTKKHYVKIKDQLIHIPYRFDDFARYKILSKDEMRRMGIKSPD 610

Query: 468 YSDGLMYTFAEN 479
             D   + F EN
Sbjct: 611 LGDAFAFLFLEN 622


>gi|323516996|gb|ADX91377.1| hypothetical protein ABTW07_0941 [Acinetobacter baumannii
           TCDC-AB0715]
 gi|323518424|gb|ADX92805.1| hypothetical protein ABTW07_2381 [Acinetobacter baumannii
           TCDC-AB0715]
          Length = 663

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     + LW +       ++  A    QLK  +W E+S             +  L  
Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259

Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            P  W +D V + S  +  K Y        +T  + +P    G+H    M  + DEASG 
Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259
            D +     G LT  + NR  +MTS P R +G FYE  +K        W     +     
Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376

Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317
            +     E    +YG   D   ++ V G+FP    +  I     EE        D +   
Sbjct: 377 LVSKQSLEEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436

Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363
            ++  D+    G D++V+V+             RR  V++     ++ D+     KI+ L
Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGESQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391
           + +Y    +++D N  G     YL+  G
Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524


>gi|256392042|ref|YP_003113606.1| hypothetical protein Caci_2856 [Catenulispora acidiphila DSM 44928]
 gi|256358268|gb|ACU71765.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928]
          Length = 484

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 93/443 (20%), Positives = 161/443 (36%), Gaps = 77/443 (17%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRP--GISVICLANSETQLKTTLWAEVSKWLSL---- 134
           A+ +  G GK+ + + L  W + T P     V+  A +  Q+K  LWAE++K  +     
Sbjct: 58  AVQSCHGTGKSFVASRLTAWWLDTHPPGEAFVVTTAPTGDQVKAILWAEINKAFAKAEAR 117

Query: 135 ---LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
              LP +         ++   W  D    + G          R  S+  P  F G H  Y
Sbjct: 118 GTPLPGR---------INETDWKYDKFLVAFG----------RKPSDYNPHAFQGIHAKY 158

Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251
            + I+ DEA G         L   T  +     I   NP      F ++     D W   
Sbjct: 159 VLVIL-DEACGISKQFWTAALAIATGVHCRILAI--GNPDDPGSHFAQVCKS--DRWNMI 213

Query: 252 QIDTR-----TVEGIDPSFHEGIIAR---------YGLDSDVTRVEVCGQFPQQDIDSFI 297
           +I  R     T E +     + ++++         +G +S +   +V  +FP    D  +
Sbjct: 214 KIAARDTPNFTGEEVPDDLADMLVSQAYVLDMAEEFGPESPIYLSKVDAEFPSDASDGVV 273

Query: 298 PLNIIEEALNREP----CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL 353
            L+ +  A  REP     PD   P+ +G D+   GGD T +  RRG      +   + D 
Sbjct: 274 RLSKL-MACTREPVHPYAPDRLVPVELGVDLG-AGGDETCIRERRGIAAGREWRNREKDS 331

Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM---LGYHVYRVLGQKRAVDLEFCR 410
               + I   + +     + +D+   G      L+     G H   V+G   +   E   
Sbjct: 332 EKVVDHIVRAIRETGATKVKVDSIGIGWGIVGSLQARRKQGLHTAEVVGVNVS---EAST 388

Query: 411 NRRTELHVKMADWLEFASLINHSG--------------LIQNLKSLKSFIVPNTGELAIE 456
                  ++   W E    ++  G              L+  L + K + +  +G + +E
Sbjct: 389 QPEKYARLRSQIWWEVGRKLSEDGGWDLSQLDTTDRDRLVSQLTAPK-YDLDASGRIVVE 447

Query: 457 SK---RVKGAKSTDYSDGLMYTF 476
            K   + +  +S D +D L+  F
Sbjct: 448 KKEETKKRIGRSPDNADALLLAF 470


>gi|312964323|ref|ZP_07778627.1| terminase B protein [Escherichia coli 2362-75]
 gi|331655801|ref|ZP_08356790.1| terminase B protein (PACase B protein) (DNA packaging B protein)
           [Escherichia coli M718]
 gi|312291036|gb|EFR18910.1| terminase B protein [Escherichia coli 2362-75]
 gi|323186470|gb|EFZ71817.1| terminase B protein [Escherichia coli 1357]
 gi|323969205|gb|EGB64507.1| terminase B protein [Escherichia coli TA007]
 gi|325495624|gb|EGC93488.1| DNA pacase B subunit [Escherichia fergusonii ECD227]
 gi|331046575|gb|EGI18664.1| terminase B protein (PACase B protein) (DNA packaging B protein)
           [Escherichia coli M718]
          Length = 494

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/340 (23%), Positives = 139/340 (40%), Gaps = 32/340 (9%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPN 137
           K ++S+G G GK+ + + +++  +   PG   I +AN   Q+ T ++  +   W +    
Sbjct: 51  KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
             W       L    +Y         +  K +    R  SEE      G H  + + II 
Sbjct: 111 FPWLA-DYFVLTETAFYEVTGKGVWTVVPKGF----RLGSEE---ALAGEHADHLLYII- 161

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKR 250
           DEASG  D     I G LT ++ NR  ++ S P R SG FY+  +K       P   +  
Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRI-LLLSQPTRPSGYFYDTHHKLAKRPGNPDGVYTA 219

Query: 251 FQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
             +++     + P+F +  +A Y G D+ +  ++V G FP+      +  + +E A  R+
Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279

Query: 310 PCPDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKI 360
                    +   D+A   G D +V+ +        +R  +   + +++         KI
Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKI 339

Query: 361 SGLV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397
                 E++    I ID +  G  T D + E  G  V R+
Sbjct: 340 FAECNPERFPNITIAIDGDGLGKATADLMYEYYGITVQRI 379


>gi|332974843|gb|EGK11758.1| hypothetical protein HMPREF9373_1714 [Psychrobacter sp. 1501(2011)]
          Length = 520

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 96/466 (20%), Positives = 183/466 (39%), Gaps = 67/466 (14%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           + ++++G G GK+     + LW +   P   ++  A    QL+T +W E++  L  L N 
Sbjct: 57  RTSVASGHGTGKSRSAGIIALWHLLFYPESVMLFTAPQIGQLRTVVWKEINICLQRLRNN 116

Query: 139 HWFEMQSLSLHPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGM 193
                         W +D V+  +  I  K +        +T  + +P    G H  + M
Sbjct: 117 ----------KALGWLADYVVVLAEKIYIKGFKDTWFVFAKTAPKHQPTNIAGQHGDHYM 166

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL----DDWK 249
            +  DEA G  D +    +G LT  N NR  ++TS P + +G FY+  +K        W 
Sbjct: 167 -VWADEACGIDDAVMEVAIGALTHEN-NRA-VLTSQPAKNTGFFYDTHHKLSHYNGGKWI 223

Query: 250 RFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
             + +      +        + +YG  +S    + + G+FP+     ++      E +  
Sbjct: 224 ALEFNGEMSPIVSKEKLIEALYQYGSRNSPGYLIRIRGKFPELK-GEYLLTRTDYENMKA 282

Query: 309 EPC----PDPYAPLIMGCDIAEEGGDNTVVV--------LRRGPVIEH-------LFDWS 349
            PC     D +  +I+  D+  + G ++ V+        + +G +  H       LF  +
Sbjct: 283 HPCVIKEGDKWG-IIVTVDVGGDVGRDSSVISVLQVVDKMVKGRIERHVHLLDIPLFS-N 340

Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
           + ++ T   KI+ ++  Y    ++ID    G      ++  G +   V       +    
Sbjct: 341 RANINTLKAKINDVMSDYPGATLVIDPLGAGMGLTQSVKADGVYFDEVHWGSPCFNNTLK 400

Query: 410 R---NRRTELHVKMADWLE---FASLINHSGLIQNLKSLKS------FIVPNTGELAIES 457
           R   N+R+  +V MA  +E   F+       + Q + +L+       +         + S
Sbjct: 401 RYYMNKRSHAYVSMAKAVEKGYFSVSDKIKKMYQVITNLEEQMTRLPYYFDEKARWCMMS 460

Query: 458 KR---VKGAKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQYEGVDL 500
           K+    KG KS D +D + + F EN           P+  YE +++
Sbjct: 461 KKDMLKKGIKSPDIADTIAFGFMEN-------ISYAPAESYEDLNI 499


>gi|260871239|ref|YP_003238019.1| DNA packaging protein [Escherichia coli O111:H- str. 11128]
 gi|257767818|dbj|BAI39311.1| DNA packaging protein [Escherichia coli O111:H- str. 11128]
          Length = 494

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/338 (22%), Positives = 140/338 (41%), Gaps = 32/338 (9%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKH 139
           ++++G G GK+ + + + +  +   PG  VI +AN   Q+   ++  + S W + +    
Sbjct: 53  SVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFP 112

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
           W   +   L    ++         I  K     CR+ +EE      G H  + + II DE
Sbjct: 113 WLS-KYFILTETSFFEVTGKGVWTILIKS----CRSGNEE---ALAGEHADHLLYII-DE 163

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQ 252
           ASG  D     I G LT ++ NR  ++ S P R SG FY+  ++       P   +    
Sbjct: 164 ASGVSDKAFSVITGALTGKD-NRI-LLLSQPTRPSGYFYDSHHRLAIRPGNPDGLFTAII 221

Query: 253 IDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
           +++     +D  F    +A Y G D+ +  ++V G+FP+      +  + +E A  R+  
Sbjct: 222 LNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVK 281

Query: 312 PDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISG 362
                  +   D+A   G D +V+ +        +R  +   + +++         KI  
Sbjct: 282 IAKGWGWVACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFA 341

Query: 363 LV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397
               E++    I ID +  G  T D + E  G  V R+
Sbjct: 342 ECNPERFPNITIAIDGDGLGKSTADLMYERYGITVQRI 379


>gi|56266643|gb|AAV84926.1| DNA pacase B subunit [Enterobacteria phage phiW39]
          Length = 494

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/340 (23%), Positives = 139/340 (40%), Gaps = 32/340 (9%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPN 137
           K ++S+G G GK+ + + +++  +   PG   I +AN   Q+ T ++  +   W +    
Sbjct: 51  KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
             W       L    +Y         +  K +    R  SEE      G H  + + II 
Sbjct: 111 FPWLA-DYFVLTETAFYEITGKGVWTVVPKGF----RLGSEE---ALAGEHADHLLYII- 161

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKR 250
           DEASG  D     I G LT ++ NR  ++ S P R SG FY+  +K       P   +  
Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRI-LLLSQPTRPSGYFYDTHHKLAKRPGNPDGVYTA 219

Query: 251 FQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
             +++     + P+F +  +A Y G D+ +  ++V G FP+      +  + +E A  R+
Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279

Query: 310 PCPDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKI 360
                    +   D+A   G D +V+ +        +R  +   + +++         KI
Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKI 339

Query: 361 SGLV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397
                 E++    I ID +  G  T D + E  G  V R+
Sbjct: 340 FAECNPERFPNITIAIDGDGLGKATADLMYEYYGITVQRI 379


>gi|324111095|gb|EGC05081.1| terminase B protein [Escherichia fergusonii B253]
          Length = 494

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 79/340 (23%), Positives = 139/340 (40%), Gaps = 32/340 (9%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPN 137
           K ++S+G G GK+ + + +++  +   PG   I +AN   Q+ T ++  +   W +    
Sbjct: 51  KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
             W       L    +Y         +  K +    R  SEE      G H  + + II 
Sbjct: 111 FPWLA-DYFVLTETAFYEVTGKGVWTVVPKGF----RLGSEE---ALAGEHADHLLYII- 161

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKR 250
           DEASG  D     I G LT ++ NR  ++ S P R SG FY+  +K       P   +  
Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRI-LLLSQPTRPSGYFYDTHHKLAKRPGNPDGVYTA 219

Query: 251 FQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
             +++     + P+F +  +A Y G D+ +  ++V G FP+      +  + +E A  R+
Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279

Query: 310 PCPDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKI 360
                    +   D+A   G D +V+ +        +R  +   + +++         KI
Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRILEYTDVTETQLAAKI 339

Query: 361 SGLV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397
                 E++    I ID +  G  T D + E  G  V R+
Sbjct: 340 FAECNPERFPNITIAIDGDGLGKATADLMYEYYGITVQRI 379


>gi|213156231|ref|YP_002318651.1| phage terminase [Acinetobacter baumannii AB0057]
 gi|301346399|ref|ZP_07227140.1| phage terminase [Acinetobacter baumannii AB056]
 gi|301594275|ref|ZP_07239283.1| phage terminase [Acinetobacter baumannii AB059]
 gi|213055391|gb|ACJ40293.1| phage terminase [Acinetobacter baumannii AB0057]
          Length = 663

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     + LW +       ++  A    QLK  +W E+S             +  L  
Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259

Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            P  W +D V + S  +  K Y        +T  + +P    G+H    M  + DEASG 
Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259
            D +     G LT  + NR  +MTS P R +G FYE  +K        W     +     
Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376

Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317
            +     +    +YG   D   ++ V G+FP    +  I     EE        D +   
Sbjct: 377 LVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436

Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363
            ++  D+    G D++V+V+             RR  V++     ++ D+     KI+ L
Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391
           + +Y    +++D N  G     YL+  G
Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524


>gi|260551382|ref|ZP_05825582.1| phage terminase [Acinetobacter sp. RUH2624]
 gi|260405545|gb|EEW99037.1| phage terminase [Acinetobacter sp. RUH2624]
          Length = 663

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     + LW +       ++  A    QLK  +W E+S             +  L  
Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259

Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            P  W +D V + S  +  K Y        +T  + +P    G+H    M  + DEASG 
Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259
            D +     G LT  + NR  +MTS P R +G FYE  +K        W     +     
Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376

Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317
            +     +    +YG   D   ++ V G+FP    +  I     EE        D +   
Sbjct: 377 LVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436

Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363
            ++  D+    G D++V+V+             RR  V++     ++ D+     KI+ L
Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391
           + +Y    +++D N  G     YL+  G
Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524


>gi|332852816|ref|ZP_08434408.1| intein splicing region-containing protein [Acinetobacter baumannii
           6013150]
 gi|332871045|ref|ZP_08439658.1| intein splicing region-containing protein [Acinetobacter baumannii
           6013113]
 gi|332729027|gb|EGJ60377.1| intein splicing region-containing protein [Acinetobacter baumannii
           6013150]
 gi|332731805|gb|EGJ63085.1| intein splicing region-containing protein [Acinetobacter baumannii
           6013113]
          Length = 663

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     + LW +       ++  A    QLK  +W E+S             +  L  
Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259

Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            P  W +D V + S  +  K Y        +T  + +P    G+H    M  + DEASG 
Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259
            D +     G LT  + NR  +MTS P R +G FYE  +K        W     +     
Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376

Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317
            +     +    +YG   D   ++ V G+FP    +  I     EE        D +   
Sbjct: 377 LVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436

Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363
            ++  D+    G D++V+V+             RR  V++     ++ D+     KI+ L
Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391
           + +Y    +++D N  G     YL+  G
Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524


>gi|226940437|ref|YP_002795511.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
 gi|226715364|gb|ACO74502.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
          Length = 133

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 40/126 (31%), Positives = 53/126 (42%), Gaps = 23/126 (18%)

Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL------HPAPWYSDVLHCSLGIDSK 167
           AN++TQL+T    EV KW  L    HWF+ QS S+      H   W +D +         
Sbjct: 4   ANTDTQLRTKTSPEVGKWQRLSITSHWFDPQSASIAARDKEHAKTWRADFV--------- 54

Query: 168 HYSTMCRTYSEERPDTFVGHHNT-YGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226
                   +SE   + F G HN    + +I DEAS   D +     G LT+      WI 
Sbjct: 55  -------PWSEHNTEAFAGLHNKGKRIVLIFDEASAIADKVWEVAEGALTDEETEIIWIA 107

Query: 227 TSNPRR 232
             NP R
Sbjct: 108 FGNPTR 113


>gi|184158505|ref|YP_001846844.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU]
 gi|183210099|gb|ACC57497.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU]
          Length = 663

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/328 (23%), Positives = 125/328 (38%), Gaps = 39/328 (11%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     + LW +       ++  A    QLK  +W E+S             +  L  
Sbjct: 211 GKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLARLKQ 259

Query: 149 HPAPWYSD-VLHCSLGIDSKHYS----TMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            P  W +D V + S  +  K Y        +T  + +P    G+H    M  + DEASG 
Sbjct: 260 GPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEASGV 318

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVE 259
            D +     G LT  + NR  +MTS P R +G FYE  +K        W     +     
Sbjct: 319 DDAVLDVAFGALTHED-NRA-VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESP 376

Query: 260 GIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA-P 317
            +     +    +YG   D   ++ V G+FP    +  I     EE        D +   
Sbjct: 377 LVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFG 436

Query: 318 LIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKISGL 363
            ++  D+    G D++V+V+             RR  V++     ++ D+     KI+ L
Sbjct: 437 YVITVDVGGGVGRDDSVIVVSKVWGESQWGERARRVEVVDIPLCKNRDDILELFAKINEL 496

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391
           + +Y    +++D N  G     YL+  G
Sbjct: 497 LLQYPNANLVVDDNGAGKGLGQYLKKQG 524


>gi|46401730|ref|YP_006576.1| PacB [Enterobacteria phage P1]
 gi|301646767|ref|ZP_07246623.1| putative terminase B protein [Escherichia coli MS 146-1]
 gi|129547|sp|P27753|TERL_BPP1 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging
           protein B; AltName: Full=PACase B protein; AltName:
           Full=Terminase B protein; AltName: Full=Terminase large
           subunit
 gi|68597607|sp|Q5XLR0|TERL_BPP7 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging
           protein B; AltName: Full=PACase B protein; AltName:
           Full=Terminase B protein; AltName: Full=Terminase large
           subunit
 gi|33323612|gb|AAQ07582.1|AF503408_106 PacB [Enterobacteria phage P7]
 gi|215636|gb|AAA21724.1| pacB [Enterobacteria phage P1]
 gi|33338757|gb|AAQ14080.1| PacB [Enterobacteria phage P1]
 gi|33338866|gb|AAQ14188.1| PacB [Enterobacteria phage P1]
 gi|54112354|gb|AAV28854.1| PacB [Enterobacteria phage P7]
 gi|301075042|gb|EFK89848.1| putative terminase B protein [Escherichia coli MS 146-1]
          Length = 494

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 78/343 (22%), Positives = 141/343 (41%), Gaps = 32/343 (9%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKH 139
           ++++G G GK+ + + + +  +   PG  VI +AN   Q+   ++  + S W + +    
Sbjct: 53  SVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFP 112

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
           W   +   L    ++         I  K     CR  +EE      G H  + + II DE
Sbjct: 113 WLS-KYFILTETSFFEVTGKGVWTILIKS----CRPGNEE---ALAGEHADHLLYII-DE 163

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQ 252
           ASG  D     I G LT ++ NR  ++ S P R SG FY+  ++       P   +    
Sbjct: 164 ASGVSDKAFSVITGALTGKD-NRI-LLLSQPTRPSGYFYDSHHRLAIRPGNPDGLFTAII 221

Query: 253 IDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
           +++     +D  F    +A Y G D+ +  ++V G+FP+      +  + +E A  R+  
Sbjct: 222 LNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVK 281

Query: 312 PDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISG 362
                  +   D+A   G D +V+ +        +R  +   + +++         KI  
Sbjct: 282 IAKGWGWVACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFA 341

Query: 363 LV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKR 402
               E++    I ID +  G  T D + E  G  V R+   K+
Sbjct: 342 ECNPERFPNITIAIDGDGLGKSTADLMYERYGITVQRIRWGKK 384


>gi|225155389|ref|ZP_03723881.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2]
 gi|224803845|gb|EEG22076.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2]
          Length = 479

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 56/227 (24%), Positives = 94/227 (41%), Gaps = 13/227 (5%)

Query: 176 YSEERPDTFVGHHNTYG--MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233
           ++ +R   F G H   G  + II DEA    D I +       +R      +  S+   L
Sbjct: 129 FATDRGGRFEGFHAYPGRPLLIILDEAKSIADDIFVA-----ADRCQPTMLLYISSWGGL 183

Query: 234 SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293
            G+F++ F++  D + +FQ        I P F E + A+YG DSD+ R  + GQ P+ + 
Sbjct: 184 FGRFHDAFSQ--DRFAQFQAGIADCPHITPEFIEAMRAQYGEDSDIYRSMILGQRPKGNE 241

Query: 294 DSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW-SKTD 352
             F+   +  E     P         + CD AE   D  V+  R G  +  +  W    +
Sbjct: 242 TGFVVPFVDYERCESNPPVWQEGTKQVFCDFAET-SDECVIAKRDGNRLSIVDAWIPDGN 300

Query: 353 LRTTNNKISGLVEKYRPDAIII--DANNTGARTCDYLEMLGYHVYRV 397
                ++  G + + + +  +I  DA+ TG      L + G  +  V
Sbjct: 301 TAGITDRFEGHLRRLQNEGFVIRGDADGTGHGYITALSLRGIKISGV 347


>gi|331649955|ref|ZP_08351031.1| terminase B protein (PACase B protein) (DNA packaging B protein)
           [Escherichia coli M605]
 gi|331041212|gb|EGI13366.1| terminase B protein (PACase B protein) (DNA packaging B protein)
           [Escherichia coli M605]
          Length = 494

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/338 (22%), Positives = 139/338 (41%), Gaps = 32/338 (9%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKH 139
           ++++G G GK+ + + + +  +   PG  VI +AN   Q+   ++  + S W + +    
Sbjct: 53  SVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFP 112

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
           W   +   L    ++         I  K     CR  +EE      G H  + + II DE
Sbjct: 113 WLS-KYFILTETSFFEVTGKGVWTILIKS----CRPGNEE---ALAGEHADHLLYII-DE 163

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQ 252
           ASG  D     I G LT ++ NR  ++ S P R SG FY+  ++       P   +    
Sbjct: 164 ASGVSDKAFSVITGALTGKD-NRI-LLLSQPTRPSGYFYDSHHRLAIRPGNPDGLFTAII 221

Query: 253 IDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
           +++     +D  F    +A Y G D+ +  ++V G+FP+      +  + +E A  R+  
Sbjct: 222 LNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVK 281

Query: 312 PDPYAPLIMGCDIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISG 362
                  +   D+A   G D +V+ +        +R  +   + +++         KI  
Sbjct: 282 IAKGWGWVACVDVAGGTGRDKSVINIMMVSGQRNKRRVINYRMQEYTDVTETQLAAKIFA 341

Query: 363 LV--EKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRV 397
               E++    I ID +  G  T D + E  G  V R+
Sbjct: 342 ECNPERFPNITIAIDGDGLGKSTADLMYERYGITVQRI 379


>gi|261381054|ref|ZP_05985627.1| phage terminase, large subunit, PBSX family [Neisseria subflava
           NJ9703]
 gi|284796087|gb|EFC51434.1| phage terminase, large subunit, PBSX family [Neisseria subflava
           NJ9703]
          Length = 450

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 45/194 (23%), Positives = 89/194 (45%), Gaps = 35/194 (18%)

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           I+G D+A+EG D +  +LR G V+  + +W   D+  + +K+    ++ + D I+ D+  
Sbjct: 241 ILGFDVADEGDDASATILRHGSVVIDMDEWRGQDVIYSADKVYLYGQEAKADKIVYDSIG 300

Query: 379 TGA-------RTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW-------- 423
            GA       R    ++ +G++    + +  A   +  +N+    ++K   W        
Sbjct: 301 VGAGVKAQFRRKTGKVQTIGFNAGGSVFKPEARYTDDKKNKDMFSNIKAQAWWMVRERFY 360

Query: 424 -----LEFA------SLINHSGLIQNLKSLKSFI------VPNTGELAIESKR---VKGA 463
                +EF        LI+ SG +++L+ LK+ +        N G + +ESK+    +G 
Sbjct: 361 KTWRAIEFGDTYPIDELISISGSLKDLEYLKAELSRPRVDYDNNGRVKVESKKDMAKRGI 420

Query: 464 KSTDYSDGLMYTFA 477
            S + +D L+  FA
Sbjct: 421 PSPNRADALIMAFA 434


>gi|320103661|ref|YP_004179252.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644]
 gi|319750943|gb|ADV62703.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644]
          Length = 553

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 77/349 (22%), Positives = 128/349 (36%), Gaps = 39/349 (11%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           ++ G  +GK+ L A L LW + T PG  V+  A S+  L T L+ E+ K L+    +   
Sbjct: 68  VATGNAVGKSYLAAGLTLWWLYTHPGSLVVATAPSQGLLGTVLFRELQKALA-ASRRRGL 126

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
            +  + +         L    G         C   +    +   G H+   M ++ DEAS
Sbjct: 127 GLPGMVVGSDRGTPFSLRVGPGRRLAAEGWGCLGIATRGVERLAGRHHADLMVVV-DEAS 185

Query: 202 GT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQID------ 254
           G  P+         LT  N  + ++   NP      F+++  + L +     I       
Sbjct: 186 GVQPEAWE-----ALTSLNPRKLFV-CGNPLTPGTVFHKLHQRGLTEASDPSIPDHARGV 239

Query: 255 --------------TRTVEGI-DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL 299
                          R+  G+ D  F      ++G  S +    V G FP   + + I  
Sbjct: 240 ALTIPSTASPDINLERSPRGLADRGFIREAERQWGRGSPLWLSHVEGVFPTVAVHALIEP 299

Query: 300 NIIEEALNREPCP---DPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFDWSKTDLRT 355
             +++A + E      +P    ++GCD+A   G D T +V+R    I  L    +     
Sbjct: 300 GWLDQAASLERSQTYENPPGQPVLGCDLAAGVGADRTAIVVRDEGGIRELIASDRLAPDE 359

Query: 356 TNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLG---YHVYRVLG 399
               I+ L  K+   P+ I+ D    GA     L   G    H   + G
Sbjct: 360 AATLIASLARKHLIAPERILYDGAGLGAELTTRLARQGPGFVHARAIFG 408


>gi|298387330|ref|ZP_06996883.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298259999|gb|EFI02870.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 500

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 62/222 (27%), Positives = 93/222 (41%), Gaps = 25/222 (11%)

Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP---LIMGCDIAEEGGDNTV 333
           +D+ R++V G FP+   D  IP   IE A  R     PY P     +G D+A  G DN+V
Sbjct: 264 NDLFRIKVRGMFPKVAEDVLIPYEWIEIANKRWQENHPYRPRKSCKLGVDVAGMGRDNSV 323

Query: 334 VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR---PDAIIIDANNTGARTCDYLEML 390
              R G  +   FD  ++  + ++  + G    Y+    D I ID    GA     L   
Sbjct: 324 FCPRYGNYVSQ-FDVFQSAGKASHMHVVGKALSYKRTDRDIIFIDTIGEGAGVYSRLVEQ 382

Query: 391 G----YHVYRVLGQKRAVDL--EFC-RNRRTELHVKMADWLE----FASLINHSGLIQNL 439
           G    + V    G K   D+  E+   N R  L+  + DWL+    F  ++         
Sbjct: 383 GIRNIFSVKNSQGAKGLHDITGEYSFANMRAYLYWALRDWLDPKNNFFPMLPPCDQFTEE 442

Query: 440 KSLKSFIVPNTGELAIE-----SKRVKGAKSTDYSDGLMYTF 476
            +   +   + G++ IE      KR+K  +S DY D L  TF
Sbjct: 443 ATETKWKFRSDGKILIEPKEEIKKRIK--RSPDYMDALSETF 482


>gi|186682890|ref|YP_001866086.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102]
 gi|186465342|gb|ACC81143.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102]
          Length = 543

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 88/374 (23%), Positives = 143/374 (38%), Gaps = 85/374 (22%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           + A  G GK+ + + LV++ +    G++ I  A SE Q+K  LWAE+ K   L   K   
Sbjct: 64  VKAAHGTGKSFIASLLVIYFLFCVGGVA-ITTAPSEDQVKWILWAELRKIHGLHKTKLGG 122

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
               + L     +S+ ++ + GI S+ YS           ++F G H    + +I DEA 
Sbjct: 123 RCDIMQL----LFSETVY-AFGITSRDYSE----------NSFQGQHRQKQL-LIEDEAD 166

Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI------------FNKPLDDWK 249
           G    I+ G +  LT   ++   +   NP     +F +             F+ P   W 
Sbjct: 167 GITPQIDNGFIACLT--GSDNRGLRIGNPVDPQSQFAKTCKLDKRCLTVSAFSHPNVSWA 224

Query: 250 RFQIDTRTVEGIDPSFHEGIIARYG--------------------LDSD-VTRV------ 282
            +++    V  + P   E II   G                    +  D + RV      
Sbjct: 225 -YELCADGVYRLKPEVAEHIINEDGEIKPQQEWPPEFPRDRIPGAISIDWIERVRREKFE 283

Query: 283 -------EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY-------APLIMGCDIAEEG 328
                   V G++ +   D  I L ++++A +       Y        P  +G D+  +G
Sbjct: 284 TSAYWKGRVMGEYAEDAADGIILLTLLKQARSLYDQNPQYWDAIAKRYPWRLGLDVG-DG 342

Query: 329 GDNTVVVLRRGPVI-EHLFDWSKTDLRTTN-------NKISGLVEKYRPDAIIIDANNTG 380
           GD   + L RGPV+ E     +K DL  T        ++I  L   Y   +I +D    G
Sbjct: 343 GDPHALALLRGPVLYEVQIHPTKGDLLDTERAADIAASQIKLLGTGY---SIAVDNTGVG 399

Query: 381 ARTCDYLEMLGYHV 394
           A T   L+  GY  
Sbjct: 400 AGTLAKLKKTGYQA 413


>gi|320091491|gb|ADW08983.1| terminase-like protein [Clavibacter phage CN77]
          Length = 414

 Score = 50.1 bits (118), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 55/236 (23%), Positives = 91/236 (38%), Gaps = 47/236 (19%)

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQI-- 253
           DEA G P  +  G    +T +++    +   NP     +F+ IF  P  +D+W  F I  
Sbjct: 51  DEAGGVPPELFTGAEAVMTGQDSK--IVAIGNPDSRGTEFHRIFTVPALMDEWNTFTISA 108

Query: 254 -DTRTVEG--------------------IDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQ 291
            D  TV G                    +D   H+  + + G   D     +V G+FP +
Sbjct: 109 YDLPTVTGEVVYPDHPEKQERMLKGLTSLDWIQHKERVWKVGGKPDGRFLAKVLGEFPGE 168

Query: 292 DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD---- 347
             ++F P   I+   N      P   +IMG D+A  G D++VV   +G  +  LF     
Sbjct: 169 TDNAFFPQEAIDRG-NDTTIDKPEKGIIMGVDLARMGDDDSVVYTNQGGRV-RLFKGQVR 226

Query: 348 -------------WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
                        WSK +   +  ++  +  +     + +D++  G    D LE L
Sbjct: 227 YSDREGTKTTTGVWSKENTVASARRVHAIAMQIGAKQVRLDSSGIGGAVFDELEQL 282


>gi|134287454|ref|YP_001109621.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4]
 gi|134131876|gb|ABO60570.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4]
          Length = 509

 Score = 50.1 bits (118), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 79/363 (21%), Positives = 147/363 (40%), Gaps = 54/363 (14%)

Query: 65  HCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTL 124
           H +   ++ + +  + ++S+G G GKT+  A + LW +      + I  A   + +   +
Sbjct: 40  HQIQMFDSVSKQGSRTSVSSGHGTGKTSGFAIIALWHLLCYYLSNTILTAPKISTVSDGV 99

Query: 125 WAEVSKWLSLLPNK------HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
           W E +   + + N        +F ++S  ++   +              ++  + ++   
Sbjct: 100 WKEFADLSTKISNGPQSWIWEYFVIESERVYVRGY------------KLNWFVIAKSAPR 147

Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL-GFLTERNANRFWIMTSNPRRLSGKF 237
             P+   G H  + +  + DEASG PD  N G++ G LT+   NR   + S P R SG F
Sbjct: 148 GSPENLAGAHRDW-LLWLADEASGIPD-DNFGVITGSLTDER-NRM-CLASQPTRSSGFF 203

Query: 238 YEIFN----KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLD--SDVTRVEVCGQFPQQ 291
           YE  +         W     ++       P      IA   L    +  +++V G+FP+ 
Sbjct: 204 YETHHALSRAEGGPWNNLVFNSE----FSPIVSAKFIAEKKLQYTEEEYQIKVQGRFPEN 259

Query: 292 DIDSFIPLNIIEEALNREPC-PDPYAPLIMGCDIAEEG-GDNTVV----VLRRGPVIEHL 345
                +    IE  + R    PD +   ++  D+   G  D TV+    V+ RG   E+ 
Sbjct: 260 SSKYLVGPQAIEACVGRTVIKPDEHWGWLLPVDVGGGGWRDETVMPALHVIGRG---EYG 316

Query: 346 FDWSKTDLRTT--------NNKISGLV---EKYRPDAI-IIDANNTGARTCDYLEMLGYH 393
            D  +  L +           ++ G++    + R +A  +IDA   G   C  L++ G+ 
Sbjct: 317 MDARRAQLISVPLHSNTQDPAQLHGVIVHAARERSNATAMIDAGGMGLIVCKQLDLDGFS 376

Query: 394 VYR 396
            YR
Sbjct: 377 QYR 379


>gi|284162607|ref|YP_003401230.1| hypothetical protein Arcpr_1511 [Archaeoglobus profundus DSM 5631]
 gi|284012604|gb|ADB58557.1| protein of unknown function DUF264 [Archaeoglobus profundus DSM
           5631]
          Length = 435

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 83/347 (23%), Positives = 135/347 (38%), Gaps = 53/347 (15%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           + AGR  GKT   A   ++   T PG     +A S  Q    ++ ++ ++LS    K   
Sbjct: 44  VVAGRRFGKTECMAVSAIYYALTNPGSIQFVIAPSYDQ-SNIMFGQIVQFLS----KSIL 98

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
                 ++  P++    H     DS     +    S  +P+   GH       II DEA+
Sbjct: 99  GCMIRRIYKTPFH----HIIFKNDS-----VIHARSASKPEFLRGHK---AHRIILDEAA 146

Query: 202 GTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGK--FYEIFNK----PLDDWKRFQID 254
             P DVI+  I   L + N +  WI    P    GK  FY+ + K       D+  ++  
Sbjct: 147 FIPDDVISNIIEPMLADYNGS--WIKIGTP---FGKNHFYDTYLKGQSPDFPDYSSYRFP 201

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP------------QQDIDSFIPLNII 302
           +     I   F E     YG +S + R E   +F             Q+++D+ I L   
Sbjct: 202 STVNPHISHEFIEKKKREYGENSIIFRTEYLAEFVEDQNAVFRWADIQKNVDNSIELIDS 261

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
            E ++++         ++GCD+A+      +VVL        L  + + + R     I  
Sbjct: 262 AENVSKQ--------YVIGCDLAKYQDYTVIVVLDVTEKPYKLVHFERFNRRPYAEVIMR 313

Query: 363 LVEKYRP---DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
           L E YR      ++ID+   G    + L+ +G   Y V   K  V L
Sbjct: 314 LKELYRRFNYAKVLIDSTGVGDPVLEDLQDVGAEGY-VFTPKSKVQL 359


>gi|159897183|ref|YP_001543430.1| hypothetical protein Haur_0654 [Herpetosiphon aurantiacus ATCC
           23779]
 gi|159890222|gb|ABX03302.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC
           23779]
          Length = 472

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 93/394 (23%), Positives = 145/394 (36%), Gaps = 82/394 (20%)

Query: 78  FKGAISAGRGIGKTTLNAWLV-LWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLP 136
           ++  + A   +GKT L   LV  W  S  PG+ V+  A ++ Q++  LW EV   +    
Sbjct: 36  YRTLVKACHKVGKTHLGGGLVNWWYDSFDPGL-VLTTAPTDRQVRDLLWKEVR--MQRRG 92

Query: 137 NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAII 196
              +   +S  L   P               H++     ++ +  D+F GHH+ + + I 
Sbjct: 93  RAGFTGPKSPRLESTP--------------DHFA---HGFTAKDGDSFQGHHSPHTLFIF 135

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNP---------RRLSGKFYEI------- 240
            DEA G   V          E  A   W+   NP           LSG ++ I       
Sbjct: 136 -DEAVGVASVFWETAESMFNEGGA---WLAIFNPTDTSSQAYAEELSGGWHVISMSVLEH 191

Query: 241 ---------FNKPLDDWKRF-QIDT------RTVEGIDPSFHEGIIAR--YGLDSDVTRV 282
                       P     R  ++DT      R +   +P     I  R  +     +   
Sbjct: 192 PNILAELQGLPPPFPSAIRLSRVDTLLKKWCRALSPEEPKRATDIHWRDAWYRPGPIAEA 251

Query: 283 EVCGQFPQQ---DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339
            + G++P Q   ++ S     + E  L     P    P  +GCD+A  G D T + +RRG
Sbjct: 252 RLLGRWPSQATNNVWSDGAFQVAESLL----LPASDEPCELGCDVARYGDDFTEIHVRRG 307

Query: 340 P---VIEHLFDWSKTDLRTTNNKISGLVEKY--------RPDAIIIDANNTGARTCDYLE 388
                 E    WS  +   T  ++  L  +Y        R  A+ ID +  G    D  +
Sbjct: 308 GHSLYHEAANGWSTVE---TAGRLKQLANEYGRRCGVDGRAVAVKIDDDGIGGGVVDLAD 364

Query: 389 MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
             GY    V G + A D E   NRR+EL   +A+
Sbjct: 365 --GYTFLGVSGARTAYDPEKYPNRRSELWFSVAE 396


>gi|241763591|ref|ZP_04761642.1| phage terminase large subunit [Acidovorax delafieldii 2AN]
 gi|241367184|gb|EER61538.1| phage terminase large subunit [Acidovorax delafieldii 2AN]
          Length = 521

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/209 (25%), Positives = 88/209 (42%), Gaps = 21/209 (10%)

Query: 295 SFIPLNIIEEALNR-EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS---K 350
             IP   ++ A  R +P  D     ++G D A  G D T V  R     + L        
Sbjct: 290 QLIPTEWVKAAQARWQPRQDKGPMTVLGLDPARGGTDKTSVARRHDCWFDVLISEPGIVT 349

Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC- 409
            D  TT    + LV    P  I +DA   G+   D+++ LG  VY V+G +R+  ++   
Sbjct: 350 KDGPTTAAFTAPLVRNGAP--IAVDAIGIGSSALDFIQGLGLLVYAVVGSERSDHMDKAG 407

Query: 410 ----RNRRTELHVKMADWLEFA-----SLINHSGLIQNLKSLKSFIVPNTGELAIESK-- 458
               RNRR E++ ++ + L+       +L     L+ +L +++  +V      AI+ +  
Sbjct: 408 TMRFRNRRAEMYWRLREALDPTAEQPIALPPDQELLGDLTAVRYKVVTMGQGAAIQIRDK 467

Query: 459 ---RVKGAKSTDYSDGLMYTFAENPPRSD 484
              R    +S D  D +  TF E  P  D
Sbjct: 468 DEIREALGRSPDKGDSVAMTFCEGIPLLD 496


>gi|161789175|ref|YP_001595730.1| PacB [Vibrio sp. 0908]
 gi|161761461|gb|ABX77106.1| PacB [Vibrio sp. 0908]
          Length = 572

 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 77/172 (44%), Gaps = 12/172 (6%)

Query: 67  LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA 126
           +  +N   P   + ++++G G GK+ L A L L  + T P    +  ANS  Q+   +++
Sbjct: 50  IEVINALTPVGARVSVASGHGTGKSHLTAALCLHFIITHPESLCMLTANSLDQVTNVVFS 109

Query: 127 EVSK-WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            + + W+ +   + W E Q   +    +Y+       G+    +    +T S+   +   
Sbjct: 110 YIKRCWVKICQRQPWLE-QYFVITAKSFYAKGYK---GV----WQIFGKTCSKGNEEGLA 161

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
           G H    M ++ DEASG  D     + G LTE N N+  ++ S   R +G F
Sbjct: 162 GQHRRDYMVVV-DEASGVSDRAFEVLRGALTEDN-NKM-LLISQFTRPTGHF 210


>gi|260580755|ref|ZP_05848581.1| phage terminase large subunit [Haemophilus influenzae RdAW]
 gi|260092572|gb|EEW76509.1| phage terminase large subunit [Haemophilus influenzae RdAW]
          Length = 447

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 87/203 (42%), Gaps = 35/203 (17%)

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A+EG D+       G V+  +  W   D+  + N+ +    K++ D II D+   
Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304

Query: 380 GA-------RTCDYLEMLGYHV---------YRVLGQK--------RAVDLEFCRNR--R 413
           GA       R    L++ G++            + G+K        +A      R+R  +
Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYK 364

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464
           T   VK  D      LI+ S  I+ L+ LK+ +        N G + +ESK   + +G  
Sbjct: 365 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424

Query: 465 STDYSDGLMYTFAENPPRSDMDF 487
           S + +D L+  +A   P+S +D 
Sbjct: 425 SPNMADALVMCYAPTKPKSLLDL 447


>gi|16273317|ref|NP_439561.1| terminase large subunit-like protein [Haemophilus influenzae Rd
           KW20]
 gi|1175785|sp|P44184|Y1410_HAEIN RecName: Full=Uncharacterized protein HI_1410
 gi|1574247|gb|AAC23058.1| predicted coding region HI1410 [Haemophilus influenzae Rd KW20]
          Length = 394

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 87/203 (42%), Gaps = 35/203 (17%)

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A+EG D+       G V+  +  W   D+  + N+ +    K++ D II D+   
Sbjct: 192 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 251

Query: 380 GA-------RTCDYLEMLGYHV---------YRVLGQK--------RAVDLEFCRNR--R 413
           GA       R    L++ G++            + G+K        +A      R+R  +
Sbjct: 252 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYK 311

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464
           T   VK  D      LI+ S  I+ L+ LK+ +        N G + +ESK   + +G  
Sbjct: 312 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 371

Query: 465 STDYSDGLMYTFAENPPRSDMDF 487
           S + +D L+  +A   P+S +D 
Sbjct: 372 SPNMADALVMCYAPTKPKSLLDL 394


>gi|85058727|ref|YP_454429.1| phage terminase large subunit [Sodalis glossinidius str.
           'morsitans']
 gi|84779247|dbj|BAE74024.1| phage terminase large subunit [Sodalis glossinidius str.
           'morsitans']
          Length = 456

 Score = 47.0 bits (110), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 37/69 (53%)

Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372
           +P     +G D+A+EG D+  ++L  G V+ HL  W+K D+  + +++    E    D I
Sbjct: 234 EPAGKKRIGFDVADEGEDSNALILSHGSVVMHLETWNKGDVIQSADRVKNYAESVIADEI 293

Query: 373 IIDANNTGA 381
           I D+   GA
Sbjct: 294 IFDSIGVGA 302


>gi|282880015|ref|ZP_06288737.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS
           5C-B1]
 gi|281306129|gb|EFA98167.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS
           5C-B1]
          Length = 459

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 100/230 (43%), Gaps = 33/230 (14%)

Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-------EPCPDPYAPLIMGCDIAEEGG 329
           +D+ R++V G FP+   D+ IP   +E A +R       +  P  YA +  G D+A  G 
Sbjct: 221 NDLFRIKVLGLFPKASEDTLIPFEWLELAHDRWKKLNAEDFVPRKYARV--GIDVAGMGR 278

Query: 330 DNTVVVLRRG---PVIEHLFDWSKTD-LRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
           D++  VLR G   P I+      K D ++     +  LVEK     ++ID    GA    
Sbjct: 279 DSSCFVLRYGNYVPEIKIHQSGGKADHMKVAGEAVQWLVEK--NTKVMIDTIGEGAGVYS 336

Query: 386 YLEMLGY-HVYRVL---GQKRAVDL----EFCRNRRTELHVKMADWLEFASLINHS---- 433
            L  LGY + Y      G K   D+    EF  N R   +  + DWL   +  N +    
Sbjct: 337 RLLELGYDNAYSCKFSEGTKGLHDITGQYEFA-NMRAYCYWAVRDWLNPKNGFNPALPPC 395

Query: 434 -GLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
             L   L  +  +   ++G + IE K   + +  +S D +D L+ TF  N
Sbjct: 396 DELDAELTEVH-WSFQSSGSIIIEPKENIKSRLKRSPDRADALISTFYPN 444


>gi|68250076|ref|YP_249188.1| phage terminase large subunit [Haemophilus influenzae 86-028NP]
 gi|68058275|gb|AAX88528.1| predicted phage terminase large subunit [Haemophilus influenzae
           86-028NP]
          Length = 447

 Score = 45.8 bits (107), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 50/203 (24%), Positives = 87/203 (42%), Gaps = 35/203 (17%)

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A+EG D+       G V+  +  W   D+  + N+ +    K++ D II D+   
Sbjct: 245 VGFDVADEGADSNDNAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304

Query: 380 GA-------RTCDYLEMLGYHV---------YRVLGQK--------RAVDLEFCRNR--R 413
           GA       R    L++ G++            + G+K        +A      R+R  +
Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYK 364

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464
           T   VK  D      LI+ S  I+ L+ LK+ +        N G + +ESK   + +G  
Sbjct: 365 TYRAVKHGDVYPDDELISLSSNIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424

Query: 465 STDYSDGLMYTFAENPPRSDMDF 487
           S + +D L+  +A   P+S +D 
Sbjct: 425 SPNMADALVMCYATTKPKSLLDL 447


>gi|319776448|ref|YP_004138936.1| phage terminase large subunit [Haemophilus influenzae F3047]
 gi|319897217|ref|YP_004135412.1| phage terminase large subunit [Haemophilus influenzae F3031]
 gi|329123931|ref|ZP_08252483.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116]
 gi|317432721|emb|CBY81084.1| predicted phage terminase large subunit [Haemophilus influenzae
           F3031]
 gi|317451039|emb|CBY87270.1| predicted phage terminase large subunit [Haemophilus influenzae
           F3047]
 gi|327468126|gb|EGF13613.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116]
          Length = 447

 Score = 45.8 bits (107), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 35/203 (17%)

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A+EG D        G V+  +  W   D+  + N+ +    K++ D II D+   
Sbjct: 245 VGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304

Query: 380 GA-------RTCDYLEMLGYHV--------YRVLGQKRAVDLE---------FCRNR--R 413
           GA       R    L++ G++            +  K+  D+            R+R  +
Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYK 364

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464
           T   VK  D      LI+ S  I+ L+ LK+ +        N G + +ESK   + +G  
Sbjct: 365 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424

Query: 465 STDYSDGLMYTFAENPPRSDMDF 487
           S + +D L+  +A   P+S +D 
Sbjct: 425 SPNMADALVMCYAPTKPKSLLDL 447


>gi|145629503|ref|ZP_01785301.1| predicted phage terminase large subunit [Haemophilus influenzae
           22.1-21]
 gi|145641440|ref|ZP_01797019.1| predicted phage terminase large subunit [Haemophilus influenzae
           R3021]
 gi|144978346|gb|EDJ88110.1| predicted phage terminase large subunit [Haemophilus influenzae
           22.1-21]
 gi|145273983|gb|EDK13850.1| predicted phage terminase large subunit [Haemophilus influenzae
           22.4-21]
 gi|309750959|gb|ADO80943.1| Probable bacteriophage terminase, large subunit [Haemophilus
           influenzae R2866]
          Length = 447

 Score = 45.4 bits (106), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 35/203 (17%)

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A+EG D        G V+  +  W   D+  + N+ +    K++ D II D+   
Sbjct: 245 VGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304

Query: 380 GA-------RTCDYLEMLGYHV--------YRVLGQKRAVDLE---------FCRNR--R 413
           GA       R    L++ G++            +  K+  D+            R+R  +
Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYK 364

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464
           T   VK  D      LI+ S  I+ L+ LK+ +        N G + +ESK   + +G  
Sbjct: 365 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424

Query: 465 STDYSDGLMYTFAENPPRSDMDF 487
           S + +D L+  +A   P+S +D 
Sbjct: 425 SPNMADALVMCYAPTKPKSLLDL 447


>gi|145638997|ref|ZP_01794605.1| terminase large subunit-like protein [Haemophilus influenzae
           PittII]
 gi|145271969|gb|EDK11878.1| terminase large subunit-like protein [Haemophilus influenzae
           PittII]
          Length = 379

 Score = 45.4 bits (106), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 85/203 (41%), Gaps = 35/203 (17%)

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A+EG D        G V+  +  W   D+  + N+ +    K++ D II D+   
Sbjct: 177 VGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 236

Query: 380 GA-------RTCDYLEMLGYHV--------YRVLGQKRAVDLE---------FCRNR--R 413
           GA       R    L++ G++            +  K+  D+            R+R  +
Sbjct: 237 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYK 296

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464
           T   VK  D      LI+ S  I+ L+ LK+ +        N G + +ESK   + +G  
Sbjct: 297 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 356

Query: 465 STDYSDGLMYTFAENPPRSDMDF 487
           S + +D L+  +A   P+S +D 
Sbjct: 357 SPNMADALVMCYAPTKPKSLLDL 379


>gi|189460514|ref|ZP_03009299.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136]
 gi|189432758|gb|EDV01743.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136]
          Length = 556

 Score = 44.7 bits (104), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 89/235 (37%), Gaps = 43/235 (18%)

Query: 278 DVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL-----IMGCDIAEEGGDNT 332
           D+ R +V G FP+ D D+ IP   +EEA  R        PL     I+G D+A  G D T
Sbjct: 309 DLFRKKVLGLFPKVDEDTLIPRQWLEEAHERWKQAKGREPLRADLNILGVDVAGMGRDAT 368

Query: 333 VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI----IIDANNTGAR------ 382
             VLRR   +   FD   +     + K++G +   R   I     ID    GA       
Sbjct: 369 CYVLRRDNWVAS-FDTHNSGGVADHMKVAGKIMVARRQNIGLYVSIDTIGEGAGVYSRCV 427

Query: 383 ----------TCDYLEML----GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---- 424
                     +C Y E      G  +  + GQ +        N R  L   + DWL    
Sbjct: 428 ELEDEPHYILSCKYSESAKTPNGRELSDITGQNKFF------NMRAYLFWAVRDWLNPRN 481

Query: 425 EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTF 476
              +++          +   F V + G+L IE K   + +  +S D  D L  TF
Sbjct: 482 NTGAMLPPDDKFDEEATEIKFSVKSNGKLYIEPKEDIKERLGRSPDKFDALANTF 536


>gi|53793591|ref|YP_112491.1| terminase large subunit [Flavobacterium phage 11b]
 gi|53748181|emb|CAH56642.1| terminase large subunit [Flavobacterium phage 11b]
          Length = 432

 Score = 44.3 bits (103), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 44/176 (25%), Positives = 84/176 (47%), Gaps = 21/176 (11%)

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP--DA 371
           P+  + +  DIA  G D  V+ +  G  +  +F  +K+ +      + GL  K++     
Sbjct: 248 PFGEMYISADIARFGSDKMVICVWSGFRVVEIFSMAKSSITEIAEAVRGLSIKHKVPLSN 307

Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE----FCRNRRTELHVKMADWLEFA 427
           +I D +  G    D L   G+     +   RA++++      +N +T+ + K+A+ ++  
Sbjct: 308 VICDEDGVGGGVVDVLGCTGF-----INNSRAMEVDNQVVQYQNLKTQCYYKLAEVIQSN 362

Query: 428 SLINHS-------GLIQNLKSLKSFIVPNTGELAIESK-RVKGA--KSTDYSDGLM 473
           +L  HS        + + L+ +K   + + G+L + SK +VK A  +S DYSD LM
Sbjct: 363 NLYIHSEDATVNDEITKELEQVKRDKIDSDGKLQLISKDKVKQAIGRSPDYSDALM 418


>gi|301170180|emb|CBW29784.1| predicted phage terminase large subunit [Haemophilus influenzae
           10810]
          Length = 447

 Score = 44.3 bits (103), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 86/203 (42%), Gaps = 35/203 (17%)

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A+EG D+       G V+  +  W    +  + N+ +    K++ D II D+   
Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGYVIDSANRTNQSAVKFKADLIIFDSIGV 304

Query: 380 GA-------RTCDYLEMLGYHV---------YRVLGQK--------RAVDLEFCRNR--R 413
           GA       R    L++ G++            + G+K        +A      R+R  +
Sbjct: 305 GAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYK 364

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFI------VPNTGELAIESK---RVKGAK 464
           T   VK  D      LI+ S  I+ L+ LK+ +        N G + +ESK   + +G  
Sbjct: 365 TYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIP 424

Query: 465 STDYSDGLMYTFAENPPRSDMDF 487
           S + +D L+  +A   P+S +D 
Sbjct: 425 SPNMADALVMCYAPTKPKSLLDL 447


>gi|329119006|ref|ZP_08247700.1| phage terminase large subunit [Neisseria bacilliformis ATCC
           BAA-1200]
 gi|327464879|gb|EGF11170.1| phage terminase large subunit [Neisseria bacilliformis ATCC
           BAA-1200]
          Length = 449

 Score = 42.7 bits (99), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 7/112 (6%)

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           I+G D+A+EG D    VLR G V+  +  W   D+  + +K+    ++   D I+ D   
Sbjct: 240 ILGFDVADEGDDANATVLRHGSVVTDMQQWRGQDVIYSADKVYLYAQEQNVDRIVYDNIG 299

Query: 379 TGA-------RTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            GA       R    ++ LG++    + +  A   +  +NR    ++K   W
Sbjct: 300 VGAGVKAQFRRKNGKVQTLGFNAGGAVYKPDAKYTDDKKNRDMFANIKAQAW 351


>gi|254781186|ref|YP_003065599.1| hypothetical protein CLIBASIA_05465 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040863|gb|ACT57659.1| hypothetical protein CLIBASIA_05465 [Candidatus Liberibacter
           asiaticus str. psy62]
          Length = 45

 Score = 42.4 bits (98), Expect = 0.21,   Method: Composition-based stats.
 Identities = 19/43 (44%), Positives = 29/43 (67%), Gaps = 1/43 (2%)

Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYH-VYRVLGQKRAV 404
           +  +Y PDAI++ AN  GA T +YLE L Y  + ++LGQ+ +V
Sbjct: 1   MAHQYNPDAIVLYANGIGAVTANYLENLNYSPIEKILGQRSSV 43


>gi|153806881|ref|ZP_01959549.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185]
 gi|149131558|gb|EDM22764.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185]
          Length = 513

 Score = 42.0 bits (97), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 67/234 (28%), Positives = 98/234 (41%), Gaps = 47/234 (20%)

Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR---EPCPDPYAP---LIMGCDIAEEGGD 330
           +D+ RV+V G FP+   D  IP   IE A NR   E     + P     +G D+A  G D
Sbjct: 275 NDLFRVKVLGMFPKVSEDVLIPYEWIEIA-NRNWQELQASGFIPAKSCKLGVDVAGMGRD 333

Query: 331 NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP--------DAI---------I 373
           N+V+  R G  +   FD  ++  R  +  + G+   Y          D I         +
Sbjct: 334 NSVLCPRYGNYVPQ-FDVHQSAGRADHMHVVGMTIPYLKKKGAKAFIDTIGEGAGVYSRL 392

Query: 374 IDANNTGARTCDYLEML-GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE-----FA 427
           ++   T A +C Y E   G H   + G+      EF  N R  L+  + DWL       A
Sbjct: 393 LEEEFTNAFSCKYSEGTDGLH--DITGE-----YEFA-NMRAYLYWALRDWLNPKNGFGA 444

Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAIE-----SKRVKGAKSTDYSDGLMYTF 476
           +L     L++     K   + N G++ IE      KR+K  +S DY D L  TF
Sbjct: 445 ALPPCDQLMEEATETKWKFLSN-GKVIIEPKEDVKKRIK--RSPDYMDALANTF 495


>gi|309379923|emb|CBX21334.1| unnamed protein product [Neisseria lactamica Y92-1009]
          Length = 449

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 28/112 (25%), Positives = 50/112 (44%), Gaps = 7/112 (6%)

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           I+G D+A+EG D    VLR G V+  +  W   D+  + +K+    ++   D I+ D   
Sbjct: 240 ILGFDVADEGDDANATVLRHGSVVTDMRQWRGQDVIYSADKVYLYAQEQDIDRIVYDNIG 299

Query: 379 TGA-------RTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            GA       R    ++ LG++    + +  A   +  +NR    ++K   W
Sbjct: 300 VGAGVKAQFRRKRGKVQTLGFNAGGAVYKPDAKYTDDKKNRDMFANIKAQAW 351


>gi|303243859|ref|ZP_07330199.1| protein of unknown function DUF264 [Methanothermococcus okinawensis
           IH1]
 gi|302485795|gb|EFL48719.1| protein of unknown function DUF264 [Methanothermococcus okinawensis
           IH1]
          Length = 445

 Score = 40.0 bits (92), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 71/328 (21%), Positives = 126/328 (38%), Gaps = 45/328 (13%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           ++AGR  GK+ L A+L+++L ST+       +A      +  ++ E+ K++      +  
Sbjct: 56  VAAGRRFGKSKLMAFLLIFLCSTQKNKKYAVIAPFYANAR-IIFRELKKYIE---KSNVL 111

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
                 +  +P+ +        ID +         S + P +  G   +Y + I+++ A 
Sbjct: 112 SRLVKRMVESPYMAIEFKTGCTIDFR---------SADNPTSIRGE--SYHLVILDEAAF 160

Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR---FQIDTRTV 258
              DV+   I   L + +A    I T N       FYE F    +   R   F+  T T 
Sbjct: 161 IKDDVVKYVIKPLLLDYDAPLIEISTPNGH---NHFYESFLMGKNKQNRHISFRFPTWTN 217

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQF------------PQQDIDSFIPLNIIEEAL 306
             +  +  E I    G DS V + E C +F             QQ ID  I L    E+ 
Sbjct: 218 PFLPKNAIEEIKQEVGEDSPVWKQEYCAEFIDNNEAVFNWEYIQQCIDGTIKLLKSGESG 277

Query: 307 NREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL---RTTNNKISGL 363
           ++          +MG D+A+      + VL        L  + + +L       +K+  L
Sbjct: 278 HQ---------YVMGVDLAKFEDYTVITVLDVSVKPYKLVYFERFNLMPYSFVADKVKEL 328

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG 391
            + +    + +DA   GA   + +E L 
Sbjct: 329 YQLFNKPQVCMDATGPGAAVVEQVESLN 356


>gi|310641214|ref|YP_003945972.1| malate dehydrogenase, nad-dependent [Paenibacillus polymyxa SC2]
 gi|309246164|gb|ADO55731.1| malate dehydrogenase, NAD-dependent [Paenibacillus polymyxa SC2]
          Length = 313

 Score = 40.0 bits (92), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 53/114 (46%), Gaps = 8/114 (7%)

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV----EKYRPDAIII 374
           IMG    E+  D+ +V++  G  I      S+ DL  TN  I   V    +KY PD+I+I
Sbjct: 64  IMGTSNYEDAADSDIVIITAG--IARKPGMSRDDLVNTNAGIVKSVCENVKKYAPDSIVI 121

Query: 375 DANN-TGARTCDYLEMLGYHVYRVLGQKRAVD-LEFCRNRRTELHVKMADWLEF 426
             +N   A T    + L +   RV+GQ   +D   +C     EL+V + D   F
Sbjct: 122 ILSNPVDAMTYTAYQTLDFPKNRVIGQSGVLDTARYCTFIAQELNVSVEDVRGF 175


>gi|226940436|ref|YP_002795510.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
 gi|226715363|gb|ACO74501.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
          Length = 93

 Score = 39.7 bits (91), Expect = 1.2,   Method: Composition-based stats.
 Identities = 22/59 (37%), Positives = 28/59 (47%), Gaps = 8/59 (13%)

Query: 31 FVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIG 89
          + LH + WG     LEG + PR+WQ E M  +  H        NP     A  AGRG+G
Sbjct: 22 WALHAYDWGRG--ELEGVTGPRAWQREVMSDIGNHL------KNPATRFSAFDAGRGLG 72


>gi|325295250|ref|YP_004281764.1| mutual gliding protein A [Desulfurobacterium thermolithotrophum DSM
           11699]
 gi|325065698|gb|ADY73705.1| mutual gliding protein A [Desulfurobacterium thermolithotrophum DSM
           11699]
          Length = 193

 Score = 39.7 bits (91), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 2/66 (3%)

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNT 332
           YG+D  +  + +  Q+ ++D+ + +P+ I+++ LNR  CPD  A  I G  + E   + T
Sbjct: 127 YGID--IKEIPLVFQYNKRDLPNVLPIEILKKDLNRWKCPDFEAIAIKGIGVLETFKEIT 184

Query: 333 VVVLRR 338
             VLR+
Sbjct: 185 KQVLRK 190


>gi|308068360|ref|YP_003869965.1| Malate dehydrogenase (Vegetative protein 69) [Paenibacillus
           polymyxa E681]
 gi|305857639|gb|ADM69427.1| Malate dehydrogenase (Vegetative protein 69) [Paenibacillus
           polymyxa E681]
          Length = 313

 Score = 38.9 bits (89), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 53/114 (46%), Gaps = 8/114 (7%)

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV----EKYRPDAIII 374
           I G    E+  ++ +V++  G  I      S+ DL  TN  I   V    +KY PD+I+I
Sbjct: 64  ITGTSNYEDAANSDIVIITAG--IARKPGMSRDDLVNTNAGIVKSVCENVKKYAPDSIVI 121

Query: 375 DANN-TGARTCDYLEMLGYHVYRVLGQKRAVD-LEFCRNRRTELHVKMADWLEF 426
             +N   A T    + LG+   RV+GQ   +D   +C     EL+V + D   F
Sbjct: 122 ILSNPVDAMTYTAYQTLGFPKNRVIGQSGVLDTARYCTFIAQELNVSVEDVRGF 175


>gi|22074007|gb|AAL05293.1| replication-associated protein [Tomato yellow leaf curl virus -
           Gezira]
          Length = 359

 Score = 38.5 bits (88), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 61/150 (40%), Gaps = 23/150 (15%)

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF-PQQDIDSFIPLNIIEEA 305
           DW +FQID R+  G   S ++   A     S    + V  +  P+  I  F  LN   + 
Sbjct: 112 DWGQFQIDGRSARGGQQSANDAYAAAINSGSKAEALRVLRELAPRDYILQFHNLNSNLDR 171

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
           + +EP P PY+   +     +              V E L  W       + N +S    
Sbjct: 172 IFQEP-PAPYSSPFLSSSFNQ--------------VPEELEVW------VSENVMSSAAR 210

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVY 395
            +RP++III+ ++   +T  +   LG H Y
Sbjct: 211 PWRPNSIIIEGDSRTGKTM-WARSLGPHNY 239


>gi|148826888|ref|YP_001291641.1| phage terminase large subunit [Haemophilus influenzae PittGG]
 gi|148718130|gb|ABQ99257.1| predicted phage terminase large subunit [Haemophilus influenzae
           PittGG]
          Length = 366

 Score = 38.1 bits (87), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 19/71 (26%), Positives = 34/71 (47%)

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A+EG D+       G V+  +  W   D+  + N+ +    K++ D II D+   
Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGV 304

Query: 380 GARTCDYLEML 390
           GA    + + L
Sbjct: 305 GAGVKAHFKRL 315


>gi|2497856|sp|Q59202|MDH_BACIS RecName: Full=Malate dehydrogenase
 gi|963019|emb|CAA62129.1| malate dehydrogenase [Bacillus israeli]
          Length = 312

 Score = 37.7 bits (86), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 53/114 (46%), Gaps = 8/114 (7%)

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI----SGLVEKYRPDAIII 374
           I+G    EE  D+ +VV+  G  I      S+ DL  TN K+    +  V KY P++III
Sbjct: 64  IIGTSNYEETADSDIVVITAG--IARKPGMSRDDLVQTNQKVMKSVTKEVVKYSPNSIII 121

Query: 375 DANN-TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRN-RRTELHVKMADWLEF 426
              N   A T    +  G+  +RV+GQ   +D    R     EL++ + D   F
Sbjct: 122 VLTNPVDAMTYTVYKESGFPKHRVIGQSGVLDTARFRTFVAQELNLSVKDITGF 175


>gi|40737892|gb|AAR89439.1| replication associated protein C1 [Tomato yellow leaf curl Mali
           virus]
          Length = 359

 Score = 37.7 bits (86), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 59/150 (39%), Gaps = 23/150 (15%)

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF-PQQDIDSFIPLNIIEEA 305
           DW  FQID R+  G   S ++   A     S    + V  +  P+  +  F  LN   + 
Sbjct: 112 DWGEFQIDGRSARGGQQSANDAYAAALNSGSKSEALRVIKELAPKDYVLQFHNLNSNLDR 171

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
           + +EP P PY    +     +              V E L  W       + N +S    
Sbjct: 172 IFQEP-PAPYISPFLSSSFNQ--------------VPEELEVW------VSENVMSSAAR 210

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVY 395
            +RPD+I+I+ ++   +T  +   LG H Y
Sbjct: 211 PWRPDSIVIEGDSRTGKTM-WARSLGPHNY 239


>gi|219965987|emb|CAR82110.1| replication associated protein (Rep) [Tomato yellow leaf curl Mali
           virus]
          Length = 359

 Score = 37.7 bits (86), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 59/150 (39%), Gaps = 23/150 (15%)

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF-PQQDIDSFIPLNIIEEA 305
           DW  FQID R+  G   S ++   A     S    + V  +  P+  +  F  LN   + 
Sbjct: 112 DWGEFQIDGRSARGGQQSANDAYAAAINAGSKSEALRVIRELAPKDYVLQFHNLNSNLDR 171

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
           + +EP P PY    +     +              V E L  W       + N +S    
Sbjct: 172 IFQEP-PAPYISPFLSSSFNQ--------------VPEELEIW------VSENVMSSAAR 210

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVY 395
            +RPD+I+I+ ++   +T  +   LG H Y
Sbjct: 211 PWRPDSIVIEGDSRTGKTM-WARSLGPHNY 239


>gi|219965994|emb|CAR82116.1| replication associated protein (Rep) [Tomato yellow leaf curl Mali
           virus]
          Length = 359

 Score = 37.4 bits (85), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 59/150 (39%), Gaps = 23/150 (15%)

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF-PQQDIDSFIPLNIIEEA 305
           DW  FQID R+  G   S ++   A     S    + V  +  P+  +  F  LN   + 
Sbjct: 112 DWGEFQIDGRSARGGQQSANDAYAAAINAGSKSEALRVIRELAPKDYVLQFHNLNSNLDR 171

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
           + +EP P PY    +     +              V E L  W       + N +S    
Sbjct: 172 IFQEP-PAPYISPFLSSSFNQ--------------VPEELEIW------VSENVMSSAAR 210

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVY 395
            +RPD+I+I+ ++   +T  +   LG H Y
Sbjct: 211 PWRPDSIVIEGDSRTGKTM-WARSLGPHNY 239


>gi|260945527|ref|XP_002617061.1| hypothetical protein CLUG_02505 [Clavispora lusitaniae ATCC 42720]
 gi|238848915|gb|EEQ38379.1| hypothetical protein CLUG_02505 [Clavispora lusitaniae ATCC 42720]
          Length = 348

 Score = 37.4 bits (85), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 37/84 (44%), Gaps = 3/84 (3%)

Query: 382 RTCDYLEMLGYHVYRVLGQKRA---VDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438
           RT +YLE  G  V       RA   V   FCR            W E A++++HS  +  
Sbjct: 190 RTMEYLETQGVLVSTFNDDGRANIEVPSFFCRESGVRSPYSFTSWKEIAAVVHHSNNLMQ 249

Query: 439 LKSLKSFIVPNTGELAIESKRVKG 462
           L+S     +P   E+A+ S+ + G
Sbjct: 250 LQSGNLLCIPPPAEIALSSELMSG 273


Searching..................................................done


Results from round 2




>gi|254781215|ref|YP_003065628.1| putative phage terminase, large subunit [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040892|gb|ACT57688.1| putative phage terminase, large subunit [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|317120680|gb|ADV02503.1| putative phage terminase large subunit [Liberibacter phage SC1]
 gi|317120824|gb|ADV02645.1| putative phage terminase large subunit [Candidatus Liberibacter
           asiaticus]
          Length = 511

 Score =  727 bits (1876), Expect = 0.0,   Method: Composition-based stats.
 Identities = 511/511 (100%), Positives = 511/511 (100%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME
Sbjct: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
           VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL
Sbjct: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER
Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI
Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN
Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
           IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI
Sbjct: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
           SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM
Sbjct: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
           ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP
Sbjct: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480

Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR 511
           PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR
Sbjct: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEYDSR 511


>gi|315122902|ref|YP_004063391.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313496304|gb|ADR52903.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 509

 Score =  647 bits (1669), Expect = 0.0,   Method: Composition-based stats.
 Identities = 373/508 (73%), Positives = 428/508 (84%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           M+RELPT  E EQ+L +LM+SD+IKLSF+NFVL  FPW E  T L  FS PR WQL+FME
Sbjct: 1   MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            VD  CL +V+NP+P++FKGA+SAGRGIGKTTLNAW++LWL+STRPG+S++CLANSETQL
Sbjct: 61  AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           K+TLWAEVSKWLS+LPNKHWFEMQSLSLHPA WY++ L  + GIDSKHY+  CRTYSEER
Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVGHHNTYGMAI NDEASGTPDVIN  ILGF TE NANRFW+MTSNPRRL+G FY+I
Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLNGWFYDI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FN PL+DW+RFQIDTRTVEGIDP+FHE IIARYGLDSDVTRVEV GQFPQQDI+SFIP  
Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPNFHENIIARYGLDSDVTRVEVLGQFPQQDINSFIPFY 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
            IEEALNREP  DPYAPL+MGCDIA EGGDNTVVVLRRG  IEH+FDWS   +  ++ KI
Sbjct: 301 RIEEALNREPIKDPYAPLVMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNVSSRKI 360

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
             L+ KY+PDA+++DAN  G +T  YL   GY V+   GQ RA D E  RNRRTELHVKM
Sbjct: 361 EELINKYKPDAVVVDANGIGVQTYYYLADEGYSVHPEKGQNRADDHESYRNRRTELHVKM 420

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
           A+WLE AS+ +HSGLIQNLKSL+SFI PNTG+LA+ESKRVKGA STDYSD L YTFA +P
Sbjct: 421 AEWLELASIPHHSGLIQNLKSLESFIEPNTGKLALESKRVKGAVSTDYSDALAYTFAVSP 480

Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEY 508
            RSDM+FGRC SYQYE  +LL++RRF Y
Sbjct: 481 ARSDMNFGRCRSYQYEADELLVDRRFSY 508


>gi|315121940|ref|YP_004062429.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313495342|gb|ADR51941.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 509

 Score =  644 bits (1662), Expect = 0.0,   Method: Composition-based stats.
 Identities = 376/508 (74%), Positives = 428/508 (84%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           M+RELPT  E EQ+L +LM+SD+IKLSF+NFVL  FPW E  T L  FS PR WQL+FME
Sbjct: 1   MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            VD  CL +V+NP+P++FKGA+SAGRGIGKTTLNAW++LWL+STRPG+S++CLANSETQL
Sbjct: 61  AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           K+TLWAEVSKWLS+LPNKHWFEMQSLSLHPA WY++ L  + GIDSKHY+  CRTYSEER
Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVGHHNTYGMAI NDEASGTPDVIN  ILGF TE NANRFW+MTSNPRRL G FY+I
Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLKGWFYDI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FN PL+DW+RFQIDTRTVEGIDPSFHEGII+RYGLDSDVTRVEV GQFPQQDI+SFIP  
Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPSFHEGIISRYGLDSDVTRVEVLGQFPQQDINSFIPFY 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
            IEEALNREP  DPYAPLIMGCDIA EGGDNTVVVLRRG  IEH+FDWS   +  ++ KI
Sbjct: 301 RIEEALNREPIKDPYAPLIMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNASSRKI 360

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
             L+ KY+PDA+++DAN  G +T  YL   GY V+   GQ RA D E  RNRRTELHVKM
Sbjct: 361 EELINKYKPDAVVVDANGIGVQTYYYLADEGYSVHAEKGQNRADDHESYRNRRTELHVKM 420

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
           A+WLE AS+ NHSGLIQNLKSL+SFI PNTG+LA+ESKRVKGA STDYSD L YTFA +P
Sbjct: 421 AEWLELASIPNHSGLIQNLKSLESFIEPNTGKLALESKRVKGAVSTDYSDALAYTFAVSP 480

Query: 481 PRSDMDFGRCPSYQYEGVDLLIERRFEY 508
            RSDM+FGRC SYQYE  +LL++RRF Y
Sbjct: 481 ARSDMNFGRCRSYQYEADELLVDRRFSY 508


>gi|317120722|gb|ADV02544.1| putative phage terminase large subunit [Liberibacter phage SC2]
 gi|317120783|gb|ADV02604.1| putative phage terminase large subunit [Candidatus Liberibacter
           asiaticus]
          Length = 516

 Score =  621 bits (1601), Expect = e-176,   Method: Composition-based stats.
 Identities = 392/507 (77%), Positives = 414/507 (81%), Gaps = 9/507 (1%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME
Sbjct: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
           VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL
Sbjct: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER
Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI
Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP  
Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPQQ 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
            I EAL R   PDPYAPLIMGCDIA EG D TVVVLRRG +IE +FDWS   +  TN KI
Sbjct: 301 YIVEALERVAIPDPYAPLIMGCDIAGEGEDKTVVVLRRGNIIERIFDWSGELIEVTNRKI 360

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGY-HVYRVLGQKRAVDLEFCRNRRTELHVK 419
           S L+ +Y PDAI+ID N  G     YL  + +  V  +LGQ+R+ + E   N R EL+  
Sbjct: 361 SSLINRYNPDAIVIDGNGIGGTVVSYLLNMHHISVEVILGQRRSTEPEQYHNLRAELYDL 420

Query: 420 MADWLEFASLINHS--GLIQNLKSLKSFIVPNTGELAIESKRVK----GAKSTDYSDGLM 473
           M   +     +      LI  LKS+KS I    G L IE KR      G +S D+ D L 
Sbjct: 421 MRSAITGGLQLPDDCPDLINELKSIKS-ISDTLGRLLIEKKRQGRSEFGVRSPDFVDALC 479

Query: 474 YTFAENPPRSDMDFGRCPS-YQYEGVD 499
           YTFA +PPR D    +     +YE +D
Sbjct: 480 YTFAVDPPRKDNPLYQGQDISEYEALD 506


>gi|227355862|ref|ZP_03840255.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906]
 gi|227164181|gb|EEI49078.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906]
          Length = 494

 Score =  495 bits (1275), Expect = e-138,   Method: Composition-based stats.
 Identities = 138/495 (27%), Positives = 217/495 (43%), Gaps = 25/495 (5%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           MS  L  +PE EQ + D+       L    +  + FPWGE G  LE ++ PR WQ E + 
Sbjct: 1   MSEALQKSPE-EQLIEDIASFTHDPL---GYAYYAFPWGEAGGELEEYNGPRQWQAEALN 56

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            +  H  N      P +   A ++G GIGK+   + ++ W M T     V+  AN+E QL
Sbjct: 57  EIGEHLRNPKTRHQPLLL--ARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQL 114

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           +T  W E++KW  L    +WF     +++                +  +      +SE  
Sbjct: 115 RTKTWPEIAKWQRLSLTNNWFTCTKTAIYSND----------PNHANAWRADAVPWSENN 164

Query: 181 PDTFVGHHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239
            + F G HN    + ++ DEAS   D++     G LT+      WI   NP R +G+F E
Sbjct: 165 TEAFAGLHNKGKRIILVFDEASNIADLVWEVAEGALTDEGTEIIWIAFGNPTRNTGRFRE 224

Query: 240 IFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL 299
            F K    W   QID+RTVEG +    +     YG DSD  +V V G FP      FIP 
Sbjct: 225 CFRKFKHRWNTKQIDSRTVEGSNKEQIKNWEEDYGEDSDFFKVRVRGVFPSASELQFIPT 284

Query: 300 NIIEEALNR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD-WSKTDLRTT 356
            + +EA+ R        +AP+I+G D A  G D+ V+ LR+G   + L+  +  TD    
Sbjct: 285 GLTDEAMKRIVTQAEVAHAPVIIGVDPAYSGIDDAVIYLRQGLFSKCLWTGFKTTDDVVM 344

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416
             +I+   ++Y+ DA+ ID    G          G     V     + D +   N+R E+
Sbjct: 345 AKRIADFEDQYKADAVHID-FGYGTGIHSIGTSWGRVWRLVKFGGASTDPQML-NKRGEM 402

Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLM 473
           +  +  WL+    I+      +L   +  +     ++ +E K   + +  +S    D L 
Sbjct: 403 YNSVKTWLKIGGAIDDQETADDLSCGEYKVRVIDSKIVLEDKTEIKKRLGRSPGKGDALA 462

Query: 474 YTFAENPPRSDMDFG 488
            TFA    + D ++ 
Sbjct: 463 LTFAYPVTKIDRNYS 477


>gi|268589373|ref|ZP_06123594.1| conserved hypothetical protein [Providencia rettgeri DSM 1131]
 gi|291315400|gb|EFE55853.1| conserved hypothetical protein [Providencia rettgeri DSM 1131]
          Length = 493

 Score =  492 bits (1266), Expect = e-137,   Method: Composition-based stats.
 Identities = 147/486 (30%), Positives = 226/486 (46%), Gaps = 26/486 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       LS   + L+ FPWGE GT LE  + PR WQ E +  +  H  
Sbjct: 6   SPE-EQLINDIGMFTHDPLS---YALYAFPWGEAGTELENANGPRQWQAEALNEIGEHLR 61

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P   + A ++G GIGK+   + ++ W M T     V+  AN+E QL+T  W E
Sbjct: 62  NPETRHQP--LQLARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQLRTKTWPE 119

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           ++KW  L   K WF     +++                +  +      +SE   + F G 
Sbjct: 120 IAKWQRLSITKDWFTYTKTAIYSND----------PNHANAWRADAVPWSENNTEAFAGL 169

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + +I DEAS   D++     G LT+ N    WI   NP R +G+F E F K   
Sbjct: 170 HNQGKRIILIFDEASNIADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRECFRKFKH 229

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    E  I  YG+D D  +V V G FP      FIP  + + A+
Sbjct: 230 RWKTKQIDSRTVEGTNKEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPTGLTDAAM 289

Query: 307 NREPCPDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  SKT D      +I+  
Sbjct: 290 KRTVTQAEVSHAPIIIGVDPAYSGDDDAVIYLRQGLHSKCLWTGSKTIDDVIMAKRIADF 349

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y  DA+ ID    G          G +   V     + D +  RN+R E++  +  W
Sbjct: 350 EDQYGADAVHID-FGYGTGIQSVGMNWGRNWQLVQFNGASTDPQM-RNKRGEMYNNVKSW 407

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L+    I+   + ++L + + + V  +G++ +ESK   + +  +S    D L  TFA   
Sbjct: 408 LKIGGAIDDQEVAEDLSTPE-YKVELSGKILLESKDDIKKRIGRSPGKGDALALTFAYPV 466

Query: 481 PRSDMD 486
            + + +
Sbjct: 467 TKKERN 472


>gi|212710820|ref|ZP_03318948.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM
           30120]
 gi|212686517|gb|EEB46045.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM
           30120]
          Length = 493

 Score =  491 bits (1263), Expect = e-136,   Method: Composition-based stats.
 Identities = 144/492 (29%), Positives = 223/492 (45%), Gaps = 28/492 (5%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           M   +      EQ + D+       LS   + L+ FPWGE GT LE  S PR WQ E + 
Sbjct: 1   MIETMSPE---EQLINDIGMFTHDPLS---YALYAFPWGEAGTELENASGPRQWQAEALN 54

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            +  H  N      P   + A ++G GIGK+   + ++ W M T     V+  AN+E QL
Sbjct: 55  EIGEHLRNPETRHQP--LQLARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQL 112

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           +T  W E++KW  L   K WF     +++                +  +      +SE  
Sbjct: 113 RTKTWPEIAKWQRLSITKDWFTCTKTAIYSND----------PNHANAWRADAVPWSENN 162

Query: 181 PDTFVGHHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239
            + F G HN    + ++ DEAS   D++     G LT+ N    WI   NP R +G+F E
Sbjct: 163 TEAFAGLHNQGKRIILVFDEASNIADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRE 222

Query: 240 IFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL 299
            F K    WK  QID+RTVEG +    E  I  YG+D D  +V V G FP      FIP 
Sbjct: 223 CFRKFKHRWKTKQIDSRTVEGTNKEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPT 282

Query: 300 NIIEEALNREPCPDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTT 356
            + + A+ R        +AP+I+G D A  G D+ V+ LR+G   + L+  SKT D    
Sbjct: 283 GLTDAAMKRTVTQAEVSHAPIILGVDPAYSGDDDAVIYLRQGLHSKCLWTGSKTIDDVIM 342

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416
             +I+   ++Y  DA+ ID    G          G +   V     + D +  +N+R E+
Sbjct: 343 AKRIADYEDQYGADAVHID-FGYGTGIQSVGMNWGRNWQLVSFNGASTDPQM-QNKRGEM 400

Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLM 473
           +  +  WL+    I+   +  +L + + + V  +G++ +E K   + +  +S +  D L 
Sbjct: 401 YNNVKSWLKIGGAIDDQEVADDLSTPE-YKVQLSGKILLEKKEDIKKRIGRSPNKGDALA 459

Query: 474 YTFAENPPRSDM 485
            TFA    + + 
Sbjct: 460 LTFAYPVTKKER 471


>gi|323156136|gb|EFZ42295.1| terminase large subunit [Escherichia coli EPECa14]
          Length = 491

 Score =  483 bits (1244), Expect = e-134,   Method: Composition-based stats.
 Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A+++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPATRYQPLML--ALASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D          K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSNDLGHD----------KRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        YAP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAYAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRLPGQQNQQ 480


>gi|324008564|gb|EGB77783.1| hypothetical protein HMPREF9532_01752 [Escherichia coli MS 57-2]
          Length = 491

 Score =  483 bits (1244), Expect = e-134,   Method: Composition-based stats.
 Identities = 142/498 (28%), Positives = 226/498 (45%), Gaps = 27/498 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D          K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSNDLGHD----------KRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSYQYEGV 498
            +  ++     S Q   +
Sbjct: 468 SKR-INIPGQQSQQGRAI 484


>gi|327252187|gb|EGE63859.1| terminase large subunit [Escherichia coli STEC_7v]
          Length = 491

 Score =  482 bits (1240), Expect = e-134,   Method: Composition-based stats.
 Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPATRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRIPGQQNQQ 480


>gi|332344357|gb|AEE57691.1| terminase, large subunit [Escherichia coli UMNK88]
          Length = 491

 Score =  481 bits (1239), Expect = e-134,   Method: Composition-based stats.
 Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLVEDIASFTYDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPATRHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRLPGQQNQQ 480


>gi|294491573|gb|ADE90329.1| putative phage terminase, large subunit [Escherichia coli IHE3034]
          Length = 491

 Score =  481 bits (1239), Expect = e-134,   Method: Composition-based stats.
 Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSTAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRIPGQQNQQ 480


>gi|330007152|ref|ZP_08305894.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3]
 gi|328535499|gb|EGF61959.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3]
          Length = 495

 Score =  481 bits (1238), Expect = e-133,   Method: Composition-based stats.
 Identities = 140/485 (28%), Positives = 218/485 (44%), Gaps = 25/485 (5%)

Query: 6   PTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAH 65
           P     EQ + D+       L    + L+ FPWGE GT L   S PR WQ +    +  H
Sbjct: 8   PEEQLKEQLIDDIASFTHDPL---GYALYAFPWGEDGTELAHASGPRQWQADAFREIGEH 64

Query: 66  CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
             N      P +   + ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W
Sbjct: 65  LQNPATRHQPLM--ISRASGHGIGKSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTW 122

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+ KW +L   K WF   + +++      D  H       K +      +SE   + F 
Sbjct: 123 PEIIKWSNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFA 172

Query: 186 GHHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244
           G HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K 
Sbjct: 173 GLHNERKRIVVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKY 232

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
              WK  QID+RTVEG +    +  +  YG DSD  +V V G FP      FIP  + +E
Sbjct: 233 KHRWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDE 292

Query: 305 ALNR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361
           A+ R        +AP I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+
Sbjct: 293 AMKRVVTAAQVAHAPRIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIA 352

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
              ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+     
Sbjct: 353 DFEDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASADPQML-NKRGEMFNACK 410

Query: 422 DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478
            WL+    ++      +L + + + V   G++ +E K   + +  +S    D L+ TFA 
Sbjct: 411 TWLKLGGALDDQETADDLSAAE-YKVRVDGKIVMEPKEDIKERLGRSPGKGDALLLTFAY 469

Query: 479 NPPRS 483
              + 
Sbjct: 470 PVTKR 474


>gi|218700994|ref|YP_002408623.1| putative phage terminase, large subunit [Escherichia coli IAI39]
 gi|218370980|emb|CAR18807.1| putative phage terminase, large subunit [Escherichia coli IAI39]
          Length = 491

 Score =  481 bits (1237), Expect = e-133,   Method: Composition-based stats.
 Identities = 143/498 (28%), Positives = 227/498 (45%), Gaps = 27/498 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIDDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSYQYEGV 498
            +  ++     S Q   +
Sbjct: 468 SKR-INIPGQQSQQGRAI 484


>gi|309702815|emb|CBJ02146.1| putative terminase, large subunit [Escherichia coli ETEC H10407]
          Length = 493

 Score =  481 bits (1237), Expect = e-133,   Method: Composition-based stats.
 Identities = 137/498 (27%), Positives = 228/498 (45%), Gaps = 25/498 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLVEDIAGFTYDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPATRHQPIML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  +V V G FP    + FIP  + + A+
Sbjct: 231 RWKCAQIDSRTVEGTNKEQLQKWVDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAV 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK-ISGL 363
            R   P    +A +++G D + +G D  V+ LR+G   + L +W +T       K I+  
Sbjct: 291 GRVITPAQVQHAAVVLGVDPSHQGKDPAVIYLRQGLHCKKLGEWQRTTDDVLFAKVIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G +   ++      D E   N+R E++    D 
Sbjct: 351 EDQYQADAVFID-YGYGTGLKSVGDNWGRNWTLIMFGSGTADPEM-GNKRGEMYKSARDA 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L+  + ++   L   L + +  +        ++ K   +    +S + +D  + T+A   
Sbjct: 409 LKLGAQLDSQELADELSAPEYKVRLKDSRKILQDKDEVKELLGRSPNNADAYVLTYAAPV 468

Query: 481 PRSDMDFGRCPSYQYEGV 498
            +   ++G+  S Q + +
Sbjct: 469 TKKQFNYGQQQSQQGKAL 486


>gi|298381721|ref|ZP_06991320.1| terminase large subunit protein [Escherichia coli FVEC1302]
 gi|301019339|ref|ZP_07183525.1| conserved hypothetical protein [Escherichia coli MS 196-1]
 gi|298279163|gb|EFI20677.1| terminase large subunit protein [Escherichia coli FVEC1302]
 gi|299882256|gb|EFI90467.1| conserved hypothetical protein [Escherichia coli MS 196-1]
 gi|323948690|gb|EGB44595.1| hypothetical protein ERKG_04913 [Escherichia coli H252]
          Length = 491

 Score =  481 bits (1237), Expect = e-133,   Method: Composition-based stats.
 Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRIPGQQNQQ 480


>gi|300898423|ref|ZP_07116764.1| conserved hypothetical protein [Escherichia coli MS 198-1]
 gi|300357890|gb|EFJ73760.1| conserved hypothetical protein [Escherichia coli MS 198-1]
          Length = 491

 Score =  480 bits (1236), Expect = e-133,   Method: Composition-based stats.
 Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRIPGQQNQQ 480


>gi|117624715|ref|YP_853628.1| putative phage terminase, large subunit [Escherichia coli APEC O1]
 gi|115513839|gb|ABJ01914.1| putative phage terminase, large subunit [Escherichia coli APEC O1]
          Length = 491

 Score =  480 bits (1235), Expect = e-133,   Method: Composition-based stats.
 Identities = 141/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        ++P+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHSPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRIPGQQNQQ 480


>gi|89152423|ref|YP_512256.1| putative terminase large subunit [Escherichia phage phiV10]
 gi|74055446|gb|AAZ95895.1| putative terminase large subunit [Escherichia phage phiV10]
          Length = 491

 Score =  480 bits (1235), Expect = e-133,   Method: Composition-based stats.
 Identities = 141/493 (28%), Positives = 223/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG  SD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSTAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRIPGQQNQQ 480


>gi|331648179|ref|ZP_08349269.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331043039|gb|EGI15179.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 491

 Score =  479 bits (1233), Expect = e-133,   Method: Composition-based stats.
 Identities = 142/493 (28%), Positives = 224/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNACKIW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRIPGQQNQQ 480


>gi|30387381|ref|NP_848210.1| terminase large subunit [Enterobacteria phage epsilon15]
 gi|30266036|gb|AAO06065.1| terminase large subunit [Salmonella phage epsilon15]
          Length = 491

 Score =  479 bits (1233), Expect = e-133,   Method: Composition-based stats.
 Identities = 141/494 (28%), Positives = 223/494 (45%), Gaps = 26/494 (5%)

Query: 12  EQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVN 71
           EQ + D+       L    + L+ FPWGE GT L   + PR WQ +    +  H  N   
Sbjct: 10  EQLVEDIASFTYDPL---GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPAT 66

Query: 72  NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131
              P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E+ KW
Sbjct: 67  RHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKW 124

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
            +L   K WF   + +++      D  H       K +      +SE   + F G HN  
Sbjct: 125 SNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGLHNER 174

Query: 192 G-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
             + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K    WK 
Sbjct: 175 KRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKC 234

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-- 308
            QID+RTVEG +    +  +  YG +SD  +V V G FP      FIP  + +EA+ R  
Sbjct: 235 AQIDSRTVEGTNKQQLQKWVDDYGEESDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVV 294

Query: 309 EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKY 367
                 +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+   ++Y
Sbjct: 295 TAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQY 354

Query: 368 RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFA 427
           + DA+ ID    G       +  G     +     + D +   N+R E+      WL+  
Sbjct: 355 QADAVFID-FGYGTGLKSIGDGWGRTWQLIPFGGGSTDPQML-NKRGEMFNSCKTWLKLG 412

Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSD 484
             ++      +L + + + V   G++ IE K   + +  +S    D L+ TFA    +  
Sbjct: 413 GALDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVTKH- 470

Query: 485 MDFGRCPSYQYEGV 498
           +      S Q + V
Sbjct: 471 LRIPGQESQQGKAV 484


>gi|301046412|ref|ZP_07193572.1| conserved hypothetical protein [Escherichia coli MS 185-1]
 gi|300301638|gb|EFJ58023.1| conserved hypothetical protein [Escherichia coli MS 185-1]
          Length = 491

 Score =  479 bits (1233), Expect = e-133,   Method: Composition-based stats.
 Identities = 142/493 (28%), Positives = 223/493 (45%), Gaps = 26/493 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLVEDIASFTYDPL---GYALYAFPWGEDGTELAHATGPRQWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+
Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYQADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFNSCKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L    +++      +L + + + V   G++ IE K   + +  +S    D L+ TFA   
Sbjct: 409 LRLGGMLDDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV 467

Query: 481 PRSDMDFGRCPSY 493
            +     G+    
Sbjct: 468 SKRLRLPGQQNQQ 480


>gi|215487825|ref|YP_002330256.1| predicted terminase, large subunit [Escherichia coli O127:H6 str.
           E2348/69]
 gi|215265897|emb|CAS10306.1| predicted terminase, large subunit [Escherichia coli O127:H6 str.
           E2348/69]
          Length = 493

 Score =  479 bits (1232), Expect = e-133,   Method: Composition-based stats.
 Identities = 137/498 (27%), Positives = 226/498 (45%), Gaps = 25/498 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLVEDIASFTYDPL---GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPATRHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  +V V G FP    + FIP  + + A+
Sbjct: 231 RWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAV 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK-ISGL 363
            R   P    +A +++G D + +G D  V+ LR+G   + L +W +T       K I+  
Sbjct: 291 GRVITPAQVQHAAVVLGVDPSHQGKDPAVIYLRQGLHCKKLGEWQRTTDDVLFAKIIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y+ DA+ ID    G       +  G +   +       D E   N+R E++    D 
Sbjct: 351 EDQYQADAVFID-YGYGTGLKSVGDNWGRNWTLIQFGSGTADPEM-GNKRGEMYKSARDA 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L+  + ++   L   L + +  +        ++ K   +    +S + +D  + T+A   
Sbjct: 409 LKLGAQLDSQNLADELSAPEYKVRLKDSRKILQDKEEVKELLGRSPNDADAYVLTYAAPV 468

Query: 481 PRSDMDFGRCPSYQYEGV 498
            +   ++G+  S Q + +
Sbjct: 469 TKKQFNYGQQQSQQGKAL 486


>gi|262043569|ref|ZP_06016682.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259039103|gb|EEW40261.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 491

 Score =  478 bits (1230), Expect = e-132,   Method: Composition-based stats.
 Identities = 141/483 (29%), Positives = 219/483 (45%), Gaps = 26/483 (5%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           +PE EQ + D+       L    + L+ FPWGE GT L   + PR WQ +    +  H  
Sbjct: 7   SPE-EQLIDDIASFTHDPL---GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQ 62

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
           N      P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E
Sbjct: 63  NPATRHQPLML--ARASGHGIGKSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTWPE 120

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           + KW +L   K WF   + +++      D  H       K +      +SE   + F G 
Sbjct: 121 IIKWSNLAITKEWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGL 170

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K   
Sbjct: 171 HNERKRIVVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            WK  QID+RTVEG +    +  +  YG DSD  +V V G FP      FIP  + +EA+
Sbjct: 231 RWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDEAM 290

Query: 307 NR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGL 363
            R        +AP I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+  
Sbjct: 291 KRVVTAVQVAHAPRIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            ++Y  DA+ ID    G       +  G     V     + D +   N+R E+      W
Sbjct: 351 EDQYLADAVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASADPQML-NKRGEMFNACKTW 408

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
           L+    ++      +L + + + V   G++ +E K   + +  +S    D L+ TFA   
Sbjct: 409 LKLGGALDDQETADDLSAAE-YKVRVDGKIVMEPKEDIKERLGRSPGKGDALLLTFAYPV 467

Query: 481 PRS 483
            + 
Sbjct: 468 TKR 470


>gi|320175050|gb|EFW50163.1| terminase B protein, putative [Shigella dysenteriae CDC 74-1112]
          Length = 480

 Score =  476 bits (1225), Expect = e-132,   Method: Composition-based stats.
 Identities = 138/486 (28%), Positives = 220/486 (45%), Gaps = 25/486 (5%)

Query: 15  LFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPN 74
           + D+       L    + L+ FPWGE+GT L   + PR WQ +    +  H  N      
Sbjct: 2   IEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQ 58

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134
           P +   A ++G GIGK+   + L+ W MST     V+  AN++ QL+T  W E+ KW +L
Sbjct: 59  PLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNL 116

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG-M 193
              K WF   + +++      D  H       K +      +SE   + F G HN    +
Sbjct: 117 AITKDWFTCTATAMYSN----DPGH------DKRWRADAIPWSEHNTEAFAGLHNERKRI 166

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
            ++ DEAS   D++     G LT+ +    W+   NP R +G+F E F K    WK  QI
Sbjct: 167 IVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQI 226

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPC 311
           D+RTVEG +    +  +  YG DSD  ++ V G FP      FIP  + +EA+ R     
Sbjct: 227 DSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAA 286

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPD 370
              +AP+I+G D A  G D+ V+ LR+G   + L+  +K TD      +I+   ++Y+ D
Sbjct: 287 QVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQAD 346

Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430
           A+ ID    G       +  G     V     + D +   N+R E+ +    WL    ++
Sbjct: 347 AVFID-FGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML-NKRGEMFISCKTWLRLGGML 404

Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDMDF 487
           +      +L + + + V   G++ IE K   + +  +S    D L+ TFA    +     
Sbjct: 405 DDQETADDLSAAE-YKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVSKRLRIP 463

Query: 488 GRCPSY 493
           G+    
Sbjct: 464 GQQNQQ 469


>gi|304398406|ref|ZP_07380280.1| terminase, large subunit [Pantoea sp. aB]
 gi|304354272|gb|EFM18645.1| terminase, large subunit [Pantoea sp. aB]
          Length = 490

 Score =  474 bits (1220), Expect = e-131,   Method: Composition-based stats.
 Identities = 135/485 (27%), Positives = 215/485 (44%), Gaps = 24/485 (4%)

Query: 13  QKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNN 72
           Q + D+            + L+ FPWGE+GT L     PR WQ +  + + AH  N    
Sbjct: 10  QLIEDIGAFTHDPF---GYALYAFPWGEEGTDLAYSKGPRQWQEDAFKQIGAHLQNPDTR 66

Query: 73  PNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL 132
             P +   A  +G GIGK+   + LV W M T     V+  AN+E QL+T  W E++KW 
Sbjct: 67  HQPLMIGRA--SGHGIGKSAFISMLVKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQ 124

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
            L   + WF   + +++                +K +      +SE   + F G HN   
Sbjct: 125 RLSITQDWFTCTATAIYSND----------PSHAKSWRADAIPWSENNTEAFAGLHNERK 174

Query: 193 -MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251
            + +I DEAS   D++     G LT+ N    W+   NP R +G+F E F K    WK  
Sbjct: 175 RIILIFDEASNIADLVWEVAEGALTDENTEIIWVAFGNPTRNTGRFRECFRKLRHRWKTA 234

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
           QID+R+VEG +    +  +  YG DSD  +V V G FP      FIP  + + A+ R   
Sbjct: 235 QIDSRSVEGTNKEQIQKWVDDYGEDSDFVKVRVRGLFPSASEAQFIPTGLTDAAVGRVIT 294

Query: 312 PDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG-LVEKYR 368
           P    +A  ++G D A +GGD  V+ LR+G   + L ++ +T       KI     ++YR
Sbjct: 295 PGQVAHAATVIGVDPAHQGGDPAVIYLRQGLHTKKLGEYQRTTDDVLFAKIVASFEDEYR 354

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            DA+ ID    G       +  G +   +     + D +   N+R E++  +  WL+   
Sbjct: 355 ADAVFID-YGYGTGLKSVGDNWGRNWQLIQFGGGSTDPQM-ANKRGEMYNAVKTWLKDGG 412

Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDM 485
            ++   + + L + +  +      + +E K   + +  KS + +D L  TFA    +   
Sbjct: 413 QLDSQQVAEELSAAEYKVRLKDSRIVLEDKTSIKERLGKSPNDADALALTFAFPVVKKLH 472

Query: 486 DFGRC 490
             G  
Sbjct: 473 YVGSN 477


>gi|303328395|ref|ZP_07358832.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
 gi|302861389|gb|EFL84326.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3]
          Length = 500

 Score =  468 bits (1204), Expect = e-129,   Method: Composition-based stats.
 Identities = 144/465 (30%), Positives = 206/465 (44%), Gaps = 26/465 (5%)

Query: 28  FSNFVLHFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGR 86
              FVL  FPWG  G  L  +   P  WQ E +  +      S       V + A+S+G 
Sbjct: 29  PLGFVLFAFPWG--GGALADYPDGPDVWQREILRGMGEQL--STGASAASVIREAVSSGH 84

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146
           G+GK+ L AW++LW MST      +  AN+E QLK   WAE++KW  L    +WF+  + 
Sbjct: 85  GVGKSALVAWIILWAMSTFSDTRGVVTANTENQLKGKTWAELAKWHRLCLCGYWFDCTAT 144

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT-YGMAIINDEASGTPD 205
           +L                  K +      +SE   + F G HN    + +I DEAS  PD
Sbjct: 145 ALIST----------QAGHEKTWRVDMVAWSERNTEAFAGLHNKGRRVLLIFDEASAIPD 194

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSF 265
            I     G LT+ +    W    NP R +G+F E F +    W   ++D+RT    D + 
Sbjct: 195 AIWEVSEGALTDADTEIIWCCFGNPTRNTGRFRECFGRYAHRWNTRRVDSRTAAMTDKNQ 254

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY--APLIMGCD 323
               +  YG DSD  RV V G+FP+     FI  +I+ EA  R   PD Y  AP I+G D
Sbjct: 255 LAQWVEDYGEDSDFVRVRVRGEFPRAGDRQFISSDIVHEARGRSLKPDQYSFAPRILGVD 314

Query: 324 IAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383
           +A  G D +V+  R+G        +   D  T    ++    ++  D I +D    GA  
Sbjct: 315 VARSGSDQSVITRRQGLACLEQRKFRGLDTVTLAGIVAEECREWGADKIFVDGIGVGAGV 374

Query: 384 CDYLEM---LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHS-GLIQNL 439
            D L     LG+ V   +    A+  E   NRR E+   M  WL     +     L + L
Sbjct: 375 VDALRQVYGLGHLVVDAVAGATALQPERFLNRRAEMWTAMRKWLAEGGAVPDDAELAEQL 434

Query: 440 KSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481
             L+ + V  +G+L +ESK   + +G  S D +D L  TF    P
Sbjct: 435 CGLE-YAVTVSGKLKLESKDDMKARGLTSPDCADALALTFYAPVP 478


>gi|167032754|ref|YP_001667985.1| putative phage terminase large subunit [Pseudomonas putida GB-1]
 gi|166859242|gb|ABY97649.1| putative phage terminase, large subunit [Pseudomonas putida GB-1]
          Length = 499

 Score =  461 bits (1187), Expect = e-127,   Method: Composition-based stats.
 Identities = 143/491 (29%), Positives = 225/491 (45%), Gaps = 27/491 (5%)

Query: 8   NPETEQKLF-DLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHC 66
             + EQ+L  D+    +  L    +VL+ FPWGE G  L   + PR WQ E +E +    
Sbjct: 7   EIDYEQELANDIASFSDDPL---GYVLYAFPWGEAGGELANKTGPRKWQREVLESIGEQL 63

Query: 67  LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA 126
                +   EV + A+++G GIGK+ L +W++ W + T      +  AN+E+QL+T  W 
Sbjct: 64  RAGAKDRG-EVIREAVASGHGIGKSALVSWVIKWALDTEVDTRGVVTANTESQLRTKTWP 122

Query: 127 EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG 186
           EV+KW  L    HWF++   +L       D  H       K++      +S+   + F G
Sbjct: 123 EVAKWNRLSITAHWFKLTGTALIST----DPDH------EKNWRIDAVPWSDTNTEAFAG 172

Query: 187 HHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL 245
            HN    + +I DEAS   D++     G LT+ +    W    NP R SG+F E F K  
Sbjct: 173 LHNEGKRILLIFDEASAIADLVWEVAEGALTDADTEIIWAAFGNPTRNSGRFRECFTKFK 232

Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
             W+  Q+D+RTV+G + +     IA YG DSD  R+ V G FP+      IP + + EA
Sbjct: 233 HRWRHRQVDSRTVDGTNKTQIAKWIADYGEDSDFVRIRVRGMFPRASDLQLIPTDWVAEA 292

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIE--HLFDWSKTDLRTTN---NKI 360
           + R+        L+ G DIA  G DN V+  RRG   +         ++ R T     K+
Sbjct: 293 MRRDGVYGLDDALVCGIDIARGGMDNNVIRFRRGMDAKSIKPIKIPGSETRNTTPFIAKV 352

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHV 418
             LV ++RPDA+ +D+   G    D L  L  G  +  V    +A D     N RT +  
Sbjct: 353 CTLVVEHRPDAVFVDSTGVGGPVADQLRRLLPGVMIIDVNFASQAPD-RHYANMRTYIWW 411

Query: 419 KMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYT 475
           +M + ++    I     ++   +   +   ++ ++A+E K   + +   S D  D L  T
Sbjct: 412 RMREAIKLGLAIESDTELETELTSPEYDHNSSDQIALEKKKDIKKRLGISPDDGDALALT 471

Query: 476 FAENPPRSDMD 486
           F     ++   
Sbjct: 472 FTMPVMKAQYQ 482


>gi|228911519|ref|ZP_04075310.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL
           200]
 gi|228848128|gb|EEM92991.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL
           200]
          Length = 459

 Score =  454 bits (1168), Expect = e-125,   Method: Composition-based stats.
 Identities = 132/494 (26%), Positives = 216/494 (43%), Gaps = 75/494 (15%)

Query: 14  KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73
           ++ D+ W D +  +F+  +L F+              P  WQ + +       ++   +P
Sbjct: 2   EIIDVYWDDPV--AFAEDMLGFY--------------PDEWQRKVL-------MDLAQSP 38

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
                K ++ +G+G+GKT L + +V+W +  RP   VIC A ++ QL T LWAE++KWL 
Sbjct: 39  -----KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLE 93

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
               K+  +     ++                 + +    RT +  +P+   G H  Y M
Sbjct: 94  GSAVKNLLKWTKTRVYMIG------------SEERWFATARTAT--KPENMQGFHEDY-M 138

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             + DEASG  D I   ILG L+   A     +  NP R SG FY+  N+  D +K  ++
Sbjct: 139 LFVCDEASGIADPIMEAILGTLS--GAENKLFLCGNPTRTSGVFYDSHNRDRDLYKIHKV 196

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            +           E +  +YG  SDV RV V G+FP+ + D+FIPL I+E+A + +  P 
Sbjct: 197 SSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCKVEPT 256

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA-- 371
               L +G D+A  G D TV+  R G  +  L +  K D   T   +  L ++Y      
Sbjct: 257 -GETLDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMETAGHVLKLAKEYMAKYKQ 315

Query: 372 -----IIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
                I +D +  G    D L      E L + VY V+   + +D E   N   E    +
Sbjct: 316 LKRVDIKVDDSGVGGGVTDRLKEVIKSERLPFKVYPVVNNGKPLDDEHYDNAGAEGWAVV 375

Query: 421 ADWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
            D LE               + N   +I    S K + + + G++A+E K   + +G +S
Sbjct: 376 RDLLEENMKAFIQGEEPTMEIPNDEKMISQFSSRK-YRITSRGKIALERKEEMKKRGLQS 434

Query: 466 TDYSDGLMYTFAEN 479
            D +D ++  F + 
Sbjct: 435 PDRADAIVLAFYKP 448


>gi|228968731|ref|ZP_04129698.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis
           serovar sotto str. T04001]
 gi|228790961|gb|EEM38595.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis
           serovar sotto str. T04001]
          Length = 459

 Score =  454 bits (1168), Expect = e-125,   Method: Composition-based stats.
 Identities = 133/494 (26%), Positives = 217/494 (43%), Gaps = 75/494 (15%)

Query: 14  KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73
           ++ D+ W D +  +F+  +L F+              P  WQ + +       ++   +P
Sbjct: 2   EIIDVYWDDPV--AFAEDMLGFY--------------PDEWQRKVL-------MDLAQSP 38

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
                K ++ +G+G+GKT L + +V+W +  RP   VIC A ++ QL T LWAE++KWL 
Sbjct: 39  -----KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLE 93

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
               K+  +     ++                 + +    RT +  +P+   G H  Y M
Sbjct: 94  GSAVKNLLKWTKTRVYMIG------------SEERWFATARTAT--KPENMQGFHEDY-M 138

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             + DEASG  D I   ILG L+   A     +  NP R SG FY+  N+  D +K  ++
Sbjct: 139 LFVCDEASGIADPIMEAILGTLS--GAENKLFLCGNPTRTSGVFYDSHNRDRDLYKIHKV 196

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            +           E +  +YG  SDV RV V G+FP+ + D+FIPL I+E+A + +  P 
Sbjct: 197 SSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCKVEPT 256

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA-- 371
               L +G D+A  G D TV+  R G  +  L +  K D   T   +  L ++Y      
Sbjct: 257 -GETLDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMETAGHVLKLAKEYMAKYKQ 315

Query: 372 -----IIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
                I +D +  G    D L      E L + VY V+   + +D E   N  TE    +
Sbjct: 316 LKRVDIKVDDSGVGGGVTDRLKEVIKSERLPFKVYPVVNNGKPLDDEHYDNAGTEGWAVV 375

Query: 421 ADWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
            D LE               + N   +I    S K + + + G++A+E K   + +G +S
Sbjct: 376 RDLLEENMKAFIQGEEPTMEIPNDEKMISQFSSRK-YRITSRGKIALERKEEMKKRGLQS 434

Query: 466 TDYSDGLMYTFAEN 479
            D +D ++  F + 
Sbjct: 435 PDRADAIVLAFYKP 448


>gi|254781187|ref|YP_003065600.1| putative phage terminase, large subunit [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040864|gb|ACT57660.1| putative phage terminase, large subunit [Candidatus Liberibacter
           asiaticus str. psy62]
          Length = 367

 Score =  453 bits (1165), Expect = e-125,   Method: Composition-based stats.
 Identities = 252/359 (70%), Positives = 299/359 (83%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           M R + T+ + EQ+L +++   E  LSF NFV+ FFPWG KG PLE FS P  WQLEFME
Sbjct: 1   MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            VD HC ++VNN NP +FK AISAGRGIGKTTLNAW++LWL+STRPG+S+IC+ANSETQL
Sbjct: 61  AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           K TLWAEVSKWLS+LP++HWFEMQSLSLHP+ WY+++L  S+GIDSKHY+  CRTYSEER
Sbjct: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           PDTFVG HNT+GMA+ NDEASGTPD+IN  ILGF TE N NRFWIMTSN RRL+G FY+I
Sbjct: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           FN PL+DWKR+QIDTRTVEGID  FHEGII+RYGLDSDV R+E+ GQFPQQ++++FIP N
Sbjct: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359
            IEEA++RE   D YAPLIMGCDIA EGGD TVVV RRG +IEH+FDWS   ++ TN +
Sbjct: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359


>gi|150390341|ref|YP_001320390.1| hypothetical protein Amet_2579 [Alkaliphilus metalliredigens QYMF]
 gi|149950203|gb|ABR48731.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF]
          Length = 469

 Score =  451 bits (1161), Expect = e-124,   Method: Composition-based stats.
 Identities = 131/494 (26%), Positives = 202/494 (40%), Gaps = 74/494 (14%)

Query: 14  KLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73
            L D  W + +   F+  +L F+              P  WQ + +  +  H        
Sbjct: 7   ALLDNYWDNPVW--FAEDMLGFY--------------PDPWQAKVLMDLAQH-------- 42

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
                K ++ +G+G+GKT L +  + W + TRP   VI  A +  QL   LWAE+SKWLS
Sbjct: 43  ----PKVSVRSGQGVGKTGLESIAITWYLCTRPFPKVIATAPTRQQLYDVLWAEISKWLS 98

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
                         ++   +             + +    RT    RP+   G H  Y M
Sbjct: 99  KSKVDKLLRWTKTKIYMNGF------------EERWWATARTAV--RPENMQGFHEDY-M 143

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             + DEASG  D I   ILG LT        ++  NP + SG FY+  N+  D +K  ++
Sbjct: 144 LFVVDEASGVADPIMEAILGTLTGY--ENKLLLCGNPTKTSGTFYDSHNRDRDTYKSHKV 201

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            +           E +  +YG DSDV RV V G FP+ + DS I L + E+A        
Sbjct: 202 SSMDSPRTSKENIEMLKKKYGADSDVFRVRVLGDFPKGEADSLISLEVTEQAAETVVDIS 261

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP---- 369
               L +G DIA  G D T++  R G  +  L  +SK D   T   I   V++ +     
Sbjct: 262 NAYTLNIGADIARFGDDKTIIAPRIGNRVLDLQQYSKKDTMETAGNILRTVDRLKTQHLQ 321

Query: 370 ---DAIIIDANNTGARTCDYLEM------LGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
                I ID +  G    D L        LGY +  +    +A D E   N+  E+   +
Sbjct: 322 INKIVIKIDDDGLGGGVTDRLREINRQQSLGYIIVPIKNGSKADDPEHYYNKAAEMWDNI 381

Query: 421 ADWLEF------------ASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
            + L+               L     LI+ L + K + V + G + +ESK   + +  +S
Sbjct: 382 RELLDENLSKFLQGEPGVIQLPKDDILIKQLSNRK-YKVDSKGRIELESKDEMKRRIGES 440

Query: 466 TDYSDGLMYTFAEN 479
            D +D ++Y+FA +
Sbjct: 441 PDRADAVIYSFASD 454


>gi|282848875|ref|ZP_06258265.1| conserved hypothetical protein [Veillonella parvula ATCC 17745]
 gi|282581380|gb|EFB86773.1| conserved hypothetical protein [Veillonella parvula ATCC 17745]
          Length = 483

 Score =  448 bits (1153), Expect = e-124,   Method: Composition-based stats.
 Identities = 134/483 (27%), Positives = 217/483 (44%), Gaps = 27/483 (5%)

Query: 10  ETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNS 69
           + ++ +  L       L+F   V   +PWGE GTPLE    P  WQ++ ++ +       
Sbjct: 3   KHDELIEALGALTHDPLAF---VYFAYPWGEPGTPLENMEGPDEWQIQILKDIGEQLKKG 59

Query: 70  VNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129
            +       + A+++G GIGK+ L +WL+ + +ST      +  AN+E QL+T  W E+S
Sbjct: 60  KDLQT--AIQEAVASGHGIGKSALISWLIHFAISTHENTRGVVTANTEGQLRTKTWPELS 117

Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           KW ++   K  F   + ++  +               K +      +S+  P++F G HN
Sbjct: 118 KWHNMFIAKDLFTYTATAIFSSD----------KDYEKTWRIDAIPWSKNSPESFAGLHN 167

Query: 190 TYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248
               + ++ DEAS   DVI     G LT+ N    W    NP R SG+F E F K    W
Sbjct: 168 QGNRILVLFDEASAIDDVIWEVTEGALTDANTEIIWCAFGNPTRNSGRFRECFRKYRKFW 227

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
             +QID+RTV+  + +  E  +  YG DSD  +V V G FP      FI   I ++A  +
Sbjct: 228 NTYQIDSRTVKISNKTKIEEWLEAYGEDSDFFKVRVRGVFPSASDLQFISTEIADKAQKQ 287

Query: 309 EPCPDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR-TTNNKISGLVE 365
              P    + P+I+G D A  G D+  +V+R+G  ++ L    K D        I+   +
Sbjct: 288 VYKPGQFEHLPVIIGVDPAWTGSDSLEIVMRQGYYMKSLASIPKNDDDWRMAQLIAQFED 347

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425
           +Y+ DA+ ID    G       + LG     +    ++ D     N R  +  +M +WL 
Sbjct: 348 EYKADAVFIDM-GYGTGIYSIGKQLGRKWRLIEFGGKSNDP-VYLNMRAYMWGQMKEWLR 405

Query: 426 FASLI--NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
               I  N   L  ++   ++ I+   G + +ESK   + +G  S +  D L  TFA   
Sbjct: 406 EGGSIPPNDQALYDDIVGPEA-IIDKNGRIQLESKKDMKDRGLPSPNKGDALALTFAARV 464

Query: 481 PRS 483
            + 
Sbjct: 465 VKK 467


>gi|150016512|ref|YP_001308766.1| hypothetical protein Cbei_1636 [Clostridium beijerinckii NCIMB
           8052]
 gi|149902977|gb|ABR33810.1| conserved hypothetical protein [Clostridium beijerinckii NCIMB
           8052]
          Length = 470

 Score =  438 bits (1125), Expect = e-120,   Method: Composition-based stats.
 Identities = 129/462 (27%), Positives = 201/462 (43%), Gaps = 47/462 (10%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
            +  P  +  + M        + V     +  K ++ +G+G+GKT L + +V W + TRP
Sbjct: 12  YWDNPVWFAEDMMNFHADKWQSEVLMALAQSPKVSVRSGQGVGKTGLESIVVTWYLCTRP 71

Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
              VI  A +  QL   LWAE+SKWL+    ++  E     ++   +            S
Sbjct: 72  FPKVIATAPTRQQLYDVLWAEISKWLASSKIENLLEWTKTKIYMKGY------------S 119

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226
           + +    +T +  RP+   G H  Y M  + DEASG  D I   ILG LT        +M
Sbjct: 120 ERWWATAKTAT--RPENMQGFHEDY-MLFVVDEASGVADPIMEAILGTLTGY--ENKLLM 174

Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286
             NP R SG FY+  N+  D +K F++ +           E +  +Y   SDV RV V G
Sbjct: 175 CGNPTRTSGTFYDSHNRDRDLYKTFKVSSLESPRTSKDNIEMLKRKYHEGSDVWRVRVEG 234

Query: 287 QFPQQDIDSFIPLNIIEEA-LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345
           +FP+ + DS I L   E A + +         L +G DIA  G D +V+  R G  +  L
Sbjct: 235 EFPKGESDSLISLEYAETATITKINNIHNNFTLHIGADIARFGNDESVIAPRIGNKVFDL 294

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPD-------AIIIDANNTGARTCDYLEM------LGY 392
             ++K D   T   I    +K++ +        I +D +  G    D L        LGY
Sbjct: 295 LTYTKKDTMETTGNILRATDKFKNEYKHINKVKIRVDDDGLGGGVTDRLREVIRQEGLGY 354

Query: 393 HVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE------------FASLINHSGLIQNLK 440
            V  +    +A D E   ++  E+   M D LE               L N+  LI+ L 
Sbjct: 355 EVMPIKNGSKANDEEHYSDKSAEMWGNMRDILEENFTNFVQGKEPTIELPNNDKLIKQLS 414

Query: 441 SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
           + K F + + G + +E K   + +  +S D +D ++Y+FAEN
Sbjct: 415 NRK-FRIDSKGRIDLEKKEEMKKRIGESPDLADAVIYSFAEN 455


>gi|209901239|ref|YP_002290878.1| putative terminase B [Clostridium phage phiCD27]
 gi|199612120|gb|ACH91293.1| putative terminase B [Clostridium phage phiCD27]
          Length = 469

 Score =  436 bits (1121), Expect = e-120,   Method: Composition-based stats.
 Identities = 134/493 (27%), Positives = 210/493 (42%), Gaps = 74/493 (15%)

Query: 15  LFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPN 74
           L D  W + +   F+  +L+F                  WQ + +  +            
Sbjct: 8   LLDCYWDNPVW--FAEDMLNF--------------KADKWQSDVLMAL------------ 39

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134
            +  K +I +G+G+GKT L +   +W +STRP   V+  A +  QL   LWAE++KWLS 
Sbjct: 40  AQTPKVSIRSGQGVGKTGLESIATVWYLSTRPFPKVVATAPTRQQLYDVLWAEIAKWLSN 99

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
              +   E     ++   +             + +    RT    +P+   G H  Y M 
Sbjct: 100 SKVEKLLEWTKTKVYMKGF------------EERWWATARTAV--KPENMQGFHEDY-ML 144

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQID 254
            + DEASG  D I   ILG L+   A    ++  NP R SG FY+  N+  D +K F++ 
Sbjct: 145 FVVDEASGVADPIMEAILGTLS--GAENKLLLCGNPTRTSGTFYDSHNRDRDLYKTFKVS 202

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP 314
           +           E +  +Y   SD  RV V G+FP+ + DS I L  +E +  RE     
Sbjct: 203 SLDSPRTSKDNIEMLKRKYHEGSDPWRVRVLGEFPKGESDSLISLEAVETSTIREVNISN 262

Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP----- 369
              L +G DIA  G D T++  R G  +  L  +SK D   T   I   V+K++      
Sbjct: 263 DYILNIGADIARYGDDETIIAPRIGGKVFDLLTYSKKDTMETVGNILRAVDKFKNMYHQI 322

Query: 370 --DAIIIDANNTGARTCDYL------EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
               I  D +  GA   D L      E L Y V  +     A++ +   N+ +E+   M 
Sbjct: 323 NRVKIKTDDDGLGAGVTDRLKEVIRHERLKYEVIPIQNGSSAIEKDKYYNKASEMWDNMR 382

Query: 422 DWLE------------FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKST 466
           + L+               L N   LI+ L + K + V + G++ IESK   + +  +S 
Sbjct: 383 EELDANLSSFIQNKEAIIQLPNDDKLIKQLSNRK-YTVDSKGKIQIESKKEMKKRIGESP 441

Query: 467 DYSDGLMYTFAEN 479
           D +D ++Y+FAEN
Sbjct: 442 DRADAVIYSFAEN 454


>gi|290968649|ref|ZP_06560187.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str.
           28L]
 gi|290781302|gb|EFD93892.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str.
           28L]
          Length = 487

 Score =  434 bits (1117), Expect = e-119,   Method: Composition-based stats.
 Identities = 142/486 (29%), Positives = 232/486 (47%), Gaps = 37/486 (7%)

Query: 8   NPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCL 67
           + E  Q L  L            FV   F W  +   L+G   P++WQ++ ++ V     
Sbjct: 5   DIELLQALGSLASDP------VAFVYFAFDWDSE--ELKG-QNPQTWQIKTLKEVGEGL- 54

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
                      + A ++G GIGK+ L AWL+LW +STRP    +  AN+ TQL+T  WAE
Sbjct: 55  -----SLSTALQHATASGHGIGKSALVAWLILWAISTRPDTRGVVTANTATQLETKTWAE 109

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           +SKW  L   K +F + S ++           C      + +      +S +R ++F G 
Sbjct: 110 LSKWYHLFRGKKFFTLTSTAI----------FCRQEGHERTWRIDAIPWSVDRTESFAGL 159

Query: 188 HNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
           HN    + +I DEAS   + I     G LT+++    W++  NP R +G+F++ F+K   
Sbjct: 160 HNQGNRLLLIFDEASAIDNKIWEVAEGALTDKDTEILWLVFGNPTRSTGRFFDCFHKYKK 219

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            W   +ID+RTV+  + +  +  I  YG+DSD  +V V G+FP      FI   I+  A 
Sbjct: 220 SWITQKIDSRTVDISNKTQLQKWIQTYGIDSDFVKVRVLGEFPDTSDTQFISTAIVRTAW 279

Query: 307 NREP---CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR-TTNNKISG 362
            R P       +AP I+G D A  GGD+TV+ LR+G   E L ++ + D       +++ 
Sbjct: 280 ERRPLRTAEYDFAPCIIGMDPAWTGGDSTVIFLRQGFFSEKLAEYKQNDNDGVMAARLAE 339

Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
             +KY  DA+ ID    G     +   +G     V   +++   +   N+R E+   M +
Sbjct: 340 FEDKYHADAVFID-KGYGTGIYSFGVTMGRQWRLVSFAEKS-GAQAYANKRAEMWGNMKE 397

Query: 423 WLEFASLINH-SGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478
           WL+   +I    GLI+ L + ++F +   GE+ +E K   + +G +S + +D L  TFA 
Sbjct: 398 WLQEGGVIPQVDGLIEELTAPQAF-INARGEIQLEKKEDMKKRGIESPNMADALALTFAY 456

Query: 479 NPPRSD 484
              + +
Sbjct: 457 PVLQRN 462


>gi|282598712|ref|YP_003358792.1| putative phage terminase B protein [Enterococcus phage phiEf11]
 gi|300860603|ref|ZP_07106690.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11]
 gi|307292389|ref|ZP_07572245.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis
           TX0411]
 gi|258598082|gb|ACV83339.1| putative phage terminase B protein [Enterococcus phage phiEf11]
 gi|300849642|gb|EFK77392.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11]
 gi|306496518|gb|EFM66079.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis
           TX0411]
 gi|315146097|gb|EFT90113.1| conserved hypothetical protein [Enterococcus faecalis TX2141]
          Length = 484

 Score =  433 bits (1113), Expect = e-119,   Method: Composition-based stats.
 Identities = 122/490 (24%), Positives = 216/490 (44%), Gaps = 51/490 (10%)

Query: 33  LHFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
             F P+ + G  ++ +   P ++  + + +      + V +   +  K ++ +G+G+GKT
Sbjct: 3   KEFIPFADIGAAIDYYYDKPVAFCQDILHLDPDEWQDKVLDDLAKFPKVSVRSGQGVGKT 62

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPA 151
            L A  +LW ++ RP   VI  A +  QL   LWAEV+KWL+    K   +     ++  
Sbjct: 63  ALEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNNSLIKDLLKWTKTKIYMV 122

Query: 152 PWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGI 211
                        DS+ +    RT +  +P+   G H  + M I+ DEASG  D I   I
Sbjct: 123 G------------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVADPIMEAI 167

Query: 212 LGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIA 271
           LG L+  +     +M  NP  + G FY+  N   D ++  ++ +   +  +    + +I 
Sbjct: 168 LGTLSGFD--NKLLMCGNPNNIEGVFYDSHNTDRDKYRTHKVSSYDSKRTNKENIQMLID 225

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL---IMGCDIAEEG 328
           +YG +SDV RV + G+FP+  +DSFI L I+E A +          +    +G D+A  G
Sbjct: 226 KYGENSDVARVRIYGEFPKGALDSFISLEIVEFAKDINISDSELKHVREGHIGVDVARFG 285

Query: 329 GDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI----SGLVEKYRPDA---IIIDANNTGA 381
            D+T+V  R G        +SK D   T  ++      +++ Y       I +D    G 
Sbjct: 286 DDSTIVFPRIGAKALPFEKYSKQDTMQTTGRVLKAAKRMMDDYPTIKKVFIKVDDTGVGG 345

Query: 382 RTCDYLEM------LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE---------- 425
              D L+       L Y V  V   + + D ++  N+ T++   + + LE          
Sbjct: 346 GVTDRLKEVISDEKLPYEVIPVNNGESSTD-DYYANKGTQIWGDVKELLEQNISNSINGQ 404

Query: 426 --FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
                L +++ LI+ L + K F + + G++ +ESK   + +   S D +D L   F E  
Sbjct: 405 GPTIELPDNANLIKELSTRK-FKMTSNGKIRLESKEDMKKRNVGSPDIADALTLAFYEPF 463

Query: 481 PRSDMDFGRC 490
               ++  + 
Sbjct: 464 RPEPINVKKA 473


>gi|257883493|ref|ZP_05663146.1| conserved hypothetical protein [Enterococcus faecium 1,231,502]
 gi|294614775|ref|ZP_06694675.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636]
 gi|294622490|ref|ZP_06701512.1| conserved hypothetical protein [Enterococcus faecium U0317]
 gi|257819151|gb|EEV46479.1| conserved hypothetical protein [Enterococcus faecium 1,231,502]
 gi|291592387|gb|EFF23996.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636]
 gi|291598037|gb|EFF29147.1| conserved hypothetical protein [Enterococcus faecium U0317]
          Length = 471

 Score =  429 bits (1104), Expect = e-118,   Method: Composition-based stats.
 Identities = 125/484 (25%), Positives = 208/484 (42%), Gaps = 51/484 (10%)

Query: 34  HFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92
            F P+ + G+ ++ +   P ++  + + +       +V N   E  K ++ +G+G+GKT 
Sbjct: 4   EFIPFADIGSAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTA 63

Query: 93  LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152
           L A  +LW ++ RP   VI  A +  QL   LWAEV+KWL+    K+  +     ++   
Sbjct: 64  LEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNDSLIKNLLKWTKTKIYMVG 123

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212
                       DS+ +    RT +  +P+   G H  + M I+ DEASG  D I   IL
Sbjct: 124 ------------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAIL 168

Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272
           G L+  +     +M  NP  + G FY+  N   D ++  ++ +   +  +    E I+ +
Sbjct: 169 GTLSGFD--NKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKK 226

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD---PYAPLIMGCDIAEEGG 329
           YG +SDV RV + G+FP+  +DSFI L  +E A  ++             +G D+A  G 
Sbjct: 227 YGKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGD 286

Query: 330 DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS----GLVEKYRPD---AIIIDANNTGAR 382
           D+T++  R          +SK     T   +      L+ +Y       I +D    G  
Sbjct: 287 DSTILFPRIATRALEYEKYSKRSTMETTGYVINMAKNLMSQYPSIDKVMIKVDDTGVGGG 346

Query: 383 TCDYLEML------GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF---------- 426
             D LE L       + V+ V     + D +F  N  T+L   + + LE           
Sbjct: 347 VTDRLEELIEDKHYPFEVFGVNNGSTSED-DFYDNLGTQLWGNIKEMLEENMTANLNGEQ 405

Query: 427 --ASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481
               L + S LI+ L + K F + +   + +ESK   + +   S D +D L   F E P 
Sbjct: 406 PVIELPSDSSLIKELSTRK-FKMTSRSRIRLESKDDMKKRNIGSPDIADALALAFYEPPS 464

Query: 482 RSDM 485
               
Sbjct: 465 HYQF 468


>gi|261208032|ref|ZP_05922709.1| conserved hypothetical protein [Enterococcus faecium TC 6]
 gi|289567088|ref|ZP_06447483.1| conserved hypothetical protein [Enterococcus faecium D344SRF]
 gi|260077749|gb|EEW65463.1| conserved hypothetical protein [Enterococcus faecium TC 6]
 gi|289161103|gb|EFD09008.1| conserved hypothetical protein [Enterococcus faecium D344SRF]
          Length = 471

 Score =  429 bits (1102), Expect = e-118,   Method: Composition-based stats.
 Identities = 125/484 (25%), Positives = 207/484 (42%), Gaps = 51/484 (10%)

Query: 34  HFFPWGEKGTPLEGF-SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT 92
            F P+ + G  ++ +   P ++  + + +       +V N   E  K ++ +G+G+GKT 
Sbjct: 4   EFIPFADIGAAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTA 63

Query: 93  LNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAP 152
           L A  +LW ++ RP   VI  A +  QL   LWAEV+KWL+    K+  +     ++   
Sbjct: 64  LEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNDSLIKNLLKWTKTKIYMVG 123

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212
                       DS+ +    RT +  +P+   G H  + M I+ DEASG  D I   IL
Sbjct: 124 ------------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAIL 168

Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272
           G L+  +     +M  NP  + G FY+  N   D ++  ++ +   +  +    E I+ +
Sbjct: 169 GTLSGFD--NKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKK 226

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD---PYAPLIMGCDIAEEGG 329
           YG +SDV RV + G+FP+  +DSFI L  +E A  ++             +G D+A  G 
Sbjct: 227 YGKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGD 286

Query: 330 DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS----GLVEKYRPD---AIIIDANNTGAR 382
           D+T++  R          +SK     T   +      L+ +Y       I +D    G  
Sbjct: 287 DSTILFPRIATRALEYEKYSKRSTMETTGYVINMAKNLMSQYPSIDKVMIKVDDTGVGGG 346

Query: 383 TCDYLEML------GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF---------- 426
             D LE L       + V+ V     + D +F  N  T+L   + + LE           
Sbjct: 347 VTDRLEELIEDKHYPFEVFGVNNGSTSED-DFYDNLGTQLWGNIKEMLEENMTANLNGEQ 405

Query: 427 --ASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481
               L + S LI+ L + K F + +   + +ESK   + +   S D +D L   F E P 
Sbjct: 406 PVIELPSDSSLIKELSTRK-FKMTSRSRIRLESKDDMKKRNIGSPDIADALALAFYEPPS 464

Query: 482 RSDM 485
               
Sbjct: 465 HYQF 468


>gi|228950291|ref|ZP_04112468.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis
           serovar monterrey BGSC 4AJ1]
 gi|228809453|gb|EEM55897.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis
           serovar monterrey BGSC 4AJ1]
          Length = 495

 Score =  427 bits (1099), Expect = e-117,   Method: Composition-based stats.
 Identities = 121/505 (23%), Positives = 200/505 (39%), Gaps = 83/505 (16%)

Query: 13  QKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNN 72
            +L ++   D +  +F   +L                 P  WQ E +  +  H       
Sbjct: 19  TQLLEIYVDDPV--AFVEDILEV--------------EPDPWQKEVLNDIANHSH----- 57

Query: 73  PNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL 132
                   ++ +G+G+GKT + +W+ +W +  RP   +IC A ++ QL   LWAE++KWL
Sbjct: 58  -------VSVRSGQGVGKTAMESWICIWFLCCRPYPKIICTAPTKQQLYDVLWAEIAKWL 110

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +    K   +     ++   +               +    +T +  RP+   G H  Y 
Sbjct: 111 NSSQVKDLLKWTKTKIYMKGF------------EDRWFATAKTAT--RPENMQGFHEDY- 155

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
           M  I DEASG  D I   ILG L+   +     M  NP + SG F++  NK    +K  +
Sbjct: 156 MLFIADEASGIADDIMEAILGTLS--GSENKLFMCGNPTKTSGVFFDSHNKDRALYKSHK 213

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP-- 310
           + +           E +  +YG  SDV RV V G+FP+ + D+FI L   E A  RE   
Sbjct: 214 VSSADSPRTSKKNIEMLKKKYGEGSDVYRVRVEGEFPRGEADAFISLETAEAARMREVYK 273

Query: 311 ---------------CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRT 355
                               A + +GCD+A  G D T++  RRG  +  L    + D   
Sbjct: 274 VEVIENEEEESTVKEIIPDTAVVEIGCDVARFGSDETIIATRRGWKVLPLQVHHQRDTMY 333

Query: 356 TNNKISGLVEKY--------RPDAIIIDANNTGARTCDYLEM------LGYHVYRVLGQK 401
            +  +    +KY        +   I ID    G    D L+           V  +    
Sbjct: 334 VSGLLVQEAKKYFSWCERTGKRIPIRIDDTGVGGGVTDRLKEVVAENDYPIDVIPINFAS 393

Query: 402 RAVDLEFCRNRRTELHVKMAD-WLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK-- 458
           +           + ++    D  LEF +L +   LI  L S++ + + + G + IE K  
Sbjct: 394 K--GNAEYACIVSVMYGHFKDNCLEFVALPDDEDLIAQL-SVRKYQINSDGRIKIEPKKA 450

Query: 459 -RVKGAKSTDYSDGLMYTFAENPPR 482
            + +G KS D ++ ++  FA   P+
Sbjct: 451 MKDRGLKSPDRAEAVVMAFAPFYPK 475


>gi|332981151|ref|YP_004462592.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON]
 gi|332698829|gb|AEE95770.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON]
          Length = 461

 Score =  424 bits (1090), Expect = e-116,   Method: Composition-based stats.
 Identities = 133/448 (29%), Positives = 198/448 (44%), Gaps = 50/448 (11%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
           + P  WQ E ++ +  +             + A+ +G G+GKT L AW +LW + TRP  
Sbjct: 25  AEPDDWQAETLQALADN------------PRVAVRSGHGVGKTALEAWALLWFLFTRPYP 72

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPN-KHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167
            + C A +  QL   LWAE SKWL   P  K +FE Q   +                   
Sbjct: 73  KIPCTAPTREQLHDILWAEASKWLERAPALKPYFEWQKTRI------------VQKQYPG 120

Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227
            +    RT    +P+   G H  + +  I DEASG  D I   I G LT  +A    +M 
Sbjct: 121 RWFATART--SNKPENMAGFHEEH-LLFIIDEASGIADNIFETIEGALTTSDAK--LLMC 175

Query: 228 SNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQ 287
            NP + SG F++ F K    +   ++     + +   + E +  +Y  DSDV RV V G+
Sbjct: 176 GNPTKNSGVFHDAFFKDRSLYWTRKVSCLDSQRVTLEYAERLKRKYHEDSDVYRVRVLGE 235

Query: 288 FPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
           FP+ + D+FI L+I+E A  R+  PD    L +G D+A  G D TV+  R G  + +L  
Sbjct: 236 FPKAEPDTFISLDIVEAATMRDVEPD--GVLEIGVDVARFGDDETVLAARAGLKLVYLKA 293

Query: 348 WSKTDLRTTNNKISGLVEKY-----RPD-AIIIDANNTGARTCDYLEM------LGYHVY 395
           ++K D  TT      L +       +P   I ID +  G    D          L   V 
Sbjct: 294 YTKQDTMTTAGYAIALAKDLMKECGKPKCTIKIDDDGVGGGVTDRCREVVREEKLYIDVI 353

Query: 396 RVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLKSLKSFIVPNTGEL 453
                    D E   N  TE    + D L  E A LIN   LI  L + K + + + G++
Sbjct: 354 DCHNGGAPEDKEHYENWGTEAWAYLRDLLQDEQAELINDEDLIGQLTTRK-YRITSKGKI 412

Query: 454 AIESK---RVKGAKSTDYSDGLMYTFAE 478
           A+ESK   + +G  S D +D ++  +A+
Sbjct: 413 ALESKDEMKRRGLMSPDRADAVVLAYAK 440


>gi|308069786|ref|YP_003871391.1| hypothetical protein PPE_03030 [Paenibacillus polymyxa E681]
 gi|305859065|gb|ADM70853.1| Conserved hypothetical protein [Paenibacillus polymyxa E681]
          Length = 452

 Score =  416 bits (1070), Expect = e-114,   Method: Composition-based stats.
 Identities = 124/456 (27%), Positives = 186/456 (40%), Gaps = 58/456 (12%)

Query: 51  PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110
           P  WQ   +       ++  NNP     + ++ +G+G+GKT L A   LW +S  P   V
Sbjct: 6   PDDWQASTL-------MDLANNP-----RVSVRSGQGVGKTGLEAATALWFLSCFPYPKV 53

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
           IC A +  QL   LWAE++KW S  P         +      W    ++       + + 
Sbjct: 54  ICTAPTRQQLHDVLWAEINKWQSKSP---------VLKRILKWTKTKIYM--KNYEERWF 102

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
              RT +  +P+   G H  Y M  I DEASG  D I   ILG L+        +M  NP
Sbjct: 103 ATARTAT--KPENMQGLHEDY-MLFIVDEASGVADPIMEAILGTLSGE--FNKILMCGNP 157

Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
            + SG FY+  NK   D+K  ++               +  +YG  SDV RV V G+FP+
Sbjct: 158 TKTSGVFYDSHNKDRADYKTRKVSCLDSPRTSKDNIAMLKRKYGEGSDVWRVRVEGEFPR 217

Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350
              D+FI L + E A            L +G D+A  G D T +    GP I       K
Sbjct: 218 GGSDTFISLEVAEFAAKEVKLEPTGDMLTIGVDVARFGDDETSMFAGIGPRIVGEHHHFK 277

Query: 351 TDLRTTNNKISGLVEKYR-------PDAIIIDANNTGARTCDYL------EMLGYHVYRV 397
                T   +  L ++ +          I +D +  G    D L      E L Y +  +
Sbjct: 278 KGTMVTAGWVINLAKELQVAHPYLNRIRIRVDDSGVGGGVTDRLSEIVAEEGLPYEIIPI 337

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLE------------FASLINHSGLIQNLKSLKSF 445
                ++D E   N  TE+   + + LE               L +   LI  L + K +
Sbjct: 338 NNGSSSLD-EHYGNLVTEMWASIKEQLEQNMSNFMNGDSSILQLPDDDVLITQLTARK-W 395

Query: 446 IVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478
            + + G++ +ESK   + +G KS D +D  + TF E
Sbjct: 396 NMTSKGKMLLESKKDMKKRGLKSPDRADAFVLTFGE 431


>gi|54302246|ref|YP_132239.1| terminase large subunit [Photobacterium profundum SS9]
 gi|46915667|emb|CAG22439.1| hypothetical protein PBPRB0566 [Photobacterium profundum SS9]
          Length = 513

 Score =  414 bits (1064), Expect = e-113,   Method: Composition-based stats.
 Identities = 132/515 (25%), Positives = 213/515 (41%), Gaps = 42/515 (8%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEK------------GTPLEGF 48
           M+++   N E  Q   D+    +  L    FV++ +PW                +  +  
Sbjct: 1   MAKKEEINYEH-QLAIDIGGFYDDPL---GFVMYAYPWDTDPDLQIVKLPEPWASKYDSV 56

Query: 49  SAPRSWQLE----FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104
             P +W  E      EV+  +  N V+    + F  +IS+G GIGK+  ++WL+ ++MST
Sbjct: 57  YGPDAWFCEMCDQLQEVIRKNDFNGVDPV--DAFLYSISSGHGIGKSCASSWLIHFVMST 114

Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164
           RP    +  +N+  QL+T  W E+ KW   L NKHWF   +   +   ++ D        
Sbjct: 115 RPNSKGVVTSNTSEQLRTKTWGELGKWTKKLINKHWFVYNNGKGNMNFYHKDY------- 167

Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASGTPDVINLGILGFLTERNANRF 223
            ++ +    +T  EE  ++F G H        + DEAS  PD I     G LT+     F
Sbjct: 168 -AETWRVDAQTCREENSESFAGLHCASSTPWYLFDEASAVPDKIWEVAEGGLTDGEP--F 224

Query: 224 WIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
           W +  NP R SG+F E + +    W R QID+ TV+  +        + YG DSD  RV 
Sbjct: 225 WFVFGNPTRNSGRFRECWRRFRQRWNRKQIDSSTVQVTNKKKISEWESDYGEDSDFYRVR 284

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG--PV 341
           V G FP    +  I   ++E A++R     P +P +M  D+A  GGDN V   R G    
Sbjct: 285 VKGVFPSASSNQKISGALLEAAMSRTAHVIPGSPRVMSLDVARGGGDNCVFRFRHGLNGG 344

Query: 342 IEHLFDWSKT---DLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398
           +        +   D          L  +++PDA  ID    G    D +  LG++   + 
Sbjct: 345 VRKKVTLPGSEYRDSMKLAAMAVQLCSEFKPDAFFIDETGVGGPVGDRIRQLGFNCIGIN 404

Query: 399 GQKRAVDLEFCRNRRTELHVKMADWLEFASLIN-HSGLIQNLKSLKSFIVPNTGELAIES 457
              +A D     N R  ++ +  +WL+    ++   GL+  + +++        E+ I  
Sbjct: 405 FASKAPDP-HYANMRAYMYHQWGEWLKAGGSLHYDEGLLTEVGAIEYTHDRKDREILIPK 463

Query: 458 K--RVKGAKSTDYSDGLMYTFAENPPRSDMDFGRC 490
              +     STD  D      A         +   
Sbjct: 464 DVIKKAIGISTDDGDACALLHAYPVAPRQQGYNSA 498


>gi|323486060|ref|ZP_08091391.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum
           WAL-14163]
 gi|323400627|gb|EGA92994.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum
           WAL-14163]
          Length = 476

 Score =  409 bits (1051), Expect = e-112,   Method: Composition-based stats.
 Identities = 125/473 (26%), Positives = 193/473 (40%), Gaps = 51/473 (10%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
               P  +  E +                E  K AI +G+G+GKT + A  +LW +   P
Sbjct: 20  YRKNPVLFAQEVLLFEPDDWQKQALMDLAESPKVAIKSGQGVGKTGMEAVALLWFLCCYP 79

Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
              ++  A ++ QL   LW+EVSKW+S  P         L      W    ++       
Sbjct: 80  YPRIVATAPTKQQLHDVLWSEVSKWMSKSP---------LLSDILKWTKTYIYMVGN--E 128

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226
           K +  + RT +  +P+   G H    M  I DEASG  D I   ILG L+   AN   +M
Sbjct: 129 KRWFAVARTAT--KPENMQGFHED-NMLFIVDEASGVADPIMEAILGTLS--GANNKLLM 183

Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286
             NP R SG FY+ FN     ++   + +   +  +    E +I +YG DS+V  V V G
Sbjct: 184 CGNPTRTSGTFYDAFNVDRSIYRCHTVSSADSKRTNKQNIESLIRKYGKDSNVVLVRVFG 243

Query: 287 QFPQQDIDSFIPLNIIEEALNREPCPD-PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345
           +FP+Q+ D FI L+I+E     +   D P   +  G D+A  G D TV+    G  I   
Sbjct: 244 EFPKQEDDVFIALSIVEHCCMLDLPDDVPIKRISFGVDVARYGSDETVIAKNVGGRITLP 303

Query: 346 FDWSKTDLRTTNNKISGLVEK-------YRPD-AIIIDANNTGARTCDYLEMLG------ 391
             +    L TT  KI  L  +       YR    I ID    G    D LE +       
Sbjct: 304 VSFRGQSLMTTVGKIVQLYRQAITEFPRYRGKIYINIDDCGLGGGVTDRLEEVKQEEKLT 363

Query: 392 -YHVYRVLGQKRAVDL----------EFCRNRRTELHVKMADWL--EFASLINHSGLIQN 438
              +  V    +  +           +   N  T L   + D L  E  SL N + L+  
Sbjct: 364 RMVIVPVNAAGKVPEETLGDGKQKACDIYDNMTTYLWGTVKDALMMEEVSLENDNELVAQ 423

Query: 439 LKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488
             + + + + + G++ +ESK   + +G  S D +D +  +  +   +   + G
Sbjct: 424 F-TCRKYRLTSRGKMLLESKEEMKKRGIDSPDRADAVALSCYQ---KKTFNIG 472


>gi|332976102|gb|EGK12970.1| hypothetical protein HMPREF9374_1123 [Desmospora sp. 8437]
          Length = 462

 Score =  406 bits (1043), Expect = e-111,   Method: Composition-based stats.
 Identities = 122/459 (26%), Positives = 195/459 (42%), Gaps = 39/459 (8%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
               P  +  E ++       +       +  + A+ AG G+GKT   AW VLW + TRP
Sbjct: 17  YIRKPGLFVREVLKAEPDEWQDIALQALADNQRVAVRAGHGVGKTATEAWAVLWFLLTRP 76

Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPN-KHWFEMQSLSLHPAPWYSDVLHCSLGID 165
              + C A ++ QL   LW E++KWL   P    + E Q                 +   
Sbjct: 77  FPKIPCTAPTKPQLMDVLWPEIAKWLMNAPELAPYVEWQKTR------------VVMKQY 124

Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWI 225
            + +    RT    +P+   G H  + +  + DEASG  + I   I G LT   +    +
Sbjct: 125 EERWFATARTS--NKPENMAGFHEEH-LLFVIDEASGVDNAIFETIDGALTTAGSK--LV 179

Query: 226 MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285
           M  NP R +G FY+ F++  D +  ++I     +     +   +  +YG DSD+ RV V 
Sbjct: 180 MFGNPTRTNGVFYDAFHQDRDLYWTYKISCLDSKMASKDYARNMARKYGEDSDIYRVRVQ 239

Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345
           G+FPQ D DSFIPL ++E+A  R+        L +G D+A  G D TV+  R GPV   L
Sbjct: 240 GEFPQGDPDSFIPLELVEDARVRDLEWIDEDELHIGVDVARFGSDETVLAARIGPVAFRL 299

Query: 346 FDWSKTD-LRTTNNKIS----GLVEKYRPDA--IIIDANNTGARTCDYLEM------LGY 392
             +        T  ++      L+E++R D   + +D    G    D L+       L  
Sbjct: 300 DRYGGRTPTTETVGRVLALARELMEEHRRDYAVVKVDDTGVGGGVTDQLQEIVAEEGLNI 359

Query: 393 HVYRVLGQKRA-VDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKSFIVP 448
            V           D +   +  TE    + D  +   +   I+   LI  L + K   + 
Sbjct: 360 DVIPCNNGATPEHDPDHYHDWGTESWGTLLDRFKAGEIALKIDDEDLIGQLTTRKK-EMT 418

Query: 449 NTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSD 484
           + G++ +ESK   + +G +S D +D L+  FAE    + 
Sbjct: 419 SKGKIKLESKEKMKKRGQRSPDRADALVLAFAEAATETG 457


>gi|289578588|ref|YP_003477215.1| hypothetical protein Thit_1395 [Thermoanaerobacter italicus Ab9]
 gi|289528301|gb|ADD02653.1| conserved hypothetical protein [Thermoanaerobacter italicus Ab9]
          Length = 460

 Score =  398 bits (1023), Expect = e-109,   Method: Composition-based stats.
 Identities = 123/464 (26%), Positives = 188/464 (40%), Gaps = 57/464 (12%)

Query: 52  RSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
             W  Q E ++ V  H             + A+ A  G+GKT + AW+ LW + T     
Sbjct: 30  DPWEKQEEILKAVRDHK------------RVAVRACHGVGKTKVAAWVALWFLYTHHNSK 77

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
           VI  A +  Q++  LW E+    +                  P    VL   + +  + +
Sbjct: 78  VITTAPTWHQVENLLWREIHAAHAASR--------------IPLGGKVLQTQIELGEQWF 123

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                  S ++P+ F G H  + + I+ DEASG          GFLT   A    ++  N
Sbjct: 124 ---ALGLSTDKPERFQGFHAEHILLIV-DEASGVEQYTFDAAEGFLTSIGAK--LLLIGN 177

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVE-----------GIDPSFHEGIIARYGLDSD 278
           P +LSG+FY  F  PL  + +  I                  + P + E    ++G DS 
Sbjct: 178 PTQLSGEFYNAFRSPL--YHKIHISAFDSPNLKAGKIVRPYLVTPEWVEDKRLKWGEDSP 235

Query: 279 VTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR 338
           +    V G+FP+Q  D+ IPL  IE A  R    +   P+ +G D+A  G D TV++LRR
Sbjct: 236 LWYSRVLGEFPEQGNDTLIPLAWIEAAQQRWHMTEAGEPVEIGADVARYGTDTTVIMLRR 295

Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398
           G   E ++     D      K+    +K   + I ID    GA   D L+  GY V  + 
Sbjct: 296 GDKAEIVYQLRGQDTMEVTGKVIDAFKKTGANVIKIDVVGIGAGVVDRLKEQGYPVQGLN 355

Query: 399 GQKRAVDLEFCRNRRTELHVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPNTGELAIE 456
             + A D     N+R E +  + +  +    ++     L   L SLK +   + G + IE
Sbjct: 356 VGESATDKGRFVNKRAEWYWALRERFQEGTIAIPPDDELASQLASLK-YKFDSRGRIQIE 414

Query: 457 SK---RVKGAKSTDYSDGLMYTF----AENPPRSDMDFGRCPSY 493
           SK   R +G  S D +D LM  F     +       D  R  S+
Sbjct: 415 SKEELRRRGLPSPDKADALMLAFSSTGMKPVDEKIKDIFRRASF 458


>gi|255282256|ref|ZP_05346811.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469]
 gi|255267204|gb|EET60409.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469]
          Length = 506

 Score =  388 bits (996), Expect = e-105,   Method: Composition-based stats.
 Identities = 111/460 (24%), Positives = 184/460 (40%), Gaps = 48/460 (10%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
               P  +  E ++               E  + A+ +G+G+GKT + A  VLW +S   
Sbjct: 34  YRKDPVLYAREVLQFEPDEWQRDALMDLAEESRVAVKSGQGVGKTGIEAVAVLWFLSCFR 93

Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
              V+  A +  QL   LW+E++KW    P         L      W    ++       
Sbjct: 94  YARVVATAPTRQQLHDVLWSEIAKWQERSP---------LLKAILRWTKTYVYV--KGYE 142

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226
           K +  + RT +  +P+   G H    M  I DEASG  D I   +LG L+    N   +M
Sbjct: 143 KRWFAVARTAT--KPENMQGFHED-NMLFIVDEASGVADPIMEAVLGTLS--GGNNKLLM 197

Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286
             NP R +G FY+ F K    +    + +      D +  + +I +YG DS++ RV V G
Sbjct: 198 CGNPTRTTGTFYDAFTKDRSIFACHTVSSLDSSRTDKNNIDALIRKYGEDSNLVRVRVKG 257

Query: 287 QFPQQDIDSFIPLNIIEEALNRE---PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIE 343
            FP+QD D FI   +I++  +R+   P     A +I+G D+A  G D TV+       I+
Sbjct: 258 LFPKQDDDVFISQELIDQCTSRQYELPESRGMAQVILGVDVARYGNDETVIYRNFKGRIK 317

Query: 344 HLFDWSKTDLRTTNNKISGLV----EKYRP----DAIIIDANNTGARTCDYLEMLGYH-- 393
            + +    +L  T   I        + Y        I ID    G    D L  +     
Sbjct: 318 MVRNRRGQNLMATAGDIVREYRHIVDGYPGFDGKIYINIDDTGLGGGVTDRLREVKKEQK 377

Query: 394 -----VYRVLGQKR--------AVDLEFCRNRRTELHVKMADWLEFASLI--NHSGLIQN 438
                +  +   ++            E+  N  T +   + + LE   ++  + +  +  
Sbjct: 378 LTRMVIIPINAAEKIETDTKAGKEAAEYYNNLTTHMWAAVRELLEKREIVIEDDAETVAQ 437

Query: 439 LKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYT 475
           L   K + V + G++ IE K   + +G  S D +D L  +
Sbjct: 438 LSMRK-YTVASNGKIEIEPKKEMKKRGLDSPDRADALTLS 476


>gi|167767949|ref|ZP_02440002.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1]
 gi|167710278|gb|EDS20857.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1]
 gi|291560988|emb|CBL39788.1| hypothetical protein CL2_30180 [butyrate-producing bacterium SSC/2]
          Length = 473

 Score =  384 bits (987), Expect = e-104,   Method: Composition-based stats.
 Identities = 121/488 (24%), Positives = 193/488 (39%), Gaps = 68/488 (13%)

Query: 11  TEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSV 70
            +  +  +    +  + F   VL F+              P  WQ E    +  +     
Sbjct: 7   HDFLVESIPLWQQNPVQFFEEVLFFY--------------PDEWQKEAAFALRDN----- 47

Query: 71  NNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSK 130
                   K  I +G+G+GKT   A  +LW +S      V+  A +  QL   LWAEVSK
Sbjct: 48  -------SKVTIKSGQGVGKTGFEAATLLWFLSCFENARVVATAPTLHQLNDVLWAEVSK 100

Query: 131 WLSLLPN-KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           W S  P  K   +     +                  + +  + RT +   P+   G H 
Sbjct: 101 WQSKSPLLKEILQWTKTKISMIG------------SKERWYAVARTATT--PENMQGFHE 146

Query: 190 TYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249
              M  I DEASG  D I   ILG LT   +N   ++  NP + SG FY+        + 
Sbjct: 147 D-NMLFIVDEASGVADPIMEAILGTLT--GSNNKLLLCGNPTKASGTFYDSHTSDRKLYY 203

Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
              +++   +  +    + +I +YG +S+V RV V G FP+QD D ++PL ++E ++  E
Sbjct: 204 CITVNSAESKRTNKDNIDSLIRKYGEESNVVRVRVKGLFPKQDDDVYMPLEMLEASIILE 263

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE---- 365
             P P     +G D+A  G D+TV+       I         DL  T   +         
Sbjct: 264 EIP-PADICTLGVDVARFGDDDTVIARNMNNKITLEKIRHGQDLMKTVGDVVVECRNIKE 322

Query: 366 --KY-RPDAIIIDANNTGARTCDYLEML-------GYHVYRVLGQKRAVDL---EFCRNR 412
             KY +   +IID    G    D L  L       G  +  V       D    E   + 
Sbjct: 323 KFKYKKTIYVIIDDTGLGGGVTDRLNELKSEGKLSGVVIVPVNFSAAVPDKKAAEKYHDI 382

Query: 413 RTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTD 467
            +     + D LE   A L N + LI  L S + + + ++G++ +ESK   + +  +S D
Sbjct: 383 TSYAWSILRDMLEEKEAVLPNDTELIAQL-SARKYDLSSSGKIRLESKKAMKERIGESPD 441

Query: 468 YSDGLMYT 475
            +D ++ +
Sbjct: 442 RADAVVLS 449


>gi|332980681|ref|YP_004462122.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON]
 gi|332698359|gb|AEE95300.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON]
          Length = 486

 Score =  384 bits (987), Expect = e-104,   Method: Composition-based stats.
 Identities = 110/470 (23%), Positives = 182/470 (38%), Gaps = 60/470 (12%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
           + P  + +E +          + +   +  + A+ +  G GK+ +   ++LW + +    
Sbjct: 16  NDPVWFVIEILGTRPWKKQIDIISAVRDNPRTAVRSCHGAGKSFIAGQVILWFLYSFYPS 75

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
            V+  A +  Q++  +W EV                S      P   ++L     I    
Sbjct: 76  IVLSTAPTWRQVEKLIWKEVR--------------ASYRRSKVPLGGNLLPKRPEIQIIQ 121

Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228
                   S   PD F G H    + ++ DEA+G P+ I   I G LT  +A    ++  
Sbjct: 122 DEWYAVGLSTNEPDRFQGFHEE-NILVVVDEAAGVPEEIFEAIEGVLTSEHAR--LLLLG 178

Query: 229 NPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE----------------------------- 259
           NP  + G FY  F  P   W+   I   T                               
Sbjct: 179 NPTSVGGTFYNAFRTPG--WENISISAFTTPNFTAFGITEDDIINKTWESKITNSLPNPK 236

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319
            I P++      R+G +S   +  V GQFP +  D+ IPL  IE A+ R        P+ 
Sbjct: 237 LITPAWVADKYRRWGPNSPAYQARVLGQFPSEGEDTLIPLAWIEAAMARWEDTPEGEPIE 296

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A  G D TV+  RRG  +  L  ++K D   T   I  +  K       +D    
Sbjct: 297 IGVDVARFGSDKTVIAARRGQKVLPLNVYAKQDTMETVGCIIMVHRKIGASKTKVDVIGV 356

Query: 380 GARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--------EFASLIN 431
           GA   D L+  G+ V  +   + A D E   N R+EL   M + L        E  +L  
Sbjct: 357 GAGVVDRLKEQGHPVIGINVAEAATDTEKFANLRSELWWNMRELLDPNQRLNPEPIALPP 416

Query: 432 HSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478
              L+ +L  +K + + + G + +ESK   + +  +S D +D ++  FA+
Sbjct: 417 DDELLADLSGVK-YKIDSRGRIQVESKEDMKKRLGRSPDRADAVVLAFAK 465


>gi|319956916|ref|YP_004168179.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM
           16511]
 gi|319419320|gb|ADV46430.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM
           16511]
          Length = 462

 Score =  377 bits (969), Expect = e-102,   Method: Composition-based stats.
 Identities = 113/419 (26%), Positives = 185/419 (44%), Gaps = 26/419 (6%)

Query: 64  AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTT 123
              + ++   +    K +I +G G GKTTL AW+VLW    R    +   A +  QL   
Sbjct: 30  KQQMKAIRAIDQGKKKISIRSGHGTGKTTLLAWIVLWWGLGREDAKIPMTAPTGHQLYDL 89

Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY-STMCRTYSEERPD 182
           L  E+ KW   +P +              + ++V   +  ID  +    + RT  +++P+
Sbjct: 90  LMPEIRKWREKMPVQ--------------YQNEVEVKTEKIDFANGNFAVPRTARKDQPE 135

Query: 183 TFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN 242
              G H T  +A I DEASG P VI     G +T    +   IM +NP R  G FY+  +
Sbjct: 136 ALQGFHAT-NLAFIIDEASGIPQVIFEVAEGAMTGE--STLVIMAANPTRTEGYFYDSHH 192

Query: 243 KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           K    W+ FQ +    E +   + E    +YG DSDV RV + G+FP+Q  ++   L  +
Sbjct: 193 KNRWQWECFQFNAEESENVSKEWIEEKKRQYGEDSDVYRVRIKGEFPRQSSNAVFSLQEV 252

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           ++A  RE   D  A +  G D+A+ G D +V+  R+G     +   S   L      +  
Sbjct: 253 DDATTREIVDDSGAEV-WGLDVADFGDDKSVLAKRKGKHFHEITARSGLTLPDLAGWLIY 311

Query: 363 LVEKY--RPDAIIIDANNTGARTCDYLEMLGYH-VYRVLGQKRAVDLEFCRNRRTELHVK 419
              +   +P  I +DA   G+         G   V  V G   A + E   N+R E +  
Sbjct: 312 EYNQAKRKPAVIFVDAIGIGSSLPAVCFEKGLDIVIGVKGSNSASNSEKYHNKRAEWYYN 371

Query: 420 MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAI-ESK--RVKGAKSTDYSDGLMYT 475
           + D LE   + +   L+  L + + + + +TG++ + E K  + +  +S D +D    T
Sbjct: 372 LKDLLEDGKIPDDDELVGELMA-QKYQISSTGKIQLVEKKEIKKELGRSPDKADACALT 429


>gi|253578914|ref|ZP_04856185.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849857|gb|EES77816.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
          Length = 473

 Score =  375 bits (963), Expect = e-102,   Method: Composition-based stats.
 Identities = 109/451 (24%), Positives = 180/451 (39%), Gaps = 48/451 (10%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
             P  +  E +                   K +I +G+G+GKT L A + LW ++  P  
Sbjct: 4   DDPVMFFREVLNFEPDEWQAQAARDLAANPKVSIKSGQGVGKTGLEAAVFLWFVTCFPHP 63

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
            ++  A ++ QL   LW+E+SKW+S                   W    ++     + K 
Sbjct: 64  RIVATAPTKQQLHDVLWSEISKWMSKSELLSIL---------LKWTKTYVYMV--GEEKR 112

Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228
           +  + RT +  +P+   G H    M  I DEASG  D I   ILG L+   AN   ++  
Sbjct: 113 WFGVARTAT--KPENMQGFHED-NMLFIVDEASGVADPIMEAILGTLS--GANNKLLLCG 167

Query: 229 NPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288
           NP + SG FY+   +    +K   + +      +    + ++ +YG DS+V RV V G+F
Sbjct: 168 NPTKTSGTFYDSHTRDRALYKCHTVSSMDSTRTNKENIDSLVRKYGWDSNVVRVRVRGEF 227

Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHL 345
           P Q+ D FIPL++IE+  ++    D       + +G D+A  G D T++        + +
Sbjct: 228 PNQEDDVFIPLSLIEQCSSKLLELDDADGMQFVSLGVDVARFGDDETIIYRNYHGHCKIV 287

Query: 346 FDWSKTDLRTTNNKISGLVEK-YRPD-------AIIIDANNTGARTCDYLEM-------L 390
            +    +L  T   I    +K YR          + ID    G    D L+         
Sbjct: 288 RNRRGQNLMATVGDIVQEFKKIYREHPTYESKVYVQIDDTGLGGGVTDRLKEVRKEQKLY 347

Query: 391 GYHVYRVLGQKR--------AVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLK 440
              V  +   ++            E   N  T +   M D L  +   + +    I  L 
Sbjct: 348 KMQVIPINAAEKIETDTAAGKDAAERYNNLTTAMWASMRDLLDNKQIVIEDDEQTIGQLS 407

Query: 441 SLKSFIVPNTGELAIESK---RVKGAKSTDY 468
           S K + + + G+L IE K   + +G  S D 
Sbjct: 408 SRK-YTMASNGKLEIEPKKEMKKRGLDSPDR 437


>gi|160940775|ref|ZP_02088117.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC
           BAA-613]
 gi|158436295|gb|EDP14062.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC
           BAA-613]
          Length = 484

 Score =  370 bits (951), Expect = e-100,   Method: Composition-based stats.
 Identities = 113/487 (23%), Positives = 185/487 (37%), Gaps = 70/487 (14%)

Query: 45  LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104
           L     P  +  + +          +     +    ++ +G GIGK+ + AW V+W M T
Sbjct: 8   LFYADNPIYFVEDVIRAKPDEKQRDILRSLRDYPMTSVRSGHGIGKSAVEAWSVIWYMCT 67

Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164
           RP   + C A +E QL   LWAE+SKW+   P                W  + L+     
Sbjct: 68  RPFPKIPCTAPTEHQLMDVLWAEISKWMRNNPALRD---------DLIWTKEKLYMQ--G 116

Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224
             + +  + RT +   P+   G H  + + II DEASG  D +   +LG +T  +A    
Sbjct: 117 HPEEWFAVPRTAT--NPEALQGFHAEHVLYII-DEASGVSDKVFEPVLGAMTGEDAK--L 171

Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284
           +M  NP RL+G FY+  ++  + +    +D R  + +  +F + II  +G DSDV RV V
Sbjct: 172 LMMGNPTRLAGFFYDSHHRNREQYSAIHVDGRDSQHVSRTFVQKIIDMFGEDSDVFRVRV 231

Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPV--- 341
            GQFP+   DS I +   EEA N +    P   + +G D+A  G D++ +          
Sbjct: 232 AGQFPKSTPDSLIAMEWCEEAANLQV-YAPGGQIDIGVDVARYGDDSSALYPLIDKKQSL 290

Query: 342 IEHLFDWSKTDLRT--TNNKISGLVEKYRPDAI--IIDANNTGARTCD------------ 385
              L+  ++T          I      Y   AI   +D +  G    D            
Sbjct: 291 PYELYHHNRTTEIAGYVVIMIKQFAMDYPDAAIRVKVDCDGLGVGVYDNLYDQRDQIIDA 350

Query: 386 ----YLEMLGYH-------------------VYRVLGQKRA-----VDLEFCRNRRTELH 417
                    G +                   +               D     N    + 
Sbjct: 351 IWYDRCRRAGINPEDGNQWNECQNVPKLDLEIIECHFGGSGGKVDDNDPVEYSNSTGLMW 410

Query: 418 VKMADWLEFASL--INHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGL 472
            K+  +L+   L   +   L+  L + + ++V   G+L +E K   + +G  S D +D L
Sbjct: 411 GKVRKYLQEGKLQLPDDDTLVSQLCNRR-YLVNKDGKLELERKESMKKRGLTSPDIADAL 469

Query: 473 MYTFAEN 479
                E 
Sbjct: 470 ALALYEP 476


>gi|302120432|gb|ADK92426.1| putative phage terminase large subunit [Candidatus Liberibacter
           asiaticus]
          Length = 255

 Score =  365 bits (936), Expect = 1e-98,   Method: Composition-based stats.
 Identities = 250/255 (98%), Positives = 254/255 (99%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
           IGKTTLNAWLVLWLMS RPG+S+ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS
Sbjct: 1   IGKTTLNAWLVLWLMSIRPGMSIICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 60

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207
           LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI
Sbjct: 61  LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 120

Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE 267
           NLGILGFLTE+NANRFWIMTSNPRRLSGKFYEIFN+PLDDWKRFQIDTRTVEGIDPSFHE
Sbjct: 121 NLGILGFLTEQNANRFWIMTSNPRRLSGKFYEIFNRPLDDWKRFQIDTRTVEGIDPSFHE 180

Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327
           GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE
Sbjct: 181 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 240

Query: 328 GGDNTVVVLRRGPVI 342
           GGDNTVVVLRRGPVI
Sbjct: 241 GGDNTVVVLRRGPVI 255


>gi|266623290|ref|ZP_06116225.1| putative terminase B protein [Clostridium hathewayi DSM 13479]
 gi|288864932|gb|EFC97230.1| putative terminase B protein [Clostridium hathewayi DSM 13479]
          Length = 484

 Score =  364 bits (935), Expect = 1e-98,   Method: Composition-based stats.
 Identities = 107/488 (21%), Positives = 191/488 (39%), Gaps = 72/488 (14%)

Query: 45  LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104
           L     P  +  + + V        +     +    ++ +G G+GK+ + +W V+W + T
Sbjct: 8   LFYADEPIYFVEDIIRVTPDQKQRDILRSLRDYPMTSVRSGHGVGKSAVESWSVIWFLCT 67

Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPN-KHWFEMQSLSLHPAPWYSDVLHCSLG 163
           RP   + C A ++ QL   LWAE+SKWL   P  K+        ++   +          
Sbjct: 68  RPFPKIPCTAPTQHQLYDILWAEISKWLRNNPELKNDIIWTQQRVYMNGY---------- 117

Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223
              + +  + RT +   P+   G H  + + II DEASG  D +   +LG +T  +A   
Sbjct: 118 --PEEWFAVPRTAT--NPEALQGFHAEHVLYII-DEASGVSDKVFEPVLGAMTGEDAK-- 170

Query: 224 WIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            +M  NP RLSG F++  +K   ++    ID R  + ++  F + II  +G+DSDV RV 
Sbjct: 171 LLMMGNPTRLSGFFFDSHHKSRSEYSAMHIDGRDSQHVNQKFVQKIINMFGMDSDVFRVR 230

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIE 343
           V GQFP+   DS I ++  E A   +P       + +G D+A  G D++ +      V  
Sbjct: 231 VAGQFPKSTPDSLIMMDWCEAATQLKPETVRN-RVDIGVDVARYGDDSSALYPVIDKVQS 289

Query: 344 HLFD-WSKTDLRTTNNKISGLVEKYRPD------AIIIDANNTGARTCDYLE-------- 388
             ++ +        +  +  ++++Y  +       + +D +  G    D L         
Sbjct: 290 DGYELYHHNRTTEISGYVVQMIKRYAVECLDAVIRVKVDCDGLGVGVYDNLYDLTDQIID 349

Query: 389 ---------------------------MLGYHVYRVLGQKRA-----VDLEFCRNRRTEL 416
                                       L   +               D     N    +
Sbjct: 350 EVWRDRCRREGLDPDNGNQWNECQRIPQLDLEIVECHFGAAGGKIDEDDPVEYSNSTGLM 409

Query: 417 HVKMADWLEFAS--LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDG 471
             K+   L+  +  + +   LI  L + + +IV   G+L +E K   + +G  S D +D 
Sbjct: 410 WGKIRKLLQTGALQIPDDDALISQLSNRR-YIVNKDGKLELERKEAMKKRGLPSPDIADA 468

Query: 472 LMYTFAEN 479
           L     + 
Sbjct: 469 LALALYDP 476


>gi|153810665|ref|ZP_01963333.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174]
 gi|149833061|gb|EDM88143.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174]
          Length = 469

 Score =  347 bits (889), Expect = 3e-93,   Method: Composition-based stats.
 Identities = 123/465 (26%), Positives = 195/465 (41%), Gaps = 53/465 (11%)

Query: 45  LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMST 104
           L   + P  +  + ++         +     E    ++ +G GIGK+ + AW V+W M T
Sbjct: 8   LYYANHPVEFVQDILKADPDPEQKKILRSLVENQMTSVRSGHGIGKSAVEAWSVIWFMCT 67

Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164
            P   + C A ++ QL   LWAE+SKW                     W  + L+     
Sbjct: 68  HPYPKIPCTAPTQHQLFDILWAEISKWKRN---------NKTLDSELIWTKEKLYM--KG 116

Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224
            ++ +  + RT S   PD   G H  + M  I DEASG  D I   +LG L+   A    
Sbjct: 117 HAEEWFAVARTAST--PDALQGFHAEH-MLYIIDEASGVEDKIFEPVLGALSTPGAK--L 171

Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284
           +M  NP +LSG FY+  NK  + +  F ID R    +   F + II  YG DSDV RV V
Sbjct: 172 LMCGNPTQLSGFFYDSHNKNREQYSTFHIDGRNSTRVSQEFVQTIINMYGEDSDVFRVRV 231

Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI-MGCDIAEEGGDNTVVVLRRGPVIE 343
            G FP  + D +IPL ++E+++  E  P  +  +I +GCD+A  G D TV+  R    ++
Sbjct: 232 AGDFPLAEDDIYIPLPLVEKSIATEYFPRRHPQIIHIGCDVARFGTDKTVIGYRTDEKVQ 291

Query: 344 HLFDWSKTDLRTTNNKISG----LVEKYR-------PDAIIIDANNTGARTCDYLEMLGY 392
                   D   T + I      LV +Y        P  I ID    G    D L  +  
Sbjct: 292 FFKKRVGQDTMKTADDIVSLGMLLVYQYGLKPDIDEPIPIKIDDGGVGGGVVDRLRQIKR 351

Query: 393 H---------VYRVLGQKRAVDLEFCRNRRTELHVKMADWLE-----------FASLINH 432
           +         VY V   ++ +  +F  +  T +   +   L+              L + 
Sbjct: 352 NNPERFWWMEVYPVKFGQK-IRHKFFDDSTTYMMSVLKKLLQPFDDNGLPKDVEIILPDD 410

Query: 433 SGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMY 474
             L+  +   K + +    ++ +ESK   + +G +S D +D ++ 
Sbjct: 411 DALVAQISGRK-YEMTENSKIRVESKKVMKARGVQSPDEADCILL 454


>gi|148653111|ref|YP_001280204.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1]
 gi|148572195|gb|ABQ94254.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1]
          Length = 520

 Score =  340 bits (871), Expect = 4e-91,   Method: Composition-based stats.
 Identities = 91/490 (18%), Positives = 181/490 (36%), Gaps = 73/490 (14%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
           +WQ E +            +      + ++++G G GK+     + LW +   P   ++ 
Sbjct: 41  TWQQELL----------FKSIVVPGSRTSVASGHGTGKSRSAGIIALWHLLFYPESVMLF 90

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC-----SLGIDSK 167
            A    QL+T +W E++  L  L N               W +D +        +     
Sbjct: 91  TAPQIGQLRTVVWKEINICLQRLRNNK----------ALGWLADYVVVLAEKIYIKGFKD 140

Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227
            +    +T  + +P    G H  + M +  DEA G  D +    +G LT  N     ++T
Sbjct: 141 TWFVFAKTAPKHQPTNIAGQHGDHYM-VWADEACGIDDAVMEVAIGALTHENNRA--VLT 197

Query: 228 SNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRV 282
           S P + +G FY+  +K        W   + +      +        + +YG  +S    +
Sbjct: 198 SQPAKNTGFFYDTHHKLSHHNGGKWTALEFNGEMSPIVSKDKLIEALYQYGSRNSPGYLI 257

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY---APLIMGCDIAEE-GGDNTVVV--- 335
            + G+FP+     ++      E + ++PC         +I+  D+  + G D++V+    
Sbjct: 258 RIRGKFPELK-GEYLLTRTDYENMKQQPCVIEEGDKWGIIVAVDVGGDVGRDSSVISVMQ 316

Query: 336 ----LRRGPVIEHLFDW------SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
               + +G +  H+         ++ ++ T   KI+ ++  Y    ++ID    G     
Sbjct: 317 VVDKMIKGRIERHVHLLDIPLFSNRANINTLKAKINDVMSDYPGATLVIDPLGAGMGLTQ 376

Query: 386 YLEMLGYHVYRVLGQKRAVD---LEFCRNRRTELHVKMADWLEFA---SLINHSGLIQNL 439
            L+  G +   V       +     +  N+R+  +V MA  +E            + Q +
Sbjct: 377 SLKADGVYFDEVHWGSPCFNNTLKRYYMNKRSHAYVSMAKAVEKGYFSVSDKVKKMYQVM 436

Query: 440 KSLKS------FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSDMDFGRC 490
            +L+       +         + SK+    KG KS D +D + + F EN           
Sbjct: 437 TNLEEQMTRLPYYFDEKARWCMMSKKDMLKKGIKSPDIADTIAFGFMEN-------ISYA 489

Query: 491 PSYQYEGVDL 500
           P   YE +++
Sbjct: 490 PVESYEDLNI 499


>gi|332974843|gb|EGK11758.1| hypothetical protein HMPREF9373_1714 [Psychrobacter sp. 1501(2011)]
          Length = 520

 Score =  337 bits (865), Expect = 2e-90,   Method: Composition-based stats.
 Identities = 90/490 (18%), Positives = 180/490 (36%), Gaps = 73/490 (14%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
           +WQ E +            +      + ++++G G GK+     + LW +   P   ++ 
Sbjct: 41  TWQQELL----------FKSIVVPGSRTSVASGHGTGKSRSAGIIALWHLLFYPESVMLF 90

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC-----SLGIDSK 167
            A    QL+T +W E++  L  L N               W +D +        +     
Sbjct: 91  TAPQIGQLRTVVWKEINICLQRLRNNK----------ALGWLADYVVVLAEKIYIKGFKD 140

Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227
            +    +T  + +P    G H  + M +  DEA G  D +    +G LT  N     ++T
Sbjct: 141 TWFVFAKTAPKHQPTNIAGQHGDHYM-VWADEACGIDDAVMEVAIGALTHENNRA--VLT 197

Query: 228 SNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRV 282
           S P + +G FY+  +K        W   + +      +        + +YG  +S    +
Sbjct: 198 SQPAKNTGFFYDTHHKLSHYNGGKWIALEFNGEMSPIVSKEKLIEALYQYGSRNSPGYLI 257

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY---APLIMGCDIAEE-GGDNTVVV--- 335
            + G+FP+     ++      E +   PC         +I+  D+  + G D++V+    
Sbjct: 258 RIRGKFPELK-GEYLLTRTDYENMKAHPCVIKEGDKWGIIVTVDVGGDVGRDSSVISVLQ 316

Query: 336 ----LRRGPVIEHLFDW------SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
               + +G +  H+         ++ ++ T   KI+ ++  Y    ++ID    G     
Sbjct: 317 VVDKMVKGRIERHVHLLDIPLFSNRANINTLKAKINDVMSDYPGATLVIDPLGAGMGLTQ 376

Query: 386 YLEMLGYHVYRVLGQKRAVD---LEFCRNRRTELHVKMADWLEFA---SLINHSGLIQNL 439
            ++  G +   V       +     +  N+R+  +V MA  +E            + Q +
Sbjct: 377 SVKADGVYFDEVHWGSPCFNNTLKRYYMNKRSHAYVSMAKAVEKGYFSVSDKIKKMYQVI 436

Query: 440 KSLKS------FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSDMDFGRC 490
            +L+       +         + SK+    KG KS D +D + + F EN           
Sbjct: 437 TNLEEQMTRLPYYFDEKARWCMMSKKDMLKKGIKSPDIADTIAFGFMEN-------ISYA 489

Query: 491 PSYQYEGVDL 500
           P+  YE +++
Sbjct: 490 PAESYEDLNI 499


>gi|83593922|ref|YP_427674.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170]
 gi|83576836|gb|ABC23387.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170]
          Length = 505

 Score =  332 bits (850), Expect = 1e-88,   Method: Composition-based stats.
 Identities = 111/463 (23%), Positives = 176/463 (38%), Gaps = 62/463 (13%)

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134
           P   K  + AG G+GKTT  A  + W +         C A + +QL+  LW+E+++    
Sbjct: 34  PAGAKVTVRAGHGVGKTTATAAAIWWHLECFDYSKTPCTAPTASQLEQILWSELARLRRR 93

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLH-CSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY-- 191
              +        +L     ++      +     + +  + RT   ++PD   G H +   
Sbjct: 94  ADARAQGTGLPAALRLEALFAVSGRAIADRGTPREWFVVARTARRDQPDALQGFHASDID 153

Query: 192 ----------------GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
                            +  + +EASG PD +     G L+   A    +M  NP R +G
Sbjct: 154 LEAGAGPRLSAKSGGAALMFVIEEASGVPDAVFEVAEGALSSPGAR--LLMVGNPTRNTG 211

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295
            F     +    +   ++       +DP +  G++ +YG +S+V RV   G FP+QD D 
Sbjct: 212 FFARSHKRDRASFTALRLRCADSPLVDPGYRAGLVRKYGAESNVVRVRADGAFPRQDDDV 271

Query: 296 FIPLNIIEEALNREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTD 352
            I L   E AL R P P   A      +G D+A  G D TV +LR GPV+  +   +  D
Sbjct: 272 LIALETAEAALAR-PLPARMATEDERRLGVDVARFGDDRTVFLLRIGPVVGAIEVTAGRD 330

Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG----YHVYRVLGQKRAVDLEF 408
                 +   L E +R   I +D    GA   D L   G             +RA   E 
Sbjct: 331 TMAVAGRARRLAEIWRAGRIYVDEIGVGAGVVDRLREDGAPVVAVNVAASAPERAAGEER 390

Query: 409 CRNRRTELHVKMADWLE------------------------FASLI---NHSGLIQNLK- 440
            R  R  L + +  WL                           S +     + L Q+L  
Sbjct: 391 GRLLRDHLWLMVRGWLRDEAPVFAGPGGGPASGSAAGLLSGMGSCLVPGVDADLAQDLAG 450

Query: 441 --SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAE 478
             +   +    +G + +ESK   + +G +S D +D L  TF E
Sbjct: 451 ELATPRYAFDGSGRVVVESKDAMKRRGLRSPDLADALALTFHE 493


>gi|269119479|ref|YP_003307656.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386]
 gi|268613357|gb|ACZ07725.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386]
          Length = 499

 Score =  323 bits (829), Expect = 3e-86,   Method: Composition-based stats.
 Identities = 99/472 (20%), Positives = 177/472 (37%), Gaps = 53/472 (11%)

Query: 58  FMEVVDAHCLNSVNNPNPEVF----KGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113
           F ++++ H L+       + F    + ++ AG   GK++L   L  + + TRP   VI  
Sbjct: 22  FKDILNFHFLSEDQTRVLQAFNEYRRLSVPAGHSTGKSSLAGGLTTYWLITRPKSRVIVT 81

Query: 114 ANSETQLKTTLWAEVSKWLSLLP---------------------NKHWFEMQSLSLHPAP 152
           A +  QLKT  WAEV+K  +                         + WF +   +  P  
Sbjct: 82  APTYRQLKTIYWAEVNKIYNRSKLKQLNLFEINDKIMRINDKDLKREWFALPVTASTPEG 141

Query: 153 WYS---------DVLHCSLGI----DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
                       + +   LGI    D +    + +    E+    +   +   + ++ DE
Sbjct: 142 MQGQHGDKTEVIEQIMKHLGIEEIGDDETIEIVSQILRGEKQIEGLTKEDKEKLLVMVDE 201

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259
           +SG  + I   + G  T+ +     ++  N  + +G FYE    P   + +  + +    
Sbjct: 202 SSGVKNEIFEVLEG--TDYD---KLVLFGNMTKNTGYFYESVYNPKSKFYKVTMSSYNSP 256

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319
            +       +   YG DS+V RV + G+ P  + +S    N I+ A  R      Y  + 
Sbjct: 257 FMKKEQIHDLEETYGPDSNVVRVRLKGEAPDGNENSIFSSNKIDSAFQRSLSLSEYETIK 316

Query: 320 MGCDIA-EEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII--IDA 376
           +G D+    GGD++ +  ++   +    D     L     +I     K R   II  ID 
Sbjct: 317 LGVDVGKGSGGDSSTIYEKKDNRVRKKLDRKDFTLPDVKREIIQYCYKNRDKLIIANIDG 376

Query: 377 NNTGARTCDYLEM---LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHS 433
              G      LE        V  +    +A + +   N+RTE++ +++  L+   L    
Sbjct: 377 TGLGTGLVQELEEGEIENLVVNDIQFAGKAKNKKEFNNKRTEMYFELSRNLDKLDLEEDQ 436

Query: 434 GLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPR 482
            L + L  ++ +   N G   + SK   +     S D SD L     E   R
Sbjct: 437 ELKRELL-IQIYEFDNNGRFKLISKDKIKEMLGHSPDKSDALALCNYEAETR 487


>gi|312964323|ref|ZP_07778627.1| terminase B protein [Escherichia coli 2362-75]
 gi|331655801|ref|ZP_08356790.1| terminase B protein (PACase B protein) (DNA packaging B protein)
           [Escherichia coli M718]
 gi|312291036|gb|EFR18910.1| terminase B protein [Escherichia coli 2362-75]
 gi|323186470|gb|EFZ71817.1| terminase B protein [Escherichia coli 1357]
 gi|323969205|gb|EGB64507.1| terminase B protein [Escherichia coli TA007]
 gi|325495624|gb|EGC93488.1| DNA pacase B subunit [Escherichia fergusonii ECD227]
 gi|331046575|gb|EGI18664.1| terminase B protein (PACase B protein) (DNA packaging B protein)
           [Escherichia coli M718]
          Length = 494

 Score =  315 bits (807), Expect = 1e-83,   Method: Composition-based stats.
 Identities = 92/483 (19%), Positives = 182/483 (37%), Gaps = 54/483 (11%)

Query: 32  VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
            L+ + W      L  F    +WQ + +          + +   +  K ++S+G G GK+
Sbjct: 16  ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPNKHWFEMQSLSLHP 150
            + + +++  +   PG   I +AN   Q+ T ++  +   W +      W       L  
Sbjct: 64  DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLA-DYFVLTE 122

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210
             +Y              ++ + + +     +   G H  + + II DEASG  D     
Sbjct: 123 TAFYEVTGKGV-------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGI 174

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263
           I G LT ++     +  S P R SG FY+  +K       P   +    +++     + P
Sbjct: 175 ITGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTP 232

Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322
           +F +  +A Y G D+ +  ++V G FP+      +  + +E A  R+         +   
Sbjct: 233 AFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACV 292

Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371
           D+A   G D +V+ +        +R  +   + +++         KI      E++    
Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNIT 352

Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427
           I ID +  G  T D + E  G  V R+   K+     D     ++R   +V+ A+ ++  
Sbjct: 353 IAIDGDGLGKATADLMYEYYGITVQRIRWGKKMHSREDKSLYFDKRAYANVQAAEAVKSG 412

Query: 428 --SLINHSGLIQNLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPP 481
              L   +  I+    +    + + G+  + S    K+     S D+ D   +    +  
Sbjct: 413 RMRLDKGNETIEEASKIPV-GINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLADYV 471

Query: 482 RSD 484
             D
Sbjct: 472 PQD 474


>gi|324111095|gb|EGC05081.1| terminase B protein [Escherichia fergusonii B253]
          Length = 494

 Score =  315 bits (806), Expect = 1e-83,   Method: Composition-based stats.
 Identities = 92/483 (19%), Positives = 182/483 (37%), Gaps = 54/483 (11%)

Query: 32  VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
            L+ + W      L  F    +WQ + +          + +   +  K ++S+G G GK+
Sbjct: 16  ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPNKHWFEMQSLSLHP 150
            + + +++  +   PG   I +AN   Q+ T ++  +   W +      W       L  
Sbjct: 64  DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLA-DYFVLTE 122

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210
             +Y              ++ + + +     +   G H  + + II DEASG  D     
Sbjct: 123 TAFYEVTGKGV-------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGI 174

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263
           I G LT ++     +  S P R SG FY+  +K       P   +    +++     + P
Sbjct: 175 ITGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTP 232

Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322
           +F +  +A Y G D+ +  ++V G FP+      +  + +E A  R+         +   
Sbjct: 233 AFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACV 292

Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371
           D+A   G D +V+ +        +R  +   + +++         KI      E++    
Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRILEYTDVTETQLAAKIFAECNPERFPNIT 352

Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427
           I ID +  G  T D + E  G  V R+   K+     D     ++R   +V+ A+ ++  
Sbjct: 353 IAIDGDGLGKATADLMYEYYGITVQRIRWGKKMHSREDKSLYFDKRAYANVQAAEAVKSG 412

Query: 428 --SLINHSGLIQNLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPP 481
              L   +  I+    +    + + G+  + S    K+     S D+ D   +    +  
Sbjct: 413 RMRLDKGNETIEEASKIPV-GINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLADYV 471

Query: 482 RSD 484
             D
Sbjct: 472 PQD 474


>gi|56266643|gb|AAV84926.1| DNA pacase B subunit [Enterobacteria phage phiW39]
          Length = 494

 Score =  314 bits (805), Expect = 2e-83,   Method: Composition-based stats.
 Identities = 92/483 (19%), Positives = 182/483 (37%), Gaps = 54/483 (11%)

Query: 32  VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
            L+ + W      L  F    +WQ + +          + +   +  K ++S+G G GK+
Sbjct: 16  ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPNKHWFEMQSLSLHP 150
            + + +++  +   PG   I +AN   Q+ T ++  +   W +      W       L  
Sbjct: 64  DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLA-DYFVLTE 122

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210
             +Y              ++ + + +     +   G H  + + II DEASG  D     
Sbjct: 123 TAFYEITGKGV-------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGI 174

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263
           I G LT ++     +  S P R SG FY+  +K       P   +    +++     + P
Sbjct: 175 ITGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTP 232

Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322
           +F +  +A Y G D+ +  ++V G FP+      +  + +E A  R+         +   
Sbjct: 233 AFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACV 292

Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371
           D+A   G D +V+ +        +R  +   + +++         KI      E++    
Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNIT 352

Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427
           I ID +  G  T D + E  G  V R+   K+     D     ++R   +V+ A+ ++  
Sbjct: 353 IAIDGDGLGKATADLMYEYYGITVQRIRWGKKMHSREDKSLYFDKRAYANVQAAEAVKSG 412

Query: 428 --SLINHSGLIQNLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPP 481
              L   +  I+    +    + + G+  + S    K+     S D+ D   +    +  
Sbjct: 413 RMRLDKGNETIEEASKIPV-GINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLADYV 471

Query: 482 RSD 484
             D
Sbjct: 472 PQD 474


>gi|168467778|ref|ZP_02701615.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar
           Newport str. SL317]
 gi|195629119|gb|EDX48493.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar
           Newport str. SL317]
          Length = 494

 Score =  313 bits (803), Expect = 4e-83,   Method: Composition-based stats.
 Identities = 94/482 (19%), Positives = 181/482 (37%), Gaps = 52/482 (10%)

Query: 32  VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
             + + W      +  F    +WQ +         + SV  P     K ++S+G G GK+
Sbjct: 16  AQYRYDWIAAADVM--FGKTPTWQQD-------QIIESVQEPGS---KTSVSSGHGTGKS 63

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS-KWLSLLPNKHWFEMQSLSLHP 150
            + + +++  +   PG   I +AN   Q+ T ++  +   W +      W   +   L  
Sbjct: 64  DMTSIMIMLFIIMFPGARAIIVANKIQQVMTGIFKYLKINWSTATSRFPWLA-EYFVLTD 122

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210
             +Y              ++ + + +     +   G H  + + II DEASG  D     
Sbjct: 123 TSFYEITSKGV-------WTVVPKGFRLGNEEALAGEHADHLLYII-DEASGVSDKAFGI 174

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263
           + G LT ++     +  S P R SG FY+  +K       P   +    +++     + P
Sbjct: 175 MTGALTGKDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPNGIYTAITLNSEESPLVTP 232

Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322
            F +  +A Y G DS +  ++V G FP+      +  + +E A  R+         I   
Sbjct: 233 EFIKMKLAEYGGRDSPMYLIKVRGLFPKTQDGFLLGRDEVERASRRKVKIAKGWGWIACV 292

Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371
           D+A   G D +V+ +        +R  +   + ++S         KI+     ++Y    
Sbjct: 293 DVAGGTGRDKSVINIMMVSGERNKRRIIGYRIIEYSDVTETQLAAKINAECSPDRYPNIT 352

Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427
           I+ID +  G  T D L +  G    R+   K+     D     ++R   +V+ A+ ++  
Sbjct: 353 IVIDGDGLGKSTADLLYDNYGITAQRIRWGKKMHSREDRSLYFDQRAYANVQAAEAVKSG 412

Query: 428 SLINHSGLIQ-NLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPPR 482
            +    G       S     + + G+  + S    K+    +S D+ D   +    N   
Sbjct: 413 RMRLDKGDATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLRSPDHWDTYCFGMLANYVP 472

Query: 483 SD 484
            +
Sbjct: 473 QN 474


>gi|56266666|gb|AAV84947.1| DNA pacase B subunit [Enterobacteria phage D6]
          Length = 502

 Score =  309 bits (792), Expect = 5e-82,   Method: Composition-based stats.
 Identities = 101/444 (22%), Positives = 176/444 (39%), Gaps = 38/444 (8%)

Query: 72  NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131
           +      +  +++G G GK++L A L+L  M   P   VI +AN   Q+KT ++  V ++
Sbjct: 49  SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
            +    +H +      L    +Y              +  +C+ Y     +   G H  +
Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYERSRKGI-------WEVLCKGYRLGNEEALAGEHAAH 161

Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------P 244
            + +I DEASG  D     + G LTE +     +M S P R SG FY+  +        P
Sbjct: 162 -LLLILDEASGISDKAIGVMTGALTEEDNR--MLMLSQPTRPSGYFYDSHHSQAKTPDNP 218

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
              W    +++     + P F +  +  Y G DS    V+V GQFP++     +  +  +
Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278

Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKT-DLR 354
            A  R+   +     +   D+   G D +V+ +        +R  V   + + S T D  
Sbjct: 279 RAARRKVLLEKNWGWVATADVG-NGRDKSVLNICKVSGHRDKRRVVNFKVMEMSGTMDPL 337

Query: 355 TTNNKISGLV--EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKR---AVDLEFC 409
              + I      EKY    I +DA+  G+ TC  L   G +  R+   K      D E  
Sbjct: 338 AFADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRIRWGKPMFANKDRERF 397

Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLK-SFIVPNTGELAIESKR----VKGAK 464
            N+R   ++   D ++   +   S      ++ K  F++   G++A+  K         K
Sbjct: 398 VNQRAYANIMARDAIKSGRMRIDSDPKTAEQASKIPFLLNEEGKMAMMRKEHMRQKLNIK 457

Query: 465 STDYSDGLMYTFAENPPRSDMDFG 488
           S D  D   +T   +   ++ D G
Sbjct: 458 SPDRWDTYCFTMLVDYVPANEDIG 481


>gi|323179619|gb|EFZ65182.1| terminase B protein [Escherichia coli 1180]
          Length = 453

 Score =  309 bits (791), Expect = 8e-82,   Method: Composition-based stats.
 Identities = 100/442 (22%), Positives = 174/442 (39%), Gaps = 38/442 (8%)

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
                +  +++G G GK++L A L+L  M   P   VI +AN   Q+KT ++  V ++ +
Sbjct: 2   QETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWA 61

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
               +H +      L    +Y              +  +C+ Y     +   G H  + +
Sbjct: 62  NAVKRHGWLQTYFVLSDTMFYERSRKGI-------WEVLCKGYRLGNEEALAGEHAAH-L 113

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLD 246
            +I DEASG  D     + G LTE +     +M S P R SG FY+  +        P  
Sbjct: 114 LLILDEASGISDKAIGVMTGALTEEDNR--MLMLSQPTRPSGYFYDSHHSQAKTPDNPKG 171

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
            W    +++     + P F +  +  Y G DS    V+V GQFP++     +  +  + A
Sbjct: 172 IWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRA 231

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKT-DLRTT 356
             R+   +     +   D+   G D +V+ +        +R  V   + +   T D    
Sbjct: 232 ARRKVLLEKNWGWVATADVG-NGRDKSVLNICKVSGHRDKRRVVNFKVMEMPGTMDPLAF 290

Query: 357 NNKISGLV--EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKR---AVDLEFCRN 411
            + I      EKY    I +DA+  G+ TC  L   G +  R+   K      D E   N
Sbjct: 291 ADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRIRWGKPMFANKDRERFVN 350

Query: 412 RRTELHVKMADWLEFASLINHSGLIQNLKSLK-SFIVPNTGELAIESKR----VKGAKST 466
           +R   ++   D ++   +   S      ++ K  F++   G++A+  K         KS 
Sbjct: 351 QRAYANIMARDAIKSGRMRIDSDPKTAEQASKIPFLLNEEGKMAMMRKEHMRQKLNIKSP 410

Query: 467 DYSDGLMYTFAENPPRSDMDFG 488
           D  D   +T   +   ++ D G
Sbjct: 411 DRWDTYCFTMLVDYVPANEDIG 432


>gi|304399103|ref|ZP_07380971.1| DNA packaging protein [Pantoea sp. aB]
 gi|304353343|gb|EFM17722.1| DNA packaging protein [Pantoea sp. aB]
          Length = 503

 Score =  308 bits (790), Expect = 1e-81,   Method: Composition-based stats.
 Identities = 107/503 (21%), Positives = 189/503 (37%), Gaps = 57/503 (11%)

Query: 34  HFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTL 93
           + + W      +E F    +WQ E         +NSV     +     +++G G GK++L
Sbjct: 23  YRYNWALA--VVELFGMIPTWQQE-------EIMNSVQETGSQ---TTVTSGHGTGKSSL 70

Query: 94  NAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPW 153
            A ++L  M   P   VI +AN   Q+KT ++  V  + +    +H +     +L    +
Sbjct: 71  TAMMLLIYMIMYPDARVIIVANKIGQVKTGVFKYVKTYWANAARRHPWLQNYFTLTDTMF 130

Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213
           Y              +  +C+ Y     +   G H  + + I+ DEASG  D     + G
Sbjct: 131 YEKSRKGI-------WEVLCKGYRLGNEEALAGEHAAHILLIL-DEASGISDKAIAIMRG 182

Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDPSFH 266
            LTE +     +M S P R SG FY+  +        P   W    +++     +   F 
Sbjct: 183 ALTEEDNR--MLMMSQPTRPSGYFYDSHHSLARHPDNPNGFWNAIVLNSEEAPHVTLKFI 240

Query: 267 EGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIA 325
              +  Y G DS    V+V G+FP+      +  +  + A  R+   +     +   D+ 
Sbjct: 241 REKLVEYGGRDSLEYMVKVLGRFPRNVSGYLLGRDECDRAARRKVYLEKGWGWVATADVG 300

Query: 326 EEGGDNTVVVLR--------RGPVIEHLFDWSKT-DLRTTNNKISGLV--EKYRPDAIII 374
             G D +++ +         R  V   L +   T D  +  + I+     E+Y    I +
Sbjct: 301 -NGRDKSILNICKVSGYGDARRVVSFKLLEMPGTMDPISFGDYIANECTQERYPGITIAV 359

Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDL---EFCRNRRTELHVKMADWLEFASL-I 430
           D +  G+ T   LE  G +   +   +        E  +N+R   ++  AD +    + I
Sbjct: 360 DGDGVGSGTLKQLERRGVNAISIRWGQPPFSKKVRERFKNQRAWSNIMAADAIRSGRMRI 419

Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESK----RVKGAKSTDYSDGLMYTF------AENP 480
           + S       S   + +   G + +  K    +    KS D  D   + F      AE  
Sbjct: 420 DMSQHTAEQASKIPYFMDEMGRIMMVPKPQMRQKLNIKSPDRWDTYCFIFLIGYRPAEAE 479

Query: 481 PRSDM-DFGRCPSYQYEGVDLLI 502
              DM DF +    +   +D L+
Sbjct: 480 LSEDMADFTQSKLDELSELDALL 502


>gi|323948959|gb|EGB44853.1| terminase B protein [Escherichia coli H252]
          Length = 502

 Score =  307 bits (787), Expect = 3e-81,   Method: Composition-based stats.
 Identities = 99/444 (22%), Positives = 175/444 (39%), Gaps = 38/444 (8%)

Query: 72  NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131
           +      +  +++G G GK++L A L+L  M   P   VI +AN   Q+KT ++  V ++
Sbjct: 49  SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
            +    +H +      L    +Y              +  +C+ Y     +   G H  +
Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYERSRKGI-------WEVLCKGYRLGNEEALAGEHAAH 161

Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------P 244
            + +I DEASG  D     + G LTE +     +M S P R SG FY+  +        P
Sbjct: 162 -LLLILDEASGISDKAIGVMTGALTEEDNR--MLMLSQPTRPSGYFYDSHHSRAKTPDNP 218

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
              W    +++     + P F +  +  Y G DS    V+V GQFP++     +  +  +
Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKEKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278

Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKT-DLR 354
            +  R+   +     +   D+   G D +V+ +        +R  V   + +   T D  
Sbjct: 279 RSARRKVLLEKNWGWVATADVG-NGRDKSVLNICKVSGHRDKRRVVNFKVMEMPGTMDPL 337

Query: 355 TTNNKISGLV--EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKR---AVDLEFC 409
              + I      EKY    I +DA+  G+ TC  L   G +  R+   K      D E  
Sbjct: 338 AFADFIYNECTPEKYPNITIAVDADGFGSDTCAQLVRRGANPVRIRWGKPMFANKDRERF 397

Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLK-SFIVPNTGELAIESKR----VKGAK 464
            N+R   ++   D ++   +   S      ++ K  F++   G++A+  K         K
Sbjct: 398 VNQRAYANIMARDAIKSGRMRIDSDPKTAEQASKIPFLLNEEGKMAMMRKEHMRQKLNIK 457

Query: 465 STDYSDGLMYTFAENPPRSDMDFG 488
           S D  D   +T   +   ++ D G
Sbjct: 458 SPDRWDTYCFTMLVDYVPANEDIG 481


>gi|307308936|ref|ZP_07588619.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti
           BL225C]
 gi|306900570|gb|EFN31183.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti
           BL225C]
          Length = 472

 Score =  307 bits (785), Expect = 4e-81,   Method: Composition-based stats.
 Identities = 109/439 (24%), Positives = 190/439 (43%), Gaps = 38/439 (8%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139
             +    G GKT ++A  + W +     + V   A SE+ +K+ +W E            
Sbjct: 50  ITVKGSSGWGKTFISAISLWWSLIVFDPVKVTIFAPSESTIKSGIWNE------------ 97

Query: 140 WFEMQSLSLHPAPWYSDVLHCS-LGIDSKHYSTMC----RTYSEERPDTFVGHHNTYGMA 194
              +Q L  + AP + ++   S   I  K     C    R  S++      G H+   + 
Sbjct: 98  ---LQVLYSNMAPLFRELFEVSATKIFRKSRGETCWAEYRLVSKDNIAAARGFHSKNNI- 153

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQ 252
           +I DEASG  DVI  G L  +         ++ SNP + SG F++ +  P    DW +  
Sbjct: 154 VIADEASGIEDVIFTGALLNVLNDGPGAKVVLVSNPDKASGFFFKTWRDPELSKDWIKVH 213

Query: 253 IDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREP 310
              R      P   E     YG + S      V G+FP  D+D  I    ++EA+ N++ 
Sbjct: 214 GSIRDKPNYTPGEEERFARLYGGVTSRDYLTLVEGEFPLSDVDGLISREFLDEAVTNKDA 273

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV----EK 366
            P+P AP+I G D A  G D +V+ +R   V+    +W+  +      ++  L     +K
Sbjct: 274 IPNPKAPIIWGLDPAGAGKDKSVLAIRHDNVLRGFEEWAGLEPVALALRVKELYLKTSKK 333

Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQK-RAVDLEFCRNRRTELHVKMADWLE 425
            RP  I +D N  GA   D L+     VY+ +  +    + +     R ++  +M +W+ 
Sbjct: 334 DRPAVIAVDGNGLGAGVYDALKHFKIPVYKCMFAEVPKRNPDRYTRVRDQIWFEMREWIH 393

Query: 426 FA--SLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
               S+ NH  LI++L ++ ++   ++ ++ IE K   + +  +S DY+D L  TF+ + 
Sbjct: 394 TGDVSIPNHKKLIEDL-AIPTYE--DSPKIKIEDKKSLKKRLGRSPDYADALALTFSVSH 450

Query: 481 PRSDMDFGRCPSYQYEGVD 499
            R    +      +Y+ + 
Sbjct: 451 TRYASKYQWDKPIEYDNLS 469


>gi|260871239|ref|YP_003238019.1| DNA packaging protein [Escherichia coli O111:H- str. 11128]
 gi|257767818|dbj|BAI39311.1| DNA packaging protein [Escherichia coli O111:H- str. 11128]
          Length = 494

 Score =  306 bits (784), Expect = 5e-81,   Method: Composition-based stats.
 Identities = 90/482 (18%), Positives = 177/482 (36%), Gaps = 52/482 (10%)

Query: 32  VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
            L+ + W      L  F    +WQ +         + S           ++++G G GK+
Sbjct: 16  ALYRYDWIAAADVL--FGKTPTWQQD-------EIIESTQQDGSW---TSVTSGHGTGKS 63

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKHWFEMQSLSLHP 150
            + + + +  +   PG  VI +AN   Q+   ++  + S W + +    W   +   L  
Sbjct: 64  DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLS-KYFILTE 122

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210
             ++              ++ + ++      +   G H  + + II DEASG  D     
Sbjct: 123 TSFFEVTGKGV-------WTILIKSCRSGNEEALAGEHADHLLYII-DEASGVSDKAFSV 174

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263
           I G LT ++     +  S P R SG FY+  ++       P   +    +++     +D 
Sbjct: 175 ITGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDA 232

Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322
            F    +A Y G D+ +  ++V G+FP+      +  + +E A  R+         +   
Sbjct: 233 KFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACV 292

Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371
           D+A   G D +V+ +        +R  +   + +++         KI      E++    
Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNIT 352

Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427
           I ID +  G  T D + E  G  V R+   K+     D     + R   +++ A+ ++  
Sbjct: 353 IAIDGDGLGKSTADLMYERYGITVQRIRWGKKMHSREDKSLYFDMRAFANIQAAEAVKSG 412

Query: 428 SLINHSGLIQ-NLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPPR 482
            +    G       S     + + G+  + S    K+     S D+ D   +    N   
Sbjct: 413 RMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLANYVP 472

Query: 483 SD 484
            D
Sbjct: 473 QD 474


>gi|46401730|ref|YP_006576.1| PacB [Enterobacteria phage P1]
 gi|301646767|ref|ZP_07246623.1| putative terminase B protein [Escherichia coli MS 146-1]
 gi|129547|sp|P27753|TERL_BPP1 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging
           protein B; AltName: Full=PACase B protein; AltName:
           Full=Terminase B protein; AltName: Full=Terminase large
           subunit
 gi|68597607|sp|Q5XLR0|TERL_BPP7 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging
           protein B; AltName: Full=PACase B protein; AltName:
           Full=Terminase B protein; AltName: Full=Terminase large
           subunit
 gi|33323612|gb|AAQ07582.1|AF503408_106 PacB [Enterobacteria phage P7]
 gi|215636|gb|AAA21724.1| pacB [Enterobacteria phage P1]
 gi|33338757|gb|AAQ14080.1| PacB [Enterobacteria phage P1]
 gi|33338866|gb|AAQ14188.1| PacB [Enterobacteria phage P1]
 gi|54112354|gb|AAV28854.1| PacB [Enterobacteria phage P7]
 gi|301075042|gb|EFK89848.1| putative terminase B protein [Escherichia coli MS 146-1]
          Length = 494

 Score =  306 bits (783), Expect = 7e-81,   Method: Composition-based stats.
 Identities = 90/482 (18%), Positives = 177/482 (36%), Gaps = 52/482 (10%)

Query: 32  VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
            L+ + W      L  F    +WQ +         + S           ++++G G GK+
Sbjct: 16  ALYRYDWIAAADVL--FGKTPTWQQD-------EIIESTQQDGSW---TSVTSGHGTGKS 63

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKHWFEMQSLSLHP 150
            + + + +  +   PG  VI +AN   Q+   ++  + S W + +    W   +   L  
Sbjct: 64  DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLS-KYFILTE 122

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210
             ++              ++ + ++      +   G H  + + II DEASG  D     
Sbjct: 123 TSFFEVTGKGV-------WTILIKSCRPGNEEALAGEHADHLLYII-DEASGVSDKAFSV 174

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263
           I G LT ++     +  S P R SG FY+  ++       P   +    +++     +D 
Sbjct: 175 ITGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDA 232

Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322
            F    +A Y G D+ +  ++V G+FP+      +  + +E A  R+         +   
Sbjct: 233 KFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACV 292

Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371
           D+A   G D +V+ +        +R  +   + +++         KI      E++    
Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMLEYTDVTETQLAAKIFAECNPERFPNIT 352

Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427
           I ID +  G  T D + E  G  V R+   K+     D     + R   +++ A+ ++  
Sbjct: 353 IAIDGDGLGKSTADLMYERYGITVQRIRWGKKMHSREDKSLYFDMRAFANIQAAEAVKSG 412

Query: 428 SLINHSGLIQ-NLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPPR 482
            +    G       S     + + G+  + S    K+     S D+ D   +    N   
Sbjct: 413 RMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLANYVP 472

Query: 483 SD 484
            D
Sbjct: 473 QD 474


>gi|331649955|ref|ZP_08351031.1| terminase B protein (PACase B protein) (DNA packaging B protein)
           [Escherichia coli M605]
 gi|331041212|gb|EGI13366.1| terminase B protein (PACase B protein) (DNA packaging B protein)
           [Escherichia coli M605]
          Length = 494

 Score =  305 bits (782), Expect = 9e-81,   Method: Composition-based stats.
 Identities = 90/482 (18%), Positives = 177/482 (36%), Gaps = 52/482 (10%)

Query: 32  VLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
            L+ + W      L  F    +WQ +         + S           ++++G G GK+
Sbjct: 16  ALYRYDWIAAADVL--FGKTPTWQQD-------EIIESTQQDGSW---TSVTSGHGTGKS 63

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV-SKWLSLLPNKHWFEMQSLSLHP 150
            + + + +  +   PG  VI +AN   Q+   ++  + S W + +    W   +   L  
Sbjct: 64  DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLS-KYFILTE 122

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG 210
             ++              ++ + ++      +   G H  + + II DEASG  D     
Sbjct: 123 TSFFEVTGKGV-------WTILIKSCRPGNEEALAGEHADHLLYII-DEASGVSDKAFSV 174

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------PLDDWKRFQIDTRTVEGIDP 263
           I G LT ++     +  S P R SG FY+  ++       P   +    +++     +D 
Sbjct: 175 ITGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDA 232

Query: 264 SFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGC 322
            F    +A Y G D+ +  ++V G+FP+      +  + +E A  R+         +   
Sbjct: 233 KFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACV 292

Query: 323 DIA-EEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTTNNKISGLV--EKYRPDA 371
           D+A   G D +V+ +        +R  +   + +++         KI      E++    
Sbjct: 293 DVAGGTGRDKSVINIMMVSGQRNKRRVINYRMQEYTDVTETQLAAKIFAECNPERFPNIT 352

Query: 372 IIIDANNTGARTCDYL-EMLGYHVYRVLGQKR---AVDLEFCRNRRTELHVKMADWLEFA 427
           I ID +  G  T D + E  G  V R+   K+     D     + R   +++ A+ ++  
Sbjct: 353 IAIDGDGLGKSTADLMYERYGITVQRIRWGKKMHSREDKSLYFDMRAFANIQAAEAVKSG 412

Query: 428 SLINHSGLIQ-NLKSLKSFIVPNTGELAIES----KRVKGAKSTDYSDGLMYTFAENPPR 482
            +    G       S     + + G+  + S    K+     S D+ D   +    N   
Sbjct: 413 RMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAMLANYVP 472

Query: 483 SD 484
            D
Sbjct: 473 QD 474


>gi|48697461|ref|YP_024846.1| Pas60 [Actinoplanes phage phiAsp2]
 gi|47679679|gb|AAT36808.1| Pas60 [Actinoplanes phage phiAsp2]
          Length = 492

 Score =  304 bits (779), Expect = 2e-80,   Method: Composition-based stats.
 Identities = 105/461 (22%), Positives = 173/461 (37%), Gaps = 53/461 (11%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
            +P +W  + ++V  A     + +  P   + A+    G+GK+   A LV W  +TR  +
Sbjct: 21  DSPTAWAADCLDVRLAGYQGEILDAVPRERRVAVRGPHGLGKSFSGAILVNWFATTRDLM 80

Query: 109 ----SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164
                +I  A++   L+  LW E+ KW           +  ++L  AP+        L +
Sbjct: 81  GKDWKIITTASAWRHLEVYLWPEIHKWAG--------RINFVALGRAPYNPRTELLDLRL 132

Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA---- 220
              H +      +  +P+   G H    +  + DEA   P      I G  +        
Sbjct: 133 KLTHGAATA--VASNQPERIEGAHAEE-LLYLLDEAKIVPPATWDSIEGAFSNAGVDVAD 189

Query: 221 NRFWIMTSNPRRLSGKFYEIFNK--PLDDWKRFQIDTRT---VEGIDPSFHEGIIARYGL 275
           N +    S P   SG+FY+I  +    +DW    +          I  ++ +   +++G 
Sbjct: 190 NAYAFAMSTPGAPSGRFYDIHRRAPGYEDWWTRHVTLEEAIASGRISRAWADQRRSQWGS 249

Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP------CPDPYAPLIMGCDIAEEGG 329
           DS V    V G+F   D DS IPL  +E A+ R         P P  PL  G D+   GG
Sbjct: 250 DSAVFHNRVLGEFHASDEDSVIPLAWLEAAIERWHEWDRQGRPSPGGPLWTGVDVGR-GG 308

Query: 330 DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
           D TV+  R G  +  L    + D   T   I     + R    IID    GA   D L  
Sbjct: 309 DETVLAARDGWAVT-LETNRRRDTMATVGLI-----QAREGRAIIDVIGLGAGVFDRLRE 362

Query: 390 LGYHVYRVLGQKRAVDLEF-----CRNRRTELHVKMADWLE-----FASLINHSGLIQNL 439
           LG       G   A   +        N R+  +  + + L+       +L     +I +L
Sbjct: 363 LGTRPLAYTGSAGATVRDRSGKFGFTNTRSAAYWNLRELLDPAFDPVLALPPDDLMISDL 422

Query: 440 KSLKSFIVPNT--GELAIESKRV---KGAKSTDYSDGLMYT 475
            +   + V      ++ +E K     +  +S D  D +  +
Sbjct: 423 TT-PHWEVTTGVPPKIKVEPKDKVVERLGRSPDRGDAIAMS 462


>gi|323516996|gb|ADX91377.1| hypothetical protein ABTW07_0941 [Acinetobacter baumannii
           TCDC-AB0715]
 gi|323518424|gb|ADX92805.1| hypothetical protein ABTW07_2381 [Acinetobacter baumannii
           TCDC-AB0715]
          Length = 663

 Score =  297 bits (761), Expect = 2e-78,   Method: Composition-based stats.
 Identities = 89/431 (20%), Positives = 154/431 (35%), Gaps = 51/431 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
              GKT     + LW +       ++  A    QLK  +W E+S             +  
Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256

Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
           L   P  W +D +        +    + +    +T  + +P    G+H    M  + DEA
Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256
           SG  D +     G LT  +     +MTS P R +G FYE  +K        W     +  
Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373

Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
               +     E    +YG   D   ++ V G+FP    +  I     EE        D +
Sbjct: 374 ESPLVSKQSLEEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433

Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360
               ++  D+    G D++V+V+             RR  V++     ++ D+     KI
Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGESQWGERARRVEVVDIPLCKNRDDILELFAKI 493

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417
           + L+ +Y    +++D N  G     YL+  G     V    +     + +   N+R+  +
Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553

Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469
           V +A  +           ++  +   L  +  +   +     I SK   +  G KS D  
Sbjct: 554 VGLARAIASGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612

Query: 470 DGLMYTFAENP 480
           D   + F EN 
Sbjct: 613 DAFAFLFLENV 623


>gi|299769795|ref|YP_003731821.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1]
 gi|298699883|gb|ADI90448.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1]
          Length = 668

 Score =  297 bits (759), Expect = 4e-78,   Method: Composition-based stats.
 Identities = 91/431 (21%), Positives = 153/431 (35%), Gaps = 51/431 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
              GKT     + LW +       ++  A    QLK  +W E+S             +  
Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256

Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
           L   P  W +D +        +    + +    +T  + +P    G+H    M  + DEA
Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256
           SG  D +     G LT  +     +MTS P R +G FYE  +K        W     +  
Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373

Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
               +     E    +YG   D   ++ V G+FP    +  I     EE        D +
Sbjct: 374 ESPLVSKQSLEEQRQKYGSRDDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433

Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360
               I+  D+    G D++V+V+             RR  V++     ++ D+     KI
Sbjct: 434 QFGYIITVDVGGGVGRDDSVIVISKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKI 493

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417
           + L+ +Y    +++D N  G     YL+  G     V    +     + +   N+R+  +
Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553

Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469
           V  A  +           ++  +   L  +  +   +     I SK   R  G KS D  
Sbjct: 554 VGFARAVASGRFKMKTKKHYVKIKDQLIHIP-YRFDDFARYKILSKDEMRRMGIKSPDLG 612

Query: 470 DGLMYTFAENP 480
           D   + F EN 
Sbjct: 613 DAFAFLFLENV 623


>gi|256392042|ref|YP_003113606.1| hypothetical protein Caci_2856 [Catenulispora acidiphila DSM 44928]
 gi|256358268|gb|ACU71765.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928]
          Length = 484

 Score =  295 bits (755), Expect = 1e-77,   Method: Composition-based stats.
 Identities = 88/479 (18%), Positives = 164/479 (34%), Gaps = 58/479 (12%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
             + P  W  + +          +     +    A+ +  G GK+ + + L  W + T P
Sbjct: 24  YLADPARWVDDKLGEYLWSRQVDIATSVRDQRLTAVQSCHGTGKSFVASRLTAWWLDTHP 83

Query: 107 --GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164
                V+  A +  Q+K  LWAE++K  +    +         ++   W  D    + G 
Sbjct: 84  PGEAFVVTTAPTGDQVKAILWAEINKAFAKAEARG--TPLPGRINETDWKYDKFLVAFG- 140

Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224
                    R  S+  P  F G H  Y + +I DEA G         L   T  +     
Sbjct: 141 ---------RKPSDYNPHAFQGIHAKYVL-VILDEACGISKQFWTAALAIATGVHCRI-- 188

Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE--------------GIDPSFHEGII 270
           +   NP      F ++       W   +I  R                  +  ++   + 
Sbjct: 189 LAIGNPDDPGSHFAQVCKSDR--WNMIKIAARDTPNFTGEEVPDDLADMLVSQAYVLDMA 246

Query: 271 ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP----CPDPYAPLIMGCDIAE 326
             +G +S +   +V  +FP    D  + L+ +  A  REP     PD   P+ +G D+  
Sbjct: 247 EEFGPESPIYLSKVDAEFPSDASDGVVRLSKL-MACTREPVHPYAPDRLVPVELGVDLGA 305

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386
            GGD T +  RRG      +   + D     + I   + +     + +D+   G      
Sbjct: 306 -GGDETCIRERRGIAAGREWRNREKDSEKVVDHIVRAIRETGATKVKVDSIGIGWGIVGS 364

Query: 387 LEMLG------YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL-EFAS-------LINH 432
           L+           V  V   + +   E     R+++  ++   L E            + 
Sbjct: 365 LQARRKQGLHTAEVVGVNVSEASTQPEKYARLRSQIWWEVGRKLSEDGGWDLSQLDTTDR 424

Query: 433 SGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN-PPRSDMDF 487
             L+  L + K + +  +G + +E K   + +  +S D +D L+  F     P+  +  
Sbjct: 425 DRLVSQLTAPK-YDLDASGRIVVEKKEETKKRIGRSPDNADALLLAFYTPSVPKPGIRV 482


>gi|184158505|ref|YP_001846844.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU]
 gi|183210099|gb|ACC57497.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU]
          Length = 663

 Score =  295 bits (755), Expect = 1e-77,   Method: Composition-based stats.
 Identities = 87/431 (20%), Positives = 153/431 (35%), Gaps = 51/431 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
              GKT     + LW +       ++  A    QLK  +W E+S             +  
Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256

Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
           L   P  W +D +        +    + +    +T  + +P    G+H    M  + DEA
Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256
           SG  D +     G LT  +     +MTS P R +G FYE  +K        W     +  
Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373

Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
               +     +    +YG   D   ++ V G+FP    +  I     EE        D +
Sbjct: 374 ESPLVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433

Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360
               ++  D+    G D++V+V+             RR  V++     ++ D+     KI
Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGESQWGERARRVEVVDIPLCKNRDDILELFAKI 493

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417
           + L+ +Y    +++D N  G     YL+  G     V    +     + +   N+R+  +
Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553

Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469
           V +   +           ++  +   L  +  +   +     I SK   +  G KS D  
Sbjct: 554 VGLQRAIASGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612

Query: 470 DGLMYTFAENP 480
           D   + F EN 
Sbjct: 613 DAFAFLFLENV 623


>gi|213156231|ref|YP_002318651.1| phage terminase [Acinetobacter baumannii AB0057]
 gi|301346399|ref|ZP_07227140.1| phage terminase [Acinetobacter baumannii AB056]
 gi|301594275|ref|ZP_07239283.1| phage terminase [Acinetobacter baumannii AB059]
 gi|213055391|gb|ACJ40293.1| phage terminase [Acinetobacter baumannii AB0057]
          Length = 663

 Score =  295 bits (754), Expect = 1e-77,   Method: Composition-based stats.
 Identities = 88/431 (20%), Positives = 156/431 (36%), Gaps = 51/431 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
              GKT     + LW +       ++  A    QLK  +W E+S             +  
Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256

Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
           L   P  W +D +        +    + +    +T  + +P    G+H    M  + DEA
Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256
           SG  D +     G LT  +     +MTS P R +G FYE  +K        W     +  
Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373

Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
               +     +    +YG   D   ++ V G+FP    +  I     EE        D +
Sbjct: 374 ESPLVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433

Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360
               ++  D+    G D++V+V+             RR  V++     ++ D+     KI
Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKI 493

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417
           + L+ +Y    +++D N  G     YL+  G     V    +     + +   N+R+  +
Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553

Query: 418 VKMADWL-----EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469
           V +A  +     +  +  ++  +   L  +  +   +     I SK   +  G KS D  
Sbjct: 554 VGLARAIANGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612

Query: 470 DGLMYTFAENP 480
           D   + F EN 
Sbjct: 613 DAFAFLFLENV 623


>gi|332852816|ref|ZP_08434408.1| intein splicing region-containing protein [Acinetobacter baumannii
           6013150]
 gi|332871045|ref|ZP_08439658.1| intein splicing region-containing protein [Acinetobacter baumannii
           6013113]
 gi|332729027|gb|EGJ60377.1| intein splicing region-containing protein [Acinetobacter baumannii
           6013150]
 gi|332731805|gb|EGJ63085.1| intein splicing region-containing protein [Acinetobacter baumannii
           6013113]
          Length = 663

 Score =  295 bits (754), Expect = 2e-77,   Method: Composition-based stats.
 Identities = 88/431 (20%), Positives = 154/431 (35%), Gaps = 51/431 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
              GKT     + LW +       ++  A    QLK  +W E+S             +  
Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256

Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
           L   P  W +D +        +    + +    +T  + +P    G+H    M  + DEA
Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256
           SG  D +     G LT  +     +MTS P R +G FYE  +K        W     +  
Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373

Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
               +     +    +YG   D   ++ V G+FP    +  I     EE        D +
Sbjct: 374 ESPLVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433

Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360
               ++  D+    G D++V+V+             RR  V++     ++ D+     KI
Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKI 493

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417
           + L+ +Y    +++D N  G     YL+  G     V    +     + +   N+R+  +
Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553

Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469
           V +A  +           ++  +   L  +  +   +     I SK   +  G KS D  
Sbjct: 554 VGLARAIASGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612

Query: 470 DGLMYTFAENP 480
           D   + F EN 
Sbjct: 613 DAFAFLFLENV 623


>gi|260551382|ref|ZP_05825582.1| phage terminase [Acinetobacter sp. RUH2624]
 gi|260405545|gb|EEW99037.1| phage terminase [Acinetobacter sp. RUH2624]
          Length = 663

 Score =  295 bits (754), Expect = 2e-77,   Method: Composition-based stats.
 Identities = 88/431 (20%), Positives = 154/431 (35%), Gaps = 51/431 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
              GKT     + LW +       ++  A    QLK  +W E+S             +  
Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEIS-----------INLAR 256

Query: 146 LSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
           L   P  W +D +        +    + +    +T  + +P    G+H    M  + DEA
Sbjct: 257 LKQGPLAWLADYVGYQSELVYIKGYKEKWYVFAKTAPKHQPTNLAGNHGDNYMVWV-DEA 315

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTR 256
           SG  D +     G LT  +     +MTS P R +G FYE  +K        W     +  
Sbjct: 316 SGVDDAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGE 373

Query: 257 TVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
               +     +    +YG   D   ++ V G+FP    +  I     EE        D +
Sbjct: 374 ESPLVSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDH 433

Query: 316 A-PLIMGCDIAEE-GGDNTVVVL-------------RRGPVIEHLFDWSKTDLRTTNNKI 360
               ++  D+    G D++V+V+             RR  V++     ++ D+     KI
Sbjct: 434 QFGYVITVDVGGGVGRDDSVIVVSKVWGEAQWGERARRVEVVDIPLCKNRDDILELFAKI 493

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELH 417
           + L+ +Y    +++D N  G     YL+  G     V    +     + +   N+R+  +
Sbjct: 494 NELLLQYPNANLVVDDNGAGKGLGQYLKKQGIFYVPVYWGSQCFSNDNRKEFTNKRSLAY 553

Query: 418 VKMADWLEFAS-----LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469
           V +A  +           ++  +   L  +  +   +     I SK   +  G KS D  
Sbjct: 554 VGLARAIASGRFKIKTKKHNVKIKDQLIHVP-YRFDDFARYKILSKDEMKRMGIKSPDIG 612

Query: 470 DGLMYTFAENP 480
           D   + F EN 
Sbjct: 613 DAFAFLFLENV 623


>gi|216906085|ref|YP_002333619.1| terminase [Abalone shriveling syndrome-associated virus]
 gi|216263178|gb|ACJ72002.1| terminase [Abalone shriveling syndrome-associated virus]
          Length = 507

 Score =  284 bits (727), Expect = 2e-74,   Method: Composition-based stats.
 Identities = 109/470 (23%), Positives = 187/470 (39%), Gaps = 45/470 (9%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQLE ++ + A      ++    V   A+S G G GKT L+  L +W     PG     
Sbjct: 50  DWQLEIVDYI-AKFFRKNSDEKHFVCAIAVSGGNGTGKTKLSKALNIWRFCCHPGSRQFI 108

Query: 113 LANSETQLK----TTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
           L NSE Q K    T L   +SK LS +       ++S + + +P  +D        D   
Sbjct: 109 LTNSERQTKRTGFTMLVRRISKLLSCIA-----ALESSAYYYSPAVADKPEVRTN-DMWD 162

Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228
            + + ++ +E       G H+   M    DE++   D +   +    T+     F   T 
Sbjct: 163 VTYLLQSSTEA---ALSGLHHPM-MTFSFDESTYFNDHVWQALENMWTQGQVLCF--CTG 216

Query: 229 NPRRLSGKFY-EIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR-------YGLDSDVT 280
           NP   +  ++  +FNK L       + TR V  ++        AR       YG      
Sbjct: 217 NPSHDNNNYFARLFNKSLHKKDSLWL-TRCVSLLELPLKYRNDARARYIEEHYGKTHPRY 275

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD-PYAPLIMGCDI--AEEGGDNTVVVLR 337
              V GQFP+++  +   +  I EA+ RE   +  + P+IMG D+  +   G  + + +R
Sbjct: 276 IASVLGQFPKKNTCNPFDITAISEAMEREVREEFIHHPVIMGIDVSISANNGSASAICVR 335

Query: 338 RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-----EMLGY 392
            G  +  L ++          K+  L+++ +P  +++DAN  G    + L     E    
Sbjct: 336 EGTAVRVLREYRCH-YTEFRIKLLELLQEIKPTIVVVDANGVGFGLYEELHRTLPETSNV 394

Query: 393 HVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLIQNLKSLKSFIVPNT 450
            VY V     A       ++ +EL  K ++W   E  S+  +   +  L SL       +
Sbjct: 395 RVYGVRAHAEAFLKSEYADKMSELAKKSSEWFNNELVSIPKNYQFLNALTSLS--FADAS 452

Query: 451 GELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQYEG 497
           G++ +  K   + K   S D +D    TF +     +MD+ +     Y  
Sbjct: 453 GKIKLIGKTDAKKKVDLSMDMADAFFLTFLDGV---EMDWAQGVKDNYLD 499


>gi|134287454|ref|YP_001109621.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4]
 gi|134131876|gb|ABO60570.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4]
          Length = 509

 Score =  274 bits (701), Expect = 2e-71,   Method: Composition-based stats.
 Identities = 84/457 (18%), Positives = 157/457 (34%), Gaps = 49/457 (10%)

Query: 59  MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118
           ++    H +   ++ + +  + ++S+G G GKT+  A + LW +      + I  A   +
Sbjct: 34  LKAPTHHQIQMFDSVSKQGSRTSVSSGHGTGKTSGFAIIALWHLLCYYLSNTILTAPKIS 93

Query: 119 QLKTTLWAEVSKWLSLLPNKHW-FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177
            +   +W E +   + + N    +  +   +       +     +     ++  + ++  
Sbjct: 94  TVSDGVWKEFADLSTKISNGPQSWIWEYFVI-------ESERVYVRGYKLNWFVIAKSAP 146

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
              P+   G H  + +  + DEASG PD     I G LT+        + S P R SG F
Sbjct: 147 RGSPENLAGAHRDW-LLWLADEASGIPDDNFGVITGSLTDE--RNRMCLASQPTRSSGFF 203

Query: 238 YEIFN----KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293
           YE  +         W     ++     +   F      +Y  +    +++V G+FP+   
Sbjct: 204 YETHHALSRAEGGPWNNLVFNSEFSPIVSAKFIAEKKLQYTEEE--YQIKVQGRFPENSS 261

Query: 294 DSFIPLNIIEEALNREPC-PDPYAPLIMGCDIAEEG-GDNTVV----VLRRGPV------ 341
              +    IE  + R    PD +   ++  D+   G  D TV+    V+ RG        
Sbjct: 262 KYLVGPQAIEACVGRTVIKPDEHWGWLLPVDVGGGGWRDETVMPALHVIGRGEYGMDARR 321

Query: 342 ---IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR-V 397
              I      +  D    +  I     +      +IDA   G   C  L++ G+  YR V
Sbjct: 322 AQLISVPLHSNTQDPAQLHGVIVHAARERSNATAMIDAGGMGLIVCKQLDLDGFSQYRKV 381

Query: 398 LGQKRAVDLEF---CRNRRTELHVKMADWLEFA--SLINH------SGLIQNLKSLKSFI 446
                    E+     N+R +     A  +      +           L++    +  F 
Sbjct: 382 NWGNPNFAKEYKDRYVNQRAQACCGFARAITEGRFGINPDVPKSFVKKLVKQGSRIPYFW 441

Query: 447 VPNTGELAIESKRVK----GAKSTDYSDGLMYTFAEN 479
                   I  K          S D  D L + F E+
Sbjct: 442 -DEKARRQIMKKEDMREKENLPSPDVFDALSFAFLED 477


>gi|228924410|ref|ZP_04087639.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis
           serovar huazhongensis BGSC 4BD1]
 gi|228835241|gb|EEM80653.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis
           serovar huazhongensis BGSC 4BD1]
          Length = 293

 Score =  274 bits (701), Expect = 2e-71,   Method: Composition-based stats.
 Identities = 77/283 (27%), Positives = 124/283 (43%), Gaps = 30/283 (10%)

Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284
            +  NP R SG FY+  N+  D +K  ++ +           E +  +YG  SDV RV V
Sbjct: 2   FLCGNPTRTSGVFYDSHNRDRDLYKIHKVSSLDSPRTSKDNIEVLKKKYGEGSDVWRVRV 61

Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEH 344
            G+FP+ + D+FIPL I+E+A + +  P     L +G D+A  G D TV+  R G  +  
Sbjct: 62  LGEFPKAEADAFIPLEIVEQAASCKVEPT-GETLDLGVDVARFGDDETVIAPRIGNKVFK 120

Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDA-------IIIDANNTGARTCDYL------EMLG 391
           L +  K D   T   +  L ++Y           I +D +  G    D L      E L 
Sbjct: 121 LLNHYKQDTMETAGHVLKLAKEYMAKYKQLKRVDIKVDDSGVGGGVTDRLKEVIKSERLP 180

Query: 392 YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE------------FASLINHSGLIQNL 439
           + VY V+   + +D E   N   E    + D LE               + N   +I   
Sbjct: 181 FKVYPVVNNGKPLDDEHYDNAGAEGWAVVRDLLEENMKAFIQGEEPTMEIPNDEKMISQF 240

Query: 440 KSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
            S K + + + G++A+E K   + +G +S D +D ++  F + 
Sbjct: 241 SSRK-YRITSRGKIALERKEEMKKRGLQSPDRADAIVLAFYKP 282


>gi|226227228|ref|YP_002761334.1| hypothetical protein GAU_1822 [Gemmatimonas aurantiaca T-27]
 gi|226090419|dbj|BAH38864.1| hypothetical protein [Gemmatimonas aurantiaca T-27]
          Length = 549

 Score =  273 bits (697), Expect = 6e-71,   Method: Composition-based stats.
 Identities = 106/544 (19%), Positives = 177/544 (32%), Gaps = 91/544 (16%)

Query: 13  QKLFDLMWSDEIKLSFSNFVLHFFP----WGEKGTPLEGFSAPRSWQLEFMEVVDAHCLN 68
             + D        L ++   L        W      L        W            L 
Sbjct: 10  SLVIDHSAYRHDPLGWAEVALGVSRETLLW-----SLFDAYGTHEW------DGTPDPLA 58

Query: 69  SVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEV 128
           +V     +    A+++G G GKT L A L+LW ++  P      +A    Q +  +W EV
Sbjct: 59  TVLEAIAKNQWVAVASGTGTGKTFLEAVLLLWWIAVEPDSIATTVATKADQQEKGIWREV 118

Query: 129 SK-WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           ++ W          E+ +L +   PW  D              T      EE      G 
Sbjct: 119 ARHWPRFQACFPEAELTTLRIRMEPWRGDAWGA-------WGITAAPKAGEESSSAVQGL 171

Query: 188 HNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPR---RLSGKFYEIFNKP 244
           H    + I+ DE  G P  +   ++   T            NP       G+F E     
Sbjct: 172 HAKR-LLILVDETPGVPQPVMTALVNTATGE--ENVIAAFGNPDYQADPLGQFAET---- 224

Query: 245 LDDWKRFQIDTRTVEGI-----------DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293
                  +I       +                     +YG++S V +  V G  P+Q  
Sbjct: 225 -KRVTAIRISALDHPNVVLGVERIPGAATRLSIATREDKYGVESGVYQSRVRGIAPEQSA 283

Query: 294 DSFIPLNIIEEALNREPCPDPYA----PLIMGCDIAE-EGGDNTVVVLRRGPVIEHLFDW 348
            + I L     A +R       A    P  +G D+A+ E GD   V + +G  +  +   
Sbjct: 284 SALIHLAWCVAAADRAESVQHAALALGPKALGVDVAQSENGDKAAVAMGQGARLLSVIAK 343

Query: 349 SKTDLRTTNNKISGLVEKYR--PDAIIIDANNTGARTCDYL------EMLGYHVYRVLGQ 400
           +  +      ++  L+      P+ + +D    GA T ++L      E  G  V R  G 
Sbjct: 344 ACPNATKLGAEVWQLMRDEGIVPEYVGVDPIGVGAATVNHLDGECEKENAGRSVVRCSGG 403

Query: 401 KRAV----------------DLEFCRNRRTELHVKMADWLEFA--SLINHSGLIQNLKSL 442
            +A+                D    +N R ++  ++ + L     +L     L + L ++
Sbjct: 404 AKAMEASSRAADGSAMEWLADANKFKNLRAQMWWQLREDLRNGLIALPRDRELFRELTTV 463

Query: 443 KSFIVPNTGELA-IESK---RVKGAKSTDYSDGLMY-------TFAENPPRSDMDFGRCP 491
           +       G +  +ESK   R +  +S D +D ++Y       T    PP    D  R P
Sbjct: 464 Q---FDEDGGIVTLESKDDIRKRLGRSPDRADAVVYWNWVRPRTRVNQPPPEGFDVAR-P 519

Query: 492 SYQY 495
              Y
Sbjct: 520 IRNY 523


>gi|159897183|ref|YP_001543430.1| hypothetical protein Haur_0654 [Herpetosiphon aurantiacus ATCC
           23779]
 gi|159890222|gb|ABX03302.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC
           23779]
          Length = 472

 Score =  258 bits (660), Expect = 1e-66,   Method: Composition-based stats.
 Identities = 100/490 (20%), Positives = 169/490 (34%), Gaps = 82/490 (16%)

Query: 45  LEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEV-FKGAISAGRGIGKTTLNAWLVLWLMS 103
           L     P ++  E +  V       +        ++  + A   +GKT L   LV W   
Sbjct: 2   LPYAHDPVAYAREVLGEVWWTKQELIARSLLTPPYRTLVKACHKVGKTHLGGGLVNWWYD 61

Query: 104 TRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG 163
           +     V+  A ++ Q++  LW EV   +       +   +S  L   P +         
Sbjct: 62  SFDPGLVLTTAPTDRQVRDLLWKEVR--MQRRGRAGFTGPKSPRLESTPDH--------- 110

Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223
                       ++ +  D+F GHH+ + +  I DEA G   V          E  A   
Sbjct: 111 --------FAHGFTAKDGDSFQGHHSPHTL-FIFDEAVGVASVFWETAESMFNEGGA--- 158

Query: 224 WIMTSNPR---------RLSGKFYEI----------------FNKPLDDWKRF-QIDT-- 255
           W+   NP           LSG ++ I                   P     R  ++DT  
Sbjct: 159 WLAIFNPTDTSSQAYAEELSGGWHVISMSVLEHPNILAELQGLPPPFPSAIRLSRVDTLL 218

Query: 256 ----RTVEGIDPSFHEGIIAR--YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
               R +   +P     I  R  +     +    + G++P Q  ++       + A +  
Sbjct: 219 KKWCRALSPEEPKRATDIHWRDAWYRPGPIAEARLLGRWPSQATNNVWSDGAFQVAESL- 277

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-- 367
             P    P  +GCD+A  G D T + +RRG    +    +      T  ++  L  +Y  
Sbjct: 278 LLPASDEPCELGCDVARYGDDFTEIHVRRGGHSLYHEAANGWSTVETAGRLKQLANEYGR 337

Query: 368 ------RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
                 R  A+ ID +  G    D     GY    V G + A D E   NRR+EL   +A
Sbjct: 338 RCGVDGRAVAVKIDDDGIGGGVVDL--ADGYTFLGVSGARTAYDPEKYPNRRSELWFSVA 395

Query: 422 D-----WLEFASLINHSGLIQNLK---SLKSFIVPNTGELAIESK---RVKGAKSTDYSD 470
           +      L F +L   +   + L+      ++   + G   +E K   + +  +S D  D
Sbjct: 396 ERAMEQRLSFVAL--DAETRRELRRQAMAPTWKQDSQGRRVVEPKADTKKRIKRSPDGMD 453

Query: 471 GLMYTFAENP 480
            +   +A  P
Sbjct: 454 AVNLAYAPAP 463


>gi|322656964|gb|EFY53248.1| DNA packaging protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CASC_09SCPH15965]
          Length = 411

 Score =  255 bits (652), Expect = 1e-65,   Method: Composition-based stats.
 Identities = 77/327 (23%), Positives = 132/327 (40%), Gaps = 30/327 (9%)

Query: 72  NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131
           +      +  +++G G GK++L A L+L  M   P   VI +AN   Q+KT ++  V ++
Sbjct: 49  SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
            +    +H +      L    +Y        GI    +  +C+ Y     +   G H  +
Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAH 161

Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-------P 244
            + +I DEASG  D     + G LTE +     +M S P R SG FY+  +        P
Sbjct: 162 -LLLILDEASGISDKAIGVMTGALTEEDNR--MLMLSQPTRPSGYFYDSHHSQAKTPDNP 218

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
              W    +++     + P F +  +  Y G DS    V+V GQFP++     +  +  +
Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278

Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKT-DLR 354
            A  R+   +     +   D+   G D +V+ +        +R  V   + +   T D  
Sbjct: 279 RAARRKVLLEKNWGWVATADVG-NGRDKSVLNICKVSGHRDKRRVVNFKVMEMPGTMDPL 337

Query: 355 TTNNKISGLV--EKYRPDAIIIDANNT 379
              + I      EKY    I +DA+  
Sbjct: 338 AFADFIYNECTPEKYPNITIAVDADGL 364


>gi|262316909|emb|CBA18135.1| putative terminase B [Paenibacillus phage phiBP]
          Length = 248

 Score =  252 bits (644), Expect = 9e-65,   Method: Composition-based stats.
 Identities = 66/242 (27%), Positives = 104/242 (42%), Gaps = 16/242 (6%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
              +P+++  E +         SV++   +    ++ +G+G+GKT L A + LW +   P
Sbjct: 23  YRKSPKTFFKEILNFSPDKWQESVSDDIAKYRFVSVRSGQGVGKTALEAAISLWFLCCFP 82

Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
              V+C A +  QL   LWAE+SKW S  P         +      W    ++       
Sbjct: 83  FPRVVCTAPTRQQLNDVLWAEISKWQSQSP---------ILKRILKWTKTKIYM--KNYE 131

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226
           + +    RT +  +P+   G H  Y M  I DEASG  D I   I G L+         M
Sbjct: 132 ERWFATARTAT--KPENMQGFHEDY-MLFIVDEASGVDDRIMAAIFGTLSGDY--NKLFM 186

Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286
             NP + SG F++  N+    ++  ++             E + A+YG  SDV RV V G
Sbjct: 187 CGNPTKTSGFFFDSHNRDRAIYRTHRVSCLDSPRTSKENIEMLKAKYGEGSDVWRVRVLG 246

Query: 287 QF 288
           +F
Sbjct: 247 EF 248


>gi|111222161|ref|YP_712955.1| hypothetical protein FRAAL2741 [Frankia alni ACN14a]
 gi|111149693|emb|CAJ61385.1| hypothetical protein FRAAL2741 [Frankia alni ACN14a]
          Length = 535

 Score =  247 bits (631), Expect = 3e-63,   Method: Composition-based stats.
 Identities = 92/467 (19%), Positives = 151/467 (32%), Gaps = 59/467 (12%)

Query: 47  GFSAPRSWQLEFMEVV-DAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTR 105
               P  W  + +  V        + N      K A+ +    GK+ + A  V   + T 
Sbjct: 52  YRDEPVRWARDRLGGVHLWSKQQEIINALRVHRKVAVPSCHDAGKSFVAAAAVAHWLDTH 111

Query: 106 PG--ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG 163
           P      I  A +  Q++  LW E+ +   L+            ++   W  D    + G
Sbjct: 112 PPGSAFAITTAPTFPQVRAILWREIRRLSRLM------NPPLGRVNQTEWLIDDDLVAFG 165

Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223
                     R  ++     F G H  Y + ++ DEA G P  + +      T  NA   
Sbjct: 166 ----------RKPADHDEGGFQGIHAQYPL-VVLDEAGGIPQQLWIAADSIATNENARI- 213

Query: 224 WIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE--------------GIDPSFHEGI 269
            +   NP   +  F ++    L  W    I                     +  ++ E  
Sbjct: 214 -LAIGNPDDPTSYFAQVC--ELPSWHVITIPAAETPAFTGEQIPDDLRQALLSRAWAEEK 270

Query: 270 IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA---PLIMGCDIAE 326
              +G D+ V   +V  QFP+      I  + + +       P P +   P+ +G D+  
Sbjct: 271 RREWGEDNPVYISKVLAQFPKDVAWKVIKASDVAKRRIGRDEPWPASKLRPVCLGVDVG- 329

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386
           EG D TVV  RRG      +     +       I   V       + IDA   G      
Sbjct: 330 EGRDWTVVRERRGVQAGREWQARTPEPEQAVKLIGQAVLITGAKTVNIDAGGPGWGIAAA 389

Query: 387 LEML-------GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL------EFASLINHS 433
           L          G  V  +    ++ + E   N R EL   +   L      + + + N  
Sbjct: 390 LRGWLKQHKVRGVAVNPIRFGAKSREPEKYLNMRAELWWGVGRLLSEQGGWDLSVMENAD 449

Query: 434 GLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFA 477
                L     +       + IESK   R +  +S D +D L+  FA
Sbjct: 450 DTTAQLLD-PIWREGAGDRIVIESKEELRKRTGRSPDNADALLLAFA 495


>gi|161789175|ref|YP_001595730.1| PacB [Vibrio sp. 0908]
 gi|161761461|gb|ABX77106.1| PacB [Vibrio sp. 0908]
          Length = 572

 Score =  246 bits (628), Expect = 7e-63,   Method: Composition-based stats.
 Identities = 81/438 (18%), Positives = 155/438 (35%), Gaps = 38/438 (8%)

Query: 64  AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTT 123
              +  +N   P   + ++++G G GK+ L A L L  + T P    +  ANS  Q+   
Sbjct: 47  FQQIEVINALTPVGARVSVASGHGTGKSHLTAALCLHFIITHPESLCMLTANSLDQVTNV 106

Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT 183
           +++ + +    +  +  +  Q   +    +Y+             +    +T S+   + 
Sbjct: 107 VFSYIKRCWVKICQRQPWLEQYFVITAKSFYA-------KGYKGVWQIFGKTCSKGNEEG 159

Query: 184 FVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK 243
             G H    M ++ DEASG  D     + G LTE N     ++ S   R +G F +   +
Sbjct: 160 LAGQHRRDYM-VVVDEASGVSDRAFEVLRGALTEDN--NKMLLISQFTRPTGHFADSQME 216

Query: 244 --PLDDWKRFQIDTRTVEGIDPSFHEGIIARY-GLDSDVTRVEVCGQFPQQDIDSFIPLN 300
                 +    +++     ++  F       Y G+ S    + V G  P       I  +
Sbjct: 217 LAEQGLYTAITLNSEMSPFVNLKFIREKRIEYGGVTSPEYGIRVLGVCPDDASGFLISRS 276

Query: 301 IIEEALNREPCPDPYAPLIMGCDIA-EEGGDNTVVVL---------RRGPVIEHLFDWSK 350
           ++++              +   D+A  EG D++V+ +         R+  V++ +   + 
Sbjct: 277 LVDKGFEAVIEFADEWGWVAVADVAGGEGRDSSVLKIGKVCGFGSERQVEVVKAIEAPAD 336

Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD---LE 407
            D       I      Y   ++ IDA+  G  T    E LG +V R+   +        +
Sbjct: 337 MDGVQFARFIHQETAGYTNISVGIDADGYGLTTAQECEKLGVNVTRIHWGRPPHANSVKQ 396

Query: 408 FCRNRRTELHVKMADWLEFASLINH--------SGLIQNLKSLKSFIVPNTGELAIESKR 459
                +    V + + L    L  H          L +    +  +     G   I SK+
Sbjct: 397 RFPKEKDFACVMVKEALGTGRLKLHRGETKQFEKKLQKQFVKIP-YEFDELGRWRIFSKK 455

Query: 460 V---KGAKSTDYSDGLMY 474
               +G KS D  D   +
Sbjct: 456 QLRSEGIKSPDIFDATAF 473


>gi|257459276|ref|ZP_05624390.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268]
 gi|257443289|gb|EEV18418.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268]
          Length = 431

 Score =  245 bits (625), Expect = 2e-62,   Method: Composition-based stats.
 Identities = 76/318 (23%), Positives = 131/318 (41%), Gaps = 18/318 (5%)

Query: 177 SEERPDTFVGHHNTYGMAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLS 234
           S ERP+   G        +I +EA        +    +  +   N N    +   P+  +
Sbjct: 104 SAERPENIEGFGYD---TVILNEAGIILKDPYLWDNAISPMLLDNPNSRAFIGGVPKGKN 160

Query: 235 GKFYEIFNKPL---DDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTRVEVCGQFPQ 290
            KF+++  + +     W+ FQ  +     +     + ++A  G  DSDV R E+ G+F  
Sbjct: 161 -KFFDLAQRGMRNEKGWRNFQFSSYDNPLLQKEEIDRLVAELGGADSDVARQEIFGEFLD 219

Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350
              +S   L  IE A  ++   D  AP+I   D+A EG D +V+  R+G  +E L  +  
Sbjct: 220 TTSNSVFSLAAIEAAFRKQRYFDAGAPVIWALDVAREGDDESVLCKRQGDSVEPLKPYRI 279

Query: 351 TDLRTTNNKISGLVEK--YRPDAIIIDANNTGARTCDYLEMLGYH--VYRVLGQKRAVDL 406
                   +I G  E+   +P AI ID    GA   D L  LG    V    G  +A D 
Sbjct: 280 ASTSELAREIYGEYERTDLKPHAIYIDTIGVGAGVFDTLCDLGLRGIVREAKGSFKASDE 339

Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463
               N+R E++  + + L   ++     L + L+++  +         +  K   + +  
Sbjct: 340 RKYANKRAEMYFNLREKLPLLAIAPDEELKRQLQTIAFY-FDKKERYLLMPKEGIKKEYG 398

Query: 464 KSTDYSDGLMYTFAENPP 481
           +S D +D L  +F +  P
Sbjct: 399 RSPDRADALAMSFFDLCP 416


>gi|292670767|ref|ZP_06604193.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
 gi|292647388|gb|EFF65360.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541]
          Length = 442

 Score =  243 bits (621), Expect = 5e-62,   Method: Composition-based stats.
 Identities = 80/376 (21%), Positives = 147/376 (39%), Gaps = 28/376 (7%)

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
           +A    Q K   W  +  + + +P +         ++ +  Y ++        ++     
Sbjct: 63  VAPYRNQAKRVAWEYLKYYTNPIPGR--------VVNESELYIEL----PTRHARSPGAR 110

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINL-GILGFLTERNANRFWIMTSNPR 231
                 + PD   G +      +I DE +     +    I   L +R    + +    P+
Sbjct: 111 LYIIGADHPDALRGIYLDG---VILDEYADIKPELWGGVIRPALADRQG--WAVFIGTPK 165

Query: 232 RLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289
             + +FYE++        W      T     +     + + A+  +     R E+   F 
Sbjct: 166 GQN-QFYEMYQHAEKSAGWYSCIYRTDETGVLPAEELKDMQAQ--MTEMEIRQELLCDFT 222

Query: 290 QQDIDSFIPLNIIEEALNREPCPDP--YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
               D  IP++++  A NR    D     P+I+G D+A  G D TV+ +R+G  ++ +  
Sbjct: 223 ASASDVVIPIDLVTAAANRLLKDDDVLGQPVILGVDVARFGDDRTVLCVRQGLWLKEVRT 282

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE 407
           ++      T +++   + ++ P A  IDA   GA   D L  L Y V  V   + A+D  
Sbjct: 283 FTGLSTMETASRVIDCINQHHPHATFIDAGAMGAGVIDRLRQLRYQVSEVNFGEMAMDAA 342

Query: 408 FCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464
              N R E++ K   WLE    I  +  ++   S   +    TG + +E K   + +  K
Sbjct: 343 RYANIRAEMYFKCRAWLEAGGAIPQNAELKTELSTVEYKFNPTGRIILEPKDKLKERTGK 402

Query: 465 STDYSDGLMYTFAENP 480
           S D +DG + TFA   
Sbjct: 403 SPDLADGFVLTFARPV 418


>gi|298387330|ref|ZP_06996883.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
 gi|298259999|gb|EFI02870.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
          Length = 500

 Score =  242 bits (618), Expect = 8e-62,   Method: Composition-based stats.
 Identities = 93/491 (18%), Positives = 160/491 (32%), Gaps = 88/491 (17%)

Query: 53  SWQL---EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP--- 106
            W     + +         +V     +    A+++G   GK  + A   L  M   P   
Sbjct: 15  DWCAFASDVLRANLDEEQKAVLRSVQKNPMTALASGTSRGKDFVAACAALCFMYLTPEWD 74

Query: 107 -------GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH 159
                     +   A S+ Q++  +  EV +                          ++ 
Sbjct: 75  DDGNLIRNTKIALSAPSQRQVENIMTPEVRRLFRNAGILP---------------GRLVA 119

Query: 160 CSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERN 219
             +  D + Y         +  + + G H    M +I  EASG  + I   I G L    
Sbjct: 120 NDIRTDYEEYFLTGFKADNKNQEVWSGFHAANVMFVIT-EASGVSETIFSAIEGNL---Q 175

Query: 220 ANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDP-----------SFHEG 268
            N   ++  NP   +G            + +F++D+     +              + E 
Sbjct: 176 GNSRLLLVFNPNITTGYAANAMKSDR--FAKFRLDSLNATNVTAKREIIPGQVNYEWVED 233

Query: 269 IIARY----------------------GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            +  +                         +D+ R++V G FP+   D  IP   IE A 
Sbjct: 234 KVKHWCTPITKEEYNEGEGDFLFENNLYRPNDLFRIKVRGMFPKVAEDVLIPYEWIEIAN 293

Query: 307 NREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL 363
            R     PY P     +G D+A  G DN+V   R G  +     + ++  + ++  + G 
Sbjct: 294 KRWQENHPYRPRKSCKLGVDVAGMGRDNSVFCPRYGNYVSQFDVF-QSAGKASHMHVVGK 352

Query: 364 VEKYR---PDAIIIDANNTGARTCDYLEMLG----YHVYRVLGQKRAVDLE---FCRNRR 413
              Y+    D I ID    GA     L   G    + V    G K   D+       N R
Sbjct: 353 ALSYKRTDRDIIFIDTIGEGAGVYSRLVEQGIRNIFSVKNSQGAKGLHDITGEYSFANMR 412

Query: 414 TELHVKMADWLE----FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKST 466
             L+  + DWL+    F  ++          +   +   + G++ IE K   + +  +S 
Sbjct: 413 AYLYWALRDWLDPKNNFFPMLPPCDQFTEEATETKWKFRSDGKILIEPKEEIKKRIKRSP 472

Query: 467 DYSDGLMYTFA 477
           DY D L  TF 
Sbjct: 473 DYMDALSETFY 483


>gi|225155389|ref|ZP_03723881.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2]
 gi|224803845|gb|EEG22076.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2]
          Length = 479

 Score =  241 bits (616), Expect = 1e-61,   Method: Composition-based stats.
 Identities = 92/451 (20%), Positives = 166/451 (36%), Gaps = 48/451 (10%)

Query: 42  GTPLEGFSA--PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT-LNAWLV 98
           GTP        P ++ +  +++        +            +   G GKT+ +   L 
Sbjct: 12  GTPAPHAEKLNPITFAVAVLKLRIYSWQAKIMASVWSGKPTVAATPNGAGKTSVIIVALA 71

Query: 99  LWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVL 158
           L L+   PG +V+  + +   +   ++A                  SL++H A + +   
Sbjct: 72  LTLLHEFPGATVVLTSATYRAVCDQIFA------------------SLAVHQAKFSAWKW 113

Query: 159 HCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG--MAIINDEASGTPDVINLGILGFLT 216
           + +   D +    +   ++ +R   F G H   G  + II DEA    D I +       
Sbjct: 114 NDTEINDGQGGRII--GFATDRGGRFEGFHAYPGRPLLIILDEAKSIADDIFVAA----- 166

Query: 217 ERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLD 276
           +R      +  S+   L G+F++ F++    + +FQ        I P F E + A+YG D
Sbjct: 167 DRCQPTMLLYISSWGGLFGRFHDAFSQDR--FAQFQAGIADCPHITPEFIEAMRAQYGED 224

Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL 336
           SD+ R  + GQ P+ +   F+   +  E     P         + CD AE   D  V+  
Sbjct: 225 SDIYRSMILGQRPKGNETGFVVPFVDYERCESNPPVWQEGTKQVFCDFAET-SDECVIAK 283

Query: 337 RRGPVIEHLFDW-SKTDLRTTNNKISGLVEKYRPDAIII--DANNTGARTCDYLEMLGYH 393
           R G  +  +  W    +     ++  G + + + +  +I  DA+ TG      L + G  
Sbjct: 284 RDGNRLSIVDAWIPDGNTAGITDRFEGHLRRLQNEGFVIRGDADGTGHGYITALSLRGIK 343

Query: 394 VYRVLGQKRAVDLEFCRNRRTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIV---- 447
           +  V      +D  +  N   E     A  ++  F  L +   L + L S +        
Sbjct: 344 ISGVKNNDAPMDNHYF-NLAAEHWWTFAKKVKSNFWILPHDEVLKRQLCSREEVYRKVGD 402

Query: 448 -----PNTGELAIESKRVKGAKSTDYSDGLM 473
                   G L +  K     KS D +D L+
Sbjct: 403 KKVYGREDGRLQLMPKSRLSTKSPDRADALV 433


>gi|283956317|ref|ZP_06373797.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni
           1336]
 gi|283792037|gb|EFC30826.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni
           1336]
          Length = 430

 Score =  241 bits (615), Expect = 2e-61,   Method: Composition-based stats.
 Identities = 74/324 (22%), Positives = 127/324 (39%), Gaps = 21/324 (6%)

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGT-----PDVINLGILGFLTERNANRFW 224
             +    S ER +   G        +I +EA         + +    +  +   N     
Sbjct: 96  GAVLHMRSAERSENIEGFGYD---LVILNEAGIILKGSKGEYLWYNAIRPMLLDNPKSRA 152

Query: 225 IMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTR 281
           I+   P+  +  FYE+  K L D  WK FQ  +     +     + +I   G  DS+V +
Sbjct: 153 IIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEDSEVVK 211

Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339
            E+ G+F          L  IE A+++            I G D+A  G D +V+  R+G
Sbjct: 212 QEIYGEFIDSSSAELFALTEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSVLAKRKG 271

Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRV 397
            +++ +  +S+       N+I     +   +P  I ID    G    D L   G  V+  
Sbjct: 272 FIVDEIKKYSQLGTMELANRILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEA 331

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457
                A   E   N+R +++   A  L+   L+    L ++++ ++ +   + G L I S
Sbjct: 332 NSANSATSNE-YLNKRAQMYFTFAKNLKHMELVKDEELKKDMRMIE-YEYSDKGLLKIVS 389

Query: 458 K---RVKGAKSTDYSDGLMYTFAE 478
           K   +    KS D SD +  TF E
Sbjct: 390 KEQLKKNYGKSPDVSDAVALTFFE 413


>gi|153951273|ref|YP_001397540.1| putative terminase B protein [Campylobacter jejuni subsp. doylei
           269.97]
 gi|153951467|ref|YP_001398214.1| putative terminase B protein [Campylobacter jejuni subsp. doylei
           269.97]
 gi|152938719|gb|ABS43460.1| putative terminase B protein [Campylobacter jejuni subsp. doylei
           269.97]
 gi|152938913|gb|ABS43654.1| putative terminase B protein [Campylobacter jejuni subsp. doylei
           269.97]
          Length = 430

 Score =  241 bits (614), Expect = 3e-61,   Method: Composition-based stats.
 Identities = 80/325 (24%), Positives = 126/325 (38%), Gaps = 23/325 (7%)

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV------INLGILGFLTERNANRF 223
             +    S ER +   G        +I +EA                I   L + N    
Sbjct: 96  GAVLHMRSAERSENIEGFAYD---LVILNEAGIILKDSKGGYLWYNSIRPMLLD-NPKSR 151

Query: 224 WIMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVT 280
            I+   P+  +  FYE+  K L D  WK FQ  +     +     + +I   G   SDV 
Sbjct: 152 AIIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGESSDVV 210

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRR 338
           R E+ G+F          L+ IE A+++            I G D+A  G D +V+  R+
Sbjct: 211 RQEIYGEFIDSSSAELFSLSGIENAMSKNSFSTQKMQGENIWGLDVARYGDDKSVLAKRK 270

Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVE--KYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396
           G VI+ L  +S+       NKI    +  + +P  I ID    G    D L   G  V+ 
Sbjct: 271 GFVIDELKKYSQLGTIELANKILAEYKQSEEKPKGIFIDTCGLGVGVYDVLLNYGLPVFE 330

Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456
                 A   +   N+R +++   A  L+   L+    L  +++ ++ +   + G L I 
Sbjct: 331 ANSANSATSNQ-YLNKRAQMYFTFAKNLKHMELVKDEELKNDMRRIE-YEYSDKGLLKIV 388

Query: 457 SK---RVKGAKSTDYSDGLMYTFAE 478
           SK   +    KS D SD +  TF E
Sbjct: 389 SKEQLKKNYGKSPDLSDAVALTFFE 413


>gi|226940459|ref|YP_002795533.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
 gi|226715386|gb|ACO74524.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
          Length = 272

 Score =  237 bits (605), Expect = 3e-60,   Method: Composition-based stats.
 Identities = 73/265 (27%), Positives = 113/265 (42%), Gaps = 9/265 (3%)

Query: 239 EIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
           +   +    W   QID+RTVEG +          YG +SD  +V V G FP      FI 
Sbjct: 5   KCGRRFRHRWVARQIDSRTVEGTNKEQIAKWAEDYGEESDFFKVRVRGMFPSMSARQFIS 64

Query: 299 LNIIEEALNREPCPD--PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTT 356
              +  A  R   P+   YAP I+  D A EG D  V+ LR+G     L   +K D    
Sbjct: 65  ETDVSAAYGRALRPEQYQYAPKILTVDPAWEGDDEFVIGLRQGLSFRVLHTMAKNDNDLV 124

Query: 357 NNK-ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTE 415
             + I+   ++   DA+ +DA   G       + +G     V     ++D   C N+R E
Sbjct: 125 AAQVIARYEDEEGADAVFVDA-GFGTGIVSAGKSMGRDWTLVWFAGNSMDAG-CLNKRAE 182

Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRV---KGAKSTDYSDGL 472
           +     DWL+    I    ++++       +    G++ IESK+    +G  S + +D L
Sbjct: 183 MWRDARDWLKSGGAIPDDPVLRDELQAPEIVPRLDGKIQIESKKEMKARGVPSPNRADAL 242

Query: 473 MYTFAENPPRSD-MDFGRCPSYQYE 496
           + +FA    R D +D  R  S + E
Sbjct: 243 ILSFAYPVTRRDPLDALRNHSERRE 267


>gi|57237579|ref|YP_178593.1| terminase B protein, putative [Campylobacter jejuni RM1221]
 gi|57166383|gb|AAW35162.1| terminase B protein, putative [Campylobacter jejuni RM1221]
          Length = 430

 Score =  237 bits (604), Expect = 4e-60,   Method: Composition-based stats.
 Identities = 74/324 (22%), Positives = 124/324 (38%), Gaps = 21/324 (6%)

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGT-----PDVINLGILGFLTERNANRFW 224
             +    S ER +   G        +I +EA         + +    +  +   N     
Sbjct: 96  GAVLHMRSAERSENIEGFGYD---LVILNEAGIILKGSKGEYLWYNAIRPMLLDNPKSRA 152

Query: 225 IMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTR 281
           I+   P+  +  FYE+  K L D  WK FQ  +     +     + +I   G   S+V +
Sbjct: 153 IIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVK 211

Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339
            E+ G+F          L+ IE A+++            I G D+A  G D + +  R+G
Sbjct: 212 QEIYGEFIDSSSAELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKG 271

Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRV 397
            VI  +  +S+       NKI     +   +P  I ID    G    D L   G  V+  
Sbjct: 272 FVIYEIKKYSQLGTIELANKILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEA 331

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457
                A   E   N+R +++      L+   L+    L ++++ ++ +   + G L I S
Sbjct: 332 NSANSATSNE-YLNKRAQMYFTFTKNLKHMELVKDEELKKDMRMIE-YEYSDKGLLKIVS 389

Query: 458 K---RVKGAKSTDYSDGLMYTFAE 478
           K   +    KS D SD +  TF E
Sbjct: 390 KEQLKKNYGKSPDVSDAVALTFFE 413


>gi|315929403|gb|EFV08605.1| phosphatase, Ppx/GppA family [Campylobacter jejuni subsp. jejuni
           305]
          Length = 430

 Score =  236 bits (603), Expect = 5e-60,   Method: Composition-based stats.
 Identities = 75/324 (23%), Positives = 124/324 (38%), Gaps = 21/324 (6%)

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGT-----PDVINLGILGFLTERNANRFW 224
             +    S ER +   G        +I +EA         + +    +  +   N     
Sbjct: 96  GAVLHMRSAERSENIEGFGYD---LVILNEAGIILKGSKGEYLWYNAIRPMLLDNPKSRA 152

Query: 225 IMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTR 281
           I+   P+  +  FYE+  K L D  WK FQ  +     +     + +I   G   S+V +
Sbjct: 153 IIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVK 211

Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREP--CPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339
            E+ G+F          L+ IE A+++            I G D+A  G D + +  R+G
Sbjct: 212 QEIYGEFIDSSSAELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKG 271

Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRV 397
            VI  +  +S+       NKI     +   +P  I ID    G    D L   G  V+  
Sbjct: 272 FVIYEIKKYSQLGTIELANKILAEYNQSEDKPKGIFIDTCGLGVGVYDVLLNYGLPVFEA 331

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457
                A   E   N+R +++   A  L+   L     L ++++ ++ +   + G L I S
Sbjct: 332 NSANSATSNE-YLNKRAQMYFTFAKNLKHMELFKDEELKKDMRMIE-YEYSDKGLLKIVS 389

Query: 458 K---RVKGAKSTDYSDGLMYTFAE 478
           K   +    KS D SD +  TF E
Sbjct: 390 KEYLKKNYGKSPDVSDAVALTFFE 413


>gi|189460514|ref|ZP_03009299.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136]
 gi|189432758|gb|EDV01743.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136]
          Length = 556

 Score =  231 bits (590), Expect = 1e-58,   Method: Composition-based stats.
 Identities = 90/510 (17%), Positives = 161/510 (31%), Gaps = 93/510 (18%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP--------- 106
            E + V        + +      + ++++G   GK  + A   +  +   P         
Sbjct: 57  REALGVTLDKEQQEILSSVQYNRRTSVASGTARGKDFVAACAAICFLYLTPRWRKNSLGE 116

Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161
                   V   A ++ Q+K  +  E+S+  +    +    +  L+ +     +D     
Sbjct: 117 IELVENTKVALTAPTDRQVKNIMMPEISRLFNRAKARGVELIGKLNAYDIRTNND----- 171

Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221
                  +        E   + + G H  + M ++  EA+G  D     I G L     +
Sbjct: 172 ------EWFLTGFKADEHNHEAWSGFHAVHTMFVVT-EATGIGDDTFAAIEGNL---QGD 221

Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE-------------- 267
              ++  NP +  G   +   +  D W ++++++ T   I                    
Sbjct: 222 SRILLVFNPNKTVGYAAKS--QKGDRWHKYRLNSLTAPNIASKKIIIPGQVDYDWVLDKL 279

Query: 268 -------------------GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
                                  ++    D+ R +V G FP+ D D+ IP   +EEA  R
Sbjct: 280 ENWCEKISPDEIISEMDDFEFEGQWYRPEDLFRKKVLGLFPKVDEDTLIPRQWLEEAHER 339

Query: 309 EPCPDPYAPL-----IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK---TDLRTTNNKI 360
                   PL     I+G D+A  G D T  VLRR   +      +     D      KI
Sbjct: 340 WKQAKGREPLRADLNILGVDVAGMGRDATCYVLRRDNWVASFDTHNSGGVADHMKVAGKI 399

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDY---LEMLGYHVYRVLGQKRAVDL----------- 406
                +     + ID    GA        LE   +++      + A              
Sbjct: 400 MVARRQNIGLYVSIDTIGEGAGVYSRCVELEDEPHYILSCKYSESAKTPNGRELSDITGQ 459

Query: 407 EFCRNRRTELHVKMADWL----EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---R 459
               N R  L   + DWL       +++          +   F V + G+L IE K   +
Sbjct: 460 NKFFNMRAYLFWAVRDWLNPRNNTGAMLPPDDKFDEEATEIKFSVKSNGKLYIEPKEDIK 519

Query: 460 VKGAKSTDYSDGLMYTFAENPPRSDMDFGR 489
            +  +S D  D L  TF        ++  R
Sbjct: 520 ERLGRSPDKFDALANTFYPVRYAKPINVNR 549


>gi|154175204|ref|YP_001409090.1| Ppx/GppA family phosphatase [Campylobacter curvus 525.92]
 gi|112803006|gb|EAU00350.1| phosphatase, Ppx/GppA family [Campylobacter curvus 525.92]
          Length = 433

 Score =  229 bits (584), Expect = 8e-58,   Method: Composition-based stats.
 Identities = 89/458 (19%), Positives = 164/458 (35%), Gaps = 56/458 (12%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111
             WQ E      A                 I  GR  G T   A   +  +    G  ++
Sbjct: 11  TDWQREVFFKNKAKF-------------TTIEKGRRSGFTKGMANACIEWLI--EGKKIL 55

Query: 112 ----CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167
                 AN +   +     E+ +  + +   H                  L    G    
Sbjct: 56  WVDTVTANLQRYFERYFVPELKQLPADMWKFH-------------AQDKKLTVGEGYLDM 102

Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT--PDVINLGILGFLTERNANRFWI 225
                    S ERP+   G        +I +EA        +    +  +     N    
Sbjct: 103 R--------SAERPENIEGFGYD---VVILNEAGIILKNSYLWDNAIRPMLLDYPNSRAF 151

Query: 226 MTSNPRRLSGKFYEIFNKPL---DDWKRFQIDTRTVEGIDPSFHEGIIARYG-LDSDVTR 281
           +   P+  + +F+++ ++ +    DW  FQI +     +     + +IA  G +DSDV +
Sbjct: 152 IGGVPKGKN-RFFDLASRGMRNEKDWVNFQISSFENPLLRKEEIDELIAELGGVDSDVVK 210

Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPV 341
            E+ G+F     ++  PL+ IE A  +    +P A  I G D+A +G D +V+ +R G  
Sbjct: 211 QEIYGEFLDTTTNALFPLSQIEAAFGKVRAYEPNAVQIWGLDVARDGDDESVLCVREGYH 270

Query: 342 IEHLFDWSKTDLRTTNNKISG--LVEKYRPDAIIIDANNTGARTCDYLEM--LGYHVYRV 397
           +++L  +          +I     + + +P+AI ID+   GA T D L    LG      
Sbjct: 271 VKNLEGFRIASTTELAREIYRRYEMSEKKPEAIFIDSVGVGAGTFDRLCEFGLGAICREA 330

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457
               +A +     N+R E++  + +     ++  H  L + L+ ++         L +  
Sbjct: 331 KASYKATNEAKFANKRAEMYFALKEKFHLLTMNAHEKLKKQLQMIEFQYDRKERYLILPK 390

Query: 458 K--RVKGAKSTDYSDGLMYTFAENPPRSDMDFGRCPSY 493
              + +   S DY+D L  TF ++   +     +   Y
Sbjct: 391 DELKKEYGTSPDYADALALTFFDDVMSARRTEEKRQRY 428


>gi|153806881|ref|ZP_01959549.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185]
 gi|149131558|gb|EDM22764.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185]
          Length = 513

 Score =  229 bits (584), Expect = 8e-58,   Method: Composition-based stats.
 Identities = 85/492 (17%), Positives = 150/492 (30%), Gaps = 92/492 (18%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP--------- 106
            + +         ++          A+++G   GK  + A   L  M   P         
Sbjct: 27  RDALCARLDREQQAIIESVQHNPMTAVASGTARGKDFVAACASLCFMYLTPRFNEKGVLV 86

Query: 107 -GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165
               V   A +  Q+K  +  E+ + +     K  F               ++   +  D
Sbjct: 87  GNTKVAMTAPTGRQVKNIMTPEIRRLIRAARTKFPFCCP----------GRLVADDIRTD 136

Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWI 225
            + +        +   +++ G H    M +I  EASG  +++   I G L     N   +
Sbjct: 137 YEEWFLTGFKADDNATESWSGFHAANTMFVIT-EASGISEIVYNAIEGNL---QGNSRML 192

Query: 226 MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE------------------ 267
           +  NP   +G            + +F++ +   E +                        
Sbjct: 193 IVFNPNITTGYAARAMKSDR--FAKFRLSSLNAENVVKKQIVIPGQVDYEWVKDKVINWC 250

Query: 268 ---------------GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312
                              +    +D+ RV+V G FP+   D  IP   IE A       
Sbjct: 251 SPIQQTDFNEGEGDFNWEGKLYRPNDLFRVKVLGMFPKVSEDVLIPYEWIEIANRNWQEL 310

Query: 313 D-----PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---V 364
                 P     +G D+A  G DN+V+  R G  +   FD  ++  R  +  + G+    
Sbjct: 311 QASGFIPAKSCKLGVDVAGMGRDNSVLCPRYGNYV-PQFDVHQSAGRADHMHVVGMTIPY 369

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE------------FCRNR 412
            K +     ID    GA     L                   E               N 
Sbjct: 370 LKKKGAKAFIDTIGEGAGVYSRLLEE-----EFTNAFSCKYSEGTDGLHDITGEYEFANM 424

Query: 413 RTELHVKMADWL----EFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
           R  L+  + DWL     F + +     +    +   +   + G++ IE K   + +  +S
Sbjct: 425 RAYLYWALRDWLNPKNGFGAALPPCDQLMEEATETKWKFLSNGKVIIEPKEDVKKRIKRS 484

Query: 466 TDYSDGLMYTFA 477
            DY D L  TF 
Sbjct: 485 PDYMDALANTFY 496


>gi|282598783|ref|YP_003359102.1| putative large subunit terminase [Clavibacter phage CMP1]
 gi|262212571|gb|ACY35907.1| putative large subunit terminase [Clavibacter phage CMP1]
          Length = 872

 Score =  229 bits (584), Expect = 8e-58,   Method: Composition-based stats.
 Identities = 88/428 (20%), Positives = 150/428 (35%), Gaps = 48/428 (11%)

Query: 91  TTLNAWLVLWLMSTRP--GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           T L   LV W +S  P    SV+  A    Q+   ++  +    +L   +   +     +
Sbjct: 424 TRLAGDLVTWFVSVFPPEETSVMVSAPIREQIDVMMFRYLRDNYNLAIERE--QPLIGEI 481

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN 208
              P++         +         R        +F G H+ + +A++ DEA G P+ + 
Sbjct: 482 TKWPYWQVGAPLDKKLVMPK-----RPADGNLISSFQGIHDGH-VAVVLDEAGGLPEDLY 535

Query: 209 LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN--KPLDDWKRFQIDTRTVEGIDPSFH 266
           +G     T  +A    +   NP + +  F+E F   +    W RF I             
Sbjct: 536 IGANAVTTNFHARI--LAIGNPDKRNTPFHERFTDTEKFSSWNRFTIGAEDTPNFTGEKI 593

Query: 267 EG------------------IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
                               +  R      V   +V G FP+ D  +F   ++I    + 
Sbjct: 594 YEDPAKDEDVKKHLVQVSWAVEMRKSARPSVVAAKVDGNFPESDDTTFFDQSVINRGYST 653

Query: 309 EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL---RTTNNKISGLVE 365
           E  P+      MG DI+ +G D +V  +  G  I    +W++ D      +  +I     
Sbjct: 654 EIEPESTDFKYMGVDISYQGEDQSVAYINHGGQIRIADEWNRFDGAEHIESAIRIHNKAC 713

Query: 366 KYRPDAIIIDANNTGARTCDYLEML------GYHVYRVLGQKRAVDLEFCRNRRTELHVK 419
           +     + ID   TGA     L+ML       Y +  V G  R  +     N R   + +
Sbjct: 714 QEGVQEVRIDMAGTGAGVYSNLKMLDQFKDKPYVLIGVNGANRTPNSNRWLNARAWHYDQ 773

Query: 420 MADWLEFASL---INHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLM 473
               L    +   I    L + ++ L+     N G+L I  K   R  G  S D+ D  +
Sbjct: 774 FRTGLITGKIDITITDVDLKKEME-LQPSTFTNRGQLQITRKDDMRKMGISSPDHLDAAI 832

Query: 474 YTFAENPP 481
           Y+  +  P
Sbjct: 833 YSAIDTTP 840


>gi|303257560|ref|ZP_07343572.1| putative terminase B protein [Burkholderiales bacterium 1_1_47]
 gi|302859530|gb|EFL82609.1| putative terminase B protein [Burkholderiales bacterium 1_1_47]
          Length = 330

 Score =  228 bits (582), Expect = 1e-57,   Method: Composition-based stats.
 Identities = 72/301 (23%), Positives = 118/301 (39%), Gaps = 17/301 (5%)

Query: 195 IINDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRR--LSGKFYE----IFNKPLDD 247
           ++ DE +     +    I   L +R     +     P+   L  + Y+    + +K   D
Sbjct: 6   VVIDEVAQIKPTLWGEVIRPALADRKGWAAF--IGTPKGINLFSQLYDQALNLMSKGDPD 63

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
           W            ID      +  +  +  +  R E    F     +  IP++ I  A N
Sbjct: 64  WIAMLYSVEQTHVIDEKELAAL--KVEMSENEFRQEFLCDFSAAQDNGLIPIDDIRAAAN 121

Query: 308 REPCPDPY--APLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
           +      Y  APLI G D+A  G D +V+  RRG V        K D     ++I+  + 
Sbjct: 122 KFYRESEYMGAPLIYGIDVARFGSDASVIFKRRGLVAFEPIVIRKFDNMALADRIAVEMA 181

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425
           K +PDA+ ID+   G    D L  + + V  V    +A+D E   NRR E+   MA W++
Sbjct: 182 KEKPDAVFIDS-GAGQGVIDRLRQMRFDVVEVPFGAQAIDKEQFANRRMEMWWHMAQWIK 240

Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPR 482
               I    ++Q      ++     G   +E+K   + +  +S D +D L  TFA     
Sbjct: 241 QGGAIPPDPVLQGDLGAPTYGYTPKGPKILEAKDKLKERIGRSPDLADALALTFAAPVAP 300

Query: 483 S 483
            
Sbjct: 301 K 301


>gi|282880015|ref|ZP_06288737.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS
           5C-B1]
 gi|281306129|gb|EFA98167.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS
           5C-B1]
          Length = 459

 Score =  225 bits (573), Expect = 1e-56,   Method: Composition-based stats.
 Identities = 81/466 (17%), Positives = 156/466 (33%), Gaps = 87/466 (18%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRP----------GISVICLANSETQLKTTLWAEVS 129
            A+++G   GK  + A   +  M   P             +   A +  Q    +  EV+
Sbjct: 2   VAVASGTSRGKDFVAACAAMCFMYLTPRWNINHRLIQNTKIAMTAPTGRQCINIMIPEVA 61

Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           +                          +L   +  ++  +       S++  + + G H 
Sbjct: 62  RLFRNASVLP---------------GRMLSDGIRTNNAEWFLTAFKASDDNTEAWSGFHA 106

Query: 190 TYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249
              M ++  EASG  +     I G L     N   ++  NP   +G   +        +K
Sbjct: 107 VNTMFVVT-EASGVSETTFNAIEGNL---QGNSRLLLVFNPNVTTGYAAKAMKSSR--FK 160

Query: 250 RFQIDTRTVEGI-----------DPSFHEGIIARY----------------------GLD 276
           +F++++   E +           D  + +  +  +                         
Sbjct: 161 KFRLNSLNAENVIKKKNVIPGQVDYEWVKDKVHNWCELIQKEDFNNGEGDFMFEDSFYRP 220

Query: 277 SDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL-----IMGCDIAEEGGDN 331
           +D+ R++V G FP+   D+ IP   +E A +R    +    +      +G D+A  G D+
Sbjct: 221 NDLFRIKVLGLFPKASEDTLIPFEWLELAHDRWKKLNAEDFVPRKYARVGIDVAGMGRDS 280

Query: 332 TVVVLRRGPVIEHLFDWS---KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
           +  VLR G  +  +       K D      +    + + +   ++ID    GA     L 
Sbjct: 281 SCFVLRYGNYVPEIKIHQSGGKADHMKVAGEAVQWLVE-KNTKVMIDTIGEGAGVYSRLL 339

Query: 389 MLGY-HVYRVLGQKRAVDLE------FCRNRRTELHVKMADWL----EFASLINHSGLIQ 437
            LGY + Y     +    L          N R   +  + DWL     F   +     + 
Sbjct: 340 ELGYDNAYSCKFSEGTKGLHDITGQYEFANMRAYCYWAVRDWLNPKNGFNPALPPCDELD 399

Query: 438 NLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENP 480
              +   +   ++G + IE K   + +  +S D +D L+ TF  N 
Sbjct: 400 AELTEVHWSFQSSGSIIIEPKENIKSRLKRSPDRADALISTFYPNT 445


>gi|212703250|ref|ZP_03311378.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098]
 gi|212673294|gb|EEB33777.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098]
          Length = 330

 Score =  225 bits (573), Expect = 1e-56,   Method: Composition-based stats.
 Identities = 64/301 (21%), Positives = 116/301 (38%), Gaps = 23/301 (7%)

Query: 197 NDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD-------W 248
            DE +     +    +   L +R  +   +    P+  +  F E++ + +         W
Sbjct: 1   MDEVAQMKPEVWGEVVQPALADRRGSA--VFIGTPKG-ANLFAELYQRGMAAQAQGDAAW 57

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
                   + + +     E +     L  +  R E+   F     D  IPL  + EA  R
Sbjct: 58  CALSYPVTSTDVLPAEDVERLRRE--LSDNAFRQEMLCDFTASSDDILIPLPDVLEAEAR 115

Query: 309 EPCPDPYA--PLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK 366
           +   D     P+I+G D+A  G D++V+V R+G  ++        D     ++++  + +
Sbjct: 116 QLAWDDVGGMPVILGVDVARFGADSSVIVRRQGLKVDGPVVMRGLDNMQLADRVAAAIME 175

Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426
            RP A+ IDA   G    D L  LG+ V  V    + +      NRR+E+   +  WL+ 
Sbjct: 176 NRPHAVFIDA-GQGQGVIDRLRQLGHEVIEVPFGGKPLQEGRFANRRSEMWYGLRQWLKS 234

Query: 427 ASLINHSG----LIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
              +   G     ++   S   +     G + +E K   + +   S D +D L  TFA  
Sbjct: 235 GGKLPDEGDDVPRLRAELSAPLYWYDAAGRMVLEPKDKIKERLGASPDIADALALTFAAP 294

Query: 480 P 480
            
Sbjct: 295 V 295


>gi|320103661|ref|YP_004179252.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644]
 gi|319750943|gb|ADV62703.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644]
          Length = 553

 Score =  220 bits (560), Expect = 4e-55,   Method: Composition-based stats.
 Identities = 81/407 (19%), Positives = 133/407 (32%), Gaps = 49/407 (12%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
             P  W                           ++ G  +GK+ L A L LW + T PG 
Sbjct: 45  GRPDYW----------EGQRRAALALTRARSVVVATGNAVGKSYLAAGLTLWWLYTHPGS 94

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
            V+  A S+  L T L+ E+ K L+    +    +  + +         L    G     
Sbjct: 95  LVVATAPSQGLLGTVLFRELQKALA-ASRRRGLGLPGMVVGSDRGTPFSLRVGPGRRLAA 153

Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228
               C   +    +   G H+   M ++ DEASG            LT  N  + ++   
Sbjct: 154 EGWGCLGIATRGVERLAGRHHADLM-VVVDEASGVQPEAWE----ALTSLNPRKLFV-CG 207

Query: 229 NPRRLSGKFYEIFNKPLDDWK-----------RFQIDTRTVEGI----------DPSFHE 267
           NP      F+++  + L +                I +     I          D  F  
Sbjct: 208 NPLTPGTVFHKLHQRGLTEASDPSIPDHARGVALTIPSTASPDINLERSPRGLADRGFIR 267

Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP---DPYAPLIMGCDI 324
               ++G  S +    V G FP   + + I    +++A + E      +P    ++GCD+
Sbjct: 268 EAERQWGRGSPLWLSHVEGVFPTVAVHALIEPGWLDQAASLERSQTYENPPGQPVLGCDL 327

Query: 325 AEE-GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGA 381
           A   G D T +V+R    I  L    +         I+ L  K+   P+ I+ D    GA
Sbjct: 328 AAGVGADRTAIVVRDEGGIRELIASDRLAPDEAATLIASLARKHLIAPERILYDGAGLGA 387

Query: 382 RTCDYLEMLG---YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425
                L   G    H   + G   A       N R     ++   L+
Sbjct: 388 ELTTRLARQGPGFVHARAIFGA--ASGGAGFLNHRAWCGWRLRQRLD 432



 Score = 42.4 bits (98), Expect = 0.17,   Method: Composition-based stats.
 Identities = 15/48 (31%), Positives = 28/48 (58%), Gaps = 5/48 (10%)

Query: 435 LIQNLKSLKSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAEN 479
           L + L++L+  +V    +LA+E KR    +  +S D +D L+ TF+ +
Sbjct: 508 LREELEALRYRLVGT--KLALEDKRETRRRLGRSPDLADALLITFSVD 553


>gi|186682890|ref|YP_001866086.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102]
 gi|186465342|gb|ACC81143.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102]
          Length = 543

 Score =  216 bits (551), Expect = 5e-54,   Method: Composition-based stats.
 Identities = 98/512 (19%), Positives = 176/512 (34%), Gaps = 104/512 (20%)

Query: 46  EGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTR 105
           +    P  +    + +   +    +     +     + A  G GK+ + + LV++ +   
Sbjct: 28  QYADDPVGFFKNELGIELTNEQTIIAESVRDRPITNVKAAHGTGKSFIASLLVIYFLFCV 87

Query: 106 PGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165
            G+  I  A SE Q+K  LWAE+ K   L   K       + L     +S+ ++      
Sbjct: 88  GGV-AITTAPSEDQVKWILWAELRKIHGLHKTKLGGRCDIMQL----LFSETVYA----- 137

Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWI 225
              +    R YSE    +F G H    +  I DEA G    I+ G +  LT   ++   +
Sbjct: 138 ---FGITSRDYSEN---SFQGQHRQKQLL-IEDEADGITPQIDNGFIACLT--GSDNRGL 188

Query: 226 MTSNPRRLSGKFYEI------------FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY 273
              NP     +F +             F+ P   W  +++    V  + P   E II   
Sbjct: 189 RIGNPVDPQSQFAKTCKLDKRCLTVSAFSHPNVSW-AYELCADGVYRLKPEVAEHIINED 247

Query: 274 G----------------------------------LDSDVTRVEVCGQFPQQDIDSFIPL 299
           G                                    S   +  V G++ +   D  I L
Sbjct: 248 GEIKPQQEWPPEFPRDRIPGAISIDWIERVRREKFETSAYWKGRVMGEYAEDAADGIILL 307

Query: 300 NIIEEALNREPCPDPYA-------PLIMGCDIAEEGGDNTVVVLRRGPVIEHL-FDWSKT 351
            ++++A +       Y        P  +G D+  +GGD   + L RGPV+  +    +K 
Sbjct: 308 TLLKQARSLYDQNPQYWDAIAKRYPWRLGLDVG-DGGDPHALALLRGPVLYEVQIHPTKG 366

Query: 352 DLRTT-------NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ---- 400
           DL  T        ++I  L   Y   +I +D    GA T   L+  GY            
Sbjct: 367 DLLDTERAADIAASQIKLLGTGY---SIAVDNTGVGAGTLAKLKKTGYQALPCRFGDVPS 423

Query: 401 -----KRAVDLEFCRNRRTELHVKMADWLEFASL-----INHSGLIQNLKSLKSFIVPNT 450
                ++    +   N + EL+ +  + L    +      N   + Q+L + + +     
Sbjct: 424 YKKKKQKEEPKQKFTNLKAELYWQFRELLMGGRIAIAPLENEEYVFQDLTATR-YSTNTK 482

Query: 451 GELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
            E+  E K   + +  +S D S+ ++      
Sbjct: 483 DEIFCEPKDKTKSRLGRSPD-SEAVIIALTNP 513


>gi|294789575|ref|ZP_06754810.1| putative terminase B protein [Simonsiella muelleri ATCC 29453]
 gi|294482512|gb|EFG30204.1| putative terminase B protein [Simonsiella muelleri ATCC 29453]
          Length = 516

 Score =  215 bits (548), Expect = 1e-53,   Method: Composition-based stats.
 Identities = 78/450 (17%), Positives = 147/450 (32%), Gaps = 63/450 (14%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRP----------GISVICLANSETQLKTTLWAEV 128
           K ++ +G G GKT     + LW +   P          G +    A +  Q+   +W E+
Sbjct: 49  KVSVVSGTGTGKTMSFGRIALWHLLCFPVAKYDGKIEIGSNTYIGAPAIKQVGDGVWKEI 108

Query: 129 SKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-----DSKHYSTMCRTYSEERPDT 183
           +  +  +                 W ++ +               +        + +  +
Sbjct: 109 TDAVQAMRAN----------RATAWLAEYIVVQAERVYIIDYKATWFITKFAMQQGQSVS 158

Query: 184 FVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK 243
             G H  Y + II DEA+G  D     I G  T+       ++ S   +  G FYE  +K
Sbjct: 159 IAGKHRFYQL-IIIDEAAGVSDEHYEVINGTQTQGGNRT--LLASQGVKQGGFFYETHHK 215

Query: 244 ----PLDDWKRFQIDTRTVEGIDPSFHEGIIAR-YGLDSDVTRVEVCGQFPQQDIDSFIP 298
                  +W      +     +   + E +  +  G ++   RV V G+F + + ++ + 
Sbjct: 216 LNKENGGNWTALCFSSENSPFVTTEWLENVALQAGGKNTTEYRVRVLGKFAENEHENLLT 275

Query: 299 LNIIEEALNREPCPDPYAP--LIMGCDIAEE--------------GGDNTVVVLRRGPVI 342
              IE  ++  P  +   P   ++  D+                 G D+     RR    
Sbjct: 276 RAQIEPRIDTLPIIEKGEPFGWLLLVDVGAGEYRDDSVCIAAKVIGDDDFGENARRVEYE 335

Query: 343 EHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKR 402
            +    +  ++      I     +     I++DA   G   C  LE  G+ V R+     
Sbjct: 336 ANPIITNTKNIHEFRGLIVEKAAQLSNVRILVDAGGIGLELCKMLENDGFDVERINWGNP 395

Query: 403 AV---DLEFCRNRRTELHVKMADWLEFASLINH-------SGLIQNLKSLKS-FIVPNTG 451
                  E   N+R    V+  D +    ++            +     +   F    T 
Sbjct: 396 CFKRAYKERFFNQRACAMVRWRDAIRQGRVLFPKMENGLREKFLMQASRIPYGFTDTGTA 455

Query: 452 ELAIESK---RVKGAKSTDYSDGLMYTFAE 478
              I  K   R +G KS D +D + + F +
Sbjct: 456 RYQIAQKAEMRKRGIKSPDIADAMSFAFLD 485


>gi|315649222|ref|ZP_07902312.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453]
 gi|315275441|gb|EFU38799.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453]
          Length = 189

 Score =  211 bits (538), Expect = 2e-52,   Method: Composition-based stats.
 Identities = 65/225 (28%), Positives = 93/225 (41%), Gaps = 45/225 (20%)

Query: 13  QKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNN 72
             L DL W D +  +F+  ++ F               P  WQ + M  V          
Sbjct: 9   TDLLDLYWDDPV--AFAEDMMGF--------------DPDDWQCDVMMDVT--------- 43

Query: 73  PNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL 132
              +  + ++ +G+G+GKT L A LV+W +  RP   V+C A ++ QL   LW EVSKWL
Sbjct: 44  ---QFPRTSVRSGQGVGKTGLEAALVIWFLCCRPNPKVVCTAPTKQQLHDVLWTEVSKWL 100

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                K+  +     ++                 + +    RT    +P+   G H  Y 
Sbjct: 101 ENSMVKNLLKWTKTKVYMIG------------HEQRWFATARTA--NKPENMQGFHEDY- 145

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
           M  I DEASG  D I   ILG L+   A    +M  NP R SG F
Sbjct: 146 MLFIVDEASGVSDPIMEAILGTLS--GAENKLLMCGNPTRTSGVF 188


>gi|119386463|ref|YP_917518.1| PBSX family phage terminase large subunit [Paracoccus denitrificans
           PD1222]
 gi|119377058|gb|ABL71822.1| phage terminase, large subunit, PBSX family [Paracoccus
           denitrificans PD1222]
          Length = 441

 Score =  208 bits (529), Expect = 2e-51,   Method: Composition-based stats.
 Identities = 88/424 (20%), Positives = 153/424 (36%), Gaps = 30/424 (7%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++    T PG+S ICL + +  L  +++  + +  + L        
Sbjct: 26  GGRGSGKSWDRAMHMIVRHLTEPGLSSICLRDVQKSLDQSVFKLLVETAARLGVAEAIR- 84

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
                   P  SD +  + G     ++ M   ++ E   +  G           +EA+  
Sbjct: 85  --------PVESDRIIRTPGNGIIAFNGMNE-FNAENIKSLEGFD-----IAWWEEAATA 130

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI---DTRTVEG 260
                  +   L +  +  ++  T NPR  S     +  +         +   + R    
Sbjct: 131 GQGPLDMLRPTLRKPGSQIWF--TYNPRLRSDPVDVMMRQDARFADSRTVVEANWRDNPF 188

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320
             P   E  +     D    R    G +  +    FI   ++ EA+ R+P       L++
Sbjct: 189 RGPELEEERLLDLAGDEARYRHIWEGDYEAESDMQFIGGGLVREAMARQPFSQIGDELVL 248

Query: 321 GCDIAEEGGDNTVVVLRRGPVI--EHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           G D+A  G D +V+  RRG     E        D      ++   +++  PD + ID   
Sbjct: 249 GVDVARFGDDRSVIWARRGRDAQTELPIIMKGADTMAVAARVMAEIDRLHPDGVFIDEGG 308

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAV----DLEFCRNRRTELHVKMADWLEFASLINHSG 434
            G    D    +GY V  V    +A      +  CRN+R ++   M +WL     I  S 
Sbjct: 309 VGGGVIDRCRQMGYSVVGVNFGGKADRAIEGVPKCRNKRAQMWATMREWLRSGGCIPDSR 368

Query: 435 LIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN-PPRSDMDFGRC 490
            ++   +   +       + IE K   + +G  S D +D L  TFA    PRS       
Sbjct: 369 DLEMDLTGPLYSFDVNNAIEIEKKSDMKKRGVSSPDEADALALTFAYPVVPRSIQRQQEA 428

Query: 491 PSYQ 494
            + +
Sbjct: 429 RAQE 432


>gi|284162607|ref|YP_003401230.1| hypothetical protein Arcpr_1511 [Archaeoglobus profundus DSM 5631]
 gi|284012604|gb|ADB58557.1| protein of unknown function DUF264 [Archaeoglobus profundus DSM
           5631]
          Length = 435

 Score =  205 bits (522), Expect = 1e-50,   Method: Composition-based stats.
 Identities = 91/449 (20%), Positives = 162/449 (36%), Gaps = 68/449 (15%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
           S P ++   F++         +     +     + AGR  GKT   A   ++   T PG 
Sbjct: 13  SDPVTFAKVFLDWGAHPAQAQILRDRHQF--ITVVAGRRFGKTECMAVSAIYYALTNPGS 70

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
               +A S  Q    ++ ++ ++LS              ++  P++    H     DS  
Sbjct: 71  IQFVIAPSYDQ-SNIMFGQIVQFLSKSI----LGCMIRRIYKTPFH----HIIFKNDS-- 119

Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV-INLGILGFLTERNANRFWIMT 227
              +    S  +P+   GH       II DEA+  PD  I+  I   L + N +  WI  
Sbjct: 120 ---VIHARSASKPEFLRGHKA---HRIILDEAAFIPDDVISNIIEPMLADYNGS--WIKI 171

Query: 228 SNPRRLSGKFYEIFNK----PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
             P   +  FY+ + K       D+  ++  +     I   F E     YG +S + R E
Sbjct: 172 GTPFGKN-HFYDTYLKGQSPDFPDYSSYRFPSTVNPHISHEFIEKKKREYGENSIIFRTE 230

Query: 284 VCGQFPQQ------------DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331
              +F +             ++D+ I L    E ++++         ++GCD+A+     
Sbjct: 231 YLAEFVEDQNAVFRWADIQKNVDNSIELIDSAENVSKQ--------YVIGCDLAKYQDYT 282

Query: 332 TVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP---DAIIIDANNTGARTCDYLE 388
            +VVL        L  + + + R     I  L E YR      ++ID+   G    + L+
Sbjct: 283 VIVVLDVTEKPYKLVHFERFNRRPYAEVIMRLKELYRRFNYAKVLIDSTGVGDPVLEDLQ 342

Query: 389 MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL--INHSGLIQNLKSLKSFI 446
            +G   Y V   K  V L            ++   LE   +       L++ L+  + + 
Sbjct: 343 DVGAEGY-VFTPKSKVQLIQ----------RLQAALENGEIRYPYIEELVKELQFFE-YQ 390

Query: 447 VPNTGELAIESKRVKGAKSTDYSDGLMYT 475
           +  TG + +E    +     DY   L   
Sbjct: 391 LTRTG-IKME---ARQGFHDDYVIALALA 415


>gi|168704975|ref|ZP_02737252.1| hypothetical protein GobsU_35915 [Gemmata obscuriglobus UQM 2246]
          Length = 519

 Score =  186 bits (473), Expect = 5e-45,   Method: Composition-based stats.
 Identities = 84/507 (16%), Positives = 153/507 (30%), Gaps = 94/507 (18%)

Query: 46  EGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEV-FKGAISAGRGIGKTTLNAWLVLWLMST 104
           +  + P  +  + ++V        +     +  ++  + A   +GK+ L   LV W   T
Sbjct: 29  KYRTDPAGYARDILKVKWWAKQVEIAEALCKPPYRVLVKASHSVGKSHLAGGLVNWWYDT 88

Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164
           R     +  A ++ Q+K  LW EV +     P     +M  L   P  +           
Sbjct: 89  RFPGVCLTTAPTDRQVKDVLWKEVRRQRRKRPGFVGPKMPRLESDPTHF----------- 137

Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224
                      ++     +F G H    + +I DEA G               + A   W
Sbjct: 138 --------AHGFTARDATSFQGQHEA-SILLIFDEAVGIDGDFWEAAESMC--QGAEYGW 186

Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGID-------------------PSF 265
           +   NP   + + Y +  +    W    I       I                       
Sbjct: 187 LAIFNPTDTTSRAY-LEEQAGSRWTVIDIPATEHPNIAAELVARPPEYPSAVRLNWLRDR 245

Query: 266 HEGIIAR---------------------YGLDSDVTRVEVCGQFPQQDIDSFIPLNI--I 302
            E    R                     +     +    +  ++P      +       +
Sbjct: 246 LEQWAERIEPGDATPTDIQFPNPDGSPQWWRPGPLADARLLARWPASGCGVWSDPVWRSV 305

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           E A   +P P+ + P  +GCD+A  G D T + +R G V  H    +  D + T  ++  
Sbjct: 306 ERAAP-DPVPERWLP-QIGCDVARFGEDWTELHVRCGNVSLHHEAHNGWDTKRTTERLKQ 363

Query: 363 LVEKYRPDAIIIDANNT---------------GARTCDYLEMLGYHVYRVLGQKRAVDLE 407
           +  ++   A  +                    G       +  G++   V     A D E
Sbjct: 364 MCGEWAQWATQLRDRGADPIDPRRIPVKVDDDGVGGGVTDQRGGFNFQAVSSASNANDKE 423

Query: 408 FCRNRRTELHVKMADWLEFAS-----LINH--SGLIQNLKSLKSFIVPNTGELAIESK-- 458
              NRR+EL   +AD  +        L  H    L +      ++ +   G   +E K  
Sbjct: 424 AYPNRRSELWFTVADRAKRGELFLSNLPAHVRQELKRQ-AMAPTYKLDAAGRRVVEPKED 482

Query: 459 -RVKGAKSTDYSDGLMYTFAENPPRSD 484
            + +  +S D  D +   + E   R  
Sbjct: 483 TKERIGRSPDGMDAVNLAYYEPSGRGG 509


>gi|320091491|gb|ADW08983.1| terminase-like protein [Clavibacter phage CN77]
          Length = 414

 Score =  184 bits (468), Expect = 2e-44,   Method: Composition-based stats.
 Identities = 73/393 (18%), Positives = 137/393 (34%), Gaps = 60/393 (15%)

Query: 157 VLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLT 216
                 G  ++  +   R   ++   TF G        +  DEA G P  +  G    +T
Sbjct: 11  KYKKMDGSGNEAIAFGKRPTDQDIVSTFQGT-RKLRTFVALDEAGGVPPELFTGAEAVMT 69

Query: 217 ERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQIDTRTVEGIDPS---------- 264
            +++    +   NP     +F+ IF  P  +D+W  F I    +  +             
Sbjct: 70  GQDSKI--VAIGNPDSRGTEFHRIFTVPALMDEWNTFTISAYDLPTVTGEVVYPDHPEKQ 127

Query: 265 -------------FHEGIIARYGLDSD-VTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                         H+  + + G   D     +V G+FP +  ++F P   I+   N   
Sbjct: 128 ERMLKGLTSLDWIQHKERVWKVGGKPDGRFLAKVLGEFPGETDNAFFPQEAIDRG-NDTT 186

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD----------------WSKTDLR 354
              P   +IMG D+A  G D++VV   +G  +                     WSK +  
Sbjct: 187 IDKPEKGIIMGVDLARMGDDDSVVYTNQGGRVRLFKGQVRYSDREGTKTTTGVWSKENTV 246

Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVDLEF 408
            +  ++  +  +     + +D++  G    D LE L       Y +  +     + +   
Sbjct: 247 ASARRVHAIAMQIGAKQVRLDSSGIGGAVFDELEQLEEFDGKCYTLVGINNANSSSNNMR 306

Query: 409 CRNRRTELHVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAK-- 464
             N R E H  + D L      L     ++++   + ++ +   G + I  K    ++  
Sbjct: 307 WANIRAENHDNLRDMLIKGYLDLDPEDTMLRDELLVITYKLNLRGAVQITPKDEMKSELN 366

Query: 465 -STDYSDGLMYTFAENPPRSDMDFGRCPSYQYE 496
            S D  D ++Y+ A+     D   G  P  + E
Sbjct: 367 GSPDRLDAVIYSLADLDHIVD---GPQPGERIE 396


>gi|315122636|ref|YP_004063125.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
 gi|313496038|gb|ADR52637.1| putative phage terminase, large subunit [Candidatus Liberibacter
           solanacearum CLso-ZC1]
          Length = 301

 Score =  170 bits (430), Expect = 5e-40,   Method: Composition-based stats.
 Identities = 61/170 (35%), Positives = 90/170 (52%), Gaps = 8/170 (4%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQ----L 56
           M+     N E +  L   + S  I  +   F  + + WGE+GTPL     PR+WQ    L
Sbjct: 1   MNATFQPNIEYDTALLQNVLSPAIAGNPLAFTKYMYRWGEEGTPLANCKGPRAWQTEVFL 60

Query: 57  EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116
           E  E ++ +          +VFK AI++ RGIGKT L AW+  W +STR G +V+  ANS
Sbjct: 61  ELAEFIEKNKEAKRLGKPLQVFKLAIASARGIGKTALVAWITYWFLSTRIGCTVVISANS 120

Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQS----LSLHPAPWYSDVLHCSL 162
           + Q KTT +AE+ +W SL  N H+FE       L+   +PW ++ +  +L
Sbjct: 121 DDQCKTTSFAEIRRWHSLAKNAHFFEANIAEALLAGGCSPWQAEPVAKTL 170


>gi|261381054|ref|ZP_05985627.1| phage terminase, large subunit, PBSX family [Neisseria subflava
           NJ9703]
 gi|284796087|gb|EFC51434.1| phage terminase, large subunit, PBSX family [Neisseria subflava
           NJ9703]
          Length = 450

 Score =  161 bits (408), Expect = 2e-37,   Method: Composition-based stats.
 Identities = 57/320 (17%), Positives = 116/320 (36%), Gaps = 40/320 (12%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQID 254
             +EA    D     ++  + +  +  +   T NP+ +    Y+ F   P DD     ++
Sbjct: 117 WIEEAENVSDESWNILIPTIRKAGSEIWL--TWNPKNILDPTYQRFVVNPPDDMVDIVVN 174

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--P 312
                 +         +    D D+ R    G+       S I    I+ A++       
Sbjct: 175 YTDNIYLPEVLRLEAESCKARDYDLYRHIWLGEPVADSELSVIKPKWIDAAIDSHIKLGF 234

Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372
           +     I+G D+A+EG D +  +LR G V+  + +W   D+  + +K+    ++ + D I
Sbjct: 235 EATGQRILGFDVADEGDDASATILRHGSVVIDMDEWRGQDVIYSADKVYLYGQEAKADKI 294

Query: 373 IIDANNTGART-------CDYLEMLGYHVYRVLGQKRA------VDLEFCRNRRTELHVK 419
           + D+   GA            ++ +G++    + +  A       + +   N + +    
Sbjct: 295 VYDSIGVGAGVKAQFRRKTGKVQTIGFNAGGSVFKPEARYTDDKKNKDMFSNIKAQAWWM 354

Query: 420 MAD-------WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESKR- 459
           + +        +EF        LI            +   S       N G + +ESK+ 
Sbjct: 355 VRERFYKTWRAIEFGDTYPIDELISISGSLKDLEYLKAELSRPRVDYDNNGRVKVESKKD 414

Query: 460 --VKGAKSTDYSDGLMYTFA 477
              +G  S + +D L+  FA
Sbjct: 415 MAKRGIPSPNRADALIMAFA 434


>gi|329122215|ref|ZP_08250807.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116]
 gi|327474100|gb|EGF19511.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116]
          Length = 452

 Score =  160 bits (404), Expect = 7e-37,   Method: Composition-based stats.
 Identities = 65/440 (14%), Positives = 143/440 (32%), Gaps = 63/440 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++    T+P I V+C              E+ K +S    +   + 
Sbjct: 27  GGRGSGKSFSIARALVLRAYTQP-IRVLCC------------REIQKSISDSVIQMLAD- 72

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q   L    ++       +G +   ++      +     +  G        +  +E    
Sbjct: 73  QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262
                  ++  + E  +    I++ NP+ +    Y+ F   P +  K   ++ +      
Sbjct: 128 SKESWDVLIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320
               E +      D ++ R    G+         I    I+ A++  ++         I+
Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDKVIIKPLWIDAAVDAHKKLGFVAAGRKII 245

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
           G D+A+EG D        G V+  + +W   D+  + ++      ++  + I+ D+   G
Sbjct: 246 GFDVADEGSDANANAFVHGSVVLRMDEWRGEDVIGSADRTRLNALEFGANEIVYDSIGVG 305

Query: 381 ART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA----- 421
           A        L+     +                     + +   N + +   ++      
Sbjct: 306 AGVKAHYHRLDDKSIRINGFNAGGAVFEPDVEYVYGKTNRDMFANIKAQAWWRLRDRFYK 365

Query: 422 --------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464
                         + +  +S I     ++   +         G + +ESK   + +G  
Sbjct: 366 TYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGIP 425

Query: 465 STDYSDGLMYTFAENPPRSD 484
           S + +D L+  FA   P+ D
Sbjct: 426 SPNKADALVMCFA---PKED 442


>gi|229844502|ref|ZP_04464642.1| predicted phage terminase large subunit [Haemophilus influenzae
           6P18H1]
 gi|229812751|gb|EEP48440.1| predicted phage terminase large subunit [Haemophilus influenzae
           6P18H1]
          Length = 452

 Score =  158 bits (400), Expect = 2e-36,   Method: Composition-based stats.
 Identities = 65/440 (14%), Positives = 144/440 (32%), Gaps = 63/440 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++    T+P I V+C              E+ K +S    +   + 
Sbjct: 27  GGRGSGKSFSIARALVLRAYTQP-IRVLCC------------REIQKSISDSVIQMLAD- 72

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q   L    ++       +G +   ++      +     +  G        +  +E    
Sbjct: 73  QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262
                  ++  + E  +    I++ NP+ +    Y+ F   P +  K   ++ +      
Sbjct: 128 SKESWDVLIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320
               E ++     D ++ R    G+         I    I+ A++  ++         I+
Sbjct: 186 KELMEDMVQMRERDYELYRHVYEGEPVADSDKVIIKPLWIDAAVDAHKKLGFVAAGRKII 245

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
           G D+A+EG D        G V+  + +W   D+  + ++      ++  + I+ D+   G
Sbjct: 246 GFDVADEGSDANANAFVHGSVVLRMDEWHGEDVIGSADRTRLNALEFGTNEIVYDSIGVG 305

Query: 381 ART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA----- 421
           A        L+     +                     + +   N + +   ++      
Sbjct: 306 AGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWRLRDRFYK 365

Query: 422 --------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464
                         + +  +S I     ++   +         G + +ESK   + +G  
Sbjct: 366 TYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGIP 425

Query: 465 STDYSDGLMYTFAENPPRSD 484
           S + +D L+  FA   P+ D
Sbjct: 426 SPNKADALVMCFA---PKED 442


>gi|260580755|ref|ZP_05848581.1| phage terminase large subunit [Haemophilus influenzae RdAW]
 gi|260092572|gb|EEW76509.1| phage terminase large subunit [Haemophilus influenzae RdAW]
          Length = 447

 Score =  153 bits (386), Expect = 7e-35,   Method: Composition-based stats.
 Identities = 72/442 (16%), Positives = 145/442 (32%), Gaps = 59/442 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++      P + V+C              E+ K +S    +   + 
Sbjct: 27  GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q   L    ++       +G +   ++      +     +  G        +  +E    
Sbjct: 73  QIEMLSLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262
                  ++  + E  +    I++ NP+ +    Y+ F   P +  K   ++ +      
Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320
               E +      D ++ R    G+       + I    IE A++   +          +
Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIEYAVDAHLKLGFTAKGMKKV 245

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
           G D+A+EG D+       G V+  +  W   D+  + N+ +    K++ D II D+   G
Sbjct: 246 GFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305

Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422
           A    + + L     V            E              N + +    + D     
Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYKT 365

Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
              +++  +     LI            +   S       N G + +ESK   + +G  S
Sbjct: 366 YRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425

Query: 466 TDYSDGLMYTFAENPPRSDMDF 487
            + +D L+  +A   P+S +D 
Sbjct: 426 PNMADALVMCYAPTKPKSLLDL 447


>gi|319776448|ref|YP_004138936.1| phage terminase large subunit [Haemophilus influenzae F3047]
 gi|319897217|ref|YP_004135412.1| phage terminase large subunit [Haemophilus influenzae F3031]
 gi|329123931|ref|ZP_08252483.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116]
 gi|317432721|emb|CBY81084.1| predicted phage terminase large subunit [Haemophilus influenzae
           F3031]
 gi|317451039|emb|CBY87270.1| predicted phage terminase large subunit [Haemophilus influenzae
           F3047]
 gi|327468126|gb|EGF13613.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116]
          Length = 447

 Score =  152 bits (383), Expect = 1e-34,   Method: Composition-based stats.
 Identities = 72/442 (16%), Positives = 144/442 (32%), Gaps = 59/442 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++      P + V+C              E+ K +S    +   + 
Sbjct: 27  GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q   L    ++       +G +   ++      +     +  G        +  +E    
Sbjct: 73  QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262
                  ++  + E  +    I++ NP+ +    Y+ F   P +  K   ++ +      
Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320
               E +      D ++ R    G+       + I    IE A++   +          +
Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKKV 245

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
           G D+A+EG D        G V+  +  W   D+  + N+ +    K++ D II D+   G
Sbjct: 246 GFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305

Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422
           A    + + L     V            E              N + +    + D     
Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYKT 365

Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
              +++  +     LI            +   S       N G + +ESK   + +G  S
Sbjct: 366 YRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425

Query: 466 TDYSDGLMYTFAENPPRSDMDF 487
            + +D L+  +A   P+S +D 
Sbjct: 426 PNMADALVMCYAPTKPKSLLDL 447


>gi|145629503|ref|ZP_01785301.1| predicted phage terminase large subunit [Haemophilus influenzae
           22.1-21]
 gi|145641440|ref|ZP_01797019.1| predicted phage terminase large subunit [Haemophilus influenzae
           R3021]
 gi|144978346|gb|EDJ88110.1| predicted phage terminase large subunit [Haemophilus influenzae
           22.1-21]
 gi|145273983|gb|EDK13850.1| predicted phage terminase large subunit [Haemophilus influenzae
           22.4-21]
 gi|309750959|gb|ADO80943.1| Probable bacteriophage terminase, large subunit [Haemophilus
           influenzae R2866]
          Length = 447

 Score =  151 bits (381), Expect = 2e-34,   Method: Composition-based stats.
 Identities = 72/442 (16%), Positives = 144/442 (32%), Gaps = 59/442 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++      P + V+C              E+ K +S    +   + 
Sbjct: 27  GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q   L    ++       +G +   ++      +     +  G        +  +E    
Sbjct: 73  QVEMLGLQDFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262
                  ++  + E  +    I++ NP+ +    Y+ F   P +  K   ++ +      
Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320
               E +      D ++ R    G+       + I    IE A++   +          +
Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKKV 245

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
           G D+A+EG D        G V+  +  W   D+  + N+ +    K++ D II D+   G
Sbjct: 246 GFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305

Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422
           A    + + L     V            E              N + +    + D     
Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSWWALRDRFYKT 365

Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
              +++  +     LI            +   S       N G + +ESK   + +G  S
Sbjct: 366 YRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425

Query: 466 TDYSDGLMYTFAENPPRSDMDF 487
            + +D L+  +A   P+S +D 
Sbjct: 426 PNMADALVMCYAPTKPKSLLDL 447


>gi|330958838|gb|EGH59098.1| hypothetical protein PMA4326_09820 [Pseudomonas syringae pv.
           maculicola str. ES4326]
          Length = 512

 Score =  151 bits (381), Expect = 3e-34,   Method: Composition-based stats.
 Identities = 55/239 (23%), Positives = 89/239 (37%), Gaps = 22/239 (9%)

Query: 267 EGIIARYGLDSDVTRVEVC---GQFPQQDIDSF--------IPLNIIEEALNREPC-PDP 314
           E +  R G  S     +V     ++P     +F        I    +  A  +E      
Sbjct: 253 EQMAWRAGKISSDFANDVDFFNQEYPATPDLAFQKVGHKPLIKTVKVSLARKKEIKHERR 312

Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI-I 373
               ++G D A  GGD +  + R+G V   +   +  D      + + ++   +   +  
Sbjct: 313 IGAHVVGLDPAR-GGDTSTFIHRQGRVAWGIERNNIPDTMAVVGQAARMLMDDKTIRMMF 371

Query: 374 IDANNTGARTCDYLEMLGY--HVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE---FAS 428
           ID    GA   D L  LG+   V  V     A D     N+R E+  +MA+W+      S
Sbjct: 372 IDIGGLGAGIYDRLVELGFGDRVTAVNFGSSASDSRKYANKRCEMWGEMAEWIHDDITPS 431

Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSD 484
           + +   L  +L S       + G+L +  K   + K  +S D  D L  TFAE     D
Sbjct: 432 IPDDDQLHSDLTSAAKDKYTSNGQLKLLPKEDAKKKIGRSPDDGDALALTFAEPVSADD 490


>gi|68250076|ref|YP_249188.1| phage terminase large subunit [Haemophilus influenzae 86-028NP]
 gi|68058275|gb|AAX88528.1| predicted phage terminase large subunit [Haemophilus influenzae
           86-028NP]
          Length = 447

 Score =  149 bits (377), Expect = 8e-34,   Method: Composition-based stats.
 Identities = 72/442 (16%), Positives = 144/442 (32%), Gaps = 59/442 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++      P + V+C              E+ K +S    +   + 
Sbjct: 27  GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q   L    ++       +G +   ++      +     +  G        +  +E    
Sbjct: 73  QIEMLGLQNFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262
                  ++  + E  +    I++ NP+ +    Y+ F   P +  K   ++ +      
Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320
               E +      D ++ R    G+       + I    IE A++   +          +
Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIECAVDAHLKLGFTAKGMKKV 245

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
           G D+A+EG D+       G V+  +  W   D+  + N+ +    K++ D II D+   G
Sbjct: 246 GFDVADEGADSNDNAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305

Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422
           A    + + L     V            E              N + +    + D     
Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYKT 365

Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
              ++   +     LI            +   S       N G + +ESK   + +G  S
Sbjct: 366 YRAVKHGDVYPDDELISLSSNIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425

Query: 466 TDYSDGLMYTFAENPPRSDMDF 487
            + +D L+  +A   P+S +D 
Sbjct: 426 PNMADALVMCYATTKPKSLLDL 447


>gi|301170180|emb|CBW29784.1| predicted phage terminase large subunit [Haemophilus influenzae
           10810]
          Length = 447

 Score =  147 bits (372), Expect = 3e-33,   Method: Composition-based stats.
 Identities = 70/442 (15%), Positives = 141/442 (31%), Gaps = 59/442 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++      P + V+C              E+ K +S    +   + 
Sbjct: 27  GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLADQ 73

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
             +      +            S+      +T +     +  G        +  +E    
Sbjct: 74  VEMLGLQDFFDVQKTQIIEQNGSRFTFAGLKT-NITSIKSMTGID-----VVWVEEGENV 127

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262
                  ++  + E  +    I++ NP+ +    Y+ F   P +  K   ++ +      
Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320
               E +      D ++ R    G+       + I    IE A++   +          +
Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKKV 245

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
           G D+A+EG D+       G V+  +  W    +  + N+ +    K++ D II D+   G
Sbjct: 246 GFDVADEGADSNANAFVHGSVVLDIEVWKNGYVIDSANRTNQSAVKFKADLIIFDSIGVG 305

Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD----- 422
           A    + + L     V            E              N + +    + D     
Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYKT 365

Query: 423 --WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK---RVKGAKS 465
              +++  +     LI            +   S       N G + +ESK   + +G  S
Sbjct: 366 YRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPS 425

Query: 466 TDYSDGLMYTFAENPPRSDMDF 487
            + +D L+  +A   P+S +D 
Sbjct: 426 PNMADALVMCYAPTKPKSLLDL 447


>gi|16273317|ref|NP_439561.1| terminase large subunit-like protein [Haemophilus influenzae Rd
           KW20]
 gi|1175785|sp|P44184|Y1410_HAEIN RecName: Full=Uncharacterized protein HI_1410
 gi|1574247|gb|AAC23058.1| predicted coding region HI1410 [Haemophilus influenzae Rd KW20]
          Length = 394

 Score =  146 bits (369), Expect = 7e-33,   Method: Composition-based stats.
 Identities = 63/402 (15%), Positives = 133/402 (33%), Gaps = 46/402 (11%)

Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT 183
           ++ E+ K +S    +   + Q   L    ++       +G +   ++      +     +
Sbjct: 1   MFREIQKSISDSVIQMLAD-QIEMLSLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKS 59

Query: 184 FVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN- 242
             G        +  +E           ++  + E  +    I++ NP+ +    Y+ F  
Sbjct: 60  MTGID-----VVWVEEGENVSKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVI 112

Query: 243 KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
            P +  K   ++ +          E +      D ++ R    G+       + I    I
Sbjct: 113 HPPERCKSVLVNWQDNPYFPKELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWI 172

Query: 303 EEALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
           E A++   +          +G D+A+EG D+       G V+  +  W   D+  + N+ 
Sbjct: 173 EYAVDAHLKLGFTAKGMKKVGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRT 232

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLE----------- 407
           +    K++ D II D+   GA    + + L     V            E           
Sbjct: 233 NQSAVKFKADLIIFDSIGVGAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQD 292

Query: 408 FCRNRRTELHVKMAD-------WLEFASLINHSGLI------------QNLKSLKSFIVP 448
              N + +    + D        +++  +     LI            +   S       
Sbjct: 293 MFSNIKAQSWWALRDRFYKTYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYD 352

Query: 449 NTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPRSDMDF 487
           N G + +ESK   + +G  S + +D L+  +A   P+S +D 
Sbjct: 353 NNGRVKVESKKDMKKRGIPSPNMADALVMCYAPTKPKSLLDL 394


>gi|68249883|ref|YP_248995.1| phage terminase large subunit [Haemophilus influenzae 86-028NP]
 gi|68058082|gb|AAX88335.1| predicted phage terminase large subunit [Haemophilus influenzae
           86-028NP]
          Length = 438

 Score =  145 bits (366), Expect = 2e-32,   Method: Composition-based stats.
 Identities = 65/441 (14%), Positives = 152/441 (34%), Gaps = 64/441 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A L++  ++ R  + V C    +  +  ++   ++  +  L     FE+
Sbjct: 12  GGRGSGKSWGVAQLLI-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 70

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q  +++     S+ +   +  +              +  +  G        +  +EA   
Sbjct: 71  QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 113

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261
            +     ++  + +  +   W+ T NP+ +    Y+ F    P + + R +I+       
Sbjct: 114 SNESWDILIPTIRKERSE-IWV-TFNPKNILDPTYQRFVIAPPKNSFVR-KINYDENPYF 170

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLI 319
             +    +      D ++ R    G+         I    IE A++  ++    P    I
Sbjct: 171 PETLRLEMEECKERDYELYRHIWLGEPVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKI 230

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A++G D+       G V+  + +W   D+  + ++      ++  + I+ D+   
Sbjct: 231 VGFDVADDGVDSNANAFVHGSVVLRVDEWRGEDVIGSADRTRLNALEFGANEIVYDSIGV 290

Query: 380 GART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA---- 421
           GA        L+     +                     + +   N + +   ++     
Sbjct: 291 GAGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWRLRDRFY 350

Query: 422 ---------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463
                          + +  +S I     ++   +         G + +ESK   + +G 
Sbjct: 351 KTYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGI 410

Query: 464 KSTDYSDGLMYTFAENPPRSD 484
            S + +D L+  FA   P+ D
Sbjct: 411 PSPNKADALVMCFA---PKED 428


>gi|329119006|ref|ZP_08247700.1| phage terminase large subunit [Neisseria bacilliformis ATCC
           BAA-1200]
 gi|327464879|gb|EGF11170.1| phage terminase large subunit [Neisseria bacilliformis ATCC
           BAA-1200]
          Length = 449

 Score =  145 bits (366), Expect = 2e-32,   Method: Composition-based stats.
 Identities = 56/322 (17%), Positives = 105/322 (32%), Gaps = 42/322 (13%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQI 253
             +EA          ++  +        W+   NP+ +    Y+ F  + P D     + 
Sbjct: 114 WVEEAEAVTKNSWDVLIPSIRGDKNAEIWVSF-NPKNILDDTYQRFIVHPPKDS-IVLKA 171

Query: 254 DTRTVEGI-DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
           +        D      ++     D D+ R    G+       + I  + IE A++     
Sbjct: 172 NYDINPHFADTPLLADMLECKERDEDLYRHIWLGEPVADSELAIIKPSWIEAAIDAHEKL 231

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
                   I+G D+A+EG D    VLR G V+  +  W   D+  + +K+    ++   D
Sbjct: 232 GFSAAGRRILGFDVADEGDDANATVLRHGSVVTDMQQWRGQDVIYSADKVYLYAQEQNVD 291

Query: 371 AIIIDANNTGART-------CDYLEMLGYHVYRVLGQKRA------VDLEFCRNRRTELH 417
            I+ D    GA            ++ LG++    + +  A       + +   N + +  
Sbjct: 292 RIVYDNIGVGAGVKAQFRRKNGKVQTLGFNAGGAVYKPDAKYTDDKKNRDMFANIKAQAW 351

Query: 418 VKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIVPNTGELAIESK 458
             + D        +          LI                S         G +  ESK
Sbjct: 352 WMVRDRFYKTWRAVHHGDSYPEDQLISLSSSLHELEYLTAELSRPQVDYDQNGRVKAESK 411

Query: 459 ---RVKGAKSTDYSDGLMYTFA 477
              + +G  S + +D L+  FA
Sbjct: 412 KDMKKRGIPSPNRADALVMVFA 433


>gi|309379923|emb|CBX21334.1| unnamed protein product [Neisseria lactamica Y92-1009]
          Length = 449

 Score =  144 bits (362), Expect = 4e-32,   Method: Composition-based stats.
 Identities = 55/322 (17%), Positives = 104/322 (32%), Gaps = 42/322 (13%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQI 253
             +EA          ++  +        W+   NP+ +    Y  F  + P D     + 
Sbjct: 114 WVEEAEAVTKNSWDVLIPSIRGDKNAEIWVSF-NPKNILDDTYRRFIVHPPQDS-IVLKA 171

Query: 254 DTRTVEGI-DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
           +        D      ++     D D+ R    G+       + I  + IE A++     
Sbjct: 172 NYDINPHFADTPLLADMLECKERDEDLYRHIWLGEPVADSELAIIKPSWIEAAIDAHEKL 231

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
                   I+G D+A+EG D    VLR G V+  +  W   D+  + +K+    ++   D
Sbjct: 232 GFQAAGKRILGFDVADEGDDANATVLRHGSVVTDMRQWRGQDVIYSADKVYLYAQEQDID 291

Query: 371 AIIIDANNTGARTC-------DYLEMLGYHVYRVLGQKRA------VDLEFCRNRRTELH 417
            I+ D    GA            ++ LG++    + +  A       + +   N + +  
Sbjct: 292 RIVYDNIGVGAGVKAQFRRKRGKVQTLGFNAGGAVYKPDAKYTDDKKNRDMFANIKAQAW 351

Query: 418 VKMAD-------WLEFASLINHSGL------------IQNLKSLKSFIVPNTGELAIESK 458
             + D        +          L            +    S         G +  ESK
Sbjct: 352 WMVRDRFYKTWRAVHHGDSYPEDQLVSLSSSLHELEYLTAELSRPQVDYDQNGRVKAESK 411

Query: 459 ---RVKGAKSTDYSDGLMYTFA 477
              + +G  S + +D L+  FA
Sbjct: 412 KDMKKRGIPSPNRADALVMAFA 433


>gi|145629819|ref|ZP_01785613.1| predicted phage terminase large subunit [Haemophilus influenzae
           22.1-21]
 gi|148827544|ref|YP_001292297.1| hypothetical protein CGSHiGG_04845 [Haemophilus influenzae PittGG]
 gi|144977965|gb|EDJ87753.1| predicted phage terminase large subunit [Haemophilus influenzae
           22.1-21]
 gi|148718786|gb|ABQ99913.1| hypothetical protein CGSHiGG_04845 [Haemophilus influenzae PittGG]
          Length = 449

 Score =  143 bits (361), Expect = 6e-32,   Method: Composition-based stats.
 Identities = 65/441 (14%), Positives = 151/441 (34%), Gaps = 64/441 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A L++  ++ R  + V C    +  +  ++   ++  +  L     FE+
Sbjct: 23  GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 81

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q  +++     S+ +   +  +              +  +  G        +  +EA   
Sbjct: 82  QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 124

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261
            +     ++  + +  +   W+ T NP+ +    Y+ F    P + + R +I+       
Sbjct: 125 SNESWDILIPTIRKERSE-IWV-TFNPKNILDPTYQRFVIAPPKNSFVR-KINYDENPYF 181

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLI 319
             +    +      D ++ R    G+         I    IE A++  ++    P    I
Sbjct: 182 PETLRLEMEECKERDYELYRHIWLGEPVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKI 241

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A++G D+       G V+  + +W   D+  + ++      ++  + I+ D+   
Sbjct: 242 VGFDVADDGVDSNANAFVHGSVVLRVDEWHGEDVIGSADRTRLNALEFGANEIVYDSIGV 301

Query: 380 GART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA---- 421
           GA        L+     +                     + +   N + +    +     
Sbjct: 302 GAGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWCLRDRFY 361

Query: 422 ---------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463
                          + +  +S I     ++   +         G + +ESK   + +G 
Sbjct: 362 KTYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGI 421

Query: 464 KSTDYSDGLMYTFAENPPRSD 484
            S + +D L+  FA   P+ D
Sbjct: 422 PSPNKADALVMCFA---PKED 439


>gi|319775727|ref|YP_004138215.1| phage terminase large subunit [Haemophilus influenzae F3047]
 gi|319896735|ref|YP_004134928.1| phage terminase large subunit [Haemophilus influenzae F3031]
 gi|317432237|emb|CBY80589.1| predicted phage terminase large subunit [Haemophilus influenzae
           F3031]
 gi|317450318|emb|CBY86534.1| predicted phage terminase large subunit [Haemophilus influenzae
           F3047]
          Length = 449

 Score =  143 bits (360), Expect = 7e-32,   Method: Composition-based stats.
 Identities = 65/441 (14%), Positives = 151/441 (34%), Gaps = 64/441 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A L++  ++ R  + V C    +  +  ++   ++  +  L     FE+
Sbjct: 23  GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEDFEV 81

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q  +++     S+ +   +  +              +  +  G        +  +EA   
Sbjct: 82  QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 124

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261
            +     ++  + +  +   W+ T NP+ +    Y+ F    P + + R +I+       
Sbjct: 125 SNESWDILIPTIRKERSE-IWV-TFNPKNILDPTYQRFVIAPPKNSFVR-KINYDENPYF 181

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLI 319
             +    +      D ++ R    G+         I    IE A++  ++    P    I
Sbjct: 182 PETLRLEMEECKERDYELYRHIWLGEPVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKI 241

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A++G D+       G V+  + +W   D+  + ++      ++  + I+ D+   
Sbjct: 242 VGFDVADDGVDSNANAFVHGSVVLRVDEWRGEDVIGSADRTRLNALEFGANEIVYDSIGV 301

Query: 380 GART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA---- 421
           GA        L+     +                     + +   N + +    +     
Sbjct: 302 GAGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWCLRDRFY 361

Query: 422 ---------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463
                          + +  +S I     ++   +         G + +ESK   + +G 
Sbjct: 362 KTYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGI 421

Query: 464 KSTDYSDGLMYTFAENPPRSD 484
            S + +D L+  FA   P+ D
Sbjct: 422 PSPNKADALIMCFA---PKED 439


>gi|260583110|ref|ZP_05850891.1| phage terminase large subunit [Haemophilus influenzae NT127]
 gi|260093822|gb|EEW77729.1| phage terminase large subunit [Haemophilus influenzae NT127]
          Length = 445

 Score =  143 bits (360), Expect = 7e-32,   Method: Composition-based stats.
 Identities = 65/441 (14%), Positives = 151/441 (34%), Gaps = 64/441 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A L++  ++ R  + V C    +  +  ++   ++  +  L     FE+
Sbjct: 19  GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 77

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q  +++     S+ +   +  +              +  +  G        +  +EA   
Sbjct: 78  QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 120

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261
            +     ++  + +  +   W+ T NP+ +    Y+ F    P + + R +I+       
Sbjct: 121 SNESWDILIPTIRKERSE-IWV-TFNPKNILDPTYQRFVIAPPKNSFVR-KINYDENPYF 177

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLI 319
             +    +      D ++ R    G+         I    IE A++  ++    P    I
Sbjct: 178 PETLRLEMEECKERDYELYRHIWLGEPVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKI 237

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +G D+A++G D+       G V+  + +W   D+  + ++      ++  + I+ D+   
Sbjct: 238 VGFDVADDGVDSNANAFVHGSVVLRVDEWHGEDVIGSADRTRLNALEFGANEIVYDSIGV 297

Query: 380 GART---CDYLEMLGYHVYRVLGQK-----------RAVDLEFCRNRRTELHVKMA---- 421
           GA        L+     +                     + +   N + +    +     
Sbjct: 298 GAGVKAHYHRLDDKSIRINGFNAGGAVFEPDAEYVYGKTNRDMFANIKAQAWWCLRDRFY 357

Query: 422 ---------------DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGA 463
                          + +  +S I     ++   +         G + +ESK   + +G 
Sbjct: 358 KTYRAITYEEQYPVDEMISLSSDIRDLEYLKAELARPYVDYDGNGRVKVESKKDMKKRGI 417

Query: 464 KSTDYSDGLMYTFAENPPRSD 484
            S + +D L+  FA   P+ D
Sbjct: 418 PSPNKADALVMCFA---PKED 435


>gi|145638997|ref|ZP_01794605.1| terminase large subunit-like protein [Haemophilus influenzae
           PittII]
 gi|145271969|gb|EDK11878.1| terminase large subunit-like protein [Haemophilus influenzae
           PittII]
          Length = 379

 Score =  142 bits (359), Expect = 1e-31,   Method: Composition-based stats.
 Identities = 56/332 (16%), Positives = 111/332 (33%), Gaps = 40/332 (12%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
            +  +E           ++  + E  +    I++ NP+ +    Y+ F   P +  K   
Sbjct: 50  VVWVEEGENVSKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVL 107

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           ++ +          E +      D ++ R    G+       + I    IE A++   + 
Sbjct: 108 VNWQDNPYFPKELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIESAVDAHLKL 167

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
                    +G D+A+EG D        G V+  +  W   D+  + N+ +    K++ D
Sbjct: 168 GFTTKGMKKVGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANRTNQSAVKFKAD 227

Query: 371 AIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELH 417
            II D+   GA    + + L     V            E              N + +  
Sbjct: 228 LIIFDSIGVGAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKDKKNQDMFSNIKAQSW 287

Query: 418 VKMAD-------WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESK 458
             + D        +++  +     LI            +   S       N G + +ESK
Sbjct: 288 WALRDRFYKTYRAVKYGDVYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESK 347

Query: 459 ---RVKGAKSTDYSDGLMYTFAENPPRSDMDF 487
              + +G  S + +D L+  +A   P+S +D 
Sbjct: 348 KDMKKRGIPSPNMADALVMCYAPTKPKSLLDL 379


>gi|307251380|ref|ZP_07533296.1| hypothetical protein appser4_21360 [Actinobacillus pleuropneumoniae
           serovar 4 str. M62]
 gi|306856621|gb|EFM88761.1| hypothetical protein appser4_21360 [Actinobacillus pleuropneumoniae
           serovar 4 str. M62]
          Length = 384

 Score =  135 bits (340), Expect = 1e-29,   Method: Composition-based stats.
 Identities = 57/376 (15%), Positives = 121/376 (32%), Gaps = 46/376 (12%)

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
            E Q   L+  P++       +G +   ++      +     +  G        +  +E 
Sbjct: 2   LEDQIEILNLKPFFEVQKTQIIGRNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEG 56

Query: 201 SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVE 259
                     ++  + E  +    I++ NP+ L    Y+ F   P +      ++ +   
Sbjct: 57  ENVSKESWDVLIPTIREDGSQI--IVSFNPKNLLDDTYQRFVINPPERCCSVLVNWQDNP 114

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAP 317
                  E +      D ++ R    GQ       + I    IE+A++  ++        
Sbjct: 115 YFPKELMEDMKQMKERDFELYRHVYEGQPVADSDLAIIKPLWIEKAVDAHKKLGFTASGR 174

Query: 318 LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377
            ++G D+A+EG D        G V+  + +W   D+  + ++       +  D I+ D+ 
Sbjct: 175 KVVGFDVADEGIDANANCFAHGSVVLQVDEWRGDDVIQSAHRTHTNAVMWGVDEIVFDSI 234

Query: 378 NTGART---CDYLEMLGYHVYRVLGQKRAVDL-----------EFCRNRRTELHVKMAD- 422
             GA        ++                +            E   N + +    + D 
Sbjct: 235 GVGAGVKAEYRRMDTKRILCSGFNAGASVFEPDEYYTQDKTNGEMFANIKAQAWWLLRDR 294

Query: 423 ------WLEFASLINHSGLI------------QNLKSLKSFIVPNTGELAIESKR---VK 461
                  +EF  +     +I            +   S       N G++ +ESK+    +
Sbjct: 295 FYKTYRAIEFGDVYPVDEMISLSSDIKDLEYLKAELSRPRVDHDNNGKVRVESKKDMRKR 354

Query: 462 GAKSTDYSDGLMYTFA 477
           G  S + +D L+  FA
Sbjct: 355 GIPSPNKADSLVMCFA 370


>gi|85058727|ref|YP_454429.1| phage terminase large subunit [Sodalis glossinidius str.
           'morsitans']
 gi|84779247|dbj|BAE74024.1| phage terminase large subunit [Sodalis glossinidius str.
           'morsitans']
          Length = 456

 Score =  135 bits (340), Expect = 2e-29,   Method: Composition-based stats.
 Identities = 73/456 (16%), Positives = 141/456 (30%), Gaps = 68/456 (14%)

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
            P  +K A   GRG GK+   A   L +   R G      A            E    ++
Sbjct: 13  QPHRYKIA-KGGRGSGKSW--AIARLLVEIARRGTYRFLCA-----------REFQASMA 58

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM 193
               +   +      +   +     +         +       +  +  +  G       
Sbjct: 59  DSVIQLIADTIQREGYLKEFEIQKAYIRYLATDSLFMFYGIKNNVTKIKSLEGID----- 113

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
               +EA          ++  + +  +   W+   NP+ +    Y+ F   PLDD     
Sbjct: 114 IAWVEEAEAVTKESWDILIPTIRKPGSE-IWVSF-NPKNILDDTYQRFVVNPLDDICLLT 171

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
           +               +      D D+      G+       + I    I  A++     
Sbjct: 172 VHYTDNPHFPEVLRLEMEECKCKDYDLYLHIWEGEPVADSDLAIIKPLWIAAAVDAHITL 231

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
             +P     +G D+A+EG D+  ++L  G V+ HL  W+K D+  + +++    E    D
Sbjct: 232 GFEPAGKKRIGFDVADEGEDSNALILSHGSVVMHLETWNKGDVIQSADRVKNYAESVIAD 291

Query: 371 AIIIDANNTGARTCDYLEML------GYHVYRVLGQKRA------VDLEFCRNRRTELHV 418
            II D+   GA     L  +      G++    + +  A       + +   N + +   
Sbjct: 292 EIIFDSIGVGAGVKARLRRVSRITASGFNAGGGVFKPDAKYVDGKTNKDMFVNLKAQAWW 351

Query: 419 KMADW----LEFASLI----NHSGLIQNLK---------------------SLKSFIVPN 449
            + +           I    + S  ++ L+                     S       N
Sbjct: 352 GVRERFYNTWHAVEYIKHHPDDSDFVKGLRDDQLISLSSRLSSLDYLKAELSRPWVDYDN 411

Query: 450 TGELAIESK---RVKGAKSTDYSDGLMYTFAENPPR 482
            G + +ESK   + +G  S + +D L+  FA     
Sbjct: 412 NGRVKVESKKDMKKRGIPSPNRADALIMAFAPTYKP 447


>gi|149174861|ref|ZP_01853485.1| hypothetical protein PM8797T_10814 [Planctomyces maris DSM 8797]
 gi|148846198|gb|EDL60537.1| hypothetical protein PM8797T_10814 [Planctomyces maris DSM 8797]
          Length = 568

 Score =  132 bits (332), Expect = 1e-28,   Method: Composition-based stats.
 Identities = 84/530 (15%), Positives = 153/530 (28%), Gaps = 141/530 (26%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111
             WQ + +E +              + +  +    G GK                   +I
Sbjct: 57  DDWQWDILESLFD----------LTIRRVFVKGNTGCGKGAAAGIACCTYFHIWNDAKII 106

Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171
              +S    +   + EV KW   +  K   ++ +  +     +S  L             
Sbjct: 107 ITRDSVRTAQKIAFGEVDKWWRKMRFKPPGKLLTSGVFDNNQHSISL------------- 153

Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPR 231
                + +  + F G H+ + +    DEA+     +        T+    + ++  SNP 
Sbjct: 154 ----ANPQHIEGFRGAHSPH-VFFWFDEAT--APNLEDKYKLANTQA---KKFLALSNPS 203

Query: 232 RLSGKFYEIFNKPLDDW-----------KRFQIDTRTVEGIDPSFHEGIIARYG------ 274
            LSG F + F     D            +   +       +     E  +A  G      
Sbjct: 204 TLSGTFRDSFPVVNPDKTQTIIDQYGNTRCITVSGWECTNVKEKCLEQPVAPIGGIKISD 263

Query: 275 ------------------------------------LDSDVTRVEVCGQFPQQDID-SFI 297
                                                D  +  V   G+FP QD D   I
Sbjct: 264 NYYPHGSPIAADDFEKVQPRIPGQTCYDEFMALLNDADPLIRNVYALGKFPDQDPDKQVI 323

Query: 298 PLNIIEE------ALNREPCPDPYAPLIM--------------GCDIA--EEGGDNTVVV 335
             + + E        NR          I+              G D+A    G D +V+ 
Sbjct: 324 LPDWLIEPVKFWTRWNRLCLRAREQFHILALKLLEQILPVEGFGLDVAASRFG-DASVLA 382

Query: 336 LRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA------IIID-ANNTGARTCDYLE 388
           +     I  + +   +D + T + +      +  D       I ID     G    D L+
Sbjct: 383 VGGRYGIRAIHECQFSDTQQTMSWVLETANSHGVDLEQGIVPIAIDWGGGYGNAVGDPLK 442

Query: 389 MLGYHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFAS--------LINHSGLIQNL 439
               +V  + G     +D +   N+R EL+ + A  L+ A         L ++  L   L
Sbjct: 443 KRNVNVIEIHGNASSNLDSKKYANKRAELYGEAARRLDPAGDFRMMPFALPDNQRLKAEL 502

Query: 440 KSLKSFIVPNTG-ELAIESKRVKG--------------AKSTDYSDGLMY 474
            + +     + G +  I  K  +G               +S D +D ++Y
Sbjct: 503 VAPEKIYAGHDGEKYYITPKGRRGSDANYNGKTLHEILGRSPDRADAVVY 552


>gi|261402679|ref|YP_003246903.1| protein of unknown function DUF264 [Methanocaldococcus vulcanius
           M7]
 gi|261369672|gb|ACX72421.1| protein of unknown function DUF264 [Methanocaldococcus vulcanius
           M7]
          Length = 437

 Score =  132 bits (332), Expect = 1e-28,   Method: Composition-based stats.
 Identities = 74/390 (18%), Positives = 138/390 (35%), Gaps = 47/390 (12%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           ++AGR  GK+ L  +L+++L  T+       +A      +  ++ E+  ++         
Sbjct: 50  VAAGRRFGKSKLMCFLLIFLSCTQKDKKFAVIAPYYANAR-IIFKELRTYIEKNKTLQKL 108

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
                 +  +P+          ID +         S + P +  G        +I DEA+
Sbjct: 109 ---VKRITESPYMVIEFKTGCIIDFR---------SADNPTSIRG---ESYHLVILDEAA 153

Query: 202 GT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF---NKPLDDWKRFQIDTRT 257
               DV+   I   L + +A    I  S P   +  FYE F       +    F+  T +
Sbjct: 154 FIKDDVVKYVIKPLLIDYDAP--LIEISTPNGHN-HFYESFLMGENRQNRHISFRFPTWS 210

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI----DSFIPLNIIEEALNREPCPD 313
              +  S  E I   +G DS V + E C +F           +I    I+  +      +
Sbjct: 211 NPFLPKSVIEEIKREFGEDSLVWKQEFCAEFIDDQDAVFKWEYI-QQCIDSNIELLTVGE 269

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRT---TNNKISGLVEKYRPD 370
                +MG D+A+      +++L        L  + +   +       +I  L  K++P 
Sbjct: 270 KGHRYVMGVDLAKYQDYTVIIILDVSENPYKLVYFERFKDKPYSYVVERIKELYIKFKP- 328

Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430
            + +D+   G    + LE      ++   Q +            +L  K+   LE   +I
Sbjct: 329 VVCVDSTGVGDPVVEQLEDCNPIPFKFTNQSK-----------MQLITKLQTALERKEVI 377

Query: 431 --NHSGLIQNLKSLKSFIVPNTGELAIESK 458
                 LI  LK  +   V     ++ E+K
Sbjct: 378 FPYIDTLITELKYFRY--VKKKTTISFEAK 405


>gi|241763591|ref|ZP_04761642.1| phage terminase large subunit [Acidovorax delafieldii 2AN]
 gi|241367184|gb|EER61538.1| phage terminase large subunit [Acidovorax delafieldii 2AN]
          Length = 521

 Score =  130 bits (327), Expect = 5e-28,   Method: Composition-based stats.
 Identities = 58/233 (24%), Positives = 94/233 (40%), Gaps = 26/233 (11%)

Query: 275 LDSDVTRVEVCGQFPQQDID---SFIPLNIIEEALNR-EPCPDPYAPLIMGCDIAEEGGD 330
           L   +    + G F     D     IP   ++ A  R +P  D     ++G D A  G D
Sbjct: 267 LPEPLRSQMLRGDFSAGAADPAWQLIPTEWVKAAQARWQPRQDKGPMTVLGLDPARGGTD 326

Query: 331 NTVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
            T V  R     + L         D  TT    + LV       I +DA   G+   D++
Sbjct: 327 KTSVARRHDCWFDVLISEPGIVTKDGPTTAAFTAPLVR--NGAPIAVDAIGIGSSALDFI 384

Query: 388 EMLGYHVYRVLGQKRAVDLE-----FCRNRRTELHVKMADWL-----EFASLINHSGLIQ 437
           + LG  VY V+G +R+  ++       RNRR E++ ++ + L     +  +L     L+ 
Sbjct: 385 QGLGLLVYAVVGSERSDHMDKAGTMRFRNRRAEMYWRLREALDPTAEQPIALPPDQELLG 444

Query: 438 NLKSLKSFIVPNTGE---LAIESK---RVKGAKSTDYSDGLMYTFAENPPRSD 484
           +L +++ + V   G+   + I  K   R    +S D  D +  TF E  P  D
Sbjct: 445 DLTAVR-YKVVTMGQGAAIQIRDKDEIREALGRSPDKGDSVAMTFCEGIPLLD 496


>gi|303243859|ref|ZP_07330199.1| protein of unknown function DUF264 [Methanothermococcus okinawensis
           IH1]
 gi|302485795|gb|EFL48719.1| protein of unknown function DUF264 [Methanothermococcus okinawensis
           IH1]
          Length = 445

 Score =  128 bits (322), Expect = 2e-27,   Method: Composition-based stats.
 Identities = 66/321 (20%), Positives = 121/321 (37%), Gaps = 31/321 (9%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           ++AGR  GK+ L A+L+++L ST+       +A      +  ++ E+ K++      +  
Sbjct: 56  VAAGRRFGKSKLMAFLLIFLCSTQKNKKYAVIAPFYANAR-IIFRELKKYIEKS---NVL 111

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
                 +  +P+ +        ID +         S + P +  G        +I DEA+
Sbjct: 112 SRLVKRMVESPYMAIEFKTGCTIDFR---------SADNPTSIRG---ESYHLVILDEAA 159

Query: 202 GT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF---NKPLDDWKRFQIDTRT 257
               DV+   I   L + +A    I  S P   +  FYE F       +    F+  T T
Sbjct: 160 FIKDDVVKYVIKPLLLDYDAP--LIEISTPNGHN-HFYESFLMGKNKQNRHISFRFPTWT 216

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI----DSFIPLNIIEEALNREPCPD 313
              +  +  E I    G DS V + E C +F   +       +I    I+  +      +
Sbjct: 217 NPFLPKNAIEEIKQEVGEDSPVWKQEYCAEFIDNNEAVFNWEYI-QQCIDGTIKLLKSGE 275

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRT---TNNKISGLVEKYRPD 370
                +MG D+A+      + VL        L  + + +L       +K+  L + +   
Sbjct: 276 SGHQYVMGVDLAKFEDYTVITVLDVSVKPYKLVYFERFNLMPYSFVADKVKELYQLFNKP 335

Query: 371 AIIIDANNTGARTCDYLEMLG 391
            + +DA   GA   + +E L 
Sbjct: 336 QVCMDATGPGAAVVEQVESLN 356


>gi|187476925|ref|YP_784949.1| phage terminase large subunit [Bordetella avium 197N]
 gi|115421511|emb|CAJ48020.1| Putative phage terminase large subunit [Bordetella avium 197N]
          Length = 512

 Score =  120 bits (302), Expect = 4e-25,   Method: Composition-based stats.
 Identities = 75/359 (20%), Positives = 123/359 (34%), Gaps = 60/359 (16%)

Query: 194 AIINDEASGTPDVINLGILG--FLTERNANRFWIMTSNP-RRLSGKFYEIFNKPLDDWKR 250
            I+ DEA+   +     +LG    T+       +MT NP   + G++   +  P  D K 
Sbjct: 144 LIVLDEATELREHQARFVLGWNRTTKAGQRCRVLMTFNPPTTVEGRWVVEYFAPWLDPKH 203

Query: 251 FQ------------IDTRTVEGI---------------DPSFHEGIIAR----------- 272
                         ID + VE                   +F    IA            
Sbjct: 204 PHPAKPGELRWFAVIDGKEVEVEGGAPFAHNGETIVPRSRTFIPSRIADNPFLMGTGYES 263

Query: 273 --YGLDSDVTRVEVCGQF---PQQDIDSFIPLNIIEEALNREPCPDPYAPLI-MGCDIAE 326
               L   +    + G F    + D    IP   +E A  R   PD  AP+  +G D+A 
Sbjct: 264 VLQSLPEPLRSQMLYGDFNAGIEDDPWQVIPTAWVEAAQARWKRPDRLAPMDSLGLDVAR 323

Query: 327 EGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII-IDANNTGARTCD 385
            G D T++  R G   +    +   D           +   R  A+I +D    GA   D
Sbjct: 324 GGRDKTILARRHGWWFDEPLVYPGKDTPDGPTVAGLAISALRDHAVIHLDVIGVGASPYD 383

Query: 386 YLEMLGYHVYRVLGQKRAVDLE-----FCRNRRTELHVKMADWLE-----FASLINHSGL 435
           +L      V  V   + A   +        NRR+EL  +M + L+       +L     L
Sbjct: 384 FLVTAKQQVVGVNVAEAACGTDKSGRLRFFNRRSELWWRMREALDPIHNTGIALPPDPRL 443

Query: 436 IQNLKSLKSFIVPNTGELA-IESKRVKGAKSTDYSDGLMYTFAENPPRSDMD-FGRCPS 492
           + +L +    +   T ++A  E    K  +S D+    +    + P R+ ++  G+  S
Sbjct: 444 LADLTAPTWSLSGATLKVASREDIIDKIGRSPDFGSAYVLALMDTPKRAAVEALGQARS 502


>gi|41179386|ref|NP_958694.1| Bbp25 [Bordetella phage BPP-1]
 gi|45569518|ref|NP_996587.1| hypothetical protein BMP-1p24 [Bordetella phage BMP-1]
 gi|45580769|ref|NP_996635.1| hypothetical protein BIP-1p24 [Bordetella phage BIP-1]
 gi|40950125|gb|AAR97691.1| Bbp25 [Bordetella phage BPP-1]
          Length = 533

 Score =  118 bits (295), Expect = 2e-24,   Method: Composition-based stats.
 Identities = 51/237 (21%), Positives = 87/237 (36%), Gaps = 21/237 (8%)

Query: 275 LDSDVTRVEVCGQF---PQQDIDSFIPLNIIEEALNREPCPDPYAPLI-MGCDIAEEGGD 330
           L   +    + G F    + D    IP   +E A  R   PD  AP+  +G D+A  G D
Sbjct: 289 LPEPLRSQMLYGDFNAGIEDDPWQVIPTAWVEAAQARWKRPDRLAPMDSLGVDVARGGRD 348

Query: 331 NTVVVLRRGPVIEHLFDWSKTDL---RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
           NT++  R     +    +   D     T        +  +    I +D    GA   D+L
Sbjct: 349 NTILARRHAMWFDVPLTYPGKDTPDGPTVAGLAIAALRDH--AVIHLDVIGVGASPYDFL 406

Query: 388 EMLGYHVYRVLGQKRAVDLE-----FCRNRRTELHVKMADWLE-----FASLINHSGLIQ 437
                 V  V   + A   +        N R+EL  +M + L+       +L     L+ 
Sbjct: 407 AQAKQQVVGVNVAEAARGTDKSGRLRFFNLRSELWWRMREALDPTNNTGIALPPDPRLLA 466

Query: 438 NLKSLKSFIVPNTGELA-IESKRVKGAKSTDYSDGLMYTFAENPPRSDMD-FGRCPS 492
           +L +    +   T ++A  E    K  +S D+    +    + P R+ ++  G+  S
Sbjct: 467 DLTAPTWSLSGATLKVASREDIIEKIGRSPDFGSAYVLALMDTPKRAAVEALGQARS 523


>gi|300907068|ref|ZP_07124735.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           84-1]
 gi|301304068|ref|ZP_07210185.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           124-1]
 gi|300401186|gb|EFJ84724.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           84-1]
 gi|300840675|gb|EFK68435.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           124-1]
 gi|315257729|gb|EFU37697.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           85-1]
          Length = 440

 Score =  114 bits (285), Expect = 4e-23,   Method: Composition-based stats.
 Identities = 52/340 (15%), Positives = 105/340 (30%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 96  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 153

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 154 INYDENPFLSDTMLKVIEAAKRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 213

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      + R
Sbjct: 214 NFEPSGRKRIGFDVADSGADKCANVYRHGSVVYWADEWKAKEDELLKSCQRTYQAALE-R 272

Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQKRA----------VDL 406
              I+ D+   GA        +              +  R                  + 
Sbjct: 273 DADIVYDSIGVGASAGAKFAEINEDRKRENMNASRINYQRFNAGAGVNEPDYEYIGIPNK 332

Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQNLKSLKSF------------IV 447
           +F  N + +    +AD        ++         LI    S                  
Sbjct: 333 DFFANLKAQAWWLVADRFRNTFNAVKNGEQYPVDELISIDSSCPLLEKLKLELTTPHRDF 392

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 393 DKNGRVMVESKKDLAKRDVPSPNVADAFIMAFAPTDTAMD 432


>gi|226940437|ref|YP_002795511.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
 gi|226715364|gb|ACO74502.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
          Length = 133

 Score =  113 bits (282), Expect = 8e-23,   Method: Composition-based stats.
 Identities = 39/126 (30%), Positives = 53/126 (42%), Gaps = 11/126 (8%)

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
           +  AN++TQL+T    EV KW  L    HWF+ QS S+                 +K + 
Sbjct: 1   MITANTDTQLRTKTSPEVGKWQRLSITSHWFDPQSASIAA----------RDKEHAKTWR 50

Query: 171 TMCRTYSEERPDTFVGHHNTYG-MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                +SE   + F G HN    + +I DEAS   D +     G LT+      WI   N
Sbjct: 51  ADFVPWSEHNTEAFAGLHNKGKRIVLIFDEASAIADKVWEVAEGALTDEETEIIWIAFGN 110

Query: 230 PRRLSG 235
           P R  G
Sbjct: 111 PTRNIG 116


>gi|229125159|ref|ZP_04254306.1| hypothetical protein bcere0016_54220 [Bacillus cereus 95/8201]
 gi|228658294|gb|EEL13987.1| hypothetical protein bcere0016_54220 [Bacillus cereus 95/8201]
          Length = 164

 Score =  112 bits (280), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 29/147 (19%), Positives = 53/147 (36%), Gaps = 21/147 (14%)

Query: 354 RTTNNKISGLVEKY--------RPDAIIIDANNTGARTCDYLEM------LGYHVYRVLG 399
                 +    +KY        +   I ID    G    D L+           V  +  
Sbjct: 1   MYVTGLLIKEAKKYFSWCERTGKRIPIRIDDTGVGGGVTDRLKEVVAENDYPIDVIPINF 60

Query: 400 QKRAVDLEFCRNRRTELHVKMADW-LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK 458
             +           + ++    D  LEF S+ +   LI  L S++ + + + G + IE K
Sbjct: 61  ASK--GNAEYACIVSVMYGHFKDNCLEFVSIPDDEDLIAQL-SVRKYQINSDGRIKIEPK 117

Query: 459 ---RVKGAKSTDYSDGLMYTFAENPPR 482
              + +G KS D ++ ++  FA   P+
Sbjct: 118 KAMKDRGLKSPDRAEAVVMAFAPFYPK 144


>gi|218290759|ref|ZP_03494841.1| protein of unknown function DUF264 [Alicyclobacillus acidocaldarius
           LAA1]
 gi|218239297|gb|EED06496.1| protein of unknown function DUF264 [Alicyclobacillus acidocaldarius
           LAA1]
          Length = 422

 Score =  111 bits (277), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 65/365 (17%), Positives = 125/365 (34%), Gaps = 42/365 (11%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
           S P S QL  + +   H      + +   F+ A + GR  GKT   A  +       PG 
Sbjct: 7   SEPTSKQLR-LRLYTPHSGQVALHRSTARFRVA-TCGRRWGKTYACANEIAKWAWEHPGA 64

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
               +A +  Q                       + +  +    ++  +   +       
Sbjct: 65  MTWWVAPTYRQ----------------------TLTAYRIITRNFHGAIEKATTTHMRIE 102

Query: 169 YSTMCRT--YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL-GFLTERNANRFWI 225
           + +   T   S E  D   G        ++ DEA+  P       L   L+++      I
Sbjct: 103 WKSGSITEFRSTENFDALRG---EGLDFLVVDEAAMVPKEAWEAALRPTLSDKAGRA--I 157

Query: 226 MTSNPRRLSGKFYEIFNKP----LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTR 281
           + S P+  +  FY ++ +       +W+ F+  T     I P   E   AR  L SDV R
Sbjct: 158 IVSTPKGRN-WFYHVWARGQDPAFPEWESFRFPTLANPYIPPEEVEE--ARTTLPSDVFR 214

Query: 282 VEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLR-RGP 340
            E   +F +     F  +        +E  P P    ++G D+A+    + +VV+     
Sbjct: 215 QEYEAEFLEDSAGVFRGIRDCIS--GQEEEPQPGRRYVVGWDVAKHQDFSVLVVMDLERA 272

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400
            +  +  +++ D      ++  + ++Y    +++DA   G    + +  +G         
Sbjct: 273 HVVKMDRFNQVDYALQLERVKHICQRYNNARLLMDATGVGDPLLEQVRRMGIQAEGYSLS 332

Query: 401 KRAVD 405
             A  
Sbjct: 333 NTAKQ 337


>gi|74311301|ref|YP_309720.1| putative bacteriophage protein [Shigella sonnei Ss046]
 gi|73854778|gb|AAZ87485.1| putative bacteriophage protein [Shigella sonnei Ss046]
          Length = 473

 Score =  110 bits (274), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252
            +  +EA    +     +   + +  +  ++    NP  ++   +  F     +     +
Sbjct: 128 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 185

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 186 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 245

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      +  
Sbjct: 246 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 305

Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406
            D I+ D+   GA        +              +  R                  + 
Sbjct: 306 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 364

Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447
           +F  N + +    +AD        +          LI                +      
Sbjct: 365 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 424

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 425 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 464


>gi|188492395|ref|ZP_02999665.1| phage terminase large subunit [Escherichia coli 53638]
 gi|188487594|gb|EDU62697.1| phage terminase large subunit [Escherichia coli 53638]
          Length = 467

 Score =  110 bits (274), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252
            +  +EA    +     +   + +  +  ++    NP  ++   +  F     +     +
Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 179

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 239

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      +  
Sbjct: 240 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 299

Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406
            D I+ D+   GA        +              +  R                  + 
Sbjct: 300 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 358

Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447
           +F  N + +    +AD        +          LI                +      
Sbjct: 359 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 418

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 419 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 458


>gi|16759908|ref|NP_455525.1| prophage terminase large subunit [Salmonella enterica subsp.
           enterica serovar Typhi str. CT18]
 gi|29142320|ref|NP_805662.1| prophage terminase large subunit [Salmonella enterica subsp.
           enterica serovar Typhi str. Ty2]
 gi|213583175|ref|ZP_03365001.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. E98-0664]
 gi|213647535|ref|ZP_03377588.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. J185]
 gi|213855100|ref|ZP_03383340.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. M223]
 gi|25512685|pir||AF0621 probable prophage terminase large chain STY1047 [imported] -
           Salmonella enterica subsp. enterica serovar Typhi
           (strain CT18)
 gi|16502201|emb|CAD05440.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi]
 gi|29137950|gb|AAO69511.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. Ty2]
          Length = 467

 Score =  110 bits (274), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 179

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D +       G     D  + I L+ IE A++  +  
Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 239

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P     +G D+A+ G D    V R G VI    +W   + +L  +  +      + R
Sbjct: 240 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 298

Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406
              I+ D+   GA          D  +    +  RV  Q                   + 
Sbjct: 299 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 358

Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447
           +F  N + +    +AD      + IN+    L+  L S+                     
Sbjct: 359 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 418

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 419 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 458


>gi|16760783|ref|NP_456400.1| bacteriophage protein [Salmonella enterica subsp. enterica serovar
           Typhi str. CT18]
 gi|25512494|pir||AE0735 probable bacteriophage protein STY2040 [imported] - Salmonella
           enterica subsp. enterica serovar Typhi (strain CT18)
 gi|16503080|emb|CAD05583.1| putative bacteriophage protein [Salmonella enterica subsp. enterica
           serovar Typhi]
          Length = 467

 Score =  109 bits (273), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252
            +  +EA    +     +   + +  +  ++    NP  ++   +  F     +     +
Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 179

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 239

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      +  
Sbjct: 240 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 299

Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406
            D I+ D+   GA        +              +  R                  + 
Sbjct: 300 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 358

Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447
           +F  N + +    +AD        +          LI                +      
Sbjct: 359 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 418

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 419 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 458


>gi|213161040|ref|ZP_03346750.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. E00-7866]
          Length = 421

 Score =  109 bits (273), Expect = 9e-22,   Method: Composition-based stats.
 Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 76  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 133

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D +       G     D  + I L+ IE A++  +  
Sbjct: 134 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 193

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P     +G D+A+ G D    V R G VI    +W   + +L  +  +      + R
Sbjct: 194 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 252

Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406
              I+ D+   GA          D  +    +  RV  Q                   + 
Sbjct: 253 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 312

Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447
           +F  N + +    +AD      + IN+    L+  L S+                     
Sbjct: 313 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 372

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 373 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 412


>gi|324012808|gb|EGB82027.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           60-1]
          Length = 441

 Score =  109 bits (273), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252
            +  +EA    +     +   + +  +  ++    NP  ++   +  F     +     +
Sbjct: 96  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 153

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 154 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 213

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      +  
Sbjct: 214 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 273

Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406
            D I+ D+   GA        +              +  R                  + 
Sbjct: 274 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 332

Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447
           +F  N + +    +AD        +          LI                +      
Sbjct: 333 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 392

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 393 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 432


>gi|194434997|ref|ZP_03067239.1| phage terminase, large subunit, pbsx family [Shigella dysenteriae
           1012]
 gi|194416779|gb|EDX32906.1| phage terminase, large subunit, pbsx family [Shigella dysenteriae
           1012]
 gi|323166781|gb|EFZ52535.1| phage terminase, large subunit, PBSX family [Shigella sonnei 53G]
          Length = 447

 Score =  109 bits (273), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252
            +  +EA    +     +   + +  +  ++    NP  ++   +  F     +     +
Sbjct: 102 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 159

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 160 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 219

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      +  
Sbjct: 220 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 279

Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406
            D I+ D+   GA        +              +  R                  + 
Sbjct: 280 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 338

Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447
           +F  N + +    +AD        +          LI                +      
Sbjct: 339 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 398

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 399 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 438


>gi|213423381|ref|ZP_03356369.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. E01-6750]
          Length = 414

 Score =  109 bits (273), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 69  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 126

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D +       G     D  + I L+ IE A++  +  
Sbjct: 127 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 186

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P     +G D+A+ G D    V R G VI    +W   + +L  +  +      + R
Sbjct: 187 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 245

Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406
              I+ D+   GA          D  +    +  RV  Q                   + 
Sbjct: 246 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 305

Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447
           +F  N + +    +AD      + IN+    L+  L S+                     
Sbjct: 306 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 365

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 366 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 405


>gi|260557981|ref|ZP_05830193.1| phage terminase large subunit [Acinetobacter baumannii ATCC 19606]
 gi|260408491|gb|EEX01797.1| phage terminase large subunit [Acinetobacter baumannii ATCC 19606]
          Length = 529

 Score =  109 bits (272), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 52/239 (21%), Positives = 90/239 (37%), Gaps = 31/239 (12%)

Query: 275 LDSDVTRVEVCGQF---PQQDIDSFIPLNIIEEALNREPCPDPYAPLI--------MGCD 323
           L   +    + G F    + D    IP   +E A  R    +    L          G D
Sbjct: 280 LPEPLRSQMLYGDFGAGIEDDPWQVIPTEWVEAAQARWKPLEDMRILHRGDFKMDSYGLD 339

Query: 324 IAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR---TTNNKISGLVEKYRPDAIIIDANNTG 380
           +A  GGDNT+   R G   ++       D     T+ +     V  + P  I +D    G
Sbjct: 340 VARGGGDNTIGFARYGYWYDNPNVLEGKDSPDGPTSASFAVSHVRDHAP--IHVDVIGVG 397

Query: 381 ARTCDYLEMLGYHVYRVLGQKRAVDLEF-----CRNRRTELHVKMADWLE-----FASLI 430
           A T D+L+  G HV  V  +  A   +        N R++L  +  + L+       +L 
Sbjct: 398 ASTYDFLKQSGIHVVPVDVRNAATAFDRSGQLSFYNLRSQLWWQFREALDPAYGSTVALP 457

Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSDMD 486
               L+ +L + + + +  T ++ +ES+     +  +S DY   ++    + P R  M 
Sbjct: 458 PEPKLLADLTAPR-WGLQGT-KIKVESREEIIKRIGRSPDYGSAIINAQIDTPKRHIMQ 514


>gi|293396491|ref|ZP_06640767.1| phage terminase large subunit [Serratia odorifera DSM 4582]
 gi|291420755|gb|EFE94008.1| phage terminase large subunit [Serratia odorifera DSM 4582]
          Length = 430

 Score =  109 bits (272), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 53/343 (15%), Positives = 107/343 (31%), Gaps = 57/343 (16%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            + N+EA    +     +   + +  +  +++   NPR  +   +      P  D    +
Sbjct: 80  VLWNEEAHAMTEAQWEVLEPTIRKEGSECWFL--FNPRLTTDFVWRNFVVAPPPDTLVRK 137

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +    I A    D+++      G     D ++ I L+ IE A++  +  
Sbjct: 138 INYDENPFLSRTIMNVIEAAKARDAEMFEHVYLGMPRTDDDEAIIKLSWIEAAVDAHKAL 197

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V   G V     +W   + +L  +  +   +  + R
Sbjct: 198 NIEPAGHRRVGFDVADSGADKCANVYAHGSVALWADEWKAREDELMKSCKRTYNVALE-R 256

Query: 369 PDAIIIDANNTGARTCDYLEMLG-------------YHVYRVLGQKRAVDLE-------- 407
             AII D+   GA +      +                 ++        + E        
Sbjct: 257 EAAIIYDSIGVGASSGSKFAEINEERESASDWNVRTVDYFKFNAGGAVFEPERDYQPGIT 316

Query: 408 ---FCRNRRTELHVKMADWL----------EFASLINHSGLI------------QNLKSL 442
              F  N + +    +AD            E         LI            +   S 
Sbjct: 317 NKDFFANIKAQAWWLVADRFRNTYNVINGKEKRESFADDQLISIDSACPLLDKLKFELST 376

Query: 443 KSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPPR 482
                   G + +E+K   + +   S + +D  +  FA     
Sbjct: 377 PKRDFDKNGRVKVETKDDLKKRDIPSPNVADAFIMAFAPIETP 419


>gi|323175059|gb|EFZ60673.1| phage terminase large subunit [Escherichia coli LT-68]
          Length = 399

 Score =  109 bits (271), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252
            +  +EA    +     +   + +  +  ++    NP  ++   +  F     +     +
Sbjct: 54  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 111

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 112 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 171

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      +  
Sbjct: 172 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 231

Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406
            D I+ D+   GA        +              +  R                  + 
Sbjct: 232 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 290

Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447
           +F  N + +    +AD        +          LI                +      
Sbjct: 291 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 350

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 351 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 390


>gi|289581321|ref|YP_003479787.1| hypothetical protein Nmag_1649 [Natrialba magadii ATCC 43099]
 gi|289530874|gb|ADD05225.1| hypothetical protein Nmag_1649 [Natrialba magadii ATCC 43099]
          Length = 602

 Score =  107 bits (268), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 77/512 (15%), Positives = 152/512 (29%), Gaps = 113/512 (22%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVF----KGAISAGRGIGKTTLNAWLVLWLMST 104
           +   +W  + +E      +               +  +    G+GK+ + A + +  ++ 
Sbjct: 22  AGDETWLEDAIEDYLGITVTGAQAQICRGIAANERLLVVTANGLGKSYILAAITIVWLTV 81

Query: 105 RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI 164
           R        + +E ++K T    V                     P  + S      +  
Sbjct: 82  RYPACSFATSGTERKMKRTYCKPVENLHGDARVPL----------PGEYKSRPERIEIDG 131

Query: 165 DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA--SGTPDVINLGILGFLTERNANR 222
           + +H+       S +      G H  Y +AII +EA        +   +   +T+     
Sbjct: 132 EPEHFFEAA---SPQDAGELEGVHAAYTLAII-EEADKKDVDAEVLDAMKSLVTDEQDRI 187

Query: 223 FWIMTSNP---------------RRLSGKF-------YEIF------------------- 241
             I  +NP                  + K+       ++                     
Sbjct: 188 --IAIANPPKDETNSIYPILDEQDDPTSKWEVLEFSSFDSHNVQVELGNVDDEKVDGLAS 245

Query: 242 -NKPLDDWKRF-----------------QIDTRTVEGI--------DPSFHEGIIARYGL 275
            +K  DDW+ +                 ++D               +P F   +  R+  
Sbjct: 246 LHKIQDDWEDYNKEPWPGAETARTLSAPKLDADGNPVFSHSDALEDNPEFRTDLDQRWYR 305

Query: 276 DSDVTRVEVCGQFP--QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTV 333
                     G  P      +    ++ +  A  R+  P    P   G D+A +GGD T 
Sbjct: 306 -------RRAGIIPPGGASKNRPFTIDDVNAAWGRDWQPV-GRPQATGIDVARDGGDRTP 357

Query: 334 VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYH 393
           V+   G V+E  ++    D     + ++ ++E    + + IDA   G+   D +      
Sbjct: 358 VISVDGDVLEVRYEEPCHDYTAHADDVTDVLEDDPDNPMPIDAVGEGSGFADIMHQRFPE 417

Query: 394 VYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSL------KSFIV 447
             R      A D    ++   E    +  WL+    IN   L + L         +   +
Sbjct: 418 TIRFKSLGVAEDSANYKDCWAEGVALLGKWLQNGGSINDRTLREELLVAARTLEYEETHI 477

Query: 448 PNTGE-----LAIESK---RVKGAKSTDYSDG 471
            + G      L +  K   + +  +S DY D 
Sbjct: 478 GSRGTNGEDVLKLTPKEKVKERLGRSPDYLDA 509


>gi|213426918|ref|ZP_03359668.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. E02-1180]
          Length = 374

 Score =  107 bits (267), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 29  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 86

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D +       G     D  + I L+ IE A++  +  
Sbjct: 87  INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 146

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P     +G D+A+ G D    V R G VI    +W   + +L  +  +      + R
Sbjct: 147 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 205

Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406
              I+ D+   GA          D  +    +  RV  Q                   + 
Sbjct: 206 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 265

Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447
           +F  N + +    +AD      + IN+    L+  L S+                     
Sbjct: 266 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 325

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 326 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 365


>gi|332091158|gb|EGI96248.1| phage terminase large subunit [Shigella dysenteriae 155-74]
          Length = 346

 Score =  107 bits (267), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 50/340 (14%), Positives = 103/340 (30%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252
            +  +EA    +     +   + +  +  ++    NP  ++   +  F     +     +
Sbjct: 1   MLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 58

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 59  INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 118

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
             +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      +  
Sbjct: 119 NFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALERE 178

Query: 369 PDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRAVDL 406
            D I+ D+   GA        +              +  R                  + 
Sbjct: 179 AD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 237

Query: 407 EFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKSFIV 447
           +F  N + +    +AD        +          LI                +      
Sbjct: 238 DFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPHRDF 297

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 298 DRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 337


>gi|328952976|ref|YP_004370310.1| hypothetical protein Desac_1270 [Desulfobacca acetoxidans DSM
           11109]
 gi|328453300|gb|AEB09129.1| hypothetical protein Desac_1270 [Desulfobacca acetoxidans DSM
           11109]
          Length = 466

 Score =  107 bits (266), Expect = 6e-21,   Method: Composition-based stats.
 Identities = 74/382 (19%), Positives = 126/382 (32%), Gaps = 58/382 (15%)

Query: 51  PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110
           P  WQ +F+          V+ P   +   +  +    GK+T  A L L      PG  +
Sbjct: 27  PDPWQQDFL----------VSRPEQALLLCSRQS----GKSTSAAALALHEALFHPGALI 72

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
           + L+ S  Q    L+ + +     LP+         +   +    +  H S  I      
Sbjct: 73  LLLSPSLRQ-SQELFRKAAGLYQRLPHAP------AACRTSALRLEFDHGSRIISLPGQE 125

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
              R +SE R              ++ DEA+  PD +   +   L            S P
Sbjct: 126 ETIRGFSEVR-------------LLVIDEAALVPDELYYAVRPMLAVSRGR--LTALSTP 170

Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
               G FY  + +  D W+R+ I       I   F      +  L +   R E   +F  
Sbjct: 171 AGKRGWFYHCYTEGGDQWQRYTIPATQCPRISADFLAA--EQRSLPAAWFRAEYFCEF-G 227

Query: 291 QDIDSFIPLNIIEEALNREPCP--------DPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342
           +  +   P ++++ A   +  P         P     +G D+ +    + + ++ R P +
Sbjct: 228 EAANQLFPAHLLQTAQCSQVSPLFAEITPSPPTGTFFIGLDLGQSQDYSALTIIHRSPAL 287

Query: 343 E----HLFDWSKTDLRTTNNKISGLVEKY-------RPDAIIIDANNTGARTCDYLEMLG 391
                HL    +  LRT    I   V +            +I+D    GA   D L   G
Sbjct: 288 PDPPCHLRHLQRFPLRTPYPDIVRQVRELLQQPQIGPNPLLIVDKTGVGAPVVDMLTQAG 347

Query: 392 YHVYRVLGQKRAVDLEFCRNRR 413
            + Y V         +  R+ R
Sbjct: 348 MNPYAVTIHGGEAVSQNGRDLR 369


>gi|289829424|ref|ZP_06547036.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. E98-3139]
          Length = 346

 Score =  107 bits (266), Expect = 6e-21,   Method: Composition-based stats.
 Identities = 57/340 (16%), Positives = 110/340 (32%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 1   MLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 58

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D +       G     D  + I L+ IE A++  +  
Sbjct: 59  INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 118

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P     +G D+A+ G D    V R G VI    +W   + +L  +  +      + R
Sbjct: 119 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 177

Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406
              I+ D+   GA          D  +    +  RV  Q                   + 
Sbjct: 178 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 237

Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447
           +F  N + +    +AD      + IN+    L+  L S+                     
Sbjct: 238 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 297

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 298 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 337


>gi|211731806|gb|ACJ10127.1| terminase [Bacteriophage APSE-3]
          Length = 469

 Score =  107 bits (266), Expect = 6e-21,   Method: Composition-based stats.
 Identities = 64/349 (18%), Positives = 102/349 (29%), Gaps = 67/349 (19%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252
             +EA    +     ++  + +  +   W    NP    G  Y+ F KP       Q   
Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKAIIDKQGYY 162

Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
                    +       +              +    R    G+      D+ I    +E
Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVE 222

Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361
            A++        P    ++  D A+ G D   +  R G +IE    WS+ D+        
Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAF 282

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLG----YHVYRVLGQKRAVD------------ 405
                YR D  I D    GA T       G      V    G   + D            
Sbjct: 283 DEAFDYRADDFIYDNIGLGAGTVKTHLRHGNDGNKMVVTGFGAGDSPDYPDEIYVPGNGE 342

Query: 406 ------------LEFCRNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436
                        +  RN+R +  V +AD        +E    ++   LI          
Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPEALISLSSKIAKLS 402

Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
                L   +    P    + + SK   R+KG KS + +D LM +FA  
Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451


>gi|161614489|ref|YP_001588454.1| hypothetical protein SPAB_02238 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|161363853|gb|ABX67621.1| hypothetical protein SPAB_02238 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
          Length = 441

 Score =  106 bits (265), Expect = 7e-21,   Method: Composition-based stats.
 Identities = 56/340 (16%), Positives = 109/340 (32%), Gaps = 52/340 (15%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 96  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 153

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A      +       G     D  + I L+ IE A++  +  
Sbjct: 154 INYDENPFLSDTMLKVIDAARRRYPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 213

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P     +G D+A+ G D    V R G VI    +W   + +L  +  +      + R
Sbjct: 214 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 272

Query: 369 PDAIIIDANNTGAR-------TCDYLEMLGYHVYRVLGQ---------------KRAVDL 406
              I+ D+   GA          D  +    +  RV  Q                   + 
Sbjct: 273 DADIVYDSIGVGASAGAKFSEINDDRKRENAYARRVNYQRFNAGAGVHEPDDEYNGIPNK 332

Query: 407 EFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KSFIV 447
           +F  N + +    +AD      + IN+    L+  L S+                     
Sbjct: 333 DFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPHRDF 392

Query: 448 PNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
              G + +ESK+    +   S + +D  +  FA      D
Sbjct: 393 DRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 432


>gi|148826888|ref|YP_001291641.1| phage terminase large subunit [Haemophilus influenzae PittGG]
 gi|148718130|gb|ABQ99257.1| predicted phage terminase large subunit [Haemophilus influenzae
           PittGG]
          Length = 366

 Score =  106 bits (265), Expect = 7e-21,   Method: Composition-based stats.
 Identities = 56/355 (15%), Positives = 114/355 (32%), Gaps = 37/355 (10%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A  ++      P + V+C              E+ K +S    +   + 
Sbjct: 27  GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQMLAD- 72

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q   L    ++       +G +   ++      +     +  G        +  +E    
Sbjct: 73  QIEMLGLRAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGID 262
                  ++  + E  +    I++ NP+ +    Y+ F   P +  K   ++ +      
Sbjct: 128 SKESWDILIPTIREDGSQI--IVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIM 320
               E +      D ++ R    G+       + I    IE A++   +          +
Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGEPVADSDLAIIKPVWIEYAVDAHLKLGFTAKGMKKV 245

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
           G D+A+EG D+       G V+  +  W   D+  + N+ +    K++ D II D+   G
Sbjct: 246 GFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFKADLIIFDSIGVG 305

Query: 381 ARTCDYLEMLG--YHVYRVLGQKRAVDLE-----------FCRNRRTELHVKMAD 422
           A    + + L     V            E              N + +    + D
Sbjct: 306 AGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRD 360


>gi|212499721|ref|YP_002308529.1| terminase [Bacteriophage APSE-2]
 gi|238898754|ref|YP_002924436.1| APSE-2 prophage; terminase [Bacteriophage APSE-2]
 gi|211731690|gb|ACJ10178.1| terminase [Bacteriophage APSE-2]
 gi|229466514|gb|ACQ68288.1| APSE-2 prophage; terminase [Bacteriophage APSE-2]
          Length = 469

 Score =  106 bits (264), Expect = 9e-21,   Method: Composition-based stats.
 Identities = 64/349 (18%), Positives = 103/349 (29%), Gaps = 67/349 (19%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252
             +EA    +     ++  + +  +   W    NP    G  Y+ F KP       Q   
Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKAIIDKQGYY 162

Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
                    +       +              +    R    G+      D+ I    +E
Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVE 222

Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361
            A++        P    ++  D A+ G D   +  R G +IE    WS+ D+        
Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAF 282

Query: 362 GLVEKYRPDAIIIDANNTGARTC-DYLEML---GYHVYRVLGQKRAVD------------ 405
                YR D  I D    GA T   +L         V    G   + D            
Sbjct: 283 DEAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGNKIVVTGFGAGDSPDYPDEIYVPGNGE 342

Query: 406 ------------LEFCRNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436
                        +  RN+R +  V +AD        +E    ++   LI          
Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPEALISLSSKIAKLS 402

Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
                L   +    P    + + SK   R+KG KS + +D LM +FA  
Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451


>gi|294663744|gb|ADF29298.1| terminase [Pseudomonas phage JG024]
          Length = 460

 Score =  106 bits (264), Expect = 9e-21,   Method: Composition-based stats.
 Identities = 63/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
            +  +EA          I   + + N+   WI   NP  ++   Y+ F  KP  D     
Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309
           I+      +  +  + I   Y  D +     + G  P+     S I L  I  A++  ++
Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367
              +P     +G D+A++G D     L  G VI  + +W     +L  +++++  L  K 
Sbjct: 232 LGWEPAGSKRIGFDVADDGDDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KL 290

Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405
           +  ++  D+   GA        L         +Y       AVD                
Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDVYMKLPHTTIKN 350

Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442
            +   N + +   ++A       + +E   +     LI                  L S 
Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410

Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487
           +   +   G   +ESK+     +  KS + +D ++ +       P+   DF
Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460


>gi|211731737|gb|ACJ10086.1| terminase [Bacteriophage APSE-5]
          Length = 469

 Score =  105 bits (263), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 64/349 (18%), Positives = 104/349 (29%), Gaps = 67/349 (19%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252
             +EA    +     ++  + +  +   W    NP    G  Y+ F KP  +    Q   
Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKELIDTQGYY 162

Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
                    +       +              +    R    G+      D+ I    +E
Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVE 222

Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361
            A++        P    ++  D A+ G D   +  R G +IE    WS+ D+        
Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAF 282

Query: 362 GLVEKYRPDAIIIDANNTGARTC-DYLEML---GYHVYRVLGQKRAVD------------ 405
                YR D  I D    GA T   +L         V    G   + D            
Sbjct: 283 DEAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGNKIVVTGFGAGDSPDYPDEIYVPGNGE 342

Query: 406 ------------LEFCRNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436
                        +  RN+R +  V +AD        +E    ++   LI          
Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPEALISLSSKIAKLS 402

Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
                L   +    P    + + SK   R+KG KS + +D LM +FA  
Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451


>gi|284008456|emb|CBA74928.1| phage terminase large subunit [Arsenophonus nasoniae]
          Length = 477

 Score =  105 bits (263), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 81/470 (17%), Positives = 137/470 (29%), Gaps = 104/470 (22%)

Query: 84  AGRGIGKTTLNAWLVLW--------LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135
            GRG  KT   A + L          +  R  ++ I     E  +   L AE+   L L 
Sbjct: 21  GGRGGMKTVSFAKIALITASINKRRFLCLREFMNSI-----EDSVHAVLQAEIET-LRLQ 74

Query: 136 PNKHWFEMQSLSLHPAPW-YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
                 +     ++ + + Y  +      I SKH   +                      
Sbjct: 75  NRFRILDNCIKGINDSIFKYGQLARNIASIKSKHDFDVA--------------------- 113

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ-- 252
              +EA    +     ++  + +  +   W    NP    G  Y+ F KP  D    +  
Sbjct: 114 -WVEEAETVSEKSLDILIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKDIIDDKGY 170

Query: 253 ----------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
                     +       +              +         G+      D+ I    +
Sbjct: 171 YEDDDLYVGKVSYLDNPWLPEELKNDAEKMKRDNYKKWLHVYGGECDANYDDAIIQPEWV 230

Query: 303 EEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
           + A++        P    ++  D A+ G D   +  R G ++E    WS+ D+     K 
Sbjct: 231 DAAIDAHIKLGFKPKGIRVITFDPADSGQDEKALSKRYGVLVEDCVSWSEGDVADATIKA 290

Query: 361 SGLVEKYRPDAIIIDANNTGARTC----------DYLEMLGY------------------ 392
                 YR D  I D    GA T           + + + G+                  
Sbjct: 291 FDEAFDYRADDFIYDNIGLGAGTVKTYLRSSNDGNKMVVTGFGAGDSPDYPDEIYVPGNG 350

Query: 393 HVYRVLGQKRAVDLEFCRNRRTELHVKMAD-------WLEFASLINHSGLI--------- 436
                L      + +  RN+R +  V +AD        +E    I+   LI         
Sbjct: 351 EYIPSLNNDDRTNRDTFRNKRAQYWVYLADRFYKTWCAVEKKEYIDPEELISLSSKIDKL 410

Query: 437 ----QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
                 L   +    P    + + SK   R KG KS + +D LM +FA  
Sbjct: 411 SQLKSELVKQQRKRTPGNRLIQLISKEEMRSKGIKSPNMADTLMMSFANP 460


>gi|332884414|gb|EGK04674.1| hypothetical protein HMPREF9456_03377 [Dysgonomonas mossii DSM
           22836]
          Length = 450

 Score =  105 bits (261), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 53/302 (17%), Positives = 108/302 (35%), Gaps = 33/302 (10%)

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWI--MTS--NPRRLS--GKFYEIFNKPL--DDW 248
            DE S   +     ++  +    A    I  M    NP +     +FY+     +  DD 
Sbjct: 133 IDENSQITEKCWNIVMSRIRHDVAKNGLIPKMFGACNPTKNFVYNRFYKPHRDGILPDDK 192

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDIDSFIPLNIIEEALN 307
              Q        +D  + E +         ++R  +  G++ + D D ++ +   +    
Sbjct: 193 AFIQALVTDNPFVDKFYIENLKNL----DPISRARLLDGEW-EYDDDPYVLMQYEKIVDL 247

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI--EHLFDWSKTDLRTTNNKISGLVE 365
                    P  M  D+A  G D+T + +  G +   + +    + D  T   +      
Sbjct: 248 FTNSHVSGGPRYMTIDVARLGKDDTTIRIWEGLISIYKKVIPKCRIDDLTVLARKLQTEY 307

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV---DLEFCRNRRTELHVKMAD 422
                  I D +  G    D L   G+    V   K      ++   +N R++ + K+A+
Sbjct: 308 SVPNSNTIADEDGVGGGLVDNLRCKGF----VNNSKPLPIYGEVRNYQNLRSQCYFKLAE 363

Query: 423 -------WLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGL 472
                  +L+   +++   +++ L+ +K        +L + +K   +    KSTD +D L
Sbjct: 364 IVNSNLMYLKNEPIVDRERVVKELEQIKQIDADKDTKLKVITKEMLKSILGKSTDEADNL 423

Query: 473 MY 474
           M 
Sbjct: 424 MM 425


>gi|211731828|gb|ACJ10140.1| terminase [Bacteriophage APSE-6]
          Length = 469

 Score =  105 bits (261), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 82/470 (17%), Positives = 132/470 (28%), Gaps = 104/470 (22%)

Query: 84  AGRGIGKTTLNAWLVLW--------LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135
            GRG  KT   A + L          +  R  ++ I     E  +   L AEV   L L 
Sbjct: 12  GGRGGMKTVSFAKIALITASMHKRRFLCLREFMNSI-----EDSVHAVLQAEVET-LGLQ 65

Query: 136 PNKHWFEMQSLSLHPAPW-YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
                       ++ + + Y  +      I SKH   +                      
Sbjct: 66  VRFRVLNSCIEGINDSIFKYGQLARNIASIKSKHDFDVA--------------------- 104

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ-- 252
              +EA    +     ++  + +  +   +  + NP    G  Y+ F KP       Q  
Sbjct: 105 -WVEEAETVSEKSLDTLIPTIRKPGSELRF--SFNPAEEDGAVYKRFVKPYKAIIDKQGY 161

Query: 253 ----------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
                     +       +              +    R    G+      D+ I    +
Sbjct: 162 YEDDDLYVGNVSYLDNPWLPVELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWV 221

Query: 303 EEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
             A++        P    ++  D A  G D   +  R G +IE    W + D+       
Sbjct: 222 GAAIDAHIKLGFKPSGIRVVTFDPAGSGQDEKALSKRYGVLIEDCVSWLEGDVADATMTA 281

Query: 361 SGLVEKYRPDAIIIDANNTGARTC-DYLEMLG---YHVYRVLGQKRAVDLEF-------- 408
                 YR D  I D    GA T   +L         V    G   + D           
Sbjct: 282 FDEAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGSKMVVTGFGAGDSPDYPHEIYVPGNG 341

Query: 409 ----------------CRNRRTELHVKMAD-------WLEFASLINHSGLI--------- 436
                            RN+R +  V +AD        +E    ++   LI         
Sbjct: 342 EYLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPDALISLSSKIAKL 401

Query: 437 ----QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
                 L   +    P    + + SK   R+KG KS + +D LM +FA  
Sbjct: 402 SQLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451


>gi|238027628|ref|YP_002911859.1| Bbp25 [Burkholderia glumae BGR1]
 gi|237876822|gb|ACR29155.1| Bbp25 [Burkholderia glumae BGR1]
          Length = 486

 Score =  105 bits (261), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 43/225 (19%), Positives = 80/225 (35%), Gaps = 30/225 (13%)

Query: 275 LDSDVTRVEVCGQFP---QQDIDSFIPLNIIEEALNREPCPDPYAPLI----MGCDIAEE 327
           L   +    + G F    + D    IP   +  A  R        P I    +G D+A  
Sbjct: 255 LPEPLRSKMLYGDFAAGREDDPWQVIPSEWVRLAQERWRARSR--PRIPMTALGVDVARG 312

Query: 328 GGDNTVVVLRRGPVIEHLFDWSKT---DLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384
           G D ++   R G   +           D      ++  L  +     + +D    GA   
Sbjct: 313 GQDQSIYTPRYGNWFDEQVCQPGLATPDGFVVAQQVFNL--REPSTLVNLDVVGVGASPF 370

Query: 385 DYL-EMLGYHVYRVLGQKRAVDLEF-----CRNRRTELHVKMADWL-----EFASLINHS 433
           D + +++G  ++ + G  R  +L+        N R  L  +M + L     E  ++    
Sbjct: 371 DIIHQVIGDKIWGISGAARTDELDMSGQFGFVNLRALLWWRMREALDPINGEDLAIPPDP 430

Query: 434 GLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYT 475
            L  +L + + +     G + +ESK   + +  +S D  D  +Y 
Sbjct: 431 ALAADLCAPR-YRKAPRG-ILVESKEEIKKRIGRSPDRGDSAVYA 473


>gi|85059798|ref|YP_455500.1| phage terminase large subunit [Sodalis glossinidius str.
           'morsitans']
 gi|84780318|dbj|BAE75095.1| phage terminase large subunit [Sodalis glossinidius str.
           'morsitans']
          Length = 483

 Score =  104 bits (260), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 48/309 (15%), Positives = 96/309 (31%), Gaps = 44/309 (14%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQID 254
             +EA          ++  + +  +   W+   NP+ +    Y+ F   PLDD     + 
Sbjct: 116 WVEEAEAVTKESWDILIPTIRKPGSE-IWVSF-NPKNILDDTYQRFVVNPLDDICLLTVH 173

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--EPCP 312
                         +      D D+      G+       + I    I  A++       
Sbjct: 174 YTDNPHFPEVLRLEMEECKCKDYDLYLHIWEGEPVADSDLAIIKPLWIAAAVDAHMTLGF 233

Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372
           D      +G D+A+EG D   +   +G V+  L +W + D+  ++N+++    +     I
Sbjct: 234 DAVGEKRLGFDVADEGEDCNALCFVQGSVVLDLDEWHRGDVIASSNRVNRYAIERGITCI 293

Query: 373 IIDANNTGARTCDYLEMLGYHVYRVLGQKRA------------VDLEFCRNRRTELHVKM 420
           I D+   GA    +L+ +     +      A             + +   N + +    +
Sbjct: 294 IYDSIGVGAGVKAHLKRIAAINVKGFNAGEAVKDPDALYMPGKTNKDMFANIKAQAWWAV 353

Query: 421 ADWL--------------EFASLINHSGLI-------------QNLKSLKSFIVPNTGEL 453
            +                + A L     LI             +   S       N G +
Sbjct: 354 RERFYKTWRCIEAKKQDPKAALLYPTDELISLSTTNIKRLEYLKAELSRPRVDYDNNGHV 413

Query: 454 AIESKRVKG 462
            +ESK+   
Sbjct: 414 KVESKKDMK 422


>gi|218148543|ref|YP_002364311.1| terminase, large subunit [Pseudomonas phage 14-1]
 gi|218059739|emb|CAU13815.1| terminase, large subunit [Pseudomonas phage 14-1]
          Length = 460

 Score =  104 bits (260), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 63/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
            +  +EA          I   + + N+   WI   NP  ++   Y+ F  KP  D     
Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309
           I+      +  +  + I   Y  D +     + G  P+     S I L  I  A++  ++
Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367
              +P     +G D+A++G D     L  G VI  + +W     +L  +++++  L  K 
Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KM 290

Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405
           +  ++  D+   GA        L         +Y       AVD                
Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDIYMKLPHTTIKN 350

Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442
            +   N + +   ++A       + +E   +     LI                  L S 
Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410

Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487
           +   +   G   +ESK+     +  KS + +D ++ +       P+   DF
Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460


>gi|218457805|ref|YP_002418810.1| terminase, large subunit [Pseudomonas phage SN]
 gi|218379073|emb|CAT99652.1| terminase, large subunit [Pseudomonas phage SN]
          Length = 460

 Score =  104 bits (259), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 63/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
            +  +EA          I   + + N+   WI   NP  ++   Y+ F  KP  D     
Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309
           I+      +  +  + I   Y  D +     + G  P+     S I L  I  A++  ++
Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367
              +P     +G D+A++G D     L  G VI  + +W     +L  +++++  L  K 
Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KV 290

Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405
           +  ++  D+   GA        L         +Y       AVD                
Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDVYMKLPHTTIKN 350

Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442
            +   N + +   ++A       + +E   +     LI                  L S 
Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410

Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487
           +   +   G   +ESK+     +  KS + +D ++ +       P+   DF
Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460


>gi|9633565|ref|NP_050979.1| P18 [Acyrthosiphon pisum bacteriophage APSE-1]
 gi|6118013|gb|AAF03961.1|AF157835_18 P18 [Endosymbiont phage APSE-1]
          Length = 469

 Score =  104 bits (259), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 64/349 (18%), Positives = 103/349 (29%), Gaps = 67/349 (19%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252
             +EA    +     ++  + +  +   W    NP    G  Y+ F KP  +    Q   
Sbjct: 105 WVEEAETVSEKSLDSLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKELIDTQGYY 162

Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
                    +       +              +    R    G+      D+ I    +E
Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVE 222

Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361
            A++        P    ++  D A+ G D   +  R G +IE    WS+ D+        
Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAF 282

Query: 362 GLVEKYRPDAIIIDANNTGARTC-DYLEMLGYHVYRVLGQKRAVDLEFC----------- 409
                YR D  I D    GA T   +L         V+    A D               
Sbjct: 283 DDAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGNKMVVTGFGAGDSPDYPDEIYVPGNGE 342

Query: 410 ----------------RNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436
                           RN+R +  V +AD        +E    ++   LI          
Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPDALISLSSKIAKLS 402

Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
                L   +    P    + + SK   R+KG KS + +D LM +FA  
Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451


>gi|197261331|ref|YP_002154147.1| putative terminase, large subunit [Pseudomonas phage LBL3]
 gi|197244421|emb|CAR31156.1| putative terminase, large subunit [Pseudomonas phage LBL3]
          Length = 460

 Score =  104 bits (259), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 63/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
            +  +EA          I   + + N+   WI   NP  ++   Y+ F  KP  D     
Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309
           I+      +  +  + I   Y  D +     + G  P+     S I L  I  A++  ++
Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367
              +P     +G D+A++G D     L  G VI  + +W     +L  +++++  L  K 
Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KM 290

Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405
           +  ++  D+   GA        L         +Y       AVD                
Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDIYMKLPHTTIKN 350

Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442
            +   N + +   ++A       + +E   +     LI                  L S 
Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410

Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487
           +   +   G   +ESK+     +  KS + +D ++ +       P+   DF
Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460


>gi|149408318|ref|YP_001294421.1| conserved hypothetical protein ORF004 [Pseudomonas phage F8]
 gi|219523873|ref|YP_002455934.1| terminase large subunit [Pseudomonas phage PB1]
 gi|190333469|gb|ACE73724.1| terminase large subunit [Pseudomonas phage PB1]
          Length = 460

 Score =  104 bits (259), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 62/351 (17%), Positives = 117/351 (33%), Gaps = 62/351 (17%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
            +  +EA          I   + + N+   WI   NP  ++   Y+ F  KP  D     
Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDAFVKM 172

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309
           I+      +  +  + I   Y  D D     + G  P+     S I L  I  A++  ++
Sbjct: 173 INWNENPFLSETMLKVIHEAYERDKDQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367
              +P     +G D+A++G D     L  G VI  + +W     +L  +++++  L  K 
Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KM 290

Query: 368 RPDAIIIDANNTGARTCDYLEMLG---------YHVYRVLGQKRAVD------------- 405
           +  ++  D+   GA        L          Y  +   G     D             
Sbjct: 291 KGASVTYDSIGVGAHVGSKFAELNDSSPDFKLTYDPFNAGGAVDKPDDIYMKLPHTTIKN 350

Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442
            +   N + +   ++A       + +    +     LI                  L S 
Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410

Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487
           +   +   G   +ESK+     +  KS + +D ++ +       P+   DF
Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460


>gi|157265379|ref|YP_001467938.1| terminase large subunit [Thermus phage P23-45]
 gi|156905274|gb|ABU96918.1| terminase large subunit [Thermus phage P23-45]
          Length = 485

 Score =  104 bits (258), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 69/395 (17%), Positives = 134/395 (33%), Gaps = 47/395 (11%)

Query: 85  GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           GR  GK+   +   ++ +  RPG     +A +  Q +      V K   L       E+Q
Sbjct: 38  GRQSGKSEAASVEAVFELFARPGSQGWIIAPTYDQAEIIFGRVVEKVERLAEVFPATEVQ 97

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMC-RTYSEERPDTFVGHHNTYGMAIINDEASGT 203
                                +K  +T   R  S +RPD   G        +I DEA+  
Sbjct: 98  LQRRRLRLLVHHYDRPVNAPGAKRVATSEFRGKSADRPDNLRGATLD---FVILDEAAMI 154

Query: 204 PDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-----------------NKPL 245
           P  +    I   L+ R+   + ++ S P+ L+  FYE F                 N+  
Sbjct: 155 PFSVWSEAIEPTLSVRDG--WALIISTPKGLN-WFYEFFLMGWRGGLKEGIPNSGVNQTH 211

Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII--- 302
            D++ F   +  V      ++  +  R  +     R E   +F       F  L+++   
Sbjct: 212 PDFESFHAASWDVWPERREWY--MERRLYIPDLEFRQEYGAEFVSHSNSVFSGLDMLILL 269

Query: 303 -EEALNREPCPDPYAP---LIMGCDIAEEGGDN--TVVVLRRGPVIEHLFDWSKTDLRTT 356
             E        + Y P     +G D  +    +  +V+ L  G ++  L   +       
Sbjct: 270 PYERRGTRLVVEDYRPDHIYCIGADFGKNQDYSVFSVLDLDTGAIV-CLERMNGATWSDQ 328

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416
             ++  L E Y    ++ D    G    + L+  G +   +  +  +V  +   N     
Sbjct: 329 VARLKALSEDYGHAYVVADTWGVGDAIAEELDAQGINYTPLPVKSSSVKEQLISN----- 383

Query: 417 HVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPN 449
              +A  +E    ++ N   ++  L++ + +   +
Sbjct: 384 ---LALLMEKGQVAVPNDKTILDELRNFRYYRTAS 415


>gi|211731785|gb|ACJ10115.1| terminase [Bacteriophage APSE-7]
          Length = 469

 Score =  104 bits (258), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 82/470 (17%), Positives = 133/470 (28%), Gaps = 104/470 (22%)

Query: 84  AGRGIGKTTLNAWLVLW--------LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135
            GRG  KT   A + L          +  R  ++ I     E  +   L AEV   L L 
Sbjct: 12  GGRGGMKTVSFAKIALITAAMHKRRFLCLREFMNSI-----EDSVHAVLQAEVET-LGLH 65

Query: 136 PNKHWFEMQSLSLHPAPW-YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
                       ++ + + Y  +      I SKH   +                      
Sbjct: 66  ARFRVLNSCIEGINASIFKYGQLARNIASIKSKHDFDVA--------------------- 104

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ-- 252
              +EA    +     ++  + +  +   W    NP    G  Y+ F KP       +  
Sbjct: 105 -WVEEAETVSEKSLDTLISTIRKPGSE-LWFSF-NPSEEDGAVYQRFVKPYKAIIDKKGY 161

Query: 253 ----------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
                     +       +              +    R    G+      D+ I    +
Sbjct: 162 YEDDDLYVGNVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYDDALIQPEWV 221

Query: 303 EEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
           + A++        P    ++  D A+ G D   +  R G +IE    WS+ D+       
Sbjct: 222 DAAIDAHIKLGFPPRGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATITA 281

Query: 361 SGLVEKYRPDAIIIDANNTGARTC-DYLEMLGYHVYRVLGQKRAVDLEFC---------- 409
                 YR D  I D    GA T   +L         V+    A D              
Sbjct: 282 FDEAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGNKMVVTGFGAGDSPDYPDEVYVPSNA 341

Query: 410 -----------------RNRRTELHVKMAD-------WLEFASLINHSGLI--------- 436
                            RN+  +  V +AD        +E    ++   LI         
Sbjct: 342 EYLPSSNNDDRTHRDTFRNKHAQYWVYLADRFYKTWRAVEKGEYLDPDELISLSSKIEKL 401

Query: 437 ----QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
                 L       +P    + + SK   R+KG KS + +D LM +FA  
Sbjct: 402 SQLKSELVKQPRKRMPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451


>gi|197261421|ref|YP_002154236.1| putative terminase, large subunit [Pseudomonas phage LMA2]
 gi|197244511|emb|CAR31245.1| putative terminase, large subunit [Pseudomonas phage LMA2]
          Length = 460

 Score =  104 bits (258), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 62/351 (17%), Positives = 119/351 (33%), Gaps = 62/351 (17%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
            +  +EA          I   + + N+   WI   NP  ++   Y+ F  KP  D     
Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWI-IFNPNEVTDFVYQNFVVKPPKDSCVKM 172

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309
           I+      +  +  + I   Y  D +     + G  P+     S I L  I  A++  ++
Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKY 367
              +P     +G D+A++G D     L  G +I  + +W     +L  +++++  L  K 
Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNIIMEVDEWDGLEDELLKSSSRVYNLA-KM 290

Query: 368 RPDAIIIDANNTGARTCDYLEMLG------YHVYRVLGQKRAVD---------------- 405
           +  ++  D+   GA        L         +Y       AVD                
Sbjct: 291 KGTSVTYDSIGVGAHVGSKFAELNDASPDFKLIYDPFNAGGAVDKPDDVYMKLPHTTIKN 350

Query: 406 LEFCRNRRTELHVKMA-------DWLEFASLINHSGLIQ----------------NLKSL 442
            +   N + +   ++A       + +E   +     LI                  L S 
Sbjct: 351 KDHFSNIKAQKWEEVATRFRKTYEAVEHGKVYPFDELISINSETIHPDKLNQLCIELSSP 410

Query: 443 KSFIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAENP--PRSDMDF 487
           +   +   G   +ESK+     +  KS + +D ++ +       P+   DF
Sbjct: 411 RK-DLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460


>gi|157265496|ref|YP_001468054.1| phage terminase large subunit [Thermus phage P74-26]
 gi|156905391|gb|ABU97034.1| phage terminase large subunit [Thermus phage P74-26]
          Length = 485

 Score =  104 bits (258), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 69/395 (17%), Positives = 134/395 (33%), Gaps = 47/395 (11%)

Query: 85  GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           GR  GK+   +   ++ +  RPG     +A +  Q +      V K   L       E+Q
Sbjct: 38  GRQSGKSEAASVEAVFELFARPGSQGWIIAPTYDQAEIIFGRVVEKVERLAEVFPATEVQ 97

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMC-RTYSEERPDTFVGHHNTYGMAIINDEASGT 203
                                +K  +T   R  S +RPD   G        +I DEA+  
Sbjct: 98  LQRRRLRLLVHHYDRPVNAPGAKRVATSEFRGKSADRPDNLRGATLD---FVILDEAAMI 154

Query: 204 PDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-----------------NKPL 245
           P  +    I   L+ R+   + ++ S P+ L+  FYE F                 N+  
Sbjct: 155 PFSVWSEAIEPTLSVRDG--WALIISTPKGLN-WFYEFFLMGWRGGLKEGIPNSGINQTH 211

Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII--- 302
            D++ F   +  V      ++  +  R  +     R E   +F       F  L+++   
Sbjct: 212 PDFESFHAASWDVWPERREWY--MERRLYIPDLEFRQEYGAEFVSHSNSVFSGLDMLILL 269

Query: 303 -EEALNREPCPDPYAP---LIMGCDIAEEGGDN--TVVVLRRGPVIEHLFDWSKTDLRTT 356
             E        + Y P     +G D  +    +  +V+ L  G ++  L   +       
Sbjct: 270 PYERRGTRLVVEDYRPDHIYCIGADFGKNQDYSVFSVLDLDTGAIV-CLERMNGATWSDQ 328

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416
             ++  L E Y    ++ D    G    + L+  G +   +  +  +V  +   N     
Sbjct: 329 VARLKALSEDYGHAYVVADTWGVGDAIAEELDAQGINYTPLPVKSSSVKEQLISN----- 383

Query: 417 HVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPN 449
              +A  +E    ++ N   ++  L++ + +   +
Sbjct: 384 ---LALLMEKGQVAVPNDKTILDELRNFRYYRTAS 415


>gi|159904490|ref|YP_001548152.1| hypothetical protein MmarC6_0096 [Methanococcus maripaludis C6]
 gi|159885983|gb|ABX00920.1| protein of unknown function DUF264 [Methanococcus maripaludis C6]
          Length = 505

 Score =  102 bits (255), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 76/437 (17%), Positives = 138/437 (31%), Gaps = 72/437 (16%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           Q E  E +D+   +             I+ GR  GKT +   +     S   G SV+ +A
Sbjct: 65  QEEIAEAIDSEMYDV----------ITINIGRRGGKTEVMGGVGPKFCSKYRGFSVLVVA 114

Query: 115 NSETQLKTTLWAEVSKWL-SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
               Q KT ++ ++ + L S   ++   + +      +P+     +    I+ K      
Sbjct: 115 PVYNQAKT-MYKKIKRGLESNKESRQLVKPKKEGFKESPFPLITFYNGSTIEFK------ 167

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG-ILGFLTERNANRFWIMTSNPRR 232
              S E PD      +     II DEA+   D I    +   L +       +  S P  
Sbjct: 168 ---SAETPDNLR---SEGYDLIIVDEAAFVDDEIISAVLEPMLMDSGG--ILVKISTPWG 219

Query: 233 LSGKFYEIFNK----------------PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLD 276
               FY+ + K                    +K F+  +     +   F  G     G D
Sbjct: 220 TGNHFYDSYIKGELQAKMLEEGEGIPEDELRYKSFKFPSWVNPYLSKRFLMGKKKDLGED 279

Query: 277 SDVTRVEVCGQFPQQD-------------IDSFIPLNIIEEALNREPCPDPYAPLIMGCD 323
           + V   E C +F + D              D+F      E  +      +     ++G D
Sbjct: 280 NPVWLQEYCAEFIEDDTTVFSTAHVQACLSDAFETHYKTENLIYLIDEGERNKEYVIGLD 339

Query: 324 IAEEGGDNTVVVLRRGPV----IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +A+       +VL         + +   ++  D      K   L E +      +D    
Sbjct: 340 LAKHNDYTVFIVLDITTGPPYTLVYFERFNGIDYTDIAEKHLALSEAFNDAPACVDQTGI 399

Query: 380 GARTCDYLEMLGY-HVYRVLGQKRAVDLEFCRNRRTELHVKMADWL--EFASLINHSGLI 436
           G    D  + +G  ++        +          TE+  K++     +   +     L+
Sbjct: 400 GEAYMDIAKKVGLDNLTGFKFTNESK---------TEIITKLSTSFRNKEVVMPKIRVLL 450

Query: 437 QNLKSLKSFIVPNTGEL 453
             LK+   F    T +L
Sbjct: 451 TELKAFMRFRTKTTFKL 467


>gi|150021340|ref|YP_001306694.1| hypothetical protein Tmel_1462 [Thermosipho melanesiensis BI429]
 gi|149793861|gb|ABR31309.1| protein of unknown function DUF264 [Thermosipho melanesiensis
           BI429]
          Length = 421

 Score =  101 bits (251), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 57/316 (18%), Positives = 105/316 (33%), Gaps = 34/316 (10%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           I AGR  GKT   A  + +  +  P   VI    S  Q                   +  
Sbjct: 39  ICAGRRFGKTNYVAGKIFYYATIHPKSRVIVGGPSLDQ---------------AKIYYDL 83

Query: 142 EMQSLSLHPAPWYSDVLHCS--LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             +++ L P   +      S    I  K+ S++    +        G        ++  E
Sbjct: 84  LTEAIELSPLKGFVKKTKDSPFPTIYLKNGSSITVRSTAHNGKYLRGRKVN---LVVLTE 140

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR---FQIDTR 256
           A+   D +   ++  + + +     I+ S P  ++  FYE + + L + K    F     
Sbjct: 141 AAFIKDSVYEQVITPM-KLDTGAPVILESTPNGMN-YFYEEYQRGLKNKKHTISFHATVY 198

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPCPDP 314
               +D    E   A+      V R E   +F   D   F P  I+ EA    +      
Sbjct: 199 DNPFLDQEEIENAKAK--TPDYVWRQEYLAEFVD-DDTVFFPWKILVEAFEDYKPEGYKD 255

Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTD---LRTTNNKISGLVEKYRPDA 371
                +G D+A+      ++VL        + ++ + +          ++ L  KYR   
Sbjct: 256 GRKYSIGVDLAKYRDYTVIIVLDVTEEPFKIAEFHRFNQIPYEEVIRIVNDLQAKYRA-Q 314

Query: 372 IIIDANNTGARTCDYL 387
           + +DA   G    + +
Sbjct: 315 VYLDATGVGDPISERI 330


>gi|118590957|ref|ZP_01548357.1| hypothetical protein SIAM614_19891 [Stappia aggregata IAM 12614]
 gi|118436479|gb|EAV43120.1| hypothetical protein SIAM614_19891 [Stappia aggregata IAM 12614]
          Length = 526

 Score =  101 bits (251), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 43/203 (21%), Positives = 80/203 (39%), Gaps = 18/203 (8%)

Query: 290 QQDIDSFIPLNIIEEALNR--EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
           Q      IP + ++ A  R  +         ++  D+A+ G D TV+    G   E    
Sbjct: 294 QDHEWQVIPSDWVDLAFERYDQGIDRDEPMTVLAVDVAQGGKDRTVLQPLHGRRFETNIV 353

Query: 348 WSKTDLRTTNNKISGLVEKYRPDA-IIID-ANNTGARTCDYLEMLGYH-----VYRVLGQ 400
              TD +   +  S ++ + R +A I++D     G  T  +L+          V+     
Sbjct: 354 RKGTDTKDGADVGSLIIRERRDNAMIVVDCTGGWGGDTVGFLKRENNIPAEKCVFSAQSG 413

Query: 401 KRAVDLE-FCRNRRTELHVKMADWLE----FASLINHSGLIQNLKSLKSFIVPNTGELAI 455
           +RA D      N R EL+ ++ + L         I  S  ++   +   + + N G++ I
Sbjct: 414 ERAKDSRIPFYNLRAELYWRLREALHPKSGLGLAIRRSATVKAQLTAHRWKMRN-GKILI 472

Query: 456 ESK---RVKGAKSTDYSDGLMYT 475
           ESK   + +   S D +D ++  
Sbjct: 473 ESKEEIKDRLGSSPDEADAIVEA 495


>gi|126011061|ref|YP_001039811.1| TerL-like protein [Burkholderia ambifaria phage BcepF1]
 gi|119712637|gb|ABL96858.1| TerL-like protein [Burkholderia ambifaria phage BcepF1]
          Length = 459

 Score =  101 bits (251), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 55/339 (16%), Positives = 107/339 (31%), Gaps = 57/339 (16%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQ 252
            +  +EA    +     I   +    +  + I   NP + +   Y+ F   P  D    Q
Sbjct: 115 ILWLEEAQYLTEEQWNVINPTIRREGSQIWLIW--NPDQYTDFIYQNFVVNPPADCLSKQ 172

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ-QDIDSFIPLNIIEEALN--RE 309
           I+      +  +  + I   Y  D  +    V G  P+     + I L  +  A++  ++
Sbjct: 173 INWTENPFLSDTMLKVIYDEYQRDPKLAE-HVYGGAPKMGGDKAIIQLQYVLAAIDAHKK 231

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN--KISGLVEKY 367
                      G DIA++G D   +V   G V+    +W   +     +  K+     + 
Sbjct: 232 LGWKIEGSKRTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALE- 290

Query: 368 RPDAIIIDANNTGARTCDYLEMLGY-----HVYRVLGQKRA----------------VDL 406
           +  +II D+   GA        L        +Y       A                 + 
Sbjct: 291 KGSSIIFDSIGVGAHAGSKFSELNEARSLEIIYEPFNAGGAVYDPDGTYMKLPHVVITNR 350

Query: 407 EFCRNRRTELHVKMADWLE-------FASLINHSGLI---------------QNLKSLKS 444
           E   N + ++  ++A           + +   H  LI               +   +   
Sbjct: 351 EHFSNVKAQMWDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPH 410

Query: 445 FIVPNTGELAIESKR----VKGAKSTDYSDGLMYTFAEN 479
             V   G+  +ESK+     +G KS + +D  +    + 
Sbjct: 411 KDVDGMGKFKVESKKDMREKRGIKSPNIADAFIMAMIQP 449


>gi|211731761|gb|ACJ10100.1| terminase [Bacteriophage APSE-4]
          Length = 469

 Score =  100 bits (250), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 63/349 (18%), Positives = 101/349 (28%), Gaps = 67/349 (19%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ--- 252
             +EA    +     ++  + +  +   W    NP    G  Y  F KP       Q   
Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYRRFVKPYKAIIDKQGYY 162

Query: 253 ---------IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
                    +       +              +    R    G+      D+ I    ++
Sbjct: 163 EDDEVYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANYGDALIQPEWVD 222

Query: 304 EALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361
            A++        P    ++  D A+ G D   +  R G +IE    WS+ D+        
Sbjct: 223 AAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATITAF 282

Query: 362 GLVEKYRPDAIIIDANNTGARTC-DYLEMLGYHVYRVLGQKRAVDLEFC----------- 409
                YR D  I D    GA T   +L         V+    A D               
Sbjct: 283 DDAFDYRADDFIYDNIGLGAGTVKTHLRHSNDGTKMVVTGFGAGDSPDYPDEIYVPGNGE 342

Query: 410 ----------------RNRRTELHVKMAD-------WLEFASLINHSGLI---------- 436
                           RN+R +  V +AD        +E    ++   LI          
Sbjct: 343 YLPSSNNDDRTHRDTFRNKRAQYWVYLADRFYKTWRAVERGEYLDPDALISLSSKIAKLS 402

Query: 437 ---QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
                L   +    P    + + SK   R+KG KS + +D LM +FA  
Sbjct: 403 QLKSELIKQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 451


>gi|256422889|ref|YP_003123542.1| hypothetical protein Cpin_3879 [Chitinophaga pinensis DSM 2588]
 gi|256037797|gb|ACU61341.1| hypothetical protein Cpin_3879 [Chitinophaga pinensis DSM 2588]
          Length = 471

 Score =  100 bits (248), Expect = 7e-19,   Method: Composition-based stats.
 Identities = 52/286 (18%), Positives = 107/286 (37%), Gaps = 38/286 (13%)

Query: 229 NPRRLSGKFYEIFNKPL------DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282
           NP++     + +F KP       D  K  Q   +    IDP + + +++   +   V + 
Sbjct: 196 NPKKN--WCHTVFWKPFKAGQLPDKVKFLQALVQDNPFIDPGYIDNLMS---ITDKVKKQ 250

Query: 283 EVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP 340
            +  G F    D ++ +  + I +    E   +      +  DIA  G D +VV++  G 
Sbjct: 251 RLLYGNFDYDDDDNALMEYDSINDIFTNEFVVE--GKKYITADIARFGSDKSVVMVWNGL 308

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398
            +  +  + K       ++I  +  KY      +++D +  G    D L+      +  +
Sbjct: 309 RVVEIRKFEKMRTTKVADEIEKIRNKYGIPLSHVVVDEDGVGGGVVDKLDG----CHGFV 364

Query: 399 GQKRAVDLEF------CRNRRTELHVKMADWL---EFASLINHSG----LIQNLKSLKSF 445
                +D          +N +++ +  +A+ +   +     +       L + L+ +K +
Sbjct: 365 NNSAPIDNPQDQQQQNYKNLKSQCYYMLAERINDHKIFVRCDDYEMRELLSEELEQVKKW 424

Query: 446 IVPNTGELAIESK---RVKGAKSTDYSDGLMY-TFAENPPRSDMDF 487
              N  +L +  K   +    +S DYSD LM   F E  P      
Sbjct: 425 DADNDKKLEVMPKKVVKELLGRSPDYSDTLMMRMFFELKPEQRWQI 470


>gi|313760829|gb|ADR79391.1| terminase [APSE phage Eptesicus fuscus/P5/IT/USA/2009]
          Length = 394

 Score =  100 bits (248), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 66/342 (19%), Positives = 102/342 (29%), Gaps = 67/342 (19%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255
             +EA    +     ++  + +  +   W    NP    G  Y  F KP       Q   
Sbjct: 44  WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYRRFVKPYKAIIDKQGYY 101

Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD-----IDSFIPLNIIEEALNREP 310
              E               LD+     E+     + +      D+ I    +E A +   
Sbjct: 102 EDDEVYVGKVS-------YLDNPWLPAELKNDAQKGECDANYEDALIQPEWVEAATDAHI 154

Query: 311 C--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR 368
                P    ++  D A+ G D   +  R G +IE    WS+ D+             YR
Sbjct: 155 KLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDVADATMTAFDEAFDYR 214

Query: 369 PDAIIIDANNTGARTC-DYLEML---GYHVYRVLGQKRAVDLEF---------------- 408
            D  I D    GA T   +L         V    G   + D                   
Sbjct: 215 ADDFIYDNIGLGAGTVKTHLRHSNDGNKMVVTGFGAGDSPDYPHEIYVPGNGEYLPSSNN 274

Query: 409 --------CRNRRTELHVKMAD-------WLEFASLINHSGLI-------------QNLK 440
                    RN+R +  V +AD        +E    ++   LI               L 
Sbjct: 275 DDRTHRDTFRNKRAQYWVYLADRFYKTWRAVEKGEYLDPEALISLSSKIAKLSQLKSELI 334

Query: 441 SLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAEN 479
             +    P    + + SK   R+KG KS + +D LM +FA  
Sbjct: 335 KQQRKRTPGNRLIQLMSKDEMRLKGIKSPNMADTLMMSFANP 376


>gi|161525001|ref|YP_001580013.1| hypothetical protein Bmul_1828 [Burkholderia multivorans ATCC
           17616]
 gi|189350256|ref|YP_001945884.1| bacteriophage TerL protein [Burkholderia multivorans ATCC 17616]
 gi|160342430|gb|ABX15516.1| conserved hypothetical protein [Burkholderia multivorans ATCC
           17616]
 gi|189334278|dbj|BAG43348.1| bacteriophage TerL protein [Burkholderia multivorans ATCC 17616]
          Length = 531

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 56/332 (16%), Positives = 104/332 (31%), Gaps = 55/332 (16%)

Query: 190 TYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248
                 I DE++      +    L   T    +      S P  +   F +   +     
Sbjct: 195 DRASFYIVDESAFLERPQLVDASLSATTNCRQD-----ISTPNGMGNSFAQ--RRHSGKV 247

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN- 307
           K F    R     D +++    A   LD  V   E+   +        IP   ++ A+  
Sbjct: 248 KVFTFHWRDDPRKDDAWYAKQCAE--LDPVVVAQEIDINYAASVEGVVIPSAWVQAAIGA 305

Query: 308 -REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLV 364
             +   +P      G D+A+EG D      R G ++  L  WS    D+  T  K  G+ 
Sbjct: 306 HLKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLLNFLRSWSGKGGDIYETVEKTFGIC 365

Query: 365 EKYRPDAIIIDANNTGARTCDYLE----------MLGYHVYRVLGQKRAVDLE------- 407
           ++   ++   DA+  GA                     +     G     D E       
Sbjct: 366 DELGYESFDYDADGLGAGVRGDARVINEQRIAIGKRPINDEPFRGSGPVHDPEGEMVPER 425

Query: 408 ----FCRNRRTELHVKMADWLEFA-------------SLINHSGLIQNLKSL------KS 444
               +  N + +    +    +                +I+    ++ L +L       +
Sbjct: 426 KNKDYFANLKAQSWWALRLRFQATYRAVVEGKPYNPDDIISIDPELKELAALTMELSQPT 485

Query: 445 FIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476
           + V   G++ I+ K   G KS + +D +M  +
Sbjct: 486 YTVNGVGKIVID-KAPDGTKSPNLADAVMIAY 516


>gi|255321082|ref|ZP_05362250.1| gp33 TerL [Acinetobacter radioresistens SK82]
 gi|262379515|ref|ZP_06072671.1| bacteriophage TerL protein [Acinetobacter radioresistens SH164]
 gi|255301852|gb|EET81101.1| gp33 TerL [Acinetobacter radioresistens SK82]
 gi|262298972|gb|EEY86885.1| bacteriophage TerL protein [Acinetobacter radioresistens SH164]
          Length = 558

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 53/344 (15%), Positives = 106/344 (30%), Gaps = 61/344 (17%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
               DE +         +   +++       I  S P  +  KF++  ++    +  F +
Sbjct: 210 MYFLDEWAFVERQ--EAVDAAISQ--NTNVHIKGSTPNGIGDKFHQ--DRFSGRYAVFTM 263

Query: 254 DTRTVEGIDP--SFHEGIIARYGL------DSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
             R     +        +I  +        D  V   EV   +        IP   ++ A
Sbjct: 264 AWRDNPDKNWQVELDGKLIYPWYEKQLATLDDIVLAQEVDIDYAASVEGVLIPSAWVQAA 323

Query: 306 LNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKIS 361
           ++       +P        D+A+EG D      R G V+++L  WS    D+  T  K  
Sbjct: 324 VDAHIKLGIEPSGERNGALDVADEGKDKNSFAARHGIVLQYLDTWSGIGDDIFGTTQKAI 383

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDLE---- 407
                 + +    DA+  GA       ++                  G     + E    
Sbjct: 384 DACLDLKLNIFFYDADGLGAGVRGDARVINELNKAKGIPEIEANPFRGSGAVHNPEQEMV 443

Query: 408 -------FCRNRRTELHVKMA-------DWLEFASLINHS--------------GLIQNL 439
                  F  N + ++   +          L+       S                ++  
Sbjct: 444 EARKNVDFFANLKAQMWWSLRLRFQNTYRALQGMQYDPDSLISLSTKDINKQELEQLKRE 503

Query: 440 KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRS 483
            S  ++     G++ + +K+  GA S + +DG+M  F++  P +
Sbjct: 504 LSQPTYSKNGAGKILV-NKQPDGALSPNRADGVMICFSDIRPPA 546


>gi|308097723|gb|ADO14402.1| AB1gp31 [Acinetobacter phage AB1]
          Length = 313

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 48/292 (16%), Positives = 89/292 (30%), Gaps = 52/292 (17%)

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--RE 309
            I+      +  +  + I  +   D +       G     D  S I  + +E AL+  + 
Sbjct: 21  HINYNENPFLSQTALDVIADKKRRDPEGFAHIYDGMPRADDDMSIIKASWVEAALDAHKL 80

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKY 367
              D      +G D+A+ G D   +V R+G V     +W   + +L  +  +      + 
Sbjct: 81  LNLDDTGRSYLGFDVADAGKDKCALVHRKGIVAYWSDEWKAREDELLKSATRTYNEAIRL 140

Query: 368 RPDAIIIDANNTGARTCDYLEML-----------------GYHVYRVLGQKRAVDLEFCR 410
               I  D+   GA     +  L                 G H      Q +  + +F  
Sbjct: 141 -NALIHYDSTGVGAGVGAKVNELNKEKKTNVQHSKFVAGGGVHEPDKFYQPKITNKDFFA 199

Query: 411 NRRTELHVKMADWLEF-----------ASLINH--SGLI------------QNLKSLKSF 445
           N + +    +AD                 +  H    LI            +   S+   
Sbjct: 200 NAKAQAWWLVADKFRLTYQVIQAIKNGTEIPKHKPEDLISISSDMPNLHRLKVELSIPHR 259

Query: 446 IVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSDMDFGRCPSYQ 494
                G + +ESK+    +  KS + +D  +  +A  P +  M         
Sbjct: 260 DEDRLGRVMVESKQDLAKRDVKSPNLADAFIMAYA--PVKRSMQINIADVES 309


>gi|169633984|ref|YP_001707720.1| putative bacteriophage protein; putative prophage terminase large
           subunit [Acinetobacter baumannii SDF]
 gi|169152776|emb|CAP01795.1| putative bacteriophage protein; putative prophage terminase large
           subunit [Acinetobacter baumannii]
          Length = 552

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 58/337 (17%), Positives = 107/337 (31%), Gaps = 61/337 (18%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
               DE +         +   +++       I  S P  +  +F++  ++    +  F +
Sbjct: 210 MYFLDEWAFVEQQ--EAVDAAISQ--NTNVHIKGSTPNGIGDRFHQ--DRFSGRYAVFTM 263

Query: 254 DTRTVEGIDPSFHE--GIIARYGL------DSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
             R     + +      +I  +        D  V   EV   +        IP   ++ A
Sbjct: 264 PWRDNPDKNWTVTYNGKVIYPWYEKQLATLDDVVLAQEVDINYAASVEGVLIPSTWVQAA 323

Query: 306 LN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKIS 361
           ++  ++   +P    I G D+A+EG D      R G V+ +L  WS    D+  T  K  
Sbjct: 324 IDAHKKLQIEPTGDRIGGLDVADEGKDKNSFAARHGVVMTYLATWSGKGDDIFGTTQKAM 383

Query: 362 GLVEKYRPDAIIIDANNTGAR-------TCDYLEMLG---YHVYRVLGQKRAVDLE---- 407
            L  +   D +  DA+  GA          +    LG    +V    G     D E    
Sbjct: 384 DLCFEKSIDTLFYDADGLGAGCRGDARVINEKRRELGLSEINVESFRGSGSVHDPEGEMV 443

Query: 408 -------FCRNRRTELHVKMA-------DWLEFASLINHS--------------GLIQNL 439
                  F  N + +    +          LE                       L+   
Sbjct: 444 EKRLNKDFFANLKAQSWWSLRLRFQETFRALEGRDYDPDMIISLSSEDIDAKELALLTTE 503

Query: 440 KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476
            S  ++     G++ + +K+  G  S + +D +M  F
Sbjct: 504 LSQPTYTKNGVGKILV-NKQPDGTASPNRADSVMICF 539


>gi|298480040|ref|ZP_06998239.1| PBSX family phage terminase [Bacteroides sp. D22]
 gi|298273849|gb|EFI15411.1| PBSX family phage terminase [Bacteroides sp. D22]
          Length = 476

 Score = 96.7 bits (239), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 55/281 (19%), Positives = 105/281 (37%), Gaps = 33/281 (11%)

Query: 216 TERNANRFWIMTSNPRRLSGKFYEIFNKP-----LDDWKRFQID-TRTVEGIDPSFHEGI 269
            E    R   +T NP++     Y+ F KP     L ++  +     +    IDP + EG+
Sbjct: 184 NELGLRRKLFITCNPKKN--WMYDTFYKPDKKGELPEYMYYLACLVQENPFIDPDYIEGL 241

Query: 270 IARYGLDSDVTRVEVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327
                    V R  +  G +    + ++    + I E    +         I G DIA  
Sbjct: 242 KTTK---DKVKRERLLKGNWEYDDNPNALCSHDAICEIFGNKISIKTGTNYITG-DIARF 297

Query: 328 GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR--PDAIIIDANNTGARTCD 385
           G D   + +  G  I  L  +  +        I    +KYR      I+D +  G    D
Sbjct: 298 GADYARLAVWDGWHIIELQCFPVSKTTDIQTWIINKQKKYRIPNHKCIVDEDGVGGGVVD 357

Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---------EFASLINHSGLI 436
             ++ G+     +      + E  +N +T+   K+AD +         +  S  +   +I
Sbjct: 358 NCDIQGF-----VNNSTPFNGENYQNLQTQCGYKLADHINATEVGIDEDLISTADKEEII 412

Query: 437 QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMY 474
           + L+ L+++   + G+L ++ K   ++    S D+ D  + 
Sbjct: 413 RELEQLQTWKADSDGKLKLKPKEEIKMDIGCSPDWRDMFLM 453


>gi|167763812|ref|ZP_02435939.1| hypothetical protein BACSTE_02192 [Bacteroides stercoris ATCC
           43183]
 gi|167697928|gb|EDS14507.1| hypothetical protein BACSTE_02192 [Bacteroides stercoris ATCC
           43183]
          Length = 476

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 55/281 (19%), Positives = 105/281 (37%), Gaps = 33/281 (11%)

Query: 216 TERNANRFWIMTSNPRRLSGKFYEIFNKP-----LDDWKRFQID-TRTVEGIDPSFHEGI 269
            E    R   +T NP++     Y+ F KP     L ++  +     +    IDP + EG+
Sbjct: 184 NELGLRRKLFITCNPKKN--WMYDTFYKPDKKGELPEYMYYLACLVQENPFIDPDYIEGL 241

Query: 270 IARYGLDSDVTRVEVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327
                    V R  +  G +    + ++    + I E    +         I G DIA  
Sbjct: 242 KTTK---DKVKRERLLKGNWEYDDNPNALCSHDAICEIFGNKISIKTGTNYITG-DIARF 297

Query: 328 GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR--PDAIIIDANNTGARTCD 385
           G D   + +  G  I  L  +  +        I    +KYR      I+D +  G    D
Sbjct: 298 GADYARLAVWDGWHIIELQCFPVSKTTDIQTWIINKQKKYRIPNHKCIVDEDGVGGGVVD 357

Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---------EFASLINHSGLI 436
             ++ G+     +      + E  +N +T+   K+AD +         +  S  +   +I
Sbjct: 358 NCDIQGF-----VNNSTPFNGENYQNLQTQCGYKLADHINATEVGIDEDLISTADKEEII 412

Query: 437 QNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMY 474
           + L+ L+++   + G+L ++ K   ++    S D+ D  + 
Sbjct: 413 RELEQLQTWEADSDGKLKLKPKEEIKMDIGCSPDWRDMFLM 453


>gi|168260952|ref|ZP_02682925.1| phage terminase, large subunit, pbsx family [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|205349913|gb|EDZ36544.1| phage terminase, large subunit, pbsx family [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
          Length = 471

 Score = 95.6 bits (236), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 65/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  +  +  +  W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTI-RKTFSEIWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGART--------------------------CDYLEMLGYHVYR 396
           L  +   D  + D +  GA                             D L   G     
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEVFSGKKITATMFKGSESPFDEDALYQAGAWADE 343

Query: 397 -VLGQKRAVDLEFCRNRRTELHVKMADWLEFA---------SLINH-------------- 432
            V G       +  RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488
             L   L  ++     N G+L     +E K+  G  S + +D LM         +  D+ 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462


>gi|163716617|gb|ABY40529.1| putative TerL [Burkholderia phage Bups phi1]
          Length = 531

 Score = 95.2 bits (235), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 63/343 (18%), Positives = 106/343 (30%), Gaps = 57/343 (16%)

Query: 190 TYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248
                 + DE++      +    L   T    +      S P  +   F +   +     
Sbjct: 195 DRASFYVVDESAFLERPQLVDASLSATTNCRQD-----ISTPNGMGNSFAQ--RRHSGKI 247

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
           K F    R     D +++   +A   LD  V   E+   +        IP   ++ AL  
Sbjct: 248 KVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEIDINYAASVEGVVIPSAWVQAALGA 305

Query: 309 EPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLV 364
                 +P      G D+A+EG D      R G ++EHL  WS    D+  T +++ G+ 
Sbjct: 306 HVKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLLEHLESWSGVGGDIFGTVDRVLGIC 365

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDLE------- 407
           +    +    DA+  GA       +L                  G     D E       
Sbjct: 366 DVRDYEVFDYDADGLGAGVRGDARVLNEQRVAAGKRSIRNEPFRGSGPVYDPEGEMVKER 425

Query: 408 ----FCRNRRTELHVKMADWL----------------EFASLINHSGLIQNLK---SLKS 444
               +  N + +    +                    E  S+         L    S  +
Sbjct: 426 KNKDYFANLKAQSWWALRLRFQATYRAVVEGKPFDPDEIISIDPDLPERAALSMELSQPT 485

Query: 445 FIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDF 487
           F V   G++ I+ K   G KS + +D +M   A  P    +D 
Sbjct: 486 FTVNGVGKIVID-KAPDGTKSPNLADAVMI--AYQPAVRGIDI 525


>gi|260868683|ref|YP_003235085.1| putative terminase large subunit [Escherichia coli O111:H- str.
           11128]
 gi|293446697|ref|ZP_06663119.1| phage terminase large subunit [Escherichia coli B088]
 gi|257765039|dbj|BAI36534.1| putative terminase large subunit [Escherichia coli O111:H- str.
           11128]
 gi|291323527|gb|EFE62955.1| phage terminase large subunit [Escherichia coli B088]
 gi|323177130|gb|EFZ62720.1| phage terminase, large subunit, PBSX family [Escherichia coli 1180]
          Length = 471

 Score = 94.8 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488
             L   L  ++     N G+L     +E K+  G  S + +D LM         +  D+ 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462


>gi|237704849|ref|ZP_04535330.1| terminase large subunit [Escherichia sp. 3_2_53FAA]
 gi|226901215|gb|EEH87474.1| terminase large subunit [Escherichia sp. 3_2_53FAA]
 gi|315288241|gb|EFU47640.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           110-3]
          Length = 471

 Score = 94.8 bits (234), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVYGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488
             L   L  ++     N G+L     +E K+  G  S + +D LM         +  D+ 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462


>gi|324019922|gb|EGB89141.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           117-3]
          Length = 471

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKCIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVYGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488
             L   L  ++     N G+L     +E K+  G  S + +D LM         +  D+ 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462


>gi|294492319|gb|ADE91075.1| phage terminase, large subunit, PBSX family [Escherichia coli
           IHE3034]
          Length = 471

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 64/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYTAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  N               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPNDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488
             L   L  ++     N G+L     +E K+  G  S + +D LM         +  D+ 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462


>gi|307544683|ref|YP_003897162.1| hypothetical protein HELO_2093 [Halomonas elongata DSM 2581]
 gi|307216707|emb|CBV41977.1| K06909 [Halomonas elongata DSM 2581]
          Length = 531

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 51/338 (15%), Positives = 109/338 (32%), Gaps = 57/338 (16%)

Query: 194 AIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
             I DE++      +    L   T    +      S P  +   F +   +       F 
Sbjct: 199 FYIVDESAFLERPHLVDASLSATTNCRQD-----VSTPNGMGNPFAQ--RRHSGKISVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
              R     D +++   +    LD      E+   +        IP   ++ A++  ++ 
Sbjct: 252 FHWRDDPRKDDAWYAKQVDE--LDPVTVAQEIDINYSASVEGVLIPSAWVQAAVDAHKKL 309

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
             +     +   D+A+EG D      R G +++ + +W+   +D+  T  K     +++ 
Sbjct: 310 GIEITGERLGALDVADEGKDQNAYAGRHGILLDLVDEWTGKGSDIFGTVQKAFDHTDEHG 369

Query: 369 PDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQK-----------RAVDLE 407
                 DA+  G+       ++             V    G             +  + +
Sbjct: 370 GSRFDYDADGLGSGVRGDARVINEQRAEQKRPKLKVNPFRGSGGVIEPDKEMVPKRKNKD 429

Query: 408 FCRNRRTELHVKM--------ADWLEFASLINHS------------GLIQNLKSLKSFIV 447
           F  N + +    +           +E                     L+  L S  ++ V
Sbjct: 430 FFANLKAQAWWALRLRFQRTYRAVVEGMEFDPDDIISIDSRLPILSKLMLEL-SQPTYHV 488

Query: 448 PNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDM 485
             TG++ ++ K  +G KS + +D +M  +A N   +D 
Sbjct: 489 NGTGKVVVD-KAPEGTKSPNLADAVMILYAPNKSVTDR 525


>gi|157159763|ref|YP_001457081.1| PBSX family phage terminase large subunit [Escherichia coli HS]
 gi|300935792|ref|ZP_07150755.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           21-1]
 gi|157065443|gb|ABV04698.1| phage terminase, large subunit, pbsx family [Escherichia coli HS]
 gi|300459025|gb|EFK22518.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           21-1]
          Length = 471

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINDGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488
             L   L  ++     N G+L     +E K+  G  S + +D LM         +  D+ 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462


>gi|91211665|ref|YP_541651.1| terminase large subunit [Escherichia coli UTI89]
 gi|117624554|ref|YP_853467.1| phage terminase large subunit [Escherichia coli APEC O1]
 gi|218559279|ref|YP_002392192.1| Terminase large subunit [Escherichia coli S88]
 gi|91073239|gb|ABE08120.1| terminase large subunit [Escherichia coli UTI89]
 gi|115513678|gb|ABJ01753.1| phage terminase large subunit [Escherichia coli APEC O1]
 gi|148566126|gb|ABQ88401.1| phage terminase large subunit [Enterobacteria phage CUS-3]
 gi|218366048|emb|CAR03793.1| Terminase large subunit [Escherichia coli S88]
 gi|307626097|gb|ADN70401.1| terminase large subunit [Escherichia coli UM146]
 gi|323948780|gb|EGB44679.1| phage terminase large subunit [Escherichia coli H252]
          Length = 471

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 63/480 (13%), Positives = 137/480 (28%), Gaps = 79/480 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYTAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFG 488
             L   L  ++     N G+L     +E K+  G  S + +D LM         +  D+ 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPESAAQPDYS 462


>gi|167725769|ref|ZP_02409005.1| hypothetical protein BpseD_42528 [Burkholderia pseudomallei DM98]
          Length = 517

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Composition-based stats.
 Identities = 63/343 (18%), Positives = 105/343 (30%), Gaps = 57/343 (16%)

Query: 190 TYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248
                 + DE++      +    L   T    +      S P  +   F +   +     
Sbjct: 181 DRASFYVVDESAFLERPQLVDASLSATTNCRQD-----ISTPNGMGNSFAQ--RRHSGKI 233

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
           K F    R     D +++   +A   LD  V   E+   +        IP   ++ AL  
Sbjct: 234 KVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEIDINYAASVEGVVIPSAWVQAALGA 291

Query: 309 EPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLV 364
                 +P      G D+A+EG D      R G ++EHL  WS    D+  T ++  G+ 
Sbjct: 292 HVKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLLEHLESWSGVGGDIFGTVDRALGIC 351

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDLE------- 407
           +    +    DA+  GA       +L                  G     D E       
Sbjct: 352 DVRDYEVFDYDADGLGAGVRGDARVLNEQRVAAGKRSIRNEPFRGSGPVYDPEGEMVKER 411

Query: 408 ----FCRNRRTELHVKMADWL----------------EFASLINHSGLIQNLK---SLKS 444
               +  N + +    +                    E  S+         L    S  +
Sbjct: 412 KNKDYFANLKAQSWWALRLRFQATYRAVVEGKPFDPDEIISIDPDLPERAALSMELSQPT 471

Query: 445 FIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDF 487
           F V   G++ I+ K   G KS + +D +M   A  P    +D 
Sbjct: 472 FTVNGVGKIVID-KAPDGTKSPNLADAVMI--AYQPAVRGIDI 511


>gi|41057280|ref|NP_958178.1| gene 2 protein [Enterobacteria phage Sf6]
 gi|191165541|ref|ZP_03027382.1| phage terminase, large subunit, pbsx family [Escherichia coli B7A]
 gi|218695968|ref|YP_002403635.1| Terminase large subunit [Escherichia coli 55989]
 gi|331678314|ref|ZP_08378989.1| phage terminase, large subunit, PBSX family [Escherichia coli H591]
 gi|33334159|gb|AAQ12192.1| gene 2 protein [Shigella phage Sf6]
 gi|190904464|gb|EDV64172.1| phage terminase, large subunit, pbsx family [Escherichia coli B7A]
 gi|218352700|emb|CAU98482.1| Terminase large subunit [Escherichia coli 55989]
 gi|324114096|gb|EGC08069.1| phage terminase large subunit [Escherichia fergusonii B253]
 gi|331074774|gb|EGI46094.1| phage terminase, large subunit, PBSX family [Escherichia coli H591]
          Length = 470

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|13559866|ref|NP_112076.1| terminase large subunit [Enterobacteria phage HK620]
 gi|13517602|gb|AAK28891.1|AF335538_43 terminase large subunit [Salmonella phage HK620]
          Length = 470

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGENIL 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|325497784|gb|EGC95643.1| gene 2 protein [Escherichia fergusonii ECD227]
          Length = 470

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVYGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|293604595|ref|ZP_06686998.1| phage terminase large subunit [Achromobacter piechaudii ATCC 43553]
 gi|292817011|gb|EFF76089.1| phage terminase large subunit [Achromobacter piechaudii ATCC 43553]
          Length = 463

 Score = 92.9 bits (229), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 54/321 (16%), Positives = 99/321 (30%), Gaps = 49/321 (15%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF--QI 253
             +E  G  +     I   + +  A  + +   NP  L   F +     L         I
Sbjct: 135 WIEEGEGLTEEQWSIIDPTIRKEGAEVWVLW--NP-HLITDFVQAKLPALLGADCIIRHI 191

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REPC 311
           +      +  +           D D  R    GQ    D  S I  + IE A++   +  
Sbjct: 192 NYPDNPFLSATAKRKAERLKEADPDAYRHIYLGQPLSSDDASVIKFHWIEAAVDAHLKLG 251

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYRP 369
            +      +G D+A+ G D     +  G + + L +W   + +L  +  +    V   R 
Sbjct: 252 IELGGARTVGYDVADSGADKNACSVFDGAICDELDEWAAPEDELNQSTKRAWAHV---RN 308

Query: 370 DAIIIDANNTGARTCDYLE----MLGYHVYRVLGQKRAVDLEF---------CRNRRTEL 416
             ++ D+   GA     L       GYH +   G   + D E+           N + + 
Sbjct: 309 GILVYDSIGVGAHVGSTLADAGIRTGYHKFNAGGAVISPDKEYAPKIKNKEKFENLKAQA 368

Query: 417 HVKMADWLE--------------------FASLINHSGLIQNLKSLKSFIVPNTGELAIE 456
              +AD L                      + +     L   L + +       G   +E
Sbjct: 369 WQDVADRLRNTYNAVTKGMVFPASELISISSGISKLEQLKIELSAPRK-RYSKRGLDMVE 427

Query: 457 SKR---VKGAKSTDYSDGLMY 474
           +K     +G  S + +D  + 
Sbjct: 428 TKEDMARRGIPSPNLADSFIM 448


>gi|222032743|emb|CAP75482.1| Terminase large subunit [Escherichia coli LF82]
          Length = 470

 Score = 92.9 bits (229), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADSLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|168239626|ref|ZP_02664684.1| phage terminase, large subunit, pbsx family protein [Salmonella
           enterica subsp. enterica serovar Schwarzengrund str.
           SL480]
 gi|197287704|gb|EDY27095.1| phage terminase, large subunit, pbsx family protein [Salmonella
           enterica subsp. enterica serovar Schwarzengrund str.
           SL480]
          Length = 470

 Score = 92.9 bits (229), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYAAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILVPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEVIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|323936486|gb|EGB32774.1| phage terminase large [Escherichia coli E1520]
          Length = 470

 Score = 92.9 bits (229), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDL------------ 406
           L  +   D  + D +  GA     T +             G +   D             
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDGPYQAGAWADE 343

Query: 407 -----------EFCRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                      +  RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIDEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|300897414|ref|ZP_07115839.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           198-1]
 gi|300358826|gb|EFJ74696.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           198-1]
          Length = 470

 Score = 92.1 bits (227), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 62/466 (13%), Positives = 134/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGSDHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|324114526|gb|EGC08494.1| hypothetical protein ERIG_00518 [Escherichia fergusonii B253]
          Length = 540

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 57/396 (14%), Positives = 122/396 (30%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441
             ++ +F  N + +   ++            E  +                  LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ + + G++ I+ K+  G +S + +D +M  +A
Sbjct: 490 QPTYSINSVGKIVID-KQPDGTRSPNLADSVMINYA 524


>gi|238027169|ref|YP_002911400.1| hypothetical protein bglu_1g15550 [Burkholderia glumae BGR1]
 gi|237876363|gb|ACR28696.1| Hypothetical protein bglu_1g15550 [Burkholderia glumae BGR1]
          Length = 531

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 59/332 (17%), Positives = 101/332 (30%), Gaps = 55/332 (16%)

Query: 190 TYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248
                 + DE++      +    L   T    +      S P  +   F +   +     
Sbjct: 195 DRASFYVVDESAFLERPQLVDASLSATTNCRQD-----ISTPNGMGNSFAQ--RRHSGKI 247

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
           K F    R     D +++   +A   LD  V   E+   +        IP   ++ AL  
Sbjct: 248 KVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEIDINYAASVEGVVIPSAWVQAALGA 305

Query: 309 EPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLV 364
                  P      G D+A+EG D      R G ++EHL  WS    D+  T ++  G+ 
Sbjct: 306 HVKLGISPSGARRGGLDVADEGKDKNAFAGRYGFLLEHLESWSGVGGDIFGTVDRALGIC 365

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDL-------- 406
           +    +    DA+  GA       +L                  G     D         
Sbjct: 366 DVRGYEVFDYDADGLGAGVRGDARVLNEQRAAAGKRSIRSEPFRGSGPVYDPDGEMVKER 425

Query: 407 ---EFCRNRRTELHVKMADWL----------------EFASLINHSGLIQNLK---SLKS 444
              ++  N + +    +                    E  S+         L    S  +
Sbjct: 426 KNKDYFANLKAQSWWALRLRFQATYRAVVEGKPFDPDEIISIDPDLPERAALSMELSQPT 485

Query: 445 FIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476
           F V   G++ I+ K   G KS + +D +M  +
Sbjct: 486 FTVNGVGKIVID-KAPDGTKSPNLADAVMIAY 516


>gi|260856407|ref|YP_003230298.1| putative terminase large subunit [Escherichia coli O26:H11 str.
           11368]
 gi|257755056|dbj|BAI26558.1| putative terminase large subunit [Escherichia coli O26:H11 str.
           11368]
          Length = 470

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 62/466 (13%), Positives = 133/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +   +        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHTKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDL------------ 406
           L  +   D  + D +  GA     T +             G +   D             
Sbjct: 284 LAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDGPYQAGAWADE 343

Query: 407 -----------EFCRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                      +  RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|330910791|gb|EGH39301.1| phage terminase, large subunit [Escherichia coli AA86]
          Length = 540

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 57/397 (14%), Positives = 127/397 (31%), Gaps = 67/397 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
              R     D  ++     +  +D+ V    E+   +        IP   ++ A++    
Sbjct: 252 FHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIK 309

Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKY 367
               P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+ 
Sbjct: 310 LGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQD 369

Query: 368 RPDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQ 400
             +    D +  GA         + L                           V    GQ
Sbjct: 370 NLEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQ 429

Query: 401 KRAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLK 440
              ++ +F  N + +   ++    +                     +++ +   LI  L 
Sbjct: 430 AARLNKDFFANAKAQSWWRLRKLFQNTYRAVVEGMAYNPDEIISISSAMASKDKLIIEL- 488

Query: 441 SLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
           S  ++ +   G++ ++ K+  G KS + +D +M ++A
Sbjct: 489 SQPTYSINGVGKIVVD-KQPDGTKSPNLADSVMISYA 524


>gi|319789040|ref|YP_004150673.1| protein of unknown function DUF264 [Thermovibrio ammonificans HB-1]
 gi|317113542|gb|ADU96032.1| protein of unknown function DUF264 [Thermovibrio ammonificans HB-1]
          Length = 419

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 70/419 (16%), Positives = 146/419 (34%), Gaps = 58/419 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            +Q+E ++ +D+H  +             I   R  GK+ + ++      +T+P  +++ 
Sbjct: 6   PYQIEIVKGIDSHKFSV------------IKMARQTGKSFVVSYWATRRATTKPNHAIVV 53

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
           ++ +E Q              L  +K    ++++ L    ++ D     L ++  + S +
Sbjct: 54  VSPTERQ------------SKLFVDKVKLHIKAMRLTGVKFFEDTELKKLEVNFPNGSQI 101

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNP 230
                   PD   G        +I DE +   +   +   +   +T +  +   +  S P
Sbjct: 102 --IALPANPDGIRGFSGD----VIMDEVAFFKNWQEVYRAVFPIITRK-KDYKLVAISTP 154

Query: 231 RRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288
              +  FY ++  ++    W R+ ++               +     + D  R E   +F
Sbjct: 155 FGKNDLFYYLWSISENNPKWFRYSLNIFEAVAKGLKVDVEELRAGIKNEDAWRTEYLVEF 214

Query: 289 PQQDIDSFIPLNIIEEA-LNREPCPDPY-----APLIMGCDIAEEGGDNTVVV----LRR 338
             +  D+ +P  +I++  + +E             L  G D+     D TV+     L  
Sbjct: 215 IDEA-DAVLPYELIQKCEMPKEELLVEDIKELKGELYCGVDVGRR-KDLTVITLLEKLGD 272

Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRV 397
              +  + + SK   R     IS     +    + ID    G +  + L+   G  V  V
Sbjct: 273 VLYVRRIEELSKKPFREQLELISHYA--HYARRLAIDETGLGMQLAEELKERFGSKVIPV 330

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456
                A + E    +   L  K  D      +     L ++L S++   V N G +  E
Sbjct: 331 YF--SAKNKEELAEK---LRAKFQD--RLIRVPADPDLREDLHSVRK-TVTNAGNVRYE 381


>gi|268589862|ref|ZP_06124083.1| phage terminase, large subunit, PBSX family [Providencia rettgeri
           DSM 1131]
 gi|291314845|gb|EFE55298.1| phage terminase, large subunit, PBSX family [Providencia rettgeri
           DSM 1131]
          Length = 470

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 67/487 (13%), Positives = 138/487 (28%), Gaps = 79/487 (16%)

Query: 66  CLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTL 124
            +N +  P  E  +  +   GRG GK+        W +       ++  A     ++   
Sbjct: 3   QINPIFMPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILC 49

Query: 125 WAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTF 184
             E+   +S    +   +      +   +               +       +  +  + 
Sbjct: 50  ARELQNSISDSVIRLLEDTIEREGYNNEFEIQRTMIKHLGTGAEFMFYGIKNNPTKIKSL 109

Query: 185 VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-K 243
            G           +EA          ++  + + N+   W+   NP+ +    Y+ F   
Sbjct: 110 EGVD-----VCWVEEAEAVTKESWDILIPTIRKPNSE-IWVSF-NPKNILDDTYQRFVVN 162

Query: 244 PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
           P DD      +              +      +  + R    G+       + I    +E
Sbjct: 163 PPDDICLLTANYTDNPHFPDVLRLEMEECKRKNPTLYRHIWLGEPVSASDMAIIKREWLE 222

Query: 304 EALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS 361
            A +  ++        +I   D ++ GGD     +R G V++ + +    D+    +  +
Sbjct: 223 AATDAHKKLGWKAKGAIIATHDPSDVGGDAKGYAMRHGSVVKRISEGLLMDVNDGADWAT 282

Query: 362 GLVEKYRPDAIIIDANNTGART--------------------------CDYLEMLGYHVY 395
               +   D  + D +  GA                             D L   G    
Sbjct: 283 EKAIQDGADHFLWDGDGLGAALRRQVTDAFTGKQTTVTMFKGSESPFDEDALYQSGAWAD 342

Query: 396 RVLGQKRAVDL-EFCRNRRTELHVKMADWL-------EFASLINHSGLIQ---------- 437
            V+    +  + +  RN+R + +  +AD L       E     N   +I           
Sbjct: 343 EVVSGDNSRTIGDVFRNKRAQFYYALADRLYLTYRAVEHGEYANPDDMISFDKEAIGEQM 402

Query: 438 ------NLKSLKSFIVPNTGELAIESKRVK----GAKSTDYSDGLMYTFAENPPRSDMDF 487
                  L  ++       G+L + +K       G  S + +D LM +        D   
Sbjct: 403 LEKLFAELTQIQR-KFNGNGKLELMTKVDMKVKLGIPSPNLADSLMMSMYCPVIIHDDTE 461

Query: 488 GRCPSYQ 494
              PS  
Sbjct: 462 IYVPSSS 468


>gi|85716479|ref|ZP_01047450.1| prophage MuMc02, terminase, ATPase subunit, putative [Nitrobacter
           sp. Nb-311A]
 gi|85696668|gb|EAQ34555.1| prophage MuMc02, terminase, ATPase subunit, putative [Nitrobacter
           sp. Nb-311A]
          Length = 250

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 50/262 (19%), Positives = 78/262 (29%), Gaps = 38/262 (14%)

Query: 51  PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110
           P  WQ E +            NP   +   +  +    GKTT+ A + L       G  V
Sbjct: 24  PDPWQAELLR----------LNPKRALLLCSRQS----GKTTVTALMALHRAIYETGALV 69

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
           + ++ S  Q    L  ++ K    L         ++                        
Sbjct: 70  VIVSPSNRQSGEML-RQIKKLHGSLKGAPELVGDAVLKVELA--------------NGSR 114

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
            +    +E+      G        +I DEAS   D +   +   L  R A+   I  + P
Sbjct: 115 IIALPGTEKTIRGIAG-----VSLVIIDEASRVDDELLAAVRPMLATR-ADGSLIALTTP 168

Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
               G FYE ++     W R ++       I   F    +   G        E    F  
Sbjct: 169 AGKRGFFYEAWHSDDQTWHRVRVAASDCPRISKEFLADELRSLG--PARYSEEYELAFVD 226

Query: 291 QDIDSFIPLNIIEEALNREPCP 312
               +F P  +IE A   E  P
Sbjct: 227 DAASAF-PTAVIERAFTTEVEP 247


>gi|298381518|ref|ZP_06991117.1| phage terminase large subunit [Escherichia coli FVEC1302]
 gi|298278960|gb|EFI20474.1| phage terminase large subunit [Escherichia coli FVEC1302]
          Length = 470

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 61/466 (13%), Positives = 133/466 (28%), Gaps = 79/466 (16%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       +      +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIFKREWLEA 223

Query: 305 ALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
           A +  ++        ++   D ++ G D      R G V++ + +    D+    +  + 
Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATS 283

Query: 363 LVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF---------- 408
           L  +   D  + D +  GA     T +             G +   D +           
Sbjct: 284 LAIEDGSDHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADE 343

Query: 409 -------------CRNRRTELHVKMADWLEFA---------SLINH-------------- 432
                         RN+R + +  +AD L            +  +               
Sbjct: 344 VVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKML 403

Query: 433 SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
             L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 404 EKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 448


>gi|167753387|ref|ZP_02425514.1| hypothetical protein ALIPUT_01661 [Alistipes putredinis DSM 17216]
 gi|167658012|gb|EDS02142.1| hypothetical protein ALIPUT_01661 [Alistipes putredinis DSM 17216]
          Length = 472

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 43/259 (16%), Positives = 92/259 (35%), Gaps = 31/259 (11%)

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDP 314
               I+  + E + +       V +  +  G +    + ++    + I E    +     
Sbjct: 230 DNPFIEKDYIEALKST---TDKVKKERLLKGNWDYDDNPNALCSYDNIREIFYPKIH-TR 285

Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR--PDAI 372
                +  DIA  G D   +++  G  I     + ++        I  L  K+R     I
Sbjct: 286 TGIKYITADIARFGSDRARILVWDGWAIIEQVSFDRSATTEIAACIESLAAKHRIPRYRI 345

Query: 373 IIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH 432
           I D +  G    D   + G+     +   + ++ E   N +T+   K+A+ +   ++   
Sbjct: 346 IADEDGVGGGVVDMCRISGF-----VNNSQCLNGENFSNLQTQCGYKLANKINSFAISFD 400

Query: 433 SGL--------IQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMYTFAENPP 481
             L         + L+ L+++ V N  +L ++ K   +    +S D+ D L+        
Sbjct: 401 CELSDGQKDEITEELEQLQTWNVDNDRKLFLKPKDEIKQDIGRSPDWRDALLM------- 453

Query: 482 RSDMDFGRCPSYQYEGVDL 500
           R   D+ +      E + L
Sbjct: 454 RVWFDYKQIIPLSKEDLGL 472


>gi|238765385|ref|ZP_04626308.1| Gp33 TerL [Yersinia kristensenii ATCC 33638]
 gi|238696377|gb|EEP89171.1| Gp33 TerL [Yersinia kristensenii ATCC 33638]
          Length = 501

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 58/402 (14%), Positives = 121/402 (30%), Gaps = 64/402 (15%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F     S     W        + ++      + +  + +             
Sbjct: 106 ALFWKARKFVETLPSEFRGSWSEKKHAPYMRVEFPDTGAVIKGEAGDNIGR-----GDRT 160

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DE++     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 161 TLYFVDESAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHGGKIPVFT 214

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312
              R+    D  +          +  V   E+   +        IP   ++ A++     
Sbjct: 215 FHWRSDPRKDDEW-YRKECEKIDNPVVVAQELDLNYQASAEGILIPSEWVQAAIDAHIHL 273

Query: 313 D--PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN--KISGLVEKYR 368
           D  P    +   D+A+EG D     +R G +++ + +WS        +  K+ G  ++Y 
Sbjct: 274 DIQPSGARLGAMDVADEGRDKNGFAIRYGFLLQDVKEWSGEGSDIYASVVKVFGYCDEYG 333

Query: 369 PDAIIIDANNTGAR------TCDYLEM---------------------LGYHVYRVLGQK 401
            D    D +  GA         + L                           V    G+ 
Sbjct: 334 LDEFRFDEDGLGAGVRGDARVINELRQSERLGPITATPFRGSGAVFDPDDEAVIGDNGKP 393

Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442
             ++ +F  N + +    +                         +++ N   LI  L S 
Sbjct: 394 ARLNKDFFANAKAQGWWHLRKLFRNTFRAMKGMDYNPDEIISINSTMENKDRLIMEL-SQ 452

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484
            ++     G++ I+ K+ +G KS + +D +M  +A      D
Sbjct: 453 PTWSKNAVGKIVID-KQPEGTKSPNLADAVMINYAPMDSSLD 493


>gi|300824951|ref|ZP_07105051.1| conserved hypothetical protein [Escherichia coli MS 119-7]
 gi|300522580|gb|EFK43649.1| conserved hypothetical protein [Escherichia coli MS 119-7]
          Length = 540

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 58/396 (14%), Positives = 121/396 (30%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  KI G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKIFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441
             ++ +F  N + +   ++            E  +                  LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ I+ K+  G +S + +D +M  +A
Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524


>gi|254160843|ref|YP_003043951.1| hypothetical protein ECB_00733 [Escherichia coli B str. REL606]
 gi|253972744|gb|ACT38415.1| conserved hypothetical protein [Escherichia coli B str. REL606]
          Length = 540

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 57/396 (14%), Positives = 122/396 (30%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L  +                        V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNVARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441
             ++ +F  N + +   ++            E  +                  LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ I+ K+  G +S + +D +M  +A
Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524


>gi|168467237|ref|ZP_02701079.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str.
           SL317]
 gi|195630466|gb|EDX49092.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str.
           SL317]
          Length = 539

 Score = 90.6 bits (223), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 54/394 (13%), Positives = 117/394 (29%), Gaps = 62/394 (15%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F     +     W        + ++      + +  + +             
Sbjct: 143 ALFWKVRKFIATLPAEFRGGWDERKHSRFMSVEFPDTGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  +          +  +   E+   +        IP   ++ A++     
Sbjct: 252 FHWRSDPRKDDEW-YRKECEKIDNPIIVAQELDLNYQASAEGILIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D     LR G ++  + +WS   +D+  +  K+ GL + + 
Sbjct: 311 GIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEWSGKGSDIYDSVVKVFGLCDDFG 370

Query: 369 PDAIIIDANNTGART---------------CDYL-----EMLGYHVYRV-------LGQK 401
            D    D +  GA                  D +        G   Y          G+ 
Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGTDQITATPFRGSGRVFYPENEAVPGDNGKP 430

Query: 402 RAVDLEFCRNRRTELHVKMAD-------WLEFASLINHS-----------GLIQNLKSLK 443
             ++ +F  N + +    +          L+                     +    S  
Sbjct: 431 SRLNKDFFANAKAQGWWHLRKLFRNTFRALKGMEYDPDEIISISSTMENKDRLLMELSQP 490

Query: 444 SFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
           ++     G++ ++ K+  G KS + +D +M  +A
Sbjct: 491 TWSKNAVGKILVD-KQPDGTKSPNLADSVMIAYA 523


>gi|194430118|ref|ZP_03062621.1| gp33 TerL [Escherichia coli B171]
 gi|215487586|ref|YP_002330017.1| predicted terminase, large subunit [Escherichia coli O127:H6 str.
           E2348/69]
 gi|260845222|ref|YP_003223000.1| putative terminase large subunit [Escherichia coli O103:H2 str.
           12009]
 gi|194411828|gb|EDX28147.1| gp33 TerL [Escherichia coli B171]
 gi|215265658|emb|CAS10061.1| predicted terminase, large subunit [Escherichia coli O127:H6 str.
           E2348/69]
 gi|257760369|dbj|BAI31866.1| predicted terminase large subunit [Escherichia coli O103:H2 str.
           12009]
 gi|309702924|emb|CBJ02255.1| putative phage gp33 TerL [Escherichia coli ETEC H10407]
 gi|323159191|gb|EFZ45181.1| gp33 TerL [Escherichia coli E128010]
          Length = 540

 Score = 90.2 bits (222), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 57/396 (14%), Positives = 121/396 (30%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441
             ++ +F  N + +   ++            E  +                  LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ I+ K+  G +S + +D +M  +A
Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524


>gi|168820654|ref|ZP_02832654.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Weltevreden
           str. HI_N05-537]
 gi|205342611|gb|EDZ29375.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Weltevreden
           str. HI_N05-537]
          Length = 539

 Score = 90.2 bits (222), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 52/394 (13%), Positives = 116/394 (29%), Gaps = 62/394 (15%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F     +     W        + ++      + +  + +             
Sbjct: 143 ALFWKVRKFIATLPAEFRGGWDERKHSRFMSVEFPDTGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKISVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  +          +  +   E+   +        IP   ++ A++     
Sbjct: 252 FHWRSDPRKDDEW-YRKECEKIDNPIIVAQELDLNYQASAEGILIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D     LR G ++  + +WS   +D+  +  K+ GL + + 
Sbjct: 311 GIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEWSGKGSDIYDSVVKVFGLCDDFG 370

Query: 369 PDAIIIDANNTGART---------------CDYLEMLGYHVYRV------------LGQK 401
            D    D +  GA                  D +    +                  G+ 
Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGTDQITATPFRGSGSVFYPENEAVPGDNGKP 430

Query: 402 RAVDLEFCRNRRTELHVKMAD-------WLEFASLINHS-----------GLIQNLKSLK 443
             ++ +F  N + +    +          L+                     +    S  
Sbjct: 431 ARLNKDFFANAKAQGWWHLRKLFRNTFRALKGMEYDPDEIISISSTMENKDRLLMELSQP 490

Query: 444 SFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
           ++     G++ ++ K+  G KS + +D +M  +A
Sbjct: 491 TWSKNAVGKILVD-KQPDGTKSPNLADSVMIAYA 523


>gi|218555117|ref|YP_002388030.1| hypothetical protein ECIAI1_2647 [Escherichia coli IAI1]
 gi|218361885|emb|CAQ99485.1| conserved hypothetical protein from bacteriophage origin
           [Escherichia coli IAI1]
          Length = 540

 Score = 90.2 bits (222), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 57/396 (14%), Positives = 121/396 (30%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLMENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441
             ++ +F  N + +   ++            E  +                  LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ I+ K+  G +S + +D +M  +A
Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524


>gi|291283815|ref|YP_003500633.1| hypothetical protein G2583_3121 [Escherichia coli O55:H7 str.
           CB9615]
 gi|290763688|gb|ADD57649.1| hypothetical protein G2583_3121 [Escherichia coli O55:H7 str.
           CB9615]
          Length = 540

 Score = 90.2 bits (222), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 57/395 (14%), Positives = 122/395 (30%), Gaps = 63/395 (15%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWL----------------EFASLINHSGLIQNL---KSL 442
             ++ +F  N + +   ++                    E  S+ +   L   L    S 
Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIALSQ 490

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
            ++ +   G++ I+ K+  G +S + +D +M  +A
Sbjct: 491 PTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524


>gi|62181180|ref|YP_217597.1| hypothetical protein SC2610 [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SC-B67]
 gi|62128813|gb|AAX66516.1| orf, partial conserved hypothetical protein [Salmonella enterica
           subsp. enterica serovar Choleraesuis str. SC-B67]
 gi|322715669|gb|EFZ07240.1| hypothetical protein SCA50_2790 [Salmonella enterica subsp.
           enterica serovar Choleraesuis str. A50]
          Length = 540

 Score = 89.8 bits (221), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 55/396 (13%), Positives = 125/396 (31%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWNEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  ++     +   +  V   E+   +        IP + ++ A++     
Sbjct: 252 FHWRSDPRKDDEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDAHIRL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRKAARRPPILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLKS 441
             ++ +F  N + +    +                          +++ +   LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWYLRKLFRNTYRAVVEGMAYNPDEIISISSTMESKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ ++ K+  G +S + +D +M ++A
Sbjct: 490 QPTYSINGVGKIVVD-KQPDGTRSPNLADSVMISYA 524


>gi|194445851|ref|YP_002040314.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str.
           SL254]
 gi|194404514|gb|ACF64736.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str.
           SL254]
          Length = 540

 Score = 89.8 bits (221), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 55/396 (13%), Positives = 125/396 (31%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWNEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  ++     +   +  V   E+   +        IP + ++ A++     
Sbjct: 252 FHWRSDPRKDDEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDAHIRL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRKAARRPPILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLKS 441
             ++ +F  N + +    +                          +++ +   LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWYLRKLFRNTYRAVVEGMAYNPDEIISISSTMESKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ ++ K+  G +S + +D +M ++A
Sbjct: 490 QPTYSINGVGKIVVD-KQPDGTRSPNLADSVMISYA 524


>gi|188494674|ref|ZP_03001944.1| gp33 TerL [Escherichia coli 53638]
 gi|188489873|gb|EDU64976.1| gp33 TerL [Escherichia coli 53638]
          Length = 539

 Score = 89.8 bits (221), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 56/395 (14%), Positives = 121/395 (30%), Gaps = 64/395 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  +          +  +   E+   +        IP   ++ A++     
Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D     LR G ++  + +WS   +D+  +  K+ GL + + 
Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370

Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401
            D    D +  GA         + L                           V    G+ 
Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430

Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442
             ++ +F  N + +    +                         +++ N   L+  L S 
Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISSTMENKDRLLMEL-SQ 489

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
            ++    TG++ ++ K+  G KS + +D +M  +A
Sbjct: 490 PTWSKNATGKILVD-KQPDGTKSPNLADSVMIAYA 523


>gi|167553969|ref|ZP_02347711.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Saintpaul
           str. SARA29]
 gi|205321713|gb|EDZ09552.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Saintpaul
           str. SARA29]
          Length = 539

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 52/394 (13%), Positives = 116/394 (29%), Gaps = 62/394 (15%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F     +     W        + ++      + +  + +             
Sbjct: 143 ALFWKVRKFIATLPAEFRGGWDERKHSRFMSVEFPDTGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  +          +  +   E+   +        IP   ++ A++     
Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPIIVAQELDLNYQASTEGILIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D     LR G ++  + +WS   +D+  +  K+ GL + + 
Sbjct: 311 GIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEWSGKGSDIYDSVVKVFGLCDDFG 370

Query: 369 PDAIIIDANNTGART---------------CDYLEMLGYHVYRV------------LGQK 401
            D    D +  GA                  D +    +                  G+ 
Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGTDQITATPFRGSGSVFYPENEAVPGDNGKP 430

Query: 402 RAVDLEFCRNRRTELHVKMAD-------WLEFASLINHS-----------GLIQNLKSLK 443
             ++ +F  N + +    +          L+                     +    S  
Sbjct: 431 SRLNKDFFANAKAQGWWHLRKLFRNTFRALKGMEYDPDEIISISSTMENKDRLLMELSQP 490

Query: 444 SFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
           ++     G++ ++ K+  G KS + +D +M  +A
Sbjct: 491 TWSKNAVGKILVD-KQPDGTKSPNLADSVMIAYA 523


>gi|224582844|ref|YP_002636642.1| hypothetical protein SPC_1035 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|224467371|gb|ACN45201.1| hypothetical protein SPC_1035 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
          Length = 540

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 55/396 (13%), Positives = 125/396 (31%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWNEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  ++     +   +  V   E+   +        IP + ++ A++     
Sbjct: 252 FHWRSDPRKDDEWYRRECEKI-DNPVVVAQELDLNYSASAEGILIPSDWVQAAVDAHIRL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRKAARRPPILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLKS 441
             ++ +F  N + +    +                          +++ +   LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWYLRKLFRNTYRAVVEGMAYNPDEIISISSTMESKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ ++ K+  G +S + +D +M ++A
Sbjct: 490 QPTYSINGVGKIVVD-KQPDGTRSPNLADSVMISYA 524


>gi|332088044|gb|EGI93169.1| gp33 TerL [Shigella boydii 5216-82]
          Length = 539

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 56/395 (14%), Positives = 120/395 (30%), Gaps = 64/395 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  +          +  +   E+   +        IP   ++ A++     
Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D     LR G ++  + +WS   +D+  +  K+ GL + + 
Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370

Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401
            D    D +  GA         + L                           V    G+ 
Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430

Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442
             ++ +F  N + +    +                         +++ N   L+  L S 
Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISSTMENKDRLLMEL-SQ 489

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
            ++    TG++ ++ K+  G KS + +D +M  +A
Sbjct: 490 PTWSKNATGKILVD-KQPDGTKSPNLADSVMIAYA 523


>gi|323173153|gb|EFZ58784.1| gp33 TerL protein [Escherichia coli LT-68]
          Length = 539

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 56/395 (14%), Positives = 120/395 (30%), Gaps = 64/395 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  +          +  +   E+   +        IP   ++ A++     
Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D     LR G ++  + +WS   +D+  +  K+ GL + + 
Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370

Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401
            D    D +  GA         + L                           V    G+ 
Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430

Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442
             ++ +F  N + +    +                         +++ N   L+  L S 
Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISSTMENKDRLLMEL-SQ 489

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
            ++    TG++ ++ K+  G KS + +D +M  +A
Sbjct: 490 PTWSKNATGKILVD-KQPDGTKSPNLADSVMIAYA 523


>gi|332759085|gb|EGJ89395.1| gp33 TerL [Shigella flexneri 4343-70]
          Length = 519

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 56/396 (14%), Positives = 121/396 (30%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 122 ALFWKARKFVETLPVEFRGSWDEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 176

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DE++     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 177 TLYLVDESAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 230

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 231 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 289

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 290 GIQPTGKRLGAMDVADEGRDKNSFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 349

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 350 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 409

Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441
             ++ +F  N + +   ++            E                     LI  L S
Sbjct: 410 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMDYNPDEIISISSSMALKDKLIIEL-S 468

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ I+ K+  G +S + +D +M ++A
Sbjct: 469 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMISYA 503


>gi|191172603|ref|ZP_03034142.1| gp33 TerL [Escherichia coli F11]
 gi|190907076|gb|EDV66676.1| gp33 TerL [Escherichia coli F11]
 gi|324014340|gb|EGB83559.1| hypothetical protein HMPREF9533_01599 [Escherichia coli MS 60-1]
          Length = 540

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 56/396 (14%), Positives = 120/396 (30%), Gaps = 65/396 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +   + G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVENVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKS 441
             ++ +F  N + +   ++            E  +                  LI  L S
Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
             ++ +   G++ I+ K+  G +S + +D +M  +A
Sbjct: 490 QPTYSINGVGKIVID-KQPDGTRSPNLADSVMINYA 524


>gi|333006277|gb|EGK25786.1| gp33 TerL [Shigella flexneri K-218]
          Length = 540

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 56/395 (14%), Positives = 123/395 (31%), Gaps = 63/395 (15%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWDEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DE++     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDESAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R     D  ++     +   +  V   E+   +        IP   ++ A++     
Sbjct: 252 FHWRDDPRKDEEWYRRECEKI-DNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+  
Sbjct: 311 GIQPTGKRLGAMDVADEGRDKNSFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDN 370

Query: 369 PDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQK 401
            +    D +  GA         + L                           V    GQ 
Sbjct: 371 LEEFRFDEDGLGAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQA 430

Query: 402 RAVDLEFCRNRRTELHVKMADWL----------------EFASLINHSGLIQNL---KSL 442
             ++ +F  N + +   ++                    E  S+ +   L   L    S 
Sbjct: 431 ARLNKDFFANAKAQSWWRLRKLFQNTWRAVVEGMDYNPDEIISISSSMALKDKLIIELSQ 490

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
            ++ +   G++ I+ K+  G +S + +D +M ++A
Sbjct: 491 PTYSINGVGKIVID-KQPDGTRSPNLADSVMISYA 524


>gi|320179507|gb|EFW54461.1| Phage terminase, large subunit [Shigella boydii ATCC 9905]
          Length = 539

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 56/395 (14%), Positives = 120/395 (30%), Gaps = 64/395 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  +          +  +   E+   +        IP   ++ A++     
Sbjct: 252 FHWRSDPRKDDEW-YHKECDKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D     LR G ++  + +WS   +D+  +  K+ GL + + 
Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370

Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401
            D    D +  GA         + L                           V    G+ 
Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430

Query: 402 RAVDLEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSL 442
             ++ +F  N + +    +                         +++ N   L+  L S 
Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISSTMENKDRLLMEL-SQ 489

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
            ++    TG++ ++ K+  G KS + +D +M  +A
Sbjct: 490 PTWSKNATGKILVD-KQPDGTKSPNLADSVMIAYA 523


>gi|224583103|ref|YP_002636901.1| terminase large subunit [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|224467630|gb|ACN45460.1| terminase large subunit [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
          Length = 492

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 56/384 (14%), Positives = 108/384 (28%), Gaps = 79/384 (20%)

Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL------TERN---ANRFWIMTSNP 230
             +   G+ N     +  +EA          ++  +       E      +  W+   NP
Sbjct: 104 NVENIKGYANFDAALV--EEAENVSKDSWETLIPTVRKEFYSAEYGRVVESEIWVA-YNP 160

Query: 231 RRLSGKFYEIF--NKPLDDW--------KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVT 280
           +      ++ F  N+   D+           QI+         +    +      + ++ 
Sbjct: 161 KNRLSDTHQRFVTNRIYPDYDENGNRYCIVKQINYTANPWFPETLRRDMEIMKKANHELY 220

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRR 338
           R    G+       + I    +E A +            +I   D ++ G D     +R 
Sbjct: 221 RHVYLGEPVGASEMAIIKFAWLEAATDAHIKLGWKAKGAVIAAHDPSDTGPDAKGYAVRH 280

Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHV 394
           G V++ + +    D+    +  S L      D  + D +  GA       DY       V
Sbjct: 281 GSVVKRVCEGLLMDINEGADWASSLAVIDDVDHFLFDGDGLGAGLRRQITDYFSGKKVTV 340

Query: 395 YRVLGQKRAVDLEF-----------------------CRNRRTELHVKMADWL------- 424
               G +   D +                         RN+R + +  +AD L       
Sbjct: 341 TMFKGSESPFDEDAPYQAGAWTDEVVQGDNVRTIGDVFRNKRAQFYYTLADRLYRTYRAV 400

Query: 425 EFASLINHSG----------------LIQNLKSLKSFIVPNTGEL----AIESKRVKGAK 464
           E     +                   L   L  ++       G+L     +E K+  G  
Sbjct: 401 EHGEYADPDEMLSFDKEAIGENILNKLFAELTQIQR-KFNGNGKLELMTKVEMKQKLGIP 459

Query: 465 STDYSDGLMYTFAENPPRSDMDFG 488
           S + +D LM         +  D+ 
Sbjct: 460 SPNLADALMMCMHCPESVAQPDYS 483


>gi|110804738|ref|YP_688258.1| putative bacteriophage protein [Shigella flexneri 5 str. 8401]
 gi|110614286|gb|ABF02953.1| putative bacteriophage protein [Shigella flexneri 5 str. 8401]
          Length = 255

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 38/238 (15%), Positives = 72/238 (30%), Gaps = 49/238 (20%)

Query: 295 SFIPLNIIEEALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SK 350
           + I L+ IE A++  +    +P     +G D+A+ G D    V R G V+    +W   +
Sbjct: 10  AIIKLSWIEAAVDAHKTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 69

Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG------------YHVYRVL 398
            +L  +  +      +   D I+ D+   GA        +              +  R  
Sbjct: 70  DELLKSCQRTYQAALEREAD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFN 128

Query: 399 GQ----------KRAVDLEFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ---- 437
                           + +F  N + +    +AD        +          LI     
Sbjct: 129 AGAGVHEPDDEYNGIPNKDFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSR 188

Query: 438 --------NLKSLKSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
                      +         G + +ESK+    +   S + +D  +  FA      D
Sbjct: 189 CPLLEKLKLELTTPHRDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 246


>gi|167993618|ref|ZP_02574712.1| gp33 TerL [Salmonella enterica subsp. enterica serovar 4,[5],12:i:-
           str. CVM23701]
 gi|205328294|gb|EDZ15058.1| gp33 TerL [Salmonella enterica subsp. enterica serovar 4,[5],12:i:-
           str. CVM23701]
          Length = 539

 Score = 87.9 bits (216), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 56/404 (13%), Positives = 127/404 (31%), Gaps = 67/404 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWNEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
              R+    D  ++     +  +D+ V    E+   +        IP + ++ A++    
Sbjct: 252 FHWRSDPRKDDEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDAHIR 309

Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKY 367
               P    +   D+A+EG D      R G ++E++ +WS   +D+  +  ++ G  E+ 
Sbjct: 310 LGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVERVFGFCEQD 369

Query: 368 RPDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQ 400
             +    D +  GA         + L                           V    GQ
Sbjct: 370 NLEEFRFDEDGLGAGVRGDARAINELRKAARRPPILATPFRGSGAVFDPDDEAVRGDNGQ 429

Query: 401 KRAVDLEFCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLK 440
              ++ +F  N + +    +                          +++ +   LI  L 
Sbjct: 430 AARLNKDFFANAKAQSWWYLRKLFRNTYRAVVEGMAYNPDEIISISSTMESKDKLIIEL- 488

Query: 441 SLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484
           S  ++ +   G++ ++ K+  G +S + +D  M ++A      D
Sbjct: 489 SQPTYSINGVGKIVVD-KQPDGTRSPNLADSAMISYAPMDSSLD 531


>gi|294650848|ref|ZP_06728195.1| bacteriophage terminase large subunit TerL [Acinetobacter
           haemolyticus ATCC 19194]
 gi|292823266|gb|EFF82122.1| bacteriophage terminase large subunit TerL [Acinetobacter
           haemolyticus ATCC 19194]
          Length = 552

 Score = 87.9 bits (216), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 54/337 (16%), Positives = 106/337 (31%), Gaps = 61/337 (18%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
               DE +         +   +++       I  S P  +  +F++  ++    +  F +
Sbjct: 210 MYFLDEWAFVERQ--EAVDAAISQ--NTNVHIKGSTPNGIGDRFHQ--DRFSGRYAVFSM 263

Query: 254 DTRTVEGIDP--SFHEGIIARYGL------DSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
             R     +    ++   I  +        D  V   EV   +        IP   ++ A
Sbjct: 264 PWRANPDKNWTVEYNGKQIHPWYEKQLATLDDVVLAQEVDINYAASVEGVLIPSTWVQLA 323

Query: 306 LNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT--DLRTTNNKIS 361
           ++       +P    I G D+A+EG D      R G V+ +L  WS    D+  T  K  
Sbjct: 324 IDAHIKLGIEPTGDRIAGLDVADEGKDKNSFASRHGIVMTYLDTWSGKGDDIFGTTQKAM 383

Query: 362 GLVEKYRPDAIIIDANNTGAR------TCDYL-EMLGYHVYRVLGQKRA----------- 403
            L      D +  DA+  GA         + L    G     V   + +           
Sbjct: 384 DLSIDQSIDTLFYDADGLGAGCRGDARVVNELRREQGLSEVDVQPFRGSGAVHEPDEQMV 443

Query: 404 ---VDLEFCRNRRTELHVKMADWLEF-------------------ASLINHSGL--IQNL 439
               + +F  N + +    +    +                    +  I+   L  +   
Sbjct: 444 EMRFNKDFFANLKAQSWWSLRLRFQETFRALEGREYDRDMIISFSSEHIDPKELAMLTTE 503

Query: 440 KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476
            S  ++     G++ + +K+  G  S + +D +M  F
Sbjct: 504 LSQPTYTKNGVGKILV-NKQPDGTASPNRADSVMICF 539


>gi|322835667|ref|YP_004215693.1| terminase large subunit [Rahnella sp. Y9602]
 gi|321170868|gb|ADW76566.1| terminase large subunit [Rahnella sp. Y9602]
          Length = 539

 Score = 87.5 bits (215), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 63/404 (15%), Positives = 120/404 (29%), Gaps = 67/404 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W +      + ++      + +  + +             
Sbjct: 143 ALFWKARKFVEMLPVEFRGGWSAKKHAPYMRVEFPTTGAVLKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IEASLSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGRIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
              R+    D +++    A+  +D+ V    E+   +        IP   I  A+N    
Sbjct: 252 FHWRSDPRKDEAWYAKECAK--IDNPVVVAQELDLNYSASAEGVLIPNEWIRAAINAHIK 309

Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKY 367
               P    +   D+A+EG D      R G ++  + +WS   +D+  ++ K  GL +K+
Sbjct: 310 LGIQPTGKRLGAMDVADEGRDKNAFSARYGFLLTEVEEWSGVGSDIYKSSEKAFGLCDKH 369

Query: 368 RPDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKRAVDLE---------- 407
             +    D +  GA        +                  G     D E          
Sbjct: 370 GLEEFRFDEDGLGAGVRGDARAINEIRKAEGARYILATPFRGSASVFDPEAEAVPGDNGQ 429

Query: 408 -------FCRNRRTELHVKMADWLE--------------------FASLINHSGLIQNLK 440
                  F  N + +    +                            + N   LI  L 
Sbjct: 430 PARINKDFFANAKAQSWWHLRKLFRNVYRAVEEKMDYNPDEIISISGDIKNLDKLIIEL- 488

Query: 441 SLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484
           S  ++ +   G++ I  K+  G KS + SD +M  +A      D
Sbjct: 489 SQPTYSINGVGKI-IVDKQPDGTKSPNLSDSVMINYAPMDTTMD 531


>gi|238790716|ref|ZP_04634478.1| Gp33 TerL [Yersinia frederiksenii ATCC 33641]
 gi|238721211|gb|EEQ12889.1| Gp33 TerL [Yersinia frederiksenii ATCC 33641]
          Length = 538

 Score = 87.5 bits (215), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 58/356 (16%), Positives = 107/356 (30%), Gaps = 63/356 (17%)

Query: 172 MCRTYSEERPDTFVGH-HNTYGMAIINDEASGTP-DVINLGILGFLTERNANRFWIMTSN 229
              T S    +   G          I DE++      +    L   T    +      S 
Sbjct: 176 FPETESAMTGEAGDGIGRGDRTSFYIVDESAFLERPYLVDASLSATTNCRQD-----VST 230

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289
           P  ++  F E   +     K F    R     D ++++  +    LD      E+   + 
Sbjct: 231 PNGMANSFAE--RRHSGKIKVFTFHWRDDPRKDDAWYQKQVEN--LDPVTVAQEIDINYS 286

Query: 290 QQDIDSFIPLNIIEEALNREPC--PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD 347
                  IP   ++ A+N        P    +   DIA+EG D      R G ++E + +
Sbjct: 287 ASVEGVLIPSAWVQAAINAHEVLGIVPTGQRLGALDIADEGKDTNSFAGRHGFLLESIEE 346

Query: 348 WSKT--DLRTTNNKISGLVEKYRPDAIIIDANNTGAR------TC--DYLEMLGYHVYRV 397
           WS    D+  T  K   + +    +    D +  GA              E    H+   
Sbjct: 347 WSGKGDDIFGTVQKAFDICDAQNLETFRFDTDGLGAGARGDARVINEQREEQRRRHIVAT 406

Query: 398 -------------------LGQKRAVDLEFCRNRRTELHVKMADWL-------------- 424
                               GQ+  ++ +F  N + +    +                  
Sbjct: 407 PFRGSGGVTDPDDEAVPGDNGQQGRLNKDFFANAKAQGWWSLRTRFQKTYRAVKENMEFD 466

Query: 425 --EFASLINHSGLIQNLK---SLKSFIVPNTGELAIESKRVKGAKSTDYSD-GLMY 474
             E  S+      +  L    S  ++ V   G++ ++ K   G KS + +D  ++ 
Sbjct: 467 PDEIISIPKDLKNLTKLTSELSQPTYSVNGVGKIVVDKKPD-GTKSPNLADSAMIL 521


>gi|227113418|ref|ZP_03827074.1| Terminase large subunit [Pectobacterium carotovorum subsp.
           brasiliensis PBR1692]
          Length = 472

 Score = 87.5 bits (215), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 49/337 (14%), Positives = 94/337 (27%), Gaps = 61/337 (18%)

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQID 254
             +EA          ++  + +  +   W+   NP+ +    Y+ F   P DD     ++
Sbjct: 116 WVEEAEAVTKESWDILIPTIRKPGSE-IWVSF-NPKNILDDTYQRFVVTPPDDICLLTVN 173

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--P 312
                         +      +  + R    G+       + I    +E A +       
Sbjct: 174 YTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASEMAIIKREWLEAATDAHIKLGW 233

Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKISGLVEKYRPDA 371
                ++   D ++ G D+    +R G V++ +       D+    +  + L      D 
Sbjct: 234 KAKGAIVAAHDPSDTGPDDKGYAMRHGSVVKRIASPPAPLDVNDGADWATDLAIADGADH 293

Query: 372 IIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLEF------------------- 408
            + D +  GA       D        V    G +   D +                    
Sbjct: 294 FLFDGDGLGAGLRRQVTDSFTGKKVTVTMFKGSESPFDEDSPYQAGAWFDEVVDGDNIRT 353

Query: 409 ----CRNRRTELHVKMADWL-----------------------EFASLINHSGLIQNLKS 441
                RN+R + +  +AD L                       E         L   L  
Sbjct: 354 IGDVFRNKRAQFYYTLADRLYLTYRAIVHGEYANPDDMLSFDKEAIGDQMLEKLFAELTQ 413

Query: 442 LKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
           ++       G+L     +E K   G  S + +D LM 
Sbjct: 414 IQR-KFNGNGKLELMTKVEMKSKLGIPSPNLADSLMM 449


>gi|260906962|ref|ZP_05915284.1| hypothetical protein BlinB_16637 [Brevibacterium linens BL2]
          Length = 249

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 45/258 (17%), Positives = 79/258 (30%), Gaps = 40/258 (15%)

Query: 50  APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
            P  WQ   +                +  +  +   R +GKTT  A+  L      PG  
Sbjct: 23  DPELWQERLLRT--------------QEARVLVLCARQVGKTTATAYKALHAAMFNPGRD 68

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
           V+ ++ S+ Q    L             +     + +   P    S+     L   S+  
Sbjct: 69  VLIVSPSQRQSDEML------------RRVASLYRGMKEAPKLSRSNTSEMGLSNGSR-- 114

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
             +    SE     F G        +I DEAS   D +   +L  +         +  S 
Sbjct: 115 -VVSLPGSEGGIRGFAGVK-----LLILDEASRVDDDVFASVLPMVASDGQ---MVALST 165

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289
           P    G F+E+  +  + W+R ++     +   P     + A  G  S V   +   +F 
Sbjct: 166 PWGRRGWFHELHQETRNGWERHKVTVYESDQYTPPRIAEVKASLG--SFVFSSDYLCEF- 222

Query: 290 QQDIDSFIPLNIIEEALN 307
                       +  A +
Sbjct: 223 GDTDSQLFSTENVRAAFS 240


>gi|315426011|dbj|BAJ47659.1| prophage MuMc02, terminase, ATPase subunit [Candidatus
           Caldiarchaeum subterraneum]
          Length = 439

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 63/333 (18%), Positives = 112/333 (33%), Gaps = 25/333 (7%)

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ- 119
            +  H        +P  F+  +   RG G T   A        T P  +++ ++ S  Q 
Sbjct: 18  DIRLHPWQKRFIDDPSRFRIILKH-RGAGATFTIAAEACAEALTHPASTILLISYSLRQS 76

Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE 179
           L+  ++  V   LS L NK      S+    A   +  +    G                
Sbjct: 77  LE--IFRHVRTILSRLENKRLKHGHSIYRLAAKIGARTVELGNGSRI--------ISLPN 126

Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239
            P++  G+      A+  DEA+      NL      T    N    + S P+   G F+E
Sbjct: 127 NPESLRGYRAD---AVYVDEAAFFRGDTNLKTAIMFTTVARNGRVTLVSTPKGKRGWFHE 183

Query: 240 IFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL 299
            +      W +  +       I     E +  R  +     R E+  +F   ++++FIP 
Sbjct: 184 AWTTDNT-WSKHLVKLGDSPHITMHDLEEL--RKTMSPLEWRQEMMCEFLD-EVNAFIPY 239

Query: 300 NIIEEALNRE-PCPDPYAPLIMGCDIAEEGGDNTVV--VLRRGPVIE--HLFDWSKTDLR 354
             I E +    P       + +G D      D+TV+  V+  G      ++ +  +    
Sbjct: 240 EKILECVEDYVPARVVGGRVYVGVDFGRF-RDSTVIIAVVEDGERFRVCYVEELRQKPFA 298

Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
                I+       P  + +D+   GA   + L
Sbjct: 299 AQLEAINRANMVLHPAIVAVDSTGMGAPLAETL 331


>gi|297520464|ref|ZP_06938850.1| hypothetical protein EcolOP_22727 [Escherichia coli OP50]
          Length = 313

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 47/270 (17%), Positives = 90/270 (33%), Gaps = 56/270 (20%)

Query: 262 DPSFHEGIIARYGLDSD---VTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--PDPYA 316
           DP   E    R     D   V   E+   +        IP   ++ A++        P  
Sbjct: 30  DPRKDEEWYRRECEKIDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKLGIQPTG 89

Query: 317 PLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIII 374
             +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+   +    
Sbjct: 90  KRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDNLEEFRF 149

Query: 375 DANNTGART------CDYLEMLGYH---------------------VYRVLGQKRAVDLE 407
           D +  GA         + L  +                        V    GQ   ++ +
Sbjct: 150 DEDGLGAGVRGDARAINELRNVARRPSILATPFRGSGAVFDPDDEAVRGDNGQAARLNKD 209

Query: 408 FCRNRRTELHVKMADWL--------EFASLINH------------SGLIQNLKSLKSFIV 447
           F  N + +   ++            E  +                  LI  L S  ++ +
Sbjct: 210 FFANAKAQSWWRLRKLFQNTWRAVVEGMAYNPDEIISISSSMALKDKLIIEL-SQPTYSI 268

Query: 448 PNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
              G++ I+ K+  G +S + +D +M  +A
Sbjct: 269 NGVGKIVID-KQPDGTRSPNLADSVMINYA 297


>gi|327191373|gb|EGE58399.1| prophage MuMc02, terminase, ATPase subunit, putative [Rhizobium
           etli CNPAF512]
          Length = 248

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 45/262 (17%), Positives = 86/262 (32%), Gaps = 38/262 (14%)

Query: 50  APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
            P  WQ   +            NP   +   +  +    GK+T+ A+LV+      P   
Sbjct: 22  EPDPWQANLLRA----------NPRRSMLLCSRQS----GKSTVAAFLVIQTALFVPAAQ 67

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
           ++ ++ ++ Q    L+  +  +LS LP       +S                    S   
Sbjct: 68  IVVVSPTQRQ-SNELFRTIVGFLSRLPGAPRPTAESKQGTEL--------------SNGA 112

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
             +    +E+      G        ++ DEA+   D +   +   +  +  +   +  + 
Sbjct: 113 RVLSLPGTEKTIRGIAGVD-----LVVMDEAARVEDALLTAVRPMMATK-PDARLVALTT 166

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289
           P    G FYE +      W+R ++       I   F +  +   G        E   +F 
Sbjct: 167 PAGKRGWFYEAWVSDDPSWERVRVPASACPRITQQFLDEELKALGA--IKFSEEYGLEFH 224

Query: 290 QQDIDSFIPLNIIEEALNREPC 311
             +  +  PL IIE A  +E  
Sbjct: 225 DPEE-AVFPLAIIEAAFTQEVR 245


>gi|315576663|gb|EFU88854.1| conserved hypothetical protein [Enterococcus faecalis TX0630]
          Length = 519

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 67/430 (15%), Positives = 137/430 (31%), Gaps = 62/430 (14%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS----LLPNKHWFEMQ 144
           GK+ L++ + +WL           +A  +      +   V+  L      +  K    + 
Sbjct: 92  GKSWLSSRIAVWLA---DHNRRCYVAGGKKDTTDIIMQHVTDTLQTVDESIARKLLEPVD 148

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
            L           +  S G   +  S        +  +  +G    Y    I DE++   
Sbjct: 149 KLERLQTGLSKRKISFSGGGSIEGISLGEHFKGNKSGNQAIGRGGDY----IIDESAFVS 204

Query: 205 DVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQIDTRTVEG 260
           +     LG   F      N      SNP    G+FY+   +            D RT   
Sbjct: 205 NETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLVVWADVRTAFE 263

Query: 261 IDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            D     E +I      S+  + +         + P ++ D        EE        +
Sbjct: 264 EDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTEE-----EHTE 313

Query: 314 PYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGL-- 363
                 +G D A +G D      + +  +    +    +  K    D  T+   I+ L  
Sbjct: 314 KNWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGDWQDGVTSKKIITQLLM 373

Query: 364 -VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV---------DLEFCRN 411
            +E +    + +D    G    + L  +   + ++ +                  ++  N
Sbjct: 374 IIEHFEVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEKKHYSAKYGAN 432

Query: 412 RRTELHVKMADWLEFASLINHSGLIQ---NLKSLKSFIVPNTGELAIESK---RVKGAKS 465
           +R E+H+ + + ++  ++     + +   +   L S  + + G+ AI  K   + K   S
Sbjct: 433 KRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLVSSKIKSNGKTAIVPKEEIKAKLGHS 492

Query: 466 TDYSDGLMYT 475
            D  D ++ +
Sbjct: 493 PDTLDSVLLS 502


>gi|255975409|ref|ZP_05425995.1| predicted protein [Enterococcus faecalis T2]
 gi|255968281|gb|EET98903.1| predicted protein [Enterococcus faecalis T2]
          Length = 519

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 67/430 (15%), Positives = 137/430 (31%), Gaps = 62/430 (14%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS----LLPNKHWFEMQ 144
           GK+ L++ + +WL           +A  +      +   V+  L      +  K    + 
Sbjct: 92  GKSWLSSRIAVWLA---DHNRRCYVAGGKKDTTDIIMQHVTDTLQTVDESIARKLLEPVD 148

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
            L           +  S G   +  S        +  +  +G    Y    I DE++   
Sbjct: 149 KLERLQTGLSKRKISFSGGGSIEGISLGEHFKGNKSGNQAIGRGGDY----IIDESAFVS 204

Query: 205 DVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQIDTRTVEG 260
           +     LG   F      N      SNP    G+FY+   +            D RT   
Sbjct: 205 NETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLVVWADVRTAFE 263

Query: 261 IDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            D     E +I      S+  + +         + P ++ D        EE        +
Sbjct: 264 EDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTEE-----EHTE 313

Query: 314 PYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGL-- 363
                 +G D A +G D      + +  +    +    +  K    D  T+   I+ L  
Sbjct: 314 KNWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGNWQDGVTSKKIITQLLM 373

Query: 364 -VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV---------DLEFCRN 411
            +E +    + +D    G    + L  +   + ++ +                  ++  N
Sbjct: 374 IIEHFEVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEKNHYSAKYGAN 432

Query: 412 RRTELHVKMADWLEFASLINHSGLIQ---NLKSLKSFIVPNTGELAIESK---RVKGAKS 465
           +R E+H+ + + ++  ++     + +   +   L S  + + G+ AI  K   + K   S
Sbjct: 433 KRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLVSSKIKSNGKTAIVPKEEIKAKLGHS 492

Query: 466 TDYSDGLMYT 475
            D  D ++ +
Sbjct: 493 PDTLDSVLLS 502


>gi|29376621|ref|NP_815775.1| hypothetical protein EF2112 [Enterococcus faecalis V583]
 gi|257090386|ref|ZP_05584747.1| predicted protein [Enterococcus faecalis CH188]
 gi|307276045|ref|ZP_07557178.1| hypothetical protein HMPREF9521_01673 [Enterococcus faecalis
           TX2134]
 gi|29344085|gb|AAO81845.1| hypothetical protein EF_2112 [Enterococcus faecalis V583]
 gi|256999198|gb|EEU85718.1| predicted protein [Enterococcus faecalis CH188]
 gi|306507375|gb|EFM76512.1| hypothetical protein HMPREF9521_01673 [Enterococcus faecalis
           TX2134]
          Length = 519

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 67/430 (15%), Positives = 137/430 (31%), Gaps = 62/430 (14%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS----LLPNKHWFEMQ 144
           GK+ L++ + +WL           +A  +      +   V+  L      +  K    + 
Sbjct: 92  GKSWLSSRIAVWLA---DHNRRCYVAGGKKDTTDIIMQHVTDTLQTVDESIARKLLEPVD 148

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
            L           +  S G   +  S        +  +  +G    Y    I DE++   
Sbjct: 149 KLERLQTGLSKRKISFSGGGSIEGISLGEHFKGNKSGNQAIGRGGDY----IIDESAFVS 204

Query: 205 DVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQIDTRTVEG 260
           +     LG   F      N      SNP    G+FY+   +            D RT   
Sbjct: 205 NETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLVVWADVRTAFE 263

Query: 261 IDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            D     E +I      S+  + +         + P ++ D        EE        +
Sbjct: 264 EDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTEE-----EHTE 313

Query: 314 PYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGL-- 363
                 +G D A +G D      + +  +    +    +  K    D  T+   I+ L  
Sbjct: 314 KNWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGDWQDGVTSKKIITQLLM 373

Query: 364 -VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV---------DLEFCRN 411
            +E +    + +D    G    + L  +   + ++ +                  ++  N
Sbjct: 374 IIEHFEVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEKNHYSAKYGAN 432

Query: 412 RRTELHVKMADWLEFASLINHSGLIQ---NLKSLKSFIVPNTGELAIESK---RVKGAKS 465
           +R E+H+ + + ++  ++     + +   +   L S  + + G+ AI  K   + K   S
Sbjct: 433 KRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLVSSKIKSNGKTAIVPKEEIKAKLGHS 492

Query: 466 TDYSDGLMYT 475
            D  D ++ +
Sbjct: 493 PDTLDSVLLS 502


>gi|315575102|gb|EFU87293.1| conserved hypothetical protein [Enterococcus faecalis TX0309B]
 gi|315582529|gb|EFU94720.1| conserved hypothetical protein [Enterococcus faecalis TX0309A]
          Length = 407

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 52/320 (16%), Positives = 106/320 (33%), Gaps = 51/320 (15%)

Query: 195 IINDEASGTPDVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKR 250
            I DE++   +     LG   F      N      SNP    G+FY+   +         
Sbjct: 83  YIIDESAFVSNETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLV 141

Query: 251 FQIDTRTVEGIDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIE 303
              D RT    D     E +I      S+  + +         + P ++ D        E
Sbjct: 142 VWADVRTAFEEDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTE 196

Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRT 355
           E        +      +G D A +G D      + +  +    +    +  K    D  T
Sbjct: 197 E-----EHTEKNWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGDWQDGVT 251

Query: 356 TNNKISGL---VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV------ 404
           +   I+ L   +E +    + +D    G    + L  +   + ++ +             
Sbjct: 252 SKKIITQLLMIIEHFEVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEK 310

Query: 405 ---DLEFCRNRRTELHVKMADWLEFASLINHSGLIQ---NLKSLKSFIVPNTGELAIESK 458
                ++  N+R E+H+ + + ++  ++     + +   +   L S  + + G+ AI  K
Sbjct: 311 NHYSAKYGANKRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLVSSKIKSNGKTAIVPK 370

Query: 459 ---RVKGAKSTDYSDGLMYT 475
              + K   S D  D ++ +
Sbjct: 371 EEIKAKLGHSPDTLDSVLLS 390


>gi|315034678|gb|EFT46610.1| conserved hypothetical protein [Enterococcus faecalis TX0027]
          Length = 519

 Score = 85.2 bits (209), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 68/431 (15%), Positives = 138/431 (32%), Gaps = 64/431 (14%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS----LLPNKHWFEMQ 144
           GK+ L++ + +WL           +A  +      +   V+  L      +  K    + 
Sbjct: 92  GKSWLSSRIAVWLA---DHNRRCYVAGGKKDTTDIIMQHVTDTLQTVDESIARKLLEPVD 148

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
            L           +  S G   +  S        +  +  +G    Y    I DE++   
Sbjct: 149 KLERLQTGLSKRKISFSGGGSIEGISLGEHFKGNKSGNQAIGRGGDY----IIDESAFVS 204

Query: 205 DVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQIDTRTVEG 260
           +     LG   F      N      SNP    G+FY+   +            D RT   
Sbjct: 205 NETYAELGRRNFANVDGKNYLSFEISNP-HNKGRFYDKLTQENIPKGMLVVWADVRTAFE 263

Query: 261 IDP-SFHEGIIARYGLDSDVTRVE------VCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            D     E +I      S+  + +         + P ++ D        EE        +
Sbjct: 264 EDRVKSIEQVI-----SSEFFQNKSTCQRYFLCELPDENEDGMFGTPQTEE-----EHTE 313

Query: 314 PYAPLIMGCDIAEEGGDN-----TVVVLRRGPVIEHLFDWSK---TDLRTTNNKISGL-- 363
                 +G D A +G D      + +  +    +    +  K    D  T+   I+ L  
Sbjct: 314 KDWEYFLGVDSAYKGKDKIKATLSALDAQGQVHVIDTIEIEKGDWQDGVTSKKIITQLLM 373

Query: 364 -VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAV---------DLEFCRN 411
            +E +    + +D    G    + L  +   + ++ +                  ++  N
Sbjct: 374 IIEHFDVKGVCVDV-GYGVYIVEGLAHINGDFELHGINFGAGTTKERVEKNHYSAKYGAN 432

Query: 412 RRTELHVKMADWLEFASLI----NHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAK 464
           +R E+H+ + + ++  ++      +  +I  L  + S  + + G+ AI  K   + K   
Sbjct: 433 KRAEMHIDLQENIDNRNIFFTEKVYEEVIDELVLISS-KIKSNGKTAIVPKEEIKAKLGH 491

Query: 465 STDYSDGLMYT 475
           S D  D ++ +
Sbjct: 492 SPDTLDSVLLS 502


>gi|53793591|ref|YP_112491.1| terminase large subunit [Flavobacterium phage 11b]
 gi|53748181|emb|CAH56642.1| terminase large subunit [Flavobacterium phage 11b]
          Length = 432

 Score = 83.2 bits (204), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 52/314 (16%), Positives = 114/314 (36%), Gaps = 38/314 (12%)

Query: 196 INDEASGTPDVINLGILG----FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD----- 246
             DE +         +       L +       + T NP +    + + + K  +     
Sbjct: 126 FIDECNQITYKAWQIVKSRIRYKLNQYGIEPKMLGTCNPAKNW-VYAQFYLKDKNGTLDN 184

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS-FIPLNIIEEA 305
           D K  Q        +  S+   +++   LD +  +    G +   +  +  I    I+  
Sbjct: 185 DKKFIQALPTDNPHLPASYLTSLLS---LDENSKQRLYYGNWEYDNDPAKLIDYEKIQNC 241

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
                 P  +  + +  DIA  G D  V+ +  G  +  +F  +K+ +      + GL  
Sbjct: 242 FTNTFIP--FGEMYISADIARFGSDKMVICVWSGFRVVEIFSMAKSSITEIAEAVRGLSI 299

Query: 366 KYRP--DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE----FCRNRRTELHVK 419
           K++     +I D +           +        +   RA++++      +N +T+ + K
Sbjct: 300 KHKVPLSNVICDED-----GVGGGVVDVLGCTGFINNSRAMEVDNQVVQYQNLKTQCYYK 354

Query: 420 MAD-------WLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYS 469
           +A+       ++       +  + + L+ +K   + + G+L + SK   +    +S DYS
Sbjct: 355 LAEVIQSNNLYIHSEDATVNDEITKELEQVKRDKIDSDGKLQLISKDKVKQAIGRSPDYS 414

Query: 470 DGLMY-TFAENPPR 482
           D LM   + E  P+
Sbjct: 415 DALMMRMYFEFKPK 428


>gi|312126991|ref|YP_003991865.1| hypothetical protein Calhy_0759 [Caldicellulosiruptor
           hydrothermalis 108]
 gi|311777010|gb|ADQ06496.1| conserved hypothetical protein [Caldicellulosiruptor hydrothermalis
           108]
          Length = 444

 Score = 80.5 bits (197), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 57/335 (17%), Positives = 108/335 (32%), Gaps = 39/335 (11%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           AGR  GK+T+    V+   +T+        A S  Q K   + E  +      N    + 
Sbjct: 54  AGRRFGKSTVTLIDVVHECATKTKQVWYITAPSIDQAK-IYFQEFEQ---RAANNSLLDA 109

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
                  +P+    L     I  +         +        G        +   EA+  
Sbjct: 110 LVKDFKWSPFPEITLINGSKILGRS--------TSRNGVYLRGKGADG---VAITEAAFI 158

Query: 204 PDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----WKRFQIDTRTV 258
            D +    I   + +RN       T N        Y++F + L+D    +K F       
Sbjct: 159 KDKVYHDVIRAMVLDRNGVLRLETTPN---GMNYVYKLFQEGLNDSTGYYKSFHATVYDN 215

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI----PLNIIEEALNREPCPDP 314
           E +D    E I     +     R+E   +F +   DSFI     L  + +    +  P  
Sbjct: 216 ERLDREELERIRRE--IPELAWRIEYLAEFVE--DDSFIFPWNLLCEVFDDYELKKEPQN 271

Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR---TTNNKISGLVEKYRPDA 371
                +G D+A+      ++VL        + ++ +   R        ++ L  KY    
Sbjct: 272 GHRYSIGVDLAKYQDYTVIIVLDITREPYQIVEYHRYQGRLYTDVVAHVNELQAKY-NAR 330

Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
           + +DA   G    + +     +    +  +++ + 
Sbjct: 331 VYLDATGVGDPIAEQVR----NCEPFVFSEKSRNK 361


>gi|333010190|gb|EGK29625.1| phage terminase large subunit domain protein [Shigella flexneri
           K-272]
 gi|333021147|gb|EGK40404.1| phage terminase large subunit domain protein [Shigella flexneri
           K-227]
          Length = 235

 Score = 80.5 bits (197), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 33/223 (14%), Positives = 63/223 (28%), Gaps = 47/223 (21%)

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVE 365
           +    +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      
Sbjct: 5   KTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAAL 64

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRA 403
           +   D I+ D+   GA        +              +  R                 
Sbjct: 65  EREAD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGI 123

Query: 404 VDLEFCRNRRTELHVKMAD-------WLEFASLINHSGLIQ------------NLKSLKS 444
            + +F  N + +    +AD        +          LI                +   
Sbjct: 124 PNKDFFANLKAQAWWLVADRFRNTFNAINNGEQYPVDELISIDSRCPLLEKLKLELTTPH 183

Query: 445 FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
                 G + +ESK+    +   S + +D  +  FA      D
Sbjct: 184 RDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 226


>gi|48697520|ref|YP_024878.1| gp33 TerL [Burkholderia phage BcepB1A]
 gi|47717490|gb|AAT37736.1| gp33 TerL [Burkholderia phage BcepB1A]
          Length = 532

 Score = 79.4 bits (194), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 52/338 (15%), Positives = 105/338 (31%), Gaps = 60/338 (17%)

Query: 196 INDEASGTPDV-INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQID 254
             DEA+   +       L   T    +      S+   L+  F E   +     K   + 
Sbjct: 203 FVDEAAHLENAQAVDTALAATTNCRID-----ISSVNGLNNPFAE--KRFSGRVKVKTMH 255

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--P 312
            R     D  +++    ++  ++ V   E+   +        IPL  I+ A++ +     
Sbjct: 256 WRDDPRKDDEWYKKQKQKF--NALVVAQEIDIDYSASAEGVLIPLEWIDAAIDADVKLGL 313

Query: 313 DPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR---TTNNKISGLVEKYRP 369
                     D+A+EG D      R G  +++   WS        TT   I  ++ +   
Sbjct: 314 TVTGQRFSSLDVADEGKDMNAFGSRLGIRMDYAESWSGKGSNIYGTTLRTIGLVIAQNGR 373

Query: 370 DAIIIDANNTGARTCDYLEMLGY--------HVYRVLGQKRAV----------------D 405
           D    D++  G       E +           +  +  +  +                 +
Sbjct: 374 DFQF-DSDGLGVGVRGDAEAINALPERKAYPKIDAIAFRGSSSVREPDKQVPGAYKGVKN 432

Query: 406 LEFCRNRRTELHVKMADWLE-------------------FASLINHSGLIQNLKSLKSFI 446
           ++F +NR+ + +  +    E                    +S I     I+       + 
Sbjct: 433 VDFFQNRKAQEYWALRMRFEATYRAVVEKLEYDPDEIISISSRIPDLQKIRMELHQPLYK 492

Query: 447 VPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484
              TG++ I+ K   G  S +Y+D  M  +A    +  
Sbjct: 493 PSTTGKIMIQ-KTPDGMVSPNYADMTMMLYAPQQTKRG 529


>gi|269941618|emb|CBI50024.1| phage protein [Staphylococcus aureus subsp. aureus TW20]
          Length = 599

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 82/446 (18%), Positives = 132/446 (29%), Gaps = 100/446 (22%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           A RG+GKT L+A   L      PG  +I  A +++Q    L             K   E+
Sbjct: 82  ASRGLGKTFLSAVYCLTRCILYPGTKIIITAPTKSQGINVL------------EKIENEL 129

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            S  +H      +  +    I   + S +    S    D   GH       ++ DE    
Sbjct: 130 LSPLIHREIESINTGNQKPMIAFHNGSWIRVVASN---DNARGHRAN---LLLVDEFVKV 183

Query: 204 P-DVINLGILGFLTERNANRFW---------------IMTSNPRRLSGKFYEIFN----- 242
             D+I+      LT +    F                +  S+    S   Y+        
Sbjct: 184 DEDLIDTVFKKMLTSQREPAFLHKAKYKNYPREENTQMYLSSAWMKSHWAYDSMRSFTKQ 243

Query: 243 ----KPLDDWKRF--QIDTRTVEGIDPSFHEGIIAR-------------------YGLDS 277
               K  DD K F   I   T        H+ + A                    +G   
Sbjct: 244 MLKKKSEDDLKSFVCHIPYYTGVMEKLYSHKQMKAEAQAEGFNKMKFAMEMEAVWWGETE 303

Query: 278 DVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG---DNTVV 334
                     F ++   +F P  ++ +A    P  +P    ++  D+A  GG   D +V 
Sbjct: 304 SAFFNFNTIDFNRKLSQAFYPKEVLVQADINNPIKEPKEKRLLAVDVARMGGNSNDASVF 363

Query: 335 VLRR---------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
            L R            + ++ D    D +T   +I  L + +  D I++D  N GA   D
Sbjct: 364 SLIRLLPKGKQQYERQLNYMEDMEGIDFQTQAIRIRQLYDDFDCDYIVLDLKNVGAGILD 423

Query: 386 YLE------MLGYHVYRVLG------QKRAVDLEF--------CRNRR-TELHVKMADWL 424
            L         G     +               E           N R  E+   +AD  
Sbjct: 424 NLRIPLTDIDRGVEYEPLNVSNDDDLASTCKYPEAPRVIHVINATNERNMEMANLLADNF 483

Query: 425 EFASLINHSGLIQNLKSLKSFIVPNT 450
                     LI+  ++ + F     
Sbjct: 484 MRGKF---RLLIREEQAEELFRQDKK 506


>gi|57867562|ref|YP_189190.1| prophage, terminase, ATPase subunit [Staphylococcus epidermidis
           RP62A]
 gi|57638220|gb|AAW55008.1| prophage, terminase, ATPase subunit, putative [Staphylococcus
           epidermidis RP62A phage SP-beta]
          Length = 599

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 82/446 (18%), Positives = 132/446 (29%), Gaps = 100/446 (22%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           A RG+GKT L+A   L      PG  +I  A +++Q    L             K   E+
Sbjct: 82  ASRGLGKTFLSAVYCLTRCILYPGTKIIITAPTKSQGINVL------------EKIENEL 129

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            S  +H      +  +    I   + S +    S    D   GH       ++ DE    
Sbjct: 130 LSPLIHREIESINTGNQKPMIAFHNGSWIRVVASN---DNARGHRAN---LLLVDEFVKV 183

Query: 204 P-DVINLGILGFLTERNANRFW---------------IMTSNPRRLSGKFYEIFN----- 242
             D+I+      LT +    F                +  S+    S   Y+        
Sbjct: 184 DEDLIDTVFKKMLTSQREPAFLHKAKYKNYPREENTQMYLSSAWMKSHWAYDSMRSFTRQ 243

Query: 243 ----KPLDDWKRF--QIDTRTVEGIDPSFHEGIIAR-------------------YGLDS 277
               K  DD K F   I   T        H+ + A                    +G   
Sbjct: 244 MLKKKSEDDLKSFVCHIPYYTGVMEKLYSHKQMKAEAQAEGFNKMKFAMEMEAVWWGETE 303

Query: 278 DVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG---DNTVV 334
                     F ++   +F P  ++ +A    P  +P    ++  D+A  GG   D +V 
Sbjct: 304 SAFFNFNTIDFNRKLSQAFYPKEVLVQADINNPIKEPKEKRLLAVDVARMGGNSNDASVF 363

Query: 335 VLRR---------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
            L R            + ++ D    D +T   +I  L + +  D I++D  N GA   D
Sbjct: 364 SLIRLLPKGKQQYERQLNYMEDMEGIDFQTQAIRIRQLYDDFDCDYIVLDLKNVGAGILD 423

Query: 386 YLE------MLGYHVYRVLG------QKRAVDLEF--------CRNRR-TELHVKMADWL 424
            L         G     +               E           N R  E+   +AD  
Sbjct: 424 NLRIPLTDIDRGVEYEPLNVSNDDDLASTCKYPEAPRVIHVINATNERNMEMANLLADNF 483

Query: 425 EFASLINHSGLIQNLKSLKSFIVPNT 450
                     LI+  ++ + F     
Sbjct: 484 MRGKF---RLLIREEQAEELFRQDKK 506


>gi|326784324|ref|YP_004324722.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-SSM5]
 gi|310003555|gb|ADO97951.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-SSM5]
          Length = 549

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 66/377 (17%), Positives = 126/377 (33%), Gaps = 51/377 (13%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T+    +LW +   P ++V  LAN     +  L          L   +    + L  
Sbjct: 85  GKSTIVTSYLLWYVLFNPNVNVAILANKAATAREML--------QRLQLSYENLPKWLQQ 136

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204
               W    L    G      ST          +            I  DE +  P    
Sbjct: 137 GILQWNRGSLELENGSKIMAASTSASAVRGMSFN-----------VIFLDEFAFIPNHIA 185

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261
           D     +   ++    +   I+ S P  ++  FY++++   +  +++   ++    V G 
Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGSNEYVPTEVHWSEVPGR 243

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD-------- 313
           D  + E  I          RVE   +F    +D+ I  + +      EP           
Sbjct: 244 DEVWKEQTIKNTSEQQ--FRVEFECEFL-GSVDTLISPSKLRIMPYHEPMNQNRGLAVFE 300

Query: 314 ---PYAPLIMGCDIAEE-GGDNTVV-VLRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365
              P    I+  D++   G D +   V+    +   +    K +        N I  + +
Sbjct: 301 QAIPEHNYILTVDVSRGVGNDYSAFTVMDTTTIPYKMVARYKNNEIKPIVLPNIIVDVAK 360

Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
            Y    I+ + N+ G +  D     LE     +  + G+      +    ++T+L VKM+
Sbjct: 361 AYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 420

Query: 422 DWLEFASLINHSGLIQN 438
             ++     N   LI++
Sbjct: 421 TAVKQVGCSNLKALIED 437


>gi|158337379|ref|YP_001518554.1| hypothetical protein AM1_4258 [Acaryochloris marina MBIC11017]
 gi|158307620|gb|ABW29237.1| conserved domain protein [Acaryochloris marina MBIC11017]
          Length = 476

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 71/443 (16%), Positives = 133/443 (30%), Gaps = 77/443 (17%)

Query: 38  WGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWL 97
           W + G  L+ F     WQ + ++ ++     S +     + K     GR +G + L    
Sbjct: 41  WIKSGGSLKQFILWD-WQKDVVDWIEEPQSLSDSPKLSVIIK-----GRQLGLSQL---C 91

Query: 98  VLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDV 157
             W +                      W + S W+ ++ ++   +   L+       S  
Sbjct: 92  CSWFLY-------------------KAW-QNSAWVGVIISRTQSDSSLLASRMREMASTA 131

Query: 158 LHCSLGIDSKHYSTMCRT----YSEERPDTFVGHHNTYGMAIINDEASGTPDV--INLGI 211
                  DS     +       +     D   G        I+ DEA+   ++       
Sbjct: 132 GLVDFSTDSLLKLEISGGGTLHFRSAAVDAVRGI--DSVSGILFDEAAFQTNLKLSLSAA 189

Query: 212 LGFLTERNANRFWIMTSNPRRLSGKFYEIFN-----------------KPLDDW------ 248
              +++  ++   I+ S P   SG F++  N                  P++ W      
Sbjct: 190 TPAMSQVGSDARIILCSTPNGASGHFFDTLNGFDNCVSDIERIRSGELPPVNKWQREDGN 249

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
               I  ++V G +PS+ E +     L       E      +          ++  A   
Sbjct: 250 IAIAIHWKSVYGDNPSYLEDLEKSLSLPKAQIAQEYDLSLTESSS-VVFSFAVVRAAATG 308

Query: 309 EPCPD--PYAPLIMGCDIAEEGGDN--TVVVLRRGP--VIEHLFDWSKTDLRTTNNKISG 362
           E  P         +G D A  G D   +V + + G    +  L+      L     +I  
Sbjct: 309 EYEPQFTEDELYYVGVDPAGSGADYFCSVFLKKTGETFTVSKLYRKRTGTLEVHMGRIDE 368

Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
            ++   P  + ++ N  G    + LE     V                N +  L  ++  
Sbjct: 369 FIKASNPIKVTVETNGLGQFVYESLESRYGSVIERFNTT--------ANSKGALIGRLQL 420

Query: 423 WLEFASL--INHSGLIQNLKSLK 443
            LE   +     S L Q L S +
Sbjct: 421 ALERGHISYPAGSPLEQELLSFR 443


>gi|113200627|ref|YP_717790.1| terminase large subunit [Synechococcus phage syn9]
 gi|76574526|gb|ABA47091.1| terminase large subunit [Synechococcus phage syn9]
          Length = 549

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 57/377 (15%), Positives = 126/377 (33%), Gaps = 51/377 (13%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T+    +LW +     ++V  LAN     +  L          L   +    + L  
Sbjct: 85  GKSTIVTSYLLWYVLFNANVNVAILANKAATAREML--------QRLQLSYENLPKWLQQ 136

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204
               W    L    G      ST          +            I  DE +  P    
Sbjct: 137 GILQWNRGSLELENGSKILAASTSASAVRGMSFN-----------VIFLDEFAFVPNHVA 185

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261
           D     +   ++    +   I+ S P  ++  FY++++   +  +++   ++    V G 
Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERKANEYIPTEVHWSEVPGR 243

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----- 316
           D ++ E  I          RVE   +F    +D+ I  + +   +  +P  +        
Sbjct: 244 DAAWKEQTIKNTSEQQ--FRVEFECEFL-GSVDTLISPSKLRTMVYGDPIAEKNGLSMYE 300

Query: 317 ------PLIMGCDIAEE--GGDNTVVVLRRGPVIEHLFDWSKTDLRT---TNNKISGLVE 365
                   ++  D++    G  +  +V+    +   L    + +        N I  +  
Sbjct: 301 KTIQGHTYVITADVSRGVSGDYSAFLVIDTTTIPYKLVAKYRNNDIKPILFPNIIVDVAR 360

Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
            Y    ++++ N+ G +  D     LE     +  + G+      +    ++T++ +KM+
Sbjct: 361 NYNHAFVLVEVNDVGGQVADIIQYDLEYDNLLMCAMRGRAGQQLGQGFSGKKTQMGIKMS 420

Query: 422 DWLEFASLINHSGLIQN 438
              +     N   L+++
Sbjct: 421 SATKQVGCSNLKALLED 437


>gi|262276634|ref|ZP_06054439.1| P-loop protein [alpha proteobacterium HIMB114]
 gi|262225214|gb|EEY75661.1| P-loop protein [alpha proteobacterium HIMB114]
          Length = 409

 Score = 77.5 bits (189), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 46/302 (15%), Positives = 102/302 (33%), Gaps = 32/302 (10%)

Query: 78  FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           F+  I+ GR  GKT L    +L          +  ++ +    K  +W ++ K       
Sbjct: 17  FRVLIT-GRRFGKTHLCLVEILRQARHCDNGKIFYVSPTYRMSKEIMWKQIKKL------ 69

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
                     +    W   +    L I   +   +    +++  D   G        ++ 
Sbjct: 70  ----------VKELRWDKYINETELTIVLVNNCQISLKGADKSADNLRGV---GLNFLVL 116

Query: 198 DEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL---DDWKRFQI 253
           DE +  P+      +   ++++ AN   +    P+      Y++F +      +WK ++ 
Sbjct: 117 DEFADIPEEAWTEVLRPTISDKYANGKVLFVGTPKGYGNWSYDMFQRGQAGDPEWKSWKY 176

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
            T     ++P   E   A+  LD+   R E    F        +  N       +    D
Sbjct: 177 TTIEGGQVEPHEIEQ--AKKDLDARSFRQEYEASFETYA--GVVYYNFDRAKNVKPVPYD 232

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRG-PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372
             A + +G D   +     +  +++G            ++ +   ++I+    +Y P  +
Sbjct: 233 QNAVIHIGMDFNIDPMSACLFYVKQGISYFFKEIVIYSSNTQEMIDEIT---RQYDPKRV 289

Query: 373 II 374
           I+
Sbjct: 290 IV 291


>gi|61806303|ref|YP_214662.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-SSM4]
 gi|61563847|gb|AAX46902.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-SSM4]
          Length = 550

 Score = 77.1 bits (188), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 57/376 (15%), Positives = 123/376 (32%), Gaps = 49/376 (13%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T+    +LW +     ++V  LAN     +  L          L   +    + +  
Sbjct: 86  GKSTIVTAYLLWYVLFNANVNVAILANKAPTAREML--------GRLQLSYENLPKWMQQ 137

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN 208
               W    L    G      ST          +            I  DE +  P+ I 
Sbjct: 138 GILGWNKGSLELENGSKILASSTSASAVRGMSFN-----------IIFLDEFAFVPNHIA 186

Query: 209 LGILGFL---TERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGID 262
                 +        +   I+ S P  ++ +FY++++   +  +++   ++    V G D
Sbjct: 187 EQFFASVYPTISSGKSTKVIIISTPHGMN-QFYKLWHDAERGANNYVATEVHWSQVPGRD 245

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD--------- 313
             + +  I          RVE   +F    +D+ I  + +     ++P  +         
Sbjct: 246 DKWKQQTIEN--TSEAQFRVEFECEFL-GSVDTLITPSKLRIMPYKDPIQENRGLAVYEH 302

Query: 314 --PYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFDWSKTDL----RTTNNKISGLVEK 366
                  I+  D++   G D +   +     + +       +         N I  +   
Sbjct: 303 VQENHNYIITVDVSRGVGNDYSAFCVIDTTTVPYKVVARYKNNQIKPLVFPNLIVDVATN 362

Query: 367 YRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
           Y    ++ + N+ G +  D     LE     +  + G+      +    ++T+L +KM+ 
Sbjct: 363 YNGAYVLCEVNDIGGQVADIIQYDLEYENLLMVSMRGRAGQQLGQGFSGKKTQLGIKMST 422

Query: 423 WLEFASLINHSGLIQN 438
            ++     N   LI++
Sbjct: 423 AVKQVGCSNLKALIED 438


>gi|326782611|ref|YP_004323017.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-SM1]
 gi|310002825|gb|ADO97224.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-SM1]
          Length = 549

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 60/377 (15%), Positives = 123/377 (32%), Gaps = 51/377 (13%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T+    +LW +     ++V  LAN     +  L          L   +    + L  
Sbjct: 85  GKSTIVTSYLLWYVLFNDNVNVAILANKAATAREML--------QRLQLSYENLPKWLQQ 136

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204
               W    L    G      ST          +            I  DE +  P    
Sbjct: 137 GILQWNRGSLELENGSKIMAASTSASAVRGMSFN-----------VIFLDEFAFIPNHIA 185

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261
           D     +   ++    +   I+ S P  ++  FY++++   +  +++   ++    V G 
Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGTNEYIPTEVHWSEVPGR 243

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE---------EALNREPCP 312
           D  + E  I          RVE   +F    +D+ I  + +          E        
Sbjct: 244 DDVWKEQTIKNTSEQQ--FRVEFECEFL-GSVDTLISPSKLRIMPYHDPMKENRGLAIFE 300

Query: 313 D--PYAPLIMGCDIAEE-GGDNTVVV-LRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365
              P    ++  D++   G D +    +    +   +    + +        N +  + +
Sbjct: 301 QSIPDHNYVITVDVSRGVGNDYSAFCVMDTTTIPYKMVARYRNNEIKPIILPNIVVDVAK 360

Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
            Y    I+ + N+ G +  D     LE     +  + G+      +    ++T+L VKM+
Sbjct: 361 NYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 420

Query: 422 DWLEFASLINHSGLIQN 438
             ++     N   LI++
Sbjct: 421 TAVKQVGCSNLKALIED 437


>gi|170023468|ref|YP_001719973.1| hypothetical protein YPK_1222 [Yersinia pseudotuberculosis YPIII]
 gi|169750002|gb|ACA67520.1| conserved hypothetical protein [Yersinia pseudotuberculosis YPIII]
          Length = 534

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 54/402 (13%), Positives = 122/402 (30%), Gaps = 65/402 (16%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F     +     W +      + I+     ++ +  + +             
Sbjct: 143 ALFWKARKFIETLPAEFRGSWDNKKHAPYMRIEFPDSGSIIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DE++     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TMYFVDESAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHGGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
              R+    D ++          +  +   E+   +        IP   ++ A+    + 
Sbjct: 252 FHWRSDPRKDDAW-YKKECEKIDNPVIVAQELDLNYNAAAEGILIPSEWVQAAIGAHTKL 310

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYR 368
              P    I   D+A+EG D      R G +++ L  WS   +D+  T      L ++  
Sbjct: 311 GITPSGARIGALDVADEGIDLNAFSSRTGVLLDRLKAWSGKGSDIYATTQDAMILSDEND 370

Query: 369 PDAIIIDANNTGARTCDYLEMLG----------YHVYRVLGQKR---------------- 402
            D ++ D++  GA       ++             +    G                   
Sbjct: 371 CDYLLYDSDGLGAGCRGDGRVINETRQKAGQRQVEIKPFRGSGEVIYPDKPVFKADTKRD 430

Query: 403 -AVDLEFCRNRRTELHVKMADWLEFA--------------------SLINHSGLIQNLKS 441
              + ++  NR+ +    +    +                      +L     LI  L S
Sbjct: 431 ARTNKDYFANRKAQGWWALRMRFQEVYRAVVKGMPFDPDEIISIDENLPEKEKLIAEL-S 489

Query: 442 LKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRS 483
             ++ +   G++ ++ K   G +S +++D +M  +A    R 
Sbjct: 490 QPTYTINGAGKVTVD-KAPSGTRSPNHADTVMICYAPEKIRR 530


>gi|326783331|ref|YP_004323723.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           Syn33]
 gi|310005278|gb|ADO99667.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           Syn33]
          Length = 549

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 64/393 (16%), Positives = 127/393 (32%), Gaps = 64/393 (16%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T+    +LW +   P ++V  LAN     +  L          L   +    + L  
Sbjct: 85  GKSTIVTAYLLWYVLFNPNVNVAILANKAATAREML--------GRLQLSYENLPKWLQQ 136

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204
               W    L    G      ST          +            I  DE +  P    
Sbjct: 137 GILQWNRGSLELENGSKILAASTSASAVRGMSFN-----------VIFLDEFAFVPNHIA 185

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261
           D     +   ++    +   I+ S P  ++  FY++++   +  +++   ++    V G 
Sbjct: 186 DQFFSSVYPTVSS-GKSTKVIIISTPHGMN-MFYKLWHDAEQGKNEYLPTEVHWSQVPGR 243

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-----------LNREP 310
           D ++ E  I          +VE   +F    +D+ I  + +              L    
Sbjct: 244 DAAWKEQTIKNTSEQQ--FKVEFECEFL-GSVDTLISPSKLRTMPYVDPVAQNKGLAIYE 300

Query: 311 CPDPYAPLIMGCDIAEE-GGDNTV-VVLRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365
             +     I+  D++   G D +  VV+    +   +    + +        N I  + +
Sbjct: 301 RVEAEHNYIITVDVSRGIGNDYSAFVVVDTTTMPYKVVARYRNNEIKPIIFPNIIIDVAK 360

Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
            Y    I+ + N+ G +  D     LE     +  + G+      +    ++T+L VKM+
Sbjct: 361 NYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 420

Query: 422 DWLEFAS-------------LINHSGLIQNLKS 441
             ++                LI     I  L +
Sbjct: 421 SAVKQVGCSNLKALIEEDKLLIPDYETIAELTT 453


>gi|294508906|ref|YP_003566117.1| hypothetical protein PSR_11004 [Salinibacter ruber M8]
 gi|294342043|emb|CBH22709.1| conserved hypothetical protein [Salinibacter ruber M8]
          Length = 255

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 50/261 (19%), Positives = 79/261 (30%), Gaps = 40/261 (15%)

Query: 50  APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
            P  WQ   +           ++    +   A  +    GKTT +A L L          
Sbjct: 7   DPDPWQEALL----------TSDWERALLNCARQS----GKTTASAALALETALEATDSL 52

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
           V+ LA +  Q K  L   V             + QS           + + S  I     
Sbjct: 53  VLILAPARRQSKEFL-RSVRSLYRDAAPDGGLDKQS------ELRLRLENESRIIALPGK 105

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
               R Y+ +               +I DEA+  PD   +     L        ++  S 
Sbjct: 106 EGTVRGYTAD--------------LVIADEAARVPDAAYVATRPMLAVTGGR--FVGLST 149

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289
           P    G FYE +  P  +W++ ++  +    +  +F E      G      R E   +F 
Sbjct: 150 PAGQRGWFYEAWTDPGQEWEQVKVTGQDCPRMTEAFLEQERREMG--DWQFRSEYMCEFT 207

Query: 290 QQDIDSFIPLNIIEEALNREP 310
               D       IE +L  E 
Sbjct: 208 D-TEDQLFATEHIESSLTSEV 227


>gi|323186590|gb|EFZ71927.1| gp33 TerL protein [Escherichia coli 1357]
          Length = 503

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 50/369 (13%), Positives = 105/369 (28%), Gaps = 58/369 (15%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMNNPFAQ--KRHSGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC- 311
              R+    D  +          +  +   E+   +        IP   ++ A++     
Sbjct: 252 FHWRSDPRKDDEW-YHKECEKIDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIRL 310

Query: 312 -PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--TDLRTTNNKISGLVEKYR 368
              P    +   D+A+EG D     LR G ++  + +WS   +D+  +  K+ GL + + 
Sbjct: 311 GIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGKGSDIYDSVVKVFGLCDDFG 370

Query: 369 PDAIIIDANNTGART------CDYLEM---------------------LGYHVYRVLGQK 401
            D    D +  GA         + L                           V    G+ 
Sbjct: 371 ADEFRFDEDGLGAGVRGDARAINELREAEGICQITATPFRGSGSVFHPENEAVPGDNGKP 430

Query: 402 RAVDLEFCRNRRTELHVKMADWLE-----FASLINHSGLIQNLKSLKSFIVPNTGELAIE 456
             ++ +F  N + +    +             +      I ++ S     + N   L +E
Sbjct: 431 ARLNKDFFVNAKAQGWWHLRKLFRNTFRALQGMEYDPDEIISISST----MENKDRLLME 486

Query: 457 ------SKR 459
                 SK+
Sbjct: 487 LSQPTWSKK 495


>gi|326782863|ref|YP_004323261.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-RSM4]
 gi|310004122|gb|ADO98516.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-RSM4]
          Length = 547

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 62/377 (16%), Positives = 120/377 (31%), Gaps = 51/377 (13%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T+    +LW +     ++V  LAN     +  L          L   +      +  
Sbjct: 83  GKSTIVTAYLLWYVLFNANVNVAILANKAATAREML--------QRLQLSYENLPNWMQQ 134

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204
               W    L    G      ST          +            I  DE +  P    
Sbjct: 135 GILQWNRGSLELENGSKIMAASTSASAVRGMSFN-----------VIFLDEFAFIPNHIA 183

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261
           D     +   ++    +   I+ S P  ++  FY++++   +  +++   ++    V G 
Sbjct: 184 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGTNEYVPTEVHWSEVPGR 241

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE-----------EALNREP 310
           D  + E  I          RVE   +F    +D+ I  + +              L    
Sbjct: 242 DDVWKEQTIKNTSESQ--FRVEFECEFL-GSVDTLIAPSKLRIMPYHDPITSNRGLAVYE 298

Query: 311 CPDPYAPLIMGCDIAEE-GGDNTVVV-LRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365
              P    I+  D++   G D +    +    +   +    K +        N I  + +
Sbjct: 299 QVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAK 358

Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
            Y    I+ + N+ G +  D     LE     +  + G+      +    ++T+L VKM+
Sbjct: 359 NYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 418

Query: 422 DWLEFASLINHSGLIQN 438
              +     N   LI+ 
Sbjct: 419 TATKQVGCSNLKALIEE 435


>gi|326783550|ref|YP_004323947.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           Syn19]
 gi|310005053|gb|ADO99443.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           Syn19]
          Length = 549

 Score = 74.8 bits (182), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 64/377 (16%), Positives = 126/377 (33%), Gaps = 51/377 (13%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T+    +LW +     ++V  LAN     +  L          L   +    + L  
Sbjct: 85  GKSTIVTSYLLWYVLFNQNVNVAILANKAATSREML--------QRLQLSYENLPKWLQQ 136

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204
               W    L    G           + S  R  +F          I  DE +  P    
Sbjct: 137 GILQWNRGSLELENGSKI---MAASTSSSAVRGMSFN--------VIFLDEFAFVPNHIA 185

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261
           D     +   ++    +   I+ S P  ++  FY++++   +  +++   ++    V G 
Sbjct: 186 DQFFSSVYPTISS-GQSTKVIIISTPHGMN-MFYKLWHDAERSKNEYIPTEVHWSEVPGR 243

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---------- 311
           D  + E  IA         +VE   +F    +D+ I  + +      +P           
Sbjct: 244 DAKWKEQTIANTSEQQ--FKVEFECEFL-GSVDTLISPSKLRVMPYHDPIAQNKGLAVYK 300

Query: 312 -PDPYAPLIMGCDIAEE--GGDNTVVVLRRGPVIEHLFDWSKTD---LRTTNNKISGLVE 365
             +P    I+  D+A       +   V+    V   +    + +        N I  + +
Sbjct: 301 RAEPDHNYIITVDVARGTSNDYSAFCVMDTTTVPYEMVARYRNNEIKPIVFPNIIVDVAK 360

Query: 366 KYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
            Y    I+ + N+ G +  D     LE     +  + G+      +    ++T+L VKM+
Sbjct: 361 NYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMS 420

Query: 422 DWLEFASLINHSGLIQN 438
             ++     N   LI+ 
Sbjct: 421 TAVKQVGCSNLKALIEE 437


>gi|18138498|ref|NP_542602.1| probable terminase [Halorubrum phage HF2]
 gi|32453919|ref|NP_861683.1| hypothetical protein HalHV1gp095 [Halovirus HF1]
 gi|18000439|gb|AAL55022.1| probable terminase [Halorubrum phage HF2]
 gi|32346487|gb|AAO61393.1| hypothetical protein [Halovirus HF1]
          Length = 563

 Score = 74.8 bits (182), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 74/464 (15%), Positives = 127/464 (27%), Gaps = 82/464 (17%)

Query: 85  GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           GR IG + +    +L     +P      L+ ++ Q      + +S   +L+ N       
Sbjct: 75  GRRIGVSYIIGICILIEALLKPDTFYPILSKTKGQSN----SRISDIKTLIKNAK----- 125

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
            + +       D +    G   K Y+    +   E P             +  DE +   
Sbjct: 126 -IDIPLEKDNQDEIVLPNGSRIKAYTGDPDSARGEDPPK----------TVFIDEMAFLE 174

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE---------------------IFNK 243
           D          T    +   +  S P+  + +F +                      F  
Sbjct: 175 DQSATLDAYLPTISLGSSQMVQVSTPKAQNDEFMDANERGTPDGRNDFGILALKQPTFKN 234

Query: 244 PLDDWKRFQIDTRTVEGIDPSFHEGIIA-RYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
             +      +  + VE +   F       +   D +    E   + P  D   F  +  I
Sbjct: 235 ADEIQTDVSLFEQDVEPVRGDFDLMAAETQRASDPNGFAQEYLCR-PVSDEYRFFSMPTI 293

Query: 303 EEALNREPCPD---------PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW----- 348
           E+A+ R    D             L+MG DI     D  +VV        +L        
Sbjct: 294 EDAMGRGAADDYSYGLRRYDTPNTLVMGVDIGFNSDDTAIVVFEHEGPRRYLRYHEVVND 353

Query: 349 -----------SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYR 396
                      S+ +      +IS +        +I+D    G    D +    G     
Sbjct: 354 RVLEQAGITPSSRQNPAAVAERISQVYNGMGVSNVIMDMTGVGQGFHDEVRRRIGRGYTG 413

Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456
                +    +   N    LH  +  WL          L + L ++      +  +    
Sbjct: 414 FNFSAKDKVEKMMGNMNYALHNDL-VWL-----PEDDSLREQLGAIVKQQKEDWQKPKFT 467

Query: 457 SKRVKGAKSTDYSDGLMYT--FAENPPRSDMDFGRCPSYQYEGV 498
            K      + D  D L      A  PP    D  R    Q E V
Sbjct: 468 GKE----HAPDGKDDLAMATVLAAFPPNFKSDKSRN-LQQREDV 506


>gi|326784562|ref|YP_004324947.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-SSM7]
 gi|310004595|gb|ADO98987.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-SSM7]
          Length = 550

 Score = 74.4 bits (181), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 61/378 (16%), Positives = 128/378 (33%), Gaps = 51/378 (13%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
            GK+T+    +LW +  +  ++V  LAN     +  L          L   +    + L 
Sbjct: 85  TGKSTIVTSYLLWYVLFKANVNVAILANKAATSREML--------QRLQLSYENLPKWLQ 136

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--- 204
                W    L    G           + S  R  +F          I  DE +  P   
Sbjct: 137 QGILQWNRGSLELENGSKI---MAASTSSSAVRGMSFN--------VIFLDEFAFVPNHI 185

Query: 205 -DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEG 260
            D     +   ++    +   I+ S P  ++  FY++++   +  +++   ++    V G
Sbjct: 186 ADQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGKNEYIPTEVHWSAVPG 243

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--------- 311
            D ++ +  IA         +VE   +F    +D+ I  + +      +P          
Sbjct: 244 RDAAWKDQTIANTSEQQ--FKVEFECEFL-GSVDTLISPSKLRTMPYEDPIIQNRGLAVY 300

Query: 312 --PDPYAPLIMGCDIAEEGG-DNTVVVLRRGPVI--EHLFDWSKTDL--RTTNNKISGLV 364
              +     I+  D+A     D +   +     +  E +  +   D+      N I  + 
Sbjct: 301 KQVEKDHNYIVTVDVARGVSQDYSAFCIIDTTTVPYELVAKYRNNDIKPIIFPNVIVDVA 360

Query: 365 EKYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
           + Y    ++ + N+ G +  D     LE        + G+      +    ++T+L VKM
Sbjct: 361 KNYNNAYVLCEVNDIGGQVADIIQFDLEYENLLQVAMRGRAGQQLGQGFSGKKTQLGVKM 420

Query: 421 ADWLEFASLINHSGLIQN 438
           +  ++     N   L++ 
Sbjct: 421 STAVKAVGCSNLKALLEE 438


>gi|218296727|ref|ZP_03497433.1| protein of unknown function DUF264 [Thermus aquaticus Y51MC23]
 gi|218242816|gb|EED09350.1| protein of unknown function DUF264 [Thermus aquaticus Y51MC23]
          Length = 425

 Score = 74.4 bits (181), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 73/377 (19%), Positives = 127/377 (33%), Gaps = 46/377 (12%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
            GK+               G + + L+  E Q +    AE +K       +    M+S  
Sbjct: 28  TGKSFALTLEAALHAVEHRGSTWVLLSAGERQSREL--AEKAKAHLDAMKQVGTLMES-R 84

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-- 205
                     L   L   S+             P T  G    Y   ++ DE +   D  
Sbjct: 85  FFEGGESVTQLEIRLPNLSRLIFLPA------NPRTARG----YTGNVVLDEFAFHQDSE 134

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE----GI 261
            I   +   +T R  +    + S P    GKF+E++ K    W R ++           +
Sbjct: 135 AIWAAMYPIIT-RRPDLKIRVMSTPNGPRGKFWELWEKGGPAWSRHKVTIYDAVAQGLPV 193

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP--LI 319
           DP      +A    D  + + E   +F   +  +F+P ++I EA  RE    P+ P    
Sbjct: 194 DPEELRAGLA----DDFIWQQEYLCEFLSAEE-AFLPWSLILEAEAREDPRGPWNPDQAY 248

Query: 320 MGCDIAEEGGDNTVVVL--RRGPV--IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIID 375
           +G D+     D TV V+  R G V  +  L    +        ++  L+ + R   +  D
Sbjct: 249 LGVDVGRH-RDLTVFVVLERVGDVYWVRLLETLHRAPFAQQEARLHALLPQVRRACL--D 305

Query: 376 ANNTGARTCDYLEM-LGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF--ASLINH 432
           A   G    +      GY V  V                 +L  ++  + E     +   
Sbjct: 306 ATGLGEMLAENARRAFGYKVEPVKFTPEVK---------ADLAQRLRLFFEDRRVRIPED 356

Query: 433 SGLIQNLKSLKSFIVPN 449
             L ++L S++  + P+
Sbjct: 357 RALREDLHSVRRIVTPS 373


>gi|182682964|ref|YP_001837088.1| terminase, large subunit [Enterobacteria phage EPS7]
 gi|182630676|gb|ACB97608.1| terminase, large subunit [Enterobacteria phage EPS7]
          Length = 438

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 60/329 (18%), Positives = 115/329 (34%), Gaps = 48/329 (14%)

Query: 66  CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
            +N++ +P        +S  R +GK+ + A+ + +L    P + V+ +A + + L    W
Sbjct: 45  IINALEDPRHRFVTACVS--RRVGKSFI-AYTLGFLKLLEPNVKVLVVAPNYS-LANIGW 100

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
           +++                   +      ++  +           ++ +  S  + D+ V
Sbjct: 101 SQIR----------------GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV 144

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIF-- 241
           G        II DEA+   DV        L  T    N   +  S PR   G +++ F  
Sbjct: 145 GRSYD---FIIFDEAA-ISDVGGAAFDIQLRPTLDKPNSKALFISTPRG--GNWFKEFYE 198

Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
              N+ L +W       R     D +  E   AR  +  +  R E    F   +   F  
Sbjct: 199 KGFNETLPNWVSIHGTYRDNPRADLNDIEE--ARRTVSKNYFRQEYEADFSVFEGQIFDT 256

Query: 299 LNIIE-----EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR----GPVIEHLFDWS 349
            N IE     + +      D     ++G D+     D T V+  +      V   L ++ 
Sbjct: 257 FNAIEHVKDLKGMRHFFKDDEAFETLLGIDVGY--RDPTAVLTIKYHYDTDVYYVLEEYQ 314

Query: 350 KTD--LRTTNNKISGLVEKYRPDAIIIDA 376
           + +         I   +++Y  D I +D+
Sbjct: 315 QAEKTTAQHATYIQHCIDRYNVDRIFVDS 343


>gi|46401884|ref|YP_006983.1| terminase, large subunit [Enterobacteria phage T5]
 gi|45775062|gb|AAS77194.1| terminase, large subunit [Enterobacteria phage T5]
 gi|59897286|gb|AAX12081.1| ORF144 [Enterobacteria phage T5]
          Length = 438

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 56/329 (17%), Positives = 113/329 (34%), Gaps = 48/329 (14%)

Query: 66  CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
            +N++ +P        +S  R +GK+ + A+ + +L    P + V+ +A + + L    W
Sbjct: 45  IINALEDPRHRFVTACVS--RRVGKSFI-AYTLGFLKLLEPNVKVLVVAPNYS-LANIGW 100

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
           +++                   +      ++  +           ++ +  S  + D+ V
Sbjct: 101 SQIR----------------GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV 144

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIF-- 241
           G        II DEA+   DV        L  T    N   +  S PR   G +++ F  
Sbjct: 145 GRSYD---FIIFDEAA-ISDVGGDAFRVQLRPTLDKPNSKALFISTPRG--GNWFKEFYA 198

Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
              +  L +W       R     D +  E   AR  +  +  R E    F   +   F  
Sbjct: 199 YGFDDTLPNWVSIHGTYRDNPRADLNDIEE--ARRTVSKNYFRQEYEADFSVFEGQIFDT 256

Query: 299 LNIIE-----EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR------GPVIEHLFD 347
            N I+     + +      D     ++G D+     D T V+  +         +   + 
Sbjct: 257 FNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGY--RDPTAVLTIKYHYDTDTYYVLEEYQ 314

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            ++         I   +++Y+ D I +D+
Sbjct: 315 QAEKTTAQHAAYIQHCIDRYKVDRIFVDS 343


>gi|326633035|ref|YP_004306624.1| terminase large subunit [Enterobacteria phage SPC35]
 gi|321272229|gb|ADW80121.1| terminase large subunit [Enterobacteria phage SPC35]
          Length = 438

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 55/329 (16%), Positives = 113/329 (34%), Gaps = 48/329 (14%)

Query: 66  CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
            +N++ +P        +S  R +GK+ + A+ + +L    P + V+ +A + + L    W
Sbjct: 45  IINALEDPRHRFVTACVS--RRVGKSFI-AYTLGFLKLLEPNVKVLVVAPNYS-LANIGW 100

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
           +++                   +      ++  +           ++ +  S  + D+ V
Sbjct: 101 SQIR----------------GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV 144

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIF-- 241
           G        II DEA+   DV        L  T    N   +  S PR   G +++ F  
Sbjct: 145 GRSYD---FIIFDEAA-ISDVGGDAFRVQLRPTLDKPNSKALFISTPRG--GNWFKEFYA 198

Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
              +  L +W       R     D +  E   AR  +  +  R E    F   +   F  
Sbjct: 199 YGFDDTLPNWVSIHGTYRDNPRADLNDIEE--ARRTVSKNYFRQEYEADFSVFEGQIFDT 256

Query: 299 LNIIE-----EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR------GPVIEHLFD 347
            N I+     + +      D     ++G D+     D T V+  +         +   + 
Sbjct: 257 FNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGY--RDPTAVLTIKYHYDTDTYYVLEEYQ 314

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            ++         I   +++Y+ D + +D+
Sbjct: 315 QAEKTTAQHAAYIQHCIDRYKVDRVFVDS 343


>gi|116624478|ref|YP_826634.1| hypothetical protein Acid_5400 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116227640|gb|ABJ86349.1| hypothetical protein Acid_5400 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 260

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 46/260 (17%), Positives = 82/260 (31%), Gaps = 27/260 (10%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            W    +          V +   +  +  ++  R  GK+T+ A   +       G   I 
Sbjct: 25  EWARRALGFEADAAQARVLDTRSK--RVLLNCTRQWGKSTVTAARAVHEAVKNAGSLTIA 82

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
           +  +  Q  T  +  V K  +        EM+              + S  +        
Sbjct: 83  VTPTARQ--TGEF--VRKAATFAS---GLEMRVKGDGHNEMSLAFPNGSRIVGLPGTEAT 135

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            R +S                 ++ DEAS   D + + +   L   +A   W+M S P  
Sbjct: 136 VRGFSA-------------VTLLLIDEASRVGDDLYMAMRPMLA-VSAGTLWLM-STPHG 180

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292
             G FYE +    + W+R  +           + E      G    + R E C +F  + 
Sbjct: 181 KRGFFYEAWANGGETWERVSVKAEDCPRFKAEYLEEERQVMGER--IYRQEYCCEF-GET 237

Query: 293 IDSFIPLNIIEEALNREPCP 312
             +    ++IE A + E  P
Sbjct: 238 SGAVFDRDLIEAAFSDEVTP 257


>gi|114320225|ref|YP_741908.1| hypothetical protein Mlg_1066 [Alkalilimnicola ehrlichii MLHE-1]
 gi|114226619|gb|ABI56418.1| hypothetical protein Mlg_1066 [Alkalilimnicola ehrlichii MLHE-1]
          Length = 463

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 71/473 (15%), Positives = 133/473 (28%), Gaps = 64/473 (13%)

Query: 15  LFDLMWSDEIKLS-FSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNP 73
           + D+M    +    F         W      L GF        E           S    
Sbjct: 5   IRDVMTDPALFGGQFGGDT-----WAAWRALLSGFYGLPLDDAEAQHWHALTDRESAPQS 59

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSK--- 130
             +     +  GR  GK+   A L ++    +     +  A            EV+    
Sbjct: 60  AHDELWLVV--GRRGGKSNAAALLAVYEACFKDHRDAL--AP----------GEVATTRV 105

Query: 131 -WLSLLPNKHWFEMQSLSLHPAPWYSDVL-----HCSLGIDSKHYSTMCRTYSEERPDTF 184
                   +  F   S  +H  P    ++           +         ++   R  TF
Sbjct: 106 MAADRAQARSVFRYISGLMHANPMLERLIVREDRESIELSNRAVIEVGTASFRTTRGYTF 165

Query: 185 VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244
                       +D+++     I   +   L   N     I  S+P    G+ +E + + 
Sbjct: 166 AAVIADEVAFWRSDDSANPDSEIIAAVRPGLATLNGK--LIALSSPYARRGELWENYRRH 223

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIE 303
                   +       ++PS  E ++             E   +F + D+++F+   ++E
Sbjct: 224 YGKASPILVAQAPSRTMNPSLPERVVTEAMERDPASAAAEYLAEF-RTDVETFLQREVVE 282

Query: 304 EALNREPCPDPYA---PLIMGCDIAEEGGDN--TVVVLRRGPV-IEHLFDWSKTDLRTTN 357
            A    P   PY          D A  G D     +  R G   +  +    K       
Sbjct: 283 AATRPTPLELPYNKRVTYTAFVDPAGGGADEFTAAIGHREGERVVVDVLRARKGTPAEIV 342

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417
            + + L++ YR    I D    G+   D     G  V     Q      +  R+    ++
Sbjct: 343 AEYADLLKSYRITRAISDRY-AGSWPADEFSRHGITVE----QAAKPKSDLYRDMLASMN 397

Query: 418 VKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAK-STDYS 469
                      L     L+  L             +++E +  +G + S D++
Sbjct: 398 SAR------VELPPDDRLMTQL-------------ISLERRTARGGRDSIDHA 431


>gi|51512091|gb|AAU05290.1| terminase large subunit [Enterobacteria phage T5]
          Length = 438

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 55/329 (16%), Positives = 112/329 (34%), Gaps = 48/329 (14%)

Query: 66  CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
            +N++ +P        +S  R +GK+ + A+ + +L    P + V+ +A + + L    W
Sbjct: 45  IINALEDPRHRFVTACVS--RRVGKSFI-AYTLGFLKLLEPNVKVLVVAPNYS-LANIGW 100

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
           +++                   +      ++  +           ++ +  S  + D+ V
Sbjct: 101 SQIR----------------GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV 144

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIF-- 241
           G        II DEA+   DV        L  T    N   +  S PR   G +++ F  
Sbjct: 145 GRSYD---FIIFDEAA-ISDVGGDAFRVQLRPTLDKPNSKALFISTPRG--GNWFKEFYA 198

Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
              +  L +W       R     D +  E   AR  +  +  R E    F   +   F  
Sbjct: 199 YGFDDTLPNWVSIHGTYRDNPRADLNDIEE--ARRTVSKNYFRQEYEADFSVFEGQIFDT 256

Query: 299 LNIIE-----EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR------GPVIEHLFD 347
            N  +     + +      D     ++G D+     D T V+  +         +   + 
Sbjct: 257 FNATDHVKDLKGMRHFFKDDEAFETLLGIDVGY--RDPTAVLTIKYHYDTDTYYVLEEYQ 314

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            ++         I   +++Y+ D I +D+
Sbjct: 315 QAEKTTAQHAAYIQHCIDRYKVDRIFVDS 343


>gi|307308946|ref|ZP_07588629.1| hypothetical protein SinmeBDRAFT_4513 [Sinorhizobium meliloti
           BL225C]
 gi|306900580|gb|EFN31193.1| hypothetical protein SinmeBDRAFT_4513 [Sinorhizobium meliloti
           BL225C]
          Length = 408

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 36/202 (17%), Positives = 71/202 (35%), Gaps = 24/202 (11%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN--KHWFEMQS 145
            GKT + A  V W +     + V     SE+ +K  +W+ +    + + +  K  F++ +
Sbjct: 208 WGKTYVAAIAVWWSLVCFDDVKVTIFGPSESLIKNGMWSNLQALHARMASSFKDLFDVSA 267

Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205
             +                         R  S +      G H      +  D+A G  +
Sbjct: 268 TRVSRKTAAP------------SCFAEYRLVSADNASAARGIHAVNN-FVFVDDADGVSE 314

Query: 206 VINLGILGFLTERNANRFWI--MTSN--PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261
           V+   ++  + + N     +  M +N  P+  +    E+FN+ L   +   +        
Sbjct: 315 VVIAYLMNIMIDPNPKLCLLSTMFANETPKLETVTEAELFNEALSSLRAM-VSGEV--RT 371

Query: 262 DPSFHEGIIARYGLDSDVTRVE 283
           DP + E I  RY L++      
Sbjct: 372 DPVWLEAI--RYQLENAEYLAR 391


>gi|331650684|ref|ZP_08351739.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331040472|gb|EGI12647.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 414

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 44/325 (13%), Positives = 98/325 (30%), Gaps = 45/325 (13%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 89  ALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GDRT 143

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              + DEA+     +   I   L++    R  I  S+   ++  F +   +       F 
Sbjct: 144 TLYLVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMANPFAQ--KRHGGKIPVFT 197

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
              R     D  ++     +  +D+ V    E+   +        IP   ++  ++    
Sbjct: 198 FHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQATVDAHIK 255

Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKY 367
               P    +   D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+ 
Sbjct: 256 LGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQD 315

Query: 368 RPDAIIIDANNTGART------CDYLEMLGYH---------------------VYRVLGQ 400
             +    D +  GA         + L  +                        V    GQ
Sbjct: 316 NLEEFRFDEDGLGAGVRGDARAINELRNVARRPSILATPFRGSGAVFDPDDEAVRGDNGQ 375

Query: 401 KRAVDLEFCRNRRTELHVKMADWLE 425
              ++ +F  N + +   ++    +
Sbjct: 376 AARLNKDFFANAKAQSWWRLRKLFQ 400


>gi|61806000|ref|YP_214360.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-SSM2]
 gi|61374509|gb|AAX44506.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-SSM2]
 gi|265525210|gb|ACY76007.1| terminase large subunit gp17 [Prochlorococcus phage P-SSM2]
          Length = 547

 Score = 72.1 bits (175), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 62/378 (16%), Positives = 124/378 (32%), Gaps = 51/378 (13%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
            GK+T     +L        ++V  LAN  +  +  L          L   +    + + 
Sbjct: 82  TGKSTTCISYLLHYAVFNDNVNVAVLANKASTARDLL--------GRLQLAYENLPRWMQ 133

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--- 204
                W    L    G      ST          +            I  DE +  P   
Sbjct: 134 QGIISWNKGSLELENGSKISANSTSSSAVRGGSYN-----------VIFLDEFAFIPNHI 182

Query: 205 -DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEG 260
            D     +   +T    +   I+ S PR ++  FY +++   K   ++    +    V G
Sbjct: 183 ADDFFASVYPTITS-GQSTKVIIVSTPRGMN-HFYRMWHDSEKGKSEYVATDVHWSEVPG 240

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI----PLNIIEEA-------LNRE 309
            D  + E  IA         ++E   +F    +++ I      N++ EA       L+  
Sbjct: 241 RDEEWKEQTIANTSEQQ--FKIEFECEFL-GSVNTLINPAKLRNLVYEAPKTRNAGLDIY 297

Query: 310 PCPDPYAPLIMGCDIAEE-GGDNTV-VVLRRGPVIEHLFDWSKTD---LRTTNNKISGLV 364
             P      I+  D+A   G D +  +V         +    + +        N I  + 
Sbjct: 298 ETPVKEHNYIITVDVARGLGNDYSAFIVFDTTEFPYKVVAKYRNNEIKPMLFPNIILDVA 357

Query: 365 EKYRPDAIIIDANNTGARTCD----YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
           + Y    ++I+ N+ G +        LE     +  + G+   +  +    ++T+L V+M
Sbjct: 358 KGYNNAYLLIEVNDIGDQVASILQYDLEYENVLMASMRGRAGQIVGQGFSGKKTQLGVRM 417

Query: 421 ADWLEFASLINHSGLIQN 438
              ++     N   ++++
Sbjct: 418 TSAVKKLGCSNLKTMMED 435


>gi|255929035|ref|YP_003097347.1| DNA terminase packaging enzyme large subunit [Synechococcus phage
           S-RSM4]
 gi|255705321|emb|CAR63310.1| DNA terminase packaging enzyme large subunit [Synechococcus phage
           S-RSM4]
          Length = 550

 Score = 72.1 bits (175), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 47/358 (13%), Positives = 103/358 (28%), Gaps = 59/358 (16%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            +Q E +     +  N    P               GK+T     +L+       +++  
Sbjct: 62  DFQKEILRDFHENRFNIAKLPRQ------------TGKSTTVVAYLLYYAIFYDSVNIGI 109

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
           LAN  +  +  L          L   +    + +      W    +    G      ST 
Sbjct: 110 LANKASTARELL--------GRLQLAYENLPKWMQHGILVWNKGNVELENGSKILAASTS 161

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI----NLGILGFLTERNANRFWIMTS 228
                    +            +  DE +  P+ +       +   +T    +   I+ S
Sbjct: 162 ASAVRGMSFN-----------ILFLDEFAFVPNHVAEQFFASVYPTITS-GKSTKVIIIS 209

Query: 229 NPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285
            P  ++  FY+++    +  +D+   ++    V G D  + E  I            E  
Sbjct: 210 TPNGMN-HFYKMWEDARRGKNDYVTNEVHWSQVPGRDAKWKEETIKN--TSPRQFAQEFE 266

Query: 286 GQFPQQDIDSFIPLNIIE-----------EALNREPCPDPYAPLIMGCDIAE--EGGDNT 332
             F     D+ I    ++             L+           I+  D+A    G  + 
Sbjct: 267 CDFL-GSADTLISPAKLQNIPFHDPIQSNAGLDVYERVQKDHEYIITVDVARGIGGDYSA 325

Query: 333 VVVLRRGPVIEHLFDWSKTDLRT---TNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
            +V     +   +    + +        + I  + ++Y    ++++ N+ G      L
Sbjct: 326 FIVFDITTMPYKIVAKYRNNEIKPVLFPSVIFQVCKEYNNPYVLVEVNDIGDSIAATL 383


>gi|329849103|ref|ZP_08264131.1| phage terminase, large subunit, PBSX family [Asticcacaulis
           biprosthecum C19]
 gi|328844166|gb|EGF93735.1| phage terminase, large subunit, PBSX family [Asticcacaulis
           biprosthecum C19]
          Length = 430

 Score = 71.3 bits (173), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 69/435 (15%), Positives = 133/435 (30%), Gaps = 43/435 (9%)

Query: 58  FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117
            +E + A+   +        F+ A   GRG  K+   A   ++     PG  V+ +   +
Sbjct: 24  ILEPIPAYRFLTKKPLGSFRFRAA-YGGRGAAKSWEFANAAIYHSLNTPGARVVFVREIQ 82

Query: 118 TQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177
             L  + +  V   L     +  F   +   H     +++L   L             + 
Sbjct: 83  GSLADSAFTLVRNRLEAYGLEGAFRQANGRFHHVENGAEILFLGL-------------WR 129

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
             +P+             I +EAS         ++  +     +  W +  NP   +   
Sbjct: 130 GNKPEGIKSL--EGATLTIWEEASEGRQRSLDVLIPTVLRTPQSELWCLW-NPMLPTDPV 186

Query: 238 YEIFNKPLDDWKRF--QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295
              F   ++  K    +++  +      +  E +      D         G +     ++
Sbjct: 187 DRFFRGDVEPQKTICRRVNWDSNPHFPEALREQMALDRKKDPLRAAWIWDGAYMPSAQNA 246

Query: 296 FIPLNIIEEAL--NREPCPDPYAPLIMGCDIAEEGGDNT--VVVLRRGPVIEHLFDWSKT 351
                +++ A    R+   +    +++G D A  GGD    VV  R G     + D    
Sbjct: 247 LWTRELLDRAWVQGRDKVMEAVGRVVVGVDPAGGGGDEVGIVVAGRYGAEGYIVLDDRSV 306

Query: 352 DLRTT---NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEF 408
             R+      ++   V+ Y  D ++++ N  G      L      V  V  + R V    
Sbjct: 307 AARSPEGWATEVLRAVDAYAADCVVVEKN-FGG----DLVASNLRVNGVHCRIREVTASR 361

Query: 409 CRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDY 468
            +  R E    + +  +         L   L       +   G            KS D 
Sbjct: 362 GKQVRAEPIAALYEQHKVYHRRPFPALEGQLL-----QMTPNGYAV-------KGKSPDR 409

Query: 469 SDGLMYTFAENPPRS 483
            D L++   E   RS
Sbjct: 410 LDALVWALTELSRRS 424


>gi|291336011|gb|ADD95601.1| large terminase protein [uncultured phage MedDCM-OCT-S09-C7]
          Length = 526

 Score = 71.3 bits (173), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 58/355 (16%), Positives = 110/355 (30%), Gaps = 47/355 (13%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           + A R  GK+  +   +LW +   P ++V  LAN                      +   
Sbjct: 80  VLASRQSGKSITSCAYLLWFLLFNPEVTVAVLANKG-----------------AIAREMI 122

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR-TYSEERPDTFVGHHNTYGMAIINDEA 200
                 L   P++       L   S  ++   +   +     +  G        +  DE 
Sbjct: 123 ARMVTMLESVPFFLQPGVKILNKGSIEFANDSKVVAAATSSSSIRGL---SINLLYLDEF 179

Query: 201 SGTPDV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD---WKRFQIDT 255
           +   D           +T    +   I+TS    +   FY+I+   + D   +K F I+ 
Sbjct: 180 AFVDDAETFYTATYPVVTS-GKDSKVIITSTANGVGNMFYKIYESAVHDQSEYKHFLINW 238

Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP----- 310
             V G D  + +  IA     S+    +  G       ++ I  N +   +++EP     
Sbjct: 239 FDVPGRDEEWKKETIAN---TSEAQFEQEYGNSFLGTGNTLINSNTLLGLMSKEPDWNKD 295

Query: 311 ------CPDPYAPLIMGCDI--AEEGGDNTVVVLRRGPVIEHLFDWSKTDL---RTTNNK 359
                  P      I   D+        +T  ++             + ++       + 
Sbjct: 296 GVKVYEKPKEGHTYITTVDVSKGRGIDYSTFTIMDISVKPFRQVCTYRDNMISPMLFPDL 355

Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKRAVDLEFCRNRR 413
           I+   + Y    +II+ N  G      L   + Y    V G  +A D+     +R
Sbjct: 356 IAKYTKPYNESLVIIENNAEGGMVATQLHYDIEYPNVFVQGMSKAEDIGVTMTKR 410


>gi|229605025|ref|YP_002875724.1| hypothetical protein P087_gp56 [Lactococcus phage P087]
 gi|227826008|gb|ACP41732.1| hypothetical protein [Lactococcus phage P087]
          Length = 578

 Score = 71.3 bits (173), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 71/460 (15%), Positives = 137/460 (29%), Gaps = 87/460 (18%)

Query: 85  GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           G G GK+ +++   L  +    G  +   A +   L + ++ E+   ++  P       +
Sbjct: 105 GTGFGKSFVSSQCNL--VRANRGELITAFAPNRE-LNSVIFKEMVSAVNHSPKLKKVLFE 161

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERP-DTFVGHHNTYGMAIINDEASGT 203
           + S   A           G+  K ++     + +        G H++  M    DE +  
Sbjct: 162 AESKEEA--------LQRGVSQKRFAFPSGGFVDLTIAKNATGVHSSSYM----DEYALL 209

Query: 204 PDVINLGILG----FLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--------DDWK-- 249
                    G    ++ +         TSNP  ++  + ++   PL         DW+  
Sbjct: 210 TKEEYNLAEGRAYAYVDKDGKPGKIFKTSNPHIMNFSYDDMIRNPLPPHEAVLWGDWRLN 269

Query: 250 ----------RFQIDTRTVEGIDPSFHEGIIARYGLD--------SDVT------RVEVC 285
                       Q+D       D          Y LD        S         R+   
Sbjct: 270 IGEGKFMELVYSQLDDEHKYLKDKFPLNREERDYLLDQAIQQVIWSPFFNDEDNLRILYL 329

Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL 345
            +F      +F         ++  P     +    G D+A  G D  +  L      +  
Sbjct: 330 SEFGVNTESAFFTTT---PKIDDSPIDWDNSTFYAGNDVAIRGTDACIYALLEYNPNKSY 386

Query: 346 FDWSKTDL------------RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYH 393
                 +                   +   ++      + IDA+  G    + L      
Sbjct: 387 SRIVAFNNVKPQLWIDHETPMEMAQNVIRQLKHDNARLLAIDASGVGEGQFNLLTTDDAE 446

Query: 394 ----VYRVLGQKRAV------DLEFCRNRRTELHVKMADWLEFASLINHSG----LIQNL 439
               V  V     A       +     N+R+EL +   ++++  +L   S     L   +
Sbjct: 447 TSCPVVPVRFGDGASKWRKDKNAVRSHNKRSELFLDFKEFIDTDTLRVTSEVWEFLQAEM 506

Query: 440 KSLKSFIVPNTGELAIESK---RVK-GAKSTDYSDGLMYT 475
           +++         ++ IE K   + + G KSTDY D  M  
Sbjct: 507 QAVTKMSNDENKKIKIEPKDAIKKRLGGKSTDYLDSSMLA 546


>gi|116625333|ref|YP_827489.1| hypothetical protein Acid_6278 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228495|gb|ABJ87204.1| hypothetical protein Acid_6278 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 260

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 40/225 (17%), Positives = 72/225 (32%), Gaps = 25/225 (11%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
            GK+T+ A   +    T+     I ++ +  Q  T  +  V K  +        +M+   
Sbjct: 58  WGKSTVTAARAVHEAVTKADSLTIAVSPTARQ--TGEF--VRKAEAFAGM---LKMKVKG 110

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207
                      + S  +         R +S                 ++ DEAS   D +
Sbjct: 111 DGSNEMSLAFPNGSRIVGLPGTEATVRGFSA-------------VALLLVDEASRVEDDL 157

Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHE 267
            + +   L        W+M S P    G FYE +      W+R  +           + E
Sbjct: 158 YMAMRPMLAVSG-GTLWLM-STPWGKRGFFYEAWANGGPTWERVSVKAEDCPRFGAEYLE 215

Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312
                 G    + R E C +F +    +    ++IE A + +  P
Sbjct: 216 EERRVMGER--IYRQEYCCEFGESSS-AVFDRDLIEAAFSDDFGP 257


>gi|86372240|gb|ABC95184.1| GP17-terminase [Stenotrophomonas phage Smp14]
          Length = 536

 Score = 70.1 bits (170), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 48/326 (14%), Positives = 107/326 (32%), Gaps = 48/326 (14%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKTT+ A ++LW         +  LAN   Q +  L    ++   +     WF    + +
Sbjct: 92  GKTTVVAAILLWYAIFNEEYRIAILANKGDQSREIL----ARLQLMYEELPWF----MQV 143

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI- 207
             + W  +  +  LG  S+ ++      +     +  G        +  DE +   + + 
Sbjct: 144 GVSVW--NKGNIKLGNRSEVFT------AATGGSSIRG---KSVNLMYLDEFAFVENDVD 192

Query: 208 -NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTR---TVEGIDP 263
                   +T        I+TS P  ++  FY+I+    +    +  +          D 
Sbjct: 193 FYTSTYPVVTS-GTKTKVIITSTPNGMN-LFYKIWTDSTNGKNNYVHNEAFWHDHPKRDQ 250

Query: 264 SFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC------------ 311
           ++ +  +            E   +F Q   D+ +    +E+   ++              
Sbjct: 251 AWKDEQLRNMSERQ--FEQEFLCKF-QGSSDTLLSPAKLEQLTYQDHIRELGGNRDFKIY 307

Query: 312 --PDPYAPLIMGCDIAEE-GGDNTVV-VLRRGPVIEHLFDWSKTDLR---TTNNKISGLV 364
             P   A  ++  D++E  G D +V+ V              ++++       +  + + 
Sbjct: 308 EDPIKDASYVVTVDVSEGIGKDYSVISVFDTTEAPFRQVAMLRSNIIAPLILADLANRIG 367

Query: 365 EKYRPDAIIIDANNTGARTCDYLEML 390
             Y    +I++ N+ G      L   
Sbjct: 368 HLYNQAVLIVECNSIGNTVVTALWED 393


>gi|30044056|ref|NP_835653.1| similar to terminase DNA packaging enzyme, large subunit
           [Rhodothermus phage RM378]
          Length = 508

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 55/335 (16%), Positives = 100/335 (29%), Gaps = 54/335 (16%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           G T       L  M       V+  AN E   K  L          +   +    + L +
Sbjct: 66  GVTWCAVAYALHQMIFNSNYKVLIAANKEATAKNVL--------ERIKFAYEQLPRFLQI 117

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--V 206
               W     +      S   +   ++ S           +     +I +EA+   +   
Sbjct: 118 KKRTWNKT--YIEFSNYSSARAVSSKSDSGR---------SESITLLIVEEAAFISNMEE 166

Query: 207 INLGILGFLTERNANRFWIMTSNPRRLS-GKFYE----IFNKPLDDWKRFQIDTRTVEGI 261
           +   +   L             N      G +YE       +   ++K F I        
Sbjct: 167 LWASVQQTLATGGK-----CIVNSTYNGVGNWYERTIRAAKEGKSEFKYFGIKWSDHPER 221

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP---- 317
           D  + E       L   V   E+    PQ   ++ IP ++I E    +P    Y      
Sbjct: 222 DEKWFEEQKRL--LPPRVFAQEILCI-PQGSGENVIPFHLIREEEFIDPFVVKYGGDYWE 278

Query: 318 -------LIMGCDIA-EEGGDNTVVVLR------RGPVIEHLFDWS--KTDLRTTNNKIS 361
                    +  D A   G D + V ++      +   IE + +++  KT L      I 
Sbjct: 279 WYRKPGYYFISVDPASGRGEDRSAVGVQVLWVDPQTLTIEQVAEFASDKTSLPVMRQVIK 338

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396
            + ++++P  I I+ N  G     ++E     +  
Sbjct: 339 QIYDEFKPQLIFIETNGIGMGLYQFMEAYTPSIVG 373


>gi|213029404|ref|ZP_03343851.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. 404ty]
          Length = 282

 Score = 69.8 bits (169), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 35/193 (18%), Positives = 70/193 (36%), Gaps = 8/193 (4%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 75  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 132

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D +       G     D  + I L+ IE A++  +  
Sbjct: 133 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 192

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKYR 368
              P     +G D+A+ G D    V R G VI    +W   + +L  +  +      + R
Sbjct: 193 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLKSCQRTYQAAME-R 251

Query: 369 PDAIIIDANNTGA 381
              I+ D+   GA
Sbjct: 252 DADIVYDSIGVGA 264


>gi|331648285|ref|ZP_08349374.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331042834|gb|EGI14975.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 219

 Score = 69.8 bits (169), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 36/205 (17%), Positives = 72/205 (35%), Gaps = 51/205 (24%)

Query: 322 CDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
            D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+   +    D +  
Sbjct: 1   MDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDNLEEFRFDEDGL 60

Query: 380 GART------CDYLEMLGYH---------------------VYRVLGQKRAVDLEFCRNR 412
           GA         + L  +                        V    GQ   ++ +F  N 
Sbjct: 61  GAGVRGDARAINELRNVARRPSILATPFRGSGAVFDPDDEAVRGDNGQAARLNKDFFANA 120

Query: 413 RTELHVKMADWL--------EFASLINH------------SGLIQNLKSLKSFIVPNTGE 452
           + +   ++            E  +                  LI  L S  ++ +   G+
Sbjct: 121 KAQSWWRLRKLFQNTWRAVAEGMAYNPDEIISISSSMALKDKLIIEL-SQPTYSINGVGK 179

Query: 453 LAIESKRVKGAKSTDYSDGLMYTFA 477
           + I+ K+  G +S + +D +M  +A
Sbjct: 180 IVID-KQPDGTRSPNLADSVMINYA 203


>gi|256819733|ref|YP_003141012.1| hypothetical protein Coch_0896 [Capnocytophaga ochracea DSM 7271]
 gi|256581316|gb|ACU92451.1| hypothetical protein Coch_0896 [Capnocytophaga ochracea DSM 7271]
          Length = 450

 Score = 69.4 bits (168), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 43/295 (14%), Positives = 104/295 (35%), Gaps = 38/295 (12%)

Query: 217 ERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTR---------TVEGIDPSFHE 267
           E N     ++T+NP +       ++ +    +K   ++ R           + +   + +
Sbjct: 162 EYNLKGKLLITANPSKNF-----LYKEFYTPYKEGTLNKRRAFIQALPYDNKMLPKEYIQ 216

Query: 268 GIIAR-YGLDSDVTRVEVC-GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI 324
            +     G +    +  +  G +    D +S    + I    N +  P       +  DI
Sbjct: 217 NLENTLRGAE----KQRLLNGLWEYDDDPNSLCDYDKILAIFNNDQLPKESTTY-LTADI 271

Query: 325 AEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY---RPDAIIIDANNTGA 381
           A  G D  V+ + +G  +  ++  + +        I+ L  KY   + + I  D +  G 
Sbjct: 272 ARFGSDLCVIGVWQGWELIEVYTLATSATTEIQALINTLRMKYNIPKGNCIA-DEDGVGG 330

Query: 382 RTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQ---- 437
              D   ++G+       ++        +N +T+   K+A+ +    +   + + +    
Sbjct: 331 GVVDNTGIVGFKNNSTPFEENG-QPTNYKNLQTQCLYKLAERINSNGIYISAEVSERTKE 389

Query: 438 --NLKSLKSFIVPNTG-ELAIESK---RVKGAKSTDYSDGLMY-TFAENPPRSDM 485
               +  +       G  L++ +K   +    +S DY D L+   + +  P+   
Sbjct: 390 MIIEEIEQIKSDNKDGQRLSVINKDTVKQAIGRSPDYRDMLLMREYFDLKPKRIF 444


>gi|291334534|gb|ADD94186.1| hypothetical protein Syncc9605_0456 [uncultured phage
           MedDCM-OCT-S04-C1220]
 gi|291335526|gb|ADD95137.1| hypothetical protein Syncc9605_0456 [uncultured phage
           MedDCM-OCT-S04-C491]
 gi|291335665|gb|ADD95272.1| hypothetical protein Syncc9605_0456 [uncultured phage
           MedDCM-OCT-S04-C846]
          Length = 354

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 51/270 (18%), Positives = 100/270 (37%), Gaps = 38/270 (14%)

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE----RPDTFVGHHNTYGMAIINDEASG 202
            L P  W        L I+  + ST+    +E     R  +  G        ++ DEA+ 
Sbjct: 12  KLVPKVWIRTKNETDLRIELINGSTIELKGTENAMALRGRSLSG--------VVLDEAAF 63

Query: 203 T-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF----NKPLDDWKRFQIDTRT 257
              +V    I   L ++    + +  S P   +  FY+++    +   ++W+R+   T  
Sbjct: 64  MDAEVWFEVIRPALADKEG--WALFISTPDGTASWFYDLWCYVPDDETNEWQRWSYTTID 121

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              +     E   A+  LD+   R E    F  +++   + ++  +E +++E       P
Sbjct: 122 GGNVSKHEVEAARAQ--LDTRTFRQEFEASF--ENLTGLVAISFSDENISQEAKDISIQP 177

Query: 318 LIMGCDIAEEGGD--NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII- 374
           L++G D      D  + +  ++ G  +    +   T   TT +    +  +Y  D  II 
Sbjct: 178 LLLGVD---FNVDPMSGICAVKNGETLYVFDEVMLTGGATTWDFAEEVTRRYGVDRRIIA 234

Query: 375 --DANN-----TGARTCDY--LEMLGYHVY 395
             D        +G    D+  L   G+ V 
Sbjct: 235 CPDPTGGARKTSGVGVTDHAILRRSGFTVQ 264


>gi|326804661|ref|YP_004327532.1| Gp17 terminase subunit for DNA packaging, nuclease and ATPase
           [Salmonella phage Vi01]
 gi|301795311|emb|CBW38029.1| Gp17 terminase subunit for DNA packaging, nuclease and ATPase
           [Salmonella phage Vi01]
          Length = 736

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 66/393 (16%), Positives = 121/393 (30%), Gaps = 66/393 (16%)

Query: 91  TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150
           TT+ A  +LW         +  LAN E Q    L   + K                +   
Sbjct: 269 TTVVAAFLLWYAMFHSDKEIAVLANKEKQAIEIL-DRIRK----------------AYQD 311

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAIINDEASGTPDV--I 207
            P++        G     +    + Y+     D+  G        +  DE +   +    
Sbjct: 312 LPFFLQQGCEKFGSTLIEFENGSKIYAYATSSDSIRGR---SVSLLYVDEVAFIENDFEF 368

Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD---WKRFQIDT---RTVEGI 261
                  +   + +R  I+TS P+   G FY+I  K       +  F +       V   
Sbjct: 369 WESTFPAIASADTSR-CILTSTPKGQRGLFYDIVTKADPRHPQYNDFHLTEVPWYKVPAY 427

Query: 262 --DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---REP-----C 311
             DP +     AR G        E   +F +  + S IP   +++  +   REP      
Sbjct: 428 TKDPDWETKQRARLG--DARFDQEFGIKF-RGSVGSLIPAKCLDKMTSKLYREPNEFTKI 484

Query: 312 PDPYAPLIMGCDIAEEG----GDNTVV-VLRRGPVIEHLFDWSKTDL---RTTNNKISGL 363
              Y P  +   IA+ G    GD +V+ +L        +    + +          I+ +
Sbjct: 485 YKEYDPQRIYFGIADTGKGVEGDYSVLTILDITEYPHVIAAKYRNNTIPPMMYAYTIADM 544

Query: 364 VEKYRPDAIIIDA-NNTGARTCDYLEMLGYHVYRVLG-----------QKRAVDLEFCRN 411
             +Y    ++++  N+ G +    L     +   +               R  +     N
Sbjct: 545 CTEYGECPVLVETNNDVGGQVITILYQEIEYPEIIFTSTDNKGTGKRIGGRKPEPGINTN 604

Query: 412 R--RTELHVKMADWLE-FASLINHSGLIQNLKS 441
           R  R+     +   +E    +I     I  L +
Sbjct: 605 RKVRSIGCANLKALIEKEMLVIEDQDTIDELST 637


>gi|282599341|ref|YP_003358653.1| Gp17 terminase DNA packaging enzyme large subunit [Shigella phage
           phiSboM-AG3]
 gi|226973647|gb|ACO94400.1| Gp17 terminase DNA packaging enzyme large subunit [Shigella phage
           phiSboM-AG3]
          Length = 736

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 61/393 (15%), Positives = 124/393 (31%), Gaps = 66/393 (16%)

Query: 91  TTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHP 150
           TT+ A  +LW         +  LAN E Q    L   + K                +   
Sbjct: 269 TTVVAAFLLWYAMFHSDKEIAVLANKEKQAIEIL-DRIRK----------------AYQD 311

Query: 151 APWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAIINDEASGTPDV--I 207
            P++        G     +    + Y+     D+  G        +  DE +   +    
Sbjct: 312 LPFFLQQGCEKFGSTLIEFENGSKIYAYATSSDSIRGR---SVSLLYVDEVAFIENDFEF 368

Query: 208 NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD---WKRFQIDTRTVEGI--- 261
                  +   + +R  I+TS P+   G FY+I  K   +   +  F++       +   
Sbjct: 369 WESTFPAIASADTSR-CILTSTPKGQRGLFYDIVTKANPEHPQYNDFKLTEVPWYRVPTY 427

Query: 262 --DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR--------EPC 311
             DP++     A+ G        E   +F +  + S IP   +++  ++           
Sbjct: 428 TKDPNWESKQRAKLG--DARFDQEFGIKF-RGSVGSLIPAKCLDKMTSKLYQEPNEFTKI 484

Query: 312 PDPYAPLIMGCDIAEEG----GDNTVV-VLRRGPVIEHLFDWSKTDL---RTTNNKISGL 363
              Y P  +   IA+ G    GD +V+ +L        +    + +          I+ +
Sbjct: 485 YHDYDPKRIYMGIADTGKGVEGDYSVLTILDITDYPHKIAAKYRNNTIPPMMYAYTIADM 544

Query: 364 VEKYRPDAIIIDA-NNTGARTCDYLEMLGYHVYRVLG-----------QKRAVDLEFCRN 411
            EKY    ++++  N+ G +    L     +   +               R  +     N
Sbjct: 545 GEKYGTCPMLVETNNDVGGQVITILYQEIEYPEIIFTTTDAKGTGKRIGGRRPEPGINTN 604

Query: 412 R--RTELHVKMADWLE-FASLINHSGLIQNLKS 441
           +  R+     +   +E    +++    I  L +
Sbjct: 605 KKVRSNGCANLKALIEREMLVVDDQDTIDELST 637


>gi|291334627|gb|ADD94276.1| hypothetical protein Syncc9605_0456 [uncultured phage
           MedDCM-OCT-S04-C231]
          Length = 320

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 40/218 (18%), Positives = 83/218 (38%), Gaps = 26/218 (11%)

Query: 195 IINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF----NKPLDDWK 249
           ++ DEA+    +V    I   L ++    + +  S P   +  FY+++         +W+
Sbjct: 56  VVLDEAAFMDAEVWFEVIRPALADKEG--WALFISTPDGTASWFYDLWCYVPEDETGEWQ 113

Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
           R+   T     +     E   A+  LD+   R E    F  +++   + ++  +E +++E
Sbjct: 114 RWSYTTIEGGNVSKHEVEAARAQ--LDNRTFRQEFEASF--ENLTGLVAISFSDENISQE 169

Query: 310 PCPDPYAPLIMGCDIAEEGGD--NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY 367
                  PL++G D      D  + +  ++ G  +    +   T   TT +    +  +Y
Sbjct: 170 AKDISIQPLLLGVD---FNVDPMSGICAVKNGETLYVFDEIMLTGGATTWDFAEEVTRRY 226

Query: 368 RPDAIII---DANN-----TGARTCDY--LEMLGYHVY 395
             D  +I   D        +G    D+  L   G+ V 
Sbjct: 227 GVDRRVIACPDPTGGARKTSGVGVTDHAILRRSGFTVQ 264


>gi|297566322|ref|YP_003685294.1| hypothetical protein Mesil_1911 [Meiothermus silvanus DSM 9946]
 gi|296850771|gb|ADH63786.1| protein of unknown function DUF264 [Meiothermus silvanus DSM 9946]
          Length = 427

 Score = 67.8 bits (164), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 68/386 (17%), Positives = 134/386 (34%), Gaps = 44/386 (11%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
           +GK+   +   +      P    + L+  E Q         SK L+    +H   +Q ++
Sbjct: 32  VGKSFAASLEAVLDCVAHPRSLWVFLSRGERQ---------SKELAEKAQRHLEAIQVVA 82

Query: 148 -LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD- 205
            ++  P+ ++     + + +              PDT  G+       ++ DE +   D 
Sbjct: 83  EMYDEPFDAESTQTVIRLPNGSRIISLPA----NPDTARGYSGN----VLLDEFALHKDS 134

Query: 206 -VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQIDTRTVEGID 262
             I   +   +T R+      + S P+   GKFYEI+      D W R ++D        
Sbjct: 135 REIWGALYPTIT-RSKRYRLRVLSTPKGQQGKFYEIWQPEPGGDLWSRHRVDIYDAVQQG 193

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP--YAPLIM 320
                  + +   D  + + E   +F  +   +++P  +I    + +   D      L +
Sbjct: 194 LEVDPEELRKGLKDPVLWQQEYLLEFVDEAS-AWLPYELITSCESSQARTDGALEGDLYL 252

Query: 321 GCDIAEEGGDNTVV--VLRRGPVI--EHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
           G DI     D +V+    R G V+    +    +T   T    +  L+ + R     IDA
Sbjct: 253 GMDIGRH-RDLSVIWVAERVGDVLWTRRVIWLERTPFATQREVLYSLLPQVRRAC--IDA 309

Query: 377 NNTGARTCDYLE-MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL-INHSG 434
           +  G +  +  +   G  V  V+  +   +          L V +    E   + I    
Sbjct: 310 SGLGMQLAEEAQSRFGSRVEPVMFTRAVKED---------LAVTLRRKFEDRLIRIPPDD 360

Query: 435 LIQNLKSLKSFIVPNTGELAIESKRV 460
            I+        I  + G +  ++ R 
Sbjct: 361 RIRESLHAVRRITTSAGHIRFDADRD 386


>gi|300775654|ref|ZP_07085515.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910]
 gi|300505681|gb|EFK36818.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910]
          Length = 475

 Score = 67.4 bits (163), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 46/284 (16%), Positives = 101/284 (35%), Gaps = 35/284 (12%)

Query: 217 ERNANRFWIMTSNPRRLSGKFYEIFNKPLD------DWKRFQIDTRTVEGIDPSFHEGII 270
           E        +T NP++     Y  F KP+         K  Q   +    I   + E + 
Sbjct: 179 EYGLKPKIFVTCNPKKN--WMYSYFYKPMKEGLLKLKQKFIQAFVQENPFITTDYIEQLE 236

Query: 271 ARYGLDSDVTRVEVC-GQFPQQDIDSFIPLNIIEEALN--REPCPDPYAPLIMGCDIAEE 327
           +         R  +  G +  +  D+   L I +  L+  +    +      +  D+A  
Sbjct: 237 ST---TDKAKRERLLKGNW--EYDDNPYKLTIYDRILDLWKNDHIEKKGRKYITADVARF 291

Query: 328 GGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI--SGLVEKYRPDAIIIDANNTGARTCD 385
           G D   V +     +  + ++  +        I    +  K      I DA+  G    D
Sbjct: 292 GSDLATVGVWEDWDLIEVHEFEISKTTEIQACIQAMRIKHKIPKHNCIADADGVGGGVVD 351

Query: 386 YLEMLGYHVYRVLGQ---KRAVDLEFCRNRRTELHVKMADWLEFASLINHSG-------- 434
            L+++G+       +    ++ +    +N +T+L V +A+ +   + +N S         
Sbjct: 352 NLDIIGFVNNAKPFEENTGQSKNAPKYKNMQTQLLVYLAEKIINQNKMNISADISEKQKE 411

Query: 435 -LIQNLKSLKSFIVPNTGELAIESK---RVKGAKSTDYSDGLMY 474
            + + L +++   +P+   + +  K   +    +S DY D ++ 
Sbjct: 412 YIKEELDTIE--RIPDVDIVTLVDKTQIKQNIGRSPDYRDMILM 453


>gi|329954246|ref|ZP_08295340.1| phage terminase, large subunit, PBSX family [Bacteroides clarus YIT
           12056]
 gi|328527952|gb|EGF54938.1| phage terminase, large subunit, PBSX family [Bacteroides clarus YIT
           12056]
          Length = 438

 Score = 67.1 bits (162), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 52/319 (16%), Positives = 109/319 (34%), Gaps = 45/319 (14%)

Query: 196 INDEASGTPDVINLGILG----FLTE-RNANRFWIMTSNPRRLSGKFYEIFNKP------ 244
             +EA     +    +       L +  N     ++T NP++     Y+ F KP      
Sbjct: 126 WIEEAGQVNRLAFEVLQTRIGRHLNDVYNVPGKILITCNPKKN--WLYDKFYKPWKEHKL 183

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDIDSFIPLNIIE 303
            D +   Q   +        +   +      +  VT+  +  G +   +     P  + +
Sbjct: 184 KDGYAFVQALVQDNPFATEDYINTLKNT---NDKVTKERLYFGNWEYDND----PAVLCD 236

Query: 304 EALNREPCPDPY-APLIMGC---DIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359
                +   + +  P+ +     D+A +G D  V     G V     D   +  ++    
Sbjct: 237 YDAICDLFVNEHVQPVGLSTGSSDLAMKGRDRFVSGHWIGNVCYIRLDQEYSTGKSIEAD 296

Query: 360 ISGLVEKY--RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417
           +  ++ ++      +I+D++  G     YLE     +    G  R ++ E   N ++E  
Sbjct: 297 LKNMMIQWSIPRSMMIVDSDGLG----SYLESYLNGIKEFHGGNRPINPE-FDNLKSECA 351

Query: 418 VKMADWLEFASL------INHSGLIQNLKSLKSFIVP----NTGELAIESKRVKGAKSTD 467
            K+A+ +    +           +I+ L  LK   +       G ++ E  +     S D
Sbjct: 352 FKLAELINNRQIRIICTEAQRERIIEELGVLKQDHIDADTRKKGIISKEKMKEILGHSPD 411

Query: 468 YSDGLMYT-FA--ENPPRS 483
           Y D L+   F   +  P+ 
Sbjct: 412 YLDMLIMAMFFRIKPIPKR 430


>gi|167623253|ref|YP_001673547.1| hypothetical protein Shal_1320 [Shewanella halifaxensis HAW-EB4]
 gi|167353275|gb|ABZ75888.1| protein of unknown function DUF264 [Shewanella halifaxensis
           HAW-EB4]
          Length = 617

 Score = 67.1 bits (162), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 40/249 (16%), Positives = 87/249 (34%), Gaps = 34/249 (13%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317
            E +   Y  + D  +      F   D DS    + +E+ +        + P        
Sbjct: 380 IEELRDEY--NDDDFKNLFMCIFVD-DADSVFKFSDLEKCMVESARWQDHKPKEQRPFGN 436

Query: 318 --LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRP 369
             + +G D +    + T+VV+    ++G     L    W   +     ++I  +  +YR 
Sbjct: 437 REVWLGYDPSRTRDNATLVVIAPGEKKGEKFRVLEKHYWRGLNFSHHVSEIQKVYARYRV 496

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429
             I +D    GA   D +  L          + A  + +  + +T L +KM D +E   +
Sbjct: 497 TYIGVDTTGIGAGVFDSISTL--------FPREATAIHYSVSSKTRLVLKMIDVVESGRI 548

Query: 430 I---NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDMD 486
               +H  +  +  S++       G +  ++ R        ++D + +  A       ++
Sbjct: 549 EWDASHKDIAMSCLSIRKTTTDTGGAITFKASRDNVT---GHAD-VFFAIAHAVINEPLN 604

Query: 487 FGRCPSYQY 495
           F    +  +
Sbjct: 605 FAHKRTSSW 613


>gi|291337121|gb|ADD96636.1| hypothetical protein Syncc9605_0456 [uncultured organism
           MedDCM-OCT-S12-C92]
          Length = 354

 Score = 66.7 bits (161), Expect = 9e-09,   Method: Composition-based stats.
 Identities = 50/270 (18%), Positives = 99/270 (36%), Gaps = 38/270 (14%)

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE----RPDTFVGHHNTYGMAIINDEASG 202
            L P  W        L I+  + ST+    +E     R  +  G        ++ DEA+ 
Sbjct: 12  KLVPKVWIRTKNETDLRIELINGSTIELKGTENAMALRGRSLSG--------VVLDEAAF 63

Query: 203 T-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF----NKPLDDWKRFQIDTRT 257
              +V    I   L ++    + +  S P   +  FY+++    +   ++W+R+   T  
Sbjct: 64  MDAEVWFEVIRPALADKEG--WALFISTPDGTASWFYDLWCYVPDDETNEWQRWSYTTID 121

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              +     E   A+  LD+   R E    F  +++   + ++  ++ ++ E       P
Sbjct: 122 GGNVSKHEVEAARAQ--LDTRTFRQEFEASF--ENLTGLVAISFSDDNISTEAKDISIQP 177

Query: 318 LIMGCDIAEEGGD--NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII- 374
           L++G D      D  + +  ++ G  +    +   T   TT +    +  +Y  D  II 
Sbjct: 178 LLLGVD---FNVDPMSGICAVKNGETLYVFDEVMLTGGATTWDFAEEVTRRYGVDRRIIA 234

Query: 375 --DANN-----TGARTCDY--LEMLGYHVY 395
             D        +G    D+  L   G+ V 
Sbjct: 235 CPDPTGGARKTSGVGVTDHAILRRSGFTVQ 264


>gi|326784094|ref|YP_004324487.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           Syn1]
 gi|310004826|gb|ADO99217.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           Syn1]
          Length = 550

 Score = 66.3 bits (160), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 49/323 (15%), Positives = 103/323 (31%), Gaps = 47/323 (14%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
            GK+T     +L  +     ++V  LAN  +  +  L    ++  +   N   +  Q + 
Sbjct: 84  TGKSTTVVSYLLHYLIFNDSVNVGILANKASTARDLL----ARLATAYENLPKWIQQGVV 139

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207
           +    W    +    G      ST          +            I  DE +  P+ I
Sbjct: 140 V----WNKGNIELENGSKILAASTSASAVRGMSFN-----------IIFLDEFAFVPNHI 184

Query: 208 ----NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEG 260
                  +   +T    +   I+ S P+ ++  FY+++       +D+   ++    V G
Sbjct: 185 ADSFFASVYPTITS-GKSTKVIIISTPQGMN-HFYKMWQDAVNGRNDYTYHEVHWSQVPG 242

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--------- 311
            D  + E  I            E   +F    +D+ I  + ++     EP          
Sbjct: 243 RDAKWKEETIKNTSQRQ--FTQEFECEFL-GSVDTLISASKLKALAFDEPITRNKGLDIY 299

Query: 312 --PDPYAPLIMGCDIAE--EGGDNTVVVLRRGPVIEHLFDWSKTD---LRTTNNKISGLV 364
             P      ++  D++    G  +  +V     V   +    + +        N I+ + 
Sbjct: 300 EKPKDKNEYLLTVDVSRGIGGDYSAFIVYDITTVPYKIVGKYRNNEIKPMLFPNVINDVA 359

Query: 365 EKYRPDAIIIDANNTGARTCDYL 387
             Y    ++ + N+ G +    L
Sbjct: 360 RAYNNAWVLCEVNDVGDQVASIL 382


>gi|58532911|ref|YP_195134.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-PM2]
 gi|58331378|emb|CAF34164.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-PM2]
          Length = 548

 Score = 66.3 bits (160), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 47/323 (14%), Positives = 104/323 (32%), Gaps = 47/323 (14%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
            GK+T     +L  +     +++  LAN  +  +  L    ++  +   N   +  Q + 
Sbjct: 84  TGKSTTVVSYLLHYLIFNDNVNIGILANKASTARDLL----ARLATAYENLPKWIQQGVV 139

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207
           +    W    +    G      ST          +            I  DE +  P+ I
Sbjct: 140 V----WNKGNIELENGSKILAASTSASAVRGMSFN-----------IIFLDEFAFVPNHI 184

Query: 208 ----NLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEG 260
                  +   +T    +   I+ S P+ ++  FY+++       + +   ++    V G
Sbjct: 185 ADSFFASVYPTITS-GKSTKVIIISTPQGMN-HFYKMWVDATNGRNGYTFHEVHWSQVPG 242

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC--------- 311
            D  + E  I            E   +F    +D+ I  + ++  +  +P          
Sbjct: 243 RDEKWKEETIKNTSERQ--FTQEFECEFL-GSVDTLIAASKLKALVFNDPIKRNKGLDIY 299

Query: 312 --PDPYAPLIMGCDIAE--EGGDNTVVVLRRGPVIEHLFDWSKTD---LRTTNNKISGLV 364
             P   +  +M  D++    G  +  ++     V   +    + +        N I+ L 
Sbjct: 300 EEPKEKSEYLMTVDVSRGIGGDYSAFIIFDITTVPYKVVGKYRNNEIKPMLFPNIINDLA 359

Query: 365 EKYRPDAIIIDANNTGARTCDYL 387
             Y    ++ + N+ G +    L
Sbjct: 360 RSYNNAWVLCEVNDIGDQVASIL 382


>gi|319775358|ref|YP_004137846.1| Terminase, ATPase subunit [Haemophilus influenzae F3047]
 gi|317449949|emb|CBY86161.1| Terminase, ATPase subunit [Haemophilus influenzae F3047]
          Length = 603

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 55/375 (14%), Positives = 118/375 (31%), Gaps = 70/375 (18%)

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
              K   + +S  ++ A   +DV      I   + + +  T+      T   +H      
Sbjct: 210 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--TFLGTNSATAQSYHGN---- 263

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  DV+     G   ++   +     S P  ++   Y     + FN+    
Sbjct: 264 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRAK 321

Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            ++ +ID                             G +    + +IA    +       
Sbjct: 322 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 379

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333
              QF   +  +F   ++    ++       Y P          + +G D A  G    +
Sbjct: 380 FLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 439

Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
           V++      R    + H   +   D  T  ++I    + Y    I+ID    G+     +
Sbjct: 440 VIVAPPKVERGDYRVLHKQTFHGMDYETQASRIKQFCDDYNVTRIVIDKTGMGSGVYQEV 499

Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444
                 V  +         E+  + + E+ +K  + ++   L      + ++ +  ++K 
Sbjct: 500 RKFYPMVQGL---------EYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 550

Query: 445 FIVPNTGELAIESKR 459
             +  TG++   S R
Sbjct: 551 -RITGTGKITYVSDR 564


>gi|320162476|ref|YP_004175701.1| hypothetical protein ANT_30750 [Anaerolinea thermophila UNI-1]
 gi|319996330|dbj|BAJ65101.1| hypothetical protein ANT_30750 [Anaerolinea thermophila UNI-1]
          Length = 506

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 57/329 (17%), Positives = 94/329 (28%), Gaps = 43/329 (13%)

Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSL--SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177
           L +   AE+ K    L  +    M+ L   L+  P          G   +         S
Sbjct: 72  LFSGTSAEMVKASPTLRPQSLTAMRRLERVLNANPLTRGRWRRESGNTFRLGQARIHFLS 131

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLT--ERNANRFWIMTSNPRRLSG 235
                + VG   T  + +  DEA           L  +T        FW        L G
Sbjct: 132 AAPGASIVG--ATASLLLEVDEAQAVSIEKFDTELAPMTASTGAVRVFWGTAWTASTLLG 189

Query: 236 K---FYEIFNKPLDDWKRFQIDTRTVEGIDPSFH---EGIIARYGLDSDVTRVEVCGQFP 289
           +     +         + F++    V    P +    E  I + G +    R +   +  
Sbjct: 190 RELRLAQAEQARDGVRRVFRLTAAEVIADHPRYARTVERAIQQLGRNHPAVRTQYFSEEV 249

Query: 290 QQDIDSFIPLNIIEEALNREPCPDP---------------YAPLIMGC--DIAEEGGDNT 332
                +  P   +       P  D                 AP+ +    D A    D++
Sbjct: 250 DAA-GTLFPEERLALLRGTHPWQDAPLPGRTYAFLLDVGGTAPVQLPLMDDYAGNRRDSS 308

Query: 333 VVVLRR-----------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGA 381
            +V+                  HL  W+         ++  L  ++ P  I+IDA   GA
Sbjct: 309 ALVIVEVEPPQDGRPAPRYRAVHLCQWTGVSQTRLFEQVLALARQWSPRRIVIDATGVGA 368

Query: 382 RTCDYLEML--GYHVYRVLGQKRAVDLEF 408
              D+L+    G  V  V       DL +
Sbjct: 369 GLADFLDRALPGRVVRFVFSSASKSDLGY 397


>gi|163758712|ref|ZP_02165799.1| prophage MuMc02, terminase, ATPase subunit, putative [Hoeflea
           phototrophica DFL-43]
 gi|162284002|gb|EDQ34286.1| prophage MuMc02, terminase, ATPase subunit, putative [Hoeflea
           phototrophica DFL-43]
          Length = 460

 Score = 64.4 bits (155), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 49/304 (16%), Positives = 86/304 (28%), Gaps = 32/304 (10%)

Query: 129 SKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHH 188
           +K   L    H ++ Q           D+ H S             T     PDT  G  
Sbjct: 79  AKAYDLAIEAHEYDWQGQEGSYRAMEVDLPHGSK-----------ITALPANPDTARGFS 127

Query: 189 NTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD 246
                 +  DE +   D   I   +   ++   A     +TS P     KFYE+     D
Sbjct: 128 AN----VFLDEFAFHKDSGAIWKALFPVIS---AGWKLRITSTPNGKGNKFYELMTAEGD 180

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDID----SFIPLNII 302
            W + ++D               +     D D    E   ++  +         I     
Sbjct: 181 RWSKHEVDIYRAVADGLPRDIEELREGLADEDAWAQEYELKWLDEASAWLSYELISSVED 240

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHLFDWSKTDLRTTNN 358
           E A   +P      P  +G DI     D  V+     +          +  +    + + 
Sbjct: 241 ERA--GDPYLYQGGPCYVGRDIGRRN-DLHVIWVWELVGDVLWERERIEQKRATFASMDA 297

Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRAVDLEFCRNRRTELH 417
               ++E+YR     ID    G +  +  +   G  +  VL       +     +     
Sbjct: 298 AFDDVMERYRVVRACIDQTGMGEKVVEDAQTRHGSRIEGVLFTGPNKLVMATAGKEAFED 357

Query: 418 VKMA 421
            ++ 
Sbjct: 358 RRVR 361


>gi|78212008|ref|YP_380787.1| hypothetical protein Syncc9605_0456 [Synechococcus sp. CC9605]
 gi|78196467|gb|ABB34232.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 414

 Score = 64.4 bits (155), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 50/254 (19%), Positives = 88/254 (34%), Gaps = 39/254 (15%)

Query: 82  ISAGRGIGKTTLN-AWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           +++GR  GKT +   WL+   + T  G  +  LA +  Q K   W ++            
Sbjct: 25  VNSGRRFGKTRMALTWLLEGALLT-SGSRMWFLAPTRVQAKQIAWRDLK----------- 72

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
                  + P  W S V   +L I+ ++ S + +    +  D+  G           DE 
Sbjct: 73  ------EMVPGSWASQVRESTLTIELRNGSHI-QLAGADYADSLRGQRADR---FAIDEY 122

Query: 201 SGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTR 256
               D   +    L  +   + +   I +S P        E++ +      W R+   + 
Sbjct: 123 CYIRDLQEMWQAALLPMLGTSDDGSVIFSSTPAGGGTFSAELWERAETAEGWARWNFPSV 182

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PD 313
               + P + E   AR  +D  + R E  G            L  +  A N++      D
Sbjct: 183 AGGWVKPEYVEQ--ARQTMDPSLWRQEFFGSIES-------LLGAVYPAFNQQNISDTVD 233

Query: 314 PYAPLIMGCDIAEE 327
              PL++GCD    
Sbjct: 234 NGGPLLVGCDFNRS 247


>gi|319762771|ref|YP_004126708.1| prophage mumc02, terminase, atpase subunit, putative
           [Alicycliphilus denitrificans BC]
 gi|317117332|gb|ADU99820.1| prophage MuMc02, terminase, ATPase subunit, putative
           [Alicycliphilus denitrificans BC]
          Length = 454

 Score = 64.0 bits (154), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 46/241 (19%), Positives = 77/241 (31%), Gaps = 21/241 (8%)

Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232
           T     PDT  G        ++ DE +   D   I   +   +++          S P  
Sbjct: 113 TALPANPDTARGFSAN----VLLDEFAFHQDSRAIWKALFPVISKPGLKLRV--ISTPNG 166

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291
              KFY++     D W R   D    V    P   E +    G D D+   E   ++  +
Sbjct: 167 KGNKFYDLMTGADDGWSRHTTDIYQAVADGLPRNIEELRKGAG-DDDLWAQEFELKWLDE 225

Query: 292 DIDSFIPLNIIEEA---LNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEH 344
              +++P  +I         +P      P  +G DIA    D  V+     +     +  
Sbjct: 226 AS-AWLPFELITACEHEAAGKPEHYQGGPCFVGVDIASRN-DLFVIWVFELVGDVLWVRE 283

Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG-ARTCDYLEMLGYH-VYRVLGQKR 402
           + +  +      +  + G+  +YR     +D    G     D     G   V  VL    
Sbjct: 284 IIERRRITFAEQDMLLDGVFRRYRVIRACMDQTGMGEKPVEDAQRRHGSSRVQGVLFTSS 343

Query: 403 A 403
           A
Sbjct: 344 A 344


>gi|146277344|ref|YP_001167503.1| hypothetical protein Rsph17025_1297 [Rhodobacter sphaeroides ATCC
           17025]
 gi|146278140|ref|YP_001168299.1| hypothetical protein Rsph17025_2103 [Rhodobacter sphaeroides ATCC
           17025]
 gi|145555585|gb|ABP70198.1| protein of unknown function DUF264 [Rhodobacter sphaeroides ATCC
           17025]
 gi|145556381|gb|ABP70994.1| protein of unknown function DUF264 [Rhodobacter sphaeroides ATCC
           17025]
          Length = 476

 Score = 64.0 bits (154), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 41/239 (17%), Positives = 69/239 (28%), Gaps = 21/239 (8%)

Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232
           T     PDT  G        +I DE +       I   +   +++    +   + S P  
Sbjct: 133 TALPANPDTARGFSAN----VILDEFAFHAKSREIWAALFPVISKG--RQKLRVISTPNG 186

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292
              KFYE+       W R  +D              ++     D D    E   ++  + 
Sbjct: 187 KGNKFYELMTAEGSVWSRHVVDIYEAVRQGLDRDVDMLRAGMADEDAWAQEYELKWLDEA 246

Query: 293 ----IDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEH 344
                   I     E  L  +P      P  +G DIA    D  V+     +        
Sbjct: 247 SAWLDYDLISS--CESELAGKPEGYQGGPCFVGVDIAARN-DLFVIWVMELVGDVLWTRE 303

Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG-ARTCDYLEMLG-YHVYRVLGQK 401
           +    +      +  +  ++ +YR   + +D    G     D     G   V  VL   
Sbjct: 304 IIARRRISFAEQDALLDDVMRRYRVIRVQMDQTGMGEKPVEDAKRRHGQLRVEGVLFSA 362


>gi|171914351|ref|ZP_02929821.1| hypothetical protein VspiD_24270 [Verrucomicrobium spinosum DSM
           4136]
          Length = 450

 Score = 64.0 bits (154), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 61/349 (17%), Positives = 102/349 (29%), Gaps = 59/349 (16%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQ-LKTTLWAEVSKWLSLLPNKHWFEMQSL 146
            GK   +A  ++     R   + +  A SE Q L+T           L     W E   L
Sbjct: 31  TGKDFSSAAEIVRDCKLRDKTTWMIAAPSERQSLET-----------LAKCSEWSEAFDL 79

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCR-TYSEERPDTFVGHHNTYGM---AIINDEASG 202
           +        D     L      ++   R      RPDT  G      M   A   D    
Sbjct: 80  ASEGIREERDGPEALLKQGEIKFANGSRVIAVPGRPDTVRGFSANVLMTEFAFFED---- 135

Query: 203 TPDVINLGILGFLTE--RNANRFWIMTSNPRRLSGKFYEIFNKP---LDDWKRFQIDTRT 257
            PD     IL  +T   R   +   + + P     K ++++ K       W + ++    
Sbjct: 136 -PDATWRAILPSITNPLRGGEKKVRLITTPNGQGNKAHDLWTKENSTKHKWSKHKVTIHD 194

Query: 258 VE----GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD----IDSFIPLNIIEEALNRE 309
                  +DP     ++     D +    E   +F            I      EA   +
Sbjct: 195 AVAAGLPVDPEELRAMLD----DPEGWAQEYECEFLDAAGVLLSYELIGSCEAPEATTTQ 250

Query: 310 P----CPDPYAPLIMGCDIAEEGGDNTVV--VLRRGP--VIEHLFDWSKTDLRTTNNKIS 361
           P       P  PL  G D A +  D +V+    + GP  V + +         +T  ++ 
Sbjct: 251 PDAFWAARPQFPLYAGWDFARK-KDLSVLWTAQKIGPLLVTKEVLVMRG---MSTPKQVE 306

Query: 362 GLVEKYRPDA-IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
            +  + +    + +D    G    D L             +   D    
Sbjct: 307 LVSHRLKNITRLCLDYTGAGVGAGDLLVE--------KFGEWNFDKHQF 347


>gi|227500282|ref|ZP_03930349.1| terminase [Anaerococcus tetradius ATCC 35098]
 gi|227217568|gb|EEI82880.1| terminase [Anaerococcus tetradius ATCC 35098]
          Length = 466

 Score = 64.0 bits (154), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 55/364 (15%), Positives = 122/364 (33%), Gaps = 51/364 (14%)

Query: 50  APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
           +P  WQ + ++ + A   + +   +   +          GKT +     LW +    G +
Sbjct: 35  SPYPWQEKLIKDIFAVNDDGLWTHSKFGYAVPRRN----GKTEIVYMAELWFLM--DGKN 88

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
           +I  A+  +   +  + ++ K+L  +      + +S+         +++          +
Sbjct: 89  IIHTAHRISTSHS-SFKKLKKYLEKMGLVDKVDFKSIKAK----GQEMIELIKTGGVIQF 143

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
            T  RT +    + F          ++ DEA    +     +   +T+ + N   +M   
Sbjct: 144 RT--RTETGGLGEGFD--------LLVIDEAQEYTEGQESALKYTVTDSD-NPMILMCGT 192

Query: 230 P------RRLSGKFYE---IFNKPLDDWKRFQIDTRTVE-------GIDPS-----FHEG 268
           P        +  K+ +      K  + W  + +   T           +PS         
Sbjct: 193 PPTLVSGGTVFSKYRDLILSGGKNHNGWAEWSVSEMTNPYDIDAWYKTNPSMGYKLRERA 252

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEE 327
           +    G D     ++  G + + +  S I  L+     L     P     L +G     +
Sbjct: 253 VEEEIGPDETDFNIQRLGYWVKYNQKSVISKLDWDR--LKLTRLPSLVGKLHVGIKYGND 310

Query: 328 GGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386
           G +  + +  +        +      +R  N+ I   ++K +P +++ID    GA   D 
Sbjct: 311 GRNVALSIAVKTLSNRIFIESIDCQSIRNGNDWIVDFLKKTKPISVVID----GASRQDI 366

Query: 387 LEML 390
           LE  
Sbjct: 367 LEEQ 370


>gi|326782381|ref|YP_004322781.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-ShM2]
 gi|310003329|gb|ADO97726.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-ShM2]
          Length = 362

 Score = 63.6 bits (153), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 46/287 (16%), Positives = 92/287 (32%), Gaps = 44/287 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T+    +LW +     ++V  LAN     +  L     +      N   +  Q +  
Sbjct: 85  GKSTIVTSYLLWYVIFNDNVNVAILANKAATSREML----QRLQRSYENLPKWLQQGIVQ 140

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204
               W    L    G           + S  R  +F          I  DE +  P    
Sbjct: 141 ----WNRGSLELENGSKI---MAASTSSSAVRGMSFN--------VIFLDEFAFVPNHIA 185

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN---KPLDDWKRFQIDTRTVEGI 261
           D     +   ++    +   I+ S P  ++  FY++++   +  +++   ++    V G 
Sbjct: 186 DEFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDSERKKNEYISTEVHWSEVPGR 243

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---------- 311
           D  +    IA         +VE   +F    +D+ I  + +   +  +P           
Sbjct: 244 DAKWKAQTIANTSEQQ--FKVEFECEFL-GSVDTLISPSKLRTMVYNDPLVQNKGLSIYE 300

Query: 312 -PDPYAPLIMGCDIAEE--GGDNTVVVLRRGPVIEHLFDWSKTDLRT 355
                   ++  D+A    G  +  VV+    +   L    K +   
Sbjct: 301 HVQKDHNYVITVDVARGVSGDFSAFVVIDTTTIPYKLVAKYKNNTIK 347


>gi|171915351|ref|ZP_02930821.1| hypothetical protein VspiD_29290 [Verrucomicrobium spinosum DSM
           4136]
          Length = 451

 Score = 63.6 bits (153), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 61/348 (17%), Positives = 104/348 (29%), Gaps = 57/348 (16%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQ-LKTTLWAEVSKWLSLLPNKHWFEMQSL 146
            GK   +A  ++     R   + +  A SE Q L+T           L     W E   L
Sbjct: 32  TGKDFSSAAEIVRDCKLRDKTTWMIAAPSERQSLET-----------LAKCGEWSEAFDL 80

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCR-TYSEERPDTFVGHHNTYGM---AIINDEASG 202
           +        D     L      ++   R      RPDT  G      M   A   D    
Sbjct: 81  ASEGIREERDGPEALLKQGEIKFANGSRVIAVPGRPDTVRGFSANVLMTEFAFFED---- 136

Query: 203 TPDVINLGILGFLTE--RNANRFWIMTSNPRRLSGKFYEIFNKP---LDDWKRFQIDTRT 257
            PD     IL  +T   R   +   + + P     K ++++ K       W + ++    
Sbjct: 137 -PDATWRAILPSITNPLRGGEKKVRLITTPNGQGNKAHDLWTKENSTKHKWSKHKVTIHD 195

Query: 258 VE----GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII---EEALNREP 310
                  +DP     ++     D +    E   +F        +P  +I   E A     
Sbjct: 196 AVAAGLPVDPEELRAMLD----DPEGWAQEYECEFLD-SAGVLLPYELIATCEAAEATTT 250

Query: 311 CPDPYA------PLIMGCDIAEEGGDNTVV--VLRRGPVIEHLFDWSKTDLRTTNNKISG 362
             D +       PL  G D A +  D +V+    + GP I+   +       +T  ++  
Sbjct: 251 QADAFWNARQQFPLYAGWDFARK-KDLSVLWTAQKVGP-IKVTKEVLIMRGMSTPAQVEL 308

Query: 363 LVEKYRPDA-IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
           +  + +    + +D    G    D L             +   D    
Sbjct: 309 VSHRLKHITRLCLDYTGAGVGAGDLLVE--------KFGEWNFDKHQF 348


>gi|68250195|ref|YP_249307.1| terminase, ATPase subunit [Haemophilus influenzae 86-028NP]
 gi|68058394|gb|AAX88647.1| terminase, ATPase subunit [Haemophilus influenzae 86-028NP]
          Length = 593

 Score = 63.2 bits (152), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 54/375 (14%), Positives = 116/375 (30%), Gaps = 70/375 (18%)

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
              K   + +S  ++ A   +DV      I   + + +   +      T   +H      
Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  DV+     G   ++   +     S P  ++   Y     + FN+    
Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRAK 311

Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            ++ +ID                             G +    + +IA    +       
Sbjct: 312 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMESGCNLFNIDDLIAENSKEE--FEQL 369

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333
              QF   +  +F   ++    ++       Y P          + +G D A  G    +
Sbjct: 370 FLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429

Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
           V++           + H   +   D  T  ++I    + Y    I+ID    G+     +
Sbjct: 430 VIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFCDDYNVTRIVIDKTGMGSGVYQEV 489

Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444
                          A  LE+  + + E+ +K  + ++   L      + ++ +  ++K 
Sbjct: 490 R---------KFYPMAQGLEYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 540

Query: 445 FIVPNTGELAIESKR 459
             +  TG++   S R
Sbjct: 541 -RITGTGKITYVSDR 554


>gi|326783799|ref|YP_004324193.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-SSM7]
 gi|310003811|gb|ADO98206.1| terminase DNA packaging enzyme large subunit [Synechococcus phage
           S-SSM7]
          Length = 552

 Score = 63.2 bits (152), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 61/407 (14%), Positives = 124/407 (30%), Gaps = 56/407 (13%)

Query: 59  MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118
           M       +N+ +N    + K    +    GK+T     +L        +++  LAN   
Sbjct: 60  MYDFQEKLVNNFHNNRFNICKMPRQS----GKSTTVVSYLLHYAIFNDSVTIGILANKAQ 115

Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
             +  L          L   +    + +      W    +           ST       
Sbjct: 116 TARDLL--------GRLQIAYENLPKWMQQGIIAWNKGSMELENKSKIIAASTSASAVRG 167

Query: 179 ERPDTFVGHHNTYGMAIINDE----ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234
              +            I  DE    A+   D     +   ++    +   I+ S PR ++
Sbjct: 168 MSFN-----------IIFLDEFAFVANHLADDFFSSVYPTISS-GKSTKVIIVSTPRGMN 215

Query: 235 GKFYEIFNK---PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291
             FY +++      +++    +    V G D ++ E  I          RVE   +F   
Sbjct: 216 -HFYRLWHDAELGRNEYVTTDVHWSEVPGRDEAWKEQTIKN--TSEAQFRVEFECEFL-G 271

Query: 292 DIDSFIPLNIIEEALNREPC------------PDPYAPLIMGCDIAEEG--GDNTVVVLR 337
            +D+ I  + ++  +  EP             P       +  D+A       +  +V  
Sbjct: 272 SVDTLIAPSKLKTMVYDEPINTGKRGGEIYQNPIEKHNYSITVDVARGVEKDYSAFIVFD 331

Query: 338 RGPVIEHLFDWSKTDL---RTTNNKISGLVEKYRPDAIIIDANNTGARTCD----YLEML 390
                  +    + +        + I+     Y    I+ + N+ G +        LE  
Sbjct: 332 TTTFPYKVVAKYRNNTIKPMLFPSVIAEFARAYNNAFILCEVNDIGDQIASILFYDLEYE 391

Query: 391 GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQ 437
              +  V G+   V  +     + +L VKM+  ++    +N   LI+
Sbjct: 392 NVLMTAVRGRAGQVLGQGFSGSKVQLGVKMSKTVKKIGALNLKTLIE 438


>gi|323146129|gb|ADX32368.1| putative terminase ATPase subunit [Cronobacter phage ESSI-2]
          Length = 639

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 37/211 (17%), Positives = 69/211 (32%), Gaps = 28/211 (13%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  +     +     F     DS      +
Sbjct: 377 PDGQWRYVITMEDAIAGGFNLASIEKLRNRY--NPTTFNMLYMCVFVDSK-DSVFSYGDL 433

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E           + P          +  G D A  G  +  V++           +  +F
Sbjct: 434 EACAVETETWQDHKPDAPRPFGDREVWGGFDPARSGDFSCFVIVAPPLFAGEKFRVLRVF 493

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
           +W   + R    +I  L +KY    + +D    G    D ++     V        AV +
Sbjct: 494 NWKGMNFRWQAKQIEQLFKKYNFAYLGVDVTGIGQGVFDNIQHFALRV--------AVPI 545

Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437
            + RN + +L +K AD +E   +     L +
Sbjct: 546 RYDRNTKNQLVLKAADVVESQRIEWDKELKE 576


>gi|318603823|emb|CBY25321.1| phage terminase, ATPase subunit [Yersinia enterocolitica subsp.
           palearctica Y11]
          Length = 257

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 31/170 (18%), Positives = 47/170 (27%), Gaps = 27/170 (15%)

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDIAEEG 328
            R     +F      S  P   ++  +                   Y P+ MG D +  G
Sbjct: 31  FRNLFLCEFVDDKA-SVFPFEELQACMVDSLVEWEDFAPFAEQPFNYHPVWMGYDPSHTG 89

Query: 329 GDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382
                VV+      G     L    W   D       I  L EKY  + I IDA   G  
Sbjct: 90  DSAGCVVMAPPWVPGGKFRILERHQWKGMDFADQAESIKKLTEKYNVEYIGIDATGIGQG 149

Query: 383 TCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH 432
               +               A ++ +    +T + +K  D +    L   
Sbjct: 150 VYQLVR---------NFFPAAREIRYSAEVKTNMVLKAKDLITTGRLEYD 190


>gi|120599697|ref|YP_964271.1| hypothetical protein Sputw3181_2900 [Shewanella sp. W3-18-1]
 gi|120559790|gb|ABM25717.1| protein of unknown function DUF264 [Shewanella sp. W3-18-1]
          Length = 602

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 38/208 (18%), Positives = 71/208 (34%), Gaps = 32/208 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317
            + +   Y  + D         F   D DS    + +E+ +        Y P        
Sbjct: 365 IDELRDEY--NGDDFANLFMCIFVD-DADSVFKFSDLEKCMVEAARWQDYKPAAPRPFGN 421

Query: 318 --LIMGCDIAEEGGDNTVVVL-----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             + +G D +    DN V+ +     ++G     L    W   +      +I  +  KYR
Sbjct: 422 REVWLGYDPSRT-RDNAVLAVVAPGEKKGEKFRVLERHRWRGMNFAHHVAEIQKIYAKYR 480

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
              I +D    GA   D +  L          + A  + +    +T L +KM D +E   
Sbjct: 481 VTYIGVDTTGIGAGVFDSISTL--------YPREATAIHYSVGSKTRLVLKMIDVVEGGR 532

Query: 429 LINHSGLIQ---NLKSLKSFIVPNTGEL 453
           +   +GL     +  S++  +  + G +
Sbjct: 533 IEWDAGLKDIAMSFLSIRRTVTDSGGAI 560


>gi|225872083|ref|YP_002753538.1| putative bacteriophage portal protein [Acidobacterium capsulatum
           ATCC 51196]
 gi|225792593|gb|ACO32683.1| putative bacteriophage portal protein [Acidobacterium capsulatum
           ATCC 51196]
          Length = 507

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 66/363 (18%), Positives = 111/363 (30%), Gaps = 65/363 (17%)

Query: 78  FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           FK A+ + R IG +   A          P  +   L+ S+ Q  +  + E     +    
Sbjct: 46  FKIAVKSAR-IGFSFATALEAALDCLAHPNTTWTVLSASKAQ--SVEFIE-----TCHRL 97

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAII 196
                  +   H   WY ++ H         ++   R  +    P T  G+        I
Sbjct: 98  IEVMTGTAELYHDEDWYDELGHIEAIQQRITFANGARIIALPANPRTARGYPGNA----I 153

Query: 197 NDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN------------ 242
            DE +   +   I   I   +   +  R     S P    GKFY++              
Sbjct: 154 LDEFAHHEESYAIWAAITRQVALGHKVRVL---STPNGEQGKFYDLCKELGLTDGVAPEN 210

Query: 243 --KPLDDWKRFQIDTR----TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF 296
             K +  W    ID          I+      +I     D+D+   E    F +    ++
Sbjct: 211 NFKIVKGWSIHWIDAPMAIADGCPINMDEMRQLIQ----DADIVNQEFYCVFLKSG-GAW 265

Query: 297 IPLNIIEEALNREPCPD------PYAPLIMGCDIAEEGGDNT---------VVVLRRGPV 341
           IPL++I+ A +     +      P   L  G D+       T         V+V R    
Sbjct: 266 IPLDLIQRAESETATVEWPGGYAPRGRLFGGIDVGRFSNRTTFWVKEDLGDVLVTRMAMA 325

Query: 342 IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRVLGQ 400
           I  +    + +L     K++ +          ID+   G    D L  L    V  V   
Sbjct: 326 IHEMPFPDQANLIAPWMKMTQV--------TAIDSTGMGIGLFDDLNKLCPGRVMGVNFA 377

Query: 401 KRA 403
             +
Sbjct: 378 GSS 380


>gi|198242430|ref|YP_002214959.1| hypothetical protein SeD_A1100 [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|193876434|gb|ACF24836.1| ORF11 [Salmonella enterica subsp. enterica serovar Dublin]
 gi|197936946|gb|ACH74279.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|326622711|gb|EGE29056.1| hypothetical protein SD3246_1075 [Salmonella enterica subsp.
           enterica serovar Dublin str. 3246]
          Length = 423

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 64/372 (17%), Positives = 115/372 (30%), Gaps = 68/372 (18%)

Query: 58  FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTT-LNAWLVLWLMSTRPGISVICLANS 116
            +E +  H        +P   K  I AGR  GKTT L      W               +
Sbjct: 6   VIEFLPFHAGQKKIYRSPAKRKV-IRAGRRFGKTTMLEQAGGNW---------------A 49

Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176
             Q++   +A   K L  LP+          +  +   +D +   +G     +      +
Sbjct: 50  ARQMRVGWFAPSYKIL--LPSFKTIRDLLKPITISSSKTDSIIELIGGGLVEF------W 101

Query: 177 SEERPDTFVGHHNTYGMAIINDEAS----GTPDVINLGILGFLTERNANRFWIMTSNPRR 232
           + + PD      +     +I DE S    G  D+    I   L + + +   +M   P+ 
Sbjct: 102 TLDNPDAGR---SRKYHKVIIDEGSLVKKGMRDIWEQAIEPTLLDFDGDA--VMAGTPKG 156

Query: 233 --LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
                 FY+  N     W+     T     I+P+    II   G    V + E   +F  
Sbjct: 157 VDDENFFYQACNDKSMGWEEHHAPTAANPTINPAALARIID--GRPPLVVQQEYNAEFVD 214

Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEG---GDNTV-------------- 333
               +F  L+ + E       P     +    D A++G    D +               
Sbjct: 215 WRGQNFFKLDWLLENGAPVDYPFSCDTVYGVVDCAQKGKLQNDGSACIWFALDNLPSPHL 274

Query: 334 ------VVLRRGPVIEHLF-DWSKTDLRTTNNKISGLVE-KYRPDAIIIDANNTGARTCD 385
                 ++   G  ++ +   W           +S +   +     + I+   TG     
Sbjct: 275 IILDWDIIQIDGYFLKDVVPQWEGK-----AKHLSEICRARMGTTGLFIEDKATGITLLQ 329

Query: 386 YLEMLGYHVYRV 397
                G++V+ V
Sbjct: 330 QDANEGWNVHPV 341


>gi|223939800|ref|ZP_03631671.1| protein of unknown function DUF264 [bacterium Ellin514]
 gi|223891576|gb|EEF58066.1| protein of unknown function DUF264 [bacterium Ellin514]
          Length = 449

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 49/345 (14%), Positives = 101/345 (29%), Gaps = 48/345 (13%)

Query: 137 NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAI 195
            K W ++  +  H                   ++T  R YS    P+   G        +
Sbjct: 71  CKAWAQLLDVVAHDLGEIIFDREKKFSAYVLEFATKLRIYSLSSNPNALAGKRGH----V 126

Query: 196 INDEASGTPDV--INLGILGFLTERNA-NRFWIMTSNPRRLSGKFYEIFNKPLDD-WKRF 251
           I DE +   D   +        T                  +G  ++I ++     W   
Sbjct: 127 ILDEFALHGDQRMLYRIAKPVTTWGGQLEIISTHRGVGTVFNGIIHDIHHRGNPMGWSHH 186

Query: 252 QIDTRTVEGIDPSFHEGIIARYGL--DSDVTRVEVCGQ-------------FPQQDIDSF 296
           ++  +    I+    E I  + G     +     V  +              P  +   F
Sbjct: 187 KVTLQEA--IEQGVVERINGKTGEAESREGYLARVRAECLDEEQWLQEYCCVPADESSVF 244

Query: 297 IPLNIIEEALNREPCPDPY-----APLIMGCDIAEEGGDNTVVVLRRGPVIEHL------ 345
           I  ++I+   +       Y      PL +G D+  +  D +V+    G  +  +      
Sbjct: 245 IGYDLIDACEDDCLKDFEYLRKCENPLYLGFDVGRK-RDLSVI--DVGEKVGDVMWDRMR 301

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKRAV 404
            + +         ++  L+E  +     IDA   G +  +  +   G+ V  V       
Sbjct: 302 IELAGKTFSEQEAELYRLLELPKLKRACIDATGLGMQLAERAKYRFGWKVEAVTFTGHVK 361

Query: 405 DLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPN 449
           + E   N R  +  +         +     L  +L+ +K  +  +
Sbjct: 362 E-ELAYNLR--MAFEDRR----VRITRDPLLRADLRGIKKEVTTS 399


>gi|223940405|ref|ZP_03632258.1| protein of unknown function DUF264 [bacterium Ellin514]
 gi|223890900|gb|EEF57408.1| protein of unknown function DUF264 [bacterium Ellin514]
          Length = 447

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 49/345 (14%), Positives = 100/345 (28%), Gaps = 48/345 (13%)

Query: 137 NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAI 195
            K W ++  +  H                   ++T  R YS    P+   G        +
Sbjct: 71  CKAWAQLLDVVAHDLGEIIFDREKKFSAYVLEFATKLRIYSLSSNPNALAGKRGH----V 126

Query: 196 INDEASGTPDV--INLGILGFLTERNA-NRFWIMTSNPRRLSGKFYEIFNKPLDD-WKRF 251
           I DE +   D   +        T                  +G  ++I  +     W   
Sbjct: 127 ILDEFALHGDQRMLYRIAKPVTTWGGQLEIISTHRGVGTVFNGIIHDIHQRGNPMGWSHH 186

Query: 252 QIDTRTVEGIDPSFHEGIIARYGL--DSDVTRVEVCGQ-------------FPQQDIDSF 296
           ++  +    I+    E I  + G     +     V  +              P  +   F
Sbjct: 187 KVTLQEA--IEQGVVERINEKTGEAESREGYLARVRAECLDEEQWLQEYCCVPADESSVF 244

Query: 297 IPLNIIEEALNREPCPDPY-----APLIMGCDIAEEGGDNTVVVLRRGPVIEHL------ 345
           I  ++I+   +       Y      PL +G D+  +  D +V+    G  +  +      
Sbjct: 245 IGYDLIDACEDDCLKDFEYLRKCENPLYLGFDVGRK-RDLSVI--DVGEKVGDVMWDRMR 301

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKRAV 404
            + +         ++  L+E  +     IDA   G +  +  +   G+ V  V       
Sbjct: 302 IELAGKTFSEQEAELYRLLELPKLKRACIDATGLGMQLAERAKYRFGWKVEAVTFTGHVK 361

Query: 405 DLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPN 449
           + E   N R  +  +         +     L  +L+ +K  +  +
Sbjct: 362 E-ELAYNLR--MAFEDRR----VRITRDPLLRADLRGIKKEVTTS 399


>gi|146310462|ref|YP_001175536.1| hypothetical protein Ent638_0800 [Enterobacter sp. 638]
 gi|145317338|gb|ABP59485.1| conserved hypothetical protein [Enterobacter sp. 638]
          Length = 445

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 43/290 (14%), Positives = 83/290 (28%), Gaps = 28/290 (9%)

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
           +L      F           W        + ++      + +  + +             
Sbjct: 143 ALFWKARKFVETLPVEFRGSWDEKKHAPYMRVEFPDTGAVIKGEAGDNIGR-----GDRT 197

Query: 193 MAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
                DEA+     +   I   L++    R  I  S+   +S  F +   +       F 
Sbjct: 198 TLYFVDEAAFLQRPLL--IDAALSQ--TTRCRIDLSSVNGMSNPFAQ--KRHSGKIPVFT 251

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
              R+    D  ++     +  +D+ V    E+   +        IP   ++ A++    
Sbjct: 252 FHWRSDPRKDNEWYRKECEK--IDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAHIK 309

Query: 312 --PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS--GLVEKY 367
               P    +   DIA+EG D      R G +++ + +WS        + +   G  + Y
Sbjct: 310 LGIQPSGQRLGSMDIADEGKDKNGFSSRYGFLLQSVHEWSGEGSDIYASVVKSFGYCDDY 369

Query: 368 RPDAIIIDANNTGAR------TCDYLEML----GYHVYRVLGQKRAVDLE 407
             D    D +  GA         + L               G     D E
Sbjct: 370 GLDEFRFDEDGLGAGARGDARVINELRQAEGRGTIAATPFRGSGSVFDPE 419


>gi|145636853|ref|ZP_01792518.1| terminase, ATPase subunit [Haemophilus influenzae PittHH]
 gi|145269934|gb|EDK09872.1| terminase, ATPase subunit [Haemophilus influenzae PittHH]
          Length = 593

 Score = 61.7 bits (148), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 54/375 (14%), Positives = 116/375 (30%), Gaps = 70/375 (18%)

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
              K   + +S  ++ A   +DV      I   + + +   +      T   +H      
Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  DV+     G   ++   +     S P  ++   Y     + FN+    
Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRTK 311

Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            ++ +ID                             G +    + +IA    +       
Sbjct: 312 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 369

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333
              QF   +  +F   ++    ++       Y P          + +G D A  G    +
Sbjct: 370 FLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429

Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
           V++           + H   +   D  T  ++I    + Y    I+ID    G+     +
Sbjct: 430 VIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFCDDYNVTRIVIDKTGMGSGVYQEV 489

Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444
                          A  LE+  + + E+ +K  + ++   L      + ++ +  ++K 
Sbjct: 490 R---------KFYPMAQGLEYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 540

Query: 445 FIVPNTGELAIESKR 459
             +  TG++   S R
Sbjct: 541 -RITGTGKITYVSDR 554


>gi|53802921|ref|YP_115325.1| prophage MuMc02, terminase, ATPase subunit [Methylococcus
           capsulatus str. Bath]
 gi|53756682|gb|AAU90973.1| putative prophage MuMc02, terminase, ATPase subunit [Methylococcus
           capsulatus str. Bath]
          Length = 443

 Score = 61.7 bits (148), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 51/276 (18%), Positives = 80/276 (28%), Gaps = 25/276 (9%)

Query: 180 RPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
            PDT  G   +    ++ DE +   D   I   +   ++  +        S P     KF
Sbjct: 114 NPDTARGFTAS----VLLDEFAFHADSRKIWQALFPVVSRSDLKLRV--ISTPNGKGNKF 167

Query: 238 YEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDID-- 294
           Y++       W R   D    V    P   E + A  G D D    E   Q+  +     
Sbjct: 168 YDLITGDHPVWSRHVTDIYQAVADGLPRDIEELKAGVG-DDDAWAQEYELQWLDEASAWL 226

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVV----VLRRGPVIEHLFDWSK 350
           SF  +N +E      P      P  +G DIA    D  V+     +        +    +
Sbjct: 227 SFELINSVEHDHAGIPEHYAGGPCFLGVDIAARN-DLFVIWVLEAVGDVYWTREILARRR 285

Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTG-ARTCDYLEMLGYH-VYRVLGQKRAVDLEF 408
                 +  ++    +YR     +D    G     D     G   V  VL      +   
Sbjct: 286 ISFAEQDALLADAFNRYRVIRCCMDQTGMGEKPVEDAQRRFGSSRVEGVLFTG--PNKLA 343

Query: 409 CRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
                 E        +       +  L  +L  LK 
Sbjct: 344 LATTGKEAFEDRRIRIPEG----NQELRNDLHKLKK 375


>gi|145630909|ref|ZP_01786686.1| terminase, ATPase subunit [Haemophilus influenzae R3021]
 gi|144983569|gb|EDJ91037.1| terminase, ATPase subunit [Haemophilus influenzae R3021]
          Length = 593

 Score = 61.7 bits (148), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 54/375 (14%), Positives = 116/375 (30%), Gaps = 70/375 (18%)

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
              K   + +S  ++ A   +DV      I   + + +   +      T   +H      
Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  DV+     G   ++   +     S P  ++   Y     + FN+    
Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRAK 311

Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            ++ +ID                             G +    + +IA    +       
Sbjct: 312 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 369

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333
              QF   +  +F   ++    ++       Y P          + +G D A  G    +
Sbjct: 370 FLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429

Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
           V++           + H   +   D  T  ++I    + Y    I+ID    G+     +
Sbjct: 430 VIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFCDDYNVTRIVIDKTGMGSGVYQEV 489

Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444
                          A  LE+  + + E+ +K  + ++   L      + ++ +  ++K 
Sbjct: 490 R---------KFYPMAQGLEYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 540

Query: 445 FIVPNTGELAIESKR 459
             +  TG++   S R
Sbjct: 541 -RITGTGKITYVSDR 554


>gi|300723941|ref|YP_003713254.1| Terminase, ATPase subunit [Xenorhabdus nematophila ATCC 19061]
 gi|297630471|emb|CBJ91136.1| Terminase, ATPase subunit (GpP) [Xenorhabdus nematophila ATCC
           19061]
          Length = 573

 Score = 61.3 bits (147), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 30/144 (20%), Positives = 54/144 (37%), Gaps = 23/144 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDP 314
            + +  +Y    D  +  +  +F   DI+S   L +++  +                P  
Sbjct: 333 IDRLRRQY--SPDEYQNLLMCEF-MDDIESIFSLQLMQGCMVDSWEIWHDVQPLMLRPYG 389

Query: 315 YAPLIMGCDIAEEG--GDNT---VVVLRR--GPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           Y P+ +G D A+ G  GD+    V+   +  G     L    W   D R  ++ I  L E
Sbjct: 390 YHPVWIGYDPAKGGENGDSAGCVVIAPPQVPGGKFRILERHQWRGMDFRAQSDAIRQLTE 449

Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389
           +Y  + I ID+   G      ++ 
Sbjct: 450 QYNVEYIGIDSTGIGHGVYQNVKE 473


>gi|120602517|ref|YP_966917.1| hypothetical protein Dvul_1472 [Desulfovibrio vulgaris DP4]
 gi|120562746|gb|ABM28490.1| protein of unknown function DUF264 [Desulfovibrio vulgaris DP4]
          Length = 599

 Score = 61.3 bits (147), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 36/197 (18%), Positives = 57/197 (28%), Gaps = 28/197 (14%)

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-- 306
               +      G D      +   Y    +  R     +F       F  L  +E  +  
Sbjct: 346 NIITLADAEAGGCDLFDVAQLKLEY--TPEEFRQLFGCEFIDDTQGVF-RLAQLEACMVD 402

Query: 307 --------NREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTD 352
                     +P P    P+  G D A  G D +  VL    R G  I  +    W    
Sbjct: 403 PADWQDVRQGDPHPVGNLPVWGGYDPARSGDDASFAVLLPDLRDGGGIRCIERHKWKGRS 462

Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNR 412
                 +I  L EKYR   + ID    G    + ++              A  + +    
Sbjct: 463 YLWQAERIRELAEKYRFAHLGIDTTGPGIGVFEQVQQ---------FCPVATPINYGVQS 513

Query: 413 RTELHVKMADWLEFASL 429
           +  L +K  + +E   L
Sbjct: 514 KAMLVLKAREVIEEGRL 530


>gi|120603805|ref|YP_968205.1| hypothetical protein Dvul_2767 [Desulfovibrio vulgaris DP4]
 gi|120564034|gb|ABM29778.1| protein of unknown function DUF264 [Desulfovibrio vulgaris DP4]
          Length = 599

 Score = 61.3 bits (147), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 36/197 (18%), Positives = 57/197 (28%), Gaps = 28/197 (14%)

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-- 306
               +      G D      +   Y    +  R     +F       F  L  +E  +  
Sbjct: 346 NIITLADAEAGGCDLFDVAQLKLEY--TPEEFRQLFGCEFIDDTQGVF-RLAQLEACMVD 402

Query: 307 --------NREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTD 352
                     +P P    P+  G D A  G D +  VL    R G  I  +    W    
Sbjct: 403 PADWQDVRQGDPHPVGNLPVWGGYDPARSGDDASFAVLLPDLRDGGGIRCIERHKWKGRS 462

Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNR 412
                 +I  L EKYR   + ID    G    + ++              A  + +    
Sbjct: 463 YLWQAERIRELAEKYRFAHLGIDTTGPGIGVFEQVQQ---------FCPVATPINYGVQS 513

Query: 413 RTELHVKMADWLEFASL 429
           +  L +K  + +E   L
Sbjct: 514 KAMLVLKAREVIEEGRL 530


>gi|302339289|ref|YP_003804495.1| hypothetical protein Spirs_2798 [Spirochaeta smaragdinae DSM 11293]
 gi|301636474|gb|ADK81901.1| conserved hypothetical protein [Spirochaeta smaragdinae DSM 11293]
          Length = 295

 Score = 61.3 bits (147), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 49/257 (19%), Positives = 85/257 (33%), Gaps = 45/257 (17%)

Query: 85  GRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
            R  GK+T+ A           G  +I ++ +  Q K  L  +V  +++L  +      +
Sbjct: 53  CRQAGKSTVIAAKAAHKAKFFSGSLIILVSPALRQSKE-LMRKVEDFIALDKSFPPASEE 111

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
              L                       +    SE+      G        II DEAS  P
Sbjct: 112 DNQLTKE-------------FKNRSRIVALPGSEKTIRGLSGP-----TLIIIDEASRIP 153

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDP- 263
           D +   I   +   +     ++ + P    G FY+ +++    W + ++  R + G  P 
Sbjct: 154 DELYKAIRPMMAGADTE--LVLMTTPFGKRGVFYDAWSRSK-RWTKIEVVGRDILGRFPN 210

Query: 264 -------SFHEGIIARYGLDSDV--------------TRVEVCGQFPQQDIDSFIPLNII 302
                     +GI A Y     V               R E  G+F    IDS   +  +
Sbjct: 211 EQVYAQLRRKDGIKACYSPRHSVEFLGEELEEMGEWWYRQEYGGEFMDP-IDSVFNMEDV 269

Query: 303 EEALNREPCPDPYAPLI 319
             A+  +     +AP+I
Sbjct: 270 RAAIINDTPAISFAPII 286


>gi|273810556|ref|YP_003344937.1| gp2 [Sodalis phage SO-1]
 gi|258619841|gb|ACV84094.1| gp2 [Sodalis phage SO-1]
          Length = 461

 Score = 61.3 bits (147), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 69/335 (20%), Positives = 120/335 (35%), Gaps = 46/335 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           +G G GK+ + A  V+ L++  PG   I    +   L   ++ E+ K       +  F  
Sbjct: 58  SGFGGGKSWVAARKVIQLLTLNPGHDGIVTEPTIPLLVKIMYPELEKAFDEAGFRWKFNK 117

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q           D ++  L +  K    +C   S E     +G +  +   I+ DE   T
Sbjct: 118 Q-----------DKIYSVL-VKGKWTRVICE--SMENYTRLIGVNAAW---IVADEFDTT 160

Query: 204 PDVINLGILGFLTER---NANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR-FQIDTRTVE 259
              + L     L  R      R +++ S P       Y+IF    D  KR  +  T    
Sbjct: 161 KQDVALAAYHKLLGRLRAGFVRQFVIVSTPEGYRAM-YQIFEVEKDSQKRLIRAKTTDNH 219

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319
            +   F + + ++Y   +++    + G F      +   +   EE  + E    P   LI
Sbjct: 220 HLPADFIDTLRSQY--PANLIDAYLNGLFVNLTSGAVYKMFNREENASTEEVQ-PEDTLI 276

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-------TDLRTTNNKISGLVEKYR---- 368
           +G D         VV +RR  + E+     +        DL  T   I  + E+Y     
Sbjct: 277 IGMDFNVTKM-AAVVYVRRQRITENKEFLDEIHAVDEFVDLFDTPAMIEAIEERYPDHCA 335

Query: 369 PDAIII--DANN-----TGARTCD--YLEMLGYHV 394
              +++  D++        A + D   LE  G+ V
Sbjct: 336 AGRVVVYPDSSGKSRKTVNASSSDIAQLEDAGFEV 370


>gi|330874284|gb|EGH08433.1| hypothetical protein PSYMP_06646 [Pseudomonas syringae pv.
           morsprunorum str. M302280PT]
          Length = 684

 Score = 61.3 bits (147), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%)

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189
           +LS    +       +      W+   L  +  + SK         +      T  GHH 
Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265

Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242
                +  DE     D   +N       T +   +     S P  +S + Y     E F 
Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319

Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279
                                  P   W++   I      G D    E +   Y  D D 
Sbjct: 320 NSKRKNAKEPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329
            +     +F      +F  L  +E   +           +P P   +P+ +G D +    
Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436

Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383
           D T VV    L  G     L    W     +    ++  L E++    I ID    G   
Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496

Query: 384 CDYLEM 389
            D +  
Sbjct: 497 FDLVRD 502


>gi|301386048|ref|ZP_07234466.1| hypothetical protein PsyrptM_25573 [Pseudomonas syringae pv. tomato
           Max13]
 gi|302060830|ref|ZP_07252371.1| hypothetical protein PsyrptK_12639 [Pseudomonas syringae pv. tomato
           K40]
 gi|302129770|ref|ZP_07255760.1| hypothetical protein PsyrptN_00140 [Pseudomonas syringae pv. tomato
           NCPPB 1108]
          Length = 684

 Score = 61.3 bits (147), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%)

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189
           +LS    +       +      W+   L  +  + SK         +      T  GHH 
Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265

Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242
                +  DE     D   +N       T +   +     S P  +S + Y     E F 
Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319

Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279
                                  P   W++   I      G D    E +   Y  D D 
Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329
            +     +F      +F  L  +E   +           +P P   +P+ +G D +    
Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436

Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383
           D T VV    L  G     L    W     +    ++  L E++    I ID    G   
Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496

Query: 384 CDYLEM 389
            D +  
Sbjct: 497 FDLVRD 502


>gi|152973346|ref|YP_001337126.1| putative prophage large terminase protein [Klebsiella pneumoniae
           subsp. pneumoniae MGH 78578]
 gi|150958195|gb|ABR80225.1| putative prophage large terminase protein [Klebsiella pneumoniae
           subsp. pneumoniae MGH 78578]
          Length = 589

 Score = 61.3 bits (147), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 38/207 (18%), Positives = 58/207 (28%), Gaps = 30/207 (14%)

Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W++   I+     G      E +     +D    R     +F      S  P   +
Sbjct: 328 PDGQWRQIVTIEDALAGGCTLFNLEQLKRENSVDD--FRNLFMCEFVDDKA-SVFPFEDL 384

Query: 303 EEALNREPCPDPY-----------APLIMGCDIAEEGGDNTVVVL----RRGPVIEHL-- 345
           +  +                     P+ +G D +  G     VVL      G     L  
Sbjct: 385 QRCMVDSLEEWEDFAPFADNPFGSRPVWVGYDPSHSGDSAGCVVLAPPVVAGGKFRILER 444

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
             W   D  T    I  L EKY  + I IDA   G      +               A D
Sbjct: 445 HQWKGMDFATQAESIRQLTEKYNVEYIGIDATGLGIGVFQLVR---------SFYPAARD 495

Query: 406 LEFCRNRRTELHVKMADWLEFASLINH 432
           + +    +T + +K  D +    L   
Sbjct: 496 IRYTPEMKTAMVLKAKDVIRRGCLEYD 522


>gi|296141561|ref|YP_003648804.1| terminase [Tsukamurella paurometabola DSM 20162]
 gi|296029695|gb|ADG80465.1| Terminase [Tsukamurella paurometabola DSM 20162]
          Length = 489

 Score = 61.3 bits (147), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 77/407 (18%), Positives = 115/407 (28%), Gaps = 74/407 (18%)

Query: 28  FSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRG 87
           F  F   F     KGT  +G    R WQ++    V      +V    P          RG
Sbjct: 27  FLAFADKFLR-VPKGTGAKGKLHLRDWQVDVARDVLDSGARTVGIMFP----------RG 75

Query: 88  IGKTTLNAWLVLWLMST-RPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146
            GKTTLNA + L+   T   G +V  +A  E Q            L+    +   E+   
Sbjct: 76  QGKTTLNAAIALYRFFTGGEGANVCVVAVDERQAG----------LAFSAARRMVELNEE 125

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV 206
                  + D L+    + +      C   S   P    G      +  + DEA      
Sbjct: 126 LSARCQIFKDRLY----LPTTDSVFQCLPAS---PTALEGL---DYVLALVDEAGVVNRD 175

Query: 207 INLGILGFLTERNANRFWIMTSNPRRLSG--------KFYEIFNKPLD-DWKRFQI---- 253
           +   +      +      +    P              ++          W+ F      
Sbjct: 176 VFEVVQLA-QGKREKSVLVAIGTPGPNLDDQVLLSLRDYHLEHPDDASLRWREFSAAGFE 234

Query: 254 -----DTRTVEGIDPSFHEGIIAR--------YGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
                 T   E  +P+  + +              +S   R     QF      SF+P  
Sbjct: 235 DHPVDCTHCWELANPALDDFLHRDALVALLPPKTRESTFRRAR-LCQFAADTEGSFLPAG 293

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR---GPVIEHLFDWSKTDLR--- 354
           + E     EP P   A +++  D      D T ++L      P    L  W +       
Sbjct: 294 VWEGLSTGEPVP-LGAEVVIALD-GSFSDDTTALLLGTVAAAPHFHPLRVWERPADNDDW 351

Query: 355 -----TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396
                   N I      Y+   II D      RT   LE  G  V  
Sbjct: 352 RVPVLEVENTIRQACRDYQVVEIIADPFRW-TRTLQVLEQEGLPVVE 397


>gi|114046227|ref|YP_736777.1| hypothetical protein Shewmr7_0720 [Shewanella sp. MR-7]
 gi|113887669|gb|ABI41720.1| protein of unknown function DUF264 [Shewanella sp. MR-7]
          Length = 602

 Score = 61.3 bits (147), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 31/164 (18%), Positives = 54/164 (32%), Gaps = 20/164 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    I+     G D    + +   Y  + D         F   D DS    + +
Sbjct: 342 PDKQWRYVVTIEDALAGGCDLFDIDELREEY--NGDDFNNLFMCIFVD-DADSVFKFSDL 398

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD- 347
           E+ +        + P          + +G D +    + T+VV+    ++G     L   
Sbjct: 399 EKCMVDAARWQDHKPAAPRPFGNREVWLGYDPSRTRDNATLVVVAPGEKKGEKFRVLEKH 458

Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
            W   +      +I  +  KYR   I +D    GA   D +  L
Sbjct: 459 YWRGMNFSHHVAEIQKIYAKYRVTYIGVDTTGIGAGVFDSISTL 502


>gi|291334706|gb|ADD94352.1| hypothetical protein Ddes_0719 [uncultured phage
           MedDCM-OCT-S04-C890]
          Length = 311

 Score = 60.9 bits (146), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 37/266 (13%), Positives = 81/266 (30%), Gaps = 32/266 (12%)

Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
                 +A +  Q K+  W  + ++ + +PN  + E +     P      +L        
Sbjct: 6   NPRFAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG------- 58

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRFWI 225
                       E  D   G +       + DE +     +    I   L++R    + +
Sbjct: 59  -----------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDR--KGYCV 102

Query: 226 MTSNPRRLSGKFYEIFNKPLD--DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
               P  ++  FY+++       DW  ++      + +DP   E      G        E
Sbjct: 103 FIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGEKK--YLQE 160

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE---GGDNTVVVLRRGP 340
               +      +     I +     +    PY P  +    A +      ++++  ++  
Sbjct: 161 FECDWIANIEGAIYGEEIAKIEDKNQIARVPYDP-TLPVSTAWDLGVADHSSIIFFQQKG 219

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEK 366
               + D+ +       + I  L EK
Sbjct: 220 TGVQIIDYHEERGHGLPHYIQMLEEK 245


>gi|330830158|ref|YP_004393110.1| phage-related terminase [Aeromonas veronii B565]
 gi|328805294|gb|AEB50493.1| Phage-related terminase [Aeromonas veronii B565]
          Length = 588

 Score = 60.9 bits (146), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 34/210 (16%), Positives = 67/210 (31%), Gaps = 35/210 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317
            + + + Y  D    R  +  +F      S  PL  ++  + +     + Y P       
Sbjct: 349 LDQLRSEYSEDE--YRNLLMCEFMDDTE-SLFPLATLQRCMVDSWLVWEDYKPHTLRPLA 405

Query: 318 ---LIMGCDIAEEGGDNTV--------VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK 366
              + +G D A+ G  ++         +V      +     W   D       I  + ++
Sbjct: 406 NRAVWIGYDPAKGGKGDSAGCAVLAPPLVPGGKFRVLERHRWQGMDFDAQAKSIRAICDR 465

Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426
           Y    I ID    G      ++                 +++  N +  + +K  D +  
Sbjct: 466 YNVAYIGIDTTGIGEGVYQLVKQ---------FYPAVTAIQYNPNVKMRMVMKAQDVMNK 516

Query: 427 ASLINHSG---LIQNLKSLKSFIVPNTGEL 453
             L   SG   L Q   S++   V  +G+L
Sbjct: 517 GRLEFDSGWTDLAQAFMSIRR-AVTQSGKL 545


>gi|330939345|gb|EGH42730.1| hypothetical protein PSYPI_10145 [Pseudomonas syringae pv. pisi
           str. 1704B]
          Length = 650

 Score = 60.9 bits (146), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%)

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189
           +LS    +       +      W+   L  +  + SK         +      T  GHH 
Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265

Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242
                +  DE     D   +N       T +   +     S P  +S + Y     E F 
Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319

Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279
                                  P   W++   I      G D    E +   Y  D D 
Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329
            +     +F      +F  L  +E   +           +P P   +P+ +G D +    
Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436

Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383
           D T VV    L  G     L    W     +    ++  L E++    I ID    G   
Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496

Query: 384 CDYLEM 389
            D +  
Sbjct: 497 FDLVRD 502


>gi|330985172|gb|EGH83275.1| hypothetical protein PLA107_09108 [Pseudomonas syringae pv.
           lachrymans str. M301315]
          Length = 684

 Score = 60.9 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%)

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189
           +LS    +       +      W+   L  +  + SK         +      T  GHH 
Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265

Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242
                +  DE     D   +N       T +   +     S P  +S + Y     E F 
Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319

Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279
                                  P   W++   I      G D    E +   Y  D D 
Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329
            +     +F      +F  L  +E   +           +P P   +P+ +G D +    
Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436

Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383
           D T VV    L  G     L    W     +    ++  L E++    I ID    G   
Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496

Query: 384 CDYLEM 389
            D +  
Sbjct: 497 FDLVRD 502


>gi|331017153|gb|EGH97209.1| hypothetical protein PLA106_13994 [Pseudomonas syringae pv.
           lachrymans str. M302278PT]
          Length = 684

 Score = 60.9 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%)

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189
           +LS    +       +      W+   L  +  + SK         +      T  GHH 
Sbjct: 206 FLSASRAQSEIFRSYIIAFAQAWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265

Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242
                +  DE     D   +N       T +   +     S P  +S + Y     E F 
Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319

Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279
                                  P   W++   I      G D    E +   Y  D D 
Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DDDK 377

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329
            +     +F      +F  L  +E   +           +P P   +P+ +G D +    
Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436

Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383
           D T VV    L  G     L    W     +    ++  L E++    I ID    G   
Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496

Query: 384 CDYLEM 389
            D +  
Sbjct: 497 FDLVRD 502


>gi|190890121|ref|YP_001976663.1| hypothetical protein RHECIAT_CH0000492 [Rhizobium etli CIAT 652]
 gi|190695400|gb|ACE89485.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
          Length = 465

 Score = 60.9 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 58/406 (14%), Positives = 128/406 (31%), Gaps = 61/406 (15%)

Query: 85  GRGIGKTTLNAWLVLWLMSTRPG---------ISVICLANSETQLKTTLWAEVSKWLSLL 135
           GR  GK+   A + ++L                +V+ +A    Q +  L   V    ++L
Sbjct: 68  GRRGGKSFTMALIAVFLACFFDYRQYLAPGERATVLVIATDRRQARVIL-RYVR---AML 123

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
            N    +          +  D        +S        ++   R  T+           
Sbjct: 124 DNIPLLQAMVERDTADSFDLD--------NSTTIEVGTASFRSTRGYTYAAVLCDELAFW 175

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255
             D+A+     I   I   +     N   +  S+P    G  ++ F +         +  
Sbjct: 176 RTDDAAEPDYAILDAIRPGMASI-PNSMLLCASSPHARRGALWDAFKRFWGKDDAPLVWR 234

Query: 256 RTVEGIDPSFHEGIIAR-YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR---EPC 311
                ++P+  + ++ R    D      E   +F + DI+ F+ + ++E+ ++R   E  
Sbjct: 235 AATREMNPTISQSVVDRALERDHASAMAEYGAEF-RSDIEQFVNIEVVEDCVSRGVYERA 293

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-----TDLRTTNNKISGLVEK 366
           P P        D +    D+  + +       ++ D  +         +   + +  + K
Sbjct: 294 PLPNIRYRAFVDPSGGSNDSMTLAIGHKEGERNILDCVRERKPPFSPESVVAEFADTLAK 353

Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426
           YR   +               +       R   +K+ +  +     R++L+  M   L  
Sbjct: 354 YRVREV-------------EGDRYAGEWPREQFRKKGITYKIAEKPRSDLYRDMLPLLNS 400

Query: 427 A--SLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAK-STDYS 469
               L++   L+  +               +E +  +G K S D++
Sbjct: 401 GVADLLDSDRLVTQIVG-------------LERRVSRGGKESIDHA 433


>gi|83943081|ref|ZP_00955541.1| hypothetical protein EE36_12908 [Sulfitobacter sp. EE-36]
 gi|83846089|gb|EAP83966.1| hypothetical protein EE36_12908 [Sulfitobacter sp. EE-36]
          Length = 259

 Score = 60.9 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 45/260 (17%), Positives = 75/260 (28%), Gaps = 40/260 (15%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
             P +WQ++ +        +  +N    +      +GR  GK+T    L         G 
Sbjct: 30  GEPDAWQVDLLRS------DPRSNEADRMILAL--SGRQSGKSTTAGGLG--YDDFSRGK 79

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLP-NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167
           +VI  A S  Q  T L+  + ++ +  P            L   P +   +      D  
Sbjct: 80  TVILTAPSLRQ-STELFRRILEYKNTDPFCPPIVRQTQTELEAHPRHGGRIIVVPATDQ- 137

Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227
                 R  + +               II DEA    D           E        + 
Sbjct: 138 -----ARGMTAD--------------TIIADEACFLDDDALTAFFPMRKETG---RIFLL 175

Query: 228 SNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQ 287
           S P    G FYE +       +R    +  +     +  E   A   +     R E   +
Sbjct: 176 STPNMRQGYFYETWTSAKRV-RRITARSIDIPR-RKAQVEFDKAT--MSEATFRREHLCE 231

Query: 288 FPQQDIDSFIPLNIIEEALN 307
           F        +    +E+A N
Sbjct: 232 FI-GAGTPLVSWEALEKASN 250


>gi|332185581|ref|ZP_08387329.1| terminase-like family protein [Sphingomonas sp. S17]
 gi|332014559|gb|EGI56616.1| terminase-like family protein [Sphingomonas sp. S17]
          Length = 436

 Score = 60.9 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 68/409 (16%), Positives = 134/409 (32%), Gaps = 58/409 (14%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           I AGRG GKT   A  V  L    PG  +  +  +   ++  +          +  +   
Sbjct: 60  IRAGRGFGKTRAGAEWVSALARDNPGARIALMGATLRDVERVM----------VRGESGL 109

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTY--GMAIIN 197
              +       W   +        +  ++     YS   P+   G  HH  +   +    
Sbjct: 110 LAVARKGEAPKWIGSLGQVHFTSGAIGFA-----YSAAAPEALRGPQHHAAWCDELGKWK 164

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRT 257
            EA G  +++    LG       +   ++T+ PR        +  K +      +   RT
Sbjct: 165 GEA-GWDNLMMTLRLG------EHPRVLVTTTPRATP-----LMRKVMALPDCVETIGRT 212

Query: 258 VE--GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
            +   +  SF + ++++YG D+ + R E+ G+       +     +++    R       
Sbjct: 213 SDNAHLPDSFQDAMLSQYG-DTRLGRQELDGEMVDDREGALWTRALLDR--QRVKTVPAL 269

Query: 316 APLIMGCD-IAEEGGDNTVVV---LRRGPVIEHLFDWSKTDLRTT--NNKISGLVEKYRP 369
             +++G D  A   GD   +V   L R      L D S+  L       +++G   + R 
Sbjct: 270 DRVVVGVDPPATSSGDACGIVAVGLGRDGHGYVLEDASEAGLSPEGWAARVAGCARRNRA 329

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429
           D ++ + N  G    + +  L      V     ++         + L+ +   W      
Sbjct: 330 DRVVAERNQ-GGDMVESVLRLADPTLPVHLVYASIGKAARAEPVSFLYAQGRVW-HARGF 387

Query: 430 INHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
                 +  L    ++  P                S D +D L++   E
Sbjct: 388 PALEDELCGLGVAGAYDGP--------------GHSPDRADALVWALTE 422


>gi|320172719|gb|EFW47954.1| Phage terminase, ATPase subunit [Shigella dysenteriae CDC 74-1112]
          Length = 590

 Score = 60.9 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 35/209 (16%), Positives = 61/209 (29%), Gaps = 33/209 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGELA 454
            + +     ++ S        + ++G +A
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGRIA 546


>gi|239502629|ref|ZP_04661939.1| hypothetical protein AbauAB_09982 [Acinetobacter baumannii AB900]
          Length = 414

 Score = 60.9 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 49/307 (15%), Positives = 98/307 (31%), Gaps = 38/307 (12%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           + AGR  GKT+L+  L++   S +P   +  +A +    K  +W ++             
Sbjct: 26  VVAGRRWGKTSLSRTLII-SKSRKPRQRIWYVAPTYRMAKQIMWKDL------------- 71

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
               +   P  W   + H SL I+  +  T+      + PD+  G        ++ DE  
Sbjct: 72  ----IEAIPRKWVVKINHSSLSIELVN-GTLIELKGADDPDSLRGVGID---FLVLDEFQ 123

Query: 202 GTPDVINL-GILGFLTERNANRFWIMTSNPRRLSGKF------YEIFNKPLDDWKRFQID 254
              +      +   L     +  +     P+  +  +       +        W+ +Q  
Sbjct: 124 DISEEAWTQCLRPTLASTGGHAIF--IGTPKAYNQLYTVYMQGQDPKKVKAGQWQSWQFP 181

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP 314
           T T   I  S  E   A     S   + E    F       + P +  E     +   DP
Sbjct: 182 TITSPFIPESEIEAARADMDEKS--FKQEFLASFETMSGRVYYPFDRKEH--VGKYPFDP 237

Query: 315 YAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVEKY-RPDA 371
             P+ +G D   +     ++  +    +  + +     ++      +I     +Y +   
Sbjct: 238 KLPIWIGMDFNIDPMSTVIMQPQPNGEVWVVDEIVQFGSNTEEICEEIERKYWRYMKQIV 297

Query: 372 IIIDANN 378
           I  D   
Sbjct: 298 IFPDPAG 304


>gi|291336431|gb|ADD95986.1| hypothetical protein Ddes_0719 [uncultured organism
           MedDCM-OCT-S04-C1073]
          Length = 311

 Score = 60.9 bits (146), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 37/266 (13%), Positives = 81/266 (30%), Gaps = 32/266 (12%)

Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
                 +A +  Q K+  W  + ++ + +PN  + E +     P      +L        
Sbjct: 6   NPRYAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG------- 58

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRFWI 225
                       E  D   G +       + DE +     +    I   L++R    + +
Sbjct: 59  -----------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDR--KGYCV 102

Query: 226 MTSNPRRLSGKFYEIFNKPLD--DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
               P  ++  FY+++       DW  ++      + +DP   E      G        E
Sbjct: 103 FIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGEKK--YLQE 160

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE---GGDNTVVVLRRGP 340
               +      +     I +     +    PY P  +    A +      ++++  ++  
Sbjct: 161 FECDWIANIEGAIYGEEIAKIEDKNQIARVPYDP-TLPVSTAWDLGVADHSSIIFFQQKG 219

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEK 366
               + D+ +       + I  L EK
Sbjct: 220 TGVQIIDYHEERGHGLPHYIQMLEEK 245


>gi|67920466|ref|ZP_00513986.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
 gi|67857950|gb|EAM53189.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
          Length = 244

 Score = 60.5 bits (145), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 39/234 (16%), Positives = 74/234 (31%), Gaps = 37/234 (15%)

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP-------------GISVICLANSETQL 120
           +P+ F+  +  GR  GK+ L     +      P               +V+    +  Q 
Sbjct: 18  DPQKFQVLV-CGRRFGKSHLQVTKHVIDCLMFPKLMPGYNVKQQTMETAVLVGMPTLKQA 76

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           +  LW  + K L   P          ++       D++   L  ++   +   + +    
Sbjct: 77  RKILWKPLVKTLENCPYVDKISRSDYTIRFKGNRPDIILAGLNDNAGDRARGLKLWR--- 133

Query: 181 PDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239
                         +  DE     P VI+  I+  + +   +   + T  P+  +   Y 
Sbjct: 134 --------------VCIDEVQDVRPSVIDAVIIPAMADT-PHSRALFTGTPKGKNNHLYN 178

Query: 240 IFN--KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291
           +F   +  DDWK +   T T   I     E    R  L   +   E   Q+ + 
Sbjct: 179 LFTMERDNDDWKSYNFPTWTNPLISKDEVERARKR--LSPRLFSQEFEAQWKES 230


>gi|294085818|ref|YP_003552578.1| hypothetical protein SAR116_2251 [Candidatus Puniceispirillum
           marinum IMCC1322]
 gi|292665393|gb|ADE40494.1| protein of unknown function DUF264 [Candidatus Puniceispirillum
           marinum IMCC1322]
          Length = 454

 Score = 60.5 bits (145), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 74/401 (18%), Positives = 133/401 (33%), Gaps = 54/401 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           AGRG GKT   A  + WL  +     +  +  +    +  +    S  LS+ PN      
Sbjct: 82  AGRGFGKTRAGAEWIRWLAQSGRARRIALVGETFDDARQVMVEGASGILSVCPN------ 135

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE-ASG 202
                    W                 T+ R YS + P+   G    YG     DE A  
Sbjct: 136 ---------WARPAWRAGQRTLIWPSGTIARCYSADDPEQLRGPEFDYG---WADEIAKW 183

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE-GI 261
                   ++  L     +   I T+ P R      ++     +D    Q  +R     +
Sbjct: 184 RYPSAWDNLMLAL-RIGKSPQCIATTTP-RPVRWLADL--AAAEDTVLVQGASRENAANL 239

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321
            P+F   +  R+G DS + R E+ G       D+    N I       P    +  +++G
Sbjct: 240 SPAFMAAMHRRFG-DSYLARQELEGIMMSNLPDALWCRNDILRLHRPMPKRHRFIRIVIG 298

Query: 322 CDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK----ISGLVEKYRPDAIIIDAN 377
            D A  GGD T ++        H++  +   L  T ++    I  +  ++R D++I + N
Sbjct: 299 VDPAMGGGDETGIITAGKDQDGHIWILADDSLHATPDRWAVQIQRVFRQWRADSVIAEIN 358

Query: 378 NTGARTCDYLEMLG--YHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
             G+     L   G    V  V   + +++  E            ++   +F +L +   
Sbjct: 359 QGGSLIRTLLAQAGCALPVREVRAMRSKSIRAEPVA--AAYARGDVSHAGQFGALED--- 413

Query: 435 LIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
               + +                   +   S D  D +++ 
Sbjct: 414 ---QMCACVP--------------GQRQTPSPDRLDAMVWA 437


>gi|126173520|ref|YP_001049669.1| hypothetical protein Sbal_1282 [Shewanella baltica OS155]
 gi|125996725|gb|ABN60800.1| protein of unknown function DUF264 [Shewanella baltica OS155]
          Length = 602

 Score = 60.5 bits (145), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 37/207 (17%), Positives = 71/207 (34%), Gaps = 30/207 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317
            + +   Y  + D         F   D DS    + +E+ +        Y P        
Sbjct: 365 IDELRDEY--NGDDFANLFMCIFVD-DADSVFKFSDLEKCMVEAARWQDYKPAAPRPFGN 421

Query: 318 --LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRP 369
             + +G D +    + T+VV+    ++G     L    W   +      +I  +  KYR 
Sbjct: 422 REVWLGYDPSRTRDNATLVVVAPGEKKGEKFRVLEKHYWRGMNFSHHVAEIQKIYAKYRV 481

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429
             I +D    GA   D +  L          + A  + +    +T L +KM D +E   +
Sbjct: 482 TYIGVDTTGIGAGVFDSISTL--------YPREATAIHYSVGSKTRLVLKMIDVIEGGRI 533

Query: 430 I---NHSGLIQNLKSLKSFIVPNTGEL 453
                H  +  +  S++  +  + G +
Sbjct: 534 EWDAGHKDIAMSCLSIRRTVTDSGGAI 560


>gi|152985800|ref|YP_001350388.1| hypothetical protein PSPA7_5052 [Pseudomonas aeruginosa PA7]
 gi|152986886|ref|YP_001346099.1| hypothetical protein PSPA7_0704 [Pseudomonas aeruginosa PA7]
 gi|150960958|gb|ABR82983.1| conserved hypothetical protein, putative [Pseudomonas aeruginosa
           PA7]
 gi|150962044|gb|ABR84069.1| conserved hypothetical protein, putative [Pseudomonas aeruginosa
           PA7]
          Length = 682

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Composition-based stats.
 Identities = 34/179 (18%), Positives = 53/179 (29%), Gaps = 20/179 (11%)

Query: 244 PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W++   I      G +    E +   Y  D +        +F      +F  L  +
Sbjct: 340 PDGQWRKVITIQDAIAGGCNLFDLERLQLEY--DEERFEQLFMCKFIDSTQAAF-ALADL 396

Query: 303 EEALNR----------EPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHLFD- 347
           E   +            P P    P+ +G D +    D T VV    L  G     L   
Sbjct: 397 ERCYSDLGLWTDYDPDSPRPFDNRPVWLGYDPSRTRDDATCVVVAPPLEPGGKFRILEKH 456

Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
            W  T       +I  L E++    I ID    G    D ++        +     A +
Sbjct: 457 SWRGTSFTHQAKQIEKLCERFNVQHIGIDITGVGYGVFDLVKDFFPRATPIHYSLEAKN 515


>gi|260582917|ref|ZP_05850701.1| terminase ATPase subunit [Haemophilus influenzae NT127]
 gi|260094017|gb|EEW77921.1| terminase ATPase subunit [Haemophilus influenzae NT127]
          Length = 593

 Score = 60.1 bits (144), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 33/219 (15%), Positives = 73/219 (33%), Gaps = 31/219 (14%)

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-- 317
           G +    + +IA    +          QF   +  +F   ++    ++       Y P  
Sbjct: 348 GCNLFNIDDLIAENSKEE--FEQLFLCQFADDNSSAFKFSDLQLCQVDSLEEWHDYKPFY 405

Query: 318 --------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGL 363
                   + +G D A  G    +V++           + H   +   D  T  ++I   
Sbjct: 406 QRPFGNREVWLGYDPAFTGDRAALVIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQF 465

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            + Y    I+ID    G+     +               A  LE+  + + E+ +K  + 
Sbjct: 466 CDDYNVTRIVIDKTGMGSGVYQEVR---------KFYPMAQGLEYNADLKNEMVLKTQNL 516

Query: 424 LEFASL---INHSGLIQNLKSLKSFIVPNTGELAIESKR 459
           ++   L      + ++ +  ++K   +  TG++   S R
Sbjct: 517 IQKRRLKFDSGDNDIVSSFMTVKK-RITGTGKITYVSDR 554


>gi|332654528|ref|ZP_08420271.1| phage terminase, large subunit, PBSX family [Ruminococcaceae
           bacterium D16]
 gi|332516492|gb|EGJ46098.1| phage terminase, large subunit, PBSX family [Ruminococcaceae
           bacterium D16]
          Length = 418

 Score = 60.1 bits (144), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 55/347 (15%), Positives = 112/347 (32%), Gaps = 46/347 (13%)

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVL-WLMSTRPGISVICLANSETQLKTTLWA 126
           +   N    +  GA+ +    GKT         W MS     +      S   ++  L +
Sbjct: 21  SPFRNCQAIICDGAVRS----GKTLCTGLSFFCWAMSCYQDKTFALCGKSIPSVRRNLLS 76

Query: 127 EVSKWLSLL--PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTF 184
           E+   L  L    +       L++      S+  +   G+D        R+ +  +  T 
Sbjct: 77  ELLPILRQLGFSCRERASRNQLTVTM-GHRSNTFYLFGGLDE-------RSAALVQGITL 128

Query: 185 VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244
            G         + DE +  P      +    +   + R W    NP   +  FY+ + + 
Sbjct: 129 AGA--------LLDEVALMPRSFVEQVCARCSVEGS-RLWFSC-NPESPAHWFYQEWIQK 178

Query: 245 LDDWKRFQID--TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS--FIPLN 300
            ++ K  ++         + P+  E     +       R  V G++   +     F   +
Sbjct: 179 AEEKKVLRLSFAMTDNPSLSPAMLERYRTMF--QGAFYRRFVLGEWVNAEGLVYDFFSQD 236

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--------SKTD 352
           ++     REP  D   P  + CD       +  +  R+  V   L ++         +  
Sbjct: 237 LV-----REPPLDVSGPFYVSCDYGTVNPTSMGLWGRKNGVWYRLEEYYYNSRQARRQKT 291

Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLG 399
            +   + +  LV+     A+++D +   A   + L   G  V +   
Sbjct: 292 DQEYADDLGALVKGRPLGAVVVDPSA--ASFIEVLRRRGVPVRKANN 336


>gi|289628558|ref|ZP_06461512.1| hypothetical protein PsyrpaN_26063 [Pseudomonas syringae pv.
           aesculi str. NCPPB3681]
 gi|289648058|ref|ZP_06479401.1| hypothetical protein Psyrpa2_09957 [Pseudomonas syringae pv.
           aesculi str. 2250]
 gi|330870325|gb|EGH05034.1| hypothetical protein PSYAE_24348 [Pseudomonas syringae pv. aesculi
           str. 0893_23]
          Length = 684

 Score = 60.1 bits (144), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 54/306 (17%), Positives = 86/306 (28%), Gaps = 56/306 (18%)

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT-YSEERPDTFVGHHN 189
           +LS    +       +      W+   L  +  + SK         +      T  GHH 
Sbjct: 206 FLSASRAQSEIFRSYIIAFAQSWFGLELTGNPIVLSKDGKPWAELRFLSTNSSTAQGHHG 265

Query: 190 TYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFN 242
                +  DE     D   +N       T +   +     S P  +S + Y     E F 
Sbjct: 266 H----VYVDEYFWIRDFEKLNTVASAMATHKKWRKT--YFSTPSAVSHQAYPFWQGEKFR 319

Query: 243 K----------------------PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279
                                  P   W++   I      G D    E +   Y  D D 
Sbjct: 320 NSKRKAAKDPWPSDKQISAGALCPDGQWRKVITILDAIAGGCDLFDLEQLQLEY--DEDK 377

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEGG 329
            +     +F      +F  L  +E   +           +P P   +P+ +G D +    
Sbjct: 378 FQQLFMCKFIDSSQSAF-SLADLERCYSDLSLWADFDPDDPRPYGNSPVWIGYDPSRTRD 436

Query: 330 DNTVVV----LRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART 383
           D T VV    L  G     L    W     +    ++  L E++    I ID    G   
Sbjct: 437 DATCVVIAPPLENGGKFRILEKHSWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGV 496

Query: 384 CDYLEM 389
            D +  
Sbjct: 497 FDLVRD 502


>gi|116751218|ref|YP_847905.1| hypothetical protein Sfum_3801 [Syntrophobacter fumaroxidans MPOB]
 gi|116700282|gb|ABK19470.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB]
          Length = 507

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 62/394 (15%), Positives = 108/394 (27%), Gaps = 69/394 (17%)

Query: 38  WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95
           WG+        S    W  Q+E +     + ++                GR +GK+ + +
Sbjct: 20  WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65

Query: 96  WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155
              L    T  G   +  A  +  L T +  E+   L   P+     M S++L       
Sbjct: 66  TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDSNPD----LMNSIALTKYGKPK 120

Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215
                   ++  + S +    +    D F   H      +  DE +   +     +   L
Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177

Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273
                 R +   S P  L    +Y +     D +  F+  +             ++  Y 
Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTEDREAELLEFYG 232

Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308
           G DS   + EV G+  +    +F                 I +           E A +R
Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292

Query: 309 -----EPCPDPYAPLIMGCDIAEEGG-------DNTVVVLRRGPVIEHLFDWSKTDLRTT 356
                   P      + G D+              T +  R    +              
Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQETEIGERTLLKMILRVHLEHVSYPHI 351

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
              I+ L   Y P  I +D    G      L  L
Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385


>gi|170748408|ref|YP_001754668.1| hypothetical protein Mrad2831_1990 [Methylobacterium radiotolerans
           JCM 2831]
 gi|170654930|gb|ACB23985.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM
           2831]
          Length = 478

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 52/334 (15%), Positives = 106/334 (31%), Gaps = 35/334 (10%)

Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213
            S+ +    G+D +      +T    R +T  G           + ++    +I   +  
Sbjct: 145 TSETIRLLSGVDIEVRPANYKTI---RGETLAGCLADEVAFWHLENSANPDTLILDAVRP 201

Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI------DTRTVEGIDPSFHE 267
            L          + S+P    G+ Y    +         +             +DP+  +
Sbjct: 202 GLATTGGP--LCVLSSPYARKGELYRTHQRDFGPSGDPAVLVLRAPSQTMNPSLDPAVVK 259

Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---REPCPDPYAPLIMGCDI 324
                Y  D      E   +F + D+++FI L  ++  +     E  P P       CD 
Sbjct: 260 ---RAYTRDPAAASAEYGAEF-RADVEAFISLEAVQACMAGDLLERAPAPGLTYQAFCDP 315

Query: 325 AEEGGDNTVVVLRRGPVIEHLFD-----WSKTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
           +  G D+  + +          D     +         +  + L++ Y   ++  D    
Sbjct: 316 SGGGADSMTLAIGHAENGIAYLDAVREMYPGGSPEAVVSTFAELLQSYGLGSVTGDHY-A 374

Query: 380 GARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNL 439
           G    +   + G    R    K  +  EF     ++   +         ++  + L   L
Sbjct: 375 GEWPKERFRVHGITYERSERSKSDIYREFLPVLNSQ---RCR-------MLPVAKLEAQL 424

Query: 440 KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473
            SL+      TG+  I+  +VKGA   D ++ + 
Sbjct: 425 VSLERRTTRGTGKDTIDHPQVKGAHD-DVANAVA 457


>gi|323699495|ref|ZP_08111407.1| protein of unknown function DUF264 [Desulfovibrio sp. ND132]
 gi|323459427|gb|EGB15292.1| protein of unknown function DUF264 [Desulfovibrio desulfuricans
           ND132]
          Length = 428

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 55/334 (16%), Positives = 98/334 (29%), Gaps = 43/334 (12%)

Query: 79  KGAISAGRG-IGKTTLN-AWLVLWLMSTR-PGISVICLANSETQLKTTLWAEVSKWLSLL 135
           + A+       GKT L+   L+     TR        +A    Q KT +W E+ ++    
Sbjct: 21  RFAVLVCHRRFGKTVLSVNRLINAARETRRDDWRGAYIAPLYRQAKTVVWDELKRY---- 76

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
                 +  ++  +     +D  + S            R +    PD+  G +      +
Sbjct: 77  -CGFGLDGCTVKFNETELRADFDNGSR----------IRLFGANNPDSLRGMYLDG---V 122

Query: 196 INDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQ 252
           + DE +  P  +    I   L++R     +     PR  +   YEI+ K     DW    
Sbjct: 123 VFDEVAQMPLRVWTEVIRPALSDRKGWAMF--IGTPRGKNA-LYEIWEKGKTDPDWLAAM 179

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL---NIIEEALNRE 309
                   +     E       +  +    E    F      ++      +   E    +
Sbjct: 180 YRASETGILPVEELEASARE--MSPEEYEQEFECSFTAAIRGAYFGQLLADADREGRMTD 237

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVL---RRGPVIEHLFDWS--KTDLRTTNNKISGLV 364
              DP  P+    D+     D+T +     R G     +  +      L      +    
Sbjct: 238 VPADPSMPVHTAWDLGM--SDSTSIWFVQARPGGTFAVIDYYEACGEGLDHYARILDDKG 295

Query: 365 EKYR----PDAIIIDANNTGARTCDYLEMLGYHV 394
            KY     P  I +    TG    +    LG   
Sbjct: 296 YKYGTHIAPHDIRVRELGTGKSRLETARSLGIRF 329


>gi|145639982|ref|ZP_01795581.1| terminase, ATPase subunit [Haemophilus influenzae PittII]
 gi|145270948|gb|EDK10866.1| terminase, ATPase subunit [Haemophilus influenzae PittII]
 gi|309751635|gb|ADO81619.1| Probable bacteriophage terminase, ATPase subunit [Haemophilus
           influenzae R2866]
          Length = 591

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 55/373 (14%), Positives = 118/373 (31%), Gaps = 68/373 (18%)

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
              K   + +S  ++ A   +DV      I   + + +   +      T   +H      
Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  DV+     G   ++   +     S P  ++   Y     + FNK    
Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNKNRAK 311

Query: 248 -----------------------WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
                                  WK+   I+     G +    + +IA    +       
Sbjct: 312 ADKVEIDISHENLRIGKLCADRQWKQIVTINDAMEGGCNLFNIDDLIAENSKEE--FEQL 369

Query: 284 VCGQFPQQD-------IDSFIPLNIIEEALNREP---CPDPYAPLIMGCDIAEEGGDNT- 332
              QF   +             ++ +EE  + +P    P     + +G D A  G     
Sbjct: 370 FLCQFADDNTSAFKFADLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429

Query: 333 -VVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
            ++   +       + H   +   D     ++I    + Y    I+ID    G+     +
Sbjct: 430 AIIAPPKVEGGDYRVLHWQTFHGMDYEAQASRIKSFCDDYNVTRIVIDKTGMGSGVFQEV 489

Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL-INHSGLIQNLKSLKSFI 446
           +              A+ L++  + + E+ +K  + ++   L  + + +I +  ++K   
Sbjct: 490 K---------KFYPMAIGLDYNADLKNEMVLKTQNLIQKRRLKFDGNEIITSFMTVKK-R 539

Query: 447 VPNTGELAIESKR 459
           +  TG++   S R
Sbjct: 540 ITGTGKITYVSDR 552


>gi|229845311|ref|ZP_04465443.1| terminase, ATPase subunit [Haemophilus influenzae 6P18H1]
 gi|229811764|gb|EEP47461.1| terminase, ATPase subunit [Haemophilus influenzae 6P18H1]
          Length = 593

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 54/374 (14%), Positives = 114/374 (30%), Gaps = 70/374 (18%)

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
              K   + +S  ++ A   +DV      I   + + +   +      T   +H      
Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  DV+     G   ++   +     S P  ++   Y     + FN+    
Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNRNRAK 311

Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            ++ +ID                             G +    + +IA    +       
Sbjct: 312 SEKIEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 369

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333
              QF   +  +F   ++    ++       Y P          + +G D A  G    +
Sbjct: 370 FLCQFADDNSSAFKFADLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429

Query: 334 VVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
           V++           + H   +   D  T  ++I    E Y    I+ID    G      +
Sbjct: 430 VIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFCEDYNVTRIVIDKTGMGTGVYQEV 489

Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL---INHSGLIQNLKSLKS 444
                          A  LE+  + + E+ +K  + ++   L      + ++ +  ++K 
Sbjct: 490 R---------KFYPMAQGLEYNADLKNEMVLKTQNLIQKRRLKFDSGDNDIVSSFMTVKK 540

Query: 445 FIVPNTGELAIESK 458
             +  TG++   S 
Sbjct: 541 -RITGTGKITYVSD 553


>gi|301155044|emb|CBW14507.1| terminase, atpase subunit [Haemophilus parainfluenzae T3T1]
          Length = 591

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 53/373 (14%), Positives = 116/373 (31%), Gaps = 68/373 (18%)

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
              K   + +S  ++ A   +DV      I   + + +   +      T   +H      
Sbjct: 200 ASKKQALQFRSYIVNYAKQTADVDLKGETIKLPNGAEL--IFLGTNSATAQSYHGN---- 253

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  DV+     G   ++   +     S P  ++   Y     + FNK    
Sbjct: 254 LYFDEVFWVPKFDVMRKVASGMAAQKMYRQT--YFSTPTTIAHPAYAFFSGKAFNKNRAK 311

Query: 248 WKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDSDVTRVE 283
             + +ID                             G +    + +IA    +       
Sbjct: 312 ADKVEIDISHENLKSGKLCADRQWKQIVSIYDAMEGGCNLFNIDDLIAENSKEE--FEQL 369

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNT- 332
              QF   +  +F   ++    ++       Y P          + +G D A  G     
Sbjct: 370 FLCQFADDNSSAFKFADLQLCQVDSLEEWHDYKPFYQRPFGNREVWLGYDPAFTGDRAAL 429

Query: 333 -VVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
            ++   +       + H   +   D     ++I    + Y    I+ID    G+     +
Sbjct: 430 AIIAPPKVEGGDYRVLHWQTFHGMDYEAQASRIKSFCDDYNVTRIVIDKTGMGSGVFQEV 489

Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL-INHSGLIQNLKSLKSFI 446
           +              A+ L++  + + E+ +K  + ++   L  + + +I +  ++K   
Sbjct: 490 K---------KFYPMAIGLDYNADLKNEMVLKTQNLIQKRRLKFDGNEIITSFMTVKK-R 539

Query: 447 VPNTGELAIESKR 459
           +  TG++   S R
Sbjct: 540 ITGTGKITYVSDR 552


>gi|163735142|ref|ZP_02142578.1| hypothetical protein RLO149_23000 [Roseobacter litoralis Och 149]
 gi|161391600|gb|EDQ15933.1| hypothetical protein RLO149_23000 [Roseobacter litoralis Och 149]
          Length = 267

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 43/265 (16%), Positives = 83/265 (31%), Gaps = 41/265 (15%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
             P  WQ   M       +   +  + +     + AG+ + K               P  
Sbjct: 28  GPPDPWQRSLMNSTSDVIMVLASRRSGKSTTVGVMAGQELAK---------------PDH 72

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
            VI L+ +  Q    L+A+++           F  + ++L        +    L   S  
Sbjct: 73  QVIILSPTLAQ-SQLLFAKIA-----------FTWEKMALPIETRRRTMTELHLKNGS-- 118

Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228
            S +C    ++  +   G+    G+    DEA+  PD +       L+    N   +  +
Sbjct: 119 -SVVCVPAGQD-GEGARGYGVKNGIL-AFDEAAFIPDKVFGA---TLSIAEDNAKTVFIT 172

Query: 229 NPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG 286
            P   SGK YE++  +    + +R +  +  +  +             ++ DV      G
Sbjct: 173 TPGGKSGKAYEMWTNHDLYPEVERIRACSLDLPRMAKLVARQRKTLSKMEFDVEH----G 228

Query: 287 QFPQQDIDSFIPLNIIEEALNREPC 311
                    F   + I  A    P 
Sbjct: 229 LQWMGRGTPFFDPDTIRAAYTDTPE 253


>gi|146313136|ref|YP_001178210.1| hypothetical protein Ent638_3501 [Enterobacter sp. 638]
 gi|145320012|gb|ABP62159.1| protein of unknown function DUF264 [Enterobacter sp. 638]
          Length = 589

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 38/229 (16%), Positives = 64/229 (27%), Gaps = 32/229 (13%)

Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W++   I+     G      + +       +D  R     +F      S  P   +
Sbjct: 328 PDGQWRQIVTIEDALAGGCTLFNLDQLKQE--NSADDFRNLFMCEFVDDKA-SVFPFEEL 384

Query: 303 EEALNREPCPDP-----------YAPLIMGCDIAEEGGDN--TVVV--LRRGPVIEHL-- 345
           +  +                   + P+ +G D +  G      V+   L  G     L  
Sbjct: 385 QRCMVDAMEEWEDFEQFADRPFNWRPVWIGYDPSHTGDSAGCAVLAPPLVAGGKFRILER 444

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
             W   D       I  L EKY  D I IDA   G      +               A  
Sbjct: 445 HQWKGMDFAAQAEAIRSLTEKYTVDYIGIDATGIGQGVYQLVR---------SFFPAARA 495

Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGL--IQNLKSLKSFIVPNTGE 452
           + +    +T + +K  D +    L   +G   I          + ++G 
Sbjct: 496 IRYTPEMKTAMVLKAKDTIRRGCLEYDAGATDITQSFMAIRKTMTSSGR 544


>gi|293417393|ref|ZP_06660017.1| terminase [Escherichia coli B185]
 gi|291430913|gb|EFF03909.1| terminase [Escherichia coli B185]
          Length = 590

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRELTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|258544092|ref|ZP_05704326.1| probable terminase (atpase subunit) related protein
           [Cardiobacterium hominis ATCC 15826]
 gi|258520720|gb|EEV89579.1| probable terminase (atpase subunit) related protein
           [Cardiobacterium hominis ATCC 15826]
          Length = 562

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 29/201 (14%)

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--- 307
             I+     G D    E +  ++          +  QF   D DS   +  ++  +    
Sbjct: 306 ITIEDAINSGFDRVTMEKLRIKF--PPGQFENLLMCQFV-NDTDSIFKMAELQRCMVDAW 362

Query: 308 --------REPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDL 353
                     P P   AP+ +G D +    D ++VV+      G V   +    ++  D 
Sbjct: 363 TLWKDYTPLAPRPLDDAPVWIGYDPSRSQDDASLVVIAPPRVEGGVFRIVDKQSFNGLDF 422

Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRR 413
                KI      Y    I IDA   G    D +              RA  + +    +
Sbjct: 423 DGQAQKIREFCAIYNVANIAIDATGIGQAVYDLVRQ---------FYPRARKIIYTVEAK 473

Query: 414 TELHVKMADWLEFASLINHSG 434
            E+ +K    +    L   +G
Sbjct: 474 NEMVLKAKQLIHHGRLQWDAG 494


>gi|300022629|ref|YP_003755240.1| hypothetical protein Hden_1105 [Hyphomicrobium denitrificans ATCC
           51888]
 gi|299524450|gb|ADJ22919.1| protein of unknown function DUF264 [Hyphomicrobium denitrificans
           ATCC 51888]
          Length = 500

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 71/420 (16%), Positives = 127/420 (30%), Gaps = 68/420 (16%)

Query: 84  AGRGIGKTTLNA-WLVLWLMSTRPGISVIC-------LANSETQLKTTLWAEVSKWLSLL 135
            GRG GKT   A W+        PG             A ++   +  L   V K L+ +
Sbjct: 104 GGRGSGKTRAGAEWIRGLACGEEPGPRSAAGSRNASRRAPTKESPRIAL---VGKTLADV 160

Query: 136 PNKHWFEMQSL-SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
            N        L ++HPA     V   S          +   +S +  +   G       A
Sbjct: 161 RNVMIEGQSGLLAVHPARERP-VFEPSKRRLIWPNGAVAELFSADEAEALRG---PQFTA 216

Query: 195 IINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              DE +     +     +   L   +A R   +T+ PR       ++    + D     
Sbjct: 217 AWCDELAKWRNAEKAWDMLQFALRLGDAPR-ACVTTTPRAT-----KLLKSIIADEATVT 270

Query: 253 IDTRTVEG---IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
           ++  T +    + P+F   +  RY   S + R E+ G+  +   D     + IEEA  R 
Sbjct: 271 VNLATADNALNLAPTFLAEMTRRY-AGSAIGRQELLGEIVEDASDGLWRRHWIEEA--RV 327

Query: 310 PCPDPYAPLIMGCDI---AEEGGDNTVVV-----LRRGPVIEHLFDWSKTDLRTTNNKIS 361
                   +++  D    A    D   +V     + +   +                   
Sbjct: 328 DAAPEMQRVVVAVDPPVTATAASDACGIVVAGLGVDKRAYVLADRTVQGRTPEIWARAAL 387

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEM--LGYHVYRVLGQKRAVDLEFCRNRRTE---- 415
              + Y  D ++ + N  G      L+     + V +V   +           R E    
Sbjct: 388 SAFDDYEADRMVAEVNQGGDLVVSVLQQFRQNFPVVKVRATRGKW-------VRAEPVAA 440

Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
           L+ +         L     L   + +       + G +          +S D SD L++ 
Sbjct: 441 LYAEGRVA-HVGRL---DALEDQMCT-----FGSDGTVK--------GRSPDRSDALVWA 483


>gi|156564098|ref|YP_001429607.1| terminase large subunit [Bacillus phage 0305phi8-36]
 gi|154622795|gb|ABS83675.1| terminase large subunit [Bacillus phage 0305phi8-36]
          Length = 635

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 33/206 (16%), Positives = 64/206 (31%), Gaps = 22/206 (10%)

Query: 40  EKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVL 99
           E+   L     P+ W  E ++         +     +  +  +  GR +GKT     ++L
Sbjct: 45  EELHYLAILDKPKFWAAETLKWFCRDYQEPMLQEMADSKRTVLRLGRRLGKTETMCIMIL 104

Query: 100 WLMSTRPGIS------VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPW 153
           W   T+P         ++ +A  E Q+   ++  +S+ +             +S    P 
Sbjct: 105 WHAFTQPNKGPNNQYDILIIAPYEEQV-DLIFKRLSQLID------------MSGDVNPS 151

Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213
                H  L   +  +     + S        G        I+ DE     +     I+ 
Sbjct: 152 RDIDKHIELPNGTVIHGITAGSKSGSGAANTRGQRAD---LIVLDEMDYMGESEITNIMN 208

Query: 214 FLTERNANRFWIMTSNPRRLSGKFYE 239
              E       I+ S P      +Y+
Sbjct: 209 IRNEAPERIKMIVASTPSGRRDSYYK 234


>gi|322420465|ref|YP_004199688.1| hypothetical protein GM18_2968 [Geobacter sp. M18]
 gi|320126852|gb|ADW14412.1| hypothetical protein GM18_2968 [Geobacter sp. M18]
          Length = 507

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 62/394 (15%), Positives = 109/394 (27%), Gaps = 69/394 (17%)

Query: 38  WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95
           WG+        S    W  Q+E +     + ++                GR +GK+ + +
Sbjct: 20  WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65

Query: 96  WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155
              L    T  G   +  A  +  L T +  E+   L   P+     M S++L      +
Sbjct: 66  TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDTNPD----LMNSIALTKYGKPN 120

Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215
                   ++  + S +    +    D F   H      +  DE +   +     +   L
Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177

Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273
                 R +   S P  L    +Y +     D +  F+  +             ++  Y 
Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTEDREAELLEFYG 232

Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308
           G DS   + EV G+  +    +F                 I +           E A +R
Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292

Query: 309 -----EPCPDPYAPLIMGCDIAEEGG-------DNTVVVLRRGPVIEHLFDWSKTDLRTT 356
                   P      + G D+              T +  R    +              
Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQETEIGERTLLKMILRVHLEHVSYPHI 351

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
              I+ L   Y P  I +D    G      L  L
Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385


>gi|161521371|ref|YP_001584798.1| hypothetical protein Bmul_4835 [Burkholderia multivorans ATCC
           17616]
 gi|189352462|ref|YP_001948089.1| ATPase subunit of bacteriophage terminase [Burkholderia multivorans
           ATCC 17616]
 gi|327198040|ref|YP_004306409.1| gp42 [Burkholderia phage KS5]
 gi|160345421|gb|ABX18506.1| protein of unknown function DUF264 [Burkholderia multivorans ATCC
           17616]
 gi|189336484|dbj|BAG45553.1| ATPase subunit of bacteriophage terminase [Burkholderia multivorans
           ATCC 17616]
 gi|310657174|gb|ADP02289.1| gp42 [Burkholderia phage KS5]
          Length = 588

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 27/138 (19%), Positives = 44/138 (31%), Gaps = 20/138 (14%)

Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------ 317
             + +   Y    +     +  QF    + S  PL +++  + +     D + P      
Sbjct: 350 NLDRLRLEY--SPEEYANLLLCQFIDDSL-SVFPLTVLQPCMVDTWEVWDDFKPLYLRPF 406

Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D +  G     VV+    R G     L  F W   D      +I  L  +Y
Sbjct: 407 GDEEVWIGYDPSHTGDSAGCVVIAPPKRPGGKFRVLERFQWHGLDFEAQAAQIEALTRRY 466

Query: 368 RPDAIIIDANNTGARTCD 385
           R   I ID    G     
Sbjct: 467 RVTYIGIDTTGIGQGVYQ 484


>gi|168822445|ref|ZP_02834445.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Weltevreden str. HI_N05-537]
 gi|205341120|gb|EDZ27884.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Weltevreden str. HI_N05-537]
          Length = 594

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/162 (15%), Positives = 51/162 (31%), Gaps = 20/162 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  + +  RY  D+    +     F     DS    + +
Sbjct: 331 PDGQWRYIITLEDAIAGGFNLASIDKLRNRYNRDT--FNMLYMCVFVDSK-DSVFSFSHV 387

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346
           E         + +            +  G D A  G  +T       +V      +  +F
Sbjct: 388 ERCCVDPDIWEDHDENLPRPFGNREVWAGYDPARSGDTSTFVIIAPPIVAGEKFRVLRVF 447

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            W   + +    +I  L  +Y    I ID    G+   + ++
Sbjct: 448 HWQGMNWKWQAAQIKKLFGQYNMTYIGIDITGLGSGVFEDVQ 489


>gi|323943519|gb|EGB39636.1| terminase [Escherichia coli H120]
          Length = 367

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 128 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 184

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 185 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 244

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 245 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 294

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 295 CLEYDVSATDITSSFMAIRKTMTSSGR 321


>gi|78356952|ref|YP_388401.1| hypothetical protein Dde_1909 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. G20]
 gi|78219357|gb|ABB38706.1| hypothetical protein Dde_1909 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. G20]
          Length = 507

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 61/394 (15%), Positives = 108/394 (27%), Gaps = 69/394 (17%)

Query: 38  WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95
           WG+        S    W  Q+E +     + ++                GR +GK+ + +
Sbjct: 20  WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65

Query: 96  WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155
              L    T  G   +  A  +  L T +  E+   L   P+     M S++L       
Sbjct: 66  TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDTNPD----LMNSIALTKYGKPK 120

Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215
                   ++  + S +    +    D F   H      +  DE +   +     +   L
Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177

Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273
                 R +   S P  L    +Y +     + +  F+  +             ++  Y 
Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSEQFHVFRWPSWLNPLWTEDREAELLEFYG 232

Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308
           G DS   + EV G+  +    +F                 I +           E A +R
Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292

Query: 309 -----EPCPDPYAPLIMGCDIAEEGG-------DNTVVVLRRGPVIEHLFDWSKTDLRTT 356
                   P      + G D+              T +  R    +              
Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQETEIGERTLLKMILRVHLEHVSYPHI 351

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
              I+ L   Y P  I +D    G      L  L
Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385


>gi|197251462|ref|YP_002147591.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Agona str. SL483]
 gi|197215165|gb|ACH52562.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Agona str. SL483]
 gi|312913681|dbj|BAJ37655.1| hypothetical protein STMDT12_C27120 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. T000240]
          Length = 594

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/162 (15%), Positives = 51/162 (31%), Gaps = 20/162 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  + +  RY  D+    +     F     DS    + +
Sbjct: 331 PDGQWRYIITLEDAIAGGFNLASIDKLRNRYNRDT--FNMLYMCVFVDSK-DSVFSFSHV 387

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346
           E         + +            +  G D A  G  +T       +V      +  +F
Sbjct: 388 ERCCVDPDIWEDHDENLPRPFGNREVWAGYDPARSGDTSTFVIIAPPIVAGEKFRVLRVF 447

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            W   + +    +I  L  +Y    I ID    G+   + ++
Sbjct: 448 HWQGMNWKWQAAQIKKLFGQYNMTYIGIDITGLGSGVFEDVQ 489


>gi|322832199|ref|YP_004212226.1| terminase, ATPase subunit [Rahnella sp. Y9602]
 gi|321167400|gb|ADW73099.1| terminase, ATPase subunit [Rahnella sp. Y9602]
          Length = 588

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 53/349 (15%), Positives = 104/349 (29%), Gaps = 67/349 (19%)

Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPL-- 245
           +  DE    P+   +     G  ++ +        S P  L+   Y     E+FNK    
Sbjct: 249 LYVDEIFWIPNFQKLRKVASGMASQEHLRTT--YFSTPSALTHGAYPFWSGELFNKGREN 306

Query: 246 ----------------------DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282
                                   W++   I+     G +    + +      +    R 
Sbjct: 307 PNDRIELDIGHHSLAKGRLCEDGQWRQIVTIEDALAGGCNLFNIDTLKQENSAED--FRN 364

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDIAEEGGDN 331
               +F      S  P   ++  +                   Y  + +G D +  G   
Sbjct: 365 LFMCEFVDDQ-TSVFPFAELQRCMVESAEEWQDFSPFAVRPFGYRAVWIGYDPSHTGDSA 423

Query: 332 --TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
              VV   L  G     L    W   D       I  L ++Y  + I +DA   G     
Sbjct: 424 GCAVVAPPLVDGGKFRVLERHQWKGMDFAAQAKSIEELTKRYCVEYIGVDATGIGQGVFQ 483

Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI---NHSGLIQNLKSL 442
            +               A+++ +    +T++ +K  D +    L    NH  +  +  ++
Sbjct: 484 LVRQ---------FFPAAMEIRYSPETKTKMVLKAKDTITSGRLEYDTNHKDITSSFMAI 534

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPRSDMDFGRCP 491
           +  +  +      E+ R + A   D +  +M+    N P +  + G+ P
Sbjct: 535 RKTMTASGSRSTYEASRSEEASHADVAWAIMHALL-NEPLTAANGGQSP 582


>gi|154247076|ref|YP_001418034.1| hypothetical protein Xaut_3147 [Xanthobacter autotrophicus Py2]
 gi|154161161|gb|ABS68377.1| protein of unknown function DUF264 [Xanthobacter autotrophicus Py2]
          Length = 416

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 68/415 (16%), Positives = 121/415 (29%), Gaps = 68/415 (16%)

Query: 82  ISAGRGIGKTTLNA-WLVLWLM-----STRPGISVICLANSETQLKTTLWAEVSKWLSLL 135
           +  GRG GKT   A W+    +     + RP   +  +A +   ++  +   VS  L++ 
Sbjct: 31  VLGGRGAGKTRAGAEWVRGLALGRPPFAGRPVGRIALVAETMADVREVMVEGVSGLLAVH 90

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
           P       +                           + + +S E P++  G       A 
Sbjct: 91  PRAERPRWEPTR---------------RRLEWANGAVAQGFSAEDPESLRG---PQFAAA 132

Query: 196 INDEASGTPDVINLGILGFLTERNANRFW-----IMTSNPRRLSGKFYEIFNKPLDDWKR 250
             DE +                             M +   R +     +   P     R
Sbjct: 133 WLDELAK-----WKRAEATFDMLQFGLRLGAQPRQMVTTTPRPTALLRRLLADPSTAVTR 187

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
            +        + PSF   ++ RYG    + R E+ G+  +   D+      +E    RE 
Sbjct: 188 AR-TADNAFHLAPSFLGQVLTRYGGT-RLGRQELDGELIEDRADALFSRPALEAL--REA 243

Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVV---LRRGPVIEHLFDWSKTDLRTT--NNKISG 362
              P   +++  D    +  G D   +V   +    V+  L D S   LR      K   
Sbjct: 244 QVPPLTRIVVAVDPPASSRAGADACGIVCAGMDATGVVHVLADDSAAGLRPAQWAAKAVA 303

Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEM--LGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
           L  ++  D I+ + N  G      +     G  V +V   +           R E    +
Sbjct: 304 LFRRFEADLIVAEVNQGGEMVRAVIAEVDDGVPVEQVRATRGKF-------LRAEPVAAL 356

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
            +            L   +           G   + S      +S D  D L++ 
Sbjct: 357 YEQGRVRHAGAFPALEDEMC-----DFGTDG---LSS-----GRSPDRLDALVWA 398


>gi|307315386|ref|ZP_07594955.1| protein of unknown function DUF264 [Escherichia coli W]
 gi|307315408|ref|ZP_07594975.1| protein of unknown function DUF264 [Escherichia coli W]
 gi|306905258|gb|EFN35804.1| protein of unknown function DUF264 [Escherichia coli W]
 gi|306905260|gb|EFN35805.1| protein of unknown function DUF264 [Escherichia coli W]
          Length = 385

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 58/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +      D    +     +F      S  P   ++  +                   
Sbjct: 146 IEQLKRENSADD--FKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 202

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 203 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 262

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 263 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 312

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 313 CLEYDVSATDITSSFMAIRKTMTSSGR 339


>gi|289805729|ref|ZP_06536358.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. AG3]
          Length = 257

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 29/167 (17%), Positives = 58/167 (34%), Gaps = 5/167 (2%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 82  VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 139

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D +       G     D  + I L+ IE A++  +  
Sbjct: 140 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 199

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTN 357
              P     +G D+A+ G D    V R G VI    +W   +     
Sbjct: 200 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKEDELLK 246


>gi|213618708|ref|ZP_03372534.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. E98-2068]
          Length = 282

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 29/163 (17%), Positives = 58/163 (35%), Gaps = 5/163 (3%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK-FYEIFNKPLDDWKRFQ 252
            +  +EA    +     +   + +  +  ++    NP  ++   +      P +D    +
Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEDTLIRK 179

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D +       G     D  + I L+ IE A++  +  
Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKVL 239

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL 353
              P     +G D+A+ G D    V R G VI    +W   + 
Sbjct: 240 DFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKED 282


>gi|323973818|gb|EGB68992.1| terminase [Escherichia coli TA007]
          Length = 589

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 35/196 (17%), Positives = 55/196 (28%), Gaps = 29/196 (14%)

Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDI 324
            +D  R     +F      S  P   ++  +                   + P+ +G D 
Sbjct: 359 SADDFRNLFMCEFVDDKA-SVFPFEELQRCMVDAMEEWEDFEPFADRPFNWRPVWIGYDP 417

Query: 325 AEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           +  G      V+   L  G     L    W   D       I  L EKY  D I IDA  
Sbjct: 418 SHTGDSAGCAVLAPPLVAGGKFRILERHQWKGMDFAAQAEAIRALTEKYTVDYIGIDATG 477

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL--I 436
            G      +               A  + +    +T + +K  D +    L   +G   I
Sbjct: 478 IGQGVYQLVR---------SFFPAARAIRYTPEMKTAMVLKAKDTIRRGCLEYDAGATDI 528

Query: 437 QNLKSLKSFIVPNTGE 452
                     + N+G 
Sbjct: 529 TQSFMAIRKTMTNSGR 544


>gi|261340099|ref|ZP_05967957.1| terminase, ATPase subunit [Enterobacter cancerogenus ATCC 35316]
 gi|288318026|gb|EFC56964.1| terminase, ATPase subunit [Enterobacter cancerogenus ATCC 35316]
          Length = 589

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/196 (17%), Positives = 55/196 (28%), Gaps = 29/196 (14%)

Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDI 324
            +D  R     +F      S  P   ++  +                   + P+ +G D 
Sbjct: 359 SADDFRNLFMCEFVDDKA-SVFPFEELQRCMVDAMEEWEDFEPFADRPFNWRPVWIGYDP 417

Query: 325 AEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           +  G      V+   L  G     L    W   D       I  L EKY  D I IDA  
Sbjct: 418 SHTGDSAGCAVLAPPLVAGGKFRILERHQWKGMDFAAQAEAIRALTEKYTVDYIGIDATG 477

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL--I 436
            G      +               A  + +    +T + +K  D +    L   +G   I
Sbjct: 478 IGQGVYQLVR---------SFFPAARAIRYTPEMKTAMVLKAKDTIRRGCLEYDAGATDI 528

Query: 437 QNLKSLKSFIVPNTGE 452
                     + ++G 
Sbjct: 529 TQSFMAIRKTMTSSGR 544


>gi|218558996|ref|YP_002391909.1| Terminase, ATPase subunit (GpP) [Escherichia coli S88]
 gi|218365765|emb|CAR03503.1| Terminase, ATPase subunit (GpP) [Escherichia coli S88]
          Length = 600

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 361 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 417

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 418 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 477

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 478 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 527

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 528 CLEYDVSATDITSSFMAIRKTMTSSGR 554


>gi|212709268|ref|ZP_03317396.1| hypothetical protein PROVALCAL_00303 [Providencia alcalifaciens DSM
           30120]
 gi|212688180|gb|EEB47708.1| hypothetical protein PROVALCAL_00303 [Providencia alcalifaciens DSM
           30120]
          Length = 585

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 38/165 (23%), Positives = 58/165 (35%), Gaps = 24/165 (14%)

Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
             W++   ++     G D    E +   Y    D     +  +F   DI S   L ++++
Sbjct: 324 GQWRQIVTVEDAIRGGCDLFEIEQLSLEY--SPDEFENLLMCEFVD-DIASIFNLQLMQK 380

Query: 305 ALNRE-----------PCPDPYAPLIMGCDIAE--EGGDN--TVVV---LRRGPVIEHLF 346
            +                P  Y P+ +G D A+  + GD+   VVV   LR G     L 
Sbjct: 381 CMVDSWEVWNDVQPLMVRPYAYHPVWIGYDPAKGTQNGDSAGCVVVAPPLRAGDKFRILE 440

Query: 347 D--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
              W   D R   N I  L E+Y    I ID+   G      +  
Sbjct: 441 HHQWRGMDFRAQANAIKELTERYNVQYIGIDSTGIGHGVLQNVRD 485


>gi|194444881|ref|YP_002043300.1| hypothetical protein SNSL254_A4364 [Salmonella enterica subsp.
           enterica serovar Newport str. SL254]
 gi|194403544|gb|ACF63766.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Newport str. SL254]
          Length = 589

 Score = 59.0 bits (141), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 37/230 (16%), Positives = 71/230 (30%), Gaps = 34/230 (14%)

Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W++   I+    +G      + +     +D    R     +F      S  P   +
Sbjct: 328 PDGQWRQIVTIEDALAKGCTLFNIDTLKRENSVDE--FRNLFMCEFVDDKA-SVFPFEEL 384

Query: 303 EEALNR-----------EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL-- 345
           +  +                P  + P+ +G D +  G     VV+      G     L  
Sbjct: 385 QRCMVDSLEKWEDYAPFADRPFGHRPVWIGYDPSLRGDSAGCVVIAPPVVAGGKFRILER 444

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
             W   D       I  L +KY  + I IDA   G      +               A +
Sbjct: 445 HQWKGMDFAQQAESIRELTQKYTVEYIGIDATGLGQGVFQLVR---------SFYPAARE 495

Query: 406 LEFCRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGE 452
           + +    +T + +K  D +    L      + + Q+  S++   + ++G 
Sbjct: 496 IRYTPEMKTAMVLKAKDTIRRGCLEYDVSATDITQSFMSIRK-TMTSSGR 544


>gi|154248423|ref|YP_001419381.1| hypothetical protein Xaut_4503 [Xanthobacter autotrophicus Py2]
 gi|154162508|gb|ABS69724.1| protein of unknown function DUF264 [Xanthobacter autotrophicus Py2]
          Length = 457

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 42/240 (17%), Positives = 68/240 (28%), Gaps = 20/240 (8%)

Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232
           T     PDT  G        +  DE +   D   I   +   +++         TS P  
Sbjct: 113 TALPANPDTARGFSAN----VFLDEFAIHKDSKAIWGALFPVISKNGLRLRV--TSTPNG 166

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292
              KFYEI     + W R  +D               +     D D+   E   ++  + 
Sbjct: 167 KGNKFYEIMTAADEVWSRHVVDIYQAVADGLPRDIDELRAGLADDDLWAQEYELKWLDEA 226

Query: 293 ID----SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEH 344
                   I     E A   +P         +G DI     D  V+ +            
Sbjct: 227 SAWLSYDLISSCEDERA--GDPALYQGGVCFVGRDIGRRQ-DLHVIWVWEQVGDVLWERE 283

Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC-DYLEMLGYHVYRVLGQKRA 403
             +  +      ++    ++ +YR     ID    G +   D     G  V  VL    +
Sbjct: 284 RIEQKRATFAEMDDAFDDIMVRYRVGRACIDQTGMGEKVVEDAQRRWGSRVEGVLFTGPS 343


>gi|225220117|ref|YP_002720084.1| phage terminase large subunit [Enterobacteria phage SSL-2009a]
 gi|224986058|gb|ACN74622.1| phage terminase large subunit [Enterobacteria phage SSL-2009a]
          Length = 461

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 66/336 (19%), Positives = 117/336 (34%), Gaps = 48/336 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           +G G GK+ + A  V+ L++  PG   I    +   L   ++ E+ K       +  F  
Sbjct: 58  SGFGGGKSWVAARKVIQLLTLNPGYDGIVTEPTIPLLVKIMYPELEKAFDEAGFRWKFNK 117

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           Q           D ++  L +  K    +C   S E     +G +  +   I+ DE   T
Sbjct: 118 Q-----------DKIYNVL-VKGKWTRVICE--SMENYTRLIGVNAAW---IVADEFDTT 160

Query: 204 PDVINLGILGFLTER---NANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR-FQIDTRTVE 259
              + +     L  R      R +++ S P       Y+IF       KR  +  T    
Sbjct: 161 KQDVAMAAYHKLLGRLRAGFVRQFVIVSTPEGYRAM-YQIFEVEKGSQKRLIRAKTTDNH 219

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319
            +   F + + ++Y   +++    + G F      +   +   E   + E    P   LI
Sbjct: 220 HLPADFIDTLRSQY--PANLIDAYLNGLFVNLTSGAVYKMFNREGNASTEE-VHPDDTLI 276

Query: 320 MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK--------TDLRTTNNKISGLVEKYR--- 368
           +G D         VV +RR   I    ++           DL  T   I  + E+Y    
Sbjct: 277 IGMDFNVTKM-AAVVYVRR-QRITENKEFRDEIHAVDEFVDLFDTPAMIEAIEERYPEHC 334

Query: 369 -PDAIII--DANN-----TGARTCD--YLEMLGYHV 394
               +++  D++        A + D   LE  G+ V
Sbjct: 335 AAGRVVVYPDSSGKSRKTVNASSSDIAQLEDAGFEV 370


>gi|326783087|ref|YP_004323484.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-HM2]
 gi|310005505|gb|ADO99893.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-HM2]
          Length = 560

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 72/414 (17%), Positives = 135/414 (32%), Gaps = 65/414 (15%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            +Q E +E    H  N    P               GK+T     +L  +     ++V  
Sbjct: 60  DFQQELIESFHEHRFNIAKLPRQ------------TGKSTTCVSYLLHYILFNDNVNVGI 107

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
           LAN  +  +  L    S+          +  Q + ++      +     L   SK     
Sbjct: 108 LANKLSTARDLL----SRLQLAYEQLPLWIQQGIVVY------NKGSMELENGSK-ILAA 156

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI----NLGILGFLTERNANRFWIMTS 228
             + S  R  +F          I  DE +  P+ I       +   +T    +   I+ S
Sbjct: 157 STSASAVRGMSFN--------IIFLDEFAFIPNHIAEQFFSSVYPTITS-GTSTKVIIIS 207

Query: 229 NPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285
            P  ++  FY+++    K  + +   ++    V G D  + E  IA           E  
Sbjct: 208 TPNGMN-HFYKLWVDAQKGRNGYAWNEVHWSKVPGRDAKWKEQTIANTSERQ--FTQEFD 264

Query: 286 GQFPQQDIDSFIPLNIIEE-----------ALNREPCPDPYAPLIMGCDIAEE--GGDNT 332
            +F    +D+ I  + +             +L+    P      I+  D++       + 
Sbjct: 265 CEFL-GSVDTLITASKLRVLTYDDVMTTNGSLDIYEKPIDKHEYIITVDVSRGLAQDYSA 323

Query: 333 VVVLRRGPVIEHLF-DWSKTDLRTT--NNKISGLVEKYRPDAIIIDANNTGARTCD---- 385
            VV+        L   +   D+R     N I  +   Y    ++ + N+ G         
Sbjct: 324 FVVIDITHAPWRLVAKYRDKDVRPMLFPNIIFNVATNYNKAYVLTEVNDIGEAVAGSLFY 383

Query: 386 YLEMLGYHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438
            LE     +  + G   + V   F  N+ T++ VKM+  ++     N   LI++
Sbjct: 384 DLEYENTLMCAMRGRAGQIVGQGFSGNK-TQMGVKMSKTVKAQGCSNLKTLIED 436


>gi|322662586|gb|EFY58794.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 81038-01]
          Length = 280

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 49/143 (34%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +           +    P  
Sbjct: 40  LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQSLALRPFG 96

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L +
Sbjct: 97  WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 156

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 157 QYNVTYIGIDSTGVGHGVYENVK 179


>gi|323183894|gb|EFZ69285.1| terminase, ATPase subunit [Escherichia coli 1357]
          Length = 590

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|320180747|gb|EFW55673.1| Phage terminase, ATPase subunit [Shigella boydii ATCC 9905]
 gi|323167352|gb|EFZ53060.1| terminase, ATPase subunit [Shigella sonnei 53G]
          Length = 590

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 60/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F    + S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKV-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|117623093|ref|YP_852006.1| putative phage terminase [Escherichia coli APEC O1]
 gi|117624286|ref|YP_853199.1| Phage protein P [Escherichia coli APEC O1]
 gi|115512217|gb|ABJ00292.1| putative phage terminase [Escherichia coli APEC O1]
 gi|115513410|gb|ABJ01485.1| Phage protein P [Escherichia coli APEC O1]
          Length = 590

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|331675382|ref|ZP_08376132.1| terminase, ATPase subunit (GpP) [Escherichia coli TA280]
 gi|331067442|gb|EGI38847.1| terminase, ATPase subunit (GpP) [Escherichia coli TA280]
          Length = 590

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSTDDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|330967816|gb|EGH68076.1| hypothetical protein PSYAC_24858 [Pseudomonas syringae pv.
           actinidiae str. M302091]
          Length = 774

 Score = 58.6 bits (140), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 51/163 (31%), Gaps = 20/163 (12%)

Query: 244 PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W++   I      G D    E +   Y  D D  +     +F      +F  L  +
Sbjct: 343 PDGQWRKVITILDAISGGCDLFDLEQLQLEY--DEDKFQQLFMCKFIDSSQSAF-SLADL 399

Query: 303 EEALNR----------EPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHLFD- 347
           E   +           +P     +P+ +G D +    D T VV    L  G     L   
Sbjct: 400 ERCYSDLSLWADFDPDDPRLYGNSPVWIGYDPSRTRDDATCVVIAPPLENGGKFRILEKH 459

Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
            W     +    ++  L E++    I ID    G    D +  
Sbjct: 460 SWRGQSFKYQAEQVKKLTERFNVQHIGIDTTGIGYGVFDLVRD 502


>gi|294634584|ref|ZP_06713119.1| terminase, ATPase subunit [Edwardsiella tarda ATCC 23685]
 gi|291092098|gb|EFE24659.1| terminase, ATPase subunit [Edwardsiella tarda ATCC 23685]
          Length = 588

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 33/134 (24%), Positives = 44/134 (32%), Gaps = 17/134 (12%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE----ALNREPCPDPYA------PLIMG 321
           +    +D  +     +F      S  P   ++     AL      +PYA      P+ +G
Sbjct: 355 KRENSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDALEAWTDVNPYADHPFDRPVWIG 413

Query: 322 CDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIID 375
            D +  G     VVL      G     L    W   D  T    I  L EKYR D I ID
Sbjct: 414 YDPSHTGDSAGCVVLAPPAVPGGKFRMLERHQWKGMDFSTQAEAIRALTEKYRVDYIGID 473

Query: 376 ANNTGARTCDYLEM 389
           A   G      +  
Sbjct: 474 ATGIGQGVFQLVRE 487


>gi|323936689|gb|EGB32974.1| terminase [Escherichia coli E1520]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFATNPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|210062534|gb|ACJ06274.1| probable terminase subunit [Photorhabdus luminescens]
          Length = 585

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 31/144 (21%), Positives = 53/144 (36%), Gaps = 23/144 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDP 314
            + +   Y    D  +  +  +F   DI+S   L +++  +                P  
Sbjct: 345 IDQLRLEY--SPDEYQNLLMCEF-MDDIESIFSLQLMQGCMVDSWEIWDDVQPLMLRPYG 401

Query: 315 YAPLIMGCDIAEEG--GDNT---VVVLRR--GPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           Y P+ +G D A+ G  GD+    VV   R  G     L    W   + R  ++ I  L E
Sbjct: 402 YHPVWIGYDPAKGGENGDSAGCVVVAPPRVPGDKFRILERHQWRGMNFRAQSDAIKRLTE 461

Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389
           +Y  + I ID+   G      ++ 
Sbjct: 462 QYNVEYIGIDSTGVGHGVYQNVKE 485


>gi|324113792|gb|EGC07767.1| terminase [Escherichia fergusonii B253]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFATNPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|332088966|gb|EGI94078.1| terminase, ATPase subunit [Shigella boydii 5216-82]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|323961666|gb|EGB57270.1| terminase [Escherichia coli H489]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|294494147|gb|ADE92903.1| terminase, ATPase subunit [Escherichia coli IHE3034]
 gi|323951869|gb|EGB47743.1| terminase [Escherichia coli H252]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|254039145|ref|ZP_04873195.1| terminase [Escherichia sp. 1_1_43]
 gi|226838581|gb|EEH70610.1| terminase [Escherichia sp. 1_1_43]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|30065706|ref|NP_839851.1| gpP [Yersinia phage L-413C]
 gi|300947250|ref|ZP_07161455.1| conserved hypothetical protein [Escherichia coli MS 116-1]
 gi|301022960|ref|ZP_07186775.1| conserved hypothetical protein [Escherichia coli MS 69-1]
 gi|331678021|ref|ZP_08378696.1| terminase, ATPase subunit (GpP) [Escherichia coli H591]
 gi|30025900|gb|AAP04439.1| gpP [Yersinia phage L-413C]
 gi|33413700|gb|AAN28220.1| gpP [Enterobacteria phage WPhi]
 gi|300397301|gb|EFJ80839.1| conserved hypothetical protein [Escherichia coli MS 69-1]
 gi|300453115|gb|EFK16735.1| conserved hypothetical protein [Escherichia coli MS 116-1]
 gi|315061386|gb|ADT75713.1| terminase, ATPase subunit [Escherichia coli W]
 gi|315063221|gb|ADT77548.1| phage large terminase subunit [Escherichia coli W]
 gi|323380714|gb|ADX52982.1| phage large terminase subunit GpP [Escherichia coli KO11]
 gi|325499372|gb|EGC97231.1| Terminase, ATPase subunit (GpP) [Escherichia fergusonii ECD227]
 gi|331074481|gb|EGI45801.1| terminase, ATPase subunit (GpP) [Escherichia coli H591]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|323378035|gb|ADX50303.1| phage large terminase subunit GpP [Escherichia coli KO11]
          Length = 589

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 350 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 406

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 407 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 466

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 467 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 516

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 517 CLEYDVSATDITSSFMAIRKTMTSSGR 543


>gi|315296184|gb|EFU55492.1| conserved hypothetical protein [Escherichia coli MS 16-3]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|213646682|ref|ZP_03376735.1| Phage protein P [Salmonella enterica subsp. enterica serovar Typhi
           str. J185]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|9630329|ref|NP_046758.1| gpP [Enterobacteria phage P2]
 gi|168789033|ref|ZP_02814040.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC869]
 gi|188492656|ref|ZP_02999926.1| phage large terminase subunit GpP [Escherichia coli 53638]
 gi|261225041|ref|ZP_05939322.1| Terminase, ATPase subunit (GpP) [Escherichia coli O157:H7 str.
           FRIK2000]
 gi|261257612|ref|ZP_05950145.1| Terminase, ATPase subunit (GpP) [Escherichia coli O157:H7 str.
           FRIK966]
 gi|301048706|ref|ZP_07195715.1| conserved hypothetical protein [Escherichia coli MS 185-1]
 gi|139354|sp|P25479|VPP_BPP2 RecName: Full=Terminase, ATPase subunit; AltName: Full=GpP
 gi|3139088|gb|AAD03269.1| gpP [Enterobacteria phage P2]
 gi|188487855|gb|EDU62958.1| phage large terminase subunit GpP [Escherichia coli 53638]
 gi|189371250|gb|EDU89666.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC869]
 gi|300299452|gb|EFJ55837.1| conserved hypothetical protein [Escherichia coli MS 185-1]
 gi|324020535|gb|EGB89754.1| hypothetical protein HMPREF9542_00768 [Escherichia coli MS 117-3]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|320196848|gb|EFW71470.1| Phage terminase, ATPase subunit [Escherichia coli WV_060327]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|170683976|ref|YP_001746268.1| phage large terminase subunit GpP [Escherichia coli SMS-3-5]
 gi|170521694|gb|ACB19872.1| phage large terminase subunit GpP [Escherichia coli SMS-3-5]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|170769222|ref|ZP_02903675.1| phage large terminase subunit GpP [Escherichia albertii TW07627]
 gi|170121874|gb|EDS90805.1| phage large terminase subunit GpP [Escherichia albertii TW07627]
          Length = 590

 Score = 58.6 bits (140), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 33/184 (17%), Positives = 51/184 (27%), Gaps = 29/184 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRGC 518

Query: 429 LINH 432
           L   
Sbjct: 519 LEYD 522


>gi|18466735|ref|NP_569542.1| hypothetical protein HCM2.0070c [Salmonella enterica subsp.
           enterica serovar Typhi str. CT18]
 gi|16506051|emb|CAD09937.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
           Typhi str. CT18]
          Length = 418

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 50/335 (14%), Positives = 106/335 (31%), Gaps = 46/335 (13%)

Query: 59  MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118
           + +V  H        +P  FK  + AGR  GK+ L+   ++   +      V  +A +  
Sbjct: 7   LSLVQLHSGQMQVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65

Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
             +  LW ++ + L                 P  W       ++ I  K+ S +      
Sbjct: 66  MARQILWDDLQEVL-----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GA 107

Query: 179 ERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
           ++PDT  G        ++ DE     PD     +   L+        ++   P+    +F
Sbjct: 108 DKPDTLRGV---ALHFVVLDEFQDMKPDTWYKVLRPTLSS--TRGGALIIGTPKG-FSEF 161

Query: 238 YEIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
           ++++        +    WK +Q  T     +  +  E       +D      E    F  
Sbjct: 162 HKLWTIGQNKDLQRKGQWKSWQFVTADSPFVPSAEIEAAKND--MDPKSFAQEYLASFEN 219

Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350
                + P +       +    +P  P+ +G D      D    V+ +      L+   +
Sbjct: 220 MSGRVYYPFD--RNVHVKPLQFNPKLPIWVGQD---FNIDPMSSVILQPQPNGELWAVDE 274

Query: 351 -----TDLRTTNNKISGLVEKYRPD-AIIIDANNT 379
                ++     +++     +++    I  D    
Sbjct: 275 VVLFSSNTAEVCDELERRFWRWKSQVTIFPDPAGA 309


>gi|16082806|ref|NP_395360.1| hypothetical protein YPMT1.24c [Yersinia pestis CO92]
 gi|31795361|ref|NP_857813.1| hypothetical protein Y1030 [Yersinia pestis KIM]
 gi|40787951|ref|NP_857660.2| hypothetical protein YPKMT021 [Yersinia pestis KIM]
 gi|45478613|ref|NP_995469.1| hypothetical protein YP_pMT025 [Yersinia pestis biovar Microtus
           str. 91001]
 gi|52788073|ref|YP_093901.1| hypothetical protein pG8786_021 [Yersinia pestis]
 gi|108793557|ref|YP_636707.1| hypothetical protein YPA_MT0025 [Yersinia pestis Antiqua]
 gi|108793757|ref|YP_636595.1| hypothetical protein YPN_MT0025 [Yersinia pestis Nepal516]
 gi|145597216|ref|YP_001154679.1| hypothetical protein YPDSF_4052 [Yersinia pestis Pestoides F]
 gi|149192775|ref|YP_001294006.1| hypothetical protein YPE_4292 [Yersinia pestis CA88-4125]
 gi|162417876|ref|YP_001604588.1| hypothetical protein YpAngola_0076 [Yersinia pestis Angola]
 gi|165939469|ref|ZP_02228016.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
           str. IP275]
 gi|166214433|ref|ZP_02240468.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
           B42003004]
 gi|167402343|ref|ZP_02307808.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
           UG05-0454]
 gi|167422791|ref|ZP_02314544.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
           str. MG05-1020]
 gi|167466683|ref|ZP_02331387.1| hypothetical protein YpesF_02065 [Yersinia pestis FV-1]
 gi|229896952|ref|ZP_04512111.1| hypothetical protein YPS_4795 [Yersinia pestis Pestoides A]
 gi|229897756|ref|ZP_04512911.1| hypothetical protein YPH_4790 [Yersinia pestis biovar Orientalis
           str. PEXU2]
 gi|229900293|ref|ZP_04515428.1| hypothetical protein YPF_4819 [Yersinia pestis biovar Orientalis
           str. India 195]
 gi|229904817|ref|ZP_04519927.1| hypothetical protein YP516_4657 [Yersinia pestis Nepal516]
 gi|270491004|ref|ZP_06208077.1| phage terminase, large subunit, PBSX family [Yersinia pestis KIM
           D27]
 gi|294502015|ref|YP_003565752.1| hypothetical protein YPZ3_pMT0023 [Yersinia pestis Z176003]
 gi|3883031|gb|AAC82691.1| unknown [Yersinia pestis KIM 10]
 gi|5834709|emb|CAB55206.1| hypothetical protein YPMT1.24c [Yersinia pestis CO92]
 gi|45357266|gb|AAS58660.1| hypothetical protein YP_pMT025 [Yersinia pestis biovar Microtus
           str. 91001]
 gi|52538002|emb|CAG27427.1| hypothetical protein [Yersinia pestis]
 gi|108777821|gb|ABG20339.1| hypothetical protein YPN_MT0025 [Yersinia pestis Nepal516]
 gi|108782104|gb|ABG16161.1| hypothetical protein YPA_MT0025 [Yersinia pestis Antiqua]
 gi|145212984|gb|ABP42389.1| hypothetical protein YPDSF_4052 [Yersinia pestis Pestoides F]
 gi|148872433|gb|ABR14922.1| hypothetical protein YPMT1.24c [Yersinia pestis CA88-4125]
 gi|162350848|gb|ABX84797.1| conserved hypothetical protein [Yersinia pestis Angola]
 gi|165912657|gb|EDR31287.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
           str. IP275]
 gi|166204381|gb|EDR48861.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
           B42003004]
 gi|166958284|gb|EDR55305.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis
           str. MG05-1020]
 gi|167048235|gb|EDR59643.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
           UG05-0454]
 gi|229678132|gb|EEO74238.1| hypothetical protein YP516_4657 [Yersinia pestis Nepal516]
 gi|229686652|gb|EEO78733.1| hypothetical protein YPF_4819 [Yersinia pestis biovar Orientalis
           str. India 195]
 gi|229693337|gb|EEO83387.1| hypothetical protein YPH_4790 [Yersinia pestis biovar Orientalis
           str. PEXU2]
 gi|229699988|gb|EEO88028.1| hypothetical protein YPS_4795 [Yersinia pestis Pestoides A]
 gi|262363909|gb|ACY60628.1| hypothetical protein YPD4_pMT0023 [Yersinia pestis D106004]
 gi|262364065|gb|ACY64401.1| hypothetical protein YPD8_pMT0023 [Yersinia pestis D182038]
 gi|270334985|gb|EFA45763.1| phage terminase, large subunit, PBSX family [Yersinia pestis KIM
           D27]
 gi|294352486|gb|ADE66542.1| hypothetical protein YPZ3_pMT0023 [Yersinia pestis Z176003]
 gi|320017547|gb|ADW01117.1| hypothetical protein YPC_4788 [Yersinia pestis biovar Medievalis
           str. Harbin 35]
          Length = 418

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 50/335 (14%), Positives = 106/335 (31%), Gaps = 46/335 (13%)

Query: 59  MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118
           + +V  H        +P  FK  + AGR  GK+ L+   ++   +      V  +A +  
Sbjct: 7   LSLVQLHSGQMQVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65

Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
             +  LW ++ + L                 P  W       ++ I  K+ S +      
Sbjct: 66  MARQILWDDLQEVL-----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GA 107

Query: 179 ERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
           ++PDT  G        ++ DE     PD     +   L+        ++   P+    +F
Sbjct: 108 DKPDTLRGV---ALHFVVLDEFQDMKPDTWYKVLRPTLSS--TRGGALIIGTPKG-FSEF 161

Query: 238 YEIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
           ++++        +    WK +Q  T     +  +  E       +D      E    F  
Sbjct: 162 HKLWTIGQNKDLQRKGQWKSWQFVTADSPFVPSAEIEAAKND--MDPKSFAQEYLASFEN 219

Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350
                + P +       +    +P  P+ +G D      D    V+ +      L+   +
Sbjct: 220 MSGRVYYPFD--RNVHVKPLQFNPKLPIWVGQD---FNIDPMSSVILQPQPNGELWAVDE 274

Query: 351 -----TDLRTTNNKISGLVEKYRPD-AIIIDANNT 379
                ++     +++     +++    I  D    
Sbjct: 275 VVLFSSNTAEVCDELERRFWRWKSQVTIFPDPAGA 309


>gi|324115403|gb|EGC09352.1| terminase [Escherichia coli E1167]
          Length = 572

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 357 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 413

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 414 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 473

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 474 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 523

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 524 CLEYDVSATDITSSFMAIRKTMTSSGR 550


>gi|302343251|ref|YP_003807780.1| hypothetical protein Deba_1821 [Desulfarculus baarsii DSM 2075]
 gi|301639864|gb|ADK85186.1| conserved hypothetical protein [Desulfarculus baarsii DSM 2075]
          Length = 507

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 63/394 (15%), Positives = 109/394 (27%), Gaps = 69/394 (17%)

Query: 38  WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95
           WG+        S    W  Q+E +     + ++                GR +GK+ + +
Sbjct: 20  WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65

Query: 96  WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155
              L    T  G   +  A  +  L T +  E+   L   P+     M S++L       
Sbjct: 66  TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDTNPD----LMNSIALTKYGKPK 120

Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215
                   ++  + S +    +    D F   H      +  DE +   +     +   L
Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177

Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273
                 R +   S P  L    +Y +     D +  F+  +             ++  Y 
Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTEDREAELLEFYG 232

Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308
           G DS   + EV G+  +    +F                 I +           E A +R
Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSEMRDCDTEEAAHDR 292

Query: 309 -----EPCPDPYAPLIMGCDIAEEGGDNTVVVL-------RRGPVIEHLFDWSKTDLRTT 356
                   P      + G D+        +VV        R    +              
Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQEMEIGERTLLKMILRVHLEHVSYPHI 351

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
              I+ L   Y P  I +D    G      L  L
Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385


>gi|300715671|ref|YP_003740474.1| Terminase, ATPase [Erwinia billingiae Eb661]
 gi|299061507|emb|CAX58621.1| Terminase, ATPase subunit [Erwinia billingiae Eb661]
          Length = 588

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 26/144 (18%), Positives = 46/144 (31%), Gaps = 23/144 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y       +  +  +F   D+ S  PL  ++  +                P  
Sbjct: 348 IDQLRLEY--SPPEYQNLLMCEFID-DLASVFPLADLQACMVDSWEVWQDFEALALRPFG 404

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L  
Sbjct: 405 WREVWIGYDPAKGTQHGDSAGCVVIAPPSVPGGKFRILERHQWRGMDFRAQADAIKELTR 464

Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389
           +Y    I ID+   G    + ++M
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVKM 488


>gi|221633560|ref|YP_002522786.1| hypothetical protein trd_1584 [Thermomicrobium roseum DSM 5159]
 gi|221155562|gb|ACM04689.1| conserved hypothetical protein [Thermomicrobium roseum DSM 5159]
          Length = 489

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 53/352 (15%), Positives = 96/352 (27%), Gaps = 55/352 (15%)

Query: 89  GKTTLNAWLVLWLMSTRP--GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146
           GK    A  + WL+      G  V+    S                +L   +    + + 
Sbjct: 65  GKDEALAQFLAWLLLRFHRRGGEVVVALPSWR-----------PQGALARERLLAVLAAP 113

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV 206
            L        +     G        + R  S        G   T  + ++ +EA      
Sbjct: 114 RLAALLAGLGLAPEVAGARVALGRAVVRYASAGPSANVRGL--TASLLLVANEAQDIAPD 171

Query: 207 INLGILGFLTERNANRFWIMTSNPRRLSG------KFYEIFNKPLDDWKRFQIDTRTVEG 260
                   +   +     +    P           ++     +     + +++   TV  
Sbjct: 172 RWDSAFAPMA-ASTGAPALYLGTPWGSDSLLARELRYLTALERQDGQQRVWRVPWTTVAA 230

Query: 261 IDPSF---HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDP 314
             P++       +A+ G      R E  G           P   +       P    P P
Sbjct: 231 ELPAYGDHVRERMAQLGAGHPFVRTEY-GLEELAGEGRLFPPERLALVRGDHPALLAPRP 289

Query: 315 YAPLIMGCDIAEEGGDN-------------------TVVVLRRGPVIEHLFDWS----KT 351
                +  D+A  G D                    TVV +  G +  +   W       
Sbjct: 290 GERYALTVDVA--GEDEASAGELRDDPGARRDATALTVVRVVPGTLPRYEAVWRARWVGA 347

Query: 352 DLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKR 402
                +  +  L   +R + +++DA+  GA    +LE  LG  V RV+   R
Sbjct: 348 RQVRQHEALVQLARAWRAERVVVDASGVGAGLAAFLEHALGERVRRVVFSPR 399


>gi|307826152|ref|ZP_07656363.1| protein of unknown function DUF264 [Methylobacter tundripaludum
           SV96]
 gi|307732791|gb|EFO03657.1| protein of unknown function DUF264 [Methylobacter tundripaludum
           SV96]
          Length = 598

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 44/261 (16%), Positives = 78/261 (29%), Gaps = 52/261 (19%)

Query: 189 NTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIF 241
           + +G   + DE    PD   +     G    +   R     S P  +S + Y     E +
Sbjct: 261 SYHGHLYV-DECFWIPDFDKMWKVASGMAAHKKWRRTL--FSTPSAISHQAYPMWCGEKY 317

Query: 242 NKPLDDWKRFQIDT------------------------RTVEGIDPSFHEGIIARYGLDS 277
           N+   D K+ + D                            +G D    + +   Y  D 
Sbjct: 318 NQGKADDKKAEFDVSHAALKDGLMGADKIWRHMVTVVDAEAQGCDLFDIDELQDEYSKDD 377

Query: 278 DVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---------PCPDPYAPLIMGCDIAEEG 328
                    +F   D  S   L I+     RE         P P    P+ +G D +   
Sbjct: 378 --FANLFMCKFID-DAKSVFNLGIMMTCYAREDYTDYNDKAPRPYGNRPVAIGYDPSRTR 434

Query: 329 GDNT----VVVLRRG--PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382
            + +     + LR G    +    D+   + +   N+I  +V+ +    + ID    G  
Sbjct: 435 DNASLAILAIPLRPGDKWRVLKTMDFHGQNFQYQANRIKEIVDSHNVQHVGIDVTGIGYG 494

Query: 383 TCDYLEMLGYHVYRVLGQKRA 403
             + +E     V  +      
Sbjct: 495 LFELVEQFYRRVTPINYSNET 515


>gi|318064508|gb|ADV36483.1| phage terminase large subunit [Edwardsiella phage eiDWF]
 gi|318064606|gb|ADV36532.1| phage terminase large subunit [Edwardsiella phage eiMSLS]
          Length = 460

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 63/346 (18%), Positives = 113/346 (32%), Gaps = 38/346 (10%)

Query: 44  PLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS 103
           P++     R+W+++ +     H    +N+   ++      +G G GKT   A   + L  
Sbjct: 27  PVKKERKSRTWRIKTL----PHQRGLINDTTTKILGLC--SGFGGGKTWSAARKAVQLAI 80

Query: 104 TRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG 163
             PG   I    +   L   ++ E+ K L+    K  F  Q    H              
Sbjct: 81  LNPGCDGIITEPTIPLLVKIMYPELEKALNEAGIKWKFNKQDKIYHC------------R 128

Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG---ILGFLTERNA 220
           I  +    +C   S E     +G +  + +    D     PD+       +LG L   N 
Sbjct: 129 IAGQMTRIICD--SMENYTRLIGVNAAWCVCDEFDTTK--PDIAMEAYRKLLGRLRTGNV 184

Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279
            +  I  S P       Y+IF    DD KR  +  T     +   + + + A+Y    ++
Sbjct: 185 RQMVI-VSTPEGFRAM-YQIFISEADDQKRLIKARTTDNHYLPQDYIDTLRAQY--PPEL 240

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339
               + G+F      +    N      N +   +    L++G D         V V R  
Sbjct: 241 IEAYLNGEFVNLTGGAVY-RNFSRTLNNCDTVAEDDDTLMIGMDFNVGQMAGAVYVQRIA 299

Query: 340 PVIEHLFDWSKT----DLRTTNNKISGLVEKYRP---DAIIIDANN 378
             +E +    +     D     + I      +       I  D++ 
Sbjct: 300 DGVEEMHLVDEFCGLLDTDAMIDAIKERYPDHHARGLIEIFPDSSG 345


>gi|318064394|gb|ADV36428.1| phage terminase large subunit [Edwardsiella phage eiAU]
          Length = 460

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 63/346 (18%), Positives = 113/346 (32%), Gaps = 38/346 (10%)

Query: 44  PLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS 103
           P++     R+W+++ +     H    +N+   ++      +G G GKT   A   + L  
Sbjct: 27  PVKKERKSRTWRIKTL----PHQRGLINDTTTKILGLC--SGFGGGKTWSAARKAVQLAI 80

Query: 104 TRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG 163
             PG   I    +   L   ++ E+ K L+    K  F  Q    H              
Sbjct: 81  LNPGCDGIITEPTIPLLVKIMYPELEKALNEAGIKWKFNKQDKIYHC------------R 128

Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG---ILGFLTERNA 220
           I  +    +C   S E     +G +  + +    D     PD+       +LG L   N 
Sbjct: 129 IAGQMTRIICD--SMENYTRLIGVNAAWCVCDEFDTTK--PDIAMEAYRKLLGRLRTGNV 184

Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSDV 279
            +  I  S P       Y+IF    DD KR  +  T     +   + + + A+Y    ++
Sbjct: 185 RQMVI-VSTPEGFRAM-YQIFISEADDQKRLIKARTTDNHYLPQDYIDTLRAQY--PPEL 240

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRG 339
               + G+F      +    N      N +   +    L++G D         V V R  
Sbjct: 241 IEAYLNGEFVNLTGGAVY-RNFSRTLNNCDTVAEDDDTLMIGMDFNVGQMAGAVYVQRIA 299

Query: 340 PVIEHLFDWSKT----DLRTTNNKISGLVEKYRP---DAIIIDANN 378
             +E +    +     D     + I      +       I  D++ 
Sbjct: 300 DGVEEMHLVDEFCGLLDTDAMIDAIKERYPDHHARGLIEIFPDSSG 345


>gi|293411885|ref|ZP_06654610.1| predicted protein [Escherichia coli B354]
 gi|220980013|emb|CAP72205.1| Hypothetical protein [Escherichia coli LF82]
 gi|291469440|gb|EFF11929.1| predicted protein [Escherichia coli B354]
 gi|323934319|gb|EGB30739.1| PBSX family protein phage terminase [Escherichia coli E1520]
          Length = 418

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 48/335 (14%), Positives = 105/335 (31%), Gaps = 46/335 (13%)

Query: 59  MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118
           + +V  H        +P  FK  + AGR  GK+ L+   ++   +      V  +A +  
Sbjct: 7   LSLVQLHSGQMKVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65

Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
             +  LW ++ + L                 P  W       ++ I  K+ S +      
Sbjct: 66  MARQILWDDLQEVL-----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GA 107

Query: 179 ERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
           ++PDT  G        ++ DE      D     +   L+        ++   P+    +F
Sbjct: 108 DKPDTLRGV---ALHFVVLDEFQDMKADTWYKVLRPTLSS--TRGGALIIGTPKG-FSEF 161

Query: 238 YEIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
           ++++        +    WK +Q  T     +  +  E       +D      E    F  
Sbjct: 162 HKLWTIGQNVELQRKGQWKSWQFVTADSPFVPTAEIEAAKND--MDPKSFAQEYLASFEN 219

Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350
                + P +       +    +P  P+ +G D      D    V+ +      L+   +
Sbjct: 220 MSGRVYYPFD--RNVHVKPLQFNPRLPIWVGQD---FNIDPMSSVILQPQPNGELWAIDE 274

Query: 351 -----TDLRTTNNKISGLVEKYRP-DAIIIDANNT 379
                ++     +++     +++    +  D    
Sbjct: 275 LVLFSSNTAEVCDELERRFWRWKSQITVFPDPAGA 309


>gi|322614428|gb|EFY11359.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 315996572]
 gi|322621507|gb|EFY18360.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-1]
 gi|322624368|gb|EFY21201.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-3]
 gi|322626565|gb|EFY23370.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 495297-4]
 gi|322633573|gb|EFY30315.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 515920-1]
 gi|322638384|gb|EFY35082.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 515920-2]
 gi|322647317|gb|EFY43813.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. NC_MB110209-0054]
 gi|322649287|gb|EFY45724.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. OH_2009072675]
 gi|322655993|gb|EFY52293.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. CASC_09SCPH15965]
 gi|322661388|gb|EFY57613.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 19N]
 gi|322666960|gb|EFY63135.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MD_MDA09249507]
 gi|322671329|gb|EFY67452.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 414877]
 gi|322677664|gb|EFY73727.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 366867]
 gi|322681510|gb|EFY77540.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 413180]
 gi|322683910|gb|EFY79920.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 446600]
 gi|323195479|gb|EFZ80657.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 609458-1]
 gi|323200466|gb|EFZ85546.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 556150-1]
 gi|323203030|gb|EFZ88062.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 609460]
 gi|323205271|gb|EFZ90246.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 507440-20]
 gi|323210579|gb|EFZ95463.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 556152]
 gi|323218140|gb|EGA02852.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB101509-0077]
 gi|323221594|gb|EGA06007.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB102109-0047]
 gi|323227645|gb|EGA11800.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB110209-0055]
 gi|323230903|gb|EGA15021.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. MB111609-0052]
 gi|323234745|gb|EGA18831.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 2009083312]
 gi|323238784|gb|EGA22834.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 2009085258]
 gi|323241484|gb|EGA25515.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. 315731156]
 gi|323248370|gb|EGA32306.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2009159199]
 gi|323252865|gb|EGA36699.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008282]
 gi|323257014|gb|EGA40723.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008283]
 gi|323260513|gb|EGA44124.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008284]
 gi|323264430|gb|EGA47936.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008285]
 gi|323269565|gb|EGA53018.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Montevideo str. IA_2010008287]
          Length = 588

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 49/143 (34%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +           +    P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQSLALRPFG 404

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|262194129|ref|YP_003265338.1| hypothetical protein Hoch_0830 [Haliangium ochraceum DSM 14365]
 gi|262077476|gb|ACY13445.1| protein of unknown function DUF264 [Haliangium ochraceum DSM 14365]
          Length = 503

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 41/288 (14%), Positives = 85/288 (29%), Gaps = 52/288 (18%)

Query: 227 TSNPRRLSGKFYEI----------FNKPLDDWKRFQIDTRTVEGIDPSF----HEGIIAR 272
            S P    G F+EI            +    W R +     ++           E  +A 
Sbjct: 208 CSTPLGRRGIFWEISTEELRKYPHHTRDEVPWWRCRFFCLDIDRAMREAPHMPTEERVAA 267

Query: 273 YGLDSDV----------TRVEVCGQFPQQDIDSFIPLNIIEEALNRE--------PCPDP 314
           +G  + V           + E    F  +   S+ P  +I    + +          P+P
Sbjct: 268 FGTQAIVQQLDSLPLEDFQQEFECSFVDESY-SYYPYELILPCTSEDLVPAGDFTDLPEP 326

Query: 315 YAPLIMGCDIAEEGGD-NTVVVLRRGPV--IEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
              ++ G D+          V    G       L  + +         +   +++     
Sbjct: 327 EGRIVAGFDVGRTRDRSELAVFEDTGGHFVCRLLRRYDQVPFAEQEADLRRFLDRVPVAR 386

Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---EFAS 428
           + ID +  G    + L      V            +   N   E        L   +  +
Sbjct: 387 LSIDQSGIGMHLAENLARDYAQVVG----------DTFTNDNKERWATDLKILFQRKDIA 436

Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESKRV-KGAKSTDYSDGLMYT 475
           L     L+  + S+K  ++P+ G++  +++R  +G  + D    +   
Sbjct: 437 LPRDRELVGQIHSIKRRVLPS-GKVGFDAERSTRGGHA-DRFWAIALA 482


>gi|213865314|ref|ZP_03387433.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. M223]
          Length = 171

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 11  LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 67

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 68  WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 127

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 128 QYNVTYIGIDSTGVGHGVYENVK 150


>gi|253991767|ref|YP_003043123.1| putative phage terminase subunit [Photorhabdus asymbiotica subsp.
           asymbiotica ATCC 43949]
 gi|211638542|emb|CAR67163.1| probable phage terminase subunit [Photorhabdus asymbiotica subsp.
           asymbiotica ATCC 43949]
 gi|253783217|emb|CAQ86382.1| probable phage terminase subunit [Photorhabdus asymbiotica]
          Length = 585

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 32/144 (22%), Positives = 54/144 (37%), Gaps = 23/144 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDP 314
            + +   Y    D  +  +  +F   DI+S   L +++  +                P  
Sbjct: 345 IDQLRLEY--SPDEYQNLLMCEF-MDDIESIFSLQLMQGCMVDSWEIWNDVQPLMLRPYG 401

Query: 315 YAPLIMGCDIAEEG--GDN--TVVV---LRRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           Y P+ +G D A+ G  GD+   VVV   L  G     L    W   + R  ++ I  L E
Sbjct: 402 YNPVWIGYDPAKGGKNGDSAGCVVVAPPLVPGGKFRILERHQWRGMNFRAQSDAIKRLTE 461

Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389
           +Y  + I ID+   G      ++ 
Sbjct: 462 QYNVEYIGIDSTGVGHGVYQNVKE 485


>gi|322831306|ref|YP_004211333.1| terminase, ATPase subunit [Rahnella sp. Y9602]
 gi|321166507|gb|ADW72206.1| terminase, ATPase subunit [Rahnella sp. Y9602]
          Length = 596

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 50/338 (14%), Positives = 98/338 (28%), Gaps = 66/338 (19%)

Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPL-- 245
           +  DE    P+   +     G  ++ +        S P  L+   Y     E+FNK    
Sbjct: 257 LYVDEIFWIPNFQKLRKVASGMASQEHLRTT--YFSTPSALTHGAYPFWSGELFNKGREN 314

Query: 246 ----------------------DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282
                                   W++   I+     G +    + +      +    R 
Sbjct: 315 PNDRIELDIGHHALAKGRLCEDGQWRQIVTIEDALAGGCNLFNIDTLKQENSAED--FRN 372

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDIAEEGGDN 331
               +F      S  P   ++  +                   Y  + +G D +  G   
Sbjct: 373 LFMCEFVDDQ-TSVFPFAELQRCMVESAEEWQDFSPFAMRPFGYRAVWIGYDPSHTGDSA 431

Query: 332 --TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
              VV   L  G     L    W   D       I  L ++Y  + I +DA   G     
Sbjct: 432 GCAVVAPPLVDGGKFRVLERHQWKGMDFAAQAKSIEELTKRYCVEYIGVDATGIGQGVFQ 491

Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI---NHSGLIQNLKSL 442
            +               A+++ +    +T++ +K  D +    L    NH  +  +  ++
Sbjct: 492 LVRQ---------FFPAAMEIRYSPETKTKMVLKAKDTITSGRLEYDTNHKDITSSFMAI 542

Query: 443 KSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
           +  +  +      E+ R + A   D +  +M+     P
Sbjct: 543 RKTMTASGSRSTYEASRSEEASHADVAWAIMHALLNEP 580


>gi|312601717|gb|ADQ92391.1| terminase ATPase subunit [Salmonella phage RE-2010]
 gi|321223512|gb|EFX48577.1| Phage terminase, ATPase subunit [Salmonella enterica subsp.
           enterica serovar Typhimurium str. TN061786]
          Length = 572

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 332 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 388

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L +
Sbjct: 389 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 448

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 449 QYNVTYIGIDSTGVGHGVYENVK 471


>gi|298346517|ref|YP_003719204.1| phage terminase protein [Mobiluncus curtisii ATCC 43063]
 gi|298236578|gb|ADI67710.1| phage terminase protein [Mobiluncus curtisii ATCC 43063]
          Length = 470

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 63/406 (15%), Positives = 117/406 (28%), Gaps = 63/406 (15%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ    +V              +     +   R  GKTTL   L+  +    PG  V  
Sbjct: 32  PWQKLVADVAGERQAEHPERARYQTVVVTVP--RQSGKTTLIKALMAAVAQANPGCQVYY 89

Query: 113 LANSETQLKTTL--WAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
            A +    K  +  W E++K L             + + P       +    G +   + 
Sbjct: 90  TAQTR---KDAVEKWGELAKQLRKD----------MGIAPDGKPRVKVLEGTGNERIVFR 136

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP----DVINLGILGF------------ 214
                     P T  G H      ++ DEA        D +                   
Sbjct: 137 GTESMIMPFAP-TVEGIHGKTSPLVVVDEAWAFDQARGDDLMAAFNPVGLTIPHSQVWII 195

Query: 215 LTERNANRFWI---------MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPS- 264
            T  +    W+           ++P   +  F    ++ +       + +       P+ 
Sbjct: 196 STAGDTRSEWLRSLVDKGRQAINDPGTTTAFFEWSADEEMAAAN---LRSDEALAFHPAI 252

Query: 265 ------FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
                 +    +A+   D  + R      +P     S + L   E+    EP   P   +
Sbjct: 253 GFTQELWKIQSLAQTEPDH-LYRRSYLNLWPTAAETSIVDLEAWEKLAEPEPASMPPD-V 310

Query: 319 IMGCDIAEEGGDNTVVVL-RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377
            +G D+A      T+    + G  ++     SK         I+ L E   P A++ D +
Sbjct: 311 AIGFDVATARTGATIYAAWQDGETVQIHRLVSKAGAAWVEKAIAHLQETLAPMAVVADDS 370

Query: 378 NTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
                  + L   G  +Y       A+      +  +E   +++D 
Sbjct: 371 GDNRPIIEALRRNGKEIY-------ALRPREYASANSEFFARISDN 409


>gi|300088757|ref|YP_003759279.1| hypothetical protein Dehly_1680 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
 gi|299528490|gb|ADJ26958.1| conserved hypothetical protein [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
          Length = 507

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 62/394 (15%), Positives = 109/394 (27%), Gaps = 69/394 (17%)

Query: 38  WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95
           WG+        S    W  Q+E +     + ++                GR +GK+ + +
Sbjct: 20  WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65

Query: 96  WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155
              L    T  G   +  A  +  L T +  E+   L   P+     M S++L       
Sbjct: 66  TDALHYAFTTRGGQGLIAAPHQGHLDTII-EEIEFQLDSNPD----LMNSIALTKYGKPK 120

Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215
                   ++  + S +    +    D F   H      +  DE +   +     +   L
Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177

Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273
                 R +   S P  L    +Y +     + +  F+  +             ++  Y 
Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSEQFHVFRWPSWLNPLWTEDREAELLEFYG 232

Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308
           G DS   + EV G+  +    +F                 I +           E A +R
Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292

Query: 309 -----EPCPDPYAPLIMGCDIAEEGGDNTVVVL-------RRGPVIEHLFDWSKTDLRTT 356
                   P      + G D+        +VV        R    +              
Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQEMEIGERTLLKMILRVHLEHVSYPHI 351

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
              I+ L   Y P  I +D    G      L  L
Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385


>gi|262194298|ref|YP_003265507.1| hypothetical protein Hoch_1017 [Haliangium ochraceum DSM 14365]
 gi|262077645|gb|ACY13614.1| protein of unknown function DUF264 [Haliangium ochraceum DSM 14365]
          Length = 478

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 41/288 (14%), Positives = 85/288 (29%), Gaps = 52/288 (18%)

Query: 227 TSNPRRLSGKFYEI----------FNKPLDDWKRFQIDTRTVEGIDPSF----HEGIIAR 272
            S P    G F+EI            +    W R +     ++           E  +A 
Sbjct: 183 CSTPLGRRGIFWEISTEELRKYPHHTRDEVPWWRCRFFCLDIDRAVREAPHMPTEERVAA 242

Query: 273 YGLDSDV----------TRVEVCGQFPQQDIDSFIPLNIIEEALNRE--------PCPDP 314
           +G  + V           + E    F  +   S+ P  +I    + +          P+P
Sbjct: 243 FGTQAIVQQLDSLALEDFQQEFECSFVDESY-SYYPYELILPCTSEDLVLAGDFTDLPEP 301

Query: 315 YAPLIMGCDIAEEGG-DNTVVVLRRGPV--IEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
              ++ G D+          V    G       L  + +         +   +++     
Sbjct: 302 EGRIVAGFDVGRTRDHSELAVFEDTGGHFVCRLLRRYDQVPFAEQEADLRRFLDRVPVAR 361

Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---EFAS 428
           + ID +  G    + L      V            +   N   E        L   +  +
Sbjct: 362 LSIDQSGIGMHLAENLARDYAQVVG----------DTFTNDNKERWATDLKILFQRKDIA 411

Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESKRV-KGAKSTDYSDGLMYT 475
           L     L+  + S+K  ++P+ G++  +++R  +G  + D    +   
Sbjct: 412 LPRDRELVGQIHSIKRRVLPS-GKVGFDAERSTRGGHA-DRFWAIALA 457


>gi|255103207|ref|ZP_05332184.1| hypothetical protein CdifQCD-6_20513 [Clostridium difficile
           QCD-63q42]
          Length = 582

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 69/505 (13%), Positives = 144/505 (28%), Gaps = 118/505 (23%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
             + P  + +++           +     +  +    A RG+GK+ L       +   +P
Sbjct: 31  YLANPHRFCMDYFGFNLHLFQQILIYMMMKSDQFVFIASRGLGKSWLLGVFCCVIAVLKP 90

Query: 107 GISVICLANSETQLKTTLWAEVS-----KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161
           G  V+  A  + Q K  + +++      K  +L      F++ +  +    W    +   
Sbjct: 91  GTCVLIAAKRKKQAKLLITSKILGDLYLKSDTLKREIKSFQVNAQEVSIDFWNGSRIEAV 150

Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-VINLGILGFLTERNA 220
           +  D        R Y                  +I DE     +  +N  ++ FLT    
Sbjct: 151 VSNDD------ARGYRAN--------------VLIVDEYRMVDEGTVNDVLVPFLTNPRQ 190

Query: 221 NRFWIMTSNPRR-----------LSGKFYEIFNKPLDDWKRFQI---------------D 254
                   NP+            LS  +Y          +  +                 
Sbjct: 191 PG---YLQNPKYRYMQEENKEIYLSSGWYSQHWSYKKFMETVKGMLSGEDMFACSIPFTC 247

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI----------------P 298
           +     +        + +  +      +E CG F  +  D+F                 P
Sbjct: 248 SLEHGLLTKKRILKEMKKESMSDASFMMEYCGVFYNESDDAFFKSSWVNPCRVLESMFYP 307

Query: 299 LNIIEEALNREPCPDPY-------APLIMGCDI--AEEGGDNTVVV------LRRGPV-- 341
            + IE   N++     Y          I+G DI  A    ++  +          G    
Sbjct: 308 PSDIEYLENKKKRDKKYHLNKIKGEIRIIGADIALARGVKNDNSIYTLMRMLPNEGTYKR 367

Query: 342 -IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGART--------CDYLEMLGY 392
            + H+  ++  +      ++  L   ++ D +I+D    G            D      Y
Sbjct: 368 CVVHIEAYNGMEAEKQAIRLKQLFSDFQADYMILDTQGIGTTVWSYIQKANYDSDRDEWY 427

Query: 393 HVYRVLGQKRAVDL-------------EFCRNRRTELHVKMADWLEFASL------INHS 433
             Y    +   VD              +   +   ++ + + D L   +L      I   
Sbjct: 428 DAYTCFNEDNTVDKSLAKKSLPVVYSMKAYADENHKMAMSLRDVLTNRTLELPISDIEAK 487

Query: 434 GLIQNLKSLKSFIVPNTGELAIESK 458
            +I   + +K+  +    E  +E+K
Sbjct: 488 EMILEKEMIKADEIDKKAE--LEAK 510


>gi|197249763|ref|YP_002147654.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Agona str. SL483]
 gi|197213466|gb|ACH50863.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Agona str. SL483]
          Length = 588

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|152982949|ref|YP_001353896.1| hypothetical protein mma_2206 [Janthinobacterium sp. Marseille]
 gi|151283026|gb|ABR91436.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
          Length = 436

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 42/276 (15%), Positives = 84/276 (30%), Gaps = 35/276 (12%)

Query: 82  ISAGRGIGKT-TLNAWLVLWLMSTRP-GISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139
           + A R  GKT      L+   ++          +A    Q K+  W  V ++ +++P   
Sbjct: 29  VVAHRRAGKTVACVNELIKAALTFHGNDGRFAYVAPFYRQAKSVAWDYVKRFSAVIPGIS 88

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             E +    +P                       + +  +  D   G        ++ DE
Sbjct: 89  INESELRIDYPNGSR------------------IQLFGADNADALRGLFFDG---VVADE 127

Query: 200 ASGTPDVINL-GILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQIDTR 256
                  +    I   L +R    + ++   P+  +  + EI+      +DW    I   
Sbjct: 128 YGDWKPSVWGYVIRPALADRGG--WAVIIGTPKGRNQFW-EIYQHAGVNEDWLCLTIRAS 184

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA 316
               + P   E +  +  L  D  R E+   F      +     I +   +     D Y 
Sbjct: 185 ESGLLPPKEIEAL--QLELTEDAWRQEMECDFDAALPGAIFGKEIWQAEQDGRVKDDLYD 242

Query: 317 P---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS 349
           P   +    D+     D  +   + G  +  +  +S
Sbjct: 243 PELKVHAVLDLG-FTDDTAIWWFQVGKELRIIDCYS 277


>gi|331656886|ref|ZP_08357848.1| terminase, ATPase subunit [Escherichia coli TA206]
 gi|331055134|gb|EGI27143.1| terminase, ATPase subunit [Escherichia coli TA206]
          Length = 531

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 291 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 347

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 348 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 407

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 408 QYNVTYIGIDSTGVGHGVYENVK 430


>gi|78355964|ref|YP_387413.1| hypothetical protein Dde_0917 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. G20]
 gi|78218369|gb|ABB37718.1| hypothetical protein Dde_0917 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. G20]
          Length = 507

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 62/394 (15%), Positives = 109/394 (27%), Gaps = 69/394 (17%)

Query: 38  WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95
           WG+        S    W  Q+E +     + ++                GR +GK+ + +
Sbjct: 20  WGQAYLYNRDGSGRDYWPHQVEDLRCPAKNIIHLD--------------GRDVGKSIVLS 65

Query: 96  WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155
              L    T  G   +  A  +  L T +  E+   L   P+     M S++L       
Sbjct: 66  TDALHYAFTTRGGQGLVAAPHQGHLDTII-EEIEFQLDTNPD----LMNSIALTKYGKPK 120

Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215
                   ++  + S +    +    D F   H      +  DE +   +     +   L
Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177

Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARY- 273
                 R +   S P  L    +Y +     + +  F+  +             ++  Y 
Sbjct: 178 KAGGTLRIY---STPNGLRDTTYYRL--TSSEQFHVFRWPSWLNPLWTEDREAELLEFYG 232

Query: 274 GLDSDVTRVEVCGQFPQQDIDSF-----------------IPLNII--------EEALNR 308
           G DS   + EV G+  +    +F                 I +           E A +R
Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292

Query: 309 -----EPCPDPYAPLIMGCDIAEEGGDNTVVVL-------RRGPVIEHLFDWSKTDLRTT 356
                   P      + G D+        +VV        R    +              
Sbjct: 293 LEMLLNLTPRSGQFWVGG-DLGYTNDPTEIVVFQEMEVGERTLLKMILRVHLEHVSYPHI 351

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
              I+ L   Y P  I +D    G      L  L
Sbjct: 352 AQIIALLERYYTPAGIGVDNGGNGLAVVQELLTL 385


>gi|34335039|gb|AAQ65014.1| unknown [synthetic construct]
 gi|301159280|emb|CBW18795.1| probable terminase subunit [Salmonella enterica subsp. enterica
           serovar Typhimurium str. SL1344]
 gi|323131065|gb|ADX18495.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. 4/74]
          Length = 588

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|200387487|ref|ZP_03214099.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Virchow str. SL491]
 gi|199604585|gb|EDZ03130.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Virchow str. SL491]
          Length = 588

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|221196218|ref|ZP_03569265.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
 gi|221202891|ref|ZP_03575910.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
 gi|221176825|gb|EEE09253.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
 gi|221182772|gb|EEE15172.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
          Length = 424

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 43/240 (17%), Positives = 67/240 (27%), Gaps = 35/240 (14%)

Query: 67  LNSVNNPNPEVFKGAISAGRGIGKTTL-NAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
              +     E  +  I  GR  GKTTL       W      G+ V     +         
Sbjct: 14  QAEIGRAFNESRRVVIRCGRRFGKTTLLERCASKWA---YNGLKVGWFGPTYK------- 63

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
                 L+L   K         ++       V+  + G   + ++           D   
Sbjct: 64  ------LNLPTYKRILRTVQPVVYSKSKIDQVIELNSGGCIEFWTL---------QDEDA 108

Query: 186 GHHNTYGMAIINDEASGTPD---VINLGILGFLTERNANRFWIMTSNPRR--LSGKFYEI 240
           G    Y   +I DE S  P     I    +   T  +     IM   P+       FYE 
Sbjct: 109 GRSRFYD-RVIIDEGSLVPKGLRSIWEQAI-APTLLDRKGHAIMAGTPKGIDPENFFYEA 166

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
                  W+ F   T +   +DP     +   Y   + V + E    F   +  +F    
Sbjct: 167 CTDKTLGWREFHAPTASNPMLDPEAVARLKDEY--PALVYQQEYLADFVDWNGAAFFSEE 224


>gi|16763092|ref|NP_458709.1| terminase subunit [Salmonella enterica subsp. enterica serovar
           Typhi str. CT18]
 gi|25315565|pir||AH1037 probable terminase chain [imported] - Salmonella enterica subsp.
           enterica serovar Typhi (strain CT18)
 gi|16505400|emb|CAD06749.1| probable terminase subunit [Salmonella enterica subsp. enterica
           serovar Typhi]
          Length = 588

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+    V+      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|309797383|ref|ZP_07691776.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           145-7]
 gi|308119007|gb|EFO56269.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           145-7]
          Length = 418

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 48/335 (14%), Positives = 105/335 (31%), Gaps = 46/335 (13%)

Query: 59  MEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118
           + +V  H        +P  FK  + AGR  GK+ L+   ++   +      V  +A +  
Sbjct: 7   LSLVQLHSGQMKVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65

Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
             +  LW ++ + L                 P  W       ++ I  K+ S +      
Sbjct: 66  MARQILWDDLQEVL-----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GA 107

Query: 179 ERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
           ++PDT  G        ++ DE      D     +   L+        ++   P+    +F
Sbjct: 108 DKPDTLRGV---ALHFVVLDEFQDMKADTWYKVLRPTLSS--TRGGALIIGTPKG-FSEF 161

Query: 238 YEIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
           ++++        +    WK +Q  T     +  +  E       +D      E    F  
Sbjct: 162 HKLWTIGQNVELQRKGQWKSWQFVTADSPFVPTAEIEAAKND--MDPKSFAQEYLASFEN 219

Query: 291 QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK 350
                + P +       +    +P  P+ +G D      D    V+ +      L+   +
Sbjct: 220 MSGRVYYPFD--RNVHVKPLQFNPRLPIWVGQD---FNIDPMSSVILQPQPNGELWAIDE 274

Query: 351 -----TDLRTTNNKISGLVEKYRPD-AIIIDANNT 379
                ++     +++     +++    +  D    
Sbjct: 275 LVLFSSNTAEVCDELERRFWRWKSQVTVFPDPAGA 309


>gi|163801735|ref|ZP_02195633.1| hypothetical protein 1103602000597_AND4_09782 [Vibrio sp. AND4]
 gi|159174652|gb|EDP59454.1| hypothetical protein AND4_09782 [Vibrio sp. AND4]
          Length = 546

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 34/260 (13%), Positives = 71/260 (27%), Gaps = 55/260 (21%)

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  D +N       T +N  +     S P   + + Y     + + +  D 
Sbjct: 211 VYVDEYFWIPKFDELNKLASAMATHKNWRKT--YFSTPSAKTHQAYTFWTGDQWRRGRDT 268

Query: 248 WKRFQIDT----RTVEGIDPSF--------------------HEGIIARYGLDSDVTRVE 283
               +  T    R    + P                       + +   Y  D       
Sbjct: 269 RANIEFPTFDEYRDGGRLCPDKQWRYVVTIEDAAAGGCELFDIDELRDEYSKDD--FDNL 326

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNT- 332
               F      S    + +E+A+        + P          + +G D +    +   
Sbjct: 327 FMCIFVDGAS-SVFKFSALEKAMVDISRWQDFKPNDNDPFERREVWLGYDPSRTRDNACL 385

Query: 333 ------VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386
                 V+ + +   +     W   + +    ++S + E+Y    + ID    GA   D 
Sbjct: 386 VVVAPPVIAIEK-FRVLEKHYWRGLNFQYQAQQVSKVFERYNVSYLGIDTTGIGAGVYDL 444

Query: 387 L-EMLGYHVYRVLGQKRAVD 405
           L +        +     + +
Sbjct: 445 LSKKHPRETVAIQYSNESKN 464


>gi|253689540|ref|YP_003018730.1| hypothetical protein PC1_3171 [Pectobacterium carotovorum subsp.
           carotovorum PC1]
 gi|251756118|gb|ACT14194.1| protein of unknown function DUF264 [Pectobacterium carotovorum
           subsp. carotovorum PC1]
          Length = 589

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 34/205 (16%), Positives = 62/205 (30%), Gaps = 30/205 (14%)

Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
             W++   ++     G +    + ++  Y       +  +  +F      S  P   ++ 
Sbjct: 329 GQWRQIVTVEDALSGGCNLFDLDQLMLEY--SPAEYQNLLMCEFVDDKA-SVFPFEELQR 385

Query: 305 ALNREPCPDP-----------YAPLIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FD 347
            +                   Y P+ +G D +  G     VVL      G     L  F 
Sbjct: 386 CMVDALEEWEDFNPYALRPFAYKPVWIGYDPSHTGDSAGCVVLAPPQAPGGKFRILERFQ 445

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE 407
           W   D     + I  L EKY  + I IDA   G      +               A +++
Sbjct: 446 WKGMDFAAQADAIKLLTEKYIVEYIGIDATGIGQGVYQLVRG---------FFPAAREIK 496

Query: 408 FCRNRRTELHVKMADWLEFASLINH 432
           +    +T + +K  D +    L   
Sbjct: 497 YSPEIKTAMVLKAKDTITSGRLEYD 521


>gi|213650797|ref|ZP_03380850.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. J185]
          Length = 518

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 278 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 334

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 335 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 394

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 395 QYNVTYIGIDSTGVGHGVYENVK 417


>gi|309795387|ref|ZP_07689805.1| conserved hypothetical protein [Escherichia coli MS 145-7]
 gi|308121037|gb|EFO58299.1| conserved hypothetical protein [Escherichia coli MS 145-7]
          Length = 588

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|16766035|ref|NP_461650.1| terminase-like protein [Enterobacteria phage Fels-2]
 gi|169936048|ref|YP_001718747.1| P2 gpP-like protein [Enterobacteria phage Fels-2]
 gi|16421269|gb|AAL21609.1| Fels-2 prophage protein [Enterobacteria phage Fels-2]
 gi|312913743|dbj|BAJ37717.1| terminase-like protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. T000240]
          Length = 588

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|323938219|gb|EGB34479.1| terminase [Escherichia coli E1520]
          Length = 588

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|291335343|gb|ADD94958.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C148]
          Length = 234

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 36/187 (19%), Positives = 73/187 (39%), Gaps = 23/187 (12%)

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE----RPDTFVGHHNTYGMAIINDEASG 202
            L P PW        L ++  + ST+    +E     R  +  G        ++ DEA+ 
Sbjct: 12  KLVPKPWIKTKNETDLKLELVNGSTIELKGTENAMALRGRSLSG--------VVLDEAAF 63

Query: 203 T-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF----NKPLDDWKRFQIDTRT 257
              +V    I   L ++    + +  S P   +  FY+++    + P ++WKR+   T  
Sbjct: 64  MDAEVWFEVIRPALADKQG--WALFISTPDGTASWFYDLWCYCEDDPTNEWKRWCYTTIE 121

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              +     E   A+  LD    R E    F  +++   + ++  ++ ++ +       P
Sbjct: 122 GGNVPQEEVEAARAQ--LDPRTFRQEFEASF--ENLTGLVAISFSDDNISTDAKDISIQP 177

Query: 318 LIMGCDI 324
           L++G D 
Sbjct: 178 LLLGVDF 184


>gi|198245759|ref|YP_002216726.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|197940275|gb|ACH77608.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|326624483|gb|EGE30828.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Dublin str. 3246]
          Length = 588

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|170020778|ref|YP_001725732.1| hypothetical protein EcolC_2777 [Escherichia coli ATCC 8739]
 gi|169755706|gb|ACA78405.1| protein of unknown function DUF264 [Escherichia coli ATCC 8739]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|306812733|ref|ZP_07446926.1| Terminase, ATPase subunit (GpP) [Escherichia coli NC101]
 gi|305853496|gb|EFM53935.1| Terminase, ATPase subunit (GpP) [Escherichia coli NC101]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|300907199|ref|ZP_07124862.1| hypothetical protein HMPREF9536_05153 [Escherichia coli MS 84-1]
 gi|301303626|ref|ZP_07209748.1| hypothetical protein HMPREF9347_02221 [Escherichia coli MS 124-1]
 gi|300401074|gb|EFJ84612.1| hypothetical protein HMPREF9536_05153 [Escherichia coli MS 84-1]
 gi|300841125|gb|EFK68885.1| hypothetical protein HMPREF9347_02221 [Escherichia coli MS 124-1]
 gi|315257856|gb|EFU37824.1| conserved hypothetical protein [Escherichia coli MS 85-1]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWSDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|253774139|ref|YP_003036970.1| hypothetical protein ECBD_2764 [Escherichia coli
           'BL21-Gold(DE3)pLysS AG']
 gi|254160943|ref|YP_003044051.1| Terminase, ATPase subunit [Escherichia coli B str. REL606]
 gi|242376647|emb|CAQ31358.1| ybl37 [Escherichia coli BL21(DE3)]
 gi|253325183|gb|ACT29785.1| protein of unknown function DUF264 [Escherichia coli
           'BL21-Gold(DE3)pLysS AG']
 gi|253972844|gb|ACT38515.1| Terminase, ATPase subunit [Escherichia coli B str. REL606]
 gi|253977058|gb|ACT42728.1| Terminase, ATPase subunit [Escherichia coli BL21(DE3)]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEIWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|16762249|ref|NP_457866.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. CT18]
 gi|29143738|ref|NP_807080.1| terminase ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|215485952|ref|YP_002328383.1| predicted terminase, ATPase subunit [Escherichia coli O127:H6 str.
           E2348/69]
 gi|312969111|ref|ZP_07783318.1| terminase, ATPase subunit [Escherichia coli 2362-75]
 gi|25315563|pir||AB0927 terminase, ATPase chain [imported] - Salmonella enterica subsp.
           enterica serovar Typhi (strain CT18)
 gi|16504553|emb|CAD09436.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi]
 gi|29139373|gb|AAO70940.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|215264024|emb|CAS08365.1| predicted terminase, ATPase subunit [Escherichia coli O127:H6 str.
           E2348/69]
 gi|312286513|gb|EFR14426.1| terminase, ATPase subunit [Escherichia coli 2362-75]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|324112701|gb|EGC06677.1| terminase [Escherichia fergusonii B253]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|323953478|gb|EGB49344.1| terminase [Escherichia coli H252]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|157160343|ref|YP_001457661.1| terminase, ATPase subunit [Escherichia coli HS]
 gi|218559567|ref|YP_002392480.1| Terminase, ATPase subunit (GpP) [Escherichia coli S88]
 gi|256021061|ref|ZP_05434926.1| Terminase, ATPase subunit (GpP) [Shigella sp. D9]
 gi|300817075|ref|ZP_07097294.1| conserved hypothetical protein [Escherichia coli MS 107-1]
 gi|331662228|ref|ZP_08363151.1| terminase, ATPase subunit [Escherichia coli TA143]
 gi|331676606|ref|ZP_08377302.1| terminase, ATPase subunit [Escherichia coli H591]
 gi|332282288|ref|ZP_08394701.1| DNA-dependent ATPase terminase subunit [Shigella sp. D9]
 gi|157066023|gb|ABV05278.1| terminase, ATPase subunit [Escherichia coli HS]
 gi|218366336|emb|CAR04087.1| Terminase, ATPase subunit (GpP) [Escherichia coli S88]
 gi|300530427|gb|EFK51489.1| conserved hypothetical protein [Escherichia coli MS 107-1]
 gi|315615257|gb|EFU95893.1| terminase, ATPase subunit [Escherichia coli 3431]
 gi|323172219|gb|EFZ57857.1| terminase, ATPase subunit [Escherichia coli LT-68]
 gi|323190830|gb|EFZ76098.1| terminase, ATPase subunit [Escherichia coli RN587/1]
 gi|323942735|gb|EGB38900.1| terminase [Escherichia coli E482]
 gi|323946304|gb|EGB42336.1| terminase [Escherichia coli H120]
 gi|323963883|gb|EGB59377.1| terminase [Escherichia coli M863]
 gi|327252355|gb|EGE64027.1| terminase, ATPase subunit [Escherichia coli STEC_7v]
 gi|331060650|gb|EGI32614.1| terminase, ATPase subunit [Escherichia coli TA143]
 gi|331075295|gb|EGI46593.1| terminase, ATPase subunit [Escherichia coli H591]
 gi|332104640|gb|EGJ07986.1| DNA-dependent ATPase terminase subunit [Shigella sp. D9]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|312970940|ref|ZP_07785119.1| terminase, ATPase subunit [Escherichia coli 1827-70]
 gi|310336701|gb|EFQ01868.1| terminase, ATPase subunit [Escherichia coli 1827-70]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|307314499|ref|ZP_07594102.1| protein of unknown function DUF264 [Escherichia coli W]
 gi|306905922|gb|EFN36444.1| protein of unknown function DUF264 [Escherichia coli W]
 gi|315060102|gb|ADT74429.1| terminase, ATPase subunit [Escherichia coli W]
 gi|323379340|gb|ADX51608.1| terminase ATPase subunit [Escherichia coli KO11]
 gi|332342200|gb|AEE55534.1| phage terminase, ATPase subunit [Escherichia coli UMNK88]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|26246838|ref|NP_752878.1| terminase, ATPase subunit [Escherichia coli CFT073]
 gi|26107238|gb|AAN79421.1|AE016758_25 Terminase, ATPase subunit [Escherichia coli CFT073]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|300916285|ref|ZP_07133032.1| conserved hypothetical protein [Escherichia coli MS 115-1]
 gi|300416374|gb|EFJ99684.1| conserved hypothetical protein [Escherichia coli MS 115-1]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|271499312|ref|YP_003332337.1| hypothetical protein Dd586_0742 [Dickeya dadantii Ech586]
 gi|270342867|gb|ACZ75632.1| protein of unknown function DUF264 [Dickeya dadantii Ech586]
          Length = 591

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 27/162 (16%), Positives = 54/162 (33%), Gaps = 20/162 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY +D+    +     F   + D+    + +
Sbjct: 329 PDGQWRYVITMEDAIRGGFNLASLEKLRNRYNVDT--FNMLYMCVFVD-NKDAVFSFDDL 385

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346
           E           + P          +  G D A  G  +T       +       +  + 
Sbjct: 386 ERCGVDPATWQDHDPTAPRPFGNREVWGGYDPARSGDLSTFVIVAPPIYEGEKFRVLLVV 445

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
           +W   + R   N+I  L ++Y    I ID    GA   + ++
Sbjct: 446 NWHGMNFRYQANQIKKLFQRYHFTYIGIDVTGIGAGVFENIQ 487


>gi|222034345|emb|CAP77086.1| Terminase, ATPase subunit [Escherichia coli LF82]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|218690765|ref|YP_002398977.1| terminase, ATPase subunit (GpP) [Escherichia coli ED1a]
 gi|218428329|emb|CAR09255.2| Terminase, ATPase subunit (GpP) [Escherichia coli ED1a]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|300896792|ref|ZP_07115295.1| terminase, ATPase subunit family protein [Escherichia coli MS
           198-1]
 gi|300359367|gb|EFJ75237.1| terminase, ATPase subunit family protein [Escherichia coli MS
           198-1]
          Length = 391

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 33/184 (17%), Positives = 51/184 (27%), Gaps = 29/184 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 218 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAAHPFG 274

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 275 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 334

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 335 VEYIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPELKTAMVLKAKDVIRRGC 385

Query: 429 LINH 432
           L   
Sbjct: 386 LEYD 389


>gi|320199051|gb|EFW73648.1| Phage terminase, ATPase subunit [Escherichia coli EC4100B]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|304360765|ref|YP_003856886.1| gp8 [Mycobacterium phage Angelica]
 gi|302858349|gb|ADL71097.1| gp8 [Mycobacterium phage Angelica]
          Length = 473

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 69/389 (17%), Positives = 123/389 (31%), Gaps = 57/389 (14%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111
             WQ +  ++V A    S      ++F  +I   R  GKT     +V       PG +VI
Sbjct: 43  DQWQDDLGKLVCAK--RSDGLYAADMFAMSIP--RQTGKTYFLGAIVFAFCKMNPGTTVI 98

Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171
             A+     +T   AE  K +  L  +       L++H             G ++  ++ 
Sbjct: 99  WTAH-----RTRTAAETFKSMQALAKREQIAPHILNVH----------TGNGKEAVLFTN 143

Query: 172 MCRTYSEERPDTF-VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
             R     R   F  G        +I DEA    +     ++   T  + N   +    P
Sbjct: 144 GSRILFGAREKGFGRGF--AKVDVLIFDEAQILSENAMDDMIPA-TNASPNGLILFAGTP 200

Query: 231 RRLS--GKFY-----EIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGI------------ 269
            + +  G+ +     +  N   DD  +     D       + ++ +              
Sbjct: 201 PKPTDPGEVFTNLRMDALNGESDDVAYVEISADENDDPDEESTWRKMNPSYPHRTSARAI 260

Query: 270 -IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC----PDPYA-PLIMGCD 323
              R  L  D  R E  G + +  + +     +I+  L R+      P+P A P  +G D
Sbjct: 261 RRMRKALSWDSFRREAMGIWDKISVHA----QVIKAGLWRDLADPLGPEPGAKPASLGVD 316

Query: 324 IAEEGGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382
           ++  G  +          + H+   W+ TD       I       R   ++ID  +    
Sbjct: 317 MSHGGAISIGGCWLIDDELRHVEQVWAGTDTAAAVEFIVERAG--RRIPVVIDDASPAKA 374

Query: 383 TCDYLEMLGYHVYRVLGQKRAVDLEFCRN 411
               L+     V        A      +N
Sbjct: 375 LVPELKRRKVKVRITYAGDMAKACGLFKN 403


>gi|82543312|ref|YP_407259.1| terminase, ATPase subunit [Shigella boydii Sb227]
 gi|81244723|gb|ABB65431.1| terminase, ATPase subunit [Shigella boydii Sb227]
 gi|320185726|gb|EFW60482.1| Phage terminase, ATPase subunit [Shigella flexneri CDC 796-83]
 gi|332097052|gb|EGJ02035.1| terminase, ATPase subunit [Shigella boydii 3594-74]
          Length = 588

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|307129625|ref|YP_003881641.1| hypothetical protein Dda3937_02574 [Dickeya dadantii 3937]
 gi|306527154|gb|ADM97084.1| Possible phage protein [Dickeya dadantii 3937]
          Length = 591

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 30/162 (18%), Positives = 57/162 (35%), Gaps = 20/162 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY +D+    +     F   + D+    + +
Sbjct: 329 PDGQWRYVITMEDAIRGGFNLASLEKLRNRYNVDT--FNMLYMCVFVD-NKDAVFSFDDL 385

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVV----LRRGPVIEHLF-- 346
           E           + P          +  G D A  G  +T+V+    +  G     L   
Sbjct: 386 ERCGVDPATWQDHDPTAPRPFGNREVWGGYDPARSGDLSTLVIVAPPIYDGEKFRVLLVV 445

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
           +W   + R   N+I  L ++Y    I ID    GA   + ++
Sbjct: 446 NWHGMNFRYQANQIKKLFQRYHFTYIGIDVTGIGAGVFENIQ 487


>gi|260599032|ref|YP_003211603.1| Terminase, ATPase subunit [Cronobacter turicensis z3032]
 gi|260218209|emb|CBA33092.1| Terminase, ATPase subunit [Cronobacter turicensis z3032]
          Length = 590

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 40/143 (27%), Gaps = 18/143 (12%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY-----------APLIM 320
           +    ++  R     +F      S  P   ++  +                     P+ +
Sbjct: 355 KRENSAEDFRNLFMCEFVDDKA-SVFPFEELQRCMVDSLEEWEDFSPFAARPFGSRPVWI 413

Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +  G     VVL      G     L    W   D  T    I  L EKY+ + I I
Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRILERHQWKGMDFATQAQAIRELTEKYQVEYIGI 473

Query: 375 DANNTGARTCDYLEMLGYHVYRV 397
           DA   G      +         +
Sbjct: 474 DATGIGQGVFQLVRAFWPAAREI 496


>gi|291334416|gb|ADD94071.1| hypothetical protein GobsU_33659 [uncultured phage
           MedDCM-OCT-S04-C1035]
 gi|291334470|gb|ADD94124.1| hypothetical protein GobsU_33659 [uncultured phage
           MedDCM-OCT-S04-C1161]
          Length = 223

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 34/235 (14%), Positives = 72/235 (30%), Gaps = 31/235 (13%)

Query: 107 GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
                 +A +  Q K+  W  + ++ + +PN  + E +     P      +L        
Sbjct: 6   NPRFAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG------- 58

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRFWI 225
                       E  D   G +       + DE +     +    I   L++R    + +
Sbjct: 59  -----------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDR--KGYCV 102

Query: 226 MTSNPRRLSGKFYEIFNKPLD--DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
               P  ++  FY+++       DW  ++      + +DP   E      G        E
Sbjct: 103 FIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGEKK--YLQE 160

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG--DNTVVVL 336
               +      +     I +     +    PY P  +    A + G  D++ ++ 
Sbjct: 161 FECDWIANIEGAIYGEEIAKIEDKNQIARVPYDP-TLPVSTAWDLGVADHSSIIF 214


>gi|296103195|ref|YP_003613341.1| hypothetical protein ECL_02853 [Enterobacter cloacae subsp. cloacae
           ATCC 13047]
 gi|295057654|gb|ADF62392.1| hypothetical protein ECL_02853 [Enterobacter cloacae subsp. cloacae
           ATCC 13047]
          Length = 591

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 28/169 (16%), Positives = 55/169 (32%), Gaps = 20/169 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  ++    +     F     DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFDMLYMCVFVDSK-DSVFSFSDL 384

Query: 303 EEA---LNREPCPDPYA-------PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E     ++     DP A       P+  G D A  G  +  V++           +  + 
Sbjct: 385 EACGVEMDTWQDHDPDAKRPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVY 395
            W   + R    +I  L ++Y    + +D    G    D ++     V 
Sbjct: 445 YWKGMNFRYQAKQIEKLFDQYNFTYLGVDVTGIGQGVFDNIQHFAMKVV 493


>gi|310815629|ref|YP_003963593.1| Putative large terminase [Ketogulonicigenium vulgare Y25]
 gi|308754364|gb|ADO42293.1| Putative large terminase [Ketogulonicigenium vulgare Y25]
          Length = 427

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 68/424 (16%), Positives = 113/424 (26%), Gaps = 75/424 (17%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ S   G           V  +A +  Q +  +        
Sbjct: 36  IMGGRGAGKTRAGA---EWVRSMVEGPRPDTPGRAKRVGLIAQTMDQAREVMV------- 85

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                   F    L     P           +         R +S   P+   G      
Sbjct: 86  --------FGDSGLMACCPPARRPEWIAGRAMLRWPNGAEARLFSAHDPEALRGPQFD-- 135

Query: 193 MAIINDEASG--TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            AI  DE +           ++  L   +  R            G F             
Sbjct: 136 -AIWADEVAKWRLAQEAWDMLVMGLRLGDDPR---ACLTTTPRGGPFLRKLLAQSGTVMT 191

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     + P F   + A +   S + R E+ G    +   +  P ++++ AL R+ 
Sbjct: 192 HAPTRANRANLAPGFVAAVEAMF-EGSHLGRQELDGLLVDEAEGTLWPQHLLDAALQRQA 250

Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357
            P     +++  D       G D   +++          DW                T  
Sbjct: 251 PP--LDRIVVAVDPPVTGHAGSDACGIIVAGVEQRGAPTDWRLWVIEDATVQGASPHTWA 308

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417
           +       ++  D ++ + N  GA     L  L  H+       RAV     +  R E  
Sbjct: 309 SAAIAAFHRHGADRLVAEVNQGGALVESVLRQLDPHI-----PYRAVRASKSKGARAE-- 361

Query: 418 VKMADWLEFASLINHSGLI---QNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474
             ++   E     +  GL      +  +        G             S D  D L++
Sbjct: 362 -PVSTIYERGRACHLPGLALLEAQMSLMTLQGFTGKG-------------SPDRVDALVW 407

Query: 475 TFAE 478
              E
Sbjct: 408 AAHE 411


>gi|258545857|ref|ZP_05706091.1| probable terminase (atpase subunit) related protein
           [Cardiobacterium hominis ATCC 15826]
 gi|258518873|gb|EEV87732.1| probable terminase (atpase subunit) related protein
           [Cardiobacterium hominis ATCC 15826]
          Length = 595

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 34/201 (16%), Positives = 60/201 (29%), Gaps = 29/201 (14%)

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
             I+     G D    E +  ++          +  QF     DS   +  ++  +    
Sbjct: 339 ITIEDAINSGFDRVTLEKLRIKF--PPGQFENLLMCQFVNDG-DSIFKMAELQRCMVDAW 395

Query: 311 CPDPYA-----------PLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDL 353
                            P+ +G D +    D ++VV+      G V   +    ++  D 
Sbjct: 396 TVWQDYTPLAARPLGDVPVWIGYDPSRSQDDASLVVIAPPQVEGGVFRIIDKQSFNGLDF 455

Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRR 413
                KI      Y    I IDA   G    D +      V ++L    A +        
Sbjct: 456 DAQARKIRDFCRMYNVVHIAIDATGIGQAVYDLVRQFFPRVRKILYSVEAKN-------- 507

Query: 414 TELHVKMADWLEFASLINHSG 434
            E+ +K    +  A L   +G
Sbjct: 508 -EMVLKAKQLIAHARLQWDNG 527


>gi|291334530|gb|ADD94183.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201]
 gi|291334650|gb|ADD94297.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695]
          Length = 223

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 32/240 (13%), Positives = 72/240 (30%), Gaps = 31/240 (12%)

Query: 102 MSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161
           M          +A +  Q K+  W  + ++   +P+  + E +     P      +L   
Sbjct: 1   MCPHKNPRFAYIAPTFKQAKSIAWDYMKQFTDKIPSTKFNETELRVDLPNGARITLLG-- 58

Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNA 220
                            E  D   G +       + DE +     +    I   L++R  
Sbjct: 59  ----------------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDR-- 97

Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLD--DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278
             + +    P  ++  FY+++       DW  ++      + +D    +      G    
Sbjct: 98  KGYCVFIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASETKIVDQEELDKAKEVMGEKK- 156

Query: 279 VTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG--DNTVVVL 336
               E    +      +     I +    ++    PY P  +    A + G  D++ ++ 
Sbjct: 157 -YLQEFECDWIANIEGAIYGEEIAKLDDKKQLARVPYDP-TLPVSTAWDLGVADHSSIIF 214


>gi|51597451|ref|YP_071642.1| orf16-like phage protein [Yersinia pseudotuberculosis IP 32953]
 gi|51590733|emb|CAH22378.1| Possible [Haemophilus phage HP1] orf16-like phage protein [Yersinia
           pseudotuberculosis IP 32953]
          Length = 601

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 22/140 (15%), Positives = 47/140 (33%), Gaps = 19/140 (13%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317
            E +  +Y  ++    +    QF     D+    + +E+          + P        
Sbjct: 354 IERLRNKY--NATAFAMLYMCQFVDSK-DAVFKFSELEKCAVDAGMWQDHDPKAARPFGN 410

Query: 318 --LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             +  G D +  G ++T V++           +  ++ W   +     ++I  L+ +Y  
Sbjct: 411 REVWGGFDPSRSGDNSTFVIVAPPLYDGERFRVLAVYYWQGLNFNYQADQIKQLMRRYNM 470

Query: 370 DAIIIDANNTGARTCDYLEM 389
             I ID    G    D +E 
Sbjct: 471 TYIGIDITGIGRGVFDLVER 490


>gi|332560992|ref|ZP_08415310.1| hypothetical protein RSWS8N_18139 [Rhodobacter sphaeroides WS8N]
 gi|332274790|gb|EGJ20106.1| hypothetical protein RSWS8N_18139 [Rhodobacter sphaeroides WS8N]
          Length = 468

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 38/237 (16%), Positives = 70/237 (29%), Gaps = 18/237 (7%)

Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232
           T     PDT  G        +I DE +       I   +   +++          S P  
Sbjct: 133 TALPANPDTARGFSAN----VILDEFAFHAKSREIWAALFPVISKGGQKLRV--ISTPNG 186

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292
              KFYE+       W R  +D              ++     D D    E   ++  + 
Sbjct: 187 KGNKFYELMTAEGSVWSRHVVDIHEAVRQGLDRDIDMLRAGMADEDAWAQEYELKWLDEA 246

Query: 293 IDSFIPLNIIEEA---LNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL 345
            ++++  ++I          P      P  +G DIA    D  V+     +        +
Sbjct: 247 -NAWLDYDLISACEHPAAGMPGLYMGGPCFVGVDIAARN-DLFVIWVLELVGDVLWTREV 304

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG-ARTCDYLEMLGYHVYRVLGQK 401
               +   +  +  ++ +  ++R     ID    G     D     G  V  +L   
Sbjct: 305 IARRRVSFQEQDRLLAEVFRRFRVVRCRIDQTGMGEKPVEDAKRAHGDRVEGILFSA 361


>gi|188495109|ref|ZP_03002379.1| terminase [Escherichia coli 53638]
 gi|188490308|gb|EDU65411.1| terminase [Escherichia coli 53638]
          Length = 607

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 29/173 (16%), Positives = 54/173 (31%), Gaps = 20/173 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGID-PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF--IPL 299
           P   W+    ++    +G+      E +  RY        +    +F       F    L
Sbjct: 336 PDGIWRYVITMEDACAKGLSARVNIEKLRNRYSAT--AFAMLYMCEFTDSRDTVFKFSDL 393

Query: 300 NIIEEALNREPCPDPYA-------PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
              E         DP A        +  G D +  G ++T V++      +    +  ++
Sbjct: 394 EKCEVEFGIWQDFDPSALRPFGNREVWGGFDPSRTGDNSTFVIVAPPVEPKEKFRVLAVY 453

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVL 398
            W   +      +I  L+++YR   I +D    G    D L       V  + 
Sbjct: 454 QWVGLNFTWQVKQIEELMKRYRFTHIGVDITGIGRGVYDQLVRSAPREVMGIN 506


>gi|149911893|ref|ZP_01900493.1| putative bacteriophage terminase, ATPase subunit [Moritella sp.
           PE36]
 gi|149805043|gb|EDM65069.1| putative bacteriophage terminase, ATPase subunit [Moritella sp.
           PE36]
          Length = 601

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 56/350 (16%), Positives = 101/350 (28%), Gaps = 77/350 (22%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
           IG T   A+   +         +   A S  Q      AE+ K   +   +  F ++   
Sbjct: 181 IGATFYFAFEAFYDAVVNGRNKIFISA-SRDQ------AEIFKANIIALCREQFGIE--- 230

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-- 205
           L  +P        +  +  K  ST  RT      D            +  DE    P   
Sbjct: 231 LSGSPLTMRNKGKTTTLYFK--STNARTAQSASGD------------LYIDEVFWIPKFK 276

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSF 265
            +        T ++        S P   S + Y+++N     W R        E      
Sbjct: 277 ELRSLAQAMATHKDFRIT--YFSTPSVTSHEAYDLWN---GRWYRKTKACNDPEFAIDVS 331

Query: 266 HEGIIARYGLDSDVTRVEV------------------CGQFPQQDIDSFIPLNIIEEALN 307
           H+ +      D  + R ++                    ++ +++ D+      I++A +
Sbjct: 332 HKTLKHGLLCDDGIWRQKLNVYDVVEQGFDRIDISMLENEYSKEEFDNLFMCKFIDDAHS 391

Query: 308 ----------------------REPCPDPYAPLIMGCDIAEEGGDNTVVVL------RRG 339
                                     P    P+++G D A      +VVVL         
Sbjct: 392 AFSLKQLMACVGNSKKWTDFDPTWSRPYAMKPVVIGFDPARTRDIASVVVLSLPLGPDDK 451

Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
             +    + S  D  T  ++I  L  KY    I +D    G    + ++ 
Sbjct: 452 FRLLESLNLSGNDFETMASEIKELTLKYHVVHIGVDTTGMGLGVFELIQK 501


>gi|329122644|ref|ZP_08251223.1| terminase [Haemophilus aegyptius ATCC 11116]
 gi|327472658|gb|EGF18087.1| terminase [Haemophilus aegyptius ATCC 11116]
          Length = 202

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 19/158 (12%)

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLV 364
            P     + +G D A  G    +V++           + H   +   D  T  ++I    
Sbjct: 16  RPFGNREVWLGYDPAFTGDRAALVIVAPPKVEGGDYRVLHKQTFHGMDYETQASRIKQFC 75

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424
           + Y    I+ID    G+     +      V  +         E+  + + E+ +K  + +
Sbjct: 76  DDYNVTRIVIDKTGMGSGVYQEVRKFYPMVQGL---------EYNADLKNEMVLKTQNLI 126

Query: 425 EFASL---INHSGLIQNLKSLKSFIVPNTGELAIESKR 459
           +   L      + ++ +  ++K   +  TG++   S R
Sbjct: 127 QKRRLKFDSGDNDIVSSFMTVKK-RITGTGKITYVSDR 163


>gi|83954308|ref|ZP_00963028.1| terminase, large subunit, putative [Sulfitobacter sp. NAS-14.1]
 gi|83841345|gb|EAP80515.1| terminase, large subunit, putative [Sulfitobacter sp. NAS-14.1]
          Length = 408

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 64/423 (15%), Positives = 114/423 (26%), Gaps = 73/423 (17%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ +   G           V  +  +  Q++  +        
Sbjct: 16  IMGGRGAGKTRAGA---EWVRAQVEGSRPLDAGRCRRVALVGETIEQVREVM-------- 64

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
             +         S +     W +                +   ++   P+   G      
Sbjct: 65  --IFGDSGILACSPADRRPDWEATRKRLVWPN-----GAVATVHTAHDPEGLRGPQFD-- 115

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +     +     +   L     +    +T+ P R  G    +   P      
Sbjct: 116 -AAWVDELAKWKKAEETWDQLQFAL-RLGEDPRACVTTTP-RNVGVLKNLLASPSTV-TT 171

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     +  SF E + ARY   + + R E+ G        +      IE    R+ 
Sbjct: 172 HAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSERIEAGRVRDV 230

Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357
                  +++G D A     G D   +V+          DW                   
Sbjct: 231 PL--LDRIVVGLDPATTAGAGADECGIVVVGAQTQGPPQDWRAVVLADCTVQGATPSGWA 288

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTE 415
                 +E+Y  D ++ + N  G    + L  +     V  V   +  V        R E
Sbjct: 289 RAAISAMEQYGADRLVAEVNQGGQMVAEVLRQVDPLVPVKSVHASRGKV-------ARAE 341

Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
               + +      ++    L   +       +   G         +G  S D  D L++ 
Sbjct: 342 PVAALYEQGRVGHVVGLDALEDQMC-----RMTARGY--------EGGGSPDRVDALVWA 388

Query: 476 FAE 478
             E
Sbjct: 389 LHE 391


>gi|169344384|ref|ZP_02865357.1| phage terminase, large subunit, pbsx family [Clostridium
           perfringens C str. JGS1495]
 gi|169297509|gb|EDS79616.1| phage terminase, large subunit, pbsx family [Clostridium
           perfringens C str. JGS1495]
          Length = 415

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 51/334 (15%), Positives = 107/334 (32%), Gaps = 37/334 (11%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            G G GK+      +++     PG   + +    + LK +++A     L       W   
Sbjct: 31  GGGGSGKSHFVVQKMIYKYLKYPGRKCLVVRKVNSTLKESIFA-----LFRSVLSDWQIY 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
               ++      ++        +K           E+  +  G  +     I+ +E +  
Sbjct: 86  DECKINKTDLTIELP-------NKSLFIFKGIDDPEKIKSIAGIDD-----IVVEECTEI 133

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK---PLDDWKRFQIDTRTVEG 260
            +     +   L  +N      +  NP   S   Y+ + K      D        +  + 
Sbjct: 134 DEFDFDQLNLRLRSKNPYNQIHVMFNPVSKSNWVYKRWFKNGYDTKDTIVLHTTYKNNKF 193

Query: 261 IDPSFHEGIIARYGLDSDVT-RVEVCGQFPQQDIDSFIPLNIIEEALNREPC--PDPYAP 317
           +   + + ++ +   D+ V  R+   G+F    +D  I  N  EE+ + +     +    
Sbjct: 194 LPKDYIDSLL-KLEKDNPVYFRIYALGEF--ATLDKLIYTNWKEESFDYKEILKNNRNTK 250

Query: 318 LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTD-----LRTTNNKISGLVEKYRPDAI 372
            I   D          V      + + L+ + +             KI  L   YR + I
Sbjct: 251 AIFSLDFGYTNDPTAFVCSIIDKINKKLWIFDEFQEKGLLNDEIAEKIIDL--GYRKEVI 308

Query: 373 IIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
           + D+     ++ + L+  G    RV G  +  D 
Sbjct: 309 VCDS--AEPKSIEELKRNGLS--RVKGAVKGRDS 338


>gi|166012063|ref|ZP_02232961.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
           E1979001]
 gi|167427125|ref|ZP_02318878.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis
           str. K1973002]
 gi|2996304|gb|AAC13184.1| P-loop protein [Yersinia pestis KIM 10]
 gi|165988997|gb|EDR41298.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str.
           E1979001]
 gi|167053876|gb|EDR63708.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis
           str. K1973002]
          Length = 402

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 48/321 (14%), Positives = 102/321 (31%), Gaps = 46/321 (14%)

Query: 73  PNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWL 132
            +P  FK  + AGR  GK+ L+   ++   +      V  +A +    +  LW ++ + L
Sbjct: 5   QSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQMARQILWDDLQEVL 63

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                            P  W       ++ I  K+ S +      ++PDT  G      
Sbjct: 64  -----------------PRKWVRKKNDTTMTIVLKNGSEIALK-GADKPDTLRGV---AL 102

Query: 193 MAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-------KP 244
             ++ DE     PD     +   L+        ++   P+    +F++++        + 
Sbjct: 103 HFVVLDEFQDMKPDTWYKVLRPTLSS--TRGGALIIGTPKG-FSEFHKLWTIGQNKDLQR 159

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
              WK +Q  T     +  +  E       +D      E    F       + P +    
Sbjct: 160 KGQWKSWQFVTADSPFVPSAEIEAAKND--MDPKSFAQEYLASFENMSGRVYYPFD--RN 215

Query: 305 ALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-----TDLRTTNNK 359
              +    +P  P+ +G D      D    V+ +      L+   +     ++     ++
Sbjct: 216 VHVKPLQFNPKLPIWVGQD---FNIDPMSSVILQPQPNGELWAVDEVVLFSSNTAEVCDE 272

Query: 360 ISGLVEKYRPD-AIIIDANNT 379
           +     +++    I  D    
Sbjct: 273 LERRFWRWKSQVTIFPDPAGA 293


>gi|323146172|gb|ADX32410.1| terminase ATPase subunit [Cronobacter phage ENT90]
          Length = 587

 Score = 56.7 bits (135), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 30/176 (17%), Positives = 53/176 (30%), Gaps = 27/176 (15%)

Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDPYAPLIMGCDI 324
                +  +  +F   +  S  P   ++  +              P P  Y P+ +G D 
Sbjct: 357 SPSEYQNLLMCEFVDDEA-SVFPFAELQTCMIDSLEEWSDFNPYLPRPFDYRPVWIGYDP 415

Query: 325 AEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           +  G      V+   L  G     L    W   D       I  L +KY  + I +DA  
Sbjct: 416 SHTGDSAGCAVIAPPLVAGGKFRVLERHQWRGMDFAAQAKSIEDLTKKYTVEYIGVDATG 475

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
            G      +               A ++ +    +T + +K  D +    L   +G
Sbjct: 476 IGQGVFQLVRQ---------FYPAAREIRYSPEVKTAMVLKAKDTISSGRLEYDAG 522


>gi|119869106|ref|YP_939058.1| phage terminase [Mycobacterium sp. KMS]
 gi|119695195|gb|ABL92268.1| phage Terminase [Mycobacterium sp. KMS]
          Length = 489

 Score = 56.7 bits (135), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 63/373 (16%), Positives = 108/373 (28%), Gaps = 71/373 (19%)

Query: 41  KGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLW 100
           KGT       PR WQ++ +  V      +V    P          RG GKTTL+A ++L+
Sbjct: 41  KGTGAREVFRPREWQMDIVRDVLDSGARTVGLMMP----------RGQGKTTLSAAILLY 90

Query: 101 LMSTR-PGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH 159
           +  TR  G +V+  A  E Q             SL        +Q      +  Y     
Sbjct: 91  IFFTRGEGANVVLFAVDERQ------------ASLAFRVAARMVQLSEDLSSRCYVYADK 138

Query: 160 CSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERN 219
             L +    Y  M        P +         +A + DEA      +          + 
Sbjct: 139 LVLPLTDSTYQVM--------PASAAAAEGLDYVACLCDEAGVINRDVFEVAQLA-QGKR 189

Query: 220 ANRFWIMTSNPRRLSGK--------FYEIFNKPLDD-WKRFQIDTRTV---------EGI 261
                I    P              +           W+ F                E  
Sbjct: 190 ERSVLIAIGTPGPDPNDQVLADLRAYAAEHPDDKSLVWREFSAAGFEDHGADCPHCWELA 249

Query: 262 DPSFHEGIIAR--------YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
           +P+  + +              ++   R     QF      +F+P  + E      P P 
Sbjct: 250 NPALDDFLHRDALHALLPPKTREATFRRAR-LCQFSTDTDGAFLPAGVWEGLSTSSPVP- 307

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRR---GPVIEHLFDWS-------KTDLRTTNNKISGL 363
           P   +++  D     GD T +++      P  + +  W        +  +    + I   
Sbjct: 308 PGVDVVLALD-GSYNGDTTALLVGTVSAEPHFDVVQVWDPKGDPDYRVPVAEVEDVIRRS 366

Query: 364 VEKYRPDAIIIDA 376
            ++++   II D 
Sbjct: 367 AKEWQVVEIIADP 379


>gi|171316543|ref|ZP_02905759.1| protein of unknown function DUF264 [Burkholderia ambifaria MEX-5]
 gi|171098271|gb|EDT43077.1| protein of unknown function DUF264 [Burkholderia ambifaria MEX-5]
          Length = 583

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 29/137 (21%), Positives = 43/137 (31%), Gaps = 20/137 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317
            E +   Y   +D     +  QF    + S  PL  ++  + +     D + P       
Sbjct: 346 LERLKLEY--SADEYANLLLCQFIDDSL-SVFPLATLQTCMVDTWEVWDDFKPLYLRPFG 402

Query: 318 ---LIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
              + +G D +  G     VVL      G     L  F W   D      +I  L  +YR
Sbjct: 403 DEEVWIGYDPSHTGDSAGCVVLAPPKYPGGKFRVLERFQWHGLDFEAQAAQIEALTRRYR 462

Query: 369 PDAIIIDANNTGARTCD 385
              I ID    G     
Sbjct: 463 VTYIGIDTTGIGQGVYQ 479


>gi|331646084|ref|ZP_08347187.1| terminase, ATPase subunit [Escherichia coli M605]
 gi|331044836|gb|EGI16963.1| terminase, ATPase subunit [Escherichia coli M605]
          Length = 588

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 28/143 (19%), Positives = 49/143 (34%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F      S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVDDFA-SVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|213419227|ref|ZP_03352293.1| terminase subunit [Salmonella enterica subsp. enterica serovar
           Typhi str. E01-6750]
          Length = 442

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/139 (18%), Positives = 47/139 (33%), Gaps = 23/139 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 282 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 338

Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+    V+      G     L    W   D R   + I  L +
Sbjct: 339 WREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 398

Query: 366 KYRPDAIIIDANNTGARTC 384
           +Y    I ID+   G    
Sbjct: 399 QYNVTYIGIDSTGVGHGVY 417


>gi|120436787|ref|YP_862473.1| phage terminase large subunit [Gramella forsetii KT0803]
 gi|117578937|emb|CAL67406.1| phage terminase large subunit [Gramella forsetii KT0803]
          Length = 506

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 35/210 (16%), Positives = 66/210 (31%), Gaps = 35/210 (16%)

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDA 371
                 +  D+A  G D  V+ L  G  I  L    K+  +   ++I  +  K+      
Sbjct: 296 KAGEKYITVDVATSGKDKLVIWLWEGFRIRDLMKIDKSTGKDIIDEIKAMALKWGVPNRR 355

Query: 372 IIIDANNTGA--RTCDYLEMLGYHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFAS 428
           I  DAN  GA     D   +     +      +   D    +N +T+ +V   + +E   
Sbjct: 356 IAYDANGVGAFIGGADNAFIPNSIAFDSNNRPRETKDGRKFKNLKTQCYVLSGERVERNE 415

Query: 429 L----------INHSGLIQNLKSLKSFIVPNTGE--------LAIESKRVKG--AKSTDY 468
           +           +    I+     +   +    +        +  E  + K    +STD 
Sbjct: 416 IWVMPQVANMMFDEKQTIRQRMLAERKAIKKQPKKDEEPQALIKKEEMKAKYLNGESTDL 475

Query: 469 SDGLMY----------TFAENPPRSDMDFG 488
            D  M           T ++      ++FG
Sbjct: 476 LDPFMMREIFELEPPITISKPTKPKGLNFG 505


>gi|331672362|ref|ZP_08373153.1| terminase, ATPase subunit [Escherichia coli TA280]
 gi|331070557|gb|EGI41921.1| terminase, ATPase subunit [Escherichia coli TA280]
          Length = 588

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 30/143 (20%), Positives = 52/143 (36%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y LD    +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEYSLDE--YQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|323190958|gb|EFZ76225.1| terminase, ATPase subunit [Escherichia coli RN587/1]
          Length = 591

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/168 (15%), Positives = 50/168 (29%), Gaps = 20/168 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  +     +     F     DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NDATFNMLYMCVFVDSK-DSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E           + P          +  G D A  G  +  V++           +  + 
Sbjct: 385 EACGVEIDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIIAPPMLAVEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394
            W   + R    +I  L +KY    + +D    G    D ++     V
Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV 492


>gi|204929563|ref|ZP_03220637.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Javiana str. GA_MM04042433]
 gi|204321282|gb|EDZ06482.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Javiana str. GA_MM04042433]
          Length = 588

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 28/143 (19%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|167647878|ref|YP_001685541.1| hypothetical protein Caul_3918 [Caulobacter sp. K31]
 gi|167350308|gb|ABZ73043.1| protein of unknown function DUF264 [Caulobacter sp. K31]
          Length = 439

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 73/451 (16%), Positives = 133/451 (29%), Gaps = 67/451 (14%)

Query: 37  PWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTL-NA 95
           PW E           ++W      V +AH    V+     +F      GRG GKT    +
Sbjct: 33  PWTELAPWPVVQDGLKTW-----RVTEAHQKPPVDPWITWLFL----GGRGAGKTFAGAS 83

Query: 96  WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155
           W+       +PG ++  +  +   ++  +          +      +   L      W +
Sbjct: 84  WIAN---QAKPGRNLALVGPTFHDVREVM----------IEGPSGIKSLYLPGDRPKWQA 130

Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDEASGTPDVINL-GIL 212
                           + + +S E PD   G   H         DE    P       +L
Sbjct: 131 SRRRLEFRN-----GAIAQAFSAEDPDALRGPQFHAA-----WADEFCAWPKPAETLAML 180

Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272
            F      +   ++T+ P R       +  +P    +     +   + + P+F   +   
Sbjct: 181 RFGLRLGTDPRLVVTTTP-RPIRALRNLIAEPGAV-QTRAPTSANADHLAPAFLSTLRGL 238

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI-AEEGGDN 331
           YG    +   E+ G   +           +  A  R   P  +  +++  D  A   GD 
Sbjct: 239 YGGT-RLAAQELDGLIVEG-EGGLFRAEDL--ARCRGAPPAAFDRVVVAIDPPATATGDA 294

Query: 332 T--VVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
              VV  R G     L D +           +      ++  DA++ +AN  G      L
Sbjct: 295 CGIVVCGRFGDRAFVLADRTAKGLSPNGWARRAVDAAVRFDADALVAEANQGGDMVRSVL 354

Query: 388 EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIV 447
                    V   K +V           L+ +    +   +       +  L S      
Sbjct: 355 -AQAAPPCPVKLVKASVGKRARAEPVAALYEQGRV-VHCGAFPALEEELMALGS------ 406

Query: 448 PNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
              G+L           S D +D L++  +E
Sbjct: 407 ---GDL---------GHSPDRADALVWALSE 425


>gi|99080642|ref|YP_612796.1| hypothetical protein TM1040_0801 [Ruegeria sp. TM1040]
 gi|99036922|gb|ABF63534.1| hypothetical protein TM1040_0801 [Ruegeria sp. TM1040]
          Length = 416

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 51/307 (16%), Positives = 93/307 (30%), Gaps = 37/307 (12%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            G G GKT +    +       P       A +   ++ T W  V          H    
Sbjct: 27  GGFGSGKTYVGCLDLGLFAGQHPKTVQGYFAPTYRDIRDTFWPTV------DEAAHSLGF 80

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            +         +D         S + +T+CR  S + P   VG      +    DE    
Sbjct: 81  TTKVKS-----ADKEVEFYRGRSYYGTTICR--SMDDPGGIVGFKIARAL---VDE---I 127

Query: 204 PDVINLGILGFLTERNANRFWIM------TSNPRRLSG--KFYEIFNK-PLDDWKRFQID 254
             +          +  A    ++              G    Y+ F + P  ++   Q  
Sbjct: 128 DILSKDKAQAAWRKIIARMRLVLPGVVNGIGVTTTPEGFRFVYDSFKREPKSNYSMVQAS 187

Query: 255 TRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
           T   E  + P +   ++  Y    ++ +  + G+F      +    +             
Sbjct: 188 TYENEAFLPPDYISTLLEDY--PEELIKAYLMGEFVNLTSGTVY-RSYDRLRHRSTQSIQ 244

Query: 314 PYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373
           P  PL +G D    G   +VV ++RG     + +        T + I  L ++Y    + 
Sbjct: 245 PREPLHIGQDF-NVGNMASVVFVQRGEDWHAVDELQGLQ--DTPHLIEVLCDRYEGHHLT 301

Query: 374 I--DANN 378
           I  DA+ 
Sbjct: 302 IYPDASG 308


>gi|296104758|ref|YP_003614904.1| terminase, ATPase subunit [Enterobacter cloacae subsp. cloacae ATCC
           13047]
 gi|295059217|gb|ADF63955.1| terminase, ATPase subunit [Enterobacter cloacae subsp. cloacae ATCC
           13047]
          Length = 572

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 28/143 (19%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 332 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWADFQALALRPFG 388

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L +
Sbjct: 389 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 448

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 449 QYNVTYIGIDSTGVGHGVYENVK 471


>gi|146311014|ref|YP_001176088.1| hypothetical protein Ent638_1356 [Enterobacter sp. 638]
 gi|145317890|gb|ABP60037.1| protein of unknown function DUF264 [Enterobacter sp. 638]
          Length = 254

 Score = 56.7 bits (135), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 14  IDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQVCMVDSWEVWSDFHALALRPFG 70

Query: 315 YAPLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    G     VV+      G     L    W   D R   + I  L +
Sbjct: 71  WREVWIGYDPAKGTQNGDSAGCVVMAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 130

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 131 QYNVTYIGIDSTGVGHGVYENVK 153


>gi|55163155|emb|CAH61098.1| large terminase subunit [Yersinia enterocolitica subsp. palearctica
           Y11]
          Length = 202

 Score = 56.7 bits (135), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 28/129 (21%), Positives = 40/129 (31%), Gaps = 15/129 (11%)

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGL 363
             P  Y P+ MG D +  G     VV+      G     L    W   D       I  L
Sbjct: 16  EQPFNYHPVWMGYDPSHTGDSAGCVVMAPPWVPGGKFRILERHQWKGMDFADQAESIKKL 75

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
            EKY  + I IDA   G      +               A ++ +    +T + +K  D 
Sbjct: 76  TEKYNVEYIGIDATGIGQGVYQLVR---------NFFPAAREIRYSAEVKTNMVLKAKDL 126

Query: 424 LEFASLINH 432
           +    L   
Sbjct: 127 ITTGRLEYD 135


>gi|200389255|ref|ZP_03215867.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Virchow str. SL491]
 gi|199606353|gb|EDZ04898.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Virchow str. SL491]
          Length = 591

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 35/211 (16%), Positives = 69/211 (32%), Gaps = 28/211 (13%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  ++    +     F   + DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVD-NKDSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E           + P          +  G D A  G  +  V++           +  + 
Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
            W   + R    +I  L +KY    + +D    G    D ++     V        AV +
Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV--------AVAI 496

Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437
            +  N + +L +K AD +E   +     L +
Sbjct: 497 RYDLNTKNQLVLKAADVVESQRIEWDKNLKE 527


>gi|194443211|ref|YP_002041983.1| hypothetical protein SNSL254_A2937 [Salmonella enterica subsp.
           enterica serovar Newport str. SL254]
 gi|194401874|gb|ACF62096.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Newport str. SL254]
          Length = 591

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 35/211 (16%), Positives = 69/211 (32%), Gaps = 28/211 (13%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  ++    +     F   + DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVD-NKDSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E           + P          +  G D A  G  +  V++           +  + 
Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
            W   + R    +I  L +KY    + +D    G    D ++     V        AV +
Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV--------AVAI 496

Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437
            +  N + +L +K AD +E   +     L +
Sbjct: 497 RYDLNTKNQLVLKAADVVESQRIEWDKNLKE 527


>gi|324009700|gb|EGB78919.1| hypothetical protein HMPREF9532_00529 [Escherichia coli MS 57-2]
          Length = 588

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRIEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|168752369|ref|ZP_02777391.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4113]
 gi|168756331|ref|ZP_02781338.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4401]
 gi|168770046|ref|ZP_02795053.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4486]
 gi|168775976|ref|ZP_02800983.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4196]
 gi|168782400|ref|ZP_02807407.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4076]
 gi|168799001|ref|ZP_02824008.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC508]
 gi|195938306|ref|ZP_03083688.1| Phage protein P [Escherichia coli O157:H7 str. EC4024]
 gi|208807993|ref|ZP_03250330.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4206]
 gi|208814612|ref|ZP_03255941.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4045]
 gi|208819940|ref|ZP_03260260.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4042]
 gi|209400321|ref|YP_002271352.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4115]
 gi|254793895|ref|YP_003078732.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           TW14359]
 gi|187768594|gb|EDU32438.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4196]
 gi|188013771|gb|EDU51893.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4113]
 gi|189000116|gb|EDU69102.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4076]
 gi|189356443|gb|EDU74862.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4401]
 gi|189360957|gb|EDU79376.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4486]
 gi|189378450|gb|EDU96866.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC508]
 gi|208727794|gb|EDZ77395.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4206]
 gi|208735889|gb|EDZ84576.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4045]
 gi|208740063|gb|EDZ87745.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4042]
 gi|209161721|gb|ACI39154.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           EC4115]
 gi|254593295|gb|ACT72656.1| phage large terminase subunit GpP [Escherichia coli O157:H7 str.
           TW14359]
 gi|326344901|gb|EGD68646.1| Phage terminase, ATPase subunit [Escherichia coli O157:H7 str.
           1125]
          Length = 590

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 59/207 (28%), Gaps = 33/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 351 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 407

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 408 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 467

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +               A D+ +    +T + +K  D +    
Sbjct: 468 VEYIGIDATGLGVGVFLLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG- 517

Query: 429 LINHSGLIQNLKS---LKSFIVPNTGE 452
            + +     ++ S        + ++G 
Sbjct: 518 CLEYDVSATDITSSFMAIRKTMTSSGR 544


>gi|213620832|ref|ZP_03373615.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. E98-2068]
          Length = 130

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 18/103 (17%), Positives = 34/103 (33%), Gaps = 22/103 (21%)

Query: 404 VDLEFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KS 444
            + +F  N + +    +AD      + IN+    L+  L S+                  
Sbjct: 19  PNKDFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPH 78

Query: 445 FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
                 G + +ESK+    +   S + +D  +  FA      D
Sbjct: 79  RDFDRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 121


>gi|213027436|ref|ZP_03341883.1| Phage protein P [Salmonella enterica subsp. enterica serovar Typhi
           str. 404ty]
          Length = 222

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/140 (19%), Positives = 39/140 (27%), Gaps = 20/140 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 81  IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 137

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 138 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 197

Query: 369 PDAIIIDANNTGARTCDYLE 388
            + I IDA   G      + 
Sbjct: 198 VEYIGIDATGLGVGVFQLVR 217


>gi|213026708|ref|ZP_03341155.1| putative prophage terminase large subunit [Salmonella enterica
           subsp. enterica serovar Typhi str. 404ty]
          Length = 143

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 18/103 (17%), Positives = 34/103 (33%), Gaps = 22/103 (21%)

Query: 404 VDLEFCRNRRTELHVKMADWLE-FASLINHSG--LIQNLKSL----------------KS 444
            + +F  N + +    +AD      + IN+    L+  L S+                  
Sbjct: 32  PNKDFFANLKAQAWWLVADRFRNTFNAINNGEQYLVDELISIDSRCPLLEKLKLELTTPH 91

Query: 445 FIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
                 G + +ESK+    +   S + +D  +  FA      D
Sbjct: 92  RDFDRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 134


>gi|29144574|ref|NP_807916.1| terminase subunit [Salmonella enterica subsp. enterica serovar
           Typhi str. Ty2]
 gi|29140212|gb|AAO71776.1| probable terminase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
          Length = 588

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/139 (18%), Positives = 47/139 (33%), Gaps = 23/139 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+    V+      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTC 384
           +Y    I ID+   G    
Sbjct: 465 QYNVTYIGIDSTGVGHGVY 483


>gi|76788305|ref|YP_329267.1| prophage LambdaSa03, terminase, large subunit [Streptococcus
           agalactiae A909]
 gi|76563362|gb|ABA45946.1| prophage LambdaSa03, terminase, large subunit, putative
           [Streptococcus agalactiae A909]
          Length = 471

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 58/346 (16%), Positives = 114/346 (32%), Gaps = 47/346 (13%)

Query: 54  WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113
           WQ   +  +    +N  N    + +  AI   R  GKT +   L LW +    G+ ++  
Sbjct: 44  WQENML--IPMMAINEDNLWVHQKYGYAIP--RRNGKTEVVYILELWAL--HKGLKILHT 97

Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
           A+  +   +  + +V K+L +     + + +    + A     +   S G   +  +   
Sbjct: 98  AHRISTSHS-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKSSGSVIQFRT--- 150

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR- 232
           RT +    + F          +I DEA          +   +T+ + N   IM   P   
Sbjct: 151 RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPTM 201

Query: 233 -LSGKFYEIFNKP-------LDDWKRFQIDTRTVEGIDPSF------------HEGIIAR 272
             +G  +E + K           W  + +D         S+               I A 
Sbjct: 202 VSTGTVFESYRKECLKGDRRYSGWAEWSVDEMQPIHDVKSWYVANPSMGYHLNERKIEAE 261

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331
            G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G + 
Sbjct: 262 LGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNNV 319

Query: 332 T-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
           +  +  R       +       +R     I   ++      ++ID 
Sbjct: 320 SLSIAARASENKVFVEAIDCLSIRNGTQWIINFLKSADIAKVVIDG 365


>gi|194450112|ref|YP_002047236.1| hypothetical protein SeHA_C3493 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. SL476]
 gi|194408416|gb|ACF68635.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Heidelberg str. SL476]
          Length = 591

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/168 (15%), Positives = 51/168 (30%), Gaps = 20/168 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  ++    +     F     DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E           + P          +  G D A  G  +  V++           +  + 
Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394
            W   + R    +I  L +KY    + +D    G    D ++     V
Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV 492


>gi|320177430|gb|EFW52430.1| Phage terminase, ATPase subunit [Shigella dysenteriae CDC 74-1112]
          Length = 588

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 50/143 (34%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQACMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R     I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQAYAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|317152051|ref|YP_004120099.1| hypothetical protein Daes_0328 [Desulfovibrio aespoeensis Aspo-2]
 gi|316942302|gb|ADU61353.1| protein of unknown function DUF264 [Desulfovibrio aespoeensis
           Aspo-2]
          Length = 428

 Score = 56.3 bits (134), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 50/333 (15%), Positives = 95/333 (28%), Gaps = 41/333 (12%)

Query: 79  KGAISAGRG-IGKTTLNAWLVLWLMST--RPGISVICLANSETQLKTTLWAEVSKWLSLL 135
           + ++       GKT L+   ++    T  R       +A    Q KT +W E+ ++    
Sbjct: 21  RFSVLVCHRRFGKTVLSVNRLIRAARTTSRTDWRGAYIAPLYKQAKTVVWDELKRY---- 76

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
                 +  ++  +     +D                 R +  E PD+  G +       
Sbjct: 77  -CGLGLDGCTVKFNETELRADF----------DNGARIRLFGAENPDSLRGMYLDGA--- 122

Query: 196 INDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL--DDWKRFQ 252
           + DE +  P  +    I   L++R     +     PR  +   Y ++       DW    
Sbjct: 123 VFDEVAQMPHRVWTEVIRPALSDRMGWAMF--IGTPRGKNA-LYRLWQDARRDPDWFAAM 179

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312
                   I+P           +  +    E    F      ++    I E         
Sbjct: 180 YRASQTGIIEPGELAAAARE--MSPEEYEQEFECSFTAAIRGAYFSALIGEADKGGRITK 237

Query: 313 DPYAPLIMGCDIAEEGG--DNTVVVL---RRGPVIEHLFDWSKTDL------RTTNNKIS 361
            P+ P  +    A + G  D+T +     R G     +  +  +        R  + +  
Sbjct: 238 VPHDP-SLPVHTAWDLGMSDSTAIWFVQARPGNAFAIVDYYEASGEGLDHYARVLDERRY 296

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394
                  P  I +    TG    +    LG   
Sbjct: 297 AYGSHIAPHDIRVRELGTGKSRLEIARALGIRF 329


>gi|324115391|gb|EGC09344.1| terminase [Escherichia coli E1167]
          Length = 492

 Score = 55.9 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 27/140 (19%), Positives = 39/140 (27%), Gaps = 20/140 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +       +D  +     +F      S  P   ++  +                   
Sbjct: 322 IEQLKRE--NSADDFKNLFMCEFVDDKA-SVFPFEELQRCMVDTLEEWEDYAPFAANPFG 378

Query: 317 --PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
             P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 379 SRPVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYN 438

Query: 369 PDAIIIDANNTGARTCDYLE 388
            + I IDA   G      + 
Sbjct: 439 VEYIGIDATGLGVGVFQLVR 458


>gi|323185221|gb|EFZ70586.1| terminase, ATPase subunit [Escherichia coli 1357]
          Length = 588

 Score = 55.9 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 29/143 (20%), Positives = 51/143 (35%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFVD-DLASVFPLSELQAGMVDSWEVWTDFHALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+   VVV      G     L    W   D R   + I  L E
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTE 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGHGVYENVK 487


>gi|315655961|ref|ZP_07908859.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333]
 gi|315490025|gb|EFU79652.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333]
          Length = 460

 Score = 55.9 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 73/432 (16%), Positives = 123/432 (28%), Gaps = 73/432 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GKT   A LV       PG  +  +A  E+ +++  +                  
Sbjct: 62  TGRGWGKTRTAAELVRDWAK-NPGTQIAVVAKKESLVRSICF---------EHKTSGLLH 111

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE-ASG 202
                  A + +        +  K+ ST+   +  E PD   G           DE A+ 
Sbjct: 112 VIPKSDQARFNASGGSGRFFLQLKNGSTI-YGFGAEVPDNLRGFAFDKA---WFDEFAAW 167

Query: 203 TPDVINLGILGFLTE-RNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261
                         + R +    ++ S   +      ++ +KP     R       +  +
Sbjct: 168 NKQTAQEVYDMMWYDLRESPSPQMVISTTPKPLKHVRDLVSKPGVVITRGH-TKDNLPNL 226

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321
                E +   YG    + R E+ G+  +    +   + + ++ + R     P   +++G
Sbjct: 227 SAIALEKLERDYGKT-RLGRQELAGELIESIEGALWDVTMFQDPVFRPDTMPPLEDIVVG 285

Query: 322 CDIA---EEGGDNTVV---------------VLRRGPVIEHLFDWSKTDLRTTNNKISGL 363
            D A    EG D T                  L  G ++E +        R    K   L
Sbjct: 286 VDPAVRSSEGADMTAFTVAARAEDAPGMFPDHLNHGYILEAIQGH--YTPRDAMAKAGEL 343

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEML--GYHVYRVLG-----QKRAVDLEFCRNRRTEL 416
             KY    ++++ANN G      L+M+  G     V        +           R   
Sbjct: 344 ARKYGASRVVLEANNGGEYLPTVLQMVAPGVPWKIVHAQQDKRGRAMPVATLYEQGRIHH 403

Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSD----GL 472
           +                 L   +       V  TG          G KS D  D     L
Sbjct: 404 Y---------GGAEKFEDLESQM-------VTYTG--------AAGEKSPDLLDSMVWAL 439

Query: 473 MYTFAENPPRSD 484
              F       D
Sbjct: 440 TELFLSPVGHGD 451


>gi|94970433|ref|YP_592481.1| hypothetical protein Acid345_3406 [Candidatus Koribacter versatilis
           Ellin345]
 gi|94552483|gb|ABF42407.1| protein of unknown function DUF264 [Candidatus Koribacter
           versatilis Ellin345]
          Length = 482

 Score = 55.9 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 60/331 (18%), Positives = 109/331 (32%), Gaps = 54/331 (16%)

Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYE 239
            PDT  G+H      ++ DE  G            +   +      + S P   +GKF+E
Sbjct: 123 NPDTVRGYHGD----VVLDE-FGFHRDAKKIYKAAIAIASRGYQLEVISTPNEQAGKFWE 177

Query: 240 IFNKP--------------LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285
           I                     W    +D  T           ++ +   D D  + E C
Sbjct: 178 IAKAAGVPADGGSERTHWTKGVWSVHWLDIYTAVKEGCPIDVEVMRQACYDDDTWQQEYC 237

Query: 286 GQFPQQDIDSFIPLNIIEEALNR------EPCPDPYAPLIMGCDIAEEGGDNTVVVLR-- 337
             F     + +IP+ +I  A ++       P       L +G DI  +  D TV+ +   
Sbjct: 238 CVFLADAQN-YIPMELIIAAESQMASLDARPEDLAGRELYLGMDIGRK-KDRTVIWIDEK 295

Query: 338 --RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVY 395
                +   +    +T       + +  +   R     ID+   GA+  + LE       
Sbjct: 296 LGDVMITRAVETLERTPFAKQFEQAAAWMPYVRRGC--IDSTGIGAQIGEDLERK----- 348

Query: 396 RVLGQKRAVDLEF-CRNRRTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIVPNTGE 452
              G  +   +EF   N+ T +       LE   A +     + + + ++K +  P TG 
Sbjct: 349 --FGAAKVEKVEFNIANKET-MAGLAKRKLEDRQARIPESPSIRRAINAVKRYTSP-TGH 404

Query: 453 LAIESKRVKGAKSTDYSD-----GLMYTFAE 478
              ++ R +      ++D      L  + AE
Sbjct: 405 FRFDADRTEAG----HADEFWAFALCLSAAE 431


>gi|56414723|ref|YP_151798.1| phage gene [Salmonella enterica subsp. enterica serovar Paratyphi A
           str. ATCC 9150]
 gi|197363650|ref|YP_002143287.1| phage gene [Salmonella enterica subsp. enterica serovar Paratyphi A
           str. AKU_12601]
 gi|56128980|gb|AAV78486.1| putative phage gene [Salmonella enterica subsp. enterica serovar
           Paratyphi A str. ATCC 9150]
 gi|197095127|emb|CAR60674.1| putative phage gene [Salmonella enterica subsp. enterica serovar
           Paratyphi A str. AKU_12601]
          Length = 591

 Score = 55.9 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 26/168 (15%), Positives = 52/168 (30%), Gaps = 20/168 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  ++    +     F   + DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVD-NKDSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E           + P          +  G D A  G  +  V++           +  + 
Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394
            W   + R    +I  L +KY    + +D    G    D ++     V
Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV 492


>gi|262279834|ref|ZP_06057619.1| conserved hypothetical protein [Acinetobacter calcoaceticus
           RUH2202]
 gi|262260185|gb|EEY78918.1| conserved hypothetical protein [Acinetobacter calcoaceticus
           RUH2202]
          Length = 416

 Score = 55.9 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 54/322 (16%), Positives = 98/322 (30%), Gaps = 50/322 (15%)

Query: 78  FKGAISAGRGIGKTTLNAW--------LVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129
           F+ A+  GR  GKT L              W +S      +   A +  Q K   W  + 
Sbjct: 10  FRDAV-CGRRFGKTFLAKAEMRRAARLAAKWNVSVEDE--IWYAAPTFKQAKRVFWKRLK 66

Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           + +                 PA W +   + +    +     + R    +  D   G   
Sbjct: 67  QAI-----------------PASWRAGKPNETECSITLRSGHVIRVVGLDNYDDLRG--- 106

Query: 190 TYGMAIINDEASGTPDVIN-LGILGFLTE--------RNANRFWIMTSNPRRLSGKFYEI 240
           +    +I DE +          +   L+         +      +    P+      Y+ 
Sbjct: 107 SGLFFLIIDEWADCKWAAWEEVLRPMLSTCKYVVNGVQRVGGHVLRIGTPKG-FNHCYDT 165

Query: 241 F--NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
           F   +P  +         +++G +    E I+A+  +D      E    F       F  
Sbjct: 166 FMDGQPGHEPDCKSFSYTSLQGGNIPESEIIVAKRKMDPKTFSQEYEASFESYQGVIFYC 225

Query: 299 LNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN 358
            N +  A            L +G D         VV +RRG  +  + ++   +L  T  
Sbjct: 226 FNRLLSA--STETVQANDVLHVGMDFNVTKM-AAVVYVRRGEQMHAVDEF--VNLFDTPA 280

Query: 359 KISGLVEKYRPDAIII--DANN 378
            I  + E+Y    I +  DA+ 
Sbjct: 281 MIEAIQERYPDHEIAVYPDASG 302


>gi|308187132|ref|YP_003931263.1| Terminase, ATPase subunit (GpP) [Pantoea vagans C9-1]
 gi|308057642|gb|ADO09814.1| Terminase, ATPase subunit (GpP) [Pantoea vagans C9-1]
          Length = 587

 Score = 55.9 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 30/176 (17%), Positives = 54/176 (30%), Gaps = 27/176 (15%)

Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDPYAPLIMGCDI 324
                +  +  +F   +  S  P   ++  +              P P  Y P+ +G D 
Sbjct: 357 SPAEYQNLLMCEFVDDEA-SVFPFAELQTCMIDSLEEWEDFNPYLPRPFAYRPVWIGYDP 415

Query: 325 AEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           +  G      V+   L  G     L    W   D       I  L +KY  + I +DA  
Sbjct: 416 SHTGDSAGCAVIAPPLVAGGKFRVLERHQWRGMDFAAQAKSIEDLTKKYTVEYIGVDATG 475

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
            G      +               A ++++    +T + +K  D +    L   +G
Sbjct: 476 IGQGVFQLVRQ---------FYPAAREIKYSPEVKTAMVLKAKDTISSGRLEYDAG 522


>gi|221214652|ref|ZP_03587622.1| putative ATPase subunit of terminase (gpP-like) [Burkholderia
           multivorans CGD1]
 gi|221165542|gb|EED98018.1| putative ATPase subunit of terminase (gpP-like) [Burkholderia
           multivorans CGD1]
          Length = 583

 Score = 55.9 bits (133), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 28/137 (20%), Positives = 44/137 (32%), Gaps = 20/137 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317
            E +   Y   +D     +  QF    + S  PL+ ++  + +     D + P       
Sbjct: 346 LERLKLEY--SADEYANLLLCQFIDDSL-SVFPLSALQPCMVDTWEVWDDFKPLYLRPFG 402

Query: 318 ---LIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
              + +G D +  G     VV+      G     L  F W   D      +I  L  +YR
Sbjct: 403 DEDVWIGYDPSHTGDSAGCVVVAPPKYPGGKFRVLERFQWHGLDFEAQAGQIEALTRRYR 462

Query: 369 PDAIIIDANNTGARTCD 385
              I ID    G     
Sbjct: 463 VTYIGIDTTGIGQGVYQ 479


>gi|257460901|ref|ZP_05626002.1| transposase family protein [Campylobacter gracilis RM3268]
 gi|257442232|gb|EEV17374.1| transposase family protein [Campylobacter gracilis RM3268]
          Length = 518

 Score = 55.9 bits (133), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 60/372 (16%), Positives = 117/372 (31%), Gaps = 61/372 (16%)

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS----EERPDTFVGHHNTYGMAIIN 197
           E   + ++ +  ++  L  S   DS++  ++    +         T  G        I  
Sbjct: 171 EQARILMNYSQMWAKKLGVSFAKDSEYEKSLDNGATIRVMAHNFRTVQGFTGD----IWM 226

Query: 198 DEASGTPDV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW--KRFQI 253
           DE +  P+   I    +  +          + S P   +  F E+F   L  +   R ++
Sbjct: 227 DEFAWYPNQKRIWHAFVPSI--GAVAGRLTILSTPFEENSFFAELFGDELKFYMFSRHRV 284

Query: 254 DT-RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP-- 310
           D  R + G      E + A +  D+D        QF   +  + + +++I+  ++     
Sbjct: 285 DIYRAMAGGLKFDLETMRALF--DADTWASAYECQFVDDES-ALLGIDLIKSCVSDFTPT 341

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVV-VLRRGPVIEHL---FDWSKTDLRTTNNKISGLVEK 366
            P    P+  G D+      +  + V   G  I+ L      +K       N ++  +  
Sbjct: 342 LPPKNIPVFSGYDVGRTKDRSVHMGVYDAGEGIKRLCLYDVIAKASFEAQENLLTDFLRL 401

Query: 367 YRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425
                + ID    G    + L+      V  V       +     N +           +
Sbjct: 402 NLLAYLKIDKTGIGMPVAERLKSRFTSRVSGVYFTASVKEA-LALNLKKHFED------K 454

Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS----TDY-----SD-----G 471
             S+ N   LI +L ++               KR  G KS    +D      +D      
Sbjct: 455 SISIPNDPLLIADLHAI---------------KRKAGQKSFLYDSDRNEHGHADRFWALA 499

Query: 472 LMYTFAENPPRS 483
           L  ++ E     
Sbjct: 500 LALSYFEKVRER 511


>gi|152994622|ref|YP_001339457.1| hypothetical protein Mmwyl1_0587 [Marinomonas sp. MWYL1]
 gi|150835546|gb|ABR69522.1| protein of unknown function DUF264 [Marinomonas sp. MWYL1]
          Length = 601

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 39/267 (14%), Positives = 80/267 (29%), Gaps = 35/267 (13%)

Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W++   I+     G D    E +       +D    +   ++   D +S   L  +
Sbjct: 345 PDKMWRQIVTIEDAEEGGCDLFDIEQLRDE--NSTDAFNNKYLCKWID-DANSVFTLAKL 401

Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346
              +        Y           P+ +G D +    + ++      +       +    
Sbjct: 402 LSCMVDTETWTDYHKDAGQPFGNRPVAIGYDPSRTTDNASLALLSIPLGASDPWRLLKKD 461

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
            W   + +    +I    EK+    I ID +  G    + +E     V  +         
Sbjct: 462 SWRGVNFQWQAARIKEEKEKHNVKHIGIDVSGIGRGVFELVEQFYRRVTPI--------- 512

Query: 407 EFCRNRRTELHVKMADWLEFAS---LINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGA 463
            +    +TEL +K  D +E            + Q    +    V + G++   + R    
Sbjct: 513 TYSVQTKTELVLKALDLIENGLFKFSAGDKEVAQAFMMITK-KVTDNGQITYVANRSNAT 571

Query: 464 KSTDYSDGLMYTFAENP--PRSDMDFG 488
              D +  +M+ F   P  P+      
Sbjct: 572 GHADVAWAIMHAFNYEPIAPKRKTTVA 598


>gi|170731562|ref|YP_001763509.1| hypothetical protein Bcenmc03_0207 [Burkholderia cenocepacia MC0-3]
 gi|169814804|gb|ACA89387.1| protein of unknown function DUF264 [Burkholderia cenocepacia MC0-3]
          Length = 583

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 29/137 (21%), Positives = 43/137 (31%), Gaps = 20/137 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317
            E +   Y   +D     +  QF    + S  PL  ++  + +     D + P       
Sbjct: 346 LERLKLEY--SADEYANLLLCQFIDDSL-SVFPLATLQTCMVDTWEVWDDFKPLYLRPFG 402

Query: 318 ---LIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
              + +G D +  G     VVL      G     L  F W   D      +I  L  +YR
Sbjct: 403 DEEVWIGYDPSHTGDSAGCVVLAPPKYPGGKFRVLERFQWHGLDFEAQAAQIEALTTRYR 462

Query: 369 PDAIIIDANNTGARTCD 385
              I ID    G     
Sbjct: 463 VTYIGIDTTGIGQGVYQ 479


>gi|262193957|ref|YP_003265166.1| hypothetical protein Hoch_0641 [Haliangium ochraceum DSM 14365]
 gi|262077304|gb|ACY13273.1| protein of unknown function DUF264 [Haliangium ochraceum DSM 14365]
          Length = 478

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 43/287 (14%), Positives = 87/287 (30%), Gaps = 52/287 (18%)

Query: 227 TSNPRRLSGKFYEI----------FNKPLDDWKRFQIDTRTVEGIDPSF----HEGIIAR 272
            S P    G F+EI            +    W R +     ++           E  +A 
Sbjct: 183 CSTPLGRRGIFWEISTEELRKYPHHTRDEVPWWRCRFFCIDIDRAMREAPHMPTEERVAA 242

Query: 273 YG-------LDS---DVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE--------PCPDP 314
           +G       LDS   +  + E    F  +   S+ P  +I    + +          P+P
Sbjct: 243 FGTQAIAQQLDSLALEDFQQEFECSFVDESY-SYYPYELILPCTSEDLVLAGDFTDLPEP 301

Query: 315 YAPLIMGCDIAEEGG-DNTVVVLRRGPV--IEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
              ++ G D+          V   +G       L  + +         +   +++     
Sbjct: 302 EGRIVAGFDVGRTRDHSELAVFEDKGGHFVCRLLRRYDQVPFAEQEADLRRFLDRVPVAR 361

Query: 372 IIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL---EFAS 428
           + ID +  G    + L      V            +   N   E        L   +   
Sbjct: 362 LSIDQSGIGMHLAENLARDYAQVVG----------DTFTNDNKERWATDLKILFQRKDIV 411

Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESKRV-KGAKSTDYSDGLMY 474
           L     L+  + S+K  ++P+ G++  +++R  +G  + D    +  
Sbjct: 412 LPRDRELVGQIHSIKRRVLPS-GKVGFDAERSTRGGHA-DRFWAIAL 456


>gi|156976253|ref|YP_001447159.1| hypothetical protein VIBHAR_05025 [Vibrio harveyi ATCC BAA-1116]
 gi|156527847|gb|ABU72932.1| hypothetical protein VIBHAR_05025 [Vibrio harveyi ATCC BAA-1116]
          Length = 593

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 34/259 (13%), Positives = 70/259 (27%), Gaps = 53/259 (20%)

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE    P  D +N       T +N  +     S P   + + Y     + + +  D 
Sbjct: 258 VYVDEYFWIPKFDELNKLASAMATHKNWRKT--YFSTPSAKTHQAYTFWTGDQWRQGRDT 315

Query: 248 WKRFQIDT----RTVEGIDPSF--------------------HEGIIARYGLDSDVTRVE 283
               +  T    R    + P                       + +   Y  D       
Sbjct: 316 RANIEFPTFDDYRDGGRLCPDKQWRYVVTIEDAAAGGCELFDIDELRDEYSKDD--FDNL 373

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV 333
               F      S    + +E+A+        + P          + +G D +    +  +
Sbjct: 374 FMCIFVDGAS-SVFKFSALEKAMVDISRWQDFKPNDNDPFDRREVWLGYDPSRTRDNACL 432

Query: 334 ------VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
                 +V      +     W   + +    ++S + E+Y    + ID    GA   D L
Sbjct: 433 VVVAPPIVAIEKFRVLEKHYWRGLNFQYQAQQVSKVFERYNVSYLGIDTTGIGAGVYDLL 492

Query: 388 -EMLGYHVYRVLGQKRAVD 405
            +        +     + +
Sbjct: 493 SKKHPRETVAIQYSNESKN 511


>gi|261347084|ref|ZP_05974728.1| terminase, ATPase subunit [Providencia rustigianii DSM 4541]
 gi|282564814|gb|EFB70349.1| terminase, ATPase subunit [Providencia rustigianii DSM 4541]
          Length = 119

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 36/89 (40%), Gaps = 9/89 (10%)

Query: 310 PCPDPYAPLIMGCDIAE--EGGDNT---VVV--LRRGPVIEHLFD--WSKTDLRTTNNKI 360
             P  Y P+ +G D A+  + GD+    V+   +R+G     L    W   D R  ++ I
Sbjct: 21  IRPYAYHPVWIGYDPAKGTQNGDSAGCVVIAPPMRKGDKFRILEHHQWRGMDFRAQSDAI 80

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEM 389
             L E+Y    I ID+   G      +  
Sbjct: 81  KELTERYNVQYIGIDSTGIGHGVLQNVRE 109


>gi|149909656|ref|ZP_01898309.1| putative bacteriophage terminase, ATPase subunit [Moritella sp.
           PE36]
 gi|149807360|gb|EDM67313.1| putative bacteriophage terminase, ATPase subunit [Moritella sp.
           PE36]
          Length = 601

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 58/350 (16%), Positives = 98/350 (28%), Gaps = 77/350 (22%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
           IG T   A+   +         +   A S  Q      AE+ K   +   +  F ++   
Sbjct: 181 IGATFYFAFEAFYDAVVNGRNKIFISA-SRDQ------AEIFKANIIALCREQFGIE--- 230

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-- 205
           L  +P        +  +  K  ST  RT      D            +  DE    P   
Sbjct: 231 LSGSPLTMRNKGKTTTLYFK--STNARTAQSASGD------------LYIDEVFWIPKFK 276

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLS--------GKFY---EIFNKP---------- 244
            +        T ++        S P   S        G++Y   +  N P          
Sbjct: 277 ELRSLAQAMATHKDFRIT--YFSTPSVTSHEAYDLWNGRWYRKTKACNDPEFAIDVSRKT 334

Query: 245 -------LDDWKRFQIDTRTV--EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295
                   D   R +++   V  +G D      +   Y  +          +F      +
Sbjct: 335 LKHGLLCDDGIWRQKLNVYDVVEQGFDRIDISMLENEYSKEE--FDNLFMCKFIDDAHSA 392

Query: 296 FIPLNIIEEALN----------REPCPDPYAPLIMGCDIAEEGGDNTVVVL------RRG 339
           F  L  +   +               P    P+++G D A      +VVVL         
Sbjct: 393 F-SLKQLMACVGNSKKWTDFDPTWSRPYAMKPVVIGFDPARTRDIASVVVLSLPLGPDDK 451

Query: 340 PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
             +    + S  D  T  ++I  L  KY    I +D    G    + ++ 
Sbjct: 452 FRLLESLNLSGNDFETMASEIKELTLKYHVVHIGVDTTGMGLGVFELIQK 501


>gi|317049635|ref|YP_004117283.1| hypothetical protein Pat9b_3434 [Pantoea sp. At-9b]
 gi|316951252|gb|ADU70727.1| protein of unknown function DUF264 [Pantoea sp. At-9b]
          Length = 588

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 26/144 (18%), Positives = 50/144 (34%), Gaps = 23/144 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y       +  +  +F   D+ S  PL ++++ +                P  
Sbjct: 348 LDQLRMEY--SPPEYQNLLMCEFVD-DLASVFPLQLLQKCMVDSWEVWTDFEALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+    V+      G     L    W   D R     I  L +
Sbjct: 405 WREVWIGYDPAKGTQNGDSAGCVVIAPPAVPGGKFRILERHQWRGMDFRAQAESIRKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLEM 389
           +Y    I ID+   G    + ++M
Sbjct: 465 QYNVTYIGIDSTGVGLGVYENVKM 488


>gi|293433090|ref|ZP_06661518.1| terminase [Escherichia coli B088]
 gi|291323909|gb|EFE63331.1| terminase [Escherichia coli B088]
          Length = 591

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 35/211 (16%), Positives = 67/211 (31%), Gaps = 28/211 (13%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  ++    +     F     DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVV------VLRRGPVIEHLF 346
           E           + P          +  G D A  G  +  V             +  + 
Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIIAPPMYAAEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
           +W   + R    +I  L +KY    + +D    G    D ++     V        AV +
Sbjct: 445 NWKGMNFRYQARQIELLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV--------AVAI 496

Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437
            +  N + +L +K AD +E   +     L +
Sbjct: 497 RYDMNTKNQLVLKAADVVESQRIEWDKNLKE 527


>gi|326782137|ref|YP_004322538.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-HM1]
 gi|310004344|gb|ADO98737.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage
           P-HM1]
          Length = 560

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 68/414 (16%), Positives = 130/414 (31%), Gaps = 65/414 (15%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            +Q E +E    +  N    P               GK+T     +L  +     ++V  
Sbjct: 60  DFQEELIESFHENRFNIAKLPRQ------------TGKSTTCVSYLLHYILFNDNVNVGI 107

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
           LAN  +  +  L     +          +  Q + ++      +     L   SK     
Sbjct: 108 LANKLSTARDLL----GRLQLAYEQLPLWLQQGIVVY------NKGSMELENGSK-ILAA 156

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI----NLGILGFLTERNANRFWIMTS 228
             + S  R  +F          I  DE +  P+ I       +   +T    +   I+ S
Sbjct: 157 STSASAVRGMSFN--------IIFLDEFAFIPNHIAEQFFSSVYPTITS-GTSTKVIIIS 207

Query: 229 NPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285
            P  ++  FY+++    K  + +   ++    V G D  + E  IA           E  
Sbjct: 208 TPNGMN-HFYKLWVDAQKGRNGYAWSEVHWSKVPGRDAKWKEQTIANTSERQ--FTQEFD 264

Query: 286 GQFPQQDIDSFIPLNIIEEALNREPC-----------PDPYAPLIMGCDIAEE--GGDNT 332
            +F    +D+ I    +      +P            P      I+  D++       + 
Sbjct: 265 CEFL-GSVDTLITAAKLRTLTYDDPLTTNGSLDVYENPVRDHDYIICVDVSRGLAQDYSA 323

Query: 333 VVVLRRGPVIEHLF-DWSKTDL--RTTNNKISGLVEKYRPDAIIIDANNTG----ARTCD 385
            VV+        L   +   D+      N I  +   Y    ++ + N+ G         
Sbjct: 324 FVVIDITHAPWRLVAKYRDHDVRPMVYPNIIFNVATNYNKAYVLTEVNDIGEAVSGSLFY 383

Query: 386 YLEMLGYHVYRVLG-QKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQN 438
            LE     +  + G   + V   F  N+  ++ VKM+  ++     N   LI++
Sbjct: 384 DLEYENVLMCAMRGRAGQIVGQGFSGNK-VQMGVKMSKTVKAQGCSNLKTLIED 436


>gi|218704209|ref|YP_002411728.1| putative Terminase, ATPase subunit from bacteriophage origin
           [Escherichia coli UMN026]
 gi|218431306|emb|CAR12184.1| Putative Terminase, ATPase subunit from bacteriophage origin
           [Escherichia coli UMN026]
          Length = 591

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 35/211 (16%), Positives = 67/211 (31%), Gaps = 28/211 (13%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  ++    +     F     DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVV------VLRRGPVIEHLF 346
           E           + P          +  G D A  G  +  V             +  + 
Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIIAPPMYAAEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
           +W   + R    +I  L +KY    + +D    G    D ++     V        AV +
Sbjct: 445 NWKGMNFRYQARQIELLFKKYNFTYLGVDVTGIGQGVFDNIQHFAMRV--------AVAI 496

Query: 407 EFCRNRRTELHVKMADWLEFASLINHSGLIQ 437
            +  N + +L +K AD +E   +     L +
Sbjct: 497 RYDMNTKNQLVLKAADVVESQRIEWDKNLKE 527


>gi|328912284|gb|AEB63880.1| SPBc2 prophage-derived uncharacterized protein yonF [Bacillus
           amyloliquefaciens LL3]
          Length = 589

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 62/431 (14%), Positives = 129/431 (29%), Gaps = 82/431 (19%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           A RG GKT L +          PG  ++  + ++ Q +  +                   
Sbjct: 84  ASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKGQAREVI-----------EKIDDLRK 132

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           +S +L               ++  + S +    S +      G  +     +I DE    
Sbjct: 133 ESPNLRREIEDLKTSTNDAKVEFHNGSWIKIVASND------GARSKRANLLIVDEFRMV 186

Query: 204 P-DVINLGILGFLTERNANRFW-------IMTSNPR-RLSGKFYE---IFNKPLDDWKR- 250
             ++I+  +  FLT   + ++        +   N    LS  +Y+    FN+ +  +   
Sbjct: 187 DFEIISKVLRKFLTAPRSPKYLEKEEYAHLKERNKEIYLSSCWYKVHWSFNRFITYYNAM 246

Query: 251 ------------FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
                       +QI  +    +D       +A    D     +E+   +  +   ++  
Sbjct: 247 MKGSKYFVCGLPYQIAIKEG-LLDKDQVRDEMAEEDFDPIGWSMEMEALWFGESEKAYFK 305

Query: 299 LNIIEEALN-------------------REPCPDPYAPLIMGCDIAEEGG---DNTVVVL 336
              IE+                      +     P    ++  DIA   G   D +V  +
Sbjct: 306 FEDIEKNRKLASPLFPPDYYSLIKDSNFKYESKKPGEIRLVSNDIAGMAGKDNDASVYTV 365

Query: 337 RR--------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            R           I ++         T   +I  + E Y  D I++D  + G    D L 
Sbjct: 366 FRLIPNSNGYDRHIVYMESIVGGHTGTQATRIRQIYEDYDCDYIVLDTQSIGLGVYDAL- 424

Query: 389 MLGYHVYRVLGQKRAVDLEFC---RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSF 445
                   +  ++RA + E      + R        +  +    I  +  + +  ++   
Sbjct: 425 -----CQPLYDKERAKEYEPFSCINDERMAERCTYQNAEKVIYSIKGNAQLNSEIAVLLK 479

Query: 446 IVPNTGELAIE 456
                G++ I 
Sbjct: 480 DGFKRGKIKIP 490


>gi|326775607|ref|ZP_08234872.1| phage terminase, large subunit, PBSX family [Streptomyces cf.
           griseus XylebKG-1]
 gi|326655940|gb|EGE40786.1| phage terminase, large subunit, PBSX family [Streptomyces cf.
           griseus XylebKG-1]
          Length = 416

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 64/412 (15%), Positives = 127/412 (30%), Gaps = 49/412 (11%)

Query: 79  KGAISAGR-GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           +  I++G    GKT   + L+ W+M        +  A S  +L     A ++K  +   +
Sbjct: 23  RINIASGSIRAGKT--ISTLLRWIMY-------VATAPSGGEL-----AVIAKTTNTAAS 68

Query: 138 KHWFEMQSLSL-HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER-PDTFVGHHNTYGMAI 195
             +  +Q  +L  P   +      +               ++ R  +   G      +  
Sbjct: 69  NVFIPLQDPNLFGPLAQHVHYTRGAPTATILGRQVRVIGANDSRAEERLRGMTCAGAL-- 126

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD----DWKRF 251
             DEA+  P      +LG ++   A      ++NP   +      F    D     +  +
Sbjct: 127 -VDEATLVPQEFWTQLLGRMSVPGAK--LFASTNPGSPAHWLKRDFIDRRDELGIRYWHY 183

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
            +D      +   +   I   +       R  V G++      S   +   EE    +  
Sbjct: 184 VLD--DNPSLGDDYKNSIKNEF--VGLWYRRFVLGEWIA-AEGSIFDM-WDEEKHVVDTL 237

Query: 312 PDPYAPLIMGCDIAEEGG-DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
           P+    + +G D  +      T++ L R   +    +W + D R    +++ +    R  
Sbjct: 238 PEIAKWISVGVDYGQTNPFHATLLGLGRDRRLYAASEW-RYDGRQQRRQLTDIEYSERMR 296

Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRA------VDLEFCRNRRTELHVKMADWL 424
             + +    G      +      V        A      +      N   +    MA  L
Sbjct: 297 GWLSNVAGIG-----PVRPQFVTVDPSAASFSAQLRRDRLTPTPANNAVLDGIRTMASLL 351

Query: 425 EFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS-DGLMYT 475
               L+ HS     +  +  +   +    A E    +  K  D+  D L Y 
Sbjct: 352 SAGKLVVHSSCKALIGEMPGYAWDDK---AAEKGEDRPIKVADHGVDALRYA 400


>gi|171779706|ref|ZP_02920662.1| hypothetical protein STRINF_01543 [Streptococcus infantarius subsp.
           infantarius ATCC BAA-102]
 gi|171281808|gb|EDT47242.1| hypothetical protein STRINF_01543 [Streptococcus infantarius subsp.
           infantarius ATCC BAA-102]
          Length = 470

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 55/350 (15%), Positives = 101/350 (28%), Gaps = 53/350 (15%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ   ++ V     + +       +          GKT +   L LW +    G++++ 
Sbjct: 42  PWQQNLLKSVMGIEEDGLWTHQKFGYSIPRRN----GKTEIIYMLELWGL--YHGLNILH 95

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +   +  + +V ++L  +  +      S+               + +        
Sbjct: 96  TAHRISTSHS-SFEKVKRYLEKMGLEDGKSFNSIRA--------KGQERIELYETGGVVQ 146

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT          G        ++ DEA          +   +T+ + N   IM   P  
Sbjct: 147 YRT-RTSNGGLGEGFD-----LLVIDEAQEYTTEQESALKYTVTDSD-NPMTIMCGTPPT 199

Query: 233 L------SGKFYEIFNKPLDD---WKRFQIDTRT---------VEGIDPSFH---EGIIA 271
                    K+ E           W  + +                    FH     I A
Sbjct: 200 PVSSGTVFTKYRETCLFGRGKYLGWAEWSVSEEKEIDDIDAWYNSNPSMGFHLNERKIEA 259

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     V+  G +P  +  S I        AL  +  P     L +G    + G D
Sbjct: 260 ELGEDKLDHNVQRLGYWPTYNQKSAISETEW--NALKIDDLPQLQGQLFVGI---KYGQD 314

Query: 331 NT----VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            T     V +R       +       +R  N+ I   +++     I+ID 
Sbjct: 315 GTNVALSVAVRTKDKHIFVETVDCQSVRNGNHWIINFLKQADIAQIVIDG 364


>gi|262042498|ref|ZP_06015656.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259040136|gb|EEW41249.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 587

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL  ++  +                P  
Sbjct: 347 LDQLRLEY--SPDEYQNLLMCEFID-DLASVFPLADLQACMVDSWEVWEDFQALALRPFG 403

Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+    V+      G     L    W   D R     I  L +
Sbjct: 404 WREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQAEAIRKLTQ 463

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 464 QYNVTYIGIDSTGVGHGVYENVK 486


>gi|9630181|ref|NP_046608.1| hypothetical protein SPBc2p056 [Bacillus phage SPBc2]
 gi|16079170|ref|NP_389994.1| prophage terminase, ATP subunit; phage SPbeta [Bacillus subtilis
           subsp. subtilis str. 168]
 gi|221310018|ref|ZP_03591865.1| hypothetical protein Bsubs1_11626 [Bacillus subtilis subsp.
           subtilis str. 168]
 gi|221314340|ref|ZP_03596145.1| hypothetical protein BsubsN3_11547 [Bacillus subtilis subsp.
           subtilis str. NCIB 3610]
 gi|221319262|ref|ZP_03600556.1| hypothetical protein BsubsJ_11473 [Bacillus subtilis subsp.
           subtilis str. JH642]
 gi|221323538|ref|ZP_03604832.1| hypothetical protein BsubsS_11602 [Bacillus subtilis subsp.
           subtilis str. SMY]
 gi|81342066|sp|O31952|YONF_BACSU RecName: Full=SPBc2 prophage-derived uncharacterized protein yonF
 gi|2634531|emb|CAB14029.1| putative prophage terminase, ATP subunit; phage SPbeta [Bacillus
           subtilis subsp. subtilis str. 168]
 gi|3025534|gb|AAC13029.1| unknown [Bacillus phage SPbeta]
          Length = 589

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 62/431 (14%), Positives = 129/431 (29%), Gaps = 82/431 (19%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           A RG GKT L +          PG  ++  + ++ Q +  +                   
Sbjct: 84  ASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKGQAREVI-----------EKIDDLRK 132

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           +S +L               ++  + S +    S +      G  +     +I DE    
Sbjct: 133 ESPNLRREIEDLKTSTNDAKVEFHNGSWIKIVASND------GARSKRANLLIVDEFRMV 186

Query: 204 P-DVINLGILGFLTERNANRFW-------IMTSNPR-RLSGKFYE---IFNKPLDDWKR- 250
             ++I+  +  FLT   + ++        +   N    LS  +Y+    FN+ +  +   
Sbjct: 187 DFEIISKVLRKFLTAPRSPKYLEKEEYAHLKERNKEIYLSSCWYKVHWSFNRFITYYNAM 246

Query: 251 ------------FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
                       +QI  +    +D       +A    D     +E+   +  +   ++  
Sbjct: 247 MKGSKYFVCGLPYQIAIKEG-LLDKDQVRDEMAEEDFDPIGWSMEMEALWFGESEKAYFK 305

Query: 299 LNIIEEALN-------------------REPCPDPYAPLIMGCDIAEEGG---DNTVVVL 336
              IE+                      +     P    ++  DIA   G   D +V  +
Sbjct: 306 FEDIEKNRKLASPLFPPDYYSLIKDSNFKYEGKKPGEIRLVSNDIAGMAGKDNDASVYTV 365

Query: 337 RR--------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            R           I ++         T   +I  + E Y  D I++D  + G    D L 
Sbjct: 366 FRLIPNSNGYDRHIVYMESIVGGHTGTQATRIRQIYEDYDCDYIVLDTQSIGLGVYDAL- 424

Query: 389 MLGYHVYRVLGQKRAVDLEFC---RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSF 445
                   +  ++RA + E      + R        +  +    I  +  + +  ++   
Sbjct: 425 -----CQPLYDKERAKEYEPFSCINDERMAARCTYQNAEKVIYSIKGNAQLNSEIAVLLK 479

Query: 446 IVPNTGELAIE 456
                G++ I 
Sbjct: 480 DGFKRGKIKIP 490


>gi|190893406|ref|YP_001979948.1| hypothetical protein RHECIAT_CH0003832 [Rhizobium etli CIAT 652]
 gi|190698685|gb|ACE92770.1| hypothetical protein RHECIAT_CH0003832 [Rhizobium etli CIAT 652]
          Length = 443

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 60/384 (15%), Positives = 109/384 (28%), Gaps = 53/384 (13%)

Query: 84  AGRGIGKTTLNAWLVLW---------LMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134
           AGR  GKT     L  +          ++T    ++  +A S  Q          K    
Sbjct: 34  AGRRSGKTRAAGTLAGYVATLVDHSAYLATSERATIPVMAASTVQ--------AQKAFQA 85

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE-ERPDTFVGHHNTYGM 193
                   + S  +  +   S+ +      D +      RT      P           +
Sbjct: 86  CMVLEESSLLSKQIESS--NSETIKLKTRCDIEVRPANHRTIRGITSPLAIA-----DEV 138

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKR 250
           A    +   T   I   +   L   N     +  S+P    G+ Y+     + P  D   
Sbjct: 139 AFYFTDGQNTDSQILDAVRPSLQSGNHAGPLVCISSPYAKRGELYDAFKNHHGPNGDAHV 198

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-- 308
                 ++E         I   Y  +  V   E  G F + D+ +   L  +E A +   
Sbjct: 199 IVAKGASLEFNSTIDPATIARAYKRNPTVADAEYGGNF-RSDVTNLFTLEAVEAATDLGV 257

Query: 309 -EPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFDWSKT-----DLRTTNNKIS 361
            E  P          D A   G D   + +        + D  +      +  T     +
Sbjct: 258 TERAPREGVQYFAHADPAGGSGADGFTLAIGHRENNVAVIDLIRERKSPYNPETVVADYA 317

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
            L+  YR   ++ DA  +                R   +K  ++ +      +E    + 
Sbjct: 318 DLLRLYRCATVVTDAYAS-------------EWNRAAWRKAGIEPKSAPMTASEFFAALV 364

Query: 422 DWLEFA--SLINHSGLIQNLKSLK 443
             +     +L++   L   L  L+
Sbjct: 365 PAVNSGQVALLDDETLKHQLVGLE 388


>gi|228904911|ref|ZP_04068965.1| hypothetical protein bthur0014_60350 [Bacillus thuringiensis IBL
           4222]
 gi|228854925|gb|EEM99529.1| hypothetical protein bthur0014_60350 [Bacillus thuringiensis IBL
           4222]
          Length = 954

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 62/426 (14%), Positives = 117/426 (27%), Gaps = 83/426 (19%)

Query: 92  TLNAWLVLWLMSTRPGISV------ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           T+ A ++    +   G  +      +     + Q +  ++ ++    + + N    +   
Sbjct: 442 TMCAHMLWVAFTCNGGTRMAKGAACVVATPYDNQAR-LIFDQLK---TFIDNNPVLQESI 497

Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205
            S+   P+   V+    G   + ++   R+ SE    +  G    +   +  DE     D
Sbjct: 498 KSITKNPY---VIEFKNGSVIRLFTAGTRSGSE--GGSLRGQRADW---LYMDEVDYMGD 549

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL-------------------- 245
                I   + E       ++ S P    G FY+   +                      
Sbjct: 550 KDFESIFAIVNEAPDRIGCMIASTPTGRRGMFYKTCTQMKLNQDVKMNKNNVYDMRSYNR 609

Query: 246 ---DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
              + W  F   T       P     +   +         EV  +F  + +  F   + +
Sbjct: 610 TLSEGWAEFYFPTMVNPEWGPKMERELRKLFSE--AAYEHEVLAEFGTEMVGVF-NKDYV 666

Query: 303 EEA----LNREPCPDPYAPLIMGCDIAEEGGDNTVVVL---------------------- 336
           +EA     N    P    P+ +G D  + G    +VV                       
Sbjct: 667 DEASSIGYNYTTSPTHDGPIAIGIDWDKAGAATQIVVTQYNPFEVRRPRPELGETEPSFG 726

Query: 337 RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM-LGYHVY 395
           R   +        +        KI  L   Y P  I  DA   G    + L   LG  V 
Sbjct: 727 RFQIINRIEIPKGEFTYDIAVKKIIELDGVYNPFGIYADA-GAGEYQIELLRKTLGDKVK 785

Query: 396 RVLGQKRAV--DLE----FCRNRRTELHVKMADWLEFASL-----INHSGLIQNLKSLKS 444
           RV      +  D        +  +  +  +    LE   L          L + + + + 
Sbjct: 786 RVHLGSSQMVRDPHSREFEKKPLKAFIVDQTKLMLERGQLRIPHREKDETLARQMTNYQV 845

Query: 445 FIVPNT 450
                 
Sbjct: 846 TRYSPK 851


>gi|109897022|ref|YP_660277.1| hypothetical protein Patl_0695 [Pseudoalteromonas atlantica T6c]
 gi|109699303|gb|ABG39223.1| protein of unknown function DUF264 [Pseudoalteromonas atlantica
           T6c]
          Length = 577

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 38/250 (15%), Positives = 72/250 (28%), Gaps = 50/250 (20%)

Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DE     D   I     G  + ++  R     S P   S + Y     + +N+   D
Sbjct: 241 LYVDEVFWMNDFNNIWHVAKGMASHKHWTRTL--ISTPSSKSHEAYTMWSGDRYNQGRKD 298

Query: 248 -----------------------WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
                                  W+ F  ++    +G      + +        D     
Sbjct: 299 EDKQELKVNHATLYGGHLAKDRIWRDFVTVEDAANDGCTLFDIDELKDE--NTPDEFDNL 356

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNRE---------PCPDPYAPLIMGCDIAEEGGDNTV- 333
              +F       F   ++++ A +R+         P P     + +G D A    D T+ 
Sbjct: 357 YMCEFVDDSHSVFKLADLLKCATDRQHWKDYRERAPRPFLNRAVWLGYDPARSRDDATLA 416

Query: 334 -----VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
                V+      +     +   DLR    +I  + +++    I ID    G    D ++
Sbjct: 417 IVAPPVLPGEKFRVLERIYFKGKDLREQAVEIQKICKRFNVVHIGIDVTGIGWGVFDLVK 476

Query: 389 MLGYHVYRVL 398
                V    
Sbjct: 477 AFYPRVQGFH 486


>gi|322833247|ref|YP_004213274.1| hypothetical protein Rahaq_2540 [Rahnella sp. Y9602]
 gi|321168448|gb|ADW74147.1| hypothetical protein Rahaq_2540 [Rahnella sp. Y9602]
          Length = 595

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 27/162 (16%), Positives = 48/162 (29%), Gaps = 20/162 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  + +  RY  D+    +     F     DS    + +
Sbjct: 332 PDGQWRYVITMEDAVKGGFNRASIDKLRNRYNRDT--FNMLYMCVFVDSK-DSVFKFSDL 388

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV------VVLRRGPVIEHLF 346
           E           + P          +  G D A  G  +T       +       +  LF
Sbjct: 389 EICGVDVADWQDHDPNAERPFGNREVWGGFDPARSGDTSTFAIVAPPLYAVEKFRVLCLF 448

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            W   +      +I  L  KY    I +D    G    + +E
Sbjct: 449 HWKGMNFAYQAAQIKKLFGKYNMTYIGVDVTGIGRGVFELIE 490


>gi|320087122|emb|CBY96890.1| Terminase, ATPase subunit GpP [Salmonella enterica subsp. enterica
           serovar Weltevreden str. 2007-60-3289-1]
          Length = 589

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320
           R     +  +     +F      S  P   ++  +           P+A       P+ +
Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFAPFADHPFGSRPVWI 413

Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +  G     VVL      G     L    W   D       I  L EKY  + I I
Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEAIRRLTEKYNVEYIGI 473

Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
           DA   G      +               A  + +    +T + +K  D +    L   +G
Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524

Query: 435 ---LIQNLKSLKSFIVPNTGE 452
              + Q+  S++   + ++G 
Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544


>gi|238027334|ref|YP_002911565.1| phage terminase ATPase subunit [Burkholderia glumae BGR1]
 gi|237876528|gb|ACR28861.1| Phage terminase ATPase subunit [Burkholderia glumae BGR1]
          Length = 605

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 44/297 (14%), Positives = 75/297 (25%), Gaps = 57/297 (19%)

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLG 210
              D +  S G +     T  RT      D            +  DE     +   +N  
Sbjct: 239 LTGDPMRLSNGAELIFLGTSSRTAQSYNGD------------LYFDEYFWVSNFATLNKV 286

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRFQID------TRTVEGI 261
            +G  T  +       T +        +     FN+   D +R +ID          +  
Sbjct: 287 AMGMATHSHLRMTHFSTPSTTTHEAYPFWTGAHFNRDRADDERVEIDISHTSLAPGRQCG 346

Query: 262 DPSFHEGIIARYGLDSDV----------------TRVEVCGQFPQQDIDSFIPLNIIEEA 305
           D  + + + A   + S                         QF      S     +++  
Sbjct: 347 DGQWRQIVTAEDAVASGFTKLDLEDLRSTNSPADFENLYMCQFVDDTS-SVFAFRLVQAC 405

Query: 306 LN-----------REPCPDPYAPLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDW 348
           +                P  +  + +G D A  G     VV+           +     W
Sbjct: 406 MVDSWDVWTDVKPLLDRPFGWKQVWIGYDPALTGDSAGCVVIAPPEQPNGKFRVLERHRW 465

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
              D  T   KI  L ++Y    I ID    G      +      V  +       +
Sbjct: 466 KGIDFETQAEKIRELTQRYHVTYIAIDTTGIGHGVHQLVRQFFPRVVPINYSPEVKN 522


>gi|222149246|ref|YP_002550203.1| hypothetical protein Avi_3048 [Agrobacterium vitis S4]
 gi|221736230|gb|ACM37193.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 452

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 45/241 (18%), Positives = 71/241 (29%), Gaps = 22/241 (9%)

Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232
           T     PDT  G        +  DE +   D   I   +   +++    R    TS P  
Sbjct: 114 TALPANPDTARGFSAN----VFLDEFAFHKDSQQIWRALFPVISKGWNIRV---TSTPNG 166

Query: 233 LSGKFYEIFNKPLDD-WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291
            S KFYE+   P+DD W R  +D               +     D D    E   ++  +
Sbjct: 167 KSNKFYELATGPIDDPWSRHVVDIYQAVRDGLPRDIEELRAGLADEDSWAQEFELKWLDE 226

Query: 292 DID----SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIE 343
                    I     E A   +P         +G DI     D  V+     +       
Sbjct: 227 ASAWLSYDLISSCEDERA--GDPEGYQGNVCFVGRDIGRR-EDLHVIWVWEQIGDVLWER 283

Query: 344 HLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLGQKR 402
              +  +      +     ++ +YR     ID    G +  +  +   G  V  VL    
Sbjct: 284 ERIEQKRATFAEMDEAFDDVMTRYRVGRACIDQTGMGEKVTEDAQIRYGSRVEGVLFTGP 343

Query: 403 A 403
            
Sbjct: 344 N 344


>gi|88858953|ref|ZP_01133594.1| Mu-like prophage FluMu protein gp28 [Pseudoalteromonas tunicata D2]
 gi|88819179|gb|EAR28993.1| Mu-like prophage FluMu protein gp28 [Pseudoalteromonas tunicata D2]
          Length = 593

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 28/142 (19%), Positives = 43/142 (30%), Gaps = 19/142 (13%)

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----I------PLNIIEEALN 307
             G D      +   Y   +D        +F      +F    I           +  L 
Sbjct: 350 NSGFDRIDISVLENEY--STDEFNNLFMCKFIDDAHSAFNLKQIMDCVGDSTKWTDFNLE 407

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL--FDWSKTDLRTTNNKIS 361
               P    P+++G D A  G   +V V    ++ G     L   D S  D     ++I 
Sbjct: 408 -WERPFALRPVVIGFDPARFGDKASVAVLSAPMKPGEKFRLLEAIDLSGNDFEAMASEIK 466

Query: 362 GLVEKYRPDAIIIDANNTGART 383
            L +KY    I +D    G   
Sbjct: 467 LLTDKYNVQHIGVDTTGIGYGV 488


>gi|290474053|ref|YP_003466928.1| hypothetical protein XBJ1_0997 [Xenorhabdus bovienii SS-2004]
 gi|289173361|emb|CBJ80138.1| putative phage gene [Xenorhabdus bovienii SS-2004]
          Length = 594

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 34/239 (14%), Positives = 64/239 (26%), Gaps = 52/239 (21%)

Query: 198 DEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP----------- 244
           DE    PD    N       T  +        S P   +   Y ++              
Sbjct: 258 DEYFWIPDFKRFNEVASAMATHDHWRTT--YFSTPSAKTHPAYSLWTGDEWRGNDPKRKN 315

Query: 245 --LDDWKRFQIDTRTVE----------------GIDPSFHEGIIARYGLDSDVTRVEVCG 286
                +   +   R                   G + +  + +  +Y  DS    +    
Sbjct: 316 VAFPAFDELRDGGRDCPDGQWRYVITLEDAIKGGFNLASIDRLRNKYNPDS--FNMLFMC 373

Query: 287 QFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL 336
            F      S    + +++        + +AP          +  G D A  G  +T V++
Sbjct: 374 VFVDSGA-SVFTYSQVDKCGVDINLWEDHAPNASRPFGEREVWGGFDPARSGDTSTFVIV 432

Query: 337 ------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
                      +   F W   + +     I  L ++YR   I ID    G    + ++ 
Sbjct: 433 APPMMANEAFRVLATFYWQGMNWKHQAKLIEELFKRYRFTHIGIDTTGIGHGVYEMVQD 491


>gi|320198795|gb|EFW73395.1| Phage terminase, ATPase subunit [Escherichia coli EC4100B]
          Length = 603

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 20/135 (14%), Positives = 42/135 (31%), Gaps = 17/135 (12%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP---------CPDPYA 316
            E +  +Y        +    QF       F    ++   ++R            P    
Sbjct: 356 IERLRNKYSPT--AFAMLYMCQFVDSKDAVFKFSALVGCEVDRATWGDFDLTATRPFGNR 413

Query: 317 PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
            +  G D +  G ++T V++           +  ++ W   +     ++I  L+ ++   
Sbjct: 414 EVWAGFDPSRSGDNSTFVLIAPPIEDGERFRVLAVWQWQGFNFSWQADQIKQLMRRFNIT 473

Query: 371 AIIIDANNTGARTCD 385
            I ID    G    D
Sbjct: 474 YIGIDTTGIGKGVYD 488


>gi|193062794|ref|ZP_03043887.1| putative conserved hypothetical protein [Escherichia coli E22]
 gi|192931437|gb|EDV84038.1| putative conserved hypothetical protein [Escherichia coli E22]
          Length = 603

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 20/135 (14%), Positives = 42/135 (31%), Gaps = 17/135 (12%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE---------PCPDPYA 316
            E +  +Y        +    QF       F    ++   ++R            P    
Sbjct: 356 IERLRNKYSPT--AFAMLYMCQFVDSKDAVFKFSALVGCEVDRATWGDFDLTAARPFGNR 413

Query: 317 PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
            +  G D +  G ++T V++           +  ++ W   +     ++I  L+ ++   
Sbjct: 414 EVWAGFDPSRSGDNSTFVLIAPPIEDGERFRVLAVWQWQGFNFSWQADQIKQLMRRFNIT 473

Query: 371 AIIIDANNTGARTCD 385
            I ID    G    D
Sbjct: 474 YIGIDTTGIGKGVYD 488


>gi|89071120|ref|ZP_01158320.1| Putative large terminase [Oceanicola granulosus HTCC2516]
 gi|89043331|gb|EAR49553.1| Putative large terminase [Oceanicola granulosus HTCC2516]
          Length = 444

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 61/418 (14%), Positives = 114/418 (27%), Gaps = 67/418 (16%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ +   G           V  +  +  Q +  +        
Sbjct: 58  ILGGRGAGKTRAGA---EWVRAQVEGPRATDPGRARRVALVGETIDQAREVMV------- 107

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                   F    L     P           +         + +S   P+   G      
Sbjct: 108 --------FGDSGLLACAPPDRRPEWIAGRRLLVWPNGAQAQLFSAHDPEALRGPQFD-- 157

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +     +     +   L   +  R     +   R +     +  +     + 
Sbjct: 158 -AAWVDELAKWKKAEEAWDMLQLALRLGDDPR--CCVTTTPRPTALMRALLERD-GTART 213

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     +  +F   +  RY   S + R E+ G    +   +      I  A N + 
Sbjct: 214 HAPTEANAANLARAFLAEVRRRY-AGSPLGRQELDGVMLSEIEGALWSAGAI-AAANCDV 271

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTT-NNKIS 361
            PD +  +++  D +  GGD   +V+                L D S     TT      
Sbjct: 272 VPDLH-RVVVAVDPSAGGGDVCGIVVAGACYDGGADNWRAWVLEDASVAGSSTTWARAAI 330

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
              E+++ D I+ + N  G      L  +        G +              L+ +  
Sbjct: 331 AAYERHQADRIVAEVNQGGDMVAAMLRQV-APTVPYKGVRAMRGKAARAEPVAALYEQGR 389

Query: 422 -DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
              +     +        L + + +               +G  S D +D L++  +E
Sbjct: 390 VRHVRGLGALEDQ---MALMTHQGY---------------RGRGSPDRADALVWALSE 429


>gi|167553298|ref|ZP_02347048.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Saintpaul str. SARA29]
 gi|205322236|gb|EDZ10075.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Saintpaul str. SARA29]
          Length = 589

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 50/310 (16%), Positives = 90/310 (29%), Gaps = 67/310 (21%)

Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPL-- 245
           +  DE    P+   +     G  ++++        S P  L+   Y     E+FNK    
Sbjct: 250 LYVDEIFWIPNFQKLRKVASGMASQKHLRST--YFSTPSTLAHGAYPFWSGELFNKGRAS 307

Query: 246 ----------------------DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282
                                   W++   I+     G      + +        +  + 
Sbjct: 308 AADRIEIDISHSALAGGLLCADGQWRQIVTIEDALASGCTLFDLDQLRRE--NSDEDFKN 365

Query: 283 EVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIMGCDIAEEGGDN 331
               +F      S  P   ++  +           P+A       P+ +G D +  G   
Sbjct: 366 LFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFTPFADHPFGSRPVWIGYDPSHTGDSA 424

Query: 332 TVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
             VVL      G     L    W   D       I  L EKY  + I IDA   G     
Sbjct: 425 GCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRRLTEKYNVEYIGIDATGLGLGVFQ 484

Query: 386 YLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG---LIQNLKSL 442
            +               A  + +    +T + +K  D +    L   +G   + Q+  S+
Sbjct: 485 LVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAGATDVTQSFMSI 535

Query: 443 KSFIVPNTGE 452
           +   + ++G 
Sbjct: 536 RK-TMTSSGR 544


>gi|83943173|ref|ZP_00955633.1| terminase, large subunit, putative [Sulfitobacter sp. EE-36]
 gi|83846181|gb|EAP84058.1| terminase, large subunit, putative [Sulfitobacter sp. EE-36]
          Length = 408

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 64/423 (15%), Positives = 113/423 (26%), Gaps = 73/423 (17%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ +   G           V  +  +  Q++  +        
Sbjct: 16  IMGGRGAGKTRAGA---EWVRAQVEGSRPLDAGRCRRVALVGETIEQVREVM-------- 64

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
             +         S +     W +                +   ++   P+   G      
Sbjct: 65  --IFGDSGILACSPADRRPDWEATRKRLVWPN-----GAVATVHTAHDPEGLRGPQFD-- 115

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +     +     +   L     +    +T+ P R  G    +   P      
Sbjct: 116 -AAWVDELAKWKKAEETWDQLQFAL-RLGEDPRVCVTTTP-RNVGVLKNLLASPSTV-TT 171

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     +  SF E + ARY   + + R E+ G        +      IE    R+ 
Sbjct: 172 HAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSERIEAGRVRDV 230

Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357
                  +++G D A     G D   +V+          DW                   
Sbjct: 231 PL--LDRIVVGLDPATTAGAGSDECGIVVVGAQTQGPPQDWRAVVMADCTVQGATPSGWA 288

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTE 415
                 +E+Y  D ++ + N  G    + L  +     V  V   +  V        R E
Sbjct: 289 RAAISAMEQYGADRLVAEVNQGGQMVAEVLRQVDPLVPVKSVHASRGKV-------ARAE 341

Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
               + +      +     L   +  + +     TG             S D  D L++ 
Sbjct: 342 PVAALYEQGRVGHVAGLDALEDQMCRMTARGYEATG-------------SPDRVDALVWA 388

Query: 476 FAE 478
             E
Sbjct: 389 LHE 391


>gi|224582696|ref|YP_002636494.1| hypothetical protein SPC_0883 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|224467223|gb|ACN45053.1| putative phage protein [Salmonella enterica subsp. enterica serovar
           Paratyphi C strain RKS4594]
          Length = 591

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 26/168 (15%), Positives = 51/168 (30%), Gaps = 20/168 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  ++    +     F     DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIEKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E           + P          +  G D A  G  +  V++           +  + 
Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394
            W   + R    +I  L +KY    + +D    G    D ++     V
Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGIFDNIQHFAMRV 492


>gi|170023192|ref|YP_001719697.1| hypothetical protein YPK_0943 [Yersinia pseudotuberculosis YPIII]
 gi|169749726|gb|ACA67244.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis
           YPIII]
          Length = 697

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 21/157 (13%), Positives = 45/157 (28%), Gaps = 20/157 (12%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E +    G             F       F   + +E+ L      + +          
Sbjct: 367 IEELREENGES--AFNQLYMCLFVDTGDCVF-RFDQLEKCLVTVSNWEDHDVNAARPFGN 423

Query: 317 -PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             +  G D A  G   + V++           + H+  W     +    +I   + +Y  
Sbjct: 424 REVWAGYDPARSGDTASFVLVAPPQADGEPFRVLHIETWHGFAFKYQVGRIKEYMARYNI 483

Query: 370 DAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRAVD 405
             I ID+   G   C+ ++      V ++     + +
Sbjct: 484 THIGIDSTGIGGPVCELVQEFARREVTQIHYSPESKN 520


>gi|317153313|ref|YP_004121361.1| hypothetical protein Daes_1602 [Desulfovibrio aespoeensis Aspo-2]
 gi|316943564|gb|ADU62615.1| hypothetical protein Daes_1602 [Desulfovibrio aespoeensis Aspo-2]
          Length = 507

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 60/394 (15%), Positives = 110/394 (27%), Gaps = 69/394 (17%)

Query: 38  WGEKGTPLEGFSAPRSW--QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNA 95
           WG+        S    W  Q+E +  +  + ++                GR +GK+ + +
Sbjct: 20  WGQAYLYNRDGSGRDYWPHQVEDLRCLARNIIHLD--------------GRDVGKSIVLS 65

Query: 96  WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYS 155
              L    T  G   +  A  +  L + +  E+   L   P+     M S+++       
Sbjct: 66  TDALHYAFTTRGGQGLIAAPHQGHLDSII-EEIEYQLDTNPD----LMNSIAVTKYGKPK 120

Query: 156 DVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL 215
                   ++  + S +    +    D+F   H      +  DE +   +     +   L
Sbjct: 121 IHRKPYFRLEFTNGSVLYFRPAGAYGDSFRSLHVGR---VWVDEGAWLTERAWKALRQCL 177

Query: 216 TERNANRFWIMTSNPRRL-SGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYG 274
                 R +   S P  L    +Y +     D +  F+  +             ++  YG
Sbjct: 178 KTGGILRIY---STPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTEDRESELLEFYG 232

Query: 275 -LDSDVTRVEVCGQFPQQDIDSF-----------------IPLNIIE--------EALNR 308
             DS   + EV G+  +    +F                 I +   E         A +R
Sbjct: 233 GRDSSGWQHEVAGEHGKPSYGAFNVEQFNLCRQDLLEYQKIVITDSELRDCDTEEAAHDR 292

Query: 309 -----EPCPDPYAPLIMGCDIAEEGGDNTVVVL-------RRGPVIEHLFDWSKTDLRTT 356
                   P      I G D+        ++V        R    +              
Sbjct: 293 MEMLLNLTPRSGQFWIGG-DLGYTNDPTEIIVFQEMEVGERTLLKMILRVHLEHVSYPHI 351

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
               + L   Y P  I +D    G      L  L
Sbjct: 352 AQIFALLERYYTPAGIGVDNGGNGLAVVQELLTL 385


>gi|159044464|ref|YP_001533258.1| hypothetical protein Dshi_1915 [Dinoroseobacter shibae DFL 12]
 gi|157912224|gb|ABV93657.1| hypothetical protein Dshi_1915 [Dinoroseobacter shibae DFL 12]
          Length = 260

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 52/294 (17%), Positives = 89/294 (30%), Gaps = 62/294 (21%)

Query: 35  FFPWGE-------KGTPLEGFSA--PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAG 85
             PW E         + L  +    P  WQ+E                       A+  G
Sbjct: 6   LIPWAEDLERRLDPVSRLTHWMGHAPDPWQVEAFTTRATE--------------VALRVG 51

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           R  GKT++ A   +  +   P    +C+A +E Q K  +  E+ + L             
Sbjct: 52  RQSGKTSVLAARAVEELHV-PESLTLCVAPAERQAK-IIAREIGRQLQRTS--------- 100

Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERP-DTFVGHHNTYGMAIINDE----- 199
                      ++           +   R  +     DT  G        +I DE     
Sbjct: 101 -----------LVINRPTQTELEIANGARVIALPSTSDTIRGF--PAVSCLIIDECAFLQ 147

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRT 257
             G  + +   +L  LTE         +S P   +  F  +F   KP D   R  +    
Sbjct: 148 GDGGGEDLISSVLPMLTEDGQ---VFFSSTPAGKNNYFARLFLDAKPGDGIHRIVVRGTD 204

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
           +  +          R  L +   R E+  +       ++  L+IIE+A ++   
Sbjct: 205 IPRLADKVERM---RRTLSATKFRQEILVEMLADGQ-AYFDLSIIEQATSKTEK 254


>gi|323527775|ref|YP_004229928.1| bacteriophage terminase ATPase subunit [Burkholderia sp. CCGE1001]
 gi|323384777|gb|ADX56868.1| bacteriophage terminase, ATPase subunit [Burkholderia sp. CCGE1001]
          Length = 588

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 25/170 (14%), Positives = 47/170 (27%), Gaps = 21/170 (12%)

Query: 247 DWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
            W++   ++     G +    + +   Y    +     +  QF      S   L  ++  
Sbjct: 330 QWRQIVTVEDAARAGCNLFNLDELRREY--SDEEYANLLMCQFIDDTA-SIFTLANLQRC 386

Query: 306 LN-----------REPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGP--VIEHLFDW 348
           +                P  + P+ +G D A  G     VV    +  G    +     W
Sbjct: 387 MVDSWELWADYKPLAARPFAWHPVWVGYDPALSGDSAGCVVVAPPMVEGGPFRVLEKHQW 446

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVL 398
              D       I  + E+Y    + ID    G      ++     V    
Sbjct: 447 RGLDFEAQAQSIKEITERYNVAYMAIDTTGIGQGVYQLVKQFYPRVVAFN 496


>gi|103487487|ref|YP_617048.1| hypothetical protein Sala_2004 [Sphingopyxis alaskensis RB2256]
 gi|98977564|gb|ABF53715.1| protein of unknown function DUF264 [Sphingopyxis alaskensis RB2256]
          Length = 436

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 66/407 (16%), Positives = 124/407 (30%), Gaps = 56/407 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           AGRG GKT   A  V     T PG  +  +A              +  L         E 
Sbjct: 56  AGRGFGKTRTGAEWVRAFAETTPGARIALVA--------------ASLLEARQVMVEGES 101

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
             L++ P     +    SL   +     +   YS   PD+  G  +    A   DE +  
Sbjct: 102 GLLAIAPDHLRPEY-ESSLRRLTWPNGAVATLYSAVEPDSLRGPEHD---AAWCDEIAKW 157

Query: 204 P--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261
           P  +     ++  +    A    + T+ PR +         +                 +
Sbjct: 158 PKGEAAWDNLMLTM-RIGARPQVVATTTPRCV--PLVRRLIQERGVATTRGRTASNRRNL 214

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321
              +   + A YG    + R E+ G+  +   D+     +IE           +A +++G
Sbjct: 215 SVQWLATMDAIYGGT-RLGRQELDGELLEDVEDALWTRALIERCRVDAGSIGKFARVVIG 273

Query: 322 CD-IAEEGGDNTVVV----LRRGPVIEHLFDWS-KTDLRTTNNKISGLVEKYRPDAIIID 375
            D  A  GGD   +V    LR G +       + +         ++    ++  + ++ +
Sbjct: 274 VDPPASAGGDACGIVVAALLRDGRLAVVEDASALRPLPGVWAQAVAAAAARWGAERVVAE 333

Query: 376 ANNTGARTCDYLEM--LGYHVYRVLGQKRAVDLEFCRNRRTE---LHVKMADWLEFASLI 430
           +N  G      L    +   V  +              RR E   L  +    +   +  
Sbjct: 334 SNMGGDMVAAVLRQADMTLPVVAIHASVGKA-------RRAEPVALAYERGQVVHAGAFA 386

Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
           +    +  L+    +  P               +S D +D  ++  A
Sbjct: 387 DLEDQLCGLQMGGGYAGP--------------GRSPDRADACVWALA 419


>gi|254485756|ref|ZP_05098961.1| phage DNA Packaging Protein [Roseobacter sp. GAI101]
 gi|214042625|gb|EEB83263.1| phage DNA Packaging Protein [Roseobacter sp. GAI101]
          Length = 452

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 62/426 (14%), Positives = 118/426 (27%), Gaps = 79/426 (18%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ S   G           V  +  +  Q++  +        
Sbjct: 60  IMGGRGAGKTRAGA---EWVRSMVEGARPLDAGRCRRVALVGETIEQVREVM-------- 108

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
             +         S +     W +                +   ++   P+   G      
Sbjct: 109 --IFGDSGILACSPADRRPDWEATRKRLVWPN-----GAVASVHTAHDPEGLRGPQFD-- 159

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +     +     +   L     +    +T+ PR +     ++  K L     
Sbjct: 160 -AAWVDELAKWKKAEETWDQLQFAL-RLGEDPRVCVTTTPRNV-----DVLKKLLASPST 212

Query: 251 FQIDTRT---VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
                 T      +  SF E + ARY   + + R E+ G        +     ++E    
Sbjct: 213 VTTHAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSEMLER--G 269

Query: 308 REPCPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLR 354
           R      +  +++G D A     G D   +V+          +W                
Sbjct: 270 RIEKLPTFDRIVVGVDPATTAGAGSDECGIVVVGAQTQGAPQNWRAVVLADCTAQGATPS 329

Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNR 412
                    +E+Y  D ++++ N  G    + L  +     +  V   +  V        
Sbjct: 330 GWARAAVSAMEQYGADRLVVETNQGGLMVGEVLRQIDPLVPLKSVHASRGKV-------A 382

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGL 472
           R E    + +      +     L   +  + +     +G             S D  D L
Sbjct: 383 RAEPVAALYEQGRVGHVAGLVALEDQMCRMTARGFEGSG-------------SPDRVDAL 429

Query: 473 MYTFAE 478
           ++   E
Sbjct: 430 VWALHE 435


>gi|24112089|ref|NP_706599.1| putative bacteriophage protein [Shigella flexneri 2a str. 301]
 gi|30062202|ref|NP_836373.1| putative bacteriophage protein [Shigella flexneri 2a str. 2457T]
 gi|24050918|gb|AAN42306.1| putative bacteriophage protein [Shigella flexneri 2a str. 301]
 gi|30040447|gb|AAP16179.1| putative bacteriophage protein [Shigella flexneri 2a str. 2457T]
 gi|281600053|gb|ADA73037.1| putative bacteriophage protein [Shigella flexneri 2002017]
 gi|332768291|gb|EGJ98476.1| hypothetical protein SF293071_0835 [Shigella flexneri 2930-71]
          Length = 179

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 25/150 (16%), Positives = 49/150 (32%), Gaps = 27/150 (18%)

Query: 295 SFIPLNIIEEALN--REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SK 350
           + I L+ IE A++  +    +P     +G D+A+ G D    V R G V+    +W   +
Sbjct: 10  AIIKLSWIEAAVDAHKTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 69

Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG------------YHVYRVL 398
            +L  +  +      +   D I+ D+   GA        +              +  R  
Sbjct: 70  DELLKSCQRTYQAALEREAD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFN 128

Query: 399 GQ----------KRAVDLEFCRNRRTELHV 418
                           + +F  N + +   
Sbjct: 129 AGAGVHEPDDEYNGIPNKDFFANLKAQAWW 158


>gi|75758280|ref|ZP_00738405.1| Hypothetical protein RBTH_06375 [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|74494334|gb|EAO57425.1| Hypothetical protein RBTH_06375 [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
          Length = 660

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 62/426 (14%), Positives = 117/426 (27%), Gaps = 83/426 (19%)

Query: 92  TLNAWLVLWLMSTRPGISV------ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           T+ A ++    +   G  +      +     + Q +  ++ ++    + + N    +   
Sbjct: 148 TMCAHMLWVAFTCNGGTRMAKGAACVVATPYDNQAR-LIFDQLK---TFIDNNPVLQESI 203

Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205
            S+   P+   V+    G   + ++   R+ SE    +  G    +   +  DE     D
Sbjct: 204 KSITKNPY---VIEFKNGSVIRLFTAGTRSGSE--GGSLRGQRADW---LYMDEVDYMGD 255

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPL-------------------- 245
                I   + E       ++ S P    G FY+   +                      
Sbjct: 256 KDFESIFAIVNEAPDRIGCMIASTPTGRRGMFYKTCTQMKLNQDVKMNKNNVYDMRSYNR 315

Query: 246 ---DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
              + W  F   T       P     +   +         EV  +F  + +  F   + +
Sbjct: 316 TLSEGWAEFYFPTMVNPEWGPKMERELRKLFSE--AAYEHEVLAEFGTEMVGVF-NKDYV 372

Query: 303 EEA----LNREPCPDPYAPLIMGCDIAEEGGDNTVVVL---------------------- 336
           +EA     N    P    P+ +G D  + G    +VV                       
Sbjct: 373 DEASSIGYNYTTSPTHDGPIAIGIDWDKAGAATQIVVTQYNPFEVRRPRPELGETEPSFG 432

Query: 337 RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM-LGYHVY 395
           R   +        +        KI  L   Y P  I  DA   G    + L   LG  V 
Sbjct: 433 RFQIINRIEIPKGEFTYDIAVKKIIELDGVYNPFGIYADA-GAGEYQIELLRKTLGDKVK 491

Query: 396 RVLGQKRAV--DLE----FCRNRRTELHVKMADWLEFASL-----INHSGLIQNLKSLKS 444
           RV      +  D        +  +  +  +    LE   L          L + + + + 
Sbjct: 492 RVHLGSSQMVRDPHSREFEKKPLKAFIVDQTKLMLERGQLRIPHREKDETLARQMTNYQV 551

Query: 445 FIVPNT 450
                 
Sbjct: 552 TRYSPK 557


>gi|17975126|ref|NP_536648.1| putative terminase, ATPase subunit [Vibrio phage K139]
 gi|153820795|ref|ZP_01973462.1| terminase [Vibrio cholerae B33]
 gi|165970256|ref|YP_001650887.1| putative terminase ATPase subunit [Vibrio phage kappa]
 gi|229512054|ref|ZP_04401533.1| hypothetical protein VCE_003464 [Vibrio cholerae B33]
 gi|229519190|ref|ZP_04408633.1| hypothetical protein VCC_003218 [Vibrio cholerae RC9]
 gi|229607255|ref|YP_002877903.1| hypothetical protein VCD_002166 [Vibrio cholerae MJ-1236]
 gi|254849294|ref|ZP_05238644.1| terminase [Vibrio cholerae MO10]
 gi|17865408|gb|AAL47515.1|AF125163_21 orf16 [Vibrio phage K139]
 gi|126521587|gb|EAZ78810.1| terminase [Vibrio cholerae B33]
 gi|165292233|dbj|BAF98815.1| putative terminase ATPase subunit [Vibrio phage kappa]
 gi|229343879|gb|EEO08854.1| hypothetical protein VCC_003218 [Vibrio cholerae RC9]
 gi|229352019|gb|EEO16960.1| hypothetical protein VCE_003464 [Vibrio cholerae B33]
 gi|229369910|gb|ACQ60333.1| hypothetical protein VCD_002166 [Vibrio cholerae MJ-1236]
 gi|254844999|gb|EET23413.1| terminase [Vibrio cholerae MO10]
          Length = 605

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 30/181 (16%), Positives = 48/181 (26%), Gaps = 23/181 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    I+     G D    E +   Y              F      S    N I
Sbjct: 345 PDKQWRYVVTIEDAAKGGCDLFDIEELREEYSETD--FNNLFMCVFVDGAS-SIFEFNKI 401

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345
           E  +        Y P          + +G D +    DN V       +V      +   
Sbjct: 402 ERCMVDSDIWQDYKPNAARPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAVEKFRVLEK 460

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404
             W     +   ++IS + E++    + ID    GA   D L          +       
Sbjct: 461 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVHDLLVNKHPRETVAIHYSNENK 520

Query: 405 D 405
           +
Sbjct: 521 N 521


>gi|118475162|ref|YP_891824.1| hypothetical protein CFF8240_0649 [Campylobacter fetus subsp. fetus
           82-40]
 gi|261886523|ref|ZP_06010562.1| hypothetical protein CfetvA_16765 [Campylobacter fetus subsp.
           venerealis str. Azul-94]
 gi|118414388|gb|ABK82808.1| hypothetical protein CFF8240_0649 [Campylobacter fetus subsp. fetus
           82-40]
          Length = 523

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 54/344 (15%), Positives = 108/344 (31%), Gaps = 38/344 (11%)

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYS----EERPDTFVGHHNTYGMAIINDEASGTP 204
           +   W  +    S   DS+H   +              T  G        I  DE +  P
Sbjct: 183 YMRHWAKEYG-ISFKKDSEHEVVLENGAYIKSFANNFRTVQGFAGD----IWMDEFAWYP 237

Query: 205 DV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRFQIDTRTVEG 260
           +   I    +  +          + S P      F++I++       +KRF +       
Sbjct: 238 NPKRIWHAFVPSI--GAIKGRLTILSTPFEERSLFHQIYSDKTKFHMFKRFCVSIYKAIE 295

Query: 261 IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP---CPDPYAP 317
               F    +     D+D        QF   +  S + +++I+  ++ +     P     
Sbjct: 296 DGLDFDLETMRDL-FDTDTWASAYECQFVDDES-SLLSISLIKSCVDNKAHYFTPKSSEC 353

Query: 318 LIMGCDIAEEGGDNTV--VVLRRGPVIEHLFDW-SKTDLRTTNNKISGLVEKYRPDAIII 374
           +  G D+      +T+  VVL  G     L D  +K         ++  ++ Y    + I
Sbjct: 354 IYAGYDVGRVSDRSTLAGVVLENGVYKTALMDILAKARFEEQKEHLTSFLKTYPISVLKI 413

Query: 375 DANNTGARTCDYL-EMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE--FASLIN 431
           D    G    + + +     V  V       +         E+ + +    E     + N
Sbjct: 414 DKTGIGMNLAENMHDKFKSRVSGVWFSNTRKE---------EMALNLKKAFEDKLIKIPN 464

Query: 432 HSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
              LI ++ ++K  I   + +   ++KR +   + D    L   
Sbjct: 465 DPLLIADIHAIKRTIGAKSFKY--DAKRNEYGHA-DRFWALALA 505


>gi|83643297|ref|YP_431732.1| Mu-like prophage FluMu protein gp28 [Hahella chejuensis KCTC 2396]
 gi|83631340|gb|ABC27307.1| Mu-like prophage FluMu protein gp28 [Hahella chejuensis KCTC 2396]
          Length = 581

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 72/391 (18%), Positives = 124/391 (31%), Gaps = 83/391 (21%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
           IG T   AW       T     +   A S +Q      A++ K   L   + +F+ +   
Sbjct: 168 IGATYYFAWEAFQDAITSGDNQIFLSA-SRSQ------ADIFKAYILKFAREYFDTELKG 220

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207
           +       DV+  S G + +  ST  RT          G+H      +  DE    PD  
Sbjct: 221 V-------DVIPLSNGAELRFVSTNGRTA--------QGYHGH----LYIDEVFWIPD-- 259

Query: 208 NLGILGFLTERNANRFW--IMTSNPRRLSGKFYEIF----------NK------------ 243
              +    +   A++ W     S P   S   Y ++          +K            
Sbjct: 260 FDRLNKLASGMAAHKKWRKTYFSTPSVKSHGAYTLWSGERYNESRRHKVEFDLSRAALRE 319

Query: 244 ----PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
               P   W+    ++    +G D    + +   Y              F +  +  F  
Sbjct: 320 GQLGPDKVWRNVVTVEDAANQGCDLFDIDELKQEY--TEAEFNNLFMCAFMEAGLSVFKL 377

Query: 299 LNIIEEAL----------NREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEH 344
            +++  A+           R+P P    P+ +G D A  G  +TVVV    +        
Sbjct: 378 DDLLSCAVCSSDVWPDFKPRQPRPFANYPVWLGYDPARTGDRSTVVVVAPPMHPAGKFRV 437

Query: 345 LFDWS-KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRA 403
           L     K       N+I  L+ +Y    + ID    G    + ++             RA
Sbjct: 438 LEKIQLKGAFSYQANRIKDLLGRYNVQFVGIDCTGPGLGVFEQVKA---------FYPRA 488

Query: 404 VDLEFCRNRRTELHVKMADWLEFASLINHSG 434
             + +  N +T L +K  D +E A +   + 
Sbjct: 489 TPIHYSLNAKTALVLKAMDVIENARIEWDAE 519


>gi|12276099|gb|AAG50261.1|AF311654_1 probable terminase [Phage GMSE-1]
          Length = 268

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 33/202 (16%), Positives = 64/202 (31%), Gaps = 27/202 (13%)

Query: 267 EGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP--------- 317
           E + ++Y   +    +    QF     DS      +E           + P         
Sbjct: 35  ERLRSKY--PARYFNMLYQCQFVDSG-DSVFSFGDLERCGVETVRWQDHQPNAARPFGNR 91

Query: 318 -LIMGCDIAEEGGDNTVVVLR----RGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPD 370
            +  G D A  G  +T V++      G     L    W   + +   ++I  L  +Y   
Sbjct: 92  EVWAGFDPARSGDTSTFVIMAPPQYEGERFRVLVTFYWQGMNWKYQASQIKALFARYHMT 151

Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430
            I ID    G+          + +      ++ V + +    +T + +KM D +E   + 
Sbjct: 152 HIGIDTTGIGSGV--------FEMVEAFAPRQTVAIRYGVETKTRMVLKMVDLVESKRIE 203

Query: 431 NHSGLIQNLKSLKSFIVPNTGE 452
                 +   S  S    +T +
Sbjct: 204 WDGEQKEIAASFLSIRRTSTAK 225


>gi|3337256|gb|AAC34148.1| terminase subunit [Enterobacteria phage 186]
          Length = 589

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 33/201 (16%), Positives = 60/201 (29%), Gaps = 31/201 (15%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY-----------APLIM 320
           +     +  +     +F      S  P   ++  +                     P+ +
Sbjct: 355 KRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMEEWEDFAPFADHPFGSRPVWI 413

Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +  G     VVL      G     L    W   D     + I  L EKY  + I I
Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQADGIRKLTEKYSVEYIGI 473

Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
           DA   G      +               A  + +    +T + +K  D +    L   +G
Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524

Query: 435 ---LIQNLKSLKSFIVPNTGE 452
              + Q+  S++   + ++G 
Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544


>gi|83649379|ref|YP_437814.1| Mu-like prophage FluMu protein gp28 [Hahella chejuensis KCTC 2396]
 gi|83637422|gb|ABC33389.1| Mu-like prophage FluMu protein gp28 [Hahella chejuensis KCTC 2396]
          Length = 581

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 72/391 (18%), Positives = 124/391 (31%), Gaps = 83/391 (21%)

Query: 88  IGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 147
           IG T   AW       T     +   A S +Q      A++ K   L   + +F+ +   
Sbjct: 168 IGATYYFAWEAFQDAITSGDNQIFLSA-SRSQ------ADIFKAYILKFAREYFDTELKG 220

Query: 148 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 207
           +       DV+  S G + +  ST  RT          G+H      +  DE    PD  
Sbjct: 221 V-------DVIPLSNGAELRFVSTNGRTA--------QGYHGH----LYIDEVFWIPD-- 259

Query: 208 NLGILGFLTERNANRFW--IMTSNPRRLSGKFYEIF----------NK------------ 243
              +    +   A++ W     S P   S   Y ++          +K            
Sbjct: 260 FDRLNKLASGMAAHKKWRKTYFSTPSVKSHGAYTLWSGERYNESRRHKVEFDLSRAALRE 319

Query: 244 ----PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
               P   W+    ++    +G D    + +   Y              F +  +  F  
Sbjct: 320 GQLGPDKVWRNVVTVEDAANQGCDLFDIDELKQEY--TEAEFNNLFMCAFMEAGLSVFKL 377

Query: 299 LNIIEEAL----------NREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEH 344
            +++  A+           R+P P    P+ +G D A  G  +TVVV    +        
Sbjct: 378 DDLLSCAVCSGEVWPDFKPRQPRPFANYPVWLGYDPARTGDRSTVVVVAPPMHPAGKFRV 437

Query: 345 LFDWS-KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRA 403
           L     K       N+I  L+ +Y    + ID    G    + ++             RA
Sbjct: 438 LEKIQLKGAFSYQANRIKDLLGRYNVQFVGIDCTGPGLGVFEQVKA---------FYPRA 488

Query: 404 VDLEFCRNRRTELHVKMADWLEFASLINHSG 434
             + +  N +T L +K  D +E A +   + 
Sbjct: 489 TPIHYSLNAKTALVLKAMDVIENARIEWDAE 519


>gi|237745794|ref|ZP_04576274.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS]
 gi|229377145|gb|EEO27236.1| conserved hypothetical protein [Oxalobacter formigenes HOxBLS]
          Length = 585

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 47/284 (16%), Positives = 77/284 (27%), Gaps = 62/284 (21%)

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA--SGTPDVINLG 210
              D +  S G       T  RT          G+H         DE   +   + +N  
Sbjct: 216 LKGDPIVLSNGAHLYFLGTNARTA--------QGYHGN----FYFDEFFWTFKFEELNKV 263

Query: 211 ILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDDWKRFQIDTRTVEGIDPSF 265
             G    +   +     S P  +S + Y     + FNK     ++ +ID       +   
Sbjct: 264 ASGMALHKRWRKT--YFSTPSAMSHEAYPFWTGDAFNKRRRKEEQVRIDVSHKWLAEGRL 321

Query: 266 HEGIIAR-----------------------YGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
            E  I R                       Y    D     +   F      S  PL+ +
Sbjct: 322 CEDRIWRQIVTIEDAEKGGCDLFDIDELRNYEYSPDQFDNLLMCNFIDDTA-SVFPLSEL 380

Query: 303 EEALNR-----------EPCPDPYAPLIMGCDIAEEGGDNTVVVLRR------GPVIEHL 345
           +  +                P    P+ +G D +  G     VV+           I   
Sbjct: 381 QRCMVDSWEAWNDYKPFTARPFGNRPVWVGYDPSRSGDSAGCVVMAPPLTFPGKFRIIEK 440

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
             +   D      +I  + +KY  + I IDA   G    + +  
Sbjct: 441 HQYRGMDFAAQAEQIRQITQKYNVEYIGIDATGMGLGVYEIVRQ 484


>gi|139473519|ref|YP_001128235.1| phage terminase [Streptococcus pyogenes str. Manfredo]
 gi|134271766|emb|CAM29999.1| putative phage terminase [Streptococcus pyogenes str. Manfredo]
          Length = 471

 Score = 54.0 bits (128), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 53/347 (15%), Positives = 109/347 (31%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ   +  + A   + +       +          GKT +   + LW +    G+ ++ 
Sbjct: 43  PWQENMLIPIMAVDEDGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   S G   +  +  
Sbjct: 97  TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKSSGAVIQFRT-- 150

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200

Query: 233 --LSGKFYEIFNKP-------LDDWKRFQIDTRTVEGIDPSF------------HEGIIA 271
              +G  +E + K           W  + +D         S+               I A
Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVDEMQPIHDVKSWYIANPSMGFHLNERKIEA 260

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365


>gi|82776052|ref|YP_402399.1| putative bacteriophage protein [Shigella dysenteriae Sd197]
 gi|81240200|gb|ABB60910.1| putative bacteriophage protein [Shigella dysenteriae Sd197]
          Length = 272

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 23/143 (16%), Positives = 51/143 (35%), Gaps = 5/143 (3%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-Q 252
            +  +EA    +     +   + +  +  ++    NP  ++   +  F     +     +
Sbjct: 131 VLWLEEAHALTEYQWKILEPTIRKEGSECWF--IFNPGLVTDFVWRNFVVDPPEGTLIRK 188

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN--REP 310
           I+      +  +  + I A    D D  +    G     D  + I L+ IE A++  +  
Sbjct: 189 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGVPESDDDAAIIKLSWIEAAVDAHKTL 248

Query: 311 CPDPYAPLIMGCDIAEEGGDNTV 333
             +P     +G D+A+ G D   
Sbjct: 249 NFEPSGRKRIGFDVADSGTDKCA 271


>gi|238920149|ref|YP_002933664.1| hypothetical protein NT01EI_2255 [Edwardsiella ictaluri 93-146]
 gi|238869718|gb|ACR69429.1| conserved hypothetical protein [Edwardsiella ictaluri 93-146]
          Length = 601

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 27/161 (16%), Positives = 53/161 (32%), Gaps = 18/161 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IP 298
           P   W+    ++    +G + +  E +  RYG   +         F       F    + 
Sbjct: 334 PDKQWRYVVTMEDAIADGFNRADIEELRERYGE--NAFNRLYMCVFVDDKDSVFDFAKLV 391

Query: 299 LNIIEEALNREPCPDPYAP-----LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFD 347
              ++  + ++  PD  AP     +  G D A  G + T VV+           +     
Sbjct: 392 RCGVDPHIWQDFHPDEAAPLGNREVWGGFDPARSGDNATFVVIAVPLLAVERFRVLEKHH 451

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
           W     +   ++I  +  +Y    I ID    G    + ++
Sbjct: 452 WRGLSFQWMADQIRTIKSRYNMTHIGIDVTGIGYGVYELVQ 492


>gi|168699883|ref|ZP_02732160.1| hypothetical protein GobsU_10183 [Gemmata obscuriglobus UQM 2246]
          Length = 205

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 27/139 (19%), Positives = 48/139 (34%), Gaps = 17/139 (12%)

Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY 238
           +  +  VG        ++ DE S   D +   +   L    +    +  S P    G F+
Sbjct: 63  DSQEGVVGFSAPR--LVVIDEGSRVSDELYKSVRPMLAV--SKGQLLTLSTPFGNQGWFF 118

Query: 239 EIF----------NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288
           +I+          +K  + W+R  +    +  I P F E   A  G      + E   +F
Sbjct: 119 DIWDDSAEGLKRRSKLHEPWQRTAVPASQIPRITPEFLEDERAELGER--WFQQEYFLRF 176

Query: 289 PQQDIDSFIPLNIIEEALN 307
               ID+     +I  A +
Sbjct: 177 LD-SIDAVFSQAVIHGARS 194


>gi|56414686|ref|YP_151761.1| terminase subunit [Salmonella enterica subsp. enterica serovar
           Paratyphi A str. ATCC 9150]
 gi|197363613|ref|YP_002143250.1| terminase subunit [Salmonella enterica subsp. enterica serovar
           Paratyphi A str. AKU_12601]
 gi|56128943|gb|AAV78449.1| probable terminase subunit [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|197095090|emb|CAR60636.1| probable terminase subunit [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
          Length = 588

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 26/139 (18%), Positives = 47/139 (33%), Gaps = 23/139 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            + +   Y    D  +  +  +F   D+ S  PL+ ++  +                P  
Sbjct: 348 LDQLRMEY--SPDEYQNLLMCEFID-DLASVFPLSELQACMVDSWEVWTDFQALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+    V+      G     L    W   D R   + I  L +
Sbjct: 405 WREVWIGYDSAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIKKLTQ 464

Query: 366 KYRPDAIIIDANNTGARTC 384
           +Y    I ID+   G    
Sbjct: 465 QYNVTYIGIDSTGVGHGVY 483


>gi|254286518|ref|ZP_04961475.1| terminase [Vibrio cholerae AM-19226]
 gi|150423467|gb|EDN15411.1| terminase [Vibrio cholerae AM-19226]
          Length = 605

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 29/181 (16%), Positives = 48/181 (26%), Gaps = 23/181 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    I+     G D    + +   Y              F      S    N I
Sbjct: 345 PDKQWRYVVTIEDAAKGGCDLFDIDELREEYSETD--FNNLFMCVFVDGAS-SIFEFNKI 401

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345
           E  +        Y P          + +G D +    DN V       +V      +   
Sbjct: 402 ERCMVDSEIWQDYKPNAARPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAVEKFRVLEK 460

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404
             W     +   ++IS + E++    + ID    GA   D L          +       
Sbjct: 461 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVHDLLVNKHPRETVAIHYSNENK 520

Query: 405 D 405
           +
Sbjct: 521 N 521


>gi|22536761|ref|NP_687612.1| hypothetical protein SAG0585 [Streptococcus agalactiae 2603V/R]
 gi|22533605|gb|AAM99484.1|AE014218_1 conserved hypothetical protein [Streptococcus agalactiae 2603V/R]
          Length = 471

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 57/346 (16%), Positives = 114/346 (32%), Gaps = 47/346 (13%)

Query: 54  WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113
           WQ   +  +    +N  N    + +  AI   R  GKT +   L LW +    G+ ++  
Sbjct: 44  WQENML--IPMMAINEDNLWVHQKYGYAIP--RRNGKTEVVYILELWAL--HKGLKILHT 97

Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
           A+  +   +  + +V K+L +     + + +    + A     +   S G   +  +   
Sbjct: 98  AHRISTSHS-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKSSGSVIQFRT--- 150

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR- 232
           RT +    + F          +I DEA          +   +T+ + N   IM   P   
Sbjct: 151 RTSNGGLGEGFD--------LLIIDEAQEYTAEQESALKYTVTDSD-NPMTIMCGTPPTM 201

Query: 233 -LSGKFYEIFNKP-------LDDWKRFQIDTRTVEGIDPSF------------HEGIIAR 272
             +G  +E + K           W  + +D         S+               I A 
Sbjct: 202 VSTGTVFESYRKECLKGDRRYSGWAEWSVDEMQPIHDVKSWYVANPSMGYHLNERKIEAE 261

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331
            G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G + 
Sbjct: 262 LGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNNV 319

Query: 332 T-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
           +  +  R       +       +R     I   ++      +++D 
Sbjct: 320 SLSIAARASENKVFVEAIDCLSVRNGTQWIINFLKSADIAKVVVDG 365


>gi|261248365|emb|CBG26202.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
          Length = 589

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320
           R     +  +     +F      S  P   ++  +           P+A       P+ +
Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFAPFADHPFGSRPVWI 413

Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +  G     VVL      G     L    W   D       I  L EKY  + I I
Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRRLTEKYNVEYIGI 473

Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
           DA   G      +               A  + +    +T + +K  D +    L   +G
Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524

Query: 435 ---LIQNLKSLKSFIVPNTGE 452
              + Q+  S++   + ++G 
Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544


>gi|41057355|ref|NP_958058.1| gp3 [Enterobacteria phage PsP3]
 gi|37548561|gb|AAN08365.1| gp3 [Enterobacteria phage PsP3]
          Length = 589

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320
           R     +  +     +F      S  P   ++  +           P+A       P+ +
Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFAPFADHPFGSRPVWI 413

Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +  G     VVL      G     L    W   D       I  L EKY  + I I
Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRRLTEKYNVEYIGI 473

Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
           DA   G      +               A  + +    +T + +K  D +    L   +G
Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524

Query: 435 ---LIQNLKSLKSFIVPNTGE 452
              + Q+  S++   + ++G 
Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544


>gi|167991605|ref|ZP_02572704.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar 4,[5],12:i:- str. CVM23701]
 gi|205329999|gb|EDZ16763.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar 4,[5],12:i:- str. CVM23701]
          Length = 590

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320
           R     +  +     +F      S  P   ++  +           P+A       P+ +
Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFTPFADHPFGSRPVWI 413

Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +  G     VVL      G     L    W   D       I  L EKY  + I I
Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRRLTEKYNVEYIGI 473

Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
           DA   G      +               A  + +    +T + +K  D +    L   +G
Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524

Query: 435 ---LIQNLKSLKSFIVPNTGE 452
              + Q+  S++   + ++G 
Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544


>gi|153816772|ref|ZP_01969439.1| terminase [Vibrio cholerae NCTC 8457]
 gi|126512575|gb|EAZ75169.1| terminase [Vibrio cholerae NCTC 8457]
          Length = 605

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 30/181 (16%), Positives = 48/181 (26%), Gaps = 23/181 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    I+     G D    E +   Y              F      S    N I
Sbjct: 345 PDRQWRYVVTIEDAAKCGCDLFDIEELREEYSETD--FNNLFMCVFVDGAS-SIFEFNKI 401

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345
           E  +        Y P          + +G D +    DN V       +V      +   
Sbjct: 402 ERCMVDSEIWQDYKPNAARPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAVEKFRVLEK 460

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404
             W     +   ++IS + E++    + ID    GA   D L          +       
Sbjct: 461 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVHDLLVNKHPRETVAIHYSNENK 520

Query: 405 D 405
           +
Sbjct: 521 N 521


>gi|168262802|ref|ZP_02684775.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Hadar str. RI_05P066]
 gi|205348497|gb|EDZ35128.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Hadar str. RI_05P066]
          Length = 591

 Score = 54.0 bits (128), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 25/168 (14%), Positives = 51/168 (30%), Gaps = 20/168 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  + +  RY  ++    +     F     DS    + +
Sbjct: 328 PDGQWRYVITMEDAIAGGFNLANIDKLRNRY--NTATFNMLYMCVFVDSK-DSVFSFSDL 384

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
           E           + P          +  G D A  G  +  V++           +  + 
Sbjct: 385 EACGVEVDTWQDHNPDAARPFGDRPVWGGFDPARSGDLSCFVIVAPPMFAVEKFRVLKVI 444

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394
            W   + R    +I  L +KY    + +D    G    D ++     V
Sbjct: 445 YWKGMNFRYQAKQIEQLFKKYNFTYLGVDVTGIGQGIFDNIQHFAMRV 492


>gi|218781804|ref|YP_002433122.1| hypothetical protein Dalk_3968 [Desulfatibacillum alkenivorans
           AK-01]
 gi|218763188|gb|ACL05654.1| protein of unknown function DUF264 [Desulfatibacillum alkenivorans
           AK-01]
          Length = 443

 Score = 54.0 bits (128), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 49/334 (14%), Positives = 99/334 (29%), Gaps = 46/334 (13%)

Query: 79  KGAISAGRG-IGKTTLNAWLVLWLMSTR---PGISVICLANSETQLKTTLWAEVSKWLSL 134
           + ++       GKT + A   L + + +   P      +A    Q K+ +W  + K+   
Sbjct: 37  RFSVLVCHRRFGKT-VAAVNELIMKACQNPLPAPRYAYIAPLYKQAKSVVWDYLKKFAG- 94

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
                   +   + H      D+ +                   + PD   G +      
Sbjct: 95  -------AINGTTFHETELRCDLPN----------GARITLLGADNPDRLRGIYLDGA-- 135

Query: 195 IINDEASGTPDVIN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEI--FNKPLDDWKRF 251
            + DE +  P+ +    I   L++R     +     PR  +  FY++  F +   DW   
Sbjct: 136 -VLDEMAQMPERVWGEIIRPALSDRLGWAMF--IGTPRGHNA-FYDLYQFARSDPDWFCA 191

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
                    +     +   A+  +  +    E    F    + ++    +I +A      
Sbjct: 192 MYRASETGIVGRDELDA--AKKEMTPEQYEQEFECSFSAAIVGAYYGP-LIAQAEKEGRI 248

Query: 312 PDPYAPLIMGCDIAEEGG--DNTVVVLRR---GPVIEHLF--DWSKTDLRTTNNKISGLV 364
                   +    A + G  D+T V   +   G  I  +   + +   L      +    
Sbjct: 249 VTLPVERALPVHTAWDLGMSDSTAVWFFQVSPGGEIRVVDYLEDAGQGLDYYVRALRERD 308

Query: 365 EKYR----PDAIIIDANNTGARTCDYLEMLGYHV 394
             Y     P  I +    TG    +  + LG   
Sbjct: 309 YLYGTHLAPHDIRVRELGTGKSRLESAKSLGVSF 342


>gi|163792602|ref|ZP_02186579.1| hypothetical protein BAL199_17183 [alpha proteobacterium BAL199]
 gi|159182307|gb|EDP66816.1| hypothetical protein BAL199_17183 [alpha proteobacterium BAL199]
          Length = 422

 Score = 54.0 bits (128), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 74/421 (17%), Positives = 124/421 (29%), Gaps = 67/421 (15%)

Query: 82  ISAGRGIGKTTLNA-WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           I AGRG GKT   A W+     S R    +  +A +    +  +                
Sbjct: 45  ILAGRGFGKTRTGAEWVRGLAESGR-ARRIALVAETAADARDVM---------------I 88

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE- 199
                L    APW       S    +     +  ++S + PD   G       A   DE 
Sbjct: 89  EGESGLLACCAPWGRPKYEPSKRRVTWPNGAIATSFSADDPDQLRGPQFD---AAWADEI 145

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259
           A    +     ++  L    A+   + T+ P +       +   P        + TR   
Sbjct: 146 AKWRYEAAWDNLMLGL-RLGADPRCVATTTP-KPRAWLARLMADPG------TVVTRGAT 197

Query: 260 GID-----PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP 314
             +     P F + I+ARY   + + R E+ G+F  +   +     +IE A         
Sbjct: 198 RENAGNLAPGFLDQILARY-AGTRLGRQEIDGEFLTEIPGALWTRTLIEGARALPGAVPG 256

Query: 315 YAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWSKTDLRTT----NNKISGLVEKY 367
            A +I+  D A       D T +V+         +       R +      + +    ++
Sbjct: 257 LARIIVAVDPAVTSGSDSDETGIVVAGVDGEGRFWVLEDLSGRMSPDLWARRSADAYRRH 316

Query: 368 RPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFA 427
             DA++ + N  G      L  +   +       RAV     +  R E    + +     
Sbjct: 317 HADAVVCEVNQGGDLVVATLRTVDGSL-----PVRAVRATRGKRLRAEPVAALYEQGRVR 371

Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM-----YTFAENPPR 482
                  L   +                      G  S D  D L+       F   P R
Sbjct: 372 HAGPFPELEDQMAG-----FTGAS----------GDASPDRLDALVWALTDLAFDRPPAR 416

Query: 483 S 483
           S
Sbjct: 417 S 417


>gi|192289100|ref|YP_001989705.1| hypothetical protein Rpal_0670 [Rhodopseudomonas palustris TIE-1]
 gi|192282849|gb|ACE99229.1| protein of unknown function DUF264 [Rhodopseudomonas palustris
           TIE-1]
          Length = 441

 Score = 53.6 bits (127), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 45/235 (19%), Positives = 75/235 (31%), Gaps = 19/235 (8%)

Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRR 232
           T     PDT  G  +     +  DE +   D   I   +   ++  + N     T N   
Sbjct: 114 TALPANPDTARGFSSN----VFLDEFAFHKDSREIWKALFPVISAGH-NLRVTSTGN--G 166

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291
              KFYE+     D W R  +D    VE   P   + + A    D D    E   ++  +
Sbjct: 167 KDNKFYELATGKDDVWSRHFVDIYKAVEDGLPRNIDELKAGI-NDDDAWAQEYELKWLDE 225

Query: 292 DID--SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL 345
                S+  +   E+    +P      P  +G DI     D  V+     +         
Sbjct: 226 ASAWLSYDLITACEDPRAGDPSGYRNNPCFVGRDIGRRN-DLHVIWVWELIGDVLWERER 284

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE-MLGYHVYRVLG 399
            +  +      +     ++ +YR     ID    G +  +  +   G  V  VL 
Sbjct: 285 IEQKRATFAAMDAAFDDVMTRYRVARACIDQTGMGEKVVEDAQAKWGSVVEGVLF 339


>gi|282599774|ref|ZP_05971828.2| terminase, ATPase subunit [Providencia rustigianii DSM 4541]
 gi|282567779|gb|EFB73314.1| terminase, ATPase subunit [Providencia rustigianii DSM 4541]
          Length = 594

 Score = 53.6 bits (127), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 40/263 (15%), Positives = 80/263 (30%), Gaps = 31/263 (11%)

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK-RFQIDTRTVEGIDPS 264
               G        +  R  +       L     +    P   W+    ++     G + +
Sbjct: 302 PFWTGDE--WRGSDPARKKVKFPQFDELRDGGRDC---PDGQWRYVITLEDAIKGGFNLA 356

Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF---------IPLNIIEEALNREPCPDPY 315
             E +  +Y  DS    +     F       F         + +++ E+     P P   
Sbjct: 357 SIERLRNKYNPDS--FNMLFMCVFVDSGASVFKYHQLDKCGVDVHLWEDHNPDAPRPFGD 414

Query: 316 APLIMGCDIAEEGGDNTVVVLR------RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             +  G D A  G  +T  ++           +  +F W   + +     I  L ++YR 
Sbjct: 415 REVWGGFDPARSGDTSTFAIVAPPMMAPEVFRVLAIFYWQGMNWKHQAKLIEDLTKRYRF 474

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429
             I ID    G    + ++            ++   + + +  + +L +KM D +    L
Sbjct: 475 TYIGIDTTGIGHGVYEMVQD--------FAPRQTHSIHYSQQTKNQLVMKMIDVVSEERL 526

Query: 430 INHSGLIQNLKSLKSFIVPNTGE 452
                  + L S  S     TG+
Sbjct: 527 EWDEEQKEILASFLSIRHTTTGK 549


>gi|331650737|ref|ZP_08351769.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331040418|gb|EGI12616.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 158

 Score = 53.6 bits (127), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 23/133 (17%), Positives = 46/133 (34%), Gaps = 29/133 (21%)

Query: 322 CDIAEEGGDNTVVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNT 379
            D+A+EG D      R G ++E++ +WS   +D+  +  K+ G  E+   +    D +  
Sbjct: 1   MDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEKVFGFCEQDNLEEFRFDEDGL 60

Query: 380 GART------CDYLEMLGYH---------------------VYRVLGQKRAVDLEFCRNR 412
           GA         + L                           V    GQ   ++ +F  N 
Sbjct: 61  GAGVRGDARAINELRNAARRPSILATPFRGSGAVFDPDDEAVRGDNGQAARLNKDFFANA 120

Query: 413 RTELHVKMADWLE 425
           + +   ++    +
Sbjct: 121 KAQSWWRLRKLFQ 133


>gi|168704532|ref|ZP_02736809.1| hypothetical protein GobsU_33659 [Gemmata obscuriglobus UQM 2246]
          Length = 209

 Score = 53.6 bits (127), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 27/139 (19%), Positives = 48/139 (34%), Gaps = 17/139 (12%)

Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY 238
           +  +  VG        ++ DE S   D +   +   L    +    +  S P    G F+
Sbjct: 63  DSQEGVVGFSAPR--LVVIDEGSRVSDELYKSVRPMLAV--SKGQLLTLSTPFGNQGWFF 118

Query: 239 EIFN----------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288
           +I++          K  + W+R  +    +  I P F E   A  G      + E   +F
Sbjct: 119 DIWDDSAEGLKRRAKLHEPWQRTAVPASQIPRITPEFLEDERAELGER--WFQQEYFLRF 176

Query: 289 PQQDIDSFIPLNIIEEALN 307
               ID+     +I  A +
Sbjct: 177 LD-SIDAVFSQAVIHGARS 194


>gi|134295281|ref|YP_001119016.1| hypothetical protein Bcep1808_1170 [Burkholderia vietnamiensis G4]
 gi|134138438|gb|ABO54181.1| protein of unknown function DUF264 [Burkholderia vietnamiensis G4]
          Length = 458

 Score = 53.2 bits (126), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 37/247 (14%), Positives = 78/247 (31%), Gaps = 31/247 (12%)

Query: 246 DDWKRFQID-TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            D+  FQI+     + +   + + +    G+   + +  + G+F     +       IE+
Sbjct: 201 GDYAHFQINPGDNAQNLSADYLDTLK---GMSPRLQKRFLRGEFSDATPNQLFAEETIEK 257

Query: 305 ALNREPCPDPYAP-LIMGCDIAEEGG-DNT--------VVVLRRGPVIEHLFDWSKTDLR 354
             +    P P    +++  D +  G  DN         VV L        L D +     
Sbjct: 258 WRHGTDQPLPDFVRVVVAVDPSGSGDVDNADNDAIGIIVVGLGTDGRAYVLDDCTVKAGP 317

Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRT 414
            T   +           +++   N G     ++        R     + V     +  R 
Sbjct: 318 ATWGSVVASAYDRHAGDVVVGETNYGGAMVQHV----VQTARARTPFKQVTASRGKAVRA 373

Query: 415 ELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474
           E    + +       + H G+ + L+  +       G +        G +S + +D L++
Sbjct: 374 EPFSSLYEQ----GKVRHVGIFRELED-ELTAFSTVGYI--------GERSPNRADALIW 420

Query: 475 TFAENPP 481
              E  P
Sbjct: 421 ALTEIFP 427


>gi|161613293|ref|YP_001587258.1| hypothetical protein SPAB_01003 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|161362657|gb|ABX66425.1| hypothetical protein SPAB_01003 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
          Length = 443

 Score = 53.2 bits (126), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 59/360 (16%), Positives = 105/360 (29%), Gaps = 65/360 (18%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMST---RPGISV-------ICLANSETQLKTTLWAEVSK 130
           A+  GR  GKT + +   +   +    RPG+ +       I  A            E  +
Sbjct: 27  AVRCGRRWGKTFMLSSAAVTYATAPFKRPGMDIELGGRVGIFTA------------EYRQ 74

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT 190
           +  +        +    L  +    +            +            +   G    
Sbjct: 75  YQEIYDKLEEILLP---LKKSFSRQEKRLLLKNGGKIDFWVT-------NDNKLAGRGRE 124

Query: 191 YGMAIINDEASGTPDV-----IN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244
           Y + +I DEA+ T        I    I   L       +   T +       FY I +  
Sbjct: 125 YEIILI-DEAAFTKSPEMLREIWPKSIKPTLLTTKGRAYVFSTPDGVDEENFFYAICHDK 183

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
              +      T +   + P   E   A    D  V R E   +F      S   +    E
Sbjct: 184 NLGFIEHHAPTSSNPFVPPEELEKEEAN--NDPRVFRQEFLAEFVDWSAASLFDIRKWFE 241

Query: 305 ALNREPC---PDPYAPLIMGCDIAEEGG---DNTVVVL-----RRGPVIEHLFDWSKTDL 353
             N++     P+    +    D A +GG   D T VV      R G     + DW    +
Sbjct: 242 GENQDQPVDYPEMCQAVFAVMDTAVKGGSEHDGTAVVYYAVDTRPGIQRLTILDWDVVQI 301

Query: 354 R---------TTNNKISGLVEKYRPD----AIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400
                     +  ++++ L  +         + I+  + G+      E LG+ V ++   
Sbjct: 302 DGALLETWMPSVFDRLNELSGQCVAINGSLGVFIEDASMGSILLQKGESLGWPVNKIESA 361


>gi|238762068|ref|ZP_04623041.1| terminase, ATPase subunit [Yersinia kristensenii ATCC 33638]
 gi|238699796|gb|EEP92540.1| terminase, ATPase subunit [Yersinia kristensenii ATCC 33638]
          Length = 595

 Score = 53.2 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/163 (15%), Positives = 52/163 (31%), Gaps = 20/163 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  +Y  D+    +     F     DS    +++
Sbjct: 333 PDGQWRYVITLEDAIDGGFNLANIERLRNKYNRDT--FNMLYMCVFVDSG-DSVFKFHML 389

Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNTVVVLR----RGPVIEHLFD- 347
           E+          +            +  G D A  G  +T V++      G     L   
Sbjct: 390 EKCGVDIEMWQDHDFSAPRPFGNREVWGGFDPARSGDTSTFVIIAPPQFEGERFRVLATF 449

Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
            W   +     N+I  L ++Y    I +D    G    + ++ 
Sbjct: 450 YWQGLNFNYQANQIKELFQRYNMTYIGVDITGIGNGVFELVQN 492


>gi|168822412|ref|ZP_02834412.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Weltevreden str. HI_N05-537]
 gi|205341123|gb|EDZ27887.1| putative conserved hypothetical protein [Salmonella enterica subsp.
           enterica serovar Weltevreden str. HI_N05-537]
          Length = 589

 Score = 53.2 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 36/201 (17%), Positives = 62/201 (30%), Gaps = 31/201 (15%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL----NREPCPDPYA-------PLIM 320
           R     +  +     +F      S  P   ++  +           P+A       P+ +
Sbjct: 355 RRENSDEDFKNLFMCEFVDDKA-SVFPFEELQRCMVDVMETWEDFAPFADHPFGSRPVWI 413

Query: 321 GCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +  G     VVL      G     L    W   D       I  L EKY  + I I
Sbjct: 414 GYDPSHTGDSAGCVVLAPPVVSGGKFRMLERHQWKGMDFAAQAEGIRKLTEKYNVEYIGI 473

Query: 375 DANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
           DA   G      +               A  + +    +T + +K  D +    L   +G
Sbjct: 474 DATGLGLGVFQLVR---------SFYPAARGIRYTPEMKTAMVLKAKDTIRRGCLEYDAG 524

Query: 435 ---LIQNLKSLKSFIVPNTGE 452
              + Q+  S++   + ++G 
Sbjct: 525 ATDVTQSFMSIRK-TMTSSGR 544


>gi|300310242|ref|YP_003774334.1| DNA-dependent ATPase terminase subunit [Herbaspirillum seropedicae
           SmR1]
 gi|300073027|gb|ADJ62426.1| DNA-dependent ATPase terminase subunit [Bacteriophage phi CTX]
           related protein [Herbaspirillum seropedicae SmR1]
          Length = 593

 Score = 53.2 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 22/132 (16%), Positives = 36/132 (27%), Gaps = 18/132 (13%)

Query: 270 IARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----------EPCPDPYAPL 318
           +  +    D     +   F      S  PL  ++  +                P  + P+
Sbjct: 358 LRDFEYSPDQFDNLLMCNFIDDSA-SVFPLADLQRGMVDSWVDWDDYKPFTARPFGHRPV 416

Query: 319 IMGCDIAEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAI 372
            +G D +  G     +V+   L  G     L    W   D       I  +  +Y    I
Sbjct: 417 WIGYDPSLTGDSAGCSVIAPPLIPGGNFRILERHQWRGKDFAEQAALIKEMCGRYNVQYI 476

Query: 373 IIDANNTGARTC 384
            ID    G    
Sbjct: 477 GIDTTGMGVGVY 488


>gi|51596097|ref|YP_070288.1| phage terminase subunit GpP [Yersinia pseudotuberculosis IP 32953]
 gi|51589379|emb|CAH21001.1| terminase subunit [Enterobacteria phage 186] gb|AAC3414 [Yersinia
           pseudotuberculosis IP 32953]
          Length = 590

 Score = 53.2 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 26/137 (18%), Positives = 42/137 (30%), Gaps = 22/137 (16%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIM 320
            YG      +  +  +F      S  P   ++  +                   + P+ +
Sbjct: 355 EYGPSE--YQNLLMCEFVDDQA-SVFPFAELQACMVDSLEEWEDYNPYSLRPFGHRPVWI 411

Query: 321 GCDIAE-EGGDNT---VVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAI 372
           G D +E  GGD+    V+   +  G     L    W   D       I  L +KY  + I
Sbjct: 412 GYDPSEANGGDSAGCAVIAPPMVPGGKFRVLERHQWKGMDFEAQAKHIEELTQKYCVEYI 471

Query: 373 IIDANNTGARTCDYLEM 389
            IDA   G      +  
Sbjct: 472 GIDATTVGQGVFQLVRQ 488


>gi|15675368|ref|NP_269542.1| hypothetical protein SPy_1460 [Streptococcus pyogenes M1 GAS]
 gi|13622552|gb|AAK34263.1| conserved hypothetical protein - phage associated [Streptococcus
           pyogenes M1 GAS]
          Length = 471

 Score = 53.2 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ+  +  + A   N +       +          GKT +   + LW +    G+ ++ 
Sbjct: 43  PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVQLWAL--HKGLKILH 96

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   + G   +  +  
Sbjct: 97  TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200

Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271
              +G  +E + K             +W   +      + +  +      FH     I A
Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365


>gi|261885730|ref|ZP_06009769.1| hypothetical protein CfetvA_11664 [Campylobacter fetus subsp.
           venerealis str. Azul-94]
          Length = 560

 Score = 53.2 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 43/307 (14%), Positives = 90/307 (29%), Gaps = 41/307 (13%)

Query: 195 IINDEASGTPDV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
           I  DE +  P+   I    +  +          + S P      F+E+++     +   +
Sbjct: 251 IWMDEFAWYPNPKKIWHAFVPSI--GAIKGRLTILSTPFEERSLFHELYSDESKYYMFKR 308

Query: 253 IDTRTVEGIDPSFHEGIIARYGL-DSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL---NR 308
                   I+      +     L D+D        QF   +  S + + +I+  +     
Sbjct: 309 FCVSIYSAIEDGLDFDLETMRNLFDADTWASAYECQFVDDES-SLLSIALIKSCVYDKAS 367

Query: 309 EPCPDPYAPLIMGCDIAEEGGDNT---VVVLRRGPVIEHLFDWSKTDLRTTNN------- 358
              P     +  G DI      +T   VV+             S+               
Sbjct: 368 YYTPKSNQVIYAGYDIGRVSDRSTLAGVVLEDVTNRARGQRSLSQGGRYIVAMMDVLAKA 427

Query: 359 -------KISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAVDLEFCR 410
                   ++  ++ Y    + ID    G    + + +     V  V       +     
Sbjct: 428 KFDEQKEHLTSFLKTYPLSVLKIDKTGIGMNLAENIHDKFRSRVSGVWFSNTRKE----- 482

Query: 411 NRRTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDY 468
               E+ + +    E    S+ N   LI ++ ++K  I   + +   ++KR +   + D 
Sbjct: 483 ----EMALNLKKAFEDKLISIPNDPLLIADIHAIKRTIGAKSFKY--DAKRNEYGHA-DR 535

Query: 469 SDGLMYT 475
              L   
Sbjct: 536 FWALALA 542


>gi|304360860|ref|YP_003856980.1| gp8 [Mycobacterium phage CrimD]
 gi|302858609|gb|ADL71354.1| gp8 [Mycobacterium phage CrimD]
          Length = 473

 Score = 53.2 bits (126), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 65/385 (16%), Positives = 118/385 (30%), Gaps = 49/385 (12%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111
             WQ +  ++V A    S      ++F  +I   R  GKT     +V  L    PG +VI
Sbjct: 43  DQWQDDLGKLVCAK--RSDGLYAADMFAMSIP--RQTGKTYFLGAIVFALCKMTPGTTVI 98

Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171
             A+     +T   AE  K +  L  +       L++H             G ++  ++ 
Sbjct: 99  WTAH-----RTRTAAETFKSMQALAKREQIAPHILNVH----------TGNGKEAVLFTN 143

Query: 172 MCRTYSEERPDTF-VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
             R     R   F  G        +I DEA    +     ++   T  + N   +    P
Sbjct: 144 GSRILFGAREKGFGRGF--AKVDVLIFDEAQILSENAMDDMVPA-TNASPNGLILFAGTP 200

Query: 231 RRLS--GKFY-----EIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGI------------ 269
            + +  G+ +     +  N   DD  +     D       + ++ +              
Sbjct: 201 PKPTDPGEVFTNLRLDAINGESDDVAYVEISADENDDPDEESTWRKMNPSYPHRTSARAI 260

Query: 270 -IARYGLDSDVTRVEVCGQFPQQDID-SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 327
              R  L  D  R E  G + +  +    I  ++  +  +         P  +G D++  
Sbjct: 261 RRMRKALSWDSFRREAMGIWDKISVHAQVIKPSLWRDLADPLGPEPGAKPASLGVDMSHG 320

Query: 328 GGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386
           G  +          + H+   W+ TD       I       R   ++ID  +        
Sbjct: 321 GAISIGGCWLIDDELRHVEQVWAGTDTAAAVEFIVERAG--RRIPVVIDDASPAKSLVPE 378

Query: 387 LEMLGYHVYRVLGQKRAVDLEFCRN 411
           L+     V        A      +N
Sbjct: 379 LKRRKVKVRITYAGDMAKACGLFKN 403


>gi|50914563|ref|YP_060535.1| Phage terminase [Streptococcus pyogenes MGAS10394]
 gi|50903637|gb|AAT87352.1| Phage terminase [Streptococcus pyogenes MGAS10394]
          Length = 476

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ+  +  + A   N +       +          GKT +   + LW +    G+ ++ 
Sbjct: 48  PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 101

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   + G   +  +  
Sbjct: 102 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 155

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 156 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 205

Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271
              +G  +E + K             +W   +      + +  +      FH     I A
Sbjct: 206 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 265

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 266 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 323

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 324 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 370


>gi|19746414|ref|NP_607550.1| hypothetical protein spyM18_1474 [Streptococcus pyogenes MGAS8232]
 gi|19748615|gb|AAL98049.1| conserved hypothetical phage protein [Streptococcus pyogenes
           MGAS8232]
          Length = 471

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ+  +  + A   N +       +          GKT +   + LW +    G+ ++ 
Sbjct: 43  PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   + G   +  +  
Sbjct: 97  TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200

Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271
              +G  +E + K             +W   +      + +  +      FH     I A
Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365


>gi|21910651|ref|NP_664919.1| putative terminase - phage associated [Streptococcus pyogenes
           MGAS315]
 gi|28876285|ref|NP_795519.1| putative terminase [Streptococcus pyogenes phage 315.3]
 gi|28895661|ref|NP_802011.1| hypothetical protein SPs0749 [Streptococcus pyogenes SSI-1]
 gi|21904853|gb|AAM79722.1| putative terminase - phage-associated [Streptococcus pyogenes
           MGAS315]
 gi|28810910|dbj|BAC63844.1| conserved hypothetical protein (phage associated) [Streptococcus
           pyogenes SSI-1]
          Length = 471

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ+  +  + A   N +       +          GKT +   + LW +    G+ ++ 
Sbjct: 43  PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   + G   +  +  
Sbjct: 97  TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200

Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271
              +G  +E + K             +W   +      + +  +      FH     I A
Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365


>gi|71911002|ref|YP_282552.1| phage terminase [Streptococcus pyogenes MGAS5005]
 gi|71853784|gb|AAZ51807.1| phage terminase [Streptococcus pyogenes MGAS5005]
          Length = 471

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 53/347 (15%), Positives = 112/347 (32%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ+  +  + A   N +       +          GKT +   + LW +    G+ ++ 
Sbjct: 43  PWQVNMLIPIMAIDENGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   + G   +  +  
Sbjct: 97  TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200

Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271
              +G  +E + K             +W   +      + +  +      FH     I A
Sbjct: 201 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 261 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 319 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 365


>gi|148727151|ref|YP_001285645.1| putative terminase large subunit [Aeromonas phage phiO18P]
 gi|110349286|gb|ABG73174.1| putative terminase large subunit [Aeromonas phage phiO18P]
          Length = 604

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 25/139 (17%), Positives = 46/139 (33%), Gaps = 19/139 (13%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317
            E +   Y  +  V       +F   D  S      +E A       + Y P        
Sbjct: 363 IEELKDEYPEE--VFDRLYMCRFID-DALSVFKFQDMERAGVDPTRWEDYKPGRPDPFGR 419

Query: 318 --LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             + MG D +    + T+VV+           +     W   + +    +I  + +K+R 
Sbjct: 420 REVWMGYDPSRTRDNATLVVVAPPTVAGERFRVLEKHYWRGLNFQYQAQEIERIAKKFRV 479

Query: 370 DAIIIDANNTGARTCDYLE 388
             + +D +  GA   D L+
Sbjct: 480 TYLGVDVSGIGAGVYDLLK 498


>gi|261251508|ref|ZP_05944082.1| terminase [Vibrio orientalis CIP 102891]
 gi|260938381|gb|EEX94369.1| terminase [Vibrio orientalis CIP 102891]
          Length = 594

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 23/180 (12%), Positives = 49/180 (27%), Gaps = 21/180 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++    +G D    E +   Y    +  +      F      S    N +
Sbjct: 332 PDKQWRYVITMEDAVAQGFDLVDIEDLRDEY--SDNEFKNLFMCIFVDGAA-SIFEFNKV 388

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL------RRGPVIEHLF 346
              +        + P          + +G D +    +  +VV+           +    
Sbjct: 389 MRCMVDSKQWQDFDPKAKRPIGAREVWLGYDPSRTRDNACLVVVAPPALPGEKFRVLEKH 448

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAVD 405
            W   + +    +I  + + Y    + ID    GA   D L +        +       +
Sbjct: 449 YWKGLNFQYQAKQIGEVFKCYNVTYLGIDVTGIGAGVYDLLSKQHPREAVAIHYSNDNKN 508


>gi|295111846|emb|CBL28596.1| Mu-like prophage FluMu protein gp28 [Synergistetes bacterium SGP1]
          Length = 532

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 68/430 (15%), Positives = 134/430 (31%), Gaps = 62/430 (14%)

Query: 78  FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           F+  + + + IG + L    VL     R    ++  A+ +      +     KW   L  
Sbjct: 143 FRVCLKSRQ-IGFSFLLGLEVLLGAIERGDNQIVISASQDQ--SDIVRNYAVKWCKDL-- 197

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
                          +  D  +      +  Y   C       P T  G+       +  
Sbjct: 198 ------------DVDYLEDGGNIIFPGGAIAYFLPC------NPRTVQGYTGD----VYL 235

Query: 198 DEAS----GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP--LDDWKRF 251
           DE +    G   ++    +   T +       +TS P   +  F EI   P     + R 
Sbjct: 236 DEFAWHMRG--RLMWQAAVPAATTKGKR--LTVTSTPYTETDMFGEIVTNPDKYPRFSRH 291

Query: 252 QIDTRTVEGIDPSFHEGIIARYGL-DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
            +       +       I    GL D+         +F   ++   +  + +   L+ + 
Sbjct: 292 TVTIYDA--VKDGHQVDIEELRGLFDAITFAQAYECRFFADELC-LLQPDEVRAVLDDDC 348

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI--------EHLFDWSKTDLRTTNNKISG 362
                A +  G DI     D T +VL     +         H+   ++       + ++G
Sbjct: 349 LRHVSAWVNGGVDIGRT-KDVTAIVLAEQLQVEAEKLVFVRHMETLARMAFDGQRSHMAG 407

Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
           LVE ++   + +DA   G +  + ++ L           +   + F R ++ E+ + +  
Sbjct: 408 LVEGWKIRRLAMDATGIGMQLSEDMQRL--------YPGKVERVHFTREKKEEMALSVKK 459

Query: 423 WLEF--ASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
             E     + N   L+  L ++K       G    ++ R +  K  D    L     E  
Sbjct: 460 LFETQRIRIPNDRDLVMQLHAIKR-KPTEKG-FTYDADRNEQIKHADLFWALALAVKEFG 517

Query: 481 PRSDMDFGRC 490
            R  +   R 
Sbjct: 518 GRRRVLTARN 527


>gi|15894418|ref|NP_347767.1| hypothetical protein CA_C1134 [Clostridium acetobutylicum ATCC 824]
 gi|15024053|gb|AAK79107.1|AE007629_13 Phage related protein, YonF B.subtilis homolog [Clostridium
           acetobutylicum ATCC 824]
 gi|325508547|gb|ADZ20183.1| Phage related protein [Clostridium acetobutylicum EA 2018]
          Length = 589

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 68/484 (14%), Positives = 151/484 (31%), Gaps = 87/484 (17%)

Query: 54  WQLEFMEVVDAHCLNSVNNPNPE--VFKGAISAG-----------RGIGKTTLNAWLVLW 100
           W  +++++   +  ++  +P       K  +              R +GKT   + ++  
Sbjct: 45  WWRQYLDIFIENYFSTEKSPVRFYDFQKVIVRECGNCSIVRDTEARSMGKTFKMSRVLAG 104

Query: 101 LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC 160
           L    P   ++ ++N+  Q        ++       N     +Q + +            
Sbjct: 105 LAILYPQNKILIVSNTVRQ-AILTVKYINDLGEENANFAREIIQPIKISKDG-------- 155

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT-PDVINLGILGFL---- 215
              +  K+ S +      +      G        I  DE++     VI   ++  L    
Sbjct: 156 -AKVKFKNGSEIEAMAMNKDGSNIRG---ERRKIIYIDESAWVMSSVIQSVLIPMLRYNR 211

Query: 216 -----------TERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPS 264
                         + +   I TS+    S  +Y+ F + + D +    D         +
Sbjct: 212 KVVENNRLKGLAFEDFSSKLIETSSAYLKSCDYYQRFKETIQDIRDGYKDRFCCALSYKT 271

Query: 265 FHE--------GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-----LNREPC 311
                       +  +  +   V  +E   +F  +   S+ P ++ E       +  +  
Sbjct: 272 AVRCGIVEEDFVLEQKKKMPLSVWEMEWNSKFVGETEGSYFPYDLTEPCRDFSHVEIQQP 331

Query: 312 PDPYAPLIMGCDIAEEGG---DNTVVVL------RRGPVIEHLF---DWSKTDLRTTNNK 359
            +  +  ++  D+A       DN  + +        G   +++     +    L     +
Sbjct: 332 KNSMSRYVLSLDVATSEDKKADNACITVIKIVPKNDGTYEKYIVFIRTYHGYSLEMLAEQ 391

Query: 360 ISGLVEKYRPD-AIIIDANNTGARTCDYL-----EMLGYHV-------YRVLGQKRAVDL 406
           +     ++     +IIDAN  G      L     + LG            V   ++A+++
Sbjct: 392 VRITCCRFPNIIKVIIDANAIGEGVVSLLNIPYVDDLGREYPPLIKDTIEVSDSRKAINI 451

Query: 407 ---EFCRNRRTE-LHVKMADWLEFASL---INHSGLIQNLKSLKSFIVPNTGELAIESKR 459
                  N++ E + V    +LE  SL   I    + + ++  K  I  +TG     SK 
Sbjct: 452 ISAIKADNKKNENMAVHTLLFLENHSLHIPIPSVKIRRQIEEQKIIIKDDTGTKRKISKE 511

Query: 460 VKGA 463
             G 
Sbjct: 512 EVGV 515


>gi|109302915|ref|YP_654730.1| hypothetical protein F108p19 [Pasteurella phage F108]
 gi|73918076|gb|AAZ93654.1| unknown [Pasteurella phage F108]
          Length = 603

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 30/183 (16%), Positives = 61/183 (33%), Gaps = 25/183 (13%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IPLNIIEEALNREPCPDPYAP---- 317
            + +  +Y              +       F    +    ++ A  ++  P+   P    
Sbjct: 367 IDALKQKYSKY--AFAQLFMCVWVDDADSIFNIKKLLKCGVDIAKWKDHNPNDARPFGAR 424

Query: 318 -LIMGCDIAEEGGDNTVVV------LRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
            +  G D A  G   + V+      L+    +   + W+    +    +I  L EKY   
Sbjct: 425 EVWGGYDPAHSGDGASFVIVAPPALLKEKYRVLARYQWNGLSYKYQAAQIKQLFEKYNMT 484

Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430
            I IDA   G    + ++            ++AV L +    +TE+ +K+ D +E   + 
Sbjct: 485 YIGIDATGVGYGVYEQVKE--------FAGRKAVPLVYNPESKTEMVLKVHDLVEHEQIE 536

Query: 431 NHS 433
              
Sbjct: 537 WDE 539


>gi|123441220|ref|YP_001005207.1| terminase, ATPase subunit [Yersinia enterocolitica subsp.
           enterocolitica 8081]
 gi|122088181|emb|CAL10969.1| terminase, ATPase subunit [Yersinia enterocolitica subsp.
           enterocolitica 8081]
          Length = 725

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 23/157 (14%), Positives = 45/157 (28%), Gaps = 20/157 (12%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA--------- 316
            E I       S         QF       F   + +E+ L      + +          
Sbjct: 378 LEEIREENAESS--FNQLYMCQFVDTGDCVF-RFDQLEKCLTNVSTWEDHDVNAMRPFGN 434

Query: 317 -PLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             +  G D A  G   + V++           + H+  W     +    +I   + +Y  
Sbjct: 435 REVWAGYDPARTGDTASFVLVAPPQVDGEPFRVLHIETWHGFAFKYQVGRIKEYMARYNI 494

Query: 370 DAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRAVD 405
             I ID    G   C+ ++      V  +   + + +
Sbjct: 495 THIGIDTTGIGGPVCEMVQDFARREVTPIRYSQESKN 531


>gi|330810733|ref|YP_004355195.1| phage terminase protein [Pseudomonas brassicacearum subsp.
           brassicacearum NFM421]
 gi|327378841|gb|AEA70191.1| Putative phage terminase protein [Pseudomonas brassicacearum subsp.
           brassicacearum NFM421]
          Length = 439

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 67/418 (16%), Positives = 122/418 (29%), Gaps = 75/418 (17%)

Query: 78  FKGAISAGRGIGKTTL--------NAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129
           F+ A+  GR  GKT L              W +S      +   A ++ Q +   W  + 
Sbjct: 32  FRDAV-CGRRFGKTFLGKAEMRRAAKLAAAWNVSVEDE--IWYAAPTQKQARRVFWRRLK 88

Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           + +                 P  W     + +  + +     + R    E  D   G   
Sbjct: 89  QAI-----------------PKSWLVTKPNETDMLITLKSGHLLRCVGLENYDDLRG--- 128

Query: 190 TYGMAIINDEASGTPDVIN-LGILGFLT---------ERNANRFWIMTSNPRRLSGKFYE 239
           +    I+ DE +          I   L+         E       +    P+      Y+
Sbjct: 129 SGLFFILVDEWADCKYAAWEEVIRPMLSTCTYTLPNGEVRKGGHALRIGTPKG-FNHCYD 187

Query: 240 IF------NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293
            F      ++P      +   +     + P   E    +  +D    R E    F  ++ 
Sbjct: 188 TFQDGKPGHEPDHRSWLYT--SLDGGNVPPEEIEAARRK--MDPRTFRQEYEASF--ENY 241

Query: 294 DSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL 353
              +      EA            L +G D         VV + R  +   L ++S  ++
Sbjct: 242 QGVVYYTFNREANRTSETIKRGEALHIGMDFNVMKM-AAVVHVIRDDLPLALSEFS--EV 298

Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGART---------CDYLEMLGYHVYRVLGQKRAV 404
           R T   I  +  ++   +I I  + +G  T            L+  G+ V  V     AV
Sbjct: 299 RDTPEMIEKIKLRFPDHSIAIYPDASGQNTSSKSASESDLSLLKKAGFTVI-VDSTNPAV 357

Query: 405 DLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKG 462
                 N    +      + E   L+N     +  + L+  I  + G    E  +  G
Sbjct: 358 KDR--VNAMCAMFAN--TYGEHRYLVNVDQCPKYTQCLERQIYTDKG----EPDKKAG 407


>gi|307591253|ref|YP_003900462.1| hypothetical protein Cyan7822_6211 [Cyanothece sp. PCC 7822]
 gi|306986818|gb|ADN18693.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
          Length = 474

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 39/249 (15%), Positives = 81/249 (32%), Gaps = 40/249 (16%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRL 233
           +    P+   G  +     I+ DE++   +   I    +   T   +    I+ S P   
Sbjct: 145 FRNSTPNGARGLESVSD--ILYDESAFVDEIEEIYKSSIPCTTVVGSEARIIILSTPNGQ 202

Query: 234 SGKFYEIFNKPLDD-----------------------------WKRFQIDTRTVEGIDPS 264
           SG +++  +    D                             +    +          +
Sbjct: 203 SGWYWDKLSSNNGDRDILEICEQIRTEKIEPIQYWIDNNQWCKFIVHWLGHPKFSQQKET 262

Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI--MGC 322
           +   I A++ L  D+   E    F   ++       I+ +    +   DP +  I   G 
Sbjct: 263 YLRDIKAQFDLPEDIIEQEYNLSFTHSEV-IVFSSEIVRKNAIGQWENDPKSNCIYYFGI 321

Query: 323 DIAEEGGDNTVVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
           D +  G D TV  + R       +  ++   K        +I+ L+++Y P  + I+ N+
Sbjct: 322 DTSLLGNDYTVCTILREIDNRYHLVKMYRQRKKTHEYNIYQIAELIKQYNPIIVGIEVNS 381

Query: 379 TGARTCDYL 387
           +G    + L
Sbjct: 382 SGQVYYEQL 390


>gi|147671611|ref|YP_001215893.1| terminase [Vibrio cholerae O395]
 gi|262167851|ref|ZP_06035552.1| terminase [Vibrio cholerae RC27]
 gi|146313994|gb|ABQ18534.1| terminase [Vibrio cholerae O395]
 gi|227014845|gb|ACP11054.1| putative terminase, ATPase subunit [Vibrio cholerae O395]
 gi|262023759|gb|EEY42459.1| terminase [Vibrio cholerae RC27]
          Length = 605

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 29/181 (16%), Positives = 48/181 (26%), Gaps = 23/181 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    I+     G D    E +   Y              F      S    N I
Sbjct: 345 PDRQWRYVVTIEDAAKGGCDLFDIEELREEYSETD--FNNLFMCVFVDGAS-SIFEFNKI 401

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345
           E  +        + P          + +G D +    DN V       +V      +   
Sbjct: 402 ERCMVDSEIWQDFKPNAARPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAVEKFRVLEK 460

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404
             W     +   ++IS + E++    + ID    GA   D L          +       
Sbjct: 461 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVHDLLVNKHPRETVAIHYSNENK 520

Query: 405 D 405
           +
Sbjct: 521 N 521


>gi|332522503|ref|ZP_08398755.1| putative phage terminase, large subunit [Streptococcus porcinus
           str. Jelinkova 176]
 gi|332313767|gb|EGJ26752.1| putative phage terminase, large subunit [Streptococcus porcinus
           str. Jelinkova 176]
          Length = 470

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 51/345 (14%), Positives = 110/345 (31%), Gaps = 45/345 (13%)

Query: 54  WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL 113
           WQ   +  + A   + +       +          GKT +   L LW +    G+ ++  
Sbjct: 43  WQKNMLSPIMAIDEDGLWVHQKYGYAIPRRN----GKTEIVYILELWGL--HKGLKILHT 96

Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
           A+  +   +  + ++ K+L +     + + +    + A     +   S G   +  +   
Sbjct: 97  AHRISTSHS-SFEKLKKYLEMS---GYVDGEDFISNKAKGQERIEFKSSGSVIQWRT--- 149

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR- 232
           RT +    + F          ++ DEA          +   +T+ + N   +M   P   
Sbjct: 150 RTSNGGLGEGFD--------LLVIDEAQEYTSEQESALKYTVTDSD-NPMTVMCGTPPTM 200

Query: 233 -LSGKFYEIFNKP-------LDDWKRFQIDTRTV-------EGIDPS-----FHEGIIAR 272
             +G  +E + K           W  + +   T           +PS         I A 
Sbjct: 201 VSTGTVFESYRKEVLKGAKKYSGWAEWSVSEMTKIDDVQSWYIANPSMGFHLNERKIEAE 260

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNT 332
            G D     ++  G +P  +  S I      + L  E  P+    L +G    ++G + +
Sbjct: 261 LGDDEIDHNIQRLGYWPTFNQKSVISEKEWGK-LKVEQTPELSGKLFVGIKFGQDGNNVS 319

Query: 333 -VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
             +  R       +       +R     I   ++      +++D 
Sbjct: 320 MSIAARTKENKIFVESIDCLSVRNGTQWIIDFLKSADIAKVVVDG 364


>gi|323940932|gb|EGB37119.1| hypothetical protein ERDG_02336 [Escherichia coli E482]
          Length = 443

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 60/361 (16%), Positives = 103/361 (28%), Gaps = 65/361 (18%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMST---RPGISV-------ICLANSETQLKTTLWAEVS 129
            A+  GR  GKT + +   +   ++   RPG+ +       I  A            E  
Sbjct: 26  NAVRCGRRWGKTFMLSSAAVTYATSQFRRPGMDIELGGRVGIFTA------------EYR 73

Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           ++  +        +    L  +    +            +            +   G   
Sbjct: 74  QYQEIYDKLEEILLP---LKKSFSRQEKRLLLKNGGKIDFWVT-------NDNKLAGRGR 123

Query: 190 TYGMAIINDEASGTPDV-----IN-LGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK 243
            Y + +I DEA+ T        I    I   L       +   T +       FY I + 
Sbjct: 124 EYEIILI-DEAAFTKSPEMLKEIWPKSIKPTLLTTKGRAYVFSTPDGVDEENFFYAICHN 182

Query: 244 PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
               +      T +   + P   E    R   D  V R E   +F      S   +    
Sbjct: 183 KDLGFHEHHAPTSSNPFVPPEELEK--ERQNNDPRVFRQEFLAEFVDWSAASLFDVRKWF 240

Query: 304 EALNREPC---PDPYAPLIMGCDIAEEGG---DNTVVVL-----RRGPVIEHLFDWS--K 350
           E  N++     P+    +    D A +GG   D T VV      R G     + DW   +
Sbjct: 241 EGENQDQPVDYPEMCQAVFAVMDTAVKGGTDHDGTAVVYYAVDTRPGIQRLTILDWDVVQ 300

Query: 351 TDLRTTNNKISGLVEKY-----------RPDAIIIDANNTGARTCDYLEMLGYHVYRVLG 399
            D       I  +  +                + I+  + G+      E LG+ V ++  
Sbjct: 301 IDGALLEEWIPSVFTRLNELSGQCVAVNGSLGVFIEDASMGSILLQKGESLGWPVNKIES 360

Query: 400 Q 400
            
Sbjct: 361 A 361


>gi|319645791|ref|ZP_08000021.1| YonF protein [Bacillus sp. BT1B_CT2]
 gi|317391541|gb|EFV72338.1| YonF protein [Bacillus sp. BT1B_CT2]
          Length = 589

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 62/449 (13%), Positives = 127/449 (28%), Gaps = 108/449 (24%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           A RG GKT L +          PG  ++  + ++ Q +  +                   
Sbjct: 84  ASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKGQAREVI-----------EKIDDLRK 132

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
           +S +L               ++  + S +    S +      G  +     +I DE    
Sbjct: 133 ESPNLKREIEDLKTSTNDARVEFHNGSWIKIVASND------GARSKRANLLIVDEFRMV 186

Query: 204 P-DVINLGILGFLTERNANRFW-------IMTSNPR-RLSGKFYEIF------------- 241
             ++I+  +  FLT   + ++        +   N    LS  +Y++              
Sbjct: 187 DFEIISKVLRKFLTAPRSPKYLEKEEYAHLKERNKEIYLSSCWYKVHWSYGRFVTYFNAM 246

Query: 242 ---NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
              +K       +QI  R    +D    +  ++    D     +E+   +  +   ++  
Sbjct: 247 MKGSKYFVCGLPYQIAIREG-LLDKDQVKDEMSEEDFDPIGWSMEMEALWFGESEKAYFK 305

Query: 299 LNIIEEALN-------------------REPCPDPYAPLIMGCDIAEEGG---DNTVVVL 336
              +E+                      +     P    ++  DIA   G   D +V  +
Sbjct: 306 FEDLEKNRKLASPLFPPDYYDLIKDSNFKFENKKPGELRLISNDIAGMAGKDNDASVYTV 365

Query: 337 RR--------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            R           I ++         +   +I  L E Y  D I++D  + G    D L 
Sbjct: 366 FRLIPNSNGYDRHIVYMESIVGGHTGSQATRIRQLFEDYACDYIVLDTQSIGLGVYDALC 425

Query: 389 MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQ--NLKSLKSFI 446
                                   R + +       E  S IN   + +    ++ +  I
Sbjct: 426 Q-----------------PLYDKERAKEY-------EPLSCINDEKMAERCTYQNAEKLI 461

Query: 447 VPNTGELAIESK---------RVKGAKST 466
               G   + S+         + +  K  
Sbjct: 462 YSIKGNAQLNSEIAVLLKDGFKRRKIKIP 490


>gi|282599667|ref|ZP_05971423.2| terminase, ATPase subunit [Providencia rustigianii DSM 4541]
 gi|282568162|gb|EFB73697.1| terminase, ATPase subunit [Providencia rustigianii DSM 4541]
          Length = 574

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 45/257 (17%), Positives = 79/257 (30%), Gaps = 42/257 (16%)

Query: 241 FNKPLDDWKRFQ-IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI-----D 294
                  W++   I      G++    E I        D  R     +F +        D
Sbjct: 305 HYSGDSMWRQIVNIHDAIARGLNRVILEEIKDE--NPPDDFRNLYECEFVKTGERAFSYD 362

Query: 295 SFIPLNIIEEALNREPCPDPYAP-------LIMGCDIAEEGGDNTVVVLRR-------GP 340
           + I   +     +      PYAP       + +G D    G +   + L         G 
Sbjct: 363 ALINCGVDGYNSHIWSDWKPYAPRPLGNRPVWVGADPTGTGDNGDGLGLVIASPPAVSGG 422

Query: 341 VIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCDYLEMLGYHVYRV 397
               +               +I  + ++Y   +I ID    TGA   + +         V
Sbjct: 423 KFRIIETVQLRGMAFEKQAEEIRRITQRYNVQSITIDGTGGTGAAVHELV---------V 473

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG----LIQNLKSLKSFIVPNTGEL 453
                A  L +    +  + +KM   +        +G    LI +L ++K      +G +
Sbjct: 474 KFFPAANLLNYSAPIKRMMIMKMQMLIRSGRFEYDAGLHKPLITSLMTIKKIQ-TPSGII 532

Query: 454 AIESKRVKGAKSTDYSD 470
             ES RV+G    D+ D
Sbjct: 533 TYESSRVRGL---DHGD 546


>gi|182415227|ref|YP_001820293.1| hypothetical protein Oter_3416 [Opitutus terrae PB90-1]
 gi|177842441|gb|ACB76693.1| protein of unknown function DUF264 [Opitutus terrae PB90-1]
          Length = 521

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 53/328 (16%), Positives = 96/328 (29%), Gaps = 66/328 (20%)

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPR-RLS 234
              P T  G        +I DE +   D   I       ++      F    S+      
Sbjct: 153 AANPRTARGFSGD----LILDEFAFHQDSRAIWEAAEPIISA--NPEFLCRISSTGNGRR 206

Query: 235 GKFYEIFNKPL---------DDWKRFQIDTRTV---EGIDPSFHEGIIARYGLDSDVTRV 282
             FY++  +           D WKR +I   +V   E I P       +    D      
Sbjct: 207 NMFYQLIAEGRIPYYRMRRSDAWKRGEIRIYSVVTGEEITPDQARAEAS----DKRAYDQ 262

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL----------------IMGCDIAE 326
              G+F  +   + +   +I  A  RE     +                    +G D+  
Sbjct: 263 NYEGEFNDEAS-ALLTQELISAA-EREGIAIEHQEWSEATIERLRTKTIGDLFLGQDVGR 320

Query: 327 EGGDNTV--VVLRRGPVIEHLFDWSKTDLRTTN--NKISGLVEKYRPDAIIIDANNTGAR 382
           +  D +V  V+ R G     +      ++R      ++  + +  +  +  ID    G  
Sbjct: 321 K-RDFSVQTVIERIGSGYRVVAMLRMENMRLPAQQRELEKICKLPKFRSAEIDMTGLGLG 379

Query: 383 TCDYLEM--LGYHVYRVLGQKRAVDLEFCR-------------NRRTELHVKMADWLEFA 427
             +Y +    G +V  V            R                TEL     D  +  
Sbjct: 380 LVEYAQEEPWGGNVRGVNFGSSEPISLKLRADGKKGETAPVTELMATELLGVFED--KRI 437

Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAI 455
            +     L  +L+  +  + P+ G ++I
Sbjct: 438 EIPMDPELRDDLRKPEKLVSPS-GRVSI 464


>gi|238760573|ref|ZP_04621704.1| Terminase, ATPase subunit [Yersinia aldovae ATCC 35236]
 gi|238701192|gb|EEP93778.1| Terminase, ATPase subunit [Yersinia aldovae ATCC 35236]
          Length = 590

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/137 (19%), Positives = 42/137 (30%), Gaps = 22/137 (16%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIM 320
            YG      +  +  +F      S  P   ++  +                   Y P+ +
Sbjct: 355 EYGPSE--YQNLLMCEFVDDQA-SVFPFKELQACMVDSLEEWEDYNPYSLRPFGYRPVWI 411

Query: 321 GCDIAE-EGGDNT---VVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAI 372
           G D +E  GGD+    V+   +  G     L    W   D       I  L +KY  + I
Sbjct: 412 GYDPSEANGGDSAGCAVIAPPMVPGGKFRVLERHQWKGMDFEAQAKHIEELTQKYCVEYI 471

Query: 373 IIDANNTGARTCDYLEM 389
            IDA   G      +  
Sbjct: 472 GIDATTVGQGVFQLVRQ 488


>gi|186896884|ref|YP_001873996.1| hypothetical protein YPTS_3586 [Yersinia pseudotuberculosis PB1/+]
 gi|186699910|gb|ACC90539.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis
           PB1/+]
          Length = 595

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 26/163 (15%), Positives = 52/163 (31%), Gaps = 20/163 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  +Y  D+    +     F     DS    +++
Sbjct: 333 PDGQWRYVITLENAIDGGFNLADIERLRNKYNRDT--FNMLYMCVFVDSG-DSVFKFHML 389

Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNTVVVLR----RGPVIEHLFD- 347
           E+          +            +  G D A  G  +T V++      G     L   
Sbjct: 390 EKCGVDIEMWQDHDFSAPRPFGNREVWGGFDPARSGDTSTFVIIAPPQFEGERFRVLATF 449

Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
            W   +     N+I  L ++Y    I +D    G    + ++ 
Sbjct: 450 YWQGLNFNYQANQIKELFQRYNMTYIGVDITGIGNGVFELVQN 492


>gi|226305996|ref|YP_002765956.1| hypothetical protein RER_25090 [Rhodococcus erythropolis PR4]
 gi|226185113|dbj|BAH33217.1| hypothetical protein RER_25090 [Rhodococcus erythropolis PR4]
          Length = 402

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 48/366 (13%), Positives = 96/366 (26%), Gaps = 71/366 (19%)

Query: 65  HCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTL 124
           H        +   FK  +  GR  GKTT     +       PG  V   A +  Q +  +
Sbjct: 5   HQSQRKIAESSSRFKV-LRCGRRFGKTTYAVEEMKGACLFEPGP-VAYFATTRDQARDIV 62

Query: 125 WAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTF 184
           WAE+    +++   ++       L       D     + +         R          
Sbjct: 63  WAEL--LENVIGTTNYVSHNEQRLEVTLRRPDGSLNRIRLFGWENIETARG--------- 111

Query: 185 VGHHNTYGMAIINDEAS---GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY-EI 240
                     ++ DE              I   L +      +     P+     +  E 
Sbjct: 112 -----KKYSLVVLDELDSMRAFEKQWREIIRATLADYRGRALF--MGTPKGYKSLYRLEK 164

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
            +K   +++ F   +     +     + +     +       E+  ++ + +        
Sbjct: 165 LSKTNANYEVFHFTSFDNPFLSVEELDEMRGEMTVTQ--YAQEMLAEYHKME-------G 215

Query: 301 IIEEALNRE----PCPDPYAPLIMGCDIAEE----------GGDNTVVVLRRGPVIEHLF 346
           +I E  NR+      P       +  D              G DN+         ++ + 
Sbjct: 216 LIYEEFNRDQHIKALPFTPERWALSIDFGYNHPFAAGIFAIGSDNS-------LHLDRMV 268

Query: 347 DWSKTDLRTTNNKISGLVEKYR--------PDAIIIDANNTGARTCDYLEMLGYHVYRVL 398
              K       N +  L+   +         D + ID              LG  +  V+
Sbjct: 269 YKRKLSDEQRMNAVRDLIGDTKLDFQIGDSEDPLAIDT---------LNRQLGLKIQPVV 319

Query: 399 GQKRAV 404
               +V
Sbjct: 320 KGAGSV 325


>gi|94994695|ref|YP_602793.1| phage terminase [Streptococcus pyogenes MGAS10750]
 gi|94548203|gb|ABF38249.1| phage terminase [Streptococcus pyogenes MGAS10750]
          Length = 476

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 52/347 (14%), Positives = 111/347 (31%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ   +  + A   + +       +          GKT +   + LW +    G+ ++ 
Sbjct: 48  PWQENMLIPIMAVDEDGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 101

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   + G   +  +  
Sbjct: 102 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 155

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 156 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 205

Query: 233 --LSGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271
              +G  +E + K             +W   +      + +  +      FH     I A
Sbjct: 206 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 265

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 266 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 323

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 324 VSLSIAARTSENKVFVETIDCLSVRNGTQWIINFLKSADIAKVVIDG 370


>gi|320105341|ref|YP_004180931.1| hypothetical protein AciPR4_0096 [Terriglobus saanensis SP1PR4]
 gi|319923862|gb|ADV80937.1| hypothetical protein AciPR4_0096 [Terriglobus saanensis SP1PR4]
          Length = 484

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 57/342 (16%), Positives = 102/342 (29%), Gaps = 61/342 (17%)

Query: 106 PGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165
           PG   + +A++    +  ++  V +    LP+         S       ++V   +    
Sbjct: 84  PGTMTVLVAHTREATEQ-MFRIVQRMWENLPDDLREGPAKRS------RANVGQMAFPAL 136

Query: 166 SKHYSTMCRT-YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW 224
              +  +     +  R  +    H +       D AS        G+   L         
Sbjct: 137 DSEFRVVSAGEQNAGRSMSIQNLHCSELSRWPGDAASTLA-----GLKAALAPGGE---M 188

Query: 225 IMTSNPRRLSGKFYEIF---------NKPLDDWKRFQIDTRTVEGIDPSFHEG-IIARYG 274
           ++ S P    G FY+ +            L  W         VE  D +  E  +++R G
Sbjct: 189 VLESTPNGAYGCFYQEWMEAEAQRMARHFLPWWMEPTYLGARVEASDWTEEERALVSREG 248

Query: 275 LDSD----------VTRVEVCGQFPQQDI-------DSFIPLNIIEEALN---------- 307
           L  +            R     +F +  +       + F  L+ IE  L           
Sbjct: 249 LRPEQIGYRRELQRTYRGMARQEFAEDAVSCFRASGECFFELDAIEARLAELTPPLASRR 308

Query: 308 -----REPCPDPYAPLIMGCDIAEEGGD---NTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359
                    P      ++  D A  G +       V+     ++      + + R     
Sbjct: 309 SGSLLLWMPPVKGRRYLIASDPAGGGSEGDFAAAQVVDIDLGLQCAELRQRLNPRELAEV 368

Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQK 401
           +  L  +Y    I+++ NN GA    YLE  G  VY   GQ 
Sbjct: 369 LIDLAREYNGALIVVERNNHGAGVLAYLEKRGVAVYEEGGQA 410


>gi|238765012|ref|ZP_04625949.1| Terminase, ATPase subunit [Yersinia kristensenii ATCC 33638]
 gi|238696781|gb|EEP89561.1| Terminase, ATPase subunit [Yersinia kristensenii ATCC 33638]
          Length = 587

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 35/207 (16%), Positives = 61/207 (29%), Gaps = 32/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP----------- 314
            + +   Y       +  +  +F      S  P   ++  +                   
Sbjct: 349 LDQLALEY--SPAEYQNLLMCEFVDDK-TSVFPFEELQGCMVDSLEEWDDFNPYAYRPFG 405

Query: 315 YAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
           Y  + +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 406 YRAVWLGYDPSHTGDSAGCVVLAPPLVPGGKFRILERHQWKGMDFATQAESIKTLTEKYC 465

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +                 ++ +    +T L +K  D +    
Sbjct: 466 VEYIGIDATGIGQGVYQLVRE---------FFPAVQEIRYSPEIKTALVLKAKDLITSGR 516

Query: 429 LINHSG---LIQNLKSLKSFIVPNTGE 452
           L   SG   + Q+  +++  +  + G 
Sbjct: 517 LEYDSGHTDITQSFMAIRKTMTASGGR 543


>gi|238788385|ref|ZP_04632179.1| Terminase, ATPase subunit [Yersinia frederiksenii ATCC 33641]
 gi|238723631|gb|EEQ15277.1| Terminase, ATPase subunit [Yersinia frederiksenii ATCC 33641]
          Length = 587

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 35/207 (16%), Positives = 61/207 (29%), Gaps = 32/207 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP----------- 314
            + +   Y       +  +  +F      S  P   ++  +                   
Sbjct: 349 LDQLALEY--SPAEYQNLLMCEFVDDK-TSVFPFEELQGCMVDSLEEWDDFNPYAYRPFG 405

Query: 315 YAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
           Y  + +G D +  G     VVL      G     L    W   D  T    I  L EKY 
Sbjct: 406 YRAVWLGYDPSHTGDSAGCVVLAPPLVLGGKFRILERHQWKGMDFATQAESIKTLTEKYC 465

Query: 369 PDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFAS 428
            + I IDA   G      +                 ++ +    +T L +K  D +    
Sbjct: 466 VEYIGIDATGIGQGVYQLVRE---------FFPAVREIRYSPEIKTALVLKAKDLITSGR 516

Query: 429 LINHSG---LIQNLKSLKSFIVPNTGE 452
           L   SG   + Q+  +++  +  + G 
Sbjct: 517 LEYDSGHTDITQSFMAIRKTMTASGGR 543


>gi|225621691|ref|YP_002724049.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
 gi|225547649|gb|ACN93626.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
          Length = 450

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 61/332 (18%), Positives = 108/332 (32%), Gaps = 51/332 (15%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109
           Q E +  +++H  + V      +F G I++    GKT L ++L++  +    S      +
Sbjct: 49  QKEVLFDIESHDYSKV------IFSGGIAS----GKTFLASYLLVKKLIENKSFYEQDTN 98

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
              + NS   L T    ++ K  SL        +          +  +    L I     
Sbjct: 99  NFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNI----- 147

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                 Y  +  D F          I  +EA+       L ++  L  R      I  +N
Sbjct: 148 ------YGGKNRDAFSKIRGGNSAIIYVNEATVIHKETLLEVIKRL--RKGKEIIIFDTN 199

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288
           P   +  F   + +  D +K +   T         F +     Y       R  V  G++
Sbjct: 200 PESPAHYFKTDYIENTDVFKTYTFTTYDNPLNSADFIQTQEKLY-RRFPAYRARVLYGEW 258

Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFD 347
              +   F      E   N++         IM  D A   GGDNT + +      E  + 
Sbjct: 259 ILNESTLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAICVLERTF-EKFYA 309

Query: 348 WSKTDLRTTN-----NKISGLVEKYRPDAIII 374
           +   D +  +       I  L+E +  + + I
Sbjct: 310 YIYQDQKPVSDSLVLASIQVLIENFNVNTVYI 341


>gi|306818204|ref|ZP_07451935.1| possible phage terminase protein [Mobiluncus mulieris ATCC 35239]
 gi|304649168|gb|EFM46462.1| possible phage terminase protein [Mobiluncus mulieris ATCC 35239]
          Length = 470

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 58/389 (14%), Positives = 100/389 (25%), Gaps = 51/389 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ    EV       + N       +  ++  R  GKTTL   L+  +    PG  V  
Sbjct: 33  PWQKLVAEVACE--RQAANPERARYQRVIVTVPRQSGKTTLIKALMAAVAQANPGCKVYY 90

Query: 113 LANSETQLKTTL--WAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
            A +    K  +  W E++K L             + +        ++        + ++
Sbjct: 91  TAQTR---KDAVEKWGELAKQLRKEMGTGPDGKPRVKVLEGTGNEKIVFQGTESVIQPFA 147

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL--------------- 215
                          G H       I DEA          ++  L               
Sbjct: 148 PTP-----------GGLHGATSPLAIVDEAWAFDQAQGDDLMAALNPVGLTIPHSQVWII 196

Query: 216 -TERNANRFWI---------MTSNPRRLSGKF---YEIFNKPLDDWKRFQIDTRTVEGID 262
            T  +    W+          TS+P   +  F    +      D      +      G  
Sbjct: 197 STAGDTRSQWLKSLVDDGRAATSDPGATTAFFEWSADEETADADLRGDAALSFHPALGYT 256

Query: 263 PSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN-IIEEALNREPCPDPYAPLIMG 321
               +           + R      +P     S I L      A      P       + 
Sbjct: 257 QELWKLKALGKDEKDHLYRRAYLNLWPTNAQTSIIDLETWDGLATEISETPT---GATIA 313

Query: 322 CDIAEEGGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
            D+A+     T+    +      L    SK         I  L +   P A++ D +   
Sbjct: 314 FDVADGRTGATIYAAWQQDNSVCLHRLISKAGAAWIEKAIEHLQDTLAPAALVADDSGDN 373

Query: 381 ARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
               + L   G  +Y +  ++ A      
Sbjct: 374 RPIIEGLRRAGATIYALKPKEYATANSEF 402


>gi|170023146|ref|YP_001719651.1| hypothetical protein YPK_0897 [Yersinia pseudotuberculosis YPIII]
 gi|169749680|gb|ACA67198.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis
           YPIII]
          Length = 595

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 25/163 (15%), Positives = 52/163 (31%), Gaps = 20/163 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  +Y  D+    +     F     DS    +++
Sbjct: 333 PDGQWRYVITLEDAIDGGFNLADIERLRNKYNRDT--FNMLYMCVFVDSG-DSVFKFHML 389

Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNT--VVVLRR--GPVIEHLFD- 347
           E+          +            +  G D A  G  +T  ++   +  G     L   
Sbjct: 390 EKCGVDIEMWQDHDFSAPRPFGNREVWGGFDPARSGDTSTFAIIAPPQFEGERFRVLVTF 449

Query: 348 -WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
            W   +     N+I  L ++Y    I +D    G    + ++ 
Sbjct: 450 YWQGLNFNYQANQIKELFQRYNMTYIGVDITGIGNGVFELVQN 492


>gi|226953564|ref|ZP_03824028.1| possible ATPase terminase subunit [Acinetobacter sp. ATCC 27244]
 gi|226835689|gb|EEH68072.1| possible ATPase terminase subunit [Acinetobacter sp. ATCC 27244]
          Length = 374

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 64/357 (17%), Positives = 103/357 (28%), Gaps = 87/357 (24%)

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE---ASGTPDVINL 209
              D +  S G + +   T  RT          G H      +  DE     G  D +  
Sbjct: 6   LTGDPIILSNGAELRFLGTNYRTA--------QGPHGN----LYFDEIFWTYGF-DELEK 52

Query: 210 GILGFLTERNANRFWIMTSNPRRLSGKFYE-----IFNKPLDDWKRFQID--------TR 256
              G  T     +     S P  ++ + Y+      FNK     ++F+ID         R
Sbjct: 53  VASGMATHDKWRKT--YFSTPSSITHEAYKFWTGTRFNKGKPKDQQFKIDLSHKALKHGR 110

Query: 257 TVE----------------GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
             E                G D    E +   Y  +          +F      S  PL+
Sbjct: 111 VCEDLMWRQIVTVEDAKEGGCDLFNIERLKFEYSPED--FANLFMCEFVDDGQ-SMFPLS 167

Query: 301 IIEEALNREP-----------CPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL 345
           +++  +                P    P+ +G D A  G +  +VV+      G     L
Sbjct: 168 MLQICMVDTLEIWNDFKIWHNRPFSNKPVWIGYDPALTGDNAGLVVVAPPAVAGGKFRVL 227

Query: 346 FD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD-------YLEMLGYHVYR 396
               +   D      +I  +  +Y    I ID    G    +        L    Y    
Sbjct: 228 EKHQFKGDDFSEQAERIRAITLRYNVTYIGIDTTGMGYGVAELVRAFFPALTTFNYSP-E 286

Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGEL 453
           V  Q     L+  RN R E         +         L Q+L S+K  +  +  ++
Sbjct: 287 VKSQLVYKTLDVIRNGRLE--------FDAG----DKDLAQSLMSIKKTLTSSQKQI 331


>gi|291618711|ref|YP_003521453.1| P [Pantoea ananatis LMG 20103]
 gi|291153741|gb|ADD78325.1| P [Pantoea ananatis LMG 20103]
          Length = 588

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 48/143 (33%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            E +  RY  +    +  +   F   D+ S   L +++  +                P  
Sbjct: 348 LEQLRTRYSPED--YQNLLMCVF-MDDLASVFQLAMLQRCMVDSWEVWDDFEALALRPFG 404

Query: 315 YAPLIMGCDIAEE--GGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+    GD+    V+      G     L    W   D R   + I  L +
Sbjct: 405 WKEVWIGYDPAKGTKNGDSAGCVVIAPPAVPGGKFRILERHQWRGMDFRAQADAIKSLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGLGVYENVK 487


>gi|281356810|ref|ZP_06243301.1| protein of unknown function DUF264 [Victivallis vadensis ATCC
           BAA-548]
 gi|281316937|gb|EFB00960.1| protein of unknown function DUF264 [Victivallis vadensis ATCC
           BAA-548]
          Length = 417

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 75/419 (17%), Positives = 128/419 (30%), Gaps = 72/419 (17%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICL---ANSETQLKTTLWAEVSKWLSLLPNKHW 140
            G   GKT +  + +L       G     L   AN+  Q         S     LP    
Sbjct: 29  GGSRSGKTFILVYAILVRALRAAGSRHAILRLHANTVRQ---------SIRFDTLPKVVK 79

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
                L L  +    D L          +  +    +EER D  +G        I  +E 
Sbjct: 80  LAFPGLGLSESK--VDQLIRLPNGSELWFGGLD---TEERADKILG---KEFATIYFNEC 131

Query: 201 SGTPDVINLGILGFLTERNA-----NRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255
           S   ++    +   LT         NR W    NP   S   Y +F +  D      +  
Sbjct: 132 S---ELGFEAVSTALTRLAQRTALKNRAWFDC-NPAGKSHWSYRLFIERRDPVSGLPLSF 187

Query: 256 RTVE---GIDPSFHEGIIARYGLDSDVT----RVEVC---GQFPQQDIDSFIPLNIIEEA 305
                   ++P+ +   +    L+  +     R  +    G +      +     +IE  
Sbjct: 188 PDNYASMLLNPAENRENLPEGYLEETLAGLTERQRLRFQEGAWLDDLSGALWSTAMIER- 246

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNT----VVVLRRG--PVIEHLFDWSKTDLRT-TNN 358
            +R         +++G D A   G ++    +V   RG       L D S  +       
Sbjct: 247 -SRVEAAPSLERIVIGVDPAVTSGKDSDETGIVTAGRGADGHYYVLADASCRERPAGWAA 305

Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTEL 416
           ++     ++R D ++ + NN G      L    L     +V   +  +        R E 
Sbjct: 306 RVRDEYRRFRADRVVAEVNNGGDLVETVLRSQELDLPFRQVRAMRGKI-------ARAE- 357

Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
              +A   E    ++H G  + L+   +   P TG             S D  D L++ 
Sbjct: 358 --PVAALYEQGK-VHHVGCFRELEEQMTSFTPQTG-----------TGSPDRLDALVWA 402


>gi|323937704|gb|EGB33972.1| terminase [Escherichia coli E1520]
          Length = 433

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 70/417 (16%), Positives = 122/417 (29%), Gaps = 62/417 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +   +   PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 25  AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 84

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 85  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 125

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 126 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 185

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + GQF      +        +  N
Sbjct: 186 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 242

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363
            E    P  P+ +G D         V VLR G         +  D       I       
Sbjct: 243 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 302

Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
                 K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 303 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 360

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469
              +         +   +    L     SL+  +    G    E  +  G    + +
Sbjct: 361 NA-MFCNANGERRYKVNVKRCPLYAE--SLEQQVWDEKG----EPDKKSGNDHPNDA 410


>gi|301025610|ref|ZP_07189133.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           69-1]
 gi|300395930|gb|EFJ79468.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           69-1]
          Length = 435

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 70/417 (16%), Positives = 122/417 (29%), Gaps = 62/417 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +   +   PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 27  AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 86

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 87  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 127

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 128 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 187

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + GQF      +        +  N
Sbjct: 188 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 244

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363
            E    P  P+ +G D         V VLR G         +  D       I       
Sbjct: 245 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 304

Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
                 K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 305 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 362

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469
              +         +   +    L     SL+  +    G    E  +  G    + +
Sbjct: 363 NA-MFCNANGERRYKVNVKRCPLYAE--SLEQQVWDEKG----EPDKKSGNDHPNDA 412


>gi|209918626|ref|YP_002292710.1| hypothetical protein ECSE_1435 [Escherichia coli SE11]
 gi|209911885|dbj|BAG76959.1| conserved hypothetical protein [Escherichia coli SE11]
          Length = 436

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 70/417 (16%), Positives = 122/417 (29%), Gaps = 62/417 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +   +   PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 28  AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 87

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 88  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 128

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 129 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 188

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + GQF      +        +  N
Sbjct: 189 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 245

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363
            E    P  P+ +G D         V VLR G         +  D       I       
Sbjct: 246 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 305

Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
                 K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 306 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 363

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469
              +         +   +    L     SL+  +    G    E  +  G    + +
Sbjct: 364 NA-MFCNANGERRYKVNVKRCPLYAE--SLEQQVWDEKG----EPDKKSGNDHPNDA 413


>gi|318604142|emb|CBY25640.1| phage terminase, ATPase subunit [Yersinia enterocolitica subsp.
           palearctica Y11]
 gi|318605359|emb|CBY26857.1| phage terminase, ATPase subunit [Yersinia enterocolitica subsp.
           palearctica Y11]
          Length = 590

 Score = 51.3 bits (121), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 24/133 (18%), Positives = 40/133 (30%), Gaps = 20/133 (15%)

Query: 276 DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIMGCDI 324
                +  +  +F      S  P   ++  +                   + P+ +G D 
Sbjct: 357 SPAEYQNLLMCEFVDDQA-SVFPFAELQACMVDSLEEWEDYNPYSLRPFGHRPVWIGYDP 415

Query: 325 AE-EGGDNT---VVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
           +E  GGD+    V+   +  G     L    W   D       I  L +KY  + I IDA
Sbjct: 416 SEANGGDSAGCAVIAPPMVPGGKFRVLERHQWKGMDFEAQAKHIEELTQKYCVEYIGIDA 475

Query: 377 NNTGARTCDYLEM 389
              G      +  
Sbjct: 476 TTVGQGVFQLVRQ 488


>gi|254465926|ref|ZP_05079337.1| phage DNA Packaging Protein [Rhodobacterales bacterium Y4I]
 gi|206686834|gb|EDZ47316.1| phage DNA Packaging Protein [Rhodobacterales bacterium Y4I]
          Length = 428

 Score = 51.3 bits (121), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 65/433 (15%), Positives = 119/433 (27%), Gaps = 67/433 (15%)

Query: 82  ISAGRGIGKTTLNA-WLVLWLMSTRPGI-----SVICLANSETQLKTTLWAEVSKWLSLL 135
           I  GRG GKT   A W+       RP        +  +A +  Q++  +   +     +L
Sbjct: 36  ILGGRGAGKTRAGAEWVRSLAEGARPHDPGTARRIALVAETYDQVRDVM---IHGDSGIL 92

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
                           P        S            + +S   P+   G       A 
Sbjct: 93  ACSP------------PDRRPKWKASERKLIWPNGAEAQAFSAHDPEALRGPQFD---AA 137

Query: 196 INDE----ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251
             DE      G        +   L     +    +T+ P R +     +   P    +  
Sbjct: 138 WADELAKWRKG--QESWDMLQFAL-RLGQDPRVCVTTTP-RNAPVLKRLLASPSTV-QTH 192

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
                    + PSF E + +RY   + + R E+ G        +      + EA   E  
Sbjct: 193 AATEANRANLAPSFLEEVRSRY-AGTRLGRQELDGVMLSDIQGALWTTAALVEAQVAEAP 251

Query: 312 PDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTNN 358
           P     +++  D A    +  D   +V+          DW                    
Sbjct: 252 P--LDRVVVAVDPAVSAGKDSDACGIVVAGAVTRGKPQDWQAYVLADCTVQGVGPLAWAQ 309

Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHV 418
            +    +++  + ++ + N  GA     L      V       RA+     ++ R E   
Sbjct: 310 AVIAARDRFGAERVVAEVNQGGALVESVLRQADPLV-----PFRALHARKGKSARAEPVA 364

Query: 419 KMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
            + +      L     L   +               +  +  +G  S D  D L++   E
Sbjct: 365 ALYEQGRVRHLPGLGELEDQMC-------------QMTPQGYRGGGSPDRVDALVWALHE 411

Query: 479 NPPRSDMDFGRCP 491
              +   +  R  
Sbjct: 412 LIIQPAANLRRPQ 424


>gi|294338167|emb|CBJ94203.1| Putative phage DNA packaging protein (terminase) [Campylobacter
           phage CPt10]
          Length = 731

 Score = 51.3 bits (121), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 58/365 (15%), Positives = 112/365 (30%), Gaps = 63/365 (17%)

Query: 64  AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTT 123
            HC +     +   +   + +     K+T  +  +  L   +  I++  +A         
Sbjct: 260 DHCYDLTLEHHHLYYTNGVLS-HNSSKSTTTSVKLAHLYCFKKDINIGIVA--------- 309

Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE-ERPD 182
                    S    + + +     L   P +      +    S       +  ++    D
Sbjct: 310 --------YSGNSAREFLDKTKKMLIGLPIWMQPGTVTWNKGSIECENNIKILTDVPSSD 361

Query: 183 TFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRF--WIMTSNPRRLSGKFYE 239
            F G   T    I+ DE +            G L  +    F   ++ S P+  +  FY+
Sbjct: 362 AFRG---TSTNIIVVDECAYLDPAGWIDFTDGVLPSQAGLAFKKLVILSTPKGKN-HFYD 417

Query: 240 IFN--------------KPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284
           I+               +   DW+   +  +   +     F +  I   GL   V     
Sbjct: 418 IWQGAGDTLETSINGFVRHRVDWRLVPRFKSDGTKYDPEEFKQQQIKTGGL--VVWNSAY 475

Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPC---------------PDPYAPLIMGCDIAEEGG 329
             +F +    + IP  I++    +EP                P P    +MG D A+EG 
Sbjct: 476 ECKF-EGSAMTLIPSEILDTYKPQEPIEVDNIKDSKILIYEEPIPGHKYVMGVDTAKEGA 534

Query: 330 DNT-VVVLRRGPVIEHLFDWSKT--DLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCD 385
           D T V +     +       +K   D       ++    ++    II++ N  +G    D
Sbjct: 535 DFTGVQIFDITDLNFRQVASAKLKIDYMLLPELLNEYGLRFNQALIIVENNEGSGQVVAD 594

Query: 386 YLEML 390
            L+  
Sbjct: 595 ILKRD 599


>gi|224796473|ref|YP_002641230.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii
           A14S]
 gi|224497687|gb|ACN53304.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii
           A14S]
          Length = 450

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 62/332 (18%), Positives = 110/332 (33%), Gaps = 51/332 (15%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109
           Q E +  +++H  + V      +F G I++    GKT L ++L++  +    S      +
Sbjct: 49  QKEVLFDIESHKYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEQDTN 98

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
              + NS + L T    ++ K   LL   +  +    S      +   ++     D    
Sbjct: 99  NFIIGNSISLLMTNTIKQIEKICRLLGIDYQKKKSGQSFCKIAGFELNIYGGKNRD---- 154

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                         F          I  +EA+       L +L  L  R      I  +N
Sbjct: 155 -------------AFSKIRGGNSAIIYVNEATVIHKETLLEVLKRL--RKGKSIIIFDTN 199

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288
           P   +  F   + +  D +K +   T         F E     Y   S   +  V  G++
Sbjct: 200 PESPAHFFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHSPAYKARVLYGEW 258

Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFD 347
              +   F      E   N++         IM  D A   GGDNT + +      E  + 
Sbjct: 259 IVNESSLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAICVLERTF-EKFYA 309

Query: 348 WSKTDLRTTNN-----KISGLVEKYRPDAIII 374
           +   D +  N+      I  L+E +  + + I
Sbjct: 310 YIYQDQKPVNDSLMLNSIQVLIENFNVNTVYI 341


>gi|291336835|gb|ADD96368.1| phage terminase large subunit [uncultured organism
           MedDCM-OCT-S09-C20]
          Length = 454

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 49/333 (14%), Positives = 104/333 (31%), Gaps = 32/333 (9%)

Query: 40  EKGTPLEGFSAP---RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAW 96
           E   P+E    P    + + + +          +++ +  +   +   G G GK+   A 
Sbjct: 14  EPKRPVERAIDPGAADALRAKILADCLPAQREFLDDESHRIL--SYIGGFGSGKSFALAA 71

Query: 97  LVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSD 156
            +++L    PG +++    +   ++T L   +      +    W    S    P P Y  
Sbjct: 72  KLIFLGLRNPGGTLMACEPTFPMIRTVLVPAI-----DMALDQWDIEYSYRASPQPEY-- 124

Query: 157 VLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASG----TPDVINLGIL 212
               S+ + +   +  C++      + +         A + DE       T       +L
Sbjct: 125 ----SINLPTGPVTIYCQSA-----ENYQRIRGQNICAAVWDECDTSPVDTAQKAGEMLL 175

Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIF-NKPLDDWKRFQIDTRTVEGIDPSFHEGIIA 271
             +     N+  +  S P       Y  F      D +  ++ T+    +   F   +  
Sbjct: 176 ARMRTGELNQLAVA-STPEG-FRWAYRTFVENDGPDKRLIRVRTQDNPHLPADFIPSLER 233

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331
            Y   S + +  + G F      S  P          +  P     + +G D+   G   
Sbjct: 234 NY--PSQLIQAYLEGHFVNLASCSLYP-EFDRSLNYCDTQPTENDTIWIGVDL-NVGNCV 289

Query: 332 TVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLV 364
           T  ++RRG       +    D +     +  + 
Sbjct: 290 TQHLVRRGDEFHFFAEKVYRDTQQIAQGLKEMY 322


>gi|299531659|ref|ZP_07045064.1| putative phage associated protein [Comamonas testosteroni S44]
 gi|298720375|gb|EFI61327.1| putative phage associated protein [Comamonas testosteroni S44]
          Length = 436

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 61/340 (17%), Positives = 122/340 (35%), Gaps = 41/340 (12%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A ++L + ++RP   V+C              E+ K +    ++   + 
Sbjct: 39  GGRGGGKSWTVAAVLLVMAASRPL-RVLCT------------REIQKSIKQSVHQ-LLKD 84

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHY-STMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202
               L+   ++  +     GI+   +  +  ++++ +   +F G        +  +EA G
Sbjct: 85  VITRLNLHAFFEVLETEVRGINGSLFLFSGLQSHTVDSIKSFEGCD-----IVWVEEAHG 139

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-NKPLDDWKRFQIDTRTVEGI 261
                   ++  + +  +  +  +  NP   + + Y+ F   P  D    +I+ R     
Sbjct: 140 VSKKSWDTLIPTIRKEGSEIWLTL--NPDMETDETYQRFIATPSPDTWVVEINWRDNPWF 197

Query: 262 DPSFHEGII-ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE---EALNREPCPDPYAP 317
                E    A+  + +D       G+  +    +     +     +   R+   DP  P
Sbjct: 198 PRVLDEERRKAKRTMLADDYAHIWEGKARRVAAGAIYRHEMESVYLDNRARDVPYDPTLP 257

Query: 318 LIMGCDIAEEGGDNTVVVLRRGP-----VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372
           +    D+     D   + L +       +I H+ D  +T L     K+  L  ++  D +
Sbjct: 258 VHTVWDLGWN--DAMSIALVQRGPQDVRIIGHIEDSHRT-LDWYVAKLEKLPYRWGTDYL 314

Query: 373 IIDAN----NTGARTCDYLEMLGYHVYRVLGQKRAVDLEF 408
             D       TG  T   L  LG     V+ Q RA D+E 
Sbjct: 315 PHDGKTKNFQTGKSTEQLLRELGRR--SVMVQPRATDVEE 352


>gi|330503113|ref|YP_004379982.1| phage P2 terminase ATPase subunit, gpP-like protein [Pseudomonas
           mendocina NK-01]
 gi|328917399|gb|AEB58230.1| phage P2 terminase ATPase subunit, gpP-like protein [Pseudomonas
           mendocina NK-01]
          Length = 585

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 43/238 (18%), Positives = 72/238 (30%), Gaps = 51/238 (21%)

Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
               W++   I      G D    + +   Y  D       +  QF     DS  PL ++
Sbjct: 326 DDKVWRQIVTILDAEARGCDLFDLDELRLEY--DGPAFDNLLMCQFVDDG-DSIFPLTML 382

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--F 346
           +  +        + P          + +G D AE G    ++VL      G     L  F
Sbjct: 383 QPCMVESWDWPDFKPFAARPFGDRQVWLGYDPAENGDSAGLMVLAPPTEPGGKFRVLDRF 442

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
            +   D      KI  L + Y    I ID    G      +                   
Sbjct: 443 QFRGMDFEAQAEKIRQLTQIYWVTYIGIDTTGMGTGVAQLVRQ----------------- 485

Query: 407 EFCRNRRTELH-------VKMADW--LEFASLINHSG---LIQNLKSLKSFIVPNTGE 452
            F  N RT  +       + M  W  +    L   +G   + Q+L +++   +  +G+
Sbjct: 486 -FFPNLRTFSYSPEVKTQLVMKAWDVVRKGRLEFDAGATDIAQSLMAIRK-TMTPSGK 541


>gi|294337972|emb|CBJ93810.1| putative phage DNA packaging protein (terminase) [Campylobacter
           phage CP220]
          Length = 744

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 58/365 (15%), Positives = 112/365 (30%), Gaps = 63/365 (17%)

Query: 64  AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTT 123
            HC +     +   +   + +     K+T  +  +  L   +  I++  +A         
Sbjct: 260 DHCYDLTLEHHHLYYTNGVLS-HNSSKSTTTSVKLAHLYCFKKDINIGIVA--------- 309

Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE-ERPD 182
                    S    + + +     L   P +      +    S       +  ++    D
Sbjct: 310 --------YSGNSAREFLDKTKKMLIGLPIWMQPGTVTWNKGSIECENNIKILTDVPSSD 361

Query: 183 TFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERNANRF--WIMTSNPRRLSGKFYE 239
            F G   T    I+ DE +            G L  +    F   ++ S P+  +  FY+
Sbjct: 362 AFRG---TSTNIIVVDECAYLDPAGWIDFTDGVLPSQAGLAFKKLVILSTPKGKN-HFYD 417

Query: 240 IFN--------------KPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284
           I+               +   DW+   +  +   +     F +  I   GL   V     
Sbjct: 418 IWQGAGDTLETSINGFVRHRVDWRLVPRFKSDGTKYDPEEFKQQQIKTGGL--VVWNSAY 475

Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPC---------------PDPYAPLIMGCDIAEEGG 329
             +F +    + IP  I++    +EP                P P    +MG D A+EG 
Sbjct: 476 ECKF-EGSAMTLIPSEILDTYKPQEPIEVDNIKDSKILIYEEPIPGHKYVMGVDTAKEGA 534

Query: 330 DNTVVVLRRGPVI---EHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCD 385
           D T V +     +   +      K D       ++    ++    II++ N  +G    D
Sbjct: 535 DFTGVQIFDTTDLNFRQVASAKLKIDYMLLPELLNEYGLRFNQALIIVENNEGSGQVVAD 594

Query: 386 YLEML 390
            L+  
Sbjct: 595 ILKRD 599


>gi|186896569|ref|YP_001873681.1| hypothetical protein YPTS_3269 [Yersinia pseudotuberculosis PB1/+]
 gi|186699595|gb|ACC90224.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis
           PB1/+]
          Length = 725

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 20/144 (13%), Positives = 43/144 (29%), Gaps = 18/144 (12%)

Query: 279 VTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----------PLIMGCDIAEEG 328
                   QF       F   + +E+ L      + +            +  G D A  G
Sbjct: 389 AFNQLYMCQFVDSGDCVF-KFDQLEKCLTNVSTWEDHDVNAMRPFGNREVWAGYDPARTG 447

Query: 329 GDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382
              + V++           + H+  W     +    +I   + +Y    I ID    G  
Sbjct: 448 DTASFVLVAPPQVDGEPFRVLHIETWHGFAFKYQVGRIKEYMTRYNITHIGIDTTGIGGP 507

Query: 383 TCDYLEML-GYHVYRVLGQKRAVD 405
            C+ ++      V ++   + + +
Sbjct: 508 VCEMVQDFARREVTQIHYSQESKN 531


>gi|331677171|ref|ZP_08377867.1| putative phage terminase [Escherichia coli H591]
 gi|331075860|gb|EGI47158.1| putative phage terminase [Escherichia coli H591]
          Length = 436

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 70/417 (16%), Positives = 129/417 (30%), Gaps = 62/417 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +   +   PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 28  AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 87

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 88  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 128

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 129 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 188

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + G+F      +        +  N
Sbjct: 189 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLLGRFTNLTSGTVYH-QFDRKLNN 245

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP---VIEHLFDWSKTDLRTTNNKISGLV 364
            E    P  P+ +G D         V VLR G    V E +  +   D+     +   L 
Sbjct: 246 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRLIKERFWLY 305

Query: 365 E-----KYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
           +     K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 306 DGHDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 363

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469
              +         +   +    +     SL+  +  + G    E  +  G    + +
Sbjct: 364 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 413


>gi|300825021|ref|ZP_07105118.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           119-7]
 gi|300522485|gb|EFK43554.1| phage terminase, large subunit, PBSX family [Escherichia coli MS
           119-7]
          Length = 435

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 70/417 (16%), Positives = 129/417 (30%), Gaps = 62/417 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +   +   PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 27  AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 86

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 87  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 127

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 128 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 187

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + G+F      +        +  N
Sbjct: 188 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLLGRFTNLTSGTVYH-QFDRKLNN 244

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP---VIEHLFDWSKTDLRTTNNKISGLV 364
            E    P  P+ +G D         V VLR G    V E +  +   D+     +   L 
Sbjct: 245 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRLIKERFWLY 304

Query: 365 E-----KYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
           +     K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 305 DGHDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 362

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469
              +         +   +    +     SL+  +  + G    E  +  G    + +
Sbjct: 363 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 412


>gi|193062487|ref|ZP_03043581.1| terminase [Escherichia coli E22]
 gi|192931609|gb|EDV84209.1| terminase [Escherichia coli E22]
          Length = 433

 Score = 51.3 bits (121), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 70/417 (16%), Positives = 129/417 (30%), Gaps = 62/417 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +   +   PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 25  AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 84

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 85  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 125

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 126 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 185

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + G+F      +        +  N
Sbjct: 186 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLLGRFTNLTSGTVYH-QFDRKLNN 242

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP---VIEHLFDWSKTDLRTTNNKISGLV 364
            E    P  P+ +G D         V VLR G    V E +  +   D+     +   L 
Sbjct: 243 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRLIKERFWLY 302

Query: 365 E-----KYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
           +     K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 303 DGHDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 360

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469
              +         +   +    +     SL+  +  + G    E  +  G    + +
Sbjct: 361 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 410


>gi|251810445|ref|ZP_04824918.1| large terminase subunit [Staphylococcus epidermidis BCM-HMP0060]
 gi|251806049|gb|EES58706.1| large terminase subunit [Staphylococcus epidermidis BCM-HMP0060]
          Length = 420

 Score = 50.9 bits (120), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 45/338 (13%), Positives = 109/338 (32%), Gaps = 40/338 (11%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           ++  E++  H  +             +   GRG GK++  + ++   +  R  ++ + + 
Sbjct: 4   IKLSELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVR 62

Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174
            ++  L T+++ ++   +      H F+++                 +    +    + R
Sbjct: 63  KTDNTLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFR 110

Query: 175 TYSEERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMT 227
               + P+      ++     +  I + A   T D +       L    +      +  +
Sbjct: 111 GA--QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFS 168

Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            NP +    +    YE   +P + +            I   F +   +    +    R E
Sbjct: 169 YNPPKRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESAKERNEQRYRWE 227

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRG 339
             G+         +P N ++     +   D +  +  G D      D    V     ++ 
Sbjct: 228 YMGEAIGS---GVVPFNNLQIETIPQEMIDGFDNIRNGLDFG-YADDPLAFVRWHYDKKK 283

Query: 340 PVIEHLFDWSKTDL--RTTNNKISGLVEKYRPDAIIID 375
            VI  + ++    +  R   N++     KY+ D I  D
Sbjct: 284 RVIYAIDEYYGVQISNRQYANEMWK--RKYQSDDIYAD 319


>gi|332768290|gb|EGJ98475.1| hypothetical protein SF293071_0834 [Shigella flexneri 2930-71]
          Length = 65

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 11/55 (20%), Positives = 19/55 (34%), Gaps = 4/55 (7%)

Query: 433 SGLIQNLKSLKSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
             L   L +         G + +ESK+    +   S + +D  +  FA      D
Sbjct: 3   EKLKLELTT-PHRDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 56


>gi|330996205|ref|ZP_08320095.1| phage terminase, large subunit, PBSX family [Paraprevotella
           xylaniphila YIT 11841]
 gi|329573709|gb|EGG55300.1| phage terminase, large subunit, PBSX family [Paraprevotella
           xylaniphila YIT 11841]
          Length = 430

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 59/349 (16%), Positives = 111/349 (31%), Gaps = 38/349 (10%)

Query: 76  EVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135
           E F   I+ GRG GK+   A        T         A+S    + T+   VS  LS++
Sbjct: 22  EHFIILITGGRGSGKSFNAA--TFIERLTFEQSRDRTFAHSILYCRYTM---VSANLSII 76

Query: 136 PN-KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
           P  +   ++   S +     SD+++   G          +T S  +       H      
Sbjct: 77  PEIQEKIDIDGGSKYFKTTRSDIVNMFSGGRIMFRGI--KTSSGNQTAKLKSIHG--ITT 132

Query: 195 IINDEA-SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-- 251
            + DEA   T +     I+  + ++      I+  NP   +   Y+ + +          
Sbjct: 133 FVCDEAEEWTNEQDFDKIMLSIRQKGIQNRIIIIMNPTDSNHFIYKKYIENTHKLVEIDG 192

Query: 252 ---QIDT------------RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF 296
              QI T              ++ + P F + +      + +     V G++      + 
Sbjct: 193 VPVQISTHPNVLHIHTTYFDNIDNLSPQFIKEVEQMKAENPEKYAHTVIGRWADVAEGA- 251

Query: 297 IPLNIIEEALNREPCPDPYAPLI-MGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRT 355
                I +          +   I +G D      D T +V+      +   D      + 
Sbjct: 252 -----IYKKWGVVKSIPQWCKKIALGLDFG-FTHDETAIVMCGVMDNDLYIDEICYKTQM 305

Query: 356 TNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAV 404
               I   +  Y+   +I D+ +   R    +   G  +Y V   K +V
Sbjct: 306 LTKDIIQTLRPYQGMKVIADSAD--PRLIQEIHNAGIRIYPVEKGKGSV 352


>gi|238025823|ref|YP_002910054.1| phage terminase, ATPase subunit [Burkholderia glumae BGR1]
 gi|237875017|gb|ACR27350.1| Phage terminase, ATPase subunit [Burkholderia glumae BGR1]
          Length = 589

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/142 (16%), Positives = 38/142 (26%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            E +  RY   +D     +  QF    + S   L  ++  +                 P 
Sbjct: 350 LERLRRRY--SADAFANLLMCQFIDDSV-SVFKLAELQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDN--TVVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G      VV   R       +     +   D       I  +  +Y
Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVEGGTFRVLERHQFRGNDFEEQAAAIEQITRRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|167567112|ref|ZP_02360028.1| phage terminase, ATPase subunit [Burkholderia oklahomensis EO147]
          Length = 589

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
            +  + +G D A  G    +VV+      G     L    +   D       I  + ++Y
Sbjct: 407 GHREVWVGYDPALTGDSAGLVVVAPPRVDGGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 HVGYIAIDTTGMGQGVYQLVRK 488


>gi|145298582|ref|YP_001141423.1| phage terminase ATPase subunit [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|145300715|ref|YP_001143556.1| phage terminase ATPase subunit [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|142851354|gb|ABO89675.1| phage terminase ATPase subunit [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|142853487|gb|ABO91808.1| phage terminase ATPase subunit [Aeromonas salmonicida subsp.
           salmonicida A449]
          Length = 606

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 23/139 (16%), Positives = 46/139 (33%), Gaps = 19/139 (13%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------- 317
            E +   Y    +V       +F   D  S      +E A       + Y P        
Sbjct: 365 IEELKDEY--PEEVFDRLYLCRFID-DALSVFKFQDMERAGVDPTRWEDYKPGRPDPFGR 421

Query: 318 --LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             + +G D +    + T+VV+           +     W   + +    +I+ + +K+R 
Sbjct: 422 REVWLGYDPSRTRDNATLVVVAPPTVAGERFRVLEKHYWRGLNFQYQAQEITRIAKKFRV 481

Query: 370 DAIIIDANNTGARTCDYLE 388
             + +D +  G    D L+
Sbjct: 482 TYLGVDVSGIGVGVFDLLK 500


>gi|188532717|ref|YP_001906514.1| Terminase, ATPase subunit [Erwinia tasmaniensis Et1/99]
 gi|188027759|emb|CAO95616.1| Terminase, ATPase subunit [Erwinia tasmaniensis Et1/99]
          Length = 588

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 26/143 (18%), Positives = 50/143 (34%), Gaps = 23/143 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-----------NREPCPDP 314
            E +  RY  +    +  +   F   D+ S   L ++++ +                P  
Sbjct: 348 LEQLRTRYSPED--YQNLLMCVF-MDDLASVFQLAMLQKCMVDSWEVWDDFEALALRPFG 404

Query: 315 YAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           +  + +G D A+  + GD+    V+      G     L    W   D R   + I  L +
Sbjct: 405 WKEVWIGYDPAKGTQNGDSAGCVVIAPPAVPGGKFRILERHQWRGMDFRAQADAIKTLTQ 464

Query: 366 KYRPDAIIIDANNTGARTCDYLE 388
           +Y    I ID+   G    + ++
Sbjct: 465 QYNVTYIGIDSTGVGLGVYENVK 487


>gi|219872451|ref|YP_002476937.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr]
 gi|219694305|gb|ACL34832.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr]
          Length = 450

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 60/332 (18%), Positives = 108/332 (32%), Gaps = 51/332 (15%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109
           Q E +  +++H  + V      +F G I++    GKT L ++L++  +    S      +
Sbjct: 49  QKEVLFDIESHDYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEKDTN 98

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
              + NS   L T    ++ K         +  +          +  +    L I     
Sbjct: 99  NFIIGNSIGLLMTNTIKQIEKICG------FLGIDYQKKKSGESFCKIAGLELNI----- 147

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                 Y  +  D+F          I  +EA+       L ++  L  R      I  +N
Sbjct: 148 ------YGGKNRDSFSKIRGGNSAIIYVNEATVIHKETLLEVIKRL--RKGKAIIIFDTN 199

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288
           P   +  F   F +  D +K +   T         F E     Y       +  V  G++
Sbjct: 200 PEGPTHFFKTDFIENKDVFKTYNFTTYDNPLNSADFIETQKKLY-KHLPAYKARVLYGEW 258

Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFD 347
              +   F      E   N++         IM  D A   GGDNT + +      E  + 
Sbjct: 259 ILNESTLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAICVLE-RAFEKFYA 309

Query: 348 WSKTDLRTTN-----NKISGLVEKYRPDAIII 374
           +   D +  +       I  L+E +  + + I
Sbjct: 310 YIYQDQKPVSDSLMLGSIQVLIENFNVNTVYI 341


>gi|329888629|ref|ZP_08267227.1| phage DNA packaging protein [Brevundimonas diminuta ATCC 11568]
 gi|328847185|gb|EGF96747.1| phage DNA packaging protein [Brevundimonas diminuta ATCC 11568]
          Length = 411

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 43/312 (13%), Positives = 85/312 (27%), Gaps = 38/312 (12%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP---RR 232
           +   +P+             I +EA          +L  +     +  W    NP     
Sbjct: 106 WKGGKPEGIKSLEGAG--LTILEEAQEVRQASLDVLLPTILRTAISELW-AIWNPRLDTD 162

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292
               F+    KP     R +I+         +  E +   +  D         G +    
Sbjct: 163 PIDVFFRGPVKPKGAIVR-KINYDQNPHFPDALRELMELDFSKDKLRAAWIWLGGYMPSV 221

Query: 293 IDSFIPLNIIEEAL--NREPCPDPYAPLIMGCDIAEEGGDNTVVVL---RRGPVIEHLFD 347
             +      ++EA    R      +  +++G D +  G D  +VV      G +I     
Sbjct: 222 QGAIWNREGLDEAWREGRHAPEGSWGRVVVGVDPSGGGDDVGIVVAAEYGDGAIILEDAT 281

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLE 407
              T         +  V+++  D ++ + N  G      L   G     V+     V   
Sbjct: 282 CPATSPMAWATATAKAVDRWGADCVVAEKNFGGDMVESTLRAGGVKARVVM-----VTAS 336

Query: 408 FCRNRRTE----LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGA 463
             +  R E    L+ +    +              + +   +                G 
Sbjct: 337 RGKQVRAEPVAALYDQKR--IRHREQFPLMEAEMLMTTPAGYQ---------------GE 379

Query: 464 KSTDYSDGLMYT 475
            S +  D L++ 
Sbjct: 380 DSPNRMDALVWA 391


>gi|294010834|ref|YP_003544294.1| hypothetical protein SJA_C1-08480 [Sphingobium japonicum UT26S]
 gi|292674164|dbj|BAI95682.1| phage-related protein [Sphingobium japonicum UT26S]
          Length = 419

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 64/428 (14%), Positives = 119/428 (27%), Gaps = 92/428 (21%)

Query: 89  GKTTLNAWLVLWLMST--RPGISVICLANSETQLKTTLWAEVSKWL-------SLLPNKH 139
           GK+   A L    M      G + + +  + T  +++LW +   W+       S  P + 
Sbjct: 17  GKS--IAILYYIFMRCLLYAGSTHLIVRRTRTACESSLWRQTLNWMLDHMADPSGAPLRE 74

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             ++ S  L    ++ +  +                  E R D  +G   T    I  +E
Sbjct: 75  KVKLNSSDL--IAYFDNGSYIMFD-----------GLDENRLDKVLG---TEYQTIWMNE 118

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF-------YEIFNKPLDDWKRFQ 252
            S         + G L             +P              ++   +   DW    
Sbjct: 119 VSEFDWSDVQQLAGCLN-----------GSPTHNDNGLPIVRKMVFDCNPRFESDWDCKV 167

Query: 253 IDTRTVEGIDPSF--------------HEGIIARYGLDSDVTRVEVC-GQFPQQDIDSFI 297
                    +                  E  +A Y      TR     G +  Q+ ++  
Sbjct: 168 FRDGQNPVNNQPLNDVQKYGKVKVQNVDEEYLAIYANADPRTRARYLDGDWSAQNDNAIF 227

Query: 298 PLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNT------VVVLRRGPVIEHLFDWSKT 351
            L+  E              +++G D A +    +      V  + +           K 
Sbjct: 228 DLDNFERNRRFGIFAKDLERIVIGVDPASKSKKESDLTGIIVAGMLKDEAYILADLTGKY 287

Query: 352 DLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFC 409
                  K++   + Y+ D+II++ NN G    + L        V +V   +  +     
Sbjct: 288 TPEQVAQKVTEAFDTYQADSIIVETNNGGDWIENGLRQYAPNLPVKQVTASRGKLT---- 343

Query: 410 RNRRTE--LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTD 467
              R E    +   D +      N S L   +     +                 AKS D
Sbjct: 344 ---RAEPIALIYAQDKVHHVGH-NLSELETQM-----YEF---------GMERGAAKSPD 385

Query: 468 YSDGLMYT 475
             D L++ 
Sbjct: 386 RMDALVWA 393


>gi|329735579|gb|EGG71866.1| phage terminase, large subunit, PBSX family [Staphylococcus
           epidermidis VCU028]
          Length = 420

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 45/338 (13%), Positives = 109/338 (32%), Gaps = 40/338 (11%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           ++  E++  H  +             +   GRG GK++  + ++   +  R  ++ + + 
Sbjct: 4   IKLSELLPKHFHSLWKATKDRKKLNVVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVR 62

Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174
            ++  L T+++ ++   +      H F+++                 +    +    + R
Sbjct: 63  KTDNTLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFR 110

Query: 175 TYSEERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMT 227
               + P+      ++     +  I + A   T D +       L    +      +  +
Sbjct: 111 GA--QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFS 168

Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            NP +    +    YE   +P + +            I   F +   +    +    R E
Sbjct: 169 YNPPKRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESTKERNELRYRWE 227

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRG 339
             G+         +P N ++     +   D +  +  G D      D    V     ++ 
Sbjct: 228 YMGEAIGS---GVVPFNNLQIETIPQEMIDGFDNIRNGLDFG-YADDPLAFVRWHYDKKK 283

Query: 340 PVIEHLFDWSKTDL--RTTNNKISGLVEKYRPDAIIID 375
            VI  + ++    +  R   N++     KY+ D I  D
Sbjct: 284 RVIYAIDEYYGVQISNRQYANEMWK--RKYQSDDIYAD 319


>gi|251782540|ref|YP_002996842.1| phage terminase [Streptococcus dysgalactiae subsp. equisimilis
           GGS_124]
 gi|242391169|dbj|BAH81628.1| phage terminase [Streptococcus dysgalactiae subsp. equisimilis
           GGS_124]
          Length = 476

 Score = 50.9 bits (120), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 53/347 (15%), Positives = 111/347 (31%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ   +  + A   + +       +          GKT +   + LW +    G+ ++ 
Sbjct: 48  PWQENMLIPIMAVDEDGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 101

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   + G   +  +  
Sbjct: 102 TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 155

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 156 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 205

Query: 233 --LSGKFYEIFNKP-------LDDWKRFQI-------DTRTVEGIDPS-----FHEGIIA 271
              +G  +E + K           W  + +       D  +    +PS         I A
Sbjct: 206 MVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYIANPSMGFHLNERKIEA 265

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +P  +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 266 ELGEDEIDHNIQRLGYWPSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 323

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 324 VSLSIAARTSENKVFVEVIDCLSVRNGTQWIINFLKSADIAKVVIDG 370


>gi|167584288|ref|ZP_02376676.1| bacteriophage terminase, ATPase subunit [Burkholderia ubonensis Bu]
          Length = 520

 Score = 50.5 bits (119), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 33/224 (14%), Positives = 54/224 (24%), Gaps = 47/224 (20%)

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDDWKRFQIDTRTVEG 260
            +N       +++   +     S P  +S + Y     E +N+         +D      
Sbjct: 196 ELNKVAQAMASQKQWRKT--YFSTPSSISHQAYPFWSGEAYNRGRAKADHIHLDISHAAL 253

Query: 261 IDPSFHEGIIAR----------------------YGLDSDVTRVEVCGQFPQQDIDSFIP 298
                 E    R                          +D        QF      S   
Sbjct: 254 SGGRLCEDRQWRQIVTIEDAAAMGCDLFDLDELRLENSADDFAQLYLCQFIDDSA-SIFK 312

Query: 299 LNIIEEALNRE-----------PCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIE 343
              I+  +                P  + P+ +G D A  G    +V++      G    
Sbjct: 313 FADIQRCMIDSWEEWDDVEFLIQRPFGHRPVWLGYDPALSGDSAGLVIVAPPAVPGGKFR 372

Query: 344 HLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
            L    W   D       I  L E+Y    + ID    G     
Sbjct: 373 VLEKMQWRGMDFEAQAESIRQLTERYTVTYMAIDTTGIGQGVYQ 416


>gi|300922729|ref|ZP_07138820.1| hypothetical protein HMPREF9548_00965 [Escherichia coli MS 182-1]
 gi|300420946|gb|EFK04257.1| hypothetical protein HMPREF9548_00965 [Escherichia coli MS 182-1]
          Length = 181

 Score = 50.5 bits (119), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 29/145 (20%), Positives = 47/145 (32%), Gaps = 19/145 (13%)

Query: 317 PLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPD 370
           P+ +G D +  G     VVL      G     L    W   D  T    I  L EKY  +
Sbjct: 1   PVWIGYDPSHRGDSAGCVVLAPPVVAGGKFRILERHQWKGMDFATQAESIRKLTEKYNVE 60

Query: 371 AIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430
            I IDA   G      +               A D+ +    +T + +K  D +     +
Sbjct: 61  YIGIDATGLGVGVFQLVR---------SFYPAARDIRYTPEMKTAMVLKAKDVIRRG-CL 110

Query: 431 NHSGLIQNLKS---LKSFIVPNTGE 452
            +     ++ S        + ++G 
Sbjct: 111 EYDVSATDITSSFMAIRKTMTSSGR 135


>gi|282534188|gb|ADA82296.1| putative terminase [Escherichia phage K1H]
 gi|282535239|gb|ADA82445.1| putative terminase [Escherichia phage K1ind3]
          Length = 416

 Score = 50.5 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 65/413 (15%), Positives = 117/413 (28%), Gaps = 63/413 (15%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            G G GKT +    +L  M   PG  +     +   ++   +       +     +   +
Sbjct: 24  GGFGSGKTFVGCLDLLTFMLKHPGTRLGYFGPTYPAIRDIFYP------TFEEAANLLGL 77

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
             L         D         +   + +CR  S + P + VG       A + DE    
Sbjct: 78  DVLVKS-----GDKEVVVTRGKTVLGTVICR--SMDNPGSIVGF---KIAAAVVDELDVL 127

Query: 204 ----PDVINLGILG--FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-R 256
                ++    I+    L          +T+ P      + +    P   +   Q  T  
Sbjct: 128 SREKAELAWNKIVARMRLVIPGVTNHISVTTTPEGFKFVYAKFKENPTPSYSMVQASTHE 187

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPC 311
               + P +   +   Y   + +    + G+F      S      +  A +R     +  
Sbjct: 188 NARFLPPDYISSLTETY--PAQLINAYLNGEFVNLTSGS------VYYAYDRRKHRSKET 239

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
             P   L +G D       + V V R+             D   T   I+    K +   
Sbjct: 240 IQPGDTLYIGQDFNVTKNASAVYVQRKDGWHAVAELKGLFDTPDTVRVITEKW-KSQGHR 298

Query: 372 III--DANNTGART-------CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
           I++  DA+    +T          L+  G+ V          D     N           
Sbjct: 299 IVVYPDASGKNRKTNSASISDIALLQQAGFDVRAKSANPPVKDRVLAVN----------T 348

Query: 423 WLEFASLINHSGLIQNL-KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474
            LE   L  +  L   + K+L+     + G    E  +         +D L Y
Sbjct: 349 ALEKGKLWVNDHLCPEIAKTLEQQAYDDNG----EPAKDGIID--HMADALGY 395


>gi|84393331|ref|ZP_00992091.1| putative phage gene [Vibrio splendidus 12B01]
 gi|84376047|gb|EAP92935.1| putative phage gene [Vibrio splendidus 12B01]
          Length = 590

 Score = 50.5 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 19/156 (12%), Positives = 49/156 (31%), Gaps = 18/156 (11%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IPLNIIEEALNREPCPDPYAP---- 317
            + +   Y  +           F    +  F    +   +++ A  ++  P    P    
Sbjct: 353 IDELREEYSQED--FNNLFMCMFVDGALSVFKFSDLEKGMVDAAHWQDFKPKNKQPFARR 410

Query: 318 -LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPD 370
            + +G D +    +  +VV+      G     L    W   + +   ++I  + ++Y+  
Sbjct: 411 EVWLGYDPSRTRDNACLVVVAPPAVAGEKFRVLEKHYWKGLNFQYHVSEIDKVFQRYKVT 470

Query: 371 AIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAVD 405
            I +D    G    D + +      + +       +
Sbjct: 471 YIGVDTTGIGGGVWDLISKKYPREAHAIHYSNENKN 506


>gi|311993449|ref|YP_004010314.1| gp17 terminase DNA packaging enzyme large subunit [Acinetobacter
           phage Acj9]
 gi|295917406|gb|ADG60077.1| gp17 terminase DNA packaging enzyme large subunit [Acinetobacter
           phage Acj9]
          Length = 609

 Score = 50.5 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 62/365 (16%), Positives = 113/365 (30%), Gaps = 62/365 (16%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111
           R +Q + ++++  + ++        V K +      +GKTT+ A  + W        ++ 
Sbjct: 141 RDYQKDMLKIMAENRMS--------VSKLSRQ----LGKTTVVAIFLAWFACFNKDKNIG 188

Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171
            LA+  +     + AEV   L             L      W    +    G     Y+ 
Sbjct: 189 ILAHKGS-----MSAEV---LDRTKQAIELLPDFLQPGIVEWNKGSIELDNGSAISAYAA 240

Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSN 229
                    PD   G        I  DE +  P  +   L I   ++    +   I+T+ 
Sbjct: 241 --------SPDAVRG---NSFSLIYIDECAFIPNFNDAWLAIQPVISS-GRHSKIIITTT 288

Query: 230 PRRLSGKFYEIFN----------KPLDDWKRFQIDTRTVEGI-DPSFHEGIIARYGLDSD 278
           P  ++  FY+I+               +W   +      + + D  +H       G   +
Sbjct: 289 PNGMN-HFYDIWTAAVEGISGFVPYESEWNAVKERLYDDKDVFDDGWHFSFTTIGGSSVE 347

Query: 279 VTRVEVCGQFPQQDIDSFI-----------PLNIIEEALNREPCPDPYAPLIMGCDIAEE 327
             R E  G F        +           P+N+ +    +   PDP    I   D AE 
Sbjct: 348 QFRQEHVGVFAGGQGTLSLRHETCNLKLETPINVGDSTFYKYKEPDPTRKYIATLDSAEG 407

Query: 328 -GGDNTVVVLRRGPVIE----HLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382
            G D   + +      +     +   +        + I   +  Y    I I+ N+TG  
Sbjct: 408 RGQDYHCMNIIDVTDEQWEQVAVLHSNTISHLILPDIILKHLLDYNEAPIYIELNSTGVS 467

Query: 383 TCDYL 387
               L
Sbjct: 468 IAKTL 472


>gi|223935635|ref|ZP_03627551.1| protein of unknown function DUF264 [bacterium Ellin514]
 gi|223895643|gb|EEF62088.1| protein of unknown function DUF264 [bacterium Ellin514]
          Length = 437

 Score = 50.5 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 57/348 (16%), Positives = 99/348 (28%), Gaps = 54/348 (15%)

Query: 137 NKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMAI 195
            K W     ++   A                 + T  R YS    P+   G        +
Sbjct: 71  CKEWARTMQIAEPDADEVVFDSKTDFSAHVLQFKTGLRIYSLSSNPNALAGKRGH----V 126

Query: 196 INDEASGTPDV--INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN--KPLDD---W 248
           I DE +   D   +        T         + S  R  +  F E+    K   +   W
Sbjct: 127 ILDEFALHADQRLLYRIAKPVTTWGGQ---LEIISTHRGANSVFNEMIRGIKENGNKMGW 183

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDS--DVTRVEVCGQ-------------FPQQDI 293
              ++       I     E I A+ G +   +     V  +              P  + 
Sbjct: 184 SHHKVTLHDA--IAEGLVERINAKTGRNESREAYLARVESECLDQEQWLQEYCCVPADET 241

Query: 294 DSFIPLNIIEEALNREPCPDPY-----APLIMGCDIAEEGGDNTV--VVLRRGPVI--EH 344
            +FI  ++I    +       Y      PL +G D+  +  D TV  V  + G VI    
Sbjct: 242 SAFITYDMISGCEDDCLKDFNYLAECKNPLYLGVDVGRK-RDLTVMDVGEKIGDVIWDRL 300

Query: 345 LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRA 403
             +           ++  L+   +     IDA   G +  +      G+ V  V+     
Sbjct: 301 RIEMQGRTFAEQEFELERLLALPKLRRACIDATGIGMQLAERARERFGWKVEPVMFTAP- 359

Query: 404 VDLEFCRNRRTELHVKMADWLE--FASLINHSGLIQNLKSLKSFIVPN 449
                    + EL   +    E     +     L  +L+ +K  I  +
Sbjct: 360 --------MKEELAFPLRGAFEDRTLRIARDPQLRADLRGIKKEITTS 399


>gi|83950455|ref|ZP_00959188.1| Putative large terminase [Roseovarius nubinhibens ISM]
 gi|83838354|gb|EAP77650.1| Putative large terminase [Roseovarius nubinhibens ISM]
          Length = 434

 Score = 50.5 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 58/427 (13%), Positives = 117/427 (27%), Gaps = 75/427 (17%)

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTT-LWAEVSK 130
           AI  GRG GKT   A    W+ +   G           +  +  +  Q++   ++ E S 
Sbjct: 41  AIMGGRGAGKTRAGA---EWVRAAVEGATPGAPGRCRRIALVGETVDQVREVMIFGE-SG 96

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT 190
            L+  P     E Q+                              +S   P+   G    
Sbjct: 97  ILACSPPDRRPEWQASR---------------RRLVWPNGAEAAVFSAHDPEALRGPQFD 141

Query: 191 YGMAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDW 248
                  DE +           +   L   +  +  I  +   R  G   ++   P    
Sbjct: 142 GA---WLDEMAKWKKARATWDMLQFALRLGDDPQ--ICVTTTPRNVGVLKDVLAAPSTV- 195

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
                       +  SF   + ARY + + + R E+ G   ++   +   L  ++ A  R
Sbjct: 196 VTQAPTEANRAHLAESFLAEVRARY-VGTRLGRQELDGILLEEAEGALWSLAALDAA--R 252

Query: 309 EPCPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRT 355
                  + +++  D       G D   +++        + +W                 
Sbjct: 253 VSKLPELSRIVVAVDPPVTGHAGSDECGIIVAGVDQSGPVQEWRAYVLADRSVSGMSPTG 312

Query: 356 TNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRR 413
                   +E++  D ++ + N  G      L  +        V            +  R
Sbjct: 313 WAGAAIRAMEEFGADKMVAEVNQGGDLVETVLRQIDPLIPFRGVHAS-------RGKQAR 365

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473
            E    + +      +     L   ++++ +      G             S D +D L+
Sbjct: 366 AEPVAALYEQGRVHHMAGLDRLEDQMRAMTTRGYEGRG-------------SPDRADALV 412

Query: 474 YTFAENP 480
           +   E  
Sbjct: 413 WALHELV 419


>gi|284921252|emb|CBG34318.1| phage protein [Escherichia coli 042]
 gi|323942326|gb|EGB38497.1| terminase [Escherichia coli E482]
          Length = 433

 Score = 50.5 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 69/417 (16%), Positives = 123/417 (29%), Gaps = 62/417 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +   +   PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 25  AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 84

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 85  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 125

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 126 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 185

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + GQF      +        +  N
Sbjct: 186 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 242

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363
            E    P  P+ +G D         V VLR G         +  D       I       
Sbjct: 243 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 302

Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
                 K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 303 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 360

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469
              +         +   +    +     SL+  +  + G    E  +  G    + +
Sbjct: 361 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 410


>gi|117623621|ref|YP_852534.1| putative phage terminase [Escherichia coli APEC O1]
 gi|331672908|ref|ZP_08373694.1| putative phage terminase [Escherichia coli TA280]
 gi|115512745|gb|ABJ00820.1| putative phage terminase [Escherichia coli APEC O1]
 gi|309701027|emb|CBJ00325.1| phage protein [Escherichia coli ETEC H10407]
 gi|331070129|gb|EGI41498.1| putative phage terminase [Escherichia coli TA280]
          Length = 436

 Score = 50.5 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 69/417 (16%), Positives = 123/417 (29%), Gaps = 62/417 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +   +   PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 28  AGFGSGKTWVGCGGICKGIWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 87

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 88  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 128

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 129 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 188

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + GQF      +        +  N
Sbjct: 189 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 245

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363
            E    P  P+ +G D         V VLR G         +  D       I       
Sbjct: 246 CEEVEQPGEPIYIGMDFNVGKMAGIVHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 305

Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
                 K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 306 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 363

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYS 469
              +         +   +    +     SL+  +  + G    E  +  G    + +
Sbjct: 364 NA-MFCNANGERRYKVNVKRCPVYAE--SLEQQVWDDKG----EPDKKSGNDHPNDA 413


>gi|319896520|ref|YP_004134713.1| phage terminase atpase subunit protein [Haemophilus influenzae
           F3031]
 gi|317432022|emb|CBY80370.1| Probable phage terminase ATPase subunit protein [Haemophilus
           influenzae F3031]
          Length = 556

 Score = 50.5 bits (119), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 28/194 (14%), Positives = 59/194 (30%), Gaps = 25/194 (12%)

Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---------REPCPDPY 315
             E +  RY              +       F    +++  ++         +   P   
Sbjct: 367 NIEKLKQRYSKY--AFNQLYMCVWIDDADSIFTVHQLLKCGVDISKWKDFNPKADRPFGD 424

Query: 316 APLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             +  G D A  G   + V++           +   + W         N+I  L EKY  
Sbjct: 425 REVWGGFDPAHSGDGASFVIIAPPALPSEKYRVLARYQWQGLSYVYQANQIRALYEKYNM 484

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429
             I IDA   G    + ++           ++ A  + +    +T + +K+ D +E   +
Sbjct: 485 TYIGIDATGVGYGVYELVKE--------FARRAATAIIYNPESKTGMVLKVHDLVEHGQI 536

Query: 430 INHSGLIQNLKSLK 443
                 +  L + +
Sbjct: 537 EWSEKELDILSTNQ 550


>gi|282533135|gb|ADA82244.1| putative terminase [Escherichia phage K1G]
          Length = 416

 Score = 50.1 bits (118), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 65/413 (15%), Positives = 117/413 (28%), Gaps = 63/413 (15%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            G G GKT +    +L  M   PG  +     +   ++   +       +     +   +
Sbjct: 24  GGFGSGKTFVGCLDLLTFMLKHPGTRLGYFGPTYPAIRDIFYP------TFEEAANLLGL 77

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
             L         D         +   + +CR  S + P + VG       A + DE    
Sbjct: 78  DVLVKS-----GDKEVVVTRGKTVLGTVICR--SMDNPGSIVGF---KIAAAVVDELDVL 127

Query: 204 ----PDVINLGILG--FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-R 256
                ++    I+    L          +T+ P      + +    P   +   Q  T  
Sbjct: 128 SREKAELAWNKIVARMRLVIPGVINHISVTTTPEGFKFVYAKFKENPTPSYSMVQASTHE 187

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPC 311
               + P +   +   Y   + +    + G+F      S      +  A +R     +  
Sbjct: 188 NARFLPPDYISSLTETY--PAQLINAYLNGEFVNLTSGS------VYYAYDRRKHRSKEV 239

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
             P   L +G D       + V V R+             D   T   I+    K +   
Sbjct: 240 IQPGDTLYIGQDFNVTKNASAVYVQRKDGWHAVAELKGLFDTPDTVRVITEKW-KSQGHR 298

Query: 372 III--DANNTGART-------CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
           I++  DA+    +T          L+  G+ V          D     N           
Sbjct: 299 IVVYPDASGKNRKTNSASISDIALLQQAGFDVRAKSANPPVKDRVLAVN----------T 348

Query: 423 WLEFASLINHSGLIQNL-KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474
            LE   L  +  L   + K+L+     + G    E  +         +D L Y
Sbjct: 349 ALEKGKLWVNDHLCPEIAKTLEQQAYDDNG----EPAKDGIID--HMADALGY 395


>gi|282547341|gb|ADA82397.1| putative terminase [Escherichia phage K1ind2]
          Length = 416

 Score = 50.1 bits (118), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 65/413 (15%), Positives = 117/413 (28%), Gaps = 63/413 (15%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            G G GKT +    +L  M   PG  +     +   ++   +       +     +   +
Sbjct: 24  GGFGSGKTFVGCLDLLTFMLKHPGTRLGYFGPTYPAIRDIFYP------TFEEAANLLGL 77

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
             L         D         +   + +CR  S + P + VG       A + DE    
Sbjct: 78  DVLVKS-----GDKEVVVTRGKTVIGTVICR--SMDNPGSIVGF---KIAAAVVDELDVL 127

Query: 204 ----PDVINLGILG--FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-R 256
                ++    I+    L          +T+ P      + +    P   +   Q  T  
Sbjct: 128 SREKAELAWNKIVARMRLVIPGVINHISVTTTPEGFKFVYAKFKENPTPSYSMVQASTHE 187

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPC 311
               + P +   +   Y   + +    + G+F      S      +  A +R     +  
Sbjct: 188 NARFLPPDYISSLTETY--PAQLINAYLNGEFVNLTSGS------VYYAYDRRKHRSKET 239

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
             P   L +G D       + V V R+             D   T   I+    K +   
Sbjct: 240 IQPGDTLYIGQDFNVTKNASAVYVQRKDGWHAVAELKGLFDTPDTVRVITEKW-KSQGHR 298

Query: 372 III--DANNTGART-------CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
           I++  DA+    +T          L+  G+ V          D     N           
Sbjct: 299 IVVYPDASGKNRKTNSASISDIALLQQAGFDVRAKSANPPVKDRVLAVN----------T 348

Query: 423 WLEFASLINHSGLIQNL-KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474
            LE   L  +  L   + K+L+     + G    E  +         +D L Y
Sbjct: 349 ALEKGKLWVNDHLCPEIAKTLEQQAYDDNG----EPAKDGIID--HMADALGY 395


>gi|332533822|ref|ZP_08409678.1| phage terminase, ATPase subunit [Pseudoalteromonas haloplanktis
           ANT/505]
 gi|332036753|gb|EGI73216.1| phage terminase, ATPase subunit [Pseudoalteromonas haloplanktis
           ANT/505]
          Length = 382

 Score = 50.1 bits (118), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 19/80 (23%), Positives = 30/80 (37%), Gaps = 6/80 (7%)

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPV--IEHLFDWSKTDLRTTNNKISGL 363
             P    P+++G D A  G   +V +    ++ G    +    D S  D     ++I  L
Sbjct: 198 ERPYGLKPVVIGFDPARFGDKASVAILSAPMKPGEKFLLLEAIDLSGNDFEAMASEIKLL 257

Query: 364 VEKYRPDAIIIDANNTGART 383
            EKY    I +D    G   
Sbjct: 258 TEKYNVVHIGVDTTGIGYGV 277


>gi|220903520|ref|YP_002478832.1| hypothetical protein Ddes_0239 [Desulfovibrio desulfuricans subsp.
           desulfuricans str. ATCC 27774]
 gi|219867819|gb|ACL48154.1| protein of unknown function DUF264 [Desulfovibrio desulfuricans
           subsp. desulfuricans str. ATCC 27774]
          Length = 615

 Score = 50.1 bits (118), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 29/155 (18%), Positives = 50/155 (32%), Gaps = 18/155 (11%)

Query: 249 KRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN- 307
           +   +D     G +      +   Y  +    R     +F    +  F+ L ++E  +  
Sbjct: 361 RIVTLDDAEAGGCNLFNRADLEQEYSPED--MRQLFGCEFIDDTLAVFL-LGLLEGCMED 417

Query: 308 --------REPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDL 353
                   R+  P   A +  G D +    D + VVL    + G  I  L    W     
Sbjct: 418 PDGWGIDLRQARPVDNAGVWGGYDPSRTRDDASFVVLLPPQKAGDKIRTLERHTWKGKSY 477

Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
                +I  L +KYR   + ID    G    + + 
Sbjct: 478 LWQVGRIRELHDKYRFQHMGIDVTGPGQAVLENVR 512


>gi|294624257|ref|ZP_06702968.1| phage-related terminase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 11122]
 gi|292601451|gb|EFF45477.1| phage-related terminase [Xanthomonas fuscans subsp. aurantifolii
           str. ICPB 11122]
          Length = 587

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 25/142 (17%), Positives = 44/142 (30%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPD 313
            + +   Y    D     +  +F      S  PL +++  +               P P 
Sbjct: 349 IDELREEY--SPDAFANLLMCEFVDDGA-SIFPLAMLQPCMVDSWIEWGQDYKPFAPRPY 405

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL    + G     L    +   D      +I  +  +Y
Sbjct: 406 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 465

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 466 WVTYIGIDTTGMGSGVAQLVKQ 487


>gi|327198086|ref|YP_004306453.1| gp41 [Burkholderia phage KL3]
 gi|310657220|gb|ADP02334.1| gp41 [Burkholderia phage KL3]
          Length = 611

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/142 (16%), Positives = 38/142 (26%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            E +  RY  +       +  QF    + S   L  ++  +                 P 
Sbjct: 372 LERLRRRYSAE--AFANLLMCQFIDDSV-SVFKLAELQRCMVDSWEEWADDFSPLLLRPF 428

Query: 314 PYAPLIMGCDIAEEGGDN--TVVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G      VV   R       +     +   D       I  + ++Y
Sbjct: 429 GYREVWVGYDPALTGDSAGLVVVAPPRVEGGTFRVLERHQFRGNDFEEQAAAIEQITQRY 488

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 489 NVGYIAIDTTGMGQGVYQLVRK 510


>gi|320535831|ref|ZP_08035911.1| conserved domain protein [Treponema phagedenis F0421]
 gi|320147321|gb|EFW38857.1| conserved domain protein [Treponema phagedenis F0421]
          Length = 488

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 57/356 (16%), Positives = 101/356 (28%), Gaps = 92/356 (25%)

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRRLSGKFYEIF--NKPLDDWKRF 251
           I+ DE +  P+     I        A    +   + P    G+FY++F   K    + R+
Sbjct: 99  IVFDEMAIYPENKAEVIYTAGIPVTARGGCVEIGSTPLGKIGRFYDVFIDKKKYRTYNRY 158

Query: 252 QID-----------TRTVEGIDPSFHEGIIARYGLDSDV----------TRVEVCGQFPQ 290
            I               V        E  + RYG    +           + E    F  
Sbjct: 159 TIPWWFSAALCTNVEEAVRNAPAMDTEERVYRYGTPPLIEAFEAMLLEDFQQEFECTFID 218

Query: 291 QDIDSFIPLNIIEE------ALNREPCP-------------------------------- 312
               SFI L++I        A +R                                    
Sbjct: 219 -SALSFITLDLIYANTPGMRAEDRTEEIRGGNIEDADIEDEKDLEIKIFRTSDELCAGYS 277

Query: 313 -DPYAPLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
            + +  L +G D+A    D  V+ +      ++  V E      K + +   ++I  ++ 
Sbjct: 278 REEHGALYLGYDVARY-RDAAVIYVLGVVDGKKKCVAEIEMKNKKFEYQR--DEIRKIMR 334

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTE-LHVKMADWL 424
           +       ID    G  T + L+            +  ++         E L + +   L
Sbjct: 335 QLPVVRGCIDRTGQGLDTTETLQKE--------FGESKLEGIDFTTPAKEVLAMGVRTGL 386

Query: 425 EFAS--LINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSD---GLMYT 475
           E     L N     + + S+K  I    G    +S R K      ++D        
Sbjct: 387 EKREFLLPNDQKFRKQIHSIKR-IPSAGGSFRYDSTRDKDG----HADSFWAFALA 437


>gi|327198304|ref|YP_004306879.1| gp35 [Burkholderia phage KS14]
 gi|310657267|gb|ADP02380.1| gp35 [Burkholderia phage KS14]
          Length = 604

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/142 (16%), Positives = 42/142 (29%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y          +  QF      S  PL  ++  +               P P 
Sbjct: 366 IDELRLEYSAQE--YANLLMCQFIDDTA-SIFPLAELQRCMVDSWEEWADDFKPLAPRPF 422

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKY 367
            + P+ +G D A  G    +VV+      G     L    +   D       I  + ++Y
Sbjct: 423 GFRPVWVGYDPALSGDSAGLVVVAPPAVPGGKFRVLHKCQFRGMDFEGQAEAIRQITQQY 482

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
             + + ID    G      ++ 
Sbjct: 483 NVEYMSIDTTGIGQGVYQLVKQ 504


>gi|21243371|ref|NP_642953.1| phage-related terminase [Xanthomonas axonopodis pv. citri str. 306]
 gi|21108918|gb|AAM37489.1| phage-related terminase [Xanthomonas axonopodis pv. citri str. 306]
          Length = 594

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/142 (18%), Positives = 45/142 (31%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317
            E +   Y    D     +   F      S  PL +++  +        ++  P    P 
Sbjct: 356 IEELREEY--SPDAFANLLMCDFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAARPY 412

Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL    + G     L    +   D      +I  +  +Y
Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494


>gi|317120885|ref|YP_004100888.1| hypothetical protein Tmar_0036 [Thermaerobacter marianensis DSM
           12885]
 gi|315590865|gb|ADU50161.1| hypothetical protein Tmar_0036 [Thermaerobacter marianensis DSM
           12885]
          Length = 410

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 80/408 (19%), Positives = 129/408 (31%), Gaps = 56/408 (13%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           I AGRG GKT   A  V   +       +  +  +   ++  +    S  LS+ P     
Sbjct: 36  ILAGRGFGKTRTGAEWVREQVERHGRRRIAIVGRTAADVRDVMVEGESGILSISP----- 90

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE-- 199
                     PW+  V   S    +     +   YS + PD   G  +    A   DE  
Sbjct: 91  ----------PWFRPVYEPSKRRLTWPNGAIATLYSADEPDLLRGPQHD---AAWADELA 137

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259
           A   P+  +  + G     +       T  P +L     ++ N P     R         
Sbjct: 138 AWRRPEAWDNLMFGLRLGPDPRVVVTTTPRPVKL---IRDLLNDPTCVVTRGS-TYENAA 193

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319
            + P+F E II+RY   + + R E+ G+       +      I+E   RE        ++
Sbjct: 194 NLAPAFLEQIISRY-EGTRLGRQELYGEVLDDVPGALWQRKRIDELRVREAP--ELVRVV 250

Query: 320 MGCDIA---EEGGDNT-VVVLRRG--PVIEHLFDWS-KTDLRTTNNKISGLVEKYRPDAI 372
           +  D A   EEG D T +VV  RG       L D S +        +       +  D I
Sbjct: 251 VAIDPAVTSEEGSDETGIVVAGRGVDGDAYVLADRSCRMSPDGWARRAVKAYYDFDGDRI 310

Query: 373 I--IDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430
           +  ++           L              +AV     +  R E    + +  +   + 
Sbjct: 311 VGEVNNGG-------DLVETVIRTVDPKVPYKAVRASRGKAVRAEPVAALYEQGKVHHVG 363

Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
               L   L               I     +GA S D +D L++   E
Sbjct: 364 TFDHLEDQLC-------------QITPDGYQGAGSPDRADALVWALTE 398


>gi|238920988|ref|YP_002934503.1| hypothetical protein NT01EI_3118 [Edwardsiella ictaluri 93-146]
 gi|238870557|gb|ACR70268.1| conserved hypothetical protein [Edwardsiella ictaluri 93-146]
          Length = 595

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 28/162 (17%), Positives = 49/162 (30%), Gaps = 21/162 (12%)

Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
             W++   I+    +G D    + +   Y    +        QF      S  PL++++ 
Sbjct: 334 GQWRQIVTIEDAIRQGYDLFDIDQLRLEY--SPEEFANLFMCQFIDDTE-SVFPLSLLQG 390

Query: 305 AL-NREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD-- 347
            + +     D Y P          + +G D A  G     VV+      G     +    
Sbjct: 391 CMVDSWAVWDDYKPFALRPLGERSVWVGYDPALTGDSAGCVVVAPPVVEGGKFRVIEKHQ 450

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
           W   D       I  +  +Y    I ID    G      ++ 
Sbjct: 451 WHGMDFAAQAENIRKITGRYNVTYIGIDVTGIGHGVHQLVKQ 492


>gi|251783038|ref|YP_002997341.1| terminase large subunit [Streptococcus dysgalactiae subsp.
           equisimilis GGS_124]
 gi|242391668|dbj|BAH82127.1| terminase large subunit [Streptococcus dysgalactiae subsp.
           equisimilis GGS_124]
          Length = 424

 Score = 49.3 bits (116), Expect = 0.001,   Method: Composition-based stats.
 Identities = 51/343 (14%), Positives = 101/343 (29%), Gaps = 50/343 (14%)

Query: 57  EFMEVVDAHCLNSVNNP-NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLAN 115
           +  +++       V    NP++   A   GRG GK++  A+++  L+   P ++ +C+  
Sbjct: 4   DLADIIPIGFRPVVQATWNPKILNIACKGGRGSGKSSNIAFIISRLIIQYP-VNAVCIRK 62

Query: 116 SETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT 175
           ++  L+ +++ ++ KW            +    + +P     +     I  +        
Sbjct: 63  TDNTLEQSVYEQI-KW----AISEQGLERYFKFNKSPLRITYIPRGNYIVFRG------- 110

Query: 176 YSEERPDTFVGHHNTYGMAII-----------NDEASGTPDVINLGILGFLTERNANRFW 224
              + P+      ++     I            DE     + +  G LG          +
Sbjct: 111 --AQNPERIKSLKDSRFPFAIGWIEELAEFKTEDEVKTITNSLLRGELG----DGLFYKF 164

Query: 225 IMTSNPRRLSGKF----YEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDV 279
             T NP +    +    YE   +P + +      T      I   F     A        
Sbjct: 165 FYTYNPPKRKQSWVNKKYESQFQPKNTF--VHASTYKDNPFIAKEFIAEAEATRERSERR 222

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR- 338
            R E  G+         +P + +      +     +  +    D      D    V    
Sbjct: 223 YRWEYLGEAIGS---GVVPFDNLRFETIPDELYRSFDNIRNAVDFG-YATDPLAFVRWHY 278

Query: 339 -----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
                G          K   R   N +    + Y  D I  DA
Sbjct: 279 DKKHNGIYAIDELYGQKISNRQLANWLKD--KSYSNDEIFADA 319


>gi|262273310|ref|ZP_06051125.1| terminase [Grimontia hollisae CIP 101886]
 gi|262222683|gb|EEY73993.1| terminase [Grimontia hollisae CIP 101886]
          Length = 594

 Score = 49.3 bits (116), Expect = 0.001,   Method: Composition-based stats.
 Identities = 33/251 (13%), Positives = 65/251 (25%), Gaps = 52/251 (20%)

Query: 180 RPDTFVGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSN-------P 230
              T  G H      +  DE    P  D +N       T +   + +  T +       P
Sbjct: 245 NSKTAQGFHGH----VYVDEYFWIPKFDELNKLASAMATHKTWRKTYFSTPSSKTHQAYP 300

Query: 231 RRLSGKF--------------YEIFNK-----PLDDWK-RFQIDTRTVEGIDPSFHEGII 270
                 +              +E F       P   W+    ++     G D    + + 
Sbjct: 301 FWTGDTWRGNANTREHVEFPTFEDFRNGGALCPDKHWRYVVTLEDAAAGGCDLFDIDELR 360

Query: 271 ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP----------LIM 320
             Y  +           F   +  S    + +E+ +        + P          + +
Sbjct: 361 DEYSKND--FANLFMCVFVDGNA-SVFTFSKLEKCMVDASKWKDFKPDAARPYANQEVWL 417

Query: 321 GCDIAEEGGDNT--VVVLRRG----PVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +    +    V+   +       +     W   + +   ++I     +Y    I I
Sbjct: 418 GYDPSRTRDNACLVVIAPPQTHAEVFRVLEKHYWKGLNFQYQASQIDEAFHRYHVTYIGI 477

Query: 375 DANNTGARTCD 385
           D    G    D
Sbjct: 478 DTTGVGYGVWD 488


>gi|282547289|gb|ADA82346.1| putative terminase [Escherichia phage K1ind1]
          Length = 416

 Score = 49.3 bits (116), Expect = 0.001,   Method: Composition-based stats.
 Identities = 65/413 (15%), Positives = 117/413 (28%), Gaps = 63/413 (15%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            G G GKT +    +L  M   PG  +     +   ++   +       +     +   +
Sbjct: 24  GGFGSGKTFVGCLDLLTFMLKHPGTRLGYFGPTYPAIRDIFYP------TFEEAANLLGL 77

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
             L         D         +   + +CR  S + P + VG       A + DE    
Sbjct: 78  DVLVKS-----GDKEVVVTCGKTVLGTVICR--SMDNPGSIVGF---KIAAAVVDELDVL 127

Query: 204 ----PDVINLGILG--FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-R 256
                ++    I+    L          +T+ P      + +    P   +   Q  T  
Sbjct: 128 SREKAELAWNKIVARMRLVIPGVTNHISVTTTPEGFKFVYAKFKENPTPSYSMVQASTHE 187

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPC 311
               + P +   +   Y   + +    + G+F      S      +  A +R     +  
Sbjct: 188 NARFLPPDYISSLTETY--PAQLINAYLNGEFVNLTSGS------VYYAYDRRKHRSKEV 239

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
             P   L +G D       + V V R+             D   T   I+    K +   
Sbjct: 240 IQPGDTLYIGQDFNVTKNASAVYVQRKDGWHAVAELKGLFDTPDTVRVITEKW-KSQGHR 298

Query: 372 III--DANNTGART-------CDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
           I++  DA+    +T          L+  G+ V          D     N           
Sbjct: 299 IVVYPDASGKNRKTNSASISDIALLQQAGFDVRAKSANPPVKDRVLAVN----------T 348

Query: 423 WLEFASLINHSGLIQNL-KSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474
            LE   L  +  L   + K+L+     + G    E  +         +D L Y
Sbjct: 349 ALEKGKLWVNDHLCPEIAKTLEQQAYDDNG----EPAKDGIID--HMADALGY 395


>gi|221316874|ref|YP_002527821.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           72a]
 gi|226246930|ref|YP_002776267.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           29805]
 gi|221237339|gb|ACM10180.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           72a]
 gi|226201508|gb|ACO38105.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           29805]
          Length = 450

 Score = 49.3 bits (116), Expect = 0.001,   Method: Composition-based stats.
 Identities = 33/163 (20%), Positives = 54/163 (33%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   ++ +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNVETFKTYNFTTYDNVFLSKGFIETQEKLY-KDIPAYKARVLLGEWLASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI E+ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITEDYMFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|169796738|ref|YP_001714531.1| putative phage terminase [Acinetobacter baumannii AYE]
 gi|169149665|emb|CAM87555.1| conserved hypothetical protein; putative phage terminase
           [Acinetobacter baumannii AYE]
          Length = 437

 Score = 49.3 bits (116), Expect = 0.001,   Method: Composition-based stats.
 Identities = 58/349 (16%), Positives = 99/349 (28%), Gaps = 55/349 (15%)

Query: 81  AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135
           A  AG G GKT +    +    W         V     A +  Q++   +  +       
Sbjct: 31  AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 80

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
               W     +         +          + Y T     S E+P T VG    + +  
Sbjct: 81  VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 130

Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244
             DE    A          I+  +  + A+    I  +         YE F K       
Sbjct: 131 -IDELDVMAKVKAQQAWRKIIARMRYKQASLLNGIDVATTPEGFKFTYEQFVKEANKSEA 189

Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
               +   Q  T   E  +   +   +   Y     +    + GQF      +  P +  
Sbjct: 190 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 246

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG 362
               + +       PL++G D         V V+R G     L +     +R T      
Sbjct: 247 RVLNHTDEEIKQGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVG--VRDTPTMCYL 303

Query: 363 LVEKYRPDAIIIDANNTGARTCDY---------LEMLGYHVYRVLGQKR 402
           + E++    I +  + +G  T            L+  G+ V  V G   
Sbjct: 304 IKERFPDHDITVIPDASGQATSSKGFSESDHAILKKNGFKV-EVNGVNP 351


>gi|323137496|ref|ZP_08072573.1| hypothetical protein Met49242DRAFT_1961 [Methylocystis sp. ATCC
           49242]
 gi|322397122|gb|EFX99646.1| hypothetical protein Met49242DRAFT_1961 [Methylocystis sp. ATCC
           49242]
          Length = 323

 Score = 49.3 bits (116), Expect = 0.001,   Method: Composition-based stats.
 Identities = 58/284 (20%), Positives = 96/284 (33%), Gaps = 41/284 (14%)

Query: 59  MEVVDAHCLNSVNNPNPEVFKG----AISAGRGIGKTTLNAWLVLWLMSTRPG------- 107
           M   +A    SV + +P   +     A+  GR  GK ++ + +V W  +   G       
Sbjct: 38  MTEAEADFFRSVADRDPPSRRARELWAV-CGRRAGKDSIASAIVTWSAAMFDGADRLRPG 96

Query: 108 --ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165
                +CLA  + Q +  L              ++ E++ L         D L  S G+D
Sbjct: 97  ERALCLCLACDKDQARIVL---------SYVRAYFAELEPLRAMVTRETKDGLELSNGVD 147

Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD-VINLGILGFLTERNANRFW 224
                   R          V       +A   DE S +PD  +   +   +         
Sbjct: 148 IYVGVNDFRAVRGRTILCAV----LDEIAYWRDENSASPDLELYRALKPGMATL-PEAML 202

Query: 225 IMTSNPRRLSGKFYEIFN----KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVT 280
           I  S+P R +G  +        +  D              +D +  +  +A    D    
Sbjct: 203 IGISSPYRRAGLLHAKHRQAYGRDGDTLVIRAPSAVMNPTLDQAEIDQAMAE---DPAAA 259

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP----YAPLIM 320
           R E   +F + DI  F+ L++IE A++      P    YAP IM
Sbjct: 260 RAEWLAEF-RDDISGFLGLDLIEAAVDPTIVTRPPRGCYAPWIM 302


>gi|145639505|ref|ZP_01795109.1| putative phage gene [Haemophilus influenzae PittII]
 gi|145271296|gb|EDK11209.1| putative phage gene [Haemophilus influenzae PittII]
          Length = 629

 Score = 49.3 bits (116), Expect = 0.001,   Method: Composition-based stats.
 Identities = 27/180 (15%), Positives = 56/180 (31%), Gaps = 25/180 (13%)

Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---------REPCPDPY 315
             E +  RY              +       F    +++  ++         +   P   
Sbjct: 389 NIEKLKQRYSKY--AFNQLYMCVWIDDADSIFTVHQLLKCGVDISKWKDFNPKADRPFGD 446

Query: 316 APLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             +  G D A  G   + V++           +   + W+        N+I  L EKY  
Sbjct: 447 REVWGGFDPAHSGDGASFVIIAPPALPSEKYRVLARYQWNGLSYVYQANQIRALYEKYNM 506

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429
             I IDA   G    + ++           ++ A  + +    +T + +K+ D +E   +
Sbjct: 507 TYIGIDATGVGYGVYELVKE--------FARRAATAIIYNPESKTGMVLKVHDLVEHGQI 558


>gi|209694587|ref|YP_002262515.1| terminase, ATPase subunit [Aliivibrio salmonicida LFI1238]
 gi|208008538|emb|CAQ78711.1| terminase, ATPase subunit [Aliivibrio salmonicida LFI1238]
          Length = 590

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/179 (11%), Positives = 54/179 (30%), Gaps = 19/179 (10%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IP 298
               W+    I+     G D    + +   Y +D           F    +  F    + 
Sbjct: 330 DDKQWRYVVTIEDAANGGCDLFDIDELREEYSVDD--FNNLFMCMFVDGSLSVFKFSDLE 387

Query: 299 LNIIEEA-----LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLR------RGPVIEHLFD 347
             +++ A       +   P  +  + +G D +    +  +VV+           +     
Sbjct: 388 KGMVDAAHWQDFKPKNKQPFEHREVWLGYDPSRTRDNACLVVVAPPAVVVEKFRVLEKHY 447

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAVD 405
           W   + +   ++I  + ++Y+   I +D    G    D + +      + +       +
Sbjct: 448 WKGLNFQYHVSEIDKVFKRYKVTYIGVDTTGIGGGVWDLISKKYPREAHAIHYSNENKN 506


>gi|300922774|ref|ZP_07138861.1| conserved domain protein [Escherichia coli MS 182-1]
 gi|300420878|gb|EFK04189.1| conserved domain protein [Escherichia coli MS 182-1]
          Length = 199

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/177 (14%), Positives = 49/177 (27%), Gaps = 55/177 (31%)

Query: 352 DLRTTNNKISGLVEKYRPDAIIIDANNTGAR----TCDYLEMLGYHVYRVLGQKRAVDLE 407
           D+    +  + +  +   D  + D +  GA     T +             G +   D +
Sbjct: 2   DINEGADWATSMAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDED 61

Query: 408 F-----------------------CRNRRTELHVKMADWLEFA---------SLINH--- 432
                                    RN+R + +  +AD L            +  +    
Sbjct: 62  APYQAGAWADEVVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVAHGEYADPDDMLS 121

Query: 433 -----------SGLIQNLKSLKSFIVPNTGEL----AIESKRVKGAKSTDYSDGLMY 474
                        L   L  ++     N G+L     +E K+  G  S + +D LM 
Sbjct: 122 FDKEAIGEKMLEKLFAELTQIQR-KFNNNGKLELMTKVEMKQKLGIPSPNLADALMM 177


>gi|284008602|emb|CBA75192.1| phage terminase protein [Arsenophonus nasoniae]
          Length = 598

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 18/162 (11%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF------ 296
           P   W+    ++     G + +  + +  RY  + D  ++     F +     F      
Sbjct: 336 PDGQWRYVITLEDAIKGGFNLASIDKLRQRY--NPDTFKMLYMCIFIEHGASVFKYDTLQ 393

Query: 297 ---IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLR------RGPVIEHLFD 347
              + +N+ E+   + P P     +  G D A  G  +T V++           I   F 
Sbjct: 394 KCGVDVNLWEDHNPKAPRPFGEREVWGGYDPARSGDTSTFVIVAPPMMAPEVFRILATFY 453

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
           W     R     I  L +KYR   I ID    G    + ++ 
Sbjct: 454 WQGFSWRHQAKLIEDLTKKYRFTHIGIDTTGIGQSVYEMVQD 495


>gi|326536310|ref|YP_004300751.1| gp17 terminase DNA packaging enzyme large subunit [Acinetobacter
           phage 133]
 gi|299483391|gb|ADJ19485.1| gp17 terminase DNA packaging enzyme large subunit [Acinetobacter
           phage 133]
          Length = 606

 Score = 49.3 bits (116), Expect = 0.002,   Method: Composition-based stats.
 Identities = 51/328 (15%), Positives = 94/328 (28%), Gaps = 51/328 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT   A  +   +      SV  LA+  +  +  L+                    L  
Sbjct: 161 GKTAAVAIFLAHYVCFNESKSVGILAHKGSMSEEVLFR--------TKQAIELLPDFLQP 212

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--DV 206
               W    +    G     ++          PD   G        I  DE +     D 
Sbjct: 213 GIVEWNKRSIELDNGSSIGAFA--------SSPDAVRG---NSFSLIYIDETAFVQNWDD 261

Query: 207 INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF------NKPLDDWKRFQIDTRT-VE 259
             L I   ++    +   IMT+ P  ++  FY+++            ++   +  +  + 
Sbjct: 262 CWLAIQPVISS-GRHSKIIMTTTPNGMN-HFYDLWQGAINGTSGFRPYEATWVSVKDRLY 319

Query: 260 GIDPSFHEGII---ARYGLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
               +F +G        G  S      E  G F     ++ I    +     +E   D +
Sbjct: 320 NEADTFDDGWEFSARAIGSSSIEQFLQEHLGNF-AGGSNTLIDGTKLAVLFGQERIADQH 378

Query: 316 APL-----------IMGCDIAEE-GGDNTVVVLRRGPVIE----HLFDWSKTDLRTTNNK 359
             +           I   D AE  G D   + +      +     +   +K       + 
Sbjct: 379 EFIEFKPPVAGRKYIATLDSAEGRGQDYHALHIIDVTDEQWEQAGVLHSNKISHLILADI 438

Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYL 387
           I   + +Y    + I+ N+TG      L
Sbjct: 439 IFLYLTRYNEAPVYIELNSTGVSIAKTL 466


>gi|289661923|ref|ZP_06483504.1| phage-related terminase [Xanthomonas campestris pv. vasculorum
           NCPPB702]
          Length = 267

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/142 (17%), Positives = 46/142 (32%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317
            + +   Y    D     +  +F      S  PL +++  +        ++  P    P 
Sbjct: 29  IDELREEY--SPDAFANLLMCEFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAARPY 85

Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL    + G     L    +   D      +I  +  +Y
Sbjct: 86  GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 145

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 146 WVTYIGIDTTGMGSGVAQLVKQ 167


>gi|188577619|ref|YP_001914548.1| phage terminase, ATPase subunit [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|188522071|gb|ACD60016.1| phage terminase, ATPase subunit [Xanthomonas oryzae pv. oryzae
           PXO99A]
          Length = 533

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/142 (16%), Positives = 42/142 (29%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPD 313
            + +   Y    D     +   F      S  PL +++  +                 P 
Sbjct: 295 IDELREEY--SPDAFANLLMCDFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAVRPY 351

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL    + G     L    +   D      +I  +  +Y
Sbjct: 352 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 411

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 412 WVTYIGIDTTGMGSGVAQLVKQ 433


>gi|325926090|ref|ZP_08187451.1| hypothetical protein XPE_1415 [Xanthomonas perforans 91-118]
 gi|325928218|ref|ZP_08189424.1| hypothetical protein XPE_3475 [Xanthomonas perforans 91-118]
 gi|325541407|gb|EGD12943.1| hypothetical protein XPE_3475 [Xanthomonas perforans 91-118]
 gi|325543435|gb|EGD14857.1| hypothetical protein XPE_1415 [Xanthomonas perforans 91-118]
          Length = 587

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 26/142 (18%), Positives = 45/142 (31%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317
            + +   Y    D     +   F      S  PL +++  +        +E  P    P 
Sbjct: 349 IDELREEY--SPDAFANLLMCDFVDDGA-SIFPLAMLQPCMVDSWVEWGQEYKPFAARPY 405

Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL    + G     L    +   D      +I  +  +Y
Sbjct: 406 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 465

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 466 WVTYIGIDTTGMGSGVAQLVKQ 487


>gi|21232401|ref|NP_638318.1| phage-related terminase [Xanthomonas campestris pv. campestris str.
           ATCC 33913]
 gi|21114179|gb|AAM42242.1| phage-related terminase [Xanthomonas campestris pv. campestris str.
           ATCC 33913]
          Length = 594

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/142 (17%), Positives = 46/142 (32%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317
            + +   Y    D     +  +F      S  PL +++  +        ++  P    P 
Sbjct: 356 IDELREEY--SPDAFANLLMCEFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAARPY 412

Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL    + G     L    +   D      +I  +  +Y
Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494


>gi|254183934|ref|ZP_04890525.1| putative terminase, ATPase subunit [Burkholderia pseudomallei 1655]
 gi|184214466|gb|EDU11509.1| putative terminase, ATPase subunit [Burkholderia pseudomallei 1655]
          Length = 589

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVV-----LRRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV     +  G   +     +   D       I  + ++Y
Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRIDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|160876026|ref|YP_001555342.1| hypothetical protein Sbal195_2916 [Shewanella baltica OS195]
 gi|160861548|gb|ABX50082.1| protein of unknown function DUF264 [Shewanella baltica OS195]
          Length = 589

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 52/343 (15%), Positives = 107/343 (31%), Gaps = 42/343 (12%)

Query: 163 GIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNA 220
            +    Y  + +     +  + +  H  +    I+  +S T D      G L       A
Sbjct: 249 NLYLDEYFWIHKFQEFRKVASGMAIHAKWRQTYISTPSSITHDAYPFWTGTLFNRGRPKA 308

Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDV 279
           +R  I  S+    +G+           W++   +D    +G +    + +   Y    D 
Sbjct: 309 DRIEIDVSHSALANGR-----RCEDGQWRQVVTVDDAIRKGCNLFDPDTLHLEY--SPDE 361

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR-----------EPCPDPYAPLIMGCDIAEEG 328
               +  +F   D  S  P+ +++  +              P P  +  + +G D  + G
Sbjct: 362 YSNLLMCEFID-DTMSVFPMVMMQRCMVDSWEVWTDYKPFAPRPLAHREVWIGYDPNKGG 420

Query: 329 GDNTVVVLRR------GPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
             ++   +        G     +    W+  D       I  +  KY    I ID    G
Sbjct: 421 KGDSAGCIVICPPAVPGGKFRVIEKHRWNGMDFEAQAKAIQDICNKYNVTFIGIDTTGLG 480

Query: 381 ARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG---LIQ 437
                 ++     V   L              ++++ +K  D +    L   +G   L Q
Sbjct: 481 EAVYQLVKKFFPQVTPFLYNPV---------LKSQMVIKAYDVISKGRLEYDAGWTDLAQ 531

Query: 438 NLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
              S++  +  +  ++  ES R +     D +   M+     P
Sbjct: 532 AFMSIRKTLTASGKQVTYESARSEEISHADIAWAAMHALYNEP 574


>gi|332288320|ref|YP_004419172.1| terminase-like family protein [Gallibacterium anatis UMN179]
 gi|330431216|gb|AEC16275.1| terminase-like family protein [Gallibacterium anatis UMN179]
          Length = 590

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 43/248 (17%), Positives = 72/248 (29%), Gaps = 27/248 (10%)

Query: 244 PLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF--IPLN 300
           P   W++   I     +G +    + +   Y ++          QF   +   F  I L 
Sbjct: 330 PDGQWRQIVTIYDAMAQGCNLFDVDALKLEYSVEE--FEQLFLCQFIDDNSSVFKFIDLQ 387

Query: 301 ---------IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLR----RGPVIE--HL 345
                      +        P    P+ +G D A  G    + V+      G      H 
Sbjct: 388 KCGVDSLEVWSDFN-PLAKRPFADNPVWIGYDPAHTGDRAALAVVAPPAVEGGKYRLLHY 446

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
                 D       I   ++ Y    I ID    G      +    Y + R L     + 
Sbjct: 447 KTVHGMDFEQQAGLIKDYLQIYNVQKITIDRTGLGEGVYQLVRKF-YPLTRGLTYNVDLK 505

Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS 465
            E        L++     LEF S      +I +  ++K        ++   S R K A  
Sbjct: 506 NEMVL---KTLNIIGKRRLEFDS--GDKEVINSFMTIKKQTTRTGQKITYISDRSKEASH 560

Query: 466 TDYSDGLM 473
            D +  +M
Sbjct: 561 GDIAWAIM 568


>gi|134288710|ref|YP_001111154.1| gp4, phage terminase, ATPase subunit [Burkholderia phage phiE12-2]
 gi|134132095|gb|ABO60770.1| gp4, phage terminase, ATPase subunit [Burkholderia phage phiE12-2]
          Length = 601

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 362 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 478

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500


>gi|53722089|ref|YP_111074.1| bacteriophage terminase, ATPase subunit [Burkholderia pseudomallei
           K96243]
 gi|52212503|emb|CAH38529.1| putative bacteriophage terminase, ATPase subunit [Burkholderia
           pseudomallei K96243]
          Length = 601

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 362 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 478

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500


>gi|72537721|ref|YP_293751.1| phage terminase ATPase subunit [Burkholderia phage phi52237]
 gi|72398411|gb|AAZ72646.1| phage terminase ATPase subunit [Burkholderia phage phi52237]
          Length = 601

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 362 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 478

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500


>gi|53717814|ref|YP_106800.1| phage terminase, ATPase subunit [Burkholderia pseudomallei K96243]
 gi|52208228|emb|CAH34159.1| phage terminase, ATPase subunit [Burkholderia pseudomallei K96243]
          Length = 589

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|167916806|ref|ZP_02503897.1| bacteriophage terminase, ATPase subunit [Burkholderia pseudomallei
           112]
          Length = 589

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|167619947|ref|ZP_02388578.1| phage terminase, ATPase subunit [Burkholderia thailandensis Bt4]
          Length = 589

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|167821684|ref|ZP_02453364.1| phage terminase, ATPase subunit [Burkholderia pseudomallei 91]
 gi|254188172|ref|ZP_04894684.1| Putative ATPase subunit of terminase (gpP-like) [Burkholderia
           pseudomallei Pasteur 52237]
 gi|157935852|gb|EDO91522.1| Putative ATPase subunit of terminase (gpP-like) [Burkholderia
           pseudomallei Pasteur 52237]
          Length = 589

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|308389159|gb|ADO31479.1| Terminase, ATPase subunit [Neisseria meningitidis alpha710]
          Length = 610

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 36/182 (19%), Positives = 59/182 (32%), Gaps = 24/182 (13%)

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLV 364
            P    P+ +G D +     + +VV       G     L       TD  +    I  + 
Sbjct: 428 RPAGNLPVWVGYDPSYTADASGLVVAVPPQNNGEPFYILETALIPGTDFESQAANIRKIT 487

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYR------VLGQKRAVDLEFCRNRRTELHV 418
           E+Y    I+IDAN  GA   D +      V        + G          +N+R     
Sbjct: 488 ERYNVSKIVIDANGIGAAVFDLVRKFYPPVIGMTYTPDIKGMMVLKTQNLLKNKR----- 542

Query: 419 KMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
                 +  ++     L     S++  +  +   +  ES R K A   D +   M  F +
Sbjct: 543 ---IKWDAGNI----DLQMAFLSVRRSVTASGRNITYESVRSKTASHGDLAWAAMMLFYQ 595

Query: 479 NP 480
            P
Sbjct: 596 EP 597


>gi|17981830|ref|NP_536821.1| terminase [Haemophilus phage HP2]
 gi|13752203|gb|AAK37798.1| orf16 [Haemophilus phage HP2]
 gi|309750513|gb|ADO80497.1| probable terminase, ATPase subunit [Haemophilus influenzae R2866]
          Length = 607

 Score = 48.6 bits (114), Expect = 0.002,   Method: Composition-based stats.
 Identities = 27/180 (15%), Positives = 56/180 (31%), Gaps = 25/180 (13%)

Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---------REPCPDPY 315
             E +  RY              +       F    +++  ++         +   P   
Sbjct: 367 NIEKLKQRYSKY--AFNQLYMCVWIDDADSIFTVHQLLKCGVDISKWKDFNPKADRPFGD 424

Query: 316 APLIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             +  G D A  G   + V++           +   + W+        N+I  L EKY  
Sbjct: 425 REVWGGFDPAHSGDGASFVIIAPPALPSEKYRVLARYQWNGLSYVYQANQIRALYEKYNM 484

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429
             I IDA   G    + ++           ++ A  + +    +T + +K+ D +E   +
Sbjct: 485 TYIGIDATGVGYGVYELVKE--------FARRAATAIIYNPESKTGMVLKVHDLVEHGQI 536


>gi|84623266|ref|YP_450638.1| phage-related terminase [Xanthomonas oryzae pv. oryzae MAFF 311018]
 gi|84367206|dbj|BAE68364.1| phage-related terminase [Xanthomonas oryzae pv. oryzae MAFF 311018]
          Length = 594

 Score = 48.6 bits (114), Expect = 0.002,   Method: Composition-based stats.
 Identities = 24/142 (16%), Positives = 42/142 (29%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPD 313
            + +   Y    D     +   F      S  PL +++  +                 P 
Sbjct: 356 IDELREEY--SPDAFANLLMCDFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAVRPY 412

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL    + G     L    +   D      +I  +  +Y
Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494


>gi|163736656|ref|ZP_02144075.1| hypothetical protein RGBS107_16031 [Phaeobacter gallaeciensis
           BS107]
 gi|161390526|gb|EDQ14876.1| hypothetical protein RGBS107_16031 [Phaeobacter gallaeciensis
           BS107]
          Length = 430

 Score = 48.6 bits (114), Expect = 0.002,   Method: Composition-based stats.
 Identities = 65/416 (15%), Positives = 112/416 (26%), Gaps = 65/416 (15%)

Query: 82  ISAGRGIGKTTLNA-WLVLWLMSTRPGI-----SVICLANSETQLKTTLWAEVSKWLSLL 135
           I  GRG GKT   A W+        P        +  L  +  Q++  +    S  L+  
Sbjct: 38  ILGGRGAGKTRAGAEWVRTLAEGATPLSAGRARRIALLGETYDQVRDVMVQGDSGLLACT 97

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
           P                        +            + +S   P+   G       A 
Sbjct: 98  PRD---------------RRPTWKATERRLIWPNGATAQAFSAHDPEALRGPQFD---AA 139

Query: 196 INDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             DE +           +   L   +  R  +  +   R  G   ++   P    +    
Sbjct: 140 WADELAKWKRGQDSWDMLQFALRLGDDPR--VCVTTTPRNVGVLRDLLASPSTV-QTHAA 196

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-LNREPCP 312
                  +  SF   +  RY   S + R E+ G   Q    +      +  A + + P  
Sbjct: 197 TEANRANLAASFIAEVRNRY-AGSRLGRQELDGILLQDVEGALWTNAGLVAAQIAKAPTL 255

Query: 313 DPYAPLIMGCDIAEEGG---DNTVVVLRRGPVIEHLFDW----------SKTDLRTTNNK 359
           D    +++  D A   G   D   +V+    +     DW                T    
Sbjct: 256 DR---VVVAVDPAVSAGKRSDACGIVVVGATLQGPPQDWCAYVLADCTVQGVGPLTWAQA 312

Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419
                ++Y  D ++ + N  GA     L  +   V        A+     +  R E    
Sbjct: 313 AIDARDRYGADRVVAEVNQGGALVESLLRQIDPLV-----PFTALHASRGKGARAEPVAA 367

Query: 420 MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
           + +      +     L   L       +   G L        G  S D  D L++ 
Sbjct: 368 LYEQGRVRHVPGLGALEDQLC-----QMTPRGYL--------GQGSPDRLDALVWA 410


>gi|224542959|ref|ZP_03683498.1| hypothetical protein CATMIT_02153 [Catenibacterium mitsuokai DSM
           15897]
 gi|224524097|gb|EEF93202.1| hypothetical protein CATMIT_02153 [Catenibacterium mitsuokai DSM
           15897]
          Length = 479

 Score = 48.6 bits (114), Expect = 0.002,   Method: Composition-based stats.
 Identities = 56/385 (14%), Positives = 114/385 (29%), Gaps = 32/385 (8%)

Query: 34  HFFPWGEK--GTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT 91
           +  P+        +E ++      +E+ E+     +   ++      K   S  R  GK+
Sbjct: 14  YIIPYKSTLGNEAIELYNNTTRNAMEWQEIQMMDIMAVDDDGQWVHIKYGYSIPRRNGKS 73

Query: 92  TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPA 151
            +     LW +    G  ++  A+  T      W ++ + L                +  
Sbjct: 74  EILVMRELWGLL--HGEKILHTAH-RTTTSHASWEKLKQMLDENDYTEVKRADKEKTYEK 130

Query: 152 PWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGI 211
            + +        I          ++        +G        +I DEA    +     +
Sbjct: 131 SYTATAQFGLETIKILDDGGGSASFRTRSSKGGLG---EGFDLLIVDEAQEYTEDQQSAL 187

Query: 212 LGFLTERNANRFWIMTSNP--------------------RRLSGKFYEIFNKPLDDWKRF 251
              +T    N   +M   P                       +  + E   + + D K  
Sbjct: 188 QYVVTSSE-NPQTLMCGTPPTAVSSGTVFVNLRKECLSGGSDTSGWAEWSVEHMSDVKDR 246

Query: 252 QIDTRTVEGIDPSFHEGII-ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
            I   T   +  +  E  + A    D     ++  G + Q +  S I  N    AL  E 
Sbjct: 247 DIWYETNPSLGQTLKERSVAAEDSSDEIDFNIQRFGLWLQYNQKSAISENE-WNALKVET 305

Query: 311 CPDPYAPLIMGCDIAEEGGDNT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
            P+   PL +G     +G + +  + ++       +       +R  N  I   + K   
Sbjct: 306 IPEFKGPLFVGIKYGHDGSNVSMSIAVKTKNDNILVDVIGCRPIRKGNGWIVDFLRKADI 365

Query: 370 DAIIIDANNTGARTCDYLEMLGYHV 394
            A+ +D  N      + L+  G  +
Sbjct: 366 AAVTVDGANGQQMLINELKEAGIKL 390


>gi|293608730|ref|ZP_06691033.1| conserved hypothetical protein [Acinetobacter sp. SH024]
 gi|292829303|gb|EFF87665.1| conserved hypothetical protein [Acinetobacter sp. SH024]
          Length = 430

 Score = 48.6 bits (114), Expect = 0.002,   Method: Composition-based stats.
 Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%)

Query: 81  AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135
           A  AG G GKT +    +    W         V     A +  Q++   +  +       
Sbjct: 24  AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 73

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
               W     +         +          + Y T     S E+P T VG    + +  
Sbjct: 74  VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 123

Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244
             DE    A          I+  +  + A     I  +         YE F K       
Sbjct: 124 -IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 182

Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
               +   Q  T   E  +   +   +   Y     +    + GQF      +  P +  
Sbjct: 183 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 239

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361
               + +       PL++G D         V V+R G     L +     D  T    I+
Sbjct: 240 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 298

Query: 362 GLVEKYRPDAIIIDANN 378
                +    +I DA+ 
Sbjct: 299 ERFPDH-DITVIPDASG 314


>gi|260556008|ref|ZP_05828228.1| PBSX family phage terminase, large subunit [Acinetobacter baumannii
           ATCC 19606]
 gi|260410919|gb|EEX04217.1| PBSX family phage terminase, large subunit [Acinetobacter baumannii
           ATCC 19606]
          Length = 435

 Score = 48.6 bits (114), Expect = 0.002,   Method: Composition-based stats.
 Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%)

Query: 81  AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135
           A  AG G GKT +    +    W         V     A +  Q++   +  +       
Sbjct: 29  AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 78

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
               W     +         +          + Y T     S E+P T VG    + +  
Sbjct: 79  VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 128

Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244
             DE    A          I+  +  + A     I  +         YE F K       
Sbjct: 129 -IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 187

Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
               +   Q  T   E  +   +   +   Y     +    + GQF      +  P +  
Sbjct: 188 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 244

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361
               + +       PL++G D         V V+R G     L +     D  T    I+
Sbjct: 245 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 303

Query: 362 GLVEKYRPDAIIIDANN 378
                +    +I DA+ 
Sbjct: 304 ERFPDH-DITVIPDASG 319


>gi|94990333|ref|YP_598433.1| terminase large subunit [Streptococcus phage 10270.2]
 gi|94994256|ref|YP_602354.1| Terminase large subunit [Streptococcus phage 10750.2]
 gi|94543841|gb|ABF33889.1| Terminase large subunit [Streptococcus phage 10270.2]
 gi|94547764|gb|ABF37810.1| Terminase large subunit [Streptococcus phage 10750.2]
          Length = 432

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 41/290 (14%), Positives = 89/290 (30%), Gaps = 41/290 (14%)

Query: 57  EFMEVVDAHCLNSVNNP-NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLAN 115
           +  +++       V    NP++   A   GRG GK++  A+++  L+   P ++ +C+  
Sbjct: 12  DLADIIPIGFKPVVQATWNPQILNIACKGGRGSGKSSNIAFIISRLIIQYP-VNAVCIRK 70

Query: 116 SETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT 175
           ++  L+ +++ ++ KW            +    + +P     +     I  +        
Sbjct: 71  TDNTLEQSVYEQI-KW----AISEQGLERYFKFNKSPLRITYIPRGNYIVFRG------- 118

Query: 176 YSEERPDTFVGHHNTYGMAII-----------NDEASGTPDVINLGILGFLTERNANRFW 224
              + P+      ++     I            DE     + +  G LG          +
Sbjct: 119 --AQNPERIKSLKDSRFPFAIGWIEELAEFKTEDEVKTITNSLLRGELG----DGLFYKF 172

Query: 225 IMTSNPRRLSGKF----YEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDV 279
             T NP +    +    YE   +P + +      T      I   F     A        
Sbjct: 173 FYTYNPPKRKQSWVNKKYESQFQPSNTF--VHASTYKDNPFIAKEFIAEAEATRERSERR 230

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG 329
            R E  G+         +P + +      +     +  +  G D      
Sbjct: 231 YRWEYLGEAIGS---GVVPFDNLRFERITDEQVADFDNIRNGIDYGYATD 277


>gi|190572396|ref|YP_001970241.1| putative phage terminase, ATPase subunit (gpp) [Stenotrophomonas
           maltophilia K279a]
 gi|190010318|emb|CAQ43926.1| putative phage terminase, ATPase subunit (gpp) [Stenotrophomonas
           maltophilia K279a]
          Length = 597

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/142 (17%), Positives = 39/142 (27%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL------------NIIEEALNREPCPD 313
            E +   Y  +       +  +F      S  PL              +++       P 
Sbjct: 360 IEELRRDYSAEE--FANLLMCEFVDDSA-SIFPLTMLQPCQVDSWVEWVDDFKPLAIRPY 416

Query: 314 PYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VV    L  G     L    +   D       I  +  +Y
Sbjct: 417 GDRAVWIGYDPAETGDSAGIVVVAPPLVPGGKFRVLERHQFKGMDFAAQAAFIQQVTLRY 476

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I IDA   G      +  
Sbjct: 477 WVTYIGIDATGMGTGVAQLVRQ 498


>gi|328552921|gb|AEB23413.1| putative helicase, ATP-dependent, intein-containing [Bacillus
           amyloliquefaciens TA208]
          Length = 1021

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 20/144 (13%), Positives = 43/144 (29%), Gaps = 33/144 (22%)

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP----------YAPLIMGCDIAE--- 326
            R      +      + I ++ + +A                       ++G D+A    
Sbjct: 731 FRQNYLCDWIGASDGALINISKLIKARTITHPELSCPRDKNKNFLLHEYVIGVDVARSAA 790

Query: 327 EGGDNTVVV-----------LRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-------- 367
           E  + T +V           +R+  V+  +   +    +  +  +  + + Y        
Sbjct: 791 ESNNKTAIVVLKIIRNSNNLIRQVQVVNIIEPPNGLSFKEQSIMVKRVFKNYGGNQDTSL 850

Query: 368 -RPDAIIIDANNTGARTCDYLEML 390
            R  A+I+D N  G    D L   
Sbjct: 851 SRVKAVIVDGNGVGGGLIDRLLED 874


>gi|167900122|ref|ZP_02487523.1| Putative ATPase subunit of terminase (gpP-like) protein
           [Burkholderia pseudomallei 7894]
          Length = 589

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 20/142 (14%), Positives = 38/142 (26%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +   F    + S   L  ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLAELQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
            +  + +G D A  G    +VV+      G     L    +   D       I  + ++Y
Sbjct: 407 GHREVWVGYDPALTGDSAGLVVVAPPRVDGGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|325921366|ref|ZP_08183223.1| hypothetical protein XGA_2215 [Xanthomonas gardneri ATCC 19865]
 gi|325548124|gb|EGD19121.1| hypothetical protein XGA_2215 [Xanthomonas gardneri ATCC 19865]
          Length = 594

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 25/142 (17%), Positives = 45/142 (31%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-------NREPCPDPYAP- 317
            + +   Y    D     +  +F      S  PL +++  +        ++  P    P 
Sbjct: 356 IDELREEY--SPDAFANLLMCEFVDDGA-SIFPLAMLQPCMVDSWVEWGQDYKPFAARPY 412

Query: 318 ----LIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL      G     L    +   D      +I  +  +Y
Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQLPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494


>gi|239590013|ref|YP_002941860.1| gp2 [Mycobacterium phage Angel]
 gi|238890545|gb|ACR77534.1| gp2 [Mycobacterium phage Angel]
          Length = 478

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 56/372 (15%), Positives = 107/372 (28%), Gaps = 54/372 (14%)

Query: 54  WQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISVIC 112
           W  +  +      + +++         ++ +  R +GKT L   +V  L    PG++VI 
Sbjct: 36  WTFDRWQDGLGRLILALDGTGLYAADTSVISIPRQVGKTYLIGCIVFALALLTPGLTVIW 95

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+     +T    E                 +  ++             GI   + S +
Sbjct: 96  TAH-----RTKTAKE-------TFGSMKAMCATPLVNAHVRNVSDARGDEGIYLHNGSRI 143

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
                    +   G        ++ DEA    D     ++  +     N   ++T  P R
Sbjct: 144 LFGAR----ENGFGLGFAGVGILVLDEAQRLTDKAMDDLIPTMNTVE-NPLILLTGTPPR 198

Query: 233 LS-------------------GKFYEIFN-----KPLDDWKRFQIDTRTVEGIDPSFHEG 268
            +                   G  Y  F+      P D  +  + +              
Sbjct: 199 PTDSGEVFTMLRQDALDGESEGTLYVEFSADEGAHPDDRAQLRKANPSYPHRTSERAIRR 258

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----PLIMGCDI 324
           +      +S     E  G +     D  +   ++  A  R       A    P   G D+
Sbjct: 259 MRKNLTEES--FLREAFGIW-----DKVVHRPVVTAARWRRLESTGPAAGVKPNGFGVDM 311

Query: 325 AEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-RPDAIIIDANNTGART 383
           +     +   V   G        W+  D       I+   ++  R   ++ID+ +  A  
Sbjct: 312 SHSRMVSVNAVWLDGDQAHTEEVWAGDDTDAAVAWIADAWKRAGRRTVVVIDSESPAASL 371

Query: 384 CDYLEMLGYHVY 395
              LE  G +VY
Sbjct: 372 VVDLENAGVNVY 383


>gi|9628620|ref|NP_043485.1| hypothetical protein HP1p21 [Haemophilus phage HP1]
 gi|1722793|sp|P51718|VPP_BPHP1 RecName: Full=Probable terminase, ATPase subunit; AltName:
           Full=ORF16
 gi|1046243|gb|AAB09201.1| orf16 [Haemophilus phage HP1]
          Length = 607

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 29/180 (16%), Positives = 56/180 (31%), Gaps = 25/180 (13%)

Query: 265 FHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNI----IEEALNREPCPDPYAP--- 317
             E +  RY              +       F    +    ++ A  ++  P    P   
Sbjct: 367 NIEKLKQRYSKY--AFNQLYMCIWIDDADSIFNVKQLLKCGVDIAKWKDFNPKADRPFGD 424

Query: 318 --LIMGCDIAEEGGDNTVVVL------RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
             +  G D A  G   + V++           +   + W         N+I  L EKY  
Sbjct: 425 REVWGGFDPAHSGDGASFVIIAPPALPGEKYRMLARYQWHGLSYVYQANQIRALYEKYNM 484

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASL 429
             I IDA   G    + ++           ++ A  + +    +T + +K+ D +E   +
Sbjct: 485 TYIGIDATGVGYGVYELVKE--------FARRAATAIIYNPESKTGMVLKVHDLVEHGQI 536


>gi|215484220|ref|YP_002326447.1| Terminase-like family protein [Acinetobacter baumannii AB307-0294]
 gi|213985731|gb|ACJ56030.1| Terminase-like family protein [Acinetobacter baumannii AB307-0294]
          Length = 413

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%)

Query: 81  AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135
           A  AG G GKT +    +    W         V     A +  Q++   +  +       
Sbjct: 7   AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 56

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
               W     +         +          + Y T     S E+P T VG    + +  
Sbjct: 57  VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 106

Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244
             DE    A          I+  +  + A     I  +         YE F K       
Sbjct: 107 -IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 165

Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
               +   Q  T   E  +   +   +   Y     +    + GQF      +  P +  
Sbjct: 166 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 222

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361
               + +       PL++G D         V V+R G     L +     D  T    I+
Sbjct: 223 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 281

Query: 362 GLVEKYRPDAIIIDANN 378
                +    +I DA+ 
Sbjct: 282 ERFPDH-DITVIPDASG 297


>gi|184157353|ref|YP_001845692.1| putative phage terminase [Acinetobacter baumannii ACICU]
 gi|301345227|ref|ZP_07225968.1| putative phage terminase [Acinetobacter baumannii AB056]
 gi|301595737|ref|ZP_07240745.1| putative phage terminase [Acinetobacter baumannii AB059]
 gi|332851175|ref|ZP_08433263.1| phage terminase, large subunit, PBSX family [Acinetobacter
           baumannii 6013150]
 gi|332869110|ref|ZP_08438600.1| phage terminase, large subunit, PBSX family [Acinetobacter
           baumannii 6013113]
 gi|332875310|ref|ZP_08443140.1| phage terminase, large subunit, PBSX family [Acinetobacter
           baumannii 6014059]
 gi|183208947|gb|ACC56345.1| putative phage terminase [Acinetobacter baumannii ACICU]
 gi|332730195|gb|EGJ61521.1| phage terminase, large subunit, PBSX family [Acinetobacter
           baumannii 6013150]
 gi|332732895|gb|EGJ64103.1| phage terminase, large subunit, PBSX family [Acinetobacter
           baumannii 6013113]
 gi|332736478|gb|EGJ67475.1| phage terminase, large subunit, PBSX family [Acinetobacter
           baumannii 6014059]
          Length = 413

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%)

Query: 81  AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135
           A  AG G GKT +    +    W         V     A +  Q++   +  +       
Sbjct: 7   AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 56

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
               W     +         +          + Y T     S E+P T VG    + +  
Sbjct: 57  VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 106

Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244
             DE    A          I+  +  + A     I  +         YE F K       
Sbjct: 107 -IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 165

Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
               +   Q  T   E  +   +   +   Y     +    + GQF      +  P +  
Sbjct: 166 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 222

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361
               + +       PL++G D         V V+R G     L +     D  T    I+
Sbjct: 223 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 281

Query: 362 GLVEKYRPDAIIIDANN 378
                +    +I DA+ 
Sbjct: 282 ERFPDH-DITVIPDASG 297


>gi|261494619|ref|ZP_05991100.1| terminase ATPase subunit [Mannheimia haemolytica serotype A2 str.
           OVINE]
 gi|261309731|gb|EEY10953.1| terminase ATPase subunit [Mannheimia haemolytica serotype A2 str.
           OVINE]
          Length = 612

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 18/132 (13%)

Query: 276 DSDVTRVEVCGQFPQQDIDSF----IPLNIIEEA--------LNREPCPDPYAPLIMGCD 323
             D        +F   +   F    +   +++           +    P     + +G D
Sbjct: 378 SPDEFEQLFMCEFIDDNQSVFKFTMMQRCLVDSMEVWRDYVFTDGYQRPFGNKEVWVGYD 437

Query: 324 IAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377
            +  G  + +VV+      G     L    +   D      +I  +  KY    + ID  
Sbjct: 438 PSYTGDRSALVVIAPPKVDGGKFRLLEYRTFKGADFAEQAAEIVAICAKYNVTRLAIDTT 497

Query: 378 NTGARTCDYLEM 389
             G    + ++ 
Sbjct: 498 GLGVGVYEIVKK 509


>gi|213425656|ref|ZP_03358406.1| hypothetical protein SentesTyphi_08397 [Salmonella enterica subsp.
           enterica serovar Typhi str. E02-1180]
          Length = 195

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 34/87 (39%), Gaps = 9/87 (10%)

Query: 311 CPDPYAPLIMGCDIAE--EGGDN--TVVVL---RRGPVIEHL--FDWSKTDLRTTNNKIS 361
            P  +  + +G D A+  + GD+   VVV      G     L    W   D R   + I 
Sbjct: 8   RPFGWREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIK 67

Query: 362 GLVEKYRPDAIIIDANNTGARTCDYLE 388
            L E+Y    I ID+   G    + ++
Sbjct: 68  KLTEQYNVTYIGIDSTGVGHGVYENVK 94


>gi|109392289|ref|YP_655519.1| gp2 [Mycobacterium phage Halo]
 gi|189043089|ref|YP_001936030.1| hypothetical protein BPs1_2 [Mycobacterium phage BPs]
 gi|91980539|gb|ABE67259.1| terminase [Mycobacterium phage Halo]
 gi|171909204|gb|ACB58161.1| hypothetical protein BPs1_2 [Mycobacterium phage BPs]
 gi|255927846|gb|ACU41466.1| gp2 [Mycobacterium phage Hope]
          Length = 478

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 56/372 (15%), Positives = 107/372 (28%), Gaps = 54/372 (14%)

Query: 54  WQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISVIC 112
           W  +  +      + +++         ++ +  R +GKT L   +V  L    PG++VI 
Sbjct: 36  WTFDRWQDGLGRLILALDGTGLYAADTSVISIPRQVGKTYLIGCIVFALALLTPGLTVIW 95

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+     +T    E                 +  ++             GI   + S +
Sbjct: 96  TAH-----RTKTAKE-------TFGSMKAMCATPLVNAHVRNVSDARGDEGIYLHNGSRI 143

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
                    +   G        ++ DEA    D     ++  +     N   ++T  P R
Sbjct: 144 LFGAR----ENGFGLGFAGVGILVLDEAQRLTDKAMDDLIPTMNTVE-NPLILLTGTPPR 198

Query: 233 LS-------------------GKFYEIFN-----KPLDDWKRFQIDTRTVEGIDPSFHEG 268
            +                   G  Y  F+      P D  +  + +              
Sbjct: 199 PTDSGEVFTMLRQDALDGESEGTLYVEFSADEGAHPDDRAQLRKANPSYPHRTSERAIRR 258

Query: 269 IIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----PLIMGCDI 324
           +      +S     E  G +     D  +   ++  A  R       A    P   G D+
Sbjct: 259 MRKNLTEES--FLREAFGIW-----DKVVHRPVVTAARWRRLESTGPAAGVKPNGFGVDM 311

Query: 325 AEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY-RPDAIIIDANNTGART 383
           +     +   V   G        W+  D       I+   ++  R   ++ID+ +  A  
Sbjct: 312 SHSRMVSVNAVWLDGDQAHTEEVWAGDDTDAAVAWIADAWKRAGRRTVVVIDSESPAASL 371

Query: 384 CDYLEMLGYHVY 395
              LE  G +VY
Sbjct: 372 VVDLENAGVNVY 383


>gi|261492632|ref|ZP_05989185.1| terminase ATPase subunit [Mannheimia haemolytica serotype A2 str.
           BOVINE]
 gi|261311791|gb|EEY12941.1| terminase ATPase subunit [Mannheimia haemolytica serotype A2 str.
           BOVINE]
          Length = 612

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 18/132 (13%)

Query: 276 DSDVTRVEVCGQFPQQDIDSF----IPLNIIEEA--------LNREPCPDPYAPLIMGCD 323
             D        +F   +   F    +   +++           +    P     + +G D
Sbjct: 378 SPDEFEQLFMCEFIDDNQSVFKFTMMQRCLVDSMEVWRDYVFTDGYQRPFGNKEVWVGYD 437

Query: 324 IAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377
            +  G  + +VV+      G     L    +   D      +I  +  KY    + ID  
Sbjct: 438 PSYTGDRSALVVIAPPKVDGGKFRLLEYRTFKGADFAEQAAEIVAICAKYNVTRLAIDTT 497

Query: 378 NTGARTCDYLEM 389
             G    + ++ 
Sbjct: 498 GLGVGVYEIVKK 509


>gi|33601198|ref|NP_888758.1| putative phage terminase [Bordetella bronchiseptica RB50]
 gi|33602480|ref|NP_890040.1| putative phage terminase [Bordetella bronchiseptica RB50]
 gi|33575633|emb|CAE32711.1| putative phage terminase [Bordetella bronchiseptica RB50]
 gi|33576919|emb|CAE33999.1| putative phage terminase [Bordetella bronchiseptica RB50]
          Length = 425

 Score = 48.6 bits (114), Expect = 0.003,   Method: Composition-based stats.
 Identities = 72/439 (16%), Positives = 127/439 (28%), Gaps = 62/439 (14%)

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKW 131
           P  F+  + AG G GKT +    +       P ++    A +  Q++   +    EV+  
Sbjct: 15  PHKFRAFV-AGFGSGKTWVGGAGLCRHAWEFPRVNSGYFAPTYGQIRDIFYPTIEEVAHD 73

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
             L    +    +        +                  +CR  S E+P   VG     
Sbjct: 74  WGLAAKINESNKEVHLFAGRKYRGT--------------VICR--SMEKPGDIVGFKIGK 117

Query: 192 GMAIINDEASGTPDV----INLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPL 245
           G+    DE               I+  L  T         +T+ P       Y+ F K +
Sbjct: 118 GL---IDELDVMKTDKAALAWRKIIARLRHTAPGLINGVDVTTTPEG-FKFVYQQFVKQV 173

Query: 246 DD-------WKRFQIDTRTV-EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI 297
            +       +   Q  T    + +   +   + A Y     +    + GQF      S  
Sbjct: 174 RERPDLVALYGLVQASTYENGKNLPEDYIPSLRASY--PPQLIAAYLRGQFTNLTSGSVY 231

Query: 298 PLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTN 357
           P N      + +   +P+  L +G D        TV V+R G  +         D     
Sbjct: 232 P-NFDRRLHHTDAAEEPHEELHIGMDFNVLNMTATVNVIRAGLPLTVGELTKVRDTPEMA 290

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDY---------LEMLGYHVYRVLGQKRAVDLEF 408
             +    +  +   + I  + +G  T            L   G+ V RV  +  AV    
Sbjct: 291 RMLKERFKD-KGHGVTIYPDASGGNTSSKNASESDLSILRKAGFTV-RVNSRNPAVKDRI 348

Query: 409 CRNRRTELHVK-MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTD 467
                  L+ +    WL     +N        ++L+       G    E  +  G    +
Sbjct: 349 NAVNGMLLNDEGARRWL-----VNTDRCPTLTEALEQQAYDKNG----EPDKSTGHDHPN 399

Query: 468 YSDGLMYTFAENPPRSDMD 486
            + G           + M 
Sbjct: 400 DAQGYFLVHRYPITPTGMS 418


>gi|289807324|ref|ZP_06537953.1| hypothetical protein Salmonellaentericaenterica_24067 [Salmonella
           enterica subsp. enterica serovar Typhi str. AG3]
          Length = 96

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 9/45 (20%), Positives = 16/45 (35%), Gaps = 3/45 (6%)

Query: 443 KSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
                   G + +ESK+    +   S + +D  +  FA      D
Sbjct: 43  PHRDFDRNGRVMVESKKDLAKRDIPSPNVADAFIMAFAPTDTSLD 87


>gi|109289938|ref|YP_655470.1| terminase ATPase subunit [Mannheimia phage phiMHaA1]
 gi|90110544|gb|ABD90554.1| terminase ATPase subunit [Mannheimia phage phiMhaA1-PHL101]
 gi|90110594|gb|ABD90603.1| terminase ATPase subunit [Mannheimia phage phiMhaA1-BAA410]
          Length = 605

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 18/132 (13%)

Query: 276 DSDVTRVEVCGQFPQQDIDSF----IPLNIIEEA--------LNREPCPDPYAPLIMGCD 323
             D        +F   +   F    +   +++           +    P     + +G D
Sbjct: 371 SPDEFEQLFMCEFIDDNQSVFKFTMMQRCLVDSMEVWRDYVFTDGYQRPFGNKEVWVGYD 430

Query: 324 IAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377
            +  G  + +VV+      G     L    +   D      +I  +  KY    + ID  
Sbjct: 431 PSYTGDRSALVVIAPPKVDGGKFRLLEYRTFKGADFAEQAAEIVAICAKYNVTRLAIDTT 490

Query: 378 NTGARTCDYLEM 389
             G    + ++ 
Sbjct: 491 GLGVGVYEIVKK 502


>gi|315268220|gb|ADT95073.1| terminase, ATPase subunit [Shewanella baltica OS678]
          Length = 589

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 52/343 (15%), Positives = 107/343 (31%), Gaps = 42/343 (12%)

Query: 163 GIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD--VINLGILGFLTERNA 220
            +    Y  + +     +  + +  H  +    I+  +S T D      G L       A
Sbjct: 249 NLYLDEYFWIHKFQEFRKVASGMAIHAKWRQTYISTPSSITHDAYPFWTGKLFNRGRPKA 308

Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDV 279
           +R  I  S+    +G+           W++   +D    +G +    + +   Y    D 
Sbjct: 309 DRIEIDVSHSALANGR-----RCEDGQWRQVVTVDDAIRKGCNLFDPDTLHLEY--SPDE 361

Query: 280 TRVEVCGQFPQQDIDSFIPLNIIEEALNR-----------EPCPDPYAPLIMGCDIAEEG 328
               +  +F   D  S  P+ +++  +              P P  +  + +G D  + G
Sbjct: 362 YSNLLMCEFID-DTMSVFPMVMMQRCMVDSWEVWTDYKPFAPRPLAHREVWIGYDPNKGG 420

Query: 329 GDNTVVVLRR------GPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTG 380
             ++   +        G     +    W+  D       I  +  KY    I ID    G
Sbjct: 421 KGDSAGCIVICPPAVPGGKFRVIEKHRWNGMDFEAQAKAIQDICNKYNVTFIGIDTTGLG 480

Query: 381 ARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG---LIQ 437
                 ++     V   L              ++++ +K  D +    L   +G   L Q
Sbjct: 481 EAVYQLVKKFFPQVTPFLYNPV---------LKSQMVIKAYDVISKGRLEYDAGWTDLAQ 531

Query: 438 NLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENP 480
              S++  +  +  ++  ES R +     D +   M+     P
Sbjct: 532 AFMSIRKTLTASGKQVTYESARSEEISHADIAWAAMHALYNEP 574


>gi|254360872|ref|ZP_04977019.1| bacteriophage terminase large subunit [Mannheimia haemolytica
           PHL213]
 gi|153092346|gb|EDN73415.1| bacteriophage terminase large subunit [Mannheimia haemolytica
           PHL213]
          Length = 600

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 18/132 (13%), Positives = 38/132 (28%), Gaps = 18/132 (13%)

Query: 276 DSDVTRVEVCGQFPQQDIDSF----IPLNIIEEA--------LNREPCPDPYAPLIMGCD 323
             D        +F   +   F    +   +++           +    P     + +G D
Sbjct: 366 SPDEFEQLFMCEFIDDNQSVFKFTMMQRCLVDSMEVWRDYVFTDGYQRPFGNKEVWVGYD 425

Query: 324 IAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377
            +  G  + +VV+      G     L    +   D      +I  +  KY    + ID  
Sbjct: 426 PSYTGDRSALVVIAPPKVDGGKFRLLEYRTFKGADFAEQAAEIVAICAKYNVTRLAIDTT 485

Query: 378 NTGARTCDYLEM 389
             G    + ++ 
Sbjct: 486 GLGVGVYEIVKK 497


>gi|322412171|gb|EFY03079.1| phage terminase [Streptococcus dysgalactiae subsp. dysgalactiae
           ATCC 27957]
          Length = 471

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 51/347 (14%), Positives = 111/347 (31%), Gaps = 47/347 (13%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ   +  + A   + +       +          GKT +   + LW +    G+ ++ 
Sbjct: 43  PWQENMLIPIMAIDEDGLWVHQKYGYAIPRRN----GKTEVVYIVELWAL--HKGLKILH 96

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +      + +V K+L +     + + +    + A     +   + G   +  +  
Sbjct: 97  TAHRISTSHA-SFEKVKKYLEMS---GYVDGEDFISNKAKGQERIEFKASGAVIQFRT-- 150

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT +    + F          +I DEA          +   +T+ + N   IM   P  
Sbjct: 151 -RTSNGGLGEGFD--------LLIIDEAQEYTSEQESALKYTVTDSD-NPMTIMCGTPPT 200

Query: 233 L--SGKFYEIFNKP----------LDDWKRFQ------IDTRTVEGIDPSFH---EGIIA 271
           +  +G  +E + K             +W   +      + +  +      FH     I A
Sbjct: 201 IVSTGTVFEAYRKDCLKGNKRYSGWAEWSVPEMVKINDVSSWYISNPSMGFHLNERKIEA 260

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     ++  G +   +  S I      +  L  E  P+  + L +G    ++G +
Sbjct: 261 ELGEDEIDHNIQRLGYWSSFNQKSVISEKEWAK--LKVEQVPELKSKLFVGIKFGQDGNN 318

Query: 331 NT-VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
            +  +  R       +       +R     I   ++      ++ID 
Sbjct: 319 VSLSIAARTSENKVFVEVIDCLSVRNGTQWIINFLKSADIAKVVIDG 365


>gi|194289059|ref|YP_002004966.1| bacteriophage p2 gpp capsid protein; terminase, atpase subunit
           [Cupriavidus taiwanensis LMG 19424]
 gi|193222894|emb|CAQ68899.1| bacteriophage P2 GPP capsid protein; Terminase, ATPase subunit
           [Cupriavidus taiwanensis LMG 19424]
          Length = 593

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 23/137 (16%), Positives = 38/137 (27%), Gaps = 21/137 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE-----------PCPDP 314
            E +   Y          +   F      S  PL+++   +              P P  
Sbjct: 355 LEQLRREY--SDADFENLLMCGFIDDTA-SVFPLSMLMRCMVDSWEVWEDFRHWSPRPLG 411

Query: 315 YAPLIMGCDIAEEGGDNTVVVLRR-----GPVIEHLFD--WSKTDLRTTNNKISGLVEKY 367
              + +G D    GGD+  +V+       G     L    +   D       +  + E+Y
Sbjct: 412 NREVWVGYDPNGGGGDSAALVVVAPPLVPGGKFRVLEKHQFRGIDYEEQAAAVLKVCERY 471

Query: 368 RPDAIIIDANNTGARTC 384
               I ID    G    
Sbjct: 472 NVTYIGIDRTGVGDAVY 488


>gi|154488071|ref|ZP_02029188.1| hypothetical protein BIFADO_01641 [Bifidobacterium adolescentis
           L2-32]
 gi|154083544|gb|EDN82589.1| hypothetical protein BIFADO_01641 [Bifidobacterium adolescentis
           L2-32]
          Length = 477

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 59/376 (15%), Positives = 107/376 (28%), Gaps = 60/376 (15%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISV 110
             WQ +   +V A   +   +      + A+ +  R  GKT    W+ +   +  PG+ +
Sbjct: 37  DPWQRQINRIVLAKSADGFWSA-----RNAVLSIPRQTGKTYDIGWVAIHRAARTPGMRI 91

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH---CSLGIDSK 167
           +  A               +  S++ +        +         D  H    + G +  
Sbjct: 92  VWTA---------------QHFSVIKDTFESLCAIVLRPEMSGLVDPDHGISLAAGKEEI 136

Query: 168 HYSTMCRT-YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226
            +    R  +         G        ++ DEA    D     +L     R  N   I 
Sbjct: 137 RFRNGSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPT-QNRAYNPQTIY 193

Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDT---------RTVEGIDPSFHEGIIARY---- 273
              P        E F +  D  +  +  +         R  + +D          Y    
Sbjct: 194 MGTPPGPRDNG-EAFTRLRDKARAGRTHSTLYVEFAADRDADPLDREQWRKANPSYPAHT 252

Query: 274 ----------GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCD 323
                      L  D  R E  G + +  +   I     EEA      P     +  G D
Sbjct: 253 SDESIANLWENLTGDDFRREALGIWDEHALSRAIDRRQWEEATIDARRP--GGVMSFGID 310

Query: 324 IAEEGGDNTV---VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK--YRPDAIIIDANN 378
           +  +    T+   +    G     L ++  T+   T      L++K   +  A++ID  +
Sbjct: 311 MNPQRTRLTIGACMRYDDGTAHIELAEYRDTNHDGT-MWAVNLIDKVWEQTAALVIDGQS 369

Query: 379 TGARTCDYLEMLGYHV 394
                   L   G  V
Sbjct: 370 PATALLPDLAEAGITV 385


>gi|264678567|ref|YP_003278474.1| hypothetical protein CtCNB1_2432 [Comamonas testosteroni CNB-2]
 gi|262209080|gb|ACY33178.1| hypothetical conserved protein [Comamonas testosteroni CNB-2]
          Length = 322

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 23/133 (17%), Positives = 40/133 (30%), Gaps = 19/133 (14%)

Query: 275 LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPDPYAPLIMGC 322
              DV       QF   D  +  PL++++  +                 P  Y  + +G 
Sbjct: 86  KSEDVFNNLYMCQFVD-DALAVFPLSVLQRCMVDSWDAWRKDFKAFAQRPFGYKRVWVGY 144

Query: 323 DIAEEGGDNTVVVL----RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
           D +  G    +VVL    + G     L+       D       I  + + Y  + + ID 
Sbjct: 145 DPSLTGDKAALVVLAPPDKPGGKCRILYKVQLHGVDFEAQAAAIKKVCDSYSVEKMTIDI 204

Query: 377 NNTGARTCDYLEM 389
              G      +  
Sbjct: 205 TGLGNGVYQLVRK 217


>gi|56808979|ref|ZP_00366686.1| COG1783: Phage terminase large subunit [Streptococcus pyogenes M49
           591]
 gi|71910836|ref|YP_282386.1| terminase large subunit [Streptococcus pyogenes MGAS5005]
 gi|71853618|gb|AAZ51641.1| terminase large subunit [Streptococcus pyogenes MGAS5005]
          Length = 424

 Score = 48.2 bits (113), Expect = 0.003,   Method: Composition-based stats.
 Identities = 37/287 (12%), Positives = 85/287 (29%), Gaps = 35/287 (12%)

Query: 57  EFMEVVDAHCLNSVNNP-NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLAN 115
           +  +++       V    NP++   A   GRG GK++  A+++  L+   P ++ +C+  
Sbjct: 4   DLADIIPIGFKPVVQATWNPQILNIACKGGRGSGKSSNIAFIISRLIIQYP-VNAVCIRK 62

Query: 116 SETQLKTTLWAEVSKWLSLLPNKHWFEMQ--------SLSLHPAPWYSDVLHCSLGIDSK 167
           ++  L+ +++ ++   +S    + +F+              +   +        +     
Sbjct: 63  TDNTLEQSVYEQIKWAISEQGLERYFKFNKSPLRITYIPRGNYIVFRGAQNPERIKSLKD 122

Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227
                   +     +               DE     + +  G LG          +  T
Sbjct: 123 SRFPFAIGW----IEELAEFKTE-------DEVKTITNSLLRGELG----DGLFYKFFYT 167

Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTR-TVEGIDPSFHEGIIARYGLDSDVTRV 282
            NP +    +    YE   +P + +      T      I   F     A         R 
Sbjct: 168 YNPPKRKQSWVNKKYESQFQPSNTF--VHASTYKDNPFIAKEFIAEAEATRERSERRYRW 225

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG 329
           E  G+         +P + +      +     +  +  G D      
Sbjct: 226 EYLGEAIGS---GVVPFDNLRFERITDEQVADFDNIRNGIDYGYATD 269


>gi|225575978|ref|YP_002724813.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
 gi|225576296|ref|YP_002725339.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
 gi|225547342|gb|ACN93326.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
 gi|225547454|gb|ACN93434.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
          Length = 450

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 33/163 (20%), Positives = 54/163 (33%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +      +K ++  T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNTATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NII++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINIIQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|17546658|ref|NP_520060.1| terminase (ATPase subunit) related protein [Ralstonia solanacearum
           GMI1000]
 gi|17428957|emb|CAD15641.1| probable terminase (atpase subunit) related protein [Ralstonia
           solanacearum GMI1000]
          Length = 506

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 22/137 (16%), Positives = 40/137 (29%), Gaps = 21/137 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----------EPCPDP 314
            + +   Y          +   F   +  S  PL+++   +              P P  
Sbjct: 355 LDQLRLEYSE--PEFANLLMCAFIDDNA-SVFPLSMLMRGMVDSWEAWEDFRPFAPRPFG 411

Query: 315 YAPLIMGCDIAEEGGDNTVVVLRR-----GPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
             P+ +G D    GGD+  +V+       G     L    +   D       I  + E++
Sbjct: 412 NRPVWVGYDPNGGGGDSAALVVVAPPLVPGGKFRVLERHQFRGIDYEEQAGAIRRVCERF 471

Query: 368 RPDAIIIDANNTGARTC 384
               + ID    G    
Sbjct: 472 NVAYVGIDRTGIGDAVF 488


>gi|134288784|ref|YP_001111035.1| gp4, phage terminase, ATPase subunit [Burkholderia phage phiE202]
 gi|134131997|gb|ABO60745.1| gp4, phage terminase, ATPase subunit [Burkholderia phage phiE202]
          Length = 589

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 19/142 (13%), Positives = 39/142 (27%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +   F    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            +  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 407 GHREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|224593667|ref|YP_002641021.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           CA-11.2a]
 gi|224554694|gb|ACN56072.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           CA-11.2a]
          Length = 450

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 54/163 (33%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K ++  T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|323188196|gb|EFZ73489.1| terminase, ATPase subunit [Escherichia coli RN587/1]
          Length = 594

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 28/162 (17%), Positives = 49/162 (30%), Gaps = 21/162 (12%)

Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
             W++   ++    +G D    E +        +        QF      S  PL++++ 
Sbjct: 333 GQWRQIVTVEDAINQGYDLFDLEQLRLE--NSPEEFANLFMCQFIDDTA-SVFPLSMLQG 389

Query: 305 AL-NREPCPDPYAP----------LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFD-- 347
            + +     D Y P          + +G D A  G     VV+      G     +    
Sbjct: 390 CMVDSWEVWDDYKPFALRPLGERSVWVGYDPALSGDSAGCVVVAPPVIEGGKFRVIEKHQ 449

Query: 348 WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
           W   D       I  + E+Y    I ID    G      ++ 
Sbjct: 450 WHGMDFAAQAENIRKITERYNVTYIGIDVTGIGHGVHQLVKQ 491


>gi|221067857|ref|ZP_03543962.1| phage terminase, large subunit, PBSX family [Comamonas testosteroni
           KF-1]
 gi|220712880|gb|EED68248.1| phage terminase, large subunit, PBSX family [Comamonas testosteroni
           KF-1]
          Length = 434

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 56/330 (16%), Positives = 112/330 (33%), Gaps = 39/330 (11%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GK+   A ++L + ++RP   V+C              E+ K           + 
Sbjct: 39  GGRGGGKSWTVAAVLLVMAASRPL-RVLCT------------REIQK-SIKQSVHQLLKD 84

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHY-STMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202
               L+   ++  +     GI+   +  +  ++++ +   +F G        +  +EA G
Sbjct: 85  VIARLNLHAFFEVLETEVRGINGSLFLFSGLQSHTVDSIKSFEGCD-----IVWVEEAHG 139

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF-NKPLDDWKRFQIDTRTVEGI 261
                   ++  + +  +  +  +  NP   + + Y+ F   P  D    +I+ R     
Sbjct: 140 VSKKSWDTLIPTIRKEGSEIWLTL--NPDMETDETYQRFIATPCPDTWVVEINWRDNPWF 197

Query: 262 DPSFHEGII-ARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE---EALNREPCPDPYAP 317
                E    A+  + +D       G+  +    +     +     +   R+   DP  P
Sbjct: 198 PRVLDEERRKAKRTMLADDYAHIWEGKARRVAAGAIYRHEMESVYLDNRARDVPYDPTLP 257

Query: 318 LIMGCDIAEEGGDNTVVVLRRGP-----VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372
           +    D+     D   + L +       +I H+ D  +T L     K+  L  ++  D +
Sbjct: 258 VHTVWDLGWN--DAMSIALVQRGPQDVRIIGHIEDSHRT-LDWYVAKLEKLPYRWGTDYL 314

Query: 373 IIDAN----NTGARTCDYLEMLGYHVYRVL 398
             D       TG  T   L  LG     V 
Sbjct: 315 PHDGKTRNFQTGKSTEQLLRELGRRSVMVQ 344


>gi|254192775|ref|ZP_04899210.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13]
 gi|254197102|ref|ZP_04903525.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13]
 gi|169649529|gb|EDS82222.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13]
 gi|169653844|gb|EDS86537.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13]
          Length = 601

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 362 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQATAIEAITQRY 478

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500


>gi|167839678|ref|ZP_02466362.1| phage terminase, ATPase subunit [Burkholderia thailandensis MSMB43]
          Length = 589

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 20/138 (14%), Positives = 37/138 (26%), Gaps = 21/138 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +   F    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
            +  + +G D A  G    +VV+      G     L    +   D       I  +  +Y
Sbjct: 407 GHREVWVGYDPALTGDSAGLVVVAPPRVDGGAFRVLERHQFRGNDFEEQAAAIEAITRRY 466

Query: 368 RPDAIIIDANNTGARTCD 385
               I ID    G     
Sbjct: 467 NVGYIAIDTTGMGQGVYQ 484


>gi|82776058|ref|YP_402405.1| hypothetical protein SDY_0732 [Shigella dysenteriae Sd197]
 gi|33323489|gb|AAQ07461.1| HI1410 hypothetical protein-like protein [Shigella flexneri]
 gi|81240206|gb|ABB60916.1| hypothetical bacteriophage protein [Shigella dysenteriae Sd197]
          Length = 97

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 9/45 (20%), Positives = 16/45 (35%), Gaps = 3/45 (6%)

Query: 443 KSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
                   G + +ESK+    +   S + +D  +  FA      D
Sbjct: 44  PHRDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 88


>gi|308071887|emb|CBW54808.1| putative DNA maturase B [Pantoea phage LIMElight]
          Length = 614

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 67/461 (14%), Positives = 132/461 (28%), Gaps = 89/461 (19%)

Query: 2   SRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEV 61
            R +P++  TE  +        + ++F  F    +     G    GF      Q +  + 
Sbjct: 25  PRTIPSDKRTELAMM-------LAITFKEFRDFAY----VGMRFLGFELTDM-QADIADY 72

Query: 62  VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ-- 119
           +                K  ++A RG  K+TL A   +W +       V+ L+  E Q  
Sbjct: 73  MQYG-----------PRKKMVAAQRGEAKSTLAALYSVWRLIQDQRCRVLILSGGEQQAS 121

Query: 120 -LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
            + T +   +  W    P   W +  S       + +  +HC L    K  S  C   + 
Sbjct: 122 EVATLVIRLIETW----PLLCWLKADSTRGDRTSYTAYDVHCDLKPLDKSPSVACIGVTA 177

Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF--------------W 224
               +  G                 PD I     G                         
Sbjct: 178 ----SLQGKRADLL----------IPDDIETTKNGMTQTEREKLLTVSKDFAAICTHGDT 223

Query: 225 IMTSNPRRLSGKFYEIFNKPL------DDWKRFQIDTRTVEGIDPSFHEGI-----IARY 273
           +    P+     +  +  +              +++ R  E + P  HE I      + Y
Sbjct: 224 LYLGTPQTKDSIYKTLPARGFEVRVWPGRIPSLEMEERYGETLAPYIHELIAAGYSRSGY 283

Query: 274 GLDSDVTRVEVCGQFPQQD----IDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG 329
           G+D  + +    G++ + D       F P     + +      D     I   D+   GG
Sbjct: 284 GVDGTLGQSTDTGRYSEDDLIEKELDFGPEGFQLQYMLDTSLLDAMRTKIKLSDLLIHGG 343

Query: 330 DNTV-----VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN------ 378
           D        +       +   ++  + +           + +Y+   ++ID         
Sbjct: 344 DTDTAPDRFMYAADRRNLVEEYEPIRGEKLYYPAGTGSEMLQYKHKLMVIDPAGCGGDEI 403

Query: 379 ---TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416
               G     Y+ +  + V    G     +++   +   E+
Sbjct: 404 SYAAGGAVSSYIHL--FSVGGFQGGVSTENIDKVIDLAIEM 442


>gi|254197041|ref|ZP_04903465.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13]
 gi|169653784|gb|EDS86477.1| putative terminase, ATPase subunit [Burkholderia pseudomallei S13]
          Length = 576

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 21/142 (14%), Positives = 40/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +  QF    + S   L+ ++  +                 P 
Sbjct: 337 IDELRREYSAEE--FANLLMCQFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 393

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 394 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQATAIEAITQRY 453

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 454 NVGYIAIDTTGMGQGVYQLVRK 475


>gi|237720954|ref|ZP_04551435.1| phage terminase large subunit [Bacteroides sp. 2_2_4]
 gi|229449789|gb|EEO55580.1| phage terminase large subunit [Bacteroides sp. 2_2_4]
          Length = 450

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 44/242 (18%), Positives = 81/242 (33%), Gaps = 40/242 (16%)

Query: 262 DPSFHEGIIARYGLDSDVTRVE-VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320
           DP++   ++ +    SD  R   + G +  +     I      EAL R           +
Sbjct: 201 DPTYLANLVNQ----SDEQRARDLDGNWKYKAAGDDIIKLTHMEALYRNSMQIGDGIRRV 256

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANN 378
            CD A EGGD+ V+ L  G  I  +F   K D + T + +  ++E++  R +    D N 
Sbjct: 257 SCDAAFEGGDSLVMWLWEGWHIRDIFV-CKLDSKKTVDTVKAMLEEWHVREECFTYDLNG 315

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEF----CRNRRTELHVKMADWLEFASLINHSG 434
            G          G+    +    +    E       N +++     A  +    +     
Sbjct: 316 LGQI------FKGFFPNAIPFNNKEAVEEKFKYIYANLKSQAAYLFAQKIINREISIEPT 369

Query: 435 LIQNLKSLKSFIVPNTGELA-IESKRVKG---------------------AKSTDYSDGL 472
           L++   S K F      ++   E K ++                        S D+ + L
Sbjct: 370 LLERKFSGKGFEKVPLRQILDKERKAIRKDEDSEEKGWTIIKKIIMKKLVGHSPDFIEAL 429

Query: 473 MY 474
           + 
Sbjct: 430 LM 431


>gi|226941496|ref|YP_002796570.1| Terminase [Laribacter hongkongensis HLHK9]
 gi|226716423|gb|ACO75561.1| Terminase [Laribacter hongkongensis HLHK9]
          Length = 578

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 24/129 (18%), Positives = 40/129 (31%), Gaps = 18/129 (13%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIM 320
           R     D  R     +F      S  PL+++   +       P           +  + +
Sbjct: 345 RLENSPDEFRQLFECEFIDDG-KSVFPLSMLHRCMVDSMEAWPDYNPFTLRPLGHREVWI 403

Query: 321 GCDIAEEGGDNTVVVLR-----RGP-VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +E G    +VV+       GP  +     +   D       I    ++YR   I I
Sbjct: 404 GYDPSESGDSAAMVVVAPPAVPDGPFRLLECRQFRGLDYSAQAQAIKEATDRYRVTHIAI 463

Query: 375 DANNTGART 383
           D    G+  
Sbjct: 464 DRTGLGSAV 472


>gi|58581337|ref|YP_200353.1| phage-related terminase [Xanthomonas oryzae pv. oryzae KACC10331]
 gi|58425931|gb|AAW74968.1| phage-related terminase [Xanthomonas oryzae pv. oryzae KACC10331]
          Length = 594

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 23/142 (16%), Positives = 41/142 (28%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR------------EPCPD 313
            + +   Y    D     +   F      S   L +++  +                 P 
Sbjct: 356 IDELREEY--SPDAFANLLMCDFVDDGA-SIFSLAMLQPCMVDSWVEWGQDYKPFAVRPY 412

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
               + +G D AE G    +VVL    + G     L    +   D      +I  +  +Y
Sbjct: 413 GDRAVWIGYDPAETGDTAGLVVLAPPQQPGGKFRLLERIQFRGMDFAKQAAEIERITRRY 472

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G+     ++ 
Sbjct: 473 WVTYIGIDTTGMGSGVAQLVKQ 494


>gi|154247555|ref|YP_001418513.1| hypothetical protein Xaut_3628 [Xanthobacter autotrophicus Py2]
 gi|154161640|gb|ABS68856.1| protein of unknown function DUF264 [Xanthobacter autotrophicus Py2]
          Length = 690

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 48/322 (14%), Positives = 96/322 (29%), Gaps = 63/322 (19%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNANRF--WIMTSNPRRLSGKFYEIFNKPLDDW 248
           +   +  +E S  P    L     L +         +   NP       Y  F +  D  
Sbjct: 380 HNCTLYFNECSQIPYSSILVARTRLAQVVPGLMQRALYDLNPAGTGHWTYREFIEGRDPI 439

Query: 249 KRFQIDTRTV------------EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF 296
               +                 + + P F   + A              G +  +   + 
Sbjct: 440 SGAPLSAPDNFQHMFLNPGDNAKNLSPEFLRSLEALPEKQRRRF---FDGMYVAEIDGAL 496

Query: 297 IPLNIIEEALNREPCPDPYA--PLIMGCD------------------IAEEGGDNTVVVL 336
             L++IE   +    P  +    +++G D                  +A    D T V+L
Sbjct: 497 WTLDLIERCRSEPIAPGDHRLRRIVIGVDPSGAANKEDARSDEIGIVVAGMMDDGTAVIL 556

Query: 337 RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396
             G V +    W K         ++GL  K+  D +I + N  G    +++ +       
Sbjct: 557 EDGTVRDGPSGWGKV--------VAGLYHKWGADRVIAERN-YGGAMVEFVILTADKSIP 607

Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456
           V      +     ++ R E    ++   E    + H+G    ++  +      +G +   
Sbjct: 608 V----SVITASRGKHIRAE---PVSALYEQGK-VRHAGRFPEMED-QFTNFSTSGYM--- 655

Query: 457 SKRVKGAKSTDYSDGLMYTFAE 478
                G +S D +D  ++   E
Sbjct: 656 -----GDRSPDRADAAVWALTE 672


>gi|226939350|ref|YP_002794423.1| Terminase [Laribacter hongkongensis HLHK9]
 gi|226714276|gb|ACO73414.1| Terminase [Laribacter hongkongensis HLHK9]
          Length = 578

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 25/129 (19%), Positives = 40/129 (31%), Gaps = 18/129 (13%)

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-----------YAPLIM 320
           R     D  R     +F      S  PL+++   +       P           +  + +
Sbjct: 345 RLENSPDEFRQLFECEFIDDG-KSVFPLSMLHRCMVDSMEAWPDYNPFTLRPLGHREVWI 403

Query: 321 GCDIAEEGGDNTVVVLR-----RGP-VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIII 374
           G D +E G    +VV+       GP  +     +   D       I    E+YR   I I
Sbjct: 404 GYDPSESGDSAAMVVVAPPAVPDGPFRLLECRQFRGLDYSAQAQAIKEATERYRVTHIAI 463

Query: 375 DANNTGART 383
           D    G+  
Sbjct: 464 DRTGLGSAV 472


>gi|213427183|ref|ZP_03359933.1| terminase subunit [Salmonella enterica subsp. enterica serovar
           Typhi str. E02-1180]
          Length = 195

 Score = 47.8 bits (112), Expect = 0.004,   Method: Composition-based stats.
 Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 9/83 (10%)

Query: 311 CPDPYAPLIMGCDIAE--EGGDNT---VVVL--RRGPVIEHL--FDWSKTDLRTTNNKIS 361
            P  +  + +G D A+  + GD+    V+      G     L    W   D R   + I 
Sbjct: 8   RPFGWREVWIGYDPAKGTQNGDSAGCVVIAPPTVPGGKFRILERHQWRGMDFRAQADAIK 67

Query: 362 GLVEKYRPDAIIIDANNTGARTC 384
            L ++Y    I ID+   G    
Sbjct: 68  KLTQQYNVTYIGIDSTGVGHGVY 90


>gi|251778523|ref|ZP_04821443.1| phage terminase, large subunit, pbsx family [Clostridium botulinum
           E1 str. 'BoNT E Beluga']
 gi|243082838|gb|EES48728.1| phage terminase, large subunit, pbsx family [Clostridium botulinum
           E1 str. 'BoNT E Beluga']
          Length = 448

 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 31/190 (16%), Positives = 61/190 (32%), Gaps = 10/190 (5%)

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI-FNKPLDDWKRFQI 253
           I+ +E +         +   L  +N      +  NP   S   YE+ F    D+     +
Sbjct: 142 IVVEECTEIDKQEFSQLGLRLRSKNGYNQIHVMFNPISKSNWVYEMWFQNGYDESDTMVL 201

Query: 254 DT--RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNI--IEEALNRE 309
            T  +  + +   +   +I     D    R+   G+F    +D  I  N   ++    + 
Sbjct: 202 KTTYKDNKFLPYDYINALIKMKETDPVYYRIYALGEF--ASLDKLIYTNWEELDFDWRKL 259

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPV---IEHLFDWSKTDLRTTNNKISGLVEK 366
               PYA    G D       +  + +    V   I    ++ +  L      +  +   
Sbjct: 260 MQQRPYAKACFGLDFGYVNDPSAFIAMIVDEVNKEIYIFDEFYEKGLLNDALALKIVKRG 319

Query: 367 YRPDAIIIDA 376
           Y  + I  D+
Sbjct: 320 YGKEIIFADS 329


>gi|163746673|ref|ZP_02154030.1| hypothetical protein OIHEL45_14759 [Oceanibulbus indolifex HEL-45]
 gi|161379787|gb|EDQ04199.1| hypothetical protein OIHEL45_14759 [Oceanibulbus indolifex HEL-45]
          Length = 414

 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 59/423 (13%), Positives = 109/423 (25%), Gaps = 73/423 (17%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ S   G           V  +  +  Q++  +        
Sbjct: 22  IMGGRGAGKTRAGA---EWVRSKVEGSRPLDPGECSRVALVGETIEQVREVM-------- 70

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                   F    +     P        +          +   ++   P+   G      
Sbjct: 71  -------IFGDSGILACSPPDRRPDWEATRKRLVWPNGAIATVHTAHDPEGLRGPQFD-- 121

Query: 193 MAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +           +   L          +T+ PR +     +   +       
Sbjct: 122 -AAWVDELAKWKRGQEAWDQLQFAL-RLGERPQVCVTTTPRNVD--VLKALLQSPSTVTT 177

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     +  SF E + ARY   + + R E+ G        +     ++E    R  
Sbjct: 178 HAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSALLEA--GRVQ 234

Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357
                  +++G D A     G D   +V+          +W                   
Sbjct: 235 VAPELDRIVVGLDPATTSGAGSDECGIVVVGAQTQGPPQEWRAVVLADCTVQGATPNGWA 294

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTE 415
                 + +Y  D ++ + N  G    + L  +     +  V   +  V        R E
Sbjct: 295 QAAIAAMTRYGADRLVAEVNQGGQLVSEVLRQVDPLVSLKTVHAARGKV-------ARAE 347

Query: 416 LHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
               + +    + L     L   +  + +      G             S D  D L++ 
Sbjct: 348 PVAALYEQGRVSHLPGLDALEDQMCLMTARGYEGKG-------------SPDRVDALVWA 394

Query: 476 FAE 478
             E
Sbjct: 395 LHE 397


>gi|226940436|ref|YP_002795510.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
 gi|226715363|gb|ACO74501.1| Terminase large subunit [Laribacter hongkongensis HLHK9]
          Length = 93

 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 35/80 (43%), Gaps = 10/80 (12%)

Query: 10 ETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNS 69
          + + +L +L    E       + LH + WG     LEG + PR+WQ E M  +  H  N 
Sbjct: 3  DIDDELIELA--AECATDPLRWALHAYDWGR--GELEGVTGPRAWQREVMSDIGNHLKNP 58

Query: 70 VNNPNPEVFKGAISAGRGIG 89
              +      A  AGRG+G
Sbjct: 59 ATRFS------AFDAGRGLG 72


>gi|219723016|ref|YP_002474442.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|219692691|gb|ACL33908.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
          Length = 450

 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNMATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|226246851|ref|YP_002776184.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           29805]
 gi|226202003|gb|ACO38584.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           29805]
          Length = 450

 Score = 47.8 bits (112), Expect = 0.005,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNMATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|83717940|ref|YP_439548.1| putative ATPase subunit of terminase (gpP-like) [Burkholderia
           thailandensis E264]
 gi|83651765|gb|ABC35829.1| Putative ATPase subunit of terminase (gpP-like) [Burkholderia
           thailandensis E264]
          Length = 601

 Score = 47.4 bits (111), Expect = 0.005,   Method: Composition-based stats.
 Identities = 20/142 (14%), Positives = 39/142 (27%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +   F    + S   L+ ++  +                 P 
Sbjct: 362 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 418

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 419 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 478

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 479 NVGYIAIDTTGMGQGVYQLVRK 500


>gi|257142677|ref|ZP_05590939.1| putative ATPase subunit of terminase (gpP-like) protein
           [Burkholderia thailandensis E264]
          Length = 589

 Score = 47.4 bits (111), Expect = 0.005,   Method: Composition-based stats.
 Identities = 20/142 (14%), Positives = 39/142 (27%), Gaps = 21/142 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
            + +   Y  +       +   F    + S   L+ ++  +                 P 
Sbjct: 350 IDELRREYSAEE--FANLLMCHFIDDSL-SVFKLSDLQRCMVDSWEEWADDFSPLLLRPF 406

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL-----RRG-PVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            Y  + +G D A  G    +VV+       G   +     +   D       I  + ++Y
Sbjct: 407 GYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRY 466

Query: 368 RPDAIIIDANNTGARTCDYLEM 389
               I ID    G      +  
Sbjct: 467 NVGYIAIDTTGMGQGVYQLVRK 488


>gi|269838926|ref|YP_003323618.1| hypothetical protein Tter_1890 [Thermobaculum terrenum ATCC
           BAA-798]
 gi|269790656|gb|ACZ42796.1| hypothetical protein Tter_1890 [Thermobaculum terrenum ATCC
           BAA-798]
          Length = 534

 Score = 47.4 bits (111), Expect = 0.005,   Method: Composition-based stats.
 Identities = 35/247 (14%), Positives = 71/247 (28%), Gaps = 33/247 (13%)

Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234
           T+    P+      NT  + ++ +EA          +   +          M +     +
Sbjct: 186 TFLSASPEASA-RGNTASLLLVANEAQDISPDRWDAVFDPMAASTNATTIFMGTVWTSRT 244

Query: 235 -----GKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGI---IARYGLDSDVTRVEVCG 286
                 ++       +   + F +  + V    P++ E +   IA+ G      R E   
Sbjct: 245 LLARQMRYLRQLELEVGRRRVFMVPWQEVARHVPAYGERVQARIAQLGRHHPFVRTEY-C 303

Query: 287 QFPQQDIDSFIPLNIIEEALN---REPCPDPYAPLIMGCDIAEEG--------GDNTVVV 335
                D   F P  + E       R+  P P        D+  E          D+T + 
Sbjct: 304 LEELSDDGGFFPPAVTERMRGDHPRQLLPTPGRTYAALLDVGGEDLAAGPSPRRDSTALT 363

Query: 336 LRRG-----------PVIEHLFDWSKTDLRTTNNKISGLVEK-YRPDAIIIDANNTGART 383
           +                +   + W+         ++  LV   +    +++DA   GA  
Sbjct: 364 IVEVCHPEGADLQPVYRVMTRYVWTGVGQPELLPQVVHLVRDVWACRRLVVDATGLGAGL 423

Query: 384 CDYLEML 390
              L  +
Sbjct: 424 ASALRRI 430


>gi|317490974|ref|ZP_07949410.1| terminase [Enterobacteriaceae bacterium 9_2_54FAA]
 gi|316920521|gb|EFV41844.1| terminase [Enterobacteriaceae bacterium 9_2_54FAA]
          Length = 590

 Score = 47.4 bits (111), Expect = 0.006,   Method: Composition-based stats.
 Identities = 29/182 (15%), Positives = 57/182 (31%), Gaps = 23/182 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +  E +  RY  +     +     F     D+      +
Sbjct: 328 PDGQWRYIITMEDAIRGGFNLADIERLRNRY--NDSTFAMLYMCVFVDSK-DAVFSFEDL 384

Query: 303 EEA-LNREP---------CPDPYAPLIMGCDIAEEGGDNT-------VVVLRRGPVIEHL 345
           E   ++R+           P     +  G D A  G  +T       V+ + +   +  +
Sbjct: 385 ERCGVDRDIWQDFDIKLKRPFGDREVWAGYDPARSGDLSTFAVLAPPVLAVEK-FRVLEI 443

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML-GYHVYRVLGQKRAV 404
            +W     R   N+I  L  KY    + ID    G    + ++   G     +    +  
Sbjct: 444 VNWHGMSFRWQANEIKKLFAKYNIRYLGIDVTGIGNAVFENIQHFAGRVAVPIRYSVKTK 503

Query: 405 DL 406
           D 
Sbjct: 504 DE 505


>gi|30062201|ref|NP_836372.1| hypothetical protein S0695 [Shigella flexneri 2a str. 2457T]
 gi|309786465|ref|ZP_07681089.1| phage terminase large subunit domain protein [Shigella dysenteriae
           1617]
 gi|30040446|gb|AAP16178.1| hypothetical bacteriophage protein [Shigella flexneri 2a str.
           2457T]
 gi|308925653|gb|EFP71136.1| phage terminase large subunit domain protein [Shigella dysenteriae
           1617]
 gi|313649746|gb|EFS14170.1| phage terminase large subunit domain protein [Shigella flexneri 2a
           str. 2457T]
 gi|332761021|gb|EGJ91309.1| phage terminase large subunit domain protein [Shigella flexneri
           4343-70]
 gi|332761177|gb|EGJ91463.1| phage terminase large subunit domain protein [Shigella flexneri
           2747-71]
 gi|332763392|gb|EGJ93632.1| phage terminase large subunit domain protein [Shigella flexneri
           K-671]
 gi|333008020|gb|EGK27496.1| phage terminase large subunit domain protein [Shigella flexneri
           K-218]
 gi|333021447|gb|EGK40697.1| phage terminase large subunit domain protein [Shigella flexneri
           K-304]
          Length = 77

 Score = 47.4 bits (111), Expect = 0.006,   Method: Composition-based stats.
 Identities = 9/45 (20%), Positives = 16/45 (35%), Gaps = 3/45 (6%)

Query: 443 KSFIVPNTGELAIESKR---VKGAKSTDYSDGLMYTFAENPPRSD 484
                   G + +ESK+    +   S + +D  +  FA      D
Sbjct: 24  PHRDFDRNGRVMVESKKDLAKREIPSPNVADAFIMAFAPIDTSLD 68


>gi|226315790|ref|YP_002776047.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           29805]
 gi|226201663|gb|ACO38256.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           29805]
          Length = 450

 Score = 47.4 bits (111), Expect = 0.006,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 54/163 (33%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D  + +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKPY-KDIPLYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|169794751|ref|YP_001712544.1| phage-related terminase, ATPase subunit (GPP-like) [Acinetobacter
           baumannii AYE]
 gi|169147678|emb|CAM85541.1| phage-related terminase, ATPase subunit (GPP-like) [Acinetobacter
           baumannii AYE]
          Length = 604

 Score = 47.4 bits (111), Expect = 0.006,   Method: Composition-based stats.
 Identities = 43/271 (15%), Positives = 84/271 (30%), Gaps = 53/271 (19%)

Query: 246 DDWKRFQIDTRTVEGI--DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
           D   R  ++ +  E    D    E +IA      +        +F      S  PL++I+
Sbjct: 342 DKMWRHIVNIQDAERQGCDLFDIEELIAE--NSPEEFANLYMCEFVDDG-HSVFPLSVIQ 398

Query: 304 EALN------------REPCPDPYAPLIMGCDIAEEGGDN--TVVVLRRGPVIE-HLFDW 348
             +                 P    P+ +G D AE G      V+        +  L + 
Sbjct: 399 PCMVDSWEVWSKDFKPLALRPFGNKPVWIGYDPAESGDSAGLVVIAPPEPDYPKFRLLEH 458

Query: 349 SKTDLRTTNNK---ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
            +       ++   I  L  KY    I +D +  G      +                  
Sbjct: 459 HQFKGMDFASQAQYIKKLTTKYNVKYIGLDKSGMGTGVAQLV------------------ 500

Query: 406 LEFCRNRRTELH-VKMADWL--EFASLINHS---------GLIQNLKSLKSFIVPNTGEL 453
           L+F  N  T  + V +   L  +   +IN            +  ++ +++  +  +  ++
Sbjct: 501 LDFFPNLTTFNYSVDVKTQLVMKAMDVINKERFEFDAGSTDVAMSIMAIRKTLTASQRQM 560

Query: 454 AIESKRVKGAKSTDYSDGLMYTFAENPPRSD 484
             E+ R +     D +  + + FA  P   D
Sbjct: 561 TFEASRAENIGHADLAFAIFHAFANEPLTLD 591


>gi|239504148|ref|ZP_04663458.1| putative phage terminase [Acinetobacter baumannii AB900]
          Length = 413

 Score = 47.4 bits (111), Expect = 0.006,   Method: Composition-based stats.
 Identities = 53/317 (16%), Positives = 88/317 (27%), Gaps = 45/317 (14%)

Query: 81  AISAGRGIGKTTLNAWLV---LWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWLSLL 135
           A  AG G GKT +    +    W         V     A +  Q++   +  +       
Sbjct: 7   AFVAGFGSGKTWVGCSSLCNKAW-----EFPKVPLGYFAPTYPQIRDIFFPTI-----EE 56

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
               W     +         +          + Y T     S E+P T VG    + +  
Sbjct: 57  VAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTIICRSMEKPATIVGFKIGHAL-- 106

Query: 196 INDE----ASGTPDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNKP------ 244
             DE    A          I+  +  + A     I  +         YE F K       
Sbjct: 107 -IDELDVMAMTKAQQAWRKIIARMRFKQAGLLNGIDVATTPEGFKFTYEQFVKEANKSEA 165

Query: 245 -LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
               +   Q  T   E  +   +   +   Y     +    + GQF      +  P +  
Sbjct: 166 KRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLISAYLRGQFVNLTSGAVYP-DFD 222

Query: 303 EEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLRTTNNKIS 361
               + +       PL++G D         V V+R G     L +     D  T    I+
Sbjct: 223 RVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG-KPRALDELVGVRDTPTMCQLIN 281

Query: 362 GLVEKYRPDAIIIDANN 378
                +    +I DA+ 
Sbjct: 282 ERFPDH-DITVIPDASG 297


>gi|195942579|ref|ZP_03087961.1| hypothetical protein Bbur8_07059 [Borrelia burgdorferi 80a]
          Length = 450

 Score = 47.4 bits (111), Expect = 0.006,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 54/163 (33%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHRQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K ++  T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|70727357|ref|YP_254273.1| putative phage terminase large subunit [Staphylococcus haemolyticus
           JCSC1435]
 gi|68448083|dbj|BAE05667.1| putative phage terminase large subunit [Staphylococcus haemolyticus
           JCSC1435]
          Length = 421

 Score = 47.4 bits (111), Expect = 0.006,   Method: Composition-based stats.
 Identities = 32/244 (13%), Positives = 81/244 (33%), Gaps = 28/244 (11%)

Query: 56  LEFMEVVDAHCLNS-VNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           L   +++  H  +      NP++       GRG GK++  + ++   +  R  ++ + + 
Sbjct: 5   LNLSQLIPKHFHDLWRATKNPDILNVVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVR 63

Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174
            ++  L T+++ ++   +      H F+++                 +    +    + R
Sbjct: 64  KTDNTLATSVFEQIKWAIEEQKVSHLFKIKVS------------PMEITFIPRGNRIIFR 111

Query: 175 TYSEERPDTFVGHHNTYG-MAIINDEASG---TPDVINLGILGFL---TERNANRFWIMT 227
               + P+      ++    +I+  E  G   T D +       L    +      +  +
Sbjct: 112 GA--QNPERLKSLKDSRFPFSIMWIEELGEFKTEDEVTTITNSMLRGELDEGLFYKFYFS 169

Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            NP +    +    YE   +P + +            I   F +   +    +    R E
Sbjct: 170 YNPAKRKQHWANKKYETSFQPDNTFVHHS-TYLNNPFISKQFIQEAESAKQRNELRYRWE 228

Query: 284 VCGQ 287
             G+
Sbjct: 229 YLGE 232


>gi|84687555|ref|ZP_01015431.1| hypothetical protein 1099457000249_RB2654_04994 [Maritimibacter
           alkaliphilus HTCC2654]
 gi|84664464|gb|EAQ10952.1| hypothetical protein RB2654_04994 [Rhodobacterales bacterium
           HTCC2654]
          Length = 260

 Score = 47.0 bits (110), Expect = 0.006,   Method: Composition-based stats.
 Identities = 39/258 (15%), Positives = 78/258 (30%), Gaps = 39/258 (15%)

Query: 49  SAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI 108
             P +WQ  F+              +P V        R IGK+   A L        PG 
Sbjct: 28  GPPDNWQRRFL-----------TTASPFVMALC---SRRIGKSQTTAILAA-QTIGAPGR 72

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
           +V+ L+ +  Q    L+  +            +   SL +         +  + G     
Sbjct: 73  TVLVLSPTLGQ-SQLLFKRI---------LEAWAAMSLPIEKTRLTQTTMELANG----- 117

Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS 228
            S +    + +   +  G+    G  +I +E +   D +    +    +       ++ +
Sbjct: 118 -SVVACVPAGQDGSSARGYGVKDGGLLIYEEGAFLADAVYDATIPIREDGG---RILLIT 173

Query: 229 NPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQF 288
            P  + G  +  + +  +     +I  R+ E    +       RY +       E   ++
Sbjct: 174 TPGNVGGFAHTAWTENDEI---EKITARSTEIERMAEKVAFDRRY-MPPRQFATEHELRW 229

Query: 289 PQQDIDSFIPLNIIEEAL 306
                D       IE A 
Sbjct: 230 SSGG-DPLFASETIENAF 246


>gi|298247861|ref|ZP_06971666.1| conserved hypothetical protein [Ktedonobacter racemifer DSM 44963]
 gi|297550520|gb|EFH84386.1| conserved hypothetical protein [Ktedonobacter racemifer DSM 44963]
          Length = 499

 Score = 47.0 bits (110), Expect = 0.006,   Method: Composition-based stats.
 Identities = 64/381 (16%), Positives = 114/381 (29%), Gaps = 65/381 (17%)

Query: 58  FMEVVDAHCLNSVNNPNPEVFKGAISAGRG----------IGKTTLNAWLVLWLMSTRPG 107
           F   V    L        +    ++  GRG          +GK  L+A L  +L+     
Sbjct: 23  FAREVLGKPLYPYQELVGDAILESVLEGRGDTFTVMFARQMGKNQLSATLEAYLLFCMRE 82

Query: 108 ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM-QSLSLHPAPW--YSDVLHCSLGI 164
            S++  A +             K  ++   +    +  +  +    W  Y   +  +   
Sbjct: 83  GSIVKAAPTY------------KPQTINSRQRLLSLLDNPLMRNRVWKHYGYTIGMAPRH 130

Query: 165 DSKHYSTMCRT--YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANR 222
           +   Y T  R   +S     + VG   T  + +  DEA           L  +       
Sbjct: 131 EQVPYQTGPRVMFFSAGPGASIVG--ATASLLLEIDEAQSIDPNKYDTDLRPMASTTNAT 188

Query: 223 FWIMTSNPRRLSGKFY-EIFN----KPLDDWKRFQIDTRTVEGID---PSFHEGIIARYG 274
             +  +     +        N    +     + F  D RT+  I+     + E  I R G
Sbjct: 189 TVLYGTAWSEETLLARMRTHNLELERLDGRQRHFAYDWRTLAAINDHYKRYVESEIKRLG 248

Query: 275 LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR-----EPCPDP-YAPLIMGCDIAEEG 328
            D    R +     P         LN ++ +L R     E  PDP     + G DIA E 
Sbjct: 249 EDHISIRTQYR-LLPILGSGYL--LNDLQFSLLRGQHTWESSPDPAEGFYVAGLDIAGEQ 305

Query: 329 G----------DNTVVVLRR---------GPVIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
                      D+T++ + R            I   +DW+          +S ++ ++  
Sbjct: 306 HARPGQPAGKHDSTILTIGRVTINDLGLPELRIVRHYDWTGMKYTDQYAAVSRILPEWNV 365

Query: 370 DAIIIDANNTGARTCDYLEML 390
              ++D    G      L   
Sbjct: 366 RRTVVDKTGLGEGLASLLSTR 386


>gi|108799880|ref|YP_640077.1| phage terminase [Mycobacterium sp. MCS]
 gi|119868990|ref|YP_938942.1| phage terminase [Mycobacterium sp. KMS]
 gi|108770299|gb|ABG09021.1| phage Terminase [Mycobacterium sp. MCS]
 gi|119695079|gb|ABL92152.1| phage Terminase [Mycobacterium sp. KMS]
          Length = 489

 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 60/363 (16%), Positives = 105/363 (28%), Gaps = 70/363 (19%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLW-LMSTRPGISV 110
           R WQ E           SV + +P          RG GK+TL A   L+   +   G  V
Sbjct: 49  REWQQELA--------GSVLDADPRPRTAGWMLPRGQGKSTLLAAYGLYDFFTGDEGAVV 100

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
             +A  E Q    ++  +++ +  L ++     Q                 L I  +   
Sbjct: 101 CVVAVDERQAG-IIFG-IARRMVELSDELASRCQVFK------------ERLYIPERDAH 146

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP 230
             C       P    G   T  +    DEA G     +  +L     + A    I    P
Sbjct: 147 FHCLPA---EPKRLEGLDYTTALL---DEA-GVASRDSYEVLTLAQGKRAQSTLIAIGTP 199

Query: 231 RRLSG--------KFYEIFNKPLDD-WKRFQI---------DTRTVEGIDPSFHEGIIAR 272
                         +     +     W+ F            T   E  +P+  + +   
Sbjct: 200 GPDPNNQVLADLRNYAADHPEDASLVWREFSAAGFEDHPVDCTHCWELANPALDDFLHRD 259

Query: 273 --------YGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI 324
                      ++   R     QF      +F+P  + +      P PD    +++  D 
Sbjct: 260 ALYALLPPKTREATFRRAR-LCQFASDTDGAFLPQGVWDGLSTGRPIPD-GTEVVVALD- 316

Query: 325 AEEGGDNTVVVLRR---GPVIEHLFDWSKTDLR--------TTNNKISGLVEKYRPDAII 373
                D T ++       P  + +  W +T+             ++I     ++R   II
Sbjct: 317 GSFSDDTTALLAGTVSAEPHFDTIHVWQRTNGDDSYRVPVAEVEDEIRAACRRWRVAEII 376

Query: 374 IDA 376
            D 
Sbjct: 377 ADP 379


>gi|237710644|ref|ZP_04541125.1| phage terminase large subunit [Bacteroides sp. 9_1_42FAA]
 gi|229455366|gb|EEO61087.1| phage terminase large subunit [Bacteroides sp. 9_1_42FAA]
          Length = 461

 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 44/242 (18%), Positives = 81/242 (33%), Gaps = 40/242 (16%)

Query: 262 DPSFHEGIIARYGLDSDVTRVE-VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320
           DP++   ++ +    SD  R   + G +  +     I      EAL R           +
Sbjct: 212 DPTYLANLVNQ----SDEQRARDLDGNWKYKAAGDDIIKLTHMEALYRNSMQIGDGIRRV 267

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKY--RPDAIIIDANN 378
            CD A EGGD+ V+ L  G  I  +F   K D + T + +  ++E++  R +    D N 
Sbjct: 268 SCDAAFEGGDSLVMWLWEGWHIRDIFV-CKLDSKKTVDTVKAVLEEWHVREECFTYDLNG 326

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEF----CRNRRTELHVKMADWLEFASLINHSG 434
            G          G+    +    +    E       N +++     A  +    +     
Sbjct: 327 LGQI------FKGFFPNAIPFNNKEAVEEKFKYIYTNLKSQAAYLFAQKIINREISIEPT 380

Query: 435 LIQNLKSLKSFIVPNTGELA-IESKRVKG---------------------AKSTDYSDGL 472
           L++   S K F      ++   E K ++                        S D+ + L
Sbjct: 381 LLERKFSGKGFEKVPLRQILDKERKAIRKDEDSEEKGWTIIKKIIMKKLVGHSPDFIEAL 440

Query: 473 MY 474
           + 
Sbjct: 441 LM 442


>gi|15668504|ref|NP_247302.1| hypothetical protein MJ_0330 [Methanocaldococcus jannaschii DSM
           2661]
 gi|2833503|sp|Q57776|Y330_METJA RecName: Full=Uncharacterized protein MJ0330
 gi|1591049|gb|AAB98318.1| hypothetical protein MJ_0330 [Methanocaldococcus jannaschii DSM
           2661]
          Length = 549

 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 33/242 (13%), Positives = 65/242 (26%), Gaps = 35/242 (14%)

Query: 85  GRGIGKTTLNAWLVLWLMS--TRPG-----ISV--ICLANSETQLKTTLWAEVSKWLSLL 135
           G+G GK  + + L  ++M              +  + +A ++   K   + E   W    
Sbjct: 93  GKGGGKDFMVSLLFNYMMFRACVEDYYEKFTRIDFVNVAPNDHLAKNVFFKEFKAWFLKC 152

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
                  +       AP         +G     +S   R  S               + +
Sbjct: 153 KVWQMIGIDKKKRQKAPICVLETKAEIGDKITMHSGHSRATS---------FEGMNALCV 203

Query: 196 INDEASGTPDVINLGILGFLTERNANRF-------------WIMTSNPRRLSGKFYEIFN 242
           + DE     D           +  ++               W     P       Y ++ 
Sbjct: 204 VADE---ISDPDFKNAEQLFEQGLSSAKSRFKDKARVVAITWTRFPTPNPRDDVGYRLYL 260

Query: 243 KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
                 + +    +T E       E   A+Y  +  + R     + P+ +   FI L  +
Sbjct: 261 DYKAVDEAYTFKGKTWEVNTRVSKEDFKAQYQKNPILARCMYECEPPELNA-YFISLEAL 319

Query: 303 EE 304
           E 
Sbjct: 320 EA 321


>gi|297848822|ref|XP_002892292.1| hypothetical protein ARALYDRAFT_470549 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338134|gb|EFH68551.1| hypothetical protein ARALYDRAFT_470549 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1406

 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 13/155 (8%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  +   ++     + F+ +   G        G GKT L    +   +   P 
Sbjct: 823 QQEGFEFIWKNLAGTILLNELKDFENSDETGGCIMSHAPGTGKTRLTIIFLQAYLQCFPD 882

Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
              + +A +   L    WA E  KW   +P  +   +       +     ++  +    S
Sbjct: 883 CKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGKESSAALGLLMQKNATARS 939

Query: 167 KHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDE 199
            +   M + YS  +  + +G  ++    +A + DE
Sbjct: 940 NNEIRMVKIYSWIKSKSILGISYNLYEKLAGVKDE 974


>gi|163742707|ref|ZP_02150092.1| terminase, large subunit, putative [Phaeobacter gallaeciensis 2.10]
 gi|161383962|gb|EDQ08346.1| terminase, large subunit, putative [Phaeobacter gallaeciensis 2.10]
          Length = 417

 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 66/416 (15%), Positives = 112/416 (26%), Gaps = 65/416 (15%)

Query: 82  ISAGRGIGKTTLNA-WLVLWLMSTRPGI-----SVICLANSETQLKTTLWAEVSKWLSLL 135
           I  GRG GKT   A W+        P        +  L  +  Q++  +    S  L+  
Sbjct: 25  ILGGRGAGKTRAGAEWVRTLAEGATPLSAGRARRIALLGETYDQVRDVMVQGDSGLLACT 84

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
           P                        +            + +S   P+   G       A 
Sbjct: 85  PRD---------------RRPTWKATERRLIWPNGATAQAFSAHDPEALRGPQFD---AA 126

Query: 196 INDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             DE +           +   L   +  R  +  +   R  G   E+   P    +    
Sbjct: 127 WADELAKWKRGQDSWDMLQFALRLGDDPR--VCVTTTPRNVGVLRELLASPSTV-QTHAA 183

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-LNREPCP 312
                  +  SF   +  RY   S + R E+ G   Q    +      +  A + + P  
Sbjct: 184 TEANRANLAASFLAEVRNRY-AGSRLGRQELDGILLQDIEGALWTNAGLVAAQIAKAPTL 242

Query: 313 DPYAPLIMGCDIAEEGG---DNTVVVLRRGPVIEHLFDW----------SKTDLRTTNNK 359
           D    +++  D A   G   D   +V+    +     DW                T    
Sbjct: 243 DR---VVVAVDPAVSAGKHSDACGIVVVGATLQGPPQDWCAYVLADCTVQGVGPLTWAQA 299

Query: 360 ISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVK 419
                ++Y  D ++ + N  GA     L  +   V        A+     +  R E    
Sbjct: 300 AIDARDRYGADRVVAEVNQGGALVESLLRQIDPLV-----PFTALHASRGKGARAEPVAA 354

Query: 420 MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
           + +      +     L   L       +   G L        G  S D  D L++ 
Sbjct: 355 LYEQGRVRHVPGLGALEDQLC-----QMTPRGYL--------GQGSPDRLDALVWA 397


>gi|315180730|gb|ADT87644.1| terminase [Vibrio furnissii NCTC 11218]
          Length = 607

 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/181 (13%), Positives = 47/181 (25%), Gaps = 23/181 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    I+     G D    E +   Y              F      S    N +
Sbjct: 340 PDRQWRYVVTIEDAAKGGCDLFDIEELREEYSEHD--FNNLFMCIFVDGAS-SIFEFNKV 396

Query: 303 EEALNREPCPDPYA----------PLIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345
           ++ +        +            + +G D +    DN V       +V      +   
Sbjct: 397 QKCMVDAGIWQDFKASAKRPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAAEKFRVLEK 455

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404
             W     +   ++I  + E++    + ID    GA   D L          +       
Sbjct: 456 HTWRGLSFQHQASEIDKVFERFNVTYLGIDITGIGAGVYDLLSNKHPRETVAIHYSNENK 515

Query: 405 D 405
           +
Sbjct: 516 N 516


>gi|153213615|ref|ZP_01948888.1| terminase [Vibrio cholerae 1587]
 gi|124115814|gb|EAY34634.1| terminase [Vibrio cholerae 1587]
          Length = 606

 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 25/181 (13%), Positives = 48/181 (26%), Gaps = 23/181 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G D +  + +                 +F      S    N I
Sbjct: 346 PDKQWRYVITMEDAVKSGFDLADIDILREENSERD--FNNLFMCEFVDGAS-SIFEYNKI 402

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTV-------VVLRRGPVIEHL 345
              +        + P          + +G D +    DN V       +V      +   
Sbjct: 403 LRCMVDIEIWQDFKPSSDRPFGSREVWLGYDPSRT-RDNAVLMVVAPPIVAAEKFRVLEK 461

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL-EMLGYHVYRVLGQKRAV 404
             W     +   ++IS + E++    + ID    GA   D L          +       
Sbjct: 462 HTWRGLSFQHQASEISKVFERFNVTYLGIDITGIGAGVYDLLSNKHPRETVAIHYSNENK 521

Query: 405 D 405
           +
Sbjct: 522 N 522


>gi|160700654|ref|YP_001552334.1| gp5 [Mycobacterium phage Giles]
 gi|159136604|gb|ABW88400.1| gp5 [Mycobacterium phage Giles]
          Length = 544

 Score = 47.0 bits (110), Expect = 0.007,   Method: Composition-based stats.
 Identities = 74/440 (16%), Positives = 130/440 (29%), Gaps = 62/440 (14%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT L A  +++ +    G  ++  A     +K        + + ++  +         L
Sbjct: 109 GKTQLIALRIIYGL-FFLGEKIVYTAQRWQTVKDVY----DRIVEIIKRRPSLL---RRL 160

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN 208
            P P   D  +   G   + Y+T   +         VG   T     I DEA    DV+ 
Sbjct: 161 KPMPGVPD-GYSEAGQHGEIYTTNGGSLDMGPRTKAVGRGQTKIDLAIFDEAYDIKDVLV 219

Query: 209 LGILG---FLTERNA---NRFWIMTSNP--RRLSGKFYEIFNKPLDDWKRFQIDTRTVEG 260
            G+ G     T       +   + + +P    L+G       K  D +         +  
Sbjct: 220 GGLTGAQKAATNPQTIYISTAAVASEHPDCGVLAGMRRNGQRKEPDLYAAEWCAPPGMAR 279

Query: 261 IDPSFHEGIIARYG---LDSDVTR--------VEVCGQFPQQDIDSFIPLNIIEEALNRE 309
            DP         +G    + D+ R          +   +   D D         +  N E
Sbjct: 280 DDPEAWRLACPSFGITVRERDLAREYRMARANARLLAIY---DADYLGWGEWPPDPENTE 336

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLR----------------RGPVIEHLFDWSKTDL 353
           P  DP     +        GD  + + R                 G V   +  W   ++
Sbjct: 337 PIIDPDWWEALTVLQPALVGDICIAIERTLDTRYWCIAAGQRTIDGRVHVEVGYWRAANI 396

Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRR 413
                 +  LVE + P AII+D  +        +   G  +      K A+  +   +  
Sbjct: 397 GVVAAALLELVELWNPAAIIVDDRSKAKPIVGVMFNQGIEIETASTPKLAMYTQGFIDA- 455

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473
               V  AD            +I +  +  +      G+L  + K      +   +  L 
Sbjct: 456 ----VNAADVTHIG-----QKIITDGIAGAAMRELPRGDLVFDEKESGAPVAPLKAIALA 506

Query: 474 YT-----FAENPPRSDMDFG 488
           +       AE  P +  D G
Sbjct: 507 HGAVLEYAAEPKPAASPDTG 526


>gi|281416525|ref|YP_003347326.1| terminase large subunit [Enterococcus phage phiFL2A]
 gi|270209389|gb|ACZ63932.1| terminase large subunit [Enterococcus phage phiFL2A]
 gi|270209454|gb|ACZ63996.1| terminase large subunit [Enterococcus phage phiFL2B]
          Length = 430

 Score = 47.0 bits (110), Expect = 0.008,   Method: Composition-based stats.
 Identities = 51/354 (14%), Positives = 100/354 (28%), Gaps = 42/354 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           RG  KTT  A  +  LM   P  ++I L  ++T     +  E+   ++ + +  +F+   
Sbjct: 52  RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106

Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            +L+        +                 +        +  G H      +I D+    
Sbjct: 107 FALYGVELVLLKETTTEVDTNLKTSSRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257
            D ++               +    N +   G+F      ++K     K     + D   
Sbjct: 164 KDRVSRA-----ERERTKLQYQELQNVKNRGGRFINTGTPWHKEDAISKMPNVKKFDCYE 218

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              ID    + +  +  +   +       +        F      +  +N       +  
Sbjct: 219 TGLIDKEQRQAL--QQAMTPSLFAANYELKHIADSESLFTAPTYTDS-INLIYNGVAH-- 273

Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373
                D A  G D+T   +    + G +I +   W K        +I  L + Y+     
Sbjct: 274 ----VDAAYGGDDSTAFTIFKEQKDGTIIGYGRKWQKHVDDCLP-EILRLHQHYQAGTFH 328

Query: 374 IDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL--HVKMADWLE 425
            + N         L   G  V           +       T L  +  +  WLE
Sbjct: 329 TETNGDKGYLAKNLRECGQFVTEYHES-----MNKFIKISTYLRKYWHLIIWLE 377


>gi|291326278|ref|ZP_06123867.2| terminase, ATPase subunit [Providencia rettgeri DSM 1131]
 gi|291314958|gb|EFE55411.1| terminase, ATPase subunit [Providencia rettgeri DSM 1131]
          Length = 574

 Score = 47.0 bits (110), Expect = 0.008,   Method: Composition-based stats.
 Identities = 47/253 (18%), Positives = 79/253 (31%), Gaps = 48/253 (18%)

Query: 248 WKRFQ-IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI-----DSFIPLNI 301
           W++   I      G++    E I        D  R     +F +        D+ I   +
Sbjct: 312 WRQIVNIHDAIARGLNRVNLEEIKDE--NPPDDFRNLYECEFVKTGERAFSYDALINCGV 369

Query: 302 IEEALNREPCPDPYAP-------LIMGCDIAEEGGDNTVVVLRR-------GPVIEHL-- 345
                +  P   PYAP       + +G D    G +   + L         G     +  
Sbjct: 370 DGYNSDVWPDWKPYAPRPLGNRPVWVGADPTGTGDNGDGLGLVVASPPAVSGGKFRIIET 429

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCDYLEM-------LGYHVYRV 397
                       ++I  + ++Y   +I ID    TGA   + +         L Y    +
Sbjct: 430 IQLRGMAFEKQADEIKRITQRYNVLSITIDGTGGTGAAVHELVVKFFPAANLLNYSA-PI 488

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457
                   L   RN R E    +           H  LI +  ++K  +   +G +  ES
Sbjct: 489 KRMMIMKMLMLIRNGRFEYDAGL-----------HKPLITSFMTIKK-VQTQSGIITYES 536

Query: 458 KRVKGAKSTDYSD 470
            RV+G    D+ D
Sbjct: 537 SRVRGL---DHGD 546


>gi|226329986|ref|ZP_03805504.1| hypothetical protein PROPEN_03899 [Proteus penneri ATCC 35198]
 gi|225200781|gb|EEG83135.1| hypothetical protein PROPEN_03899 [Proteus penneri ATCC 35198]
          Length = 584

 Score = 46.6 bits (109), Expect = 0.008,   Method: Composition-based stats.
 Identities = 27/140 (19%), Positives = 47/140 (33%), Gaps = 23/140 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN-----------REPCPDP 314
            E +   Y    D     +   F   DI+S    N+++  +                P  
Sbjct: 344 LEQLKKEY--SPDEYNNLLMCHF-MDDIESLFNFNMMQNCMVDSWEVWDDIQPLALRPYG 400

Query: 315 YAPLIMGCDIAEEG--GDNT---VVVLRR--GPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           Y P+ +G D ++ G  GD+    V+   +  G     L    W   D R   + I  + E
Sbjct: 401 YDPVWVGYDPSKGGENGDSAGCVVIAPPKVPGGKFRILERHQWRGMDFRAQADAIKKITE 460

Query: 366 KYRPDAIIIDANNTGARTCD 385
           ++  + + ID    G     
Sbjct: 461 RFYVEYMGIDTTGLGHGVYQ 480


>gi|197285843|ref|YP_002151715.1| phage terminase, ATPase subunit [Proteus mirabilis HI4320]
 gi|194683330|emb|CAR44037.1| phage terminase, ATPase subunit [Proteus mirabilis HI4320]
          Length = 584

 Score = 46.6 bits (109), Expect = 0.008,   Method: Composition-based stats.
 Identities = 27/140 (19%), Positives = 47/140 (33%), Gaps = 23/140 (16%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN-----------REPCPDP 314
            E +   Y    D     +   F   DI+S    N+++  +                P  
Sbjct: 344 LEQLKKEY--SPDEYNNLLMCHF-MDDIESLFNFNMMQNCMVDSWEVWDDIQPLALRPYG 400

Query: 315 YAPLIMGCDIAEEG--GDNT---VVVLRR--GPVIEHL--FDWSKTDLRTTNNKISGLVE 365
           Y P+ +G D ++ G  GD+    V+   +  G     L    W   D R   + I  + E
Sbjct: 401 YDPVWVGYDPSKGGENGDSAGCVVIAPPKVPGGKFRILERHQWRGMDFRAQADAIKKITE 460

Query: 366 KYRPDAIIIDANNTGARTCD 385
           ++  + + ID    G     
Sbjct: 461 RFYVEYMGIDTTGLGHGVYQ 480


>gi|224020497|ref|YP_002601287.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           64b]
 gi|223929730|gb|ACN24438.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           64b]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.009,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYINNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYIFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|313649747|gb|EFS14171.1| phage terminase large subunit domain protein [Shigella flexneri 2a
           str. 2457T]
 gi|332761022|gb|EGJ91310.1| phage terminase large subunit domain protein [Shigella flexneri
           4343-70]
 gi|332761328|gb|EGJ91614.1| phage terminase large subunit domain protein [Shigella flexneri
           2747-71]
 gi|332763393|gb|EGJ93633.1| phage terminase large subunit domain protein [Shigella flexneri
           K-671]
 gi|333007918|gb|EGK27394.1| phage terminase large subunit domain protein [Shigella flexneri
           K-218]
 gi|333021518|gb|EGK40768.1| phage terminase large subunit domain protein [Shigella flexneri
           K-304]
          Length = 159

 Score = 46.6 bits (109), Expect = 0.009,   Method: Composition-based stats.
 Identities = 20/135 (14%), Positives = 40/135 (29%), Gaps = 25/135 (18%)

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW--SKTDLRTTNNKISGLVE 365
           +    +P     +G D+A+ G D    V R G V+    +W   + +L  +  +      
Sbjct: 5   KTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKEDELLKSCQRTYQAAL 64

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLG------------YHVYRVLGQ----------KRA 403
           +   D I+ D+   GA        +              +  R                 
Sbjct: 65  EREAD-IVYDSIGVGASAGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGI 123

Query: 404 VDLEFCRNRRTELHV 418
            + +F  N + +   
Sbjct: 124 PNKDFFANLKAQAWW 138


>gi|84687436|ref|ZP_01015314.1| Putative large terminase [Maritimibacter alkaliphilus HTCC2654]
 gi|84664594|gb|EAQ11080.1| Putative large terminase [Rhodobacterales bacterium HTCC2654]
          Length = 426

 Score = 46.6 bits (109), Expect = 0.009,   Method: Composition-based stats.
 Identities = 67/437 (15%), Positives = 116/437 (26%), Gaps = 75/437 (17%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ +   G           V  +  +  Q++  +        
Sbjct: 33  ILGGRGAGKTRAGA---EWVRAQVEGPAPLSPGRAGRVALIGETFDQVRDVMV------- 82

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                   F    +     P        +             ++S   P+   G      
Sbjct: 83  --------FGDSGIVACAPPDRRPAWEATKRRLVWPNGATATSFSASEPEGLRGPQFD-- 132

Query: 193 MAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +     D     +   L   +  R  + T  PR +      I  + L     
Sbjct: 133 -AAWADELAKWKKVDDAWDMLQFALRLGDHPRQVVTT-TPRDVP-----ILRRLLTLSST 185

Query: 251 FQIDTRTVEG---IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
                 T      +  SF E I ARYG    + R E+ G        +F    ++E+   
Sbjct: 186 VTTHAPTTANRANLAKSFLEEIEARYGGT-RLGRQELEGVLLDDREGAFWSTAMLEDC-- 242

Query: 308 REPCPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIEHLFDWSK----------TDLR 354
           R   P P + +++  D       G D   +V+           W                
Sbjct: 243 RIDGPPPLSRIVVAVDPPVTGHAGSDECGIVVAGAVTEGAPGAWRAVVLEDASVKAAKPI 302

Query: 355 TTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRT 414
                    +E+Y  D ++ + N  G    + +      +    G + A         R 
Sbjct: 303 DWARAALDAMERYGADRLVAEVNQ-GGDLVETVIRQIDPLVPYRGVRAAKGKS----ARA 357

Query: 415 ELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474
           E    + +    + L     L   +  +        G             S D  D L++
Sbjct: 358 EPVAALYEQGRVSHLRGLGDLEDQMCLMTVQGFEGKG-------------SPDRVDALVW 404

Query: 475 TFAENPPRSDMDFGRCP 491
              +        + R  
Sbjct: 405 ALTDLVVEPGAKWRRPQ 421


>gi|66396341|ref|YP_240671.1| ORF008 [Staphylococcus phage 88]
 gi|66396415|ref|YP_240743.1| ORF009 [Staphylococcus phage 92]
 gi|62636756|gb|AAX91867.1| ORF008 [Staphylococcus phage 88]
 gi|62636829|gb|AAX91940.1| ORF009 [Staphylococcus phage 92]
          Length = 421

 Score = 46.6 bits (109), Expect = 0.009,   Method: Composition-based stats.
 Identities = 44/320 (13%), Positives = 101/320 (31%), Gaps = 35/320 (10%)

Query: 72  NPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKW 131
             + EV       GRG GK++  + ++   +  R  ++ + +  ++  L T+++ ++   
Sbjct: 22  TKDKEVLNVVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVIRKTDNTLATSVFEQIKWA 80

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
           +      H F+++                 +    +    + R    + P+      ++ 
Sbjct: 81  IEEQKVSHLFKVKVS------------PMEITYIPRGNRIIFRGA--QNPERLKSLKDSR 126

Query: 192 ---GMAIINDEASGTPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF----YEI 240
               +A I + A    +     I   L     +      +  + NP +    +    YE 
Sbjct: 127 FPFSIAWIEELAEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYES 186

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
             +  + +            I   F +   +    +    R E  G+     +  F  L 
Sbjct: 187 SFQADNTYVHHS-TYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR 245

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTT 356
            IEE   R+   D +  +    D      D    V     ++  VI  + ++    +   
Sbjct: 246 -IEEIPQRQY--DTFDNIRNAVDFG-YATDPLAFVRWHYDKKKRVIYAMDEYYGVQISNR 301

Query: 357 NNKISGLVEKYRPDAIIIDA 376
                   + Y+ D I  D+
Sbjct: 302 EFANWLKKKGYQSDEIFADS 321


>gi|195942518|ref|ZP_03087900.1| hypothetical protein Bbur8_06704 [Borrelia burgdorferi 80a]
 gi|312149990|gb|ADQ30051.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           N40]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.009,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   ++ +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNVETFKTYNFTTYDNVFLSKGFIETQEKLY-KDIPAYKARVLLGEWLASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI E+ +   P        I   D     GGDNT + +
Sbjct: 265 IFTQINITEDYMFTSP--------IAYLDPTFSVGGDNTALCV 299


>gi|222147998|ref|YP_002548955.1| large terminase [Agrobacterium vitis S4]
 gi|221734986|gb|ACM35949.1| large terminase [Agrobacterium vitis S4]
          Length = 459

 Score = 46.6 bits (109), Expect = 0.009,   Method: Composition-based stats.
 Identities = 63/403 (15%), Positives = 119/403 (29%), Gaps = 54/403 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GKT   A            +  I  A  ++ ++  L  E       +       +
Sbjct: 83  GGRGSGKTRAGA----------EWVHEIASAGEKSAVRIALVGETLGDAREVMVDGLSGI 132

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
             ++ H  P     +  S          M + +S E P++  G           DE +  
Sbjct: 133 ARIARHKRP----EVEISRRRLVWPNGAMAQMFSAEDPESLRG---PQFHYAWCDEIAKW 185

Query: 204 --PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI 261
              +     +   L   +  R  I T+ P R       +   P     R          +
Sbjct: 186 KHAEETFDMLQFSLRLGDDPRQVI-TTTP-RPVPILKRLLADPGTRLTRLSTFGNAC-NL 242

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMG 321
            P F E + ARYG    + R E+ G+  +   D+    + +E+   R    +P   +++G
Sbjct: 243 APGFIEALQARYGGT-RLGRQELDGELIEDREDALWRRDRLEQLTVR--LSEPLHRIVVG 299

Query: 322 CDIAEEGGDNTVVVL------RRGPVIEHLF-DWSKTDLRTTNNKISGLVEKYRPDAIII 374
            D     G  +V  +      R G  +       +     +    +     ++  D ++ 
Sbjct: 300 VDPPSGAGAQSVCGIIVAGLDRLGRAVVLADCSVTGESPASWATAVVRAFRRFEADRVVA 359

Query: 375 DANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH 432
           + N  G      L+ +     V  V   +           R E    + +          
Sbjct: 360 EVNQGGEMVGALLKSVDANLPVRMVRATRGKF-------LRAEPVAALYEQGRVFHAARF 412

Query: 433 SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
           + L   +       + N              +S D  D L++ 
Sbjct: 413 ADLEDQMCDFGPEGLSN-------------GQSPDRLDALVWA 442


>gi|312148837|gb|ADQ31485.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           JD1]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|312148805|gb|ADQ31454.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           JD1]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|312147637|gb|ADQ30298.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           JD1]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|312147604|gb|ADQ30266.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           JD1]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|224590670|ref|YP_002640676.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           WI91-23]
 gi|224553765|gb|ACN55167.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           WI91-23]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|224983785|ref|YP_002641105.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           WI91-23]
 gi|224553986|gb|ACN55383.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           WI91-23]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|195942842|ref|ZP_03088224.1| hypothetical protein Bbur8_08565 [Borrelia burgdorferi 80a]
 gi|312150044|gb|ADQ30103.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           N40]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|225622041|ref|YP_002724986.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           94a]
 gi|225546350|gb|ACN92359.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           94a]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|225576422|ref|YP_002725451.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           118a]
 gi|225547005|gb|ACN92996.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           118a]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|226322171|ref|ZP_03797692.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           Bol26]
 gi|226232426|gb|EEH31184.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           Bol26]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|226246703|ref|YP_002776000.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           Bol26]
 gi|226202392|gb|ACO38050.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           Bol26]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|224022662|ref|YP_002606275.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           64b]
 gi|224593632|ref|YP_002640950.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           CA-11.2a]
 gi|223929246|gb|ACN23964.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           64b]
 gi|224554688|gb|ACN56067.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           CA-11.2a]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|219723193|ref|YP_002474612.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|224591572|ref|YP_002640899.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           CA-11.2a]
 gi|219693035|gb|ACL34243.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|224554907|gb|ACN56281.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           CA-11.2a]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|11497124|ref|NP_051248.1| hypothetical protein BB_S45 [Borrelia burgdorferi B31]
 gi|223987739|ref|YP_002601211.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           64b]
 gi|6382145|gb|AAF07462.1|AE001576_21 conserved hypothetical protein [Borrelia burgdorferi B31]
 gi|223929452|gb|ACN24166.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           64b]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|221316998|ref|YP_002533177.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           72a]
 gi|221237630|gb|ACM10461.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           72a]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI +           +A  I   D A   GGDNT + +
Sbjct: 265 IFTQINITD--------DYVFASPIAYLDPAFSVGGDNTALCV 299


>gi|238764966|ref|ZP_04625904.1| terminase, ATPase subunit [Yersinia kristensenii ATCC 33638]
 gi|238696825|gb|EEP89604.1| terminase, ATPase subunit [Yersinia kristensenii ATCC 33638]
          Length = 591

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 33/243 (13%), Positives = 64/243 (26%), Gaps = 46/243 (18%)

Query: 195 IINDEASGTPD--VINLGILGFLTERNANRFWIMTSNPRRLSGKFY---EIFNKPLDDWK 249
           +  DE    P+   +N    G  T ++    +  T + +   G  +   + + K     K
Sbjct: 253 LYIDEYLWIPNFRRLNEVASGMATHKHWRITYFSTPSAKTHQGYPFWSGDEWRKGDTKRK 312

Query: 250 RFQIDT--------RTVE----------------GIDPSFHEGIIARYGLDSDVTRVEVC 285
                +        R                   G + +    +  RY        +   
Sbjct: 313 DVVFPSFDEMRDGGRECPDGQWRYVVTLEDAIAGGFNLADINELRERYNES--AFNMLFM 370

Query: 286 GQFPQQDIDSF---------IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL 336
             F       F         + +   E+    EP P     +  G D A    + T VV+
Sbjct: 371 CVFVDDKESVFKFGDLMRCGVDIRTWEDFHPDEPMPFGNREVWGGFDPARSNDNATFVVV 430

Query: 337 R------RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEML 390
                      +     W     +    +I  +  +Y    I ID    G    + ++  
Sbjct: 431 APPLVAAERFRVLEKHHWRSMSFQFMAERIRSIKARYNMTYIGIDVTGLGYGVFELVQGF 490

Query: 391 GYH 393
            + 
Sbjct: 491 AHR 493


>gi|226246889|ref|YP_002776229.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           Bol26]
 gi|226202275|gb|ACO37943.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           Bol26]
          Length = 450

 Score = 46.6 bits (109), Expect = 0.010,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYIFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|94263678|ref|ZP_01287487.1| hypothetical protein MldDRAFT_1386 [delta proteobacterium MLMS-1]
 gi|93455983|gb|EAT06138.1| hypothetical protein MldDRAFT_1386 [delta proteobacterium MLMS-1]
          Length = 457

 Score = 46.6 bits (109), Expect = 0.011,   Method: Composition-based stats.
 Identities = 71/409 (17%), Positives = 112/409 (27%), Gaps = 66/409 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMS-----------TRPGISVIC--LANSETQLKTTLWAEVSK 130
           AGR  GKT   A + ++L +             PG   +   LA    Q K  L   ++ 
Sbjct: 64  AGRRSGKTNATAGIAVYLATIGAAVDGLLDKLAPGERGVVALLAVDRQQAKVAL-RYIA- 121

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT 190
              +           +       + D         + + S   RT      D        
Sbjct: 122 --GMFEASPVLAQMVVKRDAEALHLDNRISIEVSTNNYRSVRGRTLLAAVLDEVA----- 174

Query: 191 YGMAIINDEASGTPDV-INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLD--- 246
                  D+ S  PDV     IL  L         +  S+P    G  Y+ + K      
Sbjct: 175 ----FFRDDQSANPDVETYRAILPGLATTGG--LLVGISSPYAKRGLLYQKWRKHYGQDG 228

Query: 247 DWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
           D    Q  T       P+    I      D    R E  G F + D++ F+   ++E   
Sbjct: 229 DILVIQGATPDFNPTIPTSV--ITDAEADDPAAARAEWFGLF-RDDVEGFLTREVVEACT 285

Query: 307 NREPCPDPYAP---LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW---SKTDLRTTNNKI 360
              P   PY          D A  G D   + +        + D     +        + 
Sbjct: 286 RPSPLVIPYNRENIYTAFADPAGGGRDEFCLAIGHQEGEVVVVDNLQARRGAPAKIVAEY 345

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
           + L++ Y   AI  D    G+   D     G         K    L+             
Sbjct: 346 ADLLKAYNVQAITADRY-AGSWPADEFARHGITCNPAANSKSVFYLDALA-----AFNSG 399

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAK-STDY 468
                   L     L+  L +             +E +  +G + S D+
Sbjct: 400 R-----LQLPPDDMLLNQLTA-------------LERRTARGGRDSIDH 430


>gi|327198525|ref|YP_004327112.1| phage terminase large subunit [Pseudoalteromonas phage H105/1]
 gi|304367920|gb|ADM26679.1| phage terminase large subunit [Pseudoalteromonas phage H105/1]
          Length = 414

 Score = 46.3 bits (108), Expect = 0.011,   Method: Composition-based stats.
 Identities = 65/413 (15%), Positives = 112/413 (27%), Gaps = 50/413 (12%)

Query: 67  LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA 126
            N   N     ++  +  G G GKT +    +L      P         S   ++   + 
Sbjct: 8   QNIFLNELNTKYRAYV-GGFGSGKTFVGCMDLLNFFGKHPRTRQGYFGTSYPSIRDIFYP 66

Query: 127 EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG 186
                         FE  +L +       +         +  Y       S +RP+T VG
Sbjct: 67  -------------TFEEAALMMGFTVDIKESNKEVHVYRNGFYYGTVICRSMDRPNTIVG 113

Query: 187 HHNTYGMAIINDEASGTPDV----INLGILGFL--TERNANRFWIMTSNPRRLSGKFYEI 240
              +  +    DE    P          I+  L            +T+ P      + + 
Sbjct: 114 FKVSRAL---VDEIDTLPKDKATNAWNKIVARLRLKIDGVENGIGVTTTPEGFLFVYSKF 170

Query: 241 FNKPLDDWKRFQIDTRTV-EGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFIP 298
            ++P   +   Q  T    E +   + + +   Y     D     + G+F      S  P
Sbjct: 171 KDEPTKSYSMVQASTYENAEFLPDDYIDTLKETYPEGLIDAY---LMGKFVNLTAGSVYP 227

Query: 299 LNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN 358
                +  + E    P  PLI+G D         + V R G            D      
Sbjct: 228 QYGRNKNNSFESIQ-PQDPLIVGMDFNVNDMAACIFVERDGIYHCVEELTKGRDTDYMAR 286

Query: 359 KISG-LVEKYRPDAIIIDANN-----TGART--CDYLEMLGYHVYRVLGQKRAVDLEFCR 410
            +    ++K     +  DA+       GA     D L+  G  V       R  +   C 
Sbjct: 287 ILKERYLDKGHRVTVYPDASGKNTSSKGADKSDIDILKSYGLWVVAKDSNPRVRERVNCV 346

Query: 411 NRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGA 463
           NR  +    +   +        +  I+             G    E  +  G 
Sbjct: 347 NRGFQ---DLKIMINSMRCPETAKCIEQQP------YDKNG----EPDKKSGL 386


>gi|87201130|ref|YP_498387.1| hypothetical protein Saro_3118 [Novosphingobium aromaticivorans DSM
           12444]
 gi|87136811|gb|ABD27553.1| protein of unknown function DUF264 [Novosphingobium aromaticivorans
           DSM 12444]
          Length = 440

 Score = 46.3 bits (108), Expect = 0.011,   Method: Composition-based stats.
 Identities = 71/425 (16%), Positives = 130/425 (30%), Gaps = 55/425 (12%)

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE 127
            S   P  +     + AGRG GKT L A  V  +    P   +  +  S  + ++ +   
Sbjct: 43  QSQQAPPSDWRVWLVMAGRGFGKTRLGAEWVRKIAEEDPEARIALVGASLHEARSVMVE- 101

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
                             L    APW   V   S+             YS   P++  G 
Sbjct: 102 --------------GESGLLSIDAPWRRPVFESSVRRLVWPNGAQAFLYSAGEPESLRGP 147

Query: 188 HNTYGMAIINDE------ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF 241
            +++      DE       S         +L  L     +   + T+ P R       I 
Sbjct: 148 QHSHA---WCDEIAKWDNGSNRAMATWDNLLMGL-RLGRDPRLVATTTP-RPVPLVARIM 202

Query: 242 NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNI 301
           ++  D            + +   F E +   +G  + + R E+ G+  +  + +     +
Sbjct: 203 DEGDDVVVTRGSTFENQDNLPRRFVEAMRRTFGGTT-LGRQELLGEMIEDLVGALWSRAL 261

Query: 302 IEEALNREPCPDPYAPLIMGCD-IAEEGGDNTVVV---LRRGPVIEHLFD--WSKTDLRT 355
           IE A  RE        +++G D  A   GD   ++   +    +   L D    +     
Sbjct: 262 IENA--REDAAPAMTRVVVGVDPPASAHGDACGIIVCGIGDDRIARVLADCSVEQASPER 319

Query: 356 TNNKISGLVEKYRPDAIIIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRR 413
               ++     +  D ++ +AN  G      L        +  V   +           R
Sbjct: 320 WARAVANAARAWSADRVVAEANQGGEMVAAVLRAAEASLPLRLVHASRGKA-------AR 372

Query: 414 TELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473
            E        L  A  + H+G+   L+  +   +   GE           +S D +D  +
Sbjct: 373 AE----PVAALYEAGRVRHAGMFPQLED-ELCGLMPGGEYQGP------GRSPDRADACV 421

Query: 474 YTFAE 478
           +   E
Sbjct: 422 WALTE 426


>gi|188026021|ref|ZP_02997754.1| hypothetical protein PROSTU_02527 [Providencia stuartii ATCC 25827]
 gi|188021298|gb|EDU59338.1| hypothetical protein PROSTU_02527 [Providencia stuartii ATCC 25827]
          Length = 264

 Score = 46.3 bits (108), Expect = 0.011,   Method: Composition-based stats.
 Identities = 47/253 (18%), Positives = 78/253 (30%), Gaps = 48/253 (18%)

Query: 248 WKRFQ-IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI-----DSFIPLNI 301
           W++   I      G++    E I        D  R     +F +        D+ I   +
Sbjct: 2   WRQIVNIHDAIARGLNRVNLEEIKDE--NPPDDFRNLYECEFVKTGERAFSYDALINCGV 59

Query: 302 IEEALNREPCPDPYAP-------LIMGCDIAEEGGDNTVVVLRR-------GPVIEHL-- 345
                +  P   PYAP       + +G D    G +   + L         G     +  
Sbjct: 60  DGYNSDVWPDWKPYAPRPLGNRPVWVGADPTGTGDNGDGLGLVVASPPAVSGGKFRIIET 119

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN-NTGARTCDYLEM-------LGYHVYRV 397
                        +I  + ++Y   +I ID    TGA   + +         L Y    +
Sbjct: 120 IQLRGMAFEKQAEEIKRITQRYNVQSITIDGTGGTGAAVHELVVKFFPAANLLNYSA-PL 178

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIES 457
                   L   RN R E    +           H  LI +  ++K  +   +G +  ES
Sbjct: 179 KRMMIMKMLMLIRNGRFEYDAGL-----------HKPLITSFMTIKK-VQTQSGIITYES 226

Query: 458 KRVKGAKSTDYSD 470
            RV+G    D+ D
Sbjct: 227 SRVRGL---DHGD 236


>gi|186895208|ref|YP_001872320.1| hypothetical protein YPTS_1896 [Yersinia pseudotuberculosis PB1/+]
 gi|186698234|gb|ACC88863.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis
           PB1/+]
          Length = 587

 Score = 46.3 bits (108), Expect = 0.011,   Method: Composition-based stats.
 Identities = 23/133 (17%), Positives = 41/133 (30%), Gaps = 15/133 (11%)

Query: 311 CPDPYAPLIMGCDIAEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLV 364
            P    P+ +G D A  G      V+   +  G     L    W   D     + I  + 
Sbjct: 403 RPFGDRPVWIGYDPASTGDSAGCAVIAPPVVAGGKFRVLERHQWKGMDFADQASNIKKIT 462

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424
           E+Y    I ID    G      +       +  +       + +    + +L  K  + +
Sbjct: 463 ERYNVTYIGIDDTGLGRSVTQLVRQ----FFPAVNA-----IHYSLEMKADLIYKAKNII 513

Query: 425 EFASLINHSGLIQ 437
           +   L   +G I 
Sbjct: 514 QGGRLEFDAGCID 526


>gi|238786939|ref|ZP_04630739.1| Terminase, ATPase subunit [Yersinia frederiksenii ATCC 33641]
 gi|238724727|gb|EEQ16367.1| Terminase, ATPase subunit [Yersinia frederiksenii ATCC 33641]
          Length = 587

 Score = 46.3 bits (108), Expect = 0.012,   Method: Composition-based stats.
 Identities = 23/133 (17%), Positives = 41/133 (30%), Gaps = 15/133 (11%)

Query: 311 CPDPYAPLIMGCDIAEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLV 364
            P    P+ +G D A  G      V+   +  G     L    W   D     + I  + 
Sbjct: 403 RPFGDRPVWIGYDPASTGDSAGCAVIAPPVVAGGKFRVLERHQWKGMDFADQASNIKKIT 462

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424
           E+Y    I ID    G      +       +  +       + +    + +L  K  + +
Sbjct: 463 ERYNVTYIGIDDTGLGRSVTQLVRQ----FFPAVNA-----IHYSLEMKADLIYKAKNII 513

Query: 425 EFASLINHSGLIQ 437
           +   L   +G I 
Sbjct: 514 QGGRLEFDAGCID 526


>gi|145708080|ref|YP_001165255.1| terminase [Ralstonia phage phiRSA1]
 gi|139003869|dbj|BAF52383.1| terminase [Ralstonia phage phiRSA1]
          Length = 593

 Score = 46.3 bits (108), Expect = 0.012,   Method: Composition-based stats.
 Identities = 18/82 (21%), Positives = 27/82 (32%), Gaps = 7/82 (8%)

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTVVVLRR-----GPVIEHL--FDWSKTDLRTTNNKISG 362
           P P    P+ +G D    GGD+  +V+       G     L    +   D       I  
Sbjct: 407 PRPFGNRPVWVGYDPNGGGGDSAALVVVAPPLVPGGKFRVLERHQFRGIDYEEQAGAIRR 466

Query: 363 LVEKYRPDAIIIDANNTGARTC 384
           + E+Y    + ID    G    
Sbjct: 467 VAERYDVAYVGIDRTGIGDAVF 488


>gi|80159854|ref|YP_398598.1| conserved phage-related protein [Clostridium phage c-st]
 gi|78675444|dbj|BAE47866.1| conserved phage-related protein [Clostridium phage c-st]
          Length = 580

 Score = 46.3 bits (108), Expect = 0.012,   Method: Composition-based stats.
 Identities = 51/374 (13%), Positives = 106/374 (28%), Gaps = 50/374 (13%)

Query: 46  EGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTR 105
                   + +E + +        +           +   RG+GK+ L+A   +      
Sbjct: 49  YYRKYIDKFCIEVLGLKLYLFQRLILRAMARNQYVMLICCRGLGKSWLSAVFFVASCILY 108

Query: 106 PGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGID 165
            G+     +    Q +  +   + K    L        +   + P    +D    +    
Sbjct: 109 KGLKCGIASGQGQQARNVI---IQKVKGELAKNPSIAREI--VFPIKTGADDCVVNFRNG 163

Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLT--------- 216
           S+  + +       + D   G  +     ++ DE     D +   IL  +T         
Sbjct: 164 SEIRAIV---LGRNQGD---GARSWRFHYLLVDECRLVSDKVINTILIPMTKTKRAVAIH 217

Query: 217 -ERNANRFWIMTSNPRRLSGKFYEIFN----KPLDDWKRFQIDTRTVE------GIDPSF 265
             +      I  S+    +   Y+ F     K       + + +            D   
Sbjct: 218 HNKREKGKVIFISSAYLKTSDLYKRFKYFCDKMSSGANNYFVCSLDYRVGIEAGIFDQDD 277

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA--LNREPCPDPYA---PLIM 320
            +    +  +  +  + E  G F     +S+ P      A  L R     P       I+
Sbjct: 278 IDEERNKPDMTIEEFQYEYEGIFVGSSGESYFPYETTTPARVLGRGEITQPKKSKSEYII 337

Query: 321 GCDIAEEGGDNT------VVVLR---RGPVIEHLFDWSKTDLRTTNNKISGLVEKYR--- 368
             D+A  G  ++      V+ L+    G  ++ +      +  +   +   L E Y    
Sbjct: 338 THDVAISGASDSDNACTHVIKLKPKPNGTYVKEVVYTKTHNGISLPEQRDFLRELYHLKF 397

Query: 369 --PDAIIIDANNTG 380
                I+ID    G
Sbjct: 398 PNAVKIVIDMRGNG 411


>gi|163849591|ref|YP_001637634.1| diguanylate cyclase [Methylobacterium extorquens PA1]
 gi|163661196|gb|ABY28563.1| diguanylate cyclase [Methylobacterium extorquens PA1]
          Length = 1428

 Score = 46.3 bits (108), Expect = 0.012,   Method: Composition-based stats.
 Identities = 36/229 (15%), Positives = 70/229 (30%), Gaps = 26/229 (11%)

Query: 83  SAGRGI--GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           S+G G   G+         WL +  P          + Q  TT W E+    +    +  
Sbjct: 644 SSGWGTYTGQPESAYIGYGWLDTVHPD---------DRQRVTTTWREIFASQAAGSFEFR 694

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY-SEERPDTFVGHHNTYGMAIINDE 199
              +  +       +  L  + G   +   T    + S +  +        Y +A++   
Sbjct: 695 ALCRDGAYRWTLTRAVPLKDASGQVQEWVGTDGDIHESRQASEAIRLQEERYRLAMLA-- 752

Query: 200 ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259
              T D I    LG  T   ++  + +          + +        W + +I     E
Sbjct: 753 ---TQDAIWDWDLGADTAEWSDGAYRLFG--------YDDAERADTGAWWKSKIHPDDRE 801

Query: 260 GIDPSFHEGIIARYGLDSDVTR-VEVCGQFPQQDIDSFIPLNIIEEALN 307
            +  S    I ++    SD  R     G + +     F+  +   +AL 
Sbjct: 802 RVTTSIKHIIESQEHRWSDEYRFARADGSYAEVTDCGFVIRDTEGQALR 850


>gi|51596194|ref|YP_070385.1| gpP phage P2 terminase [Yersinia pseudotuberculosis IP 32953]
 gi|170024552|ref|YP_001721057.1| hypothetical protein YPK_2327 [Yersinia pseudotuberculosis YPIII]
 gi|51589476|emb|CAH21098.1| similar to gpP phage P2 TERMINASE [Yersinia pseudotuberculosis IP
           32953]
 gi|169751086|gb|ACA68604.1| protein of unknown function DUF264 [Yersinia pseudotuberculosis
           YPIII]
          Length = 587

 Score = 46.3 bits (108), Expect = 0.012,   Method: Composition-based stats.
 Identities = 23/133 (17%), Positives = 40/133 (30%), Gaps = 15/133 (11%)

Query: 311 CPDPYAPLIMGCDIAEEGGDN--TVVV--LRRGPVIEHL--FDWSKTDLRTTNNKISGLV 364
            P    P+ +G D A  G      V+   +  G     L    W   D     + I  + 
Sbjct: 403 RPFGDRPVWIGYDPASTGDSAGCAVIAPPVVAGGKFRVLERHQWKGMDFADQASNIKKIT 462

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424
           E+Y    I ID    G      +       +  +       + +    + +L  K  + +
Sbjct: 463 ERYNVTYIGIDDTGLGRSVTQLVRQ----FFPAVNA-----IHYSLEMKADLIYKAKNII 513

Query: 425 EFASLINHSGLIQ 437
               L   +G I 
Sbjct: 514 HGGRLEFDAGCID 526


>gi|169634245|ref|YP_001707981.1| phage-related terminase, ATPase subunit (GPP-like) [Acinetobacter
           baumannii SDF]
 gi|169153037|emb|CAP02098.1| phage-related terminase, ATPase subunit (GPP-like) [Acinetobacter
           baumannii]
          Length = 594

 Score = 46.3 bits (108), Expect = 0.012,   Method: Composition-based stats.
 Identities = 28/175 (16%), Positives = 51/175 (29%), Gaps = 26/175 (14%)

Query: 238 YEIFNKPL----DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292
           ++   +        W++   I      G D    + +   Y  +       +  QF    
Sbjct: 322 HDALKRGRYCEDKIWRQIVTILDAENGGCDLFDIDELRFEYSAEE--FANLLMCQFIDDG 379

Query: 293 IDSFIPLNIIEEALNR------------EPCPDPYAPLIMGCDIAEEGGDNTVVVL---- 336
             S  PLNI++  +                 P    P+ +G D AE G    +VV+    
Sbjct: 380 A-SIFPLNILQACMVDSWEAWADDYKPFHARPLASRPVWVGYDPAETGDSAGLVVVAPPS 438

Query: 337 --RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
                  I     +   D +    +I  +  +Y    I +D    G      +  
Sbjct: 439 VANGKFRILERHQFRGMDFKAQAEQIRQITLRYNVTYIGLDTTGMGTGVAQLVRQ 493


>gi|33594269|ref|NP_881913.1| putative phage terminase [Bordetella pertussis Tohama I]
 gi|33564344|emb|CAE43647.1| putative phage terminase [Bordetella pertussis Tohama I]
 gi|332383682|gb|AEE68529.1| putative phage terminase [Bordetella pertussis CS]
          Length = 425

 Score = 46.3 bits (108), Expect = 0.012,   Method: Composition-based stats.
 Identities = 70/440 (15%), Positives = 131/440 (29%), Gaps = 64/440 (14%)

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKW 131
           P  F+  + AG G GKT +    +       P ++    A +  Q++   +    EV+  
Sbjct: 15  PHKFRAFV-AGFGSGKTWVGGAGLCRHAWEFPRVNSGYFAPTYGQIRDIFYPTIEEVAHD 73

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
             L    +    +        +                  +CR  S E+P   VG     
Sbjct: 74  WGLAAKINESNKEVHLFAGRKYRGT--------------VICR--SMEKPGDIVGFKIGK 117

Query: 192 GM-----AIINDEASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKP 244
           G+      +  D+A+    +    I+  L  T         +T+ P       Y+ F K 
Sbjct: 118 GLIDELDVMKADKAA----LAWRKIIARLRHTAPGLLNGVDVTTTPEG-FKFVYQQFVKQ 172

Query: 245 LDD-------WKRFQIDTRTV-EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF 296
           + +       +   Q  T    + +   +   + A Y     +    + GQF      S 
Sbjct: 173 VRERPELAALYGLVQASTYENGKNLPEDYIPSLRASY--PPQLIAAYLRGQFTNLTSGSV 230

Query: 297 IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTT 356
              N      + +   +P+  L +G D        TV V+R G  +         D    
Sbjct: 231 YA-NFDRRLHHTDAAEEPHEELHIGMDFNVLNMTATVNVIRAGLPLTVGELTKVRDTPEM 289

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDY---------LEMLGYHVYRVLGQKRAVDLE 407
              +    +  +   + I  + +G  T            L   G+ V RV  +  +V   
Sbjct: 290 ARMLKERFKD-KGHGVTIYPDASGGNTSSKNASESDLSILRKAGFTV-RVNSRNPSVKDR 347

Query: 408 FCRNRRTELHVK-MADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKST 466
                   L+ +    WL     +N        ++L+  +    G    E  +  G    
Sbjct: 348 INAVNGMLLNDEGARRWL-----VNTDRCPTLTEALEQQVYDKNG----EPDKSTGHDHP 398

Query: 467 DYSDGLMYTFAENPPRSDMD 486
           + + G           + M 
Sbjct: 399 NDAQGYFLVHRYPITPTGMS 418


>gi|224984406|ref|YP_002641809.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana
           VS116]
 gi|224497005|gb|ACN52640.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana
           VS116]
          Length = 450

 Score = 46.3 bits (108), Expect = 0.013,   Method: Composition-based stats.
 Identities = 58/332 (17%), Positives = 105/332 (31%), Gaps = 51/332 (15%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS-----TRPGIS 109
           Q E +  +++H  + V      +F G I++    GKT L ++L++  +           +
Sbjct: 49  QKEVLFDIESHDYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSLYERDTN 98

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
              + NS   L T    ++ K         +  +          +  +    L I     
Sbjct: 99  NFIIGNSIGLLMTNTIKQIEKICG------FLGIDYQKKKSGESFCKIAGLELNI----- 147

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                 Y     D F          I  +EA+       L ++  L  R      I  +N
Sbjct: 148 ------YGGRNRDAFSKIRGGNSAIIYVNEATVIHKETLLEVIKRL--RKGKSIIIFDTN 199

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288
           P   +  F   + +  D +K +   T         F E     Y       +  V  G++
Sbjct: 200 PESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLYGEW 258

Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVLRRGPVIEHLFD 347
              +   F      E   N++         IM  D A   GGDNT + +      E  + 
Sbjct: 259 ILNESALF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAICVLERTF-EKFYA 309

Query: 348 WSKTDLRTTN-----NKISGLVEKYRPDAIII 374
           +   D +  +       I  L+E +  + + I
Sbjct: 310 YIYQDQKPVSDSLMLGSIQVLIENFNVNTVYI 341


>gi|293368016|ref|ZP_06614649.1| large terminase subunit [Staphylococcus epidermidis
           M23864:W2(grey)]
 gi|291317838|gb|EFE58251.1| large terminase subunit [Staphylococcus epidermidis
           M23864:W2(grey)]
          Length = 421

 Score = 46.3 bits (108), Expect = 0.013,   Method: Composition-based stats.
 Identities = 41/335 (12%), Positives = 103/335 (30%), Gaps = 40/335 (11%)

Query: 60  EVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118
           E++  H  +             +   GRG GK++  + ++   +  R  ++ + +  ++ 
Sbjct: 9   ELLPKHFHSLWKATKDRKKLNVVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVRKTDN 67

Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
            L T+++ ++   +      H F+++                 +    +    + R    
Sbjct: 68  TLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFRGA-- 113

Query: 179 ERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMTSNPR 231
           + P+      ++     +  I + A   T D +       L    +      +  + NP 
Sbjct: 114 QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFSYNPP 173

Query: 232 RLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQ 287
           +    +    YE   +P + +            I   F +   +    +    R E  G+
Sbjct: 174 KRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESTKERNELRYRWEYMGE 232

Query: 288 FPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIE 343
                    +P N ++     +     +  +    D      D    V     ++  +I 
Sbjct: 233 AIGS---GVVPFNNLQIEKIPDELYKSFDNIRNAVDFG-YATDPLAFVRWHYDKKKRIIY 288

Query: 344 HLFDWSKTDL--RTTNNKISGLVEKYRPDAIIIDA 376
            + +     +  R   N +      Y+ D I  D+
Sbjct: 289 AVDEHYGVQISNREFANWLKR--RGYQSDEIFADS 321


>gi|229589112|ref|YP_002871231.1| terminase ATPase subunit [Pseudomonas fluorescens SBW25]
 gi|229360978|emb|CAY47838.1| terminase, ATPase subunit [Pseudomonas fluorescens SBW25]
          Length = 585

 Score = 46.3 bits (108), Expect = 0.013,   Method: Composition-based stats.
 Identities = 28/161 (17%), Positives = 51/161 (31%), Gaps = 22/161 (13%)

Query: 248 WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
           W++   I      G D    + +   Y  D++  +  +  QF      S  PL +++  +
Sbjct: 328 WRQIVTILDAEDRGCDLFDLDELRQEY--DAEAFQNLLMCQFIDDGA-SIFPLAMLQPCM 384

Query: 307 NR------------EPCPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHLFD--W 348
                            P     + +G D AE G    +VV    +  G     L    +
Sbjct: 385 VDSWDLWAQDYKPFAARPFGDRQVWVGYDPAESGDSAGLVVIAPPMVPGGKFRVLEKHQF 444

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
              D       I  + ++Y    I ID    G+     ++ 
Sbjct: 445 RGMDFAAQAEAIRQVTKRYWVTYIGIDITGMGSGVAQLVKQ 485


>gi|219723069|ref|YP_002474484.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|219693000|gb|ACL34209.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|312147710|gb|ADQ30370.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           JD1]
          Length = 450

 Score = 46.3 bits (108), Expect = 0.013,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHRQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYINNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|291517493|emb|CBK71109.1| Phage terminase large subunit [Bifidobacterium longum subsp. longum
           F8]
          Length = 477

 Score = 46.3 bits (108), Expect = 0.014,   Method: Composition-based stats.
 Identities = 57/376 (15%), Positives = 105/376 (27%), Gaps = 60/376 (15%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISV 110
             WQ +   ++ A   +   +      +  + +  R  GKT    W+ +   +  PG+ +
Sbjct: 37  DVWQRQINRIILAKSADGFWSA-----RNTVLSIPRQTGKTYDIGWVAIHRAARTPGMRI 91

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH---CSLGIDSK 167
           +  A               +  S++ +        +         D  H    + G +  
Sbjct: 92  VWTA---------------QHFSVIKDTFESLCAIVLRPEMSGLVDPDHGISLAAGKEEI 136

Query: 168 HYSTMCRT-YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226
            +    R  +         G        ++ DEA    D     +L     R  N   I 
Sbjct: 137 RFRNGSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPT-QNRAYNPQTIY 193

Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDT---------RTVEGIDPSFHEGIIARY---- 273
              P        E F +  D  +  +  +         R  + +D          Y    
Sbjct: 194 MGTPPGPRDNG-EAFTRLRDKTRAGRTHSTLYVEFAADRDADPLDREQWRKANPSYPAHT 252

Query: 274 ----------GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCD 323
                      L  D  R E  G + +  +   I     EEA      P     +  G D
Sbjct: 253 SDESIANLWENLTGDDFRREALGIWDEHALSRAIDRRQWEEATIDARRP--GGVMSFGID 310

Query: 324 IAEEGGDNTV---VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK--YRPDAIIIDANN 378
           +       T+   +    G     L ++  T+   T      L++K   +  A++ID  +
Sbjct: 311 MNPTRTRLTIGACMRYDDGTAHIELAEYRDTNHDGT-MWAVNLIDKVWEQTAALVIDGQS 369

Query: 379 TGARTCDYLEMLGYHV 394
                   L   G  V
Sbjct: 370 PATALLPDLAEAGVTV 385


>gi|260427953|ref|ZP_05781932.1| phage DNA Packaging Protein [Citreicella sp. SE45]
 gi|260422445|gb|EEX15696.1| phage DNA Packaging Protein [Citreicella sp. SE45]
          Length = 409

 Score = 46.3 bits (108), Expect = 0.014,   Method: Composition-based stats.
 Identities = 75/424 (17%), Positives = 134/424 (31%), Gaps = 70/424 (16%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTT-LWAEVSKW 131
           I  GRG GKT   A    W+ +   G           V  +  +  Q++   ++ E S  
Sbjct: 16  IMGGRGAGKTRAGA---EWVRACVEGAMPLSPGRCKRVALIGETMDQVREVMVFGE-SGI 71

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
           ++  P     E Q+       W +                    +S   P+   G     
Sbjct: 72  MNCSPPDRRPEWQATR-RCLVWPN--------------GAEAMVFSAHDPEGLRGPQFD- 115

Query: 192 GMAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249
             A   DE +           +   L     +    +T+ P R  G   E+   P     
Sbjct: 116 --AAWVDELAKWKKARETWDMLQFAL-RLGEHPQVCVTTTP-RNVGILKELLELPSTVVT 171

Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
           R +        +  SF E + ARYG +S + R E+ G        +     +++ A    
Sbjct: 172 RAK-TEANRANLAESFLEEVRARYG-NSRLARQELDGILVTDVDGALWTGEMLDRAQALA 229

Query: 310 PCPDPYAPLIMGCDI-AEEG--GDNTVVVLR--------RGPVIEHLFDWSKTDLRTTN- 357
           P P  +  +++  D  A +G   D   +V+         +      L D S   +  T  
Sbjct: 230 P-PATFDRIVVAVDPPAGDGKASDACGIVVAGVVCEGPPQAWRAWVLEDASVQGVSPTGW 288

Query: 358 -NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTEL 416
               +   E+++ D ++ + N  GA     L  +   V       R V     +  R E 
Sbjct: 289 AQAAAAAYERWQADRVVAEVNQGGAMVETVLRQVSPQV-----PLRKVHATRGKAARAEP 343

Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTF 476
              + +          +GL + +      ++ + G         +G  S D  D L++  
Sbjct: 344 VAALYEQGRVGHAAGLAGLEEQMG-----LMTSAGY--------QGQGSPDRVDALVWAL 390

Query: 477 AENP 480
            E  
Sbjct: 391 TELV 394


>gi|111074104|ref|YP_709233.1| hypothetical protein BAPKO_4029 [Borrelia afzelii PKo]
 gi|110891215|gb|ABH02376.1| hypothetical protein BAPKO_4029 [Borrelia afzelii PKo]
          Length = 450

 Score = 45.9 bits (107), Expect = 0.014,   Method: Composition-based stats.
 Identities = 31/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  ++I ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQIDITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|148807391|gb|ABR13464.1| predicted ATPase terminase subunit [Pseudomonas aeruginosa]
          Length = 593

 Score = 45.9 bits (107), Expect = 0.014,   Method: Composition-based stats.
 Identities = 28/161 (17%), Positives = 50/161 (31%), Gaps = 22/161 (13%)

Query: 248 WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
           W++   I      G D    E +   Y  D++  +  +  QF      S  PL +++  +
Sbjct: 336 WRQIVTILDAEARGCDLFDIEELRLEY--DAEAFQNLLMCQFVDDGA-SIFPLTMLQPCM 392

Query: 307 NR------------EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDW 348
                            P     + +G D AE G    +VV+      G     L    +
Sbjct: 393 VDSWDLWSEDYKPFALRPFGDRQVWLGYDPAETGDTAGLVVVAPPAVPGGKFRVLERHQF 452

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
              D       I  + ++Y    I +D    G+     +  
Sbjct: 453 RGKDFAEQAEFIRKVTQRYWVTYIGVDTTGMGSGVAQLVRQ 493


>gi|170718356|ref|YP_001783582.1| hypothetical protein HSM_0231 [Haemophilus somnus 2336]
 gi|168826485|gb|ACA31856.1| protein of unknown function DUF264 [Haemophilus somnus 2336]
          Length = 595

 Score = 45.9 bits (107), Expect = 0.015,   Method: Composition-based stats.
 Identities = 23/139 (16%), Positives = 45/139 (32%), Gaps = 17/139 (12%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN---------REPCPDPYA 316
            + +  +Y   +   +      +   +   F    +++ A+N           P P    
Sbjct: 358 LDQLKQKY--SALAFKQLFECHWIDDEDSIFTISKLLKCAVNINKWADFQPDTPRPFGDR 415

Query: 317 PLIMGCDIAEEGGDNTVVV----LRRGPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPD 370
            +  G D A      + V+    +  G     L  + W     R    +I  L ++Y   
Sbjct: 416 EVWGGYDPAHSSDGASFVIVAPPINEGEKFRVLARYQWFGLSYRWQAEQIKKLYQQYNFS 475

Query: 371 AIIIDANNTGARTCDYLEM 389
            I IDAN  G    + ++ 
Sbjct: 476 YIGIDANGVGQGVFEMIQE 494


>gi|224591489|ref|YP_002640832.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           CA-11.2a]
 gi|224554623|gb|ACN56003.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           CA-11.2a]
          Length = 450

 Score = 45.9 bits (107), Expect = 0.015,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKIDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|221641598|ref|YP_002527783.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           72a]
 gi|225622087|ref|YP_002725040.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           118a]
 gi|221237550|gb|ACM10383.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           72a]
 gi|225546885|gb|ACN92880.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           118a]
          Length = 450

 Score = 45.9 bits (107), Expect = 0.015,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKIDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|295096346|emb|CBK85436.1| Terminase-like family [Enterobacter cloacae subsp. cloacae NCTC
           9394]
          Length = 435

 Score = 45.9 bits (107), Expect = 0.016,   Method: Composition-based stats.
 Identities = 51/317 (16%), Positives = 99/317 (31%), Gaps = 42/317 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           AG G GKT +    +   M   P I+    A +  Q++   +  +         +  F+ 
Sbjct: 26  AGFGSGKTWVGCGGICKGMWEHPKINQGYFAPTYPQIRDIFYPTI--------EEVAFDW 77

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE---- 199
               +          +     + + Y       S E+P + VG      M    DE    
Sbjct: 78  GLSVIINEGNKEVHFY-----EGRRYRGTTICRSMEKPGSIVGFKIGNAM---VDELDVM 129

Query: 200 ASGTPDVINLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKP-------LDDWKRF 251
           A+         I+  +  + +  R  I  +         Y+ F K           +   
Sbjct: 130 AAAKAQQAWRKIIARMRYKVDGLRNGIDVTTTPEGFKFVYQQFVKAVREKPELSALYGLI 189

Query: 252 QIDTRTV-EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
           Q  T    + + P +   +++ Y    ++ +  + G+F   +  + I      +  N   
Sbjct: 190 QASTFDNAKNLPPDYISSLLSSY--PDELIQAYLRGKFTNLNSGT-IYHTFNRKLNNCSD 246

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT-DLRTTNNKISGLVEKY-- 367
                 PL +G D    G    +V ++R  +   + +  K  D      +I     +Y  
Sbjct: 247 EIQDGDPLFIGMDF-NVGKMAAIVHVKRNGLPRAVRELVKVYDTPAMIKRIQEEFWRYED 305

Query: 368 ------RPDAIIIDANN 378
                 R   I  DA+ 
Sbjct: 306 GRYVKSREIYIYPDASG 322


>gi|226315871|ref|YP_002776346.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           Bol26]
 gi|226202080|gb|ACO37753.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           Bol26]
          Length = 450

 Score = 45.9 bits (107), Expect = 0.016,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHRQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|148724503|ref|YP_001285469.1| DNA packaging protein [Cyanophage Syn5]
 gi|145588148|gb|ABP87967.1| DNA packaging protein [Synechococcus phage Syn5]
          Length = 574

 Score = 45.9 bits (107), Expect = 0.016,   Method: Composition-based stats.
 Identities = 58/398 (14%), Positives = 117/398 (29%), Gaps = 96/398 (24%)

Query: 79  KGAISAGRGIGKTTLNAWLVLWLMSTRPGISV-ICLANSET--------Q--LKTTLWAE 127
           +  ISA RG+GK+ + A  VLW++   P   + +  A+ E         Q  +    W  
Sbjct: 47  RLQISAFRGVGKSWITAAFVLWVLFVDPDRKIMVISASKERADNFSIFCQKLILDIEW-- 104

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER---PDTF 184
           +S       ++ W  + S  + PA  +      S+GI  +   +       +    P   
Sbjct: 105 LSHLRPRDSDQRWSRI-SFDVGPAKPHQAPSVKSVGITGQMTGSRAHLMVFDDVEVPANS 163

Query: 185 VGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---- 240
                   +  +  E+              + + +A   ++    P+     + ++    
Sbjct: 164 ATDMQREKLLQLVSESESI----------LVPDDDARIMFL--GTPQSTFTIYRKLAERS 211

Query: 241 ---------FNKPLDDWK-----RFQIDTRTVEGIDPSFHEGIIARY---GLDSDVTRVE 283
                    + + L  ++     +   D      +     +           +S + R  
Sbjct: 212 YRPFVWPARYPRDLSKYEGLLAPQLVADLEKDPELTWKPTDTRFNELNLMERESAMGRSN 271

Query: 284 VCGQF---PQQDIDSFIPL-----------NIIEEA---------LNREPCPD------- 313
              QF            PL               EA         + +E  P        
Sbjct: 272 FMLQFMLDTSLSDAEKFPLKFQDLIVTPLGAECAEAYAWSADPRYMRKELNPVGLPGDRF 331

Query: 314 -----------PYAPLIMGCDIAEEGGDNTV-VVLRRGP---VIEHLFDWSKTDLRTTNN 358
                      PY+  I+  D +  G D TV VVL +      +  +  +       T +
Sbjct: 332 YGPMYIDEGIVPYSETIVSVDPSGRGTDETVAVVLSQANGYIFVRDMKAFRDGYSDETLS 391

Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396
            I  L ++Y+   +++++N  G      L         
Sbjct: 392 DIVRLGKRYKASKLLVESN-FGDGMITELFKRHISQMG 428


>gi|319761996|ref|YP_004125933.1| hypothetical protein Alide_1284 [Alicycliphilus denitrificans BC]
 gi|317116557|gb|ADU99045.1| hypothetical protein Alide_1284 [Alicycliphilus denitrificans BC]
          Length = 633

 Score = 45.9 bits (107), Expect = 0.017,   Method: Composition-based stats.
 Identities = 30/186 (16%), Positives = 55/186 (29%), Gaps = 25/186 (13%)

Query: 231 RRLSGKFYEIFNKPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFP 289
            RLSG     F      W+    I      G D    + +   Y          +   F 
Sbjct: 359 DRLSGG----FTGEDRVWRNIVTILDALAGGCDLFDLDELRLEY--SDAEFANLLMCGFV 412

Query: 290 QQDIDSFIPLNIIEEALNRE-----------PCPDPYAPLIMGCDIAEEGGDNTVVVL-- 336
                S  PL++++  +                P  + P+ +G D +  G    +VVL  
Sbjct: 413 DDSF-SVFPLSMLQACMVDSWELWADFKPFSQRPFGWMPVWVGYDPSHTGDSAGLVVLAP 471

Query: 337 --RRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGY 392
             + G  +  L    +   D       I  + ++Y   A+ +D    G      ++    
Sbjct: 472 PAKPGGQLRVLHTQQFKGMDFEAQAKAIKEITQRYNVAAMTLDTTGIGQGVFQLVQKFYP 531

Query: 393 HVYRVL 398
               + 
Sbjct: 532 AARGIN 537


>gi|270307731|ref|YP_003329789.1| hypothetical protein DhcVS_300 [Dehalococcoides sp. VS]
 gi|270153623|gb|ACZ61461.1| hypothetical protein DhcVS_300 [Dehalococcoides sp. VS]
          Length = 457

 Score = 45.9 bits (107), Expect = 0.017,   Method: Composition-based stats.
 Identities = 44/291 (15%), Positives = 85/291 (29%), Gaps = 45/291 (15%)

Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213
           ++++ H   G   +         S E     VG  NT  + +  DEA             
Sbjct: 87  FTEIYHTEGGYIIRLNQARAVFLSAEPSANVVG--NTAHLLLEVDEAQDVSKEKYTKEFK 144

Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD------WKRFQIDTRTVEGIDPSFHE 267
            +     N   I+       +    EI  + ++        + F+ D   V   +P++  
Sbjct: 145 PM-GATTNVTTILYGTTWDNASLLEEIKRQNIEKEQKDGLKRHFRYDWEEVAAHNPAYLA 203

Query: 268 GIIARY---GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDPYAPLIMG 321
             ++     G +  +   +     P            ++   +  PC   P+     + G
Sbjct: 204 YALSEKDRLGENHPLFLTQYR-LLPVSGGGGMFSTEQLDLLKSSHPCQIYPENGKVYVAG 262

Query: 322 CDIAEE-----GGDNTVVVLRRGPVIEHLFD----------------------WSKTDLR 354
            D+A E     G     V LRR   +  + +                      W      
Sbjct: 263 LDLAGEDGQIDGDLPATVNLRRDSSVLTIAELDYTFAKAPCNLPQLKLVCHYSWQGARHA 322

Query: 355 TTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYHVYRVLGQKRA 403
               K+  L+ K ++   + +DA   G     +L   LG  +     Q  +
Sbjct: 323 LLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSRILPFAFQPSS 373


>gi|221065290|ref|ZP_03541395.1| protein of unknown function DUF264 [Comamonas testosteroni KF-1]
 gi|220710313|gb|EED65681.1| protein of unknown function DUF264 [Comamonas testosteroni KF-1]
          Length = 632

 Score = 45.9 bits (107), Expect = 0.017,   Method: Composition-based stats.
 Identities = 21/138 (15%), Positives = 38/138 (27%), Gaps = 21/138 (15%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------------REPCPD 313
              ++  Y    D        +F       F  L +++  +                 P 
Sbjct: 393 LAELLEEY--PDDEFSNLFRCEFIDDSNSQF-TLQMMQACMVDSWEAWADDFKPLAARPF 449

Query: 314 PYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLF--DWSKTDLRTTNNKISGLVEKY 367
            + P+ +G D +  G    +VV+      G     L    +   D       I  + E+Y
Sbjct: 450 AWQPVWVGYDPSFTGDTAALVVIAPPKVPGGKFRLLHRQQFRGADFEAQAEYIRSITERY 509

Query: 368 RPDAIIIDANNTGARTCD 385
               + ID    G     
Sbjct: 510 NVTFMGIDTTGLGQGVYQ 527


>gi|206563738|ref|YP_002234501.1| putative phage terminase large subunit [Burkholderia cenocepacia
           J2315]
 gi|198039778|emb|CAR55749.1| putative phage terminase large subunit [Burkholderia cenocepacia
           J2315]
          Length = 436

 Score = 45.9 bits (107), Expect = 0.018,   Method: Composition-based stats.
 Identities = 52/347 (14%), Positives = 98/347 (28%), Gaps = 70/347 (20%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPD---VINLGILGFLTERNANRFWIMTSNPR- 231
           +S E      G        ++ DE + T D    I    +  +            S P  
Sbjct: 99  WSLESGLFGRGREYD---LLLFDETAFTKDGTLEIYRDAISPVVATRPGFRMFSFSTPLV 155

Query: 232 -RLSGKFYEIFNKPLDDW------------KRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278
             LS  FY +       W            K     +     +D  +  G   +    S 
Sbjct: 156 MDLSNFFYALHEHKDYKWNPKIGADNYERFKVHHRPSWCNPLVDDKWLRGEYRKRSALS- 214

Query: 279 VTRVEVCGQFPQQDIDSFIP---------------LNIIEEALNREPCPDPYAPLIMGC- 322
             R E+ G+F      S  P                 +I+ A+      D  A + +G  
Sbjct: 215 -WRQEIEGEFVDWSGISLFPNLNKPVDPHPRYDAVFAVIDTAMKSGIEHDGTACMWLGYS 273

Query: 323 DIAEEGGDNTVVVLRRGPVIEHLFDWSKTD----LRTTNNKISGLVEKYRPDAI-IIDAN 377
           D+   G DN  ++      +  L    + D    +      ++   + ++   +  I+  
Sbjct: 274 DV--FGPDNLHIL---DWEVTSLDASGQYDWLKRILNHGEALARHYKSHQGFTVAYIEDK 328

Query: 378 NTGARTCDYLEMLGYHVYRVLGQKRAVDLEF---------CRNRRTELHVKMADWLEFAS 428
            +G       +  G  V  +  +  A+  +            NR   +H      +EF +
Sbjct: 329 QSGIVLLQQGKESGLPVEAINSKFTALGKDERMRICVDPVHANRVKFVHESFNKLVEFKN 388

Query: 429 LINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
              +  L Q L                   +    ++ D +D   Y+
Sbjct: 389 ESKNHALKQILSYR-------------IGDKEAYKRADDLADCFAYS 422


>gi|146310689|ref|YP_001175763.1| hypothetical protein Ent638_1030 [Enterobacter sp. 638]
 gi|145317565|gb|ABP59712.1| hypothetical protein Ent638_1030 [Enterobacter sp. 638]
          Length = 402

 Score = 45.9 bits (107), Expect = 0.018,   Method: Composition-based stats.
 Identities = 48/284 (16%), Positives = 84/284 (29%), Gaps = 32/284 (11%)

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
             L  +  P           I  K+   +    + +      G        ++ DEA+ T
Sbjct: 42  DKLVEYLEPLIKSSSRSEKRILLKNGGKIDFWVTNDNKLAGRGREYD---LVLIDEAAFT 98

Query: 204 PDVINLGILGFL----TERNANRFWIMTSNPRRLS--GKFYEIFNKPLDDWKRFQIDTRT 257
                L  +       T         + S P  +     FY I  K    +      T +
Sbjct: 99  KSPEMLAEIWAKSIKPTLLTTKGRAYIFSTPDGVDEDNFFYAICRKKELGFFEHYAPTSS 158

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              + P   E    R   +  V R E   +F     D+   ++   E       P+    
Sbjct: 159 NPFVPPEELEK--ERLSCEPRVFRQEFLAEFVDWSADALFDVSKWLEDGKPVEFPEMCMA 216

Query: 318 LIMGCDIAEEGG---DNTVVVL-----RRGPVIEHLFDWSKTDLR---------TTNNKI 360
           +    D A +GG   D T VV      R G     + DW    +          +  +++
Sbjct: 217 VFAVMDTAVKGGIEHDGTAVVYYAIDTRPGRERLTILDWDVVQIDGALLEVWMPSVFDRL 276

Query: 361 SGL----VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400
           + L    V       + I+  + G+      E LG+ V ++   
Sbjct: 277 NELSGLCVAVNGSLGVFIEDASMGSILLQKGESLGWQVNKIESA 320


>gi|145335142|ref|NP_172040.2| chr31 (chromatin remodeling 31); ATP binding / DNA binding /
           helicase/ nucleic acid binding [Arabidopsis thaliana]
 gi|332189724|gb|AEE27845.1| chromatin remodeling 31 [Arabidopsis thaliana]
          Length = 1410

 Score = 45.9 bits (107), Expect = 0.018,   Method: Composition-based stats.
 Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 13/155 (8%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  +   ++     + F+ +   G        G GKT L    +   +   P 
Sbjct: 827 QQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTGKTRLTIIFLQAYLQCFPD 886

Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
              + +A +   L    WA E  KW   +P  +   +       +     ++  +    S
Sbjct: 887 CKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGKENSAALGLLMQKNATARS 943

Query: 167 KHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDE 199
            +   M + YS  +  + +G  ++    +A + DE
Sbjct: 944 NNEIRMVKIYSWIKSKSILGISYNLYEKLAGVKDE 978


>gi|110740804|dbj|BAE98499.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1410

 Score = 45.9 bits (107), Expect = 0.018,   Method: Composition-based stats.
 Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 13/155 (8%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  +   ++     + F+ +   G        G GKT L    +   +   P 
Sbjct: 827 QQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTGKTRLTIIFLQAYLQCFPD 886

Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
              + +A +   L    WA E  KW   +P  +   +       +     ++  +    S
Sbjct: 887 CKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGKENSAALGLLMQKNATARS 943

Query: 167 KHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDE 199
            +   M + YS  +  + +G  ++    +A + DE
Sbjct: 944 NNEIRMVKIYSWIKSKSILGISYNLYEKLAGVKDE 978


>gi|8778726|gb|AAF79734.1|AC005106_15 T25N20.14 [Arabidopsis thaliana]
          Length = 1465

 Score = 45.9 bits (107), Expect = 0.018,   Method: Composition-based stats.
 Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 13/155 (8%)

Query: 55   QLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----R--GIGKTTLNAWLVLWLMSTRPG 107
            Q E  E +  +   ++     + F+ +   G        G GKT L    +   +   P 
Sbjct: 882  QQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTGKTRLTIIFLQAYLQCFPD 941

Query: 108  ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
               + +A +   L    WA E  KW   +P  +   +       +     ++  +    S
Sbjct: 942  CKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGKENSAALGLLMQKNATARS 998

Query: 167  KHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDE 199
             +   M + YS  +  + +G  ++    +A + DE
Sbjct: 999  NNEIRMVKIYSWIKSKSILGISYNLYEKLAGVKDE 1033


>gi|319407675|emb|CBI81323.1| phage-related protein [Bartonella sp. 1-1C]
          Length = 442

 Score = 45.9 bits (107), Expect = 0.018,   Method: Composition-based stats.
 Identities = 31/193 (16%), Positives = 63/193 (32%), Gaps = 9/193 (4%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247
             +    DEA    D     ++  L E          +T NP R +    + F      +
Sbjct: 122 RILLCWVDEAEPVTDAAWQILIPTLREEGKEWHSELWVTWNPCRENAAVEKRFRFTEDPN 181

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
            K  +I+ R         +    A      +  +    G++ Q    ++    ++E    
Sbjct: 182 IKGVEINWRDNPKFPAKLNRDRKADLEQRPEQYQHIWEGEYLQAMQGAYYQKLLLEAEQE 241

Query: 308 REPCPDPYAPLI---MGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361
                 P  PLI   +  DI   G   D T + + +       + D+ +   +  +  I 
Sbjct: 242 GRITIVPRDPLIQVKIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHIG 301

Query: 362 GLVEKYRPDAIII 374
            + +K    A+++
Sbjct: 302 WVCQKGYEKALMV 314


>gi|224535035|ref|ZP_03675589.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii
           A14S]
 gi|224513696|gb|EEF84036.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii
           A14S]
          Length = 379

 Score = 45.5 bits (106), Expect = 0.019,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|216968428|ref|YP_002333693.1| phage terminase, large subunit, pbsx family [Borrelia afzelii
           ACA-1]
 gi|216752682|gb|ACJ73366.1| phage terminase, large subunit, pbsx family [Borrelia afzelii
           ACA-1]
          Length = 450

 Score = 45.5 bits (106), Expect = 0.019,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|99080898|ref|YP_613052.1| hypothetical protein TM1040_1057 [Ruegeria sp. TM1040]
 gi|99037178|gb|ABF63790.1| DNA packaging protein Gp17 (Terminase) [Ruegeria sp. TM1040]
          Length = 425

 Score = 45.5 bits (106), Expect = 0.021,   Method: Composition-based stats.
 Identities = 64/422 (15%), Positives = 115/422 (27%), Gaps = 77/422 (18%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ S   G           V  +  +  Q++  +   +    
Sbjct: 33  ILGGRGAGKTRAGA---EWVRSQVEGAGPFGVGSARRVALVGETYDQVRDVM---IHGDS 86

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
            +L                P                     + +S   P+   G      
Sbjct: 87  GILACSP------------PDRRPEWRAGERRLLWPNGASAQAFSASDPEVLRGPQFD-- 132

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +           +   L          +T+ PR +     +   +       
Sbjct: 133 -AAWVDELAKWRRAQEAWDMLQFAL-RLGTAPRVCVTTTPRNV--PLLKGLLQSPSTVTT 188

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     + PSF   + ARY   S + R E+ G        +    +++ E   R+ 
Sbjct: 189 HAPTEANSANLAPSFLSEVRARY-AGSRLARQELDGVLLADVDGALWSSDMLAEIQRRDT 247

Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVVL----RRGP----VIEHLFDWSKTDLRTTN-- 357
                  +++  D    A +G D   +++     +GP        L D +   L  T   
Sbjct: 248 P--RLDRIVVAVDPSVSAHKGSDACGIIVAGAQTQGPISSWRAYVLADHTVQGLGPTGWA 305

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTE 415
                  + Y+ D ++ + N  GA     L  +        V   K           R E
Sbjct: 306 RAAIAARDAYKADRLVAEVNQGGALVGTVLRQVDPLVPFTPVHASKGKA-------ARAE 358

Query: 416 LHVKMADWLEFASLINHSGLIQN--LKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473
               + +            L +   L + + +               +G  S D  D L+
Sbjct: 359 PVAALYEQGRVHHAPGLQELEEQMCLMTAQGY---------------RGDASPDRVDALV 403

Query: 474 YT 475
           + 
Sbjct: 404 WA 405


>gi|11497347|ref|NP_051454.1| hypothetical protein BBN43 [Borrelia burgdorferi B31]
 gi|6382368|gb|AAF07680.1|AE001581_22 conserved hypothetical protein [Borrelia burgdorferi B31]
          Length = 450

 Score = 45.5 bits (106), Expect = 0.021,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKIYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITDDYIFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|216997755|ref|YP_002333847.1| phage terminase, large subunit, pbsx family protein [Borrelia
           afzelii ACA-1]
 gi|216752400|gb|ACJ73182.1| phage terminase, large subunit, pbsx family protein [Borrelia
           afzelii ACA-1]
          Length = 450

 Score = 45.5 bits (106), Expect = 0.021,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFAQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|216969097|ref|YP_002333737.1| PBSX family phage termninase large subunit [Borrelia afzelii ACA-1]
 gi|216753027|gb|ACJ73621.1| phage terminase, large subunit, PBSX family [Borrelia afzelii
           ACA-1]
          Length = 450

 Score = 45.5 bits (106), Expect = 0.022,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYDFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|187939507|gb|ACD38655.1| terminase ATPase subunit [Pseudomonas aeruginosa]
          Length = 593

 Score = 45.1 bits (105), Expect = 0.025,   Method: Composition-based stats.
 Identities = 27/161 (16%), Positives = 50/161 (31%), Gaps = 22/161 (13%)

Query: 248 WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
           W++   I      G D    + +   Y  D++  +  +  QF      S  PL +++  +
Sbjct: 336 WRQIVTILDAEARGCDLFDIDELRLEY--DAEAFQNLLMCQFVDDGA-SIFPLTMLQPCM 392

Query: 307 NR------------EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDW 348
                            P     + +G D AE G    +VV+      G     L    +
Sbjct: 393 VDSWDLWSEDYKPFALRPFGDRQVWLGYDPAETGDTAGLVVVAPPAVPGGKFRVLERHQF 452

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
              D       I  + ++Y    I +D    G+     +  
Sbjct: 453 RGKDFAEQAEFIRKVTQRYWVTYIGVDTTGMGSGVAQLVRQ 493


>gi|17313220|ref|NP_490600.1| predicted DNA-dependent ATPase terminase subunit [Pseudomonas phage
           phiCTX]
 gi|4063774|dbj|BAA36228.1| unnamed protein product [Pseudomonas phage phiCTX]
          Length = 594

 Score = 45.1 bits (105), Expect = 0.025,   Method: Composition-based stats.
 Identities = 27/161 (16%), Positives = 50/161 (31%), Gaps = 22/161 (13%)

Query: 248 WKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
           W++   I      G D    + +   Y  D++  +  +  QF      S  PL +++  +
Sbjct: 336 WRQIVTILDAEARGCDLFDIDELRLEY--DAEAFQNLLMCQFVDDGA-SIFPLTMLQPCM 392

Query: 307 NR------------EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDW 348
                            P     + +G D AE G    +VV+      G     L    +
Sbjct: 393 VDSWDLWSEDYKPFALRPFGDRQVWLGYDPAETGDTAGLVVVAPPAVPGGKFRVLERHQF 452

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
              D       I  + ++Y    I +D    G+     +  
Sbjct: 453 RGKDFAEQAEFIRKVTQRYWVTYIGVDTTGMGSGVAQLVRQ 493


>gi|260556808|ref|ZP_05829026.1| P-loop protein [Acinetobacter baumannii ATCC 19606]
 gi|260410067|gb|EEX03367.1| P-loop protein [Acinetobacter baumannii ATCC 19606]
          Length = 437

 Score = 45.1 bits (105), Expect = 0.025,   Method: Composition-based stats.
 Identities = 53/332 (15%), Positives = 93/332 (28%), Gaps = 70/332 (21%)

Query: 78  FKGAISAGRGIGKTTLNAW--------LVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129
           F+ A+  GR  GKT L              W +S      +   A +  Q K   W  + 
Sbjct: 31  FRDAV-CGRRFGKTFLAKAEMRRAARLAQKWNVSVEDE--IWYAAPTFKQAKRVFWKRLK 87

Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           + +                 P  W     + +    +     + R    +  D   G   
Sbjct: 88  QAI-----------------PPSWRFGKPNETECTITLKTGHVIRVVGLDNYDDLRG--- 127

Query: 190 TYGMAIINDEASGTPDVIN-LGILGFLTE--------RNANRFWIMTSNPRRLSGKFYEI 240
           +    +I DE +          +   L+         +      +    P+  +   Y+ 
Sbjct: 128 SGLFFLIIDEWADCKWAAWEEVLRPMLSTCKYTVNGVQRVGGNVLRIGTPKGYN-HCYDT 186

Query: 241 F-------NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293
           +             W    +    +        E  +AR  +D    R E    F     
Sbjct: 187 WMDGQNGREPDHKSWIYTSLQGGNIP-----ESEIDVARRKMDPKTFRQEYEASFETYQ- 240

Query: 294 DSFIPLNIIEEALNR-----EPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW 348
                  +I     R     E        L +G D   +     VV +R G  +  + ++
Sbjct: 241 ------GVIYYCFERTFNCTEKVVKEGDVLHIGMDFNVQKM-AAVVYVRDGEELYAVGEF 293

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIII--DANN 378
              DL  T   I  +  KY+   II+  DA+ 
Sbjct: 294 --KDLFDTPAMIEAIKAKYQDHEIIVYPDASG 323


>gi|194436023|ref|ZP_03068125.1| putative conserved hypothetical protein [Escherichia coli 101-1]
 gi|194424751|gb|EDX40736.1| putative conserved hypothetical protein [Escherichia coli 101-1]
          Length = 595

 Score = 45.1 bits (105), Expect = 0.025,   Method: Composition-based stats.
 Identities = 35/238 (14%), Positives = 65/238 (27%), Gaps = 46/238 (19%)

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY---EIFNKPLDDWK 249
           +  DE    P    +N    G  T ++    +  T + +   G  +   + + K     K
Sbjct: 257 LYIDEYLWIPGFRRLNEVASGMATHKHWRITYFSTPSSKTHQGYPFWSGDEWRKGDPKRK 316

Query: 250 RFQIDT--------RTVE----------------GIDPSFHEGIIARYGLDSDVTRVEVC 285
             +  +        R                   G + +    +  RY        +   
Sbjct: 317 GVEFPSFDELRDGGRECPDGQWRYVVTLEDAIAGGFNLADINELRERYNET--AFNMLFM 374

Query: 286 GQFPQQDIDSF---------IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL 336
             F       F         + ++  E+    EP P     +  G D A  G + T VVL
Sbjct: 375 CVFVDDKESVFKFDDLVRCGVDVSTWEDFHPEEPMPFGNREVWGGFDPARSGDNATFVVL 434

Query: 337 R------RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
                      +     W     +    +I  +  +Y    I ID    G    + ++
Sbjct: 435 APPLVSAERFRVLEKHHWRSMSFQFMAERIRSIKARYNMTFIGIDVTGLGYGVFELVQ 492


>gi|218964078|ref|YP_002455438.1| putative phage terminase, pbsx family protein [Borrelia afzelii
           ACA-1]
 gi|216752969|gb|ACJ73583.1| putative phage terminase, pbsx family protein [Borrelia afzelii
           ACA-1]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.025,   Method: Composition-based stats.
 Identities = 55/289 (19%), Positives = 93/289 (32%), Gaps = 45/289 (15%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109
           Q E +  +++H  + V      +F G I++    GKT L ++L++  +    S      +
Sbjct: 49  QKEVLFDIESHTYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEQDTN 98

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
              + NS   L T    ++ K   L        +          +  +    L I     
Sbjct: 99  NFIIGNSIGLLMTNTIKQIEKICGL------LGIDYQKKKSGQSFCKIAGLELNI----- 147

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                 Y  +  D F          I  +EA+       L ++  L  R      I  +N
Sbjct: 148 ------YGGKNRDAFSKIRGGNSAIIYVNEATVIHKETLLEVMKRL--RKGKSIIIFDTN 199

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288
           P   +  F   + +  D +K +   T         F E     Y       +  V  G++
Sbjct: 200 PESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLYGEW 258

Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
              +   F      E   N++         IM  D A   GGDNT V +
Sbjct: 259 VLNESSLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAVCV 299


>gi|170769336|ref|ZP_02903789.1| conserved hypothetical protein [Escherichia albertii TW07627]
 gi|170121988|gb|EDS90919.1| conserved hypothetical protein [Escherichia albertii TW07627]
          Length = 595

 Score = 45.1 bits (105), Expect = 0.025,   Method: Composition-based stats.
 Identities = 35/238 (14%), Positives = 65/238 (27%), Gaps = 46/238 (19%)

Query: 195 IINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY---EIFNKPLDDWK 249
           +  DE    P    +N    G  T ++    +  T + +   G  +   + + K     K
Sbjct: 257 LYIDEYLWIPGFRRLNEVASGMATHKHWRITYFSTPSSKTHQGYPFWSGDEWRKGDPKRK 316

Query: 250 RFQIDT--------RTVE----------------GIDPSFHEGIIARYGLDSDVTRVEVC 285
             +  +        R                   G + +    +  RY        +   
Sbjct: 317 GVEFPSFDELRDGGRECPDGQWRYVVTLEDAIAGGFNLADINELRERYNET--AFNMLFM 374

Query: 286 GQFPQQDIDSF---------IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL 336
             F       F         + ++  E+    EP P     +  G D A  G + T VVL
Sbjct: 375 CVFVDDKESVFKFDDLVRCGVDVSTWEDFHPEEPMPFGNREVWGGFDPARSGDNATFVVL 434

Query: 337 R------RGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
                      +     W     +    +I  +  +Y    I ID    G    + ++
Sbjct: 435 APPLVSAERFRVLEKHHWRSMSFQFMAERIRSIKARYNMTFIGIDVTGLGYGVFELVQ 492


>gi|117621599|ref|YP_853855.1| hypothetical protein BAPKO_2028 [Borrelia afzelii PKo]
 gi|110890985|gb|ABH02150.1| hypothetical protein BAPKO_2028 [Borrelia afzelii PKo]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.025,   Method: Composition-based stats.
 Identities = 55/289 (19%), Positives = 93/289 (32%), Gaps = 45/289 (15%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109
           Q E +  +++H  + V      +F G I++    GKT L ++L++  +    S      +
Sbjct: 49  QKEVLFDIESHTYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEQDTN 98

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
              + NS   L T    ++ K   L        +          +  +    L I     
Sbjct: 99  NFIIGNSIGLLMTNTIKQIEKICGL------LGIDYQKKKSGQSFCKIAGLELNI----- 147

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                 Y  +  D F          I  +EA+       L ++  L  R      I  +N
Sbjct: 148 ------YGGKNRDAFSKIRGGNSAIIYVNEATVIHKETLLEVMKRL--RKGKSIIIFDTN 199

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288
           P   +  F   + +  D +K +   T         F E     Y       +  V  G++
Sbjct: 200 PESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLYGEW 258

Query: 289 PQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
              +   F      E   N++         IM  D A   GGDNT V +
Sbjct: 259 VLNESSLF-----NEMIFNQDYEFKSP---IMYIDPAFSVGGDNTAVCV 299


>gi|224022826|ref|YP_002606317.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           64b]
 gi|223929278|gb|ACN23995.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           64b]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.026,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI  + +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|224020463|ref|YP_002601168.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           64b]
 gi|223929158|gb|ACN23879.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           64b]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.026,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI  + +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|219869985|ref|YP_002474251.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|219692877|gb|ACL34089.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.026,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI  + +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|195942413|ref|ZP_03087795.1| hypothetical protein Bbur8_06149 [Borrelia burgdorferi 80a]
 gi|312201120|gb|ADQ44434.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           297]
 gi|312201339|gb|ADQ44646.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           297]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.026,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI  + +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|11497152|ref|NP_051291.1| hypothetical protein BB_R45 [Borrelia burgdorferi B31]
 gi|6382173|gb|AAF07489.1|AE001577_3 conserved hypothetical protein [Borrelia burgdorferi B31]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.026,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI  + +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|11497247|ref|NP_051377.1| hypothetical protein BB_O44 [Borrelia burgdorferi B31]
 gi|6382268|gb|AAF07582.1|AE001579_11 conserved hypothetical protein [Borrelia burgdorferi B31]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.026,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI  + +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|331035425|gb|AEC52982.1| large terminase protein [Synechococcus phage S-CRM01]
          Length = 567

 Score = 45.1 bits (105), Expect = 0.026,   Method: Composition-based stats.
 Identities = 47/329 (14%), Positives = 95/329 (28%), Gaps = 56/329 (17%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GK+T     ++     R  I++  LAN        +    +K      N   +  Q +  
Sbjct: 84  GKSTTVTAYLIHQAIFRDNINIAILANKRETAYELM----AKLQLSYENLPKWMQQGV-- 137

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP---- 204
               W    +    G      ST              G        ++ DE +  P    
Sbjct: 138 --LGWNKGSIELENGSRITASSTSSSAVR--------GF---AYNIVMLDEFAFVPTNVA 184

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF---NKPLDDWKRFQIDTRTVEGI 261
           D     +   ++    +   I+ S P  ++  FY+++    K  + +   +     V G 
Sbjct: 185 DDFFSSVYPTISS-GKSTKVIIVSTPCGMN-HFYKMWTDATKGRNSYNPIEAHWSEVPGR 242

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA----- 316
           D  F +  IA   L     + E    F    + + I    ++  +  EP           
Sbjct: 243 DEKFKQETIANTSLSQ--WQQEFETDFI-GSVGTLINPAKLKSLVYDEPLLSSGGLDVYE 299

Query: 317 ---------------PLIMGCDIAEEG--GDNTVVVLRRGPVIEHLFDWSKTD---LRTT 356
                            ++  D++       +  +V         L    + +       
Sbjct: 300 HPIMKDENDENSRDHEYMITVDVSRGMKLDYSAFLVFDITQYPHRLVAKYRNNEIKPMLF 359

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCD 385
            + I  + +KY    I+ + N+ G +   
Sbjct: 360 PDVIVPVAKKYNNAWILCEVNDIGDQVAS 388


>gi|45597419|ref|NP_996704.1| TerL [Lactococcus phage phiLC3]
 gi|45504639|gb|AAS66808.1| large subunit terminase [Lactococcus phage phiLC3]
          Length = 469

 Score = 45.1 bits (105), Expect = 0.026,   Method: Composition-based stats.
 Identities = 56/349 (16%), Positives = 103/349 (29%), Gaps = 51/349 (14%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ   ++ + A   + +       +          GKT +   L LW +    G+S++ 
Sbjct: 41  PWQKNLLKEIMAIDEDGLWTHQKFGYSIPRRN----GKTEIVYILELWAL--EQGLSILH 94

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +   ++    + K+L         + +S+         + L          + T 
Sbjct: 95  TAHRISTSHSSYEK-LKKYLEDSGYVEGEDFKSIKAK----GQERLELIESGGVIQFRT- 148

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT S    + F          +  DEA          +   +T+ + N   IM   P  
Sbjct: 149 -RTSSGGLGEGFD--------ILFIDEAQEYTTEQESALKYTVTDSD-NPMTIMCGTPPT 198

Query: 233 L------SGKFYE---IFNKPLDDWKRFQI-------DTRTVEGIDPS-----FHEGIIA 271
                     + +           W  + +       D       +PS         I A
Sbjct: 199 PVSSGTVFTNYRDNTLAGKAKYSGWAEWSVEDVKDIHDVEAWYNSNPSMGYHLNERKIEA 258

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331
             G D     V+  G +P+ +  S I       AL     P     L +G    + G D 
Sbjct: 259 ELGEDKLDHNVQRLGYWPKYNQKSVISEQE-WNALKVNRLPVIKGKLFVGI---KYGNDG 314

Query: 332 TVVVLRRGPVIEH----LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
             V +            +       +R  N  I   ++K   + ++ID 
Sbjct: 315 ANVAMSIAVKTLSGKVFVETIDCQSIRNGNQWIINFLKKADVEKVVIDG 363


>gi|219723219|ref|YP_002474654.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|219692798|gb|ACL34012.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|312148753|gb|ADQ31404.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           JD1]
          Length = 450

 Score = 45.1 bits (105), Expect = 0.027,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI  + +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITADYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|119953680|ref|YP_950600.1| putative large terminase subunit [Staphylococcus phage CNPH82]
 gi|112361306|gb|ABI15678.1| putative large terminase subunit [Staphylococcus phage CNPH82]
 gi|329736010|gb|EGG72285.1| phage terminase, large subunit, PBSX family [Staphylococcus
           epidermidis VCU045]
          Length = 421

 Score = 44.7 bits (104), Expect = 0.032,   Method: Composition-based stats.
 Identities = 29/240 (12%), Positives = 76/240 (31%), Gaps = 28/240 (11%)

Query: 60  EVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET 118
           E++  H  +             +   GRG GK++  + ++   +  R  ++ + +  ++ 
Sbjct: 9   ELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVRKTDN 67

Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
            L T+++ ++   +      H F+++                 +    +    + R    
Sbjct: 68  TLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFRGA-- 113

Query: 179 ERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMTSNPR 231
           + P+      ++     +  I + A   T D +       L    +      +  + NP 
Sbjct: 114 QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFSYNPP 173

Query: 232 RLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQ 287
           +    +    YE   +P + +            I   F +   +    +    R E  G+
Sbjct: 174 KRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESAKERNEQRYRWEYMGE 232


>gi|56560912|ref|YP_161331.1| hypothetical protein BGP046 [Borrelia garinii PBi]
 gi|52696553|gb|AAU85896.1| hypothetical protein BGP046 [Borrelia garinii PBi]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.032,   Method: Composition-based stats.
 Identities = 31/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSTLIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  +    +  V  G++      
Sbjct: 206 YFKTDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KEIPTYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|56560985|ref|YP_161401.1| hypothetical protein BGP116 [Borrelia garinii PBi]
 gi|52696625|gb|AAU85966.1| hypothetical protein BGP116 [Borrelia garinii PBi]
          Length = 336

 Score = 44.7 bits (104), Expect = 0.033,   Method: Composition-based stats.
 Identities = 34/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSIYSLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYTHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    E  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219


>gi|312984196|ref|ZP_07791542.1| putative phage terminase, large subunit [Lactobacillus crispatus
           CTV-05]
 gi|310894415|gb|EFQ43491.1| putative phage terminase, large subunit [Lactobacillus crispatus
           CTV-05]
          Length = 632

 Score = 44.7 bits (104), Expect = 0.034,   Method: Composition-based stats.
 Identities = 27/166 (16%), Positives = 48/166 (28%), Gaps = 21/166 (12%)

Query: 54  WQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAW----LVLWLMSTRPGIS 109
           WQ   + +++    +  +         ++  GRG GKT +        VL          
Sbjct: 101 WQKFILAMING-WKDENDEKRFTDIHISV--GRGQGKTQIAGIQMCKAVLIDTLNYTNKD 157

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
            +  AN+  Q  T L+  + K L  +     F   +         + ++           
Sbjct: 158 FLVTANTSDQ-STKLFGYIKKMLEAVIKIEPFASLAKESGLDLQTNQIIEKRTNNKVWKI 216

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP--DVINLGILG 213
           S     Y            +T+ +  I DE       D I     G
Sbjct: 217 SYEADKYD-----------STHNVLAIYDETGALNTYDRITDITDG 251


>gi|224591529|ref|YP_002640858.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           WI91-23]
 gi|224554111|gb|ACN55505.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           WI91-23]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.034,   Method: Composition-based stats.
 Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    +  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALVFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI  + +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|219873383|ref|YP_002477648.1| phage terminase, large subunit, pbsx family [Borrelia garinii
           Far04]
 gi|219694616|gb|ACL35135.1| phage terminase, large subunit, pbsx family [Borrelia garinii
           Far04]
          Length = 267

 Score = 44.7 bits (104), Expect = 0.034,   Method: Composition-based stats.
 Identities = 43/248 (17%), Positives = 78/248 (31%), Gaps = 36/248 (14%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM----STRP-GIS 109
           Q E +  +++H  + V      +F G I++    GKT L ++L++  +    S      +
Sbjct: 49  QKEVLFDIESHDYSKV------IFSGGIAS----GKTFLASYLLIKKLIENKSFYEKDTN 98

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
              + NS   L T    ++ K         +  +          +  +    L I     
Sbjct: 99  NFIIGNSIGLLMTNTIKQIEKICG------FLGIDYQKKKSGESFCKIAGLELNI----- 147

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN 229
                 Y  +  D+F          I  +EA+       L  +  L  R      I  +N
Sbjct: 148 ------YGGKNRDSFSKIRGGNSAIIYVNEATVIHKETLLEAIKRL--RKGKAIIIFDTN 199

Query: 230 PRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQF 288
           P   +  F   F +  D +K +   T         F E     Y       +  V  G++
Sbjct: 200 PESPTHFFKTDFIENKDVFKTYNFTTYDNPLNSADFIETQKKLY-KHLPAYKARVLYGEW 258

Query: 289 PQQDIDSF 296
              +   F
Sbjct: 259 ILNESTLF 266


>gi|326387547|ref|ZP_08209153.1| hypothetical protein Y88_0459 [Novosphingobium nitrogenifigens DSM
           19370]
 gi|326207593|gb|EGD58404.1| hypothetical protein Y88_0459 [Novosphingobium nitrogenifigens DSM
           19370]
          Length = 656

 Score = 44.7 bits (104), Expect = 0.035,   Method: Composition-based stats.
 Identities = 39/251 (15%), Positives = 64/251 (25%), Gaps = 61/251 (24%)

Query: 185 VGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFY---- 238
            G+H      +I DE       + +        T +   R     S P  L  + Y    
Sbjct: 310 QGYHGD----VIVDECFWIYGFEELFKVASAMATHKQYTRTL--FSTPSTLDHEAYGMWS 363

Query: 239 -EIFNKPLDDWKRFQIDTRT------------------------VEGIDPSFHEGIIARY 273
            + FN+      + +ID                            +G+D    + +    
Sbjct: 364 GDRFNRRRAKADKVRIDIANEHLRDGSLGPDGVWRQVVTIFDAIAKGLDLVDVDELQREN 423

Query: 274 GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP---------LIMGCD 323
           G+D           F      S  P  ++   + +       Y P         + +G D
Sbjct: 424 GIDE--FDNLFRCIFLDDSQ-SMFPFALMRRCMVDAWEVWQDYQPYALRPYAGEVWLGYD 480

Query: 324 IAEEGG----DNTVVVLR-----RGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAI 372
                     D+  +V        G     L        D     + I  +  KYR   I
Sbjct: 481 PNASEDNPTSDDAALVAIAPPSAIGGKFRILEKKRLKGLDFAGQADAIREMAGKYRVTKI 540

Query: 373 IIDANNTGART 383
            ID    G   
Sbjct: 541 GIDTTGAGKAV 551


>gi|255652557|ref|ZP_05399459.1| hypothetical protein CdifQCD_20411 [Clostridium difficile
           QCD-37x79]
          Length = 591

 Score = 44.7 bits (104), Expect = 0.036,   Method: Composition-based stats.
 Identities = 47/366 (12%), Positives = 102/366 (27%), Gaps = 75/366 (20%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           RG  K+ + A          P   +   A +++Q +  +  ++ K L         E++ 
Sbjct: 71  RGFAKSWIAAVYACCRAVLYPNSKIGIAAFTKSQAELIIREKIEKELVKQSPMLAREIKK 130

Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205
           +  +   +     H    I++   +   R +                  +I DE      
Sbjct: 131 IE-YNNKFSKVTFHNGSTIEAIVSNEQSRGFRFN--------------ILIVDEFRLVKK 175

Query: 206 VINLGILGFLTERNANRFWIMTS-------NPRR---LSGKFYEIFNKP--LDDWKRFQI 253
            I   IL      + N  +            P +   LS  ++ +         + +  +
Sbjct: 176 EIQDRILKPFLNVSRNLKFKKDGKYEDYPPEPNKELYLSSAWFRMHEAYDKFKLYVKDMV 235

Query: 254 DTRTV-------------EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFI--- 297
           D R                 +D    + +     +D+    +E+   F  ++ D+     
Sbjct: 236 DGRDKFVLNCNYKLSLHHGILDKERADEMKRE--MDAVSWIMEMESLFFGENEDAIFKSS 293

Query: 298 -------------PLNIIEEALNREPCPD------PYAPLIMGCDIA---EEGGDNTV-- 333
                        P   +E    +                I+  DIA    +  DN+V  
Sbjct: 294 YVNPCRTLKNPFYPPTDLEILSAKNGKVKCNLQKRKGELRIISADIAVAEGDNNDNSVYT 353

Query: 334 ---VVLRRGPVIE---HLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYL 387
              ++  +        H+   +         ++  L   +  D ++ID    G      L
Sbjct: 354 CWRLLPEKDYYERMVVHIESHNGMKPDKQAIRLKQLFFDFEADFLVIDTQGVGQSVLSDL 413

Query: 388 EMLGYH 393
             + Y 
Sbjct: 414 LRVNYD 419


>gi|225621943|ref|YP_002724616.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
 gi|225547242|gb|ACN93227.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.037,   Method: Composition-based stats.
 Identities = 31/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  +    +  V  G++      
Sbjct: 206 YFKTDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KEIPTYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|23455748|ref|NP_695057.1| putative terminase [Lactococcus phage r1t]
 gi|1353546|gb|AAB18704.1| ORF29 [Lactococcus phage r1t]
          Length = 469

 Score = 44.7 bits (104), Expect = 0.038,   Method: Composition-based stats.
 Identities = 57/349 (16%), Positives = 104/349 (29%), Gaps = 51/349 (14%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ   ++ V A   + +       +          GKT +   L LW +    G+S++ 
Sbjct: 41  PWQKNLLKEVMAIDEDGLWTHQKFGYSIPRRN----GKTEIVYILELWSLV--QGLSILH 94

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +   ++    + K+L         + +S+         + L          + T 
Sbjct: 95  TAHRISTSHSSYEK-LKKYLEDSGYVEGEDFKSIKAK----GQERLELIESGGVIQFRT- 148

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT S    + F          ++ DEA          +   +T+ + N   IM   P  
Sbjct: 149 -RTSSGGLGEGFD--------ILVIDEAQEYTTEQESALKYTVTDSD-NPMTIMCGTPPT 198

Query: 233 L------SGKFYE---IFNKPLDDWKRFQI-------DTRTVEGIDPS-----FHEGIIA 271
                     + +           W  + +       D       +PS         I A
Sbjct: 199 PVSSGTVFTNYRDNTIAGKAKYSGWAEWSVEDVKDIHDVEAWYNSNPSMGYHLNERKIEA 258

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331
             G D     V+  G +P+ +  S I       AL     P     L +G    + G D 
Sbjct: 259 ELGEDKLDHNVQRLGYWPKYNQKSVISEQE-WNALKVNRLPVIKGKLFVGI---KYGNDG 314

Query: 332 TVVVLRRGPVIEH----LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
             V +            +       +R  N  I   ++K   + ++ID 
Sbjct: 315 ANVAMSIAVKTLSGKVFVETIDCQSIRNGNQWIINFLKKADVEKVVIDG 363


>gi|269978258|ref|ZP_06185208.1| putative phage terminase, large subunit [Mobiluncus mulieris 28-1]
 gi|269933767|gb|EEZ90351.1| putative phage terminase, large subunit [Mobiluncus mulieris 28-1]
          Length = 477

 Score = 44.7 bits (104), Expect = 0.038,   Method: Composition-based stats.
 Identities = 39/237 (16%), Positives = 78/237 (32%), Gaps = 26/237 (10%)

Query: 200 ASGTPDVINLGILG----FLTERNANRFWIM---TSNPRRLSGKFYEIFNKPLDDWKRFQ 252
           ++G  D     + G     +T  NA     +   T +   ++      ++     W+   
Sbjct: 183 STGLADS--EVLEGLRTRAVTGENAGSLCYLEWSTKSWDEMTVSERSHWDDDRAKWRA-- 238

Query: 253 IDTRTVEGIDPSFHEGIIARY------GLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEA 305
            D       +P+F   I A Y         SD+    E  G + +   DS IP+++ +  
Sbjct: 239 -DPEVWREANPAFEIRISADYMQKELASEMSDIDFEREHLGIWERIGGDSLIPVDVWQSL 297

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
            N +  P     L +    + E     +  +R           +   L     ++  L  
Sbjct: 298 ANEKSQPGENIVLALDVPPSREQAFIAMASIRDDGKTHLELVDTADGLAWITPRLQQLQR 357

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
           KYRP+AI++DA +        L+       ++ G               + +  + +
Sbjct: 358 KYRPEAIVVDAQSAAGSLLPELKANRVRTLQISG-------RDYAKACGQFYDAVRE 407


>gi|322507236|gb|ADX02690.1| Putative phage terminase [Acinetobacter baumannii 1656-2]
          Length = 378

 Score = 44.7 bits (104), Expect = 0.039,   Method: Composition-based stats.
 Identities = 44/279 (15%), Positives = 77/279 (27%), Gaps = 35/279 (12%)

Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
           A +  Q++   +  +           W     +         +          + Y T  
Sbjct: 5   APTYPQIRDIFFPTI-----EEVAFDWGLKTKVY--------ETNKEVDIYYGRQYRTTI 51

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDE----ASGTPDVINLGILGFLTERNANRF-WIMTS 228
              S E+P T VG    + +    DE    A          I+  +  + A     I  +
Sbjct: 52  ICRSMEKPATIVGFKIGHAL---IDELDVMAKVKAQQAWRKIIARMRYKQAGLLNGIDVA 108

Query: 229 NPRRLSGKFYEIFNKP-------LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVT 280
                    YE F K           +   Q  T   E  +   +   +   Y     + 
Sbjct: 109 TTPEGFKFTYEQFVKEANKSEAKRKLYGMIQASTYDNEANLPDDYISSLYESY--PPQLI 166

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP 340
              + GQF      +  P +      + +       PL++G D         V V+R G 
Sbjct: 167 SAYLRGQFVNLTSGAVYP-DFDRVLNHTDEEIKKGEPLLIGMDFNVLKMAAVVYVIREG- 224

Query: 341 VIEHLFDWSK-TDLRTTNNKISGLVEKYRPDAIIIDANN 378
               L +     D  T    I+     +    +I DA+ 
Sbjct: 225 KPRALDELVGVRDTPTMCQLINERFPDH-DITVIPDASG 262


>gi|239502405|ref|ZP_04661715.1| putative phage terminase [Acinetobacter baumannii AB900]
          Length = 427

 Score = 44.7 bits (104), Expect = 0.039,   Method: Composition-based stats.
 Identities = 54/326 (16%), Positives = 97/326 (29%), Gaps = 52/326 (15%)

Query: 75  PEVFKGAISAGRGIGKTTLN-------AWLVLWLMSTRPGISVIC--LANSETQLKTTLW 125
           P  F+  + AG G GKT +        +W             V     A +  Q++   +
Sbjct: 19  PNKFRAFV-AGFGSGKTWVGCSSLCDKSWS---------FPKVPLGYFAPTYPQIRDIFF 68

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
             +           W       ++ +    D+ +       + Y +     S E+P+T V
Sbjct: 69  PTI-----DEVAFDWGL--KTKIYESNKEVDLYYG------RQYRSTIICRSMEKPNTIV 115

Query: 186 GHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNK 243
           G    + +    D  +          I+  +  + A     I  +         +E F K
Sbjct: 116 GFKIGHALIDELDVMTKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTHEQFVK 175

Query: 244 P-------LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295
                      +   Q  T   E  +   +   +   Y     +    + GQF      +
Sbjct: 176 EANLSDAKRALYGMIQASTYDNEVNLPDDYIASLFESY--PPQLISAYLKGQFVNLTSGA 233

Query: 296 FIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLR 354
             P +      + +    P   L++G D         V V+R G     L +     D  
Sbjct: 234 VYP-DFDRTLNHTDEEIRPNEALLIGMDFNVLKMAAVVYVIRDG-KPRALDELVGVRDTP 291

Query: 355 TTNNKISGLVEKYRPD--AIIIDANN 378
           T  +    L+EK+      II DA  
Sbjct: 292 TMADL---LIEKFPNHEMTIIPDAAG 314


>gi|169633422|ref|YP_001707158.1| putative phage terminase [Acinetobacter baumannii SDF]
 gi|169152214|emb|CAP01118.1| conserved hypothetical protein; Putative phage terminase
           [Acinetobacter baumannii]
          Length = 432

 Score = 44.7 bits (104), Expect = 0.039,   Method: Composition-based stats.
 Identities = 54/326 (16%), Positives = 97/326 (29%), Gaps = 52/326 (15%)

Query: 75  PEVFKGAISAGRGIGKTTLN-------AWLVLWLMSTRPGISVIC--LANSETQLKTTLW 125
           P  F+  + AG G GKT +        +W             V     A +  Q++   +
Sbjct: 24  PNKFRAFV-AGFGSGKTWVGCSSLCDKSWS---------FPKVPLGYFAPTYPQIRDIFF 73

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
             +           W       ++ +    D+ +       + Y +     S E+P+T V
Sbjct: 74  PTI-----DEVAFDWGL--KTKIYESNKEVDLYYG------RQYRSTIICRSMEKPNTIV 120

Query: 186 GHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRF-WIMTSNPRRLSGKFYEIFNK 243
           G    + +    D  +          I+  +  + A     I  +         +E F K
Sbjct: 121 GFKIGHALIDELDVMTKVKAQQAWRKIIARMRYKQAGLLNGIDVATTPEGFKFTHEQFVK 180

Query: 244 P-------LDDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295
                      +   Q  T   E  +   +   +   Y     +    + GQF      +
Sbjct: 181 EANQSDAKRALYGMIQASTYDNEANLPDDYIASLFESY--PPQLISAYLKGQFVNLTSGA 238

Query: 296 FIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSK-TDLR 354
             P +      + +    P   L++G D         V V+R G     L +     D  
Sbjct: 239 VYP-DFDRTLNHTDEEIRPNEALLIGMDFNVLKMAAVVYVIRDG-KPRALDELVGVRDTP 296

Query: 355 TTNNKISGLVEKYRPD--AIIIDANN 378
           T  +    L+EK+      II DA  
Sbjct: 297 TMADL---LIEKFPNHEMTIIPDAAG 319


>gi|117530337|ref|YP_851180.1| superfamily II DNA/RNA helicase [Microcystis phage Ma-LMM01]
 gi|117165949|dbj|BAF36257.1| superfamily II DNA/RNA helicase [Microcystis phage Ma-LMM01]
          Length = 483

 Score = 44.7 bits (104), Expect = 0.041,   Method: Composition-based stats.
 Identities = 31/177 (17%), Positives = 53/177 (29%), Gaps = 27/177 (15%)

Query: 58  FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117
            ++                 ++G + A  G GK+ + A L+++          + +  + 
Sbjct: 96  AIDARLRLDQQEAVAGILRGYRGYVRAATGYGKSAVIATLMMYF-----EARRLIVVPTV 150

Query: 118 TQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176
             L     AE V +W SL P                   D L+  +    + Y       
Sbjct: 151 RLLYQM--AEDVQEWASLSPGLVGDG-NDDISTMTIATVDTLYERIKRGDRRY------- 200

Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233
                +   G         + DEA    +    GI   L+  NA     MT+ P R 
Sbjct: 201 ----IEWLSGIE-----VAVFDEAHTYMNA--SGITTALSLVNARYKIGMTATPTRT 246


>gi|11497404|ref|NP_051512.1| hypothetical protein BB_Q50 [Borrelia burgdorferi B31]
 gi|6382425|gb|AAF07735.1|AE001584_32 conserved hypothetical protein [Borrelia burgdorferi B31]
          Length = 450

 Score = 44.7 bits (104), Expect = 0.041,   Method: Composition-based stats.
 Identities = 31/161 (19%), Positives = 51/161 (31%), Gaps = 13/161 (8%)

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
            ++   F     +    I  +EA+         +L  L  R      I  +NP      F
Sbjct: 150 GDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEHYF 207

Query: 238 YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDIDSF 296
              +   +  +K +   T     +   F E     Y  D    +  V  G++       F
Sbjct: 208 KTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIF 266

Query: 297 IPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
             +NI  + +   P        I   D A   GGDNT + +
Sbjct: 267 TQINITNDYVFTSP--------IAYLDPAFSVGGDNTALCV 299


>gi|313499430|gb|ADR60796.1| P-loop protein [Pseudomonas putida BIRD-1]
          Length = 374

 Score = 44.3 bits (103), Expect = 0.041,   Method: Composition-based stats.
 Identities = 45/329 (13%), Positives = 92/329 (27%), Gaps = 62/329 (18%)

Query: 78  FKGAISAGRGIGKTTLNAW--------LVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129
           F+ A+  GR  GKT L              W +S      +   A +  Q K   W  + 
Sbjct: 34  FRDAV-CGRRFGKTFLGKAEMRRAARLAAEWGVSVEDE--IWYGAPTFKQAKRVFWRRLK 90

Query: 130 KWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           + +                 P  W +   + +    +     + R    +  D   G   
Sbjct: 91  QAI-----------------PEAWRAARPNETECSITLKSGHIMRVVGLDNYDNLRG--- 130

Query: 190 TYGMAIINDEASGTPDVINLGILGFLTERNANRF-----------WIMTSNPRRLSGKFY 238
           +    ++ DE +         +L  +                    +    P+      Y
Sbjct: 131 SGLFFVLVDEWADCSWAAWEEVLRPMLSTCQYTIPQTGESRKGGHALRIGTPKG-FNHCY 189

Query: 239 EIFN-------KPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291
           + +             W+   +    V  ++        AR  +D    R E    F  +
Sbjct: 190 DTYRDGQPGGEPDHKSWQYTSLQGGNVPAVELD-----AARRKMDPRTFRQEYEAGF--E 242

Query: 292 DIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKT 351
           +    +         +      P   + +G D         V V+R G     L ++   
Sbjct: 243 NYAGVVYSTFDRAECHTSERIKPGEAIHIGMDFNVMKMAAVVYVVRDGL-PLALDEFH-- 299

Query: 352 DLRTTNNKISGLVEKYRPDAIII--DANN 378
            +R T + I  +  ++   ++ +  DA+ 
Sbjct: 300 SVRDTPDMIEKIKVRFSGHSVSVYPDASG 328


>gi|319404714|emb|CBI78316.1| phage-related protein [Bartonella rochalimae ATCC BAA-1498]
          Length = 442

 Score = 44.3 bits (103), Expect = 0.042,   Method: Composition-based stats.
 Identities = 31/193 (16%), Positives = 63/193 (32%), Gaps = 9/193 (4%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247
             +    DEA    D     ++  L E          +T NP R +    + F      +
Sbjct: 122 RILLCWVDEAEPVTDAAWQILIPTLREEGKEWHSELWVTWNPCRENAAVEKRFRFTKDPN 181

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
            K  +I+ R         +    A      +  +    G++ Q    ++    ++E    
Sbjct: 182 IKGVEINWRDNPKFPAKLNRDRQADLEQRPEQYQHIWEGEYLQAMQGAYYQKLLLEAEQE 241

Query: 308 REPCPDPYAPLI---MGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361
                 P  PLI   +  DI   G   D T + + +       + D+ +   +  +  I 
Sbjct: 242 GRITTVPRDPLIQVKIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHIG 301

Query: 362 GLVEKYRPDAIII 374
            + +K    A+++
Sbjct: 302 WVCQKGYEKALMV 314


>gi|306818632|ref|ZP_07452355.1| possible phage-related terminase [Mobiluncus mulieris ATCC 35239]
 gi|304648805|gb|EFM46107.1| possible phage-related terminase [Mobiluncus mulieris ATCC 35239]
          Length = 470

 Score = 44.3 bits (103), Expect = 0.042,   Method: Composition-based stats.
 Identities = 39/237 (16%), Positives = 78/237 (32%), Gaps = 26/237 (10%)

Query: 200 ASGTPDVINLGILG----FLTERNANRFWIM---TSNPRRLSGKFYEIFNKPLDDWKRFQ 252
           ++G  D     + G     +T  NA     +   T +   ++      ++     W+   
Sbjct: 176 STGLADS--EVLEGLRTRAVTGENAGSLCYLEWSTKSWDEMTVSERSHWDDDRAKWRA-- 231

Query: 253 IDTRTVEGIDPSFHEGIIARY------GLDSDV-TRVEVCGQFPQQDIDSFIPLNIIEEA 305
            D       +P+F   I A Y         SD+    E  G + +   DS IP+++ +  
Sbjct: 232 -DPEVWREANPAFEIRISADYMQKELASEMSDIDFEREHLGIWERIGGDSLIPVDVWQSL 290

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
            N +  P     L +    + E     +  +R           +   L     ++  L  
Sbjct: 291 ANEKSQPGENIVLALDVPPSREQAFIAMASIRDDGKTHLELVDTADGLAWITPRLQQLQR 350

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMAD 422
           KYRP+AI++DA +        L+       ++ G               + +  + +
Sbjct: 351 KYRPEAIVVDAQSAAGSLLPELKANRVRTLQISG-------RDYAKACGQFYDAVRE 400


>gi|332290535|ref|YP_004421387.1| conserved hypothetical protein, Terminase-like family
           [Gallibacterium anatis UMN179]
 gi|330433431|gb|AEC18490.1| conserved hypothetical protein, Terminase-like family
           [Gallibacterium anatis UMN179]
          Length = 597

 Score = 44.3 bits (103), Expect = 0.043,   Method: Composition-based stats.
 Identities = 20/135 (14%), Positives = 40/135 (29%), Gaps = 17/135 (12%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN----REPCPDPYAP---- 317
            E +  +Y  +          ++       F    +++ A +    ++  PD   P    
Sbjct: 361 LEALKRKY--NKAAFDQLFMCKWIDDADSIFNISQLLKCATDISKWQDFRPDSDRPLDNR 418

Query: 318 -LIMGCDIA--EEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPD 370
            +  G D A   +G    V+           I     W          +I  + ++Y   
Sbjct: 419 EVWCGYDPAKSYDGASFVVIAPPVLPGEKYRILERHQWHGLSYSYQAEQIKQIYQRYNVS 478

Query: 371 AIIIDANNTGARTCD 385
            I ID +  G    +
Sbjct: 479 YIGIDTSGVGVGVYE 493


>gi|307544941|ref|YP_003897420.1| cobalamin synthesis protein, P47K [Halomonas elongata DSM 2581]
 gi|307216965|emb|CBV42235.1| cobalamin synthesis protein, P47K [Halomonas elongata DSM 2581]
          Length = 399

 Score = 44.3 bits (103), Expect = 0.043,   Method: Composition-based stats.
 Identities = 29/136 (21%), Positives = 51/136 (37%), Gaps = 7/136 (5%)

Query: 300 NIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL--FDWSKTDLRTTN 357
           ++I+  L  +P  + +A +I   +    G D T+       +IE L              
Sbjct: 81  SLIQHLLAHKPAGERWAVVIN--EFGRVGIDQTMFEAHDDLIIESLPGGCLCCQQAVVLR 138

Query: 358 NKISGLVEKYRPDAIIIDANNTG--ARTCDYLEMLGY-HVYRVLGQKRAVDLEFCRNRRT 414
             +  L+ ++RPD +II+ +  G  A   D L   G+     + G    +D     + R 
Sbjct: 139 ASLVRLLRRHRPDRLIIEPSGLGHPAGLLDLLRGEGFADALDIRGVVAVLDPRRLDDTRA 198

Query: 415 ELHVKMADWLEFASLI 430
             H    D L  A  +
Sbjct: 199 MAHETFLDQLRMADAV 214


>gi|226949140|ref|YP_002804231.1| putative phage terminase, large subunit [Clostridium botulinum A2
           str. Kyoto]
 gi|226841904|gb|ACO84570.1| putative phage terminase, large subunit [Clostridium botulinum A2
           str. Kyoto]
          Length = 572

 Score = 44.3 bits (103), Expect = 0.043,   Method: Composition-based stats.
 Identities = 54/306 (17%), Positives = 97/306 (31%), Gaps = 34/306 (11%)

Query: 57  EFMEVVDAHCLNSVNNPNP-EVFKGA-ISAGRGIGKTTLNAWLVLWL--MSTRPGISVIC 112
           +F E +    +  V        F+ + I  GR  GK+ LN  L  +L   S      + C
Sbjct: 77  QFQEFILGSLIGWVTKDKEYRRFRSSYIQLGRQNGKSFLNGILGTYLGNFSGYKYGKIFC 136

Query: 113 LANSETQLKTTLWAEVSKW-LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171
           +A    Q K  +W E++K+  S       F +Q          ++ +  +LG D+K    
Sbjct: 137 VATKHDQAK-IVWDEMNKFIQSDDDLGELFTVQEYKSTIICNLTNTVIKALGRDTKGLD- 194

Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN---------- 221
             R       +     H    M  + +   G    I   ++  +T               
Sbjct: 195 GLRPLLTVIDEYHA--HKDNQMYKLME---GGQKKIKQSLISVITTAGFELESPCHKMYK 249

Query: 222 -RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI--DPSFHEGIIARY----- 273
               I+    +  S   Y       DD   +Q   +    +  D    E +I  Y     
Sbjct: 250 YCKQILEGTEKNESKFIYIAEMDEEDDLNNYQNWIKANPMLQYDREALENLIPVYKSAKA 309

Query: 274 --GLDSDVTRVEVCGQFPQQDIDSFI-PLNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
               D +    +    + +     ++      + A N+          I+G D++  GGD
Sbjct: 310 IGSKDWNDFLTKQLNMWVEFTETKYMNMTAWNKCASNKTLEDFRGQEFILGIDLS-SGGD 368

Query: 331 NTVVVL 336
            T +  
Sbjct: 369 LTSICF 374


>gi|163850863|ref|YP_001638906.1| hypothetical protein Mext_1434 [Methylobacterium extorquens PA1]
 gi|163662468|gb|ABY29835.1| protein of unknown function DUF264 [Methylobacterium extorquens
           PA1]
          Length = 458

 Score = 44.3 bits (103), Expect = 0.043,   Method: Composition-based stats.
 Identities = 62/296 (20%), Positives = 98/296 (33%), Gaps = 43/296 (14%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           L  +E    H       P P  +   A+  GRG GKT   A    W+     G  V    
Sbjct: 46  LRLLEADWLHLARHDQLPPPGNWTTWAVIGGRGSGKTRTGA---EWVRGLAHGDPVFTPE 102

Query: 115 NSET-QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
             E   L    +A+V   +   P+     +  L   P  W         G        + 
Sbjct: 103 PVERIALVGETFADVRDVMIEGPSG-LLALPRLGGAPPVWQPSRRRVVFGN-----GAVA 156

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRR 232
             +S E PD+  G       A+ +DE +                 +  +F +     PR 
Sbjct: 157 LAFSAEEPDSLRG---PQFGAVWSDEVAK-----WREAEAT---YDMIQFGLRLGTHPRG 205

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTV----------EGIDPSFHEGIIARYGLDSDVTRV 282
           L         +P+   +R   D RTV          + + PSF E ++ RY   + + R 
Sbjct: 206 LVT----TTPRPVPLIRRLLADPRTVVTRSRTADNAQNLAPSFLEEVVGRY-AGTRLRRQ 260

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI---AEEGGDNTVVV 335
           E+ G+  +   D+    + IE A  R     P   + +  D    +  G D   +V
Sbjct: 261 ELDGELIEDRPDALWTRDAIERA--RVSEAPPLQRIAVAIDPPASSRVGADACGIV 314


>gi|225626397|ref|YP_002727892.1| terminase large subunit [Enterococcus phage EFAP-1]
 gi|225346568|gb|ACN86334.1| terminase large subunit [Enterococcus phage EFAP-1]
          Length = 574

 Score = 44.3 bits (103), Expect = 0.045,   Method: Composition-based stats.
 Identities = 49/303 (16%), Positives = 97/303 (32%), Gaps = 52/303 (17%)

Query: 68  NSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLW-LMST-RPGIS--VICLANSETQLKTT 123
               N      K  IS  R  GK+ L A + L+  +    P  S  ++  AN++ Q    
Sbjct: 89  RKKKNKMRRFRKVYISLARKNGKSILVAGISLYEFLLGQYPNASRQIVAAANTKDQ---- 144

Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT 183
                     ++ N    ++++L                 I+     +  +  S +  D+
Sbjct: 145 --------AGIVFNMLKSQLKALRAVSDGTRKVTKVNKKDIEHLEDESTVKPLSSD-VDS 195

Query: 184 FVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY----- 238
             G     G+     EA  T   +   +    +++      I+++  + L+G  +     
Sbjct: 196 LDGLDVLCGVLDEYGEAKSTA--MIEVLESSQSQQLQGLILIISTTTKNLNGPMHSIEYP 253

Query: 239 ---EIFNKPLDD-------WKRFQIDTRTVE---------GIDPSFHEGI-------IAR 272
              ++ N+ ++        W+   +     E           +   HE +       +A 
Sbjct: 254 FITKLLNEEVEADAYLALCWEMDSLSEVDDEANWIKSNPLFENAQLHETMYEHKVNSLAE 313

Query: 273 YGLDSDV--TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
           Y    D+     +    + Q   DSFI     E     +P      P+ +G D+A  G  
Sbjct: 314 YKAKGDMSGWLTKEMNFWVQSSQDSFIDKESWEAVKQTKPYDIKGRPVYIGLDLARTGDM 373

Query: 331 NTV 333
             V
Sbjct: 374 TAV 376


>gi|328543446|ref|YP_004303555.1| endopeptidase Clp ATP-binding chain A [polymorphum gilvum
           SL003B-26A1]
 gi|326413190|gb|ADZ70253.1| Endopeptidase Clp ATP-binding chain A [Polymorphum gilvum
           SL003B-26A1]
          Length = 813

 Score = 44.3 bits (103), Expect = 0.047,   Method: Composition-based stats.
 Identities = 34/215 (15%), Positives = 73/215 (33%), Gaps = 19/215 (8%)

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTR-VEVCGQFPQQDIDSFIPLNIIE 303
               K   ++   V   + +    I    G DS++ R +++  +  + +     PL + +
Sbjct: 171 KPKKKTDALEAYCVNLNEKATKGKIDPLIGRDSEIARTIQILCRRSKNN-----PLFVGD 225

Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL 363
             + +    +  A  I+  D+ E   D T+  L  G ++       + D      ++   
Sbjct: 226 PGVGKTAIAEGLARRIVKGDVPEVLKDATIFALDMGALL--AGTRYRGDFEERLKQVVKE 283

Query: 364 VEKYRPDAIIID----ANNTGARTCDYLEMLG-YHVYRVLGQKRAVDLEFCRNRRTELHV 418
           +E+Y    + ID        GA +   ++           G  R +     +  R +   
Sbjct: 284 IEEYPGAVMFIDEIHTVIGAGATSGGAMDASNLLKPALASGAIRCIGSTTYKEYR-QFFE 342

Query: 419 KMADWLEFASLINHSG-----LIQNLKSLKSFIVP 448
           K    +     I+ +       I+ LK LK +   
Sbjct: 343 KDRALVRRFQKIDVNEPSVPDAIEILKGLKPYFED 377


>gi|293393565|ref|ZP_06637875.1| conserved hypothetical protein [Serratia odorifera DSM 4582]
 gi|291423900|gb|EFE97119.1| conserved hypothetical protein [Serratia odorifera DSM 4582]
          Length = 572

 Score = 44.3 bits (103), Expect = 0.048,   Method: Composition-based stats.
 Identities = 58/338 (17%), Positives = 99/338 (29%), Gaps = 62/338 (18%)

Query: 195 IINDEASGTPDVIN--LGILGFLTERNANRFWIMTSNPRRLSGKFY-----EIFNKPLDD 247
           +  DEA    + +N      G  T     R     S P     + Y     ++FNK    
Sbjct: 230 LYLDEAFWISNFLNLRKVAAGMATHEGLRRT--YFSTPSSEEHEAYQFWTGDLFNKSRRK 287

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIAR---------------------YGLDSDV-TRVEVC 285
            +R +ID       +       I R                      G +S         
Sbjct: 288 AERVEIDISHKALKNGRLGGDGIWRQIVTIEDAIKLGFNRVKIETIKGENSPEDYDNLYR 347

Query: 286 GQFPQQDIDSF-----IPLNIIEEALNREPCPDPYAP-------LIMGCDI--AEEGGDN 331
            +F      +F     I   +     +  P  +P+AP       + +G D       GD+
Sbjct: 348 CRFVTVGERAFNYNAMIGCCVDGFNDDVWPDWNPFAPRPIGDRGVWIGYDPNGGSGNGDS 407

Query: 332 TVVVLRR-----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384
             +V+       G     +        +       I GL E+Y    I ID         
Sbjct: 408 AGLVVIVPPAVAGGKFRIIERVQLRGMEFEEQAKVIEGLTERYNVQHIAIDGTG---GFG 464

Query: 385 DYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKS 444
           D +  L      V     AV  ++    +  + +K    +    L   +G++  ++S  +
Sbjct: 465 DAVWQL-----VVKFFPLAVKYQYSVQLKRAMVLKALMLVRAGRLELDAGMMDLIQSFMT 519

Query: 445 FIVPNTGE-LAIESKRVKGAKSTDYSDGLMYTFAENPP 481
                 G  +   S R +G+   D +     T   N P
Sbjct: 520 VRKVQKGNVMTYVSDRKRGSNHGDLAWA-SMTALYNEP 556


>gi|56560881|ref|YP_161301.1| hypothetical protein BGP016 [Borrelia garinii PBi]
 gi|52696522|gb|AAU85866.1| hypothetical protein BGP016 [Borrelia garinii PBi]
          Length = 396

 Score = 44.3 bits (103), Expect = 0.052,   Method: Composition-based stats.
 Identities = 34/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSIYGLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    E  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219


>gi|213580952|ref|ZP_03362778.1| hypothetical protein SentesTyph_07004 [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-0664]
          Length = 67

 Score = 44.3 bits (103), Expect = 0.052,   Method: Composition-based stats.
 Identities = 12/48 (25%), Positives = 18/48 (37%)

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            I     W   D R   + I  L E+Y    I ID+   G    + ++
Sbjct: 16  RILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVK 63


>gi|213162920|ref|ZP_03348630.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. E00-7866]
          Length = 113

 Score = 44.3 bits (103), Expect = 0.052,   Method: Composition-based stats.
 Identities = 12/48 (25%), Positives = 18/48 (37%)

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            I     W   D R   + I  L E+Y    I ID+   G    + ++
Sbjct: 16  RILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVK 63


>gi|323139470|ref|ZP_08074518.1| hypothetical protein Met49242DRAFT_3906 [Methylocystis sp. ATCC
           49242]
 gi|322395272|gb|EFX97825.1| hypothetical protein Met49242DRAFT_3906 [Methylocystis sp. ATCC
           49242]
          Length = 439

 Score = 44.3 bits (103), Expect = 0.052,   Method: Composition-based stats.
 Identities = 65/412 (15%), Positives = 125/412 (30%), Gaps = 62/412 (15%)

Query: 82  ISAGRGIGKTTLNA----WLVLW--LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135
           I  GRG GKT   A     L L      TRP   +  +  +   ++  +   VS  L++ 
Sbjct: 54  ILGGRGAGKTRAGAEWVKGLALGRPHFCTRPVSRIALIGETAADVREVMIEGVSGLLAIH 113

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGM 193
             +     +S                          + + +S E P++  G   H     
Sbjct: 114 GKRDRPRWESSR---------------RRLVWDSGVVAQAFSAEDPESLRGPQFHAA--- 155

Query: 194 AIINDEASG--TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251
               DE +           +   L   +  R  + T+ P R +    ++   P     R 
Sbjct: 156 --WCDELAKWRYARETWDMLQFGLRLGDWPRQLV-TTTP-RPTPLLKDLIAHPATVLTR- 210

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
            +       + PSF E ++A+Y   + + R E+ G+  ++  D+    ++IE   +R   
Sbjct: 211 ALTRENAANLAPSFLESVVAQY-AGTRLGRQELDGEIVEERKDALWTRDLIEA--SRVAD 267

Query: 312 PDPYAPLIMGCD-IAEEGG--DNTVVVLRRGPVIEHLFDWSKTDLRTT-----NNKISGL 363
               A +++  D  A  G   DN  ++         +F  + + +              L
Sbjct: 268 APRLARIVVAVDPPASFGKRADNCGIIAAGADAGGAIFVLADSTISAARPAQWARAAIAL 327

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW 423
             K   D ++ + N  G      L         V   +             +L+ +    
Sbjct: 328 YHKLSADVLVAEVNQGGEMVRAVLNEAD-PAAPVTMVRATRGKYLRAAPVAQLYEQGR-- 384

Query: 424 LEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
           ++         L   +           G            +S D  D L++ 
Sbjct: 385 VKHVGAFP--ALEDEMC-----DFGFDGLSC--------GRSPDRLDALVWA 421


>gi|323978427|gb|EGB73511.1| terminase [Escherichia coli TW10509]
          Length = 595

 Score = 43.9 bits (102), Expect = 0.054,   Method: Composition-based stats.
 Identities = 23/162 (14%), Positives = 44/162 (27%), Gaps = 20/162 (12%)

Query: 244 PLDDWK-RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII 302
           P   W+    ++     G + +    +  RY        +     F      S    + +
Sbjct: 334 PDGQWRYVVTLEDAIAGGFNLADINELRERYNET--AFNMLFMCVFVDDKE-SVFKFDDL 390

Query: 303 EEALNREPCPDPYAP----------LIMGCDIAEEGGDNTVVVLR------RGPVIEHLF 346
                     + + P          +  G D A  G + T VVL           +    
Sbjct: 391 VRCGVDVSTWEDFHPEDAMPFGNREVWGGFDPARSGDNATFVVLSPPLVAAERFRVLEKH 450

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            W     +    +I  +  +Y    I ID    G    + ++
Sbjct: 451 HWRSMSFQFMAERIRSIKARYNMTFIGIDVTGLGYGVFELVQ 492


>gi|291085166|ref|ZP_06570961.1| terminase, ATPase subunit [Citrobacter youngae ATCC 29220]
 gi|291072161|gb|EFE10270.1| terminase, ATPase subunit [Citrobacter youngae ATCC 29220]
          Length = 106

 Score = 43.9 bits (102), Expect = 0.055,   Method: Composition-based stats.
 Identities = 22/84 (26%), Positives = 28/84 (33%), Gaps = 6/84 (7%)

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLV 364
            P  + P  +G D +  G      VL      G     L    W   D  T    I  L 
Sbjct: 17  RPFNWRPAWIGYDPSHTGDSAGCAVLAPPLVAGGKFRILERHQWRGMDFATQAEAIRELT 76

Query: 365 EKYRPDAIIIDANNTGARTCDYLE 388
           EKY  + I IDA + G      + 
Sbjct: 77  EKYCVEYIGIDATDIGQGVYQLVR 100


>gi|259418958|ref|ZP_05742875.1| phage DNA Packaging Protein [Silicibacter sp. TrichCH4B]
 gi|259345180|gb|EEW57034.1| phage DNA Packaging Protein [Silicibacter sp. TrichCH4B]
          Length = 478

 Score = 43.9 bits (102), Expect = 0.056,   Method: Composition-based stats.
 Identities = 59/423 (13%), Positives = 110/423 (26%), Gaps = 79/423 (18%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ S   G           +  +  +  Q++  +        
Sbjct: 86  ILGGRGAGKTRAGA---EWVRSEVEGAEPFGIGRARRMALVGETYDQVRDVM-------- 134

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
             +         S       W +                  + +S   P+   G      
Sbjct: 135 --IHGDSGILACSPPDRRPEWRAGERRLVWPN-----GATAQAFSASDPEALRGPQFD-- 185

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +           +   L    A R  +  +   R      ++   P      
Sbjct: 186 -AAWVDELAKWRRAQDAWDMLQFALRLGAAPR--VCVTTTPRNVPLLKQLLESPSTV-TT 241

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     + P F   + ARYG  S + R E+ G        +     ++E+   R+ 
Sbjct: 242 HAPTEANRANLAPGFLTEVRARYG-GSRLARQELDGVMLADVDGALWTSGMLEQLQRRDR 300

Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISG----- 362
            P     +++  D    A +G D   +++        + +W         +         
Sbjct: 301 PP--LDRIVVAVDPSVSAHKGSDACGIIVAGAQTQGPISEWRAY---VLADHTVQGLGPT 355

Query: 363 --------LVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNR 412
                     + YR + ++ + N  GA     L  +        V   K           
Sbjct: 356 GWARAAIAARDAYRAERLVAEVNQGGALVGTVLRQVDPLVPFTPVHASKGKA-------A 408

Query: 413 RTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGL 472
           R E    + +            L + +  + +      G             S D  D L
Sbjct: 409 RAEPVAALYEQGRVHHAPGLQELEEQMCLMTAQGYRGDG-------------SPDRVDAL 455

Query: 473 MYT 475
           ++ 
Sbjct: 456 VWA 458


>gi|254474412|ref|ZP_05087798.1| phage DNA Packaging Protein [Ruegeria sp. R11]
 gi|214028655|gb|EEB69490.1| phage DNA Packaging Protein [Ruegeria sp. R11]
          Length = 417

 Score = 43.9 bits (102), Expect = 0.057,   Method: Composition-based stats.
 Identities = 64/415 (15%), Positives = 113/415 (27%), Gaps = 63/415 (15%)

Query: 82  ISAGRGIGKTTLNA-WLVLWLMSTRPGI-----SVICLANSETQLKTTLWAEVSKWLSLL 135
           I  GRG GKT   A W+      + P        +  L  +  Q++  +    S  L+  
Sbjct: 25  ILGGRGAGKTRAGAEWVRALAEGSTPLSAGRARRIALLGETYDQVRDVMVQGDSGILACT 84

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
           P               P        +            + +S   P+   G       A 
Sbjct: 85  P---------------PDRRPQWKATERRLIWPNGATAQAFSAHDPEALRGPQFD---AA 126

Query: 196 INDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             DE +           +   L     +    +T+ P R      ++   P    +    
Sbjct: 127 WADELAKWKRGQDSWDMLQFAL-RLGTDPRVCVTTTP-RNVSVLRDLLASPSTV-QTHAA 183

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
                  +  SF E +  RY   S + R E+ G   Q    +      +  A  R   P 
Sbjct: 184 TEANRANLATSFIEEVRNRY-AGSRLGRQELDGVLLQDVEGALWCNAGLVGAQVRSAPP- 241

Query: 314 PYAPLIMGCDIA---EEGGDNTVVVLR--------RGPVIEHLFDWSKTDLRTT--NNKI 360
               +++  D A    +  D   +++         +      L D +    R       +
Sbjct: 242 -LDRVVVAVDPAVSAGKSSDACGILVVGAVLQGPPQDWRAYVLADCTVQGARPLVWAQAV 300

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
                ++  D ++ + N  GA     L  +   V       R       +  R E    +
Sbjct: 301 VDAAHRFDADRVVAEVNQGGALVESLLRQIDPLV-----PFRPRHAARSKGARAEPVAAL 355

Query: 421 ADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
            +      L     L   +       +   G L        G  S D  D L++ 
Sbjct: 356 YEQGRVRHLPGLGALEDQMC-----QMTPRGYL--------GQGSPDRLDALVWA 397


>gi|219723512|ref|YP_002476767.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr]
 gi|219694406|gb|ACL34930.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr]
          Length = 396

 Score = 43.9 bits (102), Expect = 0.059,   Method: Composition-based stats.
 Identities = 34/217 (15%), Positives = 74/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS    + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSRYNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    E  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219


>gi|212712878|ref|ZP_03321006.1| hypothetical protein PROVALCAL_03975 [Providencia alcalifaciens DSM
           30120]
 gi|212684570|gb|EEB44098.1| hypothetical protein PROVALCAL_03975 [Providencia alcalifaciens DSM
           30120]
          Length = 436

 Score = 43.9 bits (102), Expect = 0.059,   Method: Composition-based stats.
 Identities = 67/424 (15%), Positives = 127/424 (29%), Gaps = 58/424 (13%)

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134
           P  FK  + AG G GKT +    +   M   P I+    A +  Q++   +  +      
Sbjct: 18  PHKFKAYV-AGFGSGKTWVGCGGICKGMWEFPKINQGYFAPTYPQIRDIFYPTI----EE 72

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
           +      ++  +  +    +          + + Y       S E+P+T VG      + 
Sbjct: 73  VALDWGLKVNIVESNKEVHF---------YEGRRYRGTVICRSMEKPETIVGFKIGNAL- 122

Query: 195 IINDE----ASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPLDD- 247
              DE     S         I+  +            +T+ P       Y+ F K + D 
Sbjct: 123 --IDELDVMKSDKAQKAWRKIIARMRYNVAGLRNGIDVTTTPEG-FKFVYQQFVKAVRDK 179

Query: 248 ------WKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
                 +   Q  T   E  +   +   +++ Y    ++ +  + GQF      + I   
Sbjct: 180 PELSTLYGIVQASTFDNEKNLPADYIPSLMSSY--PPELIKAYLKGQFTNLTSGT-IYHT 236

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP---VIEHLFDWSKTDLRTTN 357
              +  N E    P   L +G D         V VLR G    V E +  +   D+    
Sbjct: 237 FDRKLNNSEEEEQPGETLYIGMDFNVGKMAGIVHVLRLGLPHAVTEIINAYDTPDMVRII 296

Query: 358 NKISGLVE-----KYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVD 405
            +   L +     K R   I  DA+        A + D   L   G+HV  ++       
Sbjct: 297 KERFWLYDGSDYKKVREIYIYPDASGDSRKSNNASSTDIAQLRQAGFHV--IVNDSNPPV 354

Query: 406 LEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS 465
            +   +    +         +   +    +       + +   +      E  +  G   
Sbjct: 355 KDRINSMNA-MFCNAKGERRYKVNVKRCPVYTESLEQQVWDPTSG-----EPDKKSGNDH 408

Query: 466 TDYS 469
            +  
Sbjct: 409 PNDG 412


>gi|320160638|ref|YP_004173862.1| hypothetical protein ANT_12280 [Anaerolinea thermophila UNI-1]
 gi|319994491|dbj|BAJ63262.1| hypothetical protein ANT_12280 [Anaerolinea thermophila UNI-1]
          Length = 1068

 Score = 43.9 bits (102), Expect = 0.062,   Method: Composition-based stats.
 Identities = 32/204 (15%), Positives = 65/204 (31%), Gaps = 17/204 (8%)

Query: 43  TPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLM 102
           T L+ F  P S        +   C+    N  P  F   +    G GKT  +    L   
Sbjct: 539 TYLKQFENPTSDINRKRNEILHACIEKGENEKPGFFSLTVPT--GGGKTLASIAFALHHA 596

Query: 103 STRPGISVICLAN--SETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC 160
           +T     +I +    +  +    ++ E+    ++L +   F+        A   ++ +  
Sbjct: 597 ATHGLKRIIYVIPFTTIIEQNAQVFKEIFGEENVLEHHSNFDWNDGKREDADNRTNSILA 656

Query: 161 SLGIDSKHYSTMCRTYS---------EERPDTFVGHHNTYGMAIINDEASGTPD----VI 207
            L + ++++       +         + +       HN     II DEA   P       
Sbjct: 657 KLKLAAENWDIPIVVTTNVQFFESLFDNKSSRCRKLHNIAKSVIIFDEAQMLPKEYIRPA 716

Query: 208 NLGILGFLTERNANRFWIMTSNPR 231
              +   +T   A+  +   + P 
Sbjct: 717 MAAVWELVTNYGASAVFCTATQPG 740


>gi|119967835|ref|YP_950664.1| putative large terminase subunit [Staphylococcus phage PH15]
 gi|112790059|gb|ABI21779.1| putative large terminase subunit [Staphylococcus phage PH15]
          Length = 446

 Score = 43.9 bits (102), Expect = 0.063,   Method: Composition-based stats.
 Identities = 41/365 (11%), Positives = 109/365 (29%), Gaps = 41/365 (11%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           ++  E++  H  +             +   GRG GK++  + ++   +  R  ++ + + 
Sbjct: 4   IKLSELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVR 62

Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174
            ++  L T+++ ++   +      H F+++                 +    +    + R
Sbjct: 63  KADNTLATSVFEQIKWAIEEQKVSHLFKVKVS------------PMEITYVPRGNRIIFR 110

Query: 175 TYSEERPDTFVGHHNTY---GMAIINDEASG-TPDVINLGILGFL---TERNANRFWIMT 227
               + P+      ++     +  I + A   T D +       L    +      +  +
Sbjct: 111 GA--QNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFS 168

Query: 228 SNPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVE 283
            NP +    +    YE   +P + +            I   F +   +    +    R E
Sbjct: 169 YNPPKRKQSWVNKKYETSFQPDNTFVHHS-TYLDNPFISKQFIQEAESAKERNEQRYRWE 227

Query: 284 VCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN--TVVVLRRGPV 341
             G+         +P N ++     +     +  +    D          + V  + G  
Sbjct: 228 YMGEAIGS---GVVPFNNLQIEKIPDELYKSFDNIRNAVDFGLTKTAPLHSDVYSKLGEH 284

Query: 342 IEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY-----LEMLGYHVYR 396
           I  +   +          +    +K +     +D +  G +  +      L+  GY    
Sbjct: 285 ISGVRKKACATDPL--AFVRWHYDKKKRIIYAVDEH-YGVQISNREFANWLKRRGYQSDE 341

Query: 397 VLGQK 401
           +    
Sbjct: 342 IYADS 346


>gi|219048282|ref|YP_002455497.1| PbsX family phage terminase large subunit [Borrelia afzelii ACA-1]
 gi|216752464|gb|ACJ73223.1| phage terminase, large subunit, pbsx family protein [Borrelia
           afzelii ACA-1]
          Length = 396

 Score = 43.9 bits (102), Expect = 0.064,   Method: Composition-based stats.
 Identities = 34/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSIYGLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANEDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    E  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219


>gi|289824955|ref|ZP_06544345.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. E98-3139]
          Length = 98

 Score = 43.9 bits (102), Expect = 0.066,   Method: Composition-based stats.
 Identities = 12/48 (25%), Positives = 18/48 (37%)

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            I     W   D R   + I  L E+Y    I ID+   G    + ++
Sbjct: 2   RILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVK 49


>gi|307294267|ref|ZP_07574111.1| hypothetical protein SphchDRAFT_1737 [Sphingobium chlorophenolicum
           L-1]
 gi|306880418|gb|EFN11635.1| hypothetical protein SphchDRAFT_1737 [Sphingobium chlorophenolicum
           L-1]
          Length = 438

 Score = 43.9 bits (102), Expect = 0.067,   Method: Composition-based stats.
 Identities = 65/403 (16%), Positives = 115/403 (28%), Gaps = 55/403 (13%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           AGRG GKT   A  V  +    P   +  +  +  + +  +                 E 
Sbjct: 59  AGRGFGKTRAGAEWVRSVAEGDPAARIALVGATLGEARAVM----------------VEG 102

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHY-STMCRTYSEERPDTFVGHHNTYGMAIINDE--- 199
            S  L  +PW++                 +   +     ++  G   ++G     DE   
Sbjct: 103 ASGVLAVSPWWNRPAFLPALRKLVWRNGAVATLFGAAEAESLRGPQFSHG---WADEIAK 159

Query: 200 -ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258
            A G     +  ++G             T  P  L     E      D            
Sbjct: 160 WAGGQA-AWDNLMMGMRLGIAPRVLATTTPRPVALVRGLVE--RNGSDVVVTRGRSADNA 216

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
             +   F   +   YG    + R E+ G+  ++   +    +++E            A +
Sbjct: 217 SHLADGFLAAMERNYGGT-RLGRQELDGELIEEVEGALWSRDLLERCRVAHVRGT-LARV 274

Query: 319 IMGCDI-AEEGGDNT---VVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAI 372
           ++  D  A   GD     VV L        + D +            ++     +  D +
Sbjct: 275 VVAVDPPASVHGDACGIVVVGLGGDGRAYVIADATVEGATPEGWARAVAAAALVHGADRV 334

Query: 373 IIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLI 430
           + +ANN GA     L     G  V  V   +  V        R E    + +    A   
Sbjct: 335 VAEANNGGAMVESVLRAAEAGLPVRLVHASRGKV-------ARAEPVAALYEAGRVAHRG 387

Query: 431 NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473
             + L   L  L    +   G +          +S D +D L+
Sbjct: 388 GFAELEDQLCGL----MLGGGYV-------GPGRSPDRADALV 419


>gi|158422729|ref|YP_001524021.1| hypothetical protein AZC_1105 [Azorhizobium caulinodans ORS 571]
 gi|158329618|dbj|BAF87103.1| conserved hypothetical protein [Azorhizobium caulinodans ORS 571]
          Length = 436

 Score = 43.9 bits (102), Expect = 0.067,   Method: Composition-based stats.
 Identities = 65/410 (15%), Positives = 123/410 (30%), Gaps = 58/410 (14%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPG------ISVICLANSETQLKTTLWAEVSKWLSLL 135
           +  GRG GKT   A  V  L   RP         +  +A +   L+  +   VS  L++ 
Sbjct: 51  VLGGRGAGKTRAGAEWVRGLALGRPPFAPAPVGRIALVAETMGDLREVMVEGVSGLLAVH 110

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
           P       +                           + + +S E P++  G       A 
Sbjct: 111 PAAERPRWEPTR---------------RRLVWPNGAVAQGFSAEDPESLRG---PQFEAA 152

Query: 196 INDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
             DE +     + +   +   L      R   M +   R S     +   P     R   
Sbjct: 153 WLDELAKWRRAEAVFDMLQFGLRLGAQPRQ--MVTTTPRPSALLRRLMADPSTVLSRAT- 209

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
             +    + P+F + ++ARYG    + R E+ G+  +   D+    +       RE    
Sbjct: 210 TAQNAFHLAPAFLDTVLARYGGT-RLGRQELEGEIIEDRPDALWTRSA--LEAAREAAAP 266

Query: 314 PYAPLIMGCDI---AEEGGDNTVVVL--RRGPVIEHLF---DWSKTDLRTTNNKISGLVE 365
           P A +++  D    +  G D   ++     G  + H+      +         +   L  
Sbjct: 267 PLARVVVALDPPASSRAGADACGIIAAGIDGEGLVHVLADATAAGLRPAQWAARAIDLWR 326

Query: 366 KYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE 425
            +  DA++ + N  G      L  +   V  V+  +              L+ +    + 
Sbjct: 327 THEADAVVAEVNQGGEMVRSVLAEVDASV-PVVSVRATRGKYLRAEPVAALYEQGR--VR 383

Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
            A       L   +       + +              +S D  D L++ 
Sbjct: 384 HAGAFP--ALEDEMCDFGPEGLSS-------------GRSPDRLDALVWA 418


>gi|254776419|ref|ZP_05217935.1| phage terminase [Mycobacterium avium subsp. avium ATCC 25291]
          Length = 491

 Score = 43.9 bits (102), Expect = 0.069,   Method: Composition-based stats.
 Identities = 71/462 (15%), Positives = 132/462 (28%), Gaps = 81/462 (17%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS-V 110
           R WQ+          L    +P+P    GAI   RG+GKT + A L L+ +   P  + +
Sbjct: 51  RPWQM--------GMLRPFLDPDPRPLVGAIMGPRGLGKTGIFAALGLYELFCGPDGNEI 102

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYS 170
             +A  E      L                              + V    + +  K  +
Sbjct: 103 PIVAVDERMAGRLL--------------KPAAQMVELNDELAARAVVYRDRIEVPGKRST 148

Query: 171 TMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG------------ILGFLTER 218
                   +R +       T+ +A + DE                        LG  T  
Sbjct: 149 LTALPAEAKRIEGL----GTWTLA-LADELGEIDPDTWSTLLLGAGKLDGAMALGIGTPP 203

Query: 219 NANRFWI------MTSNPRRLSGKFYEIFNKPLDDWKRFQIDT-RTVEGIDPSFHEGIIA 271
           N     +        +NP   +  FYE      D ++   +     +E  +P   + +  
Sbjct: 204 NRETSVLTDLREACRANPDDRTMAFYEF---SADGFEHHPVSCVHCLELANPQLDDLLSR 260

Query: 272 RYG------LDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIA 325
                          R +   Q    +   F+  +  +      P PD  A +++  D  
Sbjct: 261 DRATALLKQTTEGEYRRKRLCQVVTTNESPFVDADTWDGLKAPHPVPD-GADVVIALD-G 318

Query: 326 EEGGDNTVVV---LRRGPVIEHLFDWS-------KTDLRTTNNKISGLVEKYRPDAIIID 375
               D+T +V   + + P  + L  W        +  +      I    +++R   I  D
Sbjct: 319 SLKDDSTALVVGTVGKVPHFDRLDAWENPGDEAWRVPVLDVEQAIREAAKRWRVREIAFD 378

Query: 376 ANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL 435
                 R+   L   G  +     Q  A       + R+     + + L  +       L
Sbjct: 379 PY-LFTRSAQILAAEGLPMVEFR-QSPARQTAATNDLRS---AAVNEQLTHSG---DEVL 430

Query: 436 IQNLKSLKSFIVPNTGELAIESKRVKGAKST--DYSDGLMYT 475
            +++ +           +A   K  +   +   D    LM  
Sbjct: 431 RRHVLAATVLESDKGIRIA---KVNRSKHAPKIDLCTALMMA 469


>gi|296445591|ref|ZP_06887546.1| protein of unknown function DUF264 [Methylosinus trichosporium
           OB3b]
 gi|296256836|gb|EFH03908.1| protein of unknown function DUF264 [Methylosinus trichosporium
           OB3b]
          Length = 442

 Score = 43.6 bits (101), Expect = 0.071,   Method: Composition-based stats.
 Identities = 67/418 (16%), Positives = 126/418 (30%), Gaps = 75/418 (17%)

Query: 82  ISAGRGIGKTTLNA-WLVLWL-----MSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135
           I +GRG GKT   A W+          +TRP   +  +  +   ++  +   VS  L+  
Sbjct: 58  ILSGRGAGKTRAGAEWVKGIARGRPQFATRPLSPIALIGETAADVRDVMIEGVSGILAAH 117

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
                   +S                          + + +S E P++  G       A 
Sbjct: 118 SRSERPLWESSRRRLTF---------------DNGVVAQAFSAEDPESLRG---PQFAAA 159

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF---- 251
             DE +                 +  +F +   +  R   +      +P+   KR     
Sbjct: 160 WCDELAK-----WRYAEETW---DMLQFGLRLGDWPR---QLVTTTPRPMPLIKRLLTEN 208

Query: 252 -----QIDTR-TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
                +  TR     + PSF E ++++YG    + R E+ G+  +Q  D+    +++E A
Sbjct: 209 GVAVTRAKTRANAANLAPSFLETVLSQYGGT-RLGRQELDGEIVEQRADALWTRDMLERA 267

Query: 306 LNREPCPDPYAPLIMGCD-IAEEGG--DNTVVVLRRGPVIEHLFDWSKTDLRTT-----N 357
             R   P P   +++  D  A  G   D   +V   G     +   +   +         
Sbjct: 268 --RILAPPPLERIVVAIDPPASSGKRADRCGIVA-VGIAQNIVHVLADATVEAARPAQWA 324

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417
                L  K   DA++ + N  G      +         V   +             +L+
Sbjct: 325 RAAIALYHKLSADALVAEVNQ-GGEMVRAVIHEADPSVPVKEARATRGKYLRAAPAAQLY 383

Query: 418 VKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
            +        +      L   +           G   + S      +S D  D L++ 
Sbjct: 384 EQGRAR-HVGAFPA---LEDEMC-----DFGPDG---LSS-----GRSPDRLDALVWA 424


>gi|89054122|ref|YP_509573.1| hypothetical protein Jann_1631 [Jannaschia sp. CCS1]
 gi|88863671|gb|ABD54548.1| protein of unknown function DUF264 [Jannaschia sp. CCS1]
          Length = 483

 Score = 43.6 bits (101), Expect = 0.071,   Method: Composition-based stats.
 Identities = 62/424 (14%), Positives = 113/424 (26%), Gaps = 75/424 (17%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132
           +  GRG GKT   A    W+ S   G           V  +  +  Q    +        
Sbjct: 91  VLGGRGAGKTRAGA---EWVRSMVEGATPEAPGRAKRVALIGETYDQAMAVMVK------ 141

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                +      S       W +          ++      + +S   P+   G      
Sbjct: 142 ----GESGLIACSPPDRVPRWIAGERKLVWPNGAE-----AQVFSANDPEALRGPQFD-- 190

Query: 193 MAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
                DE +  P        +   L     +   I+T+ P R      ++  +       
Sbjct: 191 -LAWADELAKWPKAQETWDMLQFGL-RLGQHPQQIVTTTP-RNVNVLKDLLARD-GVAHT 246

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     +  SF   + +RYG D+ + R E+ G       ++      I+E   R  
Sbjct: 247 HAPTEANSAYLADSFLTEVRSRYG-DTRLGRQELDGVLLDDVDNALWVRGAIDE--GRLT 303

Query: 311 CPDPYAPLIMGCDI---AEEGGDNTVVVLRRGPVIE-HLFDWS----------KTDLRTT 356
                  +I+  D       G D   +V+  G +       W                  
Sbjct: 304 DAPDVTRVIVAVDPPVTGHAGSDACGIVV-VGIIERGDPAQWRAVVLEDCSVQGVSPNQW 362

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRT 414
            N       ++    ++ + N  G    D +  +    ++  V      V        R 
Sbjct: 363 ANAAVAAYHRHGASRMVAEVNQGGVMVADTIRTVDPTINLRTVHASTGKV-------ARA 415

Query: 415 ELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMY 474
           E    + +  + A L  H+ L   +  +        G             S D  D L++
Sbjct: 416 EPVAALYEQGKVAHLGTHAELEDEMCKMALTGYEGQG-------------SPDRVDALVW 462

Query: 475 TFAE 478
              E
Sbjct: 463 ALTE 466


>gi|312149784|gb|ADQ29854.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           N40]
          Length = 304

 Score = 43.6 bits (101), Expect = 0.074,   Method: Composition-based stats.
 Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEG-GDNTVVVL 336
            F  +NI ++ +   P        I   D A    GDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVRGDNTALCV 299


>gi|203288763|ref|YP_002223713.1| hypothetical protein BDU_2013 [Borrelia duttonii Ly]
 gi|201084613|gb|ACH94190.1| uncharacterized conserved protein [Borrelia duttonii Ly]
          Length = 398

 Score = 43.6 bits (101), Expect = 0.074,   Method: Composition-based stats.
 Identities = 36/244 (14%), Positives = 79/244 (32%), Gaps = 23/244 (9%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L     + G   + +   + +   ++  E+ + LS    +  F +
Sbjct: 27  SSRGTGKTYDIATVNLERKFAKDGGDTLAVRKKKNKTTQSIHKEILELLSRYNLRREFTI 86

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
               +       +      G             ++ +          +   +  +EA+  
Sbjct: 87  SKAKI-------ETKKLIYGRKRAFVFEGGHDTTDLKSYA-------HFKDLWLEEANQF 132

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQIDTRTVEGI 261
            +     ++  + ER    +  M+SNP   S   Y+ +  N+        +   R    +
Sbjct: 133 TESDIEKLIPTMRERGGRIY--MSSNPVPRSHWLYKRYIANEDNPSVCVIKSTYRDNPFL 190

Query: 262 DP----SFHEGIIARYGLDSDVTRVEVCG-QFPQQDIDSFIPLNIIEEALNREPCPDPYA 316
           +     ++ E     Y  +    R+EV G +F            I +E+L        Y 
Sbjct: 191 NGGDVNAWLEKQKLAYHGNDIGFRIEVLGEEFEFGTARFIKEFTICDESLISRVQGSFYT 250

Query: 317 PLIM 320
            + +
Sbjct: 251 GIHI 254


>gi|195942758|ref|ZP_03088140.1| hypothetical protein Bbur8_08065 [Borrelia burgdorferi 80a]
          Length = 312

 Score = 43.6 bits (101), Expect = 0.074,   Method: Composition-based stats.
 Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEG-GDNTVVVL 336
            F  +NI ++ +   P        I   D A    GDNT + +
Sbjct: 265 IFTQINITDDYVFTSP--------IAYLDPAFSVRGDNTALCV 299


>gi|66394679|ref|YP_240816.1| ORF009 [Staphylococcus phage X2]
 gi|62636903|gb|AAX92014.1| ORF009 [Staphylococcus phage X2]
          Length = 421

 Score = 43.6 bits (101), Expect = 0.075,   Method: Composition-based stats.
 Identities = 40/309 (12%), Positives = 97/309 (31%), Gaps = 35/309 (11%)

Query: 83  SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142
             GRG GK++  + ++   +  R  ++ + +  ++  L T+++ ++   +      H F+
Sbjct: 33  KGGRGSGKSSDISIIIT-QLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFK 91

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY---GMAIINDE 199
           ++                 +    +    + R    + P+      ++     ++ I + 
Sbjct: 92  VKVS------------PMEITYIPRGNRIIFRGA--QNPERLKSLKDSRFPFSISWIEEL 137

Query: 200 ASGTPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF----YEIFNKPLDDWKRF 251
           A    +     I   L     +      +  + NP +    +    YE   +  + +   
Sbjct: 138 AEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSFQADNTYVHH 197

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
                    I   F +   +    +    R E  G+     +  F  L  IEE   R+  
Sbjct: 198 S-TYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR-IEEIPQRQY- 254

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            D +  +    D      D    V     ++  VI  + +     +           + Y
Sbjct: 255 -DTFDNIRNAVDFG-YATDPLAFVRWHYDKKKRVIYAMDEHYGVQISNREFANWLKKKGY 312

Query: 368 RPDAIIIDA 376
           + D +  D+
Sbjct: 313 QSDEVFADS 321


>gi|216996657|ref|YP_002333778.1| phage terminase, large subunit, PBSX family [Borrelia afzelii
           ACA-1]
 gi|216752579|gb|ACJ73283.1| phage terminase, large subunit, PBSX family [Borrelia afzelii
           ACA-1]
          Length = 450

 Score = 43.6 bits (101), Expect = 0.076,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKIDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIAITDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQDYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|265753755|ref|ZP_06089110.1| terminase [Bacteroides sp. 3_1_33FAA]
 gi|263235469|gb|EEZ20993.1| terminase [Bacteroides sp. 3_1_33FAA]
          Length = 521

 Score = 43.6 bits (101), Expect = 0.082,   Method: Composition-based stats.
 Identities = 34/240 (14%), Positives = 72/240 (30%), Gaps = 33/240 (13%)

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQF---PQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
           +P++   + A  G    + +  + G F   P+++    IP    +   N  P  +     
Sbjct: 244 NPNYIGSVAASGGK---MAQAIIEGNFNVDPEENEKIPIPSTSAQGVFNNNPAVN--GDK 298

Query: 319 IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANN 378
            +  D+A+ G DN V +   G     +   SK+  R     +     ++      I  + 
Sbjct: 299 WITVDLADYGTDNLVALAWDGFHAYDILILSKSTPRENAMAVKTFAFEHGTAESHIIFDA 358

Query: 379 TGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL--- 435
           T  R  +           +        L     +    ++++   +E  +L     L   
Sbjct: 359 TAGRYFNDYIPDAVPYISLNKPFGLYQLTAMTVKDM-CYIRLCKMIEEGNLTFDDKLAVQ 417

Query: 436 ----------------IQNLKSLKSFIVPNTGELAIESKRVK-----GAKSTDYSDGLMY 474
                                S+  F    +G+  + +K+         +S D  D    
Sbjct: 418 TYTHQNLKYKVTVENEFMEECSVVRFDDMQSGKKRLWNKKKMNQMLGKGRSMDLLDPCAM 477


>gi|319409256|emb|CBI82900.1| phage-related protein [Bartonella schoenbuchensis R1]
          Length = 441

 Score = 43.6 bits (101), Expect = 0.083,   Method: Composition-based stats.
 Identities = 33/194 (17%), Positives = 66/194 (34%), Gaps = 11/194 (5%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFNKPLDD- 247
             +    DEA    +     ++  L E   N      +T NP R +    + F    D  
Sbjct: 122 RILLCWVDEAEPVTETAWQTLIPTLREEGENWHCELWVTWNPLRENAPVEKRFRAVKDPH 181

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF---IPLNIIEE 304
            K  +I+ R         +    A +    +       G++ Q    ++   + L   +E
Sbjct: 182 IKGVEINWRDNPQFPDRLNRAREADFTQRPEQYNHIWEGEYLQAVQGAYYQKLLLEAEQE 241

Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKI 360
                   DP   + +  DI   G   D T + + +  G  I  L D+ +   +  +  I
Sbjct: 242 GRITHVPRDPLIQIKIFWDIGGTGAKADATALWVAQFIGREIRVL-DYYEAQGQPLSEHI 300

Query: 361 SGLVEKYRPDAIII 374
             + ++    A+++
Sbjct: 301 GWMCQRGYDKALMV 314


>gi|317152167|ref|YP_004120215.1| hypothetical protein Daes_0447 [Desulfovibrio aespoeensis Aspo-2]
 gi|316942418|gb|ADU61469.1| hypothetical protein Daes_0447 [Desulfovibrio aespoeensis Aspo-2]
          Length = 590

 Score = 43.6 bits (101), Expect = 0.083,   Method: Composition-based stats.
 Identities = 46/293 (15%), Positives = 92/293 (31%), Gaps = 40/293 (13%)

Query: 164 IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRF 223
           +    Y  + +     +  + +  H  + + +       TP  +        T    N+ 
Sbjct: 255 VYIDEYFWITKFNELYKVASAMAAHKKWRITLF-----STPSAVTHEAYDLWTGDRFNKR 309

Query: 224 WIMTSNPRRLSGKFYEIFNK----PLDDWKR-FQIDTRTVEGIDPSFHEGIIARYGLDSD 278
           W      +R     +E   +    P   W++   I      G D    E +  +Y   +D
Sbjct: 310 WSR--QAKRKEFPSFEAMQRGVVCPDKVWRKVITIKDAEAGGCDLFDFEDLNLQY--STD 365

Query: 279 VTRVEVCGQFPQQDIDSFIPLNIIEEALNR----------EPCPDPYAPLIMGCDIAEEG 328
             R     +F   D+ +   L+ +E                  P    P+  G D +   
Sbjct: 366 EFRNLFMCEFVD-DLQAVFRLHNLEACYGDMDEWTDFNPDAARPFGNLPVWGGYDPSRNR 424

Query: 329 GDNTVVVL----RRGPVIEHLFDWSKTDLRTT--NNKISGLVEKYRPDAIIIDANNTGAR 382
            D + V+L    + G +   L  +   D   T    +I  L +++    I ID    G  
Sbjct: 425 DDASFVILAPPLQPGGMFRVLARYKWVDKSYTWQAQRIKELTQQFNFVHIGIDVTGPGIG 484

Query: 383 TCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGL 435
             + ++              A+ + +    +T L +K  D +E   +   + L
Sbjct: 485 VFESVQA---------FFPAAMPITYGVQTKTTLVLKAKDVIESGRIQWDASL 528


>gi|328857391|gb|EGG06508.1| hypothetical protein MELLADRAFT_36161 [Melampsora larici-populina
           98AG31]
          Length = 824

 Score = 43.6 bits (101), Expect = 0.084,   Method: Composition-based stats.
 Identities = 29/151 (19%), Positives = 52/151 (34%), Gaps = 9/151 (5%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQS 145
           G+GKT     L+L       G   + +A +   +    W  E+ K+   L    W     
Sbjct: 237 GMGKTIQTISLILSDRKAGDGKQTLVIAPT---VAIIQWRNEIEKFTKGLKVNVWHGGNR 293

Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYS---TMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202
            +        D++  S  +    +    +  R + E R +  +  H+ +   +I DEA  
Sbjct: 294 STDKKTMKSYDIVLTSYAVLESSFRRQNSGYRKFGELRKEASL-LHSIHWHRVILDEAHN 352

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRL 233
             D       G   E  A   W ++  P + 
Sbjct: 353 IKDRSCNTAKGAF-ELQATFKWCLSGTPLQN 382


>gi|160897385|ref|YP_001562967.1| PBSX family phage terminase large subunit [Delftia acidovorans
           SPH-1]
 gi|160362969|gb|ABX34582.1| phage terminase, large subunit, PBSX family [Delftia acidovorans
           SPH-1]
          Length = 433

 Score = 43.6 bits (101), Expect = 0.086,   Method: Composition-based stats.
 Identities = 41/295 (13%), Positives = 77/295 (26%), Gaps = 25/295 (8%)

Query: 147 SLHPAPWYSDVLHCSLG-IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-INDEASGTP 204
           ++   PW +         I S+           +R        +   + +   DEA    
Sbjct: 78  AIEDEPWLAAYYDVGDKYIKSRDGRITFAFAGLDR--NIASIKSKGRLLLCWVDEAEPVT 135

Query: 205 DVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFN-KPLDDWKRFQIDTRTVEGI 261
           D     ++  L E     N    +T NP+R S    + F        K  + + +     
Sbjct: 136 DEAWTTLIPTLREEGTDWNAELWVTWNPKRKSAPVEKRFKGSSDPRMKYVRCNWKDNPKF 195

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII----EEALNREPCPDPYAP 317
                   +       +       G +      ++   +I+    +  + R P  DP   
Sbjct: 196 PALLERVRLRDLAERPEQYAHIWEGDYATVIEGAYFASHIVKARQDNRIGRVPA-DPLMT 254

Query: 318 LIMGCDIAEEGGDNTVVVLRR----GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373
           L    DI   G       +      G  I  L  + +     +++      + Y    I 
Sbjct: 255 LRAFVDIGGTGARADAFAMWIAQFVGKEIRVLDYYEQVGQPLSSHLNWMREKGYDKAQIW 314

Query: 374 IDANNTGARTC------DYLEMLGYHVYRVLGQKRAVDLEFCRNRR---TELHVK 419
           +  +               L   GY V  V  Q +          R     +   
Sbjct: 315 LPHDGATQDKVHDVSYESALRQAGYTVTVVPNQGKGAAKARIEAGRRLFGSMWFN 369


>gi|319899324|ref|YP_004159421.1| hypothetical protein BARCL_1179 [Bartonella clarridgeiae 73]
 gi|319403292|emb|CBI76851.1| phage-related protein [Bartonella clarridgeiae 73]
          Length = 442

 Score = 43.6 bits (101), Expect = 0.087,   Method: Composition-based stats.
 Identities = 31/193 (16%), Positives = 64/193 (33%), Gaps = 9/193 (4%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247
             +    DEA    D     ++  L E   +      +T NP R +    + F      +
Sbjct: 122 RILLCWVDEAEPVTDAAWQILIPTLREEGKDWHSELWVTWNPCRENAAVEKRFRFTKDPN 181

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
            K  +I+ R         +    A      +  +    G++ Q    ++    ++E    
Sbjct: 182 VKGVEINWRDNPKFPAKLNRDRKADLEQRPEQYQYIWEGEYLQAMQGAYYQKLLLEAEQE 241

Query: 308 REPCPDPYAPLI---MGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361
                 P  PLI   +  DI   G   D T + + +       + D+ +   +  +  I 
Sbjct: 242 GRITKVPRDPLIQIKIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHIG 301

Query: 362 GLVEKYRPDAIII 374
            + +K    A+++
Sbjct: 302 WICQKGYEKALMV 314


>gi|16127022|ref|NP_421586.1| hypothetical protein CC_2790 [Caulobacter crescentus CB15]
 gi|221235816|ref|YP_002518253.1| phage DNA packaging protein [Caulobacter crescentus NA1000]
 gi|13424390|gb|AAK24754.1| conserved hypothetical protein [Caulobacter crescentus CB15]
 gi|220964989|gb|ACL96345.1| phage DNA packaging protein [Caulobacter crescentus NA1000]
          Length = 567

 Score = 43.6 bits (101), Expect = 0.087,   Method: Composition-based stats.
 Identities = 62/405 (15%), Positives = 109/405 (26%), Gaps = 60/405 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
            GRG GKT   A  + W     P  ++I    +   ++  +          +      + 
Sbjct: 199 GGRGAGKTFAGARWITWNALAYPSQALI--GPTLHDVREVM----------IEGPSGLKA 246

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDEAS 201
                +   W +                +   +S E P++  G   H         DE  
Sbjct: 247 MGGPAYRPRWEASRRRLVWPN-----GAVAYAFSAEDPESLRGPQFHAA-----WADEFC 296

Query: 202 GTPDVINLGI---LGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258
             P           G     +       T  P R      +               +   
Sbjct: 297 AWPKPAETLAMLRFGLRLGEDPRLVVTTTPKPHRAL----KTLMAEPGVALTRAGTSANA 352

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
             + P+F   + + YG    +   E+ G   +           +  A  R   P     +
Sbjct: 353 GNLAPAFLRTLASLYGGT-RLAAQELDGVVVE-TDGGLFRAEDL--ARCRAARPARLDRV 408

Query: 319 IMGCD-IAEEGGDNT--VVVLRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRPDAII 373
           ++  D  A   GD    VVV RR      L D             +       +  DA++
Sbjct: 409 VVAVDPPATATGDACGIVVVGRRDDRAFVLADETARGLSPAGWAGRAVAAARAWTADALV 468

Query: 374 IDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHS 433
            +AN  G      L        RV   + ++           L+ +    L   + +   
Sbjct: 469 AEANQGGDMVRSVLAQAD-PPCRVKLVRASLGKRARAEPVAALYEQGRV-LHCGAFVALE 526

Query: 434 GLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
             +  L S         G+L           S D +D L++  +E
Sbjct: 527 EELMALGS---------GDLE---------HSPDRADALVWAVSE 553


>gi|282598984|ref|YP_003358901.1| gp17 terminase DNA packaging enzyme large subunit [Deftia phage
           phiW-14]
 gi|257219054|gb|ACV50069.1| gp17 terminase DNA packaging enzyme large subunit [Deftia phage
           phiW-14]
          Length = 585

 Score = 43.6 bits (101), Expect = 0.091,   Method: Composition-based stats.
 Identities = 50/286 (17%), Positives = 95/286 (33%), Gaps = 39/286 (13%)

Query: 130 KWLSLLPNKHWFEMQSLSLH-----PAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDT 183
           K  ++L NK    ++ +  +       P++  +      +       M + +S    PDT
Sbjct: 139 KNWAVLANKSSAALEVMDRYRVMFQELPYFMQIGAVRFNLAEVELENMSKVFSGTSDPDT 198

Query: 184 FVGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF 241
             G     G+    DE++ T   +         L+  + +   I+TS P    G FY+I+
Sbjct: 199 VRGK-ALNGIYW--DESAFTARDEEFWTSTFPVLSSGDTS-KAILTSTPNGARGVFYKIW 254

Query: 242 NKPLDD-------WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDID 294
            +  D        + R  +        D ++ E  I + G      + E    F      
Sbjct: 255 KESEDPNSDVYNGFARLAVPWYRHPRRDEAWKELSIRKIGPTK--FKQEHELSFL-GSSG 311

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIM--GCD------IAEEGG----DNTV-VVLRRGPV 341
             IP   +E      P  +     I     D      IA+ GG    D +V  ++    +
Sbjct: 312 CLIPPMTLERMGFINPLREDEHLKIFVEPVDDHKYIGIADSGGGVGADYSVCTIIDVTEI 371

Query: 342 IEHLFDWSKTD---LRTTNNKISGLVEKYRPDAIII-DANNTGART 383
              +    + +         +I  L   Y    ++I + N+ G + 
Sbjct: 372 PYRVVAKYRNNEIAPIVFPYQIVSLCGLYNDCPVLIENNNDVGGQV 417


>gi|260433583|ref|ZP_05787554.1| putative phage terminase, large subunit [Silicibacter
           lacuscaerulensis ITI-1157]
 gi|260417411|gb|EEX10670.1| putative phage terminase, large subunit [Silicibacter
           lacuscaerulensis ITI-1157]
          Length = 504

 Score = 43.2 bits (100), Expect = 0.092,   Method: Composition-based stats.
 Identities = 52/298 (17%), Positives = 86/298 (28%), Gaps = 54/298 (18%)

Query: 64  AHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVL------WLMSTRPGISVICLANSE 117
              ++    P   V   ++  GRG GK+TL+A L L      W  +       I +A   
Sbjct: 33  NKFIDGAYGPGINVGVLSV--GRGNGKSTLSAILALGELVGAWSDA---KEREILIAAKT 87

Query: 118 TQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177
            Q     W  V      LP      +                  +  D ++   + R  S
Sbjct: 88  QQQAQICWHYVVSLSKTLPEDVQAAITIRR---------QPRFEIQFDDENGPHILRAIS 138

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTP----DVINLGILGFLTERNANRFWIMT--SNPR 231
            +          T     I DE    P    D +   +L  L++R+     I T  SN  
Sbjct: 139 ADGKSAL----GTSPTLAILDERGHWPLAQGDELEAALLTGLSKRDGKALIISTSASNDM 194

Query: 232 RLSGKF--------YEIFNKPLDDWKRFQIDTR--TVEGID----------PSFHEGIIA 271
                +        Y   ++P        + +      G                   I 
Sbjct: 195 HPFSLWLDREAPGVYRQEHRPEPGLPADDVASLIIANPGTKYGIGPSLKRLKDDAALAIE 254

Query: 272 RYGLDSDVTRVEVCGQFPQQDI-DSFIPL-NIIEEALNREPCPDPYAPLIMGCDIAEE 327
           R G      R+    +  Q+D  D  I L + ++     +  P    P ++G D+   
Sbjct: 255 RGGSALSRFRLLSRNERVQEDNRDILISLDDWLK--CETDALPPKSGPCVIGLDLGGS 310


>gi|154489097|ref|ZP_02029946.1| hypothetical protein BIFADO_02409 [Bifidobacterium adolescentis
           L2-32]
 gi|154083234|gb|EDN82279.1| hypothetical protein BIFADO_02409 [Bifidobacterium adolescentis
           L2-32]
          Length = 1055

 Score = 43.2 bits (100), Expect = 0.092,   Method: Composition-based stats.
 Identities = 46/242 (19%), Positives = 75/242 (30%), Gaps = 33/242 (13%)

Query: 1   MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
           +S E+ +     + L D  W    +  F  +     P      P+E  S  ++ Q   M+
Sbjct: 194 LSEEIESQISESKPLTD-AWLKLYEEDFKKYA----PQRPNRKPIEKTSQSQTIQPNAMQ 248

Query: 61  VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
           V     +N          +  I +  G GKT L+A+ V                    Q+
Sbjct: 249 V--EALMNLAQLRKQGESRAIIVSATGTGKTYLSAFDV-------------------RQV 287

Query: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180
           K      +++    +  K     Q +   P          S   D K+     +T S  R
Sbjct: 288 KPNRMLYIAQ-QEQILKKAEESFQKVLGCPKSELGLFSGGSKESDRKYVFATVQTMS--R 344

Query: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG-KFYE 239
           P+T           I+ DE               +     N    MT+ P R  G   +E
Sbjct: 345 PETLAQFDADEFDYILVDE---VHHAAAESYKRVIDHFQPNFMLGMTATPERTDGANIFE 401

Query: 240 IF 241
           +F
Sbjct: 402 LF 403


>gi|87307615|ref|ZP_01089759.1| hypothetical protein DSM3645_28877 [Blastopirellula marina DSM
           3645]
 gi|87289785|gb|EAQ81675.1| hypothetical protein DSM3645_28877 [Blastopirellula marina DSM
           3645]
          Length = 429

 Score = 43.2 bits (100), Expect = 0.093,   Method: Composition-based stats.
 Identities = 52/348 (14%), Positives = 93/348 (26%), Gaps = 63/348 (18%)

Query: 51  PRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLW---------- 100
           P   Q E +     H                +  GR  GKT       +           
Sbjct: 26  PLPHQREILRDRHRHKR--------------VICGRRWGKTGAGLIAAILGHGDPSGPGH 71

Query: 101 LMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHC 160
                 G ++  +A +  Q K                    E   +          V   
Sbjct: 72  WKGMVDGGTLYWVAPTFAQSKKI------------------ERDIMLAFANSGLVYVKSE 113

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVIN-LGILGFLTERN 219
                    S   +T +        G        +I DE +     +    +   L++R 
Sbjct: 114 GRIEHPSGGSITIKTAAAPVSLRGEGLDG-----MIGDEFAFVRKEVWSDALRPALSDRR 168

Query: 220 ANRFWIMTSNPRRLSGKFYEIFNKPLDD--WKRFQIDTRTVEGIDPSFHEGIIARYGLDS 277
               ++ T  P   +    +  +    D  +K +Q  T     ID +  +  +   G  S
Sbjct: 169 GWSMFLTT--PNGPN-WMKDQHDLDGVDPTYKSWQCPTSDNCLIDQAELDSALLDLGQAS 225

Query: 278 DVTRVEVCGQFPQQDIDSFIPL--NIIEEALNREPCPDPYAPLIMGCDIAEEGGDNT--- 332
                E   QF       F  L     +   +  P        ++G D ++   D +   
Sbjct: 226 --FDQEYRAQFVDVSGAEFSGLYFQTPKFWFDDWPPESEIRFRVIGLDPSKGKNDKSDYS 283

Query: 333 ---VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDAN 377
              ++ L     I    D  + D+R    K   L   + P  +I++ N
Sbjct: 284 AFVMLALAGDGQIYVDADIERRDVRKIAEKAFELCALFEPTGMIVETN 331


>gi|307275425|ref|ZP_07556567.1| phage uncharacterized protein [Enterococcus faecalis TX2134]
 gi|306507813|gb|EFM76941.1| phage uncharacterized protein [Enterococcus faecalis TX2134]
          Length = 418

 Score = 43.2 bits (100), Expect = 0.098,   Method: Composition-based stats.
 Identities = 45/305 (14%), Positives = 89/305 (29%), Gaps = 35/305 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           RG  KTT  A  +  LM   P  ++I L  ++T     +  E+   ++ + +  +F+   
Sbjct: 52  RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106

Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            +L+        +                 +        +  G H      +I D+    
Sbjct: 107 FALYGVELVLLKETTTEIDTNLKTSSRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257
            D ++               +    N +   G+F      ++K     K     + D   
Sbjct: 164 KDRVSRA-----EREKTKLQYQELQNVKNRGGRFINTGTPWHKEDAISKMPNVKKFDCYE 218

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              ID    + +  +  +   +       +        F     I+   N       +  
Sbjct: 219 TGLIDKEQRKAL--QQSMTPSLFAANYELKHIADSESLFTAPTYID-NTNLIYNGVAH-- 273

Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373
                D A  GGD+T   +    + G +I     W K        +I  L + Y+     
Sbjct: 274 ----IDAAYGGGDSTAFTIFKEQKDGTIIGFGKKWQKHVDDCLP-EILQLHQYYQAGTFY 328

Query: 374 IDANN 378
            + N 
Sbjct: 329 TETNG 333


>gi|219053375|ref|YP_002455734.1| phage terminase, large subunit, pbsx family protein [Borrelia
           afzelii ACA-1]
 gi|226234324|ref|YP_002775459.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           Bol26]
 gi|216752668|gb|ACJ73353.1| phage terminase, large subunit, pbsx family protein [Borrelia
           afzelii ACA-1]
 gi|226202138|gb|ACO37810.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           Bol26]
          Length = 396

 Score = 43.2 bits (100), Expect = 0.098,   Method: Composition-based stats.
 Identities = 34/217 (15%), Positives = 74/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS    + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSRYNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSADDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLANEDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    E  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVEAWLEKQKLAYHGNDIGFRIEVLGE 219


>gi|57234875|ref|YP_181104.1| hypothetical protein DET0357 [Dehalococcoides ethenogenes 195]
 gi|57225323|gb|AAW40380.1| hypothetical protein DET0357 [Dehalococcoides ethenogenes 195]
          Length = 441

 Score = 43.2 bits (100), Expect = 0.10,   Method: Composition-based stats.
 Identities = 44/291 (15%), Positives = 83/291 (28%), Gaps = 45/291 (15%)

Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213
           + D+     G   +         S E     VG  NT  + +  DEA             
Sbjct: 71  FGDIYQTEGGYIIRLNQARAVFLSAEPSANVVG--NTAHLLLEVDEAQDVNQEKYSKEFK 128

Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD------WKRFQIDTRTVEGIDPSFHE 267
            +     N   ++       S    EI  + ++        + F+ D   V   +P++  
Sbjct: 129 PM-GATTNVTTVLYGTTWDSSSLLEEIKRQNIEKEHKDGLKRHFRYDWEEVAAHNPAYLA 187

Query: 268 GIIARY---GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDPYAPLIMG 321
             ++     G +  +   +     P            +       PC   P+     + G
Sbjct: 188 YALSEKDRLGENHPLFLTQYR-LLPVSGGGGMFSSEQLGLLKGSHPCQLYPENGKVYVAG 246

Query: 322 CDIAEE-----GGDNTVVVLRRGPVIEHLFD----------------------WSKTDLR 354
            D+A E         T V LRR  ++  + +                      W      
Sbjct: 247 LDLAGEDVQSAADLPTAVNLRRDSIVLTIAELDYTFAKAPFNLPQVRLVCHCSWQGARHA 306

Query: 355 TTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYHVYRVLGQKRA 403
               K+  L+ K ++   + +DA   G     +L   LG  +   + Q  +
Sbjct: 307 LLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSRILPFVFQPSS 357


>gi|331662794|ref|ZP_08363717.1| putative phage terminase [Escherichia coli TA143]
 gi|331061216|gb|EGI33180.1| putative phage terminase [Escherichia coli TA143]
          Length = 407

 Score = 43.2 bits (100), Expect = 0.11,   Method: Composition-based stats.
 Identities = 63/362 (17%), Positives = 107/362 (29%), Gaps = 55/362 (15%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVSKWLSLLPNKHW 140
           AG G GKT +    +       PGI+    A +  Q++   +    EV+    L    + 
Sbjct: 28  AGFGSGKTWVGCGGICKGTWEHPGINQGYFAPTYPQIRDIFYPTVEEVAADWGLNVKINE 87

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
              +    +   +    +              CR  S E+P T VG      +    DE 
Sbjct: 88  GNKEVHFYYGRQYRGTTI--------------CR--SMEKPQTIVGFKIGNAL---VDEL 128

Query: 201 SGTPDV----INLGILGFLTER-NANRFWIMTSNPRRLSGKFYEIFNKPL-------DDW 248
              P          I+  +  + +  R  I  +         YE F K +         +
Sbjct: 129 DILPKEKARTAWRKIIARMRYKIDGLRNGIDVTTTPEGFKFVYEQFVKAVREKTELASLY 188

Query: 249 KRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
              Q  T   E  +   +   ++  Y    ++ +  + GQF      +        +  N
Sbjct: 189 GLVQASTFDNEKNLPADYIPSLLESY--PPELIKAYLRGQFTNLTSGTVYH-QFDRKLNN 245

Query: 308 REPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL---- 363
            E    P  P+ +G D         + VLR G         +  D       I       
Sbjct: 246 CEEVEQPGEPIYIGMDFNVGKMAGIIHVLRLGLPCAVTEIINAYDTPDMIRIIKERFWLY 305

Query: 364 ----VEKYRPDAIIIDANN-----TGARTCD--YLEMLGYHVYRVLGQKRAVDLEFCRNR 412
                 K R   I  DA+      + A T D   L+  G++V  V+        +   + 
Sbjct: 306 DGNDYRKVREIYIYPDASGDSRKSSNASTTDIAQLKQAGFNV--VVNSSNPPVKDRVNSM 363

Query: 413 RT 414
             
Sbjct: 364 NA 365


>gi|269836053|ref|YP_003318281.1| hypothetical protein Sthe_0020 [Sphaerobacter thermophilus DSM
           20745]
 gi|269785316|gb|ACZ37459.1| conserved hypothetical protein [Sphaerobacter thermophilus DSM
           20745]
          Length = 497

 Score = 43.2 bits (100), Expect = 0.11,   Method: Composition-based stats.
 Identities = 58/385 (15%), Positives = 100/385 (25%), Gaps = 72/385 (18%)

Query: 50  APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP--G 107
            PR +Q E M  V A  +              I   R  GK    A L+ +L++     G
Sbjct: 29  RPRRYQAEPMRAVAAAVVARARGDRSHPADFGIVFSRQAGKDEALAQLIAYLLTLFQRAG 88

Query: 108 ISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167
            S++    +           +     L  ++    +                   G   +
Sbjct: 89  GSIVVALPT-----------LRPQGILARDRLIERLTCERARALGLRP---RVQDGTIVR 134

Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMT 227
                C   S        G   T  + ++ +E           +   +          M 
Sbjct: 135 LGRAACHFVSAGPQSNARGQ--TASLLLVANECQDIRPERWDSVFAPMAASTDAVTLSMG 192

Query: 228 SNPRRLS---------------GKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272
           +     +                    +F  P   W   Q+  + V      + E  IA 
Sbjct: 193 TVWTADTLLARQMRHLAALEAEDGRRRLFRVP---W---QVVAQEVPSYGR-YVERQIAL 245

Query: 273 YGLDSDVTRVEVC--------GQFPQQDIDSF----------IPLNIIEEALNREPCPDP 314
            G D    R E          G FP                 +P    E AL  +   + 
Sbjct: 246 LGADHPFIRTEYELLELDGQGGLFPPSRQGQMQGDHPPLTRAVPGE--EYALLLDVAGEE 303

Query: 315 YAPLIMG--CDIAEEGGDNTVVVLR--------RGPVIEHLFDWSKTDLRTTNNKISGLV 364
              +  G   D A       + V+R        R  V+     W+       + ++  L 
Sbjct: 304 EESVDPGRAYDPAARRDSTALTVVRVVHQDARPRYEVVRRYL-WTGVKHTALHAQLVDLA 362

Query: 365 EK-YRPDAIIIDANNTGARTCDYLE 388
              +R   +++DA   GA    +L 
Sbjct: 363 RHVWRARYVVVDATGVGAGLASFLR 387


>gi|296393586|ref|YP_003658470.1| terminase [Segniliparus rotundus DSM 44985]
 gi|296180733|gb|ADG97639.1| Terminase [Segniliparus rotundus DSM 44985]
          Length = 498

 Score = 42.8 bits (99), Expect = 0.12,   Method: Composition-based stats.
 Identities = 70/388 (18%), Positives = 118/388 (30%), Gaps = 70/388 (18%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ    E +  H L    +  P        A R  GK+ L   L LW +  + G+ V  
Sbjct: 45  PWQ----EWLLTHALEVRPDGLPRFRTVIALAARQNGKSLLMIVLALWRVYVKGGVVVGT 100

Query: 113 ---LANSETQLKTTLWAEV----------------------SKWLSLLPNKHWFEMQSLS 147
              LANSE       W E                        K L L     +    +  
Sbjct: 101 AQDLANSE-----KAWGEAVELAEGTPELASEVLHVDKTNGKKSLRLHSGAQYRIAAASR 155

Query: 148 LHPAPWYSDVLHCSLGIDSKH---YSTMCRTYSEERPDTF------VGHHNTYGMAII-- 196
                + +D++      + +    ++ + +T +  RPD         G H +  +A +  
Sbjct: 156 RGARGFTADLILLDELREHQSFDSWAAVTKT-TMARPDAQVWCLSNAGDHLSVVLAHLRN 214

Query: 197 -----NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251
                 D   G PD +         +  A    +         G       K    W + 
Sbjct: 215 IAHRQLDWPDGKPDHVEDQAP----DDEAEDDSVGIFEWSAPPG----CDPKDRHAWAQA 266

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDS-DVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                 V   + +    I + Y  D   V   EV  Q+P        P    +   +   
Sbjct: 267 N-PALGVTITERA----IASAYATDPAPVFAAEVLCQWPLTVTPGPFPPGSWDSTRDDNS 321

Query: 311 CPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKIS--GLVEKYR 368
                +P ++G D+A     +TV +   G   + +     T  R  ++ ++      + +
Sbjct: 322 TIATDSPRVVGLDMAWN--RSTVTLALAGRRDDGMAHVEITAQRAGSDWVAPWLAERREK 379

Query: 369 PDAIIIDANNTGA-RTCDYLEMLGYHVY 395
             A+I+ AN   A      LE  G  V 
Sbjct: 380 IAAVIVQANGAPASSLVADLEAAGLPVI 407


>gi|114569469|ref|YP_756149.1| hypothetical protein Mmar10_0918 [Maricaulis maris MCS10]
 gi|114339931|gb|ABI65211.1| protein of unknown function DUF264 [Maricaulis maris MCS10]
          Length = 450

 Score = 42.8 bits (99), Expect = 0.12,   Method: Composition-based stats.
 Identities = 66/409 (16%), Positives = 113/409 (27%), Gaps = 59/409 (14%)

Query: 84  AGRGIGKTTLNA-WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142
            GRG GKT   A W+    + T     +  +  +   ++  +          +      +
Sbjct: 67  GGRGAGKTRAGAEWVRHRALRTV--SRIALVGPTFNDVREVM----------IEGPSGLK 114

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202
               ++    + +          S+ Y+     +S E  D   G    Y      DE + 
Sbjct: 115 HLGSAMERPRYEASRKRLVFPSGSQAYA-----FSAEDADGLRGPQFDYA---WGDEFAA 166

Query: 203 TPDV---INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259
            PD    ++   +G             T  P        + ++         Q       
Sbjct: 167 WPDPQRVLDTLRMGVRLGGAPRILLTTTPRPIPALKALVKAWDPRGPIRVTHQPTAANAA 226

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLI 319
            + P F E + A YG  S + R EV G        +      IE A            ++
Sbjct: 227 NLAPGFVEALNAAYG-GSMLGRQEVEGLLIDDPDGALWTRPKIEAARLAAGQMPELDRIV 285

Query: 320 MGCDIAEEGG---DNT--VVVLRRGP------VIEHLFDWSKTDLRTTNNKISGLVEKYR 368
           +  D    GG   D    VV    G       V+     +          + +   + Y 
Sbjct: 286 VALDPPATGGPRSDECGIVVAGAHGEGPARIAVVLADLSFGPALPADWAARAASAFDDYS 345

Query: 369 PDAIIIDANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF 426
            DA+I +AN  G      L+    G  V  V   +           R E    +      
Sbjct: 346 ADALIAEANQGGEMVRSVLQAAAPGLPVRLVHASRGKR-------ARAEPVAALYAAGRV 398

Query: 427 ASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
                   L   + +   F  P+  +            S D  D L++ 
Sbjct: 399 RHARPFPALEDQMCA---FGAPDGPK-----------SSPDRVDALVWA 433


>gi|308188181|ref|YP_003932312.1| Terminase, ATPase subunit (GpP) [Pantoea vagans C9-1]
 gi|308058691|gb|ADO10863.1| Terminase, ATPase subunit (GpP) [Pantoea vagans C9-1]
          Length = 190

 Score = 42.8 bits (99), Expect = 0.12,   Method: Composition-based stats.
 Identities = 18/81 (22%), Positives = 30/81 (37%), Gaps = 9/81 (11%)

Query: 317 PLIMGCDIAEE---GGDNTVVVL----RRGPVIEHL--FDWSKTDLRTTNNKISGLVEKY 367
            + +G D A+    G     VV+      G     L    W   D R   + I  L ++Y
Sbjct: 9   EVWIGYDPAKGTQNGDSAGCVVMAPPAVPGGKFRILERHQWRGMDFRAQADAIRTLTQQY 68

Query: 368 RPDAIIIDANNTGARTCDYLE 388
               I ID+ + G    + ++
Sbjct: 69  NVTYIGIDSTSVGLGVYENVK 89


>gi|58040880|ref|YP_192844.1| Phage DNA packaging protein [Gluconobacter oxydans 621H]
 gi|58003294|gb|AAW62188.1| Phage DNA Packaging Protein [Gluconobacter oxydans 621H]
          Length = 435

 Score = 42.8 bits (99), Expect = 0.13,   Method: Composition-based stats.
 Identities = 63/436 (14%), Positives = 119/436 (27%), Gaps = 87/436 (19%)

Query: 83  SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142
             G   GKT L    V+      PG                ++            KH   
Sbjct: 26  RGGSRSGKTFLLVRAVVIRAVKAPGSR------------HGIFR-----HRFNALKHTII 68

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY--------SEERPDTFVGHHNTYGMA 194
             +        + D+ +     D   Y T+            S +R +  +G        
Sbjct: 69  GDTFPKVMRLCFPDLPYTLNRTD--WYVTLPNGSEILFHGLDSSDRTEKILGL---EFAT 123

Query: 195 IINDEASGTPDVINLGILGFLTERNANRFWIMT-SNPRRLSGKFYEIFNKPL-------- 245
           +  +EAS         +L  L ++          +NP   S   Y +F + +        
Sbjct: 124 VYMNEASQISYAARNMLLTRLAQKTCLSVKEYIDANPPTTSHWLYSLFEQKIEPKSGEPL 183

Query: 246 ---DDWKRFQIDTRTVE-GIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDIDSFIPLN 300
              DD+   QI+  +    + P +   + A      +  R     G +      +   L+
Sbjct: 184 PYPDDYATMQINPDSNRANLSPEYLAQLEAL----PEKERQRFLFGNYQTAIDGALWTLD 239

Query: 301 IIEEALN-----REPCPDPYAPLIMGCDIAE------EGGDNTVVVL----RRGPVIEHL 345
            I          R         +++  D +          D   + +    R G      
Sbjct: 240 RIRRLAQVTNETRAAVLADMRRIVVSVDPSGCSGNEDYKSDEIGISVCGIDRDGNGHVFA 299

Query: 346 FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
               +             ++ +  D I+ + N  GA              R     + V 
Sbjct: 300 DLTCRAGPAGWAKVAIDAMDLWGADRIVAEKNFGGAMV-----EQTIRSVRATAPVKLVT 354

Query: 406 LEFCRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKG 462
               +  R E    +A   E   + +H     L + L          +G         +G
Sbjct: 355 ASRGKTARAE---PIAALYEQGKVFHHGRFPDLEEQLC-----QFSASGF--------QG 398

Query: 463 AKSTDYSDGLMYTFAE 478
           A+S D +D +++  +E
Sbjct: 399 ARSPDRADSMVWGLSE 414


>gi|29374972|ref|NP_814125.1| hypothetical protein EF0333 [Enterococcus faecalis V583]
 gi|29342430|gb|AAO80196.1| conserved hypothetical protein TIGR01630 [Enterococcus faecalis
           V583]
          Length = 418

 Score = 42.8 bits (99), Expect = 0.13,   Method: Composition-based stats.
 Identities = 45/305 (14%), Positives = 89/305 (29%), Gaps = 35/305 (11%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           RG  KTT  A  +  LM   P  ++I L  ++T     +  E+   ++ + +  +F+   
Sbjct: 52  RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106

Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            +L+        +                 +        +  G H      +I D+    
Sbjct: 107 FALYGVELVLLKETTTEIDTNLKTSTRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257
            D ++               +    N +   G+F      ++K     K     + D   
Sbjct: 164 KDRVSRA-----EREKTKLQYQELQNVKNREGRFINTGTPWHKEDAISKMPNVKKFDCYE 218

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              ID    + +  +  +   +       +        F     I+   N       +  
Sbjct: 219 TGLIDKEQRKAL--QQSMTPSLFAANYELKHIADSESLFTAPTYID-NTNLIYNGVAH-- 273

Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373
                D A  GGD+T   +    + G +I     W K        +I  L + Y+     
Sbjct: 274 ----IDAAYGGGDSTAFTIFKEQKDGTIIGFGKKWQKHVDDCLP-EILQLHQYYQAGTFY 328

Query: 374 IDANN 378
            + N 
Sbjct: 329 TETNG 333


>gi|59712621|ref|YP_205397.1| terminase, ATPase subunit [Vibrio fischeri ES114]
 gi|59480722|gb|AAW86509.1| terminase, ATPase subunit [Vibrio fischeri ES114]
          Length = 588

 Score = 42.8 bits (99), Expect = 0.14,   Method: Composition-based stats.
 Identities = 29/243 (11%), Positives = 64/243 (26%), Gaps = 35/243 (14%)

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
             ID     G      + +  +Y  D  V    +   F      SF  +  +        
Sbjct: 332 ITIDDAIEGGATFFNMDKLRRKY-PDKSVFNNLLRCVFLDDAS-SFFSIKSLLACKTDTD 389

Query: 311 CPDPYA----------PLIMGCDIAEEG-----GDNTVVV----LRRGPVIEHLFDWS-- 349
                            +++G D    G      D  ++V    + +G     +      
Sbjct: 390 NWKDVDLESLHPVGRREVLVGYDPRGGGQGEGADDAGLIVSLKPIIKGGAFRFIERVRLK 449

Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
            +        I  + +KY    + ID    G+   + +                  L++ 
Sbjct: 450 GSSYEDQAAAIEAICKKYNVVYLAIDVGGVGSAVAELVR---------KFYPGLTTLDYS 500

Query: 410 RNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKST 466
              +  +  K  + +    L        L+ +   ++      + ++   S R K     
Sbjct: 501 PEMKRMMAYKAREIINAGRLQFDNEWDDLVHSFLMIRQHTTKMSNQITFVSARNKVGSHA 560

Query: 467 DYS 469
           D +
Sbjct: 561 DLA 563


>gi|327198111|ref|YP_004306641.1| terminase large subunit [Enterococcus phage EFRM31]
 gi|297179206|gb|ADI23907.1| terminase large subunit [Enterococcus phage EFRM31]
          Length = 574

 Score = 42.8 bits (99), Expect = 0.14,   Method: Composition-based stats.
 Identities = 48/295 (16%), Positives = 92/295 (31%), Gaps = 52/295 (17%)

Query: 76  EVFKGAISAGRGIGKTTLNAWLVLW-LMST-RPGIS--VICLANSETQLKTTLWAEVSKW 131
              K  IS  R  GK+ L A + L+  +    P  S  ++  AN++ Q            
Sbjct: 97  RFRKVYISLARKNGKSILVAGISLYEFLLGQYPQASRQIVAAANTKDQ------------ 144

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
             ++ N    ++++L                 I+     +  +  S +  D+  G     
Sbjct: 145 AGIVFNMLKSQLKALRAVSDGTRKVTKVNKKDIEHLEDESTVKPLSSDA-DSLDGLDVLC 203

Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD---- 247
           G+     EA  T   +   +    +++      I+++  + L+G  + I    +      
Sbjct: 204 GVLDEYGEAKSTA--MIEVLESSQSQQLQGLILIISTTTKNLNGPMHSIEYPFITKLLNE 261

Query: 248 -----------WKRFQIDTRTVE---------GIDPSFHEGI-------IARYGLDSDV- 279
                      W+   +     E           +   HE +       +A Y    D+ 
Sbjct: 262 EVEADAYLALCWEMDSLSEVDDEANWIKSNPLFENAQLHETMYEHKVNSLAEYKAKGDMS 321

Query: 280 -TRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTV 333
               +    + Q   DSFI     E     +P      P+ +G D+A  G    V
Sbjct: 322 GWLTKEMNFWVQSSQDSFIDKESWEAVKQTQPYDIKGRPVYIGLDLARTGDMTAV 376


>gi|238694889|ref|YP_002922083.1| Dda DNA helicase [Enterobacteria phage JSE]
 gi|220029025|gb|ACL77960.1| Dda DNA helicase [Enterobacteria phage JSE]
          Length = 463

 Score = 42.8 bits (99), Expect = 0.14,   Method: Composition-based stats.
 Identities = 29/177 (16%), Positives = 53/177 (29%), Gaps = 34/177 (19%)

Query: 57  EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116
           + +     H  + +     +     +    G GKT +  ++V      R G++ + LA  
Sbjct: 8   DMLTDGQKHAFDVLMKRIEQKKHTTVRGAAGTGKTAMMKFIVQ--EMVRRGVTGVVLATP 65

Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176
             Q K  L   V +                        +  LH  L ++  +Y       
Sbjct: 66  THQAKKVLSKAVGR-----------------------QAFTLHALLRLNPTNYEDTQVFE 102

Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233
            ++ P             II DEAS     +   +   +   N     I   +P +L
Sbjct: 103 QKDTPKL------DDVQIIIVDEASMVDKKLFDIL---MKSINGRIVIIAVGDPHQL 150


>gi|157311312|ref|YP_001469355.1| Dda DNA helicase [Enterobacteria phage Phi1]
 gi|149380516|gb|ABR24521.1| Dda DNA helicase [Enterobacteria phage Phi1]
          Length = 463

 Score = 42.8 bits (99), Expect = 0.14,   Method: Composition-based stats.
 Identities = 29/177 (16%), Positives = 53/177 (29%), Gaps = 34/177 (19%)

Query: 57  EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116
           + +     H  + +     +     +    G GKT +  ++V      R G++ + LA  
Sbjct: 8   DMLTDGQKHAFDVLMKRIEQKKHTTVRGAAGTGKTAMMKFIVQ--EMVRRGVTGVVLATP 65

Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176
             Q K  L   V +                        +  LH  L ++  +Y       
Sbjct: 66  THQAKKVLSKAVGR-----------------------QAFTLHALLRLNPTNYEDTQVFE 102

Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233
            ++ P             II DEAS     +   +   +   N     I   +P +L
Sbjct: 103 QKDTPKL------DDVQIIIVDEASMVDKKLFDIL---MKSINGRIVIIAVGDPHQL 150


>gi|49474625|ref|YP_032667.1| phage related protein [Bartonella quintana str. Toulouse]
 gi|49240129|emb|CAF26575.1| phage related protein [Bartonella quintana str. Toulouse]
          Length = 402

 Score = 42.8 bits (99), Expect = 0.14,   Method: Composition-based stats.
 Identities = 31/191 (16%), Positives = 63/191 (32%), Gaps = 11/191 (5%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNK-PLDDWKR 250
               DEA    +     ++  L E     N    +T NP R +    + F      + K 
Sbjct: 86  LCWVDEAEPVTETAWQTLIPTLREEGKDWNAELWVTWNPCRENAPVEKRFRNVENPNIKG 145

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
            +I+ R         +    A      +       G++ Q    ++    ++E  L    
Sbjct: 146 AEINWRDNPLFPQKLNRDRKADLQQRPENYNHIWEGEYLQSVQGAYYQKALLEAELEGRI 205

Query: 311 CPDPYAP---LIMGCDIAEEG--GDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKISGL 363
              P  P   + +  DI   G   D T + + +  G  I  L D+ +   +     +  +
Sbjct: 206 TNVPRDPLMQIKIFWDIGGTGAKADATALWVAQFIGREIRVL-DYYEAQGQPLAEHVGWV 264

Query: 364 VEKYRPDAIII 374
            ++    A+++
Sbjct: 265 FQRGYEKALMV 275


>gi|294011207|ref|YP_003544667.1| hypothetical protein SJA_C1-12210 [Sphingobium japonicum UT26S]
 gi|292674537|dbj|BAI96055.1| conserved hypothetical protein [Sphingobium japonicum UT26S]
          Length = 437

 Score = 42.8 bits (99), Expect = 0.14,   Method: Composition-based stats.
 Identities = 64/406 (15%), Positives = 114/406 (28%), Gaps = 61/406 (15%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           AGRG GKT   A  V  +  + P   +  +  +  + +  +                 E 
Sbjct: 58  AGRGFGKTRAGAEWVRSVAESDPKARIALVGATLGEARAVM----------------VEG 101

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKH-YSTMCRTYSEERPDTFVGHHNTYGMAIINDE--- 199
            S  L  APW++  +              +   Y     ++  G   ++G     DE   
Sbjct: 102 ASGILAVAPWWNRPVFAPALRKLVWPNGAVATLYGAAEAESLRGPQFSHG---WADEIAK 158

Query: 200 -ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258
            A G     +  ++G             T  P  L      +     D           V
Sbjct: 159 WAGGQA-AWDNLMMGMRLGGAPRVLATTTPRPVPLVRGL--VARAGGDVVVTRGRTADNV 215

Query: 259 EGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPL 318
             +   F   +   YG    + R E+ G+  ++   +    +++E              +
Sbjct: 216 AHLADGFLAAMERSYGGT-RLGRQELDGELIEEVEGALWSRDLLERCRVAHVRG-GLTRV 273

Query: 319 IMGCD-IAEEGGDNT---VVVLRRGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAI 372
           ++  D  A   GD     VV L        + D +             SG    +  D +
Sbjct: 274 VVAVDPPASAHGDACGIVVVGLGEDRRAYVIADATVEGATPEGWARAASGAALVHGADRV 333

Query: 373 IIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADW---LEFA 427
           + +ANN GA     L        V  V   +  V        R E    + +    +   
Sbjct: 334 VAEANNGGAMVESVLRAAEAALPVRLVHASRGKV-------ARAEPVAALYEAGRVVHRG 386

Query: 428 SLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLM 473
                   +  L     ++ P               +S D +D L+
Sbjct: 387 GFAELEDQLCGLMLGGGYVGP--------------GRSPDRADALV 418


>gi|315181719|gb|ADT88632.1| terminase, ATPase subunit [Vibrio furnissii NCTC 11218]
          Length = 574

 Score = 42.8 bits (99), Expect = 0.14,   Method: Composition-based stats.
 Identities = 36/244 (14%), Positives = 72/244 (29%), Gaps = 37/244 (15%)

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
             ID    +G      E +  +Y  D  V    +   F       F    ++  A   + 
Sbjct: 318 ITIDDAIEKGATFFNMEKLRRKY-PDKTVFDNLLRCVFLDDSASIFALKALL--ACKTDS 374

Query: 311 -----------CPDPYAPLIMGCDI----AEEGGDNTVVVL-----RRGPVIEHLFDWS- 349
                       P   A +++G D       EG D+  +V+     R+G V   +     
Sbjct: 375 SLWKDVDHNKARPAGNAEVLVGYDPRGGGQGEGSDDAGLVVALKPKRKGGVFRLIERARL 434

Query: 350 -KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEF 408
             +        I  + EKY    + ID +  G+   + +      +  +           
Sbjct: 435 KGSSYEQQALAIKAMTEKYNVVHLAIDVSGVGSAVAELVRKFYPSLIELDYSPEV----- 489

Query: 409 CRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS 465
              +R  +  K  + +    L        L+ +   ++      + ++   S R K    
Sbjct: 490 ---KRM-MVYKAREIINDGRLQFDGEWDDLVHSFLMIRQQTTKASNQVTFISNRSKVGSH 545

Query: 466 TDYS 469
            D +
Sbjct: 546 ADLA 549


>gi|260769184|ref|ZP_05878117.1| terminase ATPase subunit [Vibrio furnissii CIP 102972]
 gi|260614522|gb|EEX39708.1| terminase ATPase subunit [Vibrio furnissii CIP 102972]
          Length = 574

 Score = 42.8 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 36/244 (14%), Positives = 72/244 (29%), Gaps = 37/244 (15%)

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
             ID    +G      E +  +Y  D  V    +   F       F    ++  A   + 
Sbjct: 318 ITIDDAIEKGATFFNMEKLRRKY-PDKTVFDNLLRCVFLDDSASIFALKALL--ACKTDS 374

Query: 311 -----------CPDPYAPLIMGCDI----AEEGGDNTVVVL-----RRGPVIEHLFDWS- 349
                       P   A +++G D       EG D+  +V+     R+G V   +     
Sbjct: 375 SLWKDVDHNKARPAGNAEVLVGYDPRGGGQGEGSDDAGLVVALKPKRKGGVFRLIERARL 434

Query: 350 -KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEF 408
             +        I  + EKY    + ID +  G+   + +      +  +           
Sbjct: 435 KGSSYEQQALAIKAMTEKYNVVHLAIDVSGVGSAVAELVRKFYPSLIELDYSPEV----- 489

Query: 409 CRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKS 465
              +R  +  K  + +    L        L+ +   ++      + ++   S R K    
Sbjct: 490 ---KRM-MVYKAREIINDGRLQFDGEWDDLVHSFLMIRQQTTKASNQVTFISNRSKVGSH 545

Query: 466 TDYS 469
            D +
Sbjct: 546 ADLA 549


>gi|240137990|ref|YP_002962462.1| hypothetical protein MexAM1_META1p1321 [Methylobacterium extorquens
           AM1]
 gi|240007959|gb|ACS39185.1| conserved hypothetical protein [Methylobacterium extorquens AM1]
          Length = 421

 Score = 42.8 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 63/296 (21%), Positives = 100/296 (33%), Gaps = 43/296 (14%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           L  +E    H       P P  +   A+  GRG GKT   A    W+     G  V    
Sbjct: 9   LRLLEADWLHLARHDQLPPPGNWTTWAVIGGRGSGKTRTGA---EWVRGLAQGDPVFTPE 65

Query: 115 NSET-QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
             E   L    +A+V   +   P+     +  L   P  W         G        + 
Sbjct: 66  PVERIALVGETFADVRDVMIEGPSG-LLALPRLGGAPPVWQPSRRRVMFGN-----GAVA 119

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRR 232
             +S E PD+  G       A+ +DE +              T  +  +F +     PR 
Sbjct: 120 LAFSAEEPDSLRG---PQFGAVWSDEVAK-----WREAE---TTYDMIQFGLRLGTHPRG 168

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTV----------EGIDPSFHEGIIARYGLDSDVTRV 282
           L         +P+   +R   D RTV          + + PSF E ++ RY   + + R 
Sbjct: 169 LVT----TTPRPVPLIQRLLADPRTVVTRSRTADNAQNLAPSFLEEVVGRY-AGTRLGRQ 223

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI---AEEGGDNTVVV 335
           E+ G+  +   D+    + IE A   E  P  +  + +  D    +  G D   +V
Sbjct: 224 ELDGELIEDRPDALWTRDSIERARVFEAPPLQH--IAVAIDPPASSGVGADACGIV 277


>gi|213865421|ref|ZP_03387540.1| probable terminase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. M223]
          Length = 85

 Score = 42.8 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 11/44 (25%), Positives = 15/44 (34%)

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384
            I     W   D R   + I  L ++Y    I ID+   G    
Sbjct: 17  RILERHQWRGMDFRAQADAIKKLTQQYNVTYIGIDSTGVGHGVY 60


>gi|213586958|ref|ZP_03368784.1| probable terminase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. E98-0664]
          Length = 67

 Score = 42.8 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 11/44 (25%), Positives = 15/44 (34%)

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384
            I     W   D R   + I  L ++Y    I ID+   G    
Sbjct: 16  RILERHQWRGMDFRAQADAIKKLTQQYNVTYIGIDSTGVGHGVY 59


>gi|213162921|ref|ZP_03348631.1| probable terminase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. E00-7866]
          Length = 113

 Score = 42.8 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 11/44 (25%), Positives = 15/44 (34%)

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTC 384
            I     W   D R   + I  L ++Y    I ID+   G    
Sbjct: 16  RILERHQWRGMDFRAQADAIKKLTQQYNVTYIGIDSTGVGHGVY 59


>gi|291484314|dbj|BAI85389.1| hypothetical protein BSNT_02825 [Bacillus subtilis subsp. natto
           BEST195]
          Length = 577

 Score = 42.8 bits (99), Expect = 0.15,   Method: Composition-based stats.
 Identities = 43/278 (15%), Positives = 88/278 (31%), Gaps = 43/278 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGIS----VICLANSETQLKTTLWAEVSKWLSLLPNKHWF--E 142
           GK+ L A L L+ +           +   ANS  Q KT  +  +S  L  + +K  F  +
Sbjct: 112 GKSVLVAGLSLYELIYGEAPKFDRQIYATANSRGQAKTV-FKMISMQLKKIRSKSKFMRK 170

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKH--------------YSTMCRTYSEERPDTFVGHH 188
              +  +   +  D                          Y T   T   E  ++  G  
Sbjct: 171 WTKIIQNEIRYLKDDCVIMPLSRDTDNLDSLNVLIGILDEYHTASNTKMMEVLESSQGQQ 230

Query: 189 NTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD- 247
           +   + II+   +G        + G +  +       + S  +     F  ++ +  ++ 
Sbjct: 231 DQGLILIIS--TAGFK------LNGPMYSQEYPYVDDILSGRKENENYFAIVYEQDDEEE 282

Query: 248 ------WKRFQIDTRTVEGIDPSFHEGIIARYGL-----DSDVTRVEVCGQFPQQDIDSF 296
                 W +       VEG+     + +  +        D + T V+    +     +SF
Sbjct: 283 IYDESTWIKSN-PLLEVEGLQKKILKNLRKKLKEALDKDDLNGTLVKNFNIWQSASSESF 341

Query: 297 IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVV 334
           I  N  ++            P+ +G D++    D + +
Sbjct: 342 INGNDWKKRGVDVAPDITGKPVYIGIDLSRT-DDLSAL 378


>gi|253583367|ref|ZP_04860565.1| helicase [Fusobacterium varium ATCC 27725]
 gi|251833939|gb|EES62502.1| helicase [Fusobacterium varium ATCC 27725]
          Length = 1624

 Score = 42.4 bits (98), Expect = 0.16,   Method: Composition-based stats.
 Identities = 35/287 (12%), Positives = 81/287 (28%), Gaps = 24/287 (8%)

Query: 179  ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY 238
              P  F G         + D   G   +   G  G + + N  R+ +  S          
Sbjct: 1262 GNPSHFQGDERDVVFLSMVDSNDGVGPLAMKG-EG-IEDSNKKRYNVAVS---------- 1309

Query: 239  EIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIP 298
                     W    +D      +        +  Y  D     +E   +  +++ DS   
Sbjct: 1310 ---RAKDQLWIVHSLDMAN--DLKKGDIRRGLLEYSEDPKAFMIE---ESVKKNSDSVFE 1361

Query: 299  LNIIEEALNREPCPDPYAPL-IMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLR--T 355
              + +    R         +     D+     +  V +   G       +  K D+    
Sbjct: 1362 EEVAKYLYARGYNIIQQWEVGAYRIDMVAFFENKRVAIECDGERWHSTEEQVKQDIERQD 1421

Query: 356  TNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTE 415
               +      + R      +  +T     + LE  G +  +   +   +  E   N+   
Sbjct: 1422 ILERCGWDFIRIRGSRYFRNPEDTMKEVLEKLEKKGIYPEKTKSENYEIREEELLNKIKS 1481

Query: 416  LHVKMADWLEFASLINHSGLIQNLKSLKSFIVP-NTGELAIESKRVK 461
               ++ +  +    I    + + + +++   +      L IE+ ++K
Sbjct: 1482 RSFEIMELWKEQGNIEEIEITKEVNNIEDKEIKIPELVLKIENSKIK 1528


>gi|325848842|ref|ZP_08170352.1| putative phage terminase, large subunit [Anaerococcus hydrogenalis
           ACS-025-V-Sch4]
 gi|325480486|gb|EGC83548.1| putative phage terminase, large subunit [Anaerococcus hydrogenalis
           ACS-025-V-Sch4]
          Length = 462

 Score = 42.4 bits (98), Expect = 0.16,   Method: Composition-based stats.
 Identities = 36/205 (17%), Positives = 67/205 (32%), Gaps = 24/205 (11%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL--SGKFYEIFNK------PL 245
            +I DEA    D     +   +T    N   IM   P     SG  +  F K      P 
Sbjct: 154 LLIIDEAQEYTDDQESALKYTVTSS-KNPQTIMCGTPPTPISSGMVFVNFRKQCLTSRPN 212

Query: 246 DDWKRFQI--------DTRTVEGIDPSF-----HEGIIARYGLDSDVTRVEVCGQFPQQD 292
           + +             D+      +PS         I    G D     ++  G +   +
Sbjct: 213 NAYWAEWSVPEMSDIHDSELWYKTNPSLGTIFTERSIEDEIGSDETDFNIQRLGLWISYN 272

Query: 293 IDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGP-VIEHLFDWSKT 351
             S I      + L  +  P     + +G     +G + ++ V  +    +  +      
Sbjct: 273 QKSAI-TEKEWQRLKLKSLPILTGEMHVGIKFGNDGTNVSLAVACKTLSKMIFIEAIDCQ 331

Query: 352 DLRTTNNKISGLVEKYRPDAIIIDA 376
           ++R  +N I   + K +P +++ID 
Sbjct: 332 NVRNGDNWIIDFLVKTKPKSVVIDG 356


>gi|254560550|ref|YP_003067645.1| hypothetical protein METDI2093 [Methylobacterium extorquens DM4]
 gi|254267828|emb|CAX23679.1| conserved hypothetical protein [Methylobacterium extorquens DM4]
          Length = 421

 Score = 42.4 bits (98), Expect = 0.16,   Method: Composition-based stats.
 Identities = 62/296 (20%), Positives = 98/296 (33%), Gaps = 43/296 (14%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           L  +E    H       P P  +   A+  GRG GKT   A    W+     G  V    
Sbjct: 9   LRLLEADWLHLARHDQLPPPGNWTTWAVIGGRGSGKTRTGA---EWVRGLAYGDPVFSPE 65

Query: 115 NSET-QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
             E   L    +A+V   +   P+     +  L   P  W         G        + 
Sbjct: 66  PVERIALVGETFADVRDVMIEGPSG-LLALPRLGGAPPVWQPSRRRVVFGN-----GAVA 119

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRR 232
             +S E PD+  G       A+ +DE +                 +  +F +     PR 
Sbjct: 120 LAFSAEEPDSLRG---PQFGAVWSDEVAK-----WREAEAT---YDMIQFGLRLGTHPRG 168

Query: 233 LSGKFYEIFNKPLDDWKRFQIDTRTV----------EGIDPSFHEGIIARYGLDSDVTRV 282
           L         +P+   +R   D RTV          + + PSF E ++ RY   + + R 
Sbjct: 169 LVT----TTPRPVPLIRRLLADPRTVVTRSRTADNAQNLAPSFLEEVVGRY-AGTRLGRQ 223

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI---AEEGGDNTVVV 335
           E+ G+  +   D+    + IE A  R     P   + +  D    +  G D   +V
Sbjct: 224 ELDGELIEDRPDALWTRDSIERA--RVSEVPPLQRIAVAIDPPASSRVGADACGIV 277


>gi|224797098|ref|YP_002642985.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           CA-11.2a]
 gi|224554508|gb|ACN55891.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           CA-11.2a]
          Length = 396

 Score = 42.4 bits (98), Expect = 0.16,   Method: Composition-based stats.
 Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSSDDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    +  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVQAWLEKQKLAYHGNDIGFRIEVLGE 219


>gi|168029927|ref|XP_001767476.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681372|gb|EDQ67800.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1075

 Score = 42.4 bits (98), Expect = 0.16,   Method: Composition-based stats.
 Identities = 22/134 (16%), Positives = 46/134 (34%), Gaps = 10/134 (7%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139
            A++A RG GK+      +          ++   A S   LKT L+  + K    +  K 
Sbjct: 279 VALTAARGRGKSAALGVAIA-GAVAFGYSNIFVTAPSPENLKT-LFEFIFKGFDAMEYKE 336

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             +   +    + +   ++  ++    +      +    E+              ++ DE
Sbjct: 337 HIDYDLVESTNSAFNKAIVRVNIFRQHRQTIQYIQPKDHEKLAQAE--------LLVIDE 388

Query: 200 ASGTPDVINLGILG 213
           A+  P  I   +LG
Sbjct: 389 AAAIPLPIVKALLG 402


>gi|320590344|gb|EFX02787.1| dead deah box DNA helicase [Grosmannia clavigera kw1407]
          Length = 2423

 Score = 42.4 bits (98), Expect = 0.17,   Method: Composition-based stats.
 Identities = 29/166 (17%), Positives = 52/166 (31%), Gaps = 25/166 (15%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RPG  V+ +A         L  E +  W   L       
Sbjct: 1194 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERIKDWGRRLAGPAGLR 1248

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA- 200
            +  L+    P    +    + +   + +  + R++         G+     + II DE  
Sbjct: 1249 LVELTGDNTPDTRTIGEADVIVTTPEKWDGISRSWQT------RGYVRKVSLVII-DEIH 1301

Query: 201  --SGTPDVINLGI------LGFLTERNANRFWI--MTSNPRRLSGK 236
              +G    I   I      +G  T  +     +    +N   L+  
Sbjct: 1302 LLAGDRGPILEIIVSRMNYIGAATGSSVRLLGMSTACANATDLASW 1347


>gi|66395738|ref|YP_240074.1| ORF009 [Staphylococcus phage 37]
 gi|62636161|gb|AAX91272.1| ORF009 [Staphylococcus phage 37]
          Length = 419

 Score = 42.4 bits (98), Expect = 0.17,   Method: Composition-based stats.
 Identities = 53/375 (14%), Positives = 122/375 (32%), Gaps = 43/375 (11%)

Query: 57  EFMEVVDAHCLNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLAN 115
           +  E++  H  +  +    +     +   GRG GK++  A +++ L+  R  ++ + L  
Sbjct: 4   KLSELIPEHFHSLWHAAKDKGKLNIVAKGGRGSGKSSDIAIIIV-LLIMRYPVNALILRK 62

Query: 116 SETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT 175
            +  L  +++ ++   ++++   H F+++                 +    +    + R 
Sbjct: 63  IDNTLALSVFEQIKWAINVMGVSHLFKIKVS------------PMEITYVPRGNKMVFRG 110

Query: 176 YSEERPDTFVGHHN---TYGMAIINDEASGTPDVINLGILGFL----TERNANRFWIMTS 228
              + P+      +    Y +A I + A    +     I   L     +      +  T 
Sbjct: 111 A--QNPERIKSLKDAQFPYAIAWIEELAEFKTEDEVTTITNSLLRGELDNGLFYKFFYTY 168

Query: 229 NPRRLSGKF----YEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284
           NP +    +    YE   +P + +            I   F E   A   ++    R E 
Sbjct: 169 NPPKRKQSWVNKKYESSFQPDNTFVHHS-TYLNNPFIAKEFIEEAKAAKAINELRYRWEY 227

Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGP 340
            G+         +P N +      +   D +  +    D      D    V     ++  
Sbjct: 228 LGEAIGS---GVVPFNNLRIETIPKEQFDTFDNIRNAVDFG-YATDPLAFVRWHYDKKKR 283

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400
           +I  + +     +           + Y+ D I  D+     ++   L+   + + R+ G 
Sbjct: 284 IIYAVDEHYGVQISNREFANWLKKKGYQSDEIYADS--AEPKSIAELKQE-HSIRRIKGV 340

Query: 401 KRAVDL----EFCRN 411
           K+  D     E   N
Sbjct: 341 KKGPDSVEHGEQWLN 355


>gi|13242438|ref|NP_077457.1| DNA packaging terminase subunit 1 [Cercopithecine herpesvirus 9]
 gi|11036590|gb|AAG27219.1|AF275348_40 unknown [Cercopithecine herpesvirus 9]
          Length = 745

 Score = 42.4 bits (98), Expect = 0.17,   Method: Composition-based stats.
 Identities = 33/152 (21%), Positives = 49/152 (32%), Gaps = 22/152 (14%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     L+  LMST  GI V   A             + K       +  FE     L
Sbjct: 271 GKTWFIVSLIALLMSTFRGIKVGYTA------------HIRK-----ATEPVFEEIKARL 313

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTM---CRTYSEERPDTFVGHHNTYGMAIINDEASGTPD 205
               W+       +  +S  +S     C T          G        +  DEA+    
Sbjct: 314 --EQWFGTERIEHVKGESITFSFSDGCCSTAVFSSSHNTNGIRGQTFNLLFVDEANFIRP 371

Query: 206 VINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                I+GFL + N    ++ ++N  + S  F
Sbjct: 372 DAVQTIVGFLNQTNCKIIFVSSTNTGKASTSF 403


>gi|226315677|ref|YP_002775693.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           29805]
 gi|226202054|gb|ACO38634.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           29805]
          Length = 396

 Score = 42.4 bits (98), Expect = 0.18,   Method: Composition-based stats.
 Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSVDGGDTLAIRKKKNKTTQSIHKEILELLSIHNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSADDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    +  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219


>gi|260431843|ref|ZP_05785814.1| conserved hypothetical protein [Silicibacter lacuscaerulensis
           ITI-1157]
 gi|260415671|gb|EEX08930.1| conserved hypothetical protein [Silicibacter lacuscaerulensis
           ITI-1157]
          Length = 176

 Score = 42.4 bits (98), Expect = 0.18,   Method: Composition-based stats.
 Identities = 26/127 (20%), Positives = 39/127 (30%), Gaps = 9/127 (7%)

Query: 182 DTFVGHHNTYGMAIINDEASGT-PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240
           +   G        II DEA    PD                 F + T N  R SG FYE 
Sbjct: 49  ENARGETAD---LIIGDEACFIQPDEALTAFFPMRRSTG-RIFLLSTPNGTR-SGYFYET 103

Query: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
           +    +  +R +  +      D         R  +     R E   ++      S +  N
Sbjct: 104 WESDANV-RRIRARSMDTTREDRLAQIEFDRR-TMSDATFRREHLCEWVGAGE-SLLSWN 160

Query: 301 IIEEALN 307
            +E A+ 
Sbjct: 161 TLERAMQ 167


>gi|289432252|ref|YP_003462125.1| hypothetical protein DehalGT_0302 [Dehalococcoides sp. GT]
 gi|288945972|gb|ADC73669.1| conserved hypothetical protein [Dehalococcoides sp. GT]
          Length = 420

 Score = 42.4 bits (98), Expect = 0.18,   Method: Composition-based stats.
 Identities = 49/295 (16%), Positives = 86/295 (29%), Gaps = 59/295 (20%)

Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213
           ++D+ H   G   +         S E   + VG  NT  + +  DEA             
Sbjct: 87  FTDIYHTEGGYIIRLNQARAVFLSAEPSASVVG--NTAHLLLEVDEAQDVNKEKY----- 139

Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFN-------------KPLDDWKRFQIDTRTVEG 260
               +        T+    L G  ++ F+             +     + F+ D   V  
Sbjct: 140 ---SKEFKPMGATTNVTTVLYGTTWDSFSLLEEIKEQNIEKEQKDGLKRHFRYDWEAVAA 196

Query: 261 IDPSFHE---GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDP 314
            +P++         R G +  +   +     P            ++      PC   P+ 
Sbjct: 197 HNPAYLAYALSEKERLGENHPLFLTQYR-LLPVSGGGGMFSNEQLDLLKGNHPCQVYPEK 255

Query: 315 YAPLIMGCDIAEE-----GGDNTVVVLRRGPVIEHL----------------------FD 347
               + G D+A E     G   T V LRR   +  +                      + 
Sbjct: 256 GKVYVAGLDLAGEDSQTGGISPTTVNLRRDSSVLTIAQLDYTFAKAPYNLPQVRLVCHYS 315

Query: 348 WSKTDLRTTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYHVYRVLGQ 400
           W  T       K+  L+ K ++   + +DA   G     +L   LG  +  V  Q
Sbjct: 316 WQGTRHALLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSRILPVPFQ 370


>gi|298290710|ref|YP_003692649.1| hypothetical protein Snov_0699 [Starkeya novella DSM 506]
 gi|296927221|gb|ADH88030.1| protein of unknown function DUF264 [Starkeya novella DSM 506]
          Length = 428

 Score = 42.4 bits (98), Expect = 0.18,   Method: Composition-based stats.
 Identities = 65/414 (15%), Positives = 116/414 (28%), Gaps = 65/414 (15%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---SVICLANSETQLKTTLWAEVSKWLSLLPNK 138
           +  GRG GKT   A  V  L   R G     +  +A S   L+  +   VS  L++ P  
Sbjct: 42  VLGGRGAGKTRAGAEWVRALAFGRAGPPAGRIALVAESLGDLREVMVEGVSGLLAVHPRG 101

Query: 139 HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIND 198
                +                           + + +S + P++  G       A   D
Sbjct: 102 ERPTWEPTR---------------KRLEWPNGAVAQGFSADDPESLRGPQFD---AAWCD 143

Query: 199 EASGTPDVINLGILGFLTERNANRFW-----IMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
           E +                             M +   R +     +   P     R   
Sbjct: 144 ELAK-----WRYAQAAFDNLQFGLRLGARPRQMVTTTPRPTTLLRALLADPRTAVTRM-G 197

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE--PC 311
                  + P F E ++ RY   + + R E+ G+  +   D+     +IE          
Sbjct: 198 TAENAAHLAPHFLETVVGRY-AGTRLGRQELDGELIEDRPDALWSRALIEAGREAAAPEM 256

Query: 312 PDPYAPLIMGCDI---AEEGGDNTVVV---LRRGPVIEHLFDWSKTDLRTT--NNKISGL 363
                 +++  D    + +  D   +V   + R  ++  L D S   L  T    +  GL
Sbjct: 257 VRQMERIVVAVDPPASSRKHADACGLVAAGIDRDGLVHVLADESAQGLTPTGWGGRAVGL 316

Query: 364 VEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTELHVKMA 421
             +   D ++++ N  G      L  +     V  V   +           R E    + 
Sbjct: 317 FHRLEADRVVVEVNQGGEMVKSILAGIDPSVPVREVRATRGKW-------LRAEPVAALY 369

Query: 422 DWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
           +            L   L         + G            +S D  D L++ 
Sbjct: 370 EQGRVRHAGAFPALEDELC-----DFGSDGL--------SNGRSPDRLDALVWA 410


>gi|225552551|ref|ZP_03773490.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
 gi|225370879|gb|EEH00310.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
          Length = 450

 Score = 42.4 bits (98), Expect = 0.19,   Method: Composition-based stats.
 Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  ++A+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNKATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI +  +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQNYVFTSP--------IAYLDPAFSIGGDNTALCV 299


>gi|224534955|ref|ZP_03675523.1| conserved hypothetical protein [Borrelia spielmanii A14S]
 gi|224513774|gb|EEF84100.1| conserved hypothetical protein [Borrelia spielmanii A14S]
          Length = 285

 Score = 42.4 bits (98), Expect = 0.19,   Method: Composition-based stats.
 Identities = 21/122 (17%), Positives = 36/122 (29%), Gaps = 4/122 (3%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDS 264

Query: 295 SF 296
            F
Sbjct: 265 IF 266


>gi|23335598|ref|ZP_00120832.1| hypothetical protein Blon03000707 [Bifidobacterium longum DJO10A]
 gi|189440021|ref|YP_001955102.1| phage terminase large subunit [Bifidobacterium longum DJO10A]
 gi|189428456|gb|ACD98604.1| Phage terminase large subunit [Bifidobacterium longum DJO10A]
          Length = 477

 Score = 42.4 bits (98), Expect = 0.19,   Method: Composition-based stats.
 Identities = 55/376 (14%), Positives = 105/376 (27%), Gaps = 60/376 (15%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISA-GRGIGKTTLNAWLVLWLMSTRPGISV 110
             WQ +   ++ A   +   +      +  + +  R  GKT    W+ +   +  PG+ +
Sbjct: 37  DVWQRQINRIILAKSADGFWSA-----RNTVLSIPRQTGKTYDIGWVAIHRAARTPGMRI 91

Query: 111 ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLH---CSLGIDSK 167
           +  A               +  S++ +        +         D  H    + G +  
Sbjct: 92  VWTA---------------QHFSVIKDTFESLCAIVLRPEMSGLVDPDHGISLAAGKEEI 136

Query: 168 HYSTMCRT-YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIM 226
            +    R  +         G        ++ DEA    D     +L     R  N   I 
Sbjct: 137 RFRNGSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPT-QNRAWNPQTIY 193

Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDT---------RTVEGIDPSFHEGIIARY---- 273
              P        E F +  D  +  +  +         R  + +D          Y    
Sbjct: 194 MGTPPGPRDNG-EAFTRLRDKARAGRTHSTLYVEFTADRDADPLDRQQWRKANPSYPSHT 252

Query: 274 ----------GLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCD 323
                      L  D  R E  G + +  +   I     EEA        P   +  G D
Sbjct: 253 SDESIANLWENLTGDDFRREALGIWDEHALSRAIDRRQWEEATI--ERRRPGGVMSFGID 310

Query: 324 IAEEGGDNTV---VVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK--YRPDAIIIDANN 378
           +  +    T+   +          L ++  T+   T      L++K   +  +++ID  +
Sbjct: 311 MNPQRTRLTIGACMRYDDNTAHIELAEYRDTNQDGT-MWAVNLIDKVWEQTASLVIDGQS 369

Query: 379 TGARTCDYLEMLGYHV 394
                   L   G  V
Sbjct: 370 PATALLPDLAQAGVTV 385


>gi|254471818|ref|ZP_05085219.1| phage DNA Packaging Protein [Pseudovibrio sp. JE062]
 gi|211959020|gb|EEA94219.1| phage DNA Packaging Protein [Pseudovibrio sp. JE062]
          Length = 428

 Score = 42.4 bits (98), Expect = 0.20,   Method: Composition-based stats.
 Identities = 55/322 (17%), Positives = 104/322 (32%), Gaps = 35/322 (10%)

Query: 166 SKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLG-ILGFLTERNANRFW 224
                 + R +S E P+   G       A   DEA    +      +L F          
Sbjct: 119 EWPNGAIARAFSSEDPEALRGPQFD---AAWCDEAGKWSNATETFDMLQFGLRLGTQPQQ 175

Query: 225 IMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV 284
           ++T+ P+  S    ++  +                 +  +F + +  RYG    + R E+
Sbjct: 176 LVTTTPK--STPLLKMLLQDQRVVVTKAGTKSNAAFLAEAFLQQMAERYGGT-RLGRQEL 232

Query: 285 CGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGG---DNT-VVVLRRGP 340
            G+  +   D+       E  +NR         +++  D     G   D   ++      
Sbjct: 233 DGELIEDREDALFARKWFE--MNRVRHVPELKRIVVAIDPPATSGKSADACGIIAAGITE 290

Query: 341 VIE--HLFDWSKTDLRTTN--NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYR 396
             E   L D +   LR     ++   L  +   D ++ + N  G    + +E +   V  
Sbjct: 291 AAELFVLRDRTAQGLRPAAWADQAIRLYHELEADCLLAEVNQGGEMVREVIEGVDASV-- 348

Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456
                ++V     + RR E    +A   E    ++H G+   L+  +     + G     
Sbjct: 349 ---PVKSVHATRSKRRRAE---PVALLYEQGR-VHHCGVFPELED-ELADFGSGGL---- 396

Query: 457 SKRVKGAKSTDYSDGLMYTFAE 478
                  KS D  D L++   E
Sbjct: 397 ----SNGKSPDRLDALVWAITE 414


>gi|330507947|ref|YP_004384375.1| phage terminase, large subunit, PBSx family [Methanosaeta concilii
           GP-6]
 gi|328928755|gb|AEB68557.1| phage terminase, large subunit, PBSx family [Methanosaeta concilii
           GP-6]
          Length = 422

 Score = 42.0 bits (97), Expect = 0.21,   Method: Composition-based stats.
 Identities = 19/152 (12%), Positives = 46/152 (30%), Gaps = 9/152 (5%)

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
            +   ++        +    DE +  P+     +L  L+++ A   ++   NP       
Sbjct: 101 ADNTSSYKKIEGESLLRAYVDEGTTIPENFTNMLLSRLSDKGA-CLYLTC-NPETPRNYI 158

Query: 238 YEIF--NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDS 295
           Y  +   +   + K ++        +   +   +   Y   +      + G +   +   
Sbjct: 159 YRNWIARQDELNIKVWKFTLDDNPYLPLEYKRDLEKEYPKGTVFYDRFILGNWVAAEGRV 218

Query: 296 FIPLNIIEEALNREPCPDPYAP--LIMGCDIA 325
           F         ++ E  P    P  L +G D  
Sbjct: 219 FGLFA---RGMHCEVPPATLRPKELRIGADYG 247


>gi|121606179|ref|YP_983508.1| hypothetical protein Pnap_3289 [Polaromonas naphthalenivorans CJ2]
 gi|120595148|gb|ABM38587.1| protein of unknown function DUF264 [Polaromonas naphthalenivorans
           CJ2]
          Length = 596

 Score = 42.0 bits (97), Expect = 0.21,   Method: Composition-based stats.
 Identities = 21/137 (15%), Positives = 39/137 (28%), Gaps = 20/137 (14%)

Query: 266 HEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL-NREPCPDPYAP------- 317
            + +   Y   ++     +   F      S  P+  +++ + +     + Y P       
Sbjct: 359 IDELRLEY--STEEFENLLMCGFIDDTQ-SVFPMAELQKCMVDSWVDWEDYKPFTARPYG 415

Query: 318 ---LIMGCDIAEEGGDNTVVVLRR----GPVIEHL--FDWSKTDLRTTNNKISGLVEKYR 368
              + +G D +  G     VVL      G     L    W   D       I  +  ++ 
Sbjct: 416 YRAVWVGYDPSHTGDTAGCVVLAAPLTPGGKFRVLERHQWRGLDFEAQAEAIRQITLRFN 475

Query: 369 PDAIIIDANNTGARTCD 385
              I ID    G     
Sbjct: 476 VQHIGIDTTGLGQGVYQ 492


>gi|301092109|ref|XP_002896227.1| N-acetyltransferase 10 [Phytophthora infestans T30-4]
 gi|262094857|gb|EEY52909.1| N-acetyltransferase 10 [Phytophthora infestans T30-4]
          Length = 1102

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 30/163 (18%), Positives = 57/163 (34%), Gaps = 17/163 (10%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAIS--AGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
           Q   ++   A  L  V   + +  +  ++  AGRG GK+      +          ++  
Sbjct: 254 QARTLDQAKA-ILTFVEAVSEKTLRSTVALTAGRGRGKSAALGMSLA-GAVAYGYSNIFV 311

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A S   LKT  +  V K    L  K   + + +      +   V+  ++  + +     
Sbjct: 312 TAPSPENLKTV-FEFVFKGFDALKYKEHLDYEIVQSTNPEFNHAVVRVNIFREHR----- 365

Query: 173 CRTYSEERPDTFVGHHNT--YGMAIINDEASGTPDVINLGILG 213
            +T    +P     HH        +  DEA+  P  +   +LG
Sbjct: 366 -QTIQYIQPT----HHEKLAQAELVAIDEAAAIPLPVVKNLLG 403


>gi|94497317|ref|ZP_01303888.1| hypothetical protein SKA58_07183 [Sphingomonas sp. SKA58]
 gi|94423180|gb|EAT08210.1| hypothetical protein SKA58_07183 [Sphingomonas sp. SKA58]
          Length = 437

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 71/412 (17%), Positives = 130/412 (31%), Gaps = 63/412 (15%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           AGRG GKT   A  V  +    P   +  +  S  + ++ +                 E 
Sbjct: 58  AGRGFGKTRAGAEWVRGIAEADPAARIALVGASLGEARSVM----------------VEG 101

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKH-YSTMCRTYSEERPDTFVGHHNTYGMAIINDE--- 199
           +S  L  AP ++   +             +   +    P+   G   ++G     DE   
Sbjct: 102 ESGLLAIAPHWARPAYAPALRRLTWPNGAVAMLFGAADPEGLRGPQFSHG---WADEIAK 158

Query: 200 -ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTV 258
            ASG     +  ++G    R+       T  P  L      +  +  DD    +  T   
Sbjct: 159 WASGEA-AWHNLMMGMRLGRDPRVLVTTTPRPVPL---VRSLVARDGDDVVVTRGRTADN 214

Query: 259 E-GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
           E  + P F   + A YG    + R E+ G+  ++   +     +IE+       P     
Sbjct: 215 EANLAPGFVAAMTAGYGGT-RLGRQELDGELIEEVEGALWTRALIEQCRV-VHVPGVLTR 272

Query: 318 LIMGCD-IAEEGGDNTVVV---LRRGPVIEHLFDWSKTDLRTT--NNKISGLVEKYRPDA 371
           +++  D  A  GGD   +V   +        + D S +  R       ++     +  D 
Sbjct: 273 VVVAVDPPASVGGDACGIVVAGMGGDGRAYVIADASVSGARPEGWARAVAAAAMVHGADR 332

Query: 372 IIIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEF--- 426
           ++ +ANN GA     L        V  V   +           R E    + +       
Sbjct: 333 VVAEANNGGAMVESVLRAAEKTLPVKLVHASRGKA-------ARAEPVAALYEAGRVAHR 385

Query: 427 ASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
            +       +  L +   ++ P               +S D +D L++  +E
Sbjct: 386 GAFPELEDEMCGLLAGGGYVGP--------------GRSPDRADALVWAMSE 423


>gi|323352542|gb|EGA85041.1| Kre33p [Saccharomyces cerevisiae VL3]
          Length = 966

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%)

Query: 78  FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           F  A++AGRG GK+      +   +S     ++   + S   LKT L+  + K    L  
Sbjct: 187 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 244

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
           +   +   +      +   ++   +  D  H  T+     ++               ++ 
Sbjct: 245 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 296

Query: 198 DEASGTPDVINLGILG 213
           DEA+  P  I   +LG
Sbjct: 297 DEAAAIPLPIVKNLLG 312


>gi|323335941|gb|EGA77219.1| Kre33p [Saccharomyces cerevisiae Vin13]
          Length = 961

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%)

Query: 78  FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           F  A++AGRG GK+      +   +S     ++   + S   LKT L+  + K    L  
Sbjct: 187 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 244

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
           +   +   +      +   ++   +  D  H  T+     ++               ++ 
Sbjct: 245 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 296

Query: 198 DEASGTPDVINLGILG 213
           DEA+  P  I   +LG
Sbjct: 297 DEAAAIPLPIVKNLLG 312


>gi|190409119|gb|EDV12384.1| hypothetical protein SCRG_03266 [Saccharomyces cerevisiae RM11-1a]
          Length = 1056

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%)

Query: 78  FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           F  A++AGRG GK+      +   +S     ++   + S   LKT L+  + K    L  
Sbjct: 277 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 334

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
           +   +   +      +   ++   +  D  H  T+     ++               ++ 
Sbjct: 335 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 386

Query: 198 DEASGTPDVINLGILG 213
           DEA+  P  I   +LG
Sbjct: 387 DEAAAIPLPIVKNLLG 402


>gi|151944405|gb|EDN62683.1| killer toxin resistant protein [Saccharomyces cerevisiae YJM789]
 gi|207341763|gb|EDZ69729.1| YNL132Wp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|256273837|gb|EEU08759.1| Kre33p [Saccharomyces cerevisiae JAY291]
 gi|259149229|emb|CAY82471.1| Kre33p [Saccharomyces cerevisiae EC1118]
          Length = 1056

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%)

Query: 78  FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           F  A++AGRG GK+      +   +S     ++   + S   LKT L+  + K    L  
Sbjct: 277 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 334

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
           +   +   +      +   ++   +  D  H  T+     ++               ++ 
Sbjct: 335 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 386

Query: 198 DEASGTPDVINLGILG 213
           DEA+  P  I   +LG
Sbjct: 387 DEAAAIPLPIVKNLLG 402


>gi|6324197|ref|NP_014267.1| Kre33p [Saccharomyces cerevisiae S288c]
 gi|1730777|sp|P53914|KRE33_YEAST RecName: Full=UPF0202 protein KRE33; AltName: Full=Killer
           toxin-resistance protein 33
 gi|854505|emb|CAA86893.1| orf16 [Saccharomyces cerevisiae]
 gi|1302072|emb|CAA96014.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|285814522|tpg|DAA10416.1| TPA: Kre33p [Saccharomyces cerevisiae S288c]
          Length = 1056

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 10/136 (7%)

Query: 78  FKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
           F  A++AGRG GK+      +   +S     ++   + S   LKT L+  + K    L  
Sbjct: 277 FTVALTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKGFDALGY 334

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
           +   +   +      +   ++   +  D  H  T+     ++               ++ 
Sbjct: 335 QEHIDYDIIQSTNPDFNKAIVRVDIKRD--HRQTIQYIVPQDHQVLGQAE------LVVI 386

Query: 198 DEASGTPDVINLGILG 213
           DEA+  P  I   +LG
Sbjct: 387 DEAAAIPLPIVKNLLG 402


>gi|260890025|ref|ZP_05901288.1| 3-isopropylmalate dehydratase, small subunit [Leptotrichia
           hofstadii F0254]
 gi|260860631|gb|EEX75131.1| 3-isopropylmalate dehydratase, small subunit [Leptotrichia
           hofstadii F0254]
          Length = 191

 Score = 42.0 bits (97), Expect = 0.22,   Method: Composition-based stats.
 Identities = 28/135 (20%), Positives = 51/135 (37%), Gaps = 6/135 (4%)

Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY-----LEMLGYH 393
           G       +W   + RT N   +    +Y+   I+I  +N G  +        L+  G+H
Sbjct: 37  GFGQYVFDEWRYNEDRTDNMDFNLNKPEYKTGTILITGDNFGCGSSREHAAWALQDYGFH 96

Query: 394 VYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGEL 453
           V    G      + +  N    + +  AD LE A L   + ++ +L++ K          
Sbjct: 97  VIVAGGYSGIFYMNWLNNGHLPITLPEADRLELAKLPGDAKVVVDLENNKLTANRKDYFF 156

Query: 454 AI-ESKRVKGAKSTD 467
            + ES + +  K  D
Sbjct: 157 ELEESWKQRLLKGLD 171


>gi|167462274|ref|ZP_02327363.1| hypothetical protein Plarl_06915 [Paenibacillus larvae subsp.
           larvae BRL-230010]
 gi|322382817|ref|ZP_08056660.1| phage-related terminase-like protein large subunit [Paenibacillus
           larvae subsp. larvae B-3650]
 gi|321153200|gb|EFX45647.1| phage-related terminase-like protein large subunit [Paenibacillus
           larvae subsp. larvae B-3650]
          Length = 423

 Score = 42.0 bits (97), Expect = 0.23,   Method: Composition-based stats.
 Identities = 46/302 (15%), Positives = 97/302 (32%), Gaps = 42/302 (13%)

Query: 179 ERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFY 238
           ++P      HN     +  +E S         +LG L     +   ++++NP       +
Sbjct: 105 DKPAKLKSIHNVS--IVWIEECSEVKYEGFKELLGRLRHPALDLHMLLSTNPVGEDNWTF 162

Query: 239 EIFNKP----------LDDWKRFQI----------DTRTVEGIDPSFHEGIIARYGLDSD 278
           + F K            D +++  I                 +  S+   +      D D
Sbjct: 163 KHFFKDELKNHIVLEDTDLYEKRTIVKNDTFYHHSTAEDNLFLPKSYVAQLDELKAYDPD 222

Query: 279 VTRVEVCGQFPQQDIDSF-----IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTV 333
           + R+   G+F    +             + EA++R   P        G D   E   N V
Sbjct: 223 LYRIAREGRFGVNGVRVLPQFEVASHEEVIEAISRIRKPIE----RTGMDFGFEDSYNAV 278

Query: 334 VVLRRGPVIEHL-FDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGY 392
           V L      + L   W     + T+++ +  ++++     +I A++   +T  Y    G+
Sbjct: 279 VRLAVDHEQKILYIYWEYYKNQMTDDRTAEALQEFARTKELIKADSAEPKTIRYFRQKGF 338

Query: 393 HVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGE 452
           ++        +         R +   K+  + +          I+ LK L ++     G 
Sbjct: 339 NMRPAKKFPGS---------RLQYTKKIKRFKKIICSEKCPNTIRELKYL-TYKTDKNGR 388

Query: 453 LA 454
           + 
Sbjct: 389 IL 390


>gi|149190524|ref|ZP_01868794.1| terminase, ATPase subunit [Vibrio shilonii AK1]
 gi|148835648|gb|EDL52615.1| terminase, ATPase subunit [Vibrio shilonii AK1]
          Length = 584

 Score = 42.0 bits (97), Expect = 0.23,   Method: Composition-based stats.
 Identities = 37/243 (15%), Positives = 75/243 (30%), Gaps = 34/243 (13%)

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
             +D    +G D  F+   + R     +V    +  +F      SF  L  +        
Sbjct: 327 ITVDDAIAKGGDKLFNMAKLKRKYPVKEVFDNLLRCKFLDDS-TSFFALKALLACKTDTE 385

Query: 311 ----------CPDPYAPLIMGCDI----AEEGGDNT--VVVLR---RGPVIEHLFDWS-- 349
                      P     +++G D       EG D+   VV L+   +G V   +      
Sbjct: 386 NWKDVDHNKARPVGNEEVLVGYDPRGGGTGEGSDDAGLVVALKPKTKGGVFRAIEKVRLK 445

Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
            +        I G+ EKY    + +D    G+   + +      +  +         E  
Sbjct: 446 GSSYEQQAETIRGITEKYNVVYLAMDTGGVGSAVAELVRKFYPALVELN-----YSPEM- 499

Query: 410 RNRRTELHVKMADWLEFASLI---NHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKST 466
             +R  +  K  + +     +   +   L+ +   ++      +G++   S R K     
Sbjct: 500 --KRM-MAYKAREIINNGRFLFDDDWDDLVHSFLMIRQQTTDRSGQVTFVSNRSKIGSHA 556

Query: 467 DYS 469
           D +
Sbjct: 557 DLA 559


>gi|262047916|ref|ZP_06020862.1| terminase large subunit [Lactobacillus crispatus MV-3A-US]
 gi|260571794|gb|EEX28369.1| terminase large subunit [Lactobacillus crispatus MV-3A-US]
          Length = 644

 Score = 42.0 bits (97), Expect = 0.25,   Method: Composition-based stats.
 Identities = 25/156 (16%), Positives = 45/156 (28%), Gaps = 19/156 (12%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAW----LVLWLMSTRPGI 108
            WQ   + +++    ++           ++  GRG GKT +        VL         
Sbjct: 112 DWQKFILAMING-WKDANGERRYTDIHISV--GRGQGKTQIAGIQMCKAVLIDTLNFTNK 168

Query: 109 SVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKH 168
             +  AN+  Q  T L+  V K L  +     F   +         + ++          
Sbjct: 169 DFLITANTSDQ-STKLFGYVKKMLEAVIKIEPFASIAKESGLDLQTNQIIEKETNNKVWK 227

Query: 169 YSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
            S     Y            +T+ +  I DE     
Sbjct: 228 ISYEADKYD-----------STHNVLAIYDETGALD 252


>gi|66395973|ref|YP_240307.1| ORF008 [Staphylococcus phage ROSA]
 gi|62636393|gb|AAX91504.1| ORF008 [Staphylococcus phage ROSA]
          Length = 421

 Score = 42.0 bits (97), Expect = 0.25,   Method: Composition-based stats.
 Identities = 38/309 (12%), Positives = 94/309 (30%), Gaps = 35/309 (11%)

Query: 83  SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142
             GRG GK++  + ++   +  R  ++ + +  ++  L T+++ ++   +      H F+
Sbjct: 33  KGGRGSGKSSDISIIIT-QLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVTHLFK 91

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY---GMAIINDE 199
           ++                 +    +    + R    + P+      ++     +A I + 
Sbjct: 92  VKVS------------PMEITYIPRGNRIIFRGA--QNPERLKSLKDSRFPFSIAWIEEL 137

Query: 200 ASGTPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF----YEIFNKPLDDWKRF 251
           A    +     I   L     +      +  + NP +    +    YE   +  + +   
Sbjct: 138 AEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSFQADNTFVHH 197

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
                    I   F +   +    +    R E  G+         +P N +      +  
Sbjct: 198 S-TYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGEAIGS---GVVPFNNLRIEEIPQGQ 253

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            D +  +    D      D    V     ++  VI  + +     +           + Y
Sbjct: 254 YDTFDNIRNAVDFG-YATDPLAFVRWHYDKKKRVIYAMDEHYGVQISNREFANWLKKKGY 312

Query: 368 RPDAIIIDA 376
           + D I  D+
Sbjct: 313 QSDEIFADS 321


>gi|78043214|ref|YP_360500.1| prophage LambdaCh01, PBSX family terminase large subunit
           [Carboxydothermus hydrogenoformans Z-2901]
 gi|77995329|gb|ABB14228.1| prophage LambdaCh01, terminase, large subunit, PBSX family
           [Carboxydothermus hydrogenoformans Z-2901]
          Length = 420

 Score = 42.0 bits (97), Expect = 0.25,   Method: Composition-based stats.
 Identities = 54/339 (15%), Positives = 100/339 (29%), Gaps = 42/339 (12%)

Query: 83  SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142
             GRG GK++  +  ++  M   P  + + L   +  LK +++ ++   +  L    +++
Sbjct: 32  KGGRGSGKSSFASIEIILGMMKDPNANAVVLRKVKETLKDSVFEQLIWAIEKLKVSDYWD 91

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAII----ND 198
                +   P     +     I  +      +  S +         +   +  I     D
Sbjct: 92  -----IKHNPMEMTYIPTGQKILFRGADKPKKIRSTKV--------SKGYIKFIWYEEVD 138

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSN-PRRLSGKFYEIFNKPLDDWKRFQIDTRT 257
           E +G  ++    I   L           T N P R++    E       D K       T
Sbjct: 139 EFNGMEEI--RIINQSLMRGGEQFVVFYTYNPPNRVNAWVNEEILIERPDRKVHHSTYLT 196

Query: 258 VEG--IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF----IPLNIIEE--ALNRE 309
           V    +   F         ++    R E  G+      + F    I     +E  A +R 
Sbjct: 197 VPREWLGEQFLIEAEHLKRINERAYRHEYLGEITGTGGEIFSNITIRKITDDEIKAFDRI 256

Query: 310 PCPDPYAPLIMGCDIAEEGG---DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEK 366
                +       D         D T    RR   I +         R     I    + 
Sbjct: 257 RRGIDWG---YAVDPVHYTVCHYDRT----RRRLFIFYEIHQVGLSNRRLAELIKEENKL 309

Query: 367 YRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVD 405
             P    I A++   ++   L+  G  VY       +V+
Sbjct: 310 NSP----ITADSAEPKSIAELKSYGLKVYGAKKGPGSVE 344


>gi|253682970|ref|ZP_04863757.1| hypothetical protein CLG_B2294 [Clostridium phage D-1873]
 gi|253560896|gb|EES90358.1| hypothetical protein CLG_B2294 [Clostridium phage D-1873]
          Length = 611

 Score = 42.0 bits (97), Expect = 0.25,   Method: Composition-based stats.
 Identities = 29/135 (21%), Positives = 50/135 (37%), Gaps = 30/135 (22%)

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCP-------DPYAPLIMGCDIAEEGG---DNT 332
           E    + +    +F  L+ I      + C              I+  D+A +GG   D +
Sbjct: 310 EYRSIWIKFSDKAFFKLDDINNCRVIKHCELEADFKNHKDDFYIISYDVARQGGTANDAS 369

Query: 333 VVVLRR------GPVIEHLFD-WSKTDLRTTNNK-------------ISGLVEKYRPDAI 372
           +  + R      G   +++   +S  D    NN              +  LVEKY+  A+
Sbjct: 370 IATIFRCTPRTDGSYFKNVVAMYSCEDKNKNNNNVNSIMHFKNQCIMLKRLVEKYQAKAL 429

Query: 373 IIDANNTGARTCDYL 387
           ++D N  G+   DYL
Sbjct: 430 LVDINGIGSGLLDYL 444


>gi|320532097|ref|ZP_08032978.1| hypothetical protein HMPREF9057_00846 [Actinomyces sp. oral taxon
           171 str. F0337]
 gi|320135702|gb|EFW27769.1| hypothetical protein HMPREF9057_00846 [Actinomyces sp. oral taxon
           171 str. F0337]
          Length = 370

 Score = 42.0 bits (97), Expect = 0.25,   Method: Composition-based stats.
 Identities = 33/175 (18%), Positives = 53/175 (30%), Gaps = 12/175 (6%)

Query: 60  EVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ 119
            V++    +  + P        I+  RG+GKT     L     S R    V+    +   
Sbjct: 21  RVIEEFLESLDDGPGAPGLLELITGARGVGKTV---MLTALGDSARERGWVVIDETAREG 77

Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE 179
           L   L AE ++ LS L  K    + SLSL                    +    R  ++ 
Sbjct: 78  LMDRLAAEFTRQLSQLAGKERSRLTSLSLSTPLGGGSATLEHAPTPEPSWRQKARALTQW 137

Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGF---LTERNANRFWIMTSNPR 231
             +   G      + +  DE    P      +      L    A    +M   P+
Sbjct: 138 LAEHGTG------LLLTIDEVHAIPREELRALSAEVQHLIREGAPIGLLMAGLPK 186


>gi|171681273|ref|XP_001905580.1| hypothetical protein [Podospora anserina S mat+]
 gi|170940595|emb|CAP65823.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1721

 Score = 42.0 bits (97), Expect = 0.26,   Method: Composition-based stats.
 Identities = 26/129 (20%), Positives = 42/129 (32%), Gaps = 18/129 (13%)

Query: 87   GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146
            G GKT     ++L L S  P   ++  A +   +   L     ++LSL P      + + 
Sbjct: 1332 GTGKTETILSIILSLQSHFPDSRILLTAPTHNAVDNVL----RRYLSLNPTHPPLRISTE 1387

Query: 147  SLHPAP-WYSDVLHCSLGIDSKHYSTMCRT-------------YSEERPDTFVGHHNTYG 192
                +P      L    GI+     +   T             +S     +     N   
Sbjct: 1388 IRKVSPDVTPYTLDAMAGIELNTLHSRAETTKAKKRVKAAKIVFSTCIGSSLGLLRNEMF 1447

Query: 193  MAIINDEAS 201
              +I DEAS
Sbjct: 1448 DIVIIDEAS 1456


>gi|323487253|ref|ZP_08092556.1| hypothetical protein HMPREF9474_04307 [Clostridium symbiosum
           WAL-14163]
 gi|323399479|gb|EGA91874.1| hypothetical protein HMPREF9474_04307 [Clostridium symbiosum
           WAL-14163]
          Length = 550

 Score = 42.0 bits (97), Expect = 0.26,   Method: Composition-based stats.
 Identities = 17/84 (20%), Positives = 33/84 (39%), Gaps = 10/84 (11%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNAWLVLW--LMSTRPGIS 109
           +WQ   + ++       V+     +F+   I  GR  GKT   + ++ +   +    G  
Sbjct: 71  TWQKSTVSIM----FGIVDEAGIRIFREFLIVIGRKNGKTLFASGIIAYCLFLDGEYGAK 126

Query: 110 VICLANSETQ---LKTTLWAEVSK 130
           V C+A    Q   +  + W  + K
Sbjct: 127 VFCVAPKLDQADLVYQSFWQTIQK 150


>gi|326772022|ref|ZP_08231307.1| conserved hypothetical protein [Actinomyces viscosus C505]
 gi|326638155|gb|EGE39056.1| conserved hypothetical protein [Actinomyces viscosus C505]
          Length = 370

 Score = 41.6 bits (96), Expect = 0.27,   Method: Composition-based stats.
 Identities = 33/175 (18%), Positives = 53/175 (30%), Gaps = 12/175 (6%)

Query: 60  EVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ 119
            V++    +  + P        I+  RG+GKT     L     S R    V+    +   
Sbjct: 21  RVIEEFLESLDDGPGAPGLLELITGARGVGKTV---MLTALGDSARERGWVVVDETAREG 77

Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE 179
           L   L AE ++ LS L  K    + SLSL                    +    R  ++ 
Sbjct: 78  LMDRLAAEFTRQLSQLAGKERSRLTSLSLSTPLGGGSATLEHAPTPEPSWRQKARALTQW 137

Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGF---LTERNANRFWIMTSNPR 231
             +   G      + +  DE    P      +      L    A    +M   P+
Sbjct: 138 LAEHGTG------LLLTIDEVHAIPREELRALSAEVQHLIREGAPIGLLMAGLPK 186


>gi|118380585|ref|XP_001023456.1| Type III restriction enzyme, res subunit family protein
           [Tetrahymena thermophila]
 gi|89305223|gb|EAS03211.1| Type III restriction enzyme, res subunit family protein
           [Tetrahymena thermophila SB210]
          Length = 1858

 Score = 41.6 bits (96), Expect = 0.27,   Method: Composition-based stats.
 Identities = 23/121 (19%), Positives = 43/121 (35%), Gaps = 11/121 (9%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139
             ++   G+GKT + A ++L      P   +  LA +   +   +  E      L+    
Sbjct: 154 TLVALPTGLGKTFIAATVILNYYLWFPKGKIFFLAPTRPLVNQQM--ECLSQFELINKND 211

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            FEM     +P P   DV      +  + +    +T   +  +    +       +I DE
Sbjct: 212 IFEMTGN--YPIPKRRDVY-----LRKRIFFCTPQTLENDLIE--QRYDGYNLSLVIFDE 262

Query: 200 A 200
           A
Sbjct: 263 A 263


>gi|281491541|ref|YP_003353521.1| phage terminase [Lactococcus lactis subsp. lactis KF147]
 gi|281375259|gb|ADA64772.1| Phage protein, terminase [Lactococcus lactis subsp. lactis KF147]
          Length = 469

 Score = 41.6 bits (96), Expect = 0.27,   Method: Composition-based stats.
 Identities = 57/350 (16%), Positives = 105/350 (30%), Gaps = 53/350 (15%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            WQ   ++ V A   + +       +          GKT +   L LW +    G+S++ 
Sbjct: 41  PWQKNLLKEVMAIDEDGLWTHQKFGYSIPRRN----GKTEIVYILELWSL--EQGLSILH 94

Query: 113 LANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTM 172
            A+  +   ++    + K+L         + +S+         + L          + T 
Sbjct: 95  TAHRISTSHSSYEK-LKKYLEDSGYVEGEDFKSIKAK----GQERLELIESGGVIQFRT- 148

Query: 173 CRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRR 232
            RT S    + F          ++ DEA          +   +T+ + N   IM   P  
Sbjct: 149 -RTSSGGLGEGFD--------ILVIDEAQEYTTEQESALKYTVTDSD-NPMTIMCGTPPT 198

Query: 233 L------SGKFYE---IFNKPLDDWKRFQI-------DTRTVEGIDPS-----FHEGIIA 271
                     + +           W  + +       D       +PS         I A
Sbjct: 199 PISSGTVFTNYRDNTLAGKAKYSGWAEWSVEDVKDIHDVEAWYNSNPSMGYHLNERKIEA 258

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIP-LNIIEEALNREPCPDPYAPLIMGCDIAEEGGD 330
             G D     V+  G +P+ +  S I         +NR P       L +G    + G D
Sbjct: 259 ELGEDKLDHNVQRLGYWPKYNQKSVISEQEWNVLKVNRLPVIK--GKLFVGI---KYGND 313

Query: 331 NTVVVLRRGPVIEH----LFDWSKTDLRTTNNKISGLVEKYRPDAIIIDA 376
              V +            +       +R  N  I   ++K   + ++ID 
Sbjct: 314 GANVAMSIAVKTLSGKVFVETIDCQSIRNGNQWIINFLKKADVEKVVIDG 363


>gi|29826542|ref|NP_821176.1| hypothetical protein SAV_2 [Streptomyces avermitilis MA-4680]
 gi|29603638|dbj|BAC67711.1| hypothetical protein [Streptomyces avermitilis MA-4680]
          Length = 77

 Score = 41.6 bits (96), Expect = 0.27,   Method: Composition-based stats.
 Identities = 10/47 (21%), Positives = 20/47 (42%), Gaps = 3/47 (6%)

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            P+  +G I +  G GKT++ A      ++  P   ++    +   L
Sbjct: 2   PPQGARGTIVSATGSGKTSMAAAST---LNCFPEGRILVTVPTLDLL 45


>gi|66396048|ref|YP_240381.1| ORF008 [Staphylococcus phage 71]
 gi|62636467|gb|AAX91578.1| ORF008 [Staphylococcus phage 71]
          Length = 421

 Score = 41.6 bits (96), Expect = 0.27,   Method: Composition-based stats.
 Identities = 38/309 (12%), Positives = 94/309 (30%), Gaps = 35/309 (11%)

Query: 83  SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFE 142
             GRG GK++  + ++   +  R  ++ + +  ++  L T+++ ++   +      H F+
Sbjct: 33  KGGRGSGKSSDISIIIT-QLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFK 91

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY---GMAIINDE 199
           ++                 +    +    + R    + P+      ++     +A I + 
Sbjct: 92  VKVS------------PMEITYIPRGNRIIFRGA--QNPERLKSLKDSRFPFSVAWIEEL 137

Query: 200 ASGTPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF----YEIFNKPLDDWKRF 251
           A    +     I   L     +      +  + NP +    +    YE   +  + +   
Sbjct: 138 AEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSFQADNTFVHH 197

Query: 252 QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC 311
                    I   F +   +    +    R E  G+         +P N +      +  
Sbjct: 198 S-TYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGEAIGS---GVVPFNNLRIEEIPQGQ 253

Query: 312 PDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKY 367
            D +  +    D      D    V     ++  VI  + +     +           + Y
Sbjct: 254 YDTFDNIRNAVDFG-YATDPLAFVRWHYDKKKRVIYAMDEHYGVQISNREFANWLKKKGY 312

Query: 368 RPDAIIIDA 376
           + D I  D+
Sbjct: 313 QSDEIFADS 321


>gi|326385269|ref|ZP_08206932.1| putative phage terminase protein [Gordonia neofelifaecis NRRL
           B-59395]
 gi|326196012|gb|EGD53223.1| putative phage terminase protein [Gordonia neofelifaecis NRRL
           B-59395]
          Length = 439

 Score = 41.6 bits (96), Expect = 0.28,   Method: Composition-based stats.
 Identities = 58/384 (15%), Positives = 115/384 (29%), Gaps = 49/384 (12%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTR-PGISVI 111
            WQ+   +++     +        V      +    GKT L A ++L  +     G  V 
Sbjct: 11  PWQILAADLIGECDASGRLIHPLVVVTVPRQS----GKTALLAAVMLHRLIMLGEGGRVW 66

Query: 112 CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171
             A +  + +  +W E+   +               +       D     LG  ++    
Sbjct: 67  YTAQTGIKAREQMW-EMMDAIDRSALGPL-------IKSKRGAGDTSMELLGTGARA--- 115

Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLT---ERNANRFWIMTS 228
                    PD+  G+ +      + DEA    +    G++G +T       N   I+ S
Sbjct: 116 ---KMHPPTPDSLHGNQSDLN---VIDEAWFFDEPQAHGLMGAITPTQSTRPNAQTIIIS 169

Query: 229 NPRRLSGKF-YEIFNKPLDDWKRFQIDTRTVEGIDPS----------------FHEGIIA 271
                   + +++  +  D      +D    +G+ P                     + A
Sbjct: 170 TAGTAESVWFHDLVARGHDGALCL-VDYGVADGVTPDDYPAIAAAHPAIGHTQKAAILPA 228

Query: 272 RYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDN 331
                S    +   G    +     +P  I++ A    P P   A ++ GC ++ E  D 
Sbjct: 229 AREQLSSGEFLRAYGNVRTRTESRLLPAEIVDAATTTTPLPATGA-VVFGCALSFERDDA 287

Query: 332 TVV---VLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            +V       G  +  L     T       + + L +++    + I          D  +
Sbjct: 288 AIVACMAADDGTPVVELVARF-TSAEGVAARCAELTDRHGGH-VAIAPAGPAGSIADDAD 345

Query: 389 MLGYHVYRVLGQKRAVDLEFCRNR 412
            LG  V R    + +       +R
Sbjct: 346 RLGATVTRYADAELSSSTADFLDR 369


>gi|224586458|ref|YP_002640348.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana
           VS116]
 gi|224497449|gb|ACN53076.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana
           VS116]
          Length = 359

 Score = 41.6 bits (96), Expect = 0.28,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 53/163 (32%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 57  YGGDKASDFERFIGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDHPEH 114

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC-GQFPQQDID 294
            F   +   +  +K +   T     +   F E     Y  +    +  V  G +      
Sbjct: 115 YFKTDYIDNVATFKTYNFTTYDNVLLSKVFIETQEKLY-KEIPTYKARVLLGAWIASTDS 173

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NII++ +   P        I   D A   GGDNT + +
Sbjct: 174 IFTQINIIQDYVFTSP--------IAYLDPAFSIGGDNTALCV 208


>gi|297618941|ref|YP_003707046.1| hypothetical protein Mvol_0413 [Methanococcus voltae A3]
 gi|297377918|gb|ADI36073.1| hypothetical protein Mvol_0413 [Methanococcus voltae A3]
          Length = 576

 Score = 41.6 bits (96), Expect = 0.28,   Method: Composition-based stats.
 Identities = 43/265 (16%), Positives = 80/265 (30%), Gaps = 53/265 (20%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWL-----VLWLMSTRPGIS 109
           Q E ++ +  + L+             +  G+G GK  + + L     +  +++  P   
Sbjct: 93  QAEILKKMKKNYLS------------TVLVGKGGGKDFMTSLLFNDELIDLILTDIPYTR 140

Query: 110 V--ICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSK 167
           V  I +A +        + E  +W S    K W   ++    P    +  +     +   
Sbjct: 141 VDFINIAPNADLAHNVFFREFKQWFSR--CKLWKLFKNSEKSPIKINNTFIKIGDLVKI- 197

Query: 168 HYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFL--------TERN 219
                  T    R  +F G   T    I+ DE     D   +              T   
Sbjct: 198 -------TSGHSRSASFEG---TNPKCIVIDE---ISDENFMNAEKIFYQAKSSVQTRWG 244

Query: 220 ANRFWIMTS-------NPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIAR 272
            +   I+ S       NP    G  Y+I+++ L     F     T E             
Sbjct: 245 KDGKVILISWTRFPTPNPLDDIG--YKIYSENLGIDDVFSFKGATWEVNSHRSKFDFEDD 302

Query: 273 YGLDSDVTRVEVCGQFPQQDIDSFI 297
           Y  +  + +     + P+   + FI
Sbjct: 303 YKRNGVLAKKMYECKPPELS-NYFI 326


>gi|34365522|tpg|DAA01288.1| TPA_exp: replicase/helicase/endonuclease [Danio rerio]
          Length = 3007

 Score = 41.6 bits (96), Expect = 0.29,   Method: Composition-based stats.
 Identities = 28/123 (22%), Positives = 44/123 (35%), Gaps = 10/123 (8%)

Query: 1    MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60
            M  +L    E E+ + DL      K++      +      +   L    +    QL    
Sbjct: 2248 MKDKLQQVEEHEEHIPDLASEANQKVAHLEKKNNIM---CRRDGLALIRSLNDTQLSIFY 2304

Query: 61   VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTL------NAWLVLWLMSTRPG-ISVICL 113
             +   CL+ V   NP      I+ G G GK+ L       A  +L  +   P  ISV+  
Sbjct: 2305 EIRQWCLDKVMGKNPSPVHLFITGGAGTGKSHLIKAIQYEAMRILSTVCRHPDNISVLLT 2364

Query: 114  ANS 116
            A +
Sbjct: 2365 APT 2367


>gi|85709622|ref|ZP_01040687.1| Phage DNA Packaging Protein [Erythrobacter sp. NAP1]
 gi|85688332|gb|EAQ28336.1| Phage DNA Packaging Protein [Erythrobacter sp. NAP1]
          Length = 441

 Score = 41.6 bits (96), Expect = 0.29,   Method: Composition-based stats.
 Identities = 72/413 (17%), Positives = 126/413 (30%), Gaps = 56/413 (13%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           I AGRG GKT   A  V  +  +     +  +++S  + +  +    S  L+  P     
Sbjct: 55  IMAGRGFGKTRAGAEWVRSIAESHSEARIALVSSSLAEARAVMVEGESGLLACSP----- 109

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE-- 199
                     P        SL             YS   P+   G   ++      DE  
Sbjct: 110 ----------PDRRPEFEPSLRRVRFPNGAEAHLYSAGEPEALRGPQFSHA---WCDEVG 156

Query: 200 ----ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDT 255
               +          +L  L   +  R  + T+ PR +      +  +        +  T
Sbjct: 157 KWPISHSRATRAWDNLLMGLRLGDDPRIAV-TTTPRAVPLVQRLLKQETSQATAVTRGST 215

Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
                  P+     IA     S + R E+ G+  +    +    +++E++   E  P  +
Sbjct: 216 YDNSANLPARFLEAIADEFAGSQLGRQEIEGELIEDIEGALWSRSLLEQSKE-EAGPPGF 274

Query: 316 APLIMGCD-IAEEGGDNT---VVVLRRGPVIEHLFD--WSKTDLRTTNNKISGLVEKYRP 369
             +++G D      GD     V  L        L D   ++         ++     +R 
Sbjct: 275 RRIVIGVDPPTSSTGDECGIVVAALGEDNKAWVLADCSVARAQPEQWARAVAEAAHHWRS 334

Query: 370 DAIIIDANNTGARTCDYLE--MLGYHVYRVLGQKRAVDLEFCRNRRTE--LHVKMADWLE 425
           D II +AN  G      L     G  V  V   +  V        R E    +  +D + 
Sbjct: 335 DRIIAEANQGGEMVESVLRAADAGLPVKLVHASRGKV-------ARAEPVAALYASDRVR 387

Query: 426 FASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
            A   N   L   +             + I  +     +S D  D L++  +E
Sbjct: 388 HAG--NFPQLQDQMCG-----------MLIGGEYAGPGRSPDRLDALVWALSE 427


>gi|300173892|ref|YP_003773058.1| phage terminase large subunit [Leuconostoc gasicomitatum LMG 18811]
 gi|299888271|emb|CBL92239.1| phage terminase, large subunit, pbsx family [Leuconostoc
           gasicomitatum LMG 18811]
          Length = 427

 Score = 41.6 bits (96), Expect = 0.30,   Method: Composition-based stats.
 Identities = 58/331 (17%), Positives = 104/331 (31%), Gaps = 26/331 (7%)

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICL---ANSETQLKTTLWAEVSK 130
           N +    A    RG GK+   A  V+  + T+P ++ + L   AN+  Q   + +  + K
Sbjct: 21  NSKARYIAYKGSRGSGKSEGVATKVILDIVTKPYVNWLVLRRYANTNRQ---STFTLLQK 77

Query: 131 WLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT 190
             + +     F+    SL    +             K  S    +               
Sbjct: 78  VANRMGVGSLFQFNG-SLPEITFKPTGQKILFRGADKPLSITSISVETGNLCRLW-VEEA 135

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK- 249
           Y M +   E S   + ++  + G + + +     ++T NP        + F         
Sbjct: 136 YQMEL---EESF--ETVDESMRGVIDDPDGFYQTVLTFNPWNERHWLKKRFFDEDTRVNN 190

Query: 250 --RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
                   +    +D  +   ++     +    RV V G++     +  I  N I E  +
Sbjct: 191 SLAITTTYKDNPFLDVDYVNRLLEMKKRNPRRARVAVDGEW--GVAEGLIYENTIVEKFD 248

Query: 308 -REPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISG 362
            RE     +  ++ G D    G D T  +      +   I  L +  K  + T       
Sbjct: 249 IREVLKGSH--IVRGMDWG-YGPDPTTFIEYAINTKTKDIYILKEMYKQHMLTDEIFKWL 305

Query: 363 LVEKYRPDAIIIDANNTGARTCDYLEMLGYH 393
            V  Y+   I  D  N G R    L   G  
Sbjct: 306 YVHGYQQGDIRADYANGGDRMIQELRNKGIR 336


>gi|238581544|ref|XP_002389644.1| hypothetical protein MPER_11197 [Moniliophthora perniciosa FA553]
 gi|215452133|gb|EEB90574.1| hypothetical protein MPER_11197 [Moniliophthora perniciosa FA553]
          Length = 633

 Score = 41.6 bits (96), Expect = 0.31,   Method: Composition-based stats.
 Identities = 25/159 (15%), Positives = 52/159 (32%), Gaps = 18/159 (11%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANS-----------ETQLKTTLWAEVSKWLSLL 135
           G GKT      +L L+S  P   ++  A S            +  ++ L+   +      
Sbjct: 480 GTGKTVTAVEAILQLLSANPNARILACAPSNSAADLIAMRLRSLGESGLFRAYAPSRDRE 539

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
              H   +     +    +S  L   +    K +  +  T         +G    +   I
Sbjct: 540 QVPHEL-LPFTYQNATGHFSVPLLSRM----KRFRAVVTTCVSANIIAGIGIPRGHYTHI 594

Query: 196 INDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234
             DEA    +     ++   T  + N   +++ +P++L 
Sbjct: 595 FVDEAGQATEP--EVMIAIKTMADMNTNVVLSGDPKQLG 631


>gi|156933807|ref|YP_001437723.1| hypothetical protein ESA_01633 [Cronobacter sakazakii ATCC BAA-894]
 gi|156532061|gb|ABU76887.1| hypothetical protein ESA_01633 [Cronobacter sakazakii ATCC BAA-894]
          Length = 575

 Score = 41.6 bits (96), Expect = 0.31,   Method: Composition-based stats.
 Identities = 56/367 (15%), Positives = 95/367 (25%), Gaps = 56/367 (15%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP---GIS 109
            WQ     V       S      +     +   R  GK+ L A  V   M       G  
Sbjct: 84  DWQKFCFCVSFGWVRKSDGLRRFQEIYIEVP--RKNGKS-LIAASVGIYMFCADDEHGAE 140

Query: 110 VICLANSETQLKTTLWAE---VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
           V C A +E Q       E   V K  +L               P       +    G   
Sbjct: 141 VYCGATTEKQAFKVFEPERQMVQKLPALRKRFSIKPWAKKMTRPDGSVFAPIVGDPGDGD 200

Query: 167 KHYSTMCRTYSEERPDTF-------VGHHNTYGMAII----NDEASGTPDV---INLGIL 212
                +   Y E   D          G        II     D AS   D    +   + 
Sbjct: 201 SPSCAIIDEYHEHATDALYTTMTTGQGAREQPLTLIITTAGYDIASPCYDKRSQVVEILE 260

Query: 213 GFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGI----DPSFHEG 268
           G  T+      + +     +             DDW   +   +    +     P F   
Sbjct: 261 GIRTDGANETIFGIIYTLDKD------------DDWTSEEAIRKANPNLGVSLKPEFLRA 308

Query: 269 IIARYGLDSDVTRVEVCGQ----FPQQDIDSFIPLNIIEEA-LNREPCPDPYAPLIMGCD 323
                   +     ++  +    +       +      E A  +         P  +G D
Sbjct: 309 K-QELAKTTPSQTNKILTKHFNLWVSSKAAFYNMQRWQEAADPSLTLADFEGEPCYLGID 367

Query: 324 IAEEGGDNTV--VVLRRGPVIEHLFD-----WSKTDLRTTNN-KISGLVEKYR---PDAI 372
           +A +   N V  V +R    ++H +      W   D   + + ++    E+Y+      +
Sbjct: 368 LASKLDLNAVVPVFMREIDGLKHFYCIGAQFWVPEDTVYSTDPQLKRTAERYQSFVNQGV 427

Query: 373 IIDANNT 379
           +I  +  
Sbjct: 428 LIPTDGA 434


>gi|195942183|ref|ZP_03087565.1| hypothetical protein Bbur8_04905 [Borrelia burgdorferi 80a]
 gi|219786709|ref|YP_002477434.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|219692709|gb|ACL33925.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi 156a]
 gi|312148688|gb|ADQ31340.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           JD1]
 gi|312148897|gb|ADQ31544.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           JD1]
 gi|312201269|gb|ADQ44578.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi
           297]
          Length = 396

 Score = 41.6 bits (96), Expect = 0.31,   Method: Composition-based stats.
 Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSADDIEMLIPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    +  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219


>gi|324504396|gb|ADY41899.1| ATP-dependent RNA helicase DDX20 [Ascaris suum]
          Length = 937

 Score = 41.6 bits (96), Expect = 0.32,   Method: Composition-based stats.
 Identities = 29/209 (13%), Positives = 61/209 (29%), Gaps = 19/209 (9%)

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLM-STRPGISVICLANSETQLKTTLWAEVSKWL 132
               F   + A  G GKT + A + L  + + R    V+ +A +          E++  +
Sbjct: 53  GLMGFDMLVQAKSGTGKTLVFALMALEGLNAQRRQPQVMIIAPT---------REIAMQI 103

Query: 133 SLLPNK---HWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHN 189
           ++   +       +            D+     G+      T  R       D    +H 
Sbjct: 104 AVTVRRLAPPVIHVGVFVGGGRSVADDIKEIRKGVHIA-VGTTGRLCQLVNDDLLPTNH- 161

Query: 190 TYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249
                 + DEA    +      + FL     N   +   +     G   E   + +    
Sbjct: 162 --VHLFVLDEADKLMEENFQKDINFLFSSLPNNKQMAVFSATYP-GDLDETLARYMKKAH 218

Query: 250 RFQIDTRTVEGID-PSFHEGIIARYGLDS 277
             +++   V+ +    +     +  G  S
Sbjct: 219 LIRLNAEDVQLLGIKQYVAMSYSEDGPTS 247


>gi|225621767|ref|YP_002724125.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
 gi|225547658|gb|ACN93635.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1]
          Length = 450

 Score = 41.6 bits (96), Expect = 0.33,   Method: Composition-based stats.
 Identities = 32/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDNPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +      +K +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNTATFKTYNFTTYDNVLLGKGFIEPQEKLY-KDIPTYKARVLLGEWIASIDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQDYVFSSP--------IAYLDPAFSVGGDNTALCV 299


>gi|283786098|ref|YP_003365963.1| ATP-dependent acetyltransferase [Citrobacter rodentium ICC168]
 gi|282949552|emb|CBG89168.1| putative ATP-dependent acetyltransferase [Citrobacter rodentium
           ICC168]
          Length = 669

 Score = 41.6 bits (96), Expect = 0.33,   Method: Composition-based stats.
 Identities = 25/112 (22%), Positives = 38/112 (33%), Gaps = 17/112 (15%)

Query: 18  LMWSD-EIKLSFSNFVLHFFP----------WGEKGT-PLEGFSAPRSWQLEFMEVVDAH 65
           L WSD    +   NFV HF            W +  +  +  F    +WQ    E +   
Sbjct: 121 LRWSDCPQPIPTPNFVRHFCRVLLADGQTLCWRQPQSLSVTHFPGRPAWQSATGEPLPEQ 180

Query: 66  CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117
                          A++A RG GK+ L   L+   +S       I  A ++
Sbjct: 181 SAILAQLRQMPERIAAVTAARGRGKSALAGQLIA-HLSG----QAIVTAPTK 227


>gi|310005737|gb|ADP00124.1| DNA maturase beta subunit [Cyanophage NATL1A-7]
          Length = 577

 Score = 41.6 bits (96), Expect = 0.35,   Method: Composition-based stats.
 Identities = 25/124 (20%), Positives = 41/124 (33%), Gaps = 11/124 (8%)

Query: 297 IPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVL---RRGPVIEH-LFDWSKTD 352
           +P +     +  +     Y   I   D +  G D T       + G +  H +  +    
Sbjct: 324 LPGDYFYSPMQLQGEWSKYTETICSVDPSGRGSDETAAAYLSQKNGFIYLHEMRAYRDGY 383

Query: 353 LRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCR-N 411
              T   I    +KY    ++I+ N  G      L             K+A+D+E  R N
Sbjct: 384 TDNTLLNILRGCQKYGVTKLVIETN-FGDGIVAELFKKHLQ-----NTKQAIDIEEVRAN 437

Query: 412 RRTE 415
            R E
Sbjct: 438 VRKE 441


>gi|260433350|ref|ZP_05787321.1| phage DNA Packaging Protein [Silicibacter lacuscaerulensis
           ITI-1157]
 gi|260417178|gb|EEX10437.1| phage DNA Packaging Protein [Silicibacter lacuscaerulensis
           ITI-1157]
          Length = 427

 Score = 41.2 bits (95), Expect = 0.36,   Method: Composition-based stats.
 Identities = 55/423 (13%), Positives = 109/423 (25%), Gaps = 69/423 (16%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ S   G           V  +  +  Q++  +        
Sbjct: 35  IMGGRGAGKTRAGA---EWVRSMVEGAKPFDEGEARRVALVGETFDQVRDVM-------- 83

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                   F    +     P        S            + +S   P+   G      
Sbjct: 84  -------IFGDSGIMQCSPPDRRPQWKASERKLVWPNGAEAQAFSAHDPEGLRGPQFD-- 134

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +           +   L          +T+ P R      ++   P      
Sbjct: 135 -AAWVDELAKWKKAGETWDMLQFAL-RLGERPRVCVTTTP-RNVKVLKDLLAAPSTVM-T 190

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     +  SF + + ARY   + + R E+ G        +    ++++ A  R  
Sbjct: 191 HAPTEANRANLAESFLQEVRARY-AGTRLGRQELDGVLLADAEGALWTGSMLDGA--RVG 247

Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVLRRGPVIEHLFDWS----------KTDLRTTN 357
                  +++  D A     G D   +V+    +     DW                   
Sbjct: 248 AVPELDRVVVALDPAVTGGSGADACGIVVVGAQLQGPPEDWRAYVLADRTVQGVGPAGWA 307

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELH 417
                 ++++  + ++ + N  G +  + +      +      + +            L+
Sbjct: 308 RAAIDAMDEFGAERLVAEVNQ-GGQLVEEVVRQVDPLVPFRAVRASRGKVARAEPVAALY 366

Query: 418 VKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
            +       A L      +  + +                    G  S D  D L++   
Sbjct: 367 EQGRV-FHVAGLDALEEQMCQMTARGFE----------------GQGSPDRVDALVWALH 409

Query: 478 ENP 480
           E  
Sbjct: 410 ELV 412


>gi|255945291|ref|XP_002563413.1| Pc20g09170 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588148|emb|CAP86246.1| Pc20g09170 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 944

 Score = 41.2 bits (95), Expect = 0.36,   Method: Composition-based stats.
 Identities = 30/190 (15%), Positives = 63/190 (33%), Gaps = 22/190 (11%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW-AEVSKWLS---LLPNKHWFE 142
           G+GKT + A  ++     +P  +++ +      +    W +E+ ++      +   H  +
Sbjct: 365 GMGKT-IQAVSLIMSDFPQPDPTLVIVPP----VALMQWVSEIKEYTDGKLKVLVYHNSD 419

Query: 143 MQSLSLHPAPWYSDVLHCS-----LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
            +   L PA      +          I  K      R  +  + D+    H  +   ++ 
Sbjct: 420 AKVKRLTPAEIRKYDVIMISYASLESIYRKQEKGFSRGETMVKADSV--IHAVHYHRLVL 477

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS-GKFYEIFN----KPLDDWKRFQ 252
           DEA                   AN  W ++  P +   G+F+ +      KP   +   Q
Sbjct: 478 DEAHSIKSRTTGVARACFALE-ANYKWCLSGTPVQNRIGEFFSLLRFLQVKPFACYFCKQ 536

Query: 253 IDTRTVEGID 262
            D   ++   
Sbjct: 537 CDCEQLQWTS 546


>gi|319406198|emb|CBI79835.1| phage-related protein [Bartonella sp. AR 15-3]
          Length = 442

 Score = 41.2 bits (95), Expect = 0.37,   Method: Composition-based stats.
 Identities = 29/193 (15%), Positives = 61/193 (31%), Gaps = 9/193 (4%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247
             +    DEA    D     ++  L E          +T NP R +    + F      +
Sbjct: 122 RILLCWVDEAEPVTDAAWQVLIPTLREEGKEWHSELWVTWNPCRENAAVEKRFRFTKDPN 181

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF---IPLNIIEE 304
            K  +I+ R         +    A      +  +    G++      ++   + L   +E
Sbjct: 182 IKGVEINWRDNPKFPAKLNRDRTADLEQRPEQYQHIWEGEYLLAMQGAYYQKLLLEAEQE 241

Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361
                   DP   + +  DI   G   D T + + +       + D+ +   +  +  I 
Sbjct: 242 GRITTVPRDPLIQVKIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHIG 301

Query: 362 GLVEKYRPDAIII 374
            +  K    A+++
Sbjct: 302 WICHKGYEKALMV 314


>gi|182438394|ref|YP_001826113.1| hypothetical protein SGR_4601 [Streptomyces griseus subsp. griseus
           NBRC 13350]
 gi|178466910|dbj|BAG21430.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus
           NBRC 13350]
          Length = 609

 Score = 41.2 bits (95), Expect = 0.37,   Method: Composition-based stats.
 Identities = 31/125 (24%), Positives = 42/125 (33%), Gaps = 24/125 (19%)

Query: 5   LPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDA 64
           +P +PE E           +  +F        PWG  G         R+WQ   ME    
Sbjct: 7   VPESPEPETVTTTTASH-HLSPAFPGRA----PWGTAG-------KLRAWQQGAME---- 50

Query: 65  HCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTL 124
                V     +    A     G GKTT    L  WL+       +  +A +E  LK   
Sbjct: 51  ---RYVQEQPRDFLAVATP---GAGKTTFALTLASWLLHHHVVQQITVVAPTEH-LKKQ- 102

Query: 125 WAEVS 129
           WAE +
Sbjct: 103 WAEAA 107


>gi|73748202|ref|YP_307441.1| hypothetical protein cbdb_A296 [Dehalococcoides sp. CBDB1]
 gi|73659918|emb|CAI82525.1| hypothetical protein cbdbA296 [Dehalococcoides sp. CBDB1]
          Length = 405

 Score = 41.2 bits (95), Expect = 0.38,   Method: Composition-based stats.
 Identities = 49/295 (16%), Positives = 86/295 (29%), Gaps = 59/295 (20%)

Query: 154 YSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILG 213
           ++D+ H   G   +         S E   + VG  NT  + +  DEA             
Sbjct: 72  FTDIYHTEGGYIIRLNQARAVFLSAEPSASVVG--NTAHLLLEVDEAQDVNKEKY----- 124

Query: 214 FLTERNANRFWIMTSNPRRLSGKFYEIFN-------------KPLDDWKRFQIDTRTVEG 260
               +        T+    L G  ++ F+             +     + F+ D   V  
Sbjct: 125 ---SKEFKPMGATTNVTTVLYGTTWDSFSLLEEIKEQNIEKEQKDGLKRHFRYDWEAVAA 181

Query: 261 IDPSFHE---GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPC---PDP 314
            +P++         R G +  +   +     P            ++      PC   P+ 
Sbjct: 182 HNPTYLAYALSEKERLGKNHPLFLAQYR-LLPVSGGGGMFSNEQLDLLKGNHPCQVYPEK 240

Query: 315 YAPLIMGCDIAEE-----GGDNTVVVLRRGPVIEHL----------------------FD 347
               + G D+A E     G   T V LRR   +  +                      + 
Sbjct: 241 GKVYVAGLDLAGEDSQTGGISPTTVNLRRDSSVLTIAQLDYTFAKAPYNLPQVRLVCHYS 300

Query: 348 WSKTDLRTTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYHVYRVLGQ 400
           W  T       K+  L+ K ++   + +DA   G     +L   LG  +  V  Q
Sbjct: 301 WQGTRHALLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSRILPVPFQ 355


>gi|149913871|ref|ZP_01902403.1| hypothetical protein RAZWK3B_17748 [Roseobacter sp. AzwK-3b]
 gi|149812155|gb|EDM71986.1| hypothetical protein RAZWK3B_17748 [Roseobacter sp. AzwK-3b]
          Length = 419

 Score = 41.2 bits (95), Expect = 0.38,   Method: Composition-based stats.
 Identities = 68/428 (15%), Positives = 125/428 (29%), Gaps = 83/428 (19%)

Query: 82  ISAGRGIGKTTLNA-WLVLWLMSTRPGISVIC-----LANSETQLKTT-LWAEVSKWLSL 134
           I  GRG GKT   A W+   +   RP    +C     +  +  Q++   ++ E S  ++ 
Sbjct: 27  ILGGRGAGKTRAGAEWVRAQVEGARPLSEGLCRRMALVGETIDQVREVMIFGE-SGIMAC 85

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
            P     + Q+                          + + +S   P+   G        
Sbjct: 86  SPPDRRPDWQATR---------------KRLVWPNGAVAQAFSAHEPEALRGPQFDGA-- 128

Query: 195 IINDEA---SGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251
              DE        D  ++   G     +     +  +   R  G   ++    L+     
Sbjct: 129 -WVDEMAKWKKARDTWDMLQFGLRLGDHPQ---VCITTTPRNVGVLKDL----LEQKSTV 180

Query: 252 QIDTRTVEG---IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
                T      +  SF E + ARY   + + R E+ G    +   +    + IE    R
Sbjct: 181 VTSAPTEANRAFLAQSFLEEVRARY-AGTRLGRQELDGVLLSEAEGALWTNSGIEAC--R 237

Query: 309 EPCPDPYAPLIMGCDI---AEEGGDNTVVVL--------RRGPVIEHLFDWSKTDLRTT- 356
                    +++  D       G D   +V+         +      L D S    R   
Sbjct: 238 VDNLPELDRIVVAIDPPVTGRAGSDECGIVVAGAVTRGPVQDWRAYVLADCSVGAARPLS 297

Query: 357 -NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRR 413
             N     +E +  D ++ + N  G      +  +     V  V  ++  V        R
Sbjct: 298 WANAAISAMEHWGADRLVAEVNQGGDMVAQVIRQVDPLVPVKSVHARRGKVT-------R 350

Query: 414 TELHVKMADWLEFASLINHSG---LIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSD 470
            E    +A   E   + +  G   L   + ++ +      G             S D  D
Sbjct: 351 AE---PVAALYEQGRVHHLRGLGTLEDQMCAMTARGFEGKG-------------SPDRVD 394

Query: 471 GLMYTFAE 478
            L++   E
Sbjct: 395 ALVWALTE 402


>gi|160898677|ref|YP_001564259.1| hypothetical protein Daci_3236 [Delftia acidovorans SPH-1]
 gi|160364261|gb|ABX35874.1| protein of unknown function DUF264 [Delftia acidovorans SPH-1]
          Length = 428

 Score = 41.2 bits (95), Expect = 0.39,   Method: Composition-based stats.
 Identities = 49/320 (15%), Positives = 92/320 (28%), Gaps = 38/320 (11%)

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSL 134
           P  F+  + AG G GKT +    +       P I+    A S  Q++   +  +      
Sbjct: 19  PHKFRAFV-AGFGSGKTWVGCSGLSAHAWEFPRINAGYFAPSYPQIRDIFFPTI------ 71

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
               H + +++          +          + Y T     S +RP++ VG      + 
Sbjct: 72  EEVAHDWGLRTEI-------RESNKEVHLYSGRQYRTTVICRSMDRPESIVGFKIGQAL- 123

Query: 195 IINDE----ASGTPDVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPLDD- 247
              DE    A    +     I+  +            +T+ P       ++ F K + D 
Sbjct: 124 --VDELDVMAKQKAEQAWRKIIARMRYNVDGLKNGVDVTTTPEG-FKFTHQQFVKAVQDK 180

Query: 248 ------WKRFQIDT-RTVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFIPL 299
                 +   Q  T    + +   +   +   Y     D     + G F      S  P 
Sbjct: 181 PELAKLYGLIQASTFENAKNLPADYIPSLFDSYPKQLIDAY---LRGLFVNLTSGSVYP- 236

Query: 300 NIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359
           +   +  +         P+++G D           VLR G  +         D       
Sbjct: 237 DFDRKLNHSFESLQEGEPVMLGMDFNRLHMAAVAYVLRDGWPVAVDEITDGRDTPYMARL 296

Query: 360 ISGLV-EKYRPDAIIIDANN 378
                 +K     +  DA+ 
Sbjct: 297 FRERYQDKGHAVTVYPDASG 316


>gi|329928970|ref|ZP_08282780.1| Tex-like protein N-terminal domain protein [Paenibacillus sp. HGF5]
 gi|328937222|gb|EGG33649.1| Tex-like protein N-terminal domain protein [Paenibacillus sp. HGF5]
          Length = 731

 Score = 41.2 bits (95), Expect = 0.40,   Method: Composition-based stats.
 Identities = 22/107 (20%), Positives = 35/107 (32%), Gaps = 9/107 (8%)

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342
           EV G+  ++  +  I +       +    P      ++G D A   G    VV   G ++
Sbjct: 293 EVRGELTEKGENQAISIF-AGNLRSLLLQPPVKGRCVLGVDPAYRTGCKLAVVDDTGKLL 351

Query: 343 EHLFDWSK---TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386
           E    +        R    K   L+ KY    I+I     G  T   
Sbjct: 352 EVAVTYPTPPANKKREAAAKFKELIAKYGIKLIVI-----GNGTASR 393


>gi|219872329|ref|YP_002476730.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr]
 gi|219694371|gb|ACL34896.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr]
          Length = 396

 Score = 41.2 bits (95), Expect = 0.41,   Method: Composition-based stats.
 Identities = 32/217 (14%), Positives = 74/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L       G   + +   + +   ++  E+ + L++   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFNPDGGDTLAIRKKKNKTTQSIHKEICELLNIYNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKSKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFTSEDIEMLIPTMREQGGRVY--MSSNPVPKSHWLYKRYLSNEDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    +  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGNVQAWLEKQKLAYHGNDIGFRIEVLGE 219


>gi|330015975|ref|ZP_08308363.1| putative ATPase subunit of terminase [Klebsiella sp. MS 92-3]
 gi|328529845|gb|EGF56736.1| putative ATPase subunit of terminase [Klebsiella sp. MS 92-3]
          Length = 575

 Score = 41.2 bits (95), Expect = 0.41,   Method: Composition-based stats.
 Identities = 37/269 (13%), Positives = 75/269 (27%), Gaps = 36/269 (13%)

Query: 237 FYEIFNK---PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293
           + +  +    P   W++  +  +        + +    R     D        +F +   
Sbjct: 304 WKKTHSGVLYPDKTWRQI-VTIQDAINNGWDYTDIDEIRDENSPDEFENLYMCEFVKDGE 362

Query: 294 DSFIPLNIIEEALNREPCPDPYAP----------LIMGCDI--AEEGGDN-----TVVVL 336
            +F    ++    +       + P          + +G D       GD      TV  L
Sbjct: 363 SAFNLSQLLGCGADGYDDWPDWKPFASRPMGQREVWLGYDANGGSGNGDAGALSVTVPPL 422

Query: 337 RRGPVIE--HLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394
             G       L      +       I    E+Y    I ID    G           + +
Sbjct: 423 VAGGRFRTVELKQLRGLEFEQQAAVIKEAAERYNVTHIAIDGQGVGEAV--------WQI 474

Query: 395 YRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTG 451
            +              ++R  L +KM   +            GL++   +++  +V   G
Sbjct: 475 VKNWFPAAICYQMSLSSKRA-LVLKMLQVIRAGRWEYDRSEQGLVRAFNAVRK-VVTPGG 532

Query: 452 ELAIESKRVKGAKSTDYSDGLMYTFAENP 480
            +  E+ R +G    D +   M +    P
Sbjct: 533 FITYETDRSRGVSHGDMAWATMLSIINEP 561


>gi|224796986|ref|YP_002642738.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           WI91-23]
 gi|224553700|gb|ACN55104.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           WI91-23]
 gi|312149848|gb|ADQ29915.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           N40]
          Length = 396

 Score = 41.2 bits (95), Expect = 0.41,   Method: Composition-based stats.
 Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSADDIEMLVPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    +  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219


>gi|226234361|ref|YP_002775493.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           29805]
 gi|226201889|gb|ACO38473.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           29805]
          Length = 396

 Score = 41.2 bits (95), Expect = 0.41,   Method: Composition-based stats.
 Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSADDIEMLVPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    +  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219


>gi|11496682|ref|NP_045481.1| hypothetical protein BBG21 [Borrelia burgdorferi B31]
 gi|218868779|ref|YP_002455248.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi ZS7]
 gi|224796961|ref|YP_002642637.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           WI91-23]
 gi|224985496|ref|YP_002642672.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           64b]
 gi|225548803|ref|YP_002724009.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           118a]
 gi|2690026|gb|AAC66069.1| predicted coding region BBG21 [Borrelia burgdorferi B31]
 gi|218165273|gb|ACK75330.1| phage terminase, large subunit, pbsx family protein [Borrelia
           burgdorferi ZS7]
 gi|223929545|gb|ACN24257.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           64b]
 gi|224554186|gb|ACN55578.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           WI91-23]
 gi|225546810|gb|ACN92808.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi
           118a]
          Length = 396

 Score = 41.2 bits (95), Expect = 0.41,   Method: Composition-based stats.
 Identities = 33/217 (15%), Positives = 75/217 (34%), Gaps = 36/217 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEM 143
           + RG GKT   A + L    +  G   + +   + +   ++  E+ + LS+   + +F +
Sbjct: 26  SSRGTGKTYDIATVNLERKFSADGGDTLAIRKKKNKTTQSIHKEILELLSIYNLRKFFNI 85

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA-------II 196
               +                           + ++R   F G H+T  +        + 
Sbjct: 86  SKAKIESKSL---------------------IFGKKRAFVFEGGHDTRDLKSYAHFKDLW 124

Query: 197 NDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQID 254
            +EA+         ++  + E+    +  M+SNP   S   Y+ +  N+        +  
Sbjct: 125 LEEANQFSADDIEMLVPTMREQGGRIY--MSSNPVPKSHWLYKRYLSNQDNPAVCIIKST 182

Query: 255 TRTVEGIDPSFHEGIIAR----YGLDSDVTRVEVCGQ 287
            R    ++    +  + +    Y  +    R+EV G+
Sbjct: 183 YRDNPFLNGGDVQAWLEKQRLAYHGNDIGFRIEVLGE 219


>gi|257088841|ref|ZP_05583202.1| predicted protein [Enterococcus faecalis CH188]
 gi|256997653|gb|EEU84173.1| predicted protein [Enterococcus faecalis CH188]
 gi|315160590|gb|EFU04607.1| phage uncharacterized protein [Enterococcus faecalis TX0645]
 gi|315579436|gb|EFU91627.1| phage uncharacterized protein [Enterococcus faecalis TX0630]
          Length = 418

 Score = 41.2 bits (95), Expect = 0.42,   Method: Composition-based stats.
 Identities = 46/323 (14%), Positives = 95/323 (29%), Gaps = 35/323 (10%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           RG  KTT  A  +  LM   P  ++I L  ++T     +  E+   ++ + +  +F+   
Sbjct: 52  RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106

Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            +L+        +                 +        +  G H      +I D+    
Sbjct: 107 FALYNVELVLLKETTTEIDTNLKTSSRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257
            D ++               +    N +  +G+F      ++K     K     + D   
Sbjct: 164 KDRVSRA-----EREKTKLQYQELQNVKNRAGRFINTGTPWHKEDAISKMPNVKKFDCYE 218

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              ID    + +  +  +   +       +        F      +   N       +  
Sbjct: 219 TGLIDKEQRKAL--QQSMTPSLFAANYELKHIADSESLFTAPTYTD-NTNLIYNGVAH-- 273

Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373
                D A  G D+T   +    + G +I     W K  +     +I  L + Y+     
Sbjct: 274 ----IDAAYGGDDSTAFTIFKEQKDGTIIGFGKKWQKH-VDDCIPEILQLHQHYQAGTFY 328

Query: 374 IDANNTGARTCDYLEMLGYHVYR 396
            + N        +L   G +V +
Sbjct: 329 NETNGDKGYLAKHLIERGQYVQK 351


>gi|158300801|ref|XP_320633.4| AGAP011893-PA [Anopheles gambiae str. PEST]
 gi|157013336|gb|EAA00145.5| AGAP011893-PA [Anopheles gambiae str. PEST]
          Length = 607

 Score = 41.2 bits (95), Expect = 0.42,   Method: Composition-based stats.
 Identities = 46/286 (16%), Positives = 86/286 (30%), Gaps = 34/286 (11%)

Query: 3   RELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVV 62
           R L       +  + L ++  + +S  +F    FP  E   P         W    +   
Sbjct: 151 RSLKIEFPLNRLQYKLEYTALVHMSRLDFSSILFPKIESAKPTTPAKTFD-WFQSCI--- 206

Query: 63  DAHCLNSVNNPNPEVFKGAISAGR------GIGKTT--LNAWLVLWLMSTRPGISVICLA 114
            A            V + A  A        G GKT   + A L +W M  RP   ++  A
Sbjct: 207 -AENEQQTQAIKNIVNRTAYPAPYILFGPPGTGKTCTIVEAVLQIWKM--RPKSRILVTA 263

Query: 115 NS--------ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
            S        +  LK     ++ ++ S    +    M    +  +  +  +       D 
Sbjct: 264 TSNYACNELAKRLLKYVTVNDLFRYFSQTSQRDINGMDLKVVQVSNMHYGIYETPAMQDF 323

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFW-- 224
                +  T         +G   +    I  DE     ++  L  +G +     N     
Sbjct: 324 VQTRILVCTVMTSGRLLQLGVDRSMYDYIFIDECGSCRELSALVPIGCVGTDTTNNRLQA 383

Query: 225 --IMTSNPRRLSGKFYEIFNKPLDD-----W--KRFQIDTRTVEGI 261
             ++  +P +L  +FY+   +   D     W      +  R +  +
Sbjct: 384 SVVLAGDPLQLGPQFYDAELRAKGDPTITHWAVNWHHLPNRKLPML 429


>gi|86138748|ref|ZP_01057320.1| terminase, large subunit, putative [Roseobacter sp. MED193]
 gi|85824395|gb|EAQ44598.1| terminase, large subunit, putative [Roseobacter sp. MED193]
          Length = 417

 Score = 41.2 bits (95), Expect = 0.42,   Method: Composition-based stats.
 Identities = 64/428 (14%), Positives = 117/428 (27%), Gaps = 89/428 (20%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+ +   G           +  +  +  Q++  +        
Sbjct: 25  ILGGRGAGKTRAGA---EWIRTQVEGATPLGPGRGRRLALIGETYDQVRDVM-------- 73

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
             +         S S     W +                M + +S   P+   G      
Sbjct: 74  --ILGDSGILACSPSDRRPQWKAGERKLIWAN-----GAMAQAFSAHDPEALRGPQFDTA 126

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
                DE +           +   L   +  R  +  +   R +    ++   P    K 
Sbjct: 127 ---WADELAKWRRAREAWDMLQFSLRLGDDPR--VCVTTTPRNAALLRQLLASPSTV-KS 180

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE----EAL 306
                     + PSF   + ARY   S + R E+ G          I L+ +E     A 
Sbjct: 181 HAATEANRANLAPSFLSEVRARY-AGSRLGRQELDG----------ILLSDVEGAIWRAA 229

Query: 307 NREPCPDPYAP----LIMGCDIA---EEGGDNTVVVL--------RRGPVIEHLFDWS-- 349
                  P AP    +++  D A    +G D   +++                L D +  
Sbjct: 230 QLAELQVPTAPALDRIVVAVDPAVSSGKGSDACGIIVAGACLQGPVETWRAYVLADRTVQ 289

Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLE 407
                     +    +++  D ++ + N  GA   + L  +        V   +  V   
Sbjct: 290 GVGPLAWAKAVIAAHQEFAADRVVAEVNQGGALVENLLRQIDPLVGFQPVHASRGKV--- 346

Query: 408 FCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTD 467
                R E    + +      L   + L + +  +        G             S D
Sbjct: 347 ----VRAEPVAALYEQHRVHHLPGLAELEEQMCQMSQQGFQGQG-------------SPD 389

Query: 468 YSDGLMYT 475
             D L++ 
Sbjct: 390 RVDALVWA 397


>gi|256023437|ref|ZP_05437302.1| predicted type I site-specific deoxyribonuclease, HsdR family
           protein [Escherichia sp. 4_1_40B]
          Length = 1031

 Score = 41.2 bits (95), Expect = 0.43,   Method: Composition-based stats.
 Identities = 49/327 (14%), Positives = 99/327 (30%), Gaps = 44/327 (13%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISV-ICLANSE--TQLKTTLWAEVSKWLSLLPNKHWFE 142
           +G GK+    WL  W+    P   V I    +E   Q+++     V++   +       +
Sbjct: 278 QGSGKSLTMVWLAKWIRENVPNSRVLIVTDRTELDEQIESVFMG-VNE--DIYRTSSGND 334

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY--SEERPDTFVGHHNTYGMAIINDE- 199
           + +   HP PW    L    G  S+           +E +            + +  DE 
Sbjct: 335 LIATLNHPNPWLICSLVHKFGRRSEAEDNAATDAFITELQQSLTKTFRAKGDLFVFVDEC 394

Query: 200 ------------ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD 247
                        +  PD + +G  G    +   +  +       + G +   +      
Sbjct: 395 HRTQSGKLHNAMTAILPDALFIGFTGTPLMKKDKKKSV------EVFGPYIHTYKFDEAV 448

Query: 248 WKRFQIDTR------TVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFIPLN 300
                +D R                +          S++ R ++  ++           +
Sbjct: 449 ADGVVLDLRYEARDIDQYLTSEKKVDDWFEAKTRGLSNLARTQLKQKWGSMQ-KLLSSKS 507

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
            +E+ +N         P +M      +G  N ++V     + +    +          K+
Sbjct: 508 RLEQIVNDILLDMDTRPRLM------DGRGNAMLVC--SSIYQACKVYEMFSQTELAGKV 559

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYL 387
           + +V  YRPDA  I    TG    + L
Sbjct: 560 A-IVTSYRPDAASIKGEETGEGLTEKL 585


>gi|327400267|ref|YP_004341106.1| hypothetical protein Arcve_0358 [Archaeoglobus veneficus SNP6]
 gi|327315775|gb|AEA46391.1| protein of unknown function DUF699 ATPase [Archaeoglobus veneficus
           SNP6]
          Length = 807

 Score = 41.2 bits (95), Expect = 0.45,   Method: Composition-based stats.
 Identities = 29/166 (17%), Positives = 57/166 (34%), Gaps = 35/166 (21%)

Query: 60  EVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS------TRPGISVICL 113
           +V       +  +   E     I+A RG GKT +   +  +L+S       RP + ++ +
Sbjct: 255 QVRVLQLFETFFDREKERKAVVITADRGRGKTAVLGIVTPYLISRMHRVLKRP-VRIMVV 313

Query: 114 ANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS----LSLHPAPWYSDVLHCSLGIDSKHY 169
           A +   ++T  +  + K L     K++   +S      ++      + +     +  K Y
Sbjct: 314 APTPQAVQTY-FRFLKKALVRQGMKNYKVKESNGLITVINSKFARVEYVVPRRAMIEKDY 372

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP-DVINLGILGF 214
           +                        II DEA+G    V+     G 
Sbjct: 373 AD----------------------IIIVDEAAGIDVPVLWQITEGA 396


>gi|148241989|ref|YP_001227146.1| hypothetical protein SynRCC307_0890 [Synechococcus sp. RCC307]
 gi|147850299|emb|CAK27793.1| Hypothetical protein SynRCC307_0890 [Synechococcus sp. RCC307]
          Length = 98

 Score = 41.2 bits (95), Expect = 0.45,   Method: Composition-based stats.
 Identities = 18/48 (37%), Positives = 24/48 (50%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS 129
           + +GR  GKT L     + L  T+PG  V  LA S  Q K   WA++ 
Sbjct: 25  VFSGRRFGKTRLMLTAGVELCLTKPGAKVFHLAPSRKQAKDIAWADLK 72


>gi|331238525|ref|XP_003331917.1| DNA repair protein RAD5 [Puccinia graminis f. sp. tritici CRL
           75-36-700-3]
 gi|309310907|gb|EFP87498.1| DNA repair protein RAD5 [Puccinia graminis f. sp. tritici CRL
           75-36-700-3]
          Length = 1036

 Score = 41.2 bits (95), Expect = 0.45,   Method: Composition-based stats.
 Identities = 37/206 (17%), Positives = 63/206 (30%), Gaps = 16/206 (7%)

Query: 38  WGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAG-----RGIGKTT 92
           WG+ G  +E     ++ Q + +E+              +   G  S G      G+GKT 
Sbjct: 395 WGDLGQKVEVVQPSKAEQPDGLELTLLPFQLEGLYWMKKQETGPWSGGVLADEMGMGKTI 454

Query: 93  LNAWLVLWLMSTRPGIS--VICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLH 149
               L+L      PG     + +A +   +    W  E+ K+   L    W      +  
Sbjct: 455 QTIALIL--SDRVPGHRKQTLVIAPT---VAIMQWRNEIEKFAKGLTVNVWHGGNRSNAQ 509

Query: 150 PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDEASGTPDVI 207
                 DV+  S  +    +      +  +          H      +I DEA    D  
Sbjct: 510 EEMENFDVVLTSFAVLESAFRRQNSGFRRKGQIIKESSLLHQINWHRVILDEAHNIKDRS 569

Query: 208 NLGILGFLTERNANRFWIMTSNPRRL 233
                G   E  A   W ++  P + 
Sbjct: 570 CNTAKGAF-ELKATYRWCLSGTPLQN 594


>gi|315618351|gb|EFU98939.1| type I site-specific deoxyribonuclease, HsdR family protein
           [Escherichia coli 3431]
          Length = 1028

 Score = 40.9 bits (94), Expect = 0.47,   Method: Composition-based stats.
 Identities = 49/327 (14%), Positives = 99/327 (30%), Gaps = 44/327 (13%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISV-ICLANSE--TQLKTTLWAEVSKWLSLLPNKHWFE 142
           +G GK+    WL  W+    P   V I    +E   Q+++     V++   +       +
Sbjct: 275 QGSGKSLTMVWLAKWIRENVPNSRVLIVTDRTELDEQIESVFMG-VNE--DIYRTSSGND 331

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY--SEERPDTFVGHHNTYGMAIINDE- 199
           + +   HP PW    L    G  S+           +E +            + +  DE 
Sbjct: 332 LIATLNHPNPWLICSLVHKFGRRSEAEDNAATDAFITELQQSLTKTFRAKGDLFVFVDEC 391

Query: 200 ------------ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD 247
                        +  PD + +G  G    +   +  +       + G +   +      
Sbjct: 392 HRTQSGKLHNAMTAILPDALFIGFTGTPLMKKDKKKSV------EVFGPYIHTYKFDEAV 445

Query: 248 WKRFQIDTR------TVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFIPLN 300
                +D R                +          S++ R ++  ++           +
Sbjct: 446 ADGVVLDLRYEARDIDQYLTSEKKVDDWFEAKTRGLSNLARTQLKQKWGSMQ-KLLSSKS 504

Query: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKI 360
            +E+ +N         P +M      +G  N ++V     + +    +          K+
Sbjct: 505 RLEQIVNDILLDMDTRPRLM------DGRGNAMLVC--SSIYQACKVYEMFSQTELAGKV 556

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYL 387
           + +V  YRPDA  I    TG    + L
Sbjct: 557 A-IVTSYRPDAASIKGEETGEGLTEKL 582


>gi|290954633|ref|YP_003485815.1| helicase-like protein [Streptomyces scabiei 87.22]
 gi|290963375|ref|YP_003494557.1| helicase-like protein [Streptomyces scabiei 87.22]
 gi|260644159|emb|CBG67232.1| putative helicase-like protein [Streptomyces scabiei 87.22]
 gi|260652901|emb|CBG76036.1| putative helicase-like protein [Streptomyces scabiei 87.22]
          Length = 889

 Score = 40.9 bits (94), Expect = 0.47,   Method: Composition-based stats.
 Identities = 10/54 (18%), Positives = 22/54 (40%), Gaps = 3/54 (5%)

Query: 67  LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
            ++ ++  P+  +G I +  G GKT + A      +   P   ++    +   L
Sbjct: 23  FSARSSVPPQGARGTIVSATGSGKTIMAAASA---LECFPEGRILVTVPTLDLL 73


>gi|326779045|ref|ZP_08238310.1| type III restriction protein res subunit [Streptomyces cf. griseus
           XylebKG-1]
 gi|326659378|gb|EGE44224.1| type III restriction protein res subunit [Streptomyces cf. griseus
           XylebKG-1]
          Length = 609

 Score = 40.9 bits (94), Expect = 0.48,   Method: Composition-based stats.
 Identities = 31/124 (25%), Positives = 41/124 (33%), Gaps = 24/124 (19%)

Query: 6   PTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAH 65
           P +PE E           +  +F        PWG  G         R+WQ   ME     
Sbjct: 8   PESPEPETVTTTTASH-HLSPAFPGRA----PWGTAG-------KLRAWQQGAME----- 50

Query: 66  CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
               V     +    A     G GKTT    L  WL+       +  +A +E  LK   W
Sbjct: 51  --RYVQEQPRDFLAVATP---GAGKTTFALTLASWLLHHHVVQQITVVAPTEH-LKKQ-W 103

Query: 126 AEVS 129
           AE +
Sbjct: 104 AEAA 107


>gi|156847104|ref|XP_001646437.1| hypothetical protein Kpol_1048p9 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156117114|gb|EDO18579.1| hypothetical protein Kpol_1048p9 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 1055

 Score = 40.9 bits (94), Expect = 0.48,   Method: Composition-based stats.
 Identities = 24/134 (17%), Positives = 48/134 (35%), Gaps = 10/134 (7%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139
             ++AGRG GK+      +   +S     ++   + S   LKT L+  + K    L  + 
Sbjct: 279 VTLTAGRGRGKSAALGISIAAAVS-HGYSNIFVTSPSPENLKT-LFEFIFKAFDALGYQE 336

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             +   +      +   ++   +  D  H  T+     ++               ++ DE
Sbjct: 337 HIDYDIIQSTNPQFNKAIVRVDIKRD--HRQTIQYIMPQDHQVLGQAE------LVVIDE 388

Query: 200 ASGTPDVINLGILG 213
           A+  P  I   +LG
Sbjct: 389 AAAIPLPIVKKLLG 402


>gi|49476071|ref|YP_034112.1| phage related protein [Bartonella henselae str. Houston-1]
 gi|49238879|emb|CAF28172.1| phage related protein [Bartonella henselae str. Houston-1]
          Length = 402

 Score = 40.9 bits (94), Expect = 0.49,   Method: Composition-based stats.
 Identities = 35/257 (13%), Positives = 79/257 (30%), Gaps = 13/257 (5%)

Query: 129 SKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLG-IDSKHYSTMCRTYSEERPDTFVGH 187
           ++      N+   E    ++   P+  D        I SK    +      +R       
Sbjct: 21  ARQFQNSLNESSLEEIKRAIESYPFLQDYYEIGDKYIKSKDGRIVYVFAGLDR--NIASI 78

Query: 188 HNTYGMAI-INDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNK- 243
            +   + +   DEA    +     ++  L E     N    +T NP R +    + F   
Sbjct: 79  KSMGRVFLCWVDEAEPVTETAWQTLIPTLREEGNDWNAELWVTWNPCRENAPVEKRFRNV 138

Query: 244 PLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIE 303
                K  +I  R         +    A      +       G++ Q    ++    ++E
Sbjct: 139 NNPHIKGAEITWRDNPQFPEKLNRDRKADLEQRPEHYNHIWEGEYLQTVEGAYYQKALLE 198

Query: 304 EALNREPCPDPYAP---LIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTN 357
            +        P  P   + +  DI   G   D T + + +       + D+ +   +  +
Sbjct: 199 ASREGRITTVPRDPLMQIRIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLS 258

Query: 358 NKISGLVEKYRPDAIII 374
             +  + ++    A+++
Sbjct: 259 EHVGWVFQRGYDKALMV 275


>gi|6467533|gb|AAF13179.1|AF181080_1 putative gene transfer agent large terminase [Rhodobacter
           capsulatus]
          Length = 393

 Score = 40.9 bits (94), Expect = 0.50,   Method: Composition-based stats.
 Identities = 66/419 (15%), Positives = 120/419 (28%), Gaps = 68/419 (16%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWLSL 134
            GRG GKT   A    W+     G           V  +  +  Q++  +          
Sbjct: 2   GGRGAGKTRAGA---EWVRMQVEGAGPADAGPAHRVALVGETFDQVRDVM---------- 48

Query: 135 LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
                 F    +     P        +            + YS + P+   G       A
Sbjct: 49  -----IFGESGILACSPPDRRPEWEATKRRLVWANGATAQAYSAQEPEALRGPQFD---A 100

Query: 195 IINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
              DE +     +     +   L     +   ++T+ P R  G    I N P        
Sbjct: 101 AWVDELAKWRRAEETWDMLQFAL-RLGKHPQQVITTTP-RNVGVLKAILNNPSTV-VTHA 157

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCP 312
                   +  SF   + ARY   + + R E+ G   +    +      +E    R   P
Sbjct: 158 PTEANRAYLAESFLAEVQARY-AGTRLGRQELEGVLLEDVEGALWTTAQLEGL--RLASP 214

Query: 313 DPYAPLIMGCDIA---EEGGDNTVVVL--------RRGPVIEHLFDWS-KTDLRTTNNKI 360
                +++  D A     G D   +V+         +      L D S +          
Sbjct: 215 PAMDRVVVALDPAVTGGAGSDECGIVVAGAVTRGPVQDWRAFVLEDASVRGRPTDWARAA 274

Query: 361 SGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKM 420
              +E++  + ++ + N  G      L  +   V       +A+     ++ R E    +
Sbjct: 275 IAAMERWGAEKLVAEVNQGGEMVESVLRQIDPLV-----PFKALRASRGKSARAE---PV 326

Query: 421 ADWLEFASLINH-SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAE 478
           A   E   + +   G +  L+  +   +   G          G  S D  D L++   E
Sbjct: 327 AALYEQGRVKHCRDGRLGALED-QMCRMTVRGY--------AGKGSPDRVDALVWAMTE 376


>gi|148548588|ref|YP_001268690.1| hypothetical protein Pput_3380 [Pseudomonas putida F1]
 gi|148512646|gb|ABQ79506.1| protein of unknown function DUF264 [Pseudomonas putida F1]
          Length = 433

 Score = 40.9 bits (94), Expect = 0.51,   Method: Composition-based stats.
 Identities = 55/303 (18%), Positives = 93/303 (30%), Gaps = 43/303 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVS-KWLSLLPNKH 139
           AG G GKT +    +   +   P I     A +  Q++   +    EV+  W   +  K 
Sbjct: 23  AGFGSGKTWVGCAALCKHVWEWPRIDSGYFAPTYPQIRDIFFPTIEEVAFDWGLKVKTKE 82

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
                          SD             +T+CR  S E+P T VG    + +    DE
Sbjct: 83  ---------------SDKEVEFYSGGQYRSTTICR--SMEKPQTIVGFKIGHAL---VDE 122

Query: 200 ASGTP----DVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPL-------D 246
               P    +     I+  +            +T+ P       Y+ F K L        
Sbjct: 123 LDVLPALKAEHAWRKIIARMRYNVPGLKNGVDVTTTPEG-FKFVYQQFVKQLREKPALQG 181

Query: 247 DWKRFQIDTRTVEG-IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
            +   Q  T   E  + P +   ++  Y   + +    + GQF   +  S I      + 
Sbjct: 182 MYGLVQASTFDNELNLPPDYIPSLMESY--PAQLILAYLNGQFVNLNAGS-IYHAYDRKL 238

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFD-WSKTDLRTTNNKISGLV 364
            +     +P  PL +G D         V V R       + +     D      +I    
Sbjct: 239 NSCFDTVEPGEPLFIGMDFNVGKMAAIVHVKRPDGKPRAVDELIDGFDTPDMIRRIKERY 298

Query: 365 EKY 367
            ++
Sbjct: 299 WRH 301


>gi|51557524|ref|YP_068358.1| DNA packaging terminase subunit 1 [Suid herpesvirus 1]
 gi|40253983|tpg|DAA02178.1| TPA_exp: UL15 protein [Suid herpesvirus 1]
          Length = 735

 Score = 40.9 bits (94), Expect = 0.52,   Method: Composition-based stats.
 Identities = 26/153 (16%), Positives = 49/153 (32%), Gaps = 24/153 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS----KWLSLLPNKHWFEMQ 144
           GKT     L+   ++T  GI V   A+     +   + E+     +W       H     
Sbjct: 277 GKTWFLVPLIALALATFRGIRVGYTAHIRKATEPV-FEEIHARLRRWCRDARVDHVKGEN 335

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
                P    S ++                  S    +   G    + +  + DEA+   
Sbjct: 336 ITVTFPDGARSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 376

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                 ILGF+ + +    ++ ++N  + S  F
Sbjct: 377 PDAVQTILGFMNQASCKIIFVSSTNTGKASTSF 409


>gi|28395422|gb|AAO38880.1| UL15 [Suid herpesvirus 1]
          Length = 753

 Score = 40.9 bits (94), Expect = 0.52,   Method: Composition-based stats.
 Identities = 26/153 (16%), Positives = 49/153 (32%), Gaps = 24/153 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVS----KWLSLLPNKHWFEMQ 144
           GKT     L+   ++T  GI V   A+     +   + E+     +W       H     
Sbjct: 293 GKTWFLVPLIALALATFRGIRVGYTAHIRKATEPV-FEEIHARLRRWCRDARVDHVKGEN 351

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
                P    S ++                  S    +   G    + +  + DEA+   
Sbjct: 352 ITVTFPDGARSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 392

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                 ILGF+ + +    ++ ++N  + S  F
Sbjct: 393 PDAVQTILGFMNQASCKIIFVSSTNTGKASTSF 425


>gi|330989588|gb|EGH87691.1| hypothetical protein PLA107_31509 [Pseudomonas syringae pv.
           lachrymans str. M301315]
          Length = 433

 Score = 40.9 bits (94), Expect = 0.53,   Method: Composition-based stats.
 Identities = 54/303 (17%), Positives = 90/303 (29%), Gaps = 43/303 (14%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA---EVS-KWLSLLPNKH 139
           AG G GKT +    +   +   P I+    A +  Q++   +    EV+  W   +  K 
Sbjct: 23  AGFGSGKTWVGCAGICKHVWEWPRINSGYFAPTYPQIRDIFFPTIEEVAFDWGLKVKTKE 82

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
                          SD             +T+CR  S E+P T VG    + +    DE
Sbjct: 83  ---------------SDKEVEFYSGGQYRSTTICR--SMEKPQTIVGFKIGHAL---VDE 122

Query: 200 ASGTP----DVINLGILGFL--TERNANRFWIMTSNPRRLSGKFYEIFNKPL-------D 246
               P    +     I+  +            +T+ P       Y+ F K L        
Sbjct: 123 LDVLPALKAEHAWRKIIARMRYNAPGLKNGVDVTTTPEG-FKFVYQQFVKQLREKPGMQG 181

Query: 247 DWKRFQIDTRTVEG-IDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
            +   Q  T   E  + P +   ++  Y     +    + GQF   +  S I      + 
Sbjct: 182 MYGLVQASTFDNELNLPPDYIPSLMESY--PPQLILAYLNGQFVNLNAGS-IYHAYDRKL 238

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW-SKTDLRTTNNKISGLV 364
                      PL +G D           V R       + ++    D      +I    
Sbjct: 239 NGCFDSVQDGEPLFIGMDFNVGKMAAITHVKRADGKPRAVDEFIDGFDTPDMIRRIKERY 298

Query: 365 EKY 367
            +Y
Sbjct: 299 WRY 301


>gi|307940746|gb|ADN95987.1| polyprotein [Chionodraco hamatus]
          Length = 2968

 Score = 40.9 bits (94), Expect = 0.53,   Method: Composition-based stats.
 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 7/69 (10%)

Query: 55   QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWL------VLWLMSTRPGI 108
            Q+     +   CL+ ++  NP+ F   I+ G G GK+ L   L      +L  +   P  
Sbjct: 2284 QMSIFYQIRQWCLDKISGKNPDPFHVFITGGAGTGKSHLIKALQYETTRLLSPLCDHPDS 2343

Query: 109  S-VICLANS 116
              V+  A +
Sbjct: 2344 VCVLLTAPT 2352


>gi|294677220|ref|YP_003577835.1| terminase-like family protein [Rhodobacter capsulatus SB 1003]
 gi|294476040|gb|ADE85428.1| terminase-like family protein [Rhodobacter capsulatus SB 1003]
          Length = 455

 Score = 40.9 bits (94), Expect = 0.53,   Method: Composition-based stats.
 Identities = 67/421 (15%), Positives = 121/421 (28%), Gaps = 68/421 (16%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGI---------SVICLANSETQLKTTLWAEVSKWL 132
           I  GRG GKT   A    W+     G           V  +  +  Q++  +        
Sbjct: 62  IMGGRGAGKTRAGA---EWVRMQVEGAGPADAGPAHRVALVGETFDQVRDVM-------- 110

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYG 192
                   F    +     P        +            + YS + P+   G      
Sbjct: 111 -------IFGESGILACSPPDRRPEWEATKRRLVWANGATAQAYSAQEPEALRGPQFD-- 161

Query: 193 MAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            A   DE +     +     +   L     +   ++T+ P R  G    I N P      
Sbjct: 162 -AAWVDELAKWRRAEETWDMLQFAL-RLGKHPQQVITTTP-RNVGVLKAILNNPSTV-VT 217

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
                     +  SF   + ARY   + + R E+ G   +    +      +E    R  
Sbjct: 218 HAPTEANRAYLAESFLAEVQARY-AGTRLGRQELEGVLLEDVEGALWTTAQLEGL--RLA 274

Query: 311 CPDPYAPLIMGCDIA---EEGGDNTVVVL--------RRGPVIEHLFDWS-KTDLRTTNN 358
            P     +++  D A     G D   +V+         +      L D S +        
Sbjct: 275 SPPAMDRVVVALDPAVTGGAGSDECGIVVAGAVTRGPVQDWRAFVLEDASVRGRPTDWAR 334

Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHV 418
                +E++  + ++ + N  G      L  +   V       +A+     ++ R E   
Sbjct: 335 AAIAAMERWGAEKLVAEVNQGGEMVESVLRQIDPLV-----PFKALRASRGKSARAE--- 386

Query: 419 KMADWLEFASLINH-SGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFA 477
            +A   E   + +   G +  L+  +   +   G          G  S D  D L++   
Sbjct: 387 PVAALYEQGRVKHCRDGRLGALED-QMCRMTVRGY--------AGKGSPDRVDALVWAMT 437

Query: 478 E 478
           E
Sbjct: 438 E 438


>gi|225683146|gb|EEH21430.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides
            brasiliensis Pb03]
          Length = 2011

 Score = 40.9 bits (94), Expect = 0.54,   Method: Composition-based stats.
 Identities = 24/118 (20%), Positives = 40/118 (33%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RPG  V+ +A         L  E V  W   L      +
Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVHDWKRRLTVPMGLK 1217

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1218 LVELTGDNTPDTKTIRDSDIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1268


>gi|219846951|ref|YP_002333526.2| DNA packaging terminase subunit 1 [Equid herpesvirus 9]
 gi|226423816|dbj|BAH02470.2| DNA packaging protein [Equid herpesvirus 9]
          Length = 734

 Score = 40.9 bits (94), Expect = 0.55,   Method: Composition-based stats.
 Identities = 28/153 (18%), Positives = 52/153 (33%), Gaps = 24/153 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLAN----SETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           GKT     L+   ++T  GI +   A+    +E  +   + A + +W    P  H     
Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEP-VFDEIGARLRQWFGNSPVDHVKGEN 322

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
                P    S ++                  S    +   G    + +  + DEA+   
Sbjct: 323 ISFSFPDGSKSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 363

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                 I+GFL + N    ++ ++N  + S  F
Sbjct: 364 PEAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396


>gi|38640180|ref|NP_944136.1| Dda DNA helicase [Aeromonas phage Aeh1]
 gi|33414865|gb|AAQ17908.1| Dda DNA helicase [Aeromonas phage Aeh1]
          Length = 454

 Score = 40.9 bits (94), Expect = 0.55,   Method: Composition-based stats.
 Identities = 28/156 (17%), Positives = 50/156 (32%), Gaps = 28/156 (17%)

Query: 58  FMEVVDAHCLNSVNNPNPEVFK-GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116
             +++   C  +  +      K   IS   G GK+ L   L+  L+    G  + C A +
Sbjct: 8   LAKIILTDCQKTAIDAVLTDKKHITISGPAGSGKSFLTKILIQKLLDLNSGAVITC-APT 66

Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176
             Q K  L                          + + +  +H  L I    Y  + R +
Sbjct: 67  -HQAKIVL-----------------------SKMSGFTASTIHSVLKIHPDTYEDV-REF 101

Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212
            + + D            +I DEAS   + +   +L
Sbjct: 102 KQSKSDK-AKEDLKAVRYLIVDEASMVDNDLFEILL 136


>gi|9629774|ref|NP_045262.1| DNA packaging terminase subunit 1 [Equid herpesvirus 4]
 gi|2605992|gb|AAC59564.1| 47/44 [Equid herpesvirus 4]
          Length = 734

 Score = 40.9 bits (94), Expect = 0.55,   Method: Composition-based stats.
 Identities = 28/153 (18%), Positives = 52/153 (33%), Gaps = 24/153 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLAN----SETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           GKT     L+   ++T  GI +   A+    +E  +   + A + +W    P  H     
Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEP-VFDEIGARLRQWFGNSPVDHVKGEN 322

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
                P    S ++                  S    +   G    + +  + DEA+   
Sbjct: 323 ISFSFPDGSKSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 363

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                 I+GFL + N    ++ ++N  + S  F
Sbjct: 364 PEAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396


>gi|50313286|ref|YP_053090.1| DNA packaging terminase subunit 1 [Equid herpesvirus 1]
 gi|139648|sp|P28969|TRM3_EHV1B RecName: Full=Tripartite terminase subunit UL15 homolog; AltName:
           Full=DNA-packaging protein 44; AltName: Full=Terminase
           large subunit
 gi|59798996|sp|P84396|TRM3_EHV1V RecName: Full=Tripartite terminase subunit UL15 homolog; AltName:
           Full=DNA-packaging protein 44; AltName: Full=Terminase
           large subunit
 gi|42795172|gb|AAS45929.1| putative terminase [Equid herpesvirus 1]
 gi|49617029|gb|AAT67302.1| DNA packaging protein [Equid herpesvirus 1]
          Length = 734

 Score = 40.9 bits (94), Expect = 0.55,   Method: Composition-based stats.
 Identities = 28/153 (18%), Positives = 52/153 (33%), Gaps = 24/153 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLAN----SETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           GKT     L+   ++T  GI +   A+    +E  +   + A + +W    P  H     
Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEP-VFDEIGARLRQWFGNSPVDHVKGEN 322

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
                P    S ++                  S    +   G    + +  + DEA+   
Sbjct: 323 ISFSFPDGSKSTIVF----------------ASSHNTNGIRGQ--DFNLLFV-DEANFIR 363

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                 I+GFL + N    ++ ++N  + S  F
Sbjct: 364 PEAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396


>gi|116196286|ref|XP_001223955.1| hypothetical protein CHGG_04741 [Chaetomium globosum CBS 148.51]
 gi|88180654|gb|EAQ88122.1| hypothetical protein CHGG_04741 [Chaetomium globosum CBS 148.51]
          Length = 2013

 Score = 40.9 bits (94), Expect = 0.56,   Method: Composition-based stats.
 Identities = 28/152 (18%), Positives = 48/152 (31%), Gaps = 19/152 (12%)

Query: 55   QLEFMEVVDAHCLNSVNNPN-----PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
             LE +     H  N +                + +  G GKT      + W    RPG  
Sbjct: 1135 ALEEIYAQRFHFFNPMQTQLFHTLYHRPANVLLGSPTGSGKTVAAELAMWWAFRERPGSK 1194

Query: 110  VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167
            V+ +A         L  E V  W + L      ++  L+    P    +    + I   +
Sbjct: 1195 VVYIAP-----MKALVRERVKDWGARLAKPLGLKLVELTGDNTPDTRTIQDADIIITTPE 1249

Query: 168  HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             +  + R++         G+     + II DE
Sbjct: 1250 KWDGISRSWQT------RGYVRKVSLVII-DE 1274


>gi|317499861|ref|ZP_07958099.1| pbsx family Phage terminase [Lachnospiraceae bacterium 8_1_57FAA]
 gi|316898763|gb|EFV20796.1| pbsx family Phage terminase [Lachnospiraceae bacterium 8_1_57FAA]
          Length = 428

 Score = 40.9 bits (94), Expect = 0.57,   Method: Composition-based stats.
 Identities = 32/207 (15%), Positives = 64/207 (30%), Gaps = 28/207 (13%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD------ 247
            +  +E S         ILG L     +   I+++NP       Y+ F +          
Sbjct: 121 IVWIEECSEVKYAGFKEILGRLRHPTLSNHIILSTNPVSKGNWCYKYFFQDKKKKVFVLD 180

Query: 248 ----------------WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQ 291
                           +    +D      +   + E +      D D+ RV   G+F   
Sbjct: 181 DEKLYKERTVVVGNTYYHHSTVD--DNFFVPKEYVEQLDDLQTHDPDLYRVARQGRFGVN 238

Query: 292 DIDSFIPLNIIEEA--LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS 349
               F P  ++E A  + +E           G D       N  + +      + L+ + 
Sbjct: 239 GSLVF-PQFVVEPANQVEKEIKAIRTPLEKNGMDFGFVTSYNAALRMIVDHDEKILYIYR 297

Query: 350 K-TDLRTTNNKISGLVEKYRPDAIIID 375
           +      T+ +I+  ++ ++   I  D
Sbjct: 298 EYYSRNKTDPEIAEDMKDWKDIVIKAD 324


>gi|189913376|ref|YP_001964605.1| ATP-dependent RNA helicase, DEAD-box family (DeaD) [Leptospira
           biflexa serovar Patoc strain 'Patoc 1 (Paris)']
 gi|167781444|gb|ABZ99741.1| ATP-dependent RNA helicase, DEAD-box family (DeaD) [Leptospira
           biflexa serovar Patoc strain 'Patoc 1 (Paris)']
          Length = 534

 Score = 40.9 bits (94), Expect = 0.57,   Method: Composition-based stats.
 Identities = 42/270 (15%), Positives = 73/270 (27%), Gaps = 41/270 (15%)

Query: 21  SDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKG 80
             E+   F +F L   P   +G    GF +P   Q + + +V                  
Sbjct: 10  DTEVGNDFQSFGLR--PEILQGITEAGFESPSPIQKQAIPLVLEGKDLIAQAQT------ 61

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET---QLKTTLWAEVSKWLSLLPN 137
                 G GKT       L  +    G+ V+ L  +     Q+   L+            
Sbjct: 62  ------GTGKTAAYGLPCLNRIKVEDGMQVLVLTPTRELALQVSDELFK----------L 105

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
                +++ +++    YS  +           +T  R     +         +    +I 
Sbjct: 106 GKHLGIKTTTIYGGSSYSKQITQVAKGAQVAVATPGRLLDLLKGKELKNFKPSM---VIL 162

Query: 198 DEAS-----GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
           DEA      G  D I        T+R    F      P +     Y+            +
Sbjct: 163 DEADEMLDMGFMDDIESIFNLLPTKRQTLLFSATMPEPIKKLASKYQTHP------AHVK 216

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282
           I        +      II     +  V R+
Sbjct: 217 IAATEKSSKNIEQVYYIIDEAEREIAVVRI 246


>gi|189913047|ref|YP_001964936.1| ATP-dependent RNA helicase (superfamily II) [Leptospira biflexa
           serovar Patoc strain 'Patoc 1 (Ames)']
 gi|167777723|gb|ABZ96023.1| ATP-dependent RNA helicase (superfamily II) [Leptospira biflexa
           serovar Patoc strain 'Patoc 1 (Ames)']
          Length = 529

 Score = 40.9 bits (94), Expect = 0.57,   Method: Composition-based stats.
 Identities = 42/270 (15%), Positives = 73/270 (27%), Gaps = 41/270 (15%)

Query: 21  SDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKG 80
             E+   F +F L   P   +G    GF +P   Q + + +V                  
Sbjct: 5   DTEVGNDFQSFGLR--PEILQGITEAGFESPSPIQKQAIPLVLEGKDLIAQAQT------ 56

Query: 81  AISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET---QLKTTLWAEVSKWLSLLPN 137
                 G GKT       L  +    G+ V+ L  +     Q+   L+            
Sbjct: 57  ------GTGKTAAYGLPCLNRIKVEDGMQVLVLTPTRELALQVSDELFK----------L 100

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
                +++ +++    YS  +           +T  R     +         +    +I 
Sbjct: 101 GKHLGIKTTTIYGGSSYSKQITQVAKGAQVAVATPGRLLDLLKGKELKNFKPSM---VIL 157

Query: 198 DEAS-----GTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQ 252
           DEA      G  D I        T+R    F      P +     Y+            +
Sbjct: 158 DEADEMLDMGFMDDIESIFNLLPTKRQTLLFSATMPEPIKKLASKYQTHP------AHVK 211

Query: 253 IDTRTVEGIDPSFHEGIIARYGLDSDVTRV 282
           I        +      II     +  V R+
Sbjct: 212 IAATEKSSKNIEQVYYIIDEAEREIAVVRI 241


>gi|315649164|ref|ZP_07902254.1| Tex-like protein protein-like protein [Paenibacillus vortex V453]
 gi|315275383|gb|EFU38741.1| Tex-like protein protein-like protein [Paenibacillus vortex V453]
          Length = 737

 Score = 40.9 bits (94), Expect = 0.58,   Method: Composition-based stats.
 Identities = 21/107 (19%), Positives = 35/107 (32%), Gaps = 9/107 (8%)

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342
           EV G+  ++  +  I +       +    P      ++G D A   G    VV   G ++
Sbjct: 300 EVRGELTEKGENQAISIF-AGNLRSLLLQPPVKGRCVLGVDPAYRTGCKLAVVDDTGKLL 358

Query: 343 EHLFDWSK---TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386
           E    +        +    K   L+ KY    I+I     G  T   
Sbjct: 359 EVAVTYPTPPANKRQEAAAKFKQLIAKYGIKLIVI-----GNGTASR 400


>gi|116625332|ref|YP_827488.1| hypothetical protein Acid_6277 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116228494|gb|ABJ87203.1| hypothetical protein Acid_6277 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 212

 Score = 40.5 bits (93), Expect = 0.60,   Method: Composition-based stats.
 Identities = 23/140 (16%), Positives = 45/140 (32%), Gaps = 13/140 (9%)

Query: 323 DIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGAR 382
           D A      T + LR   V           +     K+   +    P  +++DA   GA 
Sbjct: 33  DPATYEFRKT-ITLRLRHVERIPLATEYVQVVERVAKVMRKLGAQGPAHLVVDATGVGAP 91

Query: 383 TCDYLEMLG-----YHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLE------FASLIN 431
             + L   G     + V    G        + R  + +L V +    E         L+ 
Sbjct: 92  VVELLRRAGMGCRLWPVSITGGPAEGYGDGYYRVPKRDLVVGLQVMFEQGALEIAGGLVE 151

Query: 432 HSGLIQNLKSLKSFIVPNTG 451
            + L++ +  ++   + + G
Sbjct: 152 RAALVKEMTDMRV-KMTSRG 170


>gi|261409036|ref|YP_003245277.1| Tex-like protein [Paenibacillus sp. Y412MC10]
 gi|261285499|gb|ACX67470.1| Tex-like protein protein-like protein [Paenibacillus sp. Y412MC10]
          Length = 740

 Score = 40.5 bits (93), Expect = 0.61,   Method: Composition-based stats.
 Identities = 22/107 (20%), Positives = 35/107 (32%), Gaps = 9/107 (8%)

Query: 283 EVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVI 342
           EV G+  ++  +  I +       +    P      ++G D A   G    VV   G ++
Sbjct: 300 EVRGELTEKGENQAISIF-AGNLRSLLLQPPVKGRRVLGVDPAYRTGCKLAVVDDTGKLL 358

Query: 343 EHLFDWSK---TDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY 386
           E    +        R    K   L+ KY    I+I     G  T   
Sbjct: 359 EVAVTYPTPPANKRREAAAKFKELIAKYGIKLIVI-----GNGTASR 400


>gi|72021085|ref|XP_793570.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
 gi|115928806|ref|XP_001188414.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]
          Length = 1117

 Score = 40.5 bits (93), Expect = 0.61,   Method: Composition-based stats.
 Identities = 21/129 (16%), Positives = 39/129 (30%), Gaps = 11/129 (8%)

Query: 82  ISAGRGIGKTTLNAWLVLWLM-STRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
           + A  G GKT + + + L  +  T P   V+ LA +          E++  +        
Sbjct: 8   VQAKSGTGKTCVFSVIALEGIDLTNPSTQVLILAPT---------REIAVQIQDTIRAIG 58

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
            EM+ L  H     +        +   H +        ++   +           + DEA
Sbjct: 59  CEMEGLRSHVFIGGTLFGPDRQKLKKCHIAVGTPG-RIKQLIEYEVLKTGTIRLFVLDEA 117

Query: 201 SGTPDVINL 209
               D    
Sbjct: 118 DKLLDDTFQ 126


>gi|209694357|ref|YP_002262285.1| putative bacteriophage terminase [Aliivibrio salmonicida LFI1238]
 gi|208008308|emb|CAQ78458.1| putative bacteriophage terminase [Aliivibrio salmonicida LFI1238]
          Length = 598

 Score = 40.5 bits (93), Expect = 0.62,   Method: Composition-based stats.
 Identities = 34/246 (13%), Positives = 73/246 (29%), Gaps = 41/246 (16%)

Query: 251 FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
             +D     G      + +  +Y + S +    +   F       F     I+  L  + 
Sbjct: 342 ITVDDAIKGGATFFNMDKLRRKYPIKS-IFDNVLRCVFLDDSASFF----NIKALLACKT 396

Query: 311 CPDPYAPLIMG-CDIAEE----------------GGDNTVVV-----LRRGPVIEHLFDW 348
               +  + MG C  A +                G D+  +V     L++G V   L   
Sbjct: 397 DTSKWKTIDMGKCRPAGDLEVLVGYDPRGGGQADGSDDAGLVISLKPLKKGGVFRFLERI 456

Query: 349 S--KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDL 406
               +        I G+ EKY    + +D +  G+   + +      +  V         
Sbjct: 457 RLKGSSYEEQAKAIEGITEKYHVVHLEMDTSGVGSAVAELVRKFYPSLKEVNYSPEV--- 513

Query: 407 EFCRNRRTELHVKMADWLEFASLINH---SGLIQNLKSLKSFIVPNTGELAIESKRVKGA 463
                +R  +  K  + +    L        ++ +   ++      + ++ + S R K  
Sbjct: 514 -----KRM-MAYKAREIINAGRLQFDDSWDDVVHSFLMIRQHTTKASNQITMISTRTKRG 567

Query: 464 KSTDYS 469
              D +
Sbjct: 568 SHADLA 573


>gi|19114536|ref|NP_593624.1| ATP-dependent 3' to 5' DNA helicase (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74698622|sp|Q9HE09|MFH2_SCHPO RecName: Full=Putative ATP-dependent RNA helicase mfh2; AltName:
           Full=FancM homolog protein 2
 gi|12038920|emb|CAC19734.1| ATP-dependent 3' to 5' DNA helicase (predicted)
           [Schizosaccharomyces pombe]
          Length = 783

 Score = 40.5 bits (93), Expect = 0.62,   Method: Composition-based stats.
 Identities = 18/115 (15%), Positives = 36/115 (31%), Gaps = 13/115 (11%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146
           G+GKT + A ++L      P   +I LA ++  L              +   +   M   
Sbjct: 134 GLGKTFIAAVVMLNYFRWFPESKIIFLAPTKPLL----------LQQRVACSNVAGMSPG 183

Query: 147 SLHPAPWYSDVLHCSLGIDSKH-YSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
           +                 ++K  +    +T   +  +          + +I DEA
Sbjct: 184 ATAELNGEVSPDRRLFEYNTKRVFFMTPQTLQNDLKEHL--LDAKSIICLIFDEA 236


>gi|329936128|ref|ZP_08285927.1| helicase-like protein [Streptomyces griseoaurantiacus M045]
 gi|329304446|gb|EGG48325.1| helicase-like protein [Streptomyces griseoaurantiacus M045]
          Length = 1056

 Score = 40.5 bits (93), Expect = 0.62,   Method: Composition-based stats.
 Identities = 13/66 (19%), Positives = 21/66 (31%), Gaps = 3/66 (4%)

Query: 55  QLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLA 114
           Q            N+ +    +  +G I +  G GKT   A      +   PG  V+   
Sbjct: 18  QKARFREWAGSPFNTRSPVPEQGSRGTIVSATGSGKTITAAACA---LECFPGARVLVTV 74

Query: 115 NSETQL 120
            +   L
Sbjct: 75  PTLDLL 80


>gi|147668985|ref|YP_001213803.1| hypothetical protein DehaBAV1_0339 [Dehalococcoides sp. BAV1]
 gi|146269933|gb|ABQ16925.1| hypothetical protein DehaBAV1_0339 [Dehalococcoides sp. BAV1]
          Length = 457

 Score = 40.5 bits (93), Expect = 0.63,   Method: Composition-based stats.
 Identities = 39/251 (15%), Positives = 69/251 (27%), Gaps = 53/251 (21%)

Query: 249 KRFQIDTRTVEGIDPSFHE---GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
           + F+ D   V   +P++         R G +  +   +     P            ++  
Sbjct: 185 RHFRYDWEAVAAHNPAYLAYALSEKERLGENHPLFLTQYR-LLPVSGGGGMFSNEQLDLL 243

Query: 306 LNREPC---PDPYAPLIMGCDIAEE-----GGDNTVVVLRRGPVIEHLFD---------- 347
               PC   P+     + G D+A E     G   T V LRR   +  + +          
Sbjct: 244 KGNHPCQIYPEKGKVYVAGLDLAGEDSQTGGISPTTVNLRRDSSVLTIAELDYTFAKAPY 303

Query: 348 ------------WSKTDLRTTNNKISGLVEK-YRPDAIIIDANNTGARTCDYLEM-LGYH 393
                       W  T       K+  L+ K ++   + +DA   G     +L   LG  
Sbjct: 304 NLPQVRLVCHYSWQGTRHALLYEKLVELLGKVWKCRKVAVDATGLGQPVASFLRESLGSR 363

Query: 394 VYRVLGQKRAVDLEFCRNR-------RTELHVKMAD------WLEFASLINHSGLIQNLK 440
           +     Q  A       N        R +++           W E           Q + 
Sbjct: 364 ILPFAFQ-PASKSRLGFNLLSAVNSGRLKMYAANGSSEYTLFWQEMGLARADYRQSQQMN 422

Query: 441 SLKSFIVPNTG 451
               ++    G
Sbjct: 423 ---FYVETTRG 430


>gi|325849110|ref|ZP_08170602.1| phage terminase, large subunit, PBSX family [Anaerococcus
           hydrogenalis ACS-025-V-Sch4]
 gi|325480355|gb|EGC83418.1| phage terminase, large subunit, PBSX family [Anaerococcus
           hydrogenalis ACS-025-V-Sch4]
          Length = 439

 Score = 40.5 bits (93), Expect = 0.64,   Method: Composition-based stats.
 Identities = 19/173 (10%), Positives = 45/173 (26%), Gaps = 14/173 (8%)

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----------PLDDWKRFQID 254
           D I          +    FW +  NP   +   Y                   +      
Sbjct: 149 DSIKEAFNRTAAAKRRKFFWDL--NPSSPNHFIYSDHIDKYQNMIDEGIDFGGYNYKHFT 206

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP 314
                 I     + I  +Y  +S   + ++ G     +   +     I +    +  P  
Sbjct: 207 IDDNINISDQRKKEIKLQYDPNSVWYKRDILGLRVVAEGLIYKQFADIPDNYLIKEKPHE 266

Query: 315 YAPLIMGCDIAEEGGDNTVVV--LRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
              + +G D       +  +   + RG    +     + +     +  + L++
Sbjct: 267 LQLIQIGVDFGGNNSKHAFICTGISRGFKKVYALRSERLEPDKPTDLYNQLID 319


>gi|193671687|ref|XP_001946103.1| PREDICTED: SWI/SNF-related matrix-associated actin-dependent
           regulator of chromatin subfamily A containing DEAD/H box
           1-like [Acyrthosiphon pisum]
          Length = 848

 Score = 40.5 bits (93), Expect = 0.67,   Method: Composition-based stats.
 Identities = 23/153 (15%), Positives = 48/153 (31%), Gaps = 15/153 (9%)

Query: 87  GIGKTTLNAWLVLWLMS----TRPGISVICLANSETQLKTTLWAEVSKW-LSLLPNKHWF 141
           G+GKT +     L  +     T P +  + +  + T      + E  +W  +++  K+  
Sbjct: 337 GLGKT-VQVIAFLAHLKETGRTHPDLPQLIVVPAST--LDNWYQEFKRWCPTMIVEKYHG 393

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
            M         W         G           + +   P+            I+ DEA 
Sbjct: 394 SMDERRYMRTKW------IRKGFGDVDVILTTYSCAANSPEEKKLFKTKEFHYIVYDEAH 447

Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234
              ++ +       +  N N   ++T  P + +
Sbjct: 448 KLKNMTSQTFE-VFSNFNGNYKILLTGTPLQNN 479


>gi|22855048|ref|NP_690654.1| hypothetical protein SPP1p003 [Bacillus phage SPP1]
 gi|1729903|sp|P54308|TERL_BPSPP RecName: Full=Large terminase protein; AltName: Full=DNA-packaging
           protein G2P; AltName: Full=Terminase large subunit
 gi|15466|emb|CAA39537.1| terminase [Bacillus phage SPP1]
 gi|2764840|emb|CAA66573.1| unnamed protein product [Bacillus phage SPP1]
          Length = 422

 Score = 40.5 bits (93), Expect = 0.69,   Method: Composition-based stats.
 Identities = 68/418 (16%), Positives = 136/418 (32%), Gaps = 48/418 (11%)

Query: 75  PEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC--LANSETQLKTTLWAEVSKWL 132
            +  K  +  GRG  K+T  A  ++ LM   P   ++   + N+  Q   +++ ++ + +
Sbjct: 24  AQHLKYVLKGGRGSAKSTHIAMWIILLMMMMPITFLVIRRVYNTVEQ---SVFEQLKEAI 80

Query: 133 SLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT-FVGHHNTY 191
            +L   H +      +  +P     +     I  +    + +  S +       G     
Sbjct: 81  DMLEVGHLW-----KVSKSPLRLTYIPRGNSIIFRGGDDVQKIKSIKASKFPVAGMWIEE 135

Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSN-PRRLSGKFYEIFNKPLDDWKR 250
                 +E       I   +L           +  + N P+R      ++FN        
Sbjct: 136 LAEFKTEEEVSV---IEKSVLRAELPPGCRYIFFYSYNPPKRKQSWVNKVFNSSFLPANT 192

Query: 251 FQIDT--RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNR 308
           F   +       +  +F E        +    R E  G+     +  F  L  IEE +  
Sbjct: 193 FVDHSTYLQNPFLSKAFIEEAEEVKRRNELKYRHEYLGEALGSGVVPFENLQ-IEEGIIT 251

Query: 309 EPCPDPYAPLIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLV 364
           +     +  +  G D    G D    V     +R   I  + +    D + +  + +  V
Sbjct: 252 DAEVARFDNIRQGLDFG-YGPDPLAFVRWHYDKRKNRIYAIDEL--VDHKVSLKRTADFV 308

Query: 365 EKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWL 424
            K + ++  I A+++  R+ D L+ L + + R+ G K+  D      R          WL
Sbjct: 309 RKNKYESARIIADSSEPRSIDALK-LEHGINRIEGAKKGPDSVEHGER----------WL 357

Query: 425 EFASLINHSGL-----IQNLKSLKSFIVPNTGEL-AIESKRVKGAKSTDYSDGLMYTF 476
           +    I    L      +  +++      N   +  +E K           D   Y F
Sbjct: 358 DELDAIVIDPLRTPNIAREFENIDYQTDKNGDPIPRLEDKDNHTI------DATRYAF 409


>gi|281416465|ref|YP_003347385.1| terminase large subunit [Enterococcus phage phiFL4A]
 gi|270209641|gb|ACZ64180.1| terminase large subunit [Enterococcus phage phiFL4A]
          Length = 418

 Score = 40.5 bits (93), Expect = 0.70,   Method: Composition-based stats.
 Identities = 46/323 (14%), Positives = 95/323 (29%), Gaps = 35/323 (10%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQS 145
           RG  KTT  A  +  LM   P  ++I L  ++T     +  E+   ++ + +  +F+   
Sbjct: 52  RGSFKTTTLAIAIALLMVLFPNKNIIFLRKTDT---DVV--EIILQVAKVLSSKYFKTLV 106

Query: 146 LSLH--PAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
            +L+        +                 +        +  G H      +I D+    
Sbjct: 107 FALYGVELVLLKETTTEIDTNLKTSSRGTSQLLGMGIYASLTGKHAD---IVITDDIVNI 163

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI---FNKPLDDWKRF---QIDTRT 257
            D ++               +    N +   G+F      ++K     K     + D   
Sbjct: 164 KDRVSRA-----EREKTKLQYQELQNVKNRGGRFINTGTPWHKEDAISKMPNVKKFDCYE 218

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
              ID    + +  +  +   +       +        F      +   N       +  
Sbjct: 219 TGLIDKEQRKAL--QQAMTPSLFAANYELKHIADSESLFTAPTYTD-NTNLIYNGVAH-- 273

Query: 318 LIMGCDIAEEGGDNTVVVL----RRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAII 373
                D A  G D+T   +    + G +I +   W K  +     +I  L + Y+     
Sbjct: 274 ----IDAAYGGDDSTAFTIFKEQKDGTLIGYGKKWQKH-VDDCIPEILQLHQHYQAGTFY 328

Query: 374 IDANNTGARTCDYLEMLGYHVYR 396
            + N        +L   G +V +
Sbjct: 329 NETNGDKGYLAKHLIERGQYVQK 351


>gi|331229057|ref|XP_003327195.1| DNA repair protein rad16 [Puccinia graminis f. sp. tritici CRL
           75-36-700-3]
 gi|309306185|gb|EFP82776.1| DNA repair protein rad16 [Puccinia graminis f. sp. tritici CRL
           75-36-700-3]
          Length = 968

 Score = 40.5 bits (93), Expect = 0.72,   Method: Composition-based stats.
 Identities = 28/152 (18%), Positives = 46/152 (30%), Gaps = 11/152 (7%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGIS--VICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEM 143
           G+GKT     L+L      PG     + +A +   +    W  E+ K+   L    W   
Sbjct: 398 GMGKTIQTIALIL--SDRVPGHRKQTLVIAPT---VAIMQWRNEIEKFAKGLTVNVWHGG 452

Query: 144 QSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVG--HHNTYGMAIINDEAS 201
              +        DV+  S  +    +      +  +          H      +I DEA 
Sbjct: 453 NRSNAQEEMENFDVVLTSFAVLESAFRRQNSGFRRKGQIIKESSLLHQINWHRVILDEAH 512

Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRL 233
              D       G   E  A   W ++  P + 
Sbjct: 513 NIKDRSCNTAKGAF-ELKATYRWCLSGTPLQN 543


>gi|289976633|gb|ADD21678.1| DNA maturase B [Caulobacter phage Cd1]
          Length = 602

 Score = 40.5 bits (93), Expect = 0.72,   Method: Composition-based stats.
 Identities = 33/219 (15%), Positives = 62/219 (28%), Gaps = 29/219 (13%)

Query: 35  FFPWGEKGTPLEGFSAPRSWQLEFMEVVDA------HCLNSVNNPNPEVFKGAISAGRGI 88
              W + G     + +   +  +  E +        H +       P      + A RG 
Sbjct: 11  LLRWEQVGLLQRHYESFHDFLDDAFEHLGFSASWVQHDIGGFLAHGPNSLM--VQAQRGQ 68

Query: 89  GKTTLNAWLVLWLMSTRPGISVICL------ANSETQLKTTLWAEVSKWLSLLPNKHWFE 142
            KTT+ A   +W +   P   V+ L      AN  + L       + K L  +       
Sbjct: 69  AKTTITAAFAVWTLIHNPKARVLILSAGGTQANEISTL-------IVKLLLTMDELECLR 121

Query: 143 MQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASG 202
             + +       +  +H +L    K  S  C   +        G      +A   + A  
Sbjct: 122 PDASNGDRTSVEAFDIHYTLKGVDKSPSVACSGITG----NLQGKRADLLIADDIESAKN 177

Query: 203 TPDVINLGILGFL----TERNANRFWIMTSNPRRLSGKF 237
           +   +    +  L    T        I    P+ +   +
Sbjct: 178 SATAMMREFIMNLTRDFTSICTEGRIIYLGTPQSMDSIY 216


>gi|330791351|ref|XP_003283757.1| hypothetical protein DICPUDRAFT_147464 [Dictyostelium purpureum]
 gi|325086380|gb|EGC39771.1| hypothetical protein DICPUDRAFT_147464 [Dictyostelium purpureum]
          Length = 1580

 Score = 40.5 bits (93), Expect = 0.75,   Method: Composition-based stats.
 Identities = 29/154 (18%), Positives = 44/154 (28%), Gaps = 16/154 (10%)

Query: 82   ISAGRGIGKTTLNAWLVLWLMSTR-----PGISVIC--LANSETQLK---TTLWAEVSKW 131
            +    G GKT   A +VL  M +      P    I     N+   L     +L  E S  
Sbjct: 1094 VRGPPGTGKTHFLALIVLIFMESYKRLGKPFRIAITSFTHNAIDNLLIRIASLKKEYSTS 1153

Query: 132  LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
            +    N   F+ Q+            L      + +H+     ++S    D         
Sbjct: 1154 VGQDINFPLFKKQTKLSEDLKLNKIQLFDKKEFEREHFCVGATSWSLSNMD------YEN 1207

Query: 192  GMAIINDEASGTPDVINLGILGFLTERNANRFWI 225
               +I DEAS     I       L +        
Sbjct: 1208 FDLLIIDEASQLSSYIGAIPFSRLNKDTGRVIVC 1241


>gi|153955889|ref|YP_001396654.1| hypothetical protein CKL_3280 [Clostridium kluyveri DSM 555]
 gi|219856242|ref|YP_002473364.1| hypothetical protein CKR_2899 [Clostridium kluyveri NBRC 12016]
 gi|146348747|gb|EDK35283.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
 gi|219569966|dbj|BAH07950.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
          Length = 450

 Score = 40.5 bits (93), Expect = 0.75,   Method: Composition-based stats.
 Identities = 44/284 (15%), Positives = 94/284 (33%), Gaps = 36/284 (12%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
            +  +E S         +LG L   + +   I+++NP       Y+ F K  +  K F +
Sbjct: 119 IVWIEECSEVKYEGFKELLGRLRHPSLSLHMILSTNPVSKDNWTYKHFFK-NEKKKTFIL 177

Query: 254 D---------------------TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292
           D                           +  S+ E +      D D+ R+   G+F   +
Sbjct: 178 DDEELYKKRIVVRNNTYYHHSLADDNLFLPKSYIEQLEELKTYDIDLYRIARKGRF-GIN 236

Query: 293 IDSFIPLNIIEEALN-REPCPDPYAPL-IMGCDIAEEGGDNTVVVLRRGPVIEHL-FDWS 349
               +P           +   +   P+  +G D   E   N ++ L      + L   W 
Sbjct: 237 GRRVLPQFEARPHYEVLQAIGNIKNPIKRVGFDFGFEDSYNALLRLAVDDKEKILYIYWE 296

Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
               + T+++ +  + +++    +I A+    +T  Y    G+++ R      +      
Sbjct: 297 YYKNQMTDDRTAIEIAEFKSTQELIRADGAEPKTIKYFNQQGFNIRRAKKFPGSRLQNTK 356

Query: 410 RNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGEL 453
           + +R          +     IN    +++L     + V   GE+
Sbjct: 357 KVKR------FKKIICSEDCINTVDELKDLT----YAVDKNGEI 390


>gi|195498547|ref|XP_002096570.1| GE25739 [Drosophila yakuba]
 gi|194182671|gb|EDW96282.1| GE25739 [Drosophila yakuba]
          Length = 1495

 Score = 40.1 bits (92), Expect = 0.80,   Method: Composition-based stats.
 Identities = 28/200 (14%), Positives = 62/200 (31%), Gaps = 24/200 (12%)

Query: 16  FDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPR--SWQLEFMEVVDAH-CLNSVNN 72
            D+ W D+ +   +   +H     E+    +G   P       +  +V   H  +   N 
Sbjct: 1   MDVNWIDDDEDLVAALAMHEEQKTEEADGADGHPRPELSDESCDGFDVATGHNWIYPNNL 60

Query: 73  PNPEVFKGAISAG----------RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKT 122
           P     +  + +            G+GKT + A L+       P   ++ +A +   +  
Sbjct: 61  PLRSYQQTIVQSALFKNTLVVLPTGLGKTFIAAVLMFNFYRWYPKGKIVFMAPTRPLVSQ 120

Query: 123 TLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE--R 180
               ++     ++P      +Q     P P  +++         + +    +    +   
Sbjct: 121 ----QIHASQKIMPFPSADTVQLTGQLPRPKRAELWGSK-----RVFFATPQVVHSDMLE 171

Query: 181 PDTFVGHHNTYGMAIINDEA 200
            D            I+ DEA
Sbjct: 172 TDGGSTFPFESIKLIVVDEA 191


>gi|310792137|gb|EFQ27664.1| Sec63 Brl domain-containing protein [Glomerella graminicola M1.001]
          Length = 1974

 Score = 40.1 bits (92), Expect = 0.82,   Method: Composition-based stats.
 Identities = 24/118 (20%), Positives = 41/118 (34%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RPG  V+ +A         L  E V  W + L      +
Sbjct: 1153 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVKDWGARLARPLGLK 1207

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1208 LVELTGDNTPDTRTIKDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1258


>gi|226288385|gb|EEH43897.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides
            brasiliensis Pb18]
          Length = 2011

 Score = 40.1 bits (92), Expect = 0.92,   Method: Composition-based stats.
 Identities = 24/118 (20%), Positives = 40/118 (33%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RPG  V+ +A         L  E V  W   L      +
Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVHDWKRRLTVPMGLK 1217

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1218 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1268


>gi|295672069|ref|XP_002796581.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides
            brasiliensis Pb01]
 gi|226283561|gb|EEH39127.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides
            brasiliensis Pb01]
          Length = 2012

 Score = 40.1 bits (92), Expect = 0.92,   Method: Composition-based stats.
 Identities = 24/118 (20%), Positives = 40/118 (33%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RPG  V+ +A         L  E V  W   L      +
Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVHDWKRRLTVPMGLK 1217

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1218 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1268


>gi|213402789|ref|XP_002172167.1| antiviral helicase SLH1 [Schizosaccharomyces japonicus yFS275]
 gi|212000214|gb|EEB05874.1| antiviral helicase SLH1 [Schizosaccharomyces japonicus yFS275]
          Length = 1949

 Score = 40.1 bits (92), Expect = 0.92,   Method: Composition-based stats.
 Identities = 28/155 (18%), Positives = 48/155 (30%), Gaps = 22/155 (14%)

Query: 55   QLEFMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRP 106
            Q   +E + A   +  N    + F           I A  G GKT        W     P
Sbjct: 1125 QNPVLEEICAKRFSFFNAVQSQFFHTVYHTPTNVFIGAPTGSGKTMAAELATWWAFREHP 1184

Query: 107  GISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI- 164
            G  V+ +A         L  E +  W + L       M  L+   +P    ++   + I 
Sbjct: 1185 GSKVVYIAP-----MKALVKERLKDWGARLVEPMHINMIELTGDTSPDSKTIMGADIIIT 1239

Query: 165  DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
              + +  + R +   +       +      +I DE
Sbjct: 1240 TPEKWDGITRNWRTRK-------YVQNVSLVIIDE 1267


>gi|209544598|ref|YP_002276827.1| hypothetical protein Gdia_2467 [Gluconacetobacter diazotrophicus
           PAl 5]
 gi|209532275|gb|ACI52212.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus
           PAl 5]
          Length = 491

 Score = 40.1 bits (92), Expect = 0.96,   Method: Composition-based stats.
 Identities = 39/199 (19%), Positives = 67/199 (33%), Gaps = 26/199 (13%)

Query: 87  GIGKTTLNAW-LVLWLMSTRPGISVI------CLANSETQLKTTLWAEVSKWLSLLPNKH 139
           G GK++   W +VL  +   PG   +       + NS  QL+ T    V +W   +    
Sbjct: 32  GSGKSSGCVWEMVLRGLKQAPGPDGVRRSRWAVIRNSYRQLEDTTIRTVHQWFPPMQFGR 91

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
           W         P+     +   +   D K         + +RPD      +        +E
Sbjct: 92  W--------KPSEHSYTINRLAAQGDEKPAEIELLFRALDRPDQVGNLLSLELTGAWINE 143

Query: 200 ASGTPDVINLGILGFL----TERNANRFW---IMTSNPRRLSGKFYEIF----NKPLDDW 248
           A   P  +   + G +     +R+    W   IM +NP     ++Y+ F    +    + 
Sbjct: 144 AREVPWAVIEAVQGRVGRYPAKRDGGATWSGIIMDTNPPDAESEWYKFFEEKDHTDAVEA 203

Query: 249 KRFQIDTRTVEGIDPSFHE 267
               I   TVE     F +
Sbjct: 204 IAQVIPGMTVERYARIFKQ 222


>gi|329945026|ref|ZP_08292976.1| hypothetical protein HMPREF9056_00859 [Actinomyces sp. oral taxon
           170 str. F0386]
 gi|328529487|gb|EGF56391.1| hypothetical protein HMPREF9056_00859 [Actinomyces sp. oral taxon
           170 str. F0386]
          Length = 370

 Score = 40.1 bits (92), Expect = 0.98,   Method: Composition-based stats.
 Identities = 32/175 (18%), Positives = 52/175 (29%), Gaps = 12/175 (6%)

Query: 60  EVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQ 119
            V++    +  + P        I+  RG+GKT     L     S R    V+    +   
Sbjct: 21  RVIEEFLESLDDGPGAPGLLELITGARGVGKTV---MLTALGDSARERGWVVVDETAREG 77

Query: 120 LKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE 179
           L   L  E ++ LS L  K    + SLSL                    +    R  ++ 
Sbjct: 78  LMDRLATEFTRQLSQLAGKERSRLTSLSLSTPLGGGSATLEHAPAPEPSWRQKARALTQW 137

Query: 180 RPDTFVGHHNTYGMAIINDEASGTPDVINLGILGF---LTERNANRFWIMTSNPR 231
             +   G      + +  DE    P      +      L    A    +M   P+
Sbjct: 138 LAEHGTG------LLLTIDEVHAIPREELRALSAEVQHLIREGAPIGLLMAGLPK 186


>gi|302422104|ref|XP_003008882.1| DEAD/DEAH box helicase [Verticillium albo-atrum VaMs.102]
 gi|261352028|gb|EEY14456.1| DEAD/DEAH box helicase [Verticillium albo-atrum VaMs.102]
          Length = 1801

 Score = 39.7 bits (91), Expect = 1.0,   Method: Composition-based stats.
 Identities = 57/353 (16%), Positives = 108/353 (30%), Gaps = 53/353 (15%)

Query: 21   SDEIKLSFSNFVL-HFFPWGEKGTPLEGFSA----PRSWQLEFMEVVDAHCLNSVNNPNP 75
               I L+ + F L H  P+ E+    +        P +WQ + ++ +DA+    V  P  
Sbjct: 719  DLAIPLNLTEFQLEHSGPYMERNFDSKPDDRVTFDPDAWQRKVLDTIDANNSLMVVAPTS 778

Query: 76   EVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLL 135
                         GKT ++ + +  ++       ++ +A ++  +     AEV+   S  
Sbjct: 779  ------------AGKTFISFYAMKKILQANDDDVLVYVAPTKALVNQIA-AEVAARYSKS 825

Query: 136  PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS-EERPDTFVGHHNTYGMA 194
              +    + ++        +      L        TM    S  +RP  +          
Sbjct: 826  YTREGKSVWAIHTRDYRVNNPTGCQVLVTVPHVLQTMLLAPSNSDRPSAWA----RRVKR 881

Query: 195  IINDEASGTPDV----INLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKR 250
            II DE           +   +L       A    I  S     +G+F++        W  
Sbjct: 882  IIFDEVHCIGQAEDGIVWEQLLLL-----APCPIIALSATVGNAGEFHD--------W-- 926

Query: 251  FQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF-----IPLNIIEEA 305
                   V      +   ++      SD+ R  V     Q  +D       +P+  I++ 
Sbjct: 927  -----LAVSQAQKGYKMELVVHNARYSDL-RKFVYCPPKQLKMDVLAKQDQLPIPGIDQG 980

Query: 306  LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNN 358
              R P      P+    DI     D+  +  R    +    D  +T       
Sbjct: 981  EERNPRFFFTHPIAALLDINRGSLDDVSLEPRDCWTLWKCMDKHQTTDFPVAK 1033


>gi|85702762|ref|ZP_01033866.1| Putative large terminase [Roseovarius sp. 217]
 gi|85671690|gb|EAQ26547.1| Putative large terminase [Roseovarius sp. 217]
          Length = 419

 Score = 39.7 bits (91), Expect = 1.0,   Method: Composition-based stats.
 Identities = 43/270 (15%), Positives = 82/270 (30%), Gaps = 43/270 (15%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGIS---------VICLANSETQLKTT-LWAEVSKW 131
           I  GRG GKT   A    W+ +   G           +  +  +  Q++   ++ E S  
Sbjct: 27  IMGGRGAGKTRAGA---EWVRAQVEGSRPLDEGRCKRIALVGETIDQVREVMVFGE-SGI 82

Query: 132 LSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTY 191
           ++  P     + Q+                          + + YS   P+   G     
Sbjct: 83  MACSPPDRRPDWQATR---------------KRLIWPNGAVAQAYSAHDPEALRGPQFDG 127

Query: 192 GMAIINDEASGT--PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWK 249
                 DE +           +   L   +A R  +  +   R  G   +I   P     
Sbjct: 128 A---WVDELAKWKRARETWDMLQFGLRLGDAPR--VCVTTTPRNVGVLKDIVAVPSTV-V 181

Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
                      +  SF + + ARY   + + R E+ G    +  D+     ++E A  R 
Sbjct: 182 TSAPTEANRAYLAESFLDEVRARY-AGTRLGRQELDGLLIDEAEDALWTPAMLEAA--RV 238

Query: 310 PCPDPYAPLIMGCDI---AEEGGDNTVVVL 336
                +  +++  D       G D   +++
Sbjct: 239 ESLPEFDRVVVAVDPPVTGHAGSDECGIIM 268


>gi|83310928|ref|YP_421192.1| protein-tyrosine-phosphatase [Magnetospirillum magneticum AMB-1]
 gi|82945769|dbj|BAE50633.1| Protein-tyrosine-phosphatase [Magnetospirillum magneticum AMB-1]
          Length = 152

 Score = 39.7 bits (91), Expect = 1.1,   Method: Composition-based stats.
 Identities = 32/155 (20%), Positives = 56/155 (36%), Gaps = 16/155 (10%)

Query: 215 LTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYG 274
           +TE   N   + T N  R       I +     W+ +   +R V  ++P   E ++A  G
Sbjct: 1   MTESTINVLVLCTGNSARSVLGEALINHLGGAKWRAYSAGSRPVGRVNPLSLE-VLAEKG 59

Query: 275 LDSDVTRVEVCGQFPQQDI-DSFIPLNIIEEALNREPCPDPYAP--LIMGC-DIAEEGGD 330
           L +   R +   +F   D     + + + + A        P  P  L MG  D A+    
Sbjct: 60  LPTAGYRSKSWDEFAAADAPRMDLVITVCDNAAGEVCPVWPGHPSKLHMGFPDPADA--- 116

Query: 331 NTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
                  +G   E L ++ K        K+  L++
Sbjct: 117 -------KGSHEEQLAEFRKVYAMIEA-KVRRLIQ 143


>gi|288554856|ref|YP_003426791.1| Tex transcription access, protein (S1 RNA binding) [Bacillus
           pseudofirmus OF4]
 gi|288546016|gb|ADC49899.1| Tex transcription access, protein (S1 RNA binding) [Bacillus
           pseudofirmus OF4]
          Length = 726

 Score = 39.7 bits (91), Expect = 1.1,   Method: Composition-based stats.
 Identities = 24/144 (16%), Positives = 52/144 (36%), Gaps = 12/144 (8%)

Query: 246 DDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA 305
             +     D+   E +  +  +G      ++  + R E+  +   +  +  I +   E  
Sbjct: 255 KRYMSRAGDSPAAEYVKLAIQDGYKRL--IEPSIER-EIRNELTAKAEEQAIHIF-SENL 310

Query: 306 LNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDW---SKTDLRTTNNKISG 362
            N    P     +++G D A   G    VV   G V++    +    + ++     K+  
Sbjct: 311 RNLLLQPPIKDKVVLGVDPAYRTGCKLAVVDGTGKVLDIGVVYPTPPRNEVEKAAAKVKQ 370

Query: 363 LVEKYRPDAIIIDANNTGARTCDY 386
           LV++++ + I I     G  T   
Sbjct: 371 LVKEHKVEMIAI-----GNGTASR 389


>gi|289167314|ref|YP_003445583.1| terminase large subunit [Streptococcus mitis B6]
 gi|288906881|emb|CBJ21715.1| terminase large subunit [Streptococcus mitis B6]
          Length = 418

 Score = 39.7 bits (91), Expect = 1.2,   Method: Composition-based stats.
 Identities = 45/353 (12%), Positives = 104/353 (29%), Gaps = 42/353 (11%)

Query: 74  NPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLS 133
           +P++       GRG GK++     ++  +  R  ++ +C+  ++  L+ +++ ++   +S
Sbjct: 22  DPKILHVVEKGGRGSGKSSDLGHTII-QLIMRYPVNAVCIRKTDNTLEQSVYEQLKWAIS 80

Query: 134 LLPNKHWFEMQ--------SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
                H F++             +   +        +             +     +   
Sbjct: 81  EQGVSHLFKINKSPLKITYIPRGNYIIFRGAQDPERIKSLKDSRFPFAIGW----IEELA 136

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF----YEIF 241
                       DE     + +  G L    +      +  + NP +    +    YE  
Sbjct: 137 EFKTE-------DEVKTITNSLLRGEL----DDGLFYKFFYSYNPPKRKQSWVNKKYESV 185

Query: 242 NKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNI 301
            +P  +             I  +F E   A         R E  G+     +    P   
Sbjct: 186 IQP-PNTHVHHSTYLDNPYISQAFIEEAEATRERSEKRYRWEYLGEAIGSGVA---PFEN 241

Query: 302 IEEALNREPCPDPYAPLIMGCDIAEEGGDNTVV---VLRRGPVIEHLFDWSKT--DLRTT 356
           +      +     +  +  G D          V     ++  VI  + +        R  
Sbjct: 242 LVFRKITDEEIARFDNIRQGNDFGYANDPLAFVRWHYDKKKRVIYAIDEIYGVKISNREL 301

Query: 357 NNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
             +I    + Y+   I  D+     ++ D L++   ++  V G K+  D    
Sbjct: 302 AERIRE--KGYQSQMITCDS--AEPKSIDELKLQ-LNIPLVQGAKKGPDSREY 349


>gi|269986940|gb|EEZ93216.1| type III restriction protein res subunit [Candidatus Parvarchaeum
           acidiphilum ARMAN-4]
          Length = 508

 Score = 39.7 bits (91), Expect = 1.2,   Method: Composition-based stats.
 Identities = 12/60 (20%), Positives = 25/60 (41%)

Query: 58  FMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117
           F E ++     +      +     +    G+GKT ++A L  + +   P   V+ LA ++
Sbjct: 4   FKEEIENREYQTKIFETAKTGNTLVVLPTGLGKTIISAMLANYRLEKYPSSKVLFLAPTK 63


>gi|302412431|ref|XP_003004048.1| ATP-dependent DNA helicase MER3 [Verticillium albo-atrum VaMs.102]
 gi|261356624|gb|EEY19052.1| ATP-dependent DNA helicase MER3 [Verticillium albo-atrum VaMs.102]
          Length = 709

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 24/118 (20%), Positives = 41/118 (34%), Gaps = 14/118 (11%)

Query: 84  AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
           +  G GKT      + W    RPG  V+ +A         L  E V  W + L      +
Sbjct: 279 SPTGSGKTVAAELAMWWAFKERPGSKVVYIAP-----MKALVRERVKDWGARLAKPLGLK 333

Query: 143 MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
           +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 334 LVELTGDNTPDTRTIKDADVIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 384


>gi|188580687|ref|YP_001924132.1| hypothetical protein Mpop_1430 [Methylobacterium populi BJ001]
 gi|179344185|gb|ACB79597.1| protein of unknown function DUF264 [Methylobacterium populi BJ001]
          Length = 421

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 61/357 (17%), Positives = 108/357 (30%), Gaps = 52/357 (14%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKG-AISAGRGIGKTTLNA-WLVLWLMSTRPGISVICL 113
           L  +E    H       P P  +   A+  GRG GKT   A W+             +  
Sbjct: 9   LRLLEADWLHRARHDQLPPPGDWTTWAVIGGRGSGKTRTGAEWV---RGLAHGDP--VFT 63

Query: 114 ANSETQLKTT--LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171
           A +  ++      +A+V   +   P+     +  L   P  W         G        
Sbjct: 64  AEAVGRIALVGETFADVRDVMIEGPSG-LLALPRLGGPPPVWQPSRRRVMFGN-----GA 117

Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTS-NP 230
           +   +S E PD+  G       A  +DE +                 +  +F +    +P
Sbjct: 118 VALAFSAEEPDSLRG---PQFGAAWSDEVAK-----WREAEAA---YDMIQFGLRLGAHP 166

Query: 231 RRLSGKFYEIFNKPLDDWKRFQIDTRTV----------EGIDPSFHEGIIARYGLDSDVT 280
           R L         +P+   +R   D RTV          + + P F E ++ RY   + + 
Sbjct: 167 RGLVT----TTPRPVPLIRRLLADPRTVVTRSRTADNAQNLAPRFLEEVVGRY-AGTRIG 221

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI---AEEGGDNT----- 332
           R E+ G+  +   D+    + IE    R     P   + +  D    +  G D       
Sbjct: 222 RQELDGELIEDRPDALWTRDGIER--TRIHAAPPLQRIAVAVDPPASSRAGADACGIVAA 279

Query: 333 VVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEM 389
            +       +       +            L  + + D ++ + N  G      L  
Sbjct: 280 GIAADGTAYVLADATLERAAPAAWAQGALALYHRLKADVLVAEVNQGGEMVVAVLAE 336


>gi|15618661|ref|NP_224947.1| exodeoxyribonuclease V, Alpha [Chlamydophila pneumoniae CWL029]
 gi|4377059|gb|AAD18890.1| Exodeoxyribonuclease V, Alpha [Chlamydophila pneumoniae CWL029]
          Length = 493

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 41/212 (19%), Positives = 71/212 (33%), Gaps = 28/212 (13%)

Query: 62  VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE---T 118
           + +   N + N   +     +S G G GKT L A L+L L+  +P + +  ++ +    +
Sbjct: 132 ILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKATS 191

Query: 119 QLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177
            ++  L    +   + L+   H F  +        +        L +D     T    YS
Sbjct: 192 HIRQILMKYNIFDDMVLMQTVHHFLQEY------AYRRYNSIDVLLVDEGSMVTFDLLYS 245

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP-RRLSGK 236
             +     G+     +         T  +I LG    L         I   NP + L G 
Sbjct: 246 LVQT--LQGYEKDKKLY--------TSSLIILGDTNQL-----PPIGIGVGNPLQDLIGY 290

Query: 237 FYEI--FNKPLDDWKRFQIDTRTVEGIDPSFH 266
           F+E   F K     K   +D  T   +     
Sbjct: 291 FHENTFFLKTSHRAKTGVVDQLTQSVLRGEMI 322


>gi|15836285|ref|NP_300809.1| exodeoxyribonuclease V, alpha [Chlamydophila pneumoniae J138]
 gi|16752288|ref|NP_445657.1| exodeoxyribonuclease V, alpha subunit, putative [Chlamydophila
           pneumoniae AR39]
 gi|33242111|ref|NP_877052.1| exonuclease V alpha-subunit [Chlamydophila pneumoniae TW-183]
 gi|7190033|gb|AAF38887.1| exodeoxyribonuclease V, alpha subunit, putative [Chlamydophila
           pneumoniae AR39]
 gi|8979125|dbj|BAA98960.1| exodeoxyribonuclease V, alpha [Chlamydophila pneumoniae J138]
 gi|33236621|gb|AAP98709.1| exonuclease V alpha-subunit [Chlamydophila pneumoniae TW-183]
          Length = 493

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 41/212 (19%), Positives = 71/212 (33%), Gaps = 28/212 (13%)

Query: 62  VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE---T 118
           + +   N + N   +     +S G G GKT L A L+L L+  +P + +  ++ +    +
Sbjct: 132 ILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKATS 191

Query: 119 QLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177
            ++  L    +   + L+   H F  +        +        L +D     T    YS
Sbjct: 192 HIRQILMKYNIFDDMVLMQTVHHFLQEY------AYRRYNSIDVLLVDEGSMVTFDLLYS 245

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP-RRLSGK 236
             +     G+     +         T  +I LG    L         I   NP + L G 
Sbjct: 246 LVQT--LQGYEKDKKLY--------TSSLIILGDTNQL-----PPIGIGVGNPLQDLIGY 290

Query: 237 FYEI--FNKPLDDWKRFQIDTRTVEGIDPSFH 266
           F+E   F K     K   +D  T   +     
Sbjct: 291 FHENTFFLKTSHRAKTGVVDQLTQSVLRGEMI 322


>gi|226945807|ref|YP_002800880.1| phage P2 terminase ATPase subunit, gpP-like protein [Azotobacter
           vinelandii DJ]
 gi|226720734|gb|ACO79905.1| Phage P2 terminase ATPase subunit, gpP-like protein [Azotobacter
           vinelandii DJ]
          Length = 585

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 30/162 (18%), Positives = 50/162 (30%), Gaps = 22/162 (13%)

Query: 246 DDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPL----- 299
             W++   I      G D    E +   Y  +++     +  +F      S  PL     
Sbjct: 326 KIWRQIVTILDAERRGCDLFDLEELRFEY--NAEQFANLLMCEFVDDGA-SIFPLAMLQP 382

Query: 300 ----NIIEEALNREP---CPDPYAPLIMGCDIAEEGGDNTVVV----LRRGPVIEHL--F 346
               + +E A + +P    P     + +G D AE G    +VV    L  G     L   
Sbjct: 383 CQVDSWVEWAEDFKPFAARPFGDRQVWVGYDPAETGDSAGLVVVAPPLVPGGKFRVLERH 442

Query: 347 DWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            +   D       I  +  +Y    I +D    G      + 
Sbjct: 443 QFRGMDFAAQAEFIRQVTRRYWVTYIGLDTTGMGTGVAQLVR 484


>gi|213423446|ref|ZP_03356429.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. E01-6750]
          Length = 72

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 11/48 (22%), Positives = 17/48 (35%)

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            I     W   D R   +    L E+Y    I ID+   G    + ++
Sbjct: 10  RILERHQWRGMDFRAQADANKKLTEQYNVTYIGIDSTGVGHGVYENVK 57


>gi|209964492|ref|YP_002297407.1| EAL domain proteni [Rhodospirillum centenum SW]
 gi|209957958|gb|ACI98594.1| EAL domain proteni [Rhodospirillum centenum SW]
          Length = 587

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 2/81 (2%)

Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGL 363
            A  +       A L +  D   EG    V  + +   +  + + S+ D      ++   
Sbjct: 82  AATFKRLVGTAAAKLFLNVDPRLEGAVPLVTAIGQRYGVPIVHEISELDTTAVGERLEAA 141

Query: 364 VEKYRP--DAIIIDANNTGAR 382
           VE+YR     I +D    G  
Sbjct: 142 VEQYRRRDIGIALDDFGVGFG 162


>gi|213027809|ref|ZP_03342256.1| terminase, ATPase subunit [Salmonella enterica subsp. enterica
           serovar Typhi str. 404ty]
          Length = 141

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 10/40 (25%), Positives = 16/40 (40%)

Query: 349 SKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
              D R   + I  L E+Y    I ID+   G    + ++
Sbjct: 1   RGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVK 40


>gi|54025903|ref|YP_120145.1| putative phage terminase [Nocardia farcinica IFM 10152]
 gi|54017411|dbj|BAD58781.1| putative phage terminase [Nocardia farcinica IFM 10152]
          Length = 436

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 23/171 (13%), Positives = 49/171 (28%), Gaps = 9/171 (5%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG----KFYEIFNKPLDDWK 249
             + DEA+  P+     +   L+   A    + T+NP          F +   +      
Sbjct: 125 LAMVDEATLLPENFWTQLGARLSVPGAK--LLATTNPDNPQHYLKVNFIDRAGERGMRLC 182

Query: 250 RFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNRE 309
            +        G+D  +   + A            + GQ+   D   +   +  +  +   
Sbjct: 183 AWDFTLDDNPGLDDEYVASLKAE--NQGLFYLRNILGQWVAADGAVYDCYDPAKHLVKWS 240

Query: 310 PCPDPYAPLIMGCDIAEEGGDNTV-VVLRRGPVIEHLFDWSKTDLRTTNNK 359
             P+    + +G D         V + L    V+  + +W           
Sbjct: 241 ELPEMQFYVGVGVDHGTTNPTAAVLIGLGADNVLYAVDEWRYAPSNKEARW 291


>gi|227496997|ref|ZP_03927248.1| phage Terminase [Actinomyces urogenitalis DSM 15434]
 gi|226833491|gb|EEH65874.1| phage Terminase [Actinomyces urogenitalis DSM 15434]
          Length = 480

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 35/246 (14%), Positives = 69/246 (28%), Gaps = 25/246 (10%)

Query: 194 AIINDEASGTPDVINLGILGF-LTERNANRFWIMTSNPRRLSGKFYEIFNKPL------- 245
            I+ DEA    D     +         AN   I T  P        E+F +         
Sbjct: 169 IIVLDEAQDLTDEALEALRSTNAAGPQANPQIIYTGTPPSPKNDG-EVFTRFRSGALSGT 227

Query: 246 ------DDWKRFQ----IDTRTVEGIDPSFHEGIIARYGLD------SDVTRVEVCGQFP 289
                  +W         D  T+   +P++   + A+   D       +    E  G + 
Sbjct: 228 TASTCWHEWSAAPDADLDDETTIAQANPAYQIRLSAKTVADEREDISEEGFARERLGMWD 287

Query: 290 QQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWS 349
           +    + I         +     +    L +        G   V   R+          +
Sbjct: 288 EVSTSAVIDQATWLRCADMASQVNDRLALAVDVQPDRTSGSVAVAGQRKDGRWHIEVIDN 347

Query: 350 KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFC 409
           + ++     +++G+  + R   ++ID     A   + L+  G  V      + A      
Sbjct: 348 RNNVGWILQRVAGIWARQRIRTVVIDRRGPAASLIEPLQQKGIKVTTTDAAQMAASCGAF 407

Query: 410 RNRRTE 415
            +   E
Sbjct: 408 YDAVME 413


>gi|125552219|gb|EAY97928.1| hypothetical protein OsI_19844 [Oryza sativa Indica Group]
          Length = 1367

 Score = 39.7 bits (91), Expect = 1.3,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 15/92 (16%)

Query: 55  QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  + +  +  N      K  +  G       G GKT L    +   M   P 
Sbjct: 782 QREAFEFMWTNLVGDIRLNEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 841

Query: 108 ISVICLANSETQLKTTLWA---EVSKWLSLLP 136
              + +A      +  L+A   E  KW   +P
Sbjct: 842 CRPVIIAP-----RGMLFAWEQEFKKWNVNVP 868


>gi|269302541|gb|ACZ32641.1| putative exodeoxyribonuclease V, alpha subunit [Chlamydophila
           pneumoniae LPCoLN]
          Length = 493

 Score = 39.3 bits (90), Expect = 1.4,   Method: Composition-based stats.
 Identities = 41/212 (19%), Positives = 71/212 (33%), Gaps = 28/212 (13%)

Query: 62  VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE---T 118
           + +   N + N   +     +S G G GKT L A L+L L+  +P + +  ++ +    +
Sbjct: 132 ILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKATS 191

Query: 119 QLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS 177
            ++  L    +   + L+   H F  +        +        L +D     T    YS
Sbjct: 192 HIRQILMKYNIFDDMVLMQTVHHFLQEY------AYRRYNSIDVLLVDEGSMVTFDLLYS 245

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNP-RRLSGK 236
             +     G+     +         T  +I LG    L         I   NP + L G 
Sbjct: 246 LVQT--LQGYEKDKKLY--------TSSLIILGDTNQL-----PPIGIGVGNPLQDLIGY 290

Query: 237 FYEI--FNKPLDDWKRFQIDTRTVEGIDPSFH 266
           F+E   F K     K   +D  T   +     
Sbjct: 291 FHENTFFLKTSHRAKTGAVDQLTQSVLRGEMI 322


>gi|222631484|gb|EEE63616.1| hypothetical protein OsJ_18433 [Oryza sativa Japonica Group]
          Length = 1364

 Score = 39.3 bits (90), Expect = 1.4,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 31/92 (33%), Gaps = 15/92 (16%)

Query: 55  QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  + +  +  N      K  +  G       G GKT L    +   M   P 
Sbjct: 779 QREAFEFMWTNLVGDIRLNEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 838

Query: 108 ISVICLANSETQLKTTLWA---EVSKWLSLLP 136
              + +A      +  L+A   E  KW   +P
Sbjct: 839 CRPVIIAP-----RGMLFAWEQEFKKWNVNVP 865


>gi|300922509|ref|ZP_07138621.1| phage terminase large subunit [Escherichia coli MS 182-1]
 gi|300421167|gb|EFK04478.1| phage terminase large subunit [Escherichia coli MS 182-1]
          Length = 240

 Score = 39.3 bits (90), Expect = 1.4,   Method: Composition-based stats.
 Identities = 29/243 (11%), Positives = 69/243 (28%), Gaps = 22/243 (9%)

Query: 67  LNSVNNPNPEVFKGAI-SAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
           +N +  P  E  +  +   GRG GK+        W +       ++  A     ++    
Sbjct: 4   INPIFEPFIEAHRYKVAKGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCA 50

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
            E+   +S    +   +      + A +            +  +       +  +  +  
Sbjct: 51  RELQNSISDSVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLE 110

Query: 186 GHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFN-KP 244
           G           +EA          ++  + +  +   W+   NP+ +    Y+ F   P
Sbjct: 111 GID-----ICWVEEAEAVTKESWDILIPTIRKPFSE-IWVSF-NPKNILDDTYQRFVVNP 163

Query: 245 LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE 304
            DD     ++              +      +  + R    G+       + I    +E 
Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223

Query: 305 ALN 307
           A +
Sbjct: 224 ATD 226


>gi|293401139|ref|ZP_06645283.1| SNF2 domain protein [Erysipelotrichaceae bacterium 5_2_54FAA]
 gi|291305265|gb|EFE46510.1| SNF2 domain protein [Erysipelotrichaceae bacterium 5_2_54FAA]
          Length = 447

 Score = 39.3 bits (90), Expect = 1.4,   Method: Composition-based stats.
 Identities = 28/160 (17%), Positives = 54/160 (33%), Gaps = 30/160 (18%)

Query: 50  APRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKT--TLNAWLVLWLMSTRPG 107
           +P ++Q   ++ ++ H + +V                G+GKT   L A   L   S    
Sbjct: 4   SPHNYQSYAIDYIETHPVAAVLLDM------------GLGKTVIFLTAIADLLFDS-FEA 50

Query: 108 ISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDS 166
             ++ +A     +    W  E+SKW  L    +   + ++    A   +      +  ++
Sbjct: 51  HRILVVAPLR--VARDTWPAEISKWQHLKHLTYAVAVGTVKERKAALSAGADITIINREN 108

Query: 167 KHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV 206
             +                G+   Y M II DE S   + 
Sbjct: 109 LGWLIDS-----------SGYEFDYDMVII-DELSSFKNH 136


>gi|227499654|ref|ZP_03929757.1| PbsX family phage terminase, large subunit [Anaerococcus tetradius
           ATCC 35098]
 gi|227218251|gb|EEI83510.1| PbsX family phage terminase, large subunit [Anaerococcus tetradius
           ATCC 35098]
          Length = 439

 Score = 39.3 bits (90), Expect = 1.4,   Method: Composition-based stats.
 Identities = 22/184 (11%), Positives = 45/184 (24%), Gaps = 36/184 (19%)

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK----------PLDDWKRFQID 254
           D I          +    FW +  NP   +   Y                   +      
Sbjct: 149 DSIKEAFNRTAAAKRRKFFWDL--NPSSPNHFIYADHIDKYQNMIDEGIDFGGYNYKHFT 206

Query: 255 TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCG-----------QFPQQDIDSFIPLNIIE 303
                 I     + I  +Y  +S   + ++ G           QF     D  I      
Sbjct: 207 IDDNINISDQRKKEIKLQYDPNSVWYKRDILGLRVVAEGLIYKQFADNPDDYLI------ 260

Query: 304 EALNREPCPDPYAPLIMGCDIAEEGGDNTVVV--LRRGPVIEHLFDWSKTDLRTTNNKIS 361
                +  P     + +G D       +  +   + RG    +     + +     +  +
Sbjct: 261 -----KEKPHELQMIQIGVDFGGNNSKHAFICCGISRGFKKVYALRSERLEPDKPTDLYN 315

Query: 362 GLVE 365
            L++
Sbjct: 316 QLID 319


>gi|170764163|ref|ZP_02633320.2| phage terminase, large subunit, pbsx family [Clostridium
           perfringens E str. JGS1987]
 gi|170661287|gb|EDT13970.1| phage terminase, large subunit, pbsx family [Clostridium
           perfringens E str. JGS1987]
          Length = 441

 Score = 39.3 bits (90), Expect = 1.5,   Method: Composition-based stats.
 Identities = 23/189 (12%), Positives = 51/189 (26%), Gaps = 33/189 (17%)

Query: 204 PDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD----------WKRFQI 253
           PD I       +       FW +  NP   +   Y+ +                +  +  
Sbjct: 145 PDSIKEAFNRTIAAHKRKVFWDL--NPDNPNAFIYKDYIDNYKSKYENGELKGGYNYYHF 202

Query: 254 DTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPD 313
                  I     E I ++Y  +S   + ++ G+         +   +I       P   
Sbjct: 203 TIDDNINISDERKEEIKSQYDKNSIWYQRDILGKR-------CVAEGLIYRRFANNPNSY 255

Query: 314 PYAP--------LIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVE 365
                       +++G D    G  +  +           F + K  + ++       ++
Sbjct: 256 RAEESDVSNLMKIVIGVDFGGTGSGHAFIA------SAITFGYKKVIILSSERHFGDDID 309

Query: 366 KYRPDAIII 374
             +   I I
Sbjct: 310 SEKLGKIFI 318


>gi|284018161|sp|A3GH78|MPH1_PICST RecName: Full=ATP-dependent DNA helicase MPH1
          Length = 1050

 Score = 39.3 bits (90), Expect = 1.5,   Method: Composition-based stats.
 Identities = 10/44 (22%), Positives = 23/44 (52%), Gaps = 4/44 (9%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE----TQLK 121
           ++   G+GKT + + ++L  +   P   +I +A ++     Q+K
Sbjct: 106 VALPTGLGKTFIASTVMLNFLRWFPESKMIFVAPTKPLVAQQIK 149


>gi|121602586|ref|YP_988560.1| PBSX family phage terminase large subunit [Bartonella bacilliformis
           KC583]
 gi|120614763|gb|ABM45364.1| putative phage terminase, large subunit, PBSX family [Bartonella
           bacilliformis KC583]
          Length = 402

 Score = 39.3 bits (90), Expect = 1.5,   Method: Composition-based stats.
 Identities = 31/194 (15%), Positives = 65/194 (33%), Gaps = 11/194 (5%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNAN--RFWIMTSNPRRLSGKFYEIFN-KPLDD 247
             +    DEA    +     ++  L E   +      +T NP R +    + F      +
Sbjct: 83  RILLCWVDEAEPVTETAWQTLIPTLREEGQDWHSELWVTWNPLRENAPVEKRFRLTKDPN 142

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSF---IPLNIIEE 304
            K  +I+ R         +   +A      +       G + Q    ++   + L+  +E
Sbjct: 143 IKGVEINWRDNPQFPDKLNRDRLADLHQRPEQYGHIWEGDYLQAVQGAYYQKLLLDAEQE 202

Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKI 360
                   DP   + +  DI   G   D T + + +  G  I  L D+ +   +  +  I
Sbjct: 203 GRIAHVSRDPLIQIKIFWDIGGTGAKADATALWVAQFIGREIRIL-DYYEAQGQPLSEHI 261

Query: 361 SGLVEKYRPDAIII 374
             +  +    A+++
Sbjct: 262 GWICHRGYDKALMV 275


>gi|328870919|gb|EGG19291.1| DEAD/DEAH box helicase [Dictyostelium fasciculatum]
          Length = 2224

 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 42/288 (14%), Positives = 80/288 (27%), Gaps = 38/288 (13%)

Query: 66   CLNSVNNPNPEVFKGA-----ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSET-- 118
              N V                ++A    GKT      VL  +   P    + +A  E+  
Sbjct: 1386 YFNPVQTQVFSSLYTTDENVFVAAPANTGKTVCAELAVLRTLINNPEARCVYIAPVESMV 1445

Query: 119  QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
             +++  WA   K+             +++ +     S ++  +    ++ +  + R + +
Sbjct: 1446 TVRSRDWAY--KFGQKFGKVSVLTGDAVTDNKILEASRIIVTT----AERWDILSRKWRQ 1499

Query: 179  ERPDTFVGHHNTYGMAIINDE----ASGTPDVINLGILGFL----TERNANRFWIMTSNP 230
            +                I DE     SG        +L  +    T+  +   +I  S+P
Sbjct: 1500 KNSRV------QSVSLFIVDELQMIGSGESGSTMEIVLSRMRYIATQTGSPIRFIGLSSP 1553

Query: 231  RRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQ 290
               +        + L +W      T      D    E  I   G D    +         
Sbjct: 1554 VANA--------RDLAEWMGATPATMFNFHPDVRPVEMEIQMQGFDYPNFQERQMAM--- 1602

Query: 291  QDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRR 338
                 +   ++   A      P   A   M  DI         +  RR
Sbjct: 1603 TKPALYAVSHMDRTAQTLVYVPTRKAARQMAADIILFVDSEDDMNTRR 1650


>gi|308198038|ref|XP_001387028.2| predicted protein [Scheffersomyces stipitis CBS 6054]
 gi|149389001|gb|EAZ63005.2| predicted protein [Pichia stipitis CBS 6054]
          Length = 941

 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 10/44 (22%), Positives = 23/44 (52%), Gaps = 4/44 (9%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE----TQLK 121
           ++   G+GKT + + ++L  +   P   +I +A ++     Q+K
Sbjct: 42  VALPTGLGKTFIASTVMLNFLRWFPESKMIFVAPTKPLVAQQIK 85


>gi|220915119|ref|YP_002490424.1| hypothetical protein Mnod_7767 [Methylobacterium nodulans ORS 2060]
 gi|219952973|gb|ACL63358.1| hypothetical protein Mnod_7767 [Methylobacterium nodulans ORS 2060]
          Length = 846

 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 34/215 (15%), Positives = 67/215 (31%), Gaps = 36/215 (16%)

Query: 124 LWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDT 183
           L+A +  WL+    +   +  +  +   P  S     +     +         S ERP+ 
Sbjct: 513 LFAWLMNWLAHAAQRPHEKPGTAPIFKGPQGS--GKTTFTNLLRAIFHPAHVVSAERPEA 570

Query: 184 FVGHHNTYG---MAIINDEASGTPDV-INLGILGFLTERNANR--------------FWI 225
            +G HN +    + ++ DEA    D   N  +   +T+                    ++
Sbjct: 571 LLGKHNAHLREALFVMADEAVFAGDPAANNRLKAMVTDATLTIEPKGIDAVSVPSFHRFV 630

Query: 226 MTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285
           MTSN   +     +        W  F +    V  +  ++   + A    ++   R  + 
Sbjct: 631 MTSNEDHVIRAEADA-----RRWAVFDVSGEQVGNV--AYFRELYAVLKPETPEVRAFLR 683

Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320
                        + I E A+ R P        I+
Sbjct: 684 ---------DLAVMEIDEAAVRRAPTTSALVGQIV 709


>gi|158318502|ref|YP_001511010.1| helicase domain-containing protein [Frankia sp. EAN1pec]
 gi|158113907|gb|ABW16104.1| helicase domain protein [Frankia sp. EAN1pec]
          Length = 1143

 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 5/136 (3%)

Query: 110 VICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHY 169
            + +A+     KT +  E+     +L  +    +   +L  + W   +   +L  D+  Y
Sbjct: 273 GVIIADEVGLGKTYIAGELLHEAVILNRQKALVVAPATLRDSTWKPFLRETNLPADTVSY 332

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGF---LTERNANRFWIM 226
             + R             H      +I DEA     +           LT +   R  ++
Sbjct: 333 EELTRGMPAAGQQGAALQHPDAYALVIVDEAHALRSLGTQRAEAMRLLLTGKVPKRLVLL 392

Query: 227 TSNPRRLSGKFYEIFN 242
           T+ P   S   Y+++N
Sbjct: 393 TATPVNNS--LYDLYN 406


>gi|111184763|gb|ABH08471.1| putative terminase ATPase subunit [Human herpesvirus 3]
 gi|157965750|gb|ABW06896.1| DNA packaging protein [Human herpesvirus 3]
          Length = 743

 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 32/155 (20%), Positives = 54/155 (34%), Gaps = 28/155 (18%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     L+  +M+T  GI V   A             + K    +        + +  
Sbjct: 268 GKTWFLVPLIALVMATFRGIKVGYTA------------HIRKATEPV-------FEGIKS 308

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM------AIINDEASG 202
               W+       +  +S  +S    +YS      F   HNT G+       +  DEA+ 
Sbjct: 309 RLEQWFGANYVDHVKGESITFSFTDGSYSTAV---FASSHNTNGIRGQDFNLLFVDEANF 365

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                   I+GFL + N    ++ ++N  + S  F
Sbjct: 366 IRPDAVQTIVGFLNQTNCKIIFVSSTNTGKASTSF 400


>gi|83721852|emb|CAI44887.1| putative terminase ATPase subunit [Human herpesvirus 3]
 gi|94481989|gb|ABF21689.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94482063|gb|ABF21762.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94482137|gb|ABF21835.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|116489977|gb|ABJ98890.1| ORF45/42 [Human herpesvirus 3]
          Length = 747

 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 32/155 (20%), Positives = 54/155 (34%), Gaps = 28/155 (18%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     L+  +M+T  GI V   A             + K    +        + +  
Sbjct: 272 GKTWFLVPLIALVMATFRGIKVGYTA------------HIRKATEPV-------FEGIKS 312

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM------AIINDEASG 202
               W+       +  +S  +S    +YS      F   HNT G+       +  DEA+ 
Sbjct: 313 RLEQWFGANYVDHVKGESITFSFTDGSYSTAV---FASSHNTNGIRGQDFNLLFVDEANF 369

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                   I+GFL + N    ++ ++N  + S  F
Sbjct: 370 IRPDAVQTIVGFLNQTNCKIIFVSSTNTGKASTSF 404


>gi|9625919|ref|NP_040165.1| DNA packaging terminase subunit 1 [Human herpesvirus 3]
 gi|139650|sp|P09294|TRM3_VZVD RecName: Full=Tripartite terminase subunit UL15 homolog; AltName:
           Full=DNA-packaging protein 45; AltName: Full=Terminase
           large subunit; Contains: RecName: Full=Gene 42 protein
 gi|5869808|emb|CAB55553.1| putative ATPase subunit of terminase [Human herpesvirus 3 strain
           Dumas]
 gi|46981453|gb|AAT07724.1| DNA packaging protein [Human herpesvirus 3]
 gi|46981524|gb|AAT07800.1| DNA packaging protein [Human herpesvirus 3]
 gi|94481841|gb|ABF21543.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94481915|gb|ABF21616.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94482211|gb|ABF21908.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94482285|gb|ABF21981.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94482359|gb|ABF22054.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94482433|gb|ABF22127.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94482507|gb|ABF22200.1| putative ATPase subunit of terminase [Human herpesvirus 3]
 gi|94482581|gb|ABF22273.1| putative ATPase subunit of terminase [Human herpesvirus 3]
          Length = 747

 Score = 39.3 bits (90), Expect = 1.6,   Method: Composition-based stats.
 Identities = 32/155 (20%), Positives = 54/155 (34%), Gaps = 28/155 (18%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSL 148
           GKT     L+  +M+T  GI V   A             + K    +        + +  
Sbjct: 272 GKTWFLVPLIALVMATFRGIKVGYTA------------HIRKATEPV-------FEGIKS 312

Query: 149 HPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGM------AIINDEASG 202
               W+       +  +S  +S    +YS      F   HNT G+       +  DEA+ 
Sbjct: 313 RLEQWFGANYVDHVKGESITFSFTDGSYSTAV---FASSHNTNGIRGQDFNLLFVDEANF 369

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                   I+GFL + N    ++ ++N  + S  F
Sbjct: 370 IRPDAVQTIVGFLNQTNCKIIFVSSTNTGKASTSF 404


>gi|56692599|ref|YP_164067.1| large terminase subunit [Pseudomonas phage B3]
 gi|33338625|gb|AAQ13949.1|AF232233_31 large terminase subunit [Pseudomonas phage B3]
          Length = 486

 Score = 39.3 bits (90), Expect = 1.7,   Method: Composition-based stats.
 Identities = 27/139 (19%), Positives = 49/139 (35%), Gaps = 7/139 (5%)

Query: 257 TVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDP-- 314
            ++G+D + +   I     D +  + E     P  D  +F+  ++I  A   +       
Sbjct: 221 EIQGMDEAQYFDFIRAGCADEESFQQEYMCN-PADDDVAFLEYDLIASAEYPQTANWQQP 279

Query: 315 -YAPLIMGCDIAEEGGDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
               L  G DI  +  D TV+ +    G V+         ++R +  +        R + 
Sbjct: 280 EGGRLFAGVDIGRK-KDLTVLWILELLGDVLYTRHVERLQNMRKSAQEAILWPWFQRCER 338

Query: 372 IIIDANNTGARTCDYLEML 390
           I IDA   G    D  +  
Sbjct: 339 ICIDATGLGIGWADDAQDQ 357


>gi|262172263|ref|ZP_06039941.1| ATP-dependent RNA helicase SrmB [Vibrio mimicus MB-451]
 gi|261893339|gb|EEY39325.1| ATP-dependent RNA helicase SrmB [Vibrio mimicus MB-451]
          Length = 416

 Score = 39.3 bits (90), Expect = 1.7,   Method: Composition-based stats.
 Identities = 40/237 (16%), Positives = 68/237 (28%), Gaps = 34/237 (14%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
           GFS P   Q E +                +      SA  G GKT   A   L  +   P
Sbjct: 22  GFSRPTQVQAEAI------------PQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFP 69

Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161
                   ++ L  +         AE ++ L+     + F +     +    ++D+L  +
Sbjct: 70  RRKAGPARILILTPTRELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATT 125

Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221
             I       +                      +I DEA    D+     +  L+     
Sbjct: 126 QDI------VVATPGRLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSAECRW 179

Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278
           R   +  +   L G+  E F   L       ID        P      I+++   +D
Sbjct: 180 RKQTLLFSAT-LEGRGVEGFTADL-LKNPAHIDAE-----PPRRERKKISQWYHRAD 229


>gi|159482689|ref|XP_001699400.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158272851|gb|EDO98646.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 231

 Score = 39.3 bits (90), Expect = 1.7,   Method: Composition-based stats.
 Identities = 12/70 (17%), Positives = 26/70 (37%), Gaps = 2/70 (2%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111
           R+W     + V  +    V           +    G+GKT + A ++L      P   ++
Sbjct: 7   RTWLYPTDQEVREYQFRMV--RGALFANTLVCLPTGLGKTLIAAVVILNFYRWFPDGKLV 64

Query: 112 CLANSETQLK 121
             A ++  ++
Sbjct: 65  FTAPTKPLVE 74


>gi|75763594|ref|ZP_00743293.1| Stage V sporulation protein AA [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|74488924|gb|EAO52441.1| Stage V sporulation protein AA [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
          Length = 206

 Score = 38.9 bits (89), Expect = 1.8,   Method: Composition-based stats.
 Identities = 16/86 (18%), Positives = 37/86 (43%), Gaps = 2/86 (2%)

Query: 300 NIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359
             I   +       P   + +G D+A+  GD++VV L +  ++  +    KT +     K
Sbjct: 3   QTIYIKMRNRLKVSPTYEVKLG-DVAQLAGDSSVVELLQNEIVYKITAHDKTHVVIDVMK 61

Query: 360 ISGLVEKYRPDAIIIDANNTGARTCD 385
           +  ++++ +   + I+   +G    D
Sbjct: 62  VIEIIQQ-KASHVQINLLGSGQTLVD 86


>gi|262163925|ref|ZP_06031664.1| ATP-dependent RNA helicase SrmB [Vibrio mimicus VM223]
 gi|262027453|gb|EEY46119.1| ATP-dependent RNA helicase SrmB [Vibrio mimicus VM223]
          Length = 416

 Score = 38.9 bits (89), Expect = 1.8,   Method: Composition-based stats.
 Identities = 40/237 (16%), Positives = 68/237 (28%), Gaps = 34/237 (14%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
           GFS P   Q E +                +      SA  G GKT   A   L  +   P
Sbjct: 22  GFSRPTQVQAEAI------------PQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFP 69

Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161
                   ++ L  +         AE ++ L+     + F +     +    ++D+L  +
Sbjct: 70  RRKAGPARILILTPTRELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATT 125

Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221
             I       +                      +I DEA    D+     +  L+     
Sbjct: 126 QDI------VVATPGRLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSAECRW 179

Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278
           R   +  +   L G+  E F   L       ID        P      I+++   +D
Sbjct: 180 RKQTLLFSAT-LEGRGVEGFTADL-LKNPAHIDAE-----PPRRERKKISQWYHRAD 229


>gi|258620330|ref|ZP_05715368.1| Superfamily II DNA and RNA helicase [Vibrio mimicus VM573]
 gi|258624701|ref|ZP_05719635.1| Superfamily II DNA and RNA helicase [Vibrio mimicus VM603]
 gi|258582988|gb|EEW07803.1| Superfamily II DNA and RNA helicase [Vibrio mimicus VM603]
 gi|258587209|gb|EEW11920.1| Superfamily II DNA and RNA helicase [Vibrio mimicus VM573]
          Length = 416

 Score = 38.9 bits (89), Expect = 1.8,   Method: Composition-based stats.
 Identities = 40/237 (16%), Positives = 68/237 (28%), Gaps = 34/237 (14%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
           GFS P   Q E +                +      SA  G GKT   A   L  +   P
Sbjct: 22  GFSRPTQVQAEAI------------PQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFP 69

Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161
                   ++ L  +         AE ++ L+     + F +     +    ++D+L  +
Sbjct: 70  RRKAGPARILILTPTRELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATT 125

Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221
             I       +                      +I DEA    D+     +  L+     
Sbjct: 126 QDI------VVATPGRLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSAECRW 179

Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278
           R   +  +   L G+  E F   L       ID        P      I+++   +D
Sbjct: 180 RKQTLLFSAT-LEGRGVEGFTADL-LKNPAHIDAE-----PPRRERKKISQWYHRAD 229


>gi|206580893|ref|YP_002240749.1| type I site-specific deoxyribonuclease, HsdR family [Klebsiella
           pneumoniae 342]
 gi|206569951|gb|ACI11727.1| type I site-specific deoxyribonuclease, HsdR family [Klebsiella
           pneumoniae 342]
          Length = 1031

 Score = 38.9 bits (89), Expect = 1.9,   Method: Composition-based stats.
 Identities = 50/330 (15%), Positives = 101/330 (30%), Gaps = 50/330 (15%)

Query: 86  RGIGKTTLNAWLVLWLMSTRPGISV-ICLANSE--TQLKTTLWA---EVSKWLSLLPNKH 139
           +G GK+    WL  W+    P   V I    +E   Q+++       E+           
Sbjct: 278 QGSGKSLTMVWLAKWIRENVPNSRVLIVTDRTELDEQIESVFMGVDEEI------YRTSS 331

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRT--YSEERPDTFVGHHNTYGMAIIN 197
             ++ +   HP PW    L    G  S+   T       +E +            + +  
Sbjct: 332 GNDLIATLNHPNPWLICSLVHKFGRRSEAEDTAATDDFITELQQSLTKTFRAKGDLFVFV 391

Query: 198 DE-------------ASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKP 244
           DE              +  P+ + +G  G    +   +  +       + G +   +   
Sbjct: 392 DECHRTQSGKLHNAMTAILPEALFIGFTGTPLMKKDKKKSV------EVFGPYIHTYKFD 445

Query: 245 LDDWKRFQIDTR------TVEGIDPSFHEGIIARYGLD-SDVTRVEVCGQFPQQDIDSFI 297
                   +D R                +          S++ + ++  ++         
Sbjct: 446 EAVADGVVLDLRYEARDIDQYLTSEKKVDDWFEAKTRGLSNLAKTQLKQKWGSMQ-KLLS 504

Query: 298 PLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTN 357
             + +E+ +N         P +M     +  G+  +V        +    +S+TD     
Sbjct: 505 SKSRLEQIVNDILLDMDTRPRLM-----DGRGNAMLVCSSVYQACKVYEMFSQTD---LA 556

Query: 358 NKISGLVEKYRPDAIIIDANNTGARTCDYL 387
            K++ +V  +RPDA  I    TGA   + L
Sbjct: 557 GKVA-IVTSFRPDAASIKGEETGAGLTEKL 585


>gi|310722509|ref|YP_003969332.1| Dda DNA helicase [Aeromonas phage phiAS5]
 gi|306021352|gb|ADM79886.1| Dda DNA helicase [Aeromonas phage phiAS5]
          Length = 454

 Score = 38.9 bits (89), Expect = 1.9,   Method: Composition-based stats.
 Identities = 26/147 (17%), Positives = 46/147 (31%), Gaps = 27/147 (18%)

Query: 66  CLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLW 125
              S++    +     IS   G GK+ L   L+   +  +    VI  A +  Q K  L 
Sbjct: 17  QKRSIDAVLNDRSHITISGPAGSGKSFLTKILIK-KLIEKNNGGVILSAPT-HQAKIVL- 73

Query: 126 AEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFV 185
                                    + + +  +H  + I    Y  + R + + + D   
Sbjct: 74  ----------------------SKMSGYTASTIHSIMKIHPDTYEDV-REFKQSKSDK-A 109

Query: 186 GHHNTYGMAIINDEASGTPDVINLGIL 212
                    +I DEAS   + +   IL
Sbjct: 110 KKDLNEVRYLIVDEASMVDNDLFEIIL 136


>gi|145603324|ref|XP_369340.2| hypothetical protein MGG_06124 [Magnaporthe oryzae 70-15]
 gi|145011578|gb|EDJ96234.1| hypothetical protein MGG_06124 [Magnaporthe oryzae 70-15]
          Length = 1998

 Score = 38.9 bits (89), Expect = 1.9,   Method: Composition-based stats.
 Identities = 24/118 (20%), Positives = 41/118 (34%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RPG  V+ +A         L  E V  W + L      +
Sbjct: 1176 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAP-----MKALVRERVKDWGARLAQPMGLK 1230

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1231 LVELTGDNTPDTRTIKDADVIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1281


>gi|209885731|ref|YP_002289588.1| phage DNA Packaging Protein [Oligotropha carboxidovorans OM5]
 gi|209873927|gb|ACI93723.1| phage DNA Packaging Protein [Oligotropha carboxidovorans OM5]
          Length = 434

 Score = 38.9 bits (89), Expect = 1.9,   Method: Composition-based stats.
 Identities = 68/419 (16%), Positives = 129/419 (30%), Gaps = 80/419 (19%)

Query: 84  AGRGIGKTTLNA------WLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPN 137
            GRG GKT   A       L L  ++ +P   +  +  +E  ++  +   VS  L++   
Sbjct: 52  GGRGAGKTRAGAEWIRAQALGLAPLAQQPAGRIALVGETEHDVREVMIEGVSGLLAVHRR 111

Query: 138 KHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIIN 197
                          W                  +   +S E P++  G           
Sbjct: 112 DE----------RPMWQPSRRRLEWKN-----GAVAHAFSAEDPESLRG---PQFACAWA 153

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTS-NPRRLSGKFYEIFNKPLDDWKRFQIDTR 256
           DE +                 +  +F +     PR+L         +P    KR   D  
Sbjct: 154 DELAK-----WRYAEAAF---DMLQFGLRLGAQPRQLIT----TTPRPTALIKRLLNDES 201

Query: 257 TVE----------GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEAL 306
            V            + P+F + ++ARY   + + R E+ G+  ++  D+     +IE   
Sbjct: 202 CVTTRAATRSNALHLAPTFLQSVMARY-AGTRLGRQELDGELIEERPDALWSRGLIETC- 259

Query: 307 NREPCPDPYAPLIMGCD-IAEEGGDNTV-------VVLRRGPVIEHLFDWSKTDLRTTNN 358
            R     P   +++  D  A  G            V    G  +      ++        
Sbjct: 260 -RISEAPPLQRIVVAVDPPATSGKRADACGIVAAGVAADNGLYVLADETLTQAAPAAWAA 318

Query: 359 KISGLVEKYRPDAIIIDANNTGARTCDYLEMLG--YHVYRVLGQKRAVDLEFCRNRRTEL 416
           +   L  +   DA++++ N  G      +  +     V  V   +           R E 
Sbjct: 319 RAVALWRRLEADALVVEVNQGGEMVRAVIAQVDPSVPVQPVRALRGKW-------LRAE- 370

Query: 417 HVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYT 475
              +A   E    + H+G+   L+  +      +G   + S      +S D+ D L++ 
Sbjct: 371 --PIATLYEQGR-VRHAGVFAALED-EMCDFATSG---LSS-----GRSPDHLDALVWA 417


>gi|221117267|ref|XP_002154001.1| PREDICTED: similar to yeast Swi2/Snf2-Like family member (ssl-1),
           partial [Hydra magnipapillata]
          Length = 2164

 Score = 38.9 bits (89), Expect = 1.9,   Method: Composition-based stats.
 Identities = 28/153 (18%), Positives = 48/153 (31%), Gaps = 22/153 (14%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVI--CLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           G+GKT +    +L  ++   G       +  +   L   L  E+ KW        +F  Q
Sbjct: 657 GLGKT-IQTIALLAHLACEEGCWGPHLIIVPTSVMLNWEL--ELKKWCPGFKILTYFGTQ 713

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
                      +      G    +   +C T  +                II DEA    
Sbjct: 714 ----------KERKIKRAGWCKPNAFHVCITSYKLVIQDHQAFKRRKWKYIILDEAQNIK 763

Query: 205 D---VINLGILGFLTERNANRFWIMTSNPRRLS 234
           +        +L F    N++R  ++T  P + S
Sbjct: 764 NFKSQRWQTLLNF----NSHRRLLLTGTPLQNS 792


>gi|163868971|ref|YP_001610200.1| hypothetical protein Btr_1983 [Bartonella tribocorum CIP 105476]
 gi|161018647|emb|CAK02205.1| phage-related protein [Bartonella tribocorum CIP 105476]
          Length = 453

 Score = 38.9 bits (89), Expect = 1.9,   Method: Composition-based stats.
 Identities = 39/303 (12%), Positives = 88/303 (29%), Gaps = 27/303 (8%)

Query: 84  AGRGIGKT---TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
            GRG GKT    L + +V +         +I  A               +      N+  
Sbjct: 39  GGRGSGKTRSFALMSAVVGYRHGMAGERGIILCA---------------RQFQNSLNESS 83

Query: 141 FEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
            E    ++   P+  D               +   ++    +               DEA
Sbjct: 84  LEEIKRAIEAYPFLQDYYEIGDKYIKSKDGRIAYVFAGLDRNIASIKSMGRVFLCWVDEA 143

Query: 201 SGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNK-PLDDWKRFQIDTRT 257
               +     ++  L E     N    +T NP   +    + F      + K  +I+ R 
Sbjct: 144 EPVTETAWQTLIPTLREEGDDWNAELWVTWNPYHENAPVEKRFRNVDNPNIKGVEINWRD 203

Query: 258 VEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAP 317
                   +   ++      +       G + Q    ++    ++E  +       P  P
Sbjct: 204 NPKFPEKLNRDRLSDLQQRPEQYNHIWEGGYLQAVQGAYYQKCLLEAEMEGRITTVPRDP 263

Query: 318 ---LIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDA 371
              + +  DI   G   D T + + +       + D+ +   +  +  +  + ++    A
Sbjct: 264 LMQVRIFWDIGGTGAKADATALWVAQFVGREIRVLDYYEAQGQPLSEHVGWVFQRGYEKA 323

Query: 372 III 374
           +++
Sbjct: 324 LMV 326


>gi|255729652|ref|XP_002549751.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240132820|gb|EER32377.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 1162

 Score = 38.9 bits (89), Expect = 2.0,   Method: Composition-based stats.
 Identities = 10/46 (21%), Positives = 22/46 (47%), Gaps = 4/46 (8%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE----TQLK 121
             ++   G+GKT + + ++L  +   P   +I +A +      Q+K
Sbjct: 107 VLVALPTGLGKTFIASTVMLNFLRWFPNSKIIFMAPTRPLVAQQIK 152


>gi|330911327|gb|EGH39837.1| phage terminase, large subunit [Escherichia coli AA86]
          Length = 555

 Score = 38.9 bits (89), Expect = 2.0,   Method: Composition-based stats.
 Identities = 26/152 (17%), Positives = 49/152 (32%), Gaps = 24/152 (15%)

Query: 178 EERPDTFVGHHNTYGMAIINDEASGTP--DVINLGILGFLTERNANRFWIMTSNP-RRLS 234
             RP    G        ++ DEA+     D +    +  LT   A    I T N    L 
Sbjct: 159 SSRPSNLRGLQGD----VVIDEAAFHESLDELLKAAM-ALTMWGARVRIISTHNGVDNLF 213

Query: 235 GKFYEIFNKPLDDWKRFQIDTRTV--------------EGIDPSFHEGIIARYGLDSDVT 280
            ++ +   +   D+   +I                   +   P   +        ++   
Sbjct: 214 NQYIQEAREGRKDYSVHRITLDDAIADGLYRRICYVTGQEWSPESEQKWRDDLYKNAPTR 273

Query: 281 RV--EVCGQFPQQDIDSFIPLNIIEEALNREP 310
               E  G  P++   ++IP  +IE A++R+ 
Sbjct: 274 EDADEEYGCIPKKSGGAYIPHALIEMAMSRDI 305


>gi|171690334|ref|XP_001910092.1| hypothetical protein [Podospora anserina S mat+]
 gi|170945115|emb|CAP71226.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1993

 Score = 38.9 bits (89), Expect = 2.0,   Method: Composition-based stats.
 Identities = 22/118 (18%), Positives = 38/118 (32%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W     PG  V+ +A         L  E V  W   L       
Sbjct: 1164 SPTGSGKTVAAELAMWWAFREHPGSKVVYIAP-----MKALVRERVKDWGDRLAKPLGLR 1218

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + +I DE
Sbjct: 1219 LVELTGDNTPDTRTIQDADIIITTPEKWDGISRSWQT------RGYVRKVSLVVI-DE 1269


>gi|297519140|ref|ZP_06937526.1| Terminase, ATPase subunit [Escherichia coli OP50]
          Length = 159

 Score = 38.9 bits (89), Expect = 2.2,   Method: Composition-based stats.
 Identities = 11/48 (22%), Positives = 17/48 (35%)

Query: 341 VIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLE 388
            I     W   D R   + I  L E+Y    I ID+        + ++
Sbjct: 11  RILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVDHGVYENVK 58


>gi|212545286|ref|XP_002152797.1| DEAD/DEAH box helicase, putative [Penicillium marneffei ATCC 18224]
 gi|210065766|gb|EEA19860.1| DEAD/DEAH box helicase, putative [Penicillium marneffei ATCC 18224]
          Length = 2022

 Score = 38.9 bits (89), Expect = 2.2,   Method: Composition-based stats.
 Identities = 27/155 (17%), Positives = 49/155 (31%), Gaps = 22/155 (14%)

Query: 55   QLEFMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRP 106
            Q   +E +        N    ++F           + +  G GKT      + W    RP
Sbjct: 1125 QNPILEEIYGQRFQFFNPMQTQLFHTLYHTSANVLLGSPTGSGKTVACELAMWWAFRERP 1184

Query: 107  GISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI- 164
            G  V+ +A         L  E V  W   +      ++  L+    P    +    + I 
Sbjct: 1185 GSKVVYIAP-----MKALVRERVQDWRKRITTAMGLKLVELTGDNTPDTRTIRDADIIIT 1239

Query: 165  DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
              + +  + R++         G+     + II DE
Sbjct: 1240 TPEKWDGISRSWQT------RGYVRQVSLVII-DE 1267


>gi|255933656|ref|XP_002558207.1| Pc12g14010 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211582826|emb|CAP81028.1| Pc12g14010 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 2009

 Score = 38.9 bits (89), Expect = 2.2,   Method: Composition-based stats.
 Identities = 21/118 (17%), Positives = 38/118 (32%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    +PG  V+ +A         L  E V  W   L  +   +
Sbjct: 1166 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVQDWRKRLTRQMGLK 1220

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++                  +I DE
Sbjct: 1221 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRDYVR-------KVSLVIIDE 1271


>gi|92113525|ref|YP_573453.1| hypothetical protein Csal_1399 [Chromohalobacter salexigens DSM
           3043]
 gi|91796615|gb|ABE58754.1| protein of unknown function DUF264 [Chromohalobacter salexigens DSM
           3043]
          Length = 594

 Score = 38.5 bits (88), Expect = 2.3,   Method: Composition-based stats.
 Identities = 24/164 (14%), Positives = 44/164 (26%), Gaps = 22/164 (13%)

Query: 242 NKPLDDWKRF-QIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300
             P   W++   I+     G D    + +   Y    +     +  +F      +F  + 
Sbjct: 326 RGPDGQWRQIVTIEDAIAGGCDLFDLDQLRLEY--SDEEFANLLMCEFVDDSQSAFPMMT 383

Query: 301 IIEEALNREPCPDPYAP----------LIMGCDIAEEGGDN-----TVVVL--RRGPVIE 343
           +    ++       + P          + +G D A +  D       V+     R     
Sbjct: 384 MQRCMVDSWDIWRDWKPFAARPFGDKPVWLGYDPAGDNLDGDGAGLVVLAPAKNRNDRHR 443

Query: 344 HLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCD 385
            L        D       I  +  +Y    I ID N  G     
Sbjct: 444 ILEKHRIKGQDYEEQAGFIEQVTRRYNVQFIGIDINGMGEAVAQ 487


>gi|159164912|gb|ABV80241.2| mutant required to maintain repression 1 [Zea mays]
          Length = 1435

 Score = 38.5 bits (88), Expect = 2.3,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%)

Query: 55  QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  + +  +  +      K  +  G       G GKT L    +   M   P 
Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911

Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131
              + +A      +  L+A   E  KW
Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933


>gi|159164911|gb|ABV80240.2| mutant required to maintain repression 1 [Zea mays]
          Length = 1435

 Score = 38.5 bits (88), Expect = 2.3,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%)

Query: 55  QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  + +  +  +      K  +  G       G GKT L    +   M   P 
Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911

Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131
              + +A      +  L+A   E  KW
Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933


>gi|332701845|ref|ZP_08421933.1| hypothetical protein Desaf_0686 [Desulfovibrio africanus str.
           Walvis Bay]
 gi|332551994|gb|EGJ49038.1| hypothetical protein Desaf_0686 [Desulfovibrio africanus str.
           Walvis Bay]
          Length = 554

 Score = 38.5 bits (88), Expect = 2.3,   Method: Composition-based stats.
 Identities = 51/283 (18%), Positives = 87/283 (30%), Gaps = 68/283 (24%)

Query: 262 DPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN------REPCPDPY 315
              ++E I   YG    V R E+    P+      IP   IEEA+       R    D +
Sbjct: 257 KRKWYERIRNSYGPRVAVMREELDAI-PRDGGGQAIPGVWIEEAMREARPILRIALDDDF 315

Query: 316 A--------------------PLIMGCDIAEE---------GGDNTVVV--------LRR 338
           A                    PL+   D A E           D +VV         +RR
Sbjct: 316 AKLPEDSRRVWGSEWIDRHLKPLLARLDPAREHVFGQDFARHRDFSVVAPLEIGQTLIRR 375

Query: 339 GPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDY-LEMLGYHVYRV 397
            P +  + +            +   + ++R  A+  DA  +GA   +Y  +  G+     
Sbjct: 376 APFLLEMHNVPTRQQEQILWALIAALPRFRGGAM--DATGSGATLAEYTADKFGHERIHQ 433

Query: 398 LGQKRAVDLEFCRNRRTELHVKMADWLEFA--SLINHSGLIQNLKSLKSFIVPNTGELAI 455
           +   +A   E           K+ D  E     L   + +  +L++L+       G + +
Sbjct: 434 VMLSQAWYREHMP--------KLVDAFETGMIDLPRDADIESDLRALEEI----DGIIKL 481

Query: 456 ESKRVK------GAKSTDYSDGLMYT-FAENPPRSDMDFGRCP 491
              R +        +  D +       FA        D+   P
Sbjct: 482 PDIRKQDLKDAELKRHGDSAIAFALGWFASQGTAEAFDYRPVP 524


>gi|255683197|ref|YP_003084405.1| UL15 [Duck enteritis virus]
 gi|254840012|gb|ACT83557.1| UL15 [Anatid herpesvirus 1]
          Length = 739

 Score = 38.5 bits (88), Expect = 2.3,   Method: Composition-based stats.
 Identities = 28/153 (18%), Positives = 54/153 (35%), Gaps = 24/153 (15%)

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLAN----SETQLKTTLWAEVSKWLSLLPNKHWFEMQ 144
           GKT     L+   ++   GI +   A+    +E  +   + A + +W      +H     
Sbjct: 265 GKTWFIVPLIALALTKFRGIKIGYTAHIRKATEP-VFDEIDARIRRWFGNGRVEHI---- 319

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTP 204
                      + +  S    SK   T     S    ++  G    + +  + DEA+   
Sbjct: 320 ---------KGETISFSFQDGSKSTVTFA---SSHNTNSLRGQ--DFNLLFV-DEANFIR 364

Query: 205 DVINLGILGFLTERNANRFWIMTSNPRRLSGKF 237
                 I+GFL + N    ++ ++N  + S  F
Sbjct: 365 SDAVQTIVGFLNQTNCKIIFVSSTNTGKSSTSF 397


>gi|159164914|gb|ABV80243.2| mutant required to maintain repression 1 [Zea mays]
          Length = 1435

 Score = 38.5 bits (88), Expect = 2.3,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%)

Query: 55  QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  + +  +  +      K  +  G       G GKT L    +   M   P 
Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911

Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131
              + +A      +  L+A   E  KW
Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933


>gi|159164908|gb|ABV80237.2| required to maintain repression 1 [Zea mays]
 gi|159164910|gb|ABV80239.2| required to maintain repression 1 [Zea mays]
          Length = 1435

 Score = 38.5 bits (88), Expect = 2.3,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%)

Query: 55  QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  + +  +  +      K  +  G       G GKT L    +   M   P 
Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911

Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131
              + +A      +  L+A   E  KW
Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933


>gi|67524049|ref|XP_660086.1| hypothetical protein AN2482.2 [Aspergillus nidulans FGSC A4]
 gi|40744644|gb|EAA63800.1| hypothetical protein AN2482.2 [Aspergillus nidulans FGSC A4]
 gi|259487904|tpe|CBF86944.1| TPA: DEAD/DEAH box helicase, putative (AFU_orthologue; AFUA_4G03070)
            [Aspergillus nidulans FGSC A4]
          Length = 2015

 Score = 38.5 bits (88), Expect = 2.3,   Method: Composition-based stats.
 Identities = 26/151 (17%), Positives = 44/151 (29%), Gaps = 19/151 (12%)

Query: 56   LEFMEVVDAHCLNSVNNPNPEVFKGAIS-----AGRGIGKTTLNAWLVLWLMSTRPGISV 110
            LE +        N +      V     +     +  G GKT      + W    RPG  V
Sbjct: 1122 LEELYGQRFQYFNPMQTQLFHVLYHTAANVLLGSPTGSGKTVAAELAMWWAFRERPGSKV 1181

Query: 111  ICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSKH 168
            + +A         L  E V  W   L      ++  L+    P    +    + I   + 
Sbjct: 1182 VYIAP-----MKALVRERVMDWGRRLTAPMGLKLVELTGDNTPDTRTIRDADIIITTPEK 1236

Query: 169  YSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  + R++                  +I DE
Sbjct: 1237 WDGISRSWQTRDYVR-------KVSLVIIDE 1260


>gi|159164909|gb|ABV80238.2| required to maintain repression 1 [Zea mays]
          Length = 1435

 Score = 38.5 bits (88), Expect = 2.4,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%)

Query: 55  QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107
           Q E  E +  + +  +  +      K  +  G       G GKT L    +   M   P 
Sbjct: 852 QREAFEFMWTNLVGDIRLDEIKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 911

Query: 108 ISVICLANSETQLKTTLWA---EVSKW 131
              + +A      +  L+A   E  KW
Sbjct: 912 CRPVIIAP-----RGMLFAWDEEFKKW 933


>gi|52079727|ref|YP_078518.1| putative phage terminase (large subunit) [Bacillus licheniformis
           ATCC 14580]
 gi|52785096|ref|YP_090925.1| XtmB [Bacillus licheniformis ATCC 14580]
 gi|319646468|ref|ZP_08000697.1| XtmB protein [Bacillus sp. BT1B_CT2]
 gi|52002938|gb|AAU22880.1| putative phage terminase (large subunit) [Bacillus licheniformis
           ATCC 14580]
 gi|52347598|gb|AAU40232.1| XtmB [Bacillus licheniformis ATCC 14580]
 gi|317391056|gb|EFV71854.1| XtmB protein [Bacillus sp. BT1B_CT2]
          Length = 432

 Score = 38.5 bits (88), Expect = 2.4,   Method: Composition-based stats.
 Identities = 38/207 (18%), Positives = 69/207 (33%), Gaps = 24/207 (11%)

Query: 194 AIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQI 253
            I  +E S         ++G L       + ++T+NP   S   Y  F K   + KRF +
Sbjct: 118 LIWIEECSEVKYEGFKELIGRLRHPYHRLYMMLTTNPVSQSNWTYRHFFKDERN-KRFIL 176

Query: 254 D---------------------TRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQD 292
           D                           +  S+ + +      D D+ R+   G+F    
Sbjct: 177 DDEVLYKKRVAVVGDTYYHHSTADDNLFLPKSYLKQLDDMKAYDPDLYRIARKGRFGVNG 236

Query: 293 IDSFIPLNIIE-EALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHL-FDWSK 350
                   ++E E + R+           G D   E   N VV     P  ++L   W  
Sbjct: 237 TRVLPQFEVMEHEEVMRQISAISNPLKRTGMDFGFEESYNAVVRAAVDPDKKYLYIYWEY 296

Query: 351 TDLRTTNNKISGLVEKYRPDAIIIDAN 377
              + T++K +  + ++     +I A+
Sbjct: 297 YKNKMTDDKTAEELHEFAVAKELIKAD 323


>gi|288931818|ref|YP_003435878.1| hypothetical protein Ferp_1452 [Ferroglobus placidus DSM 10642]
 gi|288894066|gb|ADC65603.1| protein of unknown function DUF699 ATPase putative [Ferroglobus
           placidus DSM 10642]
          Length = 763

 Score = 38.5 bits (88), Expect = 2.5,   Method: Composition-based stats.
 Identities = 31/157 (19%), Positives = 59/157 (37%), Gaps = 27/157 (17%)

Query: 65  HCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS------TRPGISVICLANSET 118
               +  +   E     I+A RG GKT +   +  +L+S       RP + ++ +A +  
Sbjct: 218 EAFETFFDRKREKKAVVITANRGRGKTAVLGIVTPYLISRMNRVLKRP-VRILVVAPTPY 276

Query: 119 QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSE 178
            ++T  +  + K L     K + E +S         +D++       ++    + R    
Sbjct: 277 AVQTY-FKFLKKALVRQGMKEFKEKRS---------NDLVTVINSKWARVEYAVPRRAMV 326

Query: 179 ERPDTFVGHHNTYGMAIINDEASGTP-DVINLGILGF 214
           E+          Y   II DEA+G    V+   + G 
Sbjct: 327 EK---------DYADIIIVDEAAGIDVPVLWKIVEGA 354


>gi|240850562|ref|YP_002971962.1| phage terminase, large subunit [Bartonella grahamii as4aup]
 gi|240267685|gb|ACS51273.1| phage terminase, large subunit [Bartonella grahamii as4aup]
          Length = 441

 Score = 38.5 bits (88), Expect = 2.5,   Method: Composition-based stats.
 Identities = 25/182 (13%), Positives = 56/182 (30%), Gaps = 9/182 (4%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFN-KPLDD 247
             +    DEA    +     ++  L E          +T NP R +    + F     + 
Sbjct: 122 RILLCWVDEAEPVTETAWQTLIPTLREEGEGWRAELWVTWNPLRENAPVEKRFRFSDNEA 181

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII---EE 304
            KR +I+           +E  +       +  +    G + +    ++    ++   +E
Sbjct: 182 IKRVEINWSDNPKFPKILNEARLDDLRNRPETYKHIWEGDYLKAVQGAYYQKEMLAAEQE 241

Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361
                   DP   +    DI   G   D T + + +       + ++ +   +  +  I 
Sbjct: 242 GRIGRVARDPLMQIRAFWDIGGTGAKADATAIWIAQFVGREIRVLNYYEAQGQPLSEHIG 301

Query: 362 GL 363
            L
Sbjct: 302 WL 303


>gi|156543626|ref|XP_001604556.1| PREDICTED: hypothetical protein [Nasonia vitripennis]
          Length = 990

 Score = 38.5 bits (88), Expect = 2.5,   Method: Composition-based stats.
 Identities = 20/134 (14%), Positives = 42/134 (31%), Gaps = 15/134 (11%)

Query: 77  VFKGAISAGRGIGKTTLNAWLVLWLMSTRPGI-SVICLANSETQLKTTLWAEVSKWLSLL 135
            F   + A  G GKT +   + L ++  +     VI LA +          E++  +  +
Sbjct: 61  GFDLIVRAKSGTGKTAVFGIIALEMIDIKISSVQVIILAPT---------REIAIQIKEV 111

Query: 136 PNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI 195
                 E++ L +        +      + + H +       +   D        +    
Sbjct: 112 IASLGCEIKGLKVESFIGGVAMDIDRKKLSNCHIAIGAPGRVKHLIDKGY-LKMDHVRLF 170

Query: 196 INDEASGTPDVINL 209
           + DEA    D +  
Sbjct: 171 VLDEA----DKLME 180


>gi|242087829|ref|XP_002439747.1| hypothetical protein SORBIDRAFT_09g019410 [Sorghum bicolor]
 gi|241945032|gb|EES18177.1| hypothetical protein SORBIDRAFT_09g019410 [Sorghum bicolor]
          Length = 1535

 Score = 38.5 bits (88), Expect = 2.7,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 29/87 (33%), Gaps = 15/87 (17%)

Query: 55   QLEFMEVVDAHCLNSV-NNPNPEVFKGAISAG----R--GIGKTTLNAWLVLWLMSTRPG 107
            Q E  E +  + +  +  +      K  +  G       G GKT L    +   M   P 
Sbjct: 952  QREAFEFMWTNLVGGIRLDELKHGAKPDVVGGCVICHAPGTGKTRLAIVFIQTYMKVFPD 1011

Query: 108  ISVICLANSETQLKTTLWA---EVSKW 131
               + +A      +  L+A   E  KW
Sbjct: 1012 CRPVIIAP-----RGMLFAWDEEFKKW 1033


>gi|46949065|gb|AAT07420.1| UL89 DNA packaging protein [Macacine herpesvirus 3]
          Length = 671

 Score = 38.5 bits (88), Expect = 2.7,   Method: Composition-based stats.
 Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID K   +     S    ++  G        ++ DEA    +     ILGFL +   
Sbjct: 273 VISIDHKGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKEKAFNTILGFLAQNTT 329

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 330 KIIFISSTNTTSDATCF 346


>gi|261212229|ref|ZP_05926515.1| ATP-dependent RNA helicase SrmB [Vibrio sp. RC341]
 gi|260838837|gb|EEX65488.1| ATP-dependent RNA helicase SrmB [Vibrio sp. RC341]
          Length = 421

 Score = 38.5 bits (88), Expect = 2.9,   Method: Composition-based stats.
 Identities = 40/237 (16%), Positives = 69/237 (29%), Gaps = 34/237 (14%)

Query: 47  GFSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP 106
           GFS P   Q E +                +      SA  G GKT   A   L  +   P
Sbjct: 22  GFSRPTQVQAEAI------------PQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFP 69

Query: 107 -----GISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161
                   ++ L  +         AE ++ L+     + F +     +    ++D+L  +
Sbjct: 70  RRKAGPARILILTPTRELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATT 125

Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNAN 221
             I       +                      +I DEA    D+     +  L+     
Sbjct: 126 QDI------VVATPGRLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSTECRW 179

Query: 222 RFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278
           R   +  +   L G+  E F   L        D   V+   P      I+++   +D
Sbjct: 180 RKQTLLFSAT-LEGRGVEGFTADLLK------DPAHVDAEPPRRERKKISQWYHRAD 229


>gi|240851102|ref|YP_002972504.1| phage terminase large subunit [Bartonella grahamii as4aup]
 gi|240268225|gb|ACS51813.1| phage terminase large subunit [Bartonella grahamii as4aup]
          Length = 453

 Score = 38.5 bits (88), Expect = 2.9,   Method: Composition-based stats.
 Identities = 46/306 (15%), Positives = 94/306 (30%), Gaps = 33/306 (10%)

Query: 84  AGRGIGKT---TLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHW 140
            GRG GKT    L + +V +         +I  A               +      N+  
Sbjct: 39  GGRGSGKTRSFALMSAVVGYRHGMAGERGIILCA---------------RQFQNSLNESS 83

Query: 141 FEMQSLSLHPAPWYSDVLHCSLG-IDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI-IND 198
            E    ++   P+  D        I SK    +      +R        +   + +   D
Sbjct: 84  LEEIKRAIESYPFLQDYYDIGDKYIKSKDGRIVYVFAGLDR--NIASIKSMGRVFLCWVD 141

Query: 199 EASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNK-PLDDWKRFQIDT 255
           EA    +     ++  L E     N    +T NP   +    + F        K  +I+ 
Sbjct: 142 EAEPVTETAWQTLIPTLREEGKDWNAELWVTWNPCYENAPVEKRFRNVDNPHIKGAEINW 201

Query: 256 RTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPY 315
           R         +   +       +       G++ Q    ++    ++E  +       P 
Sbjct: 202 RDNPQFPEKLNRDRMDDLQQRPEQYNHIWEGEYLQAVQGAYYQKCLLEAEMEGRITTVPR 261

Query: 316 AP---LIMGCDIAEEG--GDNTVVVLRR--GPVIEHLFDWSKTDLRTTNNKISGLVEKYR 368
            P   + +  DI   G   D T + + +  G  I  L D+ +   +  +  +  + ++  
Sbjct: 262 DPLMQVRIFWDIGGTGAKADATALWVAQFIGREIRVL-DYYEAQGQPLSEHVGWVFQRGY 320

Query: 369 PDAIII 374
             A+++
Sbjct: 321 EKALMV 326


>gi|299532092|ref|ZP_07045486.1| mu-like prophage Flumu protein gp28 [Comamonas testosteroni S44]
 gi|298719754|gb|EFI60717.1| mu-like prophage Flumu protein gp28 [Comamonas testosteroni S44]
          Length = 470

 Score = 38.2 bits (87), Expect = 3.1,   Method: Composition-based stats.
 Identities = 33/240 (13%), Positives = 68/240 (28%), Gaps = 39/240 (16%)

Query: 268 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEA-------LNREPCPDPYAPLIM 320
             +     D++    E     P  D   F+   +I            R         L  
Sbjct: 232 DFVKNGAADAESFDQEYMCI-PADDDSKFLEYGLITACEYLGGTDWKRGLQGPFQGRLFC 290

Query: 321 GCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYR----PDAIIIDA 376
           G DI  +  D TV+ +     +  +F     +      K       +      D + ID+
Sbjct: 291 GVDIGRK-KDLTVLWVV--EQLGDVFYTRHVETMEKMRKSDQEKILWPWFAICDRVCIDS 347

Query: 377 NNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSG 434
              G    D  +     + +  V    +  +      +      K+        +     
Sbjct: 348 TGLGIGWTDDAQDKFGEHRIEGVSFTGQVKEALAYPLKGAMEDRKIR-------IPEDPK 400

Query: 435 LIQNLKSLKSFIVPNTG--ELAIESKRVKGAKSTD-YSD---GLMYTF-AENPPRSDMDF 487
           +  +L+ ++  +  + G      ES       + D ++D    L     A N P +  ++
Sbjct: 401 IRADLRKVQK-VTTSAGNIRFVAES-------TPDGHADRFWALALALQAGNSPAAPFEY 452


>gi|262401641|ref|ZP_06078207.1| ATP-dependent RNA helicase SrmB [Vibrio sp. RC586]
 gi|262352058|gb|EEZ01188.1| ATP-dependent RNA helicase SrmB [Vibrio sp. RC586]
          Length = 416

 Score = 38.2 bits (87), Expect = 3.3,   Method: Composition-based stats.
 Identities = 34/222 (15%), Positives = 62/222 (27%), Gaps = 22/222 (9%)

Query: 62  VDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRP-----GISVICLANS 116
                         +      SA  G GKT   A   L  +   P        ++ L  +
Sbjct: 25  RPTQVQAEAIPQALDGRDVLASAPTGTGKTAAFAIPALQYLLDFPRRKPGPARILILTPT 84

Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTY 176
                    AE ++ L+     + F +     +    ++D+L  +  I       +    
Sbjct: 85  RELAMQV--AEQAQALAKNTRLNIFTITGGVQYQE--HADILATTQDI------VVATPG 134

Query: 177 SEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGK 236
                             +I DEA    D+     +  L+     R   +  +   L G+
Sbjct: 135 RLLEYIDAERFDCRAIEWLILDEADRMLDMGFGPTVDRLSTECRWRKQTLLFSAT-LEGR 193

Query: 237 FYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSD 278
             E F   L        D   V+   P      I+++   +D
Sbjct: 194 GVEGFTADLLK------DPAHVDAEPPRRERKKISQWYHRAD 229


>gi|91199577|emb|CAI77931.1| putative helicase [Streptomyces ambofaciens ATCC 23877]
 gi|96771624|emb|CAI78205.1| putative helicase [Streptomyces ambofaciens ATCC 23877]
 gi|117164172|emb|CAJ87711.1| putative helicase [Streptomyces ambofaciens ATCC 23877]
 gi|126347284|emb|CAJ88989.1| putative helicase [Streptomyces ambofaciens ATCC 23877]
          Length = 886

 Score = 38.2 bits (87), Expect = 3.3,   Method: Composition-based stats.
 Identities = 14/70 (20%), Positives = 28/70 (40%), Gaps = 4/70 (5%)

Query: 52  RSWQLEFMEVVDAHC-LNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISV 110
           R  Q+E  + +      ++ ++  PE  +G I +  G GKT + A      +    G  +
Sbjct: 5   REHQVEQKQSIREWVGFSARSSVPPEGMRGTIVSATGSGKTIMAAASA---LECFAGGRI 61

Query: 111 ICLANSETQL 120
           +    +   L
Sbjct: 62  LVTVPTLDLL 71


>gi|302381364|ref|YP_003817187.1| hypothetical protein Bresu_0249 [Brevundimonas subvibrioides ATCC
           15264]
 gi|302191992|gb|ADK99563.1| conserved hypothetical protein [Brevundimonas subvibrioides ATCC
           15264]
          Length = 556

 Score = 38.2 bits (87), Expect = 3.4,   Method: Composition-based stats.
 Identities = 41/239 (17%), Positives = 75/239 (31%), Gaps = 23/239 (9%)

Query: 170 STMCRTYSEERPDTFVGHHNTYGMAIINDEASG---------TPDVINLGILGFLTERNA 220
             + + +S E P+   G       A   DE                  L +L        
Sbjct: 235 GAVAQAFSAEDPEALRG---PQFAAAWADEFCAWPRGGRGGRGGPGATLALLRMGLRLGE 291

Query: 221 NRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVT 280
               ++T+ P +  G   ++  +P    +         + +   F EG+   YG      
Sbjct: 292 RPRLVVTTTP-KPIGALRDLRAEP-GLVQTHAATRDNADHLAAGFVEGLERLYGGTRKAA 349

Query: 281 RVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNT--VVVLR 337
             E+ G+  +Q   S     ++  A  R      +  +++  D     GG+    VV  R
Sbjct: 350 -QELEGRVVEQ-EGSLFTAEMMGRA--RGVLEGSFDRIVVAIDPTTTAGGNACGIVVAGR 405

Query: 338 RGPVIEHLFDWS--KTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHV 394
            G     L D S           +     E++   A++++ N  G      L+  G  V
Sbjct: 406 VGDRAHVLADRSVAGLGPDGWARRAVRAAEEFGAVALVVEVNQGGEMVRAVLKTAGCSV 464


>gi|312881427|ref|ZP_07741222.1| DNA-dependent helicase II [Vibrio caribbenthicus ATCC BAA-2122]
 gi|309370909|gb|EFP98366.1| DNA-dependent helicase II [Vibrio caribbenthicus ATCC BAA-2122]
          Length = 724

 Score = 38.2 bits (87), Expect = 3.5,   Method: Composition-based stats.
 Identities = 37/206 (17%), Positives = 72/206 (34%), Gaps = 20/206 (9%)

Query: 239 EIFNKP-----LDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDI 293
           + FN P     L  ++ +Q        +D +           D+   R     +F    +
Sbjct: 161 DTFNDPVTQTYLQLYRAYQEACDRAGLVDFAEILLRAQELLRDNKHIRQHYQTRFKHILV 220

Query: 294 DSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDL 353
           D F   N I+ A  R         +I+       G D+  +   RG  +E++    K  +
Sbjct: 221 DEFQDTNNIQYAWLRLMAGPDTHVMIV-------GDDDQSIYGWRGAKVENIE---KFTV 270

Query: 354 RTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLGYHVYRVLGQKRAVDLEFCRNRR 413
              +     L + YR    I+DA+N  A   +  E +G  ++        + +    N  
Sbjct: 271 EFPSVNTIRLEQNYRSTKTILDASN--ALIANNTERMGKELWTDGSAGEPISVYSAYNEL 328

Query: 414 TELHV---KMADWLEFASLINHSGLI 436
            E      K+ +W E   ++  + ++
Sbjct: 329 DEARFAVSKIKEWQEKGGVLTDTAML 354


>gi|302539315|ref|ZP_07291657.1| TtrA [Streptomyces sp. C]
 gi|302448210|gb|EFL20026.1| TtrA [Streptomyces sp. C]
          Length = 888

 Score = 38.2 bits (87), Expect = 3.5,   Method: Composition-based stats.
 Identities = 12/52 (23%), Positives = 20/52 (38%), Gaps = 3/52 (5%)

Query: 69  SVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
           S +   P+  +G I +  G GKT   A      +   PG  ++    +   L
Sbjct: 23  SRSPVPPQGTRGTIVSATGSGKTITAAAGA---LECFPGGRILVTVPTLDLL 71


>gi|307727814|ref|YP_003911027.1| SNF2-related protein [Burkholderia sp. CCGE1003]
 gi|307588339|gb|ADN61736.1| SNF2-related protein [Burkholderia sp. CCGE1003]
          Length = 1227

 Score = 38.2 bits (87), Expect = 3.5,   Method: Composition-based stats.
 Identities = 27/203 (13%), Positives = 51/203 (25%), Gaps = 25/203 (12%)

Query: 128 VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH 187
           V  W         F  +   L             +G      +T    + +++     G 
Sbjct: 739 VHNWREEARR---FAPELKVLVLNGPQRKERFEQIGEHELILTTYALLWRDQKV--LAG- 792

Query: 188 HNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD 247
           H  +   +I DEA    +         +   +A     +T  P           N   + 
Sbjct: 793 HEYH--LLILDEAQYVKNATTKAAQ-AIRGLSARHRLCLTGTPLE---------NHLGEL 840

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
           W +F        G    F          + D  R  +  +  +     F+     +E   
Sbjct: 841 WSQFDFLLPGFLGTQKDFTRRWRNPIEKNHDGVRRSLLARRIRP----FMLRRRKDEVAK 896

Query: 308 REPCPDPYAPLIMGCDIAEEGGD 330
             P       ++   D+     D
Sbjct: 897 ELPAKT---TIVCSVDLEGAQRD 916


>gi|164659175|ref|XP_001730712.1| hypothetical protein MGL_2166 [Malassezia globosa CBS 7966]
 gi|159104609|gb|EDP43498.1| hypothetical protein MGL_2166 [Malassezia globosa CBS 7966]
          Length = 838

 Score = 38.2 bits (87), Expect = 3.5,   Method: Composition-based stats.
 Identities = 26/152 (17%), Positives = 48/152 (31%), Gaps = 15/152 (9%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA-EVSKWLSLLPNKHWFEMQS 145
           G+GKT      ++ L+   P    + +A +   L+   W  E+ K+   L    W   Q 
Sbjct: 244 GMGKT----IQMISLLVADPKRPSLVVAPTVAILQ---WRNEMQKYAPGLRVVVWHGAQR 296

Query: 146 LSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEE----RPDTFVGHHNTYGMAIINDEAS 201
                     DV+  S  +    +       +      R  +    H      II DEA 
Sbjct: 297 SRDRDTLSTVDVVLTSYAVLESTFRRDRYGVTRNGRHVREQSL--LHAMKWRRIILDEAH 354

Query: 202 GTPDVINLGILGFLTERNANRFWIMTSNPRRL 233
              +  +           ++  W ++  P + 
Sbjct: 355 HIKERTSNTARSAFA-LQSDFKWCLSGTPLQN 385


>gi|289619624|emb|CBI53907.1| unnamed protein product [Sordaria macrospora]
          Length = 2051

 Score = 38.2 bits (87), Expect = 3.5,   Method: Composition-based stats.
 Identities = 29/152 (19%), Positives = 50/152 (32%), Gaps = 22/152 (14%)

Query: 58   FMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
             +E + A      N    +VF           + +  G GKT      + W    RPG  
Sbjct: 1170 ALEEIYAQRFQYFNPMQTQVFHTLYHTPANVLLGSPTGSGKTVACELAMWWAFRERPGSK 1229

Query: 110  VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167
            V+ +A         L  E V  W + L      ++  L+    P    +    + I   +
Sbjct: 1230 VVYIAP-----MKALVRERVKDWGARLAKPLGLKLVELTGDNTPDTRTIQDADIIITTPE 1284

Query: 168  HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             +  + R++         G+     + II DE
Sbjct: 1285 KWDGISRSWQT------RGYVRKVSLVII-DE 1309


>gi|183600815|ref|ZP_02962308.1| hypothetical protein PROSTU_04416 [Providencia stuartii ATCC 25827]
 gi|188019600|gb|EDU57640.1| hypothetical protein PROSTU_04416 [Providencia stuartii ATCC 25827]
          Length = 413

 Score = 38.2 bits (87), Expect = 3.5,   Method: Composition-based stats.
 Identities = 33/200 (16%), Positives = 64/200 (32%), Gaps = 21/200 (10%)

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGH-HN-------TYGMAIIND 198
           ++   PW SD          K+  T CR+ S      F G  HN          +    D
Sbjct: 79  AIRSVPWLSDFYELG----EKYIRTKCRSVSYV----FAGLRHNLDSIKSKARILIAWVD 130

Query: 199 EASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNK-PLDDWKRFQIDTRT 257
           EA    ++    +   + E  +   W+ T NP +      + F K P D+    +++   
Sbjct: 131 EAESVSEIAWTKLAPTVREAGSE-IWV-TWNPEKDGSATDKRFRKEPPDNAIIVEMNYDD 188

Query: 258 VEGIDPSFHEGIIARYGL-DSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYA 316
                    E  ++     D +       G + +      +    + ++   +       
Sbjct: 189 NPWFPSVLEEERLSDQSRLDPNTYAWIWEGAYLENSDKQVLANKYVVQSFP-DDLWKQAD 247

Query: 317 PLIMGCDIAEEGGDNTVVVL 336
            L+ G D       NT++ +
Sbjct: 248 RLLFGADFGFAKDPNTLIRM 267


>gi|71004784|ref|XP_757058.1| hypothetical protein UM00911.1 [Ustilago maydis 521]
 gi|46096862|gb|EAK82095.1| hypothetical protein UM00911.1 [Ustilago maydis 521]
          Length = 1490

 Score = 38.2 bits (87), Expect = 3.6,   Method: Composition-based stats.
 Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 3/39 (7%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSE---TQLKT 122
           G+GKT + A ++L      P   ++ LA +     Q KT
Sbjct: 304 GLGKTFIAAVVILNFFRWYPDGKILFLAPTRPLVDQQKT 342


>gi|319943331|ref|ZP_08017613.1| hypothetical protein HMPREF0551_0459 [Lautropia mirabilis ATCC
           51599]
 gi|319743146|gb|EFV95551.1| hypothetical protein HMPREF0551_0459 [Lautropia mirabilis ATCC
           51599]
          Length = 220

 Score = 38.2 bits (87), Expect = 3.7,   Method: Composition-based stats.
 Identities = 28/117 (23%), Positives = 37/117 (31%), Gaps = 14/117 (11%)

Query: 52  RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVI 111
            +WQ         H +             AI    G GKTTL A L     S  PG  V+
Sbjct: 30  TAWQASPRTAFVDHLMARAGTHAGRPAIIAIDGRSGSGKTTLTAALA----SVVPGAQVL 85

Query: 112 CLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLH--PAPWYSDVLHCSLGID 165
                   L   +W E + +W   L         + +L   P PW       S+ I 
Sbjct: 86  -------HLDDLIWNEPLYQWDQQLVAALSELHTTGALDLIPHPWREHGREGSIRIT 135


>gi|49475449|ref|YP_033490.1| terminase large subunit protein [Bartonella henselae str.
           Houston-1]
 gi|49475495|ref|YP_033536.1| terminase large subunit protein [Bartonella henselae str.
           Houston-1]
 gi|49238255|emb|CAF27468.1| Terminase large subunit protein [Bartonella henselae str.
           Houston-1]
 gi|49238301|emb|CAF27516.1| Terminase large subunit protein [Bartonella henselae str.
           Houston-1]
          Length = 340

 Score = 38.2 bits (87), Expect = 3.8,   Method: Composition-based stats.
 Identities = 26/186 (13%), Positives = 54/186 (29%), Gaps = 17/186 (9%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFN-KPLDD 247
             +    DEA    +     ++  L E          +T NP R +      F     + 
Sbjct: 83  RILLCWVDEAEPVTETAWQTLIPTLREEGEGWRAELWVTWNPLRDNAPVERRFRFSNNEA 142

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE--- 304
            KR +I+           +E  +       +  +    G +      ++    ++     
Sbjct: 143 IKRVEINWSDNPKFPKILNEARLDDLKNRPETYKHIWEGAYLTAIQGAYYQKEMLAAEQE 202

Query: 305 ----ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTN 357
                + R+P     A      DI   G   D T + + +       + D+ +   +  +
Sbjct: 203 GRIGRVARDPLMQMRAFW----DIGGTGAKADATAIWIAQFVGREIRVLDYYEAQGQPLS 258

Query: 358 NKISGL 363
             I  L
Sbjct: 259 EHIGWL 264


>gi|327355898|gb|EGE84755.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces
            dermatitidis ATCC 18188]
          Length = 2024

 Score = 37.8 bits (86), Expect = 3.9,   Method: Composition-based stats.
 Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 22/152 (14%)

Query: 58   FMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
             +E + A      N    ++F           + +  G GKT      + W    +PG  
Sbjct: 1131 ILEEIYAQRFQFFNPMQTQIFHTLYHTPANVLLGSPTGSGKTVAAELAMWWAFREKPGSK 1190

Query: 110  VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167
            V+ +A         L  E V  W   L      ++  L+    P    +    + I   +
Sbjct: 1191 VVYIAP-----MKALVRERVHDWRRRLTAPMGLKLVELTGDNTPDTRTIRDADIIITTPE 1245

Query: 168  HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             +  + R++         G+     + II DE
Sbjct: 1246 KWDGISRSWQT------RGYVRQVSLVII-DE 1270


>gi|239609198|gb|EEQ86185.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces
            dermatitidis ER-3]
          Length = 2024

 Score = 37.8 bits (86), Expect = 3.9,   Method: Composition-based stats.
 Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 22/152 (14%)

Query: 58   FMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
             +E + A      N    ++F           + +  G GKT      + W    +PG  
Sbjct: 1131 ILEEIYAQRFQFFNPMQTQIFHTLYHTPANVLLGSPTGSGKTVAAELAMWWAFREKPGSK 1190

Query: 110  VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167
            V+ +A         L  E V  W   L      ++  L+    P    +    + I   +
Sbjct: 1191 VVYIAP-----MKALVRERVHDWRRRLTAPMGLKLVELTGDNTPDTRTIRDADIIITTPE 1245

Query: 168  HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             +  + R++         G+     + II DE
Sbjct: 1246 KWDGISRSWQT------RGYVRQVSLVII-DE 1270


>gi|261189015|ref|XP_002620920.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces
            dermatitidis SLH14081]
 gi|239591924|gb|EEQ74505.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces
            dermatitidis SLH14081]
          Length = 2024

 Score = 37.8 bits (86), Expect = 3.9,   Method: Composition-based stats.
 Identities = 27/152 (17%), Positives = 49/152 (32%), Gaps = 22/152 (14%)

Query: 58   FMEVVDAHCLNSVNNPNPEVFKGA--------ISAGRGIGKTTLNAWLVLWLMSTRPGIS 109
             +E + A      N    ++F           + +  G GKT      + W    +PG  
Sbjct: 1131 ILEEIYAQRFQFFNPMQTQIFHTLYHTPANVLLGSPTGSGKTVAAELAMWWAFREKPGSK 1190

Query: 110  VICLANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSK 167
            V+ +A         L  E V  W   L      ++  L+    P    +    + I   +
Sbjct: 1191 VVYIAP-----MKALVRERVHDWRRRLTAPMGLKLVELTGDNTPDTRTIRDADIIITTPE 1245

Query: 168  HYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
             +  + R++         G+     + II DE
Sbjct: 1246 KWDGISRSWQT------RGYVRQVSLVII-DE 1270


>gi|85105138|ref|XP_961895.1| activating signal cointegrator 1 complex subunit 3 [Neurospora crassa
            OR74A]
 gi|28923479|gb|EAA32659.1| activating signal cointegrator 1 complex subunit 3 [Neurospora crassa
            OR74A]
          Length = 2066

 Score = 37.8 bits (86), Expect = 3.9,   Method: Composition-based stats.
 Identities = 24/118 (20%), Positives = 41/118 (34%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RPG  V+ +A         L  E V  W + L      +
Sbjct: 1199 SPTGSGKTVACELAMWWAFRERPGSKVVYIAP-----MKALVRERVKDWGARLAKPLGLK 1253

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1254 LVELTGDNTPDTRTIQDADIIITTPEKWDGISRSWQT------RGYVRKVSLVII-DE 1304


>gi|18640523|ref|NP_570364.1| DNA packaging protein A [Synechococcus phage P60]
 gi|18478753|gb|AAL73302.1| DNA packaging protein A [Synechococcus phage P60]
          Length = 308

 Score = 37.8 bits (86), Expect = 4.0,   Method: Composition-based stats.
 Identities = 18/91 (19%), Positives = 32/91 (35%), Gaps = 5/91 (5%)

Query: 314 PYAPLIMGCDIAEEGGDNTV-VVLRRGP---VIEHLFDWSKTDLRTTNNKISGLVEKYRP 369
            Y   I+  D +  G D TV VVL +      +  L  +       T + I  L ++Y+ 
Sbjct: 77  DYDETIVSVDPSGRGTDETVAVVLSQANGYVFVRDLKAYRDGYSDATLSDIVRLGKRYKA 136

Query: 370 DAIIIDANNTGARTCDYLEMLGYHVYRVLGQ 400
             +++++N  G      L             
Sbjct: 137 SRLLVESN-FGDGMVCELFNRHIQQMGAGFS 166


>gi|213406229|ref|XP_002173886.1| ATP-dependent 3' to 5' DNA helicase [Schizosaccharomyces japonicus
           yFS275]
 gi|212001933|gb|EEB07593.1| ATP-dependent 3' to 5' DNA helicase [Schizosaccharomyces japonicus
           yFS275]
          Length = 812

 Score = 37.8 bits (86), Expect = 4.2,   Method: Composition-based stats.
 Identities = 20/116 (17%), Positives = 42/116 (36%), Gaps = 15/116 (12%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146
           G+GKT + A +++      P  ++  LA ++  L   + A +   L+ +P     E+   
Sbjct: 156 GLGKTFIAAVVMMNYYRWFPQSNIAFLAPTKPLLYQQMQACIH--LTGIPESSIVELNGE 213

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAI--INDEA 200
                      L   L  D + +    +T + +             + +  + DEA
Sbjct: 214 V-------KPELRKQLFRDKRVFFVTPQTLNND----IQTEVCDPRLFVCLVFDEA 258


>gi|302894383|ref|XP_003046072.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256726999|gb|EEU40359.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1970

 Score = 37.8 bits (86), Expect = 4.3,   Method: Composition-based stats.
 Identities = 23/118 (19%), Positives = 40/118 (33%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RP   V+ +A         L  E V  W + L      +
Sbjct: 1158 SPTGSGKTVAAELAMWWAFRERPKSKVVYIAP-----MKALVRERVKDWGARLARPLGLK 1212

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1213 LVELTGDNTPDTRTIQDADVIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1263


>gi|194899456|ref|XP_001979275.1| GG24642 [Drosophila erecta]
 gi|190650978|gb|EDV48233.1| GG24642 [Drosophila erecta]
          Length = 1450

 Score = 37.8 bits (86), Expect = 4.6,   Method: Composition-based stats.
 Identities = 29/198 (14%), Positives = 60/198 (30%), Gaps = 20/198 (10%)

Query: 16  FDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPR--SWQLEFMEVVDAH-CLNSVNN 72
            D+ W D+     +   +H     E+    EG   P       E  ++   H  +   N 
Sbjct: 1   MDVNWIDDDDDLVAALAMHEEQKTEEADGTEGHPQPELSDEACEGFDMAAGHNWIYPNNL 60

Query: 73  PNPEVFKGAISAG----------RGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKT 122
           P     +  + +            G+GKT + A L+       P   ++ +A +   +  
Sbjct: 61  PLRSYQQTIVQSALFKNTLVVLPTGLGKTFIAAVLMYNFYRWYPKGKIVFMAPTRPLVSQ 120

Query: 123 TLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPD 182
               ++     ++P      +Q     P P  +++        +          + +   
Sbjct: 121 ----QIHASQKIMPFPSADTVQLTGQLPRPKRAELWDSKRVFFATPQVVHSDMLTADGGS 176

Query: 183 TFVGHHNTYGMAIINDEA 200
            F          I+ DEA
Sbjct: 177 NFP---FGSIKLIVVDEA 191


>gi|296083594|emb|CBI23583.3| unnamed protein product [Vitis vinifera]
          Length = 1287

 Score = 37.8 bits (86), Expect = 4.6,   Method: Composition-based stats.
 Identities = 28/148 (18%), Positives = 49/148 (33%), Gaps = 10/148 (6%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSL 146
           G+GKT +   L+L     RPG       +S   L     A +S+W   L      E  S+
Sbjct: 634 GLGKTVMTIALIL----ARPGRR-----SSGGTLIVCPMALLSQWKDELETHSKPESISI 684

Query: 147 SLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV 206
            +H     ++        D    +    T + +  +     H      ++ DEA      
Sbjct: 685 FIHYGGDRTNDPKVISEHDVVLTTYGVLTSAYKNDENSSIFHRVEWYRVVLDEAHTIKSS 744

Query: 207 INLGILGFLTERNANRFWIMTSNPRRLS 234
             L          ++  W +T  P + +
Sbjct: 745 KTLSAQAAFALP-SHCRWCLTGTPLQNN 771


>gi|242815191|ref|XP_002486521.1| DEAD/DEAH box helicase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218714860|gb|EED14283.1| DEAD/DEAH box helicase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 2030

 Score = 37.8 bits (86), Expect = 4.8,   Method: Composition-based stats.
 Identities = 24/118 (20%), Positives = 40/118 (33%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RPG  V+ +A         L  E V  W   L      +
Sbjct: 1164 SPTGSGKTVACELAMWWAFRERPGSKVVYIAP-----MKALVRERVQDWRKRLTAAMGLK 1218

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1219 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1269


>gi|238502669|ref|XP_002382568.1| DEAD/DEAH box helicase, putative [Aspergillus flavus NRRL3357]
 gi|220691378|gb|EED47726.1| DEAD/DEAH box helicase, putative [Aspergillus flavus NRRL3357]
          Length = 1997

 Score = 37.8 bits (86), Expect = 5.0,   Method: Composition-based stats.
 Identities = 21/118 (17%), Positives = 37/118 (31%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    +PG  V+ +A         L  E V  W   L      +
Sbjct: 1163 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVHDWKKRLTGPMGLK 1217

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++                  +I DE
Sbjct: 1218 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRDYVR-------KVSLVIIDE 1268


>gi|169775993|ref|XP_001822463.1| helicase mug81 [Aspergillus oryzae RIB40]
 gi|83771198|dbj|BAE61330.1| unnamed protein product [Aspergillus oryzae]
          Length = 1998

 Score = 37.8 bits (86), Expect = 5.0,   Method: Composition-based stats.
 Identities = 21/118 (17%), Positives = 37/118 (31%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    +PG  V+ +A         L  E V  W   L      +
Sbjct: 1163 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVHDWKKRLTGPMGLK 1217

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++                  +I DE
Sbjct: 1218 LVELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRDYVR-------KVSLVIIDE 1268


>gi|20026680|ref|NP_612722.1| DNA packaging terminase subunit 1 [Panine herpesvirus 2]
 gi|19881108|gb|AAM00728.1|AF480884_80 DNA packaging protein UL89 [Panine herpesvirus 2]
          Length = 672

 Score = 37.4 bits (85), Expect = 5.0,   Method: Composition-based stats.
 Identities = 15/85 (17%), Positives = 29/85 (34%), Gaps = 3/85 (3%)

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGIL 212
           +  +     + ID +   +     S    ++  G        ++ DEA          IL
Sbjct: 265 YLVENKDNVISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTIL 321

Query: 213 GFLTERNANRFWIMTSNPRRLSGKF 237
           GFL +      +I ++N    +  F
Sbjct: 322 GFLAQNTTKIIFISSTNTTSDATCF 346


>gi|73695754|gb|AAZ80628.1| rhUL89 [Macacine herpesvirus 3]
          Length = 671

 Score = 37.4 bits (85), Expect = 5.1,   Method: Composition-based stats.
 Identities = 16/77 (20%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID K   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 273 VISIDHKGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 329

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 330 KIIFISSTNTTSDATCF 346


>gi|222615621|gb|EEE51753.1| hypothetical protein OsJ_33185 [Oryza sativa Japonica Group]
          Length = 726

 Score = 37.4 bits (85), Expect = 5.2,   Method: Composition-based stats.
 Identities = 7/38 (18%), Positives = 16/38 (42%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117
             ++   G+GKT + A ++       P   ++  A + 
Sbjct: 264 TLVALPTGLGKTFIAAVVMYNYFRWFPEGKIVFTAPTR 301


>gi|218185362|gb|EEC67789.1| hypothetical protein OsI_35346 [Oryza sativa Indica Group]
          Length = 648

 Score = 37.4 bits (85), Expect = 5.2,   Method: Composition-based stats.
 Identities = 7/38 (18%), Positives = 16/38 (42%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117
             ++   G+GKT + A ++       P   ++  A + 
Sbjct: 186 TLVALPTGLGKTFIAAVVMYNYFRWFPEGKIVFTAPTR 223


>gi|62734194|gb|AAX96303.1| Similar to probable ATP-dependent RNA helicase - fission yeast
           (Schizosaccharomyces pombe) [Oryza sativa Japonica
           Group]
 gi|77548994|gb|ABA91791.1| Type III restriction enzyme, res subunit family protein, expressed
           [Oryza sativa Japonica Group]
          Length = 1488

 Score = 37.4 bits (85), Expect = 5.2,   Method: Composition-based stats.
 Identities = 7/38 (18%), Positives = 16/38 (42%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSE 117
             ++   G+GKT + A ++       P   ++  A + 
Sbjct: 264 TLVALPTGLGKTFIAAVVMYNYFRWFPEGKIVFTAPTR 301


>gi|320035817|gb|EFW17757.1| DEAD/DEAH box helicase [Coccidioides posadasii str. Silveira]
          Length = 1970

 Score = 37.4 bits (85), Expect = 5.2,   Method: Composition-based stats.
 Identities = 23/118 (19%), Positives = 41/118 (34%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    +PG  V+ +A         L  E V  W   L      +
Sbjct: 1155 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVQDWRRRLAMPLGLK 1209

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    + +  + I   + +  + R++         G+     + II DE
Sbjct: 1210 LVELTGDNTPDTRTIRNADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1260


>gi|303321375|ref|XP_003070682.1| activating signal cointegrator 1 complex subunit, putative
            [Coccidioides posadasii C735 delta SOWgp]
 gi|240110378|gb|EER28537.1| activating signal cointegrator 1 complex subunit, putative
            [Coccidioides posadasii C735 delta SOWgp]
          Length = 1970

 Score = 37.4 bits (85), Expect = 5.2,   Method: Composition-based stats.
 Identities = 23/118 (19%), Positives = 41/118 (34%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    +PG  V+ +A         L  E V  W   L      +
Sbjct: 1155 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVQDWRRRLAMPLGLK 1209

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    + +  + I   + +  + R++         G+     + II DE
Sbjct: 1210 LVELTGDNTPDTRTIRNADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1260


>gi|119180556|ref|XP_001241737.1| hypothetical protein CIMG_08900 [Coccidioides immitis RS]
          Length = 1970

 Score = 37.4 bits (85), Expect = 5.2,   Method: Composition-based stats.
 Identities = 23/118 (19%), Positives = 41/118 (34%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    +PG  V+ +A         L  E V  W   L      +
Sbjct: 1155 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAP-----MKALVRERVQDWRRRLAMPLGLK 1209

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    + +  + I   + +  + R++         G+     + II DE
Sbjct: 1210 LVELTGDNTPDTRTIRNADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1260


>gi|171058461|ref|YP_001790810.1| exodeoxyribonuclease V subunit alpha [Leptothrix cholodnii SP-6]
 gi|170775906|gb|ACB34045.1| exodeoxyribonuclease V, alpha subunit [Leptothrix cholodnii SP-6]
          Length = 739

 Score = 37.4 bits (85), Expect = 5.3,   Method: Composition-based stats.
 Identities = 27/162 (16%), Positives = 50/162 (30%), Gaps = 28/162 (17%)

Query: 48  FSAPRSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPG 107
           F  P +               S            I+ G G GKT   A L+  +M+  P 
Sbjct: 235 FGGPPA-------PDRFDWQRSACAIALRGRLALITGGPGTGKTYTVARLLALVMAVHPQ 287

Query: 108 I---SVICLANSET---QLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCS 161
                +   A +     +LK ++ + + +  + LP    + +    L  +     +L   
Sbjct: 288 PQALRIALAAPTGKAAARLKQSIDSALQQLAAALPGALDWGLLQQRLSQSLTLHKLLGAR 347

Query: 162 LGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGT 203
              D++ +    R             H      ++ DEAS  
Sbjct: 348 P--DTRRFGRDAR-------------HPLEVDLLVVDEASMV 374


>gi|331086511|ref|ZP_08335590.1| hypothetical protein HMPREF0987_01893 [Lachnospiraceae bacterium
           9_1_43BFAA]
 gi|330410569|gb|EGG89997.1| hypothetical protein HMPREF0987_01893 [Lachnospiraceae bacterium
           9_1_43BFAA]
          Length = 649

 Score = 37.4 bits (85), Expect = 5.3,   Method: Composition-based stats.
 Identities = 44/297 (14%), Positives = 86/297 (28%), Gaps = 58/297 (19%)

Query: 117 ETQLKTTLWAEVSKWLSLLPNKHWFEMQ---SLSLHPAPWYSDVLHCSLGIDSKHYSTMC 173
           +  L    W EV +W+  +  K     +      ++P  +                   C
Sbjct: 344 QGHLFAKSWNEVERWVEAIIEKGDPIQKERVERVIYPERFEHSFEEMMFTRKE------C 397

Query: 174 RTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRL 233
           R       +   G      + +      G  D+   GI   +T       +     P   
Sbjct: 398 RLSIYHLDENGTGR---DQLFV------GMEDLQEKGI--TITADQYRCVYSSLYLPNED 446

Query: 234 SGKFYEIFN-KPLDDWKRFQIDTRTVEGID------------------PSFHEGIIARYG 274
               Y IFN  P  D+K   +    V  ++                  P F E      G
Sbjct: 447 MNAVYSIFNDDPPADYKAHSLSVSDVVIMNQNGDMKAYFVDRFGFQELPDFVEERKKILG 506

Query: 275 LDSDVTRVEVC-------------GQFP-QQDIDSFIPLNIIEEALNREPCPDPYAPLIM 320
           ++SD+ + ++               +FP   ++   + L    EA  + P         +
Sbjct: 507 MESDIQKKDILEQTSCISFYAAECSEFPVLGEVHHDLTLPEALEAYEKIPAERMNGLKSV 566

Query: 321 GCDIAEEGG-----DNTVVVLRRGPVIEHLFDWSKTDLRTTNNKISGLVEKYRPDAI 372
           G ++ E G      D  V    +  +++ +  + +  L     K      + +P  +
Sbjct: 567 GFNLQEGGDYDGMMDLMVAGRSQREILDSIPFYRENKLVQEALKRVEQYIEEKPLNV 623


>gi|295688413|ref|YP_003592106.1| hypothetical protein Cseg_0983 [Caulobacter segnis ATCC 21756]
 gi|295430316|gb|ADG09488.1| protein of unknown function DUF264 [Caulobacter segnis ATCC 21756]
          Length = 445

 Score = 37.4 bits (85), Expect = 5.9,   Method: Composition-based stats.
 Identities = 71/442 (16%), Positives = 127/442 (28%), Gaps = 82/442 (18%)

Query: 56  LEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMS-TRPGISVICLA 114
           L  + + + H L   +     +F      GRG GKT   A    WL+     G  +  + 
Sbjct: 53  LRTLRIREDHQLPPPDPWVTWLFL----GGRGAGKTYAGAA---WLIEQATAGARLALVG 105

Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174
            +   ++  +          +      +  SL      W +                   
Sbjct: 106 PTFHDVREVM----------IEGPSGLKALSLPDEHPRWEASRRRLVWPN-----GATAY 150

Query: 175 TYSEERPDTFVG--HHNTYGMAIINDEASGTPDVINL-GILGFLTERNANRFWIMTSNPR 231
            +S E PD+  G   H         DE    P   +   +L F     A+   ++T+ P 
Sbjct: 151 AFSAEDPDSLRGPQFHAA-----WADEFCAWPKPGDTLAMLRFGLRLGADPRLVVTTTP- 204

Query: 232 RLSGKFYEIFNKPLDDWKRFQID------TRTVEGIDPSFHEGIIARYGLDSDVTRVEVC 285
                  +             +       +     + P+F   + + YG    +   E+ 
Sbjct: 205 -------KPHRALKVLMAEPGVSLTRAGTSANAGNLAPAFLRTLESLYGGT-RLAAQELD 256

Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDI-AEEGGDNT--VVVLRRGPVI 342
           G   +           +         P     +++  D  A  GGD    VVV RR    
Sbjct: 257 GVIVE-TDGGLFRAEDLARCRA--ARPARLDRVVVAVDPPATAGGDACGIVVVGRRDDRA 313

Query: 343 EHLFD--WSKTDLRTTNNKISGLVEKYRPDAIIIDANNTGARTCDYLEMLG----YHVYR 396
             L D             +       +  DA++ +AN  G      L          + R
Sbjct: 314 FVLADETARGLSPAGWAARAVAAARAWSADALVAEANQGGDMVRSVLAQADPPCRVKLVR 373

Query: 397 VLGQKRAVDLEFCRNRRTELHVKMADWLEFASLINHSGLIQNLKSLKSFIVPNTGELAIE 456
               KRA         R E    +A   E   +++    +      +  +   +G+L   
Sbjct: 374 ASVGKRA---------RAE---PVAALYEQGRVLHCGSFVALE---EELMALGSGDLE-- 416

Query: 457 SKRVKGAKSTDYSDGLMYTFAE 478
                   S D +D L++  +E
Sbjct: 417 -------HSPDRADALVWAVSE 431


>gi|94694803|gb|ABF47048.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694815|gb|ABF47054.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694825|gb|ABF47059.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694853|gb|ABF47073.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694857|gb|ABF47075.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|222354519|gb|ACM48067.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|239909445|gb|ACS32392.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694837|gb|ABF47065.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694821|gb|ABF47057.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694847|gb|ABF47070.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|219879683|gb|ACL51158.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694819|gb|ABF47056.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694829|gb|ABF47061.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694807|gb|ABF47050.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694809|gb|ABF47051.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694823|gb|ABF47058.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694839|gb|ABF47066.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|242345696|gb|ACS92014.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|256557083|gb|ACU83739.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694817|gb|ABF47055.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694827|gb|ABF47060.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694841|gb|ABF47067.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694813|gb|ABF47053.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|254770949|gb|ACT81760.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|94694843|gb|ABF47068.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|52139262|ref|YP_081537.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|39842097|gb|AAR31641.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|44903295|gb|AAS48974.1| UL89 [Human herpesvirus 5]
 gi|94694805|gb|ABF47049.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694811|gb|ABF47052.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694845|gb|ABF47069.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694849|gb|ABF47071.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694851|gb|ABF47072.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|94694855|gb|ABF47074.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
 gi|157780097|gb|ABV71611.1| UL89 [Human herpesvirus 5]
 gi|242345862|gb|ACS92179.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|242554146|gb|ACS93421.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|254771115|gb|ACT81925.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|270311455|gb|ACZ72832.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|270355676|gb|ACZ79836.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|270355841|gb|ACZ80000.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|270356007|gb|ACZ80165.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|270356173|gb|ACZ80330.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|290564434|gb|ADD39135.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|294488421|gb|ADE88081.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
 gi|317160580|gb|ADV04406.1| DNA packaging terminase subunit 1 [Human herpesvirus 5]
          Length = 674

 Score = 37.4 bits (85), Expect = 6.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGAKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|262043664|ref|ZP_06016773.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|259039002|gb|EEW40164.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
          Length = 464

 Score = 37.4 bits (85), Expect = 6.4,   Method: Composition-based stats.
 Identities = 31/252 (12%), Positives = 72/252 (28%), Gaps = 39/252 (15%)

Query: 115 NSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCR 174
               Q++  +W  V+                       +  ++   +L  +         
Sbjct: 65  PQANQVRKAIWKAVN------------PRTGRLRIDEAFPHELRRKTLDNEMMIEFINGS 112

Query: 175 TYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLS 234
           T+     D +     +  + I+  E + +       +   L +     F++  S PR  +
Sbjct: 113 TWQAVGSDNYGALIGSGHVGIVFSEWALSNPSAWAFLRPILADNGGWAFFV--STPRGKN 170

Query: 235 GKFYEIFN---KPLDDWKRFQIDTRTVEGIDPS----FHEGIIARYGLDS--DVTRVEVC 285
             FY++F    K  D+W    +       I P         + A  G +    +   E  
Sbjct: 171 -HFYKMFQGGLKDPDNWFCDHLSADITLHIPPETLAQELREMQAERGEEEGQALFNQEYM 229

Query: 286 GQFPQQDIDSFIPLNIIEEALNREPCPDPYAP-------LIMGCDIAEEGGDNTVVVLRR 338
             +      ++    ++      +    P+ P         +G       GD T +   +
Sbjct: 230 CDWNAAIPGAYYSSILVGLEKGGQIGNVPWDPQYEVYTSWDLGI------GDATAIWFYQ 283

Query: 339 --GPVIEHLFDW 348
             G  +  +  +
Sbjct: 284 FIGKEVRVIDYY 295


>gi|321265233|ref|XP_003197333.1| member of the DEAH family of helicases; Mph1p [Cryptococcus gattii
           WM276]
 gi|317463812|gb|ADV25546.1| Member of the DEAH family of helicases, putative; Mph1p
           [Cryptococcus gattii WM276]
          Length = 1517

 Score = 37.0 bits (84), Expect = 6.6,   Method: Composition-based stats.
 Identities = 24/156 (15%), Positives = 55/156 (35%), Gaps = 12/156 (7%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139
             ++   G+GKT +   ++L      P   ++ LA +   +   +  E  +    +P++ 
Sbjct: 299 TLVALPTGLGKTFVAGVVMLNFYRWFPTGKIVFLAPTRPLVAQQI--EACQLSCGIPSRD 356

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
              M           +      L  + + +    +T   +  +  V       + ++ DE
Sbjct: 357 AAVMTGEGG------ARKGRERLWEEKRVFYCTPQTLDNDLKNGAVDP--QDIVLVVLDE 408

Query: 200 A-SGTPDVINLGILGFLTERNAN-RFWIMTSNPRRL 233
           A   T +     I+ +LT  +   R   +T+ P   
Sbjct: 409 AHKATGNYAYTTIVAYLTAHHPYFRVLALTATPGAD 444


>gi|119716507|ref|YP_923472.1| type III restriction enzyme, res subunit [Nocardioides sp. JS614]
 gi|119537168|gb|ABL81785.1| type III restriction enzyme, res subunit [Nocardioides sp. JS614]
          Length = 558

 Score = 37.0 bits (84), Expect = 6.7,   Method: Composition-based stats.
 Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 20/105 (19%)

Query: 38  WGEKGTPLEGFSAP---------RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGI 88
           W            P         R +QLE  E +         N    V         G+
Sbjct: 117 WAGPTLEAIYAKMPARVPSKYELRPYQLEAAERIQQDL--EDTNRALLVLAT------GL 168

Query: 89  GKTTLNAWLVLWLMSTRPGISVICLANSE---TQLKTTLWAEVSK 130
           GKT +   ++   + + P   ++ +A+ +    QL+  LW  + K
Sbjct: 169 GKTVVGGEVIRRHLESHPDARILVVAHMKELVEQLEKALWRHLDK 213


>gi|170086129|ref|XP_001874288.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164651840|gb|EDR16080.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1307

 Score = 37.0 bits (84), Expect = 6.9,   Method: Composition-based stats.
 Identities = 9/47 (19%), Positives = 21/47 (44%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA 126
             ++   G+GKT +   ++L      P   V+ +A ++  +   + A
Sbjct: 224 TLVALPTGLGKTFIAGAVMLNFYRWFPEGKVVFVAPTKPLVAQQIMA 270


>gi|260811155|ref|XP_002600288.1| hypothetical protein BRAFLDRAFT_118278 [Branchiostoma floridae]
 gi|229285574|gb|EEN56300.1| hypothetical protein BRAFLDRAFT_118278 [Branchiostoma floridae]
          Length = 275

 Score = 37.0 bits (84), Expect = 6.9,   Method: Composition-based stats.
 Identities = 19/129 (14%), Positives = 39/129 (30%), Gaps = 18/129 (13%)

Query: 188 HNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDD 247
           H+      + DE       +  G+L  L E  +N  +I   +  +L          P+  
Sbjct: 48  HSDRQHLFVFDEMETVYPSLAEGLLSLLEEDTSNTMFIFIWSTEKL----------PMGR 97

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALN 307
           +   QI                I    +   +++++V     +    +F        A  
Sbjct: 98  YLLQQIS--------KGRSRESIREEEIQDLLSQLQVDTDSSEPSSTNFASTGKTFAATV 149

Query: 308 REPCPDPYA 316
           ++P   P  
Sbjct: 150 KKPSNIPEG 158


>gi|71018359|ref|XP_759410.1| hypothetical protein UM03263.1 [Ustilago maydis 521]
 gi|46098957|gb|EAK84190.1| hypothetical protein UM03263.1 [Ustilago maydis 521]
          Length = 1054

 Score = 37.0 bits (84), Expect = 6.9,   Method: Composition-based stats.
 Identities = 23/151 (15%), Positives = 47/151 (31%), Gaps = 12/151 (7%)

Query: 87  GIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWA-EVSKWLS-LLPNKHWFEMQ 144
           G+GKT      ++ LM +      + +A +   +    W  E+ ++    L    W    
Sbjct: 468 GMGKT----IQMISLMLSDRKKPCLVVAPT---VAIMQWRNEIEQYTEPKLKVLMWHGAN 520

Query: 145 SLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERP--DTFVGHHNTYGMAIINDEASG 202
                     +DV+  S  +    +      +  +          H  +   II DEA  
Sbjct: 521 RTQDLKELKAADVVLTSYAVLESSFRKQESGFRRKNEILKERSALHAVHWRRIILDEAHN 580

Query: 203 TPDVINLGILGFLTERNANRFWIMTSNPRRL 233
             +       G    +  +  W ++  P + 
Sbjct: 581 IKERSTNTAKGAFALQG-DFRWCLSGTPLQN 610


>gi|94694835|gb|ABF47064.1| DNA cleavage and packaging protein large subunit [Human herpesvirus
           5]
          Length = 674

 Score = 37.0 bits (84), Expect = 7.0,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 27/77 (35%), Gaps = 3/77 (3%)

Query: 161 SLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNA 220
            + ID +   +     S    ++  G        ++ DEA          ILGFL +   
Sbjct: 275 VISIDHRGPKSTALFASCYNTNSIRG---QNFHLLLVDEAHFIKKEAFNTILGFLAQNTT 331

Query: 221 NRFWIMTSNPRRLSGKF 237
              +I ++N    +  F
Sbjct: 332 KIIFISSTNTTSDATCF 348


>gi|49475696|ref|YP_033737.1| Phage related protein [Bartonella henselae str. Houston-1]
 gi|49238503|emb|CAF27734.1| Phage related protein [Bartonella henselae str. Houston-1]
          Length = 441

 Score = 37.0 bits (84), Expect = 7.1,   Method: Composition-based stats.
 Identities = 26/186 (13%), Positives = 54/186 (29%), Gaps = 17/186 (9%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFN-KPLDD 247
             +    DEA    +     ++  L E          +T NP R +      F     + 
Sbjct: 122 RILLCWVDEAEPVTETAWQTLIPTLREEGEGWRAELWVTWNPLRDNAPVERRFRFSNNEA 181

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE--- 304
            KR +I+           +E  +       +  +    G +      ++    ++     
Sbjct: 182 IKRVEINWSDNPKFPKILNEARLDDLKNRPETYKHIWEGAYLTAVQGAYYQKEMLAAEQE 241

Query: 305 ----ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTN 357
                + R+P     A      DI   G   D T + + +       + D+ +   +  +
Sbjct: 242 GRIGRVARDPLMQMRAFW----DIGGTGAKADATAIWIAQFVGREIRVLDYYEAQGQPLS 297

Query: 358 NKISGL 363
             I  L
Sbjct: 298 EHIGWL 303


>gi|29826538|ref|NP_828844.1| putative helicase [Streptomyces avermitilis MA-4680]
 gi|29611336|dbj|BAC75379.1| putative helicase [Streptomyces avermitilis MA-4680]
          Length = 885

 Score = 37.0 bits (84), Expect = 7.6,   Method: Composition-based stats.
 Identities = 10/50 (20%), Positives = 19/50 (38%), Gaps = 3/50 (6%)

Query: 71  NNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120
           ++  P+  +G I +  G GKT   A      +    G  ++    +   L
Sbjct: 27  SSVPPQGARGTIVSATGSGKTFTAAACA---LECFSGGRILVTVPTLDLL 73


>gi|312880761|ref|ZP_07740561.1| UvrD/REP helicase [Aminomonas paucivorans DSM 12260]
 gi|310784052|gb|EFQ24450.1| UvrD/REP helicase [Aminomonas paucivorans DSM 12260]
          Length = 1200

 Score = 37.0 bits (84), Expect = 7.8,   Method: Composition-based stats.
 Identities = 29/231 (12%), Positives = 60/231 (25%), Gaps = 33/231 (14%)

Query: 53  SWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVIC 112
            W    +  +       V +P   V    + AG G GKT   +    WL+++ P   V  
Sbjct: 11  PWMERLLGDLRPEQRQGVISPRSLVV---VQAGAGTGKTHTLSSRFAWLLASDPTCRV-- 65

Query: 113 LANSETQLKTTLWAE-VSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYST 171
                 Q+ T  + E  ++ +         +     L   P     L  +     + Y +
Sbjct: 66  -----EQILTLTFTEKAAREMRDRIRCRLLQW----LEAEPEKLGHLRDAAARIDEGYIS 116

Query: 172 MCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDV-----INLGILGFLTERNANRFWIM 226
               ++              G+ +  D  S          +   + G     +   F  +
Sbjct: 117 TLHAFALRVI-------RESGLVLDLDPESRIASPCGEGALFEEMEGAFDRLDPAWFLRL 169

Query: 227 TSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDS 277
             +P      + +       D    ++                   +G   
Sbjct: 170 LEDP------WRDRCQDLFGDPAFPRLVNALSPRRLAELVREAAELHGSRD 214


>gi|321472411|gb|EFX83381.1| hypothetical protein DAPPUDRAFT_48010 [Daphnia pulex]
          Length = 657

 Score = 37.0 bits (84), Expect = 7.9,   Method: Composition-based stats.
 Identities = 25/168 (14%), Positives = 52/168 (30%), Gaps = 29/168 (17%)

Query: 36  FPWGEKGTPLEGFSAP-RSWQLEFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLN 94
           F  G   T     + P R +Q + +E    H                ++   G+GKT + 
Sbjct: 93  FDMGAGDTWFYPTNKPVRKYQRDIVETCLFH-------------NTLVTLPTGLGKTFIA 139

Query: 95  AWLVLWLMSTRPGISVICLANSETQLKTTLWA--EVSKWLSLLPNKHWFEMQSLSLHPAP 152
           A ++       P   +I +A ++  +   + A  E+   L L          S +     
Sbjct: 140 AVVMYNFFRWYPRGKIIFMAPTKPLVAQQIQACYEIMG-LPLDSTSEMTGAMSPADRKTQ 198

Query: 153 WYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEA 200
           W    +          +    +  + +   +      +    ++ DEA
Sbjct: 199 WREKRV----------FFLTPQILTND--ISRAAFPASEIKCLVLDEA 234


>gi|319408093|emb|CBI81746.1| phage related protein [Bartonella schoenbuchensis R1]
 gi|319408856|emb|CBI82513.1| phage related protein [Bartonella schoenbuchensis R1]
          Length = 444

 Score = 37.0 bits (84), Expect = 8.2,   Method: Composition-based stats.
 Identities = 25/182 (13%), Positives = 52/182 (28%), Gaps = 9/182 (4%)

Query: 191 YGMAIINDEASGTPDVINLGILGFLTERNA--NRFWIMTSNPRRLSGKFYEIFNKPLDD- 247
             +    DEA    +     ++  L E          +T NP R +      F    D  
Sbjct: 125 RILLCWVDEAEPVTETAWQTLIPTLREEGEGWRAELWVTWNPLRENAPVERRFRFTKDQN 184

Query: 248 WKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNII---EE 304
            K  +++               +       +       G + +    ++    ++   +E
Sbjct: 185 IKGVEVNWSDNPLFPQKLQRVRLDDLQNRPESYNHIWEGDYLKAVQGAYFQKEMLAAEQE 244

Query: 305 ALNREPCPDPYAPLIMGCDIAEEG--GDNTVVVLRR-GPVIEHLFDWSKTDLRTTNNKIS 361
                   DP  P+    DI   G   D T + + +       + D+ +   +  +  I 
Sbjct: 245 GRVGRVARDPLMPIRAFWDIGGTGAKADATAIWIAQFVGREIRVLDYYEAQGQPLSEHIG 304

Query: 362 GL 363
            L
Sbjct: 305 WL 306


>gi|322707444|gb|EFY99022.1| activating signal cointegrator 1 complex subunit 3 [Metarhizium
            anisopliae ARSEF 23]
          Length = 1969

 Score = 37.0 bits (84), Expect = 8.3,   Method: Composition-based stats.
 Identities = 23/118 (19%), Positives = 39/118 (33%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RP   V+ +A         L  E V  W   L      +
Sbjct: 1159 SPTGSGKTVAAELAMWWAFRERPKSKVVYIAP-----MKALVRERVKDWGKRLAQPLGLK 1213

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1214 IVELTGDNTPDTRTIKDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1264


>gi|322695748|gb|EFY87551.1| activating signal cointegrator 1 complex subunit 3 [Metarhizium
            acridum CQMa 102]
          Length = 1950

 Score = 37.0 bits (84), Expect = 8.3,   Method: Composition-based stats.
 Identities = 23/118 (19%), Positives = 39/118 (33%), Gaps = 14/118 (11%)

Query: 84   AGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAE-VSKWLSLLPNKHWFE 142
            +  G GKT      + W    RP   V+ +A         L  E V  W   L      +
Sbjct: 1140 SPTGSGKTVAAELAMWWAFRERPKSKVVYIAP-----MKALVRERVKDWGKRLAQPLGLK 1194

Query: 143  MQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDE 199
            +  L+    P    +    + I   + +  + R++         G+     + II DE
Sbjct: 1195 IVELTGDNTPDTRTIKDADIIITTPEKWDGISRSWQT------RGYVRQVSLVII-DE 1245


>gi|224586602|ref|YP_002640499.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana
           VS116]
 gi|224497136|gb|ACN52769.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana
           VS116]
          Length = 450

 Score = 36.6 bits (83), Expect = 9.0,   Method: Composition-based stats.
 Identities = 31/163 (19%), Positives = 52/163 (31%), Gaps = 13/163 (7%)

Query: 176 YSEERPDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSG 235
           Y  ++   F     +    I  +EA+         +L  L  R      I  +NP     
Sbjct: 148 YGGDKASDFERFRGSNSALIFVNEATTLHKQTLEEVLKRL--RCGQETIIFDTNPDNPEH 205

Query: 236 KFYEIFNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEV-CGQFPQQDID 294
            F   +   +  +  +   T     +   F E     Y  D    +  V  G++      
Sbjct: 206 YFKTDYIDNIHTFTTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASIDS 264

Query: 295 SFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE-GGDNTVVVL 336
            F  +NI ++ +   P        I   D A   GGDNT + +
Sbjct: 265 IFTQINITQDYVFSSP--------IAYLDPAFSVGGDNTALCV 299


>gi|71065561|ref|YP_264288.1| PBSX family phage terminase large subunit [Psychrobacter arcticus
           273-4]
 gi|71038546|gb|AAZ18854.1| phage terminase, large subunit, PBSX family [Psychrobacter arcticus
           273-4]
          Length = 421

 Score = 36.6 bits (83), Expect = 9.0,   Method: Composition-based stats.
 Identities = 27/195 (13%), Positives = 55/195 (28%), Gaps = 11/195 (5%)

Query: 134 LLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYS--EERPDTFVGHHNTY 191
              N+   E    ++    W +D               +   ++      D+  G   + 
Sbjct: 72  NSLNESSLEEIKQAIKSVSWLNDYYEIGEKYIRTKNRRVAYAFTGLRHNLDSIKGK--SR 129

Query: 192 GMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRF 251
            +    DEA    +     +L  + E ++   WI T NP        + F +   ++   
Sbjct: 130 ILLAWVDEAENVSEAAWRKLLPTVREDDSE-VWI-TWNPENKGSATDKRFRQVEHEFI-V 186

Query: 252 QIDTRTVEGIDPSFH-EGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREP 310
           +++             E +  +  LD    R    G + +           +     R P
Sbjct: 187 EMNHNDNPFFPDVLEQERLNDQENLDDATYRWIWEGAYLEASDAQIFNGKFVVREFERHP 246

Query: 311 CPDPYAPLIMGCDIA 325
             +   P   G D  
Sbjct: 247 TWN--GPYN-GLDFG 258


>gi|308476267|ref|XP_003100350.1| hypothetical protein CRE_22485 [Caenorhabditis remanei]
 gi|308265092|gb|EFP09045.1| hypothetical protein CRE_22485 [Caenorhabditis remanei]
          Length = 1870

 Score = 36.6 bits (83), Expect = 9.3,   Method: Composition-based stats.
 Identities = 27/144 (18%), Positives = 53/144 (36%), Gaps = 15/144 (10%)

Query: 57   EFMEVVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANS 116
            ++   + A    S+   +       + A  G GKT      +  L+   PG+ V+ +A  
Sbjct: 1031 DYFNPIQAQVFYSLYKTDKSAL---VGAPTGSGKTLCAELAMFRLLQDHPGMKVVYIAP- 1086

Query: 117  ETQLKTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGI-DSKHYSTMCRT 175
               LK+ +   V  W     N   + +  +S    P   ++   S+ I   + +  + R+
Sbjct: 1087 ---LKSLVRERVDDWKQKFENGMGYRVVEVSGDVTPDPQELQASSILITTPEKWDGISRS 1143

Query: 176  YSEERPDTFVGHHNTYGMAIINDE 199
            ++       VG        I+ DE
Sbjct: 1144 WATREYVRRVG-------LIVLDE 1160


>gi|312116003|ref|YP_004013599.1| hypothetical protein Rvan_3315 [Rhodomicrobium vannielii ATCC
           17100]
 gi|311221132|gb|ADP72500.1| hypothetical protein Rvan_3315 [Rhodomicrobium vannielii ATCC
           17100]
          Length = 466

 Score = 36.6 bits (83), Expect = 9.4,   Method: Composition-based stats.
 Identities = 72/420 (17%), Positives = 129/420 (30%), Gaps = 57/420 (13%)

Query: 82  ISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKHWF 141
           I AGRG GKT   A    W+ +   G      A   +  +  L AE +     +  +   
Sbjct: 61  ILAGRGAGKTRTGA---EWVRACVCGP-TPLSAGRYS--RFALVAETAADARDVIVEGPS 114

Query: 142 EMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEAS 201
            +  L++HP  +       S    +     +   Y+   PD   G  +    A   DE +
Sbjct: 115 GL--LAIHPRGFRP-KFEPSKRRLTWPNGAVAMLYNATEPDQLRGPQHD---AAWCDELA 168

Query: 202 G--TPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRTVE 259
                      +   L   +  R  I+T+ P R      E   K        +  T    
Sbjct: 169 KWRYARETWDMLQFGLRLGHDPRQ-IVTTTP-RPIAIIREFLGKEGHGVVLTRGSTYDNR 226

Query: 260 GIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEE-ALNREPCPDPYAPL 318
                 +   I R    + + R E+  +       +    +++++  + R      +  +
Sbjct: 227 ANLAQNYFNTIVRSYEGTRLGRQEINAELLDDVAGALWTRSLLDQHRIARGTPLPRFDRV 286

Query: 319 IMGCDIAE---EGGDNT------VVVLRRGPVIEHLFDW-SKTDLRTTNNKISGLVEKYR 368
           ++G D A      GD T      V  L        L D  ++        K     + Y 
Sbjct: 287 VVGIDPAARPSGAGDKTSETGIVVCGLGEDGRGYVLDDLSNRQGPMGWAQKAVAGFDLYE 346

Query: 369 PDAIIIDANNTGARTCDYLEML--GYHVYRVLGQKRAVDLEFCRNRRTE----LHVKMAD 422
            DA++++ N  GA     L  +  G  +  V   +           R E    L+ +   
Sbjct: 347 ADALVVEINQGGAMVETVLRAVRGGLPIRAVRATRGKT-------VRAEPIAALYAQGR- 398

Query: 423 WLEFASLINHSGLIQNLKSLKSFIVPNTGELAIESKRVKGAKSTDYSDGLMYTFAENPPR 482
            +     +    L   +     F +   G             + D  D L++  A+  PR
Sbjct: 399 -VSHVGALP--TLEDQMVQFTPFGIEGDG-------------AADRVDALVWALADLFPR 442


>gi|302756859|ref|XP_002961853.1| hypothetical protein SELMODRAFT_140315 [Selaginella moellendorffii]
 gi|300170512|gb|EFJ37113.1| hypothetical protein SELMODRAFT_140315 [Selaginella moellendorffii]
          Length = 1015

 Score = 36.6 bits (83), Expect = 9.6,   Method: Composition-based stats.
 Identities = 36/211 (17%), Positives = 64/211 (30%), Gaps = 24/211 (11%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139
            A++A RG GK+      +          ++   A S   LKT L+  V K    L  K 
Sbjct: 279 VALTASRGRGKSAALGLAIA-GAVAFGYSNIFVTAPSPENLKT-LFEFVCKGFDALEYKE 336

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT--YGMAIIN 197
             +   +      +   V+  ++    +      +T    +P      H        +I 
Sbjct: 337 HIDYDLVQSTNPAFNKAVVRVNIFRQHR------QTIQYIQPQD----HAKLAQAELLII 386

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRT 257
           DEA+  P  +   +LG            M S      G    +  K +   +  Q  + +
Sbjct: 387 DEAAAIPLPMVKALLG-------PYLVFMCSTVNGYEGTGRSLSLKLIQQLRS-QGKSES 438

Query: 258 VEGIDPSFHEGIIARYGLDSDV--TRVEVCG 286
              +          RYG    +     E+  
Sbjct: 439 APSVFREVELAEPIRYGAGDPIEGWLHELLC 469


>gi|302798078|ref|XP_002980799.1| hypothetical protein SELMODRAFT_113365 [Selaginella moellendorffii]
 gi|300151338|gb|EFJ17984.1| hypothetical protein SELMODRAFT_113365 [Selaginella moellendorffii]
          Length = 1015

 Score = 36.6 bits (83), Expect = 9.6,   Method: Composition-based stats.
 Identities = 36/211 (17%), Positives = 64/211 (30%), Gaps = 24/211 (11%)

Query: 80  GAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQLKTTLWAEVSKWLSLLPNKH 139
            A++A RG GK+      +          ++   A S   LKT L+  V K    L  K 
Sbjct: 279 VALTASRGRGKSAALGLAIA-GAVAFGYSNIFVTAPSPENLKT-LFEFVCKGFDALEYKE 336

Query: 140 WFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNT--YGMAIIN 197
             +   +      +   V+  ++    +      +T    +P      H        +I 
Sbjct: 337 HIDYDLVQSTNPAFNKAVVRVNIFRQHR------QTIQYIQPQD----HAKLAQAELLII 386

Query: 198 DEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIFNKPLDDWKRFQIDTRT 257
           DEA+  P  +   +LG            M S      G    +  K +   +  Q  + +
Sbjct: 387 DEAAAIPLPMVKALLG-------PYLVFMCSTVNGYEGTGRSLSLKLIQQLRS-QGKSES 438

Query: 258 VEGIDPSFHEGIIARYGLDSDV--TRVEVCG 286
              +          RYG    +     E+  
Sbjct: 439 APSVFREVELAEPIRYGAGDPIEGWLHELLC 469


>gi|281204972|gb|EFA79166.1| hypothetical protein PPL_07991 [Polysphondylium pallidum PN500]
          Length = 1587

 Score = 36.6 bits (83), Expect = 9.6,   Method: Composition-based stats.
 Identities = 42/197 (21%), Positives = 63/197 (31%), Gaps = 21/197 (10%)

Query: 82   ISAGRGIGKTTLNAWLVLWLMSTRPGI-----SVICLANSETQLKTTLW--AEVSKWLSL 134
            +    G GKT   A +VL +M T          +   A++ T +   L   AE+ K    
Sbjct: 1071 VVGPPGTGKTHFLALMVLIIMETLIRAEKKSYIIAITAHTHTAIDNLLVRIAELKKEYES 1130

Query: 135  LPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMA 194
                                S   H  +  D KH   +    S       +G      M 
Sbjct: 1131 FAGNALNFQIVKKESSKLSESLTSHNIVKYDKKHKFNLMCIGSTCWGLNTLGL--DLDML 1188

Query: 195  IINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEIF--NKPLDDWKRFQ 252
            II DEAS  P  +    LG           ++  +P++L       F   K L       
Sbjct: 1189 II-DEASQLPSPL--AALGLNAVNLEKSRVVVVGDPKQLGPVLKASFIVRKDLSV----- 1240

Query: 253  IDTRTVEGIDPSFHEGI 269
              +  +E  +P FH+ I
Sbjct: 1241 --SDKLEHQEPKFHKSI 1255


  Database: nr
    Posted date:  May 22, 2011 12:22 AM
  Number of letters in database: 999,999,966
  Number of sequences in database:  2,987,313
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 22, 2011 12:30 AM
  Number of letters in database: 999,999,796
  Number of sequences in database:  2,903,041
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 22, 2011 12:36 AM
  Number of letters in database: 999,999,281
  Number of sequences in database:  2,904,016
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 22, 2011 12:41 AM
  Number of letters in database: 999,999,960
  Number of sequences in database:  2,935,328
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 22, 2011 12:46 AM
  Number of letters in database: 842,794,627
  Number of sequences in database:  2,394,679
  
Lambda     K      H
   0.308    0.130    0.353 

Lambda     K      H
   0.267   0.0399    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,114,867,950
Number of Sequences: 14124377
Number of extensions: 360134319
Number of successful extensions: 985041
Number of sequences better than 10.0: 1255
Number of HSP's better than 10.0 without gapping: 457
Number of HSP's successfully gapped in prelim test: 935
Number of HSP's that attempted gapping in prelim test: 982625
Number of HSP's gapped (non-prelim): 1549
length of query: 511
length of database: 4,842,793,630
effective HSP length: 144
effective length of query: 367
effective length of database: 2,808,883,342
effective search space: 1030860186514
effective search space used: 1030860186514
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.2 bits)
S2: 83 (36.6 bits)