BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781187|ref|YP_003065600.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] (367 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781187|ref|YP_003065600.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040864|gb|ACT57660.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] Length = 367 Score = 768 bits (1984), Expect = 0.0, Method: Compositional matrix adjust. Identities = 367/367 (100%), Positives = 367/367 (100%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME Sbjct: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL Sbjct: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER Sbjct: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI Sbjct: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN Sbjct: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG 360 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG Sbjct: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG 360 Query: 361 CPVGSSI 367 CPVGSSI Sbjct: 361 CPVGSSI 367 >gi|315121940|ref|YP_004062429.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495342|gb|ADR51941.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 509 Score = 569 bits (1467), Expect = e-160, Method: Compositional matrix adjust. Identities = 264/359 (73%), Positives = 303/359 (84%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 M R + T + EQEL E++ + LSF NFV+R FPW L +FS+P RWQL+FME Sbjct: 1 MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 AVD C NV+N +P IFK A+SAGRGIGKTTLNAWMMLWLISTRPGMSI+C+ANSETQL Sbjct: 61 AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K+TLWAEVSKWLSMLP++HWFEMQSLSLHP+ WYAE LE++ GIDSKHYTITCRTYSEER Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVG HNT+GMA+FNDEASGTPD+IN SILGFFTE N NRFW+MTSN RRL GWFYDI Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLKGWFYDI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FN+PLEDW+R+QIDTRTVEGID FHEGIISRYGLDSDV R+E+LGQFPQQ++N+FIP Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPSFHEGIISRYGLDSDVTRVEVLGQFPQQDINSFIPFY 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 IEEA++RE I D YAPLIMGCDIAGEGGD TVVV RRG IEHIFDWS + ++++ Sbjct: 301 RIEEALNREPIKDPYAPLIMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNASSRK 359 >gi|315122902|ref|YP_004063391.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496304|gb|ADR52903.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 509 Score = 566 bits (1460), Expect = e-159, Method: Compositional matrix adjust. Identities = 262/359 (72%), Positives = 303/359 (84%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 M R + T + EQEL E++ + LSF NFV+R FPW L +FS+P RWQL+FME Sbjct: 1 MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 AVD C NV+N +P IFK A+SAGRGIGKTTLNAWMMLWLISTRPGMSI+C+ANSETQL Sbjct: 61 AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K+TLWAEVSKWLSMLP++HWFEMQSLSLHP+ WYAE LE++ GIDSKHYTITCRTYSEER Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVG HNT+GMA+FNDEASGTPD+IN SILGFFTE N NRFW+MTSN RRLNGWFYDI Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLNGWFYDI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FN+PLEDW+R+QIDTRTVEGID FHE II+RYGLDSDV R+E+LGQFPQQ++N+FIP Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPNFHENIIARYGLDSDVTRVEVLGQFPQQDINSFIPFY 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 IEEA++RE I D YAPL+MGCDIAGEGGD TVVV RRG IEHIFDWS + ++++ Sbjct: 301 RIEEALNREPIKDPYAPLVMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNVSSRK 359 >gi|317120722|gb|ADV02544.1| putative phage terminase large subunit [Liberibacter phage SC2] gi|317120783|gb|ADV02604.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 516 Score = 553 bits (1426), Expect = e-155, Method: Compositional matrix adjust. Identities = 257/359 (71%), Positives = 302/359 (84%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 M R + T+ + EQ+L +++ E LSF NFV+ FFPWG KG PLE FS P WQLEFME Sbjct: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 VD HC ++VNN NP +FK AISAGRGIGKTTLNAW++LWL+STRPG+S+IC+ANSETQL Sbjct: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K TLWAEVSKWLS+LP++HWFEMQSLSLHP+ WY+++L S+GIDSKHY+ CRTYSEER Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVG HNT+GMA+ NDEASGTPD+IN ILGF TE N NRFWIMTSN RRL+G FY+I Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FN PL+DWKR+QIDTRTVEGID FHEGII+RYGLDSDV R+E+ GQFPQQ++++FIP Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPQQ 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 YI EA+ R AI D YAPLIMGCDIAGEG DKTVVV RRGNIIE IFDWS +LI+ TN++ Sbjct: 301 YIVEALERVAIPDPYAPLIMGCDIAGEGEDKTVVVLRRGNIIERIFDWSGELIEVTNRK 359 >gi|254781215|ref|YP_003065628.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040892|gb|ACT57688.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|317120680|gb|ADV02503.1| putative phage terminase large subunit [Liberibacter phage SC1] gi|317120824|gb|ADV02645.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 511 Score = 545 bits (1403), Expect = e-153, Method: Compositional matrix adjust. Identities = 252/359 (70%), Positives = 299/359 (83%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 M R + T+ + EQ+L +++ E LSF NFV+ FFPWG KG PLE FS P WQLEFME Sbjct: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 VD HC ++VNN NP +FK AISAGRGIGKTTLNAW++LWL+STRPG+S+IC+ANSETQL Sbjct: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K TLWAEVSKWLS+LP++HWFEMQSLSLHP+ WY+++L S+GIDSKHY+ CRTYSEER Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVG HNT+GMA+ NDEASGTPD+IN ILGF TE N NRFWIMTSN RRL+G FY+I Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FN PL+DWKR+QIDTRTVEGID FHEGII+RYGLDSDV R+E+ GQFPQQ++++FIP N Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 IEEA++RE D YAPLIMGCDIA EGGD TVVV RRG +IEH+FDWS ++ TN + Sbjct: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359 >gi|302120432|gb|ADK92426.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 255 Score = 418 bits (1074), Expect = e-115, Method: Compositional matrix adjust. Identities = 194/255 (76%), Positives = 224/255 (87%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 IGKTTLNAW++LWL+S RPGMSIIC+ANSETQLK TLWAEVSKWLS+LP++HWFEMQSLS Sbjct: 1 IGKTTLNAWLVLWLMSIRPGMSIICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 60 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII 207 LHP+ WY+++L S+GIDSKHY+ CRTYSEERPDTFVG HNT+GMA+ NDEASGTPD+I Sbjct: 61 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 120 Query: 208 NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHE 267 N ILGF TE N NRFWIMTSN RRL+G FY+IFN PL+DWKR+QIDTRTVEGID FHE Sbjct: 121 NLGILGFLTEQNANRFWIMTSNPRRLSGKFYEIFNRPLDDWKRFQIDTRTVEGIDPSFHE 180 Query: 268 GIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE 327 GII+RYGLDSDV R+E+ GQFPQQ++++FIP N IEEA++RE D YAPLIMGCDIA E Sbjct: 181 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 240 Query: 328 GGDKTVVVFRRGNII 342 GGD TVVV RRG +I Sbjct: 241 GGDNTVVVLRRGPVI 255 >gi|167032754|ref|YP_001667985.1| putative phage terminase large subunit [Pseudomonas putida GB-1] gi|166859242|gb|ABY97649.1| putative phage terminase, large subunit [Pseudomonas putida GB-1] Length = 499 Score = 166 bits (420), Expect = 5e-39, Method: Compositional matrix adjust. Identities = 102/333 (30%), Positives = 162/333 (48%), Gaps = 22/333 (6%) Query: 12 EQELHEMLMHAECVLSFKN----FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCH 67 EQEL A + SF + +V+ FPWG G L + + P +WQ E +E++ Sbjct: 11 EQEL------ANDIASFSDDPLGYVLYAFPWGEAGGELANKTGPRKWQREVLESIGEQLR 64 Query: 68 SNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAE 127 + + I + A+++G GIGK+ L +W++ W + T + AN+E+QL+ W E Sbjct: 65 AGAKDRGEVI-REAVASGHGIGKSALVSWVIKWALDTEVDTRGVVTANTESQLRTKTWPE 123 Query: 128 VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGP 187 V+KW + HWF++ +L + E K++ I +S+ + F G Sbjct: 124 VAKWNRLSITAHWFKLTGTALISTDPDHE----------KNWRIDAVPWSDTNTEAFAGL 173 Query: 188 HNT-HGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLE 246 HN + + DEAS D++ + G T+ + W N R +G F + F Sbjct: 174 HNEGKRILLIFDEASAIADLVWEVAEGALTDADTEIIWAAFGNPTRNSGRFRECFTKFKH 233 Query: 247 DWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAM 306 W+ Q+D+RTV+G + I+ YG DSD RI + G FP+ IP +++ EAM Sbjct: 234 RWRHRQVDSRTVDGTNKTQIAKWIADYGEDSDFVRIRVRGMFPRASDLQLIPTDWVAEAM 293 Query: 307 SREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 R+ + L L+ G DIA G D V+ FRRG Sbjct: 294 RRDGVYGLDDALVCGIDIARGGMDNNVIRFRRG 326 >gi|323156136|gb|EFZ42295.1| terminase large subunit [Escherichia coli EPECa14] Length = 491 Score = 153 bits (387), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 93/313 (29%), Positives = 147/313 (46%), Gaps = 15/313 (4%) Query: 30 NFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIG 89 + + FPWG +G L H + P +WQ + + H + P + A+++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPATRYQPLML--ALASGHGIG 82 Query: 90 KTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLH 149 K+ + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIIN 208 + +G D K + +SE + F G HN + V DEAS D++ Sbjct: 143 SN---------DLGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEG 268 + G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAG 326 + YG DSD +I + G FP FIP +EAM R A YAP+I+G D A Sbjct: 253 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAYAPVIIGVDPAY 312 Query: 327 EGGDKTVVVFRRG 339 G D V+ R+G Sbjct: 313 SGVDDAVIYLRQG 325 >gi|212710820|ref|ZP_03318948.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM 30120] gi|212686517|gb|EEB46045.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM 30120] Length = 493 Score = 152 bits (384), Expect = 8e-35, Method: Compositional matrix adjust. Identities = 100/345 (28%), Positives = 154/345 (44%), Gaps = 30/345 (8%) Query: 4 LISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVD 63 +I T EQ ++++ M LS+ + FPWG G LE+ S P +WQ E + + Sbjct: 1 MIETMSPEEQLINDIGMFTHDPLSY---ALYAFPWGEAGTELENASGPRQWQAEALNEIG 57 Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNT 123 H + P + A ++G GIGK+ + ++ W + T ++ AN+E QL+ Sbjct: 58 EHLRNPETRHQP--LQLARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQLRTK 115 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSL------HPSGWYAELLEQSMGIDSKHYTITCRTYS 177 W E++KW + + WF ++ H + W A+ + +S Sbjct: 116 TWPEIAKWQRLSITKDWFTCTKTAIYSNDPNHANAWRADAV----------------PWS 159 Query: 178 EERPDTFVGPHNTHGMAVFN-DEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGW 236 E + F G HN + DEAS D++ + G T+ N WI N R G Sbjct: 160 ENNTEAFAGLHNQGKRIILVFDEASNIADLVWEVAEGALTDENTEIIWIAFGNPTRNTGR 219 Query: 237 FYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNF 296 F + F WK QID+RTVEG + E I YG+D D ++ + G FP F Sbjct: 220 FRECFRKFKHRWKTKQIDSRTVEGTNKEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQF 279 Query: 297 IPHNYIEEAMSREAI--DDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 IP + AM R + +AP+I+G D A G D V+ R+G Sbjct: 280 IPTGLTDAAMKRTVTQAEVSHAPIILGVDPAYSGDDDAVIYLRQG 324 >gi|303328395|ref|ZP_07358832.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861389|gb|EFL84326.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 500 Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 94/313 (30%), Positives = 147/313 (46%), Gaps = 16/313 (5%) Query: 30 NFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIG 89 FV+ FPWG G ++ P WQ E + + + S ++ + A+S+G G+G Sbjct: 31 GFVLFAFPWG-GGALADYPDGPDVWQREILRGMGEQLSTGA--SAASVIREAVSSGHGVG 87 Query: 90 KTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLH 149 K+ L AW++LW +ST + AN+E QLK WAE++KW + +WF+ + + Sbjct: 88 KSALVAWIILWAMSTFSDTRGVVTANTENQLKGKTWAELAKWHRLCLCGYWFDCTATA-- 145 Query: 150 PSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT-HGMAVFNDEASGTPDIIN 208 L+ G + K + + +SE + F G HN + + DEAS PD I Sbjct: 146 -------LISTQAGHE-KTWRVDMVAWSERNTEAFAGLHNKGRRVLLIFDEASAIPDAIW 197 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEG 268 + G T+ + W N R G F + F W ++D+RT D Sbjct: 198 EVSEGALTDADTEIIWCCFGNPTRNTGRFRECFGRYAHRWNTRRVDSRTAAMTDKNQLAQ 257 Query: 269 IISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY--APLIMGCDIAG 326 + YG DSD R+ + G+FP+ FI + + EA R D Y AP I+G D+A Sbjct: 258 WVEDYGEDSDFVRVRVRGEFPRAGDRQFISSDIVHEARGRSLKPDQYSFAPRILGVDVAR 317 Query: 327 EGGDKTVVVFRRG 339 G D++V+ R+G Sbjct: 318 SGSDQSVITRRQG 330 >gi|268589373|ref|ZP_06123594.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315400|gb|EFE55853.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 493 Score = 150 bits (380), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 99/345 (28%), Positives = 155/345 (44%), Gaps = 30/345 (8%) Query: 4 LISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVD 63 +I T EQ ++++ M LS+ + FPWG G LE+ + P +WQ E + + Sbjct: 1 MIDTMSPEEQLINDIGMFTHDPLSYALYA---FPWGEAGTELENANGPRQWQAEALNEIG 57 Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNT 123 H + P + A ++G GIGK+ + ++ W + T ++ AN+E QL+ Sbjct: 58 EHLRNPETRHQP--LQLARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQLRTK 115 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSL------HPSGWYAELLEQSMGIDSKHYTITCRTYS 177 W E++KW + + WF ++ H + W A+ + +S Sbjct: 116 TWPEIAKWQRLSITKDWFTYTKTAIYSNDPNHANAWRADAV----------------PWS 159 Query: 178 EERPDTFVGPHNT-HGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGW 236 E + F G HN + + DEAS D++ + G T+ N WI N R G Sbjct: 160 ENNTEAFAGLHNQGKRIILIFDEASNIADLVWEVAEGALTDENTEIIWIAFGNPTRNTGR 219 Query: 237 FYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNF 296 F + F WK QID+RTVEG + E I YG+D D ++ + G FP F Sbjct: 220 FRECFRKFKHRWKTKQIDSRTVEGTNKEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQF 279 Query: 297 IPHNYIEEAMSREAI--DDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 IP + AM R + +AP+I+G D A G D V+ R+G Sbjct: 280 IPTGLTDAAMKRTVTQAEVSHAPIIIGVDPAYSGDDDAVIYLRQG 324 >gi|330007152|ref|ZP_08305894.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3] gi|328535499|gb|EGF61959.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3] Length = 495 Score = 149 bits (377), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 101/342 (29%), Positives = 157/342 (45%), Gaps = 23/342 (6%) Query: 7 TDQKL--EQELHEMLMHAECVLSFKN----FVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 TD L E++L E L+ + + SF + + + FPWG G L H S P +WQ + Sbjct: 2 TDAALSPEEQLKEQLI--DDIASFTHDPLGYALYAFPWGEDGTELAHASGPRQWQADAFR 59 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 + H + P + A +G GIGK+ + ++ W +ST ++ AN++ QL Sbjct: 60 EIGEHLQNPATRHQPLMISRA--SGHGIGKSAFISMLINWAMSTCEDCKVVVTANTDNQL 117 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 + W E+ KW ++ + WF + +++ + G D K + +SE Sbjct: 118 RTKTWPEIIKWSNLAITKEWFTCTATAMYSN---------DPGHD-KRWRADAIPWSEHN 167 Query: 181 PDTFVGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYD 239 + F G HN + V DEAS D++ + G T+ + W+ N R G F + Sbjct: 168 TEAFAGLHNERKRIVVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRE 227 Query: 240 IFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPH 299 F WK QID+RTVEG + + + YG DSD ++ + G FP FIP Sbjct: 228 CFRKYKHRWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASELQFIPT 287 Query: 300 NYIEEAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 +EAM R A +AP I+G D A G D V+ R+G Sbjct: 288 GLTDEAMKRVVTAAQVAHAPRIIGVDPAYSGVDDAVIYLRQG 329 >gi|262043569|ref|ZP_06016682.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039103|gb|EEW40261.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 491 Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 91/312 (29%), Positives = 144/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIVVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD ++ + G FP FIP +EAM R A+ +AP I+G D A Sbjct: 254 VDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVVTAVQVAHAPRIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|332344357|gb|AEE57691.1| terminase, large subunit [Escherichia coli UMNK88] Length = 491 Score = 148 bits (373), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 145/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|324008564|gb|EGB77783.1| hypothetical protein HMPREF9532_01752 [Escherichia coli MS 57-2] Length = 491 Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 146/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + +G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DLGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|327252187|gb|EGE63859.1| terminase large subunit [Escherichia coli STEC_7v] Length = 491 Score = 147 bits (371), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 145/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPATRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|300898423|ref|ZP_07116764.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357890|gb|EFJ73760.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 491 Score = 147 bits (370), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 145/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPETRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|218700994|ref|YP_002408623.1| putative phage terminase, large subunit [Escherichia coli IAI39] gi|218370980|emb|CAR18807.1| putative phage terminase, large subunit [Escherichia coli IAI39] Length = 491 Score = 146 bits (368), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 145/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|331648179|ref|ZP_08349269.1| conserved hypothetical protein [Escherichia coli M605] gi|331043039|gb|EGI15179.1| conserved hypothetical protein [Escherichia coli M605] Length = 491 Score = 146 bits (368), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 145/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|298381721|ref|ZP_06991320.1| terminase large subunit protein [Escherichia coli FVEC1302] gi|301019339|ref|ZP_07183525.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|298279163|gb|EFI20677.1| terminase large subunit protein [Escherichia coli FVEC1302] gi|299882256|gb|EFI90467.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|323948690|gb|EGB44595.1| hypothetical protein ERKG_04913 [Escherichia coli H252] Length = 491 Score = 146 bits (368), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 145/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|294491573|gb|ADE90329.1| putative phage terminase, large subunit [Escherichia coli IHE3034] Length = 491 Score = 146 bits (368), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 145/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|301046412|ref|ZP_07193572.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|300301638|gb|EFJ58023.1| conserved hypothetical protein [Escherichia coli MS 185-1] Length = 491 Score = 146 bits (368), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 92/312 (29%), Positives = 144/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEDGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|290968649|ref|ZP_06560187.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] gi|290781302|gb|EFD93892.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] Length = 487 Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 98/325 (30%), Positives = 160/325 (49%), Gaps = 29/325 (8%) Query: 31 FVMRFFPWG---IKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRG 87 FV F W +KG+ P WQ++ ++ V S T + A ++G G Sbjct: 22 FVYFAFDWDSEELKGQ------NPQTWQIKTLKEVGEGL------SLSTALQHATASGHG 69 Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 IGK+ L AW++LW ISTRP + AN+ TQL+ WAE+SKW + + +F + S + Sbjct: 70 IGKSALVAWLILWAISTRPDTRGVVTANTATQLETKTWAELSKWYHLFRGKKFFTLTSTA 129 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT-HGMAVFNDEASGTPDI 206 + E E++ ID+ +++ +R ++F G HN + + + DEAS + Sbjct: 130 IFCR---QEGHERTWRIDAIPWSV-------DRTESFAGLHNQGNRLLLIFDEASAIDNK 179 Query: 207 INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFH 266 I + G T+ + W++ N R G F+D F+ + W +ID+RTV+ + Sbjct: 180 IWEVAEGALTDKDTEILWLVFGNPTRSTGRFFDCFHKYKKSWITQKIDSRTVDISNKTQL 239 Query: 267 EGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDL---YAPLIMGCD 323 + I YG+DSD ++ +LG+FP FI + A R + +AP I+G D Sbjct: 240 QKWIQTYGIDSDFVKVRVLGEFPDTSDTQFISTAIVRTAWERRPLRTAEYDFAPCIIGMD 299 Query: 324 IAGEGGDKTVVVFRRGNIIEHIFDW 348 A GGD TV+ R+G E + ++ Sbjct: 300 PAWTGGDSTVIFLRQGFFSEKLAEY 324 >gi|30387381|ref|NP_848210.1| terminase large subunit [Enterobacteria phage epsilon15] gi|30266036|gb|AAO06065.1| terminase large subunit [Salmonella phage epsilon15] Length = 491 Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 95/338 (28%), Positives = 155/338 (45%), Gaps = 21/338 (6%) Query: 5 ISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDV 64 IST+++L +++ A + + FPWG G L H + P +WQ + + Sbjct: 6 ISTEEQLVEDI------ASFTYDPLGYALYAFPWGEDGTELAHATGPRKWQADAFREIRD 59 Query: 65 HCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTL 124 H + P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ Sbjct: 60 HLQNPATRHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKT 117 Query: 125 WAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTF 184 W E+ KW ++ + WF + +++ + G D K + +SE + F Sbjct: 118 WPEIIKWSNLAITKEWFTCTATAMYSN---------DPGHD-KRWRADAIPWSEHNTEAF 167 Query: 185 VGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI 243 G HN + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 168 AGLHNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRK 227 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 WK QID+RTVEG + + + YG +SD ++ + G FP FIP + Sbjct: 228 YKHRWKCAQIDSRTVEGTNKQQLQKWVDDYGEESDFVKVRVRGIFPDASELQFIPTGLTD 287 Query: 304 EAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 EAM R A +AP+I+G D A G D V+ R+G Sbjct: 288 EAMKRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQG 325 >gi|320175050|gb|EFW50163.1| terminase B protein, putative [Shigella dysenteriae CDC 74-1112] Length = 480 Score = 145 bits (367), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 92/313 (29%), Positives = 145/313 (46%), Gaps = 15/313 (4%) Query: 30 NFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIG 89 + + FPWG +G L H + P +WQ + + H + P + A ++G GIG Sbjct: 14 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 71 Query: 90 KTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLH 149 K+ + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 72 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 131 Query: 150 PSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIIN 208 + G D K + +SE + F G HN + V DEAS D++ Sbjct: 132 SN---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 181 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEG 268 + G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 182 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQK 241 Query: 269 IISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAG 326 + YG DSD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 242 WVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 301 Query: 327 EGGDKTVVVFRRG 339 G D V+ R+G Sbjct: 302 SGVDDAVIYLRQG 314 >gi|117624715|ref|YP_853628.1| putative phage terminase, large subunit [Escherichia coli APEC O1] gi|115513839|gb|ABJ01914.1| putative phage terminase, large subunit [Escherichia coli APEC O1] Length = 491 Score = 145 bits (365), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 91/312 (29%), Positives = 145/312 (46%), Gaps = 15/312 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD +I + G FP FIP +EAM R A ++P+I+G D A Sbjct: 254 VDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHSPVIIGVDPAYS 313 Query: 328 GGDKTVVVFRRG 339 G D V+ R+G Sbjct: 314 GVDDAVIYLRQG 325 >gi|89152423|ref|YP_512256.1| putative terminase large subunit [Escherichia phage phiV10] gi|74055446|gb|AAZ95895.1| putative terminase large subunit [Escherichia phage phiV10] Length = 491 Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 91/313 (29%), Positives = 144/313 (46%), Gaps = 15/313 (4%) Query: 30 NFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIG 89 + + FPWG +G L H + P +WQ + + H + P + A ++G GIG Sbjct: 25 GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML--ARASGHGIG 82 Query: 90 KTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLH 149 K+ + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 83 KSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWFTCTATAMY 142 Query: 150 PSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIIN 208 + G D K + +SE + F G HN + V DEAS D++ Sbjct: 143 SN---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVW 192 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEG 268 + G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 193 EVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSRTVEGTNKQQLQK 252 Query: 269 IISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAG 326 + YG SD +I + G FP FIP +EAM R A +AP+I+G D A Sbjct: 253 WVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAY 312 Query: 327 EGGDKTVVVFRRG 339 G D V+ R+G Sbjct: 313 SGVDDAVIYLRQG 325 >gi|309702815|emb|CBJ02146.1| putative terminase, large subunit [Escherichia coli ETEC H10407] Length = 493 Score = 143 bits (361), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 87/321 (27%), Positives = 148/321 (46%), Gaps = 15/321 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQNPATRHQPIML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKEQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD ++ + G FP N FIP + A+ R +A +++G D + + Sbjct: 254 VDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAVGRVITPAQVQHAAVVLGVDPSHQ 313 Query: 328 GGDKTVVVFRRGNIIEHIFDW 348 G D V+ R+G + + +W Sbjct: 314 GKDPAVIYLRQGLHCKKLGEW 334 >gi|215487825|ref|YP_002330256.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] gi|215265897|emb|CAS10306.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] Length = 493 Score = 142 bits (359), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 87/321 (27%), Positives = 147/321 (45%), Gaps = 15/321 (4%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG G L H + P +WQ + + H + P + A ++G GIGK Sbjct: 26 YALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQNPATRHQPLML--ARASGHGIGK 83 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 + + ++ W +ST ++ AN++ QL+ W E+ KW ++ + WF + +++ Sbjct: 84 SAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKEWFTCTATAMYS 143 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINK 209 + G D K + +SE + F G HN + V DEAS D++ + Sbjct: 144 N---------DPGHD-KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLVWE 193 Query: 210 SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 G T+ + W+ N R G F + F WK QID+RTVEG + + Sbjct: 194 VAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRTVEGTNKQQLQKW 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGE 327 + YG DSD ++ + G FP N FIP + A+ R +A +++G D + + Sbjct: 254 VDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAVGRVITPAQVQHAAVVLGVDPSHQ 313 Query: 328 GGDKTVVVFRRGNIIEHIFDW 348 G D V+ R+G + + +W Sbjct: 314 GKDPAVIYLRQGLHCKKLGEW 334 >gi|282848875|ref|ZP_06258265.1| conserved hypothetical protein [Veillonella parvula ATCC 17745] gi|282581380|gb|EFB86773.1| conserved hypothetical protein [Veillonella parvula ATCC 17745] Length = 483 Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 98/337 (29%), Positives = 157/337 (46%), Gaps = 17/337 (5%) Query: 14 ELHEMLMHAECVLSFK--NFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVN 71 E H+ L+ A L+ FV +PWG G PLE+ P WQ++ ++ D+ Sbjct: 2 EKHDELIEALGALTHDPLAFVYFAYPWGEPGTPLENMEGPDEWQIQILK--DIGEQLKKG 59 Query: 72 NSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKW 131 T + A+++G GIGK+ L +W++ + IST + AN+E QL+ W E+SKW Sbjct: 60 KDLQTAIQEAVASGHGIGKSALISWLIHFAISTHENTRGVVTANTEGQLRTKTWPELSKW 119 Query: 132 LSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT- 190 +M + F + ++ S E K + I +S+ P++F G HN Sbjct: 120 HNMFIAKDLFTYTATAIFSSDKDYE----------KTWRIDAIPWSKNSPESFAGLHNQG 169 Query: 191 HGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKR 250 + + V DEAS D+I + G T+ N W N R +G F + F + W Sbjct: 170 NRILVLFDEASAIDDVIWEVTEGALTDANTEIIWCAFGNPTRNSGRFRECFRKYRKFWNT 229 Query: 251 YQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 YQID+RTV+ + E + YG DSD ++ + G FP FI ++A + Sbjct: 230 YQIDSRTVKISNKTKIEEWLEAYGEDSDFFKVRVRGVFPSASDLQFISTEIADKAQKQVY 289 Query: 311 IDDLYA--PLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 + P+I+G D A G D +V R+G ++ + Sbjct: 290 KPGQFEHLPVIIGVDPAWTGSDSLEIVMRQGYYMKSL 326 >gi|227355862|ref|ZP_03840255.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906] gi|227164181|gb|EEI49078.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906] Length = 494 Score = 139 bits (349), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 96/344 (27%), Positives = 151/344 (43%), Gaps = 33/344 (9%) Query: 9 QKLEQELHEMLMHAECVLSFKN----FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDV 64 + L++ E L+ E + SF + + FPWG G LE ++ P +WQ E + + Sbjct: 3 EALQKSPEEQLI--EDIASFTHDPLGYAYYAFPWGEAGGELEEYNGPRQWQAEALNEIGE 60 Query: 65 HCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTL 124 H + P + A ++G GIGK+ + ++ W + T ++ AN+E QL+ Sbjct: 61 HLRNPKTRHQPLLL--ARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQLRTKT 118 Query: 125 WAEVSKWLSMLPHRHWFEMQSLSL------HPSGWYAELLEQSMGIDSKHYTITCRTYSE 178 W E++KW + +WF ++ H + W A+ + +SE Sbjct: 119 WPEIAKWQRLSLTNNWFTCTKTAIYSNDPNHANAWRADAV----------------PWSE 162 Query: 179 ERPDTFVGPHNTHGMAVFN-DEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 + F G HN + DEAS D++ + G T+ WI N R G F Sbjct: 163 NNTEAFAGLHNKGKRIILVFDEASNIADLVWEVAEGALTDEGTEIIWIAFGNPTRNTGRF 222 Query: 238 YDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFI 297 + F W QID+RTVEG + + YG DSD ++ + G FP FI Sbjct: 223 RECFRKFKHRWNTKQIDSRTVEGSNKEQIKNWEEDYGEDSDFFKVRVRGVFPSASELQFI 282 Query: 298 PHNYIEEAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 P +EAM R + +AP+I+G D A G D V+ R+G Sbjct: 283 PTGLTDEAMKRIVTQAEVAHAPVIIGVDPAYSGIDDAVIYLRQG 326 >gi|304398406|ref|ZP_07380280.1| terminase, large subunit [Pantoea sp. aB] gi|304354272|gb|EFM18645.1| terminase, large subunit [Pantoea sp. aB] Length = 490 Score = 137 bits (345), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 85/318 (26%), Positives = 139/318 (43%), Gaps = 27/318 (8%) Query: 31 FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGK 90 + + FPWG +G L + P +WQ + + + H + P + A +G GIGK Sbjct: 25 YALYAFPWGEEGTDLAYSKGPRQWQEDAFKQIGAHLQNPDTRHQPLMIGRA--SGHGIGK 82 Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL-- 148 + + ++ W + T ++ AN+E QL+ W E++KW + + WF + ++ Sbjct: 83 SAFISMLVKWGMDTCEDCKVVVTANTENQLRTKTWPEIAKWQRLSITQDWFTCTATAIYS 142 Query: 149 ----HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVF-NDEASGT 203 H W A+ + +SE + F G HN + DEAS Sbjct: 143 NDPSHAKSWRADAI----------------PWSENNTEAFAGLHNERKRIILIFDEASNI 186 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS 263 D++ + G T+ N W+ N R G F + F WK QID+R+VEG + Sbjct: 187 ADLVWEVAEGALTDENTEIIWVAFGNPTRNTGRFRECFRKLRHRWKTAQIDSRSVEGTNK 246 Query: 264 GFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD--LYAPLIMG 321 + + YG DSD ++ + G FP FIP + A+ R +A ++G Sbjct: 247 EQIQKWVDDYGEDSDFVKVRVRGLFPSASEAQFIPTGLTDAAVGRVITPGQVAHAATVIG 306 Query: 322 CDIAGEGGDKTVVVFRRG 339 D A +GGD V+ R+G Sbjct: 307 VDPAHQGGDPAVIYLRQG 324 >gi|332981151|ref|YP_004462592.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON] gi|332698829|gb|AEE95770.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON] Length = 461 Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 99/301 (32%), Positives = 146/301 (48%), Gaps = 50/301 (16%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 ++P WQ E ++A+ NP + A+ +G G+GKT L AW +LW + TRP Sbjct: 25 AEPDDWQAETLQAL---------ADNPRV---AVRSGHGVGKTALEAWALLWFLFTRPYP 72 Query: 109 SIICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSL----HPSGWYAELLEQSMG 163 I C A + QL + LWAE SKWL P + +FE Q + +P W+A Sbjct: 73 KIPCTAPTREQLHDILWAEASKWLERAPALKPYFEWQKTRIVQKQYPGRWFA-------- 124 Query: 164 IDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRF 223 T RT + +P+ G H H + + DEASG D I ++I G T + Sbjct: 125 --------TARTSN--KPENMAGFHEEHLLFII-DEASGIADNIFETIEGALTTSDAK-- 171 Query: 224 WIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG-----FHEGIISRYGLDSD 278 +M N + +G F+D F +D Y TR V +DS + E + +Y DSD Sbjct: 172 LLMCGNPTKNSGVFHDAF---FKDRSLYW--TRKVSCLDSQRVTLEYAERLKRKYHEDSD 226 Query: 279 VARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRR 338 V R+ +LG+FP+ E + FI + +E A R+ D L +G D+A G D+TV+ R Sbjct: 227 VYRVRVLGEFPKAEPDTFISLDIVEAATMRDVEPD--GVLEIGVDVARFGDDETVLAARA 284 Query: 339 G 339 G Sbjct: 285 G 285 >gi|153810665|ref|ZP_01963333.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174] gi|149833061|gb|EDM88143.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174] Length = 469 Score = 134 bits (337), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 87/264 (32%), Positives = 136/264 (51%), Gaps = 17/264 (6%) Query: 81 AISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW 140 ++ +G GIGK+ + AW ++W + T P I C A ++ QL + LWAE+SKW R+ Sbjct: 44 SVRSGHGIGKSAVEAWSVIWFMCTHPYPKIPCTAPTQHQLFDILWAEISKW-----KRNN 98 Query: 141 FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA 200 + S + W E L M ++ + RT S PD G H H + + DEA Sbjct: 99 KTLDSELI----WTKEKL--YMKGHAEEWFAVARTAST--PDALQGFHAEHMLYII-DEA 149 Query: 201 SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEG 260 SG D I + +LG + P +M N +L+G+FYD N E + + ID R Sbjct: 150 SGVEDKIFEPVLGALS--TPGAKLLMCGNPTQLSGFFYDSHNKNREQYSTFHIDGRNSTR 207 Query: 261 IDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI- 319 + F + II+ YG DSDV R+ + G FP E + +IP +E++++ E + +I Sbjct: 208 VSQEFVQTIINMYGEDSDVFRVRVAGDFPLAEDDIYIPLPLVEKSIATEYFPRRHPQIIH 267 Query: 320 MGCDIAGEGGDKTVVVFRRGNIIE 343 +GCD+A G DKTV+ +R ++ Sbjct: 268 IGCDVARFGTDKTVIGYRTDEKVQ 291 >gi|54302246|ref|YP_132239.1| terminase large subunit [Photobacterium profundum SS9] gi|46915667|emb|CAG22439.1| hypothetical protein PBPRB0566 [Photobacterium profundum SS9] Length = 513 Score = 133 bits (334), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 97/324 (29%), Positives = 151/324 (46%), Gaps = 25/324 (7%) Query: 31 FVMRFFPWGIK------------GKPLEHFSQPHRWQLEFMEAV-DVHCHSNVNNSNPT- 76 FVM +PW + P W E + + +V ++ N +P Sbjct: 27 FVMYAYPWDTDPDLQIVKLPEPWASKYDSVYGPDAWFCEMCDQLQEVIRKNDFNGVDPVD 86 Query: 77 IFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLP 136 F +IS+G GIGK+ ++W++ +++STRP + +N+ QL+ W E+ KW L Sbjct: 87 AFLYSISSGHGIGKSCASSWLIHFVMSTRPNSKGVVTSNTSEQLRTKTWGELGKWTKKLI 146 Query: 137 HRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVF 196 ++HWF + + + ++ + E + +D++ TCR EE ++F G H + Sbjct: 147 NKHWFVYNNGKGNMNFYHKDYAE-TWRVDAQ----TCR---EENSESFAGLHCASSTPWY 198 Query: 197 -NDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 DEAS PD I + G T+ P FW + N R +G F + + + W R QID+ Sbjct: 199 LFDEASAVPDKIWEVAEGGLTDGEP--FWFVFGNPTRNSGRFRECWRRFRQRWNRKQIDS 256 Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY 315 TV+ + S YG DSD R+ + G FP N I +E AMSR A Sbjct: 257 STVQVTNKKKISEWESDYGEDSDFYRVRVKGVFPSASSNQKISGALLEAAMSRTAHVIPG 316 Query: 316 APLIMGCDIAGEGGDKTVVVFRRG 339 +P +M D+A GGD V FR G Sbjct: 317 SPRVMSLDVARGGGDNCVFRFRHG 340 >gi|332976102|gb|EGK12970.1| hypothetical protein HMPREF9374_1123 [Desmospora sp. 8437] Length = 462 Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 91/300 (30%), Positives = 144/300 (48%), Gaps = 42/300 (14%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 ++P WQ D+ + +N + A+ AG G+GKT AW +LW + TRP Sbjct: 31 AEPDEWQ-------DIALQALADNQ-----RVAVRAGHGVGKTATEAWAVLWFLLTRPFP 78 Query: 109 SIICIANSETQLKNTLWAEVSKWL----SMLPHRHWFEMQ-SLSLHPSGWYAELLEQSMG 163 I C A ++ QL + LW E++KWL + P+ W + + + + W+A Sbjct: 79 KIPCTAPTKPQLMDVLWPEIAKWLMNAPELAPYVEWQKTRVVMKQYEERWFA-------- 130 Query: 164 IDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRF 223 T RT + +P+ G H H + V DEASG + I ++I G T Sbjct: 131 --------TARTSN--KPENMAGFHEEHLLFVI-DEASGVDNAIFETIDGALTTAGSK-- 177 Query: 224 WIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIE 283 +M N R NG FYD F+ + + Y+I + + + +YG DSD+ R+ Sbjct: 178 LVMFGNPTRTNGVFYDAFHQDRDLYWTYKISCLDSKMASKDYARNMARKYGEDSDIYRVR 237 Query: 284 ILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNI 341 + G+FPQ + ++FIP +E+A R E ID+ L +G D+A G D+TV+ R G + Sbjct: 238 VQGEFPQGDPDSFIPLELVEDARVRDLEWIDE--DELHIGVDVARFGSDETVLAARIGPV 295 >gi|257883493|ref|ZP_05663146.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|294614775|ref|ZP_06694675.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636] gi|294622490|ref|ZP_06701512.1| conserved hypothetical protein [Enterococcus faecium U0317] gi|257819151|gb|EEV46479.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|291592387|gb|EFF23996.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636] gi|291598037|gb|EFF29147.1| conserved hypothetical protein [Enterococcus faecium U0317] Length = 471 Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 86/307 (28%), Positives = 151/307 (49%), Gaps = 21/307 (6%) Query: 35 FFPWGIKGKPLEHF-SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 F P+ G ++++ +P + + + NV N K ++ +G+G+GKT L Sbjct: 5 FIPFADIGSAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTAL 64 Query: 94 NAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGW 153 A +LW ++ RP +I A + QL + LWAEV+KWL+ ++ + ++ G Sbjct: 65 EAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNDSLIKNLLKWTKTKIYMVG- 123 Query: 154 YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 DS+ + T RT + +P+ G H H M + DEASG D I ++ILG Sbjct: 124 -----------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAILG 169 Query: 214 FFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRY 273 + + N+ +M N + G FYD N + ++ +++ + + + E I+ +Y Sbjct: 170 TLSGFD-NKL-LMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKKY 227 Query: 274 GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI---MGCDIAGEGGD 330 G +SDVAR+ I G+FP+ +++FI +E A ++ D L +G D+A G D Sbjct: 228 GKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGDD 287 Query: 331 KTVVVFR 337 T++ R Sbjct: 288 STILFPR 294 >gi|261208032|ref|ZP_05922709.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289567088|ref|ZP_06447483.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|260077749|gb|EEW65463.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289161103|gb|EFD09008.1| conserved hypothetical protein [Enterococcus faecium D344SRF] Length = 471 Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 86/307 (28%), Positives = 151/307 (49%), Gaps = 21/307 (6%) Query: 35 FFPWGIKGKPLEHF-SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 F P+ G ++++ +P + + + NV N K ++ +G+G+GKT L Sbjct: 5 FIPFADIGAAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTAL 64 Query: 94 NAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGW 153 A +LW ++ RP +I A + QL + LWAEV+KWL+ ++ + ++ G Sbjct: 65 EAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNDSLIKNLLKWTKTKIYMVG- 123 Query: 154 YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 DS+ + T RT + +P+ G H H M + DEASG D I ++ILG Sbjct: 124 -----------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAILG 169 Query: 214 FFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRY 273 + + N+ +M N + G FYD N + ++ +++ + + + E I+ +Y Sbjct: 170 TLSGFD-NKL-LMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKKY 227 Query: 274 GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI---MGCDIAGEGGD 330 G +SDVAR+ I G+FP+ +++FI +E A ++ D L +G D+A G D Sbjct: 228 GKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGDD 287 Query: 331 KTVVVFR 337 T++ R Sbjct: 288 STILFPR 294 >gi|160940775|ref|ZP_02088117.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC BAA-613] gi|158436295|gb|EDP14062.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC BAA-613] Length = 484 Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 82/262 (31%), Positives = 127/262 (48%), Gaps = 33/262 (12%) Query: 81 AISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH- 139 ++ +G GIGK+ + AW ++W + TRP I C A +E QL + LWAE+SKW+ P Sbjct: 44 SVRSGHGIGKSAVEAWSVIWYMCTRPFPKIPCTAPTEHQLMDVLWAEISKWMRNNPALRD 103 Query: 140 ---WF-EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAV 195 W E + HP W+A RT + P+ G H H + + Sbjct: 104 DLIWTKEKLYMQGHPEEWFA----------------VPRTATN--PEALQGFHAEHVLYI 145 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 DEASG D + + +LG T + +M N RL G+FYD + E + +D Sbjct: 146 I-DEASGVSDKVFEPVLGAMT--GEDAKLLMMGNPTRLAGFFYDSHHRNREQYSAIHVDG 202 Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY 315 R + + F + II +G DSDV R+ + GQFP+ ++ I + EEA + + +Y Sbjct: 203 RDSQHVSRTFVQKIIDMFGEDSDVFRVRVAGQFPKSTPDSLIAMEWCEEAANLQ----VY 258 Query: 316 AP---LIMGCDIAGEGGDKTVV 334 AP + +G D+A G D + + Sbjct: 259 APGGQIDIGVDVARYGDDSSAL 280 >gi|319956916|ref|YP_004168179.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM 16511] gi|319419320|gb|ADV46430.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM 16511] Length = 462 Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 96/331 (29%), Positives = 153/331 (46%), Gaps = 42/331 (12%) Query: 42 GKPLEHF------SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNA 95 K LE F ++P + Q++ + A+D K +I +G G GKTTL A Sbjct: 13 AKSLEFFVRVILKAKPTKQQMKAIRAIDQGKK-----------KISIRSGHGTGKTTLLA 61 Query: 96 WMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYA 155 W++LW R I A + QL + L E+ KW +P ++ E+ Sbjct: 62 WIVLWWGLGREDAKIPMTAPTGHQLYDLLMPEIRKWREKMPVQYQNEV------------ 109 Query: 156 ELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFF 215 E+ + + + ++ + RT +++P+ G H T+ +A DEASG P +I + G Sbjct: 110 EVKTEKIDFANGNFAVP-RTARKDQPEALQGFHATN-LAFIIDEASGIPQVIFEVAEGAM 167 Query: 216 TELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL 275 T + IM +N R G+FYD + W+ +Q + E + + E +YG Sbjct: 168 T--GESTLVIMAANPTRTEGYFYDSHHKNRWQWECFQFNAEESENVSKEWIEEKKRQYGE 225 Query: 276 DSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVV 335 DSDV R+ I G+FP+Q N +++A +RE +DD A + G D+A G DK+V+ Sbjct: 226 DSDVYRVRIKGEFPRQSSNAVFSLQEVDDATTREIVDDSGAE-VWGLDVADFGDDKSVLA 284 Query: 336 FRRGNIIEHIF--------DWSAKLIQETNQ 358 R+G I D + LI E NQ Sbjct: 285 KRKGKHFHEITARSGLTLPDLAGWLIYEYNQ 315 >gi|228950291|ref|ZP_04112468.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] gi|228809453|gb|EEM55897.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] Length = 495 Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 85/311 (27%), Positives = 136/311 (43%), Gaps = 54/311 (17%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 +P WQ E + + H H +V +G+G+GKT + +W+ +W + RP Sbjct: 40 EPDPWQKEVLNDIANHSHVSVR------------SGQGVGKTAMESWICIWFLCCRPYPK 87 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSG----WYAELLEQSMGID 165 IIC A ++ QL + LWAE++KWL+ + + ++ G W+A Sbjct: 88 IICTAPTKQQLYDVLWAEIAKWLNSSQVKDLLKWTKTKIYMKGFEDRWFA---------- 137 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWI 225 T +T + RP+ G H + M DEASG D I ++ILG + F Sbjct: 138 ------TAKTAT--RPENMQGFHEDY-MLFIADEASGIADDIMEAILGTLSGSENKLF-- 186 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 M N + +G F+D N +K +++ + E + +YG SDV R+ + Sbjct: 187 MCGNPTKTSGVFFDSHNKDRALYKSHKVSSADSPRTSKKNIEMLKKKYGEGSDVYRVRVE 246 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLY-----------------APLIMGCDIAGEG 328 G+FP+ E + FI E A RE A + +GCD+A G Sbjct: 247 GEFPRGEADAFISLETAEAARMREVYKVEVIENEEEESTVKEIIPDTAVVEIGCDVARFG 306 Query: 329 GDKTVVVFRRG 339 D+T++ RRG Sbjct: 307 SDETIIATRRG 317 >gi|228968731|ref|ZP_04129698.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis serovar sotto str. T04001] gi|228790961|gb|EEM38595.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis serovar sotto str. T04001] Length = 459 Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 82/282 (29%), Positives = 138/282 (48%), Gaps = 26/282 (9%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 K ++ +G+G+GKT L + +++W + RP +IC A ++ QL LWAE++KWL + Sbjct: 39 KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLEGSAVK 98 Query: 139 HWFEMQSLSLHPSG----WYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 + + ++ G W+A T RT + +P+ G H + M Sbjct: 99 NLLKWTKTRVYMIGSEERWFA----------------TARTAT--KPENMQGFHEDY-ML 139 Query: 195 VFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQID 254 DEASG D I ++ILG + F + N R +G FYD N + +K +++ Sbjct: 140 FVCDEASGIADPIMEAILGTLSGAENKLF--LCGNPTRTSGVFYDSHNRDRDLYKIHKVS 197 Query: 255 TRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDL 314 + E + +YG SDV R+ +LG+FP+ E + FIP +E+A S + ++ Sbjct: 198 SLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCK-VEPT 256 Query: 315 YAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQET 356 L +G D+A G D+TV+ R GN + + + + ET Sbjct: 257 GETLDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMET 298 >gi|228911519|ref|ZP_04075310.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL 200] gi|228848128|gb|EEM92991.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL 200] Length = 459 Score = 120 bits (300), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 82/282 (29%), Positives = 138/282 (48%), Gaps = 26/282 (9%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 K ++ +G+G+GKT L + +++W + RP +IC A ++ QL LWAE++KWL + Sbjct: 39 KVSVRSGQGVGKTGLESVVVIWFLCCRPNPKVICTAPTKEQLFTVLWAEIAKWLEGSAVK 98 Query: 139 HWFEMQSLSLHPSG----WYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 + + ++ G W+A T RT + +P+ G H + M Sbjct: 99 NLLKWTKTRVYMIGSEERWFA----------------TARTAT--KPENMQGFHEDY-ML 139 Query: 195 VFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQID 254 DEASG D I ++ILG + F + N R +G FYD N + +K +++ Sbjct: 140 FVCDEASGIADPIMEAILGTLSGAENKLF--LCGNPTRTSGVFYDSHNRDRDLYKIHKVS 197 Query: 255 TRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDL 314 + E + +YG SDV R+ +LG+FP+ E + FIP +E+A S + ++ Sbjct: 198 SLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLGEFPKAEADAFIPLEIVEQAASCK-VEPT 256 Query: 315 YAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQET 356 L +G D+A G D+TV+ R GN + + + + ET Sbjct: 257 GETLDLGVDVARFGDDETVIAPRIGNKVFKLLNHYKQDTMET 298 >gi|150390341|ref|YP_001320390.1| hypothetical protein Amet_2579 [Alkaliphilus metalliredigens QYMF] gi|149950203|gb|ABR48731.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF] Length = 469 Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 86/280 (30%), Positives = 136/280 (48%), Gaps = 21/280 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 K ++ +G+G+GKT L + + W + TRP +I A + QL + LWAE+SKWLS Sbjct: 44 KVSVRSGQGVGKTGLESIAITWYLCTRPFPKVIATAPTRQQLYDVLWAEISKWLSKSKVD 103 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 ++ +G+ + + T RT RP+ G H + + V D Sbjct: 104 KLLRWTKTKIYMNGF------------EERWWATARTAV--RPENMQGFHEDYMLFVV-D 148 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTV 258 EASG D I ++ILG T N+ ++ N + +G FYD N + +K +++ + Sbjct: 149 EASGVADPIMEAILGTLTGYE-NKL-LLCGNPTKTSGTFYDSHNRDRDTYKSHKVSSMDS 206 Query: 259 EGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAP- 317 E + +YG DSDV R+ +LG FP+ E ++ I E+A E + D+ Sbjct: 207 PRTSKENIEMLKKKYGADSDVFRVRVLGDFPKGEADSLISLEVTEQAA--ETVVDISNAY 264 Query: 318 -LIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQET 356 L +G DIA G DKT++ R GN + + +S K ET Sbjct: 265 TLNIGADIARFGDDKTIIAPRIGNRVLDLQQYSKKDTMET 304 >gi|323486060|ref|ZP_08091391.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum WAL-14163] gi|323400627|gb|EGA92994.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum WAL-14163] Length = 476 Score = 119 bits (298), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 82/265 (30%), Positives = 132/265 (49%), Gaps = 17/265 (6%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 K AI +G+G+GKT + A +LW + P I+ A ++ QL + LW+EVSKW+S P Sbjct: 52 KVAIKSGQGVGKTGMEAVALLWFLCCYPYPRIVATAPTKQQLHDVLWSEVSKWMSKSP-- 109 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 + S L + Y ++ + K + RT + +P+ G H + M D Sbjct: 110 ----LLSDILKWTKTYIYMVG-----NEKRWFAVARTAT--KPENMQGFHEDN-MLFIVD 157 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTV 258 EASG D I ++ILG + N +M N R +G FYD FN+ ++ + + + Sbjct: 158 EASGVADPIMEAILGTLS--GANNKLLMCGNPTRTSGTFYDAFNVDRSIYRCHTVSSADS 215 Query: 259 EGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL 318 + + E +I +YG DS+V + + G+FP+QE + FI + +E + DD+ Sbjct: 216 KRTNKQNIESLIRKYGKDSNVVLVRVFGEFPKQEDDVFIALSIVEHCCMLDLPDDVPIKR 275 Query: 319 I-MGCDIAGEGGDKTVVVFRRGNII 342 I G D+A G D+TV+ G I Sbjct: 276 ISFGVDVARYGSDETVIAKNVGGRI 300 >gi|282598712|ref|YP_003358792.1| putative phage terminase B protein [Enterococcus phage phiEf11] gi|300860603|ref|ZP_07106690.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|307292389|ref|ZP_07572245.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis TX0411] gi|258598082|gb|ACV83339.1| putative phage terminase B protein [Enterococcus phage phiEf11] gi|300849642|gb|EFK77392.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|306496518|gb|EFM66079.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis TX0411] gi|315146097|gb|EFT90113.1| conserved hypothetical protein [Enterococcus faecalis TX2141] Length = 484 Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 79/264 (29%), Positives = 133/264 (50%), Gaps = 20/264 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 K ++ +G+G+GKT L A +LW ++ RP +I A + QL + LWAEV+KWL+ + Sbjct: 50 KVSVRSGQGVGKTALEAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNNSLIK 109 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 + ++ G DS+ + T RT + +P+ G H H M + D Sbjct: 110 DLLKWTKTKIYMVG------------DSERWFATARTAT--KPENMQGFHEDH-MLIVVD 154 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTV 258 EASG D I ++ILG + + N+ +M N + G FYD N + ++ +++ + Sbjct: 155 EASGVADPIMEAILGTLSGFD-NKL-LMCGNPNNIEGVFYDSHNTDRDKYRTHKVSSYDS 212 Query: 259 EGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL 318 + + + +I +YG +SDVAR+ I G+FP+ +++FI +E A D + Sbjct: 213 KRTNKENIQMLIDKYGENSDVARVRIYGEFPKGALDSFISLEIVEFAKDINISDSELKHV 272 Query: 319 I---MGCDIAGEGGDKTVVVFRRG 339 +G D+A G D T+V R G Sbjct: 273 REGHIGVDVARFGDDSTIVFPRIG 296 >gi|266623290|ref|ZP_06116225.1| putative terminase B protein [Clostridium hathewayi DSM 13479] gi|288864932|gb|EFC97230.1| putative terminase B protein [Clostridium hathewayi DSM 13479] Length = 484 Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 73/259 (28%), Positives = 130/259 (50%), Gaps = 27/259 (10%) Query: 81 AISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH- 139 ++ +G G+GK+ + +W ++W + TRP I C A ++ QL + LWAE+SKWL P Sbjct: 44 SVRSGHGVGKSAVESWSVIWFLCTRPFPKIPCTAPTQHQLYDILWAEISKWLRNNPELKN 103 Query: 140 ---WFEMQS-LSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAV 195 W + + ++ +P W+A RT + P+ G H H + + Sbjct: 104 DIIWTQQRVYMNGYPEEWFA----------------VPRTATN--PEALQGFHAEHVLYI 145 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 DEASG D + + +LG T + +M N RL+G+F+D + ++ ID Sbjct: 146 I-DEASGVSDKVFEPVLGAMT--GEDAKLLMMGNPTRLSGFFFDSHHKSRSEYSAMHIDG 202 Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY 315 R + ++ F + II+ +G+DSDV R+ + GQFP+ ++ I ++ E A + + + Sbjct: 203 RDSQHVNQKFVQKIINMFGMDSDVFRVRVAGQFPKSTPDSLIMMDWCEAATQLKP-ETVR 261 Query: 316 APLIMGCDIAGEGGDKTVV 334 + +G D+A G D + + Sbjct: 262 NRVDIGVDVARYGDDSSAL 280 >gi|150016512|ref|YP_001308766.1| hypothetical protein Cbei_1636 [Clostridium beijerinckii NCIMB 8052] gi|149902977|gb|ABR33810.1| conserved hypothetical protein [Clostridium beijerinckii NCIMB 8052] Length = 470 Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 87/288 (30%), Positives = 145/288 (50%), Gaps = 36/288 (12%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 K ++ +G+G+GKT L + ++ W + TRP +I A + QL + LWAE+SKWL+ Sbjct: 44 KVSVRSGQGVGKTGLESIVVTWYLCTRPFPKVIATAPTRQQLYDVLWAEISKWLASSKIE 103 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 + E ++ G+ S+ + T +T + RP+ G H + + V D Sbjct: 104 NLLEWTKTKIYMKGY------------SERWWATAKTAT--RPENMQGFHEDYMLFVV-D 148 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT--- 255 EASG D I ++ILG T N+ +M N R +G FYD N + +K +++ + Sbjct: 149 EASGVADPIMEAILGTLTGYE-NKL-LMCGNPTRTSGTFYDSHNRDRDLYKTFKVSSLES 206 Query: 256 -RT----VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 RT +E + +HEG SDV R+ + G+FP+ E ++ I Y E A + Sbjct: 207 PRTSKDNIEMLKRKYHEG--------SDVWRVRVEGEFPKGESDSLISLEYAETA-TITK 257 Query: 311 IDDLYA--PLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQET 356 I++++ L +G DIA G D++V+ R GN + + ++ K ET Sbjct: 258 INNIHNNFTLHIGADIARFGNDESVIAPRIGNKVFDLLTYTKKDTMET 305 >gi|209901239|ref|YP_002290878.1| putative terminase B [Clostridium phage phiCD27] gi|199612120|gb|ACH91293.1| putative terminase B [Clostridium phage phiCD27] Length = 469 Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 86/287 (29%), Positives = 140/287 (48%), Gaps = 35/287 (12%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 K +I +G+G+GKT L + +W +STRP ++ A + QL + LWAE++KWLS Sbjct: 44 KVSIRSGQGVGKTGLESIATVWYLSTRPFPKVVATAPTRQQLYDVLWAEIAKWLSNSKVE 103 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 E ++ G+ + + T RT +P+ G H + + V D Sbjct: 104 KLLEWTKTKVYMKGF------------EERWWATARTAV--KPENMQGFHEDYMLFVV-D 148 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT--- 255 EASG D I ++ILG + N+ ++ N R +G FYD N + +K +++ + Sbjct: 149 EASGVADPIMEAILGTLSGAE-NKL-LLCGNPTRTSGTFYDSHNRDRDLYKTFKVSSLDS 206 Query: 256 -RT----VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 RT +E + +HEG SD R+ +LG+FP+ E ++ I +E + RE Sbjct: 207 PRTSKDNIEMLKRKYHEG--------SDPWRVRVLGEFPKGESDSLISLEAVETSTIREV 258 Query: 311 -IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQET 356 I + Y L +G DIA G D+T++ R G + + +S K ET Sbjct: 259 NISNDYI-LNIGADIARYGDDETIIAPRIGGKVFDLLTYSKKDTMET 304 >gi|253578914|ref|ZP_04856185.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA] gi|251849857|gb|EES77816.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA] Length = 473 Score = 109 bits (273), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 91/293 (31%), Positives = 149/293 (50%), Gaps = 41/293 (13%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 +P WQ + A D+ +NP K +I +G+G+GKT L A + LW ++ P Sbjct: 17 EPDEWQAQ--AARDLA-------ANP---KVSIKSGQGVGKTGLEAAVFLWFVTCFPHPR 64 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 I+ A ++ QL + LW+E+SKW+S E+ S+ L + Y ++ + K + Sbjct: 65 IVATAPTKQQLHDVLWSEISKWMSK------SELLSILLKWTKTYVYMVGE-----EKRW 113 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 RT + +P+ G H + M DEASG D I ++ILG + N N+ ++ N Sbjct: 114 FGVARTAT--KPENMQGFHEDN-MLFIVDEASGVADPIMEAILGTLSGAN-NKL-LLCGN 168 Query: 230 TRRLNGWFYDIFNIPLEDWKRYQI----DTRT-VEGIDSGFHEGIISRYGLDSDVARIEI 284 + +G FYD +K + + TRT E IDS ++ +YG DS+V R+ + Sbjct: 169 PTKTSGTFYDSHTRDRALYKCHTVSSMDSTRTNKENIDS-----LVRKYGWDSNVVRVRV 223 Query: 285 LGQFPQQEVNNFIPHNYIEEAMSR-EAIDDLYAP--LIMGCDIAGEGGDKTVV 334 G+FP QE + FIP + IE+ S+ +DD + +G D+A G D+T++ Sbjct: 224 RGEFPNQEDDVFIPLSLIEQCSSKLLELDDADGMQFVSLGVDVARFGDDETII 276 >gi|255282256|ref|ZP_05346811.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] gi|255267204|gb|EET60409.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] Length = 506 Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust. Identities = 76/289 (26%), Positives = 131/289 (45%), Gaps = 33/289 (11%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 +P WQ + + +D+ S V A+ +G+G+GKT + A +LW +S Sbjct: 49 EPDEWQRDAL--MDLAEESRV----------AVKSGQGVGKTGIEAVAVLWFLSCFRYAR 96 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSLHPSGWYAELLEQSMGIDSKH 168 ++ A + QL + LW+E++KW P + ++ G+ K Sbjct: 97 VVATAPTRQQLHDVLWSEIAKWQERSPLLKAILRWTKTYVYVKGY------------EKR 144 Query: 169 YTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTS 228 + RT + +P+ G H + M DEASG D I +++LG + N +M Sbjct: 145 WFAVARTAT--KPENMQGFHEDN-MLFIVDEASGVADPIMEAVLGTLS--GGNNKLLMCG 199 Query: 229 NTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQF 288 N R G FYD F + + + + D + +I +YG DS++ R+ + G F Sbjct: 200 NPTRTTGTFYDAFTKDRSIFACHTVSSLDSSRTDKNNIDALIRKYGEDSNLVRVRVKGLF 259 Query: 289 PQQEVNNFIPHNYIEEAMSRE---AIDDLYAPLIMGCDIAGEGGDKTVV 334 P+Q+ + FI I++ SR+ A +I+G D+A G D+TV+ Sbjct: 260 PKQDDDVFISQELIDQCTSRQYELPESRGMAQVILGVDVARYGNDETVI 308 >gi|308069786|ref|YP_003871391.1| hypothetical protein PPE_03030 [Paenibacillus polymyxa E681] gi|305859065|gb|ADM70853.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 452 Score = 99.8 bits (247), Expect = 7e-19, Method: Compositional matrix adjust. Identities = 80/260 (30%), Positives = 121/260 (46%), Gaps = 28/260 (10%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 + ++ +G+G+GKT L A LW +S P +IC A + QL + LWAE++KW S P Sbjct: 22 RVSVRSGQGVGKTGLEAATALWFLSCFPYPKVICTAPTRQQLHDVLWAEINKWQSKSP-- 79 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 + L + Y + E+ + T RT + +P+ G H + M D Sbjct: 80 --VLKRILKWTKTKIYMKNYEE-------RWFATARTAT--KPENMQGLHEDY-MLFIVD 127 Query: 199 EASGTPDIINKSILGFFT-ELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRT 257 EASG D I ++ILG + E N +M N + +G FYD N D+K TR Sbjct: 128 EASGVADPIMEAILGTLSGEFNK---ILMCGNPTKTSGVFYDSHNKDRADYK-----TRK 179 Query: 258 VEGIDSGFHEG-----IISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAID 312 V +DS + +YG SDV R+ + G+FP+ + FI E A ++ Sbjct: 180 VSCLDSPRTSKDNIAMLKRKYGEGSDVWRVRVEGEFPRGGSDTFISLEVAEFAAKEVKLE 239 Query: 313 DLYAPLIMGCDIAGEGGDKT 332 L +G D+A G D+T Sbjct: 240 PTGDMLTIGVDVARFGDDET 259 >gi|289578588|ref|YP_003477215.1| hypothetical protein Thit_1395 [Thermoanaerobacter italicus Ab9] gi|289528301|gb|ADD02653.1| conserved hypothetical protein [Thermoanaerobacter italicus Ab9] Length = 460 Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 82/318 (25%), Positives = 140/318 (44%), Gaps = 52/318 (16%) Query: 40 IKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMML 99 +KG P E Q E ++AV H + A+ A G+GKT + AW+ L Sbjct: 27 LKGDPWEK-------QEEILKAVRDHK------------RVAVRACHGVGKTKVAAWVAL 67 Query: 100 WLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLE 159 W + T +I A + Q++N LW E+ + S P G ++L+ Sbjct: 68 WFLYTHHNSKVITTAPTWHQVENLLWREIHA------------AHAASRIPLG--GKVLQ 113 Query: 160 QSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELN 219 + + + + + T ++P+ F G H H + + DEASG + GF T + Sbjct: 114 TQIELGEQWFALGLST---DKPERFQGFHAEHILLIV-DEASGVEQYTFDAAEGFLTSIG 169 Query: 220 PNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG-----------FHEG 268 ++ N +L+G FY+ F PL + + I + +G + E Sbjct: 170 AK--LLLIGNPTQLSGEFYNAFRSPL--YHKIHISAFDSPNLKAGKIVRPYLVTPEWVED 225 Query: 269 IISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEG 328 ++G DS + +LG+FP+Q + IP +IE A R + + P+ +G D+A G Sbjct: 226 KRLKWGEDSPLWYSRVLGEFPEQGNDTLIPLAWIEAAQQRWHMTEAGEPVEIGADVARYG 285 Query: 329 GDKTVVVFRRGNIIEHIF 346 D TV++ RRG+ E ++ Sbjct: 286 TDTTVIMLRRGDKAEIVY 303 >gi|167767949|ref|ZP_02440002.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1] gi|167710278|gb|EDS20857.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1] gi|291560988|emb|CBL39788.1| hypothetical protein CL2_30180 [butyrate-producing bacterium SSC/2] Length = 473 Score = 96.3 bits (238), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 84/302 (27%), Positives = 133/302 (44%), Gaps = 43/302 (14%) Query: 48 FSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPG 107 F P WQ E A+ +NS K I +G+G+GKT A +LW +S Sbjct: 30 FFYPDEWQKEAAFALR-------DNS-----KVTIKSGQGVGKTGFEAATLLWFLSCFEN 77 Query: 108 MSIICIANSETQLKNTLWAEVSKWLSMLPH----RHWFEMQ-SLSLHPSGWYAELLEQSM 162 ++ A + QL + LWAEVSKW S P W + + S+ WYA Sbjct: 78 ARVVATAPTLHQLNDVLWAEVSKWQSKSPLLKEILQWTKTKISMIGSKERWYA------- 130 Query: 163 GIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNR 222 RT + P+ G H + M DEASG D I ++ILG T N Sbjct: 131 ---------VARTATT--PENMQGFHEDN-MLFIVDEASGVADPIMEAILGTLT--GSNN 176 Query: 223 FWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARI 282 ++ N + +G FYD + + +++ + + + +I +YG +S+V R+ Sbjct: 177 KLLLCGNPTKASGTFYDSHTSDRKLYYCITVNSAESKRTNKDNIDSLIRKYGEESNVVRV 236 Query: 283 EILGQFPQQEVNNFIPHNYIEEAMSREAI--DDLYAPLIMGCDIAGEGGDKTVVVFRRGN 340 + G FP+Q+ + ++P +E ++ E I D+ +G D+A G D TV+ N Sbjct: 237 RVKGLFPKQDDDVYMPLEMLEASIILEEIPPADI---CTLGVDVARFGDDDTVIARNMNN 293 Query: 341 II 342 I Sbjct: 294 KI 295 >gi|315122636|ref|YP_004063125.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496038|gb|ADR52637.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 301 Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 49/145 (33%), Positives = 78/145 (53%), Gaps = 11/145 (7%) Query: 26 LSFKNFVMRFFPWGIKGKPLEHFSQPHRWQ----LEFMEAVDVHCHSNVNNSNPTIFKCA 81 L+F ++ R WG +G PL + P WQ LE E ++ + + +FK A Sbjct: 29 LAFTKYMYR---WGEEGTPLANCKGPRAWQTEVFLELAEFIEKNKEAKRLGKPLQVFKLA 85 Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 I++ RGIGKT L AW+ W +STR G +++ ANS+ Q K T +AE+ +W S+ + H+F Sbjct: 86 IASARGIGKTALVAWITYWFLSTRIGCTVVISANSDDQCKTTSFAEIRRWHSLAKNAHFF 145 Query: 142 EMQSLSLHPSG----WYAELLEQSM 162 E +G W AE + +++ Sbjct: 146 EANIAEALLAGGCSPWQAEPVAKTL 170 >gi|307308936|ref|ZP_07588619.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti BL225C] gi|306900570|gb|EFN31183.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti BL225C] Length = 472 Score = 83.2 bits (204), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 75/295 (25%), Positives = 129/295 (43%), Gaps = 30/295 (10%) Query: 65 HCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTL 124 +C + NN T+ G GKT ++A + W + + + A SE+ +K+ + Sbjct: 40 YCEAFKNNQTITV-----KGSSGWGKTFISAISLWWSLIVFDPVKVTIFAPSESTIKSGI 94 Query: 125 WAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSM-GIDSKHYTITC----RTYSEE 179 W E +Q L + + + EL E S I K TC R S++ Sbjct: 95 WNE---------------LQVLYSNMAPLFRELFEVSATKIFRKSRGETCWAEYRLVSKD 139 Query: 180 RPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYD 239 G H+ + + V DEASG D+I L P ++ SN + +G+F+ Sbjct: 140 NIAAARGFHSKNNI-VIADEASGIEDVIFTGALLNVLNDGPGAKVVLVSNPDKASGFFFK 198 Query: 240 IFNIP--LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL-GQFPQQEVNNF 296 + P +DW + R G E YG + + ++ G+FP +V+ Sbjct: 199 TWRDPELSKDWIKVHGSIRDKPNYTPGEEERFARLYGGVTSRDYLTLVEGEFPLSDVDGL 258 Query: 297 IPHNYIEEAMS-REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 I +++EA++ ++AI + AP+I G D AG G DK+V+ R N++ +W+ Sbjct: 259 ISREFLDEAVTNKDAIPNPKAPIIWGLDPAGAGKDKSVLAIRHDNVLRGFEEWAG 313 >gi|262316909|emb|CBA18135.1| putative terminase B [Paenibacillus phage phiBP] Length = 248 Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 59/208 (28%), Positives = 100/208 (48%), Gaps = 16/208 (7%) Query: 81 AISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW 140 ++ +G+G+GKT L A + LW + P ++C A + QL + LWAE+SKW S P Sbjct: 57 SVRSGQGVGKTALEAAISLWFLCCFPFPRVVCTAPTRQQLNDVLWAEISKWQSQSP---- 112 Query: 141 FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA 200 + L + Y + E+ + T RT + +P+ G H + M DEA Sbjct: 113 ILKRILKWTKTKIYMKNYEE-------RWFATARTAT--KPENMQGFHEDY-MLFIVDEA 162 Query: 201 SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEG 260 SG D I +I G + + N+ + M N + +G+F+D N ++ +++ Sbjct: 163 SGVDDRIMAAIFGTLSG-DYNKLF-MCGNPTKTSGFFFDSHNRDRAIYRTHRVSCLDSPR 220 Query: 261 IDSGFHEGIISRYGLDSDVARIEILGQF 288 E + ++YG SDV R+ +LG+F Sbjct: 221 TSKENIEMLKAKYGEGSDVWRVRVLGEF 248 >gi|83593922|ref|YP_427674.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170] gi|83576836|gb|ABC23387.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170] Length = 505 Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 73/300 (24%), Positives = 122/300 (40%), Gaps = 39/300 (13%) Query: 75 PTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSM 134 P K + AG G+GKTT A + W + C A + +QL+ LW+E+++ Sbjct: 34 PAGAKVTVRAGHGVGKTTATAAAIWWHLECFDYSKTPCTAPTASQLEQILWSELARLRRR 93 Query: 135 LPHRHWFEMQSLSLHPSGWYAELLEQSMGI------DSKHYTITCRTYSEERPDTFVGPH 188 Q L P+ E L G + + + RT ++PD G H Sbjct: 94 ----ADARAQGTGL-PAALRLEALFAVSGRAIADRGTPREWFVVARTARRDQPDALQGFH 148 Query: 189 ----------------NTHGMAVF--NDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 + G A+ +EASG PD + + G + +P +M N Sbjct: 149 ASDIDLEAGAGPRLSAKSGGAALMFVIEEASGVPDAVFEVAEGALS--SPGARLLMVGNP 206 Query: 231 RRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ 290 R G+F + ++ +D G+ G++ +YG +S+V R+ G FP+ Sbjct: 207 TRNTGFFARSHKRDRASFTALRLRCADSPLVDPGYRAGLVRKYGAESNVVRVRADGAFPR 266 Query: 291 QEVNNFIPHNYIEEAM-----SREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 Q+ + I E A+ +R A +D +G D+A G D+TV + R G ++ I Sbjct: 267 QDDDVLIALETAEAALARPLPARMATEDERR---LGVDVARFGDDRTVFLLRIGPVVGAI 323 >gi|332980681|ref|YP_004462122.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON] gi|332698359|gb|AEE95300.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON] Length = 486 Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 85/351 (24%), Positives = 140/351 (39%), Gaps = 80/351 (22%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 ++P + Q++ + AV NP + A+ + G GK+ + ++LW + + Sbjct: 28 TRPWKKQIDIISAV---------RDNP---RTAVRSCHGAGKSFIAGQVILWFLYSFYPS 75 Query: 109 SIICIANSETQLKNTLWAEVSKWL---------SMLPHRHWFEMQSLSLHPSGWYAELLE 159 ++ A + Q++ +W EV ++LP R E+Q + WYA L Sbjct: 76 IVLSTAPTWRQVEKLIWKEVRASYRRSKVPLGGNLLPKRP--EIQIIQ---DEWYAVGL- 129 Query: 160 QSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELN 219 S PD F G H + + V DEA+G P+ I ++I G T + Sbjct: 130 -----------------STNEPDRFQGFHEENILVVV-DEAAGVPEEIFEAIEGVLTSEH 171 Query: 220 PNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS-GFHEGII-------- 270 ++ N + G FY+ F P W+ I T + G E I Sbjct: 172 AR--LLLLGNPTSVGGTFYNAFRTP--GWENISISAFTTPNFTAFGITEDDIINKTWESK 227 Query: 271 --------------------SRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 R+G +S + +LGQFP + + IP +IE AM+R Sbjct: 228 ITNSLPNPKLITPAWVADKYRRWGPNSPAYQARVLGQFPSEGEDTLIPLAWIEAAMARWE 287 Query: 311 IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEGC 361 P+ +G D+A G DKTV+ RRG + + ++ + ET GC Sbjct: 288 DTPEGEPIEIGVDVARFGSDKTVIAARRGQKVLPLNVYAKQDTMET--VGC 336 >gi|269119479|ref|YP_003307656.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386] gi|268613357|gb|ACZ07725.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386] Length = 499 Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 77/310 (24%), Positives = 128/310 (41%), Gaps = 52/310 (16%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS--------K 130 + ++ AG GK++L + + + TRP +I A + QLK WAEV+ K Sbjct: 47 RLSVPAGHSTGKSSLAGGLTTYWLITRPKSRVIVTAPTYRQLKTIYWAEVNKIYNRSKLK 106 Query: 131 WLSMLP-------------HRHWFEMQSLSLHPSGWYA------ELLEQSM---GI---- 164 L++ R WF + + P G E++EQ M GI Sbjct: 107 QLNLFEINDKIMRINDKDLKREWFALPVTASTPEGMQGQHGDKTEVIEQIMKHLGIEEIG 166 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNP-NRF 223 D + I + E+ + + + V DE+SG + I + + G T+ + F Sbjct: 167 DDETIEIVSQILRGEKQIEGLTKEDKEKLLVMVDESSGVKNEIFEVLEG--TDYDKLVLF 224 Query: 224 WIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHE-----GIISRYGLDSD 278 MT NT G+FY+ P K Y++ T+ +S F + + YG DS+ Sbjct: 225 GNMTKNT----GYFYESVYNP--KSKFYKV---TMSSYNSPFMKKEQIHDLEETYGPDSN 275 Query: 279 VARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIA-GEGGDKTVVVFR 337 V R+ + G+ P N+ N I+ A R Y + +G D+ G GGD + + + Sbjct: 276 VVRVRLKGEAPDGNENSIFSSNKIDSAFQRSLSLSEYETIKLGVDVGKGSGGDSSTIYEK 335 Query: 338 RGNIIEHIFD 347 + N + D Sbjct: 336 KDNRVRKKLD 345 >gi|315649222|ref|ZP_07902312.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453] gi|315275441|gb|EFU38799.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453] Length = 189 Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 51/163 (31%), Positives = 82/163 (50%), Gaps = 25/163 (15%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWL--SMLP 136 + ++ +G+G+GKT L A +++W + RP ++C A ++ QL + LW EVSKWL SM+ Sbjct: 47 RTSVRSGQGVGKTGLEAALVIWFLCCRPNPKVVCTAPTKQQLHDVLWTEVSKWLENSMVK 106 Query: 137 H-RHWFEMQSLSL-HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 + W + + + H W+A T RT + +P+ G H + M Sbjct: 107 NLLKWTKTKVYMIGHEQRWFA----------------TARTAN--KPENMQGFHEDY-ML 147 Query: 195 VFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 DEASG D I ++ILG + N+ +M N R +G F Sbjct: 148 FIVDEASGVSDPIMEAILGTLSGAE-NKL-LMCGNPTRTSGVF 188 >gi|304399103|ref|ZP_07380971.1| DNA packaging protein [Pantoea sp. aB] gi|304353343|gb|EFM17722.1| DNA packaging protein [Pantoea sp. aB] Length = 503 Score = 70.5 bits (171), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 72/316 (22%), Positives = 137/316 (43%), Gaps = 32/316 (10%) Query: 28 FKNFVMRF-FPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGR 86 +++ V+R+ + W + +E F WQ E + +N+ T + +++G Sbjct: 16 WRDMVIRYRYNWALA--VVELFGMIPTWQQEEI----------MNSVQETGSQTTVTSGH 63 Query: 87 GIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSL 146 G GK++L A M+L + P +I +AN Q+K ++ V + + RH + Sbjct: 64 GTGKSSLTAMMLLIYMIMYPDARVIIVANKIGQVKTGVFKYVKTYWANAARRHPWLQNYF 123 Query: 147 SLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDI 206 +L + +Y + GI + + C+ Y + G H H + + DEASG D Sbjct: 124 TLTDTMFYE---KSRKGI----WEVLCKGYRLGNEEALAGEHAAHILLIL-DEASGISDK 175 Query: 207 INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF-------NIPLEDWKRYQIDTRTVE 259 + G TE + NR +M+ TR +G+FYD + P W +++ Sbjct: 176 AIAIMRGALTEED-NRMLMMSQPTRP-SGYFYDSHHSLARHPDNPNGFWNAIVLNSEEAP 233 Query: 260 GIDSGF-HEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL 318 + F E ++ G DS +++LG+FP+ + + + A R+ + Sbjct: 234 HVTLKFIREKLVEYGGRDSLEYMVKVLGRFPRNVSGYLLGRDECDRAARRKVYLEKGWGW 293 Query: 319 IMGCDIAGEGGDKTVV 334 + D+ G G DK+++ Sbjct: 294 VATADV-GNGRDKSIL 308 >gi|48697461|ref|YP_024846.1| Pas60 [Actinoplanes phage phiAsp2] gi|47679679|gb|AAT36808.1| Pas60 [Actinoplanes phage phiAsp2] Length = 492 Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 76/309 (24%), Positives = 127/309 (41%), Gaps = 31/309 (10%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 P W + ++ + ++ P + A+ G+GK+ A ++ W +TR M Sbjct: 22 SPTAWAADCLDVRLAGYQGEILDAVPRERRVAVRGPHGLGKSFSGAILVNWFATTRDLMG 81 Query: 110 ----IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGID 165 II A++ L+ LW E+ KW + +L P ELL+ + + Sbjct: 82 KDWKIITTASAWRHLEVYLWPEIHKWAGRI------NFVALGRAPYNPRTELLDLRLKL- 134 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFT----ELNPN 221 H T + +P+ G H + + DEA P SI G F+ ++ N Sbjct: 135 -THGAATA--VASNQPERIEGAHAEELLYLL-DEAKIVPPATWDSIEGAFSNAGVDVADN 190 Query: 222 RFWIMTSNTRRLNGWFYDIFNIP--LEDW--KRYQIDTRTVEG-IDSGFHEGIISRYGLD 276 + S +G FYDI EDW + ++ G I + + S++G D Sbjct: 191 AYAFAMSTPGAPSGRFYDIHRRAPGYEDWWTRHVTLEEAIASGRISRAWADQRRSQWGSD 250 Query: 277 SDVARIEILGQFPQQEVNNFIPHNYIEEAM------SREAIDDLYAPLIMGCDIAGEGGD 330 S V +LG+F + ++ IP ++E A+ R+ PL G D+ G GGD Sbjct: 251 SAVFHNRVLGEFHASDEDSVIPLAWLEAAIERWHEWDRQGRPSPGGPLWTGVDV-GRGGD 309 Query: 331 KTVVVFRRG 339 +TV+ R G Sbjct: 310 ETVLAARDG 318 >gi|322656964|gb|EFY53248.1| DNA packaging protein [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] Length = 411 Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 63/264 (23%), Positives = 116/264 (43%), Gaps = 19/264 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 + +++G G GK++L A ++L + P +I +AN Q+K ++ V ++ + R Sbjct: 56 RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 H + L + +Y GI + + C+ Y + G H H + + D Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAHLLLIL-D 167 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF-------NIPLEDWKRY 251 EASG D + G TE + NR +M S R +G+FYD + P W Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 225 Query: 252 QIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 +++ + F + + Y G DS +++LGQFP++ + + + A R+ Sbjct: 226 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRAARRKV 285 Query: 311 IDDLYAPLIMGCDIAGEGGDKTVV 334 + + + D+ G G DK+V+ Sbjct: 286 LLEKNWGWVATADV-GNGRDKSVL 308 >gi|323179619|gb|EFZ65182.1| terminase B protein [Escherichia coli 1180] Length = 453 Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 63/264 (23%), Positives = 116/264 (43%), Gaps = 19/264 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 + +++G G GK++L A ++L + P +I +AN Q+K ++ V ++ + R Sbjct: 7 RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 66 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 H + L + +Y GI + + C+ Y + G H H + + D Sbjct: 67 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAHLLLIL-D 118 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF-------NIPLEDWKRY 251 EASG D + G TE + NR +M S R +G+FYD + P W Sbjct: 119 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 176 Query: 252 QIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 +++ + F + + Y G DS +++LGQFP++ + + + A R+ Sbjct: 177 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRAARRKV 236 Query: 311 IDDLYAPLIMGCDIAGEGGDKTVV 334 + + + D+ G G DK+V+ Sbjct: 237 LLEKNWGWVATADV-GNGRDKSVL 259 >gi|56266666|gb|AAV84947.1| DNA pacase B subunit [Enterobacteria phage D6] Length = 502 Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 63/264 (23%), Positives = 116/264 (43%), Gaps = 19/264 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 + +++G G GK++L A ++L + P +I +AN Q+K ++ V ++ + R Sbjct: 56 RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 H + L + +Y GI + + C+ Y + G H H + + D Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAHLLLIL-D 167 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF-------NIPLEDWKRY 251 EASG D + G TE + NR +M S R +G+FYD + P W Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSQAKTPDNPKGIWTAI 225 Query: 252 QIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 +++ + F + + Y G DS +++LGQFP++ + + + A R+ Sbjct: 226 VLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRAARRKV 285 Query: 311 IDDLYAPLIMGCDIAGEGGDKTVV 334 + + + D+ G G DK+V+ Sbjct: 286 LLEKNWGWVATADV-GNGRDKSVL 308 >gi|323948959|gb|EGB44853.1| terminase B protein [Escherichia coli H252] Length = 502 Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 62/264 (23%), Positives = 116/264 (43%), Gaps = 19/264 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 + +++G G GK++L A ++L + P +I +AN Q+K ++ V ++ + R Sbjct: 56 RTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWANAVKR 115 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 H + L + +Y GI + + C+ Y + G H H + + D Sbjct: 116 HGWLQTYFVLSDTMFYE---RSRKGI----WEVLCKGYRLGNEEALAGEHAAHLLLIL-D 167 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF-------NIPLEDWKRY 251 EASG D + G TE + NR +M S R +G+FYD + P W Sbjct: 168 EASGISDKAIGVMTGALTEED-NRM-LMLSQPTRPSGYFYDSHHSRAKTPDNPKGIWTAI 225 Query: 252 QIDTRTVEGIDSGF-HEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 +++ + F E ++ G DS +++LGQFP++ + + + + R+ Sbjct: 226 VLNSEESPFVTPQFIKEKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRSARRKV 285 Query: 311 IDDLYAPLIMGCDIAGEGGDKTVV 334 + + + D+ G G DK+V+ Sbjct: 286 LLEKNWGWVATADV-GNGRDKSVL 308 >gi|228924410|ref|ZP_04087639.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] gi|228835241|gb|EEM80653.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] Length = 293 Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 1/131 (0%) Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 + N R +G FYD N + +K +++ + E + +YG SDV R+ +L Sbjct: 3 LCGNPTRTSGVFYDSHNRDRDLYKIHKVSSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVL 62 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 G+FP+ E + FIP +E+A S + ++ L +G D+A G D+TV+ R GN + + Sbjct: 63 GEFPKAEADAFIPLEIVEQAASCK-VEPTGETLDLGVDVARFGDDETVIAPRIGNKVFKL 121 Query: 346 FDWSAKLIQET 356 + + ET Sbjct: 122 LNHYKQDTMET 132 >gi|216906085|ref|YP_002333619.1| terminase [Abalone shriveling syndrome-associated virus] gi|216263178|gb|ACJ72002.1| terminase [Abalone shriveling syndrome-associated virus] Length = 507 Score = 60.1 bits (144), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 83/313 (26%), Positives = 128/313 (40%), Gaps = 35/313 (11%) Query: 54 WQLEFMEAVDVHCHSNVNNSNPTIFKCAI--SAGRGIGKTTLNAWMMLWLISTRPGMSII 111 WQLE VD NS+ F CAI S G G GKT L+ + +W PG Sbjct: 51 WQLEI---VDYIAKFFRKNSDEKHFVCAIAVSGGNGTGKTKLSKALNIWRFCCHPGSRQF 107 Query: 112 CIANSETQLKNT----LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSK 167 + NSE Q K T L +SK LS + ++S + + S A+ E D Sbjct: 108 ILTNSERQTKRTGFTMLVRRISKLLSCIA-----ALESSAYYYSPAVADKPEVRTN-DMW 161 Query: 168 HYTITCRTYSEERPDTFVGPHNTHGMAVFN-DEASGTPDIINKSILGFFTELNPNRFWIM 226 T ++ +E G H H M F+ DE++ D + +++ +T+ Sbjct: 162 DVTYLLQSSTEA---ALSGLH--HPMMTFSFDESTYFNDHVWQALENMWTQ--GQVLCFC 214 Query: 227 TSN-TRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID-------SGFHEGIISRYGLDSD 278 T N + N +F +FN L + TR V ++ I YG Sbjct: 215 TGNPSHDNNNYFARLFNKSLHKKDSLWL-TRCVSLLELPLKYRNDARARYIEEHYGKTHP 273 Query: 279 VARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD-LYAPLIMGCD--IAGEGGDKTVVV 335 +LGQFP++ N I EAM RE ++ ++ P+IMG D I+ G + + Sbjct: 274 RYIASVLGQFPKKNTCNPFDITAISEAMEREVREEFIHHPVIMGIDVSISANNGSASAIC 333 Query: 336 FRRGNIIEHIFDW 348 R G + + ++ Sbjct: 334 VREGTAVRVLREY 346 >gi|260871239|ref|YP_003238019.1| DNA packaging protein [Escherichia coli O111:H- str. 11128] gi|257767818|dbj|BAI39311.1| DNA packaging protein [Escherichia coli O111:H- str. 11128] Length = 494 Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 62/268 (23%), Positives = 121/268 (45%), Gaps = 27/268 (10%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV-SKWLSMLPHR 138 ++++G G GK+ + + + + I PG +I +AN Q+ + ++ + S W + + Sbjct: 52 TSVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRF 111 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI---TCRTYSEERPDTFVGPHNTHGMAV 195 W + S + E+ + + +TI +CR+ +EE G H H + + Sbjct: 112 PWLSKYFILTETS--FFEVTGKGV------WTILIKSCRSGNEE---ALAGEHADHLLYI 160 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDW 248 DEASG D I G T + NR +++ TR +G+FYD + P + Sbjct: 161 I-DEASGVSDKAFSVITGALTGKD-NRILLLSQPTRP-SGYFYDSHHRLAIRPGNPDGLF 217 Query: 249 KRYQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 +++ +D+ F ++ Y G D+ + I++ G+FP+ + + + +E A Sbjct: 218 TAIILNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATR 277 Query: 308 REAIDDLYAPLIMGCDIA-GEGGDKTVV 334 R+ + D+A G G DK+V+ Sbjct: 278 RKVKIAKGWGWVACVDVAGGTGRDKSVI 305 >gi|331649955|ref|ZP_08351031.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M605] gi|331041212|gb|EGI13366.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M605] Length = 494 Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 62/268 (23%), Positives = 120/268 (44%), Gaps = 27/268 (10%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV-SKWLSMLPHR 138 ++++G G GK+ + + + + I PG +I +AN Q+ + ++ + S W + + Sbjct: 52 TSVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRF 111 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI---TCRTYSEERPDTFVGPHNTHGMAV 195 W + S + E+ + + +TI +CR +EE G H H + + Sbjct: 112 PWLSKYFILTETS--FFEVTGKGV------WTILIKSCRPGNEE---ALAGEHADHLLYI 160 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDW 248 DEASG D I G T + NR +++ TR +G+FYD + P + Sbjct: 161 I-DEASGVSDKAFSVITGALTGKD-NRILLLSQPTRP-SGYFYDSHHRLAIRPGNPDGLF 217 Query: 249 KRYQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 +++ +D+ F ++ Y G D+ + I++ G+FP+ + + + +E A Sbjct: 218 TAIILNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATR 277 Query: 308 REAIDDLYAPLIMGCDIA-GEGGDKTVV 334 R+ + D+A G G DK+V+ Sbjct: 278 RKVKIAKGWGWVACVDVAGGTGRDKSVI 305 >gi|46401730|ref|YP_006576.1| PacB [Enterobacteria phage P1] gi|301646767|ref|ZP_07246623.1| putative terminase B protein [Escherichia coli MS 146-1] gi|129547|sp|P27753|TERL_BPP1 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein B; AltName: Full=PACase B protein; AltName: Full=Terminase B protein; AltName: Full=Terminase large subunit gi|68597607|sp|Q5XLR0|TERL_BPP7 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein B; AltName: Full=PACase B protein; AltName: Full=Terminase B protein; AltName: Full=Terminase large subunit gi|33323612|gb|AAQ07582.1|AF503408_106 PacB [Enterobacteria phage P7] gi|215636|gb|AAA21724.1| pacB [Enterobacteria phage P1] gi|33338757|gb|AAQ14080.1| PacB [Enterobacteria phage P1] gi|33338866|gb|AAQ14188.1| PacB [Enterobacteria phage P1] gi|54112354|gb|AAV28854.1| PacB [Enterobacteria phage P7] gi|301075042|gb|EFK89848.1| putative terminase B protein [Escherichia coli MS 146-1] Length = 494 Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 62/268 (23%), Positives = 120/268 (44%), Gaps = 27/268 (10%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV-SKWLSMLPHR 138 ++++G G GK+ + + + + I PG +I +AN Q+ + ++ + S W + + Sbjct: 52 TSVTSGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRF 111 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI---TCRTYSEERPDTFVGPHNTHGMAV 195 W + S + E+ + + +TI +CR +EE G H H + + Sbjct: 112 PWLSKYFILTETS--FFEVTGKGV------WTILIKSCRPGNEE---ALAGEHADHLLYI 160 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDW 248 DEASG D I G T + NR +++ TR +G+FYD + P + Sbjct: 161 I-DEASGVSDKAFSVITGALTGKD-NRILLLSQPTRP-SGYFYDSHHRLAIRPGNPDGLF 217 Query: 249 KRYQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 +++ +D+ F ++ Y G D+ + I++ G+FP+ + + + +E A Sbjct: 218 TAIILNSEESPLVDAKFIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATR 277 Query: 308 REAIDDLYAPLIMGCDIA-GEGGDKTVV 334 R+ + D+A G G DK+V+ Sbjct: 278 RKVKIAKGWGWVACVDVAGGTGRDKSVI 305 >gi|161789175|ref|YP_001595730.1| PacB [Vibrio sp. 0908] gi|161761461|gb|ABX77106.1| PacB [Vibrio sp. 0908] Length = 572 Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 46/163 (28%), Positives = 75/163 (46%), Gaps = 11/163 (6%) Query: 70 VNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS 129 +N P + ++++G G GK+ L A + L I T P + ANS Q+ N +++ + Sbjct: 53 INALTPVGARVSVASGHGTGKSHLTAALCLHFIITHPESLCMLTANSLDQVTNVVFSYIK 112 Query: 130 K-WLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 + W+ + + W E Q + +YA+ + I K TC +EE G H Sbjct: 113 RCWVKICQRQPWLE-QYFVITAKSFYAKGYKGVWQIFGK----TCSKGNEE---GLAGQH 164 Query: 189 NTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTR 231 M V DEASG D + + G TE N N+ +++ TR Sbjct: 165 RRDYMVVV-DEASGVSDRAFEVLRGALTEDN-NKMLLISQFTR 205 >gi|56266643|gb|AAV84926.1| DNA pacase B subunit [Enterobacteria phage phiW39] Length = 494 Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 59/266 (22%), Positives = 114/266 (42%), Gaps = 21/266 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS-KWLSMLPH 137 K ++S+G G GK+ + + M++ I PG I +AN Q+ ++ + W + Sbjct: 51 KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110 Query: 138 RHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFN 197 W L + +Y E+ + + +T+ + + + G H H + + Sbjct: 111 FPWL-ADYFVLTETAFY-EITGKGV------WTVVPKGFRLGSEEALAGEHADHLLYII- 161 Query: 198 DEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKR 250 DEASG D I G T + NR +++ TR +G+FYD + P + Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRILLLSQPTRP-SGYFYDTHHKLAKRPGNPDGVYTA 219 Query: 251 YQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSRE 309 +++ + F + ++ Y G D+ + I++ G FP+ + + + +E A R+ Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279 Query: 310 AIDDLYAPLIMGCDIA-GEGGDKTVV 334 + D+A G G DK+V+ Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVI 305 >gi|324111095|gb|EGC05081.1| terminase B protein [Escherichia fergusonii B253] Length = 494 Score = 53.5 bits (127), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 59/266 (22%), Positives = 114/266 (42%), Gaps = 21/266 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS-KWLSMLPH 137 K ++S+G G GK+ + + M++ I PG I +AN Q+ ++ + W + Sbjct: 51 KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110 Query: 138 RHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFN 197 W L + +Y E+ + + +T+ + + + G H H + + Sbjct: 111 FPWL-ADYFVLTETAFY-EVTGKGV------WTVVPKGFRLGSEEALAGEHADHLLYII- 161 Query: 198 DEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKR 250 DEASG D I G T + NR +++ TR +G+FYD + P + Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRILLLSQPTRP-SGYFYDTHHKLAKRPGNPDGVYTA 219 Query: 251 YQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSRE 309 +++ + F + ++ Y G D+ + I++ G FP+ + + + +E A R+ Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279 Query: 310 AIDDLYAPLIMGCDIA-GEGGDKTVV 334 + D+A G G DK+V+ Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVI 305 >gi|312964323|ref|ZP_07778627.1| terminase B protein [Escherichia coli 2362-75] gi|331655801|ref|ZP_08356790.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M718] gi|312291036|gb|EFR18910.1| terminase B protein [Escherichia coli 2362-75] gi|323186470|gb|EFZ71817.1| terminase B protein [Escherichia coli 1357] gi|323969205|gb|EGB64507.1| terminase B protein [Escherichia coli TA007] gi|325495624|gb|EGC93488.1| DNA pacase B subunit [Escherichia fergusonii ECD227] gi|331046575|gb|EGI18664.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M718] Length = 494 Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 59/266 (22%), Positives = 114/266 (42%), Gaps = 21/266 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS-KWLSMLPH 137 K ++S+G G GK+ + + M++ I PG I +AN Q+ ++ + W + Sbjct: 51 KTSVSSGHGTGKSDMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSR 110 Query: 138 RHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFN 197 W L + +Y E+ + + +T+ + + + G H H + + Sbjct: 111 FPWL-ADYFVLTETAFY-EVTGKGV------WTVVPKGFRLGSEEALAGEHADHLLYII- 161 Query: 198 DEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKR 250 DEASG D I G T + NR +++ TR +G+FYD + P + Sbjct: 162 DEASGVSDRAFGIITGALTGQD-NRILLLSQPTRP-SGYFYDTHHKLAKRPGNPDGVYTA 219 Query: 251 YQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSRE 309 +++ + F + ++ Y G D+ + I++ G FP+ + + + +E A R+ Sbjct: 220 ITLNSEESPLVTPAFIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRK 279 Query: 310 AIDDLYAPLIMGCDIA-GEGGDKTVV 334 + D+A G G DK+V+ Sbjct: 280 VKIAKGWGWLACVDVAGGTGRDKSVI 305 >gi|257459276|ref|ZP_05624390.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268] gi|257443289|gb|EEV18418.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268] Length = 431 Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 67/275 (24%), Positives = 108/275 (39%), Gaps = 35/275 (12%) Query: 80 CAISAGR--GIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPH 137 C I GR G K T NA + WL+ G I+ + LK L LP Sbjct: 26 CTIEKGRRFGFTKGTANACIE-WLLE---GQKILWVDTIAANLKRYFERYFLPELRQLPK 81 Query: 138 RHW-FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVF 196 W + Q L G Y + S ERP+ G + + Sbjct: 82 ELWNWNAQDKQLKICGGYLDF------------------RSAERPENIEG--FGYDTVIL 121 Query: 197 NDEA--SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED---WKRY 251 N+ P + + +I + NPN + + N F+D+ + + W+ + Sbjct: 122 NEAGIILKDPYLWDNAISPMLLD-NPNSRAFIGGVPKGKNK-FFDLAQRGMRNEKGWRNF 179 Query: 252 QIDTRTVEGIDSGFHEGIISRYG-LDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 Q + + + +++ G DSDVAR EI G+F N+ IE A ++ Sbjct: 180 QFSSYDNPLLQKEEIDRLVAELGGADSDVARQEIFGEFLDTTSNSVFSLAAIEAAFRKQR 239 Query: 311 IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 D AP+I D+A EG D++V+ R+G+ +E + Sbjct: 240 YFDAGAPVIWALDVAREGDDESVLCKRQGDSVEPL 274 >gi|168467778|ref|ZP_02701615.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195629119|gb|EDX48493.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 494 Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 60/265 (22%), Positives = 115/265 (43%), Gaps = 19/265 (7%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 K ++S+G G GK+ + + M++ I PG I +AN Q+ ++ + S R Sbjct: 51 KTSVSSGHGTGKSDMTSIMIMLFIIMFPGARAIIVANKIQQVMTGIFKYLKINWSTATSR 110 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 + + L + +Y E+ + + +T+ + + + G H H + + D Sbjct: 111 FPWLAEYFVLTDTSFY-EITSKGV------WTVVPKGFRLGNEEALAGEHADHLLYII-D 162 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKRY 251 EASG D + G T + NR +++ TR +G+FYD + P + Sbjct: 163 EASGVSDKAFGIMTGALTGKD-NRILLLSQPTRP-SGYFYDTHHKLAKRPGNPNGIYTAI 220 Query: 252 QIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 +++ + F + ++ Y G DS + I++ G FP+ + + + +E A R+ Sbjct: 221 TLNSEESPLVTPEFIKMKLAEYGGRDSPMYLIKVRGLFPKTQDGFLLGRDEVERASRRKV 280 Query: 311 IDDLYAPLIMGCDIA-GEGGDKTVV 334 I D+A G G DK+V+ Sbjct: 281 KIAKGWGWIACVDVAGGTGRDKSVI 305 >gi|320103661|ref|YP_004179252.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644] gi|319750943|gb|ADV62703.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644] Length = 553 Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 68/290 (23%), Positives = 109/290 (37%), Gaps = 33/290 (11%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 ++ G +GK+ L A + LW + T PG ++ A S+ L L+ E+ K L+ R Sbjct: 68 VATGNAVGKSYLAAGLTLWWLYTHPGSLVVATAPSQGLLGTVLFRELQKALAA-SRRRGL 126 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS 201 + + + L G C + + G H+ M V DEAS Sbjct: 127 GLPGMVVGSDRGTPFSLRVGPGRRLAAEGWGCLGIATRGVERLAGRHHADLMVVV-DEAS 185 Query: 202 GTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT------ 255 G + T LNP + ++ N F+ + L + I Sbjct: 186 G----VQPEAWEALTSLNPRKLFV-CGNPLTPGTVFHKLHQRGLTEASDPSIPDHARGVA 240 Query: 256 --------------RTVEGI-DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 R+ G+ D GF ++G S + + G FP V+ I Sbjct: 241 LTIPSTASPDINLERSPRGLADRGFIREAERQWGRGSPLWLSHVEGVFPTVAVHALIEPG 300 Query: 301 YIEEAMSREAIDDLYAP---LIMGCDI-AGEGGDKTVVVFR-RGNIIEHI 345 ++++A S E P ++GCD+ AG G D+T +V R G I E I Sbjct: 301 WLDQAASLERSQTYENPPGQPVLGCDLAAGVGADRTAIVVRDEGGIRELI 350 >gi|226940459|ref|YP_002795533.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715386|gb|ACO74524.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 272 Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 2/94 (2%) Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 W QID+RTVEG + YG +SD ++ + G FP FI + A Sbjct: 14 WVARQIDSRTVEGTNKEQIAKWAEDYGEESDFFKVRVRGMFPSMSARQFISETDVSAAYG 73 Query: 308 REAIDD--LYAPLIMGCDIAGEGGDKTVVVFRRG 339 R + YAP I+ D A EG D+ V+ R+G Sbjct: 74 RALRPEQYQYAPKILTVDPAWEGDDEFVIGLRQG 107 >gi|148653111|ref|YP_001280204.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1] gi|148572195|gb|ABQ94254.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1] Length = 520 Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 43/166 (25%), Positives = 74/166 (44%), Gaps = 18/166 (10%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 + ++++G G GK+ + LW + P ++ A QL+ +W E++ L L + Sbjct: 57 RTSVASGHGTGKSRSAGIIALWHLLFYPESVMLFTAPQIGQLRTVVWKEINICLQRLRNN 116 Query: 139 HWFEMQSLSLHPSGWYAE---LLEQSMGIDSKHYT--ITCRTYSEERPDTFVGPHNTHGM 193 GW A+ +L + + I T + +T + +P G H H M Sbjct: 117 KAL----------GWLADYVVVLAEKIYIKGFKDTWFVFAKTAPKHQPTNIAGQHGDHYM 166 Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYD 239 V+ DEA G D + + +G T N NR ++TS + G+FYD Sbjct: 167 -VWADEACGIDDAVMEVAIGALTHEN-NRA-VLTSQPAKNTGFFYD 209 >gi|332974843|gb|EGK11758.1| hypothetical protein HMPREF9373_1714 [Psychrobacter sp. 1501(2011)] Length = 520 Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 43/166 (25%), Positives = 74/166 (44%), Gaps = 18/166 (10%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 + ++++G G GK+ + LW + P ++ A QL+ +W E++ L L + Sbjct: 57 RTSVASGHGTGKSRSAGIIALWHLLFYPESVMLFTAPQIGQLRTVVWKEINICLQRLRNN 116 Query: 139 HWFEMQSLSLHPSGWYAE---LLEQSMGIDSKHYT--ITCRTYSEERPDTFVGPHNTHGM 193 GW A+ +L + + I T + +T + +P G H H M Sbjct: 117 KAL----------GWLADYVVVLAEKIYIKGFKDTWFVFAKTAPKHQPTNIAGQHGDHYM 166 Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYD 239 V+ DEA G D + + +G T N NR ++TS + G+FYD Sbjct: 167 -VWADEACGIDDAVMEVAIGALTHEN-NRA-VLTSQPAKNTGFFYD 209 >gi|226227228|ref|YP_002761334.1| hypothetical protein GAU_1822 [Gemmatimonas aurantiaca T-27] gi|226090419|dbj|BAH38864.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 549 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 64/286 (22%), Positives = 108/286 (37%), Gaps = 47/286 (16%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH 139 A+++G G GKT L A ++LW I+ P +A Q + +W EV+ RH Sbjct: 70 VAVASGTGTGKTFLEAVLLLWWIAVEPDSIATTVATKADQQEKGIWREVA--------RH 121 Query: 140 WFEMQSLSLHPSGWYAEL---LEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVF 196 W Q + P L +E G + IT + E + V + + + Sbjct: 122 WPRFQ--ACFPEAELTTLRIRMEPWRGDAWGAWGITAAPKAGEESSSAVQGLHAKRLLIL 179 Query: 197 NDEASGTPDIINKSILGFFT--------------ELNP-NRFWIMTSNTRRLNGWFYDIF 241 DE G P + +++ T + +P +F + T+R+ Sbjct: 180 VDETPGVPQPVMTALVNTATGEENVIAAFGNPDYQADPLGQF----AETKRVTA-----I 230 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISR---YGLDSDVARIEILGQFPQQEVNNFIP 298 I D + + G + I +R YG++S V + + G P+Q + I Sbjct: 231 RISALDHPNVVLGVERIPGAATRLS--IATREDKYGVESGVYQSRVRGIAPEQSASALIH 288 Query: 299 HNYIEEAMSR-EAIDD---LYAPLIMGCDIA-GEGGDKTVVVFRRG 339 + A R E++ P +G D+A E GDK V +G Sbjct: 289 LAWCVAAADRAESVQHAALALGPKALGVDVAQSENGDKAAVAMGQG 334 >gi|189460514|ref|ZP_03009299.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136] gi|189432758|gb|EDV01743.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136] Length = 556 Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust. Identities = 61/268 (22%), Positives = 102/268 (38%), Gaps = 51/268 (19%) Query: 109 SIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKH 168 + A ++ Q+KN + E+S+ + R + L+ + + ++ Sbjct: 124 KVALTAPTDRQVKNIMMPEISRLFNRAKARGVELIGKLNAY-----------DIRTNNDE 172 Query: 169 YTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG-------FFTELNPN 221 + +T E + + G H H M V EA+G D +I G NPN Sbjct: 173 WFLTGFKADEHNHEAWSGFHAVHTMFVVT-EATGIGDDTFAAIEGNLQGDSRILLVFNPN 231 Query: 222 RFWIMTSNTRRLNGWF------------------------YDIFNIPLEDWKRYQIDTRT 257 + + +++ + W YD LE+W Sbjct: 232 KTVGYAAKSQKGDRWHKYRLNSLTAPNIASKKIIIPGQVDYDWVLDKLENWCEKISPDEI 291 Query: 258 VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR----EAIDD 313 + +D EG R D+ R ++LG FP+ + + IP ++EEA R + + Sbjct: 292 ISEMDDFEFEGQWYR---PEDLFRKKVLGLFPKVDEDTLIPRQWLEEAHERWKQAKGREP 348 Query: 314 LYAPL-IMGCDIAGEGGDKTVVVFRRGN 340 L A L I+G D+AG G D T V RR N Sbjct: 349 LRADLNILGVDVAGMGRDATCYVLRRDN 376 >gi|283956317|ref|ZP_06373797.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni 1336] gi|283792037|gb|EFC30826.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni 1336] Length = 430 Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust. Identities = 42/142 (29%), Positives = 65/142 (45%), Gaps = 14/142 (9%) Query: 237 FYDIFNIPLED--WKRYQIDTRTVEGI-DSGFHEGIISRYGLDSDVARIEILGQFPQQEV 293 FY++ L D WK +Q + + + E I G DS+V + EI G+F Sbjct: 164 FYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEDSEVVKQEIYGEFIDSSS 223 Query: 294 NNFIPHNYIEEAMSREA--IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA- 350 IE AMS+ + I+ + I G D+A G DK+V+ R+G I++ I +S Sbjct: 224 AELFALTEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSVLAKRKGFIVDEIKKYSQL 283 Query: 351 -------KLIQETNQ-EGCPVG 364 +++ E NQ E P G Sbjct: 284 GTMELANRILAEYNQSEDKPKG 305 >gi|154175204|ref|YP_001409090.1| Ppx/GppA family phosphatase [Campylobacter curvus 525.92] gi|112803006|gb|EAU00350.1| phosphatase, Ppx/GppA family [Campylobacter curvus 525.92] Length = 433 Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust. Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 9/105 (8%) Query: 246 EDWKRYQIDT-----RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 +DW +QI + E ID E I G+DSDV + EI G+F N P + Sbjct: 174 KDWVNFQISSFENPLLRKEEID----ELIAELGGVDSDVVKQEIYGEFLDTTTNALFPLS 229 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 IE A + + A I G D+A +G D++V+ R G ++++ Sbjct: 230 QIEAAFGKVRAYEPNAVQIWGLDVARDGDDESVLCVREGYHVKNL 274 >gi|226940437|ref|YP_002795511.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715364|gb|ACO74502.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 133 Score = 44.7 bits (104), Expect = 0.026, Method: Compositional matrix adjust. Identities = 35/129 (27%), Positives = 51/129 (39%), Gaps = 23/129 (17%) Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL------HPSGWYAELLEQSMGI 164 + AN++TQL+ EV KW + HWF+ QS S+ H W A+ + Sbjct: 1 MITANTDTQLRTKTSPEVGKWQRLSITSHWFDPQSASIAARDKEHAKTWRADFV------ 54 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNT-HGMAVFNDEASGTPDIINKSILGFFTELNPNRF 223 +SE + F G HN + + DEAS D + + G T+ Sbjct: 55 ----------PWSEHNTEAFAGLHNKGKRIVLIFDEASAIADKVWEVAEGALTDEETEII 104 Query: 224 WIMTSNTRR 232 WI N R Sbjct: 105 WIAFGNPTR 113 >gi|153951273|ref|YP_001397540.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|153951467|ref|YP_001398214.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938719|gb|ABS43460.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938913|gb|ABS43654.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] Length = 430 Score = 43.5 bits (101), Expect = 0.053, Method: Compositional matrix adjust. Identities = 35/118 (29%), Positives = 54/118 (45%), Gaps = 5/118 (4%) Query: 237 FYDIFNIPLED--WKRYQIDTRTVEGI-DSGFHEGIISRYGLDSDVARIEILGQFPQQEV 293 FY++ L D WK +Q + + + E I G SDV R EI G+F Sbjct: 164 FYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGESSDVVRQEIYGEFIDSSS 223 Query: 294 NNFIPHNYIEEAMSREAI--DDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS 349 + IE AMS+ + + I G D+A G DK+V+ R+G +I+ + +S Sbjct: 224 AELFSLSGIENAMSKNSFSTQKMQGENIWGLDVARYGDDKSVLAKRKGFVIDELKKYS 281 >gi|282880015|ref|ZP_06288737.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS 5C-B1] gi|281306129|gb|EFA98167.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS 5C-B1] Length = 459 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 9/76 (11%) Query: 277 SDVARIEILGQFPQQEVNNFIPHNYIEEAMSR-------EAIDDLYAPLIMGCDIAGEGG 329 +D+ RI++LG FP+ + IP ++E A R + + YA + G D+AG G Sbjct: 221 NDLFRIKVLGLFPKASEDTLIPFEWLELAHDRWKKLNAEDFVPRKYARV--GIDVAGMGR 278 Query: 330 DKTVVVFRRGNIIEHI 345 D + V R GN + I Sbjct: 279 DSSCFVLRYGNYVPEI 294 >gi|57237579|ref|YP_178593.1| terminase B protein, putative [Campylobacter jejuni RM1221] gi|57166383|gb|AAW35162.1| terminase B protein, putative [Campylobacter jejuni RM1221] Length = 430 Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust. Identities = 40/142 (28%), Positives = 64/142 (45%), Gaps = 14/142 (9%) Query: 237 FYDIFNIPLED--WKRYQIDTRTVEGIDSGFHEGIISRYGLD-SDVARIEILGQFPQQEV 293 FY++ L D WK +Q + + + +I G + S+V + EI G+F Sbjct: 164 FYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVKQEIYGEFIDSSS 223 Query: 294 NNFIPHNYIEEAMSREA--IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA- 350 + IE AMS+ + I+ + I G D+A G DK+ + R+G +I I +S Sbjct: 224 AELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKGFVIYEIKKYSQL 283 Query: 351 -------KLIQETNQ-EGCPVG 364 K++ E NQ E P G Sbjct: 284 GTIELANKILAEYNQSEDKPKG 305 >gi|315929403|gb|EFV08605.1| phosphatase, Ppx/GppA family [Campylobacter jejuni subsp. jejuni 305] Length = 430 Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust. Identities = 40/142 (28%), Positives = 64/142 (45%), Gaps = 14/142 (9%) Query: 237 FYDIFNIPLED--WKRYQIDTRTVEGIDSGFHEGIISRYGLD-SDVARIEILGQFPQQEV 293 FY++ L D WK +Q + + + +I G + S+V + EI G+F Sbjct: 164 FYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVKQEIYGEFIDSSS 223 Query: 294 NNFIPHNYIEEAMSREA--IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA- 350 + IE AMS+ + I+ + I G D+A G DK+ + R+G +I I +S Sbjct: 224 AELFSLSEIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKGFVIYEIKKYSQL 283 Query: 351 -------KLIQETNQ-EGCPVG 364 K++ E NQ E P G Sbjct: 284 GTIELANKILAEYNQSEDKPKG 305 >gi|298387330|ref|ZP_06996883.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] gi|298259999|gb|EFI02870.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] Length = 500 Score = 41.6 bits (96), Expect = 0.17, Method: Compositional matrix adjust. Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 3/72 (4%) Query: 277 SDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAP---LIMGCDIAGEGGDKTV 333 +D+ RI++ G FP+ + IP+ +IE A R + Y P +G D+AG G D +V Sbjct: 264 NDLFRIKVRGMFPKVAEDVLIPYEWIEIANKRWQENHPYRPRKSCKLGVDVAGMGRDNSV 323 Query: 334 VVFRRGNIIEHI 345 R GN + Sbjct: 324 FCPRYGNYVSQF 335 >gi|292670767|ref|ZP_06604193.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292647388|gb|EFF65360.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 442 Score = 41.2 bits (95), Expect = 0.28, Method: Compositional matrix adjust. Identities = 35/144 (24%), Positives = 60/144 (41%), Gaps = 28/144 (19%) Query: 221 NRFWIMTSNTRRLNGWFYDIFN------IPLEDWKRYQIDTRTVEGIDSGFHEGIISRYG 274 N+F+ M + + GW+ I+ +P E+ K Q +E Sbjct: 168 NQFYEMYQHAEKSAGWYSCIYRTDETGVLPAEELKDMQAQMTEME--------------- 212 Query: 275 LDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD--LYAPLIMGCDIAGEGGDKT 332 R E+L F + IP + + A +R DD L P+I+G D+A G D+T Sbjct: 213 -----IRQELLCDFTASASDVVIPIDLVTAAANRLLKDDDVLGQPVILGVDVARFGDDRT 267 Query: 333 VVVFRRGNIIEHIFDWSAKLIQET 356 V+ R+G ++ + ++ ET Sbjct: 268 VLCVRQGLWLKEVRTFTGLSTMET 291 >gi|225574768|ref|ZP_03783378.1| hypothetical protein RUMHYD_02845 [Blautia hydrogenotrophica DSM 10507] gi|225037968|gb|EEG48214.1| hypothetical protein RUMHYD_02845 [Blautia hydrogenotrophica DSM 10507] Length = 428 Score = 40.4 bits (93), Expect = 0.40, Method: Compositional matrix adjust. Identities = 68/319 (21%), Positives = 137/319 (42%), Gaps = 36/319 (11%) Query: 66 CHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLW 125 C V+ S SAG G K+ A L + G ++ICI S+ +++ + Sbjct: 10 CFREVDRSQKRYIVMKGSAGSG--KSVDTAQNYLLRLMQDKGRNLICIRKSDITNRDSTY 67 Query: 126 AEVS----KWLSMLPHRHWFEMQ---SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE 178 AE++ + R+W Q SL+ P+G +++ + + + + + T+ Sbjct: 68 AELTGAAYRIFGDQVDRYWNIKQSPLSLTFRPNG--NQIIFRGVNDEKQREKLKSITFQR 125 Query: 179 ER-PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFW--IMTSNTRRLNG 235 + D ++ A F +II+ + G EL P++F+ MT N N Sbjct: 126 GKLTDVWIEEATEITQADF--------EIIDDRLRG---ELPPDQFYQIRMTFNPVNKNH 174 Query: 236 WFYDI-FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVN 294 W + F+ P + + ID+ +H + R +D + +I LG + EV Sbjct: 175 WIKKVFFDTPDSNVLTHHSTYLDNRFIDAAYHARMARRKEVDPEGYQIYGLGNWG--EVG 232 Query: 295 NFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGN---IIEHIFDW--- 348 I HN+ E +S + +DD Y + +G D + +++ + + I++ I+ + Sbjct: 233 GLILHNWAVENIS-QNLDD-YDDIAIGQDFGFNHANAILLLGMKDDNIYILQEIYVFEKE 290 Query: 349 SAKLIQETNQEGCPVGSSI 367 +A++I ++G P+ ++ Sbjct: 291 TAEIIPLAIKDGIPIKRTM 309 >gi|291334627|gb|ADD94276.1| hypothetical protein Syncc9605_0456 [uncultured phage MedDCM-OCT-S04-C231] Length = 320 Score = 40.4 bits (93), Expect = 0.41, Method: Compositional matrix adjust. Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 8/92 (8%) Query: 236 WFYDIFNIPLED----WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQ 291 WFYD++ ED W+R+ T +EG + HE +R LD+ R E F + Sbjct: 96 WFYDLWCYVPEDETGEWQRWSYTT--IEGGNVSKHEVEAARAQLDNRTFRQEFEASF--E 151 Query: 292 EVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 + + ++ +E +S+EA D PL++G D Sbjct: 152 NLTGLVAISFSDENISQEAKDISIQPLLLGVD 183 >gi|294085818|ref|YP_003552578.1| hypothetical protein SAR116_2251 [Candidatus Puniceispirillum marinum IMCC1322] gi|292665393|gb|ADE40494.1| protein of unknown function DUF264 [Candidatus Puniceispirillum marinum IMCC1322] Length = 454 Score = 40.4 bits (93), Expect = 0.45, Method: Compositional matrix adjust. Identities = 64/280 (22%), Positives = 101/280 (36%), Gaps = 51/280 (18%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPH---R 138 + AGRG GKT A + WL + I + + + + S LS+ P+ Sbjct: 80 LMAGRGFGKTRAGAEWIRWLAQSGRARRIALVGETFDDARQVMVEGASGILSVCPNWARP 139 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 W Q + PSG R YS + P+ GP +G A D Sbjct: 140 AWRAGQRTLIWPSG------------------TIARCYSADDPEQLRGPEFDYGWA---D 178 Query: 199 EASG--TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTR 256 E + P + +L +P + + T R W D+ ED Q +R Sbjct: 179 EIAKWRYPSAWDNLMLALRIGKSPQ---CIATTTPRPVRWLADLAAA--EDTVLVQGASR 233 Query: 257 -TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY 315 + F + R+G DS +AR QE+ + N + R I L+ Sbjct: 234 ENAANLSPAFMAAMHRRFG-DSYLAR---------QELEGIMMSNLPDALWCRNDILRLH 283 Query: 316 APL---------IMGCDIAGEGGDKTVVVFRRGNIIEHIF 346 P+ ++G D A GGD+T ++ + HI+ Sbjct: 284 RPMPKRHRFIRIVIGVDPAMGGGDETGIITAGKDQDGHIW 323 >gi|134287454|ref|YP_001109621.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4] gi|134131876|gb|ABO60570.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4] Length = 509 Score = 39.7 bits (91), Expect = 0.83, Method: Compositional matrix adjust. Identities = 57/268 (21%), Positives = 111/268 (41%), Gaps = 29/268 (10%) Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLW---AEVSKWLSML 135 + ++S+G G GKT+ A + LW + + I A + + + +W A++S +S Sbjct: 54 RTSVSSGHGTGKTSGFAIIALWHLLCYYLSNTILTAPKISTVSDGVWKEFADLSTKISNG 113 Query: 136 PHR---HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG 192 P +F ++S ++ G+ ++ + ++ P+ G H Sbjct: 114 PQSWIWEYFVIESERVYVRGY------------KLNWFVIAKSAPRGSPENLAGAHRDW- 160 Query: 193 MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED----W 248 + DEASG PD I G T+ NR + + TR +G+FY+ + W Sbjct: 161 LLWLADEASGIPDDNFGVITGSLTD-ERNRMCLASQPTRS-SGFFYETHHALSRAEGGPW 218 Query: 249 KRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 ++ + + F +Y + +I++ G+FP+ + IE + R Sbjct: 219 NNLVFNSEFSPIVSAKFIAEKKLQY--TEEEYQIKVQGRFPENSSKYLVGPQAIEACVGR 276 Query: 309 EAID-DLYAPLIMGCDIAGEG-GDKTVV 334 I D + ++ D+ G G D+TV+ Sbjct: 277 TVIKPDEHWGWLLPVDVGGGGWRDETVM 304 >gi|226479018|emb|CAX73004.1| Cell division control protein 42 homolog precursor [Schistosoma japonicum] Length = 98 Score = 38.9 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 43 KPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 KP++ P L+ + + H H N+NN NPTI KC + +GKT+L Sbjct: 2 KPIDGGFSPELPHLKKVRPQNTHGH-NINNENPTIVKCILIGDEQVGKTSL 51 >gi|153806881|ref|ZP_01959549.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185] gi|149131558|gb|EDM22764.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185] Length = 513 Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust. Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 5/71 (7%) Query: 277 SDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAP---LIMGCDIAGEGGDK 331 +D+ R+++LG FP+ + IP+ +IE A +E + P +G D+AG G D Sbjct: 275 NDLFRVKVLGMFPKVSEDVLIPYEWIEIANRNWQELQASGFIPAKSCKLGVDVAGMGRDN 334 Query: 332 TVVVFRRGNII 342 +V+ R GN + Sbjct: 335 SVLCPRYGNYV 345 >gi|291334534|gb|ADD94186.1| hypothetical protein Syncc9605_0456 [uncultured phage MedDCM-OCT-S04-C1220] gi|291335526|gb|ADD95137.1| hypothetical protein Syncc9605_0456 [uncultured phage MedDCM-OCT-S04-C491] gi|291335665|gb|ADD95272.1| hypothetical protein Syncc9605_0456 [uncultured phage MedDCM-OCT-S04-C846] Length = 354 Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust. Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 8/92 (8%) Query: 236 WFYDIFNIPLED----WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQ 291 WFYD++ +D W+R+ T ++G + HE +R LD+ R E F + Sbjct: 96 WFYDLWCYVPDDETNEWQRWSYTT--IDGGNVSKHEVEAARAQLDTRTFRQEFEASF--E 151 Query: 292 EVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 + + ++ +E +S+EA D PL++G D Sbjct: 152 NLTGLVAISFSDENISQEAKDISIQPLLLGVD 183 >gi|225155389|ref|ZP_03723881.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2] gi|224803845|gb|EEG22076.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2] Length = 479 Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust. Identities = 34/117 (29%), Positives = 52/117 (44%), Gaps = 5/117 (4%) Query: 233 LNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQE 292 L G F+D F+ + + ++Q I F E + ++YG DSD+ R ILGQ P+ Sbjct: 183 LFGRFHDAFS--QDRFAQFQAGIADCPHITPEFIEAMRAQYGEDSDIYRSMILGQRPKGN 240 Query: 293 VNNF-IPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 F +P E S + + CD A E D+ V+ R GN + + W Sbjct: 241 ETGFVVPFVDYERCESNPPVWQEGTKQVF-CDFA-ETSDECVIAKRDGNRLSIVDAW 295 >gi|186682890|ref|YP_001866086.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102] gi|186465342|gb|ACC81143.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102] Length = 543 Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust. Identities = 36/131 (27%), Positives = 59/131 (45%), Gaps = 18/131 (13%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 + A G GK+ + + ++++ + G++I A SE Q+K LWAE+ K Sbjct: 64 VKAAHGTGKSFIASLLVIYFLFCVGGVAITT-APSEDQVKWILWAELRK----------- 111 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS 201 + L G ++++ + IT R YSE ++F G H + + DEA Sbjct: 112 -IHGLHKTKLGGRCDIMQLLFSETVYAFGITSRDYSE---NSFQGQHRQKQL-LIEDEAD 166 Query: 202 G-TPDIINKSI 211 G TP I N I Sbjct: 167 GITPQIDNGFI 177 >gi|76156436|gb|AAX27647.2| SJCHGC05167 protein [Schistosoma japonicum] Length = 206 Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust. Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 43 KPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 KP++ P L+ + + H H N+NN NPTI KC + +GKT+L Sbjct: 2 KPIDGGFSPELPHLKKVRPQNTHGH-NINNENPTIVKCILIGDEQVGKTSL 51 >gi|29841054|gb|AAP06067.1| similar to NM_021205 CDC42-like GTPase; novel Ras family protein; Wrch-1; Ryu GTPase in Homo sapiens [Schistosoma japonicum] Length = 187 Score = 37.4 bits (85), Expect = 3.5, Method: Compositional matrix adjust. Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%) Query: 43 KPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 KP++ P L+ + + H H N+NN NPTI KC + +GKT+L Sbjct: 2 KPIDGGFSPELPHLKKVRPQNTHGH-NINNENPTIVKCILIGDEQVGKTSL 51 >gi|119386463|ref|YP_917518.1| PBSX family phage terminase large subunit [Paracoccus denitrificans PD1222] gi|119377058|gb|ABL71822.1| phage terminase, large subunit, PBSX family [Paracoccus denitrificans PD1222] Length = 441 Score = 37.0 bits (84), Expect = 5.1, Method: Compositional matrix adjust. Identities = 17/55 (30%), Positives = 29/55 (52%) Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGN 340 G + + FI + EAM+R+ + L++G D+A G D++V+ RRG Sbjct: 214 GDYEAESDMQFIGGGLVREAMARQPFSQIGDELVLGVDVARFGDDRSVIWARRGR 268 >gi|291337121|gb|ADD96636.1| hypothetical protein Syncc9605_0456 [uncultured organism MedDCM-OCT-S12-C92] Length = 354 Score = 35.8 bits (81), Expect = 9.8, Method: Compositional matrix adjust. Identities = 25/92 (27%), Positives = 45/92 (48%), Gaps = 8/92 (8%) Query: 236 WFYDIFNIPLED----WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQ 291 WFYD++ +D W+R+ T ++G + HE +R LD+ R E F + Sbjct: 96 WFYDLWCYVPDDETNEWQRWSYTT--IDGGNVSKHEVEAARAQLDTRTFRQEFEASF--E 151 Query: 292 EVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 + + ++ ++ +S EA D PL++G D Sbjct: 152 NLTGLVAISFSDDNISTEAKDISIQPLLLGVD 183 Searching..................................................done Results from round 2 >gi|254781187|ref|YP_003065600.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040864|gb|ACT57660.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] Length = 367 Score = 521 bits (1341), Expect = e-146, Method: Composition-based stats. Identities = 367/367 (100%), Positives = 367/367 (100%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME Sbjct: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL Sbjct: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER Sbjct: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI Sbjct: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN Sbjct: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG 360 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG Sbjct: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG 360 Query: 361 CPVGSSI 367 CPVGSSI Sbjct: 361 CPVGSSI 367 >gi|254781215|ref|YP_003065628.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|254040892|gb|ACT57688.1| putative phage terminase, large subunit [Candidatus Liberibacter asiaticus str. psy62] gi|317120680|gb|ADV02503.1| putative phage terminase large subunit [Liberibacter phage SC1] gi|317120824|gb|ADV02645.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 511 Score = 512 bits (1319), Expect = e-143, Method: Composition-based stats. Identities = 252/359 (70%), Positives = 299/359 (83%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 M R + T+ + EQ+L +++ E LSF NFV+ FFPWG KG PLE FS P WQLEFME Sbjct: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 VD HC ++VNN NP +FK AISAGRGIGKTTLNAW++LWL+STRPG+S+IC+ANSETQL Sbjct: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K TLWAEVSKWLS+LP++HWFEMQSLSLHP+ WY+++L S+GIDSKHY+ CRTYSEER Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVG HNT+GMA+ NDEASGTPD+IN ILGF TE N NRFWIMTSN RRL+G FY+I Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FN PL+DWKR+QIDTRTVEGID FHEGII+RYGLDSDV R+E+ GQFPQQ++++FIP N Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLN 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 IEEA++RE D YAPLIMGCDIA EGGD TVVV RRG +IEH+FDWS ++ TN + Sbjct: 301 IIEEALNREPCPDPYAPLIMGCDIAEEGGDNTVVVLRRGPVIEHLFDWSKTDLRTTNNK 359 >gi|317120722|gb|ADV02544.1| putative phage terminase large subunit [Liberibacter phage SC2] gi|317120783|gb|ADV02604.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 516 Score = 507 bits (1306), Expect = e-142, Method: Composition-based stats. Identities = 257/359 (71%), Positives = 302/359 (84%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 M R + T+ + EQ+L +++ E LSF NFV+ FFPWG KG PLE FS P WQLEFME Sbjct: 1 MSRELPTNPETEQKLFDLMWSDEIKLSFSNFVLHFFPWGEKGTPLEGFSAPRSWQLEFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 VD HC ++VNN NP +FK AISAGRGIGKTTLNAW++LWL+STRPG+S+IC+ANSETQL Sbjct: 61 VVDAHCLNSVNNPNPEVFKGAISAGRGIGKTTLNAWLVLWLMSTRPGISVICLANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K TLWAEVSKWLS+LP++HWFEMQSLSLHP+ WY+++L S+GIDSKHY+ CRTYSEER Sbjct: 121 KTTLWAEVSKWLSLLPNKHWFEMQSLSLHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVG HNT+GMA+ NDEASGTPD+IN ILGF TE N NRFWIMTSN RRL+G FY+I Sbjct: 181 PDTFVGHHNTYGMAIINDEASGTPDVINLGILGFLTERNANRFWIMTSNPRRLSGKFYEI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FN PL+DWKR+QIDTRTVEGID FHEGII+RYGLDSDV R+E+ GQFPQQ++++FIP Sbjct: 241 FNKPLDDWKRFQIDTRTVEGIDPSFHEGIIARYGLDSDVTRVEVCGQFPQQDIDSFIPQQ 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 YI EA+ R AI D YAPLIMGCDIAGEG DKTVVV RRGNIIE IFDWS +LI+ TN++ Sbjct: 301 YIVEALERVAIPDPYAPLIMGCDIAGEGEDKTVVVLRRGNIIERIFDWSGELIEVTNRK 359 >gi|315122902|ref|YP_004063391.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496304|gb|ADR52903.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 509 Score = 480 bits (1234), Expect = e-133, Method: Composition-based stats. Identities = 262/359 (72%), Positives = 303/359 (84%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 M R + T + EQEL E++ + LSF NFV+R FPW L +FS+P RWQL+FME Sbjct: 1 MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 AVD C NV+N +P IFK A+SAGRGIGKTTLNAWMMLWLISTRPGMSI+C+ANSETQL Sbjct: 61 AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K+TLWAEVSKWLSMLP++HWFEMQSLSLHP+ WYAE LE++ GIDSKHYTITCRTYSEER Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVG HNT+GMA+FNDEASGTPD+IN SILGFFTE N NRFW+MTSN RRLNGWFYDI Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLNGWFYDI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FN+PLEDW+R+QIDTRTVEGID FHE II+RYGLDSDV R+E+LGQFPQQ++N+FIP Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPNFHENIIARYGLDSDVTRVEVLGQFPQQDINSFIPFY 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 IEEA++RE I D YAPL+MGCDIAGEGGD TVVV RRG IEHIFDWS + ++++ Sbjct: 301 RIEEALNREPIKDPYAPLVMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNVSSRK 359 >gi|315121940|ref|YP_004062429.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495342|gb|ADR51941.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 509 Score = 479 bits (1232), Expect = e-133, Method: Composition-based stats. Identities = 264/359 (73%), Positives = 303/359 (84%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 M R + T + EQEL E++ + LSF NFV+R FPW L +FS+P RWQL+FME Sbjct: 1 MTRELPTKIEHEQELMELMFSDDIKLSFTNFVLRLFPWSEANTSLANFSRPRRWQLDFME 60 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 AVD C NV+N +P IFK A+SAGRGIGKTTLNAWMMLWLISTRPGMSI+C+ANSETQL Sbjct: 61 AVDTDCLFNVDNPDPKIFKGAVSAGRGIGKTTLNAWMMLWLISTRPGMSILCLANSETQL 120 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K+TLWAEVSKWLSMLP++HWFEMQSLSLHP+ WYAE LE++ GIDSKHYTITCRTYSEER Sbjct: 121 KSTLWAEVSKWLSMLPNKHWFEMQSLSLHPAVWYAEALEKNFGIDSKHYTITCRTYSEER 180 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 PDTFVG HNT+GMA+FNDEASGTPD+IN SILGFFTE N NRFW+MTSN RRL GWFYDI Sbjct: 181 PDTFVGHHNTYGMAIFNDEASGTPDVINTSILGFFTENNANRFWVMTSNPRRLKGWFYDI 240 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 FN+PLEDW+R+QIDTRTVEGID FHEGIISRYGLDSDV R+E+LGQFPQQ++N+FIP Sbjct: 241 FNVPLEDWQRFQIDTRTVEGIDPSFHEGIISRYGLDSDVTRVEVLGQFPQQDINSFIPFY 300 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 IEEA++RE I D YAPLIMGCDIAGEGGD TVVV RRG IEHIFDWS + ++++ Sbjct: 301 RIEEALNREPIKDPYAPLIMGCDIAGEGGDNTVVVLRRGTNIEHIFDWSGLAVNASSRK 359 >gi|212710820|ref|ZP_03318948.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM 30120] gi|212686517|gb|EEB46045.1| hypothetical protein PROVALCAL_01888 [Providencia alcalifaciens DSM 30120] Length = 493 Score = 395 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 98/355 (27%), Positives = 157/355 (44%), Gaps = 20/355 (5%) Query: 4 LISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVD 63 +I T EQ ++++ M LS + + FPWG G LE+ S P +WQ E + + Sbjct: 1 MIETMSPEEQLINDIGMFTHDPLS---YALYAFPWGEAGTELENASGPRQWQAEALNEIG 57 Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNT 123 H + P + A ++G GIGK+ + ++ W + T ++ AN+E QL+ Sbjct: 58 EHLRNPETRHQP--LQLARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQLRTK 115 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDT 183 W E++KW + + WF +++ + + + +SE + Sbjct: 116 TWPEIAKWQRLSITKDWFTCTKTAIYSNDP----------NHANAWRADAVPWSENNTEA 165 Query: 184 FVGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN 242 F G HN + + DEAS D++ + G T+ N WI N R G F + F Sbjct: 166 FAGLHNQGKRIILVFDEASNIADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRECFR 225 Query: 243 IPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYI 302 WK QID+RTVEG + E I YG+D D ++ + G FP FIP Sbjct: 226 KFKHRWKTKQIDSRTVEGTNKEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPTGLT 285 Query: 303 EEAMSREAIDDL--YAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 + AM R +AP+I+G D A G D V+ R+G + + W+ + Sbjct: 286 DAAMKRTVTQAEVSHAPIILGVDPAYSGDDDAVIYLRQGLHSKCL--WTGSKTID 338 >gi|268589373|ref|ZP_06123594.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315400|gb|EFE55853.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 493 Score = 393 bits (1009), Expect = e-107, Method: Composition-based stats. Identities = 97/355 (27%), Positives = 157/355 (44%), Gaps = 20/355 (5%) Query: 4 LISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVD 63 +I T EQ ++++ M LS + + FPWG G LE+ + P +WQ E + + Sbjct: 1 MIDTMSPEEQLINDIGMFTHDPLS---YALYAFPWGEAGTELENANGPRQWQAEALNEIG 57 Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNT 123 H + P + A ++G GIGK+ + ++ W + T ++ AN+E QL+ Sbjct: 58 EHLRNPETRHQP--LQLARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQLRTK 115 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDT 183 W E++KW + + WF +++ + + + +SE + Sbjct: 116 TWPEIAKWQRLSITKDWFTYTKTAIYSNDP----------NHANAWRADAVPWSENNTEA 165 Query: 184 FVGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN 242 F G HN + + DEAS D++ + G T+ N WI N R G F + F Sbjct: 166 FAGLHNQGKRIILIFDEASNIADLVWEVAEGALTDENTEIIWIAFGNPTRNTGRFRECFR 225 Query: 243 IPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYI 302 WK QID+RTVEG + E I YG+D D ++ + G FP FIP Sbjct: 226 KFKHRWKTKQIDSRTVEGTNKEQIEKWIQDYGVDDDFVKVRVRGIFPSTSEKQFIPTGLT 285 Query: 303 EEAMSREAIDDL--YAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 + AM R +AP+I+G D A G D V+ R+G + + W+ + Sbjct: 286 DAAMKRTVTQAEVSHAPIIIGVDPAYSGDDDAVIYLRQGLHSKCL--WTGSKTID 338 >gi|215487825|ref|YP_002330256.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] gi|215265897|emb|CAS10306.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] Length = 493 Score = 390 bits (1001), Expect = e-106, Method: Composition-based stats. Identities = 88/346 (25%), Positives = 152/346 (43%), Gaps = 18/346 (5%) Query: 6 STDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVH 65 EQ + ++ L + + FPWG G L H + P +WQ + + H Sbjct: 4 EAMSPEEQLVEDIASFTYDPL---GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDH 60 Query: 66 CHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLW 125 + P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W Sbjct: 61 LQNPATRHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTW 118 Query: 126 AEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFV 185 E+ KW ++ + WF + +++ + + K + +SE + F Sbjct: 119 PEIIKWSNLAITKEWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFA 168 Query: 186 GPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIP 244 G HN + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 169 GLHNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKY 228 Query: 245 LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEE 304 WK QID+RTVEG + + + YG DSD ++ + G FP N FIP + Sbjct: 229 KHRWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQP 288 Query: 305 AMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 A+ R +A +++G D + +G D V+ R+G + + +W Sbjct: 289 AVGRVITPAQVQHAAVVLGVDPSHQGKDPAVIYLRQGLHCKKLGEW 334 >gi|330007152|ref|ZP_08305894.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3] gi|328535499|gb|EGF61959.1| hypothetical protein HMPREF9538_03583 [Klebsiella sp. MS 92-3] Length = 495 Score = 390 bits (1001), Expect = e-106, Method: Composition-based stats. Identities = 99/356 (27%), Positives = 161/356 (45%), Gaps = 25/356 (7%) Query: 7 TDQKL--EQELHEMLMHAECVLSFKN----FVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 TD L E++L E L+ + + SF + + + FPWG G L H S P +WQ + Sbjct: 2 TDAALSPEEQLKEQLI--DDIASFTHDPLGYALYAFPWGEDGTELAHASGPRQWQADAFR 59 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 + H + P + + ++G GIGK+ + ++ W +ST ++ AN++ QL Sbjct: 60 EIGEHLQNPATRHQPLMI--SRASGHGIGKSAFISMLINWAMSTCEDCKVVVTANTDNQL 117 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 + W E+ KW ++ + WF + +++ + + K + +SE Sbjct: 118 RTKTWPEIIKWSNLAITKEWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHN 167 Query: 181 PDTFVGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYD 239 + F G HN + V DEAS D++ + G T+ + W+ N R G F + Sbjct: 168 TEAFAGLHNERKRIVVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRE 227 Query: 240 IFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPH 299 F WK QID+RTVEG + + + YG DSD ++ + G FP FIP Sbjct: 228 CFRKYKHRWKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASELQFIPT 287 Query: 300 NYIEEAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 +EAM R A +AP I+G D A G D V+ R+G + + W+ Sbjct: 288 GLTDEAMKRVVTAAQVAHAPRIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 341 >gi|309702815|emb|CBJ02146.1| putative terminase, large subunit [Escherichia coli ETEC H10407] Length = 493 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 88/344 (25%), Positives = 153/344 (44%), Gaps = 18/344 (5%) Query: 8 DQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCH 67 EQ + ++ L + + FPWG +G L H + P +WQ + + H Sbjct: 6 MSPEEQLVEDIAGFTYDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQ 62 Query: 68 SNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAE 127 + P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E Sbjct: 63 NPATRHQPIML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGP 187 + KW ++ + WF + +++ + + K + +SE + F G Sbjct: 121 IIKWSNLAITKEWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLE 246 HN + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAM 306 WK QID+RTVEG + + + YG DSD ++ + G FP N FIP + A+ Sbjct: 231 RWKCAQIDSRTVEGTNKEQLQKWVDDYGEDSDFVKVRVRGIFPDASENQFIPSGLTQPAV 290 Query: 307 SR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 R +A +++G D + +G D V+ R+G + + +W Sbjct: 291 GRVITPAQVQHAAVVLGVDPSHQGKDPAVIYLRQGLHCKKLGEW 334 >gi|262043569|ref|ZP_06016682.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039103|gb|EEW40261.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 491 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 93/348 (26%), Positives = 153/348 (43%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG G L H + P +WQ + + H + Sbjct: 7 SPEEQLIDDIASFTHDPL---GYALYAFPWGEDGTELAHATGPRKWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PATRHQPLML--ARASGHGIGKSAFISMLINWAMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKEWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIVVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD ++ + G FP FIP +EAM Sbjct: 232 WKCAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKVRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A+ +AP I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAVQVAHAPRIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|323156136|gb|EFZ42295.1| terminase large subunit [Escherichia coli EPECa14] Length = 491 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 95/348 (27%), Positives = 155/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A+++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PATRYQPLML--ALASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDLGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A YAP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAYAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|332344357|gb|AEE57691.1| terminase, large subunit [Escherichia coli UMNK88] Length = 491 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 94/349 (26%), Positives = 154/349 (44%), Gaps = 20/349 (5%) Query: 8 DQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCH 67 EQ + ++ L + + FPWG +G L H + P +WQ + + H Sbjct: 6 MSPEEQLVEDIASFTYDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQ 62 Query: 68 SNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAE 127 + P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E Sbjct: 63 NPATRHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 128 VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGP 187 + KW ++ + WF + +++ + + K + +SE + F G Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGL 170 Query: 188 HNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLE 246 HN + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 171 HNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKH 230 Query: 247 DWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAM 306 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 231 RWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAM 290 Query: 307 SR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|327252187|gb|EGE63859.1| terminase large subunit [Escherichia coli STEC_7v] Length = 491 Score = 388 bits (996), Expect = e-106, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 154/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PATRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|301046412|ref|ZP_07193572.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|300301638|gb|EFJ58023.1| conserved hypothetical protein [Escherichia coli MS 185-1] Length = 491 Score = 387 bits (995), Expect = e-105, Method: Composition-based stats. Identities = 94/352 (26%), Positives = 155/352 (44%), Gaps = 20/352 (5%) Query: 5 ISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDV 64 ++ EQ + ++ L + + FPWG G L H + P +WQ + + Sbjct: 3 VAAMSPEEQLVEDIASFTYDPL---GYALYAFPWGEDGTELAHATGPRQWQADAFREIRD 59 Query: 65 HCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTL 124 H + P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ Sbjct: 60 HLQNPETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKT 117 Query: 125 WAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTF 184 W E+ KW ++ + WF + +++ + + K + +SE + F Sbjct: 118 WPEIIKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAF 167 Query: 185 VGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI 243 G HN + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 168 AGLHNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRK 227 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 WK QID+RTVEG + + + YG DSD +I + G FP FIP + Sbjct: 228 YKHRWKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTD 287 Query: 304 EAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 EAM R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 288 EAMKRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|331648179|ref|ZP_08349269.1| conserved hypothetical protein [Escherichia coli M605] gi|331043039|gb|EGI15179.1| conserved hypothetical protein [Escherichia coli M605] Length = 491 Score = 387 bits (995), Expect = e-105, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 154/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|300898423|ref|ZP_07116764.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357890|gb|EFJ73760.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 491 Score = 387 bits (995), Expect = e-105, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 154/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRKWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|298381721|ref|ZP_06991320.1| terminase large subunit protein [Escherichia coli FVEC1302] gi|301019339|ref|ZP_07183525.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|298279163|gb|EFI20677.1| terminase large subunit protein [Escherichia coli FVEC1302] gi|299882256|gb|EFI90467.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|323948690|gb|EGB44595.1| hypothetical protein ERKG_04913 [Escherichia coli H252] Length = 491 Score = 387 bits (995), Expect = e-105, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 154/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|294491573|gb|ADE90329.1| putative phage terminase, large subunit [Escherichia coli IHE3034] Length = 491 Score = 387 bits (994), Expect = e-105, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 154/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|324008564|gb|EGB77783.1| hypothetical protein HMPREF9532_01752 [Escherichia coli MS 57-2] Length = 491 Score = 387 bits (994), Expect = e-105, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 154/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDLGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|30387381|ref|NP_848210.1| terminase large subunit [Enterobacteria phage epsilon15] gi|30266036|gb|AAO06065.1| terminase large subunit [Salmonella phage epsilon15] Length = 491 Score = 387 bits (994), Expect = e-105, Method: Composition-based stats. Identities = 94/352 (26%), Positives = 158/352 (44%), Gaps = 23/352 (6%) Query: 5 ISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDV 64 IST+++L +++ A + + FPWG G L H + P +WQ + + Sbjct: 6 ISTEEQLVEDI------ASFTYDPLGYALYAFPWGEDGTELAHATGPRKWQADAFREIRD 59 Query: 65 HCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTL 124 H + P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ Sbjct: 60 HLQNPATRHQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKT 117 Query: 125 WAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTF 184 W E+ KW ++ + WF + +++ + + K + +SE + F Sbjct: 118 WPEIIKWSNLAITKEWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAF 167 Query: 185 VGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI 243 G HN + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 168 AGLHNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRK 227 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 WK QID+RTVEG + + + YG +SD ++ + G FP FIP + Sbjct: 228 YKHRWKCAQIDSRTVEGTNKQQLQKWVDDYGEESDFVKVRVRGIFPDASELQFIPTGLTD 287 Query: 304 EAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 EAM R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 288 EAMKRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|218700994|ref|YP_002408623.1| putative phage terminase, large subunit [Escherichia coli IAI39] gi|218370980|emb|CAR18807.1| putative phage terminase, large subunit [Escherichia coli IAI39] Length = 491 Score = 387 bits (993), Expect = e-105, Method: Composition-based stats. Identities = 94/348 (27%), Positives = 154/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIDDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|117624715|ref|YP_853628.1| putative phage terminase, large subunit [Escherichia coli APEC O1] gi|115513839|gb|ABJ01914.1| putative phage terminase, large subunit [Escherichia coli APEC O1] Length = 491 Score = 386 bits (992), Expect = e-105, Method: Composition-based stats. Identities = 93/348 (26%), Positives = 154/348 (44%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG DSD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A ++P+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHSPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|89152423|ref|YP_512256.1| putative terminase large subunit [Escherichia phage phiV10] gi|74055446|gb|AAZ95895.1| putative terminase large subunit [Escherichia phage phiV10] Length = 491 Score = 386 bits (990), Expect = e-105, Method: Composition-based stats. Identities = 93/348 (26%), Positives = 153/348 (43%), Gaps = 20/348 (5%) Query: 9 QKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHS 68 EQ + ++ L + + FPWG +G L H + P +WQ + + H + Sbjct: 7 SPEEQLIEDIAGFTHDPL---GYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQN 63 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEV 128 P + A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ Sbjct: 64 PETRYQPLML--ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEI 121 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 KW ++ + WF + +++ + + K + +SE + F G H Sbjct: 122 IKWSNLAITKDWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLH 171 Query: 189 NTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 N + V DEAS D++ + G T+ + W+ N R G F + F Sbjct: 172 NERKRIIVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR 231 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 WK QID+RTVEG + + + YG SD +I + G FP FIP +EAM Sbjct: 232 WKTAQIDSRTVEGTNKQQLQKWVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMK 291 Query: 308 R--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 R A +AP+I+G D A G D V+ R+G + + W+ Sbjct: 292 RVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 337 >gi|320175050|gb|EFW50163.1| terminase B protein, putative [Shigella dysenteriae CDC 74-1112] Length = 480 Score = 383 bits (984), Expect = e-104, Method: Composition-based stats. Identities = 93/338 (27%), Positives = 152/338 (44%), Gaps = 21/338 (6%) Query: 23 ECVLSFKN----FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIF 78 E + F + + + FPWG +G L H + P +WQ + + H + P + Sbjct: 3 EDIAGFTHDPLGYALYAFPWGEEGTELAHATGPRQWQADAFREIRDHLQNPETRYQPLML 62 Query: 79 KCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 A ++G GIGK+ + ++ W +ST ++ AN++ QL+ W E+ KW ++ + Sbjct: 63 --ARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITK 120 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG-MAVFN 197 WF + +++ + + K + +SE + F G HN + V Sbjct: 121 DWFTCTATAMYSNDPGHD----------KRWRADAIPWSEHNTEAFAGLHNERKRIIVVF 170 Query: 198 DEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRT 257 DEAS D++ + G T+ + W+ N R G F + F WK QID+RT Sbjct: 171 DEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSRT 230 Query: 258 VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLY 315 VEG + + + YG DSD +I + G FP FIP +EAM R A + Sbjct: 231 VEGTNKQQLQKWVDDYGEDSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVAH 290 Query: 316 APLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 AP+I+G D A G D V+ R+G + + W+ Sbjct: 291 APVIIGVDPAYSGVDDAVIYLRQGLHSKVL--WTGNKT 326 >gi|227355862|ref|ZP_03840255.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906] gi|227164181|gb|EEI49078.1| phage terminase, large subunit [Proteus mirabilis ATCC 29906] Length = 494 Score = 380 bits (976), Expect = e-103, Method: Composition-based stats. Identities = 94/352 (26%), Positives = 154/352 (43%), Gaps = 23/352 (6%) Query: 9 QKLEQELHEMLMHAECVLSFKN----FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDV 64 + L++ E L+ E + SF + + FPWG G LE ++ P +WQ E + + Sbjct: 3 EALQKSPEEQLI--EDIASFTHDPLGYAYYAFPWGEAGGELEEYNGPRQWQAEALNEIGE 60 Query: 65 HCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTL 124 H + P + A ++G GIGK+ + ++ W + T ++ AN+E QL+ Sbjct: 61 HLRNPKTRHQPLLL--ARASGHGIGKSAFISMIIKWGMDTCEDCKVVVTANTENQLRTKT 118 Query: 125 WAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTF 184 W E++KW + +WF +++ + + + +SE + F Sbjct: 119 WPEIAKWQRLSLTNNWFTCTKTAIYSNDP----------NHANAWRADAVPWSENNTEAF 168 Query: 185 VGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI 243 G HN + + DEAS D++ + G T+ WI N R G F + F Sbjct: 169 AGLHNKGKRIILVFDEASNIADLVWEVAEGALTDEGTEIIWIAFGNPTRNTGRFRECFRK 228 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 W QID+RTVEG + + YG DSD ++ + G FP FIP + Sbjct: 229 FKHRWNTKQIDSRTVEGSNKEQIKNWEEDYGEDSDFFKVRVRGVFPSASELQFIPTGLTD 288 Query: 304 EAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 EAM R + +AP+I+G D A G D V+ R+G + + W+ Sbjct: 289 EAMKRIVTQAEVAHAPVIIGVDPAYSGIDDAVIYLRQGLFSKCL--WTGFKT 338 >gi|304398406|ref|ZP_07380280.1| terminase, large subunit [Pantoea sp. aB] gi|304354272|gb|EFM18645.1| terminase, large subunit [Pantoea sp. aB] Length = 490 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 87/348 (25%), Positives = 153/348 (43%), Gaps = 19/348 (5%) Query: 5 ISTDQKLE-QELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVD 63 +S+ LE Q + ++ + + FPWG +G L + P +WQ + + + Sbjct: 1 MSSAADLEIQLIEDIGAFTHDPF---GYALYAFPWGEEGTDLAYSKGPRQWQEDAFKQIG 57 Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNT 123 H + P + A +G GIGK+ + ++ W + T ++ AN+E QL+ Sbjct: 58 AHLQNPDTRHQPLMIGRA--SGHGIGKSAFISMLVKWGMDTCEDCKVVVTANTENQLRTK 115 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDT 183 W E++KW + + WF + +++ + +K + +SE + Sbjct: 116 TWPEIAKWQRLSITQDWFTCTATAIYSNDP----------SHAKSWRADAIPWSENNTEA 165 Query: 184 FVGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN 242 F G HN + + DEAS D++ + G T+ N W+ N R G F + F Sbjct: 166 FAGLHNERKRIILIFDEASNIADLVWEVAEGALTDENTEIIWVAFGNPTRNTGRFRECFR 225 Query: 243 IPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYI 302 WK QID+R+VEG + + + YG DSD ++ + G FP FIP Sbjct: 226 KLRHRWKTAQIDSRSVEGTNKEQIQKWVDDYGEDSDFVKVRVRGLFPSASEAQFIPTGLT 285 Query: 303 EEAMSREAID--DLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 + A+ R +A ++G D A +GGD V+ R+G + + ++ Sbjct: 286 DAAVGRVITPGQVAHAATVIGVDPAHQGGDPAVIYLRQGLHTKKLGEY 333 >gi|228911519|ref|ZP_04075310.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL 200] gi|228848128|gb|EEM92991.1| hypothetical protein bthur0013_56490 [Bacillus thuringiensis IBL 200] Length = 459 Score = 368 bits (945), Expect = e-100, Method: Composition-based stats. Identities = 81/310 (26%), Positives = 141/310 (45%), Gaps = 18/310 (5%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP 106 ++ P + + + V K ++ +G+G+GKT L + +++W + RP Sbjct: 7 YWDDPVAFAEDMLGFYPDEWQRKVLMDLAQSPKVSVRSGQGVGKTGLESVVVIWFLCCRP 66 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 +IC A ++ QL LWAE++KWL ++ + ++ G Sbjct: 67 NPKVICTAPTKEQLFTVLWAEIAKWLEGSAVKNLLKWTKTRVYMIG------------SE 114 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 + + T RT + +P+ G H + M DEASG D I ++ILG + + Sbjct: 115 ERWFATARTAT--KPENMQGFHEDY-MLFVCDEASGIADPIMEAILGTLS--GAENKLFL 169 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILG 286 N R +G FYD N + +K +++ + E + +YG SDV R+ +LG Sbjct: 170 CGNPTRTSGVFYDSHNRDRDLYKIHKVSSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLG 229 Query: 287 QFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIF 346 +FP+ E + FIP +E+A S + L +G D+A G D+TV+ R GN + + Sbjct: 230 EFPKAEADAFIPLEIVEQAASCKVEPT-GETLDLGVDVARFGDDETVIAPRIGNKVFKLL 288 Query: 347 DWSAKLIQET 356 + + ET Sbjct: 289 NHYKQDTMET 298 >gi|228968731|ref|ZP_04129698.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis serovar sotto str. T04001] gi|228790961|gb|EEM38595.1| hypothetical protein bthur0004_54930 [Bacillus thuringiensis serovar sotto str. T04001] Length = 459 Score = 368 bits (945), Expect = e-100, Method: Composition-based stats. Identities = 81/310 (26%), Positives = 141/310 (45%), Gaps = 18/310 (5%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP 106 ++ P + + + V K ++ +G+G+GKT L + +++W + RP Sbjct: 7 YWDDPVAFAEDMLGFYPDEWQRKVLMDLAQSPKVSVRSGQGVGKTGLESVVVIWFLCCRP 66 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 +IC A ++ QL LWAE++KWL ++ + ++ G Sbjct: 67 NPKVICTAPTKEQLFTVLWAEIAKWLEGSAVKNLLKWTKTRVYMIG------------SE 114 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 + + T RT + +P+ G H + M DEASG D I ++ILG + + Sbjct: 115 ERWFATARTAT--KPENMQGFHEDY-MLFVCDEASGIADPIMEAILGTLS--GAENKLFL 169 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILG 286 N R +G FYD N + +K +++ + E + +YG SDV R+ +LG Sbjct: 170 CGNPTRTSGVFYDSHNRDRDLYKIHKVSSLDSPRTSKDNIEVLKKKYGEGSDVWRVRVLG 229 Query: 287 QFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIF 346 +FP+ E + FIP +E+A S + L +G D+A G D+TV+ R GN + + Sbjct: 230 EFPKAEADAFIPLEIVEQAASCKVEPT-GETLDLGVDVARFGDDETVIAPRIGNKVFKLL 288 Query: 347 DWSAKLIQET 356 + + ET Sbjct: 289 NHYKQDTMET 298 >gi|302120432|gb|ADK92426.1| putative phage terminase large subunit [Candidatus Liberibacter asiaticus] Length = 255 Score = 366 bits (939), Expect = 3e-99, Method: Composition-based stats. Identities = 194/255 (76%), Positives = 224/255 (87%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 IGKTTLNAW++LWL+S RPGMSIIC+ANSETQLK TLWAEVSKWLS+LP++HWFEMQSLS Sbjct: 1 IGKTTLNAWLVLWLMSIRPGMSIICLANSETQLKTTLWAEVSKWLSLLPNKHWFEMQSLS 60 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII 207 LHP+ WY+++L S+GIDSKHY+ CRTYSEERPDTFVG HNT+GMA+ NDEASGTPD+I Sbjct: 61 LHPAPWYSDVLHCSLGIDSKHYSTMCRTYSEERPDTFVGHHNTYGMAIINDEASGTPDVI 120 Query: 208 NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHE 267 N ILGF TE N NRFWIMTSN RRL+G FY+IFN PL+DWKR+QIDTRTVEGID FHE Sbjct: 121 NLGILGFLTEQNANRFWIMTSNPRRLSGKFYEIFNRPLDDWKRFQIDTRTVEGIDPSFHE 180 Query: 268 GIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE 327 GII+RYGLDSDV R+E+ GQFPQQ++++FIP N IEEA++RE D YAPLIMGCDIA E Sbjct: 181 GIIARYGLDSDVTRVEVCGQFPQQDIDSFIPLNIIEEALNREPCPDPYAPLIMGCDIAEE 240 Query: 328 GGDKTVVVFRRGNII 342 GGD TVVV RRG +I Sbjct: 241 GGDNTVVVLRRGPVI 255 >gi|282848875|ref|ZP_06258265.1| conserved hypothetical protein [Veillonella parvula ATCC 17745] gi|282581380|gb|EFB86773.1| conserved hypothetical protein [Veillonella parvula ATCC 17745] Length = 483 Score = 362 bits (929), Expect = 4e-98, Method: Composition-based stats. Identities = 97/344 (28%), Positives = 156/344 (45%), Gaps = 17/344 (4%) Query: 14 ELHEMLMHAECVLSFK--NFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVN 71 E H+ L+ A L+ FV +PWG G PLE+ P WQ++ ++ + Sbjct: 2 EKHDELIEALGALTHDPLAFVYFAYPWGEPGTPLENMEGPDEWQIQILKDIGEQLK--KG 59 Query: 72 NSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKW 131 T + A+++G GIGK+ L +W++ + IST + AN+E QL+ W E+SKW Sbjct: 60 KDLQTAIQEAVASGHGIGKSALISWLIHFAISTHENTRGVVTANTEGQLRTKTWPELSKW 119 Query: 132 LSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT- 190 +M + F + ++ S E K + I +S+ P++F G HN Sbjct: 120 HNMFIAKDLFTYTATAIFSSDKDYE----------KTWRIDAIPWSKNSPESFAGLHNQG 169 Query: 191 HGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKR 250 + + V DEAS D+I + G T+ N W N R +G F + F + W Sbjct: 170 NRILVLFDEASAIDDVIWEVTEGALTDANTEIIWCAFGNPTRNSGRFRECFRKYRKFWNT 229 Query: 251 YQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 YQID+RTV+ + E + YG DSD ++ + G FP FI ++A + Sbjct: 230 YQIDSRTVKISNKTKIEEWLEAYGEDSDFFKVRVRGVFPSASDLQFISTEIADKAQKQVY 289 Query: 311 IDD--LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + P+I+G D A G D +V R+G ++ + Sbjct: 290 KPGQFEHLPVIIGVDPAWTGSDSLEIVMRQGYYMKSLASIPKND 333 >gi|150390341|ref|YP_001320390.1| hypothetical protein Amet_2579 [Alkaliphilus metalliredigens QYMF] gi|149950203|gb|ABR48731.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF] Length = 469 Score = 360 bits (925), Expect = 2e-97, Method: Composition-based stats. Identities = 87/313 (27%), Positives = 138/313 (44%), Gaps = 18/313 (5%) Query: 45 LEHFSQPHRWQLE-FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIS 103 L+++ W E + + V K ++ +G+G+GKT L + + W + Sbjct: 9 LDNYWDNPVWFAEDMLGFYPDPWQAKVLMDLAQHPKVSVRSGQGVGKTGLESIAITWYLC 68 Query: 104 TRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMG 163 TRP +I A + QL + LWAE+SKWLS ++ +G+ Sbjct: 69 TRPFPKVIATAPTRQQLYDVLWAEISKWLSKSKVDKLLRWTKTKIYMNGF---------- 118 Query: 164 IDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRF 223 + + T RT RP+ G H + M DEASG D I ++ILG T Sbjct: 119 --EERWWATARTAV--RPENMQGFHEDY-MLFVVDEASGVADPIMEAILGTLTGY--ENK 171 Query: 224 WIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIE 283 ++ N + +G FYD N + +K +++ + E + +YG DSDV R+ Sbjct: 172 LLLCGNPTKTSGTFYDSHNRDRDTYKSHKVSSMDSPRTSKENIEMLKKKYGADSDVFRVR 231 Query: 284 ILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIE 343 +LG FP+ E ++ I E+A L +G DIA G DKT++ R GN + Sbjct: 232 VLGDFPKGEADSLISLEVTEQAAETVVDISNAYTLNIGADIARFGDDKTIIAPRIGNRVL 291 Query: 344 HIFDWSAKLIQET 356 + +S K ET Sbjct: 292 DLQQYSKKDTMET 304 >gi|167032754|ref|YP_001667985.1| putative phage terminase large subunit [Pseudomonas putida GB-1] gi|166859242|gb|ABY97649.1| putative phage terminase, large subunit [Pseudomonas putida GB-1] Length = 499 Score = 360 bits (924), Expect = 2e-97, Method: Composition-based stats. Identities = 98/337 (29%), Positives = 161/337 (47%), Gaps = 16/337 (4%) Query: 8 DQKLEQELHEMLMHAECVLSFKN----FVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVD 63 +E+ A + SF + +V+ FPWG G L + + P +WQ E +E++ Sbjct: 1 MNASNREIDYEQELANDIASFSDDPLGYVLYAFPWGEAGGELANKTGPRKWQREVLESIG 60 Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNT 123 + + + + A+++G GIGK+ L +W++ W + T + AN+E+QL+ Sbjct: 61 EQLRAGAKDRGE-VIREAVASGHGIGKSALVSWVIKWALDTEVDTRGVVTANTESQLRTK 119 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDT 183 W EV+KW + HWF++ +L + E K++ I +S+ + Sbjct: 120 TWPEVAKWNRLSITAHWFKLTGTALISTDPDHE----------KNWRIDAVPWSDTNTEA 169 Query: 184 FVGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN 242 F G HN + + DEAS D++ + G T+ + W N R +G F + F Sbjct: 170 FAGLHNEGKRILLIFDEASAIADLVWEVAEGALTDADTEIIWAAFGNPTRNSGRFRECFT 229 Query: 243 IPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYI 302 W+ Q+D+RTV+G + I+ YG DSD RI + G FP+ IP +++ Sbjct: 230 KFKHRWRHRQVDSRTVDGTNKTQIAKWIADYGEDSDFVRIRVRGMFPRASDLQLIPTDWV 289 Query: 303 EEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 EAM R+ + L L+ G DIA G D V+ FRRG Sbjct: 290 AEAMRRDGVYGLDDALVCGIDIARGGMDNNVIRFRRG 326 >gi|282598712|ref|YP_003358792.1| putative phage terminase B protein [Enterococcus phage phiEf11] gi|300860603|ref|ZP_07106690.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|307292389|ref|ZP_07572245.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis TX0411] gi|258598082|gb|ACV83339.1| putative phage terminase B protein [Enterococcus phage phiEf11] gi|300849642|gb|EFK77392.1| conserved hypothetical protein [Enterococcus faecalis TUSoD Ef11] gi|306496518|gb|EFM66079.1| hypothetical protein HMPREF9509_02682 [Enterococcus faecalis TX0411] gi|315146097|gb|EFT90113.1| conserved hypothetical protein [Enterococcus faecalis TX2141] Length = 484 Score = 359 bits (921), Expect = 5e-97, Method: Composition-based stats. Identities = 85/327 (25%), Positives = 151/327 (46%), Gaps = 21/327 (6%) Query: 35 FFPWGIKGKPLEHF-SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 F P+ G ++++ +P + + + V + K ++ +G+G+GKT L Sbjct: 5 FIPFADIGAAIDYYYDKPVAFCQDILHLDPDEWQDKVLDDLAKFPKVSVRSGQGVGKTAL 64 Query: 94 NAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGW 153 A +LW ++ RP +I A + QL + LWAEV+KWL+ + + ++ G Sbjct: 65 EAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNNSLIKDLLKWTKTKIYMVG- 123 Query: 154 YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 DS+ + T RT + +P+ G H H M + DEASG D I ++ILG Sbjct: 124 -----------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVADPIMEAILG 169 Query: 214 FFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRY 273 + + +M N + G FYD N + ++ +++ + + + + +I +Y Sbjct: 170 TLSGFD--NKLLMCGNPNNIEGVFYDSHNTDRDKYRTHKVSSYDSKRTNKENIQMLIDKY 227 Query: 274 GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL---IMGCDIAGEGGD 330 G +SDVAR+ I G+FP+ +++FI +E A D + +G D+A G D Sbjct: 228 GENSDVARVRIYGEFPKGALDSFISLEIVEFAKDINISDSELKHVREGHIGVDVARFGDD 287 Query: 331 KTVVVFRRGNIIEHIFDWSAKLIQETN 357 T+V R G +S + +T Sbjct: 288 STIVFPRIGAKALPFEKYSKQDTMQTT 314 >gi|303328395|ref|ZP_07358832.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861389|gb|EFL84326.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 500 Score = 356 bits (914), Expect = 3e-96, Method: Composition-based stats. Identities = 93/332 (28%), Positives = 148/332 (44%), Gaps = 16/332 (4%) Query: 25 VLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISA 84 FV+ FPWG G ++ P WQ E + + + + + ++ + A+S+ Sbjct: 26 AADPLGFVLFAFPWG-GGALADYPDGPDVWQREILRGMGEQLSTGASAA--SVIREAVSS 82 Query: 85 GRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQ 144 G G+GK+ L AW++LW +ST + AN+E QLK WAE++KW + +WF+ Sbjct: 83 GHGVGKSALVAWIILWAMSTFSDTRGVVTANTENQLKGKTWAELAKWHRLCLCGYWFDCT 142 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT-HGMAVFNDEASGT 203 + +L + E K + + +SE + F G HN + + DEAS Sbjct: 143 ATALISTQAGHE----------KTWRVDMVAWSERNTEAFAGLHNKGRRVLLIFDEASAI 192 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS 263 PD I + G T+ + W N R G F + F W ++D+RT D Sbjct: 193 PDAIWEVSEGALTDADTEIIWCCFGNPTRNTGRFRECFGRYAHRWNTRRVDSRTAAMTDK 252 Query: 264 GFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY--APLIMG 321 + YG DSD R+ + G+FP+ FI + + EA R D Y AP I+G Sbjct: 253 NQLAQWVEDYGEDSDFVRVRVRGEFPRAGDRQFISSDIVHEARGRSLKPDQYSFAPRILG 312 Query: 322 CDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 D+A G D++V+ R+G + Sbjct: 313 VDVARSGSDQSVITRRQGLACLEQRKFRGLDT 344 >gi|209901239|ref|YP_002290878.1| putative terminase B [Clostridium phage phiCD27] gi|199612120|gb|ACH91293.1| putative terminase B [Clostridium phage phiCD27] Length = 469 Score = 354 bits (909), Expect = 1e-95, Method: Composition-based stats. Identities = 82/310 (26%), Positives = 139/310 (44%), Gaps = 17/310 (5%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP 106 ++ P + + + S+V + K +I +G+G+GKT L + +W +STRP Sbjct: 12 YWDNPVWFAEDMLNFKADKWQSDVLMALAQTPKVSIRSGQGVGKTGLESIATVWYLSTRP 71 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 ++ A + QL + LWAE++KWLS E ++ G+ Sbjct: 72 FPKVVATAPTRQQLYDVLWAEIAKWLSNSKVEKLLEWTKTKVYMKGF------------E 119 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 + + T RT +P+ G H + M DEASG D I ++ILG + ++ Sbjct: 120 ERWWATARTAV--KPENMQGFHEDY-MLFVVDEASGVADPIMEAILGTLS--GAENKLLL 174 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILG 286 N R +G FYD N + +K +++ + E + +Y SD R+ +LG Sbjct: 175 CGNPTRTSGTFYDSHNRDRDLYKTFKVSSLDSPRTSKDNIEMLKRKYHEGSDPWRVRVLG 234 Query: 287 QFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIF 346 +FP+ E ++ I +E + RE L +G DIA G D+T++ R G + + Sbjct: 235 EFPKGESDSLISLEAVETSTIREVNISNDYILNIGADIARYGDDETIIAPRIGGKVFDLL 294 Query: 347 DWSAKLIQET 356 +S K ET Sbjct: 295 TYSKKDTMET 304 >gi|257883493|ref|ZP_05663146.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|294614775|ref|ZP_06694675.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636] gi|294622490|ref|ZP_06701512.1| conserved hypothetical protein [Enterococcus faecium U0317] gi|257819151|gb|EEV46479.1| conserved hypothetical protein [Enterococcus faecium 1,231,502] gi|291592387|gb|EFF23996.1| hypothetical protein EfmE1636_0865 [Enterococcus faecium E1636] gi|291598037|gb|EFF29147.1| conserved hypothetical protein [Enterococcus faecium U0317] Length = 471 Score = 353 bits (906), Expect = 2e-95, Method: Composition-based stats. Identities = 88/327 (26%), Positives = 154/327 (47%), Gaps = 21/327 (6%) Query: 35 FFPWGIKGKPLEHF-SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 F P+ G ++++ +P + + + NV N K ++ +G+G+GKT L Sbjct: 5 FIPFADIGSAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTAL 64 Query: 94 NAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGW 153 A +LW ++ RP +I A + QL + LWAEV+KWL+ ++ + ++ G Sbjct: 65 EAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNDSLIKNLLKWTKTKIYMVG- 123 Query: 154 YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 DS+ + T RT + +P+ G H H M + DEASG D I ++ILG Sbjct: 124 -----------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAILG 169 Query: 214 FFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRY 273 + + +M N + G FYD N + ++ +++ + + + E I+ +Y Sbjct: 170 TLSGFD--NKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKKY 227 Query: 274 GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL---IMGCDIAGEGGD 330 G +SDVAR+ I G+FP+ +++FI +E A ++ D L +G D+A G D Sbjct: 228 GKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGDD 287 Query: 331 KTVVVFRRGNIIEHIFDWSAKLIQETN 357 T++ R +S + ET Sbjct: 288 STILFPRIATRALEYEKYSKRSTMETT 314 >gi|228950291|ref|ZP_04112468.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] gi|228809453|gb|EEM55897.1| hypothetical protein bthur0007_63570 [Bacillus thuringiensis serovar monterrey BGSC 4AJ1] Length = 495 Score = 353 bits (905), Expect = 3e-95, Method: Composition-based stats. Identities = 85/373 (22%), Positives = 157/373 (42%), Gaps = 62/373 (16%) Query: 1 MPRLISTDQKLEQELHEML--MHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEF 58 + ++T++++ Q++ L ++ + ++F ++ +P WQ E Sbjct: 3 VSNDVTTEEEVLQDIITQLLEIYVDDPVAFVEDILEV--------------EPDPWQKEV 48 Query: 59 MEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSET 118 + + H H ++ +G+G+GKT + +W+ +W + RP IIC A ++ Sbjct: 49 LNDIANHSH------------VSVRSGQGVGKTAMESWICIWFLCCRPYPKIICTAPTKQ 96 Query: 119 QLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE 178 QL + LWAE++KWL+ + + ++ G+ + T +T + Sbjct: 97 QLYDVLWAEIAKWLNSSQVKDLLKWTKTKIYMKGF------------EDRWFATAKTAT- 143 Query: 179 ERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFY 238 RP+ G H + M DEASG D I ++ILG + M N + +G F+ Sbjct: 144 -RPENMQGFHEDY-MLFIADEASGIADDIMEAILGTLS--GSENKLFMCGNPTKTSGVFF 199 Query: 239 DIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIP 298 D N +K +++ + E + +YG SDV R+ + G+FP+ E + FI Sbjct: 200 DSHNKDRALYKSHKVSSADSPRTSKKNIEMLKKKYGEGSDVYRVRVEGEFPRGEADAFIS 259 Query: 299 HNYIEEAMSREAIDDLY-----------------APLIMGCDIAGEGGDKTVVVFRRGNI 341 E A RE A + +GCD+A G D+T++ RRG Sbjct: 260 LETAEAARMREVYKVEVIENEEEESTVKEIIPDTAVVEIGCDVARFGSDETIIATRRGWK 319 Query: 342 IEHIFDWSAKLIQ 354 + + + Sbjct: 320 VLPLQVHHQRDTM 332 >gi|261208032|ref|ZP_05922709.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289567088|ref|ZP_06447483.1| conserved hypothetical protein [Enterococcus faecium D344SRF] gi|260077749|gb|EEW65463.1| conserved hypothetical protein [Enterococcus faecium TC 6] gi|289161103|gb|EFD09008.1| conserved hypothetical protein [Enterococcus faecium D344SRF] Length = 471 Score = 353 bits (905), Expect = 3e-95, Method: Composition-based stats. Identities = 88/327 (26%), Positives = 154/327 (47%), Gaps = 21/327 (6%) Query: 35 FFPWGIKGKPLEHF-SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 F P+ G ++++ +P + + + NV N K ++ +G+G+GKT L Sbjct: 5 FIPFADIGAAIDYYYDKPVAFCQDILHLNPDEWQENVLNDLAEFSKVSVRSGQGVGKTAL 64 Query: 94 NAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGW 153 A +LW ++ RP +I A + QL + LWAEV+KWL+ ++ + ++ G Sbjct: 65 EAGAILWFLTCRPYAKVIATAPTMKQLYDVLWAEVAKWLNDSLIKNLLKWTKTKIYMVG- 123 Query: 154 YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 DS+ + T RT + +P+ G H H M + DEASG D I ++ILG Sbjct: 124 -----------DSERWFATARTAT--KPENMQGFHEDH-MLIVVDEASGVSDPIMEAILG 169 Query: 214 FFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRY 273 + + +M N + G FYD N + ++ +++ + + + E I+ +Y Sbjct: 170 TLSGFD--NKLLMCGNPNNIEGVFYDSHNSDRDKYRVHKVSSYDSKRTNKDNIEMILKKY 227 Query: 274 GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL---IMGCDIAGEGGD 330 G +SDVAR+ I G+FP+ +++FI +E A ++ D L +G D+A G D Sbjct: 228 GKESDVARVRIFGEFPKGALDSFISLETVELATEKQISDSLVNKTTVAHIGVDVARYGDD 287 Query: 331 KTVVVFRRGNIIEHIFDWSAKLIQETN 357 T++ R +S + ET Sbjct: 288 STILFPRIATRALEYEKYSKRSTMETT 314 >gi|150016512|ref|YP_001308766.1| hypothetical protein Cbei_1636 [Clostridium beijerinckii NCIMB 8052] gi|149902977|gb|ABR33810.1| conserved hypothetical protein [Clostridium beijerinckii NCIMB 8052] Length = 470 Score = 352 bits (904), Expect = 4e-95, Method: Composition-based stats. Identities = 85/312 (27%), Positives = 142/312 (45%), Gaps = 18/312 (5%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP 106 ++ P + + M S V + K ++ +G+G+GKT L + ++ W + TRP Sbjct: 12 YWDNPVWFAEDMMNFHADKWQSEVLMALAQSPKVSVRSGQGVGKTGLESIVVTWYLCTRP 71 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 +I A + QL + LWAE+SKWL+ + E ++ G+ S Sbjct: 72 FPKVIATAPTRQQLYDVLWAEISKWLASSKIENLLEWTKTKIYMKGY------------S 119 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 + + T +T + RP+ G H + M DEASG D I ++ILG T +M Sbjct: 120 ERWWATAKTAT--RPENMQGFHEDY-MLFVVDEASGVADPIMEAILGTLTGY--ENKLLM 174 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILG 286 N R +G FYD N + +K +++ + E + +Y SDV R+ + G Sbjct: 175 CGNPTRTSGTFYDSHNRDRDLYKTFKVSSLESPRTSKDNIEMLKRKYHEGSDVWRVRVEG 234 Query: 287 QFPQQEVNNFIPHNYIEEAMSREAIDDLYA-PLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 +FP+ E ++ I Y E A + + L +G DIA G D++V+ R GN + + Sbjct: 235 EFPKGESDSLISLEYAETATITKINNIHNNFTLHIGADIARFGNDESVIAPRIGNKVFDL 294 Query: 346 FDWSAKLIQETN 357 ++ K ET Sbjct: 295 LTYTKKDTMETT 306 >gi|332981151|ref|YP_004462592.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON] gi|332698829|gb|AEE95770.1| hypothetical protein Mahau_0567 [Mahella australiensis 50-1 BON] Length = 461 Score = 344 bits (883), Expect = 9e-93, Method: Composition-based stats. Identities = 88/309 (28%), Positives = 144/309 (46%), Gaps = 32/309 (10%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 ++P WQ E ++A+ + + A+ +G G+GKT L AW +LW + TRP Sbjct: 25 AEPDDWQAETLQALADN------------PRVAVRSGHGVGKTALEAWALLWFLFTRPYP 72 Query: 109 SIICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSLHPSGWYAELLEQSMGIDSK 167 I C A + QL + LWAE SKWL P + +FE Q + + Sbjct: 73 KIPCTAPTREQLHDILWAEASKWLERAPALKPYFEWQKTRIVQKQY------------PG 120 Query: 168 HYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMT 227 + T RT +P+ G H H + + DEASG D I ++I G T + +M Sbjct: 121 RWFATARTS--NKPENMAGFHEEHLLFII-DEASGIADNIFETIEGALTTSD--AKLLMC 175 Query: 228 SNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ 287 N + +G F+D F + ++ + + + E + +Y DSDV R+ +LG+ Sbjct: 176 GNPTKNSGVFHDAFFKDRSLYWTRKVSCLDSQRVTLEYAERLKRKYHEDSDVYRVRVLGE 235 Query: 288 FPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFD 347 FP+ E + FI + +E A R+ D L +G D+A G D+TV+ R G + ++ Sbjct: 236 FPKAEPDTFISLDIVEAATMRDVEPD--GVLEIGVDVARFGDDETVLAARAGLKLVYLKA 293 Query: 348 WSAKLIQET 356 ++ + T Sbjct: 294 YTKQDTMTT 302 >gi|290968649|ref|ZP_06560187.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] gi|290781302|gb|EFD93892.1| conserved hypothetical protein [Megasphaera genomosp. type_1 str. 28L] Length = 487 Score = 334 bits (857), Expect = 1e-89, Method: Composition-based stats. Identities = 94/344 (27%), Positives = 158/344 (45%), Gaps = 23/344 (6%) Query: 13 QELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNN 72 +++ + FV F W + ++ P WQ++ ++ V Sbjct: 4 EDIELLQALGSLASDPVAFVYFAFDWDSEELKGQN---PQTWQIKTLKEVGEGL------ 54 Query: 73 SNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWL 132 S T + A ++G GIGK+ L AW++LW ISTRP + AN+ TQL+ WAE+SKW Sbjct: 55 SLSTALQHATASGHGIGKSALVAWLILWAISTRPDTRGVVTANTATQLETKTWAELSKWY 114 Query: 133 SMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT-H 191 + + +F + S ++ + + I +S +R ++F G HN + Sbjct: 115 HLFRGKKFFTLTSTAIF----------CRQEGHERTWRIDAIPWSVDRTESFAGLHNQGN 164 Query: 192 GMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRY 251 + + DEAS + I + G T+ + W++ N R G F+D F+ + W Sbjct: 165 RLLLIFDEASAIDNKIWEVAEGALTDKDTEILWLVFGNPTRSTGRFFDCFHKYKKSWITQ 224 Query: 252 QIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAI 311 +ID+RTV+ + + I YG+DSD ++ +LG+FP FI + A R + Sbjct: 225 KIDSRTVDISNKTQLQKWIQTYGIDSDFVKVRVLGEFPDTSDTQFISTAIVRTAWERRPL 284 Query: 312 DDL---YAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +AP I+G D A GGD TV+ R+G E + ++ Sbjct: 285 RTAEYDFAPCIIGMDPAWTGGDSTVIFLRQGFFSEKLAEYKQND 328 >gi|308069786|ref|YP_003871391.1| hypothetical protein PPE_03030 [Paenibacillus polymyxa E681] gi|305859065|gb|ADM70853.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 452 Score = 333 bits (853), Expect = 4e-89, Method: Composition-based stats. Identities = 77/307 (25%), Positives = 125/307 (40%), Gaps = 30/307 (9%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSI 110 P WQ + + + + ++ +G+G+GKT L A LW +S P + Sbjct: 6 PDDWQASTLMDLANN------------PRVSVRSGQGVGKTGLEAATALWFLSCFPYPKV 53 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 IC A + QL + LWAE++KW S P + + ++ + + + Sbjct: 54 ICTAPTRQQLHDVLWAEINKWQSKSPVLKRILKWTKTKIYMKNY------------EERW 101 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 T RT + +P+ G H + M DEASG D I ++ILG + +M N Sbjct: 102 FATARTAT--KPENMQGLHEDY-MLFIVDEASGVADPIMEAILGTLSGEFNKI--LMCGN 156 Query: 230 TRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFP 289 + +G FYD N D+K ++ + +YG SDV R+ + G+FP Sbjct: 157 PTKTSGVFYDSHNKDRADYKTRKVSCLDSPRTSKDNIAMLKRKYGEGSDVWRVRVEGEFP 216 Query: 290 QQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS 349 + + FI E A ++ L +G D+A G D+T + G I Sbjct: 217 RGGSDTFISLEVAEFAAKEVKLEPTGDMLTIGVDVARFGDDETSMFAGIGPRIVGEHHHF 276 Query: 350 AKLIQET 356 K T Sbjct: 277 KKGTMVT 283 >gi|153810665|ref|ZP_01963333.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174] gi|149833061|gb|EDM88143.1| hypothetical protein RUMOBE_01049 [Ruminococcus obeum ATCC 29174] Length = 469 Score = 330 bits (847), Expect = 2e-88, Method: Composition-based stats. Identities = 88/314 (28%), Positives = 144/314 (45%), Gaps = 19/314 (6%) Query: 45 LEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIST 104 L + + P + + ++A + S ++ +G GIGK+ + AW ++W + T Sbjct: 8 LYYANHPVEFVQDILKADPDPEQKKILRSLVENQMTSVRSGHGIGKSAVEAWSVIWFMCT 67 Query: 105 RPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQSLSLHPSGWYAELLEQSMG 163 P I C A ++ QL + LWAE+SKW L+ G Sbjct: 68 HPYPKIPCTAPTQHQLFDILWAEISKWKRNNKTLDSELIWTKEKLYMKG----------- 116 Query: 164 IDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRF 223 ++ + RT S PD G H H + + DEASG D I + +LG + P Sbjct: 117 -HAEEWFAVARTAST--PDALQGFHAEHMLYII-DEASGVEDKIFEPVLGALST--PGAK 170 Query: 224 WIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIE 283 +M N +L+G+FYD N E + + ID R + F + II+ YG DSDV R+ Sbjct: 171 LLMCGNPTQLSGFFYDSHNKNREQYSTFHIDGRNSTRVSQEFVQTIINMYGEDSDVFRVR 230 Query: 284 ILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI-MGCDIAGEGGDKTVVVFRRGNII 342 + G FP E + +IP +E++++ E + +I +GCD+A G DKTV+ +R + Sbjct: 231 VAGDFPLAEDDIYIPLPLVEKSIATEYFPRRHPQIIHIGCDVARFGTDKTVIGYRTDEKV 290 Query: 343 EHIFDWSAKLIQET 356 + + +T Sbjct: 291 QFFKKRVGQDTMKT 304 >gi|323486060|ref|ZP_08091391.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum WAL-14163] gi|323400627|gb|EGA92994.1| hypothetical protein HMPREF9474_03142 [Clostridium symbiosum WAL-14163] Length = 476 Score = 327 bits (837), Expect = 2e-87, Method: Composition-based stats. Identities = 83/312 (26%), Positives = 138/312 (44%), Gaps = 19/312 (6%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP 106 + P + E + K AI +G+G+GKT + A +LW + P Sbjct: 20 YRKNPVLFAQEVLLFEPDDWQKQALMDLAESPKVAIKSGQGVGKTGMEAVALLWFLCCYP 79 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSLHPSGWYAELLEQSMGID 165 I+ A ++ QL + LW+EVSKW+S P + ++ G + Sbjct: 80 YPRIVATAPTKQQLHDVLWSEVSKWMSKSPLLSDILKWTKTYIYMVG------------N 127 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWI 225 K + RT + +P+ G H + M DEASG D I ++ILG + N + Sbjct: 128 EKRWFAVARTAT--KPENMQGFHEDN-MLFIVDEASGVADPIMEAILGTLS--GANNKLL 182 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 M N R +G FYD FN+ ++ + + + + + E +I +YG DS+V + + Sbjct: 183 MCGNPTRTSGTFYDAFNVDRSIYRCHTVSSADSKRTNKQNIESLIRKYGKDSNVVLVRVF 242 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI-MGCDIAGEGGDKTVVVFRRGNIIEH 344 G+FP+QE + FI + +E + DD+ I G D+A G D+TV+ G I Sbjct: 243 GEFPKQEDDVFIALSIVEHCCMLDLPDDVPIKRISFGVDVARYGSDETVIAKNVGGRITL 302 Query: 345 IFDWSAKLIQET 356 + + + T Sbjct: 303 PVSFRGQSLMTT 314 >gi|54302246|ref|YP_132239.1| terminase large subunit [Photobacterium profundum SS9] gi|46915667|emb|CAG22439.1| hypothetical protein PBPRB0566 [Photobacterium profundum SS9] Length = 513 Score = 327 bits (837), Expect = 2e-87, Method: Composition-based stats. Identities = 97/354 (27%), Positives = 155/354 (43%), Gaps = 29/354 (8%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIK------------GKPLEHF 48 M + + + Q ++ + L FVM +PW + Sbjct: 1 MAKKEEINYEH-QLAIDIGGFYDDPL---GFVMYAYPWDTDPDLQIVKLPEPWASKYDSV 56 Query: 49 SQPHRWQLEFMEAV-DVHCHSNVNNSNPT-IFKCAISAGRGIGKTTLNAWMMLWLISTRP 106 P W E + + +V ++ N +P F +IS+G GIGK+ ++W++ +++STRP Sbjct: 57 YGPDAWFCEMCDQLQEVIRKNDFNGVDPVDAFLYSISSGHGIGKSCASSWLIHFVMSTRP 116 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + +N+ QL+ W E+ KW L ++HWF + + + ++ + E Sbjct: 117 NSKGVVTSNTSEQLRTKTWGELGKWTKKLINKHWFVYNNGKGNMNFYHKDYAE------- 169 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAV-FNDEASGTPDIINKSILGFFTELNPNRFWI 225 + + +T EE ++F G H DEAS PD I + G T+ P FW Sbjct: 170 -TWRVDAQTCREENSESFAGLHCASSTPWYLFDEASAVPDKIWEVAEGGLTDGEP--FWF 226 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 + N R +G F + + + W R QID+ TV+ + S YG DSD R+ + Sbjct: 227 VFGNPTRNSGRFRECWRRFRQRWNRKQIDSSTVQVTNKKKISEWESDYGEDSDFYRVRVK 286 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 G FP N I +E AMSR A +P +M D+A GGD V FR G Sbjct: 287 GVFPSASSNQKISGALLEAAMSRTAHVIPGSPRVMSLDVARGGGDNCVFRFRHG 340 >gi|255282256|ref|ZP_05346811.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] gi|255267204|gb|EET60409.1| conserved hypothetical protein [Bryantella formatexigens DSM 14469] Length = 506 Score = 324 bits (830), Expect = 1e-86, Method: Composition-based stats. Identities = 75/311 (24%), Positives = 135/311 (43%), Gaps = 33/311 (10%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 +P WQ + + + + A+ +G+G+GKT + A +LW +S Sbjct: 49 EPDEWQRDALMDLAE------------ESRVAVKSGQGVGKTGIEAVAVLWFLSCFRYAR 96 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLP-HRHWFEMQSLSLHPSGWYAELLEQSMGIDSKH 168 ++ A + QL + LW+E++KW P + ++ G+ K Sbjct: 97 VVATAPTRQQLHDVLWSEIAKWQERSPLLKAILRWTKTYVYVKGY------------EKR 144 Query: 169 YTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTS 228 + RT + +P+ G H + M DEASG D I +++LG + N +M Sbjct: 145 WFAVARTAT--KPENMQGFHEDN-MLFIVDEASGVADPIMEAVLGTLS--GGNNKLLMCG 199 Query: 229 NTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQF 288 N R G FYD F + + + + D + +I +YG DS++ R+ + G F Sbjct: 200 NPTRTTGTFYDAFTKDRSIFACHTVSSLDSSRTDKNNIDALIRKYGEDSNLVRVRVKGLF 259 Query: 289 PQQEVNNFIPHNYIEEAMSREAIDDL---YAPLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 P+Q+ + FI I++ SR+ A +I+G D+A G D+TV+ I+ + Sbjct: 260 PKQDDDVFISQELIDQCTSRQYELPESRGMAQVILGVDVARYGNDETVIYRNFKGRIKMV 319 Query: 346 FDWSAKLIQET 356 + + + T Sbjct: 320 RNRRGQNLMAT 330 >gi|332976102|gb|EGK12970.1| hypothetical protein HMPREF9374_1123 [Desmospora sp. 8437] Length = 462 Score = 324 bits (830), Expect = 2e-86, Method: Composition-based stats. Identities = 89/313 (28%), Positives = 142/313 (45%), Gaps = 31/313 (9%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 ++P WQ D+ + +N + A+ AG G+GKT AW +LW + TRP Sbjct: 31 AEPDEWQ-------DIALQALADNQ-----RVAVRAGHGVGKTATEAWAVLWFLLTRPFP 78 Query: 109 SIICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSLHPSGWYAELLEQSMGIDSK 167 I C A ++ QL + LW E++KWL P + E Q + + + Sbjct: 79 KIPCTAPTKPQLMDVLWPEIAKWLMNAPELAPYVEWQKTRVVMKQY------------EE 126 Query: 168 HYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMT 227 + T RT +P+ G H H + V DEASG + I ++I G T +M Sbjct: 127 RWFATARTS--NKPENMAGFHEEHLLFVI-DEASGVDNAIFETIDGALTTAG--SKLVMF 181 Query: 228 SNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ 287 N R NG FYD F+ + + Y+I + + + +YG DSD+ R+ + G+ Sbjct: 182 GNPTRTNGVFYDAFHQDRDLYWTYKISCLDSKMASKDYARNMARKYGEDSDIYRVRVQGE 241 Query: 288 FPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFD 347 FPQ + ++FIP +E+A R+ L +G D+A G D+TV+ R G + + Sbjct: 242 FPQGDPDSFIPLELVEDARVRDLEWIDEDELHIGVDVARFGSDETVLAARIGPVAFRLDR 301 Query: 348 WSAKLIQETNQEG 360 + + T G Sbjct: 302 YGGR-TPTTETVG 313 >gi|160940775|ref|ZP_02088117.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC BAA-613] gi|158436295|gb|EDP14062.1| hypothetical protein CLOBOL_05669 [Clostridium bolteae ATCC BAA-613] Length = 484 Score = 323 bits (829), Expect = 2e-86, Method: Composition-based stats. Identities = 80/300 (26%), Positives = 131/300 (43%), Gaps = 19/300 (6%) Query: 43 KPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI 102 L + P + + + A ++ S ++ +G GIGK+ + AW ++W + Sbjct: 6 AVLFYADNPIYFVEDVIRAKPDEKQRDILRSLRDYPMTSVRSGHGIGKSAVEAWSVIWYM 65 Query: 103 STRPGMSIICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSLHPSGWYAELLEQS 161 TRP I C A +E QL + LWAE+SKW+ P R L+ G Sbjct: 66 CTRPFPKIPCTAPTEHQLMDVLWAEISKWMRNNPALRDDLIWTKEKLYMQG--------- 116 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPN 221 + + RT + P+ G H H + + DEASG D + + +LG T + Sbjct: 117 ---HPEEWFAVPRTAT--NPEALQGFHAEHVLYII-DEASGVSDKVFEPVLGAMTGED-- 168 Query: 222 RFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVAR 281 +M N RL G+FYD + E + +D R + + F + II +G DSDV R Sbjct: 169 AKLLMMGNPTRLAGFFYDSHHRNREQYSAIHVDGRDSQHVSRTFVQKIIDMFGEDSDVFR 228 Query: 282 IEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNI 341 + + GQFP+ ++ I + EEA + + + +G D+A G D + + Sbjct: 229 VRVAGQFPKSTPDSLIAMEWCEEAANLQVY-APGGQIDIGVDVARYGDDSSALYPLIDKK 287 >gi|253578914|ref|ZP_04856185.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA] gi|251849857|gb|EES77816.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA] Length = 473 Score = 322 bits (826), Expect = 4e-86, Method: Composition-based stats. Identities = 78/311 (25%), Positives = 141/311 (45%), Gaps = 19/311 (6%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 P + E + + K +I +G+G+GKT L A + LW ++ P Sbjct: 4 DDPVMFFREVLNFEPDEWQAQAARDLAANPKVSIKSGQGVGKTGLEAAVFLWFVTCFPHP 63 Query: 109 SIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKH 168 I+ A ++ QL + LW+E+SKW+S E+ S+ L + Y ++ + K Sbjct: 64 RIVATAPTKQQLHDVLWSEISKWMSKS------ELLSILLKWTKTYVYMVGE-----EKR 112 Query: 169 YTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTS 228 + RT + +P+ G H + M DEASG D I ++ILG + N ++ Sbjct: 113 WFGVARTAT--KPENMQGFHEDN-MLFIVDEASGVADPIMEAILGTLS--GANNKLLLCG 167 Query: 229 NTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQF 288 N + +G FYD +K + + + + + ++ +YG DS+V R+ + G+F Sbjct: 168 NPTKTSGTFYDSHTRDRALYKCHTVSSMDSTRTNKENIDSLVRKYGWDSNVVRVRVRGEF 227 Query: 289 PQQEVNNFIPHNYIEEAMSREAIDDLYAP---LIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 P QE + FIP + IE+ S+ D + +G D+A G D+T++ + + Sbjct: 228 PNQEDDVFIPLSLIEQCSSKLLELDDADGMQFVSLGVDVARFGDDETIIYRNYHGHCKIV 287 Query: 346 FDWSAKLIQET 356 + + + T Sbjct: 288 RNRRGQNLMAT 298 >gi|167767949|ref|ZP_02440002.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1] gi|167710278|gb|EDS20857.1| hypothetical protein CLOSS21_02492 [Clostridium sp. SS2/1] gi|291560988|emb|CBL39788.1| hypothetical protein CL2_30180 [butyrate-producing bacterium SSC/2] Length = 473 Score = 318 bits (816), Expect = 7e-85, Method: Composition-based stats. Identities = 78/310 (25%), Positives = 131/310 (42%), Gaps = 31/310 (10%) Query: 48 FSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPG 107 F P WQ E A+ + K I +G+G+GKT A +LW +S Sbjct: 30 FFYPDEWQKEAAFALRDN------------SKVTIKSGQGVGKTGFEAATLLWFLSCFEN 77 Query: 108 MSIICIANSETQLKNTLWAEVSKWLSMLP-HRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 ++ A + QL + LWAEVSKW S P + + + G Sbjct: 78 ARVVATAPTLHQLNDVLWAEVSKWQSKSPLLKEILQWTKTKISMIG------------SK 125 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 + + RT + P+ G H + M DEASG D I ++ILG T N ++ Sbjct: 126 ERWYAVARTATT--PENMQGFHEDN-MLFIVDEASGVADPIMEAILGTLT--GSNNKLLL 180 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILG 286 N + +G FYD + + +++ + + + +I +YG +S+V R+ + G Sbjct: 181 CGNPTKASGTFYDSHTSDRKLYYCITVNSAESKRTNKDNIDSLIRKYGEESNVVRVRVKG 240 Query: 287 QFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIF 346 FP+Q+ + ++P +E ++ E I +G D+A G D TV+ N I Sbjct: 241 LFPKQDDDVYMPLEMLEASIILEEIPPADI-CTLGVDVARFGDDDTVIARNMNNKITLEK 299 Query: 347 DWSAKLIQET 356 + + +T Sbjct: 300 IRHGQDLMKT 309 >gi|266623290|ref|ZP_06116225.1| putative terminase B protein [Clostridium hathewayi DSM 13479] gi|288864932|gb|EFC97230.1| putative terminase B protein [Clostridium hathewayi DSM 13479] Length = 484 Score = 318 bits (815), Expect = 9e-85, Method: Composition-based stats. Identities = 73/297 (24%), Positives = 139/297 (46%), Gaps = 19/297 (6%) Query: 41 KGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLW 100 L + +P + + + ++ S ++ +G G+GK+ + +W ++W Sbjct: 4 DDAVLFYADEPIYFVEDIIRVTPDQKQRDILRSLRDYPMTSVRSGHGVGKSAVESWSVIW 63 Query: 101 LISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSLHPSGWYAELLE 159 + TRP I C A ++ QL + LWAE+SKWL P ++ ++ +G+ Sbjct: 64 FLCTRPFPKIPCTAPTQHQLYDILWAEISKWLRNNPELKNDIIWTQQRVYMNGY------ 117 Query: 160 QSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELN 219 + + RT + P+ G H H + + DEASG D + + +LG T + Sbjct: 118 ------PEEWFAVPRTAT--NPEALQGFHAEHVLYII-DEASGVSDKVFEPVLGAMTGED 168 Query: 220 PNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDV 279 +M N RL+G+F+D + ++ ID R + ++ F + II+ +G+DSDV Sbjct: 169 --AKLLMMGNPTRLSGFFFDSHHKSRSEYSAMHIDGRDSQHVNQKFVQKIINMFGMDSDV 226 Query: 280 ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVF 336 R+ + GQFP+ ++ I ++ E A + + + + +G D+A G D + + Sbjct: 227 FRVRVAGQFPKSTPDSLIMMDWCEAATQLKP-ETVRNRVDIGVDVARYGDDSSALYP 282 >gi|319956916|ref|YP_004168179.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM 16511] gi|319419320|gb|ADV46430.1| hypothetical protein Nitsa_1177 [Nitratifractor salsuginis DSM 16511] Length = 462 Score = 313 bits (801), Expect = 4e-83, Method: Composition-based stats. Identities = 95/331 (28%), Positives = 148/331 (44%), Gaps = 42/331 (12%) Query: 42 GKPLEHF------SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNA 95 K LE F ++P + Q++ + A+D K +I +G G GKTTL A Sbjct: 13 AKSLEFFVRVILKAKPTKQQMKAIRAIDQGKK-----------KISIRSGHGTGKTTLLA 61 Query: 96 WMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYA 155 W++LW R I A + QL + L E+ KW +P ++ E++ + Sbjct: 62 WIVLWWGLGREDAKIPMTAPTGHQLYDLLMPEIRKWREKMPVQYQNEVEVKTEKID---- 117 Query: 156 ELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFF 215 + RT +++P+ G H T+ +A DEASG P +I + G Sbjct: 118 ---------FANGNFAVPRTARKDQPEALQGFHATN-LAFIIDEASGIPQVIFEVAEGAM 167 Query: 216 TELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL 275 T + IM +N R G+FYD + W+ +Q + E + + E +YG Sbjct: 168 TGEST--LVIMAANPTRTEGYFYDSHHKNRWQWECFQFNAEESENVSKEWIEEKKRQYGE 225 Query: 276 DSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVV 335 DSDV R+ I G+FP+Q N +++A +RE +DD A + G D+A G DK+V+ Sbjct: 226 DSDVYRVRIKGEFPRQSSNAVFSLQEVDDATTREIVDDSGAEV-WGLDVADFGDDKSVLA 284 Query: 336 FRRGNIIEHIF--------DWSAKLIQETNQ 358 R+G I D + LI E NQ Sbjct: 285 KRKGKHFHEITARSGLTLPDLAGWLIYEYNQ 315 >gi|289578588|ref|YP_003477215.1| hypothetical protein Thit_1395 [Thermoanaerobacter italicus Ab9] gi|289528301|gb|ADD02653.1| conserved hypothetical protein [Thermoanaerobacter italicus Ab9] Length = 460 Score = 288 bits (738), Expect = 7e-76, Method: Composition-based stats. Identities = 76/320 (23%), Positives = 133/320 (41%), Gaps = 45/320 (14%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSI 110 P Q E ++AV H + A+ A G+GKT + AW+ LW + T + Sbjct: 31 PWEKQEEILKAVRDHK------------RVAVRACHGVGKTKVAAWVALWFLYTHHNSKV 78 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 I A + Q++N LW E+ + ++L+ + + + + Sbjct: 79 ITTAPTWHQVENLLWREIHAAHAASRI--------------PLGGKVLQTQIELGEQWF- 123 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 S ++P+ F G H H + + DEASG + GF T + ++ N Sbjct: 124 --ALGLSTDKPERFQGFHAEHILLIV-DEASGVEQYTFDAAEGFLTSIG--AKLLLIGNP 178 Query: 231 RRLNGWFYDIFNIPLEDWKRYQIDTRTVEG-----------IDSGFHEGIISRYGLDSDV 279 +L+G FY+ F PL + + I + + E ++G DS + Sbjct: 179 TQLSGEFYNAFRSPL--YHKIHISAFDSPNLKAGKIVRPYLVTPEWVEDKRLKWGEDSPL 236 Query: 280 ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 +LG+FP+Q + IP +IE A R + + P+ +G D+A G D TV++ RRG Sbjct: 237 WYSRVLGEFPEQGNDTLIPLAWIEAAQQRWHMTEAGEPVEIGADVARYGTDTTVIMLRRG 296 Query: 340 NIIEHIFDWSAKLIQETNQE 359 + E ++ + E + Sbjct: 297 DKAEIVYQLRGQDTMEVTGK 316 >gi|304399103|ref|ZP_07380971.1| DNA packaging protein [Pantoea sp. aB] gi|304353343|gb|EFM17722.1| DNA packaging protein [Pantoea sp. aB] Length = 503 Score = 275 bits (703), Expect = 9e-72, Method: Composition-based stats. Identities = 68/316 (21%), Positives = 132/316 (41%), Gaps = 32/316 (10%) Query: 28 FKNFVMRF-FPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGR 86 +++ V+R+ + W + +E F WQ E + +N+ T + +++G Sbjct: 16 WRDMVIRYRYNWALA--VVELFGMIPTWQQEEI----------MNSVQETGSQTTVTSGH 63 Query: 87 GIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSL 146 G GK++L A M+L + P +I +AN Q+K ++ V + + RH + Sbjct: 64 GTGKSSLTAMMLLIYMIMYPDARVIIVANKIGQVKTGVFKYVKTYWANAARRHPWLQNYF 123 Query: 147 SLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDI 206 +L + +Y + + + + C+ Y + G H H + + DEASG D Sbjct: 124 TLTDTMFYEKSRKGI-------WEVLCKGYRLGNEEALAGEHAAHILLIL-DEASGISDK 175 Query: 207 INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-------IPLEDWKRYQIDTRTVE 259 + G TE + +M S R +G+FYD + P W +++ Sbjct: 176 AIAIMRGALTEED--NRMLMMSQPTRPSGYFYDSHHSLARHPDNPNGFWNAIVLNSEEAP 233 Query: 260 GIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL 318 + F + Y G DS +++LG+FP+ + + + A R+ + Sbjct: 234 HVTLKFIREKLVEYGGRDSLEYMVKVLGRFPRNVSGYLLGRDECDRAARRKVYLEKGWGW 293 Query: 319 IMGCDIAGEGGDKTVV 334 + D+ G G DK+++ Sbjct: 294 VATADV-GNGRDKSIL 308 >gi|332980681|ref|YP_004462122.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON] gi|332698359|gb|AEE95300.1| hypothetical protein Mahau_0077 [Mahella australiensis 50-1 BON] Length = 486 Score = 273 bits (697), Expect = 3e-71, Method: Composition-based stats. Identities = 75/342 (21%), Positives = 128/342 (37%), Gaps = 62/342 (18%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 ++P + Q++ + AV + + A+ + G GK+ + ++LW + + Sbjct: 28 TRPWKKQIDIISAVRDN------------PRTAVRSCHGAGKSFIAGQVILWFLYSFYPS 75 Query: 109 SIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKH 168 ++ A + Q++ +W EV LL + I Sbjct: 76 IVLSTAPTWRQVEKLIWKEVRASYRRSKV--------------PLGGNLLPKRPEIQIIQ 121 Query: 169 YTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTS 228 S PD F G H + + V DEA+G P+ I ++I G T + ++ Sbjct: 122 DEWYAVGLSTNEPDRFQGFHEEN-ILVVVDEAAGVPEEIFEAIEGVLTSEH--ARLLLLG 178 Query: 229 NTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEG---------------------------- 260 N + G FY+ F P W+ I T Sbjct: 179 NPTSVGGTFYNAFRTP--GWENISISAFTTPNFTAFGITEDDIINKTWESKITNSLPNPK 236 Query: 261 -IDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI 319 I + R+G +S + +LGQFP + + IP +IE AM+R P+ Sbjct: 237 LITPAWVADKYRRWGPNSPAYQARVLGQFPSEGEDTLIPLAWIEAAMARWEDTPEGEPIE 296 Query: 320 MGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEGC 361 +G D+A G DKTV+ RRG + + ++ + ET GC Sbjct: 297 IGVDVARFGSDKTVIAARRGQKVLPLNVYAKQDTMET--VGC 336 >gi|312964323|ref|ZP_07778627.1| terminase B protein [Escherichia coli 2362-75] gi|331655801|ref|ZP_08356790.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M718] gi|312291036|gb|EFR18910.1| terminase B protein [Escherichia coli 2362-75] gi|323186470|gb|EFZ71817.1| terminase B protein [Escherichia coli 1357] gi|323969205|gb|EGB64507.1| terminase B protein [Escherichia coli TA007] gi|325495624|gb|EGC93488.1| DNA pacase B subunit [Escherichia fergusonii ECD227] gi|331046575|gb|EGI18664.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M718] Length = 494 Score = 271 bits (694), Expect = 1e-70, Method: Composition-based stats. Identities = 61/312 (19%), Positives = 122/312 (39%), Gaps = 31/312 (9%) Query: 32 VMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKT 91 + + W L F + WQ + + + + K ++S+G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63 Query: 92 TLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPS 151 + + M++ I PG I +AN Q+ ++ + + R + L + Sbjct: 64 DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLADYFVLTET 123 Query: 152 GWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSI 211 +Y E+ + + +T+ + + + G H H + + DEASG D I Sbjct: 124 AFY-EVTGKGV------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGII 175 Query: 212 LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKRYQIDTRTVEGIDSG 264 G T + + S R +G+FYD + P + +++ + Sbjct: 176 TGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTPA 233 Query: 265 FHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 F + ++ Y G D+ + I++ G FP+ + + + +E A R+ + D Sbjct: 234 FIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACVD 293 Query: 324 IA-GEGGDKTVV 334 +A G G DK+V+ Sbjct: 294 VAGGTGRDKSVI 305 >gi|324111095|gb|EGC05081.1| terminase B protein [Escherichia fergusonii B253] Length = 494 Score = 271 bits (693), Expect = 1e-70, Method: Composition-based stats. Identities = 61/312 (19%), Positives = 122/312 (39%), Gaps = 31/312 (9%) Query: 32 VMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKT 91 + + W L F + WQ + + + + K ++S+G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63 Query: 92 TLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPS 151 + + M++ I PG I +AN Q+ ++ + + R + L + Sbjct: 64 DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLADYFVLTET 123 Query: 152 GWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSI 211 +Y E+ + + +T+ + + + G H H + + DEASG D I Sbjct: 124 AFY-EVTGKGV------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGII 175 Query: 212 LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKRYQIDTRTVEGIDSG 264 G T + + S R +G+FYD + P + +++ + Sbjct: 176 TGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTPA 233 Query: 265 FHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 F + ++ Y G D+ + I++ G FP+ + + + +E A R+ + D Sbjct: 234 FIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACVD 293 Query: 324 IA-GEGGDKTVV 334 +A G G DK+V+ Sbjct: 294 VAGGTGRDKSVI 305 >gi|56266643|gb|AAV84926.1| DNA pacase B subunit [Enterobacteria phage phiW39] Length = 494 Score = 271 bits (693), Expect = 1e-70, Method: Composition-based stats. Identities = 61/312 (19%), Positives = 122/312 (39%), Gaps = 31/312 (9%) Query: 32 VMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKT 91 + + W L F + WQ + + + + K ++S+G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDLI----------IESVQEQGSKTSVSSGHGTGKS 63 Query: 92 TLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPS 151 + + M++ I PG I +AN Q+ ++ + + R + L + Sbjct: 64 DMTSIMIMLFIIMYPGARAIIVANKIQQVMTGIFKYIKINWATATSRFPWLADYFVLTET 123 Query: 152 GWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSI 211 +Y E+ + + +T+ + + + G H H + + DEASG D I Sbjct: 124 AFY-EITGKGV------WTVVPKGFRLGSEEALAGEHADHLLYII-DEASGVSDRAFGII 175 Query: 212 LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKRYQIDTRTVEGIDSG 264 G T + + S R +G+FYD + P + +++ + Sbjct: 176 TGALTGQDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPDGVYTAITLNSEESPLVTPA 233 Query: 265 FHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 F + ++ Y G D+ + I++ G FP+ + + + +E A R+ + D Sbjct: 234 FIKMKLAEYGGRDNPMYMIKVRGLFPKSQDGFLLGRDEVERATRRKVKIAKGWGWLACVD 293 Query: 324 IA-GEGGDKTVV 334 +A G G DK+V+ Sbjct: 294 VAGGTGRDKSVI 305 >gi|168467778|ref|ZP_02701615.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195629119|gb|EDX48493.1| DNA pacase B subunit [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 494 Score = 271 bits (692), Expect = 2e-70, Method: Composition-based stats. Identities = 62/310 (20%), Positives = 122/310 (39%), Gaps = 31/310 (10%) Query: 34 RFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL 93 + W + F + WQ + + + + K ++S+G G GK+ + Sbjct: 18 YRYDWIAAADVM--FGKTPTWQQDQI----------IESVQEPGSKTSVSSGHGTGKSDM 65 Query: 94 NAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGW 153 + M++ I PG I +AN Q+ ++ + S R + + L + + Sbjct: 66 TSIMIMLFIIMFPGARAIIVANKIQQVMTGIFKYLKINWSTATSRFPWLAEYFVLTDTSF 125 Query: 154 YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 Y E+ + + +T+ + + + G H H + + DEASG D + G Sbjct: 126 Y-EITSKGV------WTVVPKGFRLGNEEALAGEHADHLLYII-DEASGVSDKAFGIMTG 177 Query: 214 FFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKRYQIDTRTVEGIDSGFH 266 T + + S R +G+FYD + P + +++ + F Sbjct: 178 ALTGKDNRILLL--SQPTRPSGYFYDTHHKLAKRPGNPNGIYTAITLNSEESPLVTPEFI 235 Query: 267 EGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIA 325 + ++ Y G DS + I++ G FP+ + + + +E A R+ I D+A Sbjct: 236 KMKLAEYGGRDSPMYLIKVRGLFPKTQDGFLLGRDEVERASRRKVKIAKGWGWIACVDVA 295 Query: 326 -GEGGDKTVV 334 G G DK+V+ Sbjct: 296 GGTGRDKSVI 305 >gi|262316909|emb|CBA18135.1| putative terminase B [Paenibacillus phage phiBP] Length = 248 Score = 270 bits (690), Expect = 2e-70, Method: Composition-based stats. Identities = 59/243 (24%), Positives = 107/243 (44%), Gaps = 18/243 (7%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP 106 + P + E + +V++ ++ +G+G+GKT L A + LW + P Sbjct: 23 YRKSPKTFFKEILNFSPDKWQESVSDDIAKYRFVSVRSGQGVGKTALEAAISLWFLCCFP 82 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPH-RHWFEMQSLSLHPSGWYAELLEQSMGID 165 ++C A + QL + LWAE+SKW S P + + ++ + Sbjct: 83 FPRVVCTAPTRQQLNDVLWAEISKWQSQSPILKRILKWTKTKIYMKNY------------ 130 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWI 225 + + T RT + +P+ G H + M DEASG D I +I G + + N+ Sbjct: 131 EERWFATARTAT--KPENMQGFHEDY-MLFIVDEASGVDDRIMAAIFGTLSG-DYNK-LF 185 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 M N + +G+F+D N ++ +++ E + ++YG SDV R+ +L Sbjct: 186 MCGNPTKTSGFFFDSHNRDRAIYRTHRVSCLDSPRTSKENIEMLKAKYGEGSDVWRVRVL 245 Query: 286 GQF 288 G+F Sbjct: 246 GEF 248 >gi|83593922|ref|YP_427674.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170] gi|83576836|gb|ABC23387.1| hypothetical protein Rru_A2590 [Rhodospirillum rubrum ATCC 11170] Length = 505 Score = 262 bits (670), Expect = 5e-68, Method: Composition-based stats. Identities = 70/326 (21%), Positives = 122/326 (37%), Gaps = 34/326 (10%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSI 110 P Q + A+ P K + AG G+GKTT A + W + Sbjct: 21 PTAQQAGLLSAI-----------APAGAKVTVRAGHGVGKTTATAAAIWWHLECFDYSKT 69 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI-DSKHY 169 C A + +QL+ LW+E+++ R +L +A + + Sbjct: 70 PCTAPTASQLEQILWSELARLRRRADARAQGTGLPAALRLEALFAVSGRAIADRGTPREW 129 Query: 170 TITCRTYSEERPDTFVGPHNTHG------------------MAVFNDEASGTPDIINKSI 211 + RT ++PD G H + + +EASG PD + + Sbjct: 130 FVVARTARRDQPDALQGFHASDIDLEAGAGPRLSAKSGGAALMFVIEEASGVPDAVFEVA 189 Query: 212 LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIIS 271 G + P +M N R G+F + ++ +D G+ G++ Sbjct: 190 EGALSS--PGARLLMVGNPTRNTGFFARSHKRDRASFTALRLRCADSPLVDPGYRAGLVR 247 Query: 272 RYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGEGG 329 +YG +S+V R+ G FP+Q+ + I E A++R A +G D+A G Sbjct: 248 KYGAESNVVRVRADGAFPRQDDDVLIALETAEAALARPLPARMATEDERRLGVDVARFGD 307 Query: 330 DKTVVVFRRGNIIEHIFDWSAKLIQE 355 D+TV + R G ++ I + + Sbjct: 308 DRTVFLLRIGPVVGAIEVTAGRDTMA 333 >gi|322656964|gb|EFY53248.1| DNA packaging protein [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] Length = 411 Score = 262 bits (669), Expect = 6e-68, Method: Composition-based stats. Identities = 60/271 (22%), Positives = 114/271 (42%), Gaps = 19/271 (7%) Query: 72 NSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKW 131 + T + +++G G GK++L A ++L + P +I +AN Q+K ++ V ++ Sbjct: 49 SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108 Query: 132 LSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTH 191 + RH + L + +Y + + C+ Y + G H H Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYE-------RSRKGIWEVLCKGYRLGNEEALAGEHAAH 161 Query: 192 GMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-------IP 244 + + DEASG D + G TE + +M S R +G+FYD + P Sbjct: 162 LLLIL-DEASGISDKAIGVMTGALTEED--NRMLMLSQPTRPSGYFYDSHHSQAKTPDNP 218 Query: 245 LEDWKRYQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 W +++ + F + + Y G DS +++LGQFP++ + + + Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278 Query: 304 EAMSREAIDDLYAPLIMGCDIAGEGGDKTVV 334 A R+ + + + D+ G G DK+V+ Sbjct: 279 RAARRKVLLEKNWGWVATADV-GNGRDKSVL 308 >gi|56266666|gb|AAV84947.1| DNA pacase B subunit [Enterobacteria phage D6] Length = 502 Score = 262 bits (669), Expect = 6e-68, Method: Composition-based stats. Identities = 60/271 (22%), Positives = 114/271 (42%), Gaps = 19/271 (7%) Query: 72 NSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKW 131 + T + +++G G GK++L A ++L + P +I +AN Q+K ++ V ++ Sbjct: 49 SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108 Query: 132 LSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTH 191 + RH + L + +Y + + C+ Y + G H H Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYE-------RSRKGIWEVLCKGYRLGNEEALAGEHAAH 161 Query: 192 GMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-------IP 244 + + DEASG D + G TE + +M S R +G+FYD + P Sbjct: 162 LLLIL-DEASGISDKAIGVMTGALTEED--NRMLMLSQPTRPSGYFYDSHHSQAKTPDNP 218 Query: 245 LEDWKRYQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 W +++ + F + + Y G DS +++LGQFP++ + + + Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278 Query: 304 EAMSREAIDDLYAPLIMGCDIAGEGGDKTVV 334 A R+ + + + D+ G G DK+V+ Sbjct: 279 RAARRKVLLEKNWGWVATADV-GNGRDKSVL 308 >gi|323179619|gb|EFZ65182.1| terminase B protein [Escherichia coli 1180] Length = 453 Score = 262 bits (669), Expect = 8e-68, Method: Composition-based stats. Identities = 60/269 (22%), Positives = 113/269 (42%), Gaps = 19/269 (7%) Query: 74 NPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLS 133 T + +++G G GK++L A ++L + P +I +AN Q+K ++ V ++ + Sbjct: 2 QETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQYWA 61 Query: 134 MLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGM 193 RH + L + +Y + + C+ Y + G H H + Sbjct: 62 NAVKRHGWLQTYFVLSDTMFYE-------RSRKGIWEVLCKGYRLGNEEALAGEHAAHLL 114 Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-------IPLE 246 + DEASG D + G TE + +M S R +G+FYD + P Sbjct: 115 LIL-DEASGISDKAIGVMTGALTEED--NRMLMLSQPTRPSGYFYDSHHSQAKTPDNPKG 171 Query: 247 DWKRYQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEA 305 W +++ + F + + Y G DS +++LGQFP++ + + + A Sbjct: 172 IWTAIVLNSEESPFVTPQFIKQKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECDRA 231 Query: 306 MSREAIDDLYAPLIMGCDIAGEGGDKTVV 334 R+ + + + D+ G G DK+V+ Sbjct: 232 ARRKVLLEKNWGWVATADV-GNGRDKSVL 259 >gi|260871239|ref|YP_003238019.1| DNA packaging protein [Escherichia coli O111:H- str. 11128] gi|257767818|dbj|BAI39311.1| DNA packaging protein [Escherichia coli O111:H- str. 11128] Length = 494 Score = 261 bits (667), Expect = 1e-67, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 126/312 (40%), Gaps = 31/312 (9%) Query: 32 VMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKT 91 + + W L F + WQ + + + ++ ++++G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDEI----------IESTQQDGSWTSVTSGHGTGKS 63 Query: 92 TLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPS 151 + + + + I PG +I +AN Q+ + ++ + + R + + L + Sbjct: 64 DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLSKYFILTET 123 Query: 152 GWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSI 211 ++ E+ + + +TI ++ + G H H + + DEASG D I Sbjct: 124 SFF-EVTGKGV------WTILIKSCRSGNEEALAGEHADHLLYII-DEASGVSDKAFSVI 175 Query: 212 LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKRYQIDTRTVEGIDSG 264 G T + + S R +G+FYD + P + +++ +D+ Sbjct: 176 TGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDAK 233 Query: 265 FHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 F ++ Y G D+ + I++ G+FP+ + + + +E A R+ + D Sbjct: 234 FIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACVD 293 Query: 324 IA-GEGGDKTVV 334 +A G G DK+V+ Sbjct: 294 VAGGTGRDKSVI 305 >gi|46401730|ref|YP_006576.1| PacB [Enterobacteria phage P1] gi|301646767|ref|ZP_07246623.1| putative terminase B protein [Escherichia coli MS 146-1] gi|129547|sp|P27753|TERL_BPP1 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein B; AltName: Full=PACase B protein; AltName: Full=Terminase B protein; AltName: Full=Terminase large subunit gi|68597607|sp|Q5XLR0|TERL_BPP7 RecName: Full=Large terminase protein; AltName: Full=DNA-packaging protein B; AltName: Full=PACase B protein; AltName: Full=Terminase B protein; AltName: Full=Terminase large subunit gi|33323612|gb|AAQ07582.1|AF503408_106 PacB [Enterobacteria phage P7] gi|215636|gb|AAA21724.1| pacB [Enterobacteria phage P1] gi|33338757|gb|AAQ14080.1| PacB [Enterobacteria phage P1] gi|33338866|gb|AAQ14188.1| PacB [Enterobacteria phage P1] gi|54112354|gb|AAV28854.1| PacB [Enterobacteria phage P7] gi|301075042|gb|EFK89848.1| putative terminase B protein [Escherichia coli MS 146-1] Length = 494 Score = 260 bits (665), Expect = 2e-67, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 126/312 (40%), Gaps = 31/312 (9%) Query: 32 VMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKT 91 + + W L F + WQ + + + ++ ++++G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDEI----------IESTQQDGSWTSVTSGHGTGKS 63 Query: 92 TLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPS 151 + + + + I PG +I +AN Q+ + ++ + + R + + L + Sbjct: 64 DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLSKYFILTET 123 Query: 152 GWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSI 211 ++ E+ + + +TI ++ + G H H + + DEASG D I Sbjct: 124 SFF-EVTGKGV------WTILIKSCRPGNEEALAGEHADHLLYII-DEASGVSDKAFSVI 175 Query: 212 LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKRYQIDTRTVEGIDSG 264 G T + + S R +G+FYD + P + +++ +D+ Sbjct: 176 TGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDAK 233 Query: 265 FHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 F ++ Y G D+ + I++ G+FP+ + + + +E A R+ + D Sbjct: 234 FIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACVD 293 Query: 324 IA-GEGGDKTVV 334 +A G G DK+V+ Sbjct: 294 VAGGTGRDKSVI 305 >gi|331649955|ref|ZP_08351031.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M605] gi|331041212|gb|EGI13366.1| terminase B protein (PACase B protein) (DNA packaging B protein) [Escherichia coli M605] Length = 494 Score = 260 bits (665), Expect = 2e-67, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 126/312 (40%), Gaps = 31/312 (9%) Query: 32 VMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKT 91 + + W L F + WQ + + + ++ ++++G G GK+ Sbjct: 16 ALYRYDWIAAADVL--FGKTPTWQQDEI----------IESTQQDGSWTSVTSGHGTGKS 63 Query: 92 TLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPS 151 + + + + I PG +I +AN Q+ + ++ + + R + + L + Sbjct: 64 DMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLSKYFILTET 123 Query: 152 GWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSI 211 ++ E+ + + +TI ++ + G H H + + DEASG D I Sbjct: 124 SFF-EVTGKGV------WTILIKSCRPGNEEALAGEHADHLLYII-DEASGVSDKAFSVI 175 Query: 212 LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI-------PLEDWKRYQIDTRTVEGIDSG 264 G T + + S R +G+FYD + P + +++ +D+ Sbjct: 176 TGALTGKDNRILLL--SQPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPLVDAK 233 Query: 265 FHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCD 323 F ++ Y G D+ + I++ G+FP+ + + + +E A R+ + D Sbjct: 234 FIRAKLAEYGGRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAKGWGWVACVD 293 Query: 324 IA-GEGGDKTVV 334 +A G G DK+V+ Sbjct: 294 VAGGTGRDKSVI 305 >gi|323948959|gb|EGB44853.1| terminase B protein [Escherichia coli H252] Length = 502 Score = 260 bits (663), Expect = 3e-67, Method: Composition-based stats. Identities = 59/271 (21%), Positives = 114/271 (42%), Gaps = 19/271 (7%) Query: 72 NSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKW 131 + T + +++G G GK++L A ++L + P +I +AN Q+K ++ V ++ Sbjct: 49 SVQETGSRTTVTSGHGTGKSSLTAMLLLIFMILFPDARVIIVANKIGQVKTGVFKYVKQY 108 Query: 132 LSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTH 191 + RH + L + +Y + + C+ Y + G H H Sbjct: 109 WANAVKRHGWLQTYFVLSDTMFYE-------RSRKGIWEVLCKGYRLGNEEALAGEHAAH 161 Query: 192 GMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-------IP 244 + + DEASG D + G TE + +M S R +G+FYD + P Sbjct: 162 LLLIL-DEASGISDKAIGVMTGALTEED--NRMLMLSQPTRPSGYFYDSHHSRAKTPDNP 218 Query: 245 LEDWKRYQIDTRTVEGIDSGFHEGIISRY-GLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 W +++ + F + + Y G DS +++LGQFP++ + + + Sbjct: 219 KGIWTAIVLNSEESPFVTPQFIKEKLLEYGGRDSIEYMVKVLGQFPREINGYLLGRDECD 278 Query: 304 EAMSREAIDDLYAPLIMGCDIAGEGGDKTVV 334 + R+ + + + D+ G G DK+V+ Sbjct: 279 RSARRKVLLEKNWGWVATADV-GNGRDKSVL 308 >gi|48697461|ref|YP_024846.1| Pas60 [Actinoplanes phage phiAsp2] gi|47679679|gb|AAT36808.1| Pas60 [Actinoplanes phage phiAsp2] Length = 492 Score = 239 bits (610), Expect = 5e-61, Method: Composition-based stats. Identities = 74/327 (22%), Positives = 122/327 (37%), Gaps = 32/327 (9%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 P W + ++ + ++ P + A+ G+GK+ A ++ W +TR M Sbjct: 21 DSPTAWAADCLDVRLAGYQGEILDAVPRERRVAVRGPHGLGKSFSGAILVNWFATTRDLM 80 Query: 109 ----SIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI 164 II A++ L+ LW E+ KW + +L P ELL+ + + Sbjct: 81 GKDWKIITTASAWRHLEVYLWPEIHKWAGRI------NFVALGRAPYNPRTELLDLRLKL 134 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELN----P 220 T +P+ G H + + DEA P SI G F+ Sbjct: 135 THGAATAVA----SNQPERIEGAHAEELLYLL-DEAKIVPPATWDSIEGAFSNAGVDVAD 189 Query: 221 NRFWIMTSNTRRLNGWFYDIFNI--PLEDWKRYQIDTRT---VEGIDSGFHEGIISRYGL 275 N + S +G FYDI EDW + I + + S++G Sbjct: 190 NAYAFAMSTPGAPSGRFYDIHRRAPGYEDWWTRHVTLEEAIASGRISRAWADQRRSQWGS 249 Query: 276 DSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA------IDDLYAPLIMGCDIAGEGG 329 DS V +LG+F + ++ IP ++E A+ R PL G D+ GG Sbjct: 250 DSAVFHNRVLGEFHASDEDSVIPLAWLEAAIERWHEWDRQGRPSPGGPLWTGVDVGR-GG 308 Query: 330 DKTVVVFRRGNIIEHIFDWSAKLIQET 356 D+TV+ R G + + + T Sbjct: 309 DETVLAARDGWAV-TLETNRRRDTMAT 334 >gi|269119479|ref|YP_003307656.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386] gi|268613357|gb|ACZ07725.1| hypothetical protein Sterm_0853 [Sebaldella termitidis ATCC 33386] Length = 499 Score = 232 bits (591), Expect = 8e-59, Method: Composition-based stats. Identities = 67/333 (20%), Positives = 125/333 (37%), Gaps = 41/333 (12%) Query: 51 PHRWQLEFMEA-VDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 P + + + + V + + ++ AG GK++L + + + TRP Sbjct: 18 PVNFFKDILNFHFLSEDQTRVLQAFNEYRRLSVPAGHSTGKSSLAGGLTTYWLITRPKSR 77 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLP---------------------HRHWFEMQSLSL 148 +I A + QLK WAEV+K + R WF + + Sbjct: 78 VIVTAPTYRQLKTIYWAEVNKIYNRSKLKQLNLFEINDKIMRINDKDLKREWFALPVTAS 137 Query: 149 HPSGWYA------ELLEQSMGI-------DSKHYTITCRTYSEERPDTFVGPHNTHGMAV 195 P G E++EQ M D + I + E+ + + + V Sbjct: 138 TPEGMQGQHGDKTEVIEQIMKHLGIEEIGDDETIEIVSQILRGEKQIEGLTKEDKEKLLV 197 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 DE+SG + I + + G T+ + ++ N + G+FY+ P + + + + Sbjct: 198 MVDESSGVKNEIFEVLEG--TDYD---KLVLFGNMTKNTGYFYESVYNPKSKFYKVTMSS 252 Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY 315 + + YG DS+V R+ + G+ P N+ N I+ A R Y Sbjct: 253 YNSPFMKKEQIHDLEETYGPDSNVVRVRLKGEAPDGNENSIFSSNKIDSAFQRSLSLSEY 312 Query: 316 APLIMGCDIA-GEGGDKTVVVFRRGNIIEHIFD 347 + +G D+ G GGD + + ++ N + D Sbjct: 313 ETIKLGVDVGKGSGGDSSTIYEKKDNRVRKKLD 345 >gi|307308936|ref|ZP_07588619.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti BL225C] gi|306900570|gb|EFN31183.1| hypothetical protein SinmeBDRAFT_4503 [Sinorhizobium meliloti BL225C] Length = 472 Score = 224 bits (572), Expect = 1e-56, Method: Composition-based stats. Identities = 74/296 (25%), Positives = 126/296 (42%), Gaps = 30/296 (10%) Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNT 123 +C + NN + G GKT ++A + W + + + A SE+ +K+ Sbjct: 39 EYCEAFKNNQT-----ITVKGSSGWGKTFISAISLWWSLIVFDPVKVTIFAPSESTIKSG 93 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS-MGIDSKHYTITC----RTYSE 178 +W E+ Q L + + + EL E S I K TC R S+ Sbjct: 94 IWNEL---------------QVLYSNMAPLFRELFEVSATKIFRKSRGETCWAEYRLVSK 138 Query: 179 ERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFY 238 + G H+ + + V DEASG D+I L P ++ SN + +G+F+ Sbjct: 139 DNIAAARGFHSKNNI-VIADEASGIEDVIFTGALLNVLNDGPGAKVVLVSNPDKASGFFF 197 Query: 239 DIFNIP--LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIE-ILGQFPQQEVNN 295 + P +DW + R G E YG + + + G+FP +V+ Sbjct: 198 KTWRDPELSKDWIKVHGSIRDKPNYTPGEEERFARLYGGVTSRDYLTLVEGEFPLSDVDG 257 Query: 296 FIPHNYIEEA-MSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 I +++EA +++AI + AP+I G D AG G DK+V+ R N++ +W+ Sbjct: 258 LISREFLDEAVTNKDAIPNPKAPIIWGLDPAGAGKDKSVLAIRHDNVLRGFEEWAG 313 >gi|216906085|ref|YP_002333619.1| terminase [Abalone shriveling syndrome-associated virus] gi|216263178|gb|ACJ72002.1| terminase [Abalone shriveling syndrome-associated virus] Length = 507 Score = 217 bits (553), Expect = 2e-54, Method: Composition-based stats. Identities = 80/313 (25%), Positives = 125/313 (39%), Gaps = 35/313 (11%) Query: 54 WQLEFMEAVDVHCHSNVNNSNPTIFKCA--ISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 WQLE VD NS+ F CA +S G G GKT L+ + +W PG Sbjct: 51 WQLEI---VDYIAKFFRKNSDEKHFVCAIAVSGGNGTGKTKLSKALNIWRFCCHPGSRQF 107 Query: 112 CIANSETQLK----NTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSK 167 + NSE Q K L +SK LS + ++S + + S A+ E D Sbjct: 108 ILTNSERQTKRTGFTMLVRRISKLLSCIA-----ALESSAYYYSPAVADKPEVRTN-DMW 161 Query: 168 HYTITCRTYSEERPDTFVGPHNTHGMAVF-NDEASGTPDIINKSILGFFTELNPNRFWIM 226 T ++ +E G H H M F DE++ D + +++ +T+ Sbjct: 162 DVTYLLQSSTEA---ALSGLH--HPMMTFSFDESTYFNDHVWQALENMWTQ--GQVLCFC 214 Query: 227 TSNT-RRLNGWFYDIFNIPLEDWKRYQIDTRTVEGI-------DSGFHEGIISRYGLDSD 278 T N N +F +FN L + TR V + + I YG Sbjct: 215 TGNPSHDNNNYFARLFNKSLHKKDSLWL-TRCVSLLELPLKYRNDARARYIEEHYGKTHP 273 Query: 279 VARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD-LYAPLIMGCDI--AGEGGDKTVVV 335 +LGQFP++ N I EAM RE ++ ++ P+IMG D+ + G + + Sbjct: 274 RYIASVLGQFPKKNTCNPFDITAISEAMEREVREEFIHHPVIMGIDVSISANNGSASAIC 333 Query: 336 FRRGNIIEHIFDW 348 R G + + ++ Sbjct: 334 VREGTAVRVLREY 346 >gi|161789175|ref|YP_001595730.1| PacB [Vibrio sp. 0908] gi|161761461|gb|ABX77106.1| PacB [Vibrio sp. 0908] Length = 572 Score = 215 bits (547), Expect = 1e-53, Method: Composition-based stats. Identities = 65/290 (22%), Positives = 113/290 (38%), Gaps = 27/290 (9%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 +P Q+E + A+ P + ++++G G GK+ L A + L I T P Sbjct: 44 EPSFQQIEVINAL-----------TPVGARVSVASGHGTGKSHLTAALCLHFIITHPESL 92 Query: 110 IICIANSETQLKNTLWAEVSK-WLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKH 168 + ANS Q+ N +++ + + W+ + + W E Q + +YA+ + I K Sbjct: 93 CMLTANSLDQVTNVVFSYIKRCWVKICQRQPWLE-QYFVITAKSFYAKGYKGVWQIFGK- 150 Query: 169 YTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTS 228 T S+ + G H M V DEASG D + + G TE N N+ +++ Sbjct: 151 ------TCSKGNEEGLAGQHRRDYMVVV-DEASGVSDRAFEVLRGALTEDN-NKMLLISQ 202 Query: 229 NTRRLNGWFYDIFNI--PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL-DSDVARIEIL 285 R G F D + +++ ++ F YG S I +L Sbjct: 203 F-TRPTGHFADSQMELAEQGLYTAITLNSEMSPFVNLKFIREKRIEYGGVTSPEYGIRVL 261 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIA-GEGGDKTVV 334 G P I + +++ + D+A GEG D +V+ Sbjct: 262 GVCPDDASGFLISRSLVDKGFEAVIEFADEWGWVAVADVAGGEGRDSSVL 311 >gi|332974843|gb|EGK11758.1| hypothetical protein HMPREF9373_1714 [Psychrobacter sp. 1501(2011)] Length = 520 Score = 213 bits (542), Expect = 4e-53, Method: Composition-based stats. Identities = 58/292 (19%), Positives = 110/292 (37%), Gaps = 30/292 (10%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIIC 112 WQ E + + + ++++G G GK+ + LW + P ++ Sbjct: 41 TWQQELL----------FKSIVVPGSRTSVASGHGTGKSRSAGIIALWHLLFYPESVMLF 90 Query: 113 IANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSK--HYT 170 A QL+ +W E++ L L + + Y +L + + I + Sbjct: 91 TAPQIGQLRTVVWKEINICLQRLRNNKALGWLAD-------YVVVLAEKIYIKGFKDTWF 143 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 + +T + +P G H H M V+ DEA G D + + +G T N ++TS Sbjct: 144 VFAKTAPKHQPTNIAGQHGDHYM-VWADEACGIDDAVMEVAIGALTHE--NNRAVLTSQP 200 Query: 231 RRLNGWFYDIFNIPLE----DWKRYQIDTRTVEGIDSGFHEGIISRYG-LDSDVARIEIL 285 + G+FYD + W + + + + +YG +S I I Sbjct: 201 AKNTGFFYDTHHKLSHYNGGKWIALEFNGEMSPIVSKEKLIEALYQYGSRNSPGYLIRIR 260 Query: 286 GQFPQQEVNNFIPHNYIE--EAMSREAIDDLYAPLIMGCDIAG-EGGDKTVV 334 G+FP+ + + E +A + +I+ D+ G G D +V+ Sbjct: 261 GKFPELKGEYLLTRTDYENMKAHPCVIKEGDKWGIIVTVDVGGDVGRDSSVI 312 >gi|148653111|ref|YP_001280204.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1] gi|148572195|gb|ABQ94254.1| hypothetical protein PsycPRwf_1309 [Psychrobacter sp. PRwf-1] Length = 520 Score = 211 bits (536), Expect = 2e-52, Method: Composition-based stats. Identities = 57/292 (19%), Positives = 110/292 (37%), Gaps = 30/292 (10%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIIC 112 WQ E + + + ++++G G GK+ + LW + P ++ Sbjct: 41 TWQQELL----------FKSIVVPGSRTSVASGHGTGKSRSAGIIALWHLLFYPESVMLF 90 Query: 113 IANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSK--HYT 170 A QL+ +W E++ L L + + Y +L + + I + Sbjct: 91 TAPQIGQLRTVVWKEINICLQRLRNNKALGWLAD-------YVVVLAEKIYIKGFKDTWF 143 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 + +T + +P G H H M V+ DEA G D + + +G T N ++TS Sbjct: 144 VFAKTAPKHQPTNIAGQHGDHYM-VWADEACGIDDAVMEVAIGALTHE--NNRAVLTSQP 200 Query: 231 RRLNGWFYDIFNIPLE----DWKRYQIDTRTVEGIDSGFHEGIISRYG-LDSDVARIEIL 285 + G+FYD + W + + + + +YG +S I I Sbjct: 201 AKNTGFFYDTHHKLSHHNGGKWTALEFNGEMSPIVSKDKLIEALYQYGSRNSPGYLIRIR 260 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLY--APLIMGCDIAG-EGGDKTVV 334 G+FP+ + + E + + + +I+ D+ G G D +V+ Sbjct: 261 GKFPELKGEYLLTRTDYENMKQQPCVIEEGDKWGIIVAVDVGGDVGRDSSVI 312 >gi|315649222|ref|ZP_07902312.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453] gi|315275441|gb|EFU38799.1| hypothetical protein PVOR_28644 [Paenibacillus vortex V453] Length = 189 Score = 205 bits (521), Expect = 1e-50, Method: Composition-based stats. Identities = 52/202 (25%), Positives = 83/202 (41%), Gaps = 31/202 (15%) Query: 38 WGIKGKPLEHFSQ--PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNA 95 W E P WQ + M V + ++ +G+G+GKT L A Sbjct: 16 WDDPVAFAEDMMGFDPDDWQCDVMMDV------------TQFPRTSVRSGQGVGKTGLEA 63 Query: 96 WMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYA 155 +++W + RP ++C A ++ QL + LW EVSKWL ++ + ++ G Sbjct: 64 ALVIWFLCCRPNPKVVCTAPTKQQLHDVLWTEVSKWLENSMVKNLLKWTKTKVYMIG--- 120 Query: 156 ELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFF 215 + + T RT +P+ G H + M DEASG D I ++ILG Sbjct: 121 ---------HEQRWFATARTA--NKPENMQGFHEDY-MLFIVDEASGVSDPIMEAILGTL 168 Query: 216 TELNPNRFWIMTSNTRRLNGWF 237 + +M N R +G F Sbjct: 169 S--GAENKLLMCGNPTRTSGVF 188 >gi|320103661|ref|YP_004179252.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644] gi|319750943|gb|ADV62703.1| hypothetical protein Isop_2123 [Isosphaera pallida ATCC 43644] Length = 553 Score = 199 bits (506), Expect = 5e-49, Method: Composition-based stats. Identities = 64/295 (21%), Positives = 104/295 (35%), Gaps = 32/295 (10%) Query: 76 TIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSML 135 ++ G +GK+ L A + LW + T PG ++ A S+ L L+ E+ K L+ Sbjct: 62 RARSVVVATGNAVGKSYLAAGLTLWWLYTHPGSLVVATAPSQGLLGTVLFRELQKALA-A 120 Query: 136 PHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAV 195 R + + + L G C + + G H+ M V Sbjct: 121 SRRRGLGLPGMVVGSDRGTPFSLRVGPGRRLAAEGWGCLGIATRGVERLAGRHHADLMVV 180 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK------ 249 DEASG + T LNP R + N F+ + L + Sbjct: 181 V-DEASGVQPEAWE----ALTSLNP-RKLFVCGNPLTPGTVFHKLHQRGLTEASDPSIPD 234 Query: 250 -----RYQIDTRTVEGID----------SGFHEGIISRYGLDSDVARIEILGQFPQQEVN 294 I + I+ GF ++G S + + G FP V+ Sbjct: 235 HARGVALTIPSTASPDINLERSPRGLADRGFIREAERQWGRGSPLWLSHVEGVFPTVAVH 294 Query: 295 NFIPHNYIEEAMSREAIDDLYAP---LIMGCDI-AGEGGDKTVVVFRRGNIIEHI 345 I ++++A S E P ++GCD+ AG G D+T +V R I + Sbjct: 295 ALIEPGWLDQAASLERSQTYENPPGQPVLGCDLAAGVGADRTAIVVRDEGGIREL 349 >gi|134287454|ref|YP_001109621.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4] gi|134131876|gb|ABO60570.1| hypothetical protein Bcep1808_7700 [Burkholderia vietnamiensis G4] Length = 509 Score = 197 bits (500), Expect = 2e-48, Method: Composition-based stats. Identities = 53/282 (18%), Positives = 110/282 (39%), Gaps = 19/282 (6%) Query: 59 MEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSET 118 ++A H ++ + + ++S+G G GKT+ A + LW + + I A + Sbjct: 34 LKAPTHHQIQMFDSVSKQGSRTSVSSGHGTGKTSGFAIIALWHLLCYYLSNTILTAPKIS 93 Query: 119 QLKNTLWAEVSKWLSMLPHRHW-FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYS 177 + + +W E + + + + + + + Y + ++ + ++ Sbjct: 94 TVSDGVWKEFADLSTKISNGPQSWIWEYFVIESERVYVRGYKL-------NWFVIAKSAP 146 Query: 178 EERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 P+ G H + DEASG PD I G T+ NR + + R +G+F Sbjct: 147 RGSPENLAGAHRD-WLLWLADEASGIPDDNFGVITGSLTDE-RNRMCLASQ-PTRSSGFF 203 Query: 238 YDIFNIPLED----WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEV 293 Y+ + W ++ + + F +Y + +I++ G+FP+ Sbjct: 204 YETHHALSRAEGGPWNNLVFNSEFSPIVSAKFIAEKKLQYTEE--EYQIKVQGRFPENSS 261 Query: 294 NNFIPHNYIEEAMSREAI-DDLYAPLIMGCDIAGEG-GDKTV 333 + IE + R I D + ++ D+ G G D+TV Sbjct: 262 KYLVGPQAIEACVGRTVIKPDEHWGWLLPVDVGGGGWRDETV 303 >gi|299769795|ref|YP_003731821.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1] gi|298699883|gb|ADI90448.1| hypothetical protein AOLE_07785 [Acinetobacter sp. DR1] Length = 668 Score = 194 bits (494), Expect = 1e-47, Method: Composition-based stats. Identities = 58/259 (22%), Positives = 94/259 (36%), Gaps = 18/259 (6%) Query: 86 RGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQ 144 GKT + LW + ++ A QLK +W E+S L+ L + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEISINLARLKQGPLAWLAD 267 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 + Y + ++ + + +T + +P G H + M V+ DEASG Sbjct: 268 YVGYQSELVYIKGYKEK-------WYVFAKTAPKHQPTNLAGNHGDNYM-VWVDEASGVD 319 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED----WKRYQIDTRTVEG 260 D + G T + +MTS R G FY+ + W + Sbjct: 320 DAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESPL 377 Query: 261 IDSGFHEGIISRYG-LDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA-PL 318 + E +YG D +I +LG+FP I EE +I D + Sbjct: 378 VSKQSLEEQRQKYGSRDDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFGY 437 Query: 319 IMGCDIAGE-GGDKTVVVF 336 I+ D+ G G D +V+V Sbjct: 438 IITVDVGGGVGRDDSVIVI 456 >gi|228924410|ref|ZP_04087639.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] gi|228835241|gb|EEM80653.1| hypothetical protein bthur0011_53510 [Bacillus thuringiensis serovar huazhongensis BGSC 4BD1] Length = 293 Score = 194 bits (493), Expect = 2e-47, Method: Composition-based stats. Identities = 39/132 (29%), Positives = 65/132 (49%), Gaps = 1/132 (0%) Query: 225 IMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI 284 + N R +G FYD N + +K +++ + E + +YG SDV R+ + Sbjct: 2 FLCGNPTRTSGVFYDSHNRDRDLYKIHKVSSLDSPRTSKDNIEVLKKKYGEGSDVWRVRV 61 Query: 285 LGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEH 344 LG+FP+ E + FIP +E+A S + L +G D+A G D+TV+ R GN + Sbjct: 62 LGEFPKAEADAFIPLEIVEQAASCKVEPT-GETLDLGVDVARFGDDETVIAPRIGNKVFK 120 Query: 345 IFDWSAKLIQET 356 + + + ET Sbjct: 121 LLNHYKQDTMET 132 >gi|323516996|gb|ADX91377.1| hypothetical protein ABTW07_0941 [Acinetobacter baumannii TCDC-AB0715] gi|323518424|gb|ADX92805.1| hypothetical protein ABTW07_2381 [Acinetobacter baumannii TCDC-AB0715] Length = 663 Score = 192 bits (488), Expect = 7e-47, Method: Composition-based stats. Identities = 57/259 (22%), Positives = 94/259 (36%), Gaps = 18/259 (6%) Query: 86 RGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQ 144 GKT + LW + ++ A QLK +W E+S L+ L + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEISINLARLKQGPLAWLAD 267 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 + Y + ++ + + +T + +P G H + M V+ DEASG Sbjct: 268 YVGYQSELVYIKGYKEK-------WYVFAKTAPKHQPTNLAGNHGDNYM-VWVDEASGVD 319 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED----WKRYQIDTRTVEG 260 D + G T + +MTS R G FY+ + W + Sbjct: 320 DAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESPL 377 Query: 261 IDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA-PL 318 + E +YG D +I +LG+FP I EE +I D + Sbjct: 378 VSKQSLEEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFGY 437 Query: 319 IMGCDIAGE-GGDKTVVVF 336 ++ D+ G G D +V+V Sbjct: 438 VITVDVGGGVGRDDSVIVV 456 >gi|213156231|ref|YP_002318651.1| phage terminase [Acinetobacter baumannii AB0057] gi|301346399|ref|ZP_07227140.1| phage terminase [Acinetobacter baumannii AB056] gi|301594275|ref|ZP_07239283.1| phage terminase [Acinetobacter baumannii AB059] gi|213055391|gb|ACJ40293.1| phage terminase [Acinetobacter baumannii AB0057] Length = 663 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 56/259 (21%), Positives = 94/259 (36%), Gaps = 18/259 (6%) Query: 86 RGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQ 144 GKT + LW + ++ A QLK +W E+S L+ L + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEISINLARLKQGPLAWLAD 267 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 + Y + ++ + + +T + +P G H + M V+ DEASG Sbjct: 268 YVGYQSELVYIKGYKEK-------WYVFAKTAPKHQPTNLAGNHGDNYM-VWVDEASGVD 319 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED----WKRYQIDTRTVEG 260 D + G T + +MTS R G FY+ + W + Sbjct: 320 DAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESPL 377 Query: 261 IDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA-PL 318 + + +YG D +I +LG+FP I EE +I D + Sbjct: 378 VSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFGY 437 Query: 319 IMGCDIAGE-GGDKTVVVF 336 ++ D+ G G D +V+V Sbjct: 438 VITVDVGGGVGRDDSVIVV 456 >gi|257459276|ref|ZP_05624390.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268] gi|257443289|gb|EEV18418.1| phosphatase, Ppx/GppA family [Campylobacter gracilis RM3268] Length = 431 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 69/289 (23%), Positives = 112/289 (38%), Gaps = 35/289 (12%) Query: 80 CAISAGR--GIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPH 137 C I GR G K T NA + WL+ G I+ + LK L LP Sbjct: 26 CTIEKGRRFGFTKGTANACIE-WLL---EGQKILWVDTIAANLKRYFERYFLPELRQLPK 81 Query: 138 RHW-FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVF 196 W + Q L G Y + S ERP+ G + + Sbjct: 82 ELWNWNAQDKQLKICGGYLDF------------------RSAERPENIEGF--GYDTVIL 121 Query: 197 NDEA--SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED---WKRY 251 N+ P + + +I + NPN + + N F+D+ + + W+ + Sbjct: 122 NEAGIILKDPYLWDNAISPMLLD-NPNSRAFIGGVPKGKNK-FFDLAQRGMRNEKGWRNF 179 Query: 252 QIDTRTVEGIDSGFHEGIISRYG-LDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 Q + + + +++ G DSDVAR EI G+F N+ IE A ++ Sbjct: 180 QFSSYDNPLLQKEEIDRLVAELGGADSDVARQEIFGEFLDTTSNSVFSLAAIEAAFRKQR 239 Query: 311 IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 D AP+I D+A EG D++V+ R+G+ +E + + E +E Sbjct: 240 YFDAGAPVIWALDVAREGDDESVLCKRQGDSVEPLKPYRIASTSELARE 288 >gi|332852816|ref|ZP_08434408.1| intein splicing region-containing protein [Acinetobacter baumannii 6013150] gi|332871045|ref|ZP_08439658.1| intein splicing region-containing protein [Acinetobacter baumannii 6013113] gi|332729027|gb|EGJ60377.1| intein splicing region-containing protein [Acinetobacter baumannii 6013150] gi|332731805|gb|EGJ63085.1| intein splicing region-containing protein [Acinetobacter baumannii 6013113] Length = 663 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 56/259 (21%), Positives = 94/259 (36%), Gaps = 18/259 (6%) Query: 86 RGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQ 144 GKT + LW + ++ A QLK +W E+S L+ L + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEISINLARLKQGPLAWLAD 267 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 + Y + ++ + + +T + +P G H + M V+ DEASG Sbjct: 268 YVGYQSELVYIKGYKEK-------WYVFAKTAPKHQPTNLAGNHGDNYM-VWVDEASGVD 319 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED----WKRYQIDTRTVEG 260 D + G T + +MTS R G FY+ + W + Sbjct: 320 DAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESPL 377 Query: 261 IDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA-PL 318 + + +YG D +I +LG+FP I EE +I D + Sbjct: 378 VSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFGY 437 Query: 319 IMGCDIAGE-GGDKTVVVF 336 ++ D+ G G D +V+V Sbjct: 438 VITVDVGGGVGRDDSVIVV 456 >gi|184158505|ref|YP_001846844.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU] gi|183210099|gb|ACC57497.1| hypothetical protein ACICU_02185 [Acinetobacter baumannii ACICU] Length = 663 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 56/259 (21%), Positives = 94/259 (36%), Gaps = 18/259 (6%) Query: 86 RGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQ 144 GKT + LW + ++ A QLK +W E+S L+ L + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEISINLARLKQGPLAWLAD 267 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 + Y + ++ + + +T + +P G H + M V+ DEASG Sbjct: 268 YVGYQSELVYIKGYKEK-------WYVFAKTAPKHQPTNLAGNHGDNYM-VWVDEASGVD 319 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED----WKRYQIDTRTVEG 260 D + G T + +MTS R G FY+ + W + Sbjct: 320 DAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESPL 377 Query: 261 IDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA-PL 318 + + +YG D +I +LG+FP I EE +I D + Sbjct: 378 VSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFGY 437 Query: 319 IMGCDIAGE-GGDKTVVVF 336 ++ D+ G G D +V+V Sbjct: 438 VITVDVGGGVGRDDSVIVV 456 >gi|260551382|ref|ZP_05825582.1| phage terminase [Acinetobacter sp. RUH2624] gi|260405545|gb|EEW99037.1| phage terminase [Acinetobacter sp. RUH2624] Length = 663 Score = 190 bits (482), Expect = 4e-46, Method: Composition-based stats. Identities = 56/259 (21%), Positives = 94/259 (36%), Gaps = 18/259 (6%) Query: 86 RGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQ 144 GKT + LW + ++ A QLK +W E+S L+ L + Sbjct: 208 HNTGKTASAGIVALWHLLFFDESIMMFTAPQIGQLKKQVWKEISINLARLKQGPLAWLAD 267 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 + Y + ++ + + +T + +P G H + M V+ DEASG Sbjct: 268 YVGYQSELVYIKGYKEK-------WYVFAKTAPKHQPTNLAGNHGDNYM-VWVDEASGVD 319 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED----WKRYQIDTRTVEG 260 D + G T + +MTS R G FY+ + W + Sbjct: 320 DAVLDVAFGALTHEDNRA--VMTSQPTRNAGMFYETHHKLSHRAGGVWIALTFNGEESPL 377 Query: 261 IDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA-PL 318 + + +YG D +I +LG+FP I EE +I D + Sbjct: 378 VSEQSLQEQRQKYGSREDAQYKIRVLGEFPDLSDEFLITKRQTEEMYVGASIFDDHQFGY 437 Query: 319 IMGCDIAGE-GGDKTVVVF 336 ++ D+ G G D +V+V Sbjct: 438 VITVDVGGGVGRDDSVIVV 456 >gi|256392042|ref|YP_003113606.1| hypothetical protein Caci_2856 [Catenulispora acidiphila DSM 44928] gi|256358268|gb|ACU71765.1| conserved hypothetical protein [Catenulispora acidiphila DSM 44928] Length = 484 Score = 162 bits (410), Expect = 8e-38, Method: Composition-based stats. Identities = 58/313 (18%), Positives = 111/313 (35%), Gaps = 39/313 (12%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP 106 + + P RW + + ++ S A+ + G GK+ + + + W + T P Sbjct: 24 YLADPARWVDDKLGEYLWSRQVDIATSVRDQRLTAVQSCHGTGKSFVASRLTAWWLDTHP 83 Query: 107 G--MSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI 164 ++ A + Q+K LWAE++K + R ++ + W + + G Sbjct: 84 PGEAFVVTTAPTGDQVKAILWAEINKAFAKAEARG--TPLPGRINETDWKYDKFLVAFGR 141 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFW 224 Y P F G H + + + DEA G + L T ++ Sbjct: 142 KPSDY----------NPHAFQGIHAKYVLVIL-DEACGISKQFWTAALAIATGVHCRILA 190 Query: 225 IMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEG--------------IDSGFHEGII 270 I N F + W +I R + + + Sbjct: 191 I--GNPDDPGSHFAQVCKSDR--WNMIKIAARDTPNFTGEEVPDDLADMLVSQAYVLDMA 246 Query: 271 SRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAI----DDLYAPLIMGCDIAG 326 +G +S + ++ +FP + + + + A +RE + D P+ +G D+ G Sbjct: 247 EEFGPESPIYLSKVDAEFPSDASDGVVRLSKL-MACTREPVHPYAPDRLVPVELGVDL-G 304 Query: 327 EGGDKTVVVFRRG 339 GGD+T + RRG Sbjct: 305 AGGDETCIRERRG 317 >gi|154175204|ref|YP_001409090.1| Ppx/GppA family phosphatase [Campylobacter curvus 525.92] gi|112803006|gb|EAU00350.1| phosphatase, Ppx/GppA family [Campylobacter curvus 525.92] Length = 433 Score = 159 bits (402), Expect = 6e-37, Method: Composition-based stats. Identities = 69/318 (21%), Positives = 110/318 (34%), Gaps = 50/318 (15%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGR--GIGKTTLNAWMMLWLISTRPGMS 109 WQ E I GR G K NA + WLI G Sbjct: 11 TDWQREVFFKNKAKF-------------TTIEKGRRSGFTKGMANACIE-WLI---EGKK 53 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQSLSLHPSGWYAELLEQSMGIDSKH 168 I+ + L+ L LP W F Q L Y ++ Sbjct: 54 ILWVDTVTANLQRYFERYFVPELKQLPADMWKFHAQDKKLTVGEGYLDM----------- 102 Query: 169 YTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT---PDIINKSILGFFTELNPNRFWI 225 S ERP+ G V +EA + + +I + PN Sbjct: 103 -------RSAERPENIEGFGYD---VVILNEAGIILKNSYLWDNAIRPMLLDY-PNSRAF 151 Query: 226 MTSNTRRLNGWFYDIFNIPL---EDWKRYQIDTRTVEGIDSGFHEGIISRYGL-DSDVAR 281 + + N F+D+ + + +DW +QI + + + +I+ G DSDV + Sbjct: 152 IGGVPKGKN-RFFDLASRGMRNEKDWVNFQISSFENPLLRKEEIDELIAELGGVDSDVVK 210 Query: 282 IEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNI 341 EI G+F N P + IE A + + A I G D+A +G D++V+ R G Sbjct: 211 QEIYGEFLDTTTNALFPLSQIEAAFGKVRAYEPNAVQIWGLDVARDGDDESVLCVREGYH 270 Query: 342 IEHIFDWSAKLIQETNQE 359 ++++ + E +E Sbjct: 271 VKNLEGFRIASTTELARE 288 >gi|189460514|ref|ZP_03009299.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136] gi|189432758|gb|EDV01743.1| hypothetical protein BACCOP_01155 [Bacteroides coprocola DSM 17136] Length = 556 Score = 157 bits (397), Expect = 2e-36, Method: Composition-based stats. Identities = 64/345 (18%), Positives = 117/345 (33%), Gaps = 69/345 (20%) Query: 56 LEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP--------- 106 E + + +S + ++++G GK + A + + P Sbjct: 57 REALGVTLDKEQQEILSSVQYNRRTSVASGTARGKDFVAACAAICFLYLTPRWRKNSLGE 116 Query: 107 -----GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS 161 + A ++ Q+KN + E+S+ + R + L+ + Sbjct: 117 IELVENTKVALTAPTDRQVKNIMMPEISRLFNRAKARGVELIGKLNAYD----------- 165 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPN 221 + ++ + +T E + + G H H M V EA+G D +I G + Sbjct: 166 IRTNNDEWFLTGFKADEHNHEAWSGFHAVHTMFVVT-EATGIGDDTFAAIEGNL--QGDS 222 Query: 222 RFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFH--------------- 266 R ++ N + G+ W +Y++++ T I S Sbjct: 223 RILLVF-NPNKTVGYAAKSQKGDR--WHKYRLNSLTAPNIASKKIIIPGQVDYDWVLDKL 279 Query: 267 EGIISRYGLDS------------------DVARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 E + D D+ R ++LG FP+ + + IP ++EEA R Sbjct: 280 ENWCEKISPDEIISEMDDFEFEGQWYRPEDLFRKKVLGLFPKVDEDTLIPRQWLEEAHER 339 Query: 309 EAIDDLYAPL-----IMGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 PL I+G D+AG G D T V RR N + Sbjct: 340 WKQAKGREPLRADLNILGVDVAGMGRDATCYVLRRDNWVASFDTH 384 >gi|298387330|ref|ZP_06996883.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] gi|298259999|gb|EFI02870.1| conserved hypothetical protein [Bacteroides sp. 1_1_14] Length = 500 Score = 155 bits (392), Expect = 9e-36, Method: Composition-based stats. Identities = 64/347 (18%), Positives = 117/347 (33%), Gaps = 69/347 (19%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP------ 106 + + + A V S A+++G GK + A L + P Sbjct: 18 AFASDVLRANLDEEQKAVLRSVQKNPMTALASGTSRGKDFVAACAALCFMYLTPEWDDDG 77 Query: 107 ----GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSM 162 I A S+ Q++N + EV + L+ + Sbjct: 78 NLIRNTKIALSAPSQRQVENIMTPEVRRLFRNAGILP---------------GRLVANDI 122 Query: 163 GIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNR 222 D + Y +T + + + G H + M V EASG + I +I G N Sbjct: 123 RTDYEEYFLTGFKADNKNQEVWSGFHAANVMFVIT-EASGVSETIFSAIEGNLQG---NS 178 Query: 223 FWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS-----------GFHEGIIS 271 ++ N G+ + + ++++D+ + + + E + Sbjct: 179 RLLLVFNPNITTGYAANAMKSDR--FAKFRLDSLNATNVTAKREIIPGQVNYEWVEDKVK 236 Query: 272 RY-----------GLD-----------SDVARIEILGQFPQQEVNNFIPHNYIEEAMSRE 309 + G +D+ RI++ G FP+ + IP+ +IE A R Sbjct: 237 HWCTPITKEEYNEGEGDFLFENNLYRPNDLFRIKVRGMFPKVAEDVLIPYEWIEIANKRW 296 Query: 310 AIDDLYAP---LIMGCDIAGEGGDKTVVVFRRGNII--EHIFDWSAK 351 + Y P +G D+AG G D +V R GN + +F + K Sbjct: 297 QENHPYRPRKSCKLGVDVAGMGRDNSVFCPRYGNYVSQFDVFQSAGK 343 >gi|153806881|ref|ZP_01959549.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185] gi|149131558|gb|EDM22764.1| hypothetical protein BACCAC_01156 [Bacteroides caccae ATCC 43185] Length = 513 Score = 154 bits (390), Expect = 2e-35, Method: Composition-based stats. Identities = 54/335 (16%), Positives = 108/335 (32%), Gaps = 64/335 (19%) Query: 56 LEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP--------- 106 + + A + S A+++G GK + A L + P Sbjct: 27 RDALCARLDREQQAIIESVQHNPMTAVASGTARGKDFVAACASLCFMYLTPRFNEKGVLV 86 Query: 107 -GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGID 165 + A + Q+KN + E+ + + + F L+ + D Sbjct: 87 GNTKVAMTAPTGRQVKNIMTPEIRRLIRAARTKFPFCCPG----------RLVADDIRTD 136 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWI 225 + + +T + +++ G H + M V EASG +I+ +I G N + Sbjct: 137 YEEWFLTGFKADDNATESWSGFHAANTMFVIT-EASGISEIVYNAIEGNLQG---NSRML 192 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHE------------------ 267 + N G+ + ++++ + E + Sbjct: 193 IVFNPNITTGYAARAMKSDR--FAKFRLSSLNAENVVKKQIVIPGQVDYEWVKDKVINWC 250 Query: 268 ---------------GIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAID 312 + +D+ R+++LG FP+ + IP+ +IE A Sbjct: 251 SPIQQTDFNEGEGDFNWEGKLYRPNDLFRVKVLGMFPKVSEDVLIPYEWIEIANRNWQEL 310 Query: 313 D-----LYAPLIMGCDIAGEGGDKTVVVFRRGNII 342 +G D+AG G D +V+ R GN + Sbjct: 311 QASGFIPAKSCKLGVDVAGMGRDNSVLCPRYGNYV 345 >gi|111222161|ref|YP_712955.1| hypothetical protein FRAAL2741 [Frankia alni ACN14a] gi|111149693|emb|CAJ61385.1| hypothetical protein FRAAL2741 [Frankia alni ACN14a] Length = 535 Score = 149 bits (375), Expect = 7e-34, Method: Composition-based stats. Identities = 64/327 (19%), Positives = 113/327 (34%), Gaps = 48/327 (14%) Query: 47 HFSQPHRWQLEFMEAVDV-HCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTR 105 + +P RW + + V + + N+ K A+ + GK+ + A + + T Sbjct: 52 YRDEPVRWARDRLGGVHLWSKQQEIINALRVHRKVAVPSCHDAGKSFVAAAAVAHWLDTH 111 Query: 106 PG--MSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMG 163 P I A + Q++ LW E+ + + + + + Sbjct: 112 PPGSAFAITTAPTFPQVRAILWREIRRLSRL---------------MNPPLGRVNQTEWL 156 Query: 164 IDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRF 223 ID + + F G H + + V DEA G P + + T N N Sbjct: 157 IDDDLVAFGRKPA-DHDEGGFQGIHAQYPLVVL-DEAGGIPQQLWIAADSIAT--NENAR 212 Query: 224 WIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVE--------------GIDSGFHEGI 269 + N +F + +P W I + + E Sbjct: 213 ILAIGNPDDPTSYFAQVCELP--SWHVITIPAAETPAFTGEQIPDDLRQALLSRAWAEEK 270 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA-----PLIMGCDI 324 +G D+ V ++L QFP+ I + A R D+ + P+ +G D+ Sbjct: 271 RREWGEDNPVYISKVLAQFPKDVAWKVI--KASDVAKRRIGRDEPWPASKLRPVCLGVDV 328 Query: 325 AGEGGDKTVVVFRRGNIIEHIFDWSAK 351 GEG D TVV RRG ++ +W A+ Sbjct: 329 -GEGRDWTVVRERRG--VQAGREWQAR 352 >gi|282880015|ref|ZP_06288737.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS 5C-B1] gi|281306129|gb|EFA98167.1| hypothetical protein HMPREF9019_0946 [Prevotella timonensis CRIS 5C-B1] Length = 459 Score = 147 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 55/314 (17%), Positives = 107/314 (34%), Gaps = 69/314 (21%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRP----------GMSIICIANSETQLKNTLWAEVS 129 A+++G GK + A + + P I A + Q N + EV+ Sbjct: 2 VAVASGTSRGKDFVAACAAMCFMYLTPRWNINHRLIQNTKIAMTAPTGRQCINIMIPEVA 61 Query: 130 KWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHN 189 + +L + ++ + +T S++ + + G H Sbjct: 62 RLFRNASVLP---------------GRMLSDGIRTNNAEWFLTAFKASDDNTEAWSGFHA 106 Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 + M V EASG + +I G N ++ N G+ +K Sbjct: 107 VNTMFVVT-EASGVSETTFNAIEGNLQG---NSRLLLVFNPNVTTGYAAKAMKSSR--FK 160 Query: 250 RYQIDTRTVEGI-----------DSGFHEGIISRY-----------GLD----------- 276 ++++++ E + D + + + + G Sbjct: 161 KFRLNSLNAENVIKKKNVIPGQVDYEWVKDKVHNWCELIQKEDFNNGEGDFMFEDSFYRP 220 Query: 277 SDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL-----IMGCDIAGEGGDK 331 +D+ RI++LG FP+ + IP ++E A R + + +G D+AG G D Sbjct: 221 NDLFRIKVLGLFPKASEDTLIPFEWLELAHDRWKKLNAEDFVPRKYARVGIDVAGMGRDS 280 Query: 332 TVVVFRRGNIIEHI 345 + V R GN + I Sbjct: 281 SCFVLRYGNYVPEI 294 >gi|226227228|ref|YP_002761334.1| hypothetical protein GAU_1822 [Gemmatimonas aurantiaca T-27] gi|226090419|dbj|BAH38864.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 549 Score = 147 bits (370), Expect = 3e-33, Method: Composition-based stats. Identities = 57/287 (19%), Positives = 96/287 (33%), Gaps = 37/287 (12%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSK-WLSMLPHR 138 A+++G G GKT L A ++LW I+ P +A Q + +W EV++ W Sbjct: 70 VAVASGTGTGKTFLEAVLLLWWIAVEPDSIATTVATKADQQEKGIWREVARHWPRFQACF 129 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRT-YSEERPDTFVGPHNTHGMAVFN 197 E+ +L + W + + IT EE G H + + Sbjct: 130 PEAELTTLRIRMEPWRGDAWGA--------WGITAAPKAGEESSSAVQGLHAK-RLLILV 180 Query: 198 DEASGTPDIINKSILGFFTELNPNRFWIMTSNT---RRLNGWFYDIFNIPLEDWKRYQID 254 DE G P + +++ T N G F + + +I Sbjct: 181 DETPGVPQPVMTALVNTATGEENVIAAF--GNPDYQADPLGQFAET-----KRVTAIRIS 233 Query: 255 TRTVEGI-----------DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 + +YG++S V + + G P+Q + I + Sbjct: 234 ALDHPNVVLGVERIPGAATRLSIATREDKYGVESGVYQSRVRGIAPEQSASALIHLAWCV 293 Query: 304 EAMSREAIDDLYA----PLIMGCDIAG-EGGDKTVVVFRRGNIIEHI 345 A R A P +G D+A E GDK V +G + + Sbjct: 294 AAADRAESVQHAALALGPKALGVDVAQSENGDKAAVAMGQGARLLSV 340 >gi|315122636|ref|YP_004063125.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496038|gb|ADR52637.1| putative phage terminase, large subunit [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 301 Score = 141 bits (355), Expect = 2e-31, Method: Composition-based stats. Identities = 54/166 (32%), Positives = 85/166 (51%), Gaps = 12/166 (7%) Query: 5 ISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQ----LEFME 60 I D L Q + + L+F ++ R WG +G PL + P WQ LE E Sbjct: 9 IEYDTALLQNVLSPAIAGN-PLAFTKYMYR---WGEEGTPLANCKGPRAWQTEVFLELAE 64 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 ++ + + +FK AI++ RGIGKT L AW+ W +STR G +++ ANS+ Q Sbjct: 65 FIEKNKEAKRLGKPLQVFKLAIASARGIGKTALVAWITYWFLSTRIGCTVVISANSDDQC 124 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQS----LSLHPSGWYAELLEQSM 162 K T +AE+ +W S+ + H+FE L+ S W AE + +++ Sbjct: 125 KTTSFAEIRRWHSLAKNAHFFEANIAEALLAGGCSPWQAEPVAKTL 170 >gi|294789575|ref|ZP_06754810.1| putative terminase B protein [Simonsiella muelleri ATCC 29453] gi|294482512|gb|EFG30204.1| putative terminase B protein [Simonsiella muelleri ATCC 29453] Length = 516 Score = 139 bits (350), Expect = 6e-31, Method: Composition-based stats. Identities = 49/274 (17%), Positives = 104/274 (37%), Gaps = 26/274 (9%) Query: 78 FKCAISAGRGIGKTTLNAWMMLWLISTRP----------GMSIICIANSETQLKNTLWAE 127 K ++ +G G GKT + LW + P G + A + Q+ + +W E Sbjct: 48 AKVSVVSGTGTGKTMSFGRIALWHLLCFPVAKYDGKIEIGSNTYIGAPAIKQVGDGVWKE 107 Query: 128 VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGP 187 ++ + + + + +++ + IT + + + G Sbjct: 108 ITDAVQAMRANRATAWLAEYIVVQAERVYIIDYKA-----TWFITKFAMQQGQSVSIAGK 162 Query: 188 HNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI---- 243 H + + + DEA+G D + I G T+ ++ S + G+FY+ + Sbjct: 163 HRFYQLIII-DEAAGVSDEHYEVINGTQTQGGNRT--LLASQGVKQGGFFYETHHKLNKE 219 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISR-YGLDSDVARIEILGQFPQQEVNNFIPHNYI 302 +W + + + + E + + G ++ R+ +LG+F + E N + I Sbjct: 220 NGGNWTALCFSSENSPFVTTEWLENVALQAGGKNTTEYRVRVLGKFAENEHENLLTRAQI 279 Query: 303 EEAMSREAIDDLYAP--LIMGCDI-AGEGGDKTV 333 E + I + P ++ D+ AGE D +V Sbjct: 280 EPRIDTLPIIEKGEPFGWLLLVDVGAGEYRDDSV 313 >gi|283956317|ref|ZP_06373797.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni 1336] gi|283792037|gb|EFC30826.1| terminase B protein, putative [Campylobacter jejuni subsp. jejuni 1336] Length = 430 Score = 136 bits (342), Expect = 6e-30, Method: Composition-based stats. Identities = 63/301 (20%), Positives = 114/301 (37%), Gaps = 37/301 (12%) Query: 70 VNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS 129 ++ NP ++ GR +G T +A ++ + G +++ + + L+N + Sbjct: 17 FDDKNPRFI--TVAKGRRLGFTRGSAKFVIENLLL--GQNVLWVDTIQANLQNYYELYFT 72 Query: 130 KWLSMLPHRHW-FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 L LP + + +Q L +G S ER + G Sbjct: 73 PELKNLPKDFYSWSVQDKKLIING------------------AVLHMRSAERSENIEGF- 113 Query: 189 NTHGMAVFNDEA-----SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI 243 + + + N+ S + +I + NP I+ + N FY++ Sbjct: 114 -GYDLVILNEAGIILKGSKGEYLWYNAIRPMLLD-NPKSRAIIGGVPKGKN-LFYELCRK 170 Query: 244 PLED--WKRYQIDTRTVEGIDSGFHEGIISR-YGLDSDVARIEILGQFPQQEVNNFIPHN 300 L D WK +Q + + + +I G DS+V + EI G+F Sbjct: 171 ELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEDSEVVKQEIYGEFIDSSSAELFALT 230 Query: 301 YIEEAMSREA--IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQ 358 IE AMS+ + I+ + I G D+A G DK+V+ R+G I++ I +S E Sbjct: 231 EIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSVLAKRKGFIVDEIKKYSQLGTMELAN 290 Query: 359 E 359 Sbjct: 291 R 291 >gi|315929403|gb|EFV08605.1| phosphatase, Ppx/GppA family [Campylobacter jejuni subsp. jejuni 305] Length = 430 Score = 132 bits (331), Expect = 1e-28, Method: Composition-based stats. Identities = 61/301 (20%), Positives = 113/301 (37%), Gaps = 37/301 (12%) Query: 70 VNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS 129 ++ NP ++ GR +G T +A ++ + G +++ + + L+N + Sbjct: 17 FDDKNPRFI--TVAKGRRLGFTRGSAKFVIENLLL--GQNVLWVDTIQANLQNYYELYFT 72 Query: 130 KWLSMLPHRHW-FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 L LP + + +Q L +G S ER + G Sbjct: 73 PELKNLPKDFYSWSVQDKKLIING------------------AVLHMRSAERSENIEGF- 113 Query: 189 NTHGMAVFNDEA-----SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI 243 + + + N+ S + +I + NP I+ + N FY++ Sbjct: 114 -GYDLVILNEAGIILKGSKGEYLWYNAIRPMLLD-NPKSRAIIGGVPKGKN-LFYELCRK 170 Query: 244 PLED--WKRYQIDTRTVEGIDSGFHEGIISR-YGLDSDVARIEILGQFPQQEVNNFIPHN 300 L D WK +Q + + + +I G S+V + EI G+F + Sbjct: 171 ELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVKQEIYGEFIDSSSAELFSLS 230 Query: 301 YIEEAMSREA--IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQ 358 IE AMS+ + I+ + I G D+A G DK+ + R+G +I I +S E Sbjct: 231 EIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKGFVIYEIKKYSQLGTIELAN 290 Query: 359 E 359 + Sbjct: 291 K 291 >gi|57237579|ref|YP_178593.1| terminase B protein, putative [Campylobacter jejuni RM1221] gi|57166383|gb|AAW35162.1| terminase B protein, putative [Campylobacter jejuni RM1221] Length = 430 Score = 132 bits (331), Expect = 1e-28, Method: Composition-based stats. Identities = 61/301 (20%), Positives = 113/301 (37%), Gaps = 37/301 (12%) Query: 70 VNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS 129 ++ NP ++ GR +G T +A ++ + G +++ + + L+N + Sbjct: 17 FDDKNPRFI--TVAKGRRLGFTRGSAKFVIENLLL--GQNVLWVDTIQANLQNYYELYFT 72 Query: 130 KWLSMLPHRHW-FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 L LP + + +Q L +G S ER + G Sbjct: 73 PELKNLPKDFYSWSVQDKKLIING------------------AVLHMRSAERSENIEGF- 113 Query: 189 NTHGMAVFNDEA-----SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI 243 + + + N+ S + +I + NP I+ + N FY++ Sbjct: 114 -GYDLVILNEAGIILKGSKGEYLWYNAIRPMLLD-NPKSRAIIGGVPKGKN-LFYELCRK 170 Query: 244 PLED--WKRYQIDTRTVEGIDSGFHEGIISR-YGLDSDVARIEILGQFPQQEVNNFIPHN 300 L D WK +Q + + + +I G S+V + EI G+F + Sbjct: 171 ELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGEGSEVVKQEIYGEFIDSSSAELFSLS 230 Query: 301 YIEEAMSREA--IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQ 358 IE AMS+ + I+ + I G D+A G DK+ + R+G +I I +S E Sbjct: 231 EIENAMSKNSFSIEKMQGENIWGLDVARYGDDKSALAKRKGFVIYEIKKYSQLGTIELAN 290 Query: 359 E 359 + Sbjct: 291 K 291 >gi|153951273|ref|YP_001397540.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|153951467|ref|YP_001398214.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938719|gb|ABS43460.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] gi|152938913|gb|ABS43654.1| putative terminase B protein [Campylobacter jejuni subsp. doylei 269.97] Length = 430 Score = 131 bits (330), Expect = 1e-28, Method: Composition-based stats. Identities = 56/265 (21%), Positives = 96/265 (36%), Gaps = 33/265 (12%) Query: 106 PGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW-FEMQSLSLHPSGWYAELLEQSMGI 164 G +++ + + L+N + L LP + + +Q L +G Sbjct: 49 EGKNVLWVDTIQANLQNYYELYFTPELKNLPKDFYSWSVQDKKLIING------------ 96 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD-----IINKSILGFFTELN 219 S ER + G + + + N+ D + SI + N Sbjct: 97 ------AVLHMRSAERSENIEGF--AYDLVILNEAGIILKDSKGGYLWYNSIRPMLLD-N 147 Query: 220 PNRFWIMTSNTRRLNGWFYDIFNIPLED--WKRYQIDTRTVEGIDSGFHEGIISR-YGLD 276 P I+ + N FY++ L D WK +Q + + + +I G Sbjct: 148 PKSRAIIGGVPKGKN-LFYELCRKELSDKNWKHFQFSSYDNPFLKEEQIKELIEEVGGES 206 Query: 277 SDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY--APLIMGCDIAGEGGDKTVV 334 SDV R EI G+F + IE AMS+ + I G D+A G DK+V+ Sbjct: 207 SDVVRQEIYGEFIDSSSAELFSLSGIENAMSKNSFSTQKMQGENIWGLDVARYGDDKSVL 266 Query: 335 VFRRGNIIEHIFDWSAKLIQETNQE 359 R+G +I+ + +S E + Sbjct: 267 AKRKGFVIDELKKYSQLGTIELANK 291 >gi|159897183|ref|YP_001543430.1| hypothetical protein Haur_0654 [Herpetosiphon aurantiacus ATCC 23779] gi|159890222|gb|ABX03302.1| conserved hypothetical protein [Herpetosiphon aurantiacus ATCC 23779] Length = 472 Score = 129 bits (324), Expect = 6e-28, Method: Composition-based stats. Identities = 56/349 (16%), Positives = 99/349 (28%), Gaps = 63/349 (18%) Query: 45 LEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTI-FKCAISAGRGIGKTTLNAWMMLWLIS 103 L + P + E + V + S T ++ + A +GKT L ++ W Sbjct: 2 LPYAHDPVAYAREVLGEVWWTKQELIARSLLTPPYRTLVKACHKVGKTHLGGGLVNWWYD 61 Query: 104 TRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMG 163 + ++ A ++ Q+++ LW EV A Sbjct: 62 SFDPGLVLTTAPTDRQVRDLLWKEVRMQRR-------------------GRAGFTGPKSP 102 Query: 164 IDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRF 223 ++ + D+F G H+ H DEA G + ++ F E Sbjct: 103 RLESTPDHFAHGFTAKDGDSFQGHHSPH-TLFIFDEAVGVASVFWETAESMFNEGGA--- 158 Query: 224 WIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEG----------------------- 260 W+ N + Y W + Sbjct: 159 WLAIFNPTDTSSQAY--AEELSGGWHVISMSVLEHPNILAELQGLPPPFPSAIRLSRVDT 216 Query: 261 --------IDSGFHEG-----IISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 + + + +A +LG++P Q NN + A S Sbjct: 217 LLKKWCRALSPEEPKRATDIHWRDAWYRPGPIAEARLLGRWPSQATNNVWSDGAFQVAES 276 Query: 308 REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQET 356 + P +GCD+A G D T + RRG + + ET Sbjct: 277 L-LLPASDEPCELGCDVARYGDDFTEIHVRRGGHSLYHEAANGWSTVET 324 >gi|226940459|ref|YP_002795533.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715386|gb|ACO74524.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 272 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 32/116 (27%), Positives = 46/116 (39%), Gaps = 2/116 (1%) Query: 239 DIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIP 298 W QID+RTVEG + YG +SD ++ + G FP FI Sbjct: 5 KCGRRFRHRWVARQIDSRTVEGTNKEQIAKWAEDYGEESDFFKVRVRGMFPSMSARQFIS 64 Query: 299 HNYIEEAMSREAIDD--LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + A R + YAP I+ D A EG D+ V+ R+G + + Sbjct: 65 ETDVSAAYGRALRPEQYQYAPKILTVDPAWEGDDEFVIGLRQGLSFRVLHTMAKND 120 >gi|225155389|ref|ZP_03723881.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2] gi|224803845|gb|EEG22076.1| hypothetical protein ObacDRAFT_9437 [Opitutaceae bacterium TAV2] Length = 479 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 59/313 (18%), Positives = 115/313 (36%), Gaps = 35/313 (11%) Query: 42 GKPLEHFSQ--PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTT-LNAWMM 98 G P H + P + + ++ + + S + + G GKT+ + + Sbjct: 12 GTPAPHAEKLNPITFAVAVLKLRIYSWQAKIMASVWSGKPTVAATPNGAGKTSVIIVALA 71 Query: 99 LWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELL 158 L L+ PG +++ + + + + ++A ++ + ++ ++ Sbjct: 72 LTLLHEFPGATVVLTSATYRAVCDQIFASLAVHQAKFSA---WKWNDTEIN--------- 119 Query: 159 EQSMGIDSKHYTITCRTYSEERPDTFVGPHN--THGMAVFNDEASGTPDIINKSILGFFT 216 D + I ++ +R F G H + + DEA D I + Sbjct: 120 ------DGQGGRII--GFATDRGGRFEGFHAYPGRPLLIILDEAKSIADDIFVAA----- 166 Query: 217 ELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLD 276 + + S+ L G F+D F+ + ++Q I F E + ++YG D Sbjct: 167 DRCQPTMLLYISSWGGLFGRFHDAFSQDR--FAQFQAGIADCPHITPEFIEAMRAQYGED 224 Query: 277 SDVARIEILGQFPQQEVNNF-IPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVV 335 SD+ R ILGQ P+ F +P E S + + CD A D+ V+ Sbjct: 225 SDIYRSMILGQRPKGNETGFVVPFVDYERCESNPPVWQEGTKQVF-CDFAET-SDECVIA 282 Query: 336 FRRGNIIEHIFDW 348 R GN + + W Sbjct: 283 KRDGNRLSIVDAW 295 >gi|226940437|ref|YP_002795511.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715364|gb|ACO74502.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 133 Score = 115 bits (288), Expect = 1e-23, Method: Composition-based stats. Identities = 34/126 (26%), Positives = 50/126 (39%), Gaps = 11/126 (8%) Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 + AN++TQL+ EV KW + HWF+ QS S+ +K + Sbjct: 1 MITANTDTQLRTKTSPEVGKWQRLSITSHWFDPQSASI----------AARDKEHAKTWR 50 Query: 171 ITCRTYSEERPDTFVGPHNTHG-MAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 +SE + F G HN + + DEAS D + + G T+ WI N Sbjct: 51 ADFVPWSEHNTEAFAGLHNKGKRIVLIFDEASAIADKVWEVAEGALTDEETEIIWIAFGN 110 Query: 230 TRRLNG 235 R G Sbjct: 111 PTRNIG 116 >gi|168704975|ref|ZP_02737252.1| hypothetical protein GobsU_35915 [Gemmata obscuriglobus UQM 2246] Length = 519 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 50/356 (14%), Positives = 103/356 (28%), Gaps = 68/356 (19%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTI-FKCAISAGRGIGKTTLNAWMMLWLISTR 105 + + P + + ++ + + ++ + A +GK+ L ++ W TR Sbjct: 30 YRTDPAGYARDILKVKWWAKQVEIAEALCKPPYRVLVKASHSVGKSHLAGGLVNWWYDTR 89 Query: 106 PGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGID 165 + A ++ Q+K+ LW EV + P +M L P+ + Sbjct: 90 FPGVCLTTAPTDRQVKDVLWKEVRRQRRKRPGFVGPKMPRLESDPTHF------------ 137 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWI 225 ++ +F G H + DEA G ++ W+ Sbjct: 138 -------AHGFTARDATSFQGQHEASILL-IFDEAVGIDGDFWEAAESMCQGAEYG--WL 187 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG---------------FHEGII 270 N Y + W I I + + + Sbjct: 188 AIFNPTDTTSRAY-LEEQAGSRWTVIDIPATEHPNIAAELVARPPEYPSAVRLNWLRDRL 246 Query: 271 SRYGL------------------DSDVAR-------IEILGQFPQQEVNNFIPHNYIEEA 305 ++ S +L ++P + + + Sbjct: 247 EQWAERIEPGDATPTDIQFPNPDGSPQWWRPGPLADARLLARWPASGCGVWSDPVW--RS 304 Query: 306 MSREAIDDLYAPLI--MGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 + R A D + + +GCD+A G D T + R GN+ H + + T + Sbjct: 305 VERAAPDPVPERWLPQIGCDVARFGEDWTELHVRCGNVSLHHEAHNGWDTKRTTER 360 >gi|186682890|ref|YP_001866086.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102] gi|186465342|gb|ACC81143.1| hypothetical protein Npun_R2589 [Nostoc punctiforme PCC 73102] Length = 543 Score = 114 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 55/375 (14%), Positives = 112/375 (29%), Gaps = 73/375 (19%) Query: 46 EHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTR 105 ++ P + + + + + S + A G GK+ + + ++++ + Sbjct: 28 QYADDPVGFFKNELGIELTNEQTIIAESVRDRPITNVKAAHGTGKSFIASLLVIYFLFC- 86 Query: 106 PGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGID 165 G I A SE Q+K LWAE+ K + + + L + Sbjct: 87 VGGVAITTAPSEDQVKWILWAELRKIHGLHKTKLGGRCDIMQL---------------LF 131 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWI 225 S+ T + ++F G H + DEA G I+ + T + + Sbjct: 132 SETVYAFGITSRDYSENSFQGQHRQKQLL-IEDEADGITPQIDNGFIACLTGSD--NRGL 188 Query: 226 MTSNTRRLNGWFYD--------------IFN----------------------------- 242 N F Sbjct: 189 RIGNPVDPQSQFAKTCKLDKRCLTVSAFSHPNVSWAYELCADGVYRLKPEVAEHIINEDG 248 Query: 243 --IPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 P ++W R I + E + S + ++G++ + + I Sbjct: 249 EIKPQQEWPPEFPRDRIPGAISIDWIERVRREKFETSAYWKGRVMGEYAEDAADGIILLT 308 Query: 301 YIEEAMSREAIDDLYA-------PLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDW-SAKL 352 +++A S + Y P +G D+ G+GGD + RG ++ + + Sbjct: 309 LLKQARSLYDQNPQYWDAIAKRYPWRLGLDV-GDGGDPHALALLRGPVLYEVQIHPTKGD 367 Query: 353 IQETNQEGCPVGSSI 367 + +T + S I Sbjct: 368 LLDTERAADIAASQI 382 >gi|282598783|ref|YP_003359102.1| putative large subunit terminase [Clavibacter phage CMP1] gi|262212571|gb|ACY35907.1| putative large subunit terminase [Clavibacter phage CMP1] Length = 872 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 56/287 (19%), Positives = 93/287 (32%), Gaps = 32/287 (11%) Query: 91 TTLNAWMMLWLISTRPG--MSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 T L ++ W +S P S++ A Q+ ++ + ++ R Q L Sbjct: 424 TRLAGDLVTWFVSVFPPEETSVMVSAPIREQIDVMMFRYLRDNYNLAIERE----QPLIG 479 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN 208 + W Q K + R +F G H+ H +AV DEA G P+ + Sbjct: 480 EITKW---PYWQVGAPLDKKLVMPKRPADGNLISSFQGIHDGH-VAVVLDEAGGLPEDLY 535 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLE--DWKRYQIDTRTVEGIDSGFH 266 T + I N + N F++ F + W R+ I Sbjct: 536 IGANAVTTNFHARILAI--GNPDKRNTPFHERFTDTEKFSSWNRFTIGAEDTPNFTGEKI 593 Query: 267 -------EGIISRYGLDS-----------DVARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 E + S V ++ G FP+ + F + I S Sbjct: 594 YEDPAKDEDVKKHLVQVSWAVEMRKSARPSVVAAKVDGNFPESDDTTFFDQSVINRGYST 653 Query: 309 EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 E + MG DI+ +G D++V G I +W+ E Sbjct: 654 EIEPESTDFKYMGVDISYQGEDQSVAYINHGGQIRIADEWNRFDGAE 700 >gi|284162607|ref|YP_003401230.1| hypothetical protein Arcpr_1511 [Archaeoglobus profundus DSM 5631] gi|284012604|gb|ADB58557.1| protein of unknown function DUF264 [Archaeoglobus profundus DSM 5631] Length = 435 Score = 108 bits (269), Expect = 2e-21, Method: Composition-based stats. Identities = 61/272 (22%), Positives = 104/272 (38%), Gaps = 42/272 (15%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH 139 + AGR GKT A ++ T PG IA S Q N ++ ++ ++LS Sbjct: 42 ITVVAGRRFGKTECMAVSAIYYALTNPGSIQFVIAPSYDQ-SNIMFGQIVQFLSKSI--- 97 Query: 140 WFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 ++ + ++ + + I ++ S +P+ G H H + DE Sbjct: 98 -LGCMIRRIYKTPFHHIIFKNDSVIHAR---------SASKPEFLRG-HKAHR--IILDE 144 Query: 200 ASGTPD-IINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI----PLEDWKRYQID 254 A+ PD +I+ I + N + WI N FYD + D+ Y+ Sbjct: 145 AAFIPDDVISNIIEPMLADYNGS--WIKIGTPFGKN-HFYDTYLKGQSPDFPDYSSYRFP 201 Query: 255 TRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNF----I------PHNYIEE 304 + I F E YG +S + R E L +F + + F I I+ Sbjct: 202 STVNPHISHEFIEKKKREYGENSIIFRTEYLAEFVEDQNAVFRWADIQKNVDNSIELIDS 261 Query: 305 AMSREAIDDLYAPLIMGCDIAGEGGDKTVVVF 336 A +++ ++GCD+A D TV+V Sbjct: 262 A------ENVSKQYVIGCDLAKY-QDYTVIVV 286 >gi|320091491|gb|ADW08983.1| terminase-like protein [Clavibacter phage CN77] Length = 414 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 69/214 (32%), Gaps = 30/214 (14%) Query: 158 LEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTE 217 ++ G ++ R ++ TF G V DEA G P + T Sbjct: 12 YKKMDGSGNEAIAFGKRPTDQDIVSTFQGT-RKLRTFVALDEAGGVPPELFTGAEAVMTG 70 Query: 218 LNPNRFWIMTSNTRRLNGWFYDIFNIP--LEDWKRYQIDTRTVE---------------- 259 + I N F+ IF +P +++W + I + Sbjct: 71 QDSKIVAI--GNPDSRGTEFHRIFTVPALMDEWNTFTISAYDLPTVTGEVVYPDHPEKQE 128 Query: 260 -----GIDSGFHEGIISRY---GLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAI 311 + + + G ++LG+FP + N F P I+ I Sbjct: 129 RMLKGLTSLDWIQHKERVWKVGGKPDGRFLAKVLGEFPGETDNAFFPQEAIDRGND-TTI 187 Query: 312 DDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 D +IMG D+A G D +VV +G + Sbjct: 188 DKPEKGIIMGVDLARMGDDDSVVYTNQGGRVRLF 221 >gi|292670767|ref|ZP_06604193.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292647388|gb|EFF65360.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 442 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 46/249 (18%), Positives = 86/249 (34%), Gaps = 25/249 (10%) Query: 113 IANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTIT 172 +A Q K W + + + +P R ++ S Y EL + I Sbjct: 63 VAPYRNQAKRVAWEYLKYYTNPIPGR--------VVNESELYIELPTRHARSPGARLYII 114 Query: 173 CRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINK-SILGFFTELNPNRFWIMTSNTR 231 + PD G + V DE + + I + + + + Sbjct: 115 G----ADHPDALRGIYLDG---VILDEYADIKPELWGGVIRPAL--ADRQGWAVFIGTPK 165 Query: 232 RLNGWFYDI--FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFP 289 N FY++ W T + + + + ++ R E+L F Sbjct: 166 GQN-QFYEMYQHAEKSAGWYSCIYRTDETGVLPAEELKDMQAQMTEM--EIRQELLCDFT 222 Query: 290 QQEVNNFIPHNYIEEAMSREAIDDL--YAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFD 347 + IP + + A +R DD P+I+G D+A G D+TV+ R+G ++ + Sbjct: 223 ASASDVVIPIDLVTAAANRLLKDDDVLGQPVILGVDVARFGDDRTVLCVRQGLWLKEVRT 282 Query: 348 WSAKLIQET 356 ++ ET Sbjct: 283 FTGLSTMET 291 >gi|303243859|ref|ZP_07330199.1| protein of unknown function DUF264 [Methanothermococcus okinawensis IH1] gi|302485795|gb|EFL48719.1| protein of unknown function DUF264 [Methanothermococcus okinawensis IH1] Length = 445 Score = 92.1 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 53/261 (20%), Positives = 92/261 (35%), Gaps = 29/261 (11%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 ++AGR GK+ L A+++++L ST+ IA + ++ E+ K++ + Sbjct: 56 VAAGRRFGKSKLMAFLLIFLCSTQKNKKYAVIAPFYANAR-IIFRELKKYIEKS---NVL 111 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS 201 + S + A + ID + S + P + G V DEA+ Sbjct: 112 SRLVKRMVESPYMAIEFKTGCTIDFR---------SADNPTSIRG---ESYHLVILDEAA 159 Query: 202 GTPDIINK-SILGFFTELNPNRFWIMTSNTRRLNGWFYDIF---NIPLEDWKRYQIDTRT 257 D + K I + + I T N FY+ F ++ T T Sbjct: 160 FIKDDVVKYVIKPLLLDYDAPLIEISTPNGH---NHFYESFLMGKNKQNRHISFRFPTWT 216 Query: 258 VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAID----D 313 + E I G DS V + E +F YI++ + + Sbjct: 217 NPFLPKNAIEEIKQEVGEDSPVWKQEYCAEFIDNNE-AVFNWEYIQQCIDGTIKLLKSGE 275 Query: 314 LYAPLIMGCDIAGEGGDKTVV 334 +MG D+A D TV+ Sbjct: 276 SGHQYVMGVDLAKF-EDYTVI 295 >gi|85716479|ref|ZP_01047450.1| prophage MuMc02, terminase, ATPase subunit, putative [Nitrobacter sp. Nb-311A] gi|85696668|gb|EAQ34555.1| prophage MuMc02, terminase, ATPase subunit, putative [Nitrobacter sp. Nb-311A] Length = 250 Score = 87.9 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 47/262 (17%), Positives = 77/262 (29%), Gaps = 38/262 (14%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSI 110 P WQ E + N + C+ + GKTT+ A M L G + Sbjct: 24 PDPWQAELLR----------LNPKRALLLCSRQS----GKTTVTALMALHRAIYETGALV 69 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 + ++ S Q L ++ K L + + EL S Sbjct: 70 VIVSPSNRQSGEML-RQIKKLHGSLKGAPEL------VGDAVLKVELANGS--------R 114 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 I +E+ G V DEAS D + ++ +T Sbjct: 115 IIALPGTEKTIRGIAG-----VSLVIIDEASRVDDELLAAVRPMLATRADGSLIALT-TP 168 Query: 231 RRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ 290 G+FY+ ++ + W R ++ I F + G E F Sbjct: 169 AGKRGFFYEAWHSDDQTWHRVRVAASDCPRISKEFLADELRSLGP--ARYSEEYELAFVD 226 Query: 291 QEVNNFIPHNYIEEAMSREAID 312 + + P IE A + E Sbjct: 227 -DAASAFPTAVIERAFTTEVEP 247 >gi|261402679|ref|YP_003246903.1| protein of unknown function DUF264 [Methanocaldococcus vulcanius M7] gi|261369672|gb|ACX72421.1| protein of unknown function DUF264 [Methanocaldococcus vulcanius M7] Length = 437 Score = 87.5 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 48/263 (18%), Positives = 91/263 (34%), Gaps = 33/263 (12%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 ++AGR GK+ L +++++L T+ IA + ++ E+ ++ Sbjct: 50 VAAGRRFGKSKLMCFLLIFLSCTQKDKKFAVIAPYYANAR-IIFKELRTYIEKNKT---L 105 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS 201 + + S + + ID + S + P + G V DEA+ Sbjct: 106 QKLVKRITESPYMVIEFKTGCIIDFR---------SADNPTSIRG---ESYHLVILDEAA 153 Query: 202 GTPDIINK-SILGFFTELNPNRFWIMTSNTRRLNGWFYDIF-----NIPLEDWKRYQIDT 255 D + K I + + I T N FY+ F ++ T Sbjct: 154 FIKDDVVKYVIKPLLIDYDAPLIEISTPNGH---NHFYESFLMGENRQNRHI--SFRFPT 208 Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAID--- 312 + + E I +G DS V + E +F + + YI++ + Sbjct: 209 WSNPFLPKSVIEEIKREFGEDSLVWKQEFCAEFID-DQDAVFKWEYIQQCIDSNIELLTV 267 Query: 313 -DLYAPLIMGCDIAGEGGDKTVV 334 + +MG D+A D TV+ Sbjct: 268 GEKGHRYVMGVDLAKY-QDYTVI 289 >gi|327191373|gb|EGE58399.1| prophage MuMc02, terminase, ATPase subunit, putative [Rhizobium etli CNPAF512] Length = 248 Score = 85.9 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 46/264 (17%), Positives = 89/264 (33%), Gaps = 42/264 (15%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 +P WQ + A N ++ C+ + GK+T+ A++++ P Sbjct: 22 EPDPWQANLLRA----------NPRRSMLLCSRQS----GKSTVAAFLVIQTALFVPAAQ 67 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 I+ ++ ++ Q N L+ + +LS LP +S S Sbjct: 68 IVVVSPTQRQ-SNELFRTIVGFLSRLPGAPRPTAESKQGTE--------------LSNGA 112 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 + +E+ G V DEA+ D + ++ P+ + + Sbjct: 113 RVLSLPGTEKTIRGIAGVD-----LVVMDEAARVEDALLTAVRPMMATK-PDARLVALTT 166 Query: 230 TRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYG--LDSDVARIEILGQ 287 GWFY+ + W+R ++ I F + + G S+ + + Sbjct: 167 PAGKRGWFYEAWVSDDPSWERVRVPASACPRITQQFLDEELKALGAIKFSEEYGL----E 222 Query: 288 FPQQEVNNFIPHNYIEEAMSREAI 311 F E P IE A ++E Sbjct: 223 FHDPEE-AVFPLAIIEAAFTQEVR 245 >gi|260906962|ref|ZP_05915284.1| hypothetical protein BlinB_16637 [Brevibacterium linens BL2] Length = 249 Score = 80.9 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 42/257 (16%), Positives = 76/257 (29%), Gaps = 40/257 (15%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSI 110 P WQ + + + R +GKTT A+ L PG + Sbjct: 24 PELWQERLLRT--------------QEARVLVLCARQVGKTTATAYKALHAAMFNPGRDV 69 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 + ++ S+ Q + R + + P + E + S Sbjct: 70 LIVSPSQRQ------------SDEMLRRVASLYRGMKEAPKLSRSNTSEMGLSNGS---R 114 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 + SE F G + DEAS D + S+L + S Sbjct: 115 VVSLPGSEGGIRGFAG-----VKLLILDEASRVDDDVFASVLPMVASDGQ---MVALSTP 166 Query: 231 RRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ 290 GWF+++ W+R+++ + + + G S V + L +F Sbjct: 167 WGRRGWFHELHQETRNGWERHKVTVYESDQYTPPRIAEVKASLG--SFVFSSDYLCEFGD 224 Query: 291 QEVNNFIPHNYIEEAMS 307 + + A S Sbjct: 225 TDS-QLFSTENVRAAFS 240 >gi|212703250|ref|ZP_03311378.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098] gi|212673294|gb|EEB33777.1| hypothetical protein DESPIG_01292 [Desulfovibrio piger ATCC 29098] Length = 330 Score = 76.7 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 61/168 (36%), Gaps = 15/168 (8%) Query: 198 DEASGTPDIIN-KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED-------WK 249 DE + + + + + + +I + N F +++ + W Sbjct: 2 DEVAQMKPEVWGEVVQPALADRRGSAVFI--GTPKGAN-LFAELYQRGMAAQAQGDAAWC 58 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSRE 309 + + + + E + L + R E+L F + IP + EA +R+ Sbjct: 59 ALSYPVTSTDVLPAEDVERLRRE--LSDNAFRQEMLCDFTASSDDILIPLPDVLEAEARQ 116 Query: 310 AI--DDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D P+I+G D+A G D +V+V R+G ++ + Sbjct: 117 LAWDDVGGMPVILGVDVARFGADSSVIVRRQGLKVDGPVVMRGLDNMQ 164 >gi|218290759|ref|ZP_03494841.1| protein of unknown function DUF264 [Alicyclobacillus acidocaldarius LAA1] gi|218239297|gb|EED06496.1| protein of unknown function DUF264 [Alicyclobacillus acidocaldarius LAA1] Length = 422 Score = 75.5 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 51/295 (17%), Positives = 94/295 (31%), Gaps = 42/295 (14%) Query: 49 SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGM 108 S+P QL + H + + F+ A + GR GKT A + PG Sbjct: 7 SEPTSKQLR-LRLYTPHSGQVALHRSTARFRVA-TCGRRWGKTYACANEIAKWAWEHPGA 64 Query: 109 SIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKH 168 +A + Q + + + ++ + + + Sbjct: 65 MTWWVAPTYRQ----------------------TLTAYRIITRNFHGAIEKATTTHMRIE 102 Query: 169 YTITCRT--YSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSIL-GFFTELNPNRFWI 225 + T S E D G + DEA+ P ++ L ++ + Sbjct: 103 WKSGSITEFRSTENFDALRG---EGLDFLVVDEAAMVPKEAWEAALRPTLSDKAGRAIIV 159 Query: 226 MTSNTRRLNGWFYDIFNIPLE----DWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVAR 281 S + N WFY ++ + +W+ ++ T I E + L SDV R Sbjct: 160 --STPKGRN-WFYHVWARGQDPAFPEWESFRFPTLANPYIPPEEVEEARTT--LPSDVFR 214 Query: 282 IEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVF 336 E +F + F + +E ++G D+A D +V+V Sbjct: 215 QEYEAEFLEDSAGVF--RGIRDCISGQEEEPQPGRRYVVGWDVAKH-QDFSVLVV 266 >gi|159904490|ref|YP_001548152.1| hypothetical protein MmarC6_0096 [Methanococcus maripaludis C6] gi|159885983|gb|ABX00920.1| protein of unknown function DUF264 [Methanococcus maripaludis C6] Length = 505 Score = 75.2 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 47/311 (15%), Positives = 87/311 (27%), Gaps = 57/311 (18%) Query: 55 QLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIA 114 Q E EA+D + I+ GR GKT + + S G S++ +A Sbjct: 65 QEEIAEAID----------SEMYDVITINIGRRGGKTEVMGGVGPKFCSKYRGFSVLVVA 114 Query: 115 NSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCR 174 Q K ++ ++ + L + + + Sbjct: 115 PVYNQAKT-MYKKIKRGLESNKESRQLVKPKKEGFKESPFPLITFYNGSTIEFK------ 167 Query: 175 TYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINK-SILGFFTELNPNRFWIMTSNTRRL 233 S E PD + + DEA+ D I + + + S Sbjct: 168 --SAETPDNLR---SEGYDLIIVDEAAFVDDEIISAVLEPMLMDSGG--ILVKISTPWGT 220 Query: 234 NGWFYDIFNI----------------PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDS 277 FYD + +K ++ + + F G G D+ Sbjct: 221 GNHFYDSYIKGELQAKMLEEGEGIPEDELRYKSFKFPSWVNPYLSKRFLMGKKKDLGEDN 280 Query: 278 DVARIEILGQFPQQEVNNFIPHNYIEEAMS-----------REAIDDLYA---PLIMGCD 323 V E +F ++ +++ +S + D ++G D Sbjct: 281 PVWLQEYCAEF-IEDDTTVFSTAHVQACLSDAFETHYKTENLIYLIDEGERNKEYVIGLD 339 Query: 324 IAGEGGDKTVV 334 +A D TV Sbjct: 340 LAKHN-DYTVF 349 >gi|116624478|ref|YP_826634.1| hypothetical protein Acid_5400 [Candidatus Solibacter usitatus Ellin6076] gi|116227640|gb|ABJ86349.1| hypothetical protein Acid_5400 [Candidatus Solibacter usitatus Ellin6076] Length = 260 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 75/261 (28%), Gaps = 29/261 (11%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIIC 112 W + + V ++ + ++ R GK+T+ A + G I Sbjct: 25 EWARRALGFEADAAQARVLDTRSK--RVLLNCTRQWGKSTVTAARAVHEAVKNAGSLTIA 82 Query: 113 IANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTIT 172 + + Q + + + SG + S + Sbjct: 83 VTPTARQTGEFV-------------------RKAATFASGLEMRVKGDGHNEMSLAFPNG 123 Query: 173 CRTYSEERPDT-FVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTR 231 R + G + DEAS D + ++ ++ W+M S Sbjct: 124 SRIVGLPGTEATVRGFSA--VTLLLIDEASRVGDDLYMAMRPML-AVSAGTLWLM-STPH 179 Query: 232 RLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQ 291 G+FY+ + E W+R + + + E G + R E +F + Sbjct: 180 GKRGFFYEAWANGGETWERVSVKAEDCPRFKAEYLEEERQVMGER--IYRQEYCCEF-GE 236 Query: 292 EVNNFIPHNYIEEAMSREAID 312 + IE A S E Sbjct: 237 TSGAVFDRDLIEAAFSDEVTP 257 >gi|229844502|ref|ZP_04464642.1| predicted phage terminase large subunit [Haemophilus influenzae 6P18H1] gi|229812751|gb|EEP48440.1| predicted phage terminase large subunit [Haemophilus influenzae 6P18H1] Length = 452 Score = 74.0 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 46/275 (16%), Positives = 95/275 (34%), Gaps = 28/275 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++ T+P + ++C E+ K +S + Sbjct: 27 GGRGSGKSFSIARALVLRAYTQP-IRVLCC------------REIQKSISDSVIQM-LAD 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q L ++ Q +G + +T + + G V+ +E Sbjct: 73 QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVEGID 262 ++ E ++ N + + Y F P E K ++ + Sbjct: 128 SKESWDVLIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFI--PHNYIEEAMS--REAIDDLYAPL 318 E ++ D ++ R G+ P + + I P +I+ A+ ++ Sbjct: 186 KELMEDMVQMRERDYELYRHVYEGE-PVADSDKVIIKPL-WIDAAVDAHKKLGFVAAGRK 243 Query: 319 IMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 I+G D+A EG D F G+++ + +W + + Sbjct: 244 IIGFDVADEGSDANANAFVHGSVVLRMDEWHGEDV 278 >gi|329122215|ref|ZP_08250807.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] gi|327474100|gb|EGF19511.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] Length = 452 Score = 73.6 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 46/275 (16%), Positives = 94/275 (34%), Gaps = 28/275 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++ T+P + ++C E+ K +S + Sbjct: 27 GGRGSGKSFSIARALVLRAYTQP-IRVLCC------------REIQKSISDSVIQM-LAD 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q L ++ Q +G + +T + + G V+ +E Sbjct: 73 QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVEGID 262 ++ E ++ N + + Y F P E K ++ + Sbjct: 128 SKESWDVLIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFI--PHNYIEEAMS--REAIDDLYAPL 318 E + D ++ R G+ P + + I P +I+ A+ ++ Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGE-PVADSDKVIIKPL-WIDAAVDAHKKLGFVAAGRK 243 Query: 319 IMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 I+G D+A EG D F G+++ + +W + + Sbjct: 244 IIGFDVADEGSDANANAFVHGSVVLRMDEWRGEDV 278 >gi|303257560|ref|ZP_07343572.1| putative terminase B protein [Burkholderiales bacterium 1_1_47] gi|302859530|gb|EFL82609.1| putative terminase B protein [Burkholderiales bacterium 1_1_47] Length = 330 Score = 73.2 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 35/170 (20%), Positives = 56/170 (32%), Gaps = 13/170 (7%) Query: 195 VFNDEASGTPDIIN-KSILGFFTELNPNRFWIMTSNTRRLNGW--FYD----IFNIPLED 247 V DE + + + I + +I + +N + YD + + D Sbjct: 6 VVIDEVAQIKPTLWGEVIRPALADRKGWAAFI--GTPKGINLFSQLYDQALNLMSKGDPD 63 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 W ID + + R E L F + N IP + I A + Sbjct: 64 WIAMLYSVEQTHVIDEKELAALKVEMSE--NEFRQEFLCDFSAAQDNGLIPIDDIRAAAN 121 Query: 308 REAIDDLY--APLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 + + Y APLI G D+A G D +V+ RRG + Sbjct: 122 KFYRESEYMGAPLIYGIDVARFGSDASVIFKRRGLVAFEPIVIRKFDNMA 171 >gi|119386463|ref|YP_917518.1| PBSX family phage terminase large subunit [Paracoccus denitrificans PD1222] gi|119377058|gb|ABL71822.1| phage terminase, large subunit, PBSX family [Paracoccus denitrificans PD1222] Length = 441 Score = 71.7 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 51/266 (19%), Positives = 87/266 (32%), Gaps = 34/266 (12%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A M+ T PG+S IC+ +V K L + E Sbjct: 26 GGRGSGKSWDRAMHMIVRHLTEPGLSSICL------------RDVQKSLDQSVFKLLVE- 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 + L + + + + I +E + +A + + A+ Sbjct: 73 TAARLGVAEAIRPVESDRIIRTPGNGIIAFNGMNEFNAENIKSL-EGFDIAWWEEAATAG 131 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTR----------RLNGWFYDIFNIPLEDWKRYQI 253 + + + W T N R R + F D + +W Sbjct: 132 QGPL-DMLRPTLRKPGSQ-IWF-TYNPRLRSDPVDVMMRQDARFADSRTVVEANW----- 183 Query: 254 DTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD 313 R E + D R G + + FI + EAM+R+ Sbjct: 184 --RDNPFRGPELEEERLLDLAGDEARYRHIWEGDYEAESDMQFIGGGLVREAMARQPFSQ 241 Query: 314 LYAPLIMGCDIAGEGGDKTVVVFRRG 339 + L++G D+A G D++V+ RRG Sbjct: 242 IGDELVLGVDVARFGDDRSVIWARRG 267 >gi|150021340|ref|YP_001306694.1| hypothetical protein Tmel_1462 [Thermosipho melanesiensis BI429] gi|149793861|gb|ABR31309.1| protein of unknown function DUF264 [Thermosipho melanesiensis BI429] Length = 421 Score = 71.3 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 50/264 (18%), Positives = 87/264 (32%), Gaps = 39/264 (14%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 I AGR GKT A + + + P +I S Q K + Sbjct: 39 ICAGRRFGKTNYVAGKIFYYATIHPKSRVIVGGPSLDQAKIY---------------YDL 83 Query: 142 EMQSLSLHPSGWYAELLEQS--MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 +++ L P + + + S I K+ + + G V E Sbjct: 84 LTEAIELSPLKGFVKKTKDSPFPTIYLKNGSSITVRSTAHNGKYLRG---RKVNLVVLTE 140 Query: 200 ASGTPDIINK-SILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKR---YQIDT 255 A+ D + + I +L+ I+ S +N +FY+ + L++ K + Sbjct: 141 AAFIKDSVYEQVITPM--KLDTGAPVILESTPNGMN-YFYEEYQRGLKNKKHTISFHATV 197 Query: 256 RTVEGIDSGFHEGIISR---YGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAID 312 +D E ++ Y V R E L +F + F P + EA + Sbjct: 198 YDNPFLDQEEIENAKAKTPDY-----VWRQEYLAEFVD-DDTVFFPWKILVEAFEDYKPE 251 Query: 313 DLYAPLI--MGCDIAGEGGDKTVV 334 +G D+A D TV+ Sbjct: 252 GYKDGRKYSIGVDLAKY-RDYTVI 274 >gi|294508906|ref|YP_003566117.1| hypothetical protein PSR_11004 [Salinibacter ruber M8] gi|294342043|emb|CBH22709.1| conserved hypothetical protein [Salinibacter ruber M8] Length = 255 Score = 70.5 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 46/260 (17%), Positives = 84/260 (32%), Gaps = 40/260 (15%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSI 110 P WQ + + + + CA + GKTT +A + L + Sbjct: 8 PDPWQEALLTS----------DWERALLNCARQS----GKTTASAALALETALEATDSLV 53 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 + +A + Q K L V + QS E + + + K T Sbjct: 54 LILAPARRQSKEFL-RSVRSLYRDAAPDGGLDKQS----ELRLRLENESRIIALPGKEGT 108 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 + R Y+ + V DEA+ PD + + S Sbjct: 109 V--RGYTAD--------------LVIADEAARVPDAAYVATRPMLAVTGGRFVGL--STP 150 Query: 231 RRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ 290 GWFY+ + P ++W++ ++ + + F E G R E + +F Sbjct: 151 AGQRGWFYEAWTDPGQEWEQVKVTGQDCPRMTEAFLEQERREMGDWQ--FRSEYMCEFTD 208 Query: 291 QEVNNFIPHNYIEEAMSREA 310 E + +IE +++ E Sbjct: 209 TE-DQLFATEHIESSLTSEV 227 >gi|149174861|ref|ZP_01853485.1| hypothetical protein PM8797T_10814 [Planctomyces maris DSM 8797] gi|148846198|gb|EDL60537.1| hypothetical protein PM8797T_10814 [Planctomyces maris DSM 8797] Length = 568 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 70/225 (31%), Gaps = 54/225 (24%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 WQ + +E++ + TI + + G GK II Sbjct: 57 DDWQWDILESL----------FDLTIRRVFVKGNTGCGKGAAAGIACCTYFHIWNDAKII 106 Query: 112 CIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI 171 +S + + EV KW + + ++ + + + ++ L Sbjct: 107 ITRDSVRTAQKIAFGEVDKWWRKMRFKPPGKLLTSGVFDNNQHSISL------------- 153 Query: 172 TCRTYSEERPDTFVGPHNTHGMAVFNDEAS--GTPDIINKSILGFFTELNPNRFWIMTSN 229 + + + F G H+ H + + DEA+ D + + ++ SN Sbjct: 154 ----ANPQHIEGFRGAHSPH-VFFWFDEATAPNLEDKYKLANTQA-------KKFLALSN 201 Query: 230 TRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYG 274 L+G F D F ++ + II +YG Sbjct: 202 PSTLSGTFRDSF-----------------PVVNPDKTQTIIDQYG 229 >gi|328952976|ref|YP_004370310.1| hypothetical protein Desac_1270 [Desulfobacca acetoxidans DSM 11109] gi|328453300|gb|AEB09129.1| hypothetical protein Desac_1270 [Desulfobacca acetoxidans DSM 11109] Length = 466 Score = 69.4 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 51/293 (17%), Positives = 90/293 (30%), Gaps = 50/293 (17%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSI 110 P WQ +F+ V+ + C+ + GK+T A + L PG I Sbjct: 27 PDPWQQDFL----------VSRPEQALLLCSRQS----GKSTSAAALALHEALFHPGALI 72 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 + ++ S Q L+ + + LPH + Sbjct: 73 LLLSPSLRQ-SQELFRKAAGLYQRLPHAP------------------AACRTSALRLEFD 113 Query: 171 ITCRTYSE-ERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 R S + +T G + + DEA+ PD + ++ S Sbjct: 114 HGSRIISLPGQEETIRGFSEVRLLVI--DEAALVPDELYYAVRPML--AVSRGRLTALST 169 Query: 230 TRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFP 289 GWFY + + W+RY I I + F L + R E +F Sbjct: 170 PAGKRGWFYHCYTEGGDQWQRYTIPATQCPRISADFLAAEQRS--LPAAWFRAEYFCEF- 226 Query: 290 QQEVNNFIPHNYIEEAMSREAID--------DLYAPLIMGCDIAGEGGDKTVV 334 + N P + ++ A + +G D+ G+ D + + Sbjct: 227 GEAANQLFPAHLLQTAQCSQVSPLFAEITPSPPTGTFFIGLDL-GQSQDYSAL 278 >gi|302339289|ref|YP_003804495.1| hypothetical protein Spirs_2798 [Spirochaeta smaragdinae DSM 11293] gi|301636474|gb|ADK81901.1| conserved hypothetical protein [Spirochaeta smaragdinae DSM 11293] Length = 295 Score = 67.8 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 78/253 (30%), Gaps = 45/253 (17%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T+ A G II ++ + Q K ++ F S Sbjct: 57 GKSTVIAAKAAHKAKFFSGSLIILVSPALRQSK-----------ELMRKVEDFIALDKSF 105 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN 208 P+ E Q I SE+ G + DEAS PD + Sbjct: 106 PPAS---EEDNQLTKEFKNRSRIVALPGSEKTIRGLSG-----PTLIIIDEASRIPDELY 157 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFH-- 266 K+I + ++ + G FYD ++ W + ++ R + G Sbjct: 158 KAIRPMMAGADTE--LVLMTTPFGKRGVFYDAWSRSK-RWTKIEVVGRDILGRFPNEQVY 214 Query: 267 ------EGIISRYGLDSDV--------------ARIEILGQFPQQEVNNFIPHNYIEEAM 306 +GI + Y V R E G+F +++ + A+ Sbjct: 215 AQLRRKDGIKACYSPRHSVEFLGEELEEMGEWWYRQEYGGEFMDP-IDSVFNMEDVRAAI 273 Query: 307 SREAIDDLYAPLI 319 + +AP+I Sbjct: 274 INDTPAISFAPII 286 >gi|116625333|ref|YP_827489.1| hypothetical protein Acid_6278 [Candidatus Solibacter usitatus Ellin6076] gi|116228495|gb|ABJ87204.1| hypothetical protein Acid_6278 [Candidatus Solibacter usitatus Ellin6076] Length = 260 Score = 67.8 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 63/221 (28%), Gaps = 27/221 (12%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 GK+T+ A + T+ I ++ + Q + + Sbjct: 58 WGKSTVTAARAVHEAVTKADSLTIAVSPTARQTGEFV-------------------RKAE 98 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDT-FVGPHNTHGMAVFNDEASGTPDI 206 ++ S + R + G + DEAS D Sbjct: 99 AFAGMLKMKVKGDGSNEMSLAFPNGSRIVGLPGTEATVRGFSA--VALLLVDEASRVEDD 156 Query: 207 INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFH 266 + ++ ++ W+M S G+FY+ + W+R + + + Sbjct: 157 LYMAMRPML-AVSGGTLWLM-STPWGKRGFFYEAWANGGPTWERVSVKAEDCPRFGAEYL 214 Query: 267 EGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 E G + R E +F + + IE A S Sbjct: 215 EEERRVMGER--IYRQEYCCEFGESSS-AVFDRDLIEAAFS 252 >gi|307308946|ref|ZP_07588629.1| hypothetical protein SinmeBDRAFT_4513 [Sinorhizobium meliloti BL225C] gi|306900580|gb|EFN31193.1| hypothetical protein SinmeBDRAFT_4513 [Sinorhizobium meliloti BL225C] Length = 408 Score = 67.8 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 20/200 (10%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 GKT + A + W + + + SE+ +KN +W+ + + + S Sbjct: 208 WGKTYVAAIAVWWSLVCFDDVKVTIFGPSESLIKNGMWSNLQALHARMA----------S 257 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII 207 + S + R S + G H + VF D+A G +++ Sbjct: 258 SFKDLFDVSATRVSRKTAAPSCFAEYRLVSADNASAARGIHAVNN-FVFVDDADGVSEVV 316 Query: 208 NKSILGFFTELNPNRFWI--MTSN--TRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS 263 ++ + NP + M +N + ++FN L + + D Sbjct: 317 IAYLMNIMIDPNPKLCLLSTMFANETPKLETVTEAELFNEALSSLRAM-VSGEV--RTDP 373 Query: 264 GFHEGIISRYGLDSDVARIE 283 + E I RY L++ Sbjct: 374 VWLEAI--RYQLENAEYLAR 391 >gi|289581321|ref|YP_003479787.1| hypothetical protein Nmag_1649 [Natrialba magadii ATCC 43099] gi|289530874|gb|ADD05225.1| hypothetical protein Nmag_1649 [Natrialba magadii ATCC 43099] Length = 602 Score = 65.9 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 43/366 (11%), Positives = 101/366 (27%), Gaps = 83/366 (22%) Query: 49 SQPHRWQLEFMEAVD----VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIST 104 + W + +E + + + + G+GK+ + A + + ++ Sbjct: 22 AGDETWLEDAIEDYLGITVTGAQAQICRGIAANERLLVVTANGLGKSYILAAITIVWLTV 81 Query: 105 RPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI 164 R + +E ++K T + + + P + + + I Sbjct: 82 RYPACSFATSGTERKMKRTY------------CKPVENLHGDARVPLPGEYKSRPERIEI 129 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG--TPDIINKSILGFFTELNPNR 222 D + ++ + G H + +A+ +EA + ++ T+ Sbjct: 130 DGEPEHFFEAASPQDAGE-LEGVHAAYTLAII-EEADKKDVDAEVLDAMKSLVTDEQDRI 187 Query: 223 FWIMTSNTRRLNGWFY---DIFNIPLEDWKRYQIDTRTVEGIDSG--------------- 264 I + + Y D + P W+ + + + Sbjct: 188 IAIA-NPPKDETNSIYPILDEQDDPTSKWEVLEFSSFDSHNVQVELGNVDDEKVDGLASL 246 Query: 265 -FHEGIISRYG--------------------------------LDSDVARI--------E 283 + Y D+ R Sbjct: 247 HKIQDDWEDYNKEPWPGAETARTLSAPKLDADGNPVFSHSDALEDNPEFRTDLDQRWYRR 306 Query: 284 ILGQFPQQ--EVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNI 341 G P N + + A R+ + P G D+A +GGD+T V+ G++ Sbjct: 307 RAGIIPPGGASKNRPFTIDDVNAAWGRDWQP-VGRPQATGIDVARDGGDRTPVISVDGDV 365 Query: 342 IEHIFD 347 +E ++ Sbjct: 366 LEVRYE 371 >gi|260580755|ref|ZP_05848581.1| phage terminase large subunit [Haemophilus influenzae RdAW] gi|260092572|gb|EEW76509.1| phage terminase large subunit [Haemophilus influenzae RdAW] Length = 447 Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 89/280 (31%), Gaps = 26/280 (9%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++ P + ++C E+ K +S + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQM-LAD 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q L ++ Q +G + +T + + G V+ +E Sbjct: 73 QIEMLSLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVEGID 262 ++ E ++ N + + Y F P E K ++ + Sbjct: 128 SKESWDILIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLI 319 E + D ++ R G+ P + + I +IE A+ + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGE-PVADSDLAIIKPVWIEYAVDAHLKLGFTAKGMKK 244 Query: 320 MGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 +G D+A EG D F G+++ I W + ++ Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANR 284 >gi|315426011|dbj|BAJ47659.1| prophage MuMc02, terminase, ATPase subunit [Candidatus Caldiarchaeum subterraneum] Length = 439 Score = 65.1 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 59/276 (21%), Positives = 99/276 (35%), Gaps = 23/276 (8%) Query: 62 VDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQ-L 120 + +H +P+ F+ I RG G T A T P +I+ I+ S Q L Sbjct: 19 IRLHPWQKRFIDDPSRFRI-ILKHRGAGATFTIAAEACAEALTHPASTILLISYSLRQSL 77 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 + ++ V LS L ++ S+ + A +E G Sbjct: 78 E--IFRHVRTILSRLENKRLKHGHSIYRLAAKIGARTVELGNGSR--------IISLPNN 127 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 P++ G AV+ DEA+ N FT + N + S + GWF++ Sbjct: 128 PESLRGYRAD---AVYVDEAAFFRGDTNLKTAIMFTTVARNGRVTLVSTPKGKRGWFHEA 184 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPH 299 + W ++ + I E + S + R E++ +F EVN FIP+ Sbjct: 185 WTTDNT-WSKHLVKLGDSPHITMHDLEELRKTM---SPLEWRQEMMCEFLD-EVNAFIPY 239 Query: 300 NYIEEAMSR-EAIDDLYAPLIMGCDIAGEGGDKTVV 334 I E + + + +G D D TV+ Sbjct: 240 EKILECVEDYVPARVVGGRVYVGVDFGRF-RDSTVI 274 >gi|309379923|emb|CBX21334.1| unnamed protein product [Neisseria lactamica Y92-1009] Length = 449 Score = 64.4 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 42/279 (15%), Positives = 86/279 (30%), Gaps = 21/279 (7%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A + + +S R G I+C E L ++ E Sbjct: 20 GGRGSGKSYFLAELAV-EVSRRIGTVILCA------------REFQGSLDDSVYQLLIET 66 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 + + + + + + + G + +EA Sbjct: 67 IERLGYTEEFDILKSTITHKGTGAKFVFYGIKNNVTKIKSIQG-----VGVCWVEEAEAV 121 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGI-D 262 ++ W+ + L+ + P +D + + D Sbjct: 122 TKNSWDVLIPSIRGDKNAEIWVSFNPKNILDDTYRRFIVHPPQDSIVLKANYDINPHFAD 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + ++ D D+ R LG+ I ++IE A+ + I+ Sbjct: 182 TPLLADMLECKERDEDLYRHIWLGEPVADSELAIIKPSWIEAAIDAHEKLGFQAAGKRIL 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 G D+A EG D V R G+++ + W + + + + Sbjct: 242 GFDVADEGDDANATVLRHGSVVTDMRQWRGQDVIYSADK 280 >gi|148826888|ref|YP_001291641.1| phage terminase large subunit [Haemophilus influenzae PittGG] gi|148718130|gb|ABQ99257.1| predicted phage terminase large subunit [Haemophilus influenzae PittGG] Length = 366 Score = 64.4 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 89/280 (31%), Gaps = 26/280 (9%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++ P + ++C E+ K +S + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQM-LAD 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q L ++ Q +G + +T + + G V+ +E Sbjct: 73 QIEMLGLRAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVEGID 262 ++ E ++ N + + Y F P E K ++ + Sbjct: 128 SKESWDILIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLI 319 E + D ++ R G+ P + + I +IE A+ + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGE-PVADSDLAIIKPVWIEYAVDAHLKLGFTAKGMKK 244 Query: 320 MGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 +G D+A EG D F G+++ I W + ++ Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANR 284 >gi|187476925|ref|YP_784949.1| phage terminase large subunit [Bordetella avium 197N] gi|115421511|emb|CAJ48020.1| Putative phage terminase large subunit [Bordetella avium 197N] Length = 512 Score = 64.0 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 32/74 (43%), Gaps = 4/74 (5%) Query: 284 ILGQFPQQEVN---NFIPHNYIEEAMSREAIDDLYAPLI-MGCDIAGEGGDKTVVVFRRG 339 + G F + IP ++E A +R D AP+ +G D+A G DKT++ R G Sbjct: 277 LYGDFNAGIEDDPWQVIPTAWVEAAQARWKRPDRLAPMDSLGLDVARGGRDKTILARRHG 336 Query: 340 NIIEHIFDWSAKLI 353 + + K Sbjct: 337 WWFDEPLVYPGKDT 350 >gi|319776448|ref|YP_004138936.1| phage terminase large subunit [Haemophilus influenzae F3047] gi|319897217|ref|YP_004135412.1| phage terminase large subunit [Haemophilus influenzae F3031] gi|329123931|ref|ZP_08252483.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] gi|317432721|emb|CBY81084.1| predicted phage terminase large subunit [Haemophilus influenzae F3031] gi|317451039|emb|CBY87270.1| predicted phage terminase large subunit [Haemophilus influenzae F3047] gi|327468126|gb|EGF13613.1| phage terminase large subunit [Haemophilus aegyptius ATCC 11116] Length = 447 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 89/280 (31%), Gaps = 26/280 (9%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++ P + ++C E+ K +S + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQM-LAD 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q L ++ Q +G + +T + + G V+ +E Sbjct: 73 QIEMLGLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVEGID 262 ++ E ++ N + + Y F P E K ++ + Sbjct: 128 SKESWDILIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLI 319 E + D ++ R G+ P + + I +IE A+ + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGE-PVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKK 244 Query: 320 MGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 +G D+A EG D F G+++ + W + ++ Sbjct: 245 VGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANR 284 >gi|145629503|ref|ZP_01785301.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|145641440|ref|ZP_01797019.1| predicted phage terminase large subunit [Haemophilus influenzae R3021] gi|144978346|gb|EDJ88110.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|145273983|gb|EDK13850.1| predicted phage terminase large subunit [Haemophilus influenzae 22.4-21] gi|309750959|gb|ADO80943.1| Probable bacteriophage terminase, large subunit [Haemophilus influenzae R2866] Length = 447 Score = 63.6 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 44/280 (15%), Positives = 89/280 (31%), Gaps = 26/280 (9%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++ P + ++C E+ K +S + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQM-LAD 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q L ++ Q +G + +T + + G V+ +E Sbjct: 73 QVEMLGLQDFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVEGID 262 ++ E ++ N + + Y F P E K ++ + Sbjct: 128 SKESWDILIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLI 319 E + D ++ R G+ P + + I +IE A+ + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGE-PVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKK 244 Query: 320 MGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 +G D+A EG D F G+++ + W + ++ Sbjct: 245 VGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANR 284 >gi|68250076|ref|YP_249188.1| phage terminase large subunit [Haemophilus influenzae 86-028NP] gi|68058275|gb|AAX88528.1| predicted phage terminase large subunit [Haemophilus influenzae 86-028NP] Length = 447 Score = 62.4 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 45/280 (16%), Positives = 89/280 (31%), Gaps = 26/280 (9%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++ P + ++C E+ K +S + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQM-LAD 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q L ++ Q +G + +T + + G V+ +E Sbjct: 73 QIEMLGLQNFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVEGID 262 ++ E ++ N + + Y F P E K ++ + Sbjct: 128 SKESWDILIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLI 319 E + D ++ R G+ P + + I +IE A+ + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGE-PVADSDLAIIKPVWIECAVDAHLKLGFTAKGMKK 244 Query: 320 MGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 +G D+A EG D F G+++ I W + ++ Sbjct: 245 VGFDVADEGADSNDNAFVHGSVVLDIEVWKNGDVIDSANR 284 >gi|329119006|ref|ZP_08247700.1| phage terminase large subunit [Neisseria bacilliformis ATCC BAA-1200] gi|327464879|gb|EGF11170.1| phage terminase large subunit [Neisseria bacilliformis ATCC BAA-1200] Length = 449 Score = 62.4 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 41/279 (14%), Positives = 86/279 (30%), Gaps = 21/279 (7%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A + + ++ R G I+C E L ++ E Sbjct: 20 GGRGSGKSYFLAELAV-EVARRIGTVILCA------------REFQGSLDDSVYQLLTET 66 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 + + + + + + + G + +EA Sbjct: 67 IARLGYTQEFEILKSSIRHKGTGAKFVFYGVKNNITKIKSIQG-----VGICWVEEAEAV 121 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGI-D 262 ++ W+ + L+ + P +D + + D Sbjct: 122 TKNSWDVLIPSIRGDKNAEIWVSFNPKNILDDTYQRFIVHPPKDSIVLKANYDINPHFAD 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + ++ D D+ R LG+ I ++IE A+ + I+ Sbjct: 182 TPLLADMLECKERDEDLYRHIWLGEPVADSELAIIKPSWIEAAIDAHEKLGFSAAGRRIL 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 G D+A EG D V R G+++ + W + + + + Sbjct: 242 GFDVADEGDDANATVLRHGSVVTDMQQWRGQDVIYSADK 280 >gi|41179386|ref|NP_958694.1| Bbp25 [Bordetella phage BPP-1] gi|45569518|ref|NP_996587.1| hypothetical protein BMP-1p24 [Bordetella phage BMP-1] gi|45580769|ref|NP_996635.1| hypothetical protein BIP-1p24 [Bordetella phage BIP-1] gi|40950125|gb|AAR97691.1| Bbp25 [Bordetella phage BPP-1] Length = 533 Score = 62.4 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 4/74 (5%) Query: 284 ILGQFPQQEVN---NFIPHNYIEEAMSREAIDDLYAPLI-MGCDIAGEGGDKTVVVFRRG 339 + G F + IP ++E A +R D AP+ +G D+A G D T++ R Sbjct: 298 LYGDFNAGIEDDPWQVIPTAWVEAAQARWKRPDRLAPMDSLGVDVARGGRDNTILARRHA 357 Query: 340 NIIEHIFDWSAKLI 353 + + K Sbjct: 358 MWFDVPLTYPGKDT 371 >gi|301170180|emb|CBW29784.1| predicted phage terminase large subunit [Haemophilus influenzae 10810] Length = 447 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 44/269 (16%), Positives = 85/269 (31%), Gaps = 26/269 (9%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++ P + ++C E+ K +S + Sbjct: 27 GGRGSGKSFSIARALVLRAYQSP-VRVLCC------------REIQKSISDSVIQM-LAD 72 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q L ++ Q + + +T + + G V+ +E Sbjct: 73 QVEMLGLQDFFDVQKTQIIEQNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEGENV 127 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVEGID 262 ++ E ++ N + + Y F P E K ++ + Sbjct: 128 SKESWDILIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFP 185 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLI 319 E + D ++ R G+ P + + I +IE A+ + Sbjct: 186 KELMEDMEQMRERDYELYRHVYEGE-PVADSDLAIIKPVWIESAVDAHLKLGFTTKGMKK 244 Query: 320 MGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 +G D+A EG D F G+++ I W Sbjct: 245 VGFDVADEGADSNANAFVHGSVVLDIEVW 273 >gi|261381054|ref|ZP_05985627.1| phage terminase, large subunit, PBSX family [Neisseria subflava NJ9703] gi|284796087|gb|EFC51434.1| phage terminase, large subunit, PBSX family [Neisseria subflava NJ9703] Length = 450 Score = 61.7 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 62/167 (37%), Gaps = 5/167 (2%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQID 254 + +EA D ++ + W+ T N + + Y F P +D ++ Sbjct: 117 WIEEAENVSDESWNILIPTIRKAGSE-IWL-TWNPKNILDPTYQRFVVNPPDDMVDIVVN 174 Query: 255 TRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAID 312 + S D D+ R LG+ + I +I+ A+ + Sbjct: 175 YTDNIYLPEVLRLEAESCKARDYDLYRHIWLGEPVADSELSVIKPKWIDAAIDSHIKLGF 234 Query: 313 DLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 + I+G D+A EG D + + R G+++ + +W + + + + Sbjct: 235 EATGQRILGFDVADEGDDASATILRHGSVVIDMDEWRGQDVIYSADK 281 >gi|157265496|ref|YP_001468054.1| phage terminase large subunit [Thermus phage P74-26] gi|156905391|gb|ABU97034.1| phage terminase large subunit [Thermus phage P74-26] Length = 485 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 67/194 (34%), Gaps = 10/194 (5%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 +P E + H ++ S + A GR GK+ + ++ + RPG Sbjct: 5 RPSDKFFELLGYKPHHVQLAIHRSTAK-RRVACL-GRQSGKSEAASVEAVFELFARPGSQ 62 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 IA + Q + V K + E+Q +K Sbjct: 63 GWIIAPTYDQAEIIFGRVVEKVERLAEVFPATEVQLQRRRLRLLVHHYDRPVNAPGAKRV 122 Query: 170 TITC-RTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNPNRFWIMT 227 + R S +RPD G V DEA+ P + ++I + + + ++ Sbjct: 123 ATSEFRGKSADRPDNLRGATLD---FVILDEAAMIPFSVWSEAIEPTLSVRDG--WALII 177 Query: 228 SNTRRLNGWFYDIF 241 S + LN WFY+ F Sbjct: 178 STPKGLN-WFYEFF 190 >gi|157265379|ref|YP_001467938.1| terminase large subunit [Thermus phage P23-45] gi|156905274|gb|ABU96918.1| terminase large subunit [Thermus phage P23-45] Length = 485 Score = 61.3 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 39/194 (20%), Positives = 67/194 (34%), Gaps = 10/194 (5%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 +P E + H ++ S + A GR GK+ + ++ + RPG Sbjct: 5 RPSDKFFELLGYKPHHVQLAIHRSTAK-RRVACL-GRQSGKSEAASVEAVFELFARPGSQ 62 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 IA + Q + V K + E+Q +K Sbjct: 63 GWIIAPTYDQAEIIFGRVVEKVERLAEVFPATEVQLQRRRLRLLVHHYDRPVNAPGAKRV 122 Query: 170 TITC-RTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNPNRFWIMT 227 + R S +RPD G V DEA+ P + ++I + + + ++ Sbjct: 123 ATSEFRGKSADRPDNLRGATLD---FVILDEAAMIPFSVWSEAIEPTLSVRDG--WALII 177 Query: 228 SNTRRLNGWFYDIF 241 S + LN WFY+ F Sbjct: 178 STPKGLN-WFYEFF 190 >gi|319789040|ref|YP_004150673.1| protein of unknown function DUF264 [Thermovibrio ammonificans HB-1] gi|317113542|gb|ADU96032.1| protein of unknown function DUF264 [Thermovibrio ammonificans HB-1] Length = 419 Score = 60.1 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 53/319 (16%), Positives = 114/319 (35%), Gaps = 51/319 (15%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIIC 112 +Q+E ++ +D H S I R GK+ + ++ +T+P +I+ Sbjct: 6 PYQIEIVKGIDSHKFSV------------IKMARQTGKSFVVSYWATRRATTKPNHAIVV 53 Query: 113 IANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTIT 172 ++ +E Q K + + ++++ L ++ + + + ++ + + Sbjct: 54 VSPTERQSK------------LFVDKVKLHIKAMRLTGVKFFEDTELKKLEVNFPNGSQI 101 Query: 173 CRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD--IINKSILGFFTELNPNRFWIMTSNT 230 PD G V DE + + + +++ T + + S Sbjct: 102 --IALPANPDGIRGFSGD----VIMDEVAFFKNWQEVYRAVFPIITRK-KDYKLVAISTP 154 Query: 231 RRLNGWFYDIF----NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILG 286 N FY ++ N P I +G+ E + + D R E L Sbjct: 155 FGKNDLFYYLWSISENNPKWFRYSLNIFEAVAKGLKVDVEE--LRAGIKNEDAWRTEYLV 212 Query: 287 QFPQQEVNNFIPHNYIEEAMSREA------IDDLYAPLIMGCDIAGEGGDKTVVVF--RR 338 +F E + +P+ I++ + I +L L G D+ D TV+ + Sbjct: 213 EFID-EADAVLPYELIQKCEMPKEELLVEDIKELKGELYCGVDVGRR-KDLTVITLLEKL 270 Query: 339 GNI--IEHIFDWSAKLIQE 355 G++ + I + S K +E Sbjct: 271 GDVLYVRRIEELSKKPFRE 289 >gi|67920466|ref|ZP_00513986.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501] gi|67857950|gb|EAM53189.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501] Length = 244 Score = 59.4 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 35/217 (16%), Positives = 67/217 (30%), Gaps = 39/217 (17%) Query: 74 NPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS---------------IICIANSET 118 +P F+ + GR GK+ L + +I ++ + Sbjct: 18 DPQKFQVLV-CGRRFGKSHLQ--VTKHVIDCLMFPKLMPGYNVKQQTMETAVLVGMPTLK 74 Query: 119 QLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE 178 Q + LW + K L P+ ++ G +++ + ++ + + Sbjct: 75 QARKILWKPLVKTLENCPYVDKISRSDYTIRFKGNRPDIILAGLNDNAGDRARGLKLWR- 133 Query: 179 ERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 V DE P +I+ I+ + P+ + T + N Sbjct: 134 ----------------VCIDEVQDVRPSVIDAVIIPAMADT-PHSRALFTGTPKGKNNHL 176 Query: 238 YDIFN--IPLEDWKRYQIDTRTVEGIDSGFHEGIISR 272 Y++F +DWK Y T T I E R Sbjct: 177 YNLFTMERDNDDWKSYNFPTWTNPLISKDEVERARKR 213 >gi|16273317|ref|NP_439561.1| terminase large subunit-like protein [Haemophilus influenzae Rd KW20] gi|1175785|sp|P44184|Y1410_HAEIN RecName: Full=Uncharacterized protein HI_1410 gi|1574247|gb|AAC23058.1| predicted coding region HI1410 [Haemophilus influenzae Rd KW20] Length = 394 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 37/240 (15%), Positives = 77/240 (32%), Gaps = 13/240 (5%) Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDT 183 ++ E+ K +S + Q L ++ Q +G + +T + + Sbjct: 1 MFREIQKSISDSVIQM-LADQIEMLSLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKS 59 Query: 184 FVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN- 242 G V+ +E ++ E ++ N + + Y F Sbjct: 60 MTGID-----VVWVEEGENVSKESWDILIPTIREDGSQII--VSFNPKNILDDTYQRFVI 112 Query: 243 IPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNY 301 P E K ++ + E + D ++ R G+ P + + I + Sbjct: 113 HPPERCKSVLVNWQDNPYFPKELMEDMEQMRERDYELYRHVYEGE-PVADSDLAIIKPVW 171 Query: 302 IEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 IE A+ + +G D+A EG D F G+++ I W + ++ Sbjct: 172 IEYAVDAHLKLGFTAKGMKKVGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANR 231 >gi|149408318|ref|YP_001294421.1| conserved hypothetical protein ORF004 [Pseudomonas phage F8] gi|219523873|ref|YP_002455934.1| terminase large subunit [Pseudomonas phage PB1] gi|190333469|gb|ACE73724.1| terminase large subunit [Pseudomonas phage PB1] Length = 460 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/161 (19%), Positives = 57/161 (35%), Gaps = 7/161 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + I + N WI+ N + + Y F P +D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNFVVKPPKDAFVKM 172 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ-QEVNNFIPHNYIEEAMS--RE 309 I+ + + I Y D D A I G P+ + I +I A+ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDKDQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + +G D+A +G D GN+I + +W Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDG 272 >gi|307251380|ref|ZP_07533296.1| hypothetical protein appser4_21360 [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306856621|gb|EFM88761.1| hypothetical protein appser4_21360 [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 384 Score = 59.0 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 74/223 (33%), Gaps = 12/223 (5%) Query: 141 FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA 200 E Q L+ ++ Q +G + +T + + G V+ +E Sbjct: 2 LEDQIEILNLKPFFEVQKTQIIGRNGSRFTFAGLKTNITSIKSMTGID-----VVWVEEG 56 Query: 201 SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQIDTRTVE 259 ++ E ++ N + L Y F P E ++ + Sbjct: 57 ENVSKESWDVLIPTIREDGSQII--VSFNPKNLLDDTYQRFVINPPERCCSVLVNWQDNP 114 Query: 260 GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYA 316 E + D ++ R GQ P + + I +IE+A+ ++ Sbjct: 115 YFPKELMEDMKQMKERDFELYRHVYEGQ-PVADSDLAIIKPLWIEKAVDAHKKLGFTASG 173 Query: 317 PLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 ++G D+A EG D F G+++ + +W + ++ Sbjct: 174 RKVVGFDVADEGIDANANCFAHGSVVLQVDEWRGDDVIQSAHR 216 >gi|269941618|emb|CBI50024.1| phage protein [Staphylococcus aureus subsp. aureus TW20] Length = 599 Score = 58.6 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 51/302 (16%), Positives = 84/302 (27%), Gaps = 67/302 (22%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 A RG+GKT L+A L PG II A +++Q N L ++ LS L HR + Sbjct: 82 ASRGLGKTFLSAVYCLTRCILYPGTKIIITAPTKSQGINVLEKIENELLSPLIHREIESI 141 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 + + P + + + D G H + + V DE Sbjct: 142 NTGNQKPMIAF---------HNGSWIRVVASN------DNARG-HRANLLLV--DEFVKV 183 Query: 204 P-DIINKSILGFFTELNPNRFWIMTS---NTRRLNGWFYDIFNIPLEDWKRYQIDTRTVE 259 D+I+ T F R N Y W + + T + Sbjct: 184 DEDLIDTVFKKMLTSQREPAFLHKAKYKNYPREENTQMYLSSAWMKSHWAYDSMRSFTKQ 243 Query: 260 GIDSGFHEGIISR------------------------------------------YGLDS 277 + + + S +G Sbjct: 244 MLKKKSEDDLKSFVCHIPYYTGVMEKLYSHKQMKAEAQAEGFNKMKFAMEMEAVWWGETE 303 Query: 278 DVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGG---DKTVV 334 F ++ F P + +A I + ++ D+A GG D +V Sbjct: 304 SAFFNFNTIDFNRKLSQAFYPKEVLVQADINNPIKEPKEKRLLAVDVARMGGNSNDASVF 363 Query: 335 VF 336 Sbjct: 364 SL 365 >gi|294663744|gb|ADF29298.1| terminase [Pseudomonas phage JG024] Length = 460 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 57/161 (35%), Gaps = 7/161 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + I + N WI+ N + + Y F P +D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ-QEVNNFIPHNYIEEAMS--RE 309 I+ + + I Y D + A I G P+ + I +I A+ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + +G D+A +G D GN+I + +W Sbjct: 232 LGWEPAGSKRIGFDVADDGDDANATTLMHGNVIMEVDEWDG 272 >gi|291334706|gb|ADD94352.1| hypothetical protein Ddes_0719 [uncultured phage MedDCM-OCT-S04-C890] Length = 311 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 54/177 (30%), Gaps = 26/177 (14%) Query: 102 ISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS 161 +S IA + Q K+ W + ++ + +P+ + E + P+G LL Sbjct: 1 MSKLKNPRFAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG-- 58 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNP 220 E D G + DE + + + I ++ Sbjct: 59 ----------------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDRKG 99 Query: 221 NRFWIMTSNTRRLNGWFYDI--FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL 275 + + +N FYD+ EDW Y+ + +D E G Sbjct: 100 --YCVFIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGE 154 >gi|190890121|ref|YP_001976663.1| hypothetical protein RHECIAT_CH0000492 [Rhizobium etli CIAT 652] gi|190695400|gb|ACE89485.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 465 Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 42/276 (15%), Positives = 85/276 (30%), Gaps = 27/276 (9%) Query: 85 GRGIGKTTLNAWMMLWLISTRPG---------MSIICIANSETQLKNTLWAEVSKWLSML 135 GR GK+ A + ++L +++ IA Q + L V L + Sbjct: 68 GRRGGKSFTMALIAVFLACFFDYRQYLAPGERATVLVIATDRRQARVIL-RYVRAMLDNI 126 Query: 136 PHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAV 195 P +Q++ + +L + + R Y+ + Sbjct: 127 P-----LLQAMVERDTADSFDLDNSTTIEVGTASFRSTRGYT------YAAVLCDELAFW 175 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 D+A+ I +I + PN + S+ G +D F + Sbjct: 176 RTDDAAEPDYAILDAIRPGMASI-PNSMLLCASSPHARRGALWDAFKRFWGKDDAPLVWR 234 Query: 256 RTVEGIDSGFHEGII-SRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDL 314 ++ + ++ D A E +F + ++ F+ +E+ +SR + Sbjct: 235 AATREMNPTISQSVVDRALERDHASAMAEYGAEF-RSDIEQFVNIEVVEDCVSRGVYERA 293 Query: 315 YAPLI---MGCDIAGEGGDKTVVVFRRGNIIEHIFD 347 P I D +G D + +I D Sbjct: 294 PLPNIRYRAFVDPSGGSNDSMTLAIGHKEGERNILD 329 >gi|198242430|ref|YP_002214959.1| hypothetical protein SeD_A1100 [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|193876434|gb|ACF24836.1| ORF11 [Salmonella enterica subsp. enterica serovar Dublin] gi|197936946|gb|ACH74279.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326622711|gb|EGE29056.1| hypothetical protein SD3246_1075 [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 423 Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 47/252 (18%), Positives = 79/252 (31%), Gaps = 38/252 (15%) Query: 58 FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTT-LNAWMMLWLISTRPGMSIICIANS 116 +E + H +P K I AGR GKTT L W M + A S Sbjct: 6 VIEFLPFHAGQKKIYRSPAKRKV-IRAGRRFGKTTMLEQAGGNWAA---RQMRVGWFAPS 61 Query: 117 ETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTY 176 L LP + S + + + +G + + Sbjct: 62 YKIL--------------LPSFKTIRDLLKPITISSSKTDSIIELIGGGLVEF------W 101 Query: 177 SEERPDTFVGPHNTHGMAVFNDEAS----GTPDIINKSILGFFTELNPNRFWIMTSNTRR 232 + + PD G + + DE S G DI ++I + + + +M + Sbjct: 102 TLDNPD--AGRSRKYHKVII-DEGSLVKKGMRDIWEQAIEPTLLDFDGDA--VMAGTPKG 156 Query: 233 L--NGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ 290 + +FY N W+ + T I+ II G V + E +F Sbjct: 157 VDDENFFYQACNDKSMGWEEHHAPTAANPTINPAALARIID--GRPPLVVQQEYNAEFVD 214 Query: 291 QEVNNFIPHNYI 302 NF +++ Sbjct: 215 WRGQNFFKLDWL 226 >gi|291334530|gb|ADD94183.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334650|gb|ADD94297.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] Length = 223 Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 51/177 (28%), Gaps = 26/177 (14%) Query: 102 ISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS 161 + IA + Q K+ W + ++ +P + E + P+G LL Sbjct: 1 MCPHKNPRFAYIAPTFKQAKSIAWDYMKQFTDKIPSTKFNETELRVDLPNGARITLLG-- 58 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNP 220 E D G + DE + + + I ++ Sbjct: 59 ----------------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDRKG 99 Query: 221 NRFWIMTSNTRRLNGWFYDI--FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL 275 + + +N FYD+ EDW Y+ + +D + G Sbjct: 100 --YCVFIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASETKIVDQEELDKAKEVMGE 154 >gi|221196218|ref|ZP_03569265.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221202891|ref|ZP_03575910.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221176825|gb|EEE09253.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221182772|gb|EEE15172.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 424 Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 70/243 (28%), Gaps = 37/243 (15%) Query: 65 HCHSNVNNSNPTIFKCAISAGRGIGKTTL-NAWMMLWLISTRPGMSIICIANSETQLKNT 123 + + + + I GR GKTTL W G+ + + Sbjct: 12 AKQAEIGRAFNESRRVVIRCGRRFGKTTLLERCASKWA---YNGLKVGWFGPTYK----- 63 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDT 183 L++ ++ ++ +++E + G + +T+ D Sbjct: 64 --------LNLPTYKRILRTVQPVVYSKSKIDQVIELNSGGCIEFWTL---------QDE 106 Query: 184 FVGPHNTHGMAVFNDEASGTPD---IINK-SILGFFTELNPNRFWIMTSNTR--RLNGWF 237 G + + DE S P I + +I + + M + +F Sbjct: 107 DAGRSRFYDRVII-DEGSLVPKGLRSIWEQAIAPTLLDRKGHAI--MAGTPKGIDPENFF 163 Query: 238 YDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFI 297 Y+ W+ + T + +D + Y + V + E L F F Sbjct: 164 YEACTDKTLGWREFHAPTASNPMLDPEAVARLKDEY--PALVYQQEYLADFVDWNGAAFF 221 Query: 298 PHN 300 Sbjct: 222 SEE 224 >gi|291334416|gb|ADD94071.1| hypothetical protein GobsU_33659 [uncultured phage MedDCM-OCT-S04-C1035] gi|291334470|gb|ADD94124.1| hypothetical protein GobsU_33659 [uncultured phage MedDCM-OCT-S04-C1161] Length = 223 Score = 57.8 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 54/177 (30%), Gaps = 26/177 (14%) Query: 102 ISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS 161 +S IA + Q K+ W + ++ + +P+ + E + P+G LL Sbjct: 1 MSKLKNPRFAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG-- 58 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNP 220 E D G + DE + + + I ++ Sbjct: 59 ----------------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDRKG 99 Query: 221 NRFWIMTSNTRRLNGWFYDI--FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL 275 + + +N FYD+ EDW Y+ + +D E G Sbjct: 100 --YCVFIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGE 154 >gi|291336431|gb|ADD95986.1| hypothetical protein Ddes_0719 [uncultured organism MedDCM-OCT-S04-C1073] Length = 311 Score = 57.8 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 52/172 (30%), Gaps = 26/172 (15%) Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 IA + Q K+ W + ++ + +P+ + E + P+G LL Sbjct: 6 NPRYAYIAPTFKQAKSIAWDYMKQFTAKIPNTKFNETELRVDLPNGSRITLLG------- 58 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNPNRFWI 225 E D G + DE + + + I ++ + + Sbjct: 59 -----------AENSDGLRGIYLDGC---VIDEYANIDGKLFAEIIRPALSDRKG--YCV 102 Query: 226 MTSNTRRLNGWFYDI--FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL 275 +N FYD+ EDW Y+ + +D E G Sbjct: 103 FIGTPAGMNNNFYDLYQHANGAEDWFNYKAKASDTKIVDPEELEKAKEVMGE 154 >gi|262276634|ref|ZP_06054439.1| P-loop protein [alpha proteobacterium HIMB114] gi|262225214|gb|EEY75661.1| P-loop protein [alpha proteobacterium HIMB114] Length = 409 Score = 57.8 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 40/251 (15%), Positives = 78/251 (31%), Gaps = 30/251 (11%) Query: 78 FKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPH 137 F+ I+ GR GKT L +L I ++ + K +W ++ K + L Sbjct: 17 FRVLIT-GRRFGKTHLCLVEILRQARHCDNGKIFYVSPTYRMSKEIMWKQIKKLVKEL-- 73 Query: 138 RHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFN 197 W + E + I + +++ D G + Sbjct: 74 --------------RWDKYINETELTIVLVNNCQISLKGADKSADNLRGV---GLNFLVL 116 Query: 198 DEASGTPDIIN-KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLE---DWKRYQI 253 DE + P+ + + ++ N + + W YD+F +WK ++ Sbjct: 117 DEFADIPEEAWTEVLRPTISDKYANGKVLFVGTPKGYGNWSYDMFQRGQAGDPEWKSWKY 176 Query: 254 DTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAID- 312 T ++ E S R E F + + A + + + Sbjct: 177 TTIEGGQVEPHEIEQAKKDLDARS--FRQEYEASFETYAGVVYYNF---DRAKNVKPVPY 231 Query: 313 DLYAPLIMGCD 323 D A + +G D Sbjct: 232 DQNAVIHIGMD 242 >gi|57867562|ref|YP_189190.1| prophage, terminase, ATPase subunit [Staphylococcus epidermidis RP62A] gi|57638220|gb|AAW55008.1| prophage, terminase, ATPase subunit, putative [Staphylococcus epidermidis RP62A phage SP-beta] Length = 599 Score = 57.4 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 51/302 (16%), Positives = 84/302 (27%), Gaps = 67/302 (22%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 A RG+GKT L+A L PG II A +++Q N L ++ LS L HR + Sbjct: 82 ASRGLGKTFLSAVYCLTRCILYPGTKIIITAPTKSQGINVLEKIENELLSPLIHREIESI 141 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 + + P + + + D G H + + V DE Sbjct: 142 NTGNQKPMIAF---------HNGSWIRVVASN------DNARG-HRANLLLV--DEFVKV 183 Query: 204 P-DIINKSILGFFTELNPNRFWIMTS---NTRRLNGWFYDIFNIPLEDWKRYQIDTRTVE 259 D+I+ T F R N Y W + + T + Sbjct: 184 DEDLIDTVFKKMLTSQREPAFLHKAKYKNYPREENTQMYLSSAWMKSHWAYDSMRSFTRQ 243 Query: 260 GIDSGFHEGIISR------------------------------------------YGLDS 277 + + + S +G Sbjct: 244 MLKKKSEDDLKSFVCHIPYYTGVMEKLYSHKQMKAEAQAEGFNKMKFAMEMEAVWWGETE 303 Query: 278 DVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGG---DKTVV 334 F ++ F P + +A I + ++ D+A GG D +V Sbjct: 304 SAFFNFNTIDFNRKLSQAFYPKEVLVQADINNPIKEPKEKRLLAVDVARMGGNSNDASVF 363 Query: 335 VF 336 Sbjct: 364 SL 365 >gi|218457805|ref|YP_002418810.1| terminase, large subunit [Pseudomonas phage SN] gi|218379073|emb|CAT99652.1| terminase, large subunit [Pseudomonas phage SN] Length = 460 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 57/161 (35%), Gaps = 7/161 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + I + N WI+ N + + Y F P +D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ-QEVNNFIPHNYIEEAMS--RE 309 I+ + + I Y D + A I G P+ + I +I A+ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + +G D+A +G D GN+I + +W Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDG 272 >gi|218148543|ref|YP_002364311.1| terminase, large subunit [Pseudomonas phage 14-1] gi|218059739|emb|CAU13815.1| terminase, large subunit [Pseudomonas phage 14-1] Length = 460 Score = 56.3 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 57/161 (35%), Gaps = 7/161 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + I + N WI+ N + + Y F P +D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ-QEVNNFIPHNYIEEAMS--RE 309 I+ + + I Y D + A I G P+ + I +I A+ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + +G D+A +G D GN+I + +W Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDG 272 >gi|197261331|ref|YP_002154147.1| putative terminase, large subunit [Pseudomonas phage LBL3] gi|197244421|emb|CAR31156.1| putative terminase, large subunit [Pseudomonas phage LBL3] Length = 460 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 57/161 (35%), Gaps = 7/161 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + I + N WI+ N + + Y F P +D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ-QEVNNFIPHNYIEEAMS--RE 309 I+ + + I Y D + A I G P+ + I +I A+ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + +G D+A +G D GN+I + +W Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDG 272 >gi|218296139|ref|ZP_03496908.1| protein of unknown function DUF264 [Thermus aquaticus Y51MC23] gi|218243516|gb|EED10045.1| protein of unknown function DUF264 [Thermus aquaticus Y51MC23] Length = 426 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 36/183 (19%), Positives = 69/183 (37%), Gaps = 16/183 (8%) Query: 186 GPHNTHGMAVFNDEASGTPD---IINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN 242 G + +A+ DEA+ P + ++IL + WI ++ R FY+++N Sbjct: 112 GRGRAYDLAII-DEAAFAPSLARVWEEAILPTLLDR-LGSAWIASTPKGRNA--FYELWN 167 Query: 243 IPLED--WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHN 300 + L+D W + + + + + + R EIL ++ E F + Sbjct: 168 LTLDDPAWAHFHEPSHRNPFLSQEELARMAATMTRE--RYRQEILAEWVDAEGRVF-SED 224 Query: 301 YIEEAMSREAIDDL--YAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFD--WSAKLIQET 356 +E A+ + +D G D+A V V R G +E + W T Sbjct: 225 ALEAALLLQGPEDPRPGERYAAGVDLARSQDYTAVAVLRLGAQLELVRVERWRGLSYTLT 284 Query: 357 NQE 359 ++ Sbjct: 285 ARK 287 >gi|197261421|ref|YP_002154236.1| putative terminase, large subunit [Pseudomonas phage LMA2] gi|197244511|emb|CAR31245.1| putative terminase, large subunit [Pseudomonas phage LMA2] Length = 460 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 32/161 (19%), Positives = 57/161 (35%), Gaps = 7/161 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + I + N WI+ N + + Y F P +D Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNFVVKPPKDSCVKM 172 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ-QEVNNFIPHNYIEEAMS--RE 309 I+ + + I Y D + A I G P+ + I +I A+ ++ Sbjct: 173 INWNENPFLSETMLKVIHEAYERDREQAE-HIYGGIPKTGGDKSVINLKFILAAIDAHKK 231 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + +G D+A +G D GNII + +W Sbjct: 232 LGWEPAGSKRIGFDVADDGEDANATTLMHGNIIMEVDEWDG 272 >gi|159044464|ref|YP_001533258.1| hypothetical protein Dshi_1915 [Dinoroseobacter shibae DFL 12] gi|157912224|gb|ABV93657.1| hypothetical protein Dshi_1915 [Dinoroseobacter shibae DFL 12] Length = 260 Score = 56.3 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 55/293 (18%), Positives = 95/293 (32%), Gaps = 62/293 (21%) Query: 36 FPWGIK-------GKPLEHFSQ--PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGR 86 PW L H+ P WQ+E + A+ GR Sbjct: 7 IPWAEDLERRLDPVSRLTHWMGHAPDPWQVEAF--------------TTRATEVALRVGR 52 Query: 87 GIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSL 146 GKT++ A + + P +C+A +E Q K + E+ + ++Q Sbjct: 53 QSGKTSVLAARAVEELHV-PESLTLCVAPAERQAK-IIAREIGR-----------QLQRT 99 Query: 147 SLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS----- 201 SL + LE + G + + DT G + + DE + Sbjct: 100 SLVINRPTQTELEIANGA-----RVIALPSTS---DTIRGFPAVSCLII--DECAFLQGD 149 Query: 202 -GTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF--NIPLEDWKRYQIDTRTV 258 G D+I S+L TE +S N +F +F P + R + + Sbjct: 150 GGGEDLI-SSVLPMLTEDGQ---VFFSSTPAGKNNYFARLFLDAKPGDGIHRIVVRGTDI 205 Query: 259 EGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAI 311 + + E + R EIL + + + + IE+A S+ Sbjct: 206 PRL-ADKVERMRRTLSATK--FRQEILVEM-LADGQAYFDLSIIEQATSKTEK 254 >gi|169633984|ref|YP_001707720.1| putative bacteriophage protein; putative prophage terminase large subunit [Acinetobacter baumannii SDF] gi|169152776|emb|CAP01795.1| putative bacteriophage protein; putative prophage terminase large subunit [Acinetobacter baumannii] Length = 552 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/245 (16%), Positives = 81/245 (33%), Gaps = 29/245 (11%) Query: 128 VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGP 187 K+ M + L P G+ ++ + M I + T + + G Sbjct: 155 FHKFRDMFSKMPQW------LKPKGFVEKVHDNYMRIINPDNGATITGEAGDNI----GR 204 Query: 188 HNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 M DE + +++ ++ N N I S + F+ + Sbjct: 205 GGRTTMYFL-DEWAFVEQ--QEAVDAAISQ-NTNVH-IKGSTPNGIGDRFHQ--DRFSGR 257 Query: 248 WKRYQIDTRTVE--GIDSGFHEGIISRYGL------DSDVARIEILGQFPQQEVNNFIPH 299 + + + R ++ +I + D V E+ + IP Sbjct: 258 YAVFTMPWRDNPDKNWTVTYNGKVIYPWYEKQLATLDDVVLAQEVDINYAASVEGVLIPS 317 Query: 300 NYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK--LIQE 355 +++ A+ ++ + I G D+A EG DK R G ++ ++ WS K I Sbjct: 318 TWVQAAIDAHKKLQIEPTGDRIGGLDVADEGKDKNSFAARHGVVMTYLATWSGKGDDIFG 377 Query: 356 TNQEG 360 T Q+ Sbjct: 378 TTQKA 382 >gi|329849103|ref|ZP_08264131.1| phage terminase, large subunit, PBSX family [Asticcacaulis biprosthecum C19] gi|328844166|gb|EGF93735.1| phage terminase, large subunit, PBSX family [Asticcacaulis biprosthecum C19] Length = 430 Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 46/292 (15%), Positives = 88/292 (30%), Gaps = 31/292 (10%) Query: 58 FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE 117 +E + + F+ A GRG K+ A ++ PG ++ + + Sbjct: 24 ILEPIPAYRFLTKKPLGSFRFRAAY-GGRGAAKSWEFANAAIYHSLNTPGARVVFVREIQ 82 Query: 118 TQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYS 177 L ++ + V L F + H AE+L + + Sbjct: 83 GSLADSAFTLVRNRLEAYGLEGAFRQANGRFHHVENGAEILFLGL-------------WR 129 Query: 178 EERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN--TRRLNG 235 +P+ +EAS ++ + W + + Sbjct: 130 GNKPEGIKSL--EGATLTIWEEASEGRQRSLDVLIPTVLRTPQSELWCLWNPMLPTDPVD 187 Query: 236 WFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL----GQFPQQ 291 F+ P + I R + F E + + LD + G + Sbjct: 188 RFFRGDVEPQK-----TICRRVNWDSNPHFPEALREQMALDRKKDPLRAAWIWDGAYMPS 242 Query: 292 EVNNFIPHNYIEEAM--SREAIDDLYAPLIMGCDIAGEGGDKT--VVVFRRG 339 N ++ A R+ + + +++G D AG GGD+ VV R G Sbjct: 243 AQNALWTRELLDRAWVQGRDKVMEAVGRVVVGVDPAGGGGDEVGIVVAGRYG 294 >gi|241763591|ref|ZP_04761642.1| phage terminase large subunit [Acidovorax delafieldii 2AN] gi|241367184|gb|EER61538.1| phage terminase large subunit [Acidovorax delafieldii 2AN] Length = 521 Score = 55.1 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 24/60 (40%), Gaps = 4/60 (6%) Query: 284 ILGQFPQQEVN---NFIPHNYIEEAMSRE-AIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 + G F + IP +++ A +R D ++G D A G DKT V R Sbjct: 276 LRGDFSAGAADPAWQLIPTEWVKAAQARWQPRQDKGPMTVLGLDPARGGTDKTSVARRHD 335 >gi|300907068|ref|ZP_07124735.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 84-1] gi|301304068|ref|ZP_07210185.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 124-1] gi|300401186|gb|EFJ84724.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 84-1] gi|300840675|gb|EFK68435.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 124-1] gi|315257729|gb|EFU37697.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 85-1] Length = 440 Score = 54.7 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 62/163 (38%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 96 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 153 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 154 INYDENPFLSDTMLKVIEAAKRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKV 212 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G+++ +W AK Sbjct: 213 LNFEPSGRKRIGFDVADSGADKCANVYRHGSVVYWADEWKAKE 255 >gi|294085818|ref|YP_003552578.1| hypothetical protein SAR116_2251 [Candidatus Puniceispirillum marinum IMCC1322] gi|292665393|gb|ADE40494.1| protein of unknown function DUF264 [Candidatus Puniceispirillum marinum IMCC1322] Length = 454 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 52/253 (20%), Positives = 86/253 (33%), Gaps = 25/253 (9%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 AGRG GKT A + WL + I + + + + S LS+ P+ Sbjct: 82 AGRGFGKTRAGAEWIRWLAQSGRARRIALVGETFDDARQVMVEGASGILSVCPN------ 135 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 W R YS + P+ GP +G + DE + Sbjct: 136 ---------WARPAWRAGQRTLIWPSGTIARCYSADDPEQLRGPEFDYG---WADEIAKW 183 Query: 204 PDI-INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRT-VEGI 261 +++ + + I T+ R + W D+ ED Q +R + Sbjct: 184 RYPSAWDNLMLAL-RIGKSPQCIATTTPRPVR-WLADLA--AAEDTVLVQGASRENAANL 239 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMG 321 F + R+G DS +AR E+ G + N I + +++G Sbjct: 240 SPAFMAAMHRRFG-DSYLARQELEGIMMSNLPDALWCRNDILRLHRPMPKRHRFIRIVIG 298 Query: 322 CDIAGEGGDKTVV 334 D A GGD+T + Sbjct: 299 VDPAMGGGDETGI 311 >gi|145638997|ref|ZP_01794605.1| terminase large subunit-like protein [Haemophilus influenzae PittII] gi|145271969|gb|EDK11878.1| terminase large subunit-like protein [Haemophilus influenzae PittII] Length = 379 Score = 54.4 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 56/170 (32%), Gaps = 7/170 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 V+ +E ++ E ++ N + + Y F P E K Sbjct: 50 VVWVEEGENVSKESWDILIPTIREDGSQII--VSFNPKNILDDTYQRFVIHPPERCKSVL 107 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--RE 309 ++ + E + D ++ R G+ P + + I +IE A+ + Sbjct: 108 VNWQDNPYFPKELMEDMEQMRERDYELYRHVYEGE-PVADSDLAIIKPVWIESAVDAHLK 166 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 +G D+A EG D F G+++ + W + ++ Sbjct: 167 LGFTTKGMKKVGFDVADEGADANANAFVHGSVVLGVEVWKNGDVIDSANR 216 >gi|194434997|ref|ZP_03067239.1| phage terminase, large subunit, pbsx family [Shigella dysenteriae 1012] gi|194416779|gb|EDX32906.1| phage terminase, large subunit, pbsx family [Shigella dysenteriae 1012] gi|323166781|gb|EFZ52535.1| phage terminase, large subunit, PBSX family [Shigella sonnei 53G] Length = 447 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P E + Sbjct: 102 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEGTLIRK 159 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 160 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKT 218 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G+++ +W AK Sbjct: 219 LNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 261 >gi|188492395|ref|ZP_02999665.1| phage terminase large subunit [Escherichia coli 53638] gi|188487594|gb|EDU62697.1| phage terminase large subunit [Escherichia coli 53638] Length = 467 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P E + Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEGTLIRK 179 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKT 238 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G+++ +W AK Sbjct: 239 LNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 281 >gi|16760783|ref|NP_456400.1| bacteriophage protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|25512494|pir||AE0735 probable bacteriophage protein STY2040 [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16503080|emb|CAD05583.1| putative bacteriophage protein [Salmonella enterica subsp. enterica serovar Typhi] Length = 467 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P E + Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEGTLIRK 179 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 180 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKT 238 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G+++ +W AK Sbjct: 239 LNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 281 >gi|74311301|ref|YP_309720.1| putative bacteriophage protein [Shigella sonnei Ss046] gi|73854778|gb|AAZ87485.1| putative bacteriophage protein [Shigella sonnei Ss046] Length = 473 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P E + Sbjct: 128 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEGTLIRK 185 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 186 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKT 244 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G+++ +W AK Sbjct: 245 LNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 287 >gi|324012808|gb|EGB82027.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 60-1] Length = 441 Score = 53.6 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P E + Sbjct: 96 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEGTLIRK 153 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 154 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKT 212 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G+++ +W AK Sbjct: 213 LNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 255 >gi|323175059|gb|EFZ60673.1| phage terminase large subunit [Escherichia coli LT-68] Length = 399 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P E + Sbjct: 54 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEGTLIRK 111 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 112 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKT 170 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G+++ +W AK Sbjct: 171 LNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 213 >gi|163735142|ref|ZP_02142578.1| hypothetical protein RLO149_23000 [Roseobacter litoralis Och 149] gi|161391600|gb|EDQ15933.1| hypothetical protein RLO149_23000 [Roseobacter litoralis Och 149] Length = 267 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 63/193 (32%), Gaps = 35/193 (18%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSI 110 P WQ M + + + + AG+ + K P + Sbjct: 30 PDPWQRSLMNSTSDVIMVLASRRSGKSTTVGVMAGQELAK---------------PDHQV 74 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 I ++ + Q L+A+++ F + ++L + E + S + Sbjct: 75 IILSPTLAQ-SQLLFAKIA-----------FTWEKMALPIETRRRTMTELHLKNGS---S 119 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 + C ++ + G +G+ DEA+ PD + + + N + + Sbjct: 120 VVCVPAGQDG-EGARGYGVKNGILA-FDEAAFIPDKVFGA---TLSIAEDNAKTVFITTP 174 Query: 231 RRLNGWFYDIFNI 243 +G Y+++ Sbjct: 175 GGKSGKAYEMWTN 187 >gi|119869106|ref|YP_939058.1| phage terminase [Mycobacterium sp. KMS] gi|119695195|gb|ABL92268.1| phage Terminase [Mycobacterium sp. KMS] Length = 489 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 50/326 (15%), Positives = 85/326 (26%), Gaps = 71/326 (21%) Query: 41 KGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLW 100 KG +P WQ++ + V V P RG GKTTL+A ++L+ Sbjct: 41 KGTGAREVFRPREWQMDIVRDVLDSGARTVGLMMP----------RGQGKTTLSAAILLY 90 Query: 101 LISTR-PGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLE 159 + TR G +++ A E Q S+ +Q S Y + Sbjct: 91 IFFTRGEGANVVLFAVDERQ------------ASLAFRVAARMVQLSEDLSSRCYVYADK 138 Query: 160 QSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELN 219 + + Y + + + G +A DEA + + Sbjct: 139 LVLPLTDSTYQVMPASAA-----AAEGL---DYVACLCDEAGVINRDVFEVAQLA-QGKR 189 Query: 220 PNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSD- 278 I D L + D + F +G D Sbjct: 190 ERSVLIAIGTPGPDPN---DQVLADLRAYAAEHPD--DKSLVWREFSAAGFEDHGADCPH 244 Query: 279 ------------------------------VARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 R + QF F+P E + Sbjct: 245 CWELANPALDDFLHRDALHALLPPKTREATFRRARLC-QFSTDTDGAFLPAGVWEGLSTS 303 Query: 309 EAIDDLYAPLIMGCDIAGEGGDKTVV 334 + +++ D + G D T + Sbjct: 304 SPVP-PGVDVVLALDGSYNG-DTTAL 327 >gi|326804661|ref|YP_004327532.1| Gp17 terminase subunit for DNA packaging, nuclease and ATPase [Salmonella phage Vi01] gi|301795311|emb|CBW38029.1| Gp17 terminase subunit for DNA packaging, nuclease and ATPase [Salmonella phage Vi01] Length = 736 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 48/269 (17%), Positives = 83/269 (30%), Gaps = 51/269 (18%) Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 TT+ A +LW I +AN E Q L + K LP + Sbjct: 269 TTVVAAFLLWYAMFHSDKEIAVLANKEKQAIEIL-DRIRKAYQDLPFFLQQGCEKFGSTL 327 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD--IIN 208 + E + + +I R+ S ++ DE + + Sbjct: 328 IEF--ENGSKIYAYATSSDSIRGRSVS----------------LLYVDEVAFIENDFEFW 369 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI--PLE-DWKRYQIDT---RTVEGI- 261 +S + +R I+TS + G FYDI P + + + V Sbjct: 370 ESTFPAIASADTSR-CILTSTPKGQRGLFYDIVTKADPRHPQYNDFHLTEVPWYKVPAYT 428 Query: 262 -DSGFHEGIISRYGLDSDVARIEILGQFP---QQEVNNFIPHNYIEEAMSREAIDD---- 313 D + +R G +F + V + IP +++ S+ + Sbjct: 429 KDPDWETKQRARLGD------ARFDQEFGIKFRGSVGSLIPAKCLDKMTSKLYREPNEFT 482 Query: 314 ----LYAPLIMGCDIAGEG----GDKTVV 334 Y P + IA G GD +V+ Sbjct: 483 KIYKEYDPQRIYFGIADTGKGVEGDYSVL 511 >gi|326783331|ref|YP_004323723.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage Syn33] gi|310005278|gb|ADO99667.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage Syn33] Length = 549 Score = 53.2 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 43/267 (16%), Positives = 86/267 (32%), Gaps = 43/267 (16%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T+ +LW + P +++ +AN E+ L + +Q L Sbjct: 85 GKSTIVTAYLLWYVLFNPNVNVAILANKAA-----TAREMLGRLQLSYENLPKWLQQGIL 139 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII- 207 + EL S + + R S +F DE + P+ I Sbjct: 140 QWNRGSLELENGSKILAASTSASAVRGMSFN--------------VIFLDEFAFVPNHIA 185 Query: 208 ---NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEGI 261 S+ + + I+ S +N FY +++ ++ ++ V G Sbjct: 186 DQFFSSVYPTVSS-GKSTKVIIISTPHGMN-MFYKLWHDAEQGKNEYLPTEVHWSQVPGR 243 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIP------HNYIEEAMSRE-----A 310 D+ + E I ++E +F V+ I Y++ + Sbjct: 244 DAAWKEQTIKNTSEQQ--FKVEFECEF-LGSVDTLISPSKLRTMPYVDPVAQNKGLAIYE 300 Query: 311 IDDLYAPLIMGCDIAGE-GGDKTVVVF 336 + I+ D++ G D + V Sbjct: 301 RVEAEHNYIITVDVSRGIGNDYSAFVV 327 >gi|293396491|ref|ZP_06640767.1| phage terminase large subunit [Serratia odorifera DSM 4582] gi|291420755|gb|EFE94008.1| phage terminase large subunit [Serratia odorifera DSM 4582] Length = 430 Score = 53.2 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++N+EA + + + + W + N R + + F P D + Sbjct: 80 VLWNEEAHAMTEAQWEVLEPTIRKEGSEC-WFLF-NPRLTTDFVWRNFVVAPPPDTLVRK 137 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--RE 309 I+ + I + D+++ LG P+ + + I ++IE A+ + Sbjct: 138 INYDENPFLSRTIMNVIEAAKARDAEMFEHVYLGM-PRTDDDEAIIKLSWIEAAVDAHKA 196 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+ G++ +W A+ Sbjct: 197 LNIEPAGHRRVGFDVADSGADKCANVYAHGSVALWADEWKARE 239 >gi|282599341|ref|YP_003358653.1| Gp17 terminase DNA packaging enzyme large subunit [Shigella phage phiSboM-AG3] gi|226973647|gb|ACO94400.1| Gp17 terminase DNA packaging enzyme large subunit [Shigella phage phiSboM-AG3] Length = 736 Score = 52.8 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 53/297 (17%), Positives = 92/297 (30%), Gaps = 57/297 (19%) Query: 91 TTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP 150 TT+ A +LW I +AN E Q L + K LP + Sbjct: 269 TTVVAAFLLWYAMFHSDKEIAVLANKEKQAIEIL-DRIRKAYQDLPFFLQQGCEKFGSTL 327 Query: 151 SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD--IIN 208 + E + + +I R+ S ++ DE + + Sbjct: 328 IEF--ENGSKIYAYATSSDSIRGRSVS----------------LLYVDEVAFIENDFEFW 369 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIP------LEDWKRYQIDTRTVEGI- 261 +S + +R I+TS + G FYDI D+K ++ V Sbjct: 370 ESTFPAIASADTSR-CILTSTPKGQRGLFYDIVTKANPEHPQYNDFKLTEVPWYRVPTYT 428 Query: 262 -DSGFHEGIISRYGLDSDVARIEILGQFP---QQEVNNFIPHNYIEEAMSREAIDD---- 313 D + ++ G +F + V + IP +++ S+ + Sbjct: 429 KDPNWESKQRAKLGD------ARFDQEFGIKFRGSVGSLIPAKCLDKMTSKLYQEPNEFT 482 Query: 314 ----LYAPLIMGCDIAGEG----GDKTVVVFRRGNIIEHIFDWSAKLIQETNQEGCP 362 Y P + IA G GD +V+ I I D+ K+ + P Sbjct: 483 KIYHDYDPKRIYMGIADTGKGVEGDYSVLT------ILDITDYPHKIAAKYRNNTIP 533 >gi|332091158|gb|EGI96248.1| phage terminase large subunit [Shigella dysenteriae 155-74] Length = 346 Score = 52.8 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 61/163 (37%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P E + Sbjct: 1 MLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEGTLIRK 58 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 59 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKT 117 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G+++ +W AK Sbjct: 118 LNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 160 >gi|262067933|ref|ZP_06027545.1| putative protein splicing site [Fusobacterium periodonticum ATCC 33693] gi|291378336|gb|EFE85854.1| putative protein splicing site [Fusobacterium periodonticum ATCC 33693] Length = 832 Score = 52.4 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 40/254 (15%), Positives = 79/254 (31%), Gaps = 32/254 (12%) Query: 98 MLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAEL 157 +L P II ANS L ++ +R F + Y Sbjct: 353 ILHFAFNNPNKKIIVAANSLN-LITEIF-----------NRMEFLLTGSKSAYKTSYTRK 400 Query: 158 LEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTE 217 S I + T + + G V+ DEA+ + + ++ F Sbjct: 401 RSPSEKIVLINGTQINGFTTGTDGSSIRGQSADR---VYIDEAAYVTEQAYQVLM-AFKL 456 Query: 218 LNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDS 277 NPN +++ S L F + + W+ + + + + + + + Sbjct: 457 DNPNVVFVVFSTPTALETNFRK-WCLVDPAWREFHYPSSILPNFEENDGPELRNSLTEEG 515 Query: 278 DVARIEILGQFPQQEV---------NNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE- 327 ++E+ +F + + N+ + Y E E I+ + +G D Sbjct: 516 --YKLEVEAEFSEGDSKVFKTENIKNSLYQYKYCE--FREELINPEKWKITIGVDYNEFK 571 Query: 328 -GGDKTVVVFRRGN 340 G V+ GN Sbjct: 572 NGSQICVLGLYCGN 585 >gi|99080642|ref|YP_612796.1| hypothetical protein TM1040_0801 [Ruegeria sp. TM1040] gi|99036922|gb|ABF63534.1| hypothetical protein TM1040_0801 [Ruegeria sp. TM1040] Length = 416 Score = 52.4 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 47/284 (16%), Positives = 85/284 (29%), Gaps = 24/284 (8%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 G G GKT + + P A + +++T W V + Sbjct: 27 GGFGSGKTYVGCLDLGLFAGQHPKTVQGYFAPTYRDIRDTFWPTVDE-----------AA 75 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 SL A+ + S + T CR S + P VG + D S Sbjct: 76 HSLGFTTKVKSADKEVEFYRGRSYYGTTICR--SMDDPGGIVGFKIARALVDEIDILSKD 133 Query: 204 -PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWF-YDIFNI-PLEDWKRYQIDTRTVE- 259 + I+ + P + T F YD F P ++ Q T E Sbjct: 134 KAQAAWRKIIARMRLVLPGVVNGIGVTTTPEGFRFVYDSFKREPKSNYSMVQASTYENEA 193 Query: 260 GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFI-PHNYIEEAMSREAIDDLYAPL 318 + + ++ Y + + + ++G+F ++ + PL Sbjct: 194 FLPPDYISTLLEDYPEE--LIKAYLMGEFVNLTSGTVYRSYDRLRH--RSTQSIQPREPL 249 Query: 319 IMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA-KLIQETNQEGC 361 +G D G +VV +RG + + + + C Sbjct: 250 HIGQDF-NVGNMASVVFVQRGEDWHAVDELQGLQDTPHLIEVLC 292 >gi|83943081|ref|ZP_00955541.1| hypothetical protein EE36_12908 [Sulfitobacter sp. EE-36] gi|83846089|gb|EAP83966.1| hypothetical protein EE36_12908 [Sulfitobacter sp. EE-36] Length = 259 Score = 52.4 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 64/222 (28%), Gaps = 38/222 (17%) Query: 43 KPLEHF----SQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMM 98 P+E F +P WQ++ + + SN +GR GK+T + Sbjct: 20 DPVERFRLAIGEPDAWQVDLLR--------SDPRSNEADRMILALSGRQSGKSTTAGGLG 71 Query: 99 LWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELL 158 G ++I A S Q L+ + ++ + P Q+ + + Sbjct: 72 --YDDFSRGKTVILTAPSLRQ-STELFRRILEYKNTDPFCPPIVRQTQTELEAHPRHGGR 128 Query: 159 EQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTEL 218 + + +T T + DEA D + E Sbjct: 129 IIVVPATDQARGMTADT-------------------IIADEACFLDDDALTAFFPMRKET 169 Query: 219 NPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEG 260 + S G+FY+ + +R + + Sbjct: 170 G---RIFLLSTPNMRQGYFYETWTSAKRV-RRITARSIDIPR 207 >gi|86372240|gb|ABC95184.1| GP17-terminase [Stenotrophomonas phage Smp14] Length = 536 Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 45/266 (16%), Positives = 85/266 (31%), Gaps = 44/266 (16%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GKTT+ A ++LW I +AN Q + E+ L ++ + MQ Sbjct: 92 GKTTVVAAILLWYAIFNEEYRIAILANKGDQSR-----EILARLQLMYEELPWFMQVGVS 146 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII- 207 + +L +S + + R S ++ DE + + + Sbjct: 147 VWNKGNIKLGNRSEVFTAATGGSSIRGKSVN--------------LMYLDEFAFVENDVD 192 Query: 208 -NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTR---TVEGIDS 263 S T I+TS +N FY I+ Y + D Sbjct: 193 FYTSTYPVVTS-GTKTKVIITSTPNGMN-LFYKIWTDSTNGKNNYVHNEAFWHDHPKRDQ 250 Query: 264 GFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY-------- 315 + + + E L +F Q + + +E+ ++ I +L Sbjct: 251 AWKDEQLRNMSERQ--FEQEFLCKF-QGSSDTLLSPAKLEQLTYQDHIRELGGNRDFKIY 307 Query: 316 ------APLIMGCDIA-GEGGDKTVV 334 A ++ D++ G G D +V+ Sbjct: 308 EDPIKDASYVVTVDVSEGIGKDYSVI 333 >gi|312126991|ref|YP_003991865.1| hypothetical protein Calhy_0759 [Caldicellulosiruptor hydrothermalis 108] gi|311777010|gb|ADQ06496.1| conserved hypothetical protein [Caldicellulosiruptor hydrothermalis 108] Length = 444 Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 48/261 (18%), Positives = 76/261 (29%), Gaps = 34/261 (13%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 AGR GK+T+ ++ +T+ A S Q K + E + + Sbjct: 54 AGRRFGKSTVTLIDVVHECATKTKQVWYITAPSIDQAK-IYFQEFE---QRAANNSLLDA 109 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 S + L I + + G EA+ Sbjct: 110 LVKDFKWSPFPEITLINGSKILGR--------STSRNGVYLRGKGADGVAIT---EAAFI 158 Query: 204 PDIIN-KSILGFFTELNPNRFWIMTSNTRRLN-GWFYDIFNIPLED----WKRYQIDTRT 257 D + I + N T + Y +F L D +K + Sbjct: 159 KDKVYHDVIRAMVLDRNGVLRL----ETTPNGMNYVYKLFQEGLNDSTGYYKSFHATVYD 214 Query: 258 VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFI-PHNYIEEAM---SREAIDD 313 E +D E I RIE L +F E ++FI P N + E + Sbjct: 215 NERLDREELERIRREIPE--LAWRIEYLAEF--VEDDSFIFPWNLLCEVFDDYELKKEPQ 270 Query: 314 LYAPLIMGCDIAGEGGDKTVV 334 +G D+A D TV+ Sbjct: 271 NGHRYSIGVDLAKY-QDYTVI 290 >gi|114320225|ref|YP_741908.1| hypothetical protein Mlg_1066 [Alkalilimnicola ehrlichii MLHE-1] gi|114226619|gb|ABI56418.1| hypothetical protein Mlg_1066 [Alkalilimnicola ehrlichii MLHE-1] Length = 463 Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 43/322 (13%), Positives = 96/322 (29%), Gaps = 36/322 (11%) Query: 38 WGIKGKPLEHFSQ-P-HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNA 95 W L F P + + A+ + + + GK+ A Sbjct: 24 WAAWRALLSGFYGLPLDDAEAQHWHALTDRESAPQSAHDELWLVVGRRG----GKSNAAA 79 Query: 96 WMMLWLISTRPGMSIIC---IANSE------TQLKNTLWAEVSKWLSMLPHRHWFEMQSL 146 + ++ + + +A + Q ++ + +S + P ++ L Sbjct: 80 LLAVYEACFKDHRDALAPGEVATTRVMAADRAQARSV-FRYISGLMHANPM-----LERL 133 Query: 147 SLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDI 206 + EL +++ T R Y+ F +D+++ Sbjct: 134 IVREDRESIELSNRAVIEVGTASFRTTRGYT------FAAVIADEVAFWRSDDSANPDSE 187 Query: 207 INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFH 266 I ++ LN + S+ G ++ + + ++ Sbjct: 188 IIAAVRPGLATLNGKLIAL--SSPYARRGELWENYRRHYGKASPILVAQAPSRTMNPSLP 245 Query: 267 EGII-SRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI---MGC 322 E ++ D A E L +F + +V F+ +E A ++ Y + Sbjct: 246 ERVVTEAMERDPASAAAEYLAEF-RTDVETFLQREVVEAATRPTPLELPYNKRVTYTAFV 304 Query: 323 DIAGEGGDK--TVVVFRRGNII 342 D AG G D+ + R G + Sbjct: 305 DPAGGGADEFTAAIGHREGERV 326 >gi|78212008|ref|YP_380787.1| hypothetical protein Syncc9605_0456 [Synechococcus sp. CC9605] gi|78196467|gb|ABB34232.1| conserved hypothetical protein [Synechococcus sp. CC9605] Length = 414 Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/254 (15%), Positives = 90/254 (35%), Gaps = 39/254 (15%) Query: 82 ISAGRGIGKTTLN-AWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHW 140 +++GR GKT + W++ + T G + +A + Q K W ++ Sbjct: 25 VNSGRRFGKTRMALTWLLEGALLT-SGSRMWFLAPTRVQAKQIAWRDLK----------- 72 Query: 141 FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA 200 + P W +++ E ++ I+ ++ + + + D+ G DE Sbjct: 73 ------EMVPGSWASQVRESTLTIELRNGSHI-QLAGADYADSLRGQRADR---FAIDEY 122 Query: 201 SGTPDI--INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPL--EDWKRYQIDTR 256 D+ + ++ L + + I +S + +++ E W R+ + Sbjct: 123 CYIRDLQEMWQAALLPMLGTSDDGSVIFSSTPAGGGTFSAELWERAETAEGWARWNFPSV 182 Query: 257 TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDL-- 314 + + E +D + R E G + + + A +++ I D Sbjct: 183 AGGWVKPEYVEQARQT--MDPSLWRQEFFG-----SIESL--LGAVYPAFNQQNISDTVD 233 Query: 315 -YAPLIMGCDIAGE 327 PL++GCD Sbjct: 234 NGGPLLVGCDFNRS 247 >gi|284008456|emb|CBA74928.1| phage terminase large subunit [Arsenophonus nasoniae] Length = 477 Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 45/310 (14%), Positives = 80/310 (25%), Gaps = 73/310 (23%) Query: 84 AGRGIGKTTLNAWMMLW--------LISTRPGMSIICIANSETQLKNTLWAEVSKWLSML 135 GRG KT A + L + R M+ I E + L AE+ Sbjct: 21 GGRGGMKTVSFAKIALITASINKRRFLCLREFMNSI-----EDSVHAVLQAEIETLRLQN 75 Query: 136 PHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAV 195 R Y +L I SKH Sbjct: 76 RFRILDNCIKGINDSIFKYGQLARNIASIKSKHDFDVA---------------------- 113 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED-------- 247 + +EA + ++ + W N +G Y F P +D Sbjct: 114 WVEEAETVSEKSLDILIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKDIIDDKGYY 171 Query: 248 ------------WKRYQIDTR---TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQE 292 + E + ++ + YG + D Sbjct: 172 EDDDLYVGKVSYLDNPWLPEELKNDAEKMKRDNYKKWLHVYGGECDANY----------- 220 Query: 293 VNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + I +++ A+ + ++ D A G D+ + R G ++E WS Sbjct: 221 DDAIIQPEWVDAAIDAHIKLGFKPKGIRVITFDPADSGQDEKALSKRYGVLVEDCVSWSE 280 Query: 351 KLIQETNQEG 360 + + + Sbjct: 281 GDVADATIKA 290 >gi|103487487|ref|YP_617048.1| hypothetical protein Sala_2004 [Sphingopyxis alaskensis RB2256] gi|98977564|gb|ABF53715.1| protein of unknown function DUF264 [Sphingopyxis alaskensis RB2256] Length = 436 Score = 52.0 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 53/264 (20%), Positives = 87/264 (32%), Gaps = 31/264 (11%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 AGRG GKT A + T PG I +A S L E Sbjct: 56 AGRGFGKTRTGAEWVRAFAETTPGARIALVAAS--------------LLEARQVMVEGES 101 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 L++ P E E S+ + YS PD+ GP + A + DE + Sbjct: 102 GLLAIAPDHLRPE-YESSLRRLTWPNGAVATLYSAVEPDSLRGPEHD---AAWCDEIAKW 157 Query: 204 P--DIINKSILGFFT-ELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEG 260 P + ++ T + + T+ R + I + + R Sbjct: 158 PKGEAAWDNL--MLTMRIGARPQVVATTTPRCVPLVRRLIQERGVATTRGRTASNR--RN 213 Query: 261 IDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIM 320 + + + + YG + + R E+ G+ + + IE +A +++ Sbjct: 214 LSVQWLATMDAIYG-GTRLGRQELDGELLEDVEDALWTRALIERCRVDAGSIGKFARVVI 272 Query: 321 GCD-IAGEGGDKTVVV----FRRG 339 G D A GGD +V R G Sbjct: 273 GVDPPASAGGDACGIVVAALLRDG 296 >gi|61806303|ref|YP_214662.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM4] gi|61563847|gb|AAX46902.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM4] Length = 550 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 50/345 (14%), Positives = 107/345 (31%), Gaps = 59/345 (17%) Query: 11 LEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNV 70 ++++ E + A+ + F +R P + ++ +Q + + H + Sbjct: 24 TKKQVAEYMKCAQDPVYFIRKYIRIVSLDEGVIPFDMYN----FQEDMVTKFHQHRFNIA 79 Query: 71 NNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSK 130 + GK+T+ +LW + +++ +AN E+ Sbjct: 80 KLPRQS------------GKSTIVTAYLLWYVLFNANVNVAILANKAP-----TAREMLG 122 Query: 131 WLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT 190 L + MQ L + EL S + S R S Sbjct: 123 RLQLSYENLPKWMQQGILGWNKGSLELENGSKILASSTSASAVRGMSFN----------- 171 Query: 191 HGMAVFNDEASGTPDII----NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---I 243 +F DE + P+ I S+ + + I+ S +N FY +++ Sbjct: 172 ---IIFLDEFAFVPNHIAEQFFASVYPTISS-GKSTKVIIISTPHGMN-QFYKLWHDAER 226 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 ++ ++ V G D + + I R+E +F V+ I + + Sbjct: 227 GANNYVATEVHWSQVPGRDDKWKQQTIENTSE--AQFRVEFECEF-LGSVDTLITPSKLR 283 Query: 304 EAMSREAIDD-----------LYAPLIMGCDIAGE-GGDKTVVVF 336 ++ I + I+ D++ G D + Sbjct: 284 IMPYKDPIQENRGLAVYEHVQENHNYIITVDVSRGVGNDYSAFCV 328 >gi|68249883|ref|YP_248995.1| phage terminase large subunit [Haemophilus influenzae 86-028NP] gi|68058082|gb|AAX88335.1| predicted phage terminase large subunit [Haemophilus influenzae 86-028NP] Length = 438 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 100/273 (36%), Gaps = 23/273 (8%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A +++ I+ R + + C + + +++ ++ + L + FE+ Sbjct: 12 GGRGSGKSWGVAQLLI-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 70 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q +++ +E + + + + + G V+ +EA Sbjct: 71 QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 113 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS 263 + ++ + W+ + L+ + P ++ +I+ Sbjct: 114 SNESWDILIPTIRKERSE-IWVTFNPKNILDPTYQRFVIAPPKNSFVRKINYDENPYFPE 172 Query: 264 GFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLIM 320 + D ++ R LG+ P + + I +IE A+ ++ I+ Sbjct: 173 TLRLEMEECKERDYELYRHIWLGE-PVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKIV 231 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 G D+A +G D F G+++ + +W + + Sbjct: 232 GFDVADDGVDSNANAFVHGSVVLRVDEWRGEDV 264 >gi|218296727|ref|ZP_03497433.1| protein of unknown function DUF264 [Thermus aquaticus Y51MC23] gi|218242816|gb|EED09350.1| protein of unknown function DUF264 [Thermus aquaticus Y51MC23] Length = 425 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 46/260 (17%), Positives = 87/260 (33%), Gaps = 22/260 (8%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 GK+ G + + ++ E Q S+ L+ H M+ + Sbjct: 28 TGKSFALTLEAALHAVEHRGSTWVLLSAGERQ---------SRELAEKAKAHLDAMKQVG 78 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS--GTPD 205 + E E ++ + ++ + P T G V DE + + Sbjct: 79 TLMESRFFEGGESVTQLEIRLPNLSRLIFLPANPRTARGYTGN----VVLDEFAFHQDSE 134 Query: 206 IINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGF 265 I ++ T P+ + S G F++++ W R+++ Sbjct: 135 AIWAAMYPIITRR-PDLKIRVMSTPNGPRGKFWELWEKGGPAWSRHKVTIYDAVAQGLPV 193 Query: 266 HEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAP--LIMGCD 323 + D + + E L +F E F+P + I EA +RE + P +G D Sbjct: 194 DPEELRAGLADDFIWQQEYLCEFLSAEE-AFLPWSLILEAEAREDPRGPWNPDQAYLGVD 252 Query: 324 IAGEGGDKTVVVF--RRGNI 341 + D TV V R G++ Sbjct: 253 VGRH-RDLTVFVVLERVGDV 271 >gi|319775727|ref|YP_004138215.1| phage terminase large subunit [Haemophilus influenzae F3047] gi|319896735|ref|YP_004134928.1| phage terminase large subunit [Haemophilus influenzae F3031] gi|317432237|emb|CBY80589.1| predicted phage terminase large subunit [Haemophilus influenzae F3031] gi|317450318|emb|CBY86534.1| predicted phage terminase large subunit [Haemophilus influenzae F3047] Length = 449 Score = 51.7 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 100/273 (36%), Gaps = 23/273 (8%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A +++ I+ R + + C + + +++ ++ + L + FE+ Sbjct: 23 GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEDFEV 81 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q +++ +E + + + + + G V+ +EA Sbjct: 82 QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 124 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS 263 + ++ + W+ + L+ + P ++ +I+ Sbjct: 125 SNESWDILIPTIRKERSE-IWVTFNPKNILDPTYQRFVIAPPKNSFVRKINYDENPYFPE 183 Query: 264 GFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLIM 320 + D ++ R LG+ P + + I +IE A+ ++ I+ Sbjct: 184 TLRLEMEECKERDYELYRHIWLGE-PVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKIV 242 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 G D+A +G D F G+++ + +W + + Sbjct: 243 GFDVADDGVDSNANAFVHGSVVLRVDEWRGEDV 275 >gi|326782863|ref|YP_004323261.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-RSM4] gi|310004122|gb|ADO98516.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-RSM4] Length = 547 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 46/274 (16%), Positives = 86/274 (31%), Gaps = 57/274 (20%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T+ +LW + +++ +AN E+ + L + MQ L Sbjct: 83 GKSTIVTAYLLWYVLFNANVNVAILANKAA-----TAREMLQRLQLSYENLPNWMQQGIL 137 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII- 207 + EL S + + R S +F DE + P+ I Sbjct: 138 QWNRGSLELENGSKIMAASTSASAVRGMSFN--------------VIFLDEFAFIPNHIA 183 Query: 208 ---NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEGI 261 S+ + + I+ S +N FY +++ ++ ++ V G Sbjct: 184 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGTNEYVPTEVHWSEVPGR 241 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL--- 318 D + E I R+E +F V+ I A S+ I + P+ Sbjct: 242 DDVWKEQTIKNTSE--SQFRVEFECEF-LGSVDTLI-------APSKLRIMPYHDPITSN 291 Query: 319 ---------------IMGCDIAGE-GGDKTVVVF 336 I+ D++ G D + Sbjct: 292 RGLAVYEQVIPEHNYIITVDVSRGVGNDYSAFCV 325 >gi|145629819|ref|ZP_01785613.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|148827544|ref|YP_001292297.1| hypothetical protein CGSHiGG_04845 [Haemophilus influenzae PittGG] gi|144977965|gb|EDJ87753.1| predicted phage terminase large subunit [Haemophilus influenzae 22.1-21] gi|148718786|gb|ABQ99913.1| hypothetical protein CGSHiGG_04845 [Haemophilus influenzae PittGG] Length = 449 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 100/273 (36%), Gaps = 23/273 (8%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A +++ I+ R + + C + + +++ ++ + L + FE+ Sbjct: 23 GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 81 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q +++ +E + + + + + G V+ +EA Sbjct: 82 QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 124 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS 263 + ++ + W+ + L+ + P ++ +I+ Sbjct: 125 SNESWDILIPTIRKERSE-IWVTFNPKNILDPTYQRFVIAPPKNSFVRKINYDENPYFPE 183 Query: 264 GFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLIM 320 + D ++ R LG+ P + + I +IE A+ ++ I+ Sbjct: 184 TLRLEMEECKERDYELYRHIWLGE-PVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKIV 242 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 G D+A +G D F G+++ + +W + + Sbjct: 243 GFDVADDGVDSNANAFVHGSVVLRVDEWHGEDV 275 >gi|326784324|ref|YP_004324722.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SSM5] gi|310003555|gb|ADO97951.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SSM5] Length = 549 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 88/267 (32%), Gaps = 47/267 (17%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T+ +LW + P +++ +AN E+ + L + +Q L Sbjct: 85 GKSTIVTSYLLWYVLFNPNVNVAILANKAA-----TAREMLQRLQLSYENLPKWLQQGIL 139 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII- 207 + EL S + + R S +F DE + P+ I Sbjct: 140 QWNRGSLELENGSKIMAASTSASAVRGMSFN--------------VIFLDEFAFIPNHIA 185 Query: 208 ---NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEGI 261 S+ + + I+ S +N FY +++ ++ ++ V G Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGSNEYVPTEVHWSEVPGR 243 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE-------------EAMSR 308 D + E I R+E +F V+ I + + A+ Sbjct: 244 DEVWKEQTIKNTSEQQ--FRVEFECEF-LGSVDTLISPSKLRIMPYHEPMNQNRGLAVFE 300 Query: 309 EAIDDLYAPLIMGCDIAGE-GGDKTVV 334 +AI + I+ D++ G D + Sbjct: 301 QAIPE--HNYILTVDVSRGVGNDYSAF 325 >gi|260583110|ref|ZP_05850891.1| phage terminase large subunit [Haemophilus influenzae NT127] gi|260093822|gb|EEW77729.1| phage terminase large subunit [Haemophilus influenzae NT127] Length = 445 Score = 51.3 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/273 (14%), Positives = 100/273 (36%), Gaps = 23/273 (8%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A +++ I+ R + + C + + +++ ++ + L + FE+ Sbjct: 19 GGRGSGKSWGVAQLLV-EIAVRTKVRVFCGRELQNSMSDSVIKLIADTIEDLGYLEEFEV 77 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q +++ +E + + + + + G V+ +EA Sbjct: 78 QRNAIYCLKTGSEFMFYGIKNNP------------NKIKSLEGID-----LVWIEEAENV 120 Query: 204 PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDS 263 + ++ + W+ + L+ + P ++ +I+ Sbjct: 121 SNESWDILIPTIRKERSE-IWVTFNPKNILDPTYQRFVIAPPKNSFVRKINYDENPYFPE 179 Query: 264 GFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAIDDLYAPLIM 320 + D ++ R LG+ P + + I +IE A+ ++ I+ Sbjct: 180 TLRLEMEECKERDYELYRHIWLGE-PVADSDKVIIKPVWIECAVDAHKKLGFLPAGRKIV 238 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 G D+A +G D F G+++ + +W + + Sbjct: 239 GFDVADDGVDSNANAFVHGSVVLRVDEWHGEDV 271 >gi|326782611|ref|YP_004323017.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SM1] gi|310002825|gb|ADO97224.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SM1] Length = 549 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 87/269 (32%), Gaps = 47/269 (17%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T+ +LW + +++ +AN E+ + L + +Q L Sbjct: 85 GKSTIVTSYLLWYVLFNDNVNVAILANKAA-----TAREMLQRLQLSYENLPKWLQQGIL 139 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII- 207 + EL S + + R S +F DE + P+ I Sbjct: 140 QWNRGSLELENGSKIMAASTSASAVRGMSFN--------------VIFLDEFAFIPNHIA 185 Query: 208 ---NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEGI 261 S+ + + I+ S +N FY +++ ++ ++ V G Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGTNEYIPTEVHWSEVPGR 243 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE-------------EAMSR 308 D + E I R+E +F V+ I + + A+ Sbjct: 244 DDVWKEQTIKNTSEQQ--FRVEFECEF-LGSVDTLISPSKLRIMPYHDPMKENRGLAIFE 300 Query: 309 EAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 ++I D ++ D++ G D + Sbjct: 301 QSIPD--HNYVITVDVSRGVGNDYSAFCV 327 >gi|126011061|ref|YP_001039811.1| TerL-like protein [Burkholderia ambifaria phage BcepF1] gi|119712637|gb|ABL96858.1| TerL-like protein [Burkholderia ambifaria phage BcepF1] Length = 459 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 53/163 (32%), Gaps = 11/163 (6%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + I W++ N + + Y F P D Q Sbjct: 115 ILWLEEAQYLTEEQWNVINPTIRREGSQ-IWLIW-NPDQYTDFIYQNFVVNPPADCLSKQ 172 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQ-QEVNNFIPHNYIEEAMSREAI 311 I+ + + I Y D + G P+ I Y+ A+ +A Sbjct: 173 INWTENPFLSDTMLKVIYDEYQRD-PKLAEHVYGGAPKMGGDKAIIQLQYVLAAI--DAH 229 Query: 312 DDLYAPLI----MGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 L + G DIA +G D +V GN++ +W Sbjct: 230 KKLGWKIEGSKRTGFDIADDGDDANAIVDAIGNVVVWAEEWDG 272 >gi|326783550|ref|YP_004323947.1| terminase DNA packaging enzyme large subunit [Synechococcus phage Syn19] gi|310005053|gb|ADO99443.1| terminase DNA packaging enzyme large subunit [Synechococcus phage Syn19] Length = 549 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 40/267 (14%), Positives = 87/267 (32%), Gaps = 43/267 (16%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T+ +LW + +++ +AN E+ + L + +Q L Sbjct: 85 GKSTIVTSYLLWYVLFNQNVNVAILANKAA-----TSREMLQRLQLSYENLPKWLQQGIL 139 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII- 207 + EL S + + + R S +F DE + P+ I Sbjct: 140 QWNRGSLELENGSKIMAASTSSSAVRGMSFN--------------VIFLDEFAFVPNHIA 185 Query: 208 ---NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEGI 261 S+ + + I+ S +N FY +++ ++ ++ V G Sbjct: 186 DQFFSSVYPTISS-GQSTKVIIISTPHGMN-MFYKLWHDAERSKNEYIPTEVHWSEVPGR 243 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE-----------EAMSREA 310 D+ + E I+ ++E +F V+ I + + + ++ Sbjct: 244 DAKWKEQTIANTSEQQ--FKVEFECEF-LGSVDTLISPSKLRVMPYHDPIAQNKGLAVYK 300 Query: 311 IDDLYAPLIMGCDIAG-EGGDKTVVVF 336 + I+ D+A D + Sbjct: 301 RAEPDHNYIITVDVARGTSNDYSAFCV 327 >gi|291336835|gb|ADD96368.1| phage terminase large subunit [uncultured organism MedDCM-OCT-S09-C20] Length = 454 Score = 50.9 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 46/330 (13%), Positives = 104/330 (31%), Gaps = 38/330 (11%) Query: 40 IKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNN---SNPTIFKCAISAGRGIGKTTLNAW 96 +P+E P + + A + + + + G G GK+ A Sbjct: 14 EPKRPVERAIDPGA--ADALRAKILADCLPAQREFLDDESHRILSYIGGFGSGKSFALAA 71 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 +++L PG +++ + ++ L + L + + E Sbjct: 72 KLIFLGLRNPGGTLMACEPTFPMIRTVLVPAIDMALDQ--------WDIEYSYRASPQPE 123 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE--ASGT--PDIINKSIL 212 S+ + + TI C++ + + + A DE S + +L Sbjct: 124 Y---SINLPTGPVTIYCQSA-----ENYQRIRGQNICAAVWDECDTSPVDTAQKAGEMLL 175 Query: 213 GFFTELNPNRFWIMTSNTRRLNG--WFYDIF-NIPLEDWKRYQIDTRTVEGIDSGFHEGI 269 N+ + + G W Y F D + ++ T+ + + F + Sbjct: 176 ARMRTGELNQLAVAS----TPEGFRWAYRTFVENDGPDKRLIRVRTQDNPHLPADFIPSL 231 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS-REAIDDLYAPLIMGCDIAGEG 328 Y S + + + G F + P + +++ + + +G D+ G Sbjct: 232 ERNY--PSQLIQAYLEGHFVNLASCSLYP--EFDRSLNYCDTQPTENDTIWIGVDL-NVG 286 Query: 329 GDKTVVVFRRGNIIEHIFDWSAKLIQETNQ 358 T + RRG+ + + Q+ Q Sbjct: 287 NCVTQHLVRRGDEFHFFAEKVYRDTQQIAQ 316 >gi|16082806|ref|NP_395360.1| hypothetical protein YPMT1.24c [Yersinia pestis CO92] gi|31795361|ref|NP_857813.1| hypothetical protein Y1030 [Yersinia pestis KIM] gi|40787951|ref|NP_857660.2| hypothetical protein YPKMT021 [Yersinia pestis KIM] gi|45478613|ref|NP_995469.1| hypothetical protein YP_pMT025 [Yersinia pestis biovar Microtus str. 91001] gi|52788073|ref|YP_093901.1| hypothetical protein pG8786_021 [Yersinia pestis] gi|108793557|ref|YP_636707.1| hypothetical protein YPA_MT0025 [Yersinia pestis Antiqua] gi|108793757|ref|YP_636595.1| hypothetical protein YPN_MT0025 [Yersinia pestis Nepal516] gi|145597216|ref|YP_001154679.1| hypothetical protein YPDSF_4052 [Yersinia pestis Pestoides F] gi|149192775|ref|YP_001294006.1| hypothetical protein YPE_4292 [Yersinia pestis CA88-4125] gi|162417876|ref|YP_001604588.1| hypothetical protein YpAngola_0076 [Yersinia pestis Angola] gi|165939469|ref|ZP_02228016.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166214433|ref|ZP_02240468.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167402343|ref|ZP_02307808.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422791|ref|ZP_02314544.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167466683|ref|ZP_02331387.1| hypothetical protein YpesF_02065 [Yersinia pestis FV-1] gi|229896952|ref|ZP_04512111.1| hypothetical protein YPS_4795 [Yersinia pestis Pestoides A] gi|229897756|ref|ZP_04512911.1| hypothetical protein YPH_4790 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229900293|ref|ZP_04515428.1| hypothetical protein YPF_4819 [Yersinia pestis biovar Orientalis str. India 195] gi|229904817|ref|ZP_04519927.1| hypothetical protein YP516_4657 [Yersinia pestis Nepal516] gi|270491004|ref|ZP_06208077.1| phage terminase, large subunit, PBSX family [Yersinia pestis KIM D27] gi|294502015|ref|YP_003565752.1| hypothetical protein YPZ3_pMT0023 [Yersinia pestis Z176003] gi|3883031|gb|AAC82691.1| unknown [Yersinia pestis KIM 10] gi|5834709|emb|CAB55206.1| hypothetical protein YPMT1.24c [Yersinia pestis CO92] gi|45357266|gb|AAS58660.1| hypothetical protein YP_pMT025 [Yersinia pestis biovar Microtus str. 91001] gi|52538002|emb|CAG27427.1| hypothetical protein [Yersinia pestis] gi|108777821|gb|ABG20339.1| hypothetical protein YPN_MT0025 [Yersinia pestis Nepal516] gi|108782104|gb|ABG16161.1| hypothetical protein YPA_MT0025 [Yersinia pestis Antiqua] gi|145212984|gb|ABP42389.1| hypothetical protein YPDSF_4052 [Yersinia pestis Pestoides F] gi|148872433|gb|ABR14922.1| hypothetical protein YPMT1.24c [Yersinia pestis CA88-4125] gi|162350848|gb|ABX84797.1| conserved hypothetical protein [Yersinia pestis Angola] gi|165912657|gb|EDR31287.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166204381|gb|EDR48861.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166958284|gb|EDR55305.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167048235|gb|EDR59643.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|229678132|gb|EEO74238.1| hypothetical protein YP516_4657 [Yersinia pestis Nepal516] gi|229686652|gb|EEO78733.1| hypothetical protein YPF_4819 [Yersinia pestis biovar Orientalis str. India 195] gi|229693337|gb|EEO83387.1| hypothetical protein YPH_4790 [Yersinia pestis biovar Orientalis str. PEXU2] gi|229699988|gb|EEO88028.1| hypothetical protein YPS_4795 [Yersinia pestis Pestoides A] gi|262363909|gb|ACY60628.1| hypothetical protein YPD4_pMT0023 [Yersinia pestis D106004] gi|262364065|gb|ACY64401.1| hypothetical protein YPD8_pMT0023 [Yersinia pestis D182038] gi|270334985|gb|EFA45763.1| phage terminase, large subunit, PBSX family [Yersinia pestis KIM D27] gi|294352486|gb|ADE66542.1| hypothetical protein YPZ3_pMT0023 [Yersinia pestis Z176003] gi|320017547|gb|ADW01117.1| hypothetical protein YPC_4788 [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 418 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 75/229 (32%), Gaps = 37/229 (16%) Query: 59 MEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSET 118 + V +H +P FK + AGR GK+ L+ ++ + + +A + Sbjct: 7 LSLVQLHSGQMQVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65 Query: 119 QLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE 178 + LW ++ + P W + + +M I K+ + + Sbjct: 66 MARQILWDDLQ-----------------EVLPRKWVRKKNDTTMTIVLKNGSEIALKGA- 107 Query: 179 ERPDTFVG--PHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWIMTSNTRRLNG 235 ++PDT G H V DE PD K + + ++ + + Sbjct: 108 DKPDTLRGVALH-----FVVLDEFQDMKPDTWYKVLRPTLSSTRGGA--LIIGTPKGFS- 159 Query: 236 WFYDIF----NIP---LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDS 277 F+ ++ N WK +Q T + S E + S Sbjct: 160 EFHKLWTIGQNKDLQRKGQWKSWQFVTADSPFVPSAEIEAAKNDMDPKS 208 >gi|293604595|ref|ZP_06686998.1| phage terminase large subunit [Achromobacter piechaudii ATCC 43553] gi|292817011|gb|EFF76089.1| phage terminase large subunit [Achromobacter piechaudii ATCC 43553] Length = 463 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 33/209 (15%), Positives = 61/209 (29%), Gaps = 10/209 (4%) Query: 147 SLHPSGWYAEL-LEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD 205 + +GW E + S + + G + +E G + Sbjct: 87 KIEAAGWRDEFDIGVSTIRHKLTGSEFLFYGLARNIEEIKG--TEGVDVCWIEEGEGLTE 144 Query: 206 IINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRY--QIDTRTVEGIDS 263 I + + + N + F L I+ + + Sbjct: 145 EQWSIIDPTIRKEGAEVWVLW--NPHLITD-FVQAKLPALLGADCIIRHINYPDNPFLSA 201 Query: 264 GFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIMG 321 D D R LGQ + + I ++IE A+ + +L +G Sbjct: 202 TAKRKAERLKEADPDAYRHIYLGQPLSSDDASVIKFHWIEAAVDAHLKLGIELGGARTVG 261 Query: 322 CDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 D+A G DK G I + + +W+A Sbjct: 262 YDVADSGADKNACSVFDGAICDELDEWAA 290 >gi|18466735|ref|NP_569542.1| hypothetical protein HCM2.0070c [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|16506051|emb|CAD09937.1| hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18] Length = 418 Score = 50.5 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 39/229 (17%), Positives = 75/229 (32%), Gaps = 37/229 (16%) Query: 59 MEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSET 118 + V +H +P FK + AGR GK+ L+ ++ + + +A + Sbjct: 7 LSLVQLHSGQMQVFQSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQ 65 Query: 119 QLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE 178 + LW ++ + P W + + +M I K+ + + Sbjct: 66 MARQILWDDLQ-----------------EVLPRKWVRKKNDTTMTIVLKNGSEIALKGA- 107 Query: 179 ERPDTFVG--PHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWIMTSNTRRLNG 235 ++PDT G H V DE PD K + + ++ + + Sbjct: 108 DKPDTLRGVALH-----FVVLDEFQDMKPDTWYKVLRPTLSSTRGGA--LIIGTPKGFS- 159 Query: 236 WFYDIF----NIP---LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDS 277 F+ ++ N WK +Q T + S E + S Sbjct: 160 EFHKLWTIGQNKDLQRKGQWKSWQFVTADSPFVPSAEIEAAKNDMDPKS 208 >gi|161525001|ref|YP_001580013.1| hypothetical protein Bmul_1828 [Burkholderia multivorans ATCC 17616] gi|189350256|ref|YP_001945884.1| bacteriophage TerL protein [Burkholderia multivorans ATCC 17616] gi|160342430|gb|ABX15516.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] gi|189334278|dbj|BAG43348.1| bacteriophage TerL protein [Burkholderia multivorans ATCC 17616] Length = 531 Score = 50.5 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 36/199 (18%), Positives = 61/199 (30%), Gaps = 29/199 (14%) Query: 186 GPHNTHGMAVFNDEASGTP---------------DIINKSI---LGFFTELNPNRFWIM- 226 G H H +F D S I+++S + + + Sbjct: 166 GTHAPHMRIIFPDTGSVITGESGDGIGRGDRASFYIVDESAFLERPQLVDASLSATTNCR 225 Query: 227 --TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI 284 S + F K + R D ++ + LD V EI Sbjct: 226 QDISTPNGMGNSFAQ--RRHSGKVKVFTFHWRDDPRKDDAWYAKQCAE--LDPVVVAQEI 281 Query: 285 LGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNII 342 + IP +++ A+ + + G D+A EG DK R G ++ Sbjct: 282 DINYAASVEGVVIPSAWVQAAIGAHLKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLL 341 Query: 343 EHIFDWSAK--LIQETNQE 359 + WS K I ET ++ Sbjct: 342 NFLRSWSGKGGDIYETVEK 360 >gi|152982949|ref|YP_001353896.1| hypothetical protein mma_2206 [Janthinobacterium sp. Marseille] gi|151283026|gb|ABR91436.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille] Length = 436 Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 40/243 (16%), Positives = 72/243 (29%), Gaps = 33/243 (13%) Query: 113 IANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTIT 172 +A Q K+ W V ++ +++P E + +P+G +L Sbjct: 62 VAPFYRQAKSVAWDYVKRFSAVIPGISINESELRIDYPNGSRIQLFG------------- 108 Query: 173 CRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINK-SILGFFTELNPNRFWIMTSNTR 231 + D G V DE + I + + ++ + Sbjct: 109 -----ADNADALRGLFFDG---VVADEYGDWKPSVWGYVIRPALADRGG--WAVIIGTPK 158 Query: 232 RLNGWFYDIFNIP--LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFP 289 N F++I+ EDW I + E + D+ R E+ F Sbjct: 159 GRN-QFWEIYQHAGVNEDWLCLTIRASESGLLPPKEIEALQLELTEDA--WRQEMECDFD 215 Query: 290 QQEVNNFIPHNYIEEAMSREAIDDLYAP---LIMGCDIAGEGGDKTVVVFRRGNIIEHIF 346 + DDLY P + D+ G D + F+ G + I Sbjct: 216 AALPGAIFGKEIWQAEQDGRVKDDLYDPELKVHAVLDL-GFTDDTAIWWFQVGKELRIID 274 Query: 347 DWS 349 +S Sbjct: 275 CYS 277 >gi|158337379|ref|YP_001518554.1| hypothetical protein AM1_4258 [Acaryochloris marina MBIC11017] gi|158307620|gb|ABW29237.1| conserved domain protein [Acaryochloris marina MBIC11017] Length = 476 Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 30/174 (17%), Positives = 59/174 (33%), Gaps = 30/174 (17%) Query: 195 VFNDEASGTPDI--INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRY- 251 + DEA+ ++ + +++ + I+ S +G F+D N Sbjct: 171 ILFDEAAFQTNLKLSLSAATPAMSQVGSDARIILCSTPNGASGHFFDTLNGFDNCVSDIE 230 Query: 252 QIDTRTVEGIDSGFHEG--------IISRYGLDSDVARIEILGQF--PQQEVNNFIPHNY 301 +I + + ++ E S YG D+ ++ P+ ++ + Sbjct: 231 RIRSGELPPVNKWQREDGNIAIAIHWKSVYG-DNPSYLEDLEKSLSLPKAQIAQEYDLSL 289 Query: 302 IEE-------AMSREAIDDLYAP-------LIMGCDIAGEGGDK--TVVVFRRG 339 E A+ R A Y P +G D AG G D +V + + G Sbjct: 290 TESSSVVFSFAVVRAAATGEYEPQFTEDELYYVGVDPAGSGADYFCSVFLKKTG 343 >gi|163716617|gb|ABY40529.1| putative TerL [Burkholderia phage Bups phi1] Length = 531 Score = 50.1 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 57/188 (30%), Gaps = 27/188 (14%) Query: 186 GPHNTHGMAVFNDEASGTP---------------DIINKSI---LGFFTELNPNRFWIM- 226 G H H +F D S ++++S + + + Sbjct: 166 GTHAPHMRIIFPDTGSVITGESGDGIGRGDRASFYVVDESAFLERPQLVDASLSATTNCR 225 Query: 227 --TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI 284 S + F K + R D ++ ++ LD V EI Sbjct: 226 QDISTPNGMGNSFAQ--RRHSGKIKVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEI 281 Query: 285 LGQFPQQEVNNFIPHNYIEEAMSREAID--DLYAPLIMGCDIAGEGGDKTVVVFRRGNII 342 + IP +++ A+ + G D+A EG DK R G ++ Sbjct: 282 DINYAASVEGVVIPSAWVQAALGAHVKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLL 341 Query: 343 EHIFDWSA 350 EH+ WS Sbjct: 342 EHLESWSG 349 >gi|304360765|ref|YP_003856886.1| gp8 [Mycobacterium phage Angelica] gi|302858349|gb|ADL71097.1| gp8 [Mycobacterium phage Angelica] Length = 473 Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 52/332 (15%), Positives = 100/332 (30%), Gaps = 57/332 (17%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 +WQ + + V + ++ +F +I R GKT ++ PG ++I Sbjct: 43 DQWQDDLGKLVCAKRSDGLYAAD--MFAMSI--PRQTGKTYFLGAIVFAFCKMNPGTTVI 98 Query: 112 CIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI 171 A+ + AE K + L R L++H G ++ +T Sbjct: 99 WTAH-----RTRTAAETFKSMQALAKREQIAPHILNVH----------TGNGKEAVLFTN 143 Query: 172 TCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN-- 229 R R F G + DEA + ++ T +PN + Sbjct: 144 GSRILFGAREKGF-GRGFAKVDVLIFDEAQILSENAMDDMIPA-TNASPNGLILFAGTPP 201 Query: 230 -TRRLNGWF---------------------YDIFNIPLEDWKRYQIDTRTVEGIDSGFHE 267 F D + P E+ +++ + Sbjct: 202 KPTDPGEVFTNLRMDALNGESDDVAYVEISADENDDPDEESTWRKMNPSYPHRTSARAIR 261 Query: 268 GIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA-----PLIMGC 322 + DS R E +G + + V+ I+ + R+ D L P +G Sbjct: 262 RMRKALSWDS--FRREAMGIWDKISVHA----QVIKAGLWRDLADPLGPEPGAKPASLGV 315 Query: 323 DIAGEGGDKTVVVFRRGNIIEHIFD-WSAKLI 353 D++ G + + + H+ W+ Sbjct: 316 DMSHGGAISIGGCWLIDDELRHVEQVWAGTDT 347 >gi|167725769|ref|ZP_02409005.1| hypothetical protein BpseD_42528 [Burkholderia pseudomallei DM98] Length = 517 Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 57/188 (30%), Gaps = 27/188 (14%) Query: 186 GPHNTHGMAVFNDEASGTP---------------DIINKSI---LGFFTELNPNRFWIM- 226 G H H +F D S ++++S + + + Sbjct: 152 GTHAPHMRIIFPDTGSVITGESGDGIGRGDRASFYVVDESAFLERPQLVDASLSATTNCR 211 Query: 227 --TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI 284 S + F K + R D ++ ++ LD V EI Sbjct: 212 QDISTPNGMGNSFAQ--RRHSGKIKVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEI 267 Query: 285 LGQFPQQEVNNFIPHNYIEEAMSREAID--DLYAPLIMGCDIAGEGGDKTVVVFRRGNII 342 + IP +++ A+ + G D+A EG DK R G ++ Sbjct: 268 DINYAASVEGVVIPSAWVQAALGAHVKLGIEPSGTRRGGLDVADEGKDKNAFAGRYGFLL 327 Query: 343 EHIFDWSA 350 EH+ WS Sbjct: 328 EHLESWSG 335 >gi|161614489|ref|YP_001588454.1| hypothetical protein SPAB_02238 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161363853|gb|ABX67621.1| hypothetical protein SPAB_02238 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 441 Score = 49.7 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 59/166 (35%), Gaps = 13/166 (7%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 96 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 153 Query: 253 IDTRTVEGIDS---GFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS- 307 I+ + + RY + + P+ + + I ++IE A+ Sbjct: 154 INYDENPFLSDTMLKVIDAARRRYPEG----FVHVYEGVPESDDDAAIIKLSWIEAAVDA 209 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 + +G D+A G DK V+R G++I +W AK Sbjct: 210 HKVLDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 255 >gi|326782381|ref|YP_004322781.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-ShM2] gi|310003329|gb|ADO97726.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-ShM2] Length = 362 Score = 49.7 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 36/256 (14%), Positives = 86/256 (33%), Gaps = 42/256 (16%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T+ +LW + +++ +AN + L + + + Q + + Sbjct: 85 GKSTIVTSYLLWYVIFNDNVNVAILANKAATSREML----QRLQRSYENLPKWLQQGI-V 139 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII- 207 + EL S + + + R S +F DE + P+ I Sbjct: 140 QWNRGSLELENGSKIMAASTSSSAVRGMSFN--------------VIFLDEFAFVPNHIA 185 Query: 208 ---NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEGI 261 S+ + + I+ S +N FY +++ ++ ++ V G Sbjct: 186 DEFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDSERKKNEYISTEVHWSEVPGR 243 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAI-----DDLYA 316 D+ + I+ ++E +F V+ I + + + + + +Y Sbjct: 244 DAKWKAQTIANTSEQQ--FKVEFECEF-LGSVDTLISPSKLRTMVYNDPLVQNKGLSIYE 300 Query: 317 PL------IMGCDIAG 326 + ++ D+A Sbjct: 301 HVQKDHNYVITVDVAR 316 >gi|61806000|ref|YP_214360.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM2] gi|61374509|gb|AAX44506.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM2] gi|265525210|gb|ACY76007.1| terminase large subunit gp17 [Prochlorococcus phage P-SSM2] Length = 547 Score = 49.7 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 49/266 (18%), Positives = 86/266 (32%), Gaps = 43/266 (16%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 GK+T +L +++ +AN + ++ L L + MQ Sbjct: 82 TGKSTTCISYLLHYAVFNDNVNVAVLANKASTARDLLGR-----LQLAYENLPRWMQQGI 136 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII 207 + + EL S + + R S +F DE + P+ I Sbjct: 137 ISWNKGSLELENGSKISANSTSSSAVRGGSYN--------------VIFLDEFAFIPNHI 182 Query: 208 ----NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEG 260 S+ T + I+ S R +N FY +++ ++ + V G Sbjct: 183 ADDFFASVYPTITS-GQSTKVIIVSTPRGMN-HFYRMWHDSEKGKSEYVATDVHWSEVPG 240 Query: 261 IDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYI-----EEAMSREAIDDLY 315 D + E I+ +IE +F VN I + E +R A D+Y Sbjct: 241 RDEEWKEQTIANTSEQQ--FKIEFECEF-LGSVNTLINPAKLRNLVYEAPKTRNAGLDIY 297 Query: 316 APL------IMGCDIAGE-GGDKTVV 334 I+ D+A G D + Sbjct: 298 ETPVKEHNYIITVDVARGLGNDYSAF 323 >gi|310815629|ref|YP_003963593.1| Putative large terminase [Ketogulonicigenium vulgare Y25] gi|308754364|gb|ADO42293.1| Putative large terminase [Ketogulonicigenium vulgare Y25] Length = 427 Score = 49.7 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 49/297 (16%), Positives = 90/297 (30%), Gaps = 49/297 (16%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGM---------SIICIANSETQLKNTLWAEVSKWL 132 I GRG GKT A W+ S G + IA + Q + + S + Sbjct: 36 IMGGRGAGKTRAGA---EWVRSMVEGPRPDTPGRAKRVGLIAQTMDQAREVMVFGDSGLM 92 Query: 133 SMLP---HRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHN 189 + P W +++ P+G R +S P+ GP Sbjct: 93 ACCPPARRPEWIAGRAMLRWPNG------------------AEARLFSAHDPEALRGPQF 134 Query: 190 THGMAVFNDEASG--TPDIINKSI-LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLE 246 A++ DE + + +G +P T G F Sbjct: 135 D---AIWADEVAKWRLAQEAWDMLVMGLRLGDDPRA----CLTTTPRGGPFLRKLLAQSG 187 Query: 247 DWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAM 306 + + GF + + + S + R E+ G + P + ++ A+ Sbjct: 188 TVMTHAPTRANRANLAPGFVAAVEAMF-EGSHLGRQELDGLLVDEAEGTLWPQHLLDAAL 246 Query: 307 SREAIDDLYAPLIMGCDI---AGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG 360 R+A +++ D G D ++ DW +I++ +G Sbjct: 247 QRQA--PPLDRIVVAVDPPVTGHAGSDACGIIVAGVEQRGAPTDWRLWVIEDATVQG 301 >gi|166012063|ref|ZP_02232961.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|167427125|ref|ZP_02318878.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|2996304|gb|AAC13184.1| P-loop protein [Yersinia pestis KIM 10] gi|165988997|gb|EDR41298.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|167053876|gb|EDR63708.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] Length = 402 Score = 49.7 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 37/215 (17%), Positives = 71/215 (33%), Gaps = 37/215 (17%) Query: 73 SNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWL 132 +P FK + AGR GK+ L+ ++ + + +A + + LW ++ Sbjct: 5 QSPHRFKV-VCAGRRWGKSRLSISTIIRAAAKEKKQRVWYVAPTYQMARQILWDDLQ--- 60 Query: 133 SMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG--PHNT 190 + P W + + +M I K+ + + ++PDT G H Sbjct: 61 --------------EVLPRKWVRKKNDTTMTIVLKNGSEIALKGA-DKPDTLRGVALH-- 103 Query: 191 HGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF----NIP- 244 V DE PD K + + ++ + + F+ ++ N Sbjct: 104 ---FVVLDEFQDMKPDTWYKVLRPTLSSTRGGA--LIIGTPKGFS-EFHKLWTIGQNKDL 157 Query: 245 --LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDS 277 WK +Q T + S E + S Sbjct: 158 QRKGQWKSWQFVTADSPFVPSAEIEAAKNDMDPKS 192 >gi|148256282|ref|YP_001240867.1| hypothetical protein BBta_4946 [Bradyrhizobium sp. BTAi1] gi|146408455|gb|ABQ36961.1| hypothetical protein BBta_4946 [Bradyrhizobium sp. BTAi1] Length = 482 Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 50/234 (21%), Positives = 87/234 (37%), Gaps = 22/234 (9%) Query: 105 RPGMS--IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSM 162 RPG ++C+A Q + L ++ ++ L+ + AE E S Sbjct: 114 RPGERALVMCLACDRAQARIIL---------NYIRSYFTDLPLLAGMVTRETAEGFELSN 164 Query: 163 GIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDI-INKSILGFFTELNPN 221 G+D T + R RP +A + DE + PD + ++I L N Sbjct: 165 GVDVAVATNSFRAVR-GRPILLAVL---DEVAFWRDENTAKPDEELYRAITPAMATL-SN 219 Query: 222 RFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISR-YGLDSDVA 280 I S+ R +G Y F + ++ + II R D A Sbjct: 220 SMIIGISSPYRKSGLLYKKFKSHFGKDGDVLVIQAPTRTLNPTIPQEIIDRALAEDPAAA 279 Query: 281 RIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMG---CDIAGEGGDK 331 E +G+F + ++ ++P IE A+ + + P+ + CD +G GD Sbjct: 280 SAEWMGEF-RDDIGGWLPLEVIESAVDQGVMVRPPQPIHIYRSFCDPSGARGDS 332 >gi|294650848|ref|ZP_06728195.1| bacteriophage terminase large subunit TerL [Acinetobacter haemolyticus ATCC 19194] gi|292823266|gb|EFF82122.1| bacteriophage terminase large subunit TerL [Acinetobacter haemolyticus ATCC 19194] Length = 552 Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 39/245 (15%), Positives = 78/245 (31%), Gaps = 29/245 (11%) Query: 128 VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGP 187 K+ M + + P+G+ ++ + M I + T + + G Sbjct: 155 FHKFRDMFSKMPDW------MKPTGFVEKVHDNYMRIINPDNGATITGEAGDNI----GR 204 Query: 188 HNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 M DE + +++ ++ N N I S + F+ + Sbjct: 205 GGRTTMYFL-DEWAFVER--QEAVDAAISQ-NTNVH-IKGSTPNGIGDRFHQ--DRFSGR 257 Query: 248 WKRYQIDTRTVEG------IDSGFHEGI-ISRYGLDSDVARI-EILGQFPQQEVNNFIPH 299 + + + R + + DV E+ + IP Sbjct: 258 YAVFSMPWRANPDKNWTVEYNGKQIHPWYEKQLATLDDVVLAQEVDINYAASVEGVLIPS 317 Query: 300 NYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK--LIQE 355 +++ A+ + + I G D+A EG DK R G ++ ++ WS K I Sbjct: 318 TWVQLAIDAHIKLGIEPTGDRIAGLDVADEGKDKNSFASRHGIVMTYLDTWSGKGDDIFG 377 Query: 356 TNQEG 360 T Q+ Sbjct: 378 TTQKA 382 >gi|158300801|ref|XP_320633.4| AGAP011893-PA [Anopheles gambiae str. PEST] gi|157013336|gb|EAA00145.5| AGAP011893-PA [Anopheles gambiae str. PEST] Length = 607 Score = 49.3 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 48/283 (16%), Positives = 93/283 (32%), Gaps = 28/283 (9%) Query: 3 RLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAV 62 R + + L + +++ A +S +F FP KP ++ W + Sbjct: 151 RSLKIEFPLNRLQYKLEYTALVHMSRLDFSSILFPKIESAKP-TTPAKTFDWFQSCIAEN 209 Query: 63 DVHCHSNVN--NSNPTIFKCAISAGRGIGKTT--LNAWMMLWLISTRPGMSIICIANS-- 116 + + N N + G GKT + A + +W + RP I+ A S Sbjct: 210 EQQTQAIKNIVNRTAYPAPYILFGPPGTGKTCTIVEAVLQIWKM--RPKSRILVTATSNY 267 Query: 117 ------ETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQ-SMGIDSKHY 169 + LK ++ ++ S R M + S + + E +M + Sbjct: 268 ACNELAKRLLKYVTVNDLFRYFSQTSQRDINGMDLKVVQVSNMHYGIYETPAMQDFVQTR 327 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFF-TELNPNRF---WI 225 + C + R G + +F DE ++ +G T+ NR + Sbjct: 328 ILVCTVMTSGRLLQL-GVDRSMYDYIFIDECGSCRELSALVPIGCVGTDTTNNRLQASVV 386 Query: 226 MTSNTRRLNGWFYDIFNIPLED-----W--KRYQIDTRTVEGI 261 + + +L FYD D W + + R + + Sbjct: 387 LAGDPLQLGPQFYDAELRAKGDPTITHWAVNWHHLPNRKLPML 429 >gi|238027169|ref|YP_002911400.1| hypothetical protein bglu_1g15550 [Burkholderia glumae BGR1] gi|237876363|gb|ACR28696.1| Hypothetical protein bglu_1g15550 [Burkholderia glumae BGR1] Length = 531 Score = 49.3 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 56/188 (29%), Gaps = 27/188 (14%) Query: 186 GPHNTHGMAVFNDEASGTP---------------DIINKSI---LGFFTELNPNRFWIM- 226 G H H +F D S ++++S + + + Sbjct: 166 GTHAPHMRIIFPDTGSVITGESGDGIGRGDRASFYVVDESAFLERPQLVDASLSATTNCR 225 Query: 227 --TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI 284 S + F K + R D ++ ++ LD V EI Sbjct: 226 QDISTPNGMGNSFAQ--RRHSGKIKVFTFHWRDDPRKDDAWYAKQVAE--LDPVVVAQEI 281 Query: 285 LGQFPQQEVNNFIPHNYIEEAMSREAID--DLYAPLIMGCDIAGEGGDKTVVVFRRGNII 342 + IP +++ A+ G D+A EG DK R G ++ Sbjct: 282 DINYAASVEGVVIPSAWVQAALGAHVKLGISPSGARRGGLDVADEGKDKNAFAGRYGFLL 341 Query: 343 EHIFDWSA 350 EH+ WS Sbjct: 342 EHLESWSG 349 >gi|213161040|ref|ZP_03346750.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E00-7866] Length = 421 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 76 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 133 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D + + P+ + + I ++IE A+ + Sbjct: 134 INYDENPFLSDTMLKVIDAARRRD-PEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKV 192 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +G D+A G DK V+R G++I +W AK Sbjct: 193 LDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 235 >gi|213029404|ref|ZP_03343851.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] Length = 282 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 75 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 132 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D + + P+ + + I ++IE A+ + Sbjct: 133 INYDENPFLSDTMLKVIDAARRRD-PEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKV 191 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +G D+A G DK V+R G++I +W AK Sbjct: 192 LDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 234 >gi|16759908|ref|NP_455525.1| prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|29142320|ref|NP_805662.1| prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|213583175|ref|ZP_03365001.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-0664] gi|213647535|ref|ZP_03377588.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. J185] gi|213855100|ref|ZP_03383340.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. M223] gi|25512685|pir||AF0621 probable prophage terminase large chain STY1047 [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16502201|emb|CAD05440.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi] gi|29137950|gb|AAO69511.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] Length = 467 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 179 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D + + P+ + + I ++IE A+ + Sbjct: 180 INYDENPFLSDTMLKVIDAARRRD-PEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKV 238 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +G D+A G DK V+R G++I +W AK Sbjct: 239 LDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 281 >gi|213423381|ref|ZP_03356369.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E01-6750] Length = 414 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 69 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 126 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D + + P+ + + I ++IE A+ + Sbjct: 127 INYDENPFLSDTMLKVIDAARRRD-PEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKV 185 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +G D+A G DK V+R G++I +W AK Sbjct: 186 LDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 228 >gi|169344384|ref|ZP_02865357.1| phage terminase, large subunit, pbsx family [Clostridium perfringens C str. JGS1495] gi|169297509|gb|EDS79616.1| phage terminase, large subunit, pbsx family [Clostridium perfringens C str. JGS1495] Length = 415 Score = 49.0 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 48/284 (16%), Positives = 95/284 (33%), Gaps = 33/284 (11%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 G G GK+ M++ PG + + + LK +++A L W Sbjct: 31 GGGGSGKSHFVVQKMIYKYLKYPGRKCLVVRKVNSTLKESIFA-----LFRSVLSDWQIY 85 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 ++ + EL +S+ I E+ + G + + +E + Sbjct: 86 DECKINKTDLTIELPNKSLFIFKGIDD-------PEKIKSIAGIDD-----IVVEECTEI 133 Query: 204 PDIINKSILGFFTELNP-NRFWIMTSNTRRLNGWFYDIFNI---PLEDWKRYQIDTRTVE 259 + + NP N+ +M N + W Y + +D + + Sbjct: 134 DEFDFDQLNLRLRSKNPYNQIHVMF-NPVSKSNWVYKRWFKNGYDTKDTIVLHTTYKNNK 192 Query: 260 GIDSGFHEGIISRYGLDSDVA-RIEILGQFPQQEVNNFIPHNYIEEAMSREAID--DLYA 316 + + + ++ + D+ V RI LG+F ++ I N+ EE+ + I + Sbjct: 193 FLPKDYIDSLL-KLEKDNPVYFRIYALGEF--ATLDKLIYTNWKEESFDYKEILKNNRNT 249 Query: 317 PLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG 360 I D G D T V + I + E ++G Sbjct: 250 KAIFSLDF-GYTNDPTAFVCSIIDKINKKL----WIFDEFQEKG 288 >gi|296141561|ref|YP_003648804.1| terminase [Tsukamurella paurometabola DSM 20162] gi|296029695|gb|ADG80465.1| Terminase [Tsukamurella paurometabola DSM 20162] Length = 489 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 51/359 (14%), Positives = 98/359 (27%), Gaps = 68/359 (18%) Query: 20 MHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFK 79 + +E L+F + +R KG + WQ++ V + Sbjct: 22 VESERFLAFADKFLRV----PKGTGAKGKLHLRDWQVDVARDV----------LDSGART 67 Query: 80 CAISAGRGIGKTTLNAWMMLW-LISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHR 138 I RG GKTTLNA + L+ + G ++ +A E Q L+ R Sbjct: 68 VGIMFPRGQGKTTLNAAIALYRFFTGGEGANVCVVAVDERQAG----------LAFSAAR 117 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFND 198 E+ + + + + + C S P G + D Sbjct: 118 RMVELNEELSARCQIFKD----RLYLPTTDSVFQCLPAS---PTALEGL---DYVLALVD 167 Query: 199 EASGTPDIINKSILGFFTELNPNRFWIMTSNTRRL-------NGWFYDIFNIPLE--DWK 249 EA + + + + + Y + + W+ Sbjct: 168 EAGVVNRDVFEVVQLA-QGKREKSVLVAIGTPGPNLDDQVLLSLRDYHLEHPDDASLRWR 226 Query: 250 RYQIDTRTV---------EGIDSGFHEGIISRY--------GLDSDVARIEILGQFPQQE 292 + E + + + +S R + QF Sbjct: 227 EFSAAGFEDHPVDCTHCWELANPALDDFLHRDALVALLPPKTRESTFRRARLC-QFAADT 285 Query: 293 VNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVF---RRGNIIEHIFDW 348 +F+P E + E + L A +++ D D T ++ + W Sbjct: 286 EGSFLPAGVWEGLSTGEPVP-LGAEVVIALD-GSFSDDTTALLLGTVAAAPHFHPLRVW 342 >gi|85058727|ref|YP_454429.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] gi|84779247|dbj|BAE74024.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] Length = 456 Score = 48.6 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 54/164 (32%), Gaps = 5/164 (3%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 + +EA ++ + W+ + L+ + PL+D + Sbjct: 116 WVEEAEAVTKESWDILIPTIRKPGSE-IWVSFNPKNILDDTYQRFVVNPLDDICLLTVHY 174 Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAID 312 + D D+ G+ P + + I +I A+ Sbjct: 175 TDNPHFPEVLRLEMEECKCKDYDLYLHIWEGE-PVADSDLAIIKPLWIAAAVDAHITLGF 233 Query: 313 DLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQET 356 + +G D+A EG D ++ G+++ H+ W+ + ++ Sbjct: 234 EPAGKKRIGFDVADEGEDSNALILSHGSVVMHLETWNKGDVIQS 277 >gi|89071120|ref|ZP_01158320.1| Putative large terminase [Oceanicola granulosus HTCC2516] gi|89043331|gb|EAR49553.1| Putative large terminase [Oceanicola granulosus HTCC2516] Length = 444 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 45/284 (15%), Positives = 84/284 (29%), Gaps = 26/284 (9%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 I GRG GKT A W+ + G + + L E ++ R Sbjct: 58 ILGGRGAGKTRAGA---EWVRAQVEGPR--ATDPGRAR-RVALVGE-----TIDQAREVM 106 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHY--TITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L E G + + +S P+ GP A + DE Sbjct: 107 VFGDSGLLACAPPDRRPEWIAGRRLLVWPNGAQAQLFSAHDPEALRGPQFD---AAWVDE 163 Query: 200 AS--GTPDIINKSILGFF-TELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTR 256 + + + +P + T R + + + Sbjct: 164 LAKWKKAEEAWDMLQLALRLGDDPR---CCVTTTPRPTALMRALLERD-GTARTHAPTEA 219 Query: 257 TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA 316 + F + RY S + R E+ G + I A + + + DL+ Sbjct: 220 NAANLARAFLAEVRRRY-AGSPLGRQELDGVMLSEIEGALWSAGAI-AAANCDVVPDLHR 277 Query: 317 PLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEG 360 +++ D + GGD +V +W A ++++ + G Sbjct: 278 -VVVAVDPSAGGGDVCGIVVAGACYDGGADNWRAWVLEDASVAG 320 >gi|289805729|ref|ZP_06536358.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. AG3] Length = 257 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 82 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 139 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D + + P+ + + I ++IE A+ + Sbjct: 140 INYDENPFLSDTMLKVIDAARRRD-PEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKV 198 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +G D+A G DK V+R G++I +W AK Sbjct: 199 LDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 241 >gi|213618708|ref|ZP_03372534.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-2068] Length = 282 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 122 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 179 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D + + P+ + + I ++IE A+ + Sbjct: 180 INYDENPFLSDTMLKVIDAARRRD-PEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKV 238 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +G D+A G DK V+R G++I +W AK Sbjct: 239 LDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 281 >gi|170748408|ref|YP_001754668.1| hypothetical protein Mrad2831_1990 [Methylobacterium radiotolerans JCM 2831] gi|170654930|gb|ACB23985.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 478 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 62/195 (31%), Gaps = 22/195 (11%) Query: 153 WYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD-IINKSI 211 +E + G+D + +T +T G +A ++ E S PD +I ++ Sbjct: 144 PTSETIRLLSGVDIEVRPANYKTIRG---ETLAGCLADE-VAFWHLENSANPDTLILDAV 199 Query: 212 LGFF-TELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQI-----DTRTV-EGIDSG 264 T P + S+ G Y + ++T+ +D Sbjct: 200 RPGLATTGGP---LCVLSSPYARKGELYRTHQRDFGPSGDPAVLVLRAPSQTMNPSLDPA 256 Query: 265 FHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS---REAIDDLYAPLIMG 321 + Y D A E +F + +V FI ++ M+ E Sbjct: 257 VVK---RAYTRDPAAASAEYGAEF-RADVEAFISLEAVQACMAGDLLERAPAPGLTYQAF 312 Query: 322 CDIAGEGGDKTVVVF 336 CD +G G D + Sbjct: 313 CDPSGGGADSMTLAI 327 >gi|304360860|ref|YP_003856980.1| gp8 [Mycobacterium phage CrimD] gi|302858609|gb|ADL71354.1| gp8 [Mycobacterium phage CrimD] Length = 473 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 52/329 (15%), Positives = 99/329 (30%), Gaps = 51/329 (15%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 +WQ + + V + ++ +F +I R GKT ++ L PG ++I Sbjct: 43 DQWQDDLGKLVCAKRSDGLYAAD--MFAMSI--PRQTGKTYFLGAIVFALCKMTPGTTVI 98 Query: 112 CIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI 171 A+ + AE K + L R L++H G ++ +T Sbjct: 99 WTAH-----RTRTAAETFKSMQALAKREQIAPHILNVH----------TGNGKEAVLFTN 143 Query: 172 TCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN-- 229 R R F G + DEA + ++ T +PN + Sbjct: 144 GSRILFGAREKGF-GRGFAKVDVLIFDEAQILSENAMDDMVPA-TNASPNGLILFAGTPP 201 Query: 230 -TRRLNGWF---------------------YDIFNIPLEDWKRYQIDTRTVEGIDSGFHE 267 F D + P E+ +++ + Sbjct: 202 KPTDPGEVFTNLRLDAINGESDDVAYVEISADENDDPDEESTWRKMNPSYPHRTSARAIR 261 Query: 268 GIISRYGLDSDVARIEILGQFPQQEVN-NFI-PHNYIEEAMSREAIDDLYAPLIMGCDIA 325 + DS R E +G + + V+ I P + + A P +G D++ Sbjct: 262 RMRKALSWDS--FRREAMGIWDKISVHAQVIKPSLWRDLADPLGPEPGAK-PASLGVDMS 318 Query: 326 GEGGDKTVVVFRRGNIIEHIFD-WSAKLI 353 G + + + H+ W+ Sbjct: 319 HGGAISIGGCWLIDDELRHVEQVWAGTDT 347 >gi|168699883|ref|ZP_02732160.1| hypothetical protein GobsU_10183 [Gemmata obscuriglobus UQM 2246] Length = 205 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 34/107 (31%), Gaps = 14/107 (13%) Query: 179 ERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFY 238 + + VG V DE S D + KS+ + S GWF+ Sbjct: 63 DSQEGVVGFSA--PRLVVIDEGSRVSDELYKSVRPMLAVSKGQ--LLTLSTPFGNQGWFF 118 Query: 239 DIFNIPLED----------WKRYQIDTRTVEGIDSGFHEGIISRYGL 275 DI++ E W+R + + I F E + G Sbjct: 119 DIWDDSAEGLKRRSKLHEPWQRTAVPASQIPRITPEFLEDERAELGE 165 >gi|213426918|ref|ZP_03359668.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E02-1180] Length = 374 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 29 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 86 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D + + P+ + + I ++IE A+ + Sbjct: 87 INYDENPFLSDTMLKVIDAARRRD-PEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKV 145 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +G D+A G DK V+R G++I +W AK Sbjct: 146 LDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 188 >gi|317120885|ref|YP_004100888.1| hypothetical protein Tmar_0036 [Thermaerobacter marianensis DSM 12885] gi|315590865|gb|ADU50161.1| hypothetical protein Tmar_0036 [Thermaerobacter marianensis DSM 12885] Length = 410 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 59/263 (22%), Positives = 95/263 (36%), Gaps = 36/263 (13%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 I AGRG GKT A + + I + + +++ + S LS+ P Sbjct: 36 ILAGRGFGKTRTGAEWVREQVERHGRRRIAIVGRTAADVRDVMVEGESGILSISP----- 90 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE-- 199 W+ + E S + YS + PD GP + A + DE Sbjct: 91 ----------PWFRPVYEPSKRRLTWPNGAIATLYSADEPDLLRGPQHD---AAWADELA 137 Query: 200 ASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVE 259 A P+ + + G +P ++ + T R D+ N P T E Sbjct: 138 AWRRPEAWDNLMFGLRLGPDPR---VVVTTTPRPVKLIRDLLNDP----TCVVTRGSTYE 190 Query: 260 ---GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYA 316 + F E IISRY + + R E+ G+ I+E REA + + Sbjct: 191 NAANLAPAFLEQIISRY-EGTRLGRQELYGEVLDDVPGALWQRKRIDELRVREAPELVR- 248 Query: 317 PLIMGCDIA---GEGGDKTVVVF 336 +++ D A EG D+T +V Sbjct: 249 -VVVAIDPAVTSEEGSDETGIVV 270 >gi|289829424|ref|ZP_06547036.1| putative prophage terminase large subunit [Salmonella enterica subsp. enterica serovar Typhi str. E98-3139] Length = 346 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 7/163 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P ED + Sbjct: 1 MLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEDTLIRK 58 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D + + P+ + + I ++IE A+ + Sbjct: 59 INYDENPFLSDTMLKVIDAARRRD-PEGFVHVYEGVPESDDDAAIIKLSWIEAAVDAHKV 117 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 +G D+A G DK V+R G++I +W AK Sbjct: 118 LDFGPEGRKRIGFDVADSGADKCANVYRYGSVIYWADEWKAKE 160 >gi|297566322|ref|YP_003685294.1| hypothetical protein Mesil_1911 [Meiothermus silvanus DSM 9946] gi|296850771|gb|ADH63786.1| protein of unknown function DUF264 [Meiothermus silvanus DSM 9946] Length = 427 Score = 48.2 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 49/264 (18%), Positives = 96/264 (36%), Gaps = 29/264 (10%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 +GK+ + + P + ++ E Q SK L+ RH +Q ++ Sbjct: 32 VGKSFAASLEAVLDCVAHPRSLWVFLSRGERQ---------SKELAEKAQRHLEAIQVVA 82 Query: 148 -LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS--GTP 204 ++ + AE + + + + I+ PDT G V DE + Sbjct: 83 EMYDEPFDAESTQTVIRLPNGSRIISL----PANPDTARGYSGN----VLLDEFALHKDS 134 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN--IPLEDWKRYQIDTRTVEGID 262 I ++ T + + S + G FY+I+ + W R+++D Sbjct: 135 REIWGALYPTIT-RSKRYRLRVLSTPKGQQGKFYEIWQPEPGGDLWSRHRVDIYDAVQQG 193 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD--LYAPLIM 320 + + D + + E L +F + ++P+ I S +A D L L + Sbjct: 194 LEVDPEELRKGLKDPVLWQQEYLLEFVDEAS-AWLPYELITSCESSQARTDGALEGDLYL 252 Query: 321 GCDIAGEGGDKTVV--VFRRGNII 342 G DI D +V+ R G+++ Sbjct: 253 GMDIGRH-RDLSVIWVAERVGDVL 275 >gi|211731737|gb|ACJ10086.1| terminase [Bacteriophage APSE-5] Length = 469 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 83/304 (27%), Gaps = 75/304 (24%) Query: 84 AGRGIGKTTLNAWMMLW--------LISTRPGMSIICIANSETQLKNTLWAEVSKWLSML 135 GRG KT A + L + R M+ I E + L AEV L + Sbjct: 12 GGRGGMKTVSFAKIALITASMHKRRFLCLREFMNSI-----EDSVHAVLQAEVET-LGLQ 65 Query: 136 PHRHWFEMQSLSLHPSGW-YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 ++ S + Y +L I SKH Sbjct: 66 NRFRILNTYIEGINDSIFKYGQLARNIASIKSKHDFDVA--------------------- 104 Query: 195 VFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDW------ 248 + +EA + +++ + W N +G Y F P ++ Sbjct: 105 -WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKELIDTQGY 161 Query: 249 --------------KRYQIDTR---TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQ 291 + + + ++ YG + D Sbjct: 162 YEDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANY---------- 211 Query: 292 EVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS 349 + I ++E A+ + ++ D A G D+ + R G +IE WS Sbjct: 212 -EDALIQPEWVEAAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWS 270 Query: 350 AKLI 353 + Sbjct: 271 EGDV 274 >gi|156564098|ref|YP_001429607.1| terminase large subunit [Bacillus phage 0305phi8-36] gi|154622795|gb|ABS83675.1| terminase large subunit [Bacillus phage 0305phi8-36] Length = 635 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 27/133 (20%), Positives = 48/133 (36%), Gaps = 20/133 (15%) Query: 18 MLMHAECVLSFKNFVMRFFPW----GIKGKP-----LEHFSQPHRWQLEFMEAVDVHCHS 68 L E + + ++R W G K L +P W E ++ Sbjct: 18 QLWETE----YDDLIVRTKKWARSTGEKFTEEELHYLAILDKPKFWAAETLKWFCRDYQE 73 Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS------IICIANSETQLKN 122 + + + GR +GKT M+LW T+P I+ IA E Q+ + Sbjct: 74 PMLQEMADSKRTVLRLGRRLGKTETMCIMILWHAFTQPNKGPNNQYDILIIAPYEEQV-D 132 Query: 123 TLWAEVSKWLSML 135 ++ +S+ + M Sbjct: 133 LIFKRLSQLIDMS 145 >gi|326784094|ref|YP_004324487.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage Syn1] gi|310004826|gb|ADO99217.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage Syn1] Length = 550 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 48/267 (17%), Positives = 94/267 (35%), Gaps = 45/267 (16%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 GK+T +L + +++ +AN + ++ L A ++ LP W +Q Sbjct: 84 TGKSTTVVSYLLHYLIFNDSVNVGILANKASTARDLL-ARLATAYENLPK--W--IQQGV 138 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII 207 + + EL S + + R S +F DE + P+ I Sbjct: 139 VVWNKGNIELENGSKILAASTSASAVRGMSFN--------------IIFLDEFAFVPNHI 184 Query: 208 ----NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEG 260 S+ T + I+ S + +N FY ++ D+ +++ V G Sbjct: 185 ADSFFASVYPTITS-GKSTKVIIISTPQGMN-HFYKMWQDAVNGRNDYTYHEVHWSQVPG 242 Query: 261 IDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYI-----EEAMSREAIDDL 314 D+ + E I S E +F V+ I + + +E ++R D+ Sbjct: 243 RDAKWKEETIKN---TSQRQFTQEFECEF-LGSVDTLISASKLKALAFDEPITRNKGLDI 298 Query: 315 YAPL------IMGCDIAGE-GGDKTVV 334 Y ++ D++ GGD + Sbjct: 299 YEKPKDKNEYLLTVDVSRGIGGDYSAF 325 >gi|326784562|ref|YP_004324947.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM7] gi|310004595|gb|ADO98987.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-SSM7] Length = 550 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 39/259 (15%), Positives = 87/259 (33%), Gaps = 46/259 (17%) Query: 88 IGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS 147 GK+T+ +LW + + +++ +AN E+ + L + +Q Sbjct: 85 TGKSTIVTSYLLWYVLFKANVNVAILANKAA-----TSREMLQRLQLSYENLPKWLQQGI 139 Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII 207 L + EL S + + + R S +F DE + P+ I Sbjct: 140 LQWNRGSLELENGSKIMAASTSSSAVRGMSFN--------------VIFLDEFAFVPNHI 185 Query: 208 ----NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEG 260 S+ + + I+ S +N FY +++ ++ ++ V G Sbjct: 186 ADQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERGKNEYIPTEVHWSAVPG 243 Query: 261 IDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEE-------------AMS 307 D+ + + I+ ++E +F V+ I + + A+ Sbjct: 244 RDAAWKDQTIANTSEQQ--FKVEFECEF-LGSVDTLISPSKLRTMPYEDPIIQNRGLAVY 300 Query: 308 REAIDDLYAPLIMGCDIAG 326 ++ + I+ D+A Sbjct: 301 KQV--EKDHNYIVTVDVAR 317 >gi|226940436|ref|YP_002795510.1| Terminase large subunit [Laribacter hongkongensis HLHK9] gi|226715363|gb|ACO74501.1| Terminase large subunit [Laribacter hongkongensis HLHK9] Length = 93 Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 32/78 (41%), Gaps = 10/78 (12%) Query: 14 ELHEMLMH--AECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVN 71 ++ + L+ AEC + + + WG LE + P WQ E M + H + Sbjct: 3 DIDDELIELAAECATDPLRWALHAYDWGR--GELEGVTGPRAWQREVMSDIGNHLKNPAT 60 Query: 72 NSNPTIFKCAISAGRGIG 89 + A AGRG+G Sbjct: 61 RFS------AFDAGRGLG 72 >gi|9633565|ref|NP_050979.1| P18 [Acyrthosiphon pisum bacteriophage APSE-1] gi|6118013|gb|AAF03961.1|AF157835_18 P18 [Endosymbiont phage APSE-1] Length = 469 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 51/183 (27%), Gaps = 38/183 (20%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDW------- 248 + +EA + S++ + W N +G Y F P ++ Sbjct: 105 WVEEAETVSEKSLDSLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKELIDTQGYY 162 Query: 249 -------------KRYQIDTR---TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQE 292 + + + ++ YG + D Sbjct: 163 EDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANY----------- 211 Query: 293 VNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + I ++E A+ + ++ D A G D+ + R G +IE WS Sbjct: 212 EDALIQPEWVEAAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSE 271 Query: 351 KLI 353 + Sbjct: 272 GDV 274 >gi|203288482|ref|YP_002223299.1| bsr protein [Borrelia duttonii Ly] gi|201084467|gb|ACH94050.1| bsr protein [Borrelia duttonii Ly] Length = 450 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 48/294 (16%), Positives = 91/294 (30%), Gaps = 51/294 (17%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTR------- 105 + Q + + A++ + + K +S G GKT L + T Sbjct: 47 KKQRKVLSAIEKNNQN----------KVILSGGIASGKTFLA---CYLFLKTLLKNRHRY 93 Query: 106 -PGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI 164 + + NS+ L+ + + K +M ++ + + + Y E+ + + Sbjct: 94 SHDTNNFILGNSQKALEINVTGQFKKLANM------LKIPFVPKYSNTSYFEINSLRVNL 147 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFW 224 Y ++ F ++ ++ +EA+ K L + P Sbjct: 148 -----------YGGDKIRDFERFRGSNSAVIYVNEATTLHKETLKEALKRL-RIKPEFIV 195 Query: 225 IMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI 284 T N +F + + Y T E I F + Y D + + Sbjct: 196 FDT-NPDHPEHYFKTDYIDNNTVYSTYNFTTYDNEEISKEFIKTQEELY-KDFPTYKASV 253 Query: 285 -LGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAG-EGGDKTVVVF 336 LG++ F N IE D + I D A GGD T + Sbjct: 254 LLGEWVANNDAIFRNINIIE--------DYEFKSPIAYLDPAYSSGGDNTSLCV 299 >gi|48697520|ref|YP_024878.1| gp33 TerL [Burkholderia phage BcepB1A] gi|47717490|gb|AAT37736.1| gp33 TerL [Burkholderia phage BcepB1A] Length = 532 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 55/158 (34%), Gaps = 10/158 (6%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 F DEA+ + +++ R I + N LN F + K + Sbjct: 203 FVDEAAHLENA--QAVDTALAATTNCRIDISSVN--GLNNPFAE--KRFSGRVKVKTMHW 256 Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAID--D 313 R D +++ ++ ++ V EI + IP +I+ A+ + Sbjct: 257 RDDPRKDDEWYKKQKQKF--NALVVAQEIDIDYSASAEGVLIPLEWIDAAIDADVKLGLT 314 Query: 314 LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK 351 + D+A EG D R G +++ WS K Sbjct: 315 VTGQRFSSLDVADEGKDMNAFGSRLGIRMDYAESWSGK 352 >gi|168704532|ref|ZP_02736809.1| hypothetical protein GobsU_33659 [Gemmata obscuriglobus UQM 2246] Length = 209 Score = 47.8 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 22/107 (20%), Positives = 34/107 (31%), Gaps = 14/107 (13%) Query: 179 ERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFY 238 + + VG V DE S D + KS+ + S GWF+ Sbjct: 63 DSQEGVVGFSA--PRLVVIDEGSRVSDELYKSVRPMLAVSKGQ--LLTLSTPFGNQGWFF 118 Query: 239 DIFNIPLED----------WKRYQIDTRTVEGIDSGFHEGIISRYGL 275 DI++ E W+R + + I F E + G Sbjct: 119 DIWDDSAEGLKRRAKLHEPWQRTAVPASQIPRITPEFLEDERAELGE 165 >gi|118590957|ref|ZP_01548357.1| hypothetical protein SIAM614_19891 [Stappia aggregata IAM 12614] gi|118436479|gb|EAV43120.1| hypothetical protein SIAM614_19891 [Stappia aggregata IAM 12614] Length = 526 Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 34/87 (39%), Gaps = 9/87 (10%) Query: 286 GQFPQQEVN---NFIPHNYIEEAMSR--EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGN 340 G F + IP ++++ A R + ID ++ D+A G D+TV+ G Sbjct: 287 GDFLAARQDHEWQVIPSDWVDLAFERYDQGIDRDEPMTVLAVDVAQGGKDRTVLQPLHGR 346 Query: 341 IIEHIFDWSAKLIQETNQEGCPVGSSI 367 E ++G VGS I Sbjct: 347 RFETNIVRKGTDT----KDGADVGSLI 369 >gi|211731806|gb|ACJ10127.1| terminase [Bacteriophage APSE-3] Length = 469 Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 83/304 (27%), Gaps = 75/304 (24%) Query: 84 AGRGIGKTTLNAWMMLW--------LISTRPGMSIICIANSETQLKNTLWAEVSKWLSML 135 GRG KT A + L + R M+ I E + L AEV L + Sbjct: 12 GGRGGMKTVSFAKIALITASMHKRRFLCLREFMNSI-----EDSVHAVLQAEVET-LGLQ 65 Query: 136 PHRHWFEMQSLSLHPSGW-YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 ++ S + Y +L I SKH Sbjct: 66 NRFRILNTYIEGINDSIFKYGQLARNIASIKSKHDFDVA--------------------- 104 Query: 195 VFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPL--------- 245 + +EA + +++ + W N +G Y F P Sbjct: 105 -WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKAIIDKQGY 161 Query: 246 ----EDW-------KRYQIDTR---TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQ 291 + + + + + ++ YG + D Sbjct: 162 YEDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANY---------- 211 Query: 292 EVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS 349 + I ++E A+ + ++ D A G D+ + R G +IE WS Sbjct: 212 -EDALIQPEWVEAAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWS 270 Query: 350 AKLI 353 + Sbjct: 271 EGDV 274 >gi|212499721|ref|YP_002308529.1| terminase [Bacteriophage APSE-2] gi|238898754|ref|YP_002924436.1| APSE-2 prophage; terminase [Bacteriophage APSE-2] gi|211731690|gb|ACJ10178.1| terminase [Bacteriophage APSE-2] gi|229466514|gb|ACQ68288.1| APSE-2 prophage; terminase [Bacteriophage APSE-2] Length = 469 Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 47/304 (15%), Positives = 83/304 (27%), Gaps = 75/304 (24%) Query: 84 AGRGIGKTTLNAWMMLW--------LISTRPGMSIICIANSETQLKNTLWAEVSKWLSML 135 GRG KT A + L + R M+ I E + L AEV L + Sbjct: 12 GGRGGMKTVSFAKIALITASMHKRRFLCLREFMNSI-----EDSVHAVLQAEVET-LGLQ 65 Query: 136 PHRHWFEMQSLSLHPSGW-YAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 ++ S + Y +L I SKH Sbjct: 66 NRFRILNTYIEGINDSIFKYGQLARNIASIKSKHDFDVA--------------------- 104 Query: 195 VFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPL--------- 245 + +EA + +++ + W N +G Y F P Sbjct: 105 -WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYKRFVKPYKAIIDKQGY 161 Query: 246 ----EDW-------KRYQIDTR---TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQ 291 + + + + + ++ YG + D Sbjct: 162 YEDDDLYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANY---------- 211 Query: 292 EVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS 349 + I ++E A+ + ++ D A G D+ + R G +IE WS Sbjct: 212 -EDALIQPEWVEAAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWS 270 Query: 350 AKLI 353 + Sbjct: 271 EGDV 274 >gi|221316874|ref|YP_002527821.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|226246930|ref|YP_002776267.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] gi|221237339|gb|ACM10180.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|226201508|gb|ACO38105.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] Length = 450 Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 31/157 (19%), Positives = 51/157 (32%), Gaps = 16/157 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 +E +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNVETFKTYNFTTYDNVFLSKGFIETQEKLY-KDIPAYKARVLLGEWLASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 E+ M I D A GGD T + Sbjct: 271 ITEDYMFTSP--------IAYLDPAFSVGGDNTALCV 299 >gi|62327097|ref|YP_223885.1| putative large subunit terminase [Lactobacillus phage phiJL-1] gi|37930114|gb|AAP74512.1| putative large subunit terminase [Lactobacillus phage phiJL-1] Length = 440 Score = 47.4 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 49/292 (16%), Positives = 95/292 (32%), Gaps = 34/292 (11%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 RG GK+ A ++ I P ++ + T K++ +A + K + F+ Sbjct: 41 KGSRGSGKSYATAAKVIIDIMMYPYVNWLVTRQYATTQKDSTFATIRKVAHSMGVLDLFK 100 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 L + +Q+ + +P G + +EA Sbjct: 101 FTKSPLEIT------YKQTGQKVFFRGMDDPLKITSIQP--VTGFICRR----WCEEAYE 148 Query: 203 TP-----DIINKSILGFFTELNPNRFWIMTSNT----RRLNGWFYDIFNIPLEDWKRYQI 253 D + +S+ G ++T N L F+D + Sbjct: 149 LKSLDAFDTVEESMRGEL-PPGGFYQTVITFNPWSDRHWLKHEFFDDKTK-RNHSRAITT 206 Query: 254 DTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD 313 + + +++ + + + + + AR+ +LG++ E F R+ D Sbjct: 207 TYKDNDHLNADYVDSLKEMLVRNPNRARVAVLGEWGIAEGLVFDGLFE-----QRDFSYD 261 Query: 314 LYA--PLIMGCDIAGEGGDKTV---VVFRRGNIIEHIFDWSAKLIQETNQEG 360 A P +G D G D T + + N I +I+D K TNQ Sbjct: 262 EIANLPKSVGLDF-GFKHDPTAGEFIAVDQDNRIVYIYDEFYKQHLLTNQIA 312 >gi|113200627|ref|YP_717790.1| terminase large subunit [Synechococcus phage syn9] gi|76574526|gb|ABA47091.1| terminase large subunit [Synechococcus phage syn9] Length = 549 Score = 47.4 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 83/256 (32%), Gaps = 42/256 (16%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T+ +LW + +++ +AN E+ + L + +Q L Sbjct: 85 GKSTIVTSYLLWYVLFNANVNVAILANKAA-----TAREMLQRLQLSYENLPKWLQQGIL 139 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII- 207 + EL S + + R S +F DE + P+ + Sbjct: 140 QWNRGSLELENGSKILAASTSASAVRGMSFN--------------VIFLDEFAFVPNHVA 185 Query: 208 ---NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN---IPLEDWKRYQIDTRTVEGI 261 S+ + + I+ S +N FY +++ ++ ++ V G Sbjct: 186 DQFFSSVYPTISS-GKSTKVIIISTPHGMN-MFYKLWHDAERKANEYIPTEVHWSEVPGR 243 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD-------- 313 D+ + E I R+E +F V+ I + + + + I + Sbjct: 244 DAAWKEQTIKNTSEQQ--FRVEFECEF-LGSVDTLISPSKLRTMVYGDPIAEKNGLSMYE 300 Query: 314 ---LYAPLIMGCDIAG 326 ++ D++ Sbjct: 301 KTIQGHTYVITADVSR 316 >gi|238790716|ref|ZP_04634478.1| Gp33 TerL [Yersinia frederiksenii ATCC 33641] gi|238721211|gb|EEQ12889.1| Gp33 TerL [Yersinia frederiksenii ATCC 33641] Length = 538 Score = 47.4 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/141 (17%), Positives = 47/141 (33%), Gaps = 16/141 (11%) Query: 228 SNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ 287 S + F + K + R D +++ + ++ + + + Sbjct: 229 STPNGMANSFAE--RRHSGKIKVFTFHWRDDPRKDDAWYQKQVE------NLDPVTVAQE 280 Query: 288 ----FPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNI 341 + IP +++ A++ + DIA EG D R G + Sbjct: 281 IDINYSASVEGVLIPSAWVQAAINAHEVLGIVPTGQRLGALDIADEGKDTNSFAGRHGFL 340 Query: 342 IEHIFDWSAK--LIQETNQEG 360 +E I +WS K I T Q+ Sbjct: 341 LESIEEWSGKGDDIFGTVQKA 361 >gi|224796473|ref|YP_002641230.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii A14S] gi|224497687|gb|ACN53304.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii A14S] Length = 450 Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 45/313 (14%), Positives = 89/313 (28%), Gaps = 46/313 (14%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ H K S G GKT L +++++ + S Sbjct: 46 TTKQKEVLFDIESHK----------YSKVIFSGGIASGKTFLASYLLIKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS + L ++ K + + + ++ + I Sbjct: 96 DTNNFIIGNSISLLMTNTIKQIEKICRL------LGIDYQKKKSGQSFCKIAGFELNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 D F + ++ +EA+ +L I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHKETLLEVLKRL--RKGKSIIIF 196 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL- 285 +N +F + + +K Y T + F E Y S + +L Sbjct: 197 DTNPESPAHFFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHSPAYKARVLY 255 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVFRRGNIIEH 344 G++ ++ + D + IM D A GGD T + E Sbjct: 256 GEW-IVNESSLFNEMIFNQ-------DYEFKSPIMYIDPAFSVGGDNTAICVLE-RTFEK 306 Query: 345 IFDWSAKLIQETN 357 + + + + N Sbjct: 307 FYAYIYQDQKPVN 319 >gi|221316998|ref|YP_002533177.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|221237630|gb|ACM10461.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] Length = 450 Score = 47.0 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++A I D A GGD T + R + Sbjct: 271 ITD--------DYVFASPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224593667|ref|YP_002641021.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] gi|224554694|gb|ACN56072.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 47.0 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y+ T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERIDEK 306 >gi|211731785|gb|ACJ10115.1| terminase [Bacteriophage APSE-7] Length = 469 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 51/183 (27%), Gaps = 38/183 (20%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPL---------- 245 + +EA + +++ + W N +G Y F P Sbjct: 105 WVEEAETVSEKSLDTLISTIRKPGSE-LWFSF-NPSEEDGAVYQRFVKPYKAIIDKKGYY 162 Query: 246 ---EDW-------KRYQIDTR---TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQE 292 + + + + + ++ YG + D Sbjct: 163 EDDDLYVGNVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANY----------- 211 Query: 293 VNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + I +++ A+ + ++ D A G D+ + R G +IE WS Sbjct: 212 DDALIQPEWVDAAIDAHIKLGFPPRGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSE 271 Query: 351 KLI 353 + Sbjct: 272 GDV 274 >gi|154247076|ref|YP_001418034.1| hypothetical protein Xaut_3147 [Xanthobacter autotrophicus Py2] gi|154161161|gb|ABS68377.1| protein of unknown function DUF264 [Xanthobacter autotrophicus Py2] Length = 416 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 48/272 (17%), Positives = 87/272 (31%), Gaps = 49/272 (18%) Query: 82 ISAGRGIGKTTLNA-WMMLWLI-----STRPGMSIICIANSETQLKNTLWAEVSKWLSML 135 + GRG GKT A W+ + + RP I +A + ++ + VS L++ Sbjct: 31 VLGGRGAGKTRAGAEWVRGLALGRPPFAGRPVGRIALVAETMADVREVMVEGVSGLLAVH 90 Query: 136 PHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAV 195 P + + +S E P++ GP A Sbjct: 91 PRAERPRWEPTR---------------RRLEWANGAVAQGFSAEDPESLRGPQFA---AA 132 Query: 196 FNDEASGTPDIINKSILGFF------TELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 + DE + K F L ++T+ R + L D Sbjct: 133 WLDELAK-----WKRAEATFDMLQFGLRLGAQPRQMVTTTPRPTA-----LLRRLLADPS 182 Query: 250 RYQIDTRTVEG---IDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAM 306 RT + + F +++RYG + + R E+ G+ + + +E Sbjct: 183 TAVTRARTADNAFHLAPSFLGQVLTRYG-GTRLGRQELDGELIEDRADALFSRPALEA-- 239 Query: 307 SREAIDDLYAPLIMGCDI---AGEGGDKTVVV 335 REA +++ D + G D +V Sbjct: 240 LREAQVPPLTRIVVAVDPPASSRAGADACGIV 271 >gi|225575978|ref|YP_002724813.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225576296|ref|YP_002725339.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547342|gb|ACN93326.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547454|gb|ACN93434.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 46.7 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 +K Y+ T + GF E Y D + + LG++ + F N Sbjct: 212 IDNTATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 I+ D ++ I D A GGD T + R + Sbjct: 271 IIQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMERVDDK 306 >gi|238027628|ref|YP_002911859.1| Bbp25 [Burkholderia glumae BGR1] gi|237876822|gb|ACR29155.1| Bbp25 [Burkholderia glumae BGR1] Length = 486 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 5/62 (8%) Query: 284 ILGQFPQQEVN---NFIPHNYIEEAMSRE-AIDDLYAPLI-MGCDIAGEGGDKTVVVFRR 338 + G F + IP ++ A R A P+ +G D+A G D+++ R Sbjct: 264 LYGDFAAGREDDPWQVIPSEWVRLAQERWRARSRPRIPMTALGVDVARGGQDQSIYTPRY 323 Query: 339 GN 340 GN Sbjct: 324 GN 325 >gi|255321082|ref|ZP_05362250.1| gp33 TerL [Acinetobacter radioresistens SK82] gi|262379515|ref|ZP_06072671.1| bacteriophage TerL protein [Acinetobacter radioresistens SH164] gi|255301852|gb|EET81101.1| gp33 TerL [Acinetobacter radioresistens SK82] gi|262298972|gb|EEY86885.1| bacteriophage TerL protein [Acinetobacter radioresistens SH164] Length = 558 Score = 46.7 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 35/225 (15%), Positives = 71/225 (31%), Gaps = 23/225 (10%) Query: 148 LHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDII 207 + P G+ ++ + M I + T + + G M DE + Sbjct: 169 MKPKGFIEKVHDNYMRIINPDNGATVTGEAGDNI----GRGGRTTMYFL-DEWAFVER-- 221 Query: 208 NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHE 267 +++ ++ N N I S + F+ + + + + R + Sbjct: 222 QEAVDAAISQ-NTNVH-IKGSTPNGIGDKFHQ--DRFSGRYAVFTMAWRDNPDKNWQVEL 277 Query: 268 GIISRYGL--------DSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAP 317 Y D V E+ + IP +++ A+ + + Sbjct: 278 DGKLIYPWYEKQLATLDDIVLAQEVDIDYAASVEGVLIPSAWVQAAVDAHIKLGIEPSGE 337 Query: 318 LIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQEG 360 D+A EG DK R G +++++ WS I T Q+ Sbjct: 338 RNGALDVADEGKDKNSFAARHGIVLQYLDTWSGIGDDIFGTTQKA 382 >gi|195942579|ref|ZP_03087961.1| hypothetical protein Bbur8_07059 [Borrelia burgdorferi 80a] Length = 450 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHRQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y+ T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYKFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|203288734|ref|YP_002223670.1| bsr protein [Borrelia duttonii Ly] gi|201084584|gb|ACH94162.1| bsr protein [Borrelia duttonii Ly] Length = 330 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 51/157 (32%), Gaps = 16/157 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ ++ +EA+ K +L + P T N +F + Sbjct: 35 ERFRG---SNSAVIYVNEATTLHKETLKEVLKRL-RMKPEFIIFDT-NPDHPEHYFKTDY 89 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + Y T E I F + Y D + + LG++ F N Sbjct: 90 IDNNTVYSTYNFTTYDNETISKEFIKTQEEIY-KDLPTYKASVLLGEWVANNDAIFRNIN 148 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAG-EGGDKTVVVF 336 IE D + I D A GGD TV+ Sbjct: 149 IIE--------DYEFKSPIAYLDPAYSSGGDNTVLCV 177 >gi|219723016|ref|YP_002474442.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692691|gb|ACL33908.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] Length = 450 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNMATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|226246851|ref|YP_002776184.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] gi|226202003|gb|ACO38584.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] Length = 450 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNMATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|203288843|ref|YP_002223837.1| bsr protein [Borrelia duttonii Ly] gi|201084394|gb|ACH93979.1| bsr protein [Borrelia duttonii Ly] Length = 450 Score = 46.3 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 49/294 (16%), Positives = 91/294 (30%), Gaps = 51/294 (17%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIST-------- 104 + Q + + A++ + + K +S G GKT L + T Sbjct: 47 KKQRKVLSAIEKNNQN----------KVILSGGIASGKTFLA---CYLFLKTLLKNRHLY 93 Query: 105 RPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI 164 R G + + NS+ L E++ + ++ + + + Y E+ + + Sbjct: 94 RKGTNNFILGNSQKAL------EINVIEQFEDLANMLKIPFVPKYSNRSYFEIDSLRVNL 147 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFW 224 Y ++ F ++ ++ +EA+ K L + P Sbjct: 148 -----------YGGDKIRDFKRFRGSNSAVIYVNEATTLHKETLKEALKRL-RIKPEFIV 195 Query: 225 IMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI 284 T N +F + + Y T E I F + Y D + + Sbjct: 196 FDT-NPDHPEHYFKTDYIDKNTVYSTYNFTTYDNEEISKEFIKTQEELY-KDFPTYKASV 253 Query: 285 -LGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAG-EGGDKTVVVF 336 LG++ F N IE D + I D A GGD T + Sbjct: 254 LLGEWVANNDAIFRNINIIE--------DYEFKSPIAYLDPAYSSGGDNTSLCV 299 >gi|85059798|ref|YP_455500.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] gi|84780318|dbj|BAE75095.1| phage terminase large subunit [Sodalis glossinidius str. 'morsitans'] Length = 483 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 54/167 (32%), Gaps = 5/167 (2%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 + +EA ++ + W+ + L+ + PL+D + Sbjct: 116 WVEEAEAVTKESWDILIPTIRKPGSE-IWVSFNPKNILDDTYQRFVVNPLDDICLLTVHY 174 Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--REAID 312 + D D+ G+ P + + I +I A+ Sbjct: 175 TDNPHFPEVLRLEMEECKCKDYDLYLHIWEGE-PVADSDLAIIKPLWIAAAVDAHMTLGF 233 Query: 313 DLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQE 359 D +G D+A EG D + F +G+++ + +W + ++ Sbjct: 234 DAVGEKRLGFDVADEGEDCNALCFVQGSVVLDLDEWHRGDVIASSNR 280 >gi|239502629|ref|ZP_04661939.1| hypothetical protein AbauAB_09982 [Acinetobacter baumannii AB900] Length = 414 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 46/263 (17%), Positives = 89/263 (33%), Gaps = 40/263 (15%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 + AGR GKT+L+ +++ S +P I +A + K +W ++ Sbjct: 26 VVAGRRWGKTSLSRTLII-SKSRKPRQRIWYVAPTYRMAKQIMWKDL------------- 71 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS 201 + P W ++ S+ I+ + T+ +++ PD+ G + DE Sbjct: 72 ----IEAIPRKWVVKINHSSLSIELVNGTLIELKGADD-PDSLRGVGID---FLVLDEFQ 123 Query: 202 GTPDIIN-KSILGFFTELNPNRFWIMTSNTRRLNGWF--YDIFNIPLE----DWKRYQID 254 + + + + +I + N + Y P + W+ +Q Sbjct: 124 DISEEAWTQCLRPTLASTGGHAIFI--GTPKAYNQLYTVYMQGQDPKKVKAGQWQSWQFP 181 Query: 255 TRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDL 314 T T I E + S + E L F + P + E + D Sbjct: 182 TITSPFIPESEIEAARADMDEKS--FKQEFLASFETMSGRVYYPFDRKEHVG--KYPFDP 237 Query: 315 YAPLIMGCDIAGEGGD--KTVVV 335 P+ +G D D TV++ Sbjct: 238 KLPIWIGMD---FNIDPMSTVIM 257 >gi|315655961|ref|ZP_07908859.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333] gi|315490025|gb|EFU79652.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333] Length = 460 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 46/280 (16%), Positives = 93/280 (33%), Gaps = 29/280 (10%) Query: 65 HCHSNVNNSNPTIFKCA--ISAGRGIGKTTLNAWMML-WLISTRPGMSIICIANSETQLK 121 H H+ + PT + GRG GKT A ++ W PG I +A E+ ++ Sbjct: 41 HHHARASQHPPTGAWTEWLLMTGRGWGKTRTAAELVRDWA--KNPGTQIAVVAKKESLVR 98 Query: 122 NTLWA-EVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 + + + S L ++P + + ++ E Sbjct: 99 SICFEHKTSGLLHVIPKSDQARFNASGGSGRFFLQLKNGSTIYGFG-----------AEV 147 Query: 181 PDTFVGPHNTHGMAVFNDE-ASGTPDIINKSILGFFTE--LNPNRFWIMTSNTRRLNGWF 237 PD G + DE A+ + + + +P+ ++++ + L Sbjct: 148 PDNLRGFAFDKA---WFDEFAAWNKQTAQEVYDMMWYDLRESPSPQMVISTTPKPLKHV- 203 Query: 238 YDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFI 297 D+ + P R + + + E + YG + + R E+ G+ + Sbjct: 204 RDLVSKPGVVITRGHTKD-NLPNLSAIALEKLERDYGK-TRLGRQELAGELIESIEGALW 261 Query: 298 PHNYIEEAMSREAIDDLYAPLIMGCDIA---GEGGDKTVV 334 ++ + R +++G D A EG D T Sbjct: 262 DVTMFQDPVFRPDTMPPLEDIVVGVDPAVRSSEGADMTAF 301 >gi|87201130|ref|YP_498387.1| hypothetical protein Saro_3118 [Novosphingobium aromaticivorans DSM 12444] gi|87136811|gb|ABD27553.1| protein of unknown function DUF264 [Novosphingobium aromaticivorans DSM 12444] Length = 440 Score = 46.3 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 51/269 (18%), Positives = 88/269 (32%), Gaps = 35/269 (13%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 + AGRG GKT L A + + P I + S + ++ + Sbjct: 57 VMAGRGFGKTRLGAEWVRKIAEEDPEARIALVGASLHEARSVMVE--------------- 101 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE-- 199 L + W + E S+ YS P++ GP ++H + DE Sbjct: 102 GESGLLSIDAPWRRPVFESSVRRLVWPNGAQAFLYSAGEPESLRGPQHSHA---WCDEIA 158 Query: 200 ----ASGTPDIINKSIL-GFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQID 254 S ++L G +P T L D +D + Sbjct: 159 KWDNGSNRAMATWDNLLMGLRLGRDPRLVATTTPRPVPLVARIMD----EGDDVVVTRGS 214 Query: 255 TRTVE-GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD 313 T + + F E + +G + + R E+LG+ + V IE A RE Sbjct: 215 TFENQDNLPRRFVEAMRRTFGGTT-LGRQELLGEMIEDLVGALWSRALIENA--REDAAP 271 Query: 314 LYAPLIMGCDI--AGEGGDKTVVVFRRGN 340 +++G D + G ++V G+ Sbjct: 272 AMTRVVVGVDPPASAHGDACGIIVCGIGD 300 >gi|224020497|ref|YP_002601287.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|223929730|gb|ACN24438.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] Length = 450 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 INNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYIFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|114569469|ref|YP_756149.1| hypothetical protein Mmar10_0918 [Maricaulis maris MCS10] gi|114339931|gb|ABI65211.1| protein of unknown function DUF264 [Maricaulis maris MCS10] Length = 450 Score = 46.3 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 43/260 (16%), Positives = 74/260 (28%), Gaps = 28/260 (10%) Query: 84 AGRGIGKTTLNA-WMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GKT A W+ + T I + + ++ + Sbjct: 67 GGRGAGKTRAGAEWVRHRALRTV--SRIALVGPTFNDVREVM------------IEGPSG 112 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 ++ L E + + S +S E D GP + + DE + Sbjct: 113 LKHLGSAMERPRYEASRKRLVFPSGSQAY---AFSAEDADGLRGPQFDYA---WGDEFAA 166 Query: 203 TPDI---INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVE 259 PD ++ +G P T ++ +Q Sbjct: 167 WPDPQRVLDTLRMGVRLGGAPRILLTTTPRPIPALKALVKAWDPRGPIRVTHQPTAANAA 226 Query: 260 GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI 319 + GF E + + YG S + R E+ G IE A ++ Sbjct: 227 NLAPGFVEALNAAYG-GSMLGRQEVEGLLIDDPDGALWTRPKIEAARLAAGQMPELDRIV 285 Query: 320 MGCDIAGEGG---DKTVVVF 336 + D GG D+ +V Sbjct: 286 VALDPPATGGPRSDECGIVV 305 >gi|226315790|ref|YP_002776047.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] gi|226201663|gb|ACO38256.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] Length = 450 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKPY-KDIPLYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|11497124|ref|NP_051248.1| hypothetical protein BB_S45 [Borrelia burgdorferi B31] gi|223987739|ref|YP_002601211.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|6382145|gb|AAF07462.1|AE001576_21 conserved hypothetical protein [Borrelia burgdorferi B31] gi|223929452|gb|ACN24166.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] Length = 450 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 50/304 (16%), Positives = 97/304 (31%), Gaps = 42/304 (13%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIS--- 103 +F + QL ++ +V NN IF I++ GKT L ++ L + Sbjct: 36 NFDKFEEKQL-TLKQKNVIKSIKKNNEKKIIFSGGIAS----GKTYLACYLFLKSLIENK 90 Query: 104 --TRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS 161 + I NS+ ++ + + K + ++ + H + Y + Sbjct: 91 KLYSSDTNNFIIGNSQRSVEVNVLGQFEKLCKL------LKIPYIPRHTNNSYILIDSLR 144 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPN 221 + + + F G ++ +F +EA+ + +L Sbjct: 145 INLYGGDKASDF--------ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQE 192 Query: 222 RFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVAR 281 T N +F + + +K Y T + GF E Y D + Sbjct: 193 TIIFDT-NPDHPEHYFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYK 250 Query: 282 IEI-LGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--R 337 + LG++ + F N + D ++ I D A GGD T + R Sbjct: 251 ARVLLGEWIASTDSIFTQINITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMER 302 Query: 338 RGNI 341 + Sbjct: 303 VDDK 306 >gi|326783799|ref|YP_004324193.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SSM7] gi|310003811|gb|ADO98206.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-SSM7] Length = 552 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 80/257 (31%), Gaps = 43/257 (16%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GK+T +L ++I +AN ++ L + LP W MQ + Sbjct: 86 GKSTTVVSYLLHYAIFNDSVTIGILANKAQTARDLL-GRLQIAYENLPK--W--MQQGII 140 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE----ASGTP 204 + EL +S I + R S +F DE A+ Sbjct: 141 AWNKGSMELENKSKIIAASTSASAVRGMSFN--------------IIFLDEFAFVANHLA 186 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI---PLEDWKRYQIDTRTVEGI 261 D S+ + + I+ S R +N FY +++ ++ + V G Sbjct: 187 DDFFSSVYPTISS-GKSTKVIIVSTPRGMN-HFYRLWHDAELGRNEYVTTDVHWSEVPGR 244 Query: 262 DSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAI---------- 311 D + E I R+E +F V+ I + ++ + E I Sbjct: 245 DEAWKEQTIKNTSE--AQFRVEFECEF-LGSVDTLIAPSKLKTMVYDEPINTGKRGGEIY 301 Query: 312 --DDLYAPLIMGCDIAG 326 + D+A Sbjct: 302 QNPIEKHNYSITVDVAR 318 >gi|226246889|ref|YP_002776229.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] gi|226202275|gb|ACO37943.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] Length = 450 Score = 45.9 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 96/304 (31%), Gaps = 42/304 (13%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIS--- 103 +F + QL ++ +V NN I I++ GKT L ++ L + Sbjct: 36 NFDKFEEKQL-TLKQKNVIKSIKKNNEKKIILSGGIAS----GKTYLACYLFLKSLIENK 90 Query: 104 --TRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS 161 + I NS+ ++ + + K + ++ + H + Y + Sbjct: 91 KLYSSDTNNFIIGNSQRSVEVNVLGQFEKLCKL------LKIPYIPRHTNNLYILIDSLR 144 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPN 221 + + + F G ++ +F +EA+ + +L Sbjct: 145 INLYGGDKASDF--------ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQE 192 Query: 222 RFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVAR 281 T N +F + + +K Y T + GF E Y D + Sbjct: 193 TIIFDT-NPDHPEHYFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYK 250 Query: 282 IEI-LGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--R 337 + LG++ + F N + D ++ I D A GGD T + R Sbjct: 251 ARVLLGEWIASTDSIFTQINITD--------DYIFTSPIAYLDPAFSVGGDNTALCVMER 302 Query: 338 RGNI 341 + Sbjct: 303 VDDK 306 >gi|218555117|ref|YP_002388030.1| hypothetical protein ECIAI1_2647 [Escherichia coli IAI1] gi|218361885|emb|CAQ99485.1| conserved hypothetical protein from bacteriophage origin [Escherichia coli IAI1] Length = 540 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 13/175 (7%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 249 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIKLGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLMENVREWSGVGSDIYQSVEK 361 >gi|330910791|gb|EGH39301.1| phage terminase, large subunit [Escherichia coli AA86] Length = 540 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 13/175 (7%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 249 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIKLGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|203288609|ref|YP_002223516.1| bsr protein [Borrelia duttonii Ly] gi|201084316|gb|ACH93904.1| bsr protein [Borrelia duttonii Ly] Length = 450 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 47/291 (16%), Positives = 95/291 (32%), Gaps = 49/291 (16%) Query: 55 QLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIST-----RPGMS 109 Q E + +D + S + IF IS+ GKT L +++++ L+ + Sbjct: 49 QKEVLRDIDNNFCSKI------IFNGGISS----GKTFLASYLLIKLLIINRDHYHKDTN 98 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 + +S L ++ K S+L + LL+ S + Sbjct: 99 NFIVGSSIGTLLANTLKQIEKICSLLNIEY-----------------LLKDSRQVTCTIA 141 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 +T Y + D+F ++ V+ +EA+ I+ + P T N Sbjct: 142 GLTLNIYGGKNIDSFTKIRGSNSALVYVNEATLMHKETLLEIMKRLRQK-PGIIIFDT-N 199 Query: 230 TRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQF 288 +F + + ++ Y + F + + Y + + + LG++ Sbjct: 200 PDHPAHYFKVDYIDNRDVYRTYNFNIYDNPLNSKDFIKTQEAIY-KNLSAYKARVLLGEW 258 Query: 289 PQQEVNNFIPHNYIEEAMSREAIDDLY--APLIMGCDIAGE-GGDKTVVVF 336 I+ + ++ Y IM D A G D T + Sbjct: 259 ----------TASIDSCFNEVILNCEYTFKSPIMYIDPAFSVGMDNTAICV 299 >gi|224983831|ref|YP_002641150.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|224554243|gb|ACN55633.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] Length = 450 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y+ T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYKFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|11497063|ref|NP_051203.1| hypothetical protein BB_P42 [Borrelia burgdorferi B31] gi|6382084|gb|AAF07402.1|AE001575_3 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y+ T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYKFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|11497292|ref|NP_051420.1| hypothetical protein BB_L43 [Borrelia burgdorferi B31] gi|6382313|gb|AAF07626.1|AE001580_11 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 45.9 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y+ T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYKFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|163792602|ref|ZP_02186579.1| hypothetical protein BAL199_17183 [alpha proteobacterium BAL199] gi|159182307|gb|EDP66816.1| hypothetical protein BAL199_17183 [alpha proteobacterium BAL199] Length = 422 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 51/260 (19%), Positives = 87/260 (33%), Gaps = 28/260 (10%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 I AGRG GKT A + L + I +A + ++ + + +L Sbjct: 45 ILAGRGFGKTRTGAEWVRGLAESGRARRIALVAETAADARDVM---IEGESGLLAC---- 97 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEAS 201 + W E S + ++S + PD GP A + DE + Sbjct: 98 --------CAPWGRPKYEPSKRRVTWPNGAIATSFSADDPDQLRGPQFD---AAWADEIA 146 Query: 202 G--TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVE 259 + +LG +P + T+ + W + P Sbjct: 147 KWRYEAAWDNLMLGLRLGADP--RCVATTTP-KPRAWLARLMADP-GTVVTRGATRENAG 202 Query: 260 GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI 319 + GF + I++RY + + R EI G+F + IE A + A +I Sbjct: 203 NLAPGFLDQILARY-AGTRLGRQEIDGEFLTEIPGALWTRTLIEGARALPGAVPGLARII 261 Query: 320 MGCDIA---GEGGDKTVVVF 336 + D A G D+T +V Sbjct: 262 VAVDPAVTSGSDSDETGIVV 281 >gi|254160843|ref|YP_003043951.1| hypothetical protein ECB_00733 [Escherichia coli B str. REL606] gi|253972744|gb|ACT38415.1| conserved hypothetical protein [Escherichia coli B str. REL606] Length = 540 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 13/175 (7%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 249 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIKLGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|268589862|ref|ZP_06124083.1| phage terminase, large subunit, PBSX family [Providencia rettgeri DSM 1131] gi|291314845|gb|EFE55298.1| phage terminase, large subunit, PBSX family [Providencia rettgeri DSM 1131] Length = 470 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 33/267 (12%), Positives = 75/267 (28%), Gaps = 21/267 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + G + +EA Sbjct: 68 TIEREGYNNEFEIQRTMIKHLGTGAEFMFYGIKNNPTKIKSLEGVD-----VCWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + N W+ + L+ + P +D + Sbjct: 123 VTKESWDILIPTIRKPNSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTANYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ +I Sbjct: 182 DVLRLEMEECKRKNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAIIA 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFD 347 D + GGD R G++++ I + Sbjct: 242 THDPSDVGGDAKGYAMRHGSVVKRISE 268 >gi|194430118|ref|ZP_03062621.1| gp33 TerL [Escherichia coli B171] gi|215487586|ref|YP_002330017.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] gi|260845222|ref|YP_003223000.1| putative terminase large subunit [Escherichia coli O103:H2 str. 12009] gi|194411828|gb|EDX28147.1| gp33 TerL [Escherichia coli B171] gi|215265658|emb|CAS10061.1| predicted terminase, large subunit [Escherichia coli O127:H6 str. E2348/69] gi|257760369|dbj|BAI31866.1| predicted terminase large subunit [Escherichia coli O103:H2 str. 12009] gi|309702924|emb|CBJ02255.1| putative phage gp33 TerL [Escherichia coli ETEC H10407] gi|323159191|gb|EFZ45181.1| gp33 TerL [Escherichia coli E128010] Length = 540 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 13/175 (7%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 249 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIKLGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|312147626|gb|ADQ30287.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATALHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|291283815|ref|YP_003500633.1| hypothetical protein G2583_3121 [Escherichia coli O55:H7 str. CB9615] gi|290763688|gb|ADD57649.1| hypothetical protein G2583_3121 [Escherichia coli O55:H7 str. CB9615] Length = 540 Score = 45.9 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 13/175 (7%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 249 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIKLGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|297848822|ref|XP_002892292.1| hypothetical protein ARALYDRAFT_470549 [Arabidopsis lyrata subsp. lyrata] gi|297338134|gb|EFH68551.1| hypothetical protein ARALYDRAFT_470549 [Arabidopsis lyrata subsp. lyrata] Length = 1406 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 51/158 (32%), Gaps = 15/158 (9%) Query: 41 KGKPLEHFSQPHRW----QLEFMEAVDVHCHSNVNNSNPTIFKCAISAG-----R--GIG 89 +G + Q E E + + + + F+ + G G G Sbjct: 805 EGTVWDKIPGVKSQMYPHQQEGFEFIWKNLAGTILLNELKDFENSDETGGCIMSHAPGTG 864 Query: 90 KTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWA-EVSKWLSMLPHRHWFEMQSLSL 148 KT L + + P + IA + L WA E KW +P + + Sbjct: 865 KTRLTIIFLQAYLQCFPDCKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGK 921 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG 186 S L++++ S + + YS + + +G Sbjct: 922 ESSAALGLLMQKNATARSNNEIRMVKIYSWIKSKSILG 959 >gi|226246703|ref|YP_002776000.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] gi|226202392|gb|ACO38050.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] Length = 450 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 49/307 (15%), Positives = 97/307 (31%), Gaps = 42/307 (13%) Query: 44 PLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIS 103 L +F + QL ++ +V NN I I++ GKT L ++ L + Sbjct: 33 SLINFDKFEEKQL-TLKQKNVIKSIKKNNEKKIILSGGIAS----GKTYLACYLFLKSLI 87 Query: 104 -----TRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELL 158 + I NS+ ++ + + K + ++ + + + Y + Sbjct: 88 ENKKLYSSDTNNFIIGNSQRSVEVNVLGQFEKLCKL------LKIPYIPRYTNNSYILID 141 Query: 159 EQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTEL 218 + + + F G ++ +F +EA+ + +L Sbjct: 142 SLRINLYGGDKASDF--------ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RC 189 Query: 219 NPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSD 278 T N +F + + +K Y T + GF E Y D Sbjct: 190 GQETIIFDT-NPDHPEHYFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIP 247 Query: 279 VARIEI-LGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 + + LG++ + F N + D ++ I D A GGD T + Sbjct: 248 SYKARVLLGEWIASTDSIFTQINITD--------DYVFTSPIAYLDPAFSVGGDNTALCV 299 Query: 337 --RRGNI 341 R + Sbjct: 300 MERVDDK 306 >gi|324114526|gb|EGC08494.1| hypothetical protein ERIG_00518 [Escherichia fergusonii B253] Length = 540 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 13/175 (7%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 249 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIKLGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|300824951|ref|ZP_07105051.1| conserved hypothetical protein [Escherichia coli MS 119-7] gi|300522580|gb|EFK43649.1| conserved hypothetical protein [Escherichia coli MS 119-7] Length = 540 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 13/175 (7%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 249 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIKLGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|297520464|ref|ZP_06938850.1| hypothetical protein EcolOP_22727 [Escherichia coli OP50] Length = 313 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 43/109 (39%), Gaps = 7/109 (6%) Query: 256 RTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS--REAID 312 R D ++ + +D+ V + L + IP +++ A+ + Sbjct: 28 RDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAHIKLGI 85 Query: 313 DLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 86 QPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 134 >gi|195942433|ref|ZP_03087815.1| hypothetical protein Bbur8_06259 [Borrelia burgdorferi 80a] Length = 450 Score = 45.5 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPLYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|219723069|ref|YP_002474484.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219693000|gb|ACL34209.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|312147710|gb|ADQ30370.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHRQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 INNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|255929035|ref|YP_003097347.1| DNA terminase packaging enzyme large subunit [Synechococcus phage S-RSM4] gi|255705321|emb|CAR63310.1| DNA terminase packaging enzyme large subunit [Synechococcus phage S-RSM4] Length = 550 Score = 45.5 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 52/344 (15%), Positives = 99/344 (28%), Gaps = 61/344 (17%) Query: 11 LEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNV 70 ++++ E + + F ++ P + + +Q E + D H + Sbjct: 24 TKKQIDEWIKCKNDPIYFAMNYIQIISLDEGLVPFKMYD----FQKEILR--DFHENRFN 77 Query: 71 NNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSK 130 P GK+T +L+ ++I +AN + E+ Sbjct: 78 IAKLPRQ----------TGKSTTVVAYLLYYAIFYDSVNIGILANKAS-----TARELLG 122 Query: 131 WLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT 190 L + MQ L + EL S + + R S Sbjct: 123 RLQLAYENLPKWMQHGILVWNKGNVELENGSKILAASTSASAVRGMSFN----------- 171 Query: 191 HGMAVFNDEASGTPDII----NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF---NI 243 +F DE + P+ + S+ T + I+ S +N FY ++ Sbjct: 172 ---ILFLDEFAFVPNHVAEQFFASVYPTITS-GKSTKVIIISTPNGMN-HFYKMWEDARR 226 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYI 302 D+ ++ V G D+ + E I S E F + I + Sbjct: 227 GKNDYVTNEVHWSQVPGRDAKWKEETIKN---TSPRQFAQEFECDF-LGSADTLISPAKL 282 Query: 303 E-----------EAMSREAIDDLYAPLIMGCDIAGE-GGDKTVV 334 + + I+ D+A GGD + Sbjct: 283 QNIPFHDPIQSNAGLDVYERVQKDHEYIITVDVARGIGGDYSAF 326 >gi|111074104|ref|YP_709233.1| hypothetical protein BAPKO_4029 [Borrelia afzelii PKo] gi|110891215|gb|ABH02376.1| hypothetical protein BAPKO_4029 [Borrelia afzelii PKo] Length = 450 Score = 45.5 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F + Sbjct: 212 IDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDSIFTQID 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMERIDDK 306 >gi|312148837|gb|ADQ31485.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|312148805|gb|ADQ31454.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|312147637|gb|ADQ30298.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|312147604|gb|ADQ30266.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224590670|ref|YP_002640676.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|224553765|gb|ACN55167.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224983785|ref|YP_002641105.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] gi|224553986|gb|ACN55383.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|195942842|ref|ZP_03088224.1| hypothetical protein Bbur8_08565 [Borrelia burgdorferi 80a] gi|312150044|gb|ADQ30103.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi N40] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|225622041|ref|YP_002724986.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] gi|225546350|gb|ACN92359.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|225576422|ref|YP_002725451.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] gi|225547005|gb|ACN92996.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|226322171|ref|ZP_03797692.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] gi|226232426|gb|EEH31184.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224022662|ref|YP_002606275.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|224593632|ref|YP_002640950.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] gi|223929246|gb|ACN23964.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|224554688|gb|ACN56067.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|219723193|ref|YP_002474612.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|224591572|ref|YP_002640899.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] gi|219693035|gb|ACL34243.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|224554907|gb|ACN56281.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|322420465|ref|YP_004199688.1| hypothetical protein GM18_2968 [Geobacter sp. M18] gi|320126852|gb|ADW14412.1| hypothetical protein GM18_2968 [Geobacter sp. M18] Length = 507 Score = 45.1 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 60/204 (29%), Gaps = 13/204 (6%) Query: 85 GRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQ 144 GR +GK+ + + L T G + A + L + E+ L P Sbjct: 55 GRDVGKSIVLSTDALHYAFTTRGGQGLIAAPHQGHLDTIIE-EIEFQLDTNPDLMNSIAL 113 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 + P+ S Y Y D F H V+ DE + Sbjct: 114 TKYGKPNIHRKPYFRLEFTNGSVLYFRPAGAYG----DAFRSLHVGR---VWVDEGAWLT 166 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG 264 + K++ + T N R ++ + + ++ + Sbjct: 167 ERAWKALRQCL-KAGGTLRIYSTPNGLRDTTYYRLT---SSDQFHVFRWPSWLNPLWTED 222 Query: 265 FHEGIISRYG-LDSDVARIEILGQ 287 ++ YG DS + E+ G+ Sbjct: 223 REAELLEFYGGRDSSGWQHEVAGE 246 >gi|191172603|ref|ZP_03034142.1| gp33 TerL [Escherichia coli F11] gi|190907076|gb|EDV66676.1| gp33 TerL [Escherichia coli F11] gi|324014340|gb|EGB83559.1| hypothetical protein HMPREF9533_01599 [Escherichia coli MS 60-1] Length = 540 Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 53/164 (32%), Gaps = 11/164 (6%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 249 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + + D+A EG DK R G ++E++ +WS Sbjct: 307 HIKLGIQPTGKRLGAMDVADEGRDKNAFSTRHGFLLENVREWSG 350 >gi|203288341|ref|YP_002223391.1| bsr protein [Borrelia recurrentis A1] gi|201085561|gb|ACH95134.1| bsr protein [Borrelia recurrentis A1] Length = 412 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 47/291 (16%), Positives = 93/291 (31%), Gaps = 49/291 (16%) Query: 55 QLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIST-----RPGMS 109 Q E + +D + S + IF IS+ GKT L +++++ L+ + Sbjct: 11 QKEVLRDIDNNFCSKI------IFNGGISS----GKTFLASYLLIKLLIINRDNYHKDTN 60 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 +S L ++ K S+L + LL+ S + Sbjct: 61 NFIFGSSIGTLLANTLKQIEKICSLLNIEY-----------------LLKDSRQVTCTIA 103 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 +T Y + D+F ++ V+ +EA+ I+ + P T N Sbjct: 104 GLTLNIYGGKNIDSFTKIRGSNSALVYVNEATLMHKETLLEIMKRLRQK-PGIIIFDT-N 161 Query: 230 TRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQF 288 +F + + ++ Y F + + Y + + + LG++ Sbjct: 162 PDHPAHYFKVDYIDNRDVYRTYNFSIYDNPLNSKDFIKTQEAIY-KNLSAYKARVLLGEW 220 Query: 289 PQQEVNNFIPHNYIEEAMSREAIDDLY--APLIMGCDIAGE-GGDKTVVVF 336 I+ + ++ Y IM D A G D T + Sbjct: 221 ----------TASIDSCFNEVILNCEYTFKSPIMYIDPAFSVGMDNTAICV 261 >gi|226315871|ref|YP_002776346.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] gi|226202080|gb|ACO37753.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi Bol26] Length = 450 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 95/304 (31%), Gaps = 42/304 (13%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTR- 105 +F + QL ++ +V NN I I++ GKT L ++ L + Sbjct: 36 NFDKFEEKQL-TLKQKNVIKSIKKNNEKKIILSGGIAS----GKTYLACYLFLKSLIANK 90 Query: 106 ----PGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS 161 + I NS+ ++ + + K ++ + H + Y + Sbjct: 91 NLYSSDTNNFIIGNSQRSVEVNVLGQFEKLCKR------LKIPYIPRHTNNSYILIDSLR 144 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPN 221 + + + F G ++ +F +EA+ + +L Sbjct: 145 INLYGGDKASDF--------ERFRG---SNSALIFVNEATTLHRQTLEEVLKRL-RCGQE 192 Query: 222 RFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVAR 281 T N +F + + +K Y T + GF E Y D + Sbjct: 193 TIIFDT-NPDHPEHYFKTDYIDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYK 250 Query: 282 IEI-LGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--R 337 + LG++ + F N + D ++ I D A GGD T + R Sbjct: 251 ARVLLGEWIASTDSIFTQINITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMER 302 Query: 338 RGNI 341 + Sbjct: 303 VDDK 306 >gi|58532911|ref|YP_195134.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-PM2] gi|58331378|emb|CAF34164.1| terminase DNA packaging enzyme large subunit [Synechococcus phage S-PM2] Length = 548 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 55/343 (16%), Positives = 110/343 (32%), Gaps = 59/343 (17%) Query: 11 LEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNV 70 ++++ E + A + F ++ P + +Q E + + H + Sbjct: 23 TKEQVKEWIKCANDPVYFTKNYVKIVSLDEGLVPFKM----WDFQEELI--MKFHKNRFN 76 Query: 71 NNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSK 130 P GK+T +L + ++I +AN + ++ L A ++ Sbjct: 77 IAKLPRQ----------TGKSTTVVSYLLHYLIFNDNVNIGILANKASTARDLL-ARLAT 125 Query: 131 WLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT 190 LP W +Q + + EL S + + R S Sbjct: 126 AYENLPK--W--IQQGVVVWNKGNIELENGSKILAASTSASAVRGMSFN----------- 170 Query: 191 HGMAVFNDEASGTPDII----NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF---NI 243 +F DE + P+ I S+ T + I+ S + +N FY ++ Sbjct: 171 ---IIFLDEFAFVPNHIADSFFASVYPTITS-GKSTKVIIISTPQGMN-HFYKMWVDATN 225 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIE 303 + +++ V G D + E I E +F V+ I + ++ Sbjct: 226 GRNGYTFHEVHWSQVPGRDEKWKEETIKNTSERQ--FTQEFECEF-LGSVDTLIAASKLK 282 Query: 304 E-----AMSREAIDDLYAPL------IMGCDIAGE-GGDKTVV 334 + R D+Y +M D++ GGD + Sbjct: 283 ALVFNDPIKRNKGLDIYEEPKEKSEYLMTVDVSRGIGGDYSAF 325 >gi|302343251|ref|YP_003807780.1| hypothetical protein Deba_1821 [Desulfarculus baarsii DSM 2075] gi|301639864|gb|ADK85186.1| conserved hypothetical protein [Desulfarculus baarsii DSM 2075] Length = 507 Score = 45.1 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 67/204 (32%), Gaps = 13/204 (6%) Query: 85 GRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQ 144 GR +GK+ + + L T G + A + L + E+ L P M Sbjct: 55 GRDVGKSIVLSTDALHYAFTTRGGQGLIAAPHQGHLDTIIE-EIEFQLDTNPD----LMN 109 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 S++L G + ++ + ++ + D F H V+ DE + Sbjct: 110 SIALTKYGKPKIHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLT 166 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG 264 + K++ + T N R ++ + + ++ + Sbjct: 167 ERAWKALRQCL-KAGGTLRIYSTPNGLRDTTYYRLT---SSDQFHVFRWPSWLNPLWTED 222 Query: 265 FHEGIISRYG-LDSDVARIEILGQ 287 ++ YG DS + E+ G+ Sbjct: 223 REAELLEFYGGRDSSGWQHEVAGE 246 >gi|116751218|ref|YP_847905.1| hypothetical protein Sfum_3801 [Syntrophobacter fumaroxidans MPOB] gi|116700282|gb|ABK19470.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB] Length = 507 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 67/204 (32%), Gaps = 13/204 (6%) Query: 85 GRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQ 144 GR +GK+ + + L T G + A + L + E+ L P M Sbjct: 55 GRDVGKSIVLSTDALHYAFTTRGGQGLIAAPHQGHLDTIIE-EIEFQLDSNPD----LMN 109 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 S++L G + ++ + ++ + D F H V+ DE + Sbjct: 110 SIALTKYGKPKIHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLT 166 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG 264 + K++ + T N R ++ + + ++ + Sbjct: 167 ERAWKALRQCL-KAGGTLRIYSTPNGLRDTTYYRLT---SSDQFHVFRWPSWLNPLWTED 222 Query: 265 FHEGIISRYG-LDSDVARIEILGQ 287 ++ YG DS + E+ G+ Sbjct: 223 REAELLEFYGGRDSSGWQHEVAGE 246 >gi|194445851|ref|YP_002040314.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|194404514|gb|ACF64736.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str. SL254] Length = 540 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 61/176 (34%), Gaps = 15/176 (8%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R+ D ++ + +D+ V + L + IP ++++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDA 306 Query: 308 --REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 R I L D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIRLGIQPTGKRLGA-MDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|62181180|ref|YP_217597.1| hypothetical protein SC2610 [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62128813|gb|AAX66516.1| orf, partial conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322715669|gb|EFZ07240.1| hypothetical protein SCA50_2790 [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 540 Score = 45.1 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 61/176 (34%), Gaps = 15/176 (8%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R+ D ++ + +D+ V + L + IP ++++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDA 306 Query: 308 --REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 R I L D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIRLGIQPTGKRLGA-MDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|291336011|gb|ADD95601.1| large terminase protein [uncultured phage MedDCM-OCT-S09-C7] Length = 526 Score = 45.1 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 42/274 (15%), Positives = 93/274 (33%), Gaps = 48/274 (17%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAE-VSKWLSMLPHRHW 140 + A R GK+ + +LW + P +++ +AN K + E +++ ++ML + Sbjct: 80 VLASRQSGKSITSCAYLLWFLLFNPEVTVAVLAN-----KGAIAREMIARMVTMLESVPF 134 Query: 141 FEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA 200 F + + + E S + + + + R S ++ DE Sbjct: 135 FLQPGVKI-LNKGSIEFANDSKVVAAATSSSSIRGLSIN--------------LLYLDEF 179 Query: 201 SGTPDI-INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED---WKRYQIDTR 256 + D + + I+TS + FY I+ + D +K + I+ Sbjct: 180 AFVDDAETFYTATYPVVTSGKDSKVIITSTANGVGNMFYKIYESAVHDQSEYKHFLINWF 239 Query: 257 TVEGIDSGFHE---------GIISRYG------LDSDVARIEILGQFPQQEVNNFIPHNY 301 V G D + + YG ++ + +LG ++ ++ Sbjct: 240 DVPGRDEEWKKETIANTSEAQFEQEYGNSFLGTGNTLINSNTLLGLMSKE-------PDW 292 Query: 302 IEEAMSREAIDDLYAPLIMGCDIA-GEGGDKTVV 334 ++ + I D++ G G D + Sbjct: 293 NKDGVKVYEKPKEGHTYITTVDVSKGRGIDYSTF 326 >gi|332884414|gb|EGK04674.1| hypothetical protein HMPREF9456_03377 [Dysgonomonas mossii DSM 22836] Length = 450 Score = 44.7 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 47/152 (30%), Gaps = 14/152 (9%) Query: 197 NDEASGTPDIINKSILGFFTELNPNRFWI--MTS--NTRRL--NGWFYDIFNIPL--EDW 248 DE S + ++ I M N + FY + +D Sbjct: 133 IDENSQITEKCWNIVMSRIRHDVAKNGLIPKMFGACNPTKNFVYNRFYKPHRDGILPDDK 192 Query: 249 KRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL-GQFPQQEVNNFIPHNYIEEAMS 307 Q +D + E + + ++R +L G++ + + + ++ Y + Sbjct: 193 AFIQALVTDNPFVDKFYIENLKNL----DPISRARLLDGEW-EYDDDPYVLMQYEKIVDL 247 Query: 308 REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRG 339 P M D+A G D T + G Sbjct: 248 FTNSHVSGGPRYMTIDVARLGKDDTTIRIWEG 279 >gi|94497317|ref|ZP_01303888.1| hypothetical protein SKA58_07183 [Sphingomonas sp. SKA58] gi|94423180|gb|EAT08210.1| hypothetical protein SKA58_07183 [Sphingomonas sp. SKA58] Length = 437 Score = 44.7 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 48/259 (18%), Positives = 87/259 (33%), Gaps = 30/259 (11%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 AGRG GKT A + + P I + S + ++ + S L++ PH Sbjct: 58 AGRGFGKTRAGAEWVRGIAEADPAARIALVGASLGEARSVMVEGESGLLAIAPH------ 111 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE---- 199 W ++ + + P+ GP +HG + DE Sbjct: 112 ---------WARPAYAPALRRLTWPNGAVAMLFGAADPEGLRGPQFSHG---WADEIAKW 159 Query: 200 ASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVE 259 ASG + ++G +P T L + +D + T E Sbjct: 160 ASGEA-AWHNLMMGMRLGRDPRVLVTTTPRPVPLV---RSLVARDGDDVVVTRGRTADNE 215 Query: 260 -GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL 318 + GF + + YG + + R E+ G+ ++ IE+ + + + Sbjct: 216 ANLAPGFVAAMTAGYG-GTRLGRQELDGELIEEVEGALWTRALIEQC-RVVHVPGVLTRV 273 Query: 319 IMGCD-IAGEGGDKTVVVF 336 ++ D A GGD +V Sbjct: 274 VVAVDPPASVGGDACGIVV 292 >gi|224535035|ref|ZP_03675589.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii A14S] gi|224513696|gb|EEF84036.1| phage terminase, large subunit, pbsx family [Borrelia spielmanii A14S] Length = 379 Score = 44.7 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMERIDDK 306 >gi|216968428|ref|YP_002333693.1| phage terminase, large subunit, pbsx family [Borrelia afzelii ACA-1] gi|216752682|gb|ACJ73366.1| phage terminase, large subunit, pbsx family [Borrelia afzelii ACA-1] Length = 450 Score = 44.7 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMERIDDK 306 >gi|78356952|ref|YP_388401.1| hypothetical protein Dde_1909 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219357|gb|ABB38706.1| hypothetical protein Dde_1909 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 507 Score = 44.7 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 67/204 (32%), Gaps = 13/204 (6%) Query: 85 GRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQ 144 GR +GK+ + + L T G + A + L + E+ L P M Sbjct: 55 GRDVGKSIVLSTDALHYAFTTRGGQGLIAAPHQGHLDTIIE-EIEFQLDTNPD----LMN 109 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 S++L G + ++ + ++ + D F H V+ DE + Sbjct: 110 SIALTKYGKPKIHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLT 166 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG 264 + K++ + T N R ++ E + ++ + Sbjct: 167 ERAWKALRQCL-KAGGTLRIYSTPNGLRDTTYYRLT---SSEQFHVFRWPSWLNPLWTED 222 Query: 265 FHEGIISRYG-LDSDVARIEILGQ 287 ++ YG DS + E+ G+ Sbjct: 223 REAELLEFYGGRDSSGWQHEVAGE 246 >gi|300088757|ref|YP_003759279.1| hypothetical protein Dehly_1680 [Dehalogenimonas lykanthroporepellens BL-DC-9] gi|299528490|gb|ADJ26958.1| conserved hypothetical protein [Dehalogenimonas lykanthroporepellens BL-DC-9] Length = 507 Score = 44.7 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 67/204 (32%), Gaps = 13/204 (6%) Query: 85 GRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQ 144 GR +GK+ + + L T G + A + L + E+ L P M Sbjct: 55 GRDVGKSIVLSTDALHYAFTTRGGQGLIAAPHQGHLDTIIE-EIEFQLDSNPD----LMN 109 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 S++L G + ++ + ++ + D F H V+ DE + Sbjct: 110 SIALTKYGKPKIHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLT 166 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG 264 + K++ + T N R ++ E + ++ + Sbjct: 167 ERAWKALRQCL-KAGGTLRIYSTPNGLRDTTYYRLT---SSEQFHVFRWPSWLNPLWTED 222 Query: 265 FHEGIISRYG-LDSDVARIEILGQ 287 ++ YG DS + E+ G+ Sbjct: 223 REAELLEFYGGRDSSGWQHEVAGE 246 >gi|216997755|ref|YP_002333847.1| phage terminase, large subunit, pbsx family protein [Borrelia afzelii ACA-1] gi|216752400|gb|ACJ73182.1| phage terminase, large subunit, pbsx family protein [Borrelia afzelii ACA-1] Length = 450 Score = 44.7 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDSIFAQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMERIDDK 306 >gi|78355964|ref|YP_387413.1| hypothetical protein Dde_0917 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78218369|gb|ABB37718.1| hypothetical protein Dde_0917 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 507 Score = 44.7 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 67/204 (32%), Gaps = 13/204 (6%) Query: 85 GRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQ 144 GR +GK+ + + L T G + A + L + E+ L P M Sbjct: 55 GRDVGKSIVLSTDALHYAFTTRGGQGLVAAPHQGHLDTIIE-EIEFQLDTNPD----LMN 109 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 S++L G + ++ + ++ + D F H V+ DE + Sbjct: 110 SIALTKYGKPKIHRKPYFRLEFTNGSVLYFRPAGAYGDAFRSLHVGR---VWVDEGAWLT 166 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSG 264 + K++ + T N R ++ E + ++ + Sbjct: 167 ERAWKALRQCL-KAGGTLRIYSTPNGLRDTTYYRLT---SSEQFHVFRWPSWLNPLWTED 222 Query: 265 FHEGIISRYG-LDSDVARIEILGQ 287 ++ YG DS + E+ G+ Sbjct: 223 REAELLEFYGGRDSSGWQHEVAGE 246 >gi|195942518|ref|ZP_03087900.1| hypothetical protein Bbur8_06704 [Borrelia burgdorferi 80a] gi|312149990|gb|ADQ30051.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi N40] Length = 450 Score = 44.7 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 50/157 (31%), Gaps = 16/157 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 +E +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNVETFKTYNFTTYDNVFLSKGFIETQEKLY-KDIPAYKARVLLGEWLASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 E+ M I D GGD T + Sbjct: 271 ITEDYMFTSP--------IAYLDPTFSVGGDNTALCV 299 >gi|216969097|ref|YP_002333737.1| PBSX family phage termninase large subunit [Borrelia afzelii ACA-1] gi|216753027|gb|ACJ73621.1| phage terminase, large subunit, PBSX family [Borrelia afzelii ACA-1] Length = 450 Score = 44.7 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYDFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMERIDDK 306 >gi|211731761|gb|ACJ10100.1| terminase [Bacteriophage APSE-4] Length = 469 Score = 44.3 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 50/183 (27%), Gaps = 38/183 (20%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLED-------- 247 + +EA + +++ + W N +G Y F P + Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYRRFVKPYKAIIDKQGYY 162 Query: 248 ------------WKRYQIDTR---TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQE 292 + + + ++ YG + D Sbjct: 163 EDDEVYVGKVSYLDNPWLPAELKNDAQKMKRENYKKWRHVYGGECDANY----------- 211 Query: 293 VNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + I +++ A+ + ++ D A G D+ + R G +IE WS Sbjct: 212 GDALIQPEWVDAAIDAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSE 271 Query: 351 KLI 353 + Sbjct: 272 GDV 274 >gi|218781804|ref|YP_002433122.1| hypothetical protein Dalk_3968 [Desulfatibacillum alkenivorans AK-01] gi|218763188|gb|ACL05654.1| protein of unknown function DUF264 [Desulfatibacillum alkenivorans AK-01] Length = 443 Score = 44.3 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 60/202 (29%), Gaps = 34/202 (16%) Query: 79 KCAISAGRG-IGKTTLNAWMMLWLISTR----PGMSIICIANSETQLKNTLWAEVSKWLS 133 + ++ GKT A + ++ P IA Q K+ +W + K+ Sbjct: 37 RFSVLVCHRRFGKT--VAAVNELIMKACQNPLPAPRYAYIAPLYKQAKSVVWDYLKKFAG 94 Query: 134 MLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGM 193 + + H + +L + IT + PD G + + Sbjct: 95 AI--------NGTTFHETELRCDLPNGA--------RITLLGA--DNPDRLRGIYLDGAV 136 Query: 194 AVFNDEASGTPDIIN-KSILGFFTELNPNRFWIMTSNTRRLNGWFYDI--FNIPLEDWKR 250 DE + P+ + + I ++ W M T R + FYD+ F DW Sbjct: 137 L---DEMAQMPERVWGEIIRPALSD---RLGWAMFIGTPRGHNAFYDLYQFARSDPDWFC 190 Query: 251 YQIDTRTVEGIDSGFHEGIISR 272 + + Sbjct: 191 AMYRASETGIVGRDELDAAKKE 212 >gi|224582844|ref|YP_002636642.1| hypothetical protein SPC_1035 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] gi|224467371|gb|ACN45201.1| hypothetical protein SPC_1035 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] Length = 540 Score = 44.3 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 61/176 (34%), Gaps = 15/176 (8%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R+ D ++ + +D+ V + L + IP ++++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYRRECEK--IDNPVVVAQELDLNYSASAEGILIPSDWVQAAVDA 306 Query: 308 --REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 R I L D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 307 HIRLGIQPTGKRLGA-MDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|308173233|ref|YP_003919938.1| PBSX terminase large subunit [Bacillus amyloliquefaciens DSM 7] gi|307606097|emb|CBI42468.1| PBSX terminase (large subunit)) [Bacillus amyloliquefaciens DSM 7] gi|328553846|gb|AEB24338.1| PBSX terminase (large subunit) [Bacillus amyloliquefaciens TA208] gi|328911299|gb|AEB62895.1| PBSX terminase (large subunit) [Bacillus amyloliquefaciens LL3] Length = 432 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 59/198 (29%), Gaps = 30/198 (15%) Query: 179 ERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFY 238 + P H H ++ +E S K ++G + I T+N + W Y Sbjct: 105 DNPAKLKSIH--HISLIWIEECSEVKYEGFKELIGRLRHPELSLHMICTTNPVGTSNWTY 162 Query: 239 DIFNIPLEDWKR--------------------YQIDTRTVEGIDSGFHEGI--ISRYGLD 276 F + + + + + + + + +Y D Sbjct: 163 RHFFRDEQKKRFVLDDHTLYEKGTVVKGDTYYHHSTACDNLFLLKSYIKQLDSLRQY--D 220 Query: 277 SDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL-IMGCDIAGEGGDKTVVV 335 D+ RI GQF + F +E E I + PL G D E V+ Sbjct: 221 PDLYRIARKGQFGVNGIRVFPQFQVMEHTEVTERIAAIRRPLFRTGMDFGFEESYNAVIR 280 Query: 336 FRRGNIIEHIF---DWSA 350 + ++ ++ Sbjct: 281 LAVDPDKKELYIFWEYYK 298 >gi|312201565|gb|ADQ44863.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|312201416|gb|ADQ44721.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|312201279|gb|ADQ44587.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] gi|312201518|gb|ADQ44817.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|312201145|gb|ADQ44458.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|312148787|gb|ADQ31437.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|312147565|gb|ADQ30229.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224022952|ref|YP_002606442.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|223929838|gb|ACN24543.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224022912|ref|YP_002606399.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|223929322|gb|ACN24038.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|221316912|ref|YP_002533066.1| PBSX family phage terminase large subunit [Borrelia burgdorferi 72a] gi|221237378|gb|ACM10217.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 72a] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|219722941|ref|YP_002474367.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692617|gb|ACL33836.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|219723152|ref|YP_002474571.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692773|gb|ACL33988.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224022879|ref|YP_002606358.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|224590757|ref|YP_002640761.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|224593734|ref|YP_002641063.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|226246755|ref|YP_002776089.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] gi|223929807|gb|ACN24513.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|224553954|gb|ACN55352.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|224554038|gb|ACN55434.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|226201931|gb|ACO38514.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 29805] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|195942876|ref|ZP_03088258.1| hypothetical protein Bbur8_08745 [Borrelia burgdorferi 80a] gi|312149906|gb|ADQ29970.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi N40] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|195942593|ref|ZP_03087975.1| hypothetical protein Bbur8_07129 [Borrelia burgdorferi 80a] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|225576150|ref|YP_002725083.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 94a] gi|225546143|gb|ACN92158.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 94a] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|225575886|ref|YP_002724729.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] gi|225546289|gb|ACN92300.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|221316807|ref|YP_002527718.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 72a] gi|225576280|ref|YP_002725297.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 118a] gi|221237285|gb|ACM10136.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 72a] gi|225547220|gb|ACN93206.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 118a] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|56561122|ref|YP_161529.1| hypothetical protein BGP243 [Borrelia garinii PBi] gi|226322231|ref|ZP_03797750.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] gi|52696759|gb|AAU86094.1| hypothetical protein BGP243 [Borrelia garinii PBi] gi|226232381|gb|EEH31141.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|11497214|ref|NP_051333.1| hypothetical protein BB_M42 [Borrelia burgdorferi B31] gi|223987696|ref|YP_002601254.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|225575916|ref|YP_002724772.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 118a] gi|225576096|ref|YP_002724941.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] gi|6382235|gb|AAF07550.1|AE001578_21 conserved hypothetical protein [Borrelia burgdorferi B31] gi|223929409|gb|ACN24123.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|225546099|gb|ACN92115.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] gi|225546556|gb|ACN92560.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 118a] Length = 450 Score = 44.3 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|167993618|ref|ZP_02574712.1| gp33 TerL [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|205328294|gb|EDZ15058.1| gp33 TerL [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] Length = 539 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 56/165 (33%), Gaps = 13/165 (7%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS- 307 + R+ D ++ + +D+ V + L + IP ++++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSDWVQAAVDA 306 Query: 308 --REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 R I L D+A EG DK R G ++E++ +WS Sbjct: 307 HIRLGIQPTGKRLGA-MDVADEGRDKNAFSTRHGFLLENVREWSG 350 >gi|225575989|ref|YP_002724899.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 118a] gi|225546587|gb|ACN92590.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 118a] Length = 450 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNISTFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|319950568|ref|ZP_08024477.1| hypothetical protein ES5_13328 [Dietzia cinnamea P4] gi|319435762|gb|EFV90973.1| hypothetical protein ES5_13328 [Dietzia cinnamea P4] Length = 536 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 52/172 (30%), Gaps = 6/172 (3%) Query: 56 LEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIAN 115 + +A+ S + + + + I G G GKT L + +R G + I Sbjct: 200 ADIADAL-TREQSVLLRAVDALPRVEIRGGAGSGKTYL--ALEQARRLSRDGQRVALICY 256 Query: 116 SETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRT 175 S L + L + W + E +L + + + Sbjct: 257 SHG-LASYLRRVTNGWKRRERPAYVGEFHALGVEWGAPAGPDERIRSAESVRWWEEELPR 315 Query: 176 YSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMT 227 E D G AV DEA D + ++G + R I+ Sbjct: 316 LMSELADGLPG--GRRFDAVIVDEAQDFADSWWQPVIGALRDRENGRLMIVG 365 >gi|145335142|ref|NP_172040.2| chr31 (chromatin remodeling 31); ATP binding / DNA binding / helicase/ nucleic acid binding [Arabidopsis thaliana] gi|332189724|gb|AEE27845.1| chromatin remodeling 31 [Arabidopsis thaliana] Length = 1410 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 51/158 (32%), Gaps = 15/158 (9%) Query: 41 KGKPLEHFSQPHRW----QLEFMEAVDVHCHSNVNNSNPTIFKCAISAG-----R--GIG 89 +G + Q E E + + + + F+ + G G G Sbjct: 809 EGTVWDKIPGVKSQMYPHQQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTG 868 Query: 90 KTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWA-EVSKWLSMLPHRHWFEMQSLSL 148 KT L + + P + IA + L WA E KW +P + + Sbjct: 869 KTRLTIIFLQAYLQCFPDCKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGK 925 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG 186 S L++++ S + + YS + + +G Sbjct: 926 ENSAALGLLMQKNATARSNNEIRMVKIYSWIKSKSILG 963 >gi|110740804|dbj|BAE98499.1| hypothetical protein [Arabidopsis thaliana] Length = 1410 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 51/158 (32%), Gaps = 15/158 (9%) Query: 41 KGKPLEHFSQPHRW----QLEFMEAVDVHCHSNVNNSNPTIFKCAISAG-----R--GIG 89 +G + Q E E + + + + F+ + G G G Sbjct: 809 EGTVWDKIPGVKSQMYPHQQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTG 868 Query: 90 KTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWA-EVSKWLSMLPHRHWFEMQSLSL 148 KT L + + P + IA + L WA E KW +P + + Sbjct: 869 KTRLTIIFLQAYLQCFPDCKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGK 925 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG 186 S L++++ S + + YS + + +G Sbjct: 926 ENSAALGLLMQKNATARSNNEIRMVKIYSWIKSKSILG 963 >gi|8778726|gb|AAF79734.1|AC005106_15 T25N20.14 [Arabidopsis thaliana] Length = 1465 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 51/158 (32%), Gaps = 15/158 (9%) Query: 41 KGKPLEHFSQPHRW----QLEFMEAVDVHCHSNVNNSNPTIFKCAISAG-----R--GIG 89 +G + Q E E + + + + F+ + G G G Sbjct: 864 EGTVWDKIPGVKSQMYPHQQEGFEFIWKNLAGTIMLNELKDFENSDETGGCIMSHAPGTG 923 Query: 90 KTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWA-EVSKWLSMLPHRHWFEMQSLSL 148 KT L + + P + IA + L WA E KW +P + + Sbjct: 924 KTRLTIIFLQAYLQCFPDCKPVIIAPASLLL---TWAEEFKKWNISIPFHNLSSLDFTGK 980 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG 186 S L++++ S + + YS + + +G Sbjct: 981 ENSAALGLLMQKNATARSNNEIRMVKIYSWIKSKSILG 1018 >gi|224591489|ref|YP_002640832.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] gi|224554623|gb|ACN56003.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKIDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|221641598|ref|YP_002527783.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|225622087|ref|YP_002725040.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] gi|221237550|gb|ACM10383.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|225546885|gb|ACN92880.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] Length = 450 Score = 44.3 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKIDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|219872451|ref|YP_002476937.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] gi|219694305|gb|ACL34832.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] Length = 450 Score = 44.3 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 45/308 (14%), Positives = 88/308 (28%), Gaps = 46/308 (14%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ H K S G GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIESH----------DYSKVIFSGGIASGKTFLASYLLIKKLIENKSFYEK 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K + + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTIKQIEK------ICGFLGIDYQKKKSGESFCKIAGLELNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 D+F + ++ +EA+ ++ I Sbjct: 150 GK-----------NRDSFSKIRGGNSAIIYVNEATVIHKETLLEVIKRL--RKGKAIIIF 196 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL- 285 +N +F F + +K Y T + F E Y + +L Sbjct: 197 DTNPEGPTHFFKTDFIENKDVFKTYNFTTYDNPLNSADFIETQKKLY-KHLPAYKARVLY 255 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVFRRGNIIEH 344 G++ E F E +++ + IM D A GGD T + E Sbjct: 256 GEWILNESTLF-----NEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICVLE-RAFEK 306 Query: 345 IFDWSAKL 352 + + + Sbjct: 307 FYAYIYQD 314 >gi|225576365|ref|YP_002725382.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 118a] gi|225546718|gb|ACN92719.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 118a] Length = 450 Score = 44.3 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDGK 306 >gi|260557981|ref|ZP_05830193.1| phage terminase large subunit [Acinetobacter baumannii ATCC 19606] gi|260408491|gb|EEX01797.1| phage terminase large subunit [Acinetobacter baumannii ATCC 19606] Length = 529 Score = 44.3 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 28/81 (34%), Gaps = 11/81 (13%) Query: 284 ILGQFPQQEVN---NFIPHNYIEEAMSREAIDDLYAPLI--------MGCDIAGEGGDKT 332 + G F + IP ++E A +R + L G D+A GGD T Sbjct: 289 LYGDFGAGIEDDPWQVIPTEWVEAAQARWKPLEDMRILHRGDFKMDSYGLDVARGGGDNT 348 Query: 333 VVVFRRGNIIEHIFDWSAKLI 353 + R G ++ K Sbjct: 349 IGFARYGYWYDNPNVLEGKDS 369 >gi|11497347|ref|NP_051454.1| hypothetical protein BBN43 [Borrelia burgdorferi B31] gi|6382368|gb|AAF07680.1|AE001581_22 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 44.3 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKIYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYIFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|225621691|ref|YP_002724049.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547649|gb|ACN93626.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 44.3 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 45/293 (15%), Positives = 85/293 (29%), Gaps = 47/293 (16%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ H K S G GKT L +++++ + S Sbjct: 46 TDKQKEVLFDIESH----------DYSKVIFSGGIASGKTFLASYLLVKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K S+ + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWI 225 D F + ++ +EA+ + + + I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHKETLLEVIK--RLRKGKEIIIF 196 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 T N +F + + +K Y T + F + Y R +L Sbjct: 197 DT-NPESPAHYFKTDYIENTDVFKTYTFTTYDNPLNSADFIQTQEKLY-RRFPAYRARVL 254 Query: 286 -GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E F E +++ + IM D A GGD T + Sbjct: 255 YGEWILNESTLF-----NEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICV 299 >gi|224590701|ref|YP_002640718.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] gi|224554531|gb|ACN55913.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi CA-11.2a] Length = 450 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHRQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|218202781|ref|YP_002364699.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi ZS7] gi|218164309|gb|ACK74373.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi ZS7] Length = 450 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHRQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|118470437|ref|YP_885678.1| hypothetical protein MSMEG_1288 [Mycobacterium smegmatis str. MC2 155] gi|118171724|gb|ABK72620.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length = 549 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 44/151 (29%), Gaps = 5/151 (3%) Query: 67 HSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWA 126 + + N+ + + + G G GKT L M + G + + S L + L Sbjct: 209 QAVILNAARLLNRIEVRGGAGSGKTFL--AMEQARRLAQDGQRVALVCYSHG-LASYLER 265 Query: 127 EVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG 186 + W + E +L + + + + E Sbjct: 266 VTATWPRRQQPAYVGEFHALGVQWGAPEGPDEALRTEQTVQFWEHDLPSQMTELAAQLEP 325 Query: 187 PHNTHGMAVFNDEASGTPDIINKSILGFFTE 217 H AV DEA D +LG + Sbjct: 326 GHRFD--AVVVDEAQDFADAWWDPLLGALHD 354 >gi|291563675|emb|CBL42491.1| phage uncharacterized protein (putative large terminase), C-terminal domain [butyrate-producing bacterium SS3/4] Length = 544 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 46/274 (16%), Positives = 96/274 (35%), Gaps = 38/274 (13%) Query: 6 STDQKLEQE--LHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVD 63 TD++L LH+ ++ A F+++++ W + P + F P E M V Sbjct: 59 ETDKELRSLFMLHKKVLLAAAPFDFESYLLYV-EWERE--PDKKFYVPR---REVMHPVV 112 Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNT 123 +++ + IS G GK+TL + + W++ P + A+S ++ Sbjct: 113 QAMQDLIDDRLDLL---TISMPPGTGKSTLGIFFLSWVMGRFPDSQSLASAHSGMLTRSF 169 Query: 124 LWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDT 183 + + + + + + ++ + + T+TCR + + Sbjct: 170 ---YDGVYQIITDSEYLWADVFPGVKMAATNSKEETIDLHKKHRFSTLTCRAINA----S 222 Query: 184 FVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNI 243 G + +D SG I ++ ++ W +N D+ + Sbjct: 223 LTGATRCDKILYADDLCSG--------IEEAMSKERLDKLWSAYTN---------DLKSR 265 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDS 277 E K I TR + ++YG DS Sbjct: 266 KKEGAKEIHIATRWSVH---DVIGRLENQYGGDS 296 >gi|163849591|ref|YP_001637634.1| diguanylate cyclase [Methylobacterium extorquens PA1] gi|163661196|gb|ABY28563.1| diguanylate cyclase [Methylobacterium extorquens PA1] Length = 1428 Score = 44.0 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 71/256 (27%), Gaps = 33/256 (12%) Query: 115 NSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCR 174 + Q T W E+ + + + + A L+ + G + Sbjct: 669 PDDRQRVTTTWREIFASQAAGSFEFRALCRDGAYRWTLTRAVPLKDASGQVQEWVGTDGD 728 Query: 175 TYSEER-PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRL 233 + + + + +A+ T D I LG T + + + Sbjct: 729 IHESRQASEAIRLQEERYRLAML-----ATQDAIWDWDLGADTAEWSDGAYRLFG----- 778 Query: 234 NGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVAR-IEILGQFPQQE 292 + D W + +I E + + I S+ SD R G + + Sbjct: 779 ---YDDAERADTGAWWKSKIHPDDRERVTTSIKHIIESQEHRWSDEYRFARADGSYAEVT 835 Query: 293 VNNF------------------IPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVV 334 F I A R + + L L G +A E KT + Sbjct: 836 DCGFVIRDTEGQALRMVGALRDISEQRRANAALRASEERLRLALQAGRMVAWERDLKTGL 895 Query: 335 VFRRGNIIEHIFDWSA 350 R N ++ + S Sbjct: 896 ATRSDNALQLLGIGSG 911 >gi|219723105|ref|YP_002474527.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] gi|219694031|gb|ACL34563.1| phage terminase, large subunit, pbsx family [Borrelia garinii PBr] Length = 450 Score = 44.0 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L + +N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRLRCAQETIIF--DTNPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + + Y+ T + F + Y D + + LG++ + F N Sbjct: 212 IDNVATFNTYKFTTYDNVLLSKEFIKTQEKLY-KDIPAYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMDRVDDK 306 >gi|332759085|gb|EGJ89395.1| gp33 TerL [Shigella flexneri 4343-70] Length = 519 Score = 44.0 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 44/114 (38%), Gaps = 7/114 (6%) Query: 251 YQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS-- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 229 FTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAH 286 Query: 308 REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 287 IKLGIQPTGKRLGAMDVADEGRDKNSFSTRHGFLLENVREWSGVGSDIYQSVEK 340 >gi|313760829|gb|ADR79391.1| terminase [APSE phage Eptesicus fuscus/P5/IT/USA/2009] Length = 394 Score = 44.0 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 54/169 (31%), Gaps = 24/169 (14%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT 255 + +EA + +++ + W N +G Y F P +K ID Sbjct: 44 WVEEAETVSEKSLDTLIPTIRKPGSE-LWFSF-NPAEEDGAVYRRFVKP---YKAI-ID- 96 Query: 256 RTVEGIDSGFHEGIISRYGL----DSDVARIEILGQFPQQE-----VNNFIPHNYIEEAM 306 G++E G D+ E+ + E + I ++E A Sbjct: 97 ------KQGYYEDDEVYVGKVSYLDNPWLPAELKNDAQKGECDANYEDALIQPEWVEAAT 150 Query: 307 S--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLI 353 + ++ D A G D+ + R G +IE WS + Sbjct: 151 DAHIKLGFKPSGIRVVTFDPADSGQDEKALSKRYGVLIEDCVSWSEGDV 199 >gi|333006277|gb|EGK25786.1| gp33 TerL [Shigella flexneri K-218] Length = 540 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 44/114 (38%), Gaps = 7/114 (6%) Query: 251 YQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS-- 307 + R D ++ + +D+ V + L + IP +++ A+ Sbjct: 250 FTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQAAVDAH 307 Query: 308 REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + + D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 308 IKLGIQPTGKRLGAMDVADEGRDKNSFSTRHGFLLENVREWSGVGSDIYQSVEK 361 >gi|203288918|ref|YP_002223912.1| bsr protein [Borrelia duttonii Ly] gi|201084425|gb|ACH94009.1| bsr protein [Borrelia duttonii Ly] Length = 399 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 49/278 (17%), Positives = 85/278 (30%), Gaps = 45/278 (16%) Query: 69 NVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIST--------RPGMSIICIANSETQL 120 NN N I I++ GKT L + T R G + + NS+ L Sbjct: 6 EKNNQNKVILSGGIAS----GKTFLA---CYLFLKTLLKNRHLYRKGTNNFILGNSQKAL 58 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 E++ + ++ + + + Y E+ + + Y ++ Sbjct: 59 ------EINVIEQFEDLANMLKIPFVPKYSNRSYFEIDSLRVNL-----------YGGDK 101 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 F ++ ++ +EA+ K L + P T N +F Sbjct: 102 IRDFKRFRGSNSAVIYVNEATTLHKETLKEALKRL-RIKPEFIVFDT-NPDHPEHYFKTD 159 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPH 299 + + Y T E I F + Y D + + LG++ F Sbjct: 160 YIDKNTIYSTYNFTTYDNEEISKEFIKTQEELY-KDFPTYKASVLLGEWVANNDAIFRNI 218 Query: 300 NYIEEAMSREAIDDLYAPLIMGCDIAG-EGGDKTVVVF 336 N IE D + I D A GGD T + Sbjct: 219 NIIE--------DYDFKSPIAYLDPAYSSGGDNTSLCV 248 >gi|328882738|emb|CCA55977.1| DNA or RNA helicases of superfamily II [Streptomyces venezuelae ATCC 10712] Length = 597 Score = 44.0 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 57/202 (28%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 20 PWGTAGKL-------RAWQQGAME--------KYIQEQPRDF-LAVATP-GAGKTTFALT 62 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ I +A +E + K + R ++ Sbjct: 63 LASWLLHHHVVQQITVVAPTEH---------LKKQWAAAAARIGIKLD------------ 101 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 + S G SK Y TY+ H V DE +G ++ Sbjct: 102 -PDYSAGPLSKEYDGVAITYAGVGVRPM--LHRNRSEQRKTLVILDEIHHAGDSKSWGEA 158 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 159 CLEAF--EPATRRLALTGTPFR 178 >gi|254776419|ref|ZP_05217935.1| phage terminase [Mycobacterium avium subsp. avium ATCC 25291] Length = 491 Score = 43.6 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 46/329 (13%), Positives = 94/329 (28%), Gaps = 62/329 (18%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRP-GMSI 110 WQ+ + + +P AI RG+GKT + A + L+ + P G I Sbjct: 51 RPWQMGMLR--------PFLDPDPRPLVGAIMGPRGLGKTGIFAALGLYELFCGPDGNEI 102 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 +A E L P E+ A + + + K T Sbjct: 103 PIVAVDERMAGRLL----------KPAAQMVELND----ELAARAVVYRDRIEVPGKRST 148 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSIL-------GFF-------- 215 +T +R + DE ++L G Sbjct: 149 LTALPAEAKRIEGL-----GTWTLALADELGEIDPDTWSTLLLGAGKLDGAMALGIGTPP 203 Query: 216 ---TELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDT-RTVEGIDSGFHEGIIS 271 T + + +N FY+ + ++ + + +E + + + Sbjct: 204 NRETSVLTDLREACRANPDDRTMAFYEF---SADGFEHHPVSCVHCLELANPQLDDLLSR 260 Query: 272 RYGLDSDVARIEILGQFPQQEVNNFIPHNY---IEEAMSREAIDDLYAPLIMGCDI--AG 326 + + + G++ ++ + + N ++ P+ G D+ A Sbjct: 261 D--RATALLKQTTEGEYRRKRLCQVVTTNESPFVDA--DTWDGLKAPHPVPDGADVVIAL 316 Query: 327 EG---GDKTVVVFRRGNIIEHIFDWSAKL 352 +G D T +V + H A Sbjct: 317 DGSLKDDSTALVVGTVGKVPHFDRLDAWE 345 >gi|247553003|gb|ACS94840.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 89/293 (30%), Gaps = 47/293 (16%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ NN + IF I++ GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIES------NNYSKVIFSGGIAS----GKTFLASYLLVKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K S+ + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWI 225 D F + ++ +EA+ + + + I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHKETLLEVIK--RLRKGKEIIIF 196 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 T N +F + + +K Y T + F + Y R +L Sbjct: 197 DT-NPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIQTQEKLY-RRFPAYRARVL 254 Query: 286 -GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E F E +++ + IM D A GGD T + Sbjct: 255 YGEWILNESTLF-----NEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICV 299 >gi|219807285|ref|YP_002477581.1| phage terminase, pbsx family protein [Borrelia burgdorferi 156a] gi|224797061|ref|YP_002642778.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|225571759|ref|YP_002724342.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] gi|219692550|gb|ACL33771.1| phage terminase, pbsx family protein [Borrelia burgdorferi 156a] gi|223929616|gb|ACN24327.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|225547179|gb|ACN93166.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] Length = 450 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 89/293 (30%), Gaps = 47/293 (16%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ NN + IF I++ GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIES------NNYSKVIFSGGIAS----GKTFLASYLLVKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K S+ + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWI 225 D F + ++ +EA+ + + + I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHKETLLEVIK--RLRKGKEIIIF 196 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 T N +F + + +K Y T + F + Y R +L Sbjct: 197 DT-NPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIQTQEKLY-RRFPAYRARVL 254 Query: 286 -GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E F E +++ + IM D A GGD T + Sbjct: 255 YGEWILNESTLF-----NEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICV 299 >gi|9631142|ref|NP_047924.1| gp33 [Streptomyces phage phiC31] gi|3947452|emb|CAA07103.1| gp33 [Streptomyces phage phiC31] Length = 519 Score = 43.6 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 44/315 (13%), Positives = 91/315 (28%), Gaps = 31/315 (9%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPG---MS 109 WQ E + V + R GK+T+ A +ML+ + G Sbjct: 51 PWQRELLIDAYVLTQDTFGRWRRKHRTVVVCVARKNGKSTIAAAIMLYHLIADRGDAQRQ 110 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 II AN Q + + +K + + Y + + + D+ Sbjct: 111 IIAAANDRNQARMVF--DSAKQMVNASPKLAAVCDVQRDVIR--YKDNTYRVVSADAGRQ 166 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 F + A + I + + + Sbjct: 167 QGLNPAAVSLDEYAFSKHSDLFDALTLGSAARN--QPMFLIISTAGPDPDGPFAALCEQG 224 Query: 230 TRRLNGW------FYDIF---------NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYG 274 R +G FY + ++ + W+ + + ++ + R Sbjct: 225 ERVNSGEADDPTLFYRSWGPKLGETVDHLDPDVWRACN-PSYDI--LNPDDFKAAAQRST 281 Query: 275 LDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVV 334 S RI L QF + ++PH + + + + +++G D + +G +V Sbjct: 282 EAS--FRIYRLSQFVRGAS-TWLPHGLWDSLAADDDPLEPGDEVVLGFDGSWKGDSTALV 338 Query: 335 VFR-RGNIIEHIFDW 348 R R + + W Sbjct: 339 ACRIRDLKVFVLGHW 353 >gi|323940932|gb|EGB37119.1| hypothetical protein ERDG_02336 [Escherichia coli E482] Length = 443 Score = 43.6 bits (101), Expect = 0.048, Method: Composition-based stats. Identities = 62/298 (20%), Positives = 94/298 (31%), Gaps = 58/298 (19%) Query: 81 AISAGRGIGKTT-LNAWMMLWLIST--RPGMSI-------ICIAN--SETQLKNTLWAEV 128 A+ GR GKT L++ + + S RPGM I I A ++ + L E+ Sbjct: 27 AVRCGRRWGKTFMLSSAAVTYATSQFRRPGMDIELGGRVGIFTAEYRQYQEIYDKLE-EI 85 Query: 129 SKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPH 188 +LP + F Q L LL+ ID + + G Sbjct: 86 -----LLPLKKSFSRQEKRL--------LLKNGGKIDFW--------VTNDNKLAGRGRE 124 Query: 189 NTHGMAVFNDEASGTPDI-----INK-SILGFFTELNPNRFWIMTSNTRRLNGWFYD-IF 241 + DEA+ T I SI + T + +FY Sbjct: 125 YE---IILIDEAAFTKSPEMLKEIWPKSIKPTLLTTKGRAYVFSTPDGVDEENFFYAICH 181 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIP-HN 300 N L + + T + + E R D V R E L +F + Sbjct: 182 NKDLG-FHEHHAPTSSNPFVPPEELEK--ERQNNDPRVFRQEFLAEFVDWSAASLFDVRK 238 Query: 301 YIEEAMSREA--IDDLYAPLIMGCDIAGEGG---DKTVVVF-----RRGNIIEHIFDW 348 + E + ++ + D A +GG D T VV+ R G I DW Sbjct: 239 WFEGENQDQPVDYPEMCQAVFAVMDTAVKGGTDHDGTAVVYYAVDTRPGIQRLTILDW 296 >gi|330791351|ref|XP_003283757.1| hypothetical protein DICPUDRAFT_147464 [Dictyostelium purpureum] gi|325086380|gb|EGC39771.1| hypothetical protein DICPUDRAFT_147464 [Dictyostelium purpureum] Length = 1580 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 53/174 (30%), Gaps = 19/174 (10%) Query: 67 HSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTR-----PGMSIICIANSETQLK 121 N+ NS + G GKT A ++L + + P I + + + Sbjct: 1079 QENIFNSIIKRRLQLVRGPPGTGKTHFLALIVLIFMESYKRLGKPF-RIAITSFTHNAID 1137 Query: 122 NTLWA------EVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRT 175 N L E S + + F+ Q+ L + +H+ + + Sbjct: 1138 NLLIRIASLKKEYSTSVGQDINFPLFKKQTKLSEDLKLNKIQLFDKKEFEREHFCVGATS 1197 Query: 176 YSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 +S D + + DEAS I +I + R + N Sbjct: 1198 WSLSNMD------YENFDLLIIDEASQLSSYI-GAIPFSRLNKDTGRVIVCGDN 1244 >gi|195942125|ref|ZP_03087507.1| hypothetical protein Bbur8_04585 [Borrelia burgdorferi 80a] Length = 450 Score = 43.6 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 49/304 (16%), Positives = 96/304 (31%), Gaps = 42/304 (13%) Query: 47 HFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTR- 105 +F + QL ++ +V NN I I++ GKT L ++ L + Sbjct: 36 NFDKFEEKQL-TLKQKNVIKSIKKNNEKKIILSGGIAS----GKTYLACYLFLKSLIANK 90 Query: 106 ----PGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQS 161 + I NS+ ++ + + K + ++ + H + Y + Sbjct: 91 NLYSSDTNNFIIGNSQRSVEVNVLGQFEKLCKL------LKIPYIPRHTNNSYILIDSLR 144 Query: 162 MGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPN 221 + + + F G ++ +F +EA+ + +L Sbjct: 145 INLYGGDKASDF--------ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQE 192 Query: 222 RFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVAR 281 T N +F + + +K Y T + GF E Y D + Sbjct: 193 TIIFDT-NPDHPEHYFKTDYIDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYK 250 Query: 282 IEI-LGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--R 337 + LG++ + F N + D ++ I D A GGD T + R Sbjct: 251 ARVLLGEWIASTDSIFTQINITD--------DYVFTSSIAYLDPAFSVGGDNTALCVMER 302 Query: 338 RGNI 341 + Sbjct: 303 VDDK 306 >gi|308071887|emb|CBW54808.1| putative DNA maturase B [Pantoea phage LIMElight] Length = 614 Score = 43.6 bits (101), Expect = 0.050, Method: Composition-based stats. Identities = 33/230 (14%), Positives = 71/230 (30%), Gaps = 45/230 (19%) Query: 2 PRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEA 61 PR I +D++ E + + E F++F G F Q + + Sbjct: 25 PRTIPSDKRTELAMMLAITFKE----FRDFAY-------VGMRFLGFELTDM-QADIADY 72 Query: 62 VDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQ-- 119 + + ++A RG K+TL A +W + ++ ++ E Q Sbjct: 73 MQYGPRKKM-----------VAAQRGEAKSTLAALYSVWRLIQDQRCRVLILSGGEQQAS 121 Query: 120 -LKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE 178 + + + W P W + S + + A + + K ++ C + Sbjct: 122 EVATLVIRLIETW----PLLCWLKADSTRGDRTSYTAYDVHCDLKPLDKSPSVACIGVTA 177 Query: 179 ERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTS 228 + G PD I ++ T+ + ++ Sbjct: 178 ----SLQGKRADLL----------IPDDI-ETTKNGMTQTEREKLLTVSK 212 >gi|219723219|ref|YP_002474654.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692798|gb|ACL34012.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|312148753|gb|ADQ31404.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi JD1] Length = 450 Score = 43.6 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 IT--------ADYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224984406|ref|YP_002641809.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] gi|224497005|gb|ACN52640.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] Length = 450 Score = 43.6 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 44/297 (14%), Positives = 83/297 (27%), Gaps = 55/297 (18%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIS-----TRP 106 Q E + ++ H K S G GKT L +++++ + Sbjct: 46 TAKQKEVLFDIESH----------DYSKVIFSGGIASGKTFLASYLLIKKLIENKSLYER 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K + + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTIKQIEK------ICGFLGIDYQKKKSGESFCKIAGLELNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 D F + ++ +EA+ ++ I Sbjct: 150 GR-----------NRDAFSKIRGGNSAIIYVNEATVIHKETLLEVIKRL--RKGKSIIIF 196 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL- 285 +N +F + + +K Y T + F E Y + +L Sbjct: 197 DTNPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLY 255 Query: 286 GQFPQQEVNNFIPHNYI--EEAMSREAIDD---LYAPLIMGCDIAGE-GGDKTVVVF 336 G+ +I E A+ E I + + IM D A GGD T + Sbjct: 256 GE-------------WILNESALFNEMIFNQDYEFKSPIMYIDPAFSVGGDNTAICV 299 >gi|82776052|ref|YP_402399.1| putative bacteriophage protein [Shigella dysenteriae Sd197] gi|81240200|gb|ABB60910.1| putative bacteriophage protein [Shigella dysenteriae Sd197] Length = 272 Score = 43.2 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 50/144 (34%), Gaps = 7/144 (4%) Query: 194 AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDWKRYQ 252 ++ +EA + K + + W + N + + + F P E + Sbjct: 131 VLWLEEAHALTEYQWKILEPTIRKEGSEC-WFIF-NPGLVTDFVWRNFVVDPPEGTLIRK 188 Query: 253 IDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNN-FIPHNYIEEAMS--RE 309 I+ + + I + D D + G P+ + + I ++IE A+ + Sbjct: 189 INYDENPFLSDTMLKVIDAARRRDPDGFKHVYEGV-PESDDDAAIIKLSWIEAAVDAHKT 247 Query: 310 AIDDLYAPLIMGCDIAGEGGDKTV 333 + +G D+A G DK Sbjct: 248 LNFEPSGRKRIGFDVADSGTDKCA 271 >gi|120402158|ref|YP_951987.1| hypothetical protein Mvan_1146 [Mycobacterium vanbaalenii PYR-1] gi|119954976|gb|ABM11981.1| conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1] Length = 551 Score = 43.2 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 44/171 (25%), Gaps = 5/171 (2%) Query: 57 EFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANS 116 E + S + ++ + + I G G GKT L M R G + + S Sbjct: 199 EDAADILTEHQSVILDAIRLLNRVEIRGGAGSGKTFL--AMEQARRLARDGQRVALVCYS 256 Query: 117 ETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTY 176 L + L + W + E L + + Sbjct: 257 HG-LASYLERITAAWNRRQQPAYVGEFHDLGKRWGAPAGPDESLRTEQTVQFWEHDLPAQ 315 Query: 177 SEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMT 227 E A+ DEA D +L + ++ T Sbjct: 316 MTEL--AMQLDDGQRFDAIVVDEAQDFADAWWDPLLAALKDDETGGLYVFT 364 >gi|154489097|ref|ZP_02029946.1| hypothetical protein BIFADO_02409 [Bifidobacterium adolescentis L2-32] gi|154083234|gb|EDN82279.1| hypothetical protein BIFADO_02409 [Bifidobacterium adolescentis L2-32] Length = 1055 Score = 43.2 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 68/235 (28%), Gaps = 32/235 (13%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFME 60 + I + + L + FK + P KP+E SQ Q M+ Sbjct: 194 LSEEIESQISESKPLTD-AWLKLYEEDFKKYA----PQRPNRKPIEKTSQSQTIQPNAMQ 248 Query: 61 AVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 + + I + G GKT L+A+ + Q+ Sbjct: 249 V--EALMNLAQLRKQGESRAIIVSATGTGKTYLSAFDV-------------------RQV 287 Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K +++ + + Q + P S D K+ T +T S R Sbjct: 288 KPNRMLYIAQ-QEQILKKAEESFQKVLGCPKSELGLFSGGSKESDRKYVFATVQTMS--R 344 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNG 235 P+T + DE +S PN MT+ R +G Sbjct: 345 PETLAQFDADEFDYILVDE---VHHAAAESYKRVIDHFQPNFMLGMTATPERTDG 396 >gi|323137496|ref|ZP_08072573.1| hypothetical protein Met49242DRAFT_1961 [Methylocystis sp. ATCC 49242] gi|322397122|gb|EFX99646.1| hypothetical protein Met49242DRAFT_1961 [Methylocystis sp. ATCC 49242] Length = 323 Score = 43.2 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 47/260 (18%), Positives = 88/260 (33%), Gaps = 48/260 (18%) Query: 85 GRGIGKTTLNAWMMLWLISTRPG---------MSIICIANSETQLKNTL------WAEVS 129 GR GK ++ + ++ W + G +C+A + Q + L +AE+ Sbjct: 67 GRRAGKDSIASAIVTWSAAMFDGADRLRPGERALCLCLACDKDQARIVLSYVRAYFAELE 126 Query: 130 KWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHN 189 +M+ L S + + + TI C E Sbjct: 127 PLRAMVTRE-----TKDGLELSNGVDIYVGVNDFRAVRGRTILCAVLDE----------- 170 Query: 190 THGMAVFNDEASGTPDI-INKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN----IP 244 +A + DE S +PD+ + +++ L P I S+ R G + Sbjct: 171 ---IAYWRDENSASPDLELYRALKPGMATL-PEAMLIGISSPYRRAGLLHAKHRQAYGRD 226 Query: 245 LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEE 304 + +D + ++ D AR E L +F + +++ F+ + IE Sbjct: 227 GDTLVIRAPSAVMNPTLDQAEIDQAMA---EDPAAARAEWLAEF-RDDISGFLGLDLIEA 282 Query: 305 AMSREAIDDL----YAPLIM 320 A+ + YAP IM Sbjct: 283 AVDPTIVTRPPRGCYAPWIM 302 >gi|168820654|ref|ZP_02832654.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205342611|gb|EDZ29375.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] Length = 539 Score = 43.2 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 11/165 (6%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 F DEA+ + I ++ R + + N +N F Sbjct: 195 DRTTLYFVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMNNPFAQ--KRHSGKIS 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMS- 307 + R+ D ++ + +D+ + E+ + IP +++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYRKECEK--IDNPIIVAQELDLNYQASAEGILIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK 351 + + D+A EG DK R G ++ + +WS K Sbjct: 307 HIKLGIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEWSGK 351 >gi|224022826|ref|YP_002606317.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] gi|223929278|gb|ACN23995.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 64b] Length = 450 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|224020463|ref|YP_002601168.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] gi|223929158|gb|ACN23879.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 64b] Length = 450 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|219869985|ref|YP_002474251.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] gi|219692877|gb|ACL34089.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi 156a] Length = 450 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|195942413|ref|ZP_03087795.1| hypothetical protein Bbur8_06149 [Borrelia burgdorferi 80a] gi|312201120|gb|ADQ44434.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] gi|312201339|gb|ADQ44646.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] Length = 450 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|219499179|ref|YP_002455350.1| putative phage terminase, pbsx family protein [Borrelia burgdorferi ZS7] gi|218164189|gb|ACK74256.1| putative phage terminase, pbsx family protein [Borrelia burgdorferi ZS7] Length = 450 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 89/293 (30%), Gaps = 47/293 (16%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ NN + IF I++ GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIES------NNYSKVIFSGGIAS----GKTFLASYLLVKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K S+ + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWI 225 D F + ++ +EA+ + + + I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHRETLLEVIK--RLRKGKEIIIF 196 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 T N +F + + +K Y T + F + Y R +L Sbjct: 197 DT-NPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIQAQEKLY-RRFPAYRARVL 254 Query: 286 -GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E F E +++ + IM D A GGD T + Sbjct: 255 YGEWILNESTLF-----NEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICV 299 >gi|11497404|ref|NP_051512.1| hypothetical protein BB_Q50 [Borrelia burgdorferi B31] gi|6382425|gb|AAF07735.1|AE001584_32 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|11497152|ref|NP_051291.1| hypothetical protein BB_R45 [Borrelia burgdorferi B31] gi|6382173|gb|AAF07489.1|AE001577_3 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|11497247|ref|NP_051377.1| hypothetical protein BB_O44 [Borrelia burgdorferi B31] gi|6382268|gb|AAF07582.1|AE001579_11 conserved hypothetical protein [Borrelia burgdorferi B31] Length = 450 Score = 43.2 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|323699495|ref|ZP_08111407.1| protein of unknown function DUF264 [Desulfovibrio sp. ND132] gi|323459427|gb|EGB15292.1| protein of unknown function DUF264 [Desulfovibrio desulfuricans ND132] Length = 428 Score = 43.2 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 42/144 (29%), Gaps = 22/144 (15%) Query: 105 RPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI 164 R IA Q K +W E+ ++ L E Sbjct: 50 RDDWRGAYIAPLYRQAKTVVWDELKRYC------------GFGLDGCTVKFNETELRADF 97 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNPNRF 223 D+ R + PD+ G + V DE + P + + I ++ + Sbjct: 98 DNGSRI---RLFGANNPDSLRGMYLDG---VVFDEVAQMPLRVWTEVIRPALSD---RKG 148 Query: 224 WIMTSNTRRLNGWFYDIFNIPLED 247 W M T R Y+I+ D Sbjct: 149 WAMFIGTPRGKNALYEIWEKGKTD 172 >gi|247553170|gb|ACS94899.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 297] Length = 450 Score = 42.8 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 89/293 (30%), Gaps = 47/293 (16%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ NN + IF I++ GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIES------NNYSKVIFSGGIAS----GKTFLASYLLVKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K S+ + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWI 225 D F + ++ +EA+ + + + I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHRETLLEVIK--RLRKGKEIIIF 196 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 T N +F + + +K Y T + F + Y R +L Sbjct: 197 DT-NPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIQTQEKLY-RRFPAYRARVL 254 Query: 286 -GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E F E +++ + IM D A GGD T + Sbjct: 255 YGEWILNESTLF-----NEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICV 299 >gi|11496928|ref|NP_045704.1| hypothetical protein BBA31 [Borrelia burgdorferi B31] gi|195942693|ref|ZP_03088075.1| hypothetical protein Bbur8_07694 [Borrelia burgdorferi 80a] gi|224796822|ref|YP_002642504.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|224796893|ref|YP_002642572.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] gi|225573840|ref|YP_002724449.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 94a] gi|226234883|ref|YP_002775758.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] gi|2690260|gb|AAC66261.1| conserved hypothetical protein [Borrelia burgdorferi B31] gi|221237191|gb|ACM10059.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 72a] gi|224554443|gb|ACN55827.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi CA-11.2a] gi|225546432|gb|ACN92439.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 94a] gi|226202357|gb|ACO38016.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] gi|247552767|gb|ACS94776.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi N40] Length = 450 Score = 42.8 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 89/293 (30%), Gaps = 47/293 (16%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ NN + IF I++ GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIES------NNYSKVIFSGGIAS----GKTFLASYLLVKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K S+ + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWI 225 D F + ++ +EA+ + + + I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHRETLLEVIK--RLRKGKEIIIF 196 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 T N +F + + +K Y T + F + Y R +L Sbjct: 197 DT-NPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIQTQEKLY-RRFPAYRARVL 254 Query: 286 -GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E F E +++ + IM D A GGD T + Sbjct: 255 YGEWILNESTLF-----NEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICV 299 >gi|56560912|ref|YP_161331.1| hypothetical protein BGP046 [Borrelia garinii PBi] gi|52696553|gb|AAU85896.1| hypothetical protein BGP046 [Borrelia garinii PBi] Length = 450 Score = 42.8 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSTLIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y + + + LG++ + F N Sbjct: 212 IDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KEIPTYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMDRVDDK 306 >gi|295836865|ref|ZP_06823798.1| DNA or RNA helicase, superfamily II [Streptomyces sp. SPB74] gi|197699526|gb|EDY46459.1| DNA or RNA helicase, superfamily II [Streptomyces sp. SPB74] Length = 596 Score = 42.8 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 57/202 (28%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 20 PWGTAGKL-------RAWQQGAME--------RYVQEQPRDF-LAVATP-GAGKTTFALT 62 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ I +A +E + K + R ++ Sbjct: 63 LASWLLHHHVVQQITVVAPTEH---------LKKQWAEAAARIGIKLD------------ 101 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 + S G SK Y TY+ H V DE +G ++ Sbjct: 102 -PDYSAGPVSKEYVGVAVTYAGVGVRPM--LHRNRVEQRKTLVILDEIHHAGDSKSWGEA 158 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 159 CLEAF--EPATRRLALTGTPFR 178 >gi|224591529|ref|YP_002640858.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] gi|224554111|gb|ACN55505.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi WI91-23] Length = 450 Score = 42.8 bits (99), Expect = 0.083, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ VF +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALVFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|318064508|gb|ADV36483.1| phage terminase large subunit [Edwardsiella phage eiDWF] gi|318064606|gb|ADV36532.1| phage terminase large subunit [Edwardsiella phage eiMSLS] Length = 460 Score = 42.8 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 88/253 (34%), Gaps = 36/253 (14%) Query: 43 KPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI 102 P++ + W+++ + H +N++ I +G G GKT A + L Sbjct: 26 APVKKERKSRTWRIKTLP----HQRGLINDTTTKILGLC--SGFGGGKTWSAARKAVQLA 79 Query: 103 STRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSM 162 PG I + L ++ E+ K L+ + F Q H Sbjct: 80 ILNPGCDGIITEPTIPLLVKIMYPELEKALNEAGIKWKFNKQDKIYHC------------ 127 Query: 163 GIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE-ASGTPDIINKS---ILGFFTEL 218 I + I C S E +G + DE + PDI ++ +LG Sbjct: 128 RIAGQMTRIICD--SMENYTRLIGVNAA---WCVCDEFDTTKPDIAMEAYRKLLGRLRTG 182 Query: 219 NPNRFWIMTSNTRRLNGW--FYDIF-NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL 275 N + I++ G+ Y IF + + + + T + + + + ++Y Sbjct: 183 NVRQMVIVS----TPEGFRAMYQIFISEADDQKRLIKARTTDNHYLPQDYIDTLRAQYPP 238 Query: 276 DSDVARIEILGQF 288 + + + G+F Sbjct: 239 E--LIEAYLNGEF 249 >gi|318064394|gb|ADV36428.1| phage terminase large subunit [Edwardsiella phage eiAU] Length = 460 Score = 42.8 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 45/253 (17%), Positives = 88/253 (34%), Gaps = 36/253 (14%) Query: 43 KPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI 102 P++ + W+++ + H +N++ I +G G GKT A + L Sbjct: 26 APVKKERKSRTWRIKTLP----HQRGLINDTTTKILGLC--SGFGGGKTWSAARKAVQLA 79 Query: 103 STRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSM 162 PG I + L ++ E+ K L+ + F Q H Sbjct: 80 ILNPGCDGIITEPTIPLLVKIMYPELEKALNEAGIKWKFNKQDKIYHC------------ 127 Query: 163 GIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE-ASGTPDIINKS---ILGFFTEL 218 I + I C S E +G + DE + PDI ++ +LG Sbjct: 128 RIAGQMTRIICD--SMENYTRLIGVNAA---WCVCDEFDTTKPDIAMEAYRKLLGRLRTG 182 Query: 219 NPNRFWIMTSNTRRLNGW--FYDIF-NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGL 275 N + I++ G+ Y IF + + + + T + + + + ++Y Sbjct: 183 NVRQMVIVS----TPEGFRAMYQIFISEADDQKRLIKARTTDNHYLPQDYIDTLRAQYPP 238 Query: 276 DSDVARIEILGQF 288 + + + G+F Sbjct: 239 E--LIEAYLNGEF 249 >gi|168260952|ref|ZP_02682925.1| phage terminase, large subunit, pbsx family [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|205349913|gb|EDZ36544.1| phage terminase, large subunit, pbsx family [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] Length = 471 Score = 42.8 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTI-RKTFSEIWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|317152051|ref|YP_004120099.1| hypothetical protein Daes_0328 [Desulfovibrio aespoeensis Aspo-2] gi|316942302|gb|ADU61353.1| protein of unknown function DUF264 [Desulfovibrio aespoeensis Aspo-2] Length = 428 Score = 42.8 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 38/260 (14%), Positives = 64/260 (24%), Gaps = 43/260 (16%) Query: 103 STRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSM 162 ++R IA Q K +W E+ ++ + + L Sbjct: 48 TSRTDWRGAYIAPLYKQAKTVVWDELKRYCGLGLDGCTVKFNETEL-------------- 93 Query: 163 GIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNPN 221 R + E PD+ G + DE + P + + I ++ Sbjct: 94 -RADFDNGARIRLFGAENPDSLRGMYLDGA---VFDEVAQMPHRVWTEVIRPALSDRMGW 149 Query: 222 RFWIMTSNTRRLNGWFYDIFNIPLE--DWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDV 279 +I R N Y ++ DW I+ G + Sbjct: 150 AMFI--GTPRGKNA-LYRLWQDARRDPDWFAAMYRASQTGIIEPGELAAAAREMSPE--E 204 Query: 280 ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAP-------LIMGCDIAGEGGDKT 332 E F + E + P +G D T Sbjct: 205 YEQEFECSFTAAIRGAYFSALIGEADKGGRITKVPHDPSLPVHTAWDLGM------SDST 258 Query: 333 VVVF---RRGNIIEHIFDWS 349 + F R GN I D+ Sbjct: 259 AIWFVQARPGNA-FAIVDYY 277 >gi|317153313|ref|YP_004121361.1| hypothetical protein Daes_1602 [Desulfovibrio aespoeensis Aspo-2] gi|316943564|gb|ADU62615.1| hypothetical protein Daes_1602 [Desulfovibrio aespoeensis Aspo-2] Length = 507 Score = 42.8 bits (99), Expect = 0.097, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 62/205 (30%), Gaps = 15/205 (7%) Query: 85 GRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQ 144 GR +GK+ + + L T G + A + L + + E+ L P Sbjct: 55 GRDVGKSIVLSTDALHYAFTTRGGQGLIAAPHQGHLDSIIE-EIEYQLDTNPDLMNSIAV 113 Query: 145 SLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTP 204 + P S Y Y D+F H V+ DE + Sbjct: 114 TKYGKPKIHRKPYFRLEFTNGSVLYFRPAGAYG----DSFRSLHVGR---VWVDEGAWLT 166 Query: 205 DIINKSILGFFTELNPNRFWIMTSNTRRL-NGWFYDIFNIPLEDWKRYQIDTRTVEGIDS 263 + K++ + S L + +Y + + + ++ + Sbjct: 167 ERAWKALRQCLKTGG---ILRIYSTPNGLRDTTYYRL--TSSDQFHVFRWPSWLNPLWTE 221 Query: 264 GFHEGIISRYG-LDSDVARIEILGQ 287 ++ YG DS + E+ G+ Sbjct: 222 DRESELLEFYGGRDSSGWQHEVAGE 246 >gi|238765385|ref|ZP_04626308.1| Gp33 TerL [Yersinia kristensenii ATCC 33638] gi|238696377|gb|EEP89171.1| Gp33 TerL [Yersinia kristensenii ATCC 33638] Length = 501 Score = 42.4 bits (98), Expect = 0.098, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 40/104 (38%), Gaps = 5/104 (4%) Query: 251 YQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMSRE 309 + R+ D ++ + +D+ V + L + IP +++ A+ Sbjct: 213 FTFHWRSDPRKDDEWYRKECEK--IDNPVVVAQELDLNYQASAEGILIPSEWVQAAIDAH 270 Query: 310 AIDD--LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK 351 D + D+A EG DK R G +++ + +WS + Sbjct: 271 IHLDIQPSGARLGAMDVADEGRDKNGFAIRYGFLLQDVKEWSGE 314 >gi|51557524|ref|YP_068358.1| DNA packaging terminase subunit 1 [Suid herpesvirus 1] gi|40253983|tpg|DAA02178.1| TPA_exp: UL15 protein [Suid herpesvirus 1] Length = 735 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 50/152 (32%), Gaps = 22/152 (14%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLW---AEVSKWLSMLPHRHWFEMQS 145 GKT ++ ++T G+ + A+ + A + +W H Sbjct: 277 GKTWFLVPLIALALATFRGIRVGYTAHIRKATEPVFEEIHARLRRWCRDARVDHVKGENI 336 Query: 146 LSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD 205 P G + ++ S + G +F DEA+ Sbjct: 337 TVTFPDGARSTIVF----------------ASSHNTNGIRGQDFN---LLFVDEANFIRP 377 Query: 206 IINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 ++ILGF + + ++ ++NT + + F Sbjct: 378 DAVQTILGFMNQASCKIIFVSSTNTGKASTSF 409 >gi|28395422|gb|AAO38880.1| UL15 [Suid herpesvirus 1] Length = 753 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 50/152 (32%), Gaps = 22/152 (14%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLW---AEVSKWLSMLPHRHWFEMQS 145 GKT ++ ++T G+ + A+ + A + +W H Sbjct: 293 GKTWFLVPLIALALATFRGIRVGYTAHIRKATEPVFEEIHARLRRWCRDARVDHVKGENI 352 Query: 146 LSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD 205 P G + ++ S + G +F DEA+ Sbjct: 353 TVTFPDGARSTIVF----------------ASSHNTNGIRGQDFN---LLFVDEANFIRP 393 Query: 206 IINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 ++ILGF + + ++ ++NT + + F Sbjct: 394 DAVQTILGFMNQASCKIIFVSSTNTGKASTSF 425 >gi|238581544|ref|XP_002389644.1| hypothetical protein MPER_11197 [Moniliophthora perniciosa FA553] gi|215452133|gb|EEB90574.1| hypothetical protein MPER_11197 [Moniliophthora perniciosa FA553] Length = 633 Score = 42.4 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 87 GIGKTTLNAWMMLWLISTRPGMSIICIANS-----------ETQLKNTLWAEVSKWLSML 135 G GKT +L L+S P I+ A S + ++ L+ + Sbjct: 480 GTGKTVTAVEAILQLLSANPNARILACAPSNSAADLIAMRLRSLGESGLFRAYAPSRDRE 539 Query: 136 PHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAV 195 H + + +G ++ L M K + T +G H + Sbjct: 540 QVPHEL-LPFTYQNATGHFSVPLLSRM----KRFRAVVTTCVSANIIAGIGIPRGHYTHI 594 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLN 234 F DEA + + ++ T + N +++ + ++L Sbjct: 595 FVDEAGQATEP--EVMIAIKTMADMNTNVVLSGDPKQLG 631 >gi|323139470|ref|ZP_08074518.1| hypothetical protein Met49242DRAFT_3906 [Methylocystis sp. ATCC 49242] gi|322395272|gb|EFX97825.1| hypothetical protein Met49242DRAFT_3906 [Methylocystis sp. ATCC 49242] Length = 439 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 47/259 (18%), Positives = 85/259 (32%), Gaps = 37/259 (14%) Query: 82 ISAGRGIGKTTLNA-WMMLWLI-----STRPGMSIICIANSETQLKNTLWAEVSKWLSML 135 I GRG GKT A W+ + TRP I I + ++ + VS L++ Sbjct: 54 ILGGRGAGKTRAGAEWVKGLALGRPHFCTRPVSRIALIGETAADVREVMIEGVSGLLAIH 113 Query: 136 PHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG--PHNTHGM 193 R +S + + +S E P++ G H Sbjct: 114 GKRDRPRWESSR---------------RRLVWDSGVVAQAFSAEDPESLRGPQFHAA--- 155 Query: 194 AVFNDEASG--TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRY 251 + DE + + + R + T+ R D+ P R Sbjct: 156 --WCDELAKWRYARETWDMLQFGLRLGDWPRQLVTTTP--RPTPLLKDLIAHPATVLTRA 211 Query: 252 QIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAI 311 + + F E ++++Y + + R E+ G+ ++ + + IE SR A Sbjct: 212 -LTRENAANLAPSFLESVVAQY-AGTRLGRQELDGEIVEERKDALWTRDLIEA--SRVAD 267 Query: 312 DDLYAPLIMGCD-IAGEGG 329 A +++ D A G Sbjct: 268 APRLARIVVAVDPPASFGK 286 >gi|298708865|emb|CBJ30823.1| conserved unknown protein [Ectocarpus siliculosus] Length = 1899 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 34/181 (18%), Positives = 68/181 (37%), Gaps = 22/181 (12%) Query: 71 NNSNPTIFKCAISAGRG-----------IGKTTLNAWMMLWLISTRPGMSIICIANSETQ 119 N+S T + ++ G GKT +L L+ RP I+ + S+T Sbjct: 755 NDSQRTAVRDIVTGAHGQVPYIIFGPPGTGKTCTVIESILQLVKLRPECRILAVGPSDTS 814 Query: 120 LKNTLWAEVSKWLSM--LPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCR--- 174 + + +S+ +S L +W++ + +HP+ + + G+ TIT + Sbjct: 815 -ADVICERLSRHMSRDQLVRINWWQRLTAGVHPNILSYCPQDSNRGMFVPPSTITHQVVV 873 Query: 175 -TYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRL 233 T + +G + +F DE+S ++ L I+ + R+L Sbjct: 874 CTCGTAGMLSVLGVDENYFTHIFVDESSN----AMETELLVPLSYAGRAQIILCGDPRQL 929 Query: 234 N 234 Sbjct: 930 G 930 >gi|329888629|ref|ZP_08267227.1| phage DNA packaging protein [Brevundimonas diminuta ATCC 11568] gi|328847185|gb|EGF96747.1| phage DNA packaging protein [Brevundimonas diminuta ATCC 11568] Length = 411 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 35/259 (13%), Positives = 73/259 (28%), Gaps = 19/259 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 A G G+ + +W ++ +I + L+ E+ L+ + Sbjct: 23 RAAHG-GRGSAKSWSVV-------DAAIFHTVTTPR-LRVVFLREIMANLTESSLELVRK 73 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 ++ E+ G+ + + +P+ +EA Sbjct: 74 RLEHFGLLGSYFREVNGTFQGLGGQKIMFIG-LWKGGKPEGIKSLEGAG--LTILEEAQE 130 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNT---RRLNGWFYDIFNIPLEDWKRYQIDTRTVE 259 +L + W + N F+ P +I+ Sbjct: 131 VRQASLDVLLPTILRTAISELWAIW-NPRLDTDPIDVFFRGPVKPKGA-IVRKINYDQNP 188 Query: 260 GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAP 317 E + + D A LG + ++EA R A + + Sbjct: 189 HFPDALRELMELDFSKDKLRAAWIWLGGYMPSVQGAIWNREGLDEAWREGRHAPEGSWGR 248 Query: 318 LIMGCDIAGEGGDKTVVVF 336 +++G D +G G D +VV Sbjct: 249 VVVGVDPSGGGDDVGIVVA 267 >gi|302521533|ref|ZP_07273875.1| involving differentiation [Streptomyces sp. SPB78] gi|333024829|ref|ZP_08452893.1| putative differentiation protein [Streptomyces sp. Tu6071] gi|302430428|gb|EFL02244.1| involving differentiation [Streptomyces sp. SPB78] gi|332744681|gb|EGJ75122.1| putative differentiation protein [Streptomyces sp. Tu6071] Length = 596 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 41/202 (20%), Positives = 57/202 (28%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 20 PWGTAGKL-------RAWQEGAME--------RYVQEQPRDF-LAVATP-GAGKTTFALT 62 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ I +A +E + K + R ++ Sbjct: 63 LASWLLHHHVVQQITVVAPTEH---------LKKQWAEAAARIGIKLD------------ 101 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 + S G SK Y TY+ H V DE +G ++ Sbjct: 102 -PDYSAGPVSKEYVGVAVTYAGVGVRPM--LHRNRVEQRKTLVILDEIHHAGDSKSWGEA 158 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 159 CLEAF--EPATRRLALTGTPFR 178 >gi|3318666|gb|AAC26153.1| BBA31 homolog [Borrelia burgdorferi 297] Length = 450 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + + G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERYRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|227500282|ref|ZP_03930349.1| terminase [Anaerococcus tetradius ATCC 35098] gi|227217568|gb|EEI82880.1| terminase [Anaerococcus tetradius ATCC 35098] Length = 466 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 40/302 (13%), Positives = 91/302 (30%), Gaps = 47/302 (15%) Query: 50 QPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMS 109 P+ WQ + ++ + + + + GKT + LW + G + Sbjct: 35 SPYPWQEKLIKDIFAVNDDGLWTHSKFGYAVPRRN----GKTEIVYMAELWFLM--DGKN 88 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 II A+ + + + ++ K+L + + +S+ ++ + + Sbjct: 89 IIHTAHRISTSHS-SFKKLKKYLEKMGLVDKVDFKSIKAK--------GQEMIELIKTGG 139 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 I RT +E G + V DEA + ++ T+ + + + Sbjct: 140 VIQFRTRTETG-----GLGEGFDLLVI-DEAQEYTEGQESALKYTVTDSDNPMILMCGTP 193 Query: 230 TRRLNG--WFYD------IFNIPLEDWKRYQIDTRTVE-------GIDSG-----FHEGI 269 ++G F W + + T + + Sbjct: 194 PTLVSGGTVFSKYRDLILSGGKNHNGWAEWSVSEMTNPYDIDAWYKTNPSMGYKLRERAV 253 Query: 270 ISRYGLDSDVARIEILGQFPQQEVNNFIP-HNYIEEAMSREAIDDLYAPLIMGCDIAGEG 328 G D I+ LG + + + I ++ + + + L L +G G Sbjct: 254 EEEIGPDETDFNIQRLGYWVKYNQKSVISKLDW--DRLKLTRLPSLVGKLHVGI---KYG 308 Query: 329 GD 330 D Sbjct: 309 ND 310 >gi|323186590|gb|EFZ71927.1| gp33 TerL protein [Escherichia coli 1357] Length = 503 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 31/224 (13%), Positives = 67/224 (29%), Gaps = 21/224 (9%) Query: 136 PHRHWFEMQSLSLHP-----SGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT 190 P +++ + W + M ++ + + + Sbjct: 141 PKALFWKARKFVETLPVEFRGSWSEKKHAPYMRVEFPETGAVIKGEAGDNIGR-----GD 195 Query: 191 HGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKR 250 F DEA+ + I ++ R + + N +N F Sbjct: 196 RTTLYFVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMNNPFAQ--KRHSGKIPV 249 Query: 251 YQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSRE 309 + R+ D ++ + +D+ V E+ + IP +++ A+ Sbjct: 250 FTFHWRSDPRKDDEWYHKECEK--IDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDAH 307 Query: 310 AIDD--LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK 351 + D+A EG DK R G ++ + +WS K Sbjct: 308 IRLGIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGK 351 >gi|226246423|ref|YP_002775825.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] gi|226201818|gb|ACO38403.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 29805] Length = 450 Score = 42.4 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 89/293 (30%), Gaps = 47/293 (16%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ NN + IF I++ GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIES------NNYSKVIFSGGIAS----GKTFLASYLLVKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K S+ + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWI 225 D F + ++ +EA+ + + + I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHRETLLEVIK--RLRKGKEIIIF 196 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 T N +F + + +K Y T + F + Y R +L Sbjct: 197 DT-NPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSTDFIQTQEKLY-RRFPAYRARVL 254 Query: 286 -GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E F E +++ + IM D A GGD T + Sbjct: 255 YGEWILNESTLF-----NEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICV 299 >gi|224796679|ref|YP_002641707.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] gi|224553883|gb|ACN55283.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi WI91-23] Length = 450 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 45/293 (15%), Positives = 89/293 (30%), Gaps = 47/293 (16%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ NN + IF I++ GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIES------NNYSKVIFSGGIAS----GKTFLASYLLVKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K S+ + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTVKQIEKICSL------LGIDYEKKKSGQSFCKIAGLKLNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWI 225 D F + ++ +EA+ + + + I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHRETLLEVIK--RLRKGKEIIIF 196 Query: 226 MTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 T N +F + + +K Y T + F + Y R +L Sbjct: 197 DT-NPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIQTQEKLY-RRFPAYRARVL 254 Query: 286 -GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E + E +++ + IM D A GGD T + Sbjct: 255 YGEWILNE-----SMLFNEMIFNQDY---EFKSPIMYIDPAFSVGGDNTAICV 299 >gi|218780689|ref|YP_002432007.1| exodeoxyribonuclease V, alpha subunit [Desulfatibacillum alkenivorans AK-01] gi|218762073|gb|ACL04539.1| exodeoxyribonuclease V, alpha subunit [Desulfatibacillum alkenivorans AK-01] Length = 589 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 45/145 (31%), Gaps = 20/145 (13%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPG--MSIICIANSETQLKNTLWAEVSKWLSMLPHRH 139 IS G G GKTT+ A ++ L+S G SI A + L + + K LS L Sbjct: 160 ISGGPGTGKTTIAARIIRLLLSLADGRAPSIAITAPTGKAAARLLES-LGKELSRLGVPP 218 Query: 140 WFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 + + + + + ++ + P + DE Sbjct: 219 G---------MEDAIPKRAKTIHRLMGARFNSSQFIHNADNPINAD--------ILIVDE 261 Query: 200 ASGTPDIINKSILGFFTELNPNRFW 224 AS + +L + Sbjct: 262 ASMVELSLMARLLEALPDHGKLILL 286 >gi|188494674|ref|ZP_03001944.1| gp33 TerL [Escherichia coli 53638] gi|188489873|gb|EDU64976.1| gp33 TerL [Escherichia coli 53638] Length = 539 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 53/165 (32%), Gaps = 11/165 (6%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N +N F Sbjct: 195 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMNNPFAQ--KRHSGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 + R+ D ++ + +D+ V E+ + IP +++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYHKECEK--IDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDA 306 Query: 309 EAIDD--LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK 351 + D+A EG DK R G ++ + +WS K Sbjct: 307 HIRLGIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGK 351 >gi|320010525|gb|ADW05375.1| type III restriction protein res subunit [Streptomyces flavogriseus ATCC 33331] Length = 593 Score = 42.4 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 56/202 (27%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 18 PWGTAGKL-------RAWQQGAME--------KYIQEQPRDF-LAVATP-GAGKTTFALT 60 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ I +A +E + K + R ++ Sbjct: 61 LASWLLHHHVVQQITVVAPTEH---------LKKQWAEAAARIGIKLD------------ 99 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 + S G SK Y TY+ H V DE +G ++ Sbjct: 100 -PDYSAGPVSKEYHGVAITYAGVGVRPM--LHRNRCEQRKTLVILDEIHHAGDSKSWGEA 156 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 F R +T R Sbjct: 157 CQEAF--DPATRRLALTGTPFR 176 >gi|322835667|ref|YP_004215693.1| terminase large subunit [Rahnella sp. Y9602] gi|321170868|gb|ADW76566.1| terminase large subunit [Rahnella sp. Y9602] Length = 539 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 73/215 (33%), Gaps = 20/215 (9%) Query: 152 GWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSI 211 GW A+ M ++ + + + F DEA+ + I Sbjct: 162 GWSAKKHAPYMRVEFPTTGAVLKGEAGDNIGR-----GDRTTLYFVDEAAFLQRPL--LI 214 Query: 212 LGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPL-EDWKRYQIDTRTVEGIDSGFHEGII 270 ++ R + + N + F + + + R+ D ++ Sbjct: 215 EASLSQTTRCRIDLSSVN--GMANPFAQKRHGGRIPVFTFHW---RSDPRKDEAWYAKEC 269 Query: 271 SRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGE 327 ++ +D+ V + L + IP+ +I A++ + + D+A E Sbjct: 270 AK--IDNPVVVAQELDLNYSASAEGVLIPNEWIRAAINAHIKLGIQPTGKRLGAMDVADE 327 Query: 328 GGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQEG 360 G DK R G ++ + +WS I +++++ Sbjct: 328 GRDKNAFSARYGFLLTEVEEWSGVGSDIYKSSEKA 362 >gi|13242438|ref|NP_077457.1| DNA packaging terminase subunit 1 [Cercopithecine herpesvirus 9] gi|11036590|gb|AAG27219.1|AF275348_40 unknown [Cercopithecine herpesvirus 9] Length = 745 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 51/149 (34%), Gaps = 16/149 (10%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GKT ++ L+ST G+ + A+ + + E+ WF + + Sbjct: 271 GKTWFIVSLIALLMSTFRGIKVGYTAHIRKATEPV-FEEIK-----ARLEQWFGTERIE- 323 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN 208 E S T S + G +F DEA+ Sbjct: 324 ------HVKGESITFSFSDGCCSTAVFSSSHNTNGIRGQTFN---LLFVDEANFIRPDAV 374 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWF 237 ++I+GF + N ++ ++NT + + F Sbjct: 375 QTIVGFLNQTNCKIIFVSSTNTGKASTSF 403 >gi|307544683|ref|YP_003897162.1| hypothetical protein HELO_2093 [Halomonas elongata DSM 2581] gi|307216707|emb|CBV41977.1| K06909 [Halomonas elongata DSM 2581] Length = 531 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 46/137 (33%), Gaps = 8/137 (5%) Query: 228 SNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ 287 S + F + R D ++ + + EI Sbjct: 229 STPNGMGNPFAQ--RRHSGKISVFTFHWRDDPRKDDAWYAKQVDELDPVT--VAQEIDIN 284 Query: 288 FPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHI 345 + IP +++ A+ ++ ++ + D+A EG D+ R G +++ + Sbjct: 285 YSASVEGVLIPSAWVQAAVDAHKKLGIEITGERLGALDVADEGKDQNAYAGRHGILLDLV 344 Query: 346 FDWSAK--LIQETNQEG 360 +W+ K I T Q+ Sbjct: 345 DEWTGKGSDIFGTVQKA 361 >gi|332088044|gb|EGI93169.1| gp33 TerL [Shigella boydii 5216-82] Length = 539 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 54/165 (32%), Gaps = 11/165 (6%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 F DEA+ + I ++ R + + N +N F Sbjct: 195 DRTTLYFVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMNNPFAQ--KRHSGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 + R+ D ++ + +D+ V E+ + IP +++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYHKECEK--IDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDA 306 Query: 309 EAIDD--LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK 351 + D+A EG DK R G ++ + +WS K Sbjct: 307 HIRLGIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGK 351 >gi|307940746|gb|ADN95987.1| polyprotein [Chionodraco hamatus] Length = 2968 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 14/69 (20%), Positives = 26/69 (37%), Gaps = 7/69 (10%) Query: 55 QLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWM------MLWLISTRPGM 108 Q+ + C ++ NP F I+ G G GK+ L + +L + P Sbjct: 2284 QMSIFYQIRQWCLDKISGKNPDPFHVFITGGAGTGKSHLIKALQYETTRLLSPLCDHPDS 2343 Query: 109 S-IICIANS 116 ++ A + Sbjct: 2344 VCVLLTAPT 2352 >gi|323173153|gb|EFZ58784.1| gp33 TerL protein [Escherichia coli LT-68] Length = 539 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 54/165 (32%), Gaps = 11/165 (6%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 F DEA+ + I ++ R + + N +N F Sbjct: 195 DRTTLYFVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMNNPFAQ--KRHSGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 + R+ D ++ + +D+ V E+ + IP +++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYHKECEK--IDNPVIVAQELDLNYQASAEGILIPSEWVQAAVDA 306 Query: 309 EAIDD--LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK 351 + D+A EG DK R G ++ + +WS K Sbjct: 307 HIRLGIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEWSGK 351 >gi|331650684|ref|ZP_08351739.1| conserved hypothetical protein [Escherichia coli M605] gi|331040472|gb|EGI12647.1| conserved hypothetical protein [Escherichia coli M605] Length = 414 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 58/176 (32%), Gaps = 15/176 (8%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 DEA+ + I ++ R + + N + F Sbjct: 141 DRTTLYLVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMANPFAQ--KRHGGKIP 194 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQ-FPQQEVNNFIPHNYIEE---A 305 + R D ++ + +D+ V + L + IP +++ A Sbjct: 195 VFTFHWRDDPRKDEEWYRRECEK--IDNPVVVAQELDLNYSASAEGVLIPSEWVQATVDA 252 Query: 306 MSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWS--AKLIQETNQE 359 + I L D+A EG DK R G ++E++ +WS I ++ ++ Sbjct: 253 HIKLGIQPTGKRLGA-MDVADEGRDKNAFSTRHGFLLENVREWSGVGSDIYQSVEK 307 >gi|225576048|ref|YP_002724855.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] gi|225546646|gb|ACN92649.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi 118a] Length = 450 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|216996657|ref|YP_002333778.1| phage terminase, large subunit, PBSX family [Borrelia afzelii ACA-1] gi|216752579|gb|ACJ73283.1| phage terminase, large subunit, PBSX family [Borrelia afzelii ACA-1] Length = 450 Score = 42.0 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKIDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIAITDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMERIDDK 306 >gi|226305996|ref|YP_002765956.1| hypothetical protein RER_25090 [Rhodococcus erythropolis PR4] gi|226185113|dbj|BAH33217.1| hypothetical protein RER_25090 [Rhodococcus erythropolis PR4] Length = 402 Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 23/210 (10%), Positives = 61/210 (29%), Gaps = 20/210 (9%) Query: 63 DVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKN 122 +H + + FK + GR GKTT A + + A + Q ++ Sbjct: 3 RLHQSQRKIAESSSRFKV-LRCGRRFGKTTY-AVEEMKGACLFEPGPVAYFATTRDQARD 60 Query: 123 TLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPD 182 +WAE+ + +++ ++ L + + + + T R Sbjct: 61 IVWAELLE--NVIGTTNYVSHNEQRLEVTLRRPDGSLNRIRLFGWENIETARG------- 111 Query: 183 TFVGPHNTHGMAVF--NDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 + + V D + I + R M + + + + Sbjct: 112 ------KKYSLVVLDELDSMRAFEKQWREIIRATLADY-RGRALFMGTPKGYKSLYRLEK 164 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGII 270 + +++ + + + + + Sbjct: 165 LSKTNANYEVFHFTSFDNPFLSVEELDEMR 194 >gi|225621767|ref|YP_002724125.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547658|gb|ACN93635.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDNPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNTATFKTYNFTTYDNVLLGKGFIEPQEKLY-KDIPTYKARVLLGEWIASIDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D +++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFSSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|237704849|ref|ZP_04535330.1| terminase large subunit [Escherichia sp. 3_2_53FAA] gi|226901215|gb|EEH87474.1| terminase large subunit [Escherichia sp. 3_2_53FAA] gi|315288241|gb|EFU47640.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 110-3] Length = 471 Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|260868683|ref|YP_003235085.1| putative terminase large subunit [Escherichia coli O111:H- str. 11128] gi|293446697|ref|ZP_06663119.1| phage terminase large subunit [Escherichia coli B088] gi|257765039|dbj|BAI36534.1| putative terminase large subunit [Escherichia coli O111:H- str. 11128] gi|291323527|gb|EFE62955.1| phage terminase large subunit [Escherichia coli B088] gi|323177130|gb|EFZ62720.1| phage terminase, large subunit, PBSX family [Escherichia coli 1180] Length = 471 Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|211731828|gb|ACJ10140.1| terminase [Bacteriophage APSE-6] Length = 469 Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 51/183 (27%), Gaps = 38/183 (20%) Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPL---------- 245 + +EA + +++ + + + N +G Y F P Sbjct: 105 WVEEAETVSEKSLDTLIPTIRKPGSELRF--SFNPAEEDGAVYKRFVKPYKAIIDKQGYY 162 Query: 246 ---EDW-------KRYQIDTR---TVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQE 292 + + + + + ++ YG + D Sbjct: 163 EDDDLYVGNVSYLDNPWLPVELKNDAQKMKRENYKKWRHVYGGECDANY----------- 211 Query: 293 VNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSA 350 + I ++ A+ + ++ D AG G D+ + R G +IE W Sbjct: 212 EDALIQPEWVGAAIDAHIKLGFKPSGIRVVTFDPAGSGQDEKALSKRYGVLIEDCVSWLE 271 Query: 351 KLI 353 + Sbjct: 272 GDV 274 >gi|225621943|ref|YP_002724616.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] gi|225547242|gb|ACN93227.1| phage terminase, large subunit, pbsx family [Borrelia sp. SV1] Length = 450 Score = 42.0 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y + + + LG++ + F N Sbjct: 212 IDNVATFKTYNFTTYDNVLLSKGFIETQEKLY-KEIPTYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D ++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFTSPIAYLDPAFSIGGDNTALCVMERVDDK 306 >gi|317483571|ref|ZP_07942553.1| phage terminase [Bifidobacterium sp. 12_1_47BFAA] gi|316914997|gb|EFV36437.1| phage terminase [Bifidobacterium sp. 12_1_47BFAA] Length = 487 Score = 42.0 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 53/189 (28%), Gaps = 27/189 (14%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 RWQ + + ++ +I R GKT + +++ L + P +++I Sbjct: 42 DRWQQGLLTLILGRRADGTFAASVGGVVLSI--CRQTGKTFTVSSLVVILCTLIPNLTVI 99 Query: 112 CIA-------NSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI 164 A N+ ++ + + + + G+ Sbjct: 100 WTAHHNRTNSNTFDHVRTLV------------RNPAL----IGYLDHSGRTDGVRGGNGM 143 Query: 165 DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFW 224 + + R F N + DEA + ++ T +PN Sbjct: 144 QEITFANGSKILFGARAQGFA-RGNDAVDIIVFDEAQILTEQAISDMVPA-TNTSPNALV 201 Query: 225 IMTSNTRRL 233 + R Sbjct: 202 LYIGTPPRP 210 >gi|326779045|ref|ZP_08238310.1| type III restriction protein res subunit [Streptomyces cf. griseus XylebKG-1] gi|326659378|gb|EGE44224.1| type III restriction protein res subunit [Streptomyces cf. griseus XylebKG-1] Length = 609 Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 56/202 (27%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 34 PWGTAGKL-------RAWQQGAME--------RYVQEQPRDF-LAVATP-GAGKTTFALT 76 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ I +A +E + K + R ++ Sbjct: 77 LASWLLHHHVVQQITVVAPTEH---------LKKQWAEAAARIGIKLD------------ 115 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 + S G SK Y TY+ H V DE +G ++ Sbjct: 116 -PDYSAGPLSKEYHGVAVTYAGVGVRPM--LHRNRCEQRKTLVILDEIHHAGDSKSWGEA 172 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 F R +T R Sbjct: 173 CQEAF--DPATRRLALTGTPFR 192 >gi|294492319|gb|ADE91075.1| phage terminase, large subunit, PBSX family [Escherichia coli IHE3034] Length = 471 Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYTAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|182438394|ref|YP_001826113.1| hypothetical protein SGR_4601 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178466910|dbj|BAG21430.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 609 Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 56/202 (27%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 34 PWGTAGKL-------RAWQQGAME--------RYVQEQPRDF-LAVATP-GAGKTTFALT 76 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ I +A +E + K + R ++ Sbjct: 77 LASWLLHHHVVQQITVVAPTEH---------LKKQWAEAAARIGIKLD------------ 115 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 + S G SK Y TY+ H V DE +G ++ Sbjct: 116 -PDYSAGPLSKEYHGVAVTYAGVGVRPM--LHRNRCEQRKTLVILDEIHHAGDSKSWGEA 172 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 F R +T R Sbjct: 173 CQEAF--DPATRRLALTGTPFR 192 >gi|168467237|ref|ZP_02701079.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195630466|gb|EDX49092.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 539 Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 11/165 (6%) Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWK 249 F DEA+ + I ++ R + + N +N F Sbjct: 195 DRTTLYFVDEAAFLQRPL--LIDAALSQTTRCRIDLSSVN--GMNNPFAQ--KRHSGKIP 248 Query: 250 RYQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPHNYIEEAMS- 307 + R+ D ++ + +D+ + E+ + IP +++ A+ Sbjct: 249 VFTFHWRSDPRKDDEWYRKECEK--IDNPIIVAQELDLNYQASAEGILIPSEWVQAAVDA 306 Query: 308 -REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAK 351 + + D+A EG DK R G ++ + +WS K Sbjct: 307 HIKLGIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEWSGK 351 >gi|320179507|gb|EFW54461.1| Phage terminase, large subunit [Shigella boydii ATCC 9905] Length = 539 Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 24/63 (38%), Gaps = 2/63 (3%) Query: 291 QEVNNFIPHNYIEEAMSREAIDD--LYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 IP +++ A+ + D+A EG DK R G ++ + +W Sbjct: 289 SAEGILIPSEWVQAAVDAHIRLGIQPGGQRLGAMDVADEGRDKNACSLRYGILLNDVQEW 348 Query: 349 SAK 351 S K Sbjct: 349 SGK 351 >gi|222032743|emb|CAP75482.1| Terminase large subunit [Escherichia coli LF82] Length = 470 Score = 41.6 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|325497784|gb|EGC95643.1| gene 2 protein [Escherichia fergusonii ECD227] Length = 470 Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|300897414|ref|ZP_07115839.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 198-1] gi|300358826|gb|EFJ74696.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 198-1] Length = 470 Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|41057280|ref|NP_958178.1| gene 2 protein [Enterobacteria phage Sf6] gi|191165541|ref|ZP_03027382.1| phage terminase, large subunit, pbsx family [Escherichia coli B7A] gi|218695968|ref|YP_002403635.1| Terminase large subunit [Escherichia coli 55989] gi|331678314|ref|ZP_08378989.1| phage terminase, large subunit, PBSX family [Escherichia coli H591] gi|33334159|gb|AAQ12192.1| gene 2 protein [Shigella phage Sf6] gi|190904464|gb|EDV64172.1| phage terminase, large subunit, pbsx family [Escherichia coli B7A] gi|218352700|emb|CAU98482.1| Terminase large subunit [Escherichia coli 55989] gi|324114096|gb|EGC08069.1| phage terminase large subunit [Escherichia fergusonii B253] gi|331074774|gb|EGI46094.1| phage terminase, large subunit, PBSX family [Escherichia coli H591] Length = 470 Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|91211665|ref|YP_541651.1| terminase large subunit [Escherichia coli UTI89] gi|117624554|ref|YP_853467.1| phage terminase large subunit [Escherichia coli APEC O1] gi|218559279|ref|YP_002392192.1| Terminase large subunit [Escherichia coli S88] gi|91073239|gb|ABE08120.1| terminase large subunit [Escherichia coli UTI89] gi|115513678|gb|ABJ01753.1| phage terminase large subunit [Escherichia coli APEC O1] gi|148566126|gb|ABQ88401.1| phage terminase large subunit [Enterobacteria phage CUS-3] gi|218366048|emb|CAR03793.1| Terminase large subunit [Escherichia coli S88] gi|307626097|gb|ADN70401.1| terminase large subunit [Escherichia coli UM146] gi|323948780|gb|EGB44679.1| phage terminase large subunit [Escherichia coli H252] Length = 471 Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYTAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|323936486|gb|EGB32774.1| phage terminase large [Escherichia coli E1520] Length = 470 Score = 41.6 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|13559866|ref|NP_112076.1| terminase large subunit [Enterobacteria phage HK620] gi|13517602|gb|AAK28891.1|AF335538_43 terminase large subunit [Salmonella phage HK620] Length = 470 Score = 41.6 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|110804738|ref|YP_688258.1| putative bacteriophage protein [Shigella flexneri 5 str. 8401] gi|110614286|gb|ABF02953.1| putative bacteriophage protein [Shigella flexneri 5 str. 8401] Length = 255 Score = 41.6 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 28/60 (46%), Gaps = 2/60 (3%) Query: 295 NFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 I ++IE A+ + + +G D+A G DK V+R G+++ +W AK Sbjct: 10 AIIKLSWIEAAVDAHKTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 69 >gi|195942758|ref|ZP_03088140.1| hypothetical protein Bbur8_08065 [Borrelia burgdorferi 80a] Length = 312 Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 53/168 (31%), Gaps = 18/168 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNIIEHI 345 + D ++ I D A GD T + R + I Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVRGDNTALCVMERVDDQFRTI 310 >gi|225622132|ref|YP_002725127.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] gi|225546387|gb|ACN92395.1| phage terminase, large subunit, PBSX family [Borrelia burgdorferi 94a] Length = 450 Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 52/164 (31%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFVETQEKLY-KDIPSYQARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 D ++ I D A GGD T + R + Sbjct: 271 ITN--------DYVFTSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|219846951|ref|YP_002333526.2| DNA packaging terminase subunit 1 [Equid herpesvirus 9] gi|226423816|dbj|BAH02470.2| DNA packaging protein [Equid herpesvirus 9] Length = 734 Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 53/152 (34%), Gaps = 22/152 (14%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSE---TQLKNTLWAEVSKWLSMLPHRHWFEMQS 145 GKT ++ ++T G+ I A+ + + + A + +W P H Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEPVFDEIGARLRQWFGNSPVDHVKGENI 323 Query: 146 LSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD 205 P G + ++ S + G +F DEA+ Sbjct: 324 SFSFPDGSKSTIVF----------------ASSHNTNGIRGQDFN---LLFVDEANFIRP 364 Query: 206 IINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 ++I+GF + N ++ ++NT + + F Sbjct: 365 EAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396 >gi|9629774|ref|NP_045262.1| DNA packaging terminase subunit 1 [Equid herpesvirus 4] gi|2605992|gb|AAC59564.1| 47/44 [Equid herpesvirus 4] Length = 734 Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 53/152 (34%), Gaps = 22/152 (14%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSE---TQLKNTLWAEVSKWLSMLPHRHWFEMQS 145 GKT ++ ++T G+ I A+ + + + A + +W P H Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEPVFDEIGARLRQWFGNSPVDHVKGENI 323 Query: 146 LSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD 205 P G + ++ S + G +F DEA+ Sbjct: 324 SFSFPDGSKSTIVF----------------ASSHNTNGIRGQDFN---LLFVDEANFIRP 364 Query: 206 IINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 ++I+GF + N ++ ++NT + + F Sbjct: 365 EAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396 >gi|50313286|ref|YP_053090.1| DNA packaging terminase subunit 1 [Equid herpesvirus 1] gi|139648|sp|P28969|TRM3_EHV1B RecName: Full=Tripartite terminase subunit UL15 homolog; AltName: Full=DNA-packaging protein 44; AltName: Full=Terminase large subunit gi|59798996|sp|P84396|TRM3_EHV1V RecName: Full=Tripartite terminase subunit UL15 homolog; AltName: Full=DNA-packaging protein 44; AltName: Full=Terminase large subunit gi|42795172|gb|AAS45929.1| putative terminase [Equid herpesvirus 1] gi|49617029|gb|AAT67302.1| DNA packaging protein [Equid herpesvirus 1] Length = 734 Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 53/152 (34%), Gaps = 22/152 (14%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSE---TQLKNTLWAEVSKWLSMLPHRHWFEMQS 145 GKT ++ ++T G+ I A+ + + + A + +W P H Sbjct: 264 GKTWFLVPLIALALATFKGIKIGYTAHIRKATEPVFDEIGARLRQWFGNSPVDHVKGENI 323 Query: 146 LSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPD 205 P G + ++ S + G +F DEA+ Sbjct: 324 SFSFPDGSKSTIVF----------------ASSHNTNGIRGQDFN---LLFVDEANFIRP 364 Query: 206 IINKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 ++I+GF + N ++ ++NT + + F Sbjct: 365 EAVQTIIGFLNQTNCKIIFVSSTNTGKASTSF 396 >gi|262200363|ref|YP_003271571.1| NERD domain-containing protein [Gordonia bronchialis DSM 43247] gi|262083710|gb|ACY19678.1| NERD domain protein [Gordonia bronchialis DSM 43247] Length = 550 Score = 41.6 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 50/171 (29%), Gaps = 5/171 (2%) Query: 57 EFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANS 116 + + + + ++ + + I G G GKT L + +R G + I S Sbjct: 200 DATAEIITEQQAVILSAISKLSRVEIRGGAGSGKTFL--ALEQARRLSRAGQRVALICYS 257 Query: 117 ETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTY 176 L + S W + E L + ++ + T Sbjct: 258 HG-LASYFTRITSHWSRREQPAYIGEFHDLGITWGASAGPDESVRTQEAAEFWEHTLPHQ 316 Query: 177 SEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMT 227 E + H A+ DEA D +L + + ++ + Sbjct: 317 MVELAEALPPGHRFD--AIVIDEAQDFADDWWLPLLACLRDPGTSGIYLFS 365 >gi|167553969|ref|ZP_02347711.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205321713|gb|EDZ09552.1| gp33 TerL [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 539 Score = 41.3 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 25/63 (39%), Gaps = 2/63 (3%) Query: 291 QEVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 IP +++ A+ + + D+A EG DK R G ++ + +W Sbjct: 289 STEGILIPSEWVQAAVDAHIKLGIQPSGQRLGAMDVADEGRDKNACSLRYGFLLSDVQEW 348 Query: 349 SAK 351 S K Sbjct: 349 SGK 351 >gi|254485756|ref|ZP_05098961.1| phage DNA Packaging Protein [Roseobacter sp. GAI101] gi|214042625|gb|EEB83263.1| phage DNA Packaging Protein [Roseobacter sp. GAI101] Length = 452 Score = 41.3 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 44/272 (16%), Positives = 78/272 (28%), Gaps = 47/272 (17%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGM---------SIICIANSETQLKNTLWAEVSKWL 132 I GRG GKT A W+ S G + + + Q++ + S L Sbjct: 60 IMGGRGAGKTRAGA---EWVRSMVEGARPLDAGRCRRVALVGETIEQVREVMIFGDSGIL 116 Query: 133 SMLP--HRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT 190 + P R +E L ++ P+ GP Sbjct: 117 ACSPADRRPDWEATRKRL-----------------VWPNGAVASVHTAHDPEGLRGPQFD 159 Query: 191 HGMAVFNDEAS--GTPDIINKSILGFF-TELNPNRFWIMTSNTRRLNGWFYDIFNIPLED 247 A + DE + + + +P + + T R + P Sbjct: 160 ---AAWVDELAKWKKAEETWDQLQFALRLGEDPR---VCVTTTPRNVDVLKKLLASPSTV 213 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 + + F E + +RY + + R E+ G +E Sbjct: 214 -TTHAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSEMLER--G 269 Query: 308 REAIDDLYAPLIMGCDI---AGEGGDKTVVVF 336 R + +++G D AG G D+ +V Sbjct: 270 RIEKLPTFDRIVVGVDPATTAGAGSDECGIVV 301 >gi|327400267|ref|YP_004341106.1| hypothetical protein Arcve_0358 [Archaeoglobus veneficus SNP6] gi|327315775|gb|AEA46391.1| protein of unknown function DUF699 ATPase [Archaeoglobus veneficus SNP6] Length = 807 Score = 41.3 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 48/141 (34%), Gaps = 25/141 (17%) Query: 80 CAISAGRGIGKTTLNAWMMLWLIS-----TRPGMSIICIANSETQLKNTLWAEVSKWLSM 134 I+A RG GKT + + +LIS + + I+ +A + ++ + + K L Sbjct: 275 VVITADRGRGKTAVLGIVTPYLISRMHRVLKRPVRIMVVAPTPQAVQTY-FRFLKKALVR 333 Query: 135 LPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 Q + + L+ ++ + R E+ + Sbjct: 334 ---------QGMKNYKVKESNGLITVINSKFARVEYVVPRRAMIEK---------DYADI 375 Query: 195 VFNDEASGTPDII-NKSILGF 214 + DEA+G + + G Sbjct: 376 IIVDEAAGIDVPVLWQITEGA 396 >gi|168239626|ref|ZP_02664684.1| phage terminase, large subunit, pbsx family protein [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|197287704|gb|EDY27095.1| phage terminase, large subunit, pbsx family protein [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] Length = 470 Score = 41.3 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYAAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILVPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|324019922|gb|EGB89141.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 117-3] Length = 471 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 77/275 (28%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKCIAEGLLMDINE 276 >gi|323352542|gb|EGA85041.1| Kre33p [Saccharomyces cerevisiae VL3] Length = 966 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 54/156 (34%), Gaps = 16/156 (10%) Query: 58 FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE 117 + +D +N F A++AGRG GK+ + + +I + S Sbjct: 173 ILSFIDAISEKTLN------FTVALTAGRGRGKSAALGISIA-AAVSHGYSNIFVTSPSP 225 Query: 118 TQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYS 177 LK L+ + K L ++ + + + ++ + D + +T Sbjct: 226 ENLKT-LFEFIFKGFDALGYQEHIDYDIIQSTNPDFNKAIVRVDIKRDHR------QTIQ 278 Query: 178 EERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 P V DEA+ P I K++LG Sbjct: 279 YIVPQDHQVLGQAE--LVVIDEAAAIPLPIVKNLLG 312 >gi|323335941|gb|EGA77219.1| Kre33p [Saccharomyces cerevisiae Vin13] Length = 961 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 54/156 (34%), Gaps = 16/156 (10%) Query: 58 FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE 117 + +D +N F A++AGRG GK+ + + +I + S Sbjct: 173 ILSFIDAISEKTLN------FTVALTAGRGRGKSAALGISIA-AAVSHGYSNIFVTSPSP 225 Query: 118 TQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYS 177 LK L+ + K L ++ + + + ++ + D + +T Sbjct: 226 ENLKT-LFEFIFKGFDALGYQEHIDYDIIQSTNPDFNKAIVRVDIKRDHR------QTIQ 278 Query: 178 EERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 P V DEA+ P I K++LG Sbjct: 279 YIVPQDHQVLGQAE--LVVIDEAAAIPLPIVKNLLG 312 >gi|190409119|gb|EDV12384.1| hypothetical protein SCRG_03266 [Saccharomyces cerevisiae RM11-1a] Length = 1056 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 54/156 (34%), Gaps = 16/156 (10%) Query: 58 FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE 117 + +D +N F A++AGRG GK+ + + +I + S Sbjct: 263 ILSFIDAISEKTLN------FTVALTAGRGRGKSAALGISIA-AAVSHGYSNIFVTSPSP 315 Query: 118 TQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYS 177 LK L+ + K L ++ + + + ++ + D + +T Sbjct: 316 ENLKT-LFEFIFKGFDALGYQEHIDYDIIQSTNPDFNKAIVRVDIKRDHR------QTIQ 368 Query: 178 EERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 P V DEA+ P I K++LG Sbjct: 369 YIVPQDHQVLGQAE--LVVIDEAAAIPLPIVKNLLG 402 >gi|151944405|gb|EDN62683.1| killer toxin resistant protein [Saccharomyces cerevisiae YJM789] gi|207341763|gb|EDZ69729.1| YNL132Wp-like protein [Saccharomyces cerevisiae AWRI1631] gi|256273837|gb|EEU08759.1| Kre33p [Saccharomyces cerevisiae JAY291] gi|259149229|emb|CAY82471.1| Kre33p [Saccharomyces cerevisiae EC1118] Length = 1056 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 54/156 (34%), Gaps = 16/156 (10%) Query: 58 FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE 117 + +D +N F A++AGRG GK+ + + +I + S Sbjct: 263 ILSFIDAISEKTLN------FTVALTAGRGRGKSAALGISIA-AAVSHGYSNIFVTSPSP 315 Query: 118 TQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYS 177 LK L+ + K L ++ + + + ++ + D + +T Sbjct: 316 ENLKT-LFEFIFKGFDALGYQEHIDYDIIQSTNPDFNKAIVRVDIKRDHR------QTIQ 368 Query: 178 EERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 P V DEA+ P I K++LG Sbjct: 369 YIVPQDHQVLGQAE--LVVIDEAAAIPLPIVKNLLG 402 >gi|6324197|ref|NP_014267.1| Kre33p [Saccharomyces cerevisiae S288c] gi|1730777|sp|P53914|KRE33_YEAST RecName: Full=UPF0202 protein KRE33; AltName: Full=Killer toxin-resistance protein 33 gi|854505|emb|CAA86893.1| orf16 [Saccharomyces cerevisiae] gi|1302072|emb|CAA96014.1| unnamed protein product [Saccharomyces cerevisiae] gi|285814522|tpg|DAA10416.1| TPA: Kre33p [Saccharomyces cerevisiae S288c] Length = 1056 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 29/156 (18%), Positives = 54/156 (34%), Gaps = 16/156 (10%) Query: 58 FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE 117 + +D +N F A++AGRG GK+ + + +I + S Sbjct: 263 ILSFIDAISEKTLN------FTVALTAGRGRGKSAALGISIA-AAVSHGYSNIFVTSPSP 315 Query: 118 TQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYS 177 LK L+ + K L ++ + + + ++ + D + +T Sbjct: 316 ENLKT-LFEFIFKGFDALGYQEHIDYDIIQSTNPDFNKAIVRVDIKRDHR------QTIQ 368 Query: 178 EERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 P V DEA+ P I K++LG Sbjct: 369 YIVPQDHQVLGQAE--LVVIDEAAAIPLPIVKNLLG 402 >gi|24112089|ref|NP_706599.1| putative bacteriophage protein [Shigella flexneri 2a str. 301] gi|30062202|ref|NP_836373.1| putative bacteriophage protein [Shigella flexneri 2a str. 2457T] gi|24050918|gb|AAN42306.1| putative bacteriophage protein [Shigella flexneri 2a str. 301] gi|30040447|gb|AAP16179.1| putative bacteriophage protein [Shigella flexneri 2a str. 2457T] gi|281600053|gb|ADA73037.1| putative bacteriophage protein [Shigella flexneri 2002017] gi|332768291|gb|EGJ98476.1| hypothetical protein SF293071_0835 [Shigella flexneri 2930-71] Length = 179 Score = 41.3 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 28/60 (46%), Gaps = 2/60 (3%) Query: 295 NFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 I ++IE A+ + + +G D+A G DK V+R G+++ +W AK Sbjct: 10 AIIKLSWIEAAVDAHKTLNFEPSGRKRIGFDVADSGTDKCANVYRHGSVVFWADEWKAKE 69 >gi|288931818|ref|YP_003435878.1| hypothetical protein Ferp_1452 [Ferroglobus placidus DSM 10642] gi|288894066|gb|ADC65603.1| protein of unknown function DUF699 ATPase putative [Ferroglobus placidus DSM 10642] Length = 763 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 51/159 (32%), Gaps = 26/159 (16%) Query: 62 VDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLIS-----TRPGMSIICIANS 116 V + + I+A RG GKT + + +LIS + + I+ +A + Sbjct: 216 VLEAFETFFDRKREKKAVV-ITANRGRGKTAVLGIVTPYLISRMNRVLKRPVRILVVAPT 274 Query: 117 ETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTY 176 ++ K+L R Q + +L+ ++ R Sbjct: 275 PYAVQTYF-----KFLKKALVR-----QGMKEFKEKRSNDLVTVINSKWARVEYAVPRRA 324 Query: 177 SEERPDTFVGPHNTHGMAVFNDEASGTPDII-NKSILGF 214 E+ + + DEA+G + K + G Sbjct: 325 MVEK---------DYADIIIVDEAAGIDVPVLWKIVEGA 354 >gi|326782137|ref|YP_004322538.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-HM1] gi|310004344|gb|ADO98737.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-HM1] Length = 560 Score = 41.3 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 50/340 (14%), Positives = 101/340 (29%), Gaps = 72/340 (21%) Query: 12 EQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVN 71 ++++ E L E + F ++ P + +Q E +E+ + + Sbjct: 23 KEQIQEYLKCKEDPVYFARNYIKIISLDEGIVPFDM----WDFQEELIESFHENRFNIAK 78 Query: 72 NSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKW 131 GK+T +L I +++ +AN + ++ L + Sbjct: 79 LPRQ------------TGKSTTCVSYLLHYILFNDNVNVGILANKLSTARDLL----GRL 122 Query: 132 LSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTH 191 + Q + ++ + EL S + + R S Sbjct: 123 QLAYEQLPLWLQQGIVVY-NKGSMELENGSKILAASTSASAVRGMSFN------------ 169 Query: 192 GMAVFNDEASGTPDII----NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF---NIP 244 +F DE + P+ I S+ T + I+ S +N FY ++ Sbjct: 170 --IIFLDEFAFIPNHIAEQFFSSVYPTITS-GTSTKVIIISTPNGMN-HFYKLWVDAQKG 225 Query: 245 LEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPH--NYI 302 + ++ V G D+ + E I+ QF Q+ F+ I Sbjct: 226 RNGYAWSEVHWSKVPGRDAKWKEQTIANTSER----------QFTQEFDCEFLGSVDTLI 275 Query: 303 EEAMSREAIDD-----------LYAPL-----IMGCDIAG 326 A R D P+ I+ D++ Sbjct: 276 TAAKLRTLTYDDPLTTNGSLDVYENPVRDHDYIICVDVSR 315 >gi|254884963|ref|ZP_05257673.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] gi|254837756|gb|EET18065.1| conserved hypothetical protein [Bacteroides sp. 4_3_47FAA] Length = 566 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 62/196 (31%), Gaps = 27/196 (13%) Query: 82 ISAGRGIGKTTLN----AWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPH 137 + AGRG+ K+T+ ++ +W + PG + +AN+ LK+ + V K M+ Sbjct: 38 VIAGRGMSKSTVIQSRRSYRCIWEM---PGAPLAFVANTYANLKDNIMPAVQKGWEMMGL 94 Query: 138 RHWFEMQSLSLHPSGWYAE---LLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 P W A+ ++ S S + P G + Sbjct: 95 YEGVHYIRGKEPPVSWKAKCSIIVNDYRNCYSFWNGSVIFMGSLDNPSLLAG---KSVVH 151 Query: 195 VFNDEASGTPD----IINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKR 250 +F DE+ D + G + ++ + T + N DW Sbjct: 152 LFYDESKYDKDEKVNRAMPVLRGDSLTYGASHLFLGLTITTDMPDV-----NEGEYDWYF 206 Query: 251 YQIDTRTVEGIDSGFH 266 R +D Sbjct: 207 -----RYAPNMDPDRI 217 >gi|251778523|ref|ZP_04821443.1| phage terminase, large subunit, pbsx family [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243082838|gb|EES48728.1| phage terminase, large subunit, pbsx family [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 448 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 36/276 (13%), Positives = 80/276 (28%), Gaps = 40/276 (14%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 G G GK+ M+ P + I L+++++ Sbjct: 48 GGAGSGKSHFVVQKMILKYLEYPNRKCLVIRKVGNSLRDSIF------------------ 89 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA-------VF 196 + S W+ + ++ T F G ++ + + Sbjct: 90 ELFKTVLSDWHL------LERCEIRDSLLSITLPNGSTFIFKGLDDSEKIKSIANIDDIV 143 Query: 197 NDEASGTPDIINKSILGFFTELNP-NRFWIMTSNTRRLNGWFYDI-FNIPLEDWKRYQID 254 +E + + N N+ +M N + W Y++ F ++ + Sbjct: 144 VEECTEIDKQEFSQLGLRLRSKNGYNQIHVMF-NPISKSNWVYEMWFQNGYDESDTMVLK 202 Query: 255 T--RTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNY--IEEAMSREA 310 T + + + + +I D RI LG+F ++ I N+ ++ + Sbjct: 203 TTYKDNKFLPYDYINALIKMKETDPVYYRIYALGEF--ASLDKLIYTNWEELDFDWRKLM 260 Query: 311 IDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIF 346 YA G D + + + I+ Sbjct: 261 QQRPYAKACFGLDFGYVNDPSAFIAMIVDEVNKEIY 296 >gi|53793591|ref|YP_112491.1| terminase large subunit [Flavobacterium phage 11b] gi|53748181|emb|CAH56642.1| terminase large subunit [Flavobacterium phage 11b] Length = 432 Score = 40.9 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 55/170 (32%), Gaps = 16/170 (9%) Query: 196 FNDEASGTPDIINKS----ILGFFTELNPNRFWIMTSNTRRLNGW--FY--DIFNIPLED 247 F DE + + I + + T N + + FY D D Sbjct: 126 FIDECNQITYKAWQIVKSRIRYKLNQYGIEPKMLGTCNPAKNWVYAQFYLKDKNGTLDND 185 Query: 248 WKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL-GQFP-QQEVNNFIPHNYIEEA 305 K Q + + + ++S +S + + G + + I + I+ Sbjct: 186 KKFIQALPTDNPHLPASYLTSLLSL-DENS---KQRLYYGNWEYDNDPAKLIDYEKIQNC 241 Query: 306 MSREAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 + I + + + DIA G DK V+ G + IF + I E Sbjct: 242 FTNTFIP--FGEMYISADIARFGSDKMVICVWSGFRVVEIFSMAKSSITE 289 >gi|18496890|ref|NP_569740.1| putative terminase gp4 [Mycobacterium phage TM4] gi|4336041|gb|AAD17572.1| putative terminase gp4 [Mycobacterium phage TM4] Length = 474 Score = 40.9 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 57/183 (31%), Gaps = 21/183 (11%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 WQ + + + + ++ +F +I R GKT L ++ L P ++I Sbjct: 41 DLWQDDLGKLICAKRDDGLYAAD--MFAMSI--PRQTGKTYLLGALVFALCIKTPNTTVI 96 Query: 112 CIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI 171 A+ + AE + + L R L++H +L + + Sbjct: 97 WTAH-----RTRTAAETFRSMQGLAKRDKIAPHILNVHTGNGKEAVLFK----NGSRILF 147 Query: 172 TCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTR 231 R + G + DEA + ++ T PN ++ Sbjct: 148 GAR-------ERGFGRGFAGVDVLIFDEAQILTENAMDDMVPA-TNAAPNPLILLAGTPP 199 Query: 232 RLN 234 + Sbjct: 200 KPT 202 >gi|326783087|ref|YP_004323484.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-HM2] gi|310005505|gb|ADO99893.1| terminase DNA packaging enzyme large subunit [Prochlorococcus phage P-HM2] Length = 560 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 50/334 (14%), Positives = 104/334 (31%), Gaps = 58/334 (17%) Query: 11 LEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNV 70 + ++ E L E + F ++ P + +Q E +E+ H + Sbjct: 22 TKHQIQEYLKCKEDPVYFAMNYIKIISLDEGIVPFKM----WDFQQELIESFHEHRFNIA 77 Query: 71 NNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSK 130 GK+T +L I +++ +AN + ++ L S+ Sbjct: 78 KLPRQ------------TGKSTTCVSYLLHYILFNDNVNVGILANKLSTARDLL----SR 121 Query: 131 WLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT 190 + Q + ++ + EL S + + R S Sbjct: 122 LQLAYEQLPLWIQQGIVVY-NKGSMELENGSKILAASTSASAVRGMSFN----------- 169 Query: 191 HGMAVFNDEASGTPDII----NKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF---NI 243 +F DE + P+ I S+ T + I+ S +N FY ++ Sbjct: 170 ---IIFLDEFAFIPNHIAEQFFSSVYPTITS-GTSTKVIIISTPNGMN-HFYKLWVDAQK 224 Query: 244 PLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYI- 302 + ++ V G D+ + E I+ E +F V+ I + + Sbjct: 225 GRNGYAWNEVHWSKVPGRDAKWKEQTIANTSERQ--FTQEFDCEF-LGSVDTLITASKLR 281 Query: 303 ----EEAMSREAIDDLYAPL------IMGCDIAG 326 ++ M+ D+Y I+ D++ Sbjct: 282 VLTYDDVMTTNGSLDIYEKPIDKHEYIITVDVSR 315 >gi|156847104|ref|XP_001646437.1| hypothetical protein Kpol_1048p9 [Vanderwaltozyma polyspora DSM 70294] gi|156117114|gb|EDO18579.1| hypothetical protein Kpol_1048p9 [Vanderwaltozyma polyspora DSM 70294] Length = 1055 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 53/156 (33%), Gaps = 16/156 (10%) Query: 58 FMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE 117 + +D +N++ ++AGRG GK+ + + +I + S Sbjct: 263 ILSFIDAISEKTLNST------VTLTAGRGRGKSAALGISIA-AAVSHGYSNIFVTSPSP 315 Query: 118 TQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYS 177 LK L+ + K L ++ + + + ++ + D + +T Sbjct: 316 ENLKT-LFEFIFKAFDALGYQEHIDYDIIQSTNPQFNKAIVRVDIKRDHR------QTIQ 368 Query: 178 EERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 P V DEA+ P I K +LG Sbjct: 369 YIMPQDHQVLGQAE--LVVIDEAAAIPLPIVKKLLG 402 >gi|157159763|ref|YP_001457081.1| PBSX family phage terminase large subunit [Escherichia coli HS] gi|300935792|ref|ZP_07150755.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 21-1] gi|157065443|gb|ABV04698.1| phage terminase, large subunit, pbsx family [Escherichia coli HS] gi|300459025|gb|EFK22518.1| phage terminase, large subunit, PBSX family [Escherichia coli MS 21-1] Length = 471 Score = 40.9 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 30/267 (11%), Positives = 75/267 (28%), Gaps = 21/267 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ I ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFD 347 D + G D R G++++ I + Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAE 268 >gi|34365522|tpg|DAA01288.1| TPA_exp: replicase/helicase/endonuclease [Danio rerio] Length = 3007 Score = 40.9 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 23/131 (17%), Positives = 46/131 (35%), Gaps = 26/131 (19%) Query: 1 MPRLISTDQKLEQELHEMLMHAECVLSF----KNFVMR----FFPWGIKGKPLEHFSQPH 52 M + ++ E+ + ++ A ++ N + R + L F + Sbjct: 2248 MKDKLQQVEEHEEHIPDLASEANQKVAHLEKKNNIMCRRDGLALIRSLNDTQLSIFYEIR 2307 Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTL------NAWMMLWLISTRP 106 +W L+ V NP+ I+ G G GK+ L A +L + P Sbjct: 2308 QWCLD-----------KVMGKNPSPVHLFITGGAGTGKSHLIKAIQYEAMRILSTVCRHP 2356 Query: 107 G-MSIICIANS 116 +S++ A + Sbjct: 2357 DNISVLLTAPT 2367 >gi|218964078|ref|YP_002455438.1| putative phage terminase, pbsx family protein [Borrelia afzelii ACA-1] gi|216752969|gb|ACJ73583.1| putative phage terminase, pbsx family protein [Borrelia afzelii ACA-1] Length = 450 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 44/292 (15%), Positives = 82/292 (28%), Gaps = 45/292 (15%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ H T K S G GKT L +++++ + S Sbjct: 46 TTKQKEVLFDIESH----------TYSKVIFSGGIASGKTFLASYLLIKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K + + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTIKQIEKICGL------LGIDYQKKKSGQSFCKIAGLELNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 D F + ++ +EA+ ++ I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHKETLLEVMKRL--RKGKSIIIF 196 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL- 285 +N +F + + +K Y T + F E Y + +L Sbjct: 197 DTNPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLY 255 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E + + D + IM D A GGD T V Sbjct: 256 GEWVLNES-SLFNEMIFNQ-------DYEFKSPIMYIDPAFSVGGDNTAVCV 299 >gi|117621599|ref|YP_853855.1| hypothetical protein BAPKO_2028 [Borrelia afzelii PKo] gi|110890985|gb|ABH02150.1| hypothetical protein BAPKO_2028 [Borrelia afzelii PKo] Length = 450 Score = 40.9 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 44/292 (15%), Positives = 82/292 (28%), Gaps = 45/292 (15%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ H T K S G GKT L +++++ + S Sbjct: 46 TTKQKEVLFDIESH----------TYSKVIFSGGIASGKTFLASYLLIKKLIENKSFYEQ 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K + + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTIKQIEKICGL------LGIDYQKKKSGQSFCKIAGLELNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 D F + ++ +EA+ ++ I Sbjct: 150 GK-----------NRDAFSKIRGGNSAIIYVNEATVIHKETLLEVMKRL--RKGKSIIIF 196 Query: 227 TSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL- 285 +N +F + + +K Y T + F E Y + +L Sbjct: 197 DTNPESPAHYFKTDYIENTDVFKTYNFTTYDNPLNSADFIETQEKLY-KHFPAYKARVLY 255 Query: 286 GQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 G++ E + + D + IM D A GGD T V Sbjct: 256 GEWVLNES-SLFNEMIFNQ-------DYEFKSPIMYIDPAFSVGGDNTAVCV 299 >gi|299531659|ref|ZP_07045064.1| putative phage associated protein [Comamonas testosteroni S44] gi|298720375|gb|EFI61327.1| putative phage associated protein [Comamonas testosteroni S44] Length = 436 Score = 40.5 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 30/178 (16%), Positives = 64/178 (35%), Gaps = 23/178 (12%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GK+ A ++L + ++RP ++C E+ K + H+ + Sbjct: 39 GGRGGGKSWTVAAVLLVMAASRPL-RVLCT------------REIQKSIKQSVHQ-LLKD 84 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHY-TITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 L+ ++ L + GI+ + ++++ + +F G V+ +EA G Sbjct: 85 VITRLNLHAFFEVLETEVRGINGSLFLFSGLQSHTVDSIKSFEGCD-----IVWVEEAHG 139 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF-NIPLEDWKRYQIDTRTVE 259 +++ + + + N Y F P D +I+ R Sbjct: 140 VSKKSWDTLIPTIRKEGSEIWLTL--NPDMETDETYQRFIATPSPDTWVVEINWRDNP 195 >gi|256422889|ref|YP_003123542.1| hypothetical protein Cpin_3879 [Chitinophaga pinensis DSM 2588] gi|256037797|gb|ACU61341.1| hypothetical protein Cpin_3879 [Chitinophaga pinensis DSM 2588] Length = 471 Score = 40.5 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 54/137 (39%), Gaps = 11/137 (8%) Query: 223 FWIMTSNTRRL--NGWFYDIFNIPL--EDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSD 278 +T N ++ + F+ F + K Q + ID G+ + ++S + Sbjct: 190 RIFVTLNPKKNWCHTVFWKPFKAGQLPDKVKFLQALVQDNPFIDPGYIDNLMS---ITDK 246 Query: 279 VARIEIL-GQFP-QQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVF 336 V + +L G F + N + ++ I + + E + + + DIA G DK+VV+ Sbjct: 247 VKKQRLLYGNFDYDDDDNALMEYDSINDIFTNEFVVE--GKKYITADIARFGSDKSVVMV 304 Query: 337 RRGNIIEHIFDWSAKLI 353 G + I + Sbjct: 305 WNGLRVVEIRKFEKMRT 321 >gi|260856407|ref|YP_003230298.1| putative terminase large subunit [Escherichia coli O26:H11 str. 11368] gi|257755056|dbj|BAI26558.1| putative terminase large subunit [Escherichia coli O26:H11 str. 11368] Length = 470 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 75/275 (27%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD--LYAPLIM 320 + + + R LG+ I ++E A ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHTKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|158318502|ref|YP_001511010.1| helicase domain-containing protein [Frankia sp. EAN1pec] gi|158113907|gb|ABW16104.1| helicase domain protein [Frankia sp. EAN1pec] Length = 1143 Score = 40.5 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 40/125 (32%), Gaps = 5/125 (4%) Query: 121 KNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEER 180 K + E+ +L + + +L S W L E ++ D+ Y R Sbjct: 284 KTYIAGELLHEAVILNRQKALVVAPATLRDSTWKPFLRETNLPADTVSYEELTRGMPAAG 343 Query: 181 PDTFVGPHNTHGMAVFNDEASGTPDIINKSILGF---FTELNPNRFWIMTSNTRRLNGWF 237 H V DEA + + T P R ++T+ + Sbjct: 344 QQGAALQHPDAYALVIVDEAHALRSLGTQRAEAMRLLLTGKVPKRLVLLTATPVNNS--L 401 Query: 238 YDIFN 242 YD++N Sbjct: 402 YDLYN 406 >gi|320590344|gb|EFX02787.1| dead deah box DNA helicase [Grosmannia clavigera kw1407] Length = 2423 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 51/165 (30%), Gaps = 23/165 (13%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ + W L + Sbjct: 1194 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAPMKALVRE----RIKDWGRRLAGPAGLRL 1249 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA-- 200 L+ + + E + + + + R++ + V DE Sbjct: 1250 VELTGDNTPDTRTIGEADVIVTTPEKWDGISRSWQTRG-------YVRKVSLVIIDEIHL 1302 Query: 201 -SGTPDIINKSI------LGFFTELNPNRFWI--MTSNTRRLNGW 236 +G I + I +G T + + +N L W Sbjct: 1303 LAGDRGPILEIIVSRMNYIGAATGSSVRLLGMSTACANATDLASW 1347 >gi|294677220|ref|YP_003577835.1| terminase-like family protein [Rhodobacter capsulatus SB 1003] gi|294476040|gb|ADE85428.1| terminase-like family protein [Rhodobacter capsulatus SB 1003] Length = 455 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 52/297 (17%), Positives = 96/297 (32%), Gaps = 45/297 (15%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGM---------SIICIANSETQLKNT-LWAEVSKW 131 I GRG GKT A W+ G + + + Q+++ ++ E S Sbjct: 62 IMGGRGAGKTRAGA---EWVRMQVEGAGPADAGPAHRVALVGETFDQVRDVMIFGE-SGI 117 Query: 132 LSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTH 191 L+ P E ++ T + YS + P+ GP Sbjct: 118 LACSPPDRRPEWEATK---------------RRLVWANGATAQAYSAQEPEALRGPQFD- 161 Query: 192 GMAVFNDEASGT--PDIINKSILGFF-TELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDW 248 A + DE + + + +P + ++T+ R G I N P Sbjct: 162 --AAWVDELAKWRRAEETWDMLQFALRLGKHPQQ--VITTTP-RNVGVLKAILNNPSTV- 215 Query: 249 KRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 + + F + +RY + + R E+ G + +E R Sbjct: 216 VTHAPTEANRAYLAESFLAEVQARY-AGTRLGRQELEGVLLEDVEGALWTTAQLEG--LR 272 Query: 309 EAIDDLYAPLIMGCDIA---GEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEGCP 362 A +++ D A G G D+ +V + DW A ++++ + G P Sbjct: 273 LASPPAMDRVVVALDPAVTGGAGSDECGIVVAGAVTRGPVQDWRAFVLEDASVRGRP 329 >gi|170023468|ref|YP_001719973.1| hypothetical protein YPK_1222 [Yersinia pseudotuberculosis YPIII] gi|169750002|gb|ACA67520.1| conserved hypothetical protein [Yersinia pseudotuberculosis YPIII] Length = 534 Score = 40.5 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 27/72 (37%), Gaps = 4/72 (5%) Query: 291 QEVNNFIPHNYIEEAMS--REAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDW 348 IP +++ A+ + I D+A EG D R G +++ + W Sbjct: 289 AAEGILIPSEWVQAAIGAHTKLGITPSGARIGALDVADEGIDLNAFSSRTGVLLDRLKAW 348 Query: 349 SAK--LIQETNQ 358 S K I T Q Sbjct: 349 SGKGSDIYATTQ 360 >gi|312149784|gb|ADQ29854.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi N40] Length = 304 Score = 40.5 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 28/157 (17%), Positives = 50/157 (31%), Gaps = 16/157 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDHPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + +K Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIATFKTYNFTTYDNVLLSKGFIETQEKLY-KDIPSYKARVLLGEWIASTDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF 336 + D ++ I D A GD T + Sbjct: 271 ITD--------DYVFTSPIAYLDPAFSVRGDNTALCV 299 >gi|298381518|ref|ZP_06991117.1| phage terminase large subunit [Escherichia coli FVEC1302] gi|298278960|gb|EFI20474.1| phage terminase large subunit [Escherichia coli FVEC1302] Length = 470 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 31/275 (11%), Positives = 76/275 (27%), Gaps = 21/275 (7%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GK+ W I ++ A ++ E+ +S R + Sbjct: 21 KGGRGSGKS--------WAI-----ARLLVEAARRQPVRILCARELQNSISDSVIRLLED 67 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASG 202 + + + + + + + + + G + +EA Sbjct: 68 TIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGID-----ICWVEEAEA 122 Query: 203 TPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGID 262 ++ + W+ + L+ + P +D ++ Sbjct: 123 VTKESWDILIPTIRKPFSE-IWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFP 181 Query: 263 SGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS--REAIDDLYAPLIM 320 + + + R LG+ ++E A ++ ++ Sbjct: 182 EVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIFKREWLEAATDAHKKLGWKAKGAVVS 241 Query: 321 GCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKLIQE 355 D + G D R G++++ I + I E Sbjct: 242 AHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINE 276 >gi|33863356|ref|NP_894916.1| UvrD/REP helicase [Prochlorococcus marinus str. MIT 9313] gi|33640805|emb|CAE21260.1| similar to UvrD/REP helicase [Prochlorococcus marinus str. MIT 9313] Length = 576 Score = 40.5 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 31/154 (20%), Positives = 51/154 (33%), Gaps = 30/154 (19%) Query: 83 SAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 S G G GKT+ M+ ++ RPG+ I A + + A V K L +P Sbjct: 158 SGGPGTGKTSTIVQMLARAVTLRPGLKIGLAAPTGKAARRLEEA-VRKGLETIPPPQRQA 216 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGM---AVFNDE 199 + SL P L+ G G H H + + DE Sbjct: 217 LTSL---PCSTLHRWLQARPGGF--------------------GRHQQHPLMLDLLVIDE 253 Query: 200 ASGTPDIINKSILGFFTELNPNRFWIMTSNTRRL 233 S + +++L + +M + +L Sbjct: 254 MSMVELALMQALLNAL---PVDSQLVMIGDPDQL 284 >gi|332185581|ref|ZP_08387329.1| terminase-like family protein [Sphingomonas sp. S17] gi|332014559|gb|EGI56616.1| terminase-like family protein [Sphingomonas sp. S17] Length = 436 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 46/262 (17%), Positives = 85/262 (32%), Gaps = 37/262 (14%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 I AGRG GKT A + L PG I + + ++ + S L++ Sbjct: 60 IRAGRGFGKTRAGAEWVSALARDNPGARIALMGATLRDVERVMVRGESGLLAVARKGEAP 119 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG--PHNT--HGMAVFN 197 + S+G YS P+ G H + + Sbjct: 120 KWIG---------------SLGQVHFTSGAIGFAYSAAAPEALRGPQHHAAWCDELGKWK 164 Query: 198 DEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRT 257 EA G +++ LG + ++T+ R + + + RT Sbjct: 165 GEA-GWDNLMMTLRLG------EHPRVLVTTTPRATP-----LMRKVMALPDCVETIGRT 212 Query: 258 VEG--IDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLY 315 + + F + ++S+YG D+ + R E+ G+ ++ R Sbjct: 213 SDNAHLPDSFQDAMLSQYG-DTRLGRQELDGEMVDDREGALWTRALLDR--QRVKTVPAL 269 Query: 316 APLIMGCD-IAGEGGDKTVVVF 336 +++G D A GD +V Sbjct: 270 DRVVVGVDPPATSSGDACGIVA 291 >gi|319762771|ref|YP_004126708.1| prophage mumc02, terminase, atpase subunit, putative [Alicycliphilus denitrificans BC] gi|317117332|gb|ADU99820.1| prophage MuMc02, terminase, ATPase subunit, putative [Alicycliphilus denitrificans BC] Length = 454 Score = 40.5 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 51/161 (31%), Gaps = 14/161 (8%) Query: 175 TYSEERPDTFVGPHNTHGMAVFNDEASGTPD--IINKSILGFFTELNPNRFWIMTSNTRR 232 T PDT G V DE + D I K++ ++ P + S Sbjct: 113 TALPANPDTARGFSAN----VLLDEFAFHQDSRAIWKALFPVISK--PGLKLRVISTPNG 166 Query: 233 LNGWFYDIFNIPLEDWKRYQIDTRT-VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQ 291 FYD+ + W R+ D V E + + D D+ E ++ + Sbjct: 167 KGNKFYDLMTGADDGWSRHTTDIYQAVADGLPRNIEEL-RKGAGDDDLWAQEFELKWLDE 225 Query: 292 EVNNFIPHNYIEEA---MSREAIDDLYAPLIMGCDIAGEGG 329 ++P I + + P +G DIA Sbjct: 226 AS-AWLPFELITACEHEAAGKPEHYQGGPCFVGVDIASRND 265 >gi|301092109|ref|XP_002896227.1| N-acetyltransferase 10 [Phytophthora infestans T30-4] gi|262094857|gb|EEY52909.1| N-acetyltransferase 10 [Phytophthora infestans T30-4] Length = 1102 Score = 40.5 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 57/164 (34%), Gaps = 19/164 (11%) Query: 55 QLEFMEAVDVHCHSNVNNSNPTIFKCAIS--AGRGIGKTTLNAWMMLWLISTRPGMSIIC 112 Q ++ + V + + ++ AGRG GK+ + +I Sbjct: 254 QARTLDQAKAIL-TFVEAVSEKTLRSTVALTAGRGRGKSAALGMSLA-GAVAYGYSNIFV 311 Query: 113 IANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTIT 172 A S LK + V K L ++ + + + + ++ ++ + + Sbjct: 312 TAPSPENLKTV-FEFVFKGFDALKYKEHLDYEIVQSTNPEFNHAVVRVNIFREHR----- 365 Query: 173 CRTYSEERPDTFVGPHNTHGM---AVFNDEASGTPDIINKSILG 213 +T +P H+ V DEA+ P + K++LG Sbjct: 366 -QTIQYIQPT-----HHEKLAQAELVAIDEAAAIPLPVVKNLLG 403 >gi|46949065|gb|AAT07420.1| UL89 DNA packaging protein [Macacine herpesvirus 3] Length = 671 Score = 40.1 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 31/91 (34%), Gaps = 7/91 (7%) Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG--PHNTHGMAVFNDEASGTPDI 206 + E + + ID K T S ++ G H + DEA + Sbjct: 261 FAKDYVVENKDFVISIDHKGAKSTALFASCYNTNSIRGQNFH-----LLLVDEAHFIKEK 315 Query: 207 INKSILGFFTELNPNRFWIMTSNTRRLNGWF 237 +ILGF + +I ++NT F Sbjct: 316 AFNTILGFLAQNTTKIIFISSTNTTSDATCF 346 >gi|29366753|ref|NP_813693.1| gp33 [Streptomyces phage phiBT1] gi|29243073|emb|CAD80101.1| gp33 [Streptomyces phage phiBT1] Length = 527 Score = 40.1 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 40/310 (12%), Positives = 83/310 (26%), Gaps = 20/310 (6%) Query: 53 RWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPG---MS 109 WQ + + R GK+T+ A +ML+ + G Sbjct: 54 PWQRTLLIDAYELTQDTFGRWRRKHRTVVVCVARKNGKSTIAAAIMLYHLIADRGDAQRQ 113 Query: 110 IICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHY 169 +I AN Q + + +K + + Y + + + D+ Sbjct: 114 VIAAANDRNQARMVF--DSAKQMVNASPKLAAVCNVQRDVIR--YKDNTYRVVSADAGRQ 169 Query: 170 TITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSN 229 F + A + I + + + Sbjct: 170 QGLNPAAVSLDEYAFSKSSDLFDALTLGSAARN--QPMFLIISTAGPDPDGPFAALCEQG 227 Query: 230 TRRLNGW------FYDIFNIPLEDWKRY---QIDTRTVEGIDSGFHEGIISRYGLDSDV- 279 R +G FY + L + + ++ R D + + ++ Sbjct: 228 ERVNSGEADDPTLFYRSWGPKLGETVDHLDPEVWARCNPSYDILNPDDFKAAAQRSTEAS 287 Query: 280 ARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLIMGCDIAGEGGDKTVVVFR-R 338 RI L QF + + A + + ++ G D + +G +V R R Sbjct: 288 FRIYRLSQFVRGASTWLPHGLWDSLAADDDDPLEPGDEVVCGFDGSWKGDSTALVACRVR 347 Query: 339 GNIIEHIFDW 348 + + W Sbjct: 348 DLRVFVLGHW 357 >gi|84687436|ref|ZP_01015314.1| Putative large terminase [Maritimibacter alkaliphilus HTCC2654] gi|84664594|gb|EAQ11080.1| Putative large terminase [Rhodobacterales bacterium HTCC2654] Length = 426 Score = 40.1 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 45/272 (16%), Positives = 79/272 (29%), Gaps = 47/272 (17%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGM---------SIICIANSETQLKNTLWAEVSKWL 132 I GRG GKT A W+ + G + I + Q+++ + Sbjct: 33 ILGGRGAGKTRAGA---EWVRAQVEGPAPLSPGRAGRVALIGETFDQVRDVMV------- 82 Query: 133 SMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG 192 F + E + T ++S P+ GP Sbjct: 83 --------FGDSGIVACAPPDRRPAWEATKRRLVWPNGATATSFSASEPEGLRGPQFD-- 132 Query: 193 MAVFNDEASGTP--DIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKR 250 A + DE + D + L + ++T+ R + I L Sbjct: 133 -AAWADELAKWKKVDDAWDMLQFAL-RLGDHPRQVVTTTPRDVP-----ILRRLLTLSST 185 Query: 251 YQIDTRTV---EGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMS 307 T + F E I +RYG + + R E+ G F +E+ Sbjct: 186 VTTHAPTTANRANLAKSFLEEIEARYG-GTRLGRQELEGVLLDDREGAFWSTAMLEDC-- 242 Query: 308 REAIDDLYAPLIMGCDI---AGEGGDKTVVVF 336 R + +++ D G D+ +V Sbjct: 243 RIDGPPPLSRIVVAVDPPVTGHAGSDECGIVV 274 >gi|154488071|ref|ZP_02029188.1| hypothetical protein BIFADO_01641 [Bifidobacterium adolescentis L2-32] gi|154083544|gb|EDN82589.1| hypothetical protein BIFADO_01641 [Bifidobacterium adolescentis L2-32] Length = 477 Score = 40.1 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 66/231 (28%), Gaps = 28/231 (12%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISA-GRGIGKTTLNAWMMLWLISTRPGMSI 110 WQ + V + + A+ + R GKT W+ + + PGM I Sbjct: 37 DPWQRQINRIVLA-----KSADGFWSARNAVLSIPRQTGKTYDIGWVAIHRAARTPGMRI 91 Query: 111 ICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYT 170 + A + +K+T S+ EM L G + + G + + Sbjct: 92 VWTAQHFSVIKDTFE-------SLCAIVLRPEMSGLVDPDHG-----ISLAAGKEEIRFR 139 Query: 171 ITCRTYSEERPD-TFVGPHNTHGMAVFNDEASGTPDIINKSILGFFT-ELNPNRFWIMTS 228 R + R G + DEA D S+L NP ++ T Sbjct: 140 NGSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPTQNRAYNPQTIYMGTP 197 Query: 229 N-TRRLNGWFYDIFNIPLEDWKRYQI-----DTRTVEGIDSGFHEGIISRY 273 R F + + + R + +D Y Sbjct: 198 PGPRDNGEAFTRLRDKARAGRTHSTLYVEFAADRDADPLDREQWRKANPSY 248 >gi|308097723|gb|ADO14402.1| AB1gp31 [Acinetobacter phage AB1] Length = 313 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 40/104 (38%), Gaps = 4/104 (3%) Query: 252 QIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVN-NFIPHNYIEEAMS--R 308 I+ + + I + D I P+ + + + I +++E A+ + Sbjct: 21 HINYNENPFLSQTALDVIADKKRRD-PEGFAHIYDGMPRADDDMSIIKASWVEAALDAHK 79 Query: 309 EAIDDLYAPLIMGCDIAGEGGDKTVVVFRRGNIIEHIFDWSAKL 352 D +G D+A G DK +V R+G + +W A+ Sbjct: 80 LLNLDDTGRSYLGFDVADAGKDKCALVHRKGIVAYWSDEWKARE 123 >gi|158425199|ref|YP_001526491.1| phage-related DNA maturase [Azorhizobium caulinodans ORS 571] gi|158332088|dbj|BAF89573.1| phage-related DNA maturase [Azorhizobium caulinodans ORS 571] Length = 569 Score = 40.1 bits (92), Expect = 0.61, Method: Composition-based stats. Identities = 22/114 (19%), Positives = 42/114 (36%), Gaps = 19/114 (16%) Query: 4 LISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEAVD 63 ++ST + L+ F+NF+ W G P P Q + + Sbjct: 1 MVSTKEHLKSSTR-FTDPDPLKADFRNFLYVV--WKHLGLP-----DPTPIQYD----IA 48 Query: 64 VHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE 117 V+ S F+ G+GK+ + + + WL+ P I+ ++ S+ Sbjct: 49 VYLQHGPKRSIIEAFR-------GVGKSWVTSAFVCWLLYCNPDHKILVVSASK 95 >gi|116196286|ref|XP_001223955.1| hypothetical protein CHGG_04741 [Chaetomium globosum CBS 148.51] gi|88180654|gb|EAQ88122.1| hypothetical protein CHGG_04741 [Chaetomium globosum CBS 148.51] Length = 2013 Score = 40.1 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 53/165 (32%), Gaps = 23/165 (13%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ V W + L ++ Sbjct: 1169 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAPMKALVRE----RVKDWGARLAKPLGLKL 1224 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA-- 200 L+ + + + + I + + R++ + V DE Sbjct: 1225 VELTGDNTPDTRTIQDADIIITTPEKWDGISRSWQTRG-------YVRKVSLVIIDEIHL 1277 Query: 201 -SGTPDIINKSILG-----FFTELNPNRFWIM---TSNTRRLNGW 236 +G I + I+ + N R M +N L W Sbjct: 1278 LAGDRGPILEIIVSRMNYIASSTKNAVRLLGMSTACANATDLGNW 1322 >gi|83943173|ref|ZP_00955633.1| terminase, large subunit, putative [Sulfitobacter sp. EE-36] gi|83846181|gb|EAP84058.1| terminase, large subunit, putative [Sulfitobacter sp. EE-36] Length = 408 Score = 39.7 bits (91), Expect = 0.66, Method: Composition-based stats. Identities = 47/269 (17%), Positives = 83/269 (30%), Gaps = 41/269 (15%) Query: 82 ISAGRGIGKTTLNA-WMMLWLISTRPG-----MSIICIANSETQLKNTLWAEVSKWLSML 135 I GRG GKT A W+ + +RP + + + Q++ + S L+ Sbjct: 16 IMGGRGAGKTRAGAEWVRAQVEGSRPLDAGRCRRVALVGETIEQVREVMIFGDSGILACS 75 Query: 136 P--HRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGM 193 P R +E L ++ P+ GP Sbjct: 76 PADRRPDWEATRKRL-----------------VWPNGAVATVHTAHDPEGLRGPQFD--- 115 Query: 194 AVFNDEAS--GTPDIINKSILGFF-TELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKR 250 A + DE + + + +P + + T R G ++ P Sbjct: 116 AAWVDELAKWKKAEETWDQLQFALRLGEDPR---VCVTTTPRNVGVLKNLLASPSTV-TT 171 Query: 251 YQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 + + F E + +RY + + R E+ G IE R+ Sbjct: 172 HAPTEANAANLAGSFLEEVRARY-RGTRLGRQELDGVLLADAEGALWTSERIEAGRVRDV 230 Query: 311 IDDLYAPLIMGCDI---AGEGGDKTVVVF 336 L +++G D AG G D+ +V Sbjct: 231 --PLLDRIVVGLDPATTAGAGSDECGIVV 257 >gi|221148414|gb|ACL99813.1| BDRF1 [Human herpesvirus 4] Length = 690 Score = 39.7 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 55/149 (36%), Gaps = 15/149 (10%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GKT + ++ ++S + I +A+ + + + ++ E+ L+ E+ + Sbjct: 231 GKTWIVVAIISLILSNLSNVQIGYVAHQKH-VASAVFTEIIDTLTKSFDSKRVEVNKETS 289 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN 208 + ++ + T+ C T + H +F DEA+ Sbjct: 290 TITFRHSGKISS---------TVMCATCFNKNSIRGQTFH-----LLFVDEANFIKKEAL 335 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWF 237 +ILGF + + +I + N+ F Sbjct: 336 PAILGFMLQKDAKIIFISSVNSADQATSF 364 >gi|219873383|ref|YP_002477648.1| phage terminase, large subunit, pbsx family [Borrelia garinii Far04] gi|219694616|gb|ACL35135.1| phage terminase, large subunit, pbsx family [Borrelia garinii Far04] Length = 267 Score = 39.7 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 72/243 (29%), Gaps = 38/243 (15%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI----STRP- 106 Q E + ++ H K S G GKT L +++++ + S Sbjct: 46 TAKQKEVLFDIESH----------DYSKVIFSGGIASGKTFLASYLLIKKLIENKSFYEK 95 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDS 166 + I NS L ++ K + + + ++ + I Sbjct: 96 DTNNFIIGNSIGLLMTNTIKQIEK------ICGFLGIDYQKKKSGESFCKIAGLELNIYG 149 Query: 167 KHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIM 226 D+F + ++ +EA+ +++L L + I+ Sbjct: 150 GK-----------NRDSFSKIRGGNSAIIYVNEATVIHK---ETLLEAIKRLRKGKAIII 195 Query: 227 T-SNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEIL 285 +N +F F + +K Y T + F E Y + +L Sbjct: 196 FDTNPESPTHFFKTDFIENKDVFKTYNFTTYDNPLNSADFIETQKKLY-KHLPAYKARVL 254 Query: 286 -GQ 287 G+ Sbjct: 255 YGE 257 >gi|123845631|sp|Q3KSR3|TRM3_EBVG RecName: Full=Tripartite terminase subunit UL15 homolog; AltName: Full=DNA-packaging protein BGRF1/BDRF1; AltName: Full=Terminase large subunit gi|64173286|gb|AAY41136.1| probable DNA packaging protein [Human herpesvirus 4] Length = 690 Score = 39.7 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 55/149 (36%), Gaps = 15/149 (10%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GKT + ++ ++S + I +A+ + + + ++ E+ L+ E+ + Sbjct: 231 GKTWIVVAIISLILSNLSNVQIGYVAHQKH-VASAVFTEIIDTLTKSFDSKRVEVNKETS 289 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN 208 + ++ + T+ C T + H +F DEA+ Sbjct: 290 TITFRHSGKISS---------TVMCATCFNKNSIRGQTFH-----LLFVDEANFIKKEAL 335 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWF 237 +ILGF + + +I + N+ F Sbjct: 336 PAILGFMLQKDAKIIFISSVNSADQATSF 364 >gi|82503246|ref|YP_401690.1| BGRF1/BDRF1 [Human herpesvirus 4] gi|139424519|ref|YP_001129485.1| BGRF1/BDRF1 [Human herpesvirus 4 type 2] gi|267408|sp|P03219|TRM3_EBVB9 RecName: Full=Tripartite terminase subunit UL15 homolog; AltName: Full=DNA-packaging protein BGRF1/BDRF1; AltName: Full=Terminase large subunit gi|254784086|sp|P0C744|TRM3_EBVA8 RecName: Full=Tripartite terminase subunit UL15 homolog; AltName: Full=DNA-packaging protein BGRF1/BDRF1; AltName: Full=Terminase large subunit gi|1632798|emb|CAA24834.1| probable DNA packaging protein [Human herpesvirus 4] gi|23893636|emb|CAD53440.1| BGRF1-BDRF1 protein [Human herpesvirus 4] gi|82703995|gb|ABB89264.1| BGRF1/BDRF1 [Human herpesvirus 4] Length = 690 Score = 39.7 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 55/149 (36%), Gaps = 15/149 (10%) Query: 89 GKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSL 148 GKT + ++ ++S + I +A+ + + + ++ E+ L+ E+ + Sbjct: 231 GKTWIVVAIISLILSNLSNVQIGYVAHQKH-VASAVFTEIIDTLTKSFDSKRVEVNKETS 289 Query: 149 HPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN 208 + ++ + T+ C T + H +F DEA+ Sbjct: 290 TITFRHSGKISS---------TVMCATCFNKNSIRGQTFH-----LLFVDEANFIKKEAL 335 Query: 209 KSILGFFTELNPNRFWIMTSNTRRLNGWF 237 +ILGF + + +I + N+ F Sbjct: 336 PAILGFMLQKDAKIIFISSVNSADQATSF 364 >gi|29826542|ref|NP_821176.1| hypothetical protein SAV_2 [Streptomyces avermitilis MA-4680] gi|29603638|dbj|BAC67711.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 77 Score = 39.7 bits (91), Expect = 0.69, Method: Composition-based stats. Identities = 10/47 (21%), Positives = 18/47 (38%), Gaps = 3/47 (6%) Query: 74 NPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQL 120 P + I + G GKT++ A ++ P I+ + L Sbjct: 2 PPQGARGTIVSATGSGKTSMAAAST---LNCFPEGRILVTVPTLDLL 45 >gi|15618661|ref|NP_224947.1| exodeoxyribonuclease V, Alpha [Chlamydophila pneumoniae CWL029] gi|4377059|gb|AAD18890.1| Exodeoxyribonuclease V, Alpha [Chlamydophila pneumoniae CWL029] Length = 493 Score = 39.7 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 70/214 (32%), Gaps = 28/214 (13%) Query: 60 EAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE-- 117 ++ + + N +S G G GKT L A ++L L+ +P + I ++ + Sbjct: 130 SSILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKA 189 Query: 118 -TQLKNTLWAE-VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRT 175 + ++ L + + ++ H F + + L+++ + +T Sbjct: 190 TSHIRQILMKYNIFDDMVLMQTVHHFLQEYAYRRYNSIDVLLVDEGSMVTFDLLYSLVQT 249 Query: 176 YSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT-RRLN 234 G + +S ILG +L P I N + L Sbjct: 250 --------LQGYEKDKKLYT----SSLI-------ILGDTNQLPP--IGIGVGNPLQDLI 288 Query: 235 GWFYDI--FNIPLEDWKRYQIDTRTVEGIDSGFH 266 G+F++ F K +D T + Sbjct: 289 GYFHENTFFLKTSHRAKTGVVDQLTQSVLRGEMI 322 >gi|15836285|ref|NP_300809.1| exodeoxyribonuclease V, alpha [Chlamydophila pneumoniae J138] gi|16752288|ref|NP_445657.1| exodeoxyribonuclease V, alpha subunit, putative [Chlamydophila pneumoniae AR39] gi|33242111|ref|NP_877052.1| exonuclease V alpha-subunit [Chlamydophila pneumoniae TW-183] gi|7190033|gb|AAF38887.1| exodeoxyribonuclease V, alpha subunit, putative [Chlamydophila pneumoniae AR39] gi|8979125|dbj|BAA98960.1| exodeoxyribonuclease V, alpha [Chlamydophila pneumoniae J138] gi|33236621|gb|AAP98709.1| exonuclease V alpha-subunit [Chlamydophila pneumoniae TW-183] Length = 493 Score = 39.7 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 70/214 (32%), Gaps = 28/214 (13%) Query: 60 EAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE-- 117 ++ + + N +S G G GKT L A ++L L+ +P + I ++ + Sbjct: 130 SSILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKA 189 Query: 118 -TQLKNTLWAE-VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRT 175 + ++ L + + ++ H F + + L+++ + +T Sbjct: 190 TSHIRQILMKYNIFDDMVLMQTVHHFLQEYAYRRYNSIDVLLVDEGSMVTFDLLYSLVQT 249 Query: 176 YSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT-RRLN 234 G + +S ILG +L P I N + L Sbjct: 250 --------LQGYEKDKKLYT----SSLI-------ILGDTNQLPP--IGIGVGNPLQDLI 288 Query: 235 GWFYDI--FNIPLEDWKRYQIDTRTVEGIDSGFH 266 G+F++ F K +D T + Sbjct: 289 GYFHENTFFLKTSHRAKTGVVDQLTQSVLRGEMI 322 >gi|308178069|ref|YP_003917475.1| type I restriction-modification system restriction subunit [Arthrobacter arilaitensis Re117] gi|307745532|emb|CBT76504.1| type I restriction-modification system restriction subunit [Arthrobacter arilaitensis Re117] Length = 1033 Score = 39.7 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 44/146 (30%), Gaps = 14/146 (9%) Query: 66 CHSNVNNSNPTIFKCAISAG------RGIGKTTLNAWMMLWLISTRPGMSI-ICIANSE- 117 H+ + A G +G GK+ W+ W++ T+ + + +E Sbjct: 248 RHNQYFGVQAAQDRIAKREGGIIWHTQGSGKSLTMVWLAKWILETQHDARVLVITDRTEL 307 Query: 118 -TQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHP--SGWYAELLEQSMGIDSKHYTITCR 174 Q+++ K + M + S P + + G K Sbjct: 308 DGQIEDGFSGVGEKIVRTQSGADMLAMLNTSNPPLMCSLVHKFRGTNDGARDKDAEDFAN 367 Query: 175 TYSEERPDTFVGPHNTHGMAVFNDEA 200 + P G + VF DEA Sbjct: 368 ELKTQIP---AGYTAKGNIFVFVDEA 390 >gi|281355726|ref|ZP_06242220.1| exodeoxyribonuclease V, alpha subunit [Victivallis vadensis ATCC BAA-548] gi|281318606|gb|EFB02626.1| exodeoxyribonuclease V, alpha subunit [Victivallis vadensis ATCC BAA-548] Length = 635 Score = 39.7 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 24/158 (15%), Positives = 49/158 (31%), Gaps = 28/158 (17%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANS-ETQLKNTLWAEVSKWLSMLPHR 138 IS G G GKTT+ A ++ + P + + A + + Q + L Sbjct: 187 TVISGGPGTGKTTVVAALLALEFARAPELRVALCAPTGKAQAR----------LGEALRE 236 Query: 139 HWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGM---AV 195 ++ +LE + + T+ + H + + V Sbjct: 237 DGLKI----GTAEAIRRRILELAPSTIDRLIGSAPLTHRTK-------YHAGNPLPFDLV 285 Query: 196 FNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRL 233 DE+S + ++ P I+ + +L Sbjct: 286 IVDESSMVSLPLMARLMQAL---APETRLILLGDPNQL 320 >gi|124022672|ref|YP_001016979.1| exodeoxyribonuclease V 67 kD polypeptide [Prochlorococcus marinus str. MIT 9303] gi|123962958|gb|ABM77714.1| possible exodeoxyribonuclease V 67 kD polypeptide [Prochlorococcus marinus str. MIT 9303] Length = 576 Score = 39.7 bits (91), Expect = 0.75, Method: Composition-based stats. Identities = 36/182 (19%), Positives = 61/182 (33%), Gaps = 42/182 (23%) Query: 55 QLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIA 114 Q +EA+D H +S G G GKT+ M+ ++ RPG+ I A Sbjct: 142 QQAAVEAIDNH------------GVVLLSGGPGTGKTSTIVQMLARAVTLRPGLRIGLAA 189 Query: 115 NSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCR 174 + + A V K L +P + Q+L+ P L+ G Sbjct: 190 PTGKAARRLEEA-VRKGLEAIP---PTQRQALTSLPCSTLHRWLQARPGGF--------- 236 Query: 175 TYSEERPDTFVGPHNTHGM---AVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTR 231 G H H + + DE S + +++L + +M + Sbjct: 237 -----------GRHQQHPLMLDLLVIDEMSMVELSLMQALLSAL---PIDSQLVMIGDPD 282 Query: 232 RL 233 +L Sbjct: 283 QL 284 >gi|124009888|ref|ZP_01694555.1| hypothetical protein M23134_06477 [Microscilla marina ATCC 23134] gi|123984124|gb|EAY24490.1| hypothetical protein M23134_06477 [Microscilla marina ATCC 23134] Length = 539 Score = 39.7 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 46/262 (17%), Positives = 77/262 (29%), Gaps = 32/262 (12%) Query: 84 AGRGIGKTTLNAWMMLWLIST-RPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 AG+G GKT A ++ + + T PG+ AN++ QL ++ V L + E Sbjct: 36 AGQGAGKTH-GAGLISFRLITNFPGVFGFMGANTDMQLTDSTLYRVFLVWKDLGLEEYDE 94 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE-----ERPDTFVGPHNTHGMAVFN 197 + G G K Y ++ + + Sbjct: 95 YLGQGDYVVGTQPPRHFSREGHAFKSYRNKISFWNGCVVFIGSLENYKAHDGKEFAWAIL 154 Query: 198 DEASGT-PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYD---------------IF 241 DE T + + + ILG + ++ T+ L YD IF Sbjct: 155 DETKDTREEAVQEVILGRLRQQG----LYISEATQALTSEEYDEGSPVVANLPFNPLYIF 210 Query: 242 NIPLED-WKRYQIDTRTVEGIDSGFH----EGIISRYGLDSDVARIEILGQFPQQEVNNF 296 P + W + E + + G DS I + +N+ Sbjct: 211 TSPAKVPWINDWFELSEYEEEIKAKIYNPPQYFKKKVGGDSKFVVISATHLNLKNLPSNY 270 Query: 297 IPHNYIEEAMSREAIDDLYAPL 318 I A R + PL Sbjct: 271 IEKQEANLASHRHGMLIYGDPL 292 >gi|269302541|gb|ACZ32641.1| putative exodeoxyribonuclease V, alpha subunit [Chlamydophila pneumoniae LPCoLN] Length = 493 Score = 39.3 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 34/214 (15%), Positives = 70/214 (32%), Gaps = 28/214 (13%) Query: 60 EAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSE-- 117 ++ + + N +S G G GKT L A ++L L+ +P + I ++ + Sbjct: 130 SSILSEEQNFIFNKITQGCFSIVSGGPGTGKTFLAAQLILSLVKQQPKLRIAIVSPTGKA 189 Query: 118 -TQLKNTLWAE-VSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRT 175 + ++ L + + ++ H F + + L+++ + +T Sbjct: 190 TSHIRQILMKYNIFDDMVLMQTVHHFLQEYAYRRYNSIDVLLVDEGSMVTFDLLYSLVQT 249 Query: 176 YSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT-RRLN 234 G + +S ILG +L P I N + L Sbjct: 250 --------LQGYEKDKKLYT----SSLI-------ILGDTNQLPP--IGIGVGNPLQDLI 288 Query: 235 GWFYDI--FNIPLEDWKRYQIDTRTVEGIDSGFH 266 G+F++ F K +D T + Sbjct: 289 GYFHENTFFLKTSHRAKTGAVDQLTQSVLRGEMI 322 >gi|302412431|ref|XP_003004048.1| ATP-dependent DNA helicase MER3 [Verticillium albo-atrum VaMs.102] gi|261356624|gb|EEY19052.1| ATP-dependent DNA helicase MER3 [Verticillium albo-atrum VaMs.102] Length = 709 Score = 39.3 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 56/170 (32%), Gaps = 21/170 (12%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ V W + L ++ Sbjct: 279 SPTGSGKTVAAELAMWWAFKERPGSKVVYIAPMKALVRE----RVKDWGARLAKPLGLKL 334 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA-- 200 L+ + + + + I + + R++ + V DE Sbjct: 335 VELTGDNTPDTRTIKDADVIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDEIHL 387 Query: 201 -SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFN-IPLEDW 248 +G I + I+ N T N+ RL G N L +W Sbjct: 388 LAGDRGPILEIIVSRM-----NYIAASTKNSVRLLGMSTACANASDLGNW 432 >gi|290960848|ref|YP_003492030.1| phage terminase large subunit [Streptomyces scabiei 87.22] gi|260650374|emb|CBG73490.1| phage terminase (large subunit) [Streptomyces scabiei 87.22] Length = 598 Score = 39.3 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 62/203 (30%), Gaps = 29/203 (14%) Query: 51 PHRWQLEFMEAVDVHCHSNVNNSNPTIFKCA---ISAGRGIGKTTLNAWMMLWLIS--TR 105 P WQ+ ++ A +++ + + R GK+TL+ + ++L Sbjct: 101 PDPWQVAWIIAPVFGWVRFDADADMYVRIITDLYVDVPRKNGKSTLSGGLAIYLTCADGE 160 Query: 106 PGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGID 165 PG +I A ++ Q ++ + + P +L H + +++ G Sbjct: 161 PGAQVIAAATTKQQ-AGYVFTPIRQLAERAP--------ALKGHVKPYRGKIIHPKSGSY 211 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIIN-KSILGFFTELNPNRFW 224 + H + DE D + I T R Sbjct: 212 FEVIASVADA-----------QHGANLHGAVIDELHVHKDPEMVEVIE---TGTGSRRQP 257 Query: 225 IMTSNTRRLNGWFYDIFNIPLED 247 ++ T +G I+N Sbjct: 258 LIVIITTADSGKPETIYNRKRTR 280 >gi|168029927|ref|XP_001767476.1| predicted protein [Physcomitrella patens subsp. patens] gi|162681372|gb|EDQ67800.1| predicted protein [Physcomitrella patens subsp. patens] Length = 1075 Score = 39.3 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 47/134 (35%), Gaps = 10/134 (7%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH 139 A++A RG GK+ + +I A S LK L+ + K + ++ Sbjct: 279 VALTAARGRGKSAALGVAIA-GAVAFGYSNIFVTAPSPENLKT-LFEFIFKGFDAMEYKE 336 Query: 140 WFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 + + S + ++ + I +H ++ + DE Sbjct: 337 HIDYDLVESTNSAFNKAIV--RVNIFRQHRQTIQYIQPKDHEKLAQAE------LLVIDE 388 Query: 200 ASGTPDIINKSILG 213 A+ P I K++LG Sbjct: 389 AAAIPLPIVKALLG 402 >gi|310792137|gb|EFQ27664.1| Sec63 Brl domain-containing protein [Glomerella graminicola M1.001] Length = 1974 Score = 39.3 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 44/132 (33%), Gaps = 15/132 (11%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ V W + L ++ Sbjct: 1153 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAPMKALVRE----RVKDWGARLARPLGLKL 1208 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA-- 200 L+ + + + + I + + R++ + V DE Sbjct: 1209 VELTGDNTPDTRTIKDADIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDEIHL 1261 Query: 201 -SGTPDIINKSI 211 +G I + I Sbjct: 1262 LAGDRGPILEII 1273 >gi|154500994|ref|ZP_02039032.1| hypothetical protein BACCAP_04681 [Bacteroides capillosus ATCC 29799] gi|150270018|gb|EDM97537.1| hypothetical protein BACCAP_04681 [Bacteroides capillosus ATCC 29799] Length = 726 Score = 39.3 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 1/73 (1%) Query: 46 EHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTR 105 + + R Q E M + + T C IS G G GK ++ ++ Sbjct: 313 DLDDEIDRQQAE-MGFTFAQEQRHAIRTALTSPICIISGGPGTGKASIQRAILNIYKKVF 371 Query: 106 PGMSIICIANSET 118 P ++C A + Sbjct: 372 PDSDVVCCAPTGR 384 >gi|254425155|ref|ZP_05038873.1| cyclic peptide transporter subfamily [Synechococcus sp. PCC 7335] gi|196192644|gb|EDX87608.1| cyclic peptide transporter subfamily [Synechococcus sp. PCC 7335] Length = 546 Score = 39.3 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 49/141 (34%), Gaps = 14/141 (9%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANS---------ETQLKNTLWAEVSKWL 132 I G G GK+TL + I P + + N QL +T++++ + Sbjct: 360 IVGGNGSGKSTLAKLITSLYI---PDSGQLILDNEPITDINREWYRQLFSTVFSDYYLFE 416 Query: 133 SMLPHRHWFEMQSLSLHPSGWYAEL--LEQSMGIDSKHYTITCRTYSEERPDTFVGPHNT 190 ++ + + Y E LE+ + I + + T + + + + + Sbjct: 417 RLVSTEETSLEEVTPRSTAQNYLEKLQLEEKVSIQNGQLSTTALSQGQRKRLALLAAYLE 476 Query: 191 HGMAVFNDEASGTPDIINKSI 211 DE + D + + I Sbjct: 477 DRSLYLFDEWAADQDPVFREI 497 >gi|85702762|ref|ZP_01033866.1| Putative large terminase [Roseovarius sp. 217] gi|85671690|gb|EAQ26547.1| Putative large terminase [Roseovarius sp. 217] Length = 419 Score = 39.3 bits (90), Expect = 0.97, Method: Composition-based stats. Identities = 47/285 (16%), Positives = 86/285 (30%), Gaps = 43/285 (15%) Query: 82 ISAGRGIGKTTLNA-WMMLWLISTRPG-----MSIICIANSETQLKNT-LWAEVSKWLSM 134 I GRG GKT A W+ + +RP I + + Q++ ++ E S ++ Sbjct: 27 IMGGRGAGKTRAGAEWVRAQVEGSRPLDEGRCKRIALVGETIDQVREVMVFGE-SGIMAC 85 Query: 135 LPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMA 194 P + Q+ + YS P+ GP Sbjct: 86 SPPDRRPDWQATR---------------KRLIWPNGAVAQAYSAHDPEALRGPQFDGA-- 128 Query: 195 VFNDEASGT--PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQ 252 + DE + + + R + T T R G DI +P Sbjct: 129 -WVDELAKWKRARETWDMLQFGLRLGDAPRVCVTT--TPRNVGVLKDIVAVP----STVV 181 Query: 253 IDTRTVEG---IDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSRE 309 T + F + + +RY + + R E+ G + + +E A R Sbjct: 182 TSAPTEANRAYLAESFLDEVRARY-AGTRLGRQELDGLLIDEAEDALWTPAMLEAA--RV 238 Query: 310 AIDDLYAPLIMGCDI---AGEGGDKTVVVFRRGNIIEHIFDWSAK 351 + +++ D G D+ ++ + DW Sbjct: 239 ESLPEFDRVVVAVDPPVTGHAGSDECGIIMAGAITRGPVQDWRVW 283 >gi|225683146|gb|EEH21430.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides brasiliensis Pb03] Length = 2011 Score = 39.3 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 38/117 (32%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ V W L ++ Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAPMKALVRE----RVHDWKRRLTVPMGLKL 1218 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1219 VELTGDNTPDTKTIRDSDIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDE 1268 >gi|296129166|ref|YP_003636416.1| type III restriction protein res subunit [Cellulomonas flavigena DSM 20109] gi|296020981|gb|ADG74217.1| type III restriction protein res subunit [Cellulomonas flavigena DSM 20109] Length = 601 Score = 39.3 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 55/202 (27%), Gaps = 47/202 (23%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG G WQ E +E P F A++ G GKTT Sbjct: 34 PWGAAGSL-------RAWQAEAIELYRQ--------RGPRDF-LAVATP-GAGKTTFALR 76 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + L+ + + +A +E + K + R + + G + Sbjct: 77 IATELLEAKVVRRVTVVAPTEH---------LKKQWADAAARVGIRLDPRFSNAQGRHGA 127 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE------ASGTPDIINKS 210 + ++ + + + V DE A D + ++ Sbjct: 128 GYDGVAVTYAQVASKPALHAARTTAER---------TLVILDEVHHGGDALSWGDAVREA 178 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 G R +T R Sbjct: 179 FEGA------TRRLALTGTPFR 194 >gi|226288385|gb|EEH43897.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides brasiliensis Pb18] Length = 2011 Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 38/117 (32%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ V W L ++ Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAPMKALVRE----RVHDWKRRLTVPMGLKL 1218 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1219 VELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDE 1268 >gi|295672069|ref|XP_002796581.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides brasiliensis Pb01] gi|226283561|gb|EEH39127.1| activating signal cointegrator 1 complex subunit 3 [Paracoccidioides brasiliensis Pb01] Length = 2012 Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 38/117 (32%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ V W L ++ Sbjct: 1163 SPTGSGKTVAAELAMWWAFRERPGSKVVYIAPMKALVRE----RVHDWKRRLTVPMGLKL 1218 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1219 VELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDE 1268 >gi|291517493|emb|CBK71109.1| Phage terminase large subunit [Bifidobacterium longum subsp. longum F8] Length = 477 Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 65/230 (28%), Gaps = 26/230 (11%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 WQ + + ++ T+ R GKT W+ + + PGM I+ Sbjct: 37 DVWQRQINRIILAKSADGFWSARNTVLSI----PRQTGKTYDIGWVAIHRAARTPGMRIV 92 Query: 112 CIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI 171 A + +K+T S+ EM L G + + G + + Sbjct: 93 WTAQHFSVIKDTFE-------SLCAIVLRPEMSGLVDPDHG-----ISLAAGKEEIRFRN 140 Query: 172 TCRTYSEERPD-TFVGPHNTHGMAVFNDEASGTPDIINKSILGFFT-ELNPNRFWIMTSN 229 R + R G + DEA D S+L NP ++ T Sbjct: 141 GSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPTQNRAYNPQTIYMGTPP 198 Query: 230 -TRRLNGWFYDIFNIPLEDWKRYQI-----DTRTVEGIDSGFHEGIISRY 273 R F + + + R + +D Y Sbjct: 199 GPRDNGEAFTRLRDKTRAGRTHSTLYVEFAADRDADPLDREQWRKANPSY 248 >gi|108797804|ref|YP_638001.1| hypothetical protein Mmcs_0827 [Mycobacterium sp. MCS] gi|119866897|ref|YP_936849.1| hypothetical protein Mkms_0844 [Mycobacterium sp. KMS] gi|108768223|gb|ABG06945.1| conserved hypothetical protein [Mycobacterium sp. MCS] gi|119692986|gb|ABL90059.1| conserved hypothetical protein [Mycobacterium sp. KMS] Length = 563 Score = 38.9 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 53/189 (28%), Gaps = 17/189 (8%) Query: 56 LEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIAN 115 + +A+ H + + ++ + + + G G GKT L A ++ R G + + Sbjct: 212 EDAADALTEH-QAIILDAIRLLNRVEVRGGAGSGKTFL-AMEQARRLAQR-GQRVALVCY 268 Query: 116 SETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRT 175 S L + L W + E +L + + + Sbjct: 269 SHG-LASYLERICETWPRRQQPAYVGEFHALGVQWGAPEGPDEALRTEETVRFWEHDLPL 327 Query: 176 YSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFF-----------TELNPNRFW 224 + + H + DEA D +L T+ F Sbjct: 328 HMADLAAQLEPGHRFDS--IVVDEAQDFADAWWDPLLAALRDPVDGGLYVFTDEGQRVFN 385 Query: 225 IMTSNTRRL 233 + S L Sbjct: 386 RVGSPPVPL 394 >gi|273810556|ref|YP_003344937.1| gp2 [Sodalis phage SO-1] gi|258619841|gb|ACV84094.1| gp2 [Sodalis phage SO-1] Length = 461 Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 82/246 (33%), Gaps = 29/246 (11%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 +G G GK+ + A ++ L++ PG I + L ++ E+ K R F Sbjct: 58 SGFGGGKSWVAARKVIQLLTLNPGHDGIVTEPTIPLLVKIMYPELEKAFDEAGFRWKFNK 117 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 Q S+ + K + C S E +G + + DE T Sbjct: 118 QDKIY------------SVLVKGKWTRVICE--SMENYTRLIGVNAA---WIVADEFDTT 160 Query: 204 PDIINKSILGFFTE---LNPNRFWIMTSNTRRLNGW--FYDIFNIPLE-DWKRYQIDTRT 257 + + R +++ S G+ Y IF + + + + T Sbjct: 161 KQDVALAAYHKLLGRLRAGFVRQFVIVSTP---EGYRAMYQIFEVEKDSQKRLIRAKTTD 217 Query: 258 VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAP 317 + + F + + S+Y +++ + G F EE S E + Sbjct: 218 NHHLPADFIDTLRSQY--PANLIDAYLNGLFVNLTSGAVYKMFNREENASTEEVQ-PEDT 274 Query: 318 LIMGCD 323 LI+G D Sbjct: 275 LIIGMD 280 >gi|295688413|ref|YP_003592106.1| hypothetical protein Cseg_0983 [Caulobacter segnis ATCC 21756] gi|295430316|gb|ADG09488.1| protein of unknown function DUF264 [Caulobacter segnis ATCC 21756] Length = 445 Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 51/273 (18%), Positives = 83/273 (30%), Gaps = 40/273 (14%) Query: 84 AGRGIGKTTLNAWMMLWLIS-TRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFE 142 GRG GKT A WLI G + + + ++ + Sbjct: 77 GGRGAGKTYAGAA---WLIEQATAGARLALVGPTFHDVREVM------------IEGPSG 121 Query: 143 MQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG--PHNTHGMAVFNDEA 200 +++LSL E + + T +S E PD+ G H + DE Sbjct: 122 LKALSLPDEHPRWEASRRRL---VWPNGATAYAFSAEDPDSLRGPQFHAA-----WADEF 173 Query: 201 SGTPDIINKSIL---GFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRT 257 P + + G +P T R + Sbjct: 174 CAWPKPGDTLAMLRFGLRLGADPRLVVTTTPKPHRALKVLM----AEPGVSLTRAGTSAN 229 Query: 258 VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAP 317 + F + S YG + +A E+ G + + F + +R A D Sbjct: 230 AGNLAPAFLRTLESLYG-GTRLAAQELDGVIVETDGGLFRAEDLARCRAARPARLDR--- 285 Query: 318 LIMGCD-IAGEGGDKT--VVVFRRGNIIEHIFD 347 +++ D A GGD VVV RR + + D Sbjct: 286 VVVAVDPPATAGGDACGIVVVGRRDDRAFVLAD 318 >gi|148241989|ref|YP_001227146.1| hypothetical protein SynRCC307_0890 [Synechococcus sp. RCC307] gi|147850299|emb|CAK27793.1| Hypothetical protein SynRCC307_0890 [Synechococcus sp. RCC307] Length = 98 Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 82 ISAGRGIGKTTL--NAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS 129 + +GR GKT L A + L T+PG + +A S Q K+ WA++ Sbjct: 25 VFSGRRFGKTRLMLTAGVEL--CLTKPGAKVFHLAPSRKQAKDIAWADLK 72 >gi|145225752|ref|YP_001136430.1| hypothetical protein Mflv_5176 [Mycobacterium gilvum PYR-GCK] gi|145218238|gb|ABP47642.1| conserved hypothetical protein [Mycobacterium gilvum PYR-GCK] Length = 551 Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 50/181 (27%), Gaps = 5/181 (2%) Query: 57 EFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANS 116 E V S + ++ + + I G G GKT L L ++++C + Sbjct: 199 EDAADVLTEQQSVILDAIKLLHRVEIRGGAGSGKTFLAMEQARRLARAGRRVALVCYS-- 256 Query: 117 ETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTY 176 L + L + W + E L K + + Sbjct: 257 -HGLASYLERITATWNRRHRPAYVGEFHDLGKQWGAPAGPDESVRNDETVKFWEHDLPSQ 315 Query: 177 SEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGW 236 H A+ DEA D +L + ++ + +R+ Sbjct: 316 MTRLATQLDPGHRFD--AIVVDEAQDFADAWWDPLLAALKDDETGGLYLFSDEGQRVFDR 373 Query: 237 F 237 F Sbjct: 374 F 374 >gi|124005744|ref|ZP_01690583.1| hypothetical protein M23134_03970 [Microscilla marina ATCC 23134] gi|123988812|gb|EAY28418.1| hypothetical protein M23134_03970 [Microscilla marina ATCC 23134] Length = 535 Score = 38.9 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 45/142 (31%), Gaps = 6/142 (4%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 AG+G GKT + LIS PG+ AN++ QL ++ V L + E Sbjct: 33 AGQGAGKTHGAGLISFRLISNFPGVFGFMGANTDMQLTDSTLYRVFLVWKDLGLEEYDEH 92 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE-----ERPDTFVGPHNTHGMAVFND 198 + G G K Y ++ + + D Sbjct: 93 TGEGDYVVGTQPPRHFSREGHAFKSYRNKISFWNGCVVFIGSLENYKAHDGKEFAWAILD 152 Query: 199 EASGT-PDIINKSILGFFTELN 219 E T + + + ILG + Sbjct: 153 ETKDTREEAVQEVILGRLRQQG 174 >gi|171690334|ref|XP_001910092.1| hypothetical protein [Podospora anserina S mat+] gi|170945115|emb|CAP71226.1| unnamed protein product [Podospora anserina S mat+] Length = 1993 Score = 38.9 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 50/165 (30%), Gaps = 23/165 (13%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W PG ++ IA + ++ V W L + Sbjct: 1164 SPTGSGKTVAAELAMWWAFREHPGSKVVYIAPMKALVRE----RVKDWGDRLAKPLGLRL 1219 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA-- 200 L+ + + + + I + + R++ + V DE Sbjct: 1220 VELTGDNTPDTRTIQDADIIITTPEKWDGISRSWQTRG-------YVRKVSLVVIDEIHL 1272 Query: 201 -SGTPDIINKSILG-----FFTELNPNRFWIM---TSNTRRLNGW 236 +G I + I+ + N R M +N L W Sbjct: 1273 LAGDRGPILEIIVSRMNYIAASTKNAVRLLGMSTACANATDLGNW 1317 >gi|23335598|ref|ZP_00120832.1| hypothetical protein Blon03000707 [Bifidobacterium longum DJO10A] gi|189440021|ref|YP_001955102.1| phage terminase large subunit [Bifidobacterium longum DJO10A] gi|189428456|gb|ACD98604.1| Phage terminase large subunit [Bifidobacterium longum DJO10A] Length = 477 Score = 38.9 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 38/230 (16%), Positives = 65/230 (28%), Gaps = 26/230 (11%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 WQ + + ++ T+ R GKT W+ + + PGM I+ Sbjct: 37 DVWQRQINRIILAKSADGFWSARNTVLSI----PRQTGKTYDIGWVAIHRAARTPGMRIV 92 Query: 112 CIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTI 171 A + +K+T S+ EM L G + + G + + Sbjct: 93 WTAQHFSVIKDTFE-------SLCAIVLRPEMSGLVDPDHG-----ISLAAGKEEIRFRN 140 Query: 172 TCRTYSEERPD-TFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTEL-NPNRFWIMTSN 229 R + R G + DEA D S+L NP ++ T Sbjct: 141 GSRIFFRARERGALRGV--KKIALLVIDEAQHLSDSAMASMLPTQNRAWNPQTIYMGTPP 198 Query: 230 -TRRLNGWFYDIFNIPLEDWKRYQI-----DTRTVEGIDSGFHEGIISRY 273 R F + + + R + +D Y Sbjct: 199 GPRDNGEAFTRLRDKARAGRTHSTLYVEFTADRDADPLDRQQWRKANPSY 248 >gi|117926000|ref|YP_866617.1| helicase domain-containing protein [Magnetococcus sp. MC-1] gi|117609756|gb|ABK45211.1| helicase domain protein [Magnetococcus sp. MC-1] Length = 1170 Score = 38.9 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 36/249 (14%), Positives = 69/249 (27%), Gaps = 53/249 (21%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCA--------------- 81 PWG + ++++ D + +N P + + Sbjct: 67 PWGFDAPGPDFKLGVEAFRIQLAHLFDPMMAVHTSNVEPLPHQISAVYESMLPRQPLRYV 126 Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 ++ G GKT + ++ L+ I+ +A + V +W + + Sbjct: 127 LADDPGAGKTIMAGLLIRELLMRSDAKRILIVAPG---------SLVEQWQDEMYEKFGV 177 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGP--HNTHGMAVFNDE 199 E S + + S +H + R R D F H+ + + DE Sbjct: 178 EFTVFSRE-----LDQVSLSGNAFDEHDRLIARLDQLSRNDEFQEKLAHSEWDL-IIVDE 231 Query: 200 -----ASGTPDIINKS---ILGFFTELNPNRFWIMTSNTRR-------------LNGWFY 238 AS + ++ LG F +MT+ FY Sbjct: 232 AHKMSASYYGQKVKETKRFKLGKLLGSVSRHFLLMTATPHNGKETDFQLFLSLLDGDRFY 291 Query: 239 DIFNIPLED 247 F Sbjct: 292 GKFREGAHR 300 >gi|259418958|ref|ZP_05742875.1| phage DNA Packaging Protein [Silicibacter sp. TrichCH4B] gi|259345180|gb|EEW57034.1| phage DNA Packaging Protein [Silicibacter sp. TrichCH4B] Length = 478 Score = 38.9 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 50/298 (16%), Positives = 90/298 (30%), Gaps = 42/298 (14%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGM---------SIICIANSETQLKNTLWAEVSKWL 132 I GRG GKT A W+ S G + + + Q+++ + S L Sbjct: 86 ILGGRGAGKTRAGA---EWVRSEVEGAEPFGIGRARRMALVGETYDQVRDVMIHGDSGIL 142 Query: 133 SMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHG 192 + P E ++ T + +S P+ GP Sbjct: 143 ACSPPDRRPEWRA---------------GERRLVWPNGATAQAFSASDPEALRGPQFD-- 185 Query: 193 MAVFNDEASGT--PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKR 250 A + DE + + R + T+ R + P Sbjct: 186 -AAWVDELAKWRRAQDAWDMLQFALRLGAAPRVCVTTTP--RNVPLLKQLLESPSTV-TT 241 Query: 251 YQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREA 310 + + GF + +RYG S +AR E+ G +E+ R+ Sbjct: 242 HAPTEANRANLAPGFLTEVRARYG-GSRLARQELDGVMLADVDGALWTSGMLEQLQRRDR 300 Query: 311 IDDLYAPLIMGCDI---AGEGGDKTVVVFRRGNIIEHIFDWSAKLIQETNQEGC-PVG 364 +++ D A +G D ++ I +W A ++ + +G P G Sbjct: 301 --PPLDRIVVAVDPSVSAHKGSDACGIIVAGAQTQGPISEWRAYVLADHTVQGLGPTG 356 >gi|126433456|ref|YP_001069147.1| hypothetical protein Mjls_0847 [Mycobacterium sp. JLS] gi|126233256|gb|ABN96656.1| conserved hypothetical protein [Mycobacterium sp. JLS] Length = 549 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 47/162 (29%), Gaps = 6/162 (3%) Query: 56 LEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIAN 115 + +A+ H + + ++ + + + G G GKT L A ++ R G + + Sbjct: 198 EDASDALTEH-QAVILDAIRQLNRVEVRGGAGSGKTFL-AMEQARRLAQR-GQRVALVCY 254 Query: 116 SETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRT 175 S L + L W + E +L + + + Sbjct: 255 SHG-LASYLERIAETWPRRQQPAYVGEFHALGVQWGAPEGPDEAVRTEETVRFWEHDLPL 313 Query: 176 YSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTE 217 + H + DEA D +L + Sbjct: 314 QMADLAAQLEPGHRFDS--IVVDEAQDFADAWWDPLLAALRD 353 >gi|213402789|ref|XP_002172167.1| antiviral helicase SLH1 [Schizosaccharomyces japonicus yFS275] gi|212000214|gb|EEB05874.1| antiviral helicase SLH1 [Schizosaccharomyces japonicus yFS275] Length = 1949 Score = 38.6 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 45/154 (29%), Gaps = 20/154 (12%) Query: 55 QLEFMEAVDVHCHSNVNNSNPTIFKCA--------ISAGRGIGKTTLNAWMMLWLISTRP 106 Q +E + S N F I A G GKT W P Sbjct: 1125 QNPVLEEICAKRFSFFNAVQSQFFHTVYHTPTNVFIGAPTGSGKTMAAELATWWAFREHP 1184 Query: 107 GMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGI-D 165 G ++ IA + +K + W + L M L+ S ++ + I Sbjct: 1185 GSKVVYIAPMKALVKE----RLKDWGARLVEPMHINMIELTGDTSPDSKTIMGADIIITT 1240 Query: 166 SKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 + + R + + + + V DE Sbjct: 1241 PEKWDGITRNWRTRK-------YVQNVSLVIIDE 1267 >gi|171681273|ref|XP_001905580.1| hypothetical protein [Podospora anserina S mat+] gi|170940595|emb|CAP65823.1| unnamed protein product [Podospora anserina S mat+] Length = 1721 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 42/125 (33%), Gaps = 10/125 (8%) Query: 87 GIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSL 146 G GKT ++L L S P I+ A + + N L +S + P R E++ + Sbjct: 1332 GTGKTETILSIILSLQSHFPDSRILLTAPTHNAVDNVLRRYLSLNPTHPPLRISTEIRKV 1391 Query: 147 SLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFV-----G-----PHNTHGMAVF 196 S + + + + T + + V G N V Sbjct: 1392 SPDVTPYTLDAMAGIELNTLHSRAETTKAKKRVKAAKIVFSTCIGSSLGLLRNEMFDIVI 1451 Query: 197 NDEAS 201 DEAS Sbjct: 1452 IDEAS 1456 >gi|85709622|ref|ZP_01040687.1| Phage DNA Packaging Protein [Erythrobacter sp. NAP1] gi|85688332|gb|EAQ28336.1| Phage DNA Packaging Protein [Erythrobacter sp. NAP1] Length = 441 Score = 38.6 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 81/251 (32%), Gaps = 30/251 (11%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWF 141 I AGRG GKT A + + + I +++S + + + S L+ P Sbjct: 55 IMAGRGFGKTRAGAEWVRSIAESHSEARIALVSSSLAEARAVMVEGESGLLACSP----- 109 Query: 142 EMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEA- 200 E S+ YS P+ GP +H + DE Sbjct: 110 ----------PDRRPEFEPSLRRVRFPNGAEAHLYSAGEPEALRGPQFSHA---WCDEVG 156 Query: 201 ------SGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQID 254 S + ++G +P +T+ R + + + + Sbjct: 157 KWPISHSRATRAWDNLLMGLRLGDDPRIA--VTTTPRAVPLVQRLLKQETSQATAVTRGS 214 Query: 255 TRT-VEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDD 313 T + + F E I + S + R EI G+ + + +E++ EA Sbjct: 215 TYDNSANLPARFLEAIADEF-AGSQLGRQEIEGELIEDIEGALWSRSLLEQS-KEEAGPP 272 Query: 314 LYAPLIMGCDI 324 + +++G D Sbjct: 273 GFRRIVIGVDP 283 >gi|315654463|ref|ZP_07907371.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333] gi|315491498|gb|EFU81115.1| conserved hypothetical protein [Mobiluncus curtisii ATCC 51333] Length = 424 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 51/160 (31%), Gaps = 2/160 (1%) Query: 60 EAVDVHCHSNVNNSNPTIFKCAISAGRGIGKT-TLNAWMMLWLISTRPGMSIICIANSET 118 + + + + P + AI +G+GKT T W+ L P M + A++ Sbjct: 11 DYLPRYLDEELRELFPQLPAIAIDGAKGVGKTETAQRWVEHVLALDNPEMGQLIAADTVN 70 Query: 119 QLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSE 178 QL + +W P P+ + + H Sbjct: 71 QLTKYATTCIDEWQKYPPVWDAVRRLVDQQTPNRFLLTGSATPVSGVDTHSGAGRIASLR 130 Query: 179 ERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTEL 218 RP + +T + SG +I ++I G T+ Sbjct: 131 LRPLSLAERPSTSPRVFISRLFSGDAEISGETIFG-LTDY 169 >gi|67611038|ref|XP_667129.1| hypothetical protein [Cryptosporidium hominis TU502] gi|54658236|gb|EAL36904.1| hypothetical protein Chro.80234 [Cryptosporidium hominis] Length = 991 Score = 38.6 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 73/237 (30%), Gaps = 50/237 (21%) Query: 40 IKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMML 99 GK + + + Q ++ D+ N+N +++AGRG GK+ + L Sbjct: 263 EIGKVISNCITFDQAQT-VLKMADIIIQKNMNAI------ISLTAGRGRGKSAALG-LSL 314 Query: 100 WLISTRPGMSIICIANSETQLKNTL-WAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELL 158 ++ +I A S + + E+ FE+ S + Sbjct: 315 ACAVSQGFSNIFITAPSAENVLTVFEFIEIGLQSLGYLEHKHFELVRSKTIDSRVGGDFS 374 Query: 159 EQSMGIDSKHYTITCR-TYSEERPDTFVGPHNTHGMA-----VFNDEASGTPDIINKSIL 212 + + R T +P+ + + V DEA+ P I K L Sbjct: 375 HSVSRLIRVNIFKDHRQTIQYIKPE-------DYHLVSQAEIVVMDEAAAIPLPIVKKFL 427 Query: 213 G------------------FFT----------ELNPNRFWIMTSNTRRLNGWFYDIF 241 G + LN N ++SNT + +F + F Sbjct: 428 GNHLFIFSSTINGYEGTGRALSLKLINDLKKKSLNNNGNLPISSNTDNQSDYFVNSF 484 >gi|206895210|ref|YP_002247305.1| cobalt import ATP-binding protein CbiO 2 [Coprothermobacter proteolyticus DSM 5265] gi|206737827|gb|ACI16905.1| cobalt import ATP-binding protein CbiO 2 [Coprothermobacter proteolyticus DSM 5265] Length = 271 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 58/156 (37%), Gaps = 12/156 (7%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSI----ICIANSETQLKNTLW---AEVSKWLSMLP 136 G G GKTTL + L+ T + I I A + Q+K + E S++ Sbjct: 34 GGNGAGKTTLARVIKGLLLPTSGKVLIDGMEISTAGRDYQIKVGIVFQNPENQIVASVVE 93 Query: 137 HRHWFEMQSLSLHPSGW--YAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGM 193 F ++L L P E +++G+ D +H + + +++ G Sbjct: 94 EDVAFGPENLGLSPREIKERVESSLKTVGLWDLRHRPVHALSGGQKQRLAIAGILALRPS 153 Query: 194 AVFNDEASGTPDII--NKSILGFFTELNPNRFWIMT 227 + DEA+ D + + + + N +T Sbjct: 154 YILFDEATALLDPVGRREVLETALSLANSVGVLWIT 189 >gi|149203834|ref|ZP_01880803.1| Putative large terminase [Roseovarius sp. TM1035] gi|149142951|gb|EDM30993.1| Putative large terminase [Roseovarius sp. TM1035] Length = 419 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 47/283 (16%), Positives = 83/283 (29%), Gaps = 45/283 (15%) Query: 82 ISAGRGIGKTTLNAWMMLWLISTRPGM---------SIICIANSETQLKNT-LWAEVSKW 131 I GRG GKT A W+ S G I + + Q++ ++ E S Sbjct: 27 IMGGRGAGKTRAGA---EWVRSEVEGARPMDSGRCKRIALVGETIDQVREVMIFGE-SGI 82 Query: 132 LSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTH 191 L+ P E Q+ + +S P+ GP Sbjct: 83 LACSPPDRRPEWQATR---------------KRLIWPNGAVAQAFSAHDPEGLRGPQFDG 127 Query: 192 GMAVFNDEASGT--PDIINKSIL-GFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDW 248 + DE + + G P + + T R G DI P Sbjct: 128 A---WVDELAKWKRARETWDMLQFGLRLGEAPR---VCVTTTPRNVGVLKDILATPSTVT 181 Query: 249 KRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSR 308 + + F + + +RY + + R E+ G + + +E A R Sbjct: 182 SSAPTEA-NRAHLAESFLDEVRARY-AGTRLGRQELDGLLIDEAEDALWSPAMLEAA--R 237 Query: 309 EAIDDLYAPLIMGCDIAGEG---GDKTVVVFRRGNIIEHIFDW 348 + +++ D G D+ ++ + DW Sbjct: 238 VDTLPEFDRVVVAVDPPVSGHAASDECGIIVVGAITRGPVQDW 280 >gi|145493391|ref|XP_001432691.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124399805|emb|CAK65294.1| unnamed protein product [Paramecium tetraurelia] Length = 733 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 38/104 (36%), Gaps = 8/104 (7%) Query: 28 FKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEA-VDVHCHSNVNNSNPTIFKCAISAGR 86 F F+ G + + ++ F+ W E + A + V+ N + AI+ Sbjct: 317 FDAFLEEQGDQGDEEQAVDKFTFNW-WCKEGIRANIAKIRQEFVDYHNLKPIRIAITGPP 375 Query: 87 GIGKTTLNAWMMLWLISTRPGMSIIC------IANSETQLKNTL 124 GIGK+T+ + + + + + QLK L Sbjct: 376 GIGKSTIANQISTYFSIPHITIKELIQEYLNQTSEEVEQLKTNL 419 >gi|145486706|ref|XP_001429359.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124396451|emb|CAK61961.1| unnamed protein product [Paramecium tetraurelia] Length = 733 Score = 38.6 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 38/104 (36%), Gaps = 8/104 (7%) Query: 28 FKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEA-VDVHCHSNVNNSNPTIFKCAISAGR 86 F F+ G + + ++ F+ W E + A + V+ N + AI+ Sbjct: 317 FDAFLEEQGDQGDEEQAVDKFTFNW-WCKEGIRANIAKIRQEFVDYHNLKPIRIAITGPP 375 Query: 87 GIGKTTLNAWMMLWLISTRPGMSIIC------IANSETQLKNTL 124 GIGK+T+ + + + + + QLK L Sbjct: 376 GIGKSTIANQISTYFSIPHITIKELIQEYLNQTSEEVEQLKTNL 419 >gi|16127022|ref|NP_421586.1| hypothetical protein CC_2790 [Caulobacter crescentus CB15] gi|221235816|ref|YP_002518253.1| phage DNA packaging protein [Caulobacter crescentus NA1000] gi|13424390|gb|AAK24754.1| conserved hypothetical protein [Caulobacter crescentus CB15] gi|220964989|gb|ACL96345.1| phage DNA packaging protein [Caulobacter crescentus NA1000] Length = 567 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 47/272 (17%), Positives = 77/272 (28%), Gaps = 38/272 (13%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 GRG GKT A + W P ++I + ++ M+ + Sbjct: 199 GGRGAGKTFAGARWITWNALAYPSQALI--GPTLHDVREV----------MIEGPSGLKA 246 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVG--PHNTHGMAVFNDEAS 201 + W E S +S E P++ G H + DE Sbjct: 247 MGGPAYRPRW-----EASRRRLVWPNGAVAYAFSAEDPESLRGPQFHAA-----WADEFC 296 Query: 202 GTPDIINKSIL---GFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTV 258 P + G +P T R + Sbjct: 297 AWPKPAETLAMLRFGLRLGEDPRLVVTTTPKPHRA----LKTLMAEPGVALTRAGTSANA 352 Query: 259 EGIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPL 318 + F + S YG + +A E+ G + + F + +R A D + Sbjct: 353 GNLAPAFLRTLASLYG-GTRLAAQELDGVVVETDGGLFRAEDLARCRAARPARLDR---V 408 Query: 319 IMGCD-IAGEGGDKT--VVVFRRGNIIEHIFD 347 ++ D A GD VVV RR + + D Sbjct: 409 VVAVDPPATATGDACGIVVVGRRDDRAFVLAD 440 >gi|307294267|ref|ZP_07574111.1| hypothetical protein SphchDRAFT_1737 [Sphingobium chlorophenolicum L-1] gi|306880418|gb|EFN11635.1| hypothetical protein SphchDRAFT_1737 [Sphingobium chlorophenolicum L-1] Length = 438 Score = 38.2 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 39/258 (15%), Positives = 80/258 (31%), Gaps = 28/258 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 AGRG GKT A + + P I + + + + + V +L W+ Sbjct: 59 AGRGFGKTRAGAEWVRSVAEGDPAARIALVGATLGEARAVM---VEGASGVLAVSPWWNR 115 Query: 144 QSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE---- 199 + ++ + ++ GP +HG + DE Sbjct: 116 PAFL------------PALRKLVWRNGAVATLFGAAEAESLRGPQFSHG---WADEIAKW 160 Query: 200 ASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIFNIPLEDWKRYQIDTRTVE 259 A G + ++G + P + T+ R + + + Sbjct: 161 AGGQA-AWDNLMMGMRLGIAP--RVLATTTPRPVALVRGLVERNGSDVVVTRGRSADNAS 217 Query: 260 GIDSGFHEGIISRYGLDSDVARIEILGQFPQQEVNNFIPHNYIEEAMSREAIDDLYAPLI 319 + GF + YG + + R E+ G+ ++ + +E + A ++ Sbjct: 218 HLADGFLAAMERNYG-GTRLGRQELDGELIEEVEGALWSRDLLERC-RVAHVRGTLARVV 275 Query: 320 MGCD-IAGEGGDKTVVVF 336 + D A GD +V Sbjct: 276 VAVDPPASVHGDACGIVV 293 >gi|302560409|ref|ZP_07312751.1| DNA/RNA helicase, superfamily II [Streptomyces griseoflavus Tu4000] gi|302478027|gb|EFL41120.1| DNA/RNA helicase, superfamily II [Streptomyces griseoflavus Tu4000] Length = 599 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 57/202 (28%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ M+ + P F A++ G GKTT Sbjct: 23 PWGTAGKL-------RAWQQGAMD--------KYIQTQPRDF-LAVATP-GAGKTTFALT 65 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ + +A +E + K S R ++ Sbjct: 66 LASWLLHHHVVQQVTVVAPTEH---------LKKQWSEAAARIGIKLD------------ 104 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 E S G K Y TY+ H V DE +G ++ Sbjct: 105 -PEYSAGPLGKDYDGVAVTYAGVGVRPM--LHRNRVEQRKTLVILDEIHHAGDSKSWGEA 161 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 162 CLEAF--EPATRRLALTGTPFR 181 >gi|328773858|gb|EGF83895.1| hypothetical protein BATDEDRAFT_29142 [Batrachochytrium dendrobatidis JAM81] Length = 1016 Score = 38.2 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 48/134 (35%), Gaps = 10/134 (7%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH 139 ++A RG GK+ + + +I + S +K L+ + K L + Sbjct: 280 VTLTASRGRGKSASLGIAIA-SAISYGYSNIFITSPSPENIKT-LFEFIFKGFDALGYEE 337 Query: 140 WFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 + + + ++ + D + +T +P F G + V DE Sbjct: 338 HLDYDIIQSTNPAFQKSIVRVNFFRDHR------QTIQWIQPSDF-GILAQAELLVI-DE 389 Query: 200 ASGTPDIINKSILG 213 A+ P + K +LG Sbjct: 390 AAAIPLPVVKKLLG 403 >gi|224586602|ref|YP_002640499.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] gi|224497136|gb|ACN52769.1| phage terminase, large subunit, pbsx family [Borrelia valaisiana VS116] Length = 450 Score = 38.2 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDNPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 + + Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIHTFTTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASIDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D +++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFSSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|255933656|ref|XP_002558207.1| Pc12g14010 [Penicillium chrysogenum Wisconsin 54-1255] gi|211582826|emb|CAP81028.1| Pc12g14010 [Penicillium chrysogenum Wisconsin 54-1255] Length = 2009 Score = 38.2 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 39/117 (33%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W +PG ++ IA + ++ V W L + ++ Sbjct: 1166 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAPMKALVRE----RVQDWRKRLTRQMGLKL 1221 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1222 VELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRD-------YVRKVSLVIIDE 1271 >gi|308449036|ref|XP_003087834.1| hypothetical protein CRE_16583 [Caenorhabditis remanei] gi|308252534|gb|EFO96486.1| hypothetical protein CRE_16583 [Caenorhabditis remanei] Length = 411 Score = 37.8 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 28/183 (15%), Positives = 52/183 (28%), Gaps = 21/183 (11%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 WQ A+ S + + S R +GKT ++ L P +++I Sbjct: 47 DEWQAGLGRAMLAKRASGLYAAGIGGIII--SICRQVGKTFTIGSIIFALCIIFPKLTVI 104 Query: 112 CIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPS-GWYAELLEQSMGIDSKHYT 170 A+ S + + +Q + + + + G + Sbjct: 105 WTAH----------------HSRTSNETFESLQGFAQKRKVAPHIRQIRRVNGQQQITFK 148 Query: 171 ITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNT 230 R R F G V DEA + + ++ T + N I+ Sbjct: 149 NGSRIMFGARESGF-GRGFAGVDVVVADEAQILGNKALEDMVPA-TNASKNPLIILMGTP 206 Query: 231 RRL 233 R Sbjct: 207 PRP 209 >gi|319943331|ref|ZP_08017613.1| hypothetical protein HMPREF0551_0459 [Lautropia mirabilis ATCC 51599] gi|319743146|gb|EFV95551.1| hypothetical protein HMPREF0551_0459 [Lautropia mirabilis ATCC 51599] Length = 220 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 47/166 (28%), Gaps = 15/166 (9%) Query: 52 HRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSII 111 WQ A H + AI G GKTTL A + S PG ++ Sbjct: 30 TAWQASPRTAFVDHLMARAGTHAGRPAIIAIDGRSGSGKTTLTAALA----SVVPGAQVL 85 Query: 112 CIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLS---LHPSGWYAELLEQSMGIDSKH 168 L + +W E E+ + L P W E S+ I + Sbjct: 86 -------HLDDLIWNEPLYQWDQQLVAALSELHTTGALDLIPHPWREHGREGSIRITAGA 138 Query: 169 YTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGF 214 + + G + H D+ + DI G Sbjct: 139 PLVIVEG-TGAGLQAIRGLIDLHVWVQTGDDVTEHRDISRDIAEGT 183 >gi|329936550|ref|ZP_08286286.1| DNA or RNA helicases of superfamily II [Streptomyces griseoaurantiacus M045] gi|329304065|gb|EGG47947.1| DNA or RNA helicases of superfamily II [Streptomyces griseoaurantiacus M045] Length = 611 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 56/202 (27%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ M+ P F A++ G GKTT Sbjct: 35 PWGTAGKL-------RAWQQGAMD--------RYLQQQPRDF-LAVATP-GAGKTTFALT 77 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ + +A +E + K + R ++ Sbjct: 78 LASWLLHHHVVQQVTVVAPTEH---------LKKQWAEAAARIGIKLD------------ 116 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 E S G + Y TY+ H V DE +G ++ Sbjct: 117 -PEYSAGPLGREYDGVAVTYAGVGVRPM--LHRNRVEQRKTLVILDEIHHAGDSKSWGEA 173 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 174 CLEAF--EPATRRLALTGTPFR 193 >gi|315446103|ref|YP_004078982.1| nuclease-like protein [Mycobacterium sp. Spyr1] gi|315264406|gb|ADU01148.1| nuclease-like protein [Mycobacterium sp. Spyr1] Length = 551 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 50/181 (27%), Gaps = 5/181 (2%) Query: 57 EFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANS 116 E V S + ++ + + I G G GKT L L ++++C + Sbjct: 199 EDAADVLTEQQSVILDAIKLLHRVEIRGGAGSGKTFLAMEQARRLARAGRRVALVCYS-- 256 Query: 117 ETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTY 176 L + L + W + E L + + + Sbjct: 257 -HGLASYLERITATWNRRHRPAYVGEFHDLGKQWGAPAGPDESVRNDETVRFWEHDLPSQ 315 Query: 177 SEERPDTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGW 236 H A+ DEA D +L + ++ + +R+ Sbjct: 316 MTRLATQLDPGHRFD--AIVVDEAQDFADAWWDPLLAALKDDETGGLYVFSDEGQRVFDR 373 Query: 237 F 237 F Sbjct: 374 F 374 >gi|218202744|ref|YP_002364661.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi ZS7] gi|218164272|gb|ACK74336.1| phage terminase, large subunit, pbsx family protein [Borrelia burgdorferi ZS7] Length = 450 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDNPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 ++ + Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIDTFTTYNFTTYDNVLLSKGFIETQEKLY-KDMPTYKARVLLGEWIASIDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D +++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFSSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|154335334|ref|XP_001563907.1| hypothetical protein [Leishmania braziliensis MHOM/BR/75/M2904] Length = 1080 Score = 37.8 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 27/134 (20%), Positives = 52/134 (38%), Gaps = 10/134 (7%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH 139 C ++AGRG GK+ M+ + +I+C A + ++ L+ + L L +R Sbjct: 305 CVVTAGRGRGKSAALGMMVA-GAIAQGYSNIMCTAPTPENVQT-LFEFAIRGLKELGYRE 362 Query: 140 WFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 + ++L + + ++ + + T + S F DE Sbjct: 363 RTDFEALQGVSEEFAKCFIRINVFREHRQ---TLQFVSATDTAKFAQAE-----VCVIDE 414 Query: 200 ASGTPDIINKSILG 213 A+ P + K ILG Sbjct: 415 AAALPLPLVKRILG 428 >gi|328854149|gb|EGG03283.1| hypothetical protein MELLADRAFT_49560 [Melampsora larici-populina 98AG31] Length = 1103 Score = 37.8 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 23/125 (18%), Positives = 39/125 (31%), Gaps = 10/125 (8%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH 139 A++A RG GK+ + +I + S LK + + K L + + Sbjct: 290 VALTAARGRGKSAALGLAIT-AAIAHSYSNIFVTSPSPENLKTV-FEFIFKGLDAIGYEE 347 Query: 140 WFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 + W+ + ID Y E + +G V DE Sbjct: 348 HLDYDIHQSTNPEWH----NCVVRIDIFRQHRQTIQYIEPQDYKVLGQAE----LVVIDE 399 Query: 200 ASGTP 204 A+ P Sbjct: 400 AAAIP 404 >gi|237794637|ref|YP_002862189.1| putative phage terminase, large subunit [Clostridium botulinum Ba4 str. 657] gi|229260548|gb|ACQ51581.1| putative phage terminase, large subunit [Clostridium botulinum Ba4 str. 657] Length = 543 Score = 37.8 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 30/78 (38%), Gaps = 5/78 (6%) Query: 85 GRGIGKTTLNAWMMLWLISTRPGMS---IICIANSETQLKNTL--WAEVSKWLSMLPHRH 139 GRG GK + + + ++ G + +ANSE Q K + EV L Sbjct: 94 GRGAGKNGFISALSWYFTTSFHGKRGYNVDIVANSEEQAKTSFDDVYEVIDDNKRLQKAF 153 Query: 140 WFEMQSLSLHPSGWYAEL 157 ++ + + + Y + Sbjct: 154 YYTKEKIVYKKTRSYLKF 171 >gi|322498208|emb|CBZ33283.1| unnamed protein product [Leishmania donovani BPK282A1] Length = 1065 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 46/136 (33%), Gaps = 14/136 (10%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS--KWLSMLPH 137 C ++AGRG GK+ + + +IIC A + ++ + K L Sbjct: 287 CVVTAGRGRGKSAALGMTIA-GAIAQGYSNIICTAPTPENVQTLFEFAIRGLKELGYRER 345 Query: 138 RHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFN 197 + +Q +S + + + + + T + + Sbjct: 346 TDFEALQGVSEEFAKCFIRINVFREHRQTVQFVSAADTAKFAQAE-----------LCVI 394 Query: 198 DEASGTPDIINKSILG 213 DEA+ P + K ILG Sbjct: 395 DEAAALPLTLVKRILG 410 >gi|146083626|ref|XP_001464793.1| hypothetical protein [Leishmania infantum JPCM5] gi|134068887|emb|CAM59821.1| conserved hypothetical protein [Leishmania infantum JPCM5] Length = 1065 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 46/136 (33%), Gaps = 14/136 (10%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVS--KWLSMLPH 137 C ++AGRG GK+ + + +IIC A + ++ + K L Sbjct: 287 CVVTAGRGRGKSAALGMTIA-GAIAQGYSNIICTAPTPENVQTLFEFAIRGLKELGYRER 345 Query: 138 RHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFN 197 + +Q +S + + + + + T + + Sbjct: 346 TDFEALQGVSEEFAKCFIRINVFREHRQTVQFVSAADTAKFAQAE-----------LCVI 394 Query: 198 DEASGTPDIINKSILG 213 DEA+ P + K ILG Sbjct: 395 DEAAALPLTLVKRILG 410 >gi|5802839|gb|AAD51802.1|AF170560_1 SdrA [Streptomyces coelicolor A3(2)] Length = 597 Score = 37.8 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 57/202 (28%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 22 PWGTAGKL-------RAWQQGAME--------KYLQDQPRDF-LAVATP-GAGKTTFALT 64 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ + +A +E + K + R ++ Sbjct: 65 LASWLLHHHVVQQVTVVAPTEH---------LKKQWAEAAARIGIKLD------------ 103 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 E S G S+ Y TY+ H V DE +G ++ Sbjct: 104 -PEYSAGPLSREYQGIAITYAGVGVRPM--LHRNRVEQRKTLVILDEIHHAGDSKSWGEA 160 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 161 CLEAF--EPATRRLALTGTPFR 180 >gi|21221383|ref|NP_627162.1| hypothetical protein SCO2936 [Streptomyces coelicolor A3(2)] gi|256787436|ref|ZP_05525867.1| hypothetical protein SlivT_23354 [Streptomyces lividans TK24] gi|289771334|ref|ZP_06530712.1| hypothetical protein SSPG_04602 [Streptomyces lividans TK24] gi|5531385|emb|CAB51017.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)] gi|289701533|gb|EFD68962.1| hypothetical protein SSPG_04602 [Streptomyces lividans TK24] Length = 598 Score = 37.8 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 40/202 (19%), Positives = 57/202 (28%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 22 PWGTAGKL-------RAWQQGAME--------KYLQDQPRDF-LAVATP-GAGKTTFALT 64 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ + +A +E + K + R ++ Sbjct: 65 LASWLLHHHVVQQVTVVAPTEH---------LKKQWAEAAARIGIKLD------------ 103 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 E S G S+ Y TY+ H V DE +G ++ Sbjct: 104 -PEYSAGPLSREYQGIAITYAGVGVRPM--LHRNRVEQRKTLVILDEIHHAGDSKSWGEA 160 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 161 CLEAF--EPATRRLALTGTPFR 180 >gi|50312271|ref|XP_456167.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140] gi|49645303|emb|CAG98875.1| KLLA0F24398p [Kluyveromyces lactis] Length = 1055 Score = 37.4 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 34/212 (16%), Positives = 66/212 (31%), Gaps = 30/212 (14%) Query: 2 PRLISTDQKLEQELHEMLMHAECVLSFKNFVMRFFPWGIKGKPLEHFSQPHRWQLEFMEA 61 P+ QEL E+ + E V L S+ + Sbjct: 221 PKDDEEISPKNQELKELKVSLEDV--------------QPAGSLVALSKTVNQAHAILTF 266 Query: 62 VDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLK 121 +D +N++ A++AGRG GK+ + + +I + S LK Sbjct: 267 IDAISEKTLNST------VALTAGRGRGKSAALGISIA-AAVSHGYSNIFVTSPSPENLK 319 Query: 122 NTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERP 181 L+ + K L ++ + + + ++ + + + Sbjct: 320 T-LFEFIFKGFDALGYQEHIDYDIIQSTNPSFNKAIVRVDIKREHRQTIQYIIPNDSH-- 376 Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILG 213 +G V DEA+ P + K +LG Sbjct: 377 --VLGQAE----LVVIDEAAAIPLPLVKKLLG 402 >gi|313233376|emb|CBY24491.1| unnamed protein product [Oikopleura dioica] Length = 985 Score = 37.4 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 50/157 (31%), Gaps = 31/157 (19%) Query: 55 QLEFMEAVDVHCHSNVNNSNPTIFKCAISA------GRGIGKTTLNAWMMLWLISTRPGM 108 Q EF+ D V + T + A+ + G GKT + A ++ + P Sbjct: 41 QREFVFPDD----FPVRSYQQTAARAALKSNCLVCLPTGAGKTLVAAAVIRNFLDWHPNS 96 Query: 109 SIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKH 168 I +A W L ++ + + PS + + + + Sbjct: 97 QAIFVA----------------WTKPLLNQQKEALTRDAGIPSSQSCVINGHTSAKNREE 140 Query: 169 YTITCRT--YSEERPDTFVGP---HNTHGMAVFNDEA 200 + TCR + + + G + V DEA Sbjct: 141 WYSTCRLICATPQTINNDAGKNLINMQRIKLVIVDEA 177 >gi|327355898|gb|EGE84755.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces dermatitidis ATCC 18188] Length = 2024 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 38/117 (32%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W +PG ++ IA + ++ V W L ++ Sbjct: 1165 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAPMKALVRE----RVHDWRRRLTAPMGLKL 1220 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1221 VELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDE 1270 >gi|239609198|gb|EEQ86185.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces dermatitidis ER-3] Length = 2024 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 38/117 (32%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W +PG ++ IA + ++ V W L ++ Sbjct: 1165 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAPMKALVRE----RVHDWRRRLTAPMGLKL 1220 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1221 VELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDE 1270 >gi|261189015|ref|XP_002620920.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces dermatitidis SLH14081] gi|239591924|gb|EEQ74505.1| activating signal cointegrator 1 complex subunit 3 [Ajellomyces dermatitidis SLH14081] Length = 2024 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 38/117 (32%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W +PG ++ IA + ++ V W L ++ Sbjct: 1165 SPTGSGKTVAAELAMWWAFREKPGSKVVYIAPMKALVRE----RVHDWRRRLTAPMGLKL 1220 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1221 VELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDE 1270 >gi|171058461|ref|YP_001790810.1| exodeoxyribonuclease V subunit alpha [Leptothrix cholodnii SP-6] gi|170775906|gb|ACB34045.1| exodeoxyribonuclease V, alpha subunit [Leptothrix cholodnii SP-6] Length = 739 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 51/167 (30%), Gaps = 38/167 (22%) Query: 48 FSQPHR-----WQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAWMMLWLI 102 F P WQ + I+ G G GKT A ++ ++ Sbjct: 235 FGGPPAPDRFDWQRSACAIALRGRLAL------------ITGGPGTGKTYTVARLLALVM 282 Query: 103 STRPGM---SIICIANS---ETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + P I A + +LK ++ + + + + LP + + L S + Sbjct: 283 AVHPQPQALRIALAAPTGKAAARLKQSIDSALQQLAAALPGALDWGLLQQRLSQSLTLHK 342 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDEASGT 203 LL D++ + R H + DEAS Sbjct: 343 LLGARP--DTRRFGRDAR-------------HPLEVDLLVVDEASMV 374 >gi|226322130|ref|ZP_03797652.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] gi|226232450|gb|EEH31207.1| phage terminase, large subunit, pbsx family [Borrelia burgdorferi Bol26] Length = 450 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 54/164 (32%), Gaps = 18/164 (10%) Query: 182 DTFVGPHNTHGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDIF 241 + F G ++ +F +EA+ + +L T N +F + Sbjct: 157 ERFRG---SNSALIFVNEATTLHKQTLEEVLKRL-RCGQETIIFDT-NPDNPEHYFKTDY 211 Query: 242 NIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDVARIEI-LGQFPQQEVNNFIPHN 300 ++ + Y T + GF E Y D + + LG++ + F N Sbjct: 212 IDNIDTFTTYNFTTYDNVLLSKGFIETQEKLY-KDIPTYKARVLLGEWIASIDSIFTQIN 270 Query: 301 YIEEAMSREAIDDLYAPLIMGCDIAGE-GGDKTVVVF--RRGNI 341 + D +++ I D A GGD T + R + Sbjct: 271 ITQ--------DYVFSSPIAYLDPAFSVGGDNTALCVMERVDDK 306 >gi|212545286|ref|XP_002152797.1| DEAD/DEAH box helicase, putative [Penicillium marneffei ATCC 18224] gi|210065766|gb|EEA19860.1| DEAD/DEAH box helicase, putative [Penicillium marneffei ATCC 18224] Length = 2022 Score = 37.4 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 19/117 (16%), Positives = 38/117 (32%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ V W + ++ Sbjct: 1162 SPTGSGKTVACELAMWWAFRERPGSKVVYIAPMKALVRE----RVQDWRKRITTAMGLKL 1217 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1218 VELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDE 1267 >gi|291453805|ref|ZP_06593195.1| hypothetical protein SSHG_04098 [Streptomyces albus J1074] gi|291356754|gb|EFE83656.1| hypothetical protein SSHG_04098 [Streptomyces albus J1074] Length = 593 Score = 37.4 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 57/202 (28%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ M+ P F A++ G GKTT Sbjct: 18 PWGTAGKL-------RAWQQAAMD--------KYVQEQPRDF-LAVATP-GAGKTTFALT 60 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + W++ + +A +E + K + R ++ Sbjct: 61 LASWMLHHHVVQQVTVVAPTEH---------LKKQWAEAAARIGIKLD------------ 99 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 E S G SK Y TY+ H V DE +G ++ Sbjct: 100 -PEYSAGPLSKEYQGVAVTYAGVGVRPM--LHRNRVEQRKTLVILDEIHHAGDSKSWGEA 156 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 157 CLEAF--EPATRRLALTGTPFR 176 >gi|242815191|ref|XP_002486521.1| DEAD/DEAH box helicase, putative [Talaromyces stipitatus ATCC 10500] gi|218714860|gb|EED14283.1| DEAD/DEAH box helicase, putative [Talaromyces stipitatus ATCC 10500] Length = 2030 Score = 37.4 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 38/117 (32%), Gaps = 12/117 (10%) Query: 84 AGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEM 143 + G GKT M W RPG ++ IA + ++ V W L ++ Sbjct: 1164 SPTGSGKTVACELAMWWAFRERPGSKVVYIAPMKALVRE----RVQDWRKRLTAAMGLKL 1219 Query: 144 QSLSLHPSGWYAELLEQSMGI-DSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 L+ + + + + I + + R++ + V DE Sbjct: 1220 VELTGDNTPDTRTIRDADIIITTPEKWDGISRSWQTRG-------YVRQVSLVIIDE 1269 >gi|294629610|ref|ZP_06708170.1| conserved hypothetical protein [Streptomyces sp. e14] gi|292832943|gb|EFF91292.1| conserved hypothetical protein [Streptomyces sp. e14] Length = 596 Score = 37.4 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 56/202 (27%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG GK WQ ME P F A++ G GKTT Sbjct: 20 PWGTAGKL-------RAWQQGAME--------KYLQEQPRDF-LAVATP-GAGKTTFALT 62 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ + +A +E + K + R ++ Sbjct: 63 LASWLLHHHVVQQVTVVAPTEH---------LKKQWAEAAARVGIKLD------------ 101 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 E S G + Y TY+ H V DE +G ++ Sbjct: 102 -PEYSAGPLGREYDGVAVTYAGVGVRPM--LHRNRVEQRKTLVILDEIHHAGDSKSWGEA 158 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 159 CLEAF--EPATRRLALTGTPFR 178 >gi|209544598|ref|YP_002276827.1| hypothetical protein Gdia_2467 [Gluconacetobacter diazotrophicus PAl 5] gi|209532275|gb|ACI52212.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 491 Score = 37.4 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 67/199 (33%), Gaps = 26/199 (13%) Query: 87 GIGKTTLNAW-MMLWLISTRPGMSII------CIANSETQLKNTLWAEVSKWLSMLPHRH 139 G GK++ W M+L + PG + I NS QL++T V +W + Sbjct: 32 GSGKSSGCVWEMVLRGLKQAPGPDGVRRSRWAVIRNSYRQLEDTTIRTVHQWFPPMQFGR 91 Query: 140 WFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 W PS + + D K I + +RPD + + +E Sbjct: 92 W--------KPSEHSYTINRLAAQGDEKPAEIELLFRALDRPDQVGNLLSLELTGAWINE 143 Query: 200 ASGTPDIINKSILGFF----TELNPNRFW---IMTSNTRRLNGWFYDIF----NIPLEDW 248 A P + +++ G + + W IM +N +Y F + + Sbjct: 144 AREVPWAVIEAVQGRVGRYPAKRDGGATWSGIIMDTNPPDAESEWYKFFEEKDHTDAVEA 203 Query: 249 KRYQIDTRTVEGIDSGFHE 267 I TVE F + Sbjct: 204 IAQVIPGMTVERYARIFKQ 222 >gi|260431843|ref|ZP_05785814.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260415671|gb|EEX08930.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 176 Score = 37.0 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 44/128 (34%), Gaps = 11/128 (8%) Query: 182 DTFVGPHNTHGMAVFNDEASGT-PDIINKSILGFFTELNPNRFWIMTSNTRRLNGWFYDI 240 + G + DEA PD + + R +++++ +G+FY+ Sbjct: 49 ENARGETAD---LIIGDEACFIQPDEALTAFFPM--RRSTGRIFLLSTPNGTRSGYFYET 103 Query: 241 FNIPLEDWKRYQIDTRTVEGIDSGFHEGIISRYGLDSDV-ARIEILGQFPQQEVNNFIPH 299 + +I R+++ I SD R E L ++ + + Sbjct: 104 WESDA---NVRRIRARSMDTTREDRLAQIEFDRRTMSDATFRREHLCEWVGAGE-SLLSW 159 Query: 300 NYIEEAMS 307 N +E AM Sbjct: 160 NTLERAMQ 167 >gi|331245260|ref|XP_003335267.1| nucleolus protein [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309314257|gb|EFP90848.1| nucleolus protein [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 1092 Score = 36.6 bits (83), Expect = 5.5, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 42/125 (33%), Gaps = 10/125 (8%) Query: 80 CAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRH 139 ++A RG GK+ + + +I + S LK L+ + K L+ L + Sbjct: 291 VTLTASRGRGKSAALGMAIA-VAVAHSYSNIFVTSPSPENLKT-LFEFIFKSLTALGYEE 348 Query: 140 WFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGMAVFNDE 199 + W ++ + + + +T +P + V DE Sbjct: 349 HLDYNVHQSSNPEWKNCIVRVDIFRNHR------QTIQYIQPQDYKVLGQAE--LVVIDE 400 Query: 200 ASGTP 204 A+ P Sbjct: 401 AAAIP 405 >gi|251783038|ref|YP_002997341.1| terminase large subunit [Streptococcus dysgalactiae subsp. equisimilis GGS_124] gi|242391668|dbj|BAH82127.1| terminase large subunit [Streptococcus dysgalactiae subsp. equisimilis GGS_124] Length = 424 Score = 36.6 bits (83), Expect = 5.6, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 74/222 (33%), Gaps = 21/222 (9%) Query: 74 NPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLS 133 NP I A GRG GK++ A+++ LI P ++ +CI ++ L+ +++ ++ +S Sbjct: 22 NPKILNIACKGGRGSGKSSNIAFIISRLIIQYP-VNAVCIRKTDNTLEQSVYEQIKWAIS 80 Query: 134 MLPHRHWFEMQS----LSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHN 189 +F+ ++ P G Y K + ++ + Sbjct: 81 EQGLERYFKFNKSPLRITYIPRGNYIVFRGAQNPERIKSLKDSRFPFAIGWIEELAEFKT 140 Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWF---YDIFNIPLE 246 DE I N + G + +F+ + +R W Y+ P Sbjct: 141 E-------DE---VKTITNSLLRGELGDGLFYKFFYTYNPPKRKQSWVNKKYESQFQPKN 190 Query: 247 DWKRYQIDTR-TVEGIDSGFHEGIISRYGLDSDVARIEILGQ 287 + T I F + R E LG+ Sbjct: 191 TF--VHASTYKDNPFIAKEFIAEAEATRERSERRYRWEYLGE 230 >gi|297194112|ref|ZP_06911510.1| type III restriction enzyme, res subunit [Streptomyces pristinaespiralis ATCC 25486] gi|297152113|gb|EFH31533.1| type III restriction enzyme, res subunit [Streptomyces pristinaespiralis ATCC 25486] Length = 594 Score = 36.3 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 56/202 (27%), Gaps = 49/202 (24%) Query: 37 PWGIKGKPLEHFSQPHRWQLEFMEAVDVHCHSNVNNSNPTIFKCAISAGRGIGKTTLNAW 96 PWG K WQ ME P F A++ G GKTT Sbjct: 18 PWGTANKL-------RAWQQGAME--------KYLQEQPRDF-LAVATP-GAGKTTFALT 60 Query: 97 MMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLSMLPHRHWFEMQSLSLHPSGWYAE 156 + WL+ + +A +E + K + R ++ Sbjct: 61 LASWLLHHHVVQQVTVVAPTEH---------LKKQWAAAAARIGIKLD------------ 99 Query: 157 LLEQSMGIDSKHYTITCRTYSEERPDTFVGPH----NTHGMAVFNDEA--SGTPDIINKS 210 + S G SK Y TY+ H V DE +G ++ Sbjct: 100 -PDYSAGPLSKEYHGVAVTYAGVGVRPM--LHRNRCEQRKTLVILDEIHHAGDSKSWGEA 156 Query: 211 ILGFFTELNPNRFWIMTSNTRR 232 L F R +T R Sbjct: 157 CLEAF--EPATRRLALTGTPFR 176 >gi|94990333|ref|YP_598433.1| terminase large subunit [Streptococcus phage 10270.2] gi|94994256|ref|YP_602354.1| Terminase large subunit [Streptococcus phage 10750.2] gi|94543841|gb|ABF33889.1| Terminase large subunit [Streptococcus phage 10270.2] gi|94547764|gb|ABF37810.1| Terminase large subunit [Streptococcus phage 10750.2] Length = 432 Score = 35.9 bits (81), Expect = 9.4, Method: Composition-based stats. Identities = 39/222 (17%), Positives = 74/222 (33%), Gaps = 21/222 (9%) Query: 74 NPTIFKCAISAGRGIGKTTLNAWMMLWLISTRPGMSIICIANSETQLKNTLWAEVSKWLS 133 NP I A GRG GK++ A+++ LI P ++ +CI ++ L+ +++ ++ +S Sbjct: 30 NPQILNIACKGGRGSGKSSNIAFIISRLIIQYP-VNAVCIRKTDNTLEQSVYEQIKWAIS 88 Query: 134 MLPHRHWFEMQS----LSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHN 189 +F+ ++ P G Y K + ++ + Sbjct: 89 EQGLERYFKFNKSPLRITYIPRGNYIVFRGAQNPERIKSLKDSRFPFAIGWIEELAEFKT 148 Query: 190 THGMAVFNDEASGTPDIINKSILGFFTELNPNRFWIMTSNTRRLNGWF---YDIFNIPLE 246 DE I N + G + +F+ + +R W Y+ P Sbjct: 149 E-------DE---VKTITNSLLRGELGDGLFYKFFYTYNPPKRKQSWVNKKYESQFQPSN 198 Query: 247 DWKRYQIDTR-TVEGIDSGFHEGIISRYGLDSDVARIEILGQ 287 + T I F + R E LG+ Sbjct: 199 TF--VHASTYKDNPFIAKEFIAEAEATRERSERRYRWEYLGE 238 >gi|217970261|ref|YP_002355495.1| exodeoxyribonuclease V, subunit alpha [Thauera sp. MZ1T] gi|217507588|gb|ACK54599.1| exodeoxyribonuclease V, alpha subunit [Thauera sp. MZ1T] Length = 683 Score = 35.9 bits (81), Expect = 10.0, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 48/144 (33%), Gaps = 22/144 (15%) Query: 79 KCAI-SAGRGIGKTTLNAWMMLWLISTRPGM---SIICIANSETQLKNTLWAEVSKWLSM 134 + +I + G G GKT A ++ +++T P I A + K + Sbjct: 217 RLSILTGGPGTGKTYTAARLLALMLATHPAPERLRIALAAPT------------GKAAAR 264 Query: 135 LPHRHWFEMQSLSLHPSGWYAELLEQSMGIDSKHYTITCRTYSEERPDTFVGPHNTHGM- 193 L +Q+L G + +L + I + + F H + + Sbjct: 265 LRQAIDGSLQALQRSL-GGHLDLAALTRRIGAARTLHALLGARPD-TRRFR-HHAGNPLD 321 Query: 194 --AVFNDEASGTPDIINKSILGFF 215 V DEAS + ++L Sbjct: 322 VDVVIVDEASMVHLEMMAALLEAL 345 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.307 0.129 0.365 Lambda K H 0.267 0.0392 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 6,576,772,568 Number of Sequences: 14124377 Number of extensions: 258892913 Number of successful extensions: 679973 Number of sequences better than 10.0: 653 Number of HSP's better than 10.0 without gapping: 204 Number of HSP's successfully gapped in prelim test: 557 Number of HSP's that attempted gapping in prelim test: 678964 Number of HSP's gapped (non-prelim): 829 length of query: 367 length of database: 4,842,793,630 effective HSP length: 140 effective length of query: 227 effective length of database: 2,865,380,850 effective search space: 650441452950 effective search space used: 650441452950 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 82 (36.3 bits)