BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781213|ref|YP_003065626.1| head-to-tail joining protein, putative [Candidatus Liberibacter asiaticus str. psy62] (556 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781213|ref|YP_003065626.1| head-to-tail joining protein, putative [Candidatus Liberibacter asiaticus str. psy62] gi|254040890|gb|ACT57686.1| head-to-tail joining protein, putative [Candidatus Liberibacter asiaticus str. psy62] gi|317120678|gb|ADV02501.1| putative phage-related head-to-tail joining protein [Liberibacter phage SC1] gi|317120822|gb|ADV02643.1| putative phage-related head-to-tail joining protein [Candidatus Liberibacter asiaticus] Length = 556 Score = 1163 bits (3008), Expect = 0.0, Method: Compositional matrix adjust. Identities = 556/556 (100%), Positives = 556/556 (100%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL Sbjct: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG Sbjct: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT Sbjct: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS Sbjct: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL Sbjct: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL Sbjct: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL Sbjct: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD Sbjct: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR Sbjct: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 Query: 541 AMEKKLTHDMMENSYG 556 AMEKKLTHDMMENSYG Sbjct: 541 AMEKKLTHDMMENSYG 556 >gi|315121938|ref|YP_004062427.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495340|gb|ADR51939.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 555 Score = 815 bits (2106), Expect = 0.0, Method: Compositional matrix adjust. Identities = 397/551 (72%), Positives = 455/551 (82%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSLLSSL 64 S K I+ F +LK+QR ELN MEELT LYPYK + RMWDTTGSEACIKLSSLLSSL Sbjct: 4 SIKKIKTCFEHLKSQREELNTRMEELTSLLYPYKQEPKSRMWDTTGSEACIKLSSLLSSL 63 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 ITPPGQKWHGL+E F +QAFLY+EDA +KK+R WCDQVTD LFGFRERSRSGFV CLQS Sbjct: 64 ITPPGQKWHGLSEPFFRHQAFLYEEDAGAKKIRGWCDQVTDVLFGFRERSRSGFVSCLQS 123 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 FYTS+VEFGTGCFY+EADVDE GLEEGIRYI+VPL++VY+SVNHQN VDS+YR F FT + Sbjct: 124 FYTSIVEFGTGCFYIEADVDETGLEEGIRYIAVPLADVYLSVNHQNEVDSIYRTFEFTAE 183 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 QI KWG KVLS KMKS+ + E ++F IIHAVYPKSL +KKKDKGNK FHSKFV +DEN Sbjct: 184 QIGGKWGYKVLSDKMKSSYEKKEPDKFKIIHAVYPKSLAEKKKDKGNKNFHSKFVCIDEN 243 Query: 245 RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPT 304 FFEEKQI T PYI+GRYRVRADEIYG+SPAMEALP IRRLNE NELAQ+ RLSLHP Sbjct: 244 VFFEEKQITTLPYIIGRYRVRADEIYGKSPAMEALPAIRRLNEISNELAQYARLSLHPAY 303 Query: 305 IAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLD 364 +A EAKQ F K YMNIGA+S++G++LFQP+Q GNPLP++EEL R++ SI SLFLLD Sbjct: 304 LAPPEAKQLEFKNKSRYMNIGAMSKDGKALFQPLQVGNPLPFYEELKRIQGSIHSLFLLD 363 Query: 365 LFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECE 424 LFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI RELDILD+Q NLPE Sbjct: 364 LFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIKRELDILDAQHNLPELT 423 Query: 425 GADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSR 484 D+ P LLKVEYTSPLFKYQQAESVAS LQG NTV+ELG KTG+P MDH+D D+VSR Sbjct: 424 DYDHSPFHLLKVEYTSPLFKYQQAESVASVLQGTNTVLELGAKTGNPEPMDHIDIDKVSR 483 Query: 485 FSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEK 544 F+LWA+ +PA LIRD EV+ R+ R+ Q M+ + QQ +Q + GAKA +A+EK Sbjct: 484 FALWASGSPAHLIRDVDEVKQRRKDRDDQMEAMQNRQDAQQQEQMGMEAGAKAVSKAIEK 543 Query: 545 KLTHDMMENSY 555 K+T+D+MENSY Sbjct: 544 KMTNDLMENSY 554 >gi|315122900|ref|YP_004063389.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496302|gb|ADR52901.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 555 Score = 814 bits (2102), Expect = 0.0, Method: Compositional matrix adjust. Identities = 394/551 (71%), Positives = 456/551 (82%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSLLSSL 64 S K I+ F +LK+QR ELN MEELT LYPYK + RMWDTTGSEACIKLSSLLSSL Sbjct: 4 SIKKIKTCFEHLKSQREELNTRMEELTSLLYPYKQEPKSRMWDTTGSEACIKLSSLLSSL 63 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 ITPPGQKWHGL+E F +QAFLY+EDA +KK+R WCDQVTD LFGFRERSRSGFV CLQS Sbjct: 64 ITPPGQKWHGLSEPFFRHQAFLYEEDAGAKKIRGWCDQVTDVLFGFRERSRSGFVSCLQS 123 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 FYTS+VEFGTGCFY+EADVDE GLEEGIRYI+VPL++VY+SVNHQN VDS+YR F FT + Sbjct: 124 FYTSIVEFGTGCFYIEADVDETGLEEGIRYIAVPLADVYLSVNHQNEVDSIYRTFEFTAE 183 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 QI KWG KVLS KMKS+ + E ++F IIHAVYPKSL +KKKDKGNK FHSKFV +DEN Sbjct: 184 QIGGKWGYKVLSDKMKSSYEKKEPDKFKIIHAVYPKSLAEKKKDKGNKNFHSKFVCIDEN 243 Query: 245 RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPT 304 FFEEKQI T PYI+GRYRVRADEIYG+SPAMEALP IRRLNE NELAQ+ RLSLHP Sbjct: 244 VFFEEKQITTLPYIIGRYRVRADEIYGKSPAMEALPAIRRLNEISNELAQYARLSLHPAY 303 Query: 305 IAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLD 364 +A +EAKQ F +K ++N GA+S++G++LFQP+Q GNPLP++EEL R++ SI SLFLLD Sbjct: 304 LAPTEAKQLEFKIKSRHINTGAMSKDGKALFQPLQVGNPLPFYEELKRIQGSIHSLFLLD 363 Query: 365 LFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECE 424 LFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI RELDILD+Q NLPE Sbjct: 364 LFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIKRELDILDAQHNLPELT 423 Query: 425 GADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSR 484 D+ P LLKVEYTSPLFKYQQAESVAS LQG NTV+ELG KTG+P MDH+D D+VSR Sbjct: 424 DYDHSPFHLLKVEYTSPLFKYQQAESVASVLQGTNTVLELGAKTGNPEPMDHIDIDKVSR 483 Query: 485 FSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEK 544 F+LWA+ +PA LIRD EV+ R+ R+ Q M+ + QQ +Q + GAKA +A+EK Sbjct: 484 FALWASGSPAHLIRDVDEVKQRRKDRDDQMEAMQNRQDAQQQEQMGMEAGAKAVSKAIEK 543 Query: 545 KLTHDMMENSY 555 K+T+D+MENSY Sbjct: 544 KMTNDLMENSY 554 >gi|317120721|gb|ADV02543.1| putative phage-related head-to-tail joining protein [Liberibacter phage SC2] gi|317120782|gb|ADV02603.1| putative phage-related head-to-tail joining protein [Candidatus Liberibacter asiaticus] Length = 539 Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust. Identities = 219/545 (40%), Positives = 312/545 (57%), Gaps = 28/545 (5%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQ--LRMWDTTGSEACIKLSS 59 N+ K + RF LK QR E+ +E+ + PY+ A ++WDTT + A KL+S Sbjct: 14 NKEFIKKLIARFESLKAQRSEIEPIRQEIIDLVCPYRGKASEDKKIWDTTATSASDKLAS 73 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 LL +LITP G +WHGL +F ++ +K +RE CD LF RE SGF Sbjct: 74 LLHNLITPFGSRWHGLVAPDPQSGSFFASQE--NKLIREQCDHFVMELFAQRELPASGFN 131 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 CL+ FYT VV FG GCFY+ E G G+RYISVP+S++ S NH+NVVD+V+ EF Sbjct: 132 LCLKDFYTEVVLFGMGCFYVSER--EGG---GLRYISVPVSSIVCSANHENVVDTVFEEF 186 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFV 239 + T + + KWG LS KMK L R++ +++ AV+P DK+ D +G+ V Sbjct: 187 SLTPENVAKKWGYDALSDKMKEDLDRSDPQKYEFFQAVFP----DKEDD--YEGYKKVIV 240 Query: 240 SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLS 299 S+DENR EE PYIVGRY +G SP +ALP+IRRLN ++ + + Sbjct: 241 SIDENRIIEEGYHRVMPYIVGRYEASPSNPFGYSPTHKALPSIRRLNALSASVSLYSEKA 300 Query: 300 LHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPLPYHEELNRLKESIR 358 L+P + + + + F KP +N G + R+GR P G + P HEE+ RL+ IR Sbjct: 301 LNPAVLTSEDTRGKTFSTKPKTVNHGWMDRQGRPRAVPFFTGSDARPSHEEMQRLQMQIR 360 Query: 359 SLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL-DSQ 417 L+LLDLFQVL D+ASRSA ESMEKT EKG F+ ++GGLQ+EF+G+M+ RE+DIL Q Sbjct: 361 ELYLLDLFQVLADRASRSATESMEKTLEKGIFISAIVGGLQAEFVGSMVKREIDILYQDQ 420 Query: 418 GNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 G++ G D LKV YTSPL+KYQ+AE + +QG+ E+ TGDP+ + Sbjct: 421 GDI-RGLGKD------LKVSYTSPLYKYQKAEELNGIVQGIRVNAEIASMTGDPTPLMMF 473 Query: 478 DTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR-EVQRRVMEEQHLQQQLQQTSQDIGAK 536 + +++ + P VL+ ED +Q+ E Q++ Q Q ++++ + GA Sbjct: 474 NPYLCGKYAADGSGVPEVLVLSE---EDTKQKLIEKQKQAEASQMKQLTMEESIKTGGAI 530 Query: 537 AAGRA 541 A RA Sbjct: 531 AQDRA 535 >gi|262043663|ref|ZP_06016772.1| hypothetical protein HMPREF0484_3791 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039001|gb|EEW40163.1| hypothetical protein HMPREF0484_3791 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 554 Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 135/466 (28%), Positives = 229/466 (49%), Gaps = 33/466 (7%) Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLA-ESFSAYQAFLYKEDARSKKVREWCDQVTD 105 D TG+ A K + + S+ITP QKWH L+ E F A ++V+ + +V D Sbjct: 65 DATGALALQKFGAAIESVITPRTQKWHTLSNERF-----------ANDEEVQRYFQEVRD 113 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 LF R + F Y S FGTGC +++ + + G RY + L +Y + Sbjct: 114 ILFRLRYAPWANFASQSHEHYISSGAFGTGCTFVDNVIGK-----GPRYCTYHLREIYFT 168 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD- 224 N Q ++D V+R++ T Q + ++G++ L ++++ + +++F +H V P D Sbjct: 169 ENFQGMIDVVHRKYCMTARQAIQQFGEENLPQQVRTTARNDPSKQFNFLHRVEPNDKRDM 228 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 ++DK F S + ++ ++ +E + PY + RY E+YGRSPAM LP I+ Sbjct: 229 SRQDKEGMPFRSVHICMEGSKIVQEGGYWSQPYAISRYYTAPGEVYGRSPAMVVLPDIKL 288 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 LNE + + ++++ PP + + + F + PG +N G ++R+G+ L P+ Sbjct: 289 LNEINRAIIEGAQMAVRPPMLLPEDGILQPFKMMPGALNFGGMNRDGKPLALPLNTATDF 348 Query: 345 PYHEELNRLK-ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 L K ++I F + LFQ+L D +A E+M + +EKG + P G +Q+EF+ Sbjct: 349 SVAMTLAEQKRQTINDGFFITLFQILVDNPQMTATEAMLRAQEKGQLLAPTAGRIQAEFL 408 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSL------LKVEYTSPLFKYQQAESVASALQG 457 G +I RE+DI G LPE PP L +EYTSPL + Q +E + + Sbjct: 409 GTLILREIDIAYQNGLLPE------PPEQLKEIGGEYDIEYTSPLVRLQMSEEASGIMNV 462 Query: 458 VNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 VN +G D + ++ D RF A+ P +++ E+ Sbjct: 463 VNAAGTIG--QFDQNIARTLNGDAALRFIAKASGAPLQVVKTEDEM 506 >gi|48697195|ref|YP_024925.1| hypothetical protein BcepC6B_gp05 [Burkholderia phage BcepC6B] gi|47779001|gb|AAT38364.1| gp05 [Burkholderia phage BcepC6B] Length = 549 Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 135/451 (29%), Positives = 224/451 (49%), Gaps = 31/451 (6%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 +M+D+T A + + S+ITP Q WH L A V+ + V Sbjct: 61 KMFDSTAPLALRNFVAAMDSMITPATQLWHRLKTGNDALNEI--------ASVKAYLQGV 112 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 TLF R R + GFV + + Y S+ FG G +E DV + GI Y +VP+ ++ Sbjct: 113 VRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGK-----GIVYRNVPMQRLW 167 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + N+ ++D + ++ T+ Q ++G + LS M+S L ++ + HAV P++ Sbjct: 168 FAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADR 227 Query: 224 DKKK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D +K D N F S ++ +R + TFP+ +GR+ V D++YG SPA +A+P + Sbjct: 228 DPRKLDGRNMQFASYWLDEGRDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDV 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R N+ + + + PP +A + FDL+ G +N G L+ +G + +P+ G Sbjct: 288 RMANDMAKTNIRGAQKLVDPPLLANEDGVLDGFDLRSGALNWGGLNDKGEEMVKPLLTGK 347 Query: 343 PLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 E + +++I F + LFQ+L D +A E +++ +EKG + P +G QSE Sbjct: 348 QAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSE 407 Query: 402 FIGAMISRELDILDSQGNLPEC------EGADNPPVSLLKVEYTSPLFKYQQAESVASAL 455 +G MI+RE+DIL G LP+ GAD + VEY SPL K +A A+ L Sbjct: 408 LLGPMIAREVDILAEAGQLPDMPQELIDAGAD------VDVEYDSPLNKAMRAGEGAAIL 461 Query: 456 QGVNTVVELGVKTG-DPSCMDHMDTDRVSRF 485 Q + +LG+ + DP+ + R++R Sbjct: 462 QWLQ---QLGIVSQFDPAAAKVPNGARIARL 489 >gi|221213955|ref|ZP_03586928.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166132|gb|EED98605.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 549 Score = 210 bits (535), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 134/451 (29%), Positives = 217/451 (48%), Gaps = 31/451 (6%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 RM+D+T A + + S+ITP Q WH L S A V+ + V Sbjct: 61 RMFDSTAPLALRNFVAAMDSMITPATQVWHRLKTSNDALNEV--------PSVKAYLQAV 112 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 LF R R + GF + + Y S+ FG G +E DV GI Y +VP+ ++ Sbjct: 113 VRALFAVRYRWQGGFTTQMGATYQSIGLFGPGALMIEHDVGH-----GIVYRNVPMQRLW 167 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + N+ ++D + + T+ Q ++G + LS M++AL R+ + T H V P++ Sbjct: 168 FAENNAGLIDKTHVLWRLTLRQAAQRFGRENLSPSMQTALERDPEKTHTFYHVVEPRADR 227 Query: 224 DKKK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D +K D N F S ++ +R + TFP+ +GR+ V D++YG SPA +A+P I Sbjct: 228 DPRKLDGRNMRFGSYWLDEGRDRIIQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDI 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R N+ + + + PP +A + FDL+ G +N G L G + +P+ G Sbjct: 288 RMANDMAKTNIRGAQKMVDPPLLASEDGVLEGFDLRSGSLNWGGLDERGNEMVKPLLTGK 347 Query: 343 PLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 E ++ +++I F + LFQ+L D +A E +++ +EKG + P +G Q+E Sbjct: 348 QAQIGIEFSQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQAE 407 Query: 402 FIGAMISRELDILDSQGNLPEC------EGADNPPVSLLKVEYTSPLFKYQQAESVASAL 455 +G +I RE+DIL G P GAD + VEY SPL K +A A+ L Sbjct: 408 LLGPLIQREVDILAEAGQFPPMPQELIDAGAD------VDVEYDSPLNKAMRAGEGAAIL 461 Query: 456 QGVNTVVELGVKTG-DPSCMDHMDTDRVSRF 485 Q + +LGV DP+ ++ R+ + Sbjct: 462 QWLQ---QLGVVAQFDPNAAKLVNGHRIGKL 489 >gi|221201497|ref|ZP_03574536.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207947|ref|ZP_03580953.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221172132|gb|EEE04573.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221178765|gb|EEE11173.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 549 Score = 210 bits (534), Expect = 6e-52, Method: Compositional matrix adjust. Identities = 127/415 (30%), Positives = 207/415 (49%), Gaps = 15/415 (3%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 RM+D+T A + + S+ITP Q WH L S + E+A V+ + +V Sbjct: 61 RMFDSTAPLALRNFVAAMDSMITPATQLWHRLKASND-----VLNENA---AVKAYLQEV 112 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 LF R R + GFV + + Y SV FG G +E DV + GI Y +VP+ ++ Sbjct: 113 VRVLFAVRYRWQGGFVTQMGATYQSVGLFGPGALMIEHDVGQ-----GIVYRNVPMQRLW 167 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + N+ ++D + ++ T+ Q ++G + LS M+SAL R+ + H V P++ Sbjct: 168 FAENNAGIIDKTHVQWELTLRQAAQRFGRENLSPSMQSALERDPEKSAIFYHIVEPRADR 227 Query: 224 DKKK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D +K D N F S ++ +R + TFP+ +GR+ V + YG SPA +A+P Sbjct: 228 DPRKLDGRNMRFGSYWLDEGRDRIIQNSGFRTFPFAIGRFYVGTGDAYGGSPACDAMPDT 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R +N+ + + + PP + + FDL+ G +N G L +G + +P+ G Sbjct: 288 RMVNDMAKTNIRGAQKLVDPPLLVSEDGSLEGFDLRSGSLNWGGLDEKGNEMVKPLLMGK 347 Query: 343 PLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 E + +++I F + LFQ+L D +A E +++ +EKG + P +G QSE Sbjct: 348 QAQIGIEFTQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSE 407 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQ 456 +G +I RELDIL LPE + +++EY SPL K +A A+ LQ Sbjct: 408 LLGPLIERELDILAEAAQLPEMPRELINAGANVEIEYDSPLNKAMRAGESAATLQ 462 >gi|242279813|ref|YP_002991942.1| hypothetical protein Desal_2347 [Desulfovibrio salexigens DSM 2638] gi|242122707|gb|ACS80403.1| conserved hypothetical protein [Desulfovibrio salexigens DSM 2638] Length = 555 Score = 183 bits (464), Expect = 7e-44, Method: Compositional matrix adjust. Identities = 150/523 (28%), Positives = 242/523 (46%), Gaps = 56/523 (10%) Query: 16 LKNQRGELNYW---MEELTGFLYPYK--------NNAQLR---MWDTTGSEACIKLSSLL 61 L+ R E N W ++++ ++ P K N+ ++R + D+T + A L++ L Sbjct: 13 LQGLRQERNSWESHWQDISDYILPRKGVYDGHRPNDGRVRSGKIIDSTATRALRILAAGL 72 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 +T P + W L S ++ AR K VREW +V +T++ R +RS F C Sbjct: 73 QGGLTSPARPWFRLGIS--------DRDLARHKSVREWISKVENTMY--RALARSNFYSC 122 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTF 181 + S YT + FGTG Y E D DE+G IR+ ++ ++ + Q VD+VYREF Sbjct: 123 IHSLYTELAGFGTGILYCEPD-DERG----IRFRTLTAGEYCLATDAQGRVDTVYREFKM 177 Query: 182 TVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD-KKKDKGNKGFHSKF-V 239 T Q+ ++G + L + + S+L N + F ++H V P+ D D N F S F + Sbjct: 178 TARQLEKRFGMQNLPATVHSSLNMNRDHWFDVLHVVQPRDEFDIALMDTMNMPFESVFLL 237 Query: 240 SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLS 299 + E PY+ R+ A ++YGRSPAM+ L ++ L E Q L+ Sbjct: 238 NGHGGHVLSESGFMENPYMAPRWDTSAMDVYGRSPAMDVLADVKMLMEMSKSQIQAVHLT 297 Query: 300 LHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP--LPYHEELNRLKESI 357 L PP + V R +L PG N + + + P+ P ++ ++ +I Sbjct: 298 LRPP-MKVPSMYSRRLNLLPGGQN--PVEQNQQDSVSPLYQVRPDLAGVSNKIQDVRTAI 354 Query: 358 RSLFLLDLFQVLDDKASR--SAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILD 415 R F D+F ++ R +AAE E+ EK +GP+I +E + +I R IL Sbjct: 355 REGFYNDIFMMMAGTNRRTITAAEVAERHEEKLIQLGPVIERQHTELLDPLIDRVFGILM 414 Query: 416 SQGNLPEC----EGADNPPVSLLKVEYTSPLFKYQQ---AESVASALQGVNTVVELGVKT 468 G LPE EGAD +K++Y S L + Q+ +S+ S Q V + + Sbjct: 415 RSGQLPEAPSVLEGAD------IKIDYISVLAQAQKMVGTQSIQSLAQFVGNLAK----- 463 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQRE 511 +P +D +D DR P ++R EVE +R R+ Sbjct: 464 ANPEVLDKVDMDRAVDDYAELIGVPNGIVRSGDEVEKLRNMRK 506 >gi|42526662|ref|NP_971760.1| head-to-tail joining protein, putative [Treponema denticola ATCC 35405] gi|41816855|gb|AAS11641.1| head-to-tail joining protein, putative [Treponema denticola ATCC 35405] Length = 560 Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 125/469 (26%), Positives = 214/469 (45%), Gaps = 34/469 (7%) Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 SE KL S L P W L S + + Y V++W +Q L+ Sbjct: 64 SEYLKKLVSGLMGYTISPNVTWLKL--SLNNTEMLEYA------GVKDWLEQSEKALY-- 113 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 E +R+ + F ++ FG G +DEK E IR++++ +Y++ N Sbjct: 114 EEFNRNNLYSQVSLFISNAASFGHGVML----IDEKK-ENSIRFLTIAEPEIYIAENEYG 168 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALA--RNENERFTIIHAVYPKSLTDKKK- 227 +D+V+R F+ TV I++++G++ +S ++K+ + +N+ I+HAV P+ D+ K Sbjct: 169 DIDTVFRYFSMTVKNIIARFGEENVSEQIKNDAKDIKGKNKEIKILHAVLPRDDYDESKL 228 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 D N F S ++ +D N EE PY V + YG SPA EA+P +R LN+ Sbjct: 229 DGKNMEFASYYIDMDNNTILEESGYYELPYSVFIWEKETSSAYGGSPAREAIPDMRLLNK 288 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH 347 + +L PP + + GY + P+ G P Sbjct: 289 VEEARLKLAQLVSEPPMNVPDSMRGFESVVPAGY----NYYERPDMIMTPINIGANFPIT 344 Query: 348 -EELNRLKESIRSLFLLDLFQVLDDK-ASRSAAESMEKTREKGAFVGPLIGGLQSEFIGA 405 E + ++ +R F +D +L + A ++A E +E EK A + LI Q++ + Sbjct: 345 LETIQDIESRLRDKFHVDFMLMLQAQTAQKTATEVIELQGEKSALLSSLIVN-QNKALSE 403 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLF----KYQQAESVASALQGVNTV 461 ++ R L+I+ QG PE N ++L V++ PL +Y Q V ++L + Sbjct: 404 IVIRTLNIMYRQGRFPEPPNILNGSDAVLNVDFVGPLAQAQKRYHQTGGVQTSLAISQPI 463 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 +++ +P +D++DTD++ + L P IR+ EVE IRQQR Sbjct: 464 IQM-----NPEVLDYIDTDKLLKNVLDTNGFPQSAIREDDEVEKIRQQR 507 >gi|291334411|gb|ADD94066.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured phage MedDCM-OCT-S04-C1035] Length = 467 Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 119/449 (26%), Positives = 212/449 (47%), Gaps = 41/449 (9%) Query: 64 LITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQ 123 ++T P W L F ++ + + W + T+ ++ ++S F + Sbjct: 1 MLTNPSTPWFSLK--------FKNEDMEGEDEAKLWLESATEVMYS--AFNQSNFQQEIF 50 Query: 124 SFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTV 183 Y ++ FGT ++E D DE L+ R+I+ +Y+S N + +D+V+R+F + Sbjct: 51 ELYHDLITFGTAAMFIEED-DEDNLKFSTRHIN----EIYISENEKGRIDTVFRKFRISA 105 Query: 184 DQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKS-LTDKKKDKGNKGFHSKFVSVD 242 + K+G+ +S+ + ++ E I+HAVYP+ KK+D N F S ++ D Sbjct: 106 RAAIRKFGN--VSNNIAVIAKKDPYEEVEILHAVYPRDDYNPKKQDTENMQFESIYLDAD 163 Query: 243 ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 FP++V RY + EIYGRSPAM ALP ++ LNE + + + + P Sbjct: 164 SGEELSVSGFREFPFVVPRYLKASHEIYGRSPAMTALPDVKMLNEMSKTIIKSAQKQVDP 223 Query: 303 PTIAVSEAKQRNFDLKPGYMNIGALSREG-RSLFQPVQFG--NPLPYHEELNRLKESIRS 359 P + + PG +N R G R +P+ G N L + E R + SIR+ Sbjct: 224 PLLVPDDGFLLPVRTVPGGLN---FYRAGTRDRIEPLNIGANNTLGLNMEEQR-RNSIRN 279 Query: 360 LFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGN 419 F ++ ++ D +A E +++ EK +GP++G LQSE + +I R IL + N Sbjct: 280 AFYVNQL-MMQDGPQMTATEVIQRNEEKMRLLGPVLGRLQSELLKPLIDRSFAIL-MRRN 337 Query: 420 L----PE-CEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 L PE G D +++EY SPL K Q++ ++S ++ + +G + Sbjct: 338 LFAQPPEFLSGQD------IEIEYVSPLAKAQKSTELSSIMRAIEI---MGSLSNVAPVF 388 Query: 475 DHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 DH++ D++ R P +++ +E+ Sbjct: 389 DHINMDKLVRHLTNIVGVPQKILKPQSEL 417 >gi|291336934|gb|ADD96462.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured organism MedDCM-OCT-S09-C787] Length = 450 Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 115/446 (25%), Positives = 213/446 (47%), Gaps = 35/446 (7%) Query: 45 MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVT 104 ++D + ++ L++ L ++T P W L F + + +EW + T Sbjct: 22 IFDGSPLQSVELLAASLHGMLTNPSTPWFSLR--------FKQNDMENEDEAKEWLEDAT 73 Query: 105 DTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM 164 + ++ ++S F + Y ++ FGT ++E D DE L+ R+I+ +++ Sbjct: 74 EVMYS--AFNKSNFQQEIFELYHDLITFGTAAMFIEED-DEDILKFSTRHIN----EIFI 126 Query: 165 SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD 224 + N + +D+V+R+F+ + ++ K+GD +S + + ++ E I+HAVYP+S D Sbjct: 127 AENDKGRIDTVFRKFSLSARAVMQKFGD--VSINIATKAKKDPYEEVEIMHAVYPRSDFD 184 Query: 225 -KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 +K+DK N F S ++ + FP++V RY + EIYGRSPAM ALP ++ Sbjct: 185 PRKQDKENMPFESVYLDAESGDELSVSGFREFPFVVPRYLKASHEIYGRSPAMTALPDVK 244 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 LNE + + + PP + + PG +N R + P Sbjct: 245 MLNEMSKTTIKSAQKQVDPPLLVPDDGFMLPVRTIPGGLNFYRAGTRDRIETLNIGANTP 304 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 L + E R + SIR+ F ++ ++ +A E +++ EK +GP++G LQSE + Sbjct: 305 LGLNMEEQR-RNSIRNAFYVNQL-MMQSGPQMTATEVIQRNEEKMRLLGPVLGRLQSELL 362 Query: 404 GAMISRELDILDSQGNL----PE-CEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGV 458 +I R ++ + NL PE G D +++EY SPL K Q++ ++S ++ + Sbjct: 363 KPLIDRTFALI-LRKNLFRPAPEFLAGQD------IEIEYVSPLAKAQKSTELSSIMRAI 415 Query: 459 NTVVELGVKTGDPSCMDHMDTDRVSR 484 LG + DH++ D++ R Sbjct: 416 EI---LGSLSNVAPVFDHINMDKLVR 438 >gi|288959388|ref|YP_003449729.1| phage head-tail connector protein [Azospirillum sp. B510] gi|288911696|dbj|BAI73185.1| phage head-tail connector protein [Azospirillum sp. B510] Length = 535 Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 129/475 (27%), Positives = 208/475 (43%), Gaps = 37/475 (7%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R++D T A L++ L +IT P W + E + V+ W V Sbjct: 55 RLFDATAGMANNNLAAGLYGMITNPANSWFNIKHEID--------ELNEVQAVKLWMATV 106 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 + + F + Y + FGT FY++ ++ G G+ Y LS + Sbjct: 107 ERAMRQALAANGLAFYSRVFGLYLDLPAFGTAVFYID---EQPG--RGLWYSHRRLSECF 161 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENER-FTIIHAVYPKSL 222 +S N + +D+VYR+FT+T Q +WGD+ ++ A+ + E +R F +HAV P Sbjct: 162 VSENDREEIDTVYRDFTWTARQAQQRWGDRA-GREVAKAIEKGEPDRPFRWLHAVEPNPD 220 Query: 223 TDKKKDKGN-KGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPT 281 D +K K F S +V VD+ E PY V R+ YG S A+ A+ Sbjct: 221 FDPRKLGARFKPFRSVYVGVDDRHVVAEGGYDELPYQVPRWAPSDAGTYGDSAAVLAIAD 280 Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 I+ +N + ++ PP +A E R PG + G + G L +P+Q G Sbjct: 281 IKMVNAMGKTTIVGAQKAVDPPLLAPDEFSVRGLRTSPGGITYGGVDMGGNQLLKPLQTG 340 Query: 342 NPLPYHEELNRLKE-SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS 400 + EL + +IR F L ++ + R+A E ME EK + P +G +Q+ Sbjct: 341 ARVDLGLELEEQRRGAIREAFHWSLL-LMVQQPGRTATEVMEHQEEKLRLMAPHLGRIQA 399 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSL-----LKVEYTSPL---FKYQQAESVA 452 EF+ + R +L+ G LP PP L L+++Y SPL K + +V Sbjct: 400 EFLDPALGRVFSLLNRTGQLPP------PPDVLRQYPGLRLDYVSPLARAAKAAEGAAVI 453 Query: 453 SALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 L+ + + +L P MD+ DTD ++R A PA ++ D +VE +R Sbjct: 454 RTLEALGPIAQL-----RPEVMDNFDTDEIARGISDAYGLPAKMMLDPRQVEQMR 503 >gi|317152045|ref|YP_004120093.1| Bacteriophage head-to-tail connecting protein [Desulfovibrio aespoeensis Aspo-2] gi|316942296|gb|ADU61347.1| Bacteriophage head-to-tail connecting protein [Desulfovibrio aespoeensis Aspo-2] Length = 603 Score = 153 bits (387), Expect = 6e-35, Method: Compositional matrix adjust. Identities = 143/529 (27%), Positives = 220/529 (41%), Gaps = 53/529 (10%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-------------AQLRMWDTTGSE 52 A+ +Q RF L+ R EL+ ++ P KN+ R++D+T S Sbjct: 7 ARSLQTRFKGLEEARQPWLAAWRELSDYMLPRKNSFTGIDPGSTRGRSGDERIFDSTPSH 66 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 A L+S L L+T P W + + VR + Q + + Sbjct: 67 ALELLASSLGGLLTNPAMPWFDIRAR--------DPDQGDGAGVRTFLQQARERMIALFN 118 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 +GF + Y V GT Y+EAD D +R+ + PL VY + + + V Sbjct: 119 TEDTGFQTNVHELYLDVALLGTAVMYVEADPDTV-----VRFCTRPLGEVYAAESARGAV 173 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-DKGN 231 DSVYR +T + Q +WG S + + ++ I+HAV+P++ D + Sbjct: 174 DSVYRRYTLSARQTAREWG-AACSGETRRKAEERPDDTVEILHAVFPRTDRDPYGVGAAH 232 Query: 232 KGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 F S +V EE PY+V R+ A E YGR P AL R LN Sbjct: 233 FPFASVYVETGAEHVLEESGYLEMPYLVPRWAKAAGETYGRGPGQTALSDTRVLNAMART 292 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALS--REGRSLFQPVQFGNPLPYHEE 349 PP + + L P + G LS R G P + PLP + + Sbjct: 293 ALMAAEKMSDPPLMVPDDGF-----LGPVHSGPGGLSYYRAG----SPDRI-EPLPVNVD 342 Query: 350 L-------NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 L + +ESIR +FL D Q+ + + +A E++ + EK +GP++G LQ+EF Sbjct: 343 LAATETMMQQRRESIRRIFLGD--QLTPEGPAVTATEALIRQSEKMRVLGPVLGRLQAEF 400 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + +I R I+ G LP P ++V YTSP+ + Q+ E A L + Sbjct: 401 LSPLIRRVFRIMLRAGALPPFPQGFGP--DDIEVRYTSPVARAQK-EFEARGLSRTMEYL 457 Query: 463 ELGVKTGDP-SCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 V DP MD+ DTDR +R TP+ +R +V + R + Sbjct: 458 APLVGASDPFGIMDNFDTDRAARHVAELFGTPSDYLRPEKDVAETRAAK 506 >gi|167041083|gb|ABZ05844.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured marine microorganism HF4000_48F7] Length = 552 Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust. Identities = 125/512 (24%), Positives = 232/512 (45%), Gaps = 53/512 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTT 49 M+ +A +Q+ + LK++RG +++ + P + + + R++++T Sbjct: 1 MSSDAATLVQE-YEALKSERGNWENMWQDIAELMIPRRADFTNRYRAPGEQRRDRIYEST 59 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 A ++ +S L + +T W L +E ++++V+ W + T Sbjct: 60 AVRALVRGASGLHNTLTSSTVPWFALETE--------DRELMKNRQVQLWLEDATRRCNS 111 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 RS F +Y ++ FGTGC Y+ E G+ G + S L + Y++ Sbjct: 112 VFNAPRSMFHQSAHEYYLDLLAFGTGCMYV---TQEPGM--GPVFKSYFLGHTYIAEGKT 166 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 ++DSVYR F T + ++G+K+ +K+A + RF ++H V P+S + Sbjct: 167 GMIDSVYRRFDDTARSLYKQFGNKLPDEIVKAA-DKEPFRRFELLHIVRPRSNAPGGRTS 225 Query: 230 GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 K F S +V + + +E PYIV R++ + E+YGR P +EALP +R V Sbjct: 226 KQKPFLSVYVHAESRKVVQEGGFDEMPYIVSRWQKNSMEVYGRGPGIEALPDVR----MV 281 Query: 290 NELAQFGRLSLH----PPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 NE+ + G ++L PP + + PG +N + P+Q G + Sbjct: 282 NEMERVGLIALQKVVDPPLLVPDDGFLSPIRTTPGGLNYYRAGLGPQDRIAPLQTGGRVD 341 Query: 346 YHE-ELNRLKESIRSLFLLDLFQVLDDKASR------SAAESMEKTREKGAFVGPLIGGL 398 +E ++ +++ +I F LDL ++ A+ SA E + R++ +GP++ Sbjct: 342 LNEAKIGQVRAAIERTFYLDLLELPGPTAADGDVLRFSATEIAARQRDRLNILGPIVARQ 401 Query: 399 QSEFIGAMISRELDILDSQGNLPECEGADNPPVSLL----KVEYTSPLFKYQQAESVASA 454 ++EF+G ++ R L ++ LP PP LL KV Y++P+ Q+A +AS Sbjct: 402 EAEFLGPLVIRTLSVMLRAEMLPP------PPQVLLDADFKVSYSNPVAIAQRAGELASI 455 Query: 455 LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFS 486 Q + +V DP+ + T RV+ + Sbjct: 456 SQLIQFLVPFA--QLDPTVIQRFQTGRVAELA 485 >gi|218886173|ref|YP_002435494.1| hypothetical protein DvMF_1072 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757127|gb|ACL08026.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 595 Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust. Identities = 128/486 (26%), Positives = 212/486 (43%), Gaps = 51/486 (10%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R+ D T + A L++ + +T P + W L + A DA S R W D V Sbjct: 74 RVIDATATRAVRILAAGMQGGLTSPARPWFRLRLADGA--------DAESGPARRWLDAV 125 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 L+ +RS F + YT + FG+ Y E D E R+ ++ Sbjct: 126 EQRLYW--ALARSNFYQASHALYTELAAFGSADLYQEVDP-----ERLTRFAALTCGEFS 178 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + + VD+V R T Q+ ++G+ LS+ + L + N ++H V P+++ Sbjct: 179 WACDAAGRVDTVARRMLMTARQLAERYGEAHLSTGTRRMLRKEPNRHVEVVHLVRPRAV- 237 Query: 224 DKKKDKGNKGFHSKFVSV------DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAME 277 + G H F S+ E FP++ R+ V ++YGRSP M+ Sbjct: 238 --RTPGHGSGLHMPFESLVFEADGAAGDLLHEGGFEEFPHLAARWDVTGSDVYGRSPGMD 295 Query: 278 ALPTIRRLNETVNELAQFGRLSLH----PPTIAVSEAKQRNFDLKPGYMNIGALSREGRS 333 LP ++ L E+A+ L++H PP + KQR +L PG N A + Sbjct: 296 VLPDVKML----QEMARSQLLAIHKVVNPPMRVPTGFKQR-LNLIPGAQNYVAPGQP--E 348 Query: 334 LFQPVQFGNP--LPYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKTREKGA 389 P+ NP +++ +++++R F DLF + D +++ +AAE E+ +EK Sbjct: 349 AVAPLYQINPDIAAVTRKIDDVRKAVREGFFNDLFLMFTADGRSNVTAAEVAERGQEKLL 408 Query: 390 FVGPLIGGLQSEFIGAMISRELDILDSQG----NLPECEGADNPPVSLLKVEYTSPLFKY 445 +GP+I Q+E + +++R IL G N PE EG + ++VEY S L + Sbjct: 409 MLGPVIERHQTELLDPLLTRTYGILRRAGALPPNPPELEGLE------MRVEYVSALAQA 462 Query: 446 QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 Q+ + S Q V L P +D +D D+ PA ++R AEV Sbjct: 463 QRLGAAQSIRQFAAEVTALSATA--PGVLDKIDFDQAVDELASIGGVPARVVRSDAEVLR 520 Query: 506 IRQQRE 511 +R +RE Sbjct: 521 LRAERE 526 >gi|298485985|ref|ZP_07004059.1| hypothetical protein PSA3335_1414 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159462|gb|EFI00509.1| hypothetical protein PSA3335_1414 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 533 Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 142/509 (27%), Positives = 219/509 (43%), Gaps = 42/509 (8%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSLLSSLITPPG 69 +D F++ RG + +E++T + + RM D T ++A LSS + S +TP Sbjct: 26 RDCFDHSYPIRGS-GFCIEQITAMEAQMR---KARMIDGTTTDAARILSSGIMSGLTPAN 81 Query: 70 QKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSV 129 W G+ S + R W D D L+ + S F T V Sbjct: 82 SLWFGM------------DVGQESDEERRWLDGSADILW--QNIHASNFDAAAFEGLTDV 127 Query: 130 VEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN-VVDSVYREFTFTVDQIVS 188 V G Y++ D+ EKG G + P+++VY S + +D+VYR + T +Q V+ Sbjct: 128 VCAGWFALYIDQDM-EKG---GFTFDLWPIASVYCSASKAGGKIDTVYRTYKLTAEQAVN 183 Query: 189 KWGDKVLSSKMKSALARNENERFTIIHAVYPKS--LTDKKKDKGNKGFHSKFVSVDENRF 246 ++G+ LS + E IHA+YP++ + + K N S V V Sbjct: 184 EFGEDNLSETTRKLAKEKPQELVEFIHAIYPRTTHMVGARLAK-NMPVASCKVEVAAKTL 242 Query: 247 FEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIA 306 E P +V R+ + D +Y P +ALP R LNE G L++ IA Sbjct: 243 VSESGYHEMPVVVPRWMMIPDSVYAVGPVFDALPDSRTLNELCRMDLAAGDLAIAGMWIA 302 Query: 307 VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE-ELNRLKESIRSLFLLDL 365 + +K G I + +P+Q G+ Y E ++ RL+ SIR + + D Sbjct: 303 EDDGVLNPRTVKVGPRKI--IVANSVDSMKPLQSGSNFQYAETKIARLQGSIRKILMADQ 360 Query: 366 FQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEG 425 Q D A +A E + +GP+ G LQ+E++ MI R I G L + Sbjct: 361 LQAQDGPA-MTATEVHVRVNLIRQLLGPVYGRLQTEYLQPMIERCFGIAYRAGVLGQA-- 417 Query: 426 ADNPPVSL----LKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 P SL V Y SPL + Q+ E V++ Q V L V DPS MD++D D Sbjct: 418 ----PESLAGRDFTVRYLSPLARSQKLEEVSAIDQFVQGA--LIVAQADPSVMDNIDMDE 471 Query: 482 VSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 RF A P+ +IR A+ + +R+ R Sbjct: 472 AQRFKGEALGVPSSVIRSKADRDKLREDR 500 >gi|302339294|ref|YP_003804500.1| head-to-tail joining protein [Spirochaeta smaragdinae DSM 11293] gi|301636479|gb|ADK81906.1| head-to-tail joining protein, putative [Spirochaeta smaragdinae DSM 11293] Length = 560 Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 131/526 (24%), Positives = 237/526 (45%), Gaps = 44/526 (8%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----NNAQLR-----MWDTTGSE 52 ++SA++I F LK +R +E+T ++P + N + ++D T Sbjct: 4 EKSAQEIIQTFEQLKQERSTWEDEYQEITEQIFPRRSVWTDNKGRASRSGGLIYDGTPIS 63 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 A L++ L + P +W L + E + + R+W + V + ++ E Sbjct: 64 ALNLLANGLVGYLVSPATRWFKLRPT--------QDELLQIRGARQWLEIVENLIYD--E 113 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 +RS F + ++ G Y++ D+ + R+ +Y++ + + Sbjct: 114 FNRSNFYEEIVEYFRDGGSIGIATIYVQEDIGRRMANYSCRH----PKEIYIAEDRFGYI 169 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK 232 D+V+R F T ++ ++G + LS +++ R+ ER IIHAVYP+ + +K KGN+ Sbjct: 170 DTVFRRFFPTAKELEEEFGREALSDGVQNLCERSPYERVEIIHAVYPRKKRNPRK-KGNR 228 Query: 233 G--FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 F S +V N E+ PY+V R+ +DE+YGR P +AL ++RLN Sbjct: 229 DMKFASAYVEGGSNHKIRERGYERLPYVVWRWSTNSDEVYGRGPGYDALVDVKRLNRLSR 288 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 ++ + ++++ PP +AV E + + P +N E PV + + L Sbjct: 289 DMLKQSQMAVDPP-LAVPEKMRGKVNWVPRGLNYYQNPNE-----VPVALNPGMQFQVGL 342 Query: 351 NR---LKESIRSLFLLDLFQVLDDKASR-SAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 +R +++ I F+ D F +L+ +A E ME+ EK A +G +IG + SEF+ + Sbjct: 343 DREQHMQQIIEKHFMTDFFLMLEQAPKEMTATEVMERQSEKAAVLGTVIGRISSEFLDPI 402 Query: 407 ISRELDILDSQGNL----PECEGADNPPVSLLKVEYTSPLFKYQQAESVA-SALQGVNTV 461 I DI L PE A ++++Y PL + Q+ V A Q +N V Sbjct: 403 IDITFDIAMKGKRLPPPPPEFAEAMYKTNGGIEIDYLGPLAQAQKKFHVTQGAQQSLNAV 462 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 + +P D ++ D+++ L A P I D +V+ IR Sbjct: 463 AP--IMQINPQVADLINWDQLTMEILHAYGMPQKAIVDLRDVQKIR 506 >gi|327252184|gb|EGE63856.1| bbp21 [Escherichia coli STEC_7v] Length = 559 Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 128/523 (24%), Positives = 236/523 (45%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D D+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDDDDI-----IRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP IA + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMIAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|324008560|gb|EGB77779.1| hypothetical protein HMPREF9532_01747 [Escherichia coli MS 57-2] Length = 559 Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 127/523 (24%), Positives = 236/523 (45%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRSFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|301046408|ref|ZP_07193568.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|300301634|gb|EFJ58019.1| conserved hypothetical protein [Escherichia coli MS 185-1] Length = 559 Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 127/523 (24%), Positives = 236/523 (45%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEANRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|117624712|ref|YP_853625.1| putative tail protein [Escherichia coli APEC O1] gi|115513836|gb|ABJ01911.1| putative tail protein [Escherichia coli APEC O1] Length = 559 Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 127/523 (24%), Positives = 238/523 (45%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN-IGALSREGRSLFQPVQFGNP 343 L +Q +PP +A + + ++ L PG + + L+ G+ +PV NP Sbjct: 286 LQLLQKRKSQIIDKVTNPPMVAPTTLRTQSVSLLPGGVTYVDQLT--GQEGLRPVYQVNP 343 Query: 344 --LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ +++I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 NTADLISDIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L G P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QGKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|323156133|gb|EFZ42292.1| bbp21 [Escherichia coli EPECa14] Length = 559 Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 127/523 (24%), Positives = 236/523 (45%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIDVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN-IGALSREGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG + I ++ G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQIT--GQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|301019343|ref|ZP_07183529.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299882260|gb|EFI90471.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 559 Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 127/523 (24%), Positives = 236/523 (45%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEANRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGIPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|331648176|ref|ZP_08349266.1| conserved hypothetical protein [Escherichia coli M605] gi|331043036|gb|EGI15176.1| conserved hypothetical protein [Escherichia coli M605] Length = 559 Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 127/523 (24%), Positives = 236/523 (45%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|320175046|gb|EFW50159.1| putative tail protein [Shigella dysenteriae CDC 74-1112] Length = 559 Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 129/524 (24%), Positives = 236/524 (45%), Gaps = 53/524 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 -FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 F E S L Y S+ + TG + D E+ IR + P+ + Y++ + Sbjct: 113 MFNE---SNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSP 164 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK 227 + VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 165 RGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSK 224 Query: 228 -DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIR 283 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 225 LDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVK 284 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN-IGALSREGRSLFQPVQFGN 342 L +Q + +PP +A + K + L PG + I ++ G+ F+P N Sbjct: 285 ALQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQIT--GQDGFRPAYLVN 342 Query: 343 PLPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGL 398 P ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 343 PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERL 402 Query: 399 QSEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASA 454 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 403 NDECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSL 456 Query: 455 LQGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 457 ASTVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|323699782|ref|ZP_08111694.1| phage head-tail connector protein [Desulfovibrio sp. ND132] gi|323459714|gb|EGB15579.1| phage head-tail connector protein [Desulfovibrio desulfuricans ND132] Length = 579 Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 144/560 (25%), Positives = 238/560 (42%), Gaps = 53/560 (9%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQLRMWDTTGS 51 A+ + RF+ L+ R +ELT ++ P KN+ R++D+T Sbjct: 7 ARSLLKRFSGLEEARRPWVSSWQELTEYMLPRKNSFAGPGGHTLGRGRAGDERIFDSTPL 66 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 A L+S L L+T P W ++ A K DA +VR + + + + Sbjct: 67 HALELLASSLGGLLTNPSLPWFDISVKDRA------KGDA--DEVRAFMQEARERMVAVF 118 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 +GF + Y V GT Y+EAD +R+ + PL V+++ + + Sbjct: 119 NSEDTGFQAHVHELYLDVALLGTAVMYVEADPTSV-----VRFSARPLGEVFVAESARGQ 173 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-DKG 230 VD+VYR + T Q + +WG + R E E ++HAV+P+ D Sbjct: 174 VDTVYRRYEVTARQAIQEWGAACSDETRRKGEDRPE-EPVEVLHAVFPRMDRDPAGFGSA 232 Query: 231 NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 + F S ++ V + EE PY+V R+ A E YGR P AL +R LN Sbjct: 233 HFPFASVYMEVKNSHVLEESGYLEMPYMVPRWAKAAGETYGRGPGQTALSDVRVLNAMAR 292 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 PP + + PG ++ R PV + E + Sbjct: 293 TALMAAEKMSDPPLMVPDDGFLGPVRSGPGGLSYYRAGSTDRIEALPVNV-DLRAAEEMM 351 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 N +ESI +FL D Q+ + + +A E++ + EK +GP++G LQ+EF+ +I R Sbjct: 352 NGRRESIGRIFLSD--QLAPEGPAVTATEAVIRQAEKMRVLGPVLGRLQTEFLSPLIRRV 409 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQ---QAESVASALQGVNTVVELGVK 467 ++ G LP +P L+V YTS + + Q +A+ +A ++ ++ +V Sbjct: 410 FRVMLRGGALPPFPEGLSP--DDLEVRYTSSVTRAQKQYEAQGLAQVMEYLSPLVGGRDA 467 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 G MD+ DTDRV+R N P+ D + ED RV+E + +Q++ Sbjct: 468 FG---IMDNFDTDRVARHVAELFNIPS----DYLKSED---------RVVEGRTQKQRVA 511 Query: 528 QTSQDIGAKAAGRAMEKKLT 547 + Q A A+ K L+ Sbjct: 512 SSQQTASTVANAAAIAKTLS 531 >gi|294492610|gb|ADE91366.1| conserved hypothetical protein [Escherichia coli IHE3034] gi|323948685|gb|EGB44590.1| hypothetical protein ERKG_04908 [Escherichia coli H252] Length = 559 Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 127/524 (24%), Positives = 235/524 (44%), Gaps = 53/524 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL-----LKVEYTSPLFKYQQAESVASA 454 E + +I R ++ + LP PP + LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRSFSMMVRKNMLP-------PPPDVMEGMPLKVEYISVMAQAQKSIGLSSL 456 Query: 455 LQGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 457 ASTVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|218700990|ref|YP_002408619.1| putative head-to-tail-joining protein [Escherichia coli IAI39] gi|218370976|emb|CAR18803.1| putative head-to-tail-joining protein [Escherichia coli IAI39] Length = 559 Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 127/523 (24%), Positives = 235/523 (44%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NNSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN-IGALSREGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG + I ++ G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQIT--GQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRS--AAESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|298381718|ref|ZP_06991317.1| hypothetical protein ECFG_01455 [Escherichia coli FVEC1302] gi|298279160|gb|EFI20674.1| hypothetical protein ECFG_01455 [Escherichia coli FVEC1302] Length = 559 Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 127/523 (24%), Positives = 235/523 (44%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NNSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|89152428|ref|YP_512261.1| putative head-to-tail-joining protein [Escherichia phage phiV10] gi|74055451|gb|AAZ95900.1| putative head-to-tail-joining protein [Escherichia phage phiV10] Length = 559 Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 130/525 (24%), Positives = 239/525 (45%), Gaps = 55/525 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG A +D+ E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAM---AVLDDD--EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL-----LKVEYTSPLFKYQQAESVASA 454 E + +I R ++ + LP PP + LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRSFSMMVRKNMLP-------PPPDVMEGMPLKVEYISVMAQAQKSIGLSSL 456 Query: 455 LQGVNTVVELG-VKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L VK P +D ++ D+ + F+ + +P V++ Sbjct: 457 ASTVNFIGQLAQVK---PEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|30387383|ref|NP_848212.1| hypothetical protein epsilon15p04 [Enterobacteria phage epsilon15] gi|30266038|gb|AAO06067.1| 4 [Salmonella phage epsilon15] Length = 556 Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 136/528 (25%), Positives = 236/528 (44%), Gaps = 66/528 (12%) Query: 16 LKNQRGEL-NYWMEELTGFLYPY-----------KNNAQLRMWDTTGSEACIKLSSLLSS 63 LKN+R ++W++ L+ F+ P + ++ D TGS A LSS + S Sbjct: 16 LKNERTSFESHWLD-LSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMS 74 Query: 64 LITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV---TDTLFGFRERSRSGFVG 120 IT P + W LA + V+ W + V + +F ++S Sbjct: 75 GITSPARPWFKLATPDPDMMDY--------GPVKIWLEVVQRRMNEVF-----NKSNLYQ 121 Query: 121 CLQSFYTSVVEFGTGCF-YMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 L Y S+ FGTG ME D D IR + P+ + Y++ + + VD+ R+F Sbjct: 122 SLPVMYASLGTFGTGAMAVMEDDQDV------IRTMPFPIGSYYLANSPRGSVDTCIRQF 175 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDK-KKDKGNKGFHSK 237 + TV Q+V ++G +S+ +K E + + H + P D K D NK + S Sbjct: 176 SMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNKPYRSV 235 Query: 238 FVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETVNELAQ 294 + D ++ E FP + R+ V +++Y S P M AL ++ L AQ Sbjct: 236 YFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQ 295 Query: 295 FGRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQPVQFGNP--LPYHEE 349 + +PP +A + K + L PG Y+++ + G+ F+P NP + Sbjct: 296 LIDKATNPPMVAPTSLKNQRVSLLPGDVTYLDVIS----GQDGFKPAYLVNPNTADLLAD 351 Query: 350 LNRLKESIRSLFLLDLFQVLDDKASRS--AAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + +++I S + +DLF +L + +RS +E EK +GP++ L E + +I Sbjct: 352 IQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLI 411 Query: 408 SRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASALQGVNTVVE 463 R I+ + LPE PP L L++EY S + + Q++ + S Q V + + Sbjct: 412 DRVFSIMARKNMLPE------PPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQ 465 Query: 464 LGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 L P +D +D D+ + FS + +P V++ +V+ IR++R Sbjct: 466 LA--QFKPEALDKLDVDQAIDAFSEMSGVSPTVIV-PQEQVQGIREER 510 >gi|300898427|ref|ZP_07116768.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357894|gb|EFJ73764.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 559 Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 126/524 (24%), Positives = 234/524 (44%), Gaps = 53/524 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEFGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A + K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL-----LKVEYTSPLFKYQQAESVASA 454 E + +I R ++ + LP PP + LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLP-------PPPDVMEGMPLKVEYISVMAQAQKSIGLSSL 456 Query: 455 LQGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 457 ASTVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|78357592|ref|YP_389041.1| hypothetical protein Dde_2550 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219997|gb|ABB39346.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 549 Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 137/535 (25%), Positives = 235/535 (43%), Gaps = 72/535 (13%) Query: 15 YLKNQRGELNY-WMEE---LTGFLY-----------PYKNNAQLRMWDTTGSEACIKLSS 59 Y+++QRGE + W E +TG Y P Q R+ D T + A L++ Sbjct: 15 YIESQRGEWDSRWREVADYVTGAGYGGGSWQEGTARPEGRRGQ-RIIDATATRALRVLAA 73 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L +TPP + W L + S +VR W D V L+ + S F Sbjct: 74 GLQGGLTPPARPWFRLRLADRGLM--------ESAEVRRWLDDVEAALYA--ALAGSNFY 123 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 + +T++ +G+ YMEAD + +R+ VP + + + VD+V R F Sbjct: 124 QNSHALFTALAAYGSADMYMEADP-----QRVMRFCVVPHGDFAWACDAAGRVDTVVRRF 178 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK-KDKGNKGFHS-K 237 + T Q K+G LS ++ A ++ V P++ D + +D NK + S Sbjct: 179 SMTAAQAAQKYGSDRLSRTVRRLAAVQPYAPVALVQLVRPRARRDPRRQDSLNKPYESLT 238 Query: 238 FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 + + + R A FP++ R+ V ++YG SP M+ LP ++ L E+A+ Sbjct: 239 WEAQEPRRLLHVSGYAEFPHLCARWEVNGGQLYGHSPVMDVLPDVKML----QEMARSQL 294 Query: 298 LSLH----PPTIAVSEAKQRNFDLKPG---YMNIG---ALSR--EGRSLFQPVQFGNPLP 345 L++H PP + KQR +L PG Y+N ALS + R Q V + Sbjct: 295 LAVHKVVNPPMRVPTGFKQR-LNLIPGAQNYVNPAQPDALSPLYQIRPDIQAVTY----- 348 Query: 346 YHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 ++ ++ SIR ++F + + +++ +AAE ME+++EK +GP++ Q++ + Sbjct: 349 ---KIEDVRRSIREGLFTEMFLLFAGESRSNVTAAEIMERSQEKLLLLGPVVERHQTDIL 405 Query: 404 GAMISRELDILDSQGNLPEC----EGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 +I R +L G LP G D LKVEY S L + Q+ + Q Sbjct: 406 DPLIGRAFGLLARAGRLPPAPDVLAGRD------LKVEYVSALAQAQRLSAAQGVRQLAG 459 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQR 514 V P +D +D D+ PA ++R +V+ +R++R +++ Sbjct: 460 DVSRFAAMA--PEVLDKIDFDQAVDELASIAGAPAGIVRSDEDVQLLRRERALKQ 512 >gi|332344354|gb|AEE57688.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 559 Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 126/523 (24%), Positives = 234/523 (44%), Gaps = 51/523 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + + + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFTIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK- 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP 343 L +Q + +PP +A K + L PG +I + + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPISLKNQRASLLPG--DITYIDQITGQDGFRPAYLVNP 343 Query: 344 LPYH--EELNRLKESIRSLFLLDLFQVLDDKASRS--AAESMEKTREKGAFVGPLIGGLQ 399 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 344 STADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLN 403 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 E + +I R ++ + LP PP ++ LKVEY S + + Q++ ++S Sbjct: 404 DECLNPLIDRAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLA 457 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 VN + +L P +D ++ D+ + F+ + +P V++ Sbjct: 458 STVNFIGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|215487822|ref|YP_002330253.1| predicted phage head-tail connector protein [Escherichia coli O127:H6 str. E2348/69] gi|215265894|emb|CAS10303.1| predicted phage head-tail connector protein [Escherichia coli O127:H6 str. E2348/69] Length = 556 Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 134/527 (25%), Positives = 230/527 (43%), Gaps = 64/527 (12%) Query: 16 LKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 LKN+R +L+ F+ P + ++ D TGS A LSS + S Sbjct: 16 LKNERTSFESHWRDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSG 75 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV---TDTLFGFRERSRSGFVGC 121 IT P + W LA + V+ W + V + +F ++S Sbjct: 76 ITSPARPWFKLATPDPDMMDY--------GPVKIWLEVVQRRMNEVF-----NKSNLYQS 122 Query: 122 LQSFYTSVVEFGTGCF-YMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L Y S+ FGTG +E D D IR + P+ + Y++ + + VD+ R+F+ Sbjct: 123 LPVMYASLGTFGTGAMAVLEDDQDV------IRTMPFPIGSYYLANSPRGSVDTCIRQFS 176 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTII-HAVYPKSLTDK-KKDKGNKGFHSKF 238 TV Q+V ++G +S+ +K E + + H + P D K D NK + S + Sbjct: 177 MTVRQMVQEFGLDNVSTSVKGMWENGTYETWVKVNHCITPNVNRDSGKMDSKNKPYRSVY 236 Query: 239 VSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETVNELAQF 295 D ++ E FP + R+ V +++Y S P M AL ++ L AQ Sbjct: 237 FESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQL 296 Query: 296 GRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQPVQFGNP--LPYHEEL 350 + +PP +A + K + L PG Y+++ G+ F+P NP ++ Sbjct: 297 IDKATNPPMVAPTSLKNQRVSLLPGDVTYLDV----LTGQDGFKPAYLVNPNTADLLADI 352 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRS--AAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 +++I S + +DLF +L +RS +E EK +GP++ L E + +I Sbjct: 353 QDTRQTINSAYFVDLFMMLQKINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLID 412 Query: 409 RELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 R I+ + LPE PP L L++EY S + + Q++ + S Q V + +L Sbjct: 413 RVFSIMARKNMLPE------PPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQL 466 Query: 465 GVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 P +D +D D+ + FS + +P V++ +V+ IR++R Sbjct: 467 A--QFKPEALDKLDVDQAIDAFSEMSGVSPTVIV-PQEQVQGIREER 510 >gi|118590948|ref|ZP_01548348.1| hypothetical protein SIAM614_19846 [Stappia aggregata IAM 12614] gi|118436470|gb|EAV43111.1| hypothetical protein SIAM614_19846 [Stappia aggregata IAM 12614] Length = 567 Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 129/481 (26%), Positives = 213/481 (44%), Gaps = 39/481 (8%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLA--ESFSAYQAFLYKEDARSKKVREWCD 101 +++D T +L+S + SL P G WHG+ + F+ A S+ E+ + Sbjct: 66 KLYDPTAVWLLDRLASGIGSLTMPEGFPWHGVGFGDPFAP---------APSQADEEFFE 116 Query: 102 QVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGC-FYMEADVDEKGLEEGIRYISVPLS 160 V D LF R RSGF +S S V+ GTG F +E + + + Y VPL Sbjct: 117 LVRDHLFRVRYSGRSGFALANRSRLLSTVKLGTGVLFPVENEDSLADIRTPVHYRYVPLY 176 Query: 161 NVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMK--SALARNENERFTIIHAVY 218 +Y+ ++ Q +R T Q V ++ KV S K+K +A A+ +N +T +HA + Sbjct: 177 EIYLVIDAQGNDCGFFRVRTLKAWQAVKEYAGKV-SPKVKEDAADAKRKNTDYTFVHACF 235 Query: 219 PKS-----LTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRS 273 + TD +K + F S D +P ++ R+ YG Sbjct: 236 LREGGHAQATDTRKSR----FESIHFEEDSGHICRRGGFFEYPLVISRWDRDGLSPYGSP 291 Query: 274 PAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRS 333 P + + I+ L + ++ PP + A++R DL PG +N G + +GR Sbjct: 292 PQAKLMSDIKSLQSLARDGLIASSQAVRPPI--ATHAQERQLDLNPGRINPGLIDEQGRP 349 Query: 334 LFQP-VQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVG 392 LF+P + NP ++ ++E +R DL+Q L + R+A E+ + +E +G Sbjct: 350 LFRPMIDTVNPGAADAQIETIREKLRVGLYGDLWQTLLEGNGRTATEANIRRKEMADMIG 409 Query: 393 PLIGGLQSEFIGAMISRELDILDSQGNL-PECEGADNPPVSLLKVEYT----SPLFKYQQ 447 P + + A+ RE+ IL +G P A PP S+L+ + T +P+ + ++ Sbjct: 410 PFSTNIMAGN-EALFEREIGILGRRGAFAPGSPLA--PPQSVLEGDVTLTPTAPIDQMRE 466 Query: 448 AESVASALQGVNTVVELGVKTG-DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDI 506 A A+ G LG+ G DPS +D D + + A PA L R EVE + Sbjct: 467 AGHF-EAIMGFQEY--LGIAAGADPSILDLHDREAEYDLTRRALGLPAKLRRRPEEVEAL 523 Query: 507 R 507 R Sbjct: 524 R 524 >gi|304398403|ref|ZP_07380277.1| phage head-tail connector protein [Pantoea sp. aB] gi|304354269|gb|EFM18642.1| phage head-tail connector protein [Pantoea sp. aB] Length = 553 Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 137/527 (25%), Positives = 237/527 (44%), Gaps = 57/527 (10%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEACIKL 57 + + LK++R + +L+ ++ P N + D T + A L Sbjct: 10 LNKQLGLLKSERTTFDPHWRDLSDYISPRSSRFLVSDANRDNRRNTNIVDPTCTLAERTL 69 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV---TDTLFGFRERS 114 SS + S IT P + W L+ S A + + V+ W + V + +F + Sbjct: 70 SSGMMSGITSPARPWFTLSVSDPAMKDY--------GPVKVWLEDVQRRMNEVF-----N 116 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 +S L Y + +GT + D E+ IR P+ + Y+S + + VD+ Sbjct: 117 KSNLYQSLPIVYAQLGTYGTAAMAILEDD-----EDIIRTYPFPIGSYYVSNSARLSVDT 171 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPK-SLTDKKKDKGNK 232 VYREF T Q+V ++G +S +K A E + +IHAVYP S K D NK Sbjct: 172 VYREFRMTTRQLVEQFGLDNVSETVKGQWATQNTESWHDVIHAVYPNVSRQTGKMDAKNK 231 Query: 233 GFHSK-FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETVN 290 + S F +++ E FP + R+ V ++ YG + P M AL ++ L Sbjct: 232 RYKSVYFEKAGDDKVLRESGFDEFPILAPRWEVNGEDAYGSNCPGMTALGQVKALQLEQK 291 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN-IGALSREGRSLFQPVQFGNPLPYHEE 349 +Q + +PP + S K + PG + + L+ G+ +P+ NP + Sbjct: 292 RKSQLIDKATNPPMVGPSSLKTQRVSQLPGAVTYVDQLT--GQDGLKPLYMVNP-NTADL 348 Query: 350 LNRLKES---IRSLFLLDLFQVLDDKASRS-AAESMEKTR-EKGAFVGPLIGGLQSEFIG 404 LN ++++ IRS + +DLF +L + +RS E++ + R EK +GP++ L EF+ Sbjct: 349 LNDIQDTRDIIRSAYFVDLFLMLQNINTRSMPVEAVNELREEKLLMLGPVLERLNDEFLD 408 Query: 405 AMISRELDILDSQGNLPECEGADNPPV---SLLKVEYTSPLFKYQQAESVASALQGVNTV 461 +I R I+ +G LP P V + L++EY S + + Q++ V S + V V Sbjct: 409 PLIDRAFAIMQRKGMLPPA-----PEVLQGTALRIEYISVMAQAQKSIGVNSMERFVGFV 463 Query: 462 VELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLIRDTAEVEDIR 507 G+ P +D +D D+ + + +P+V++ D EV+ IR Sbjct: 464 G--GMAQAKPEALDKLDIDKIIDSYGDSIGVSPSVIVPD-EEVQKIR 507 >gi|187736539|ref|YP_001878651.1| hypothetical protein Amuc_2060 [Akkermansia muciniphila ATCC BAA-835] gi|187426591|gb|ACD05870.1| hypothetical protein Amuc_2060 [Akkermansia muciniphila ATCC BAA-835] Length = 544 Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 137/552 (24%), Positives = 229/552 (41%), Gaps = 80/552 (14%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTT 49 M +R+A ++ + L QR W + L ++ P + N A RM DTT Sbjct: 1 MEERTA-ELNSVYKSLAAQRAPWETWWDRLRDYVLPRRLNREGEVSLPNRDAMDRMTDTT 59 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 EAC KL+S S ITP W +SA +D + W +Q ++ Sbjct: 60 AVEACQKLASGHMSYITPSHDVWF----KWSA------PDDRGGDEAEAWYNQCSE--IA 107 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 +E S S F + + V GTG + D + L + ++P + N + Sbjct: 108 LKELSVSNFYTEIHECFLDRVALGTGSLFTGTSSDGRLL-----FTNIPCGQFACAENAE 162 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT---IIHAVYPKSLTDKK 226 VD+ REFT+T Q S +G K L K + L R N T +H V P++ ++ Sbjct: 163 GRVDTYVREFTYTAHQARSMFGVKALGPKAREVLERGGNPYATTLRFLHVVRPRTRRSRR 222 Query: 227 KDKGNK-GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR-- 283 +++ + F S ++S+D+ EE FPY+V R+ YG +P P I+ Sbjct: 223 REQASHMPFESVYLSLDDQVIVEEGGYMEFPYLVTRFLKWGSGPYGLAPGRLVFPAIQQV 282 Query: 284 ----RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQ 339 R+ +T+ E+A F P I + DL+ G + ++ E SL P + Sbjct: 283 QFLNRILDTLGEVAAF-------PRILELANQIGEVDLRAGGRTV--ITPEAASLHLPRE 333 Query: 340 FGNPLPYHEELNRL---KESIRSLFLLDLFQVLD-DKASRSAAESMEKTREKGAFVGPLI 395 + Y ++RL +++IR + L + ++ + + +A E M + E+ P Sbjct: 334 WATQGKYDVGMDRLAQKQDAIRRAYYLPMLELWSGHRGNMTATEVMARENERVLMFSPSF 393 Query: 396 GGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLK------VEYTSPLFKYQ--- 446 S+ M +R +L G P PP ++L+ V P YQ Sbjct: 394 TLFVSDLYSTM-TRIFSLLFRMGKFP------RPPRAVLRVGRDGSVAVGEPRVVYQSKI 446 Query: 447 -------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRD 499 Q+E + +LQ +N +++ P DH+D D R S P ++R Sbjct: 447 ALVLRRLQSEGMDRSLQRLNMMMQAA-----PDLADHVDWDHCFRLSARVDGAPESMLRP 501 Query: 500 TAEVEDIRQQRE 511 A+V +R++RE Sbjct: 502 WADVRAMRKERE 513 >gi|212703348|ref|ZP_03311476.1| hypothetical protein DESPIG_01391 [Desulfovibrio piger ATCC 29098] gi|212673194|gb|EEB33677.1| hypothetical protein DESPIG_01391 [Desulfovibrio piger ATCC 29098] Length = 611 Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 125/507 (24%), Positives = 221/507 (43%), Gaps = 53/507 (10%) Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLA------ESFSAYQAFLYKEDARSKKVREWC 100 D TG A L++ L +T P + W LA A Q +L + +AR + V + C Sbjct: 89 DATGILAMRTLAAGLQGGMTSPARPWFRLALDDPDLSRSHAGQRYLDEVEARMRVVLQRC 148 Query: 101 DQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLS 160 + F + + Y + FGT + AD L G R++ + Sbjct: 149 N----------------FYNAMHTIYAELGTFGTAFVFELAD-----LRHGFRFVPLCAG 187 Query: 161 NVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK 220 + + VD+V+ ++ Q+V +G + L ++ A R ++R +IHAV P+ Sbjct: 188 QYVLDTDAARRVDTVFHRMHMSLRQMVQSFGPEALPENLRLAARRTPDQRHAVIHAVLPR 247 Query: 221 SLTDKKKDKGNKGFHSKFVSV--DENR-----FFEEKQIATFPYIVGRYRVRADEIYGRS 273 + +++ + H + SV E R +E FP R+ V A+++YGRS Sbjct: 248 T---ERRPRLAGPCHMPWASVYWLEGREGQVVPLKESGFMGFPGFGPRWDVAANDVYGRS 304 Query: 274 PAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN-IGALSREGR 332 PAM+ALP R L + + ++ PP + + DL PG +N + +L + + Sbjct: 305 PAMDALPDCRMLQQMGITTLKAIHKAVDPPMSVHAGLRSVGLDLTPGGINFVDSLPGQNQ 364 Query: 333 SLFQPVQFGNP--LPYHEELNRLKESIRSLFLLDLFQ-VLDDKASRSAAESMEKTREKGA 389 + P+ P + +++ IR+ DLF+ +L+ ++ +A+E + EK Sbjct: 365 PVATPLLQVKPDLAQARSAMEAVQQQIRAGLYNDLFRLILEGRSKVTASEIAAREEEKLL 424 Query: 390 FVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVS--LLKVEYTSPLFKYQQ 447 +GP++ L E + +I R ++ + LP C P +S LKVE+ S L + Q+ Sbjct: 425 LIGPVLERLHDELLIPLIDRTFRLMLALDMLPPCP----PELSGRHLKVEFVSLLAQAQK 480 Query: 448 AESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 +++ Q + + L + P +D +D D + + P L R E +R Sbjct: 481 LVGISATDQYL--ALTLKAASAWPEALDSVDVDNLLDNYAESLGLPVNLTRPREERARLR 538 Query: 508 QQREVQRRVMEEQHLQQQLQQTSQDIG 534 RE R+ EQ L L Q + D+G Sbjct: 539 AGREEARQT--EQQL--ALLQKAADLG 561 >gi|187476929|ref|YP_784953.1| phage head-tail connector protein [Bordetella avium 197N] gi|115421515|emb|CAJ48024.1| Putative phage head-tail connector protein [Bordetella avium 197N] Length = 555 Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 144/560 (25%), Positives = 252/560 (45%), Gaps = 55/560 (9%) Query: 3 QRSAKDIQDRFNYLKNQRGE-LNYWMEELTGFLYPY--------KNNAQLR---MWDTTG 50 Q K + R+ LK +R +++W +E++ +L P +N R + D TG Sbjct: 4 QTERKLLLSRWGQLKAERESWISHW-KEISDYLLPRSGRFFINDRNRGGKRHNNILDNTG 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + A L++ + + +T P + W L S E S V+ W VT + Sbjct: 63 TRALRVLAAGMMAGMTSPARPWFRLTTSIP--------ELDESAAVKAWLANVTRLMLMV 114 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 +S + L S Y + FGT + D ++ IR+ ++ ++ ++Q Sbjct: 115 FAKSNT--YRALHSTYEELGLFGTASSIVLPD-----FKDVIRHHTLSAGEYAIAADNQG 167 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTD-KKKD 228 VD++YREF TV Q+V ++G S+ +++ R E++ T+IHA+ P++ D K+D Sbjct: 168 RVDTLYREFQITVAQMVREFGKDKCSTTVRNLFDRGALEQWVTVIHAIEPRADRDPNKRD 227 Query: 229 KGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLN 286 N + S +V + DE R E +F + R+ + +IYG SPAMEAL +R+L Sbjct: 228 DRNMAWKSVYVELGADETRTLRESGYRSFRALCPRWALAGGDIYGNSPAMEALGDVRQLQ 287 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQ-PVQFGN 342 AQ +PP AK ++ PG Y+++ A + R+ F+ + + Sbjct: 288 HEQLRKAQGIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDVAAPNGGIRTAFEVNLDLSH 347 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKAS--RSAAESMEKTREKGAFVGPLIGGLQS 400 L ++ ++E I++ F DLF +L + + +A E E+ EK +GP++ + + Sbjct: 348 LL---ADIVDVRERIKASFYADLFLMLANGTNPKMTATEVAERHEEKLLMLGPVLERMHN 404 Query: 401 EFIGAMISRELDILDSQGNLP----ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQ 456 E + +I + LP E +G D L VE+ S L + Q+A + S + Sbjct: 405 EILDPLIELTFQRMVEANILPPPPQEMQGVD------LNVEFVSMLAQAQRAIATNSVDR 458 Query: 457 GVNTVVELGVKTG-DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRR 515 V LGV P +D + DR + LI +V IR+QR Q++ Sbjct: 459 FVGN---LGVVAKIKPEVLDKFNADRWADTYADMLGIDPELIVPGNQVALIRKQRAEQQQ 515 Query: 516 VMEEQHLQQQLQQTSQDIGA 535 ++ L Q T+ +G+ Sbjct: 516 AAQQAALLNQGADTAAKLGS 535 >gi|41179382|ref|NP_958690.1| Bbp21 [Bordetella phage BPP-1] gi|45569514|ref|NP_996583.1| hypothetical protein BMP-1p20 [Bordetella phage BMP-1] gi|45580765|ref|NP_996631.1| hypothetical protein BIP-1p20 [Bordetella phage BIP-1] gi|40950121|gb|AAR97687.1| Bbp21 [Bordetella phage BPP-1] Length = 555 Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 143/560 (25%), Positives = 248/560 (44%), Gaps = 55/560 (9%) Query: 3 QRSAKDIQDRFNYLKNQRGE-LNYWMEELTGFLYPY--------KNNAQLR---MWDTTG 50 Q K + R+ L+ +R +++W +E++ +L P +N + R + D TG Sbjct: 4 QTERKLLLSRWGQLRTERESWMSHW-KEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTG 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + A L++ + + +T P + W L S E S V+ W VT + Sbjct: 63 TRALRVLAAGMMAGMTSPARPWFRLTTSIP--------ELDESAAVKAWLANVTRLMLMI 114 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 +S + L S Y + FGT + D D + + S+ ++ ++Q Sbjct: 115 FAKSNT--YRALHSMYEELGAFGTASSIVLPDFDAV-----VYHHSLTAGEYAIAADNQG 167 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTD-KKKD 228 V+++YREF TV Q+V ++G S+ ++S R E++ T+IHA+ P++ D K+D Sbjct: 168 RVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRD 227 Query: 229 KGNKGFHSKFV--SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLN 286 N + S + DE R E +F + R+ + +IYG SPAMEAL +R+L Sbjct: 228 DRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQ 287 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQ-PVQFGN 342 AQ +PP AK ++ PG Y++ A + R+ F+ + + Sbjct: 288 HEQLRKAQAIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSH 347 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKAS--RSAAESMEKTREKGAFVGPLIGGLQS 400 L ++ ++E I++ F DLF +L + + +A E E+ EK +GP++ + + Sbjct: 348 LL---ADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHN 404 Query: 401 EFIGAMISRELDILDSQGNLP----ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQ 456 E + +I + LP E +G D L VE+ S L + Q+A + S + Sbjct: 405 EILDPLIELTFQRMVEANILPPPPQEMQGVD------LNVEFVSMLAQAQRAIATNSVDR 458 Query: 457 GVNTVVELGVKTG-DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRR 515 V LG G P +D D DR + LI +V IR+QR Q++ Sbjct: 459 FVGN---LGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQ 515 Query: 516 VMEEQHLQQQLQQTSQDIGA 535 ++ L Q T+ +G+ Sbjct: 516 AAQQAALLNQGADTAAKLGS 535 >gi|220903991|ref|YP_002479303.1| hypothetical protein Ddes_0717 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219868290|gb|ACL48625.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 597 Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 121/477 (25%), Positives = 201/477 (42%), Gaps = 45/477 (9%) Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKED-ARSKKVREWCDQVTD 105 D TG A L++ L +T P + W ++ L D ARS+ + W D+V Sbjct: 59 DATGILAMRTLAAGLQGGLTSPARPW---------FRLGLDDADLARSRPGQAWLDEVA- 108 Query: 106 TLFGFRERS---RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 R RS R F + + Y + FGT + AD +G R++ + Sbjct: 109 ----ARMRSVFHRCNFYNAMHTLYAELATFGTAFVFELADP-----RDGFRFMPLCAGEY 159 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSL 222 + + VD+V+R + ++ QIV +G L ++ A+ RN +ER +I AVYP+ Sbjct: 160 VLDCDAGRRVDTVFRRSSMSLRQIVQTFGPAALPESLREAVRRNADERRNVIQAVYPR-- 217 Query: 223 TDKKKDKGNKGFHSKFVSV-------DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPA 275 D + H SV E FP R+ V +++YGRSPA Sbjct: 218 -DDRIHGILTASHMPVASVYWLEGRDGGEHALRESGFRHFPGFGPRWDVAGNDVYGRSPA 276 Query: 276 MEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN-IGALSREGRSL 334 M+ALP R L + + ++ PP + + DL PG +N + + + Sbjct: 277 MDALPDCRMLQQMGITTLKAIHKAVDPPMSVSAGLRSVGLDLTPGGINYVDSAPGQSPQA 336 Query: 335 FQPVQFGNP--LPYHEELNRLKESIRSLFLLDLFQ-VLDDKASRSAAESMEKTREKGAFV 391 P+ NP + ++ IRS DLF+ +L+ ++ +A+E + EK + Sbjct: 337 ATPLLQVNPDLSTARRAMESVQNQIRSGLYNDLFKLILEGRSGVTASEIAAREEEKLVLI 396 Query: 392 GPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVS--LLKVEYTSPLFKYQQAE 449 GP++ L E ++ R + + LP C P +S LKVE+ S L + Q+ Sbjct: 397 GPVLERLHDELFIPLMDRTFECMRELDMLPPCP----PELSGRRLKVEFVSLLAQAQKLV 452 Query: 450 SVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDI 506 V++A Q + + L T P +D ++ D + + P L R E E + Sbjct: 453 GVSAADQYL--ALTLRASTAWPEALDTLNVDHLLDNYADSLGLPISLTRPPEEREQM 507 >gi|212710818|ref|ZP_03318946.1| hypothetical protein PROVALCAL_01886 [Providencia alcalifaciens DSM 30120] gi|212686515|gb|EEB46043.1| hypothetical protein PROVALCAL_01886 [Providencia alcalifaciens DSM 30120] Length = 550 Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 130/507 (25%), Positives = 223/507 (43%), Gaps = 62/507 (12%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEACI 55 +D+ + + LKN+R +EL + P + ++ D +++ Sbjct: 5 QDLLKQLSQLKNERQSFEPHWKELAEYTRPRSTRFSTSEVNRGDRRNTKIIDQEAAKSER 64 Query: 56 KLSSLLSSLITPPGQKWHGLAE------SFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 LSS + S IT P +KW LA ++S + +L E +Q + +F Sbjct: 65 TLSSGMMSGITSPARKWFRLATPDPDMMNYSPVKMWL-----------EVVEQRMNEVF- 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 +RS L Y+ + F T + D E IR + P+ + Y++ Sbjct: 113 ----NRSNIYQSLPQTYSDIGTFATSALAVLEDN-----ERVIRTVPFPIGSYYIANGPD 163 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSAL-ARNENERFTIIHAVYPK-SLTDKKK 227 VD+ +REF+ TV Q+V ++G +S ++KS + N ++ T+IH+VYP + K Sbjct: 164 LTVDTCFREFSMTVRQLVMEFGLDNVSEQVKSMWDSGNYSQWITVIHSVYPNLNRISGKL 223 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + + D +R E FP + R+ V +++YG S P M AL +++ Sbjct: 224 DAKNKLFKSVYFEIGGDSDRVLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGSVKA 283 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQPVQFG 341 L AQ +PP A + K + L PG Y+ + + + +FQ Sbjct: 284 LQLLQRRKAQQIDKVTNPPMQAPASIKNQRISLVPGGITYLPMAGADQMIKPIFQVQADI 343 Query: 342 NPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRS--AAESMEKTREKGAFVGPLIGGLQ 399 N L ++ + I+ + DLF +L + +RS +E EK +GP++ L Sbjct: 344 NGL--IADIGDTRNQIKEAYFSDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLQRLD 401 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 SE + +I+R I+ + LP PP + LKVEY S + + Q++ V S Sbjct: 402 SELLDKLINRTFAIMARKNLLPV------PPEEMQGMQLKVEYISVMAQAQKSVGVNSVE 455 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDRV 482 + V V G+ P +D ++TD + Sbjct: 456 RFVGFVG--GLAKLKPEALDKLNTDEI 480 >gi|268589375|ref|ZP_06123596.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315402|gb|EFE55855.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 550 Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 131/507 (25%), Positives = 226/507 (44%), Gaps = 62/507 (12%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYK---NNAQL--------RMWDTTGSEACI 55 +D+ + + LKN+R +EL + P N +++ ++ D +++ Sbjct: 5 QDLLKQLSQLKNERQSFEPHWKELAEYTRPRSTRFNTSEVNRGDRRNTKIIDQEAAKSER 64 Query: 56 KLSSLLSSLITPPGQKWHGLAE------SFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 LSS + S IT P +KW LA ++S + +L E +Q + +F Sbjct: 65 TLSSGMMSGITSPARKWFRLATPDPDMMNYSPVKMWL-----------EVVEQRMNEVF- 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 +RS L Y+ + F T + D E IR + P+ + Y++ Sbjct: 113 ----NRSNIYQSLPQTYSDIGTFATSALAVLEDN-----ERVIRTVPFPIGSYYIANGPD 163 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSAL-ARNENERFTIIHAVYPK-SLTDKKK 227 VD+ +REF+ TV Q+V ++G +S ++KS + N ++ T+IH+VYP + K Sbjct: 164 LTVDTCFREFSMTVRQLVMEFGLDKVSEQVKSLWDSGNYSQWITVIHSVYPNLNRISGKL 223 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + + D R E FP + R+ V +++YG S P M AL +++ Sbjct: 224 DAKNKLFKSVYFEMGGDSERVLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGSVKA 283 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQPVQFG 341 L AQ +PP A + K + L PG Y+ + + + +FQ Sbjct: 284 LQLLQRRKAQQIDKVTNPPMQAPASIKNQRISLVPGGITYLPMAGADQMIKPIFQVQADI 343 Query: 342 NPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRS--AAESMEKTREKGAFVGPLIGGLQ 399 N L ++ + I+ + DLF +L + +RS +E EK +GP++ L Sbjct: 344 NGL--IADIGDTRNQIKEAYFSDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLQRLD 401 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASAL 455 SE + +I+R I+ + LP PP + LKVEY S + + Q++ V+S Sbjct: 402 SELLDKLINRTFAIMARKNLLPV------PPEEMQGMQLKVEYISVMAQAQKSVGVSSIE 455 Query: 456 QGVNTVVELGVKTGDPSCMDHMDTDRV 482 + V V G+ P +D ++TD + Sbjct: 456 RFVGFVG--GLAQMKPEALDKLNTDEM 480 >gi|309702812|emb|CBJ02143.1| putative phage protein [Escherichia coli ETEC H10407] Length = 559 Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 126/514 (24%), Positives = 223/514 (43%), Gaps = 63/514 (12%) Query: 16 LKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 LK++R +L+ F+ P + ++ D TGS A LSS + S Sbjct: 16 LKSERTSFESHWRDLSDFINPRGSRFLTSDVNRDDRRNTKIIDPTGSMAQRILSSGMMSG 75 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV---TDTLFGFRERSRSGFVGC 121 IT P + W LA + V+ W + V + +F ++S Sbjct: 76 ITSPARPWFKLATPDPDMMDY--------GPVKVWLEVVQRRMNEVF-----NKSNLYQS 122 Query: 122 LQSFYTSVVEFGTGCF-YMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L Y S+ FGT +E D D IR + P+ Y++ + + VD+ +R+F+ Sbjct: 123 LPVMYASLGTFGTAAMAVLEDDQDV------IRTMPFPIGCYYLANSPRGSVDTSFRQFS 176 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDK-KKDKGNKGFHSKF 238 TV Q+V ++G +SS ++ E + + H + P D K D NK F S + Sbjct: 177 MTVRQLVQEFGLDNVSSSVQGMWQNGTYETWIEVNHCITPNVNRDTGKMDSKNKPFRSVY 236 Query: 239 VSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETVNELAQF 295 D ++ E FP + R+ V +++Y S P M AL ++ L AQ Sbjct: 237 FETGGDADKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQL 296 Query: 296 GRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQPVQFGNP--LPYHEEL 350 + +PP +A + K + L PG Y+++ G+ F+P NP ++ Sbjct: 297 IDKATNPPMVAPTSLKTQRVSLLPGDVTYLDV----LSGQDGFKPAYLVNPNTADLLADI 352 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 +++I S + +DLF +L + +RS +E EK +GP++ L E + +I Sbjct: 353 QDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLID 412 Query: 409 RELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 R ++ + LP PP ++ LKVEY S + + Q++ ++S VN + +L Sbjct: 413 RAFSMMVRKNMLPP------PPDAMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQL 466 Query: 465 GVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 P +D ++ D+ + F+ + +P V++ Sbjct: 467 A--QAKPEALDKLNVDQAIDAFADMSGVSPTVIV 498 >gi|290968647|ref|ZP_06560185.1| hypothetical protein HMPREF0889_0287 [Megasphaera genomosp. type_1 str. 28L] gi|290781300|gb|EFD93890.1| hypothetical protein HMPREF0889_0287 [Megasphaera genomosp. type_1 str. 28L] Length = 577 Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 122/481 (25%), Positives = 220/481 (45%), Gaps = 59/481 (12%) Query: 45 MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVT 104 +++ ++A ++ + S +TPP +KW F+ A L ++ + E C+ + Sbjct: 76 IYNGITAQARDTFAAGIQSGLTPPSRKWF----RFAPTDASLDNNIDVARVLDERCEIME 131 Query: 105 DTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM 164 L S+S F + S Y + FG + AD E+G+ +++ + + Sbjct: 132 GVL------SQSNFYNVIHSAYKEL-PFGQSPVGVFAD------EKGVYFVNYTIGTYAL 178 Query: 165 SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARN--ENERFTIIHAVYPKSL 222 + Q +++ R+ + QIVS +GD V++ ++ A+ N + +T+ VYP Sbjct: 179 GADGQGRINTFARKVKMSAAQIVSLYGDSVVTDSVREAVKANGGHEDYYTVCWLVYPNP- 237 Query: 223 TDKKKDKGNKGFHS-KFVSV------DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPA 275 K K G H KF+SV D N K + V RY V+ + YG PA Sbjct: 238 ----KAKPTGGNHDMKFLSVHWLEGSDPNSLLAAKGFEEWAIPVARYNVKGIDAYGIGPA 293 Query: 276 MEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYM--------NIGAL 327 +ALP R L + + A LS+ PP + +E + R +L PG N+ ++ Sbjct: 294 WDALPESRMLQKMEYDGAIALELSIKPPLVGPAELQGR-INLFPGAYTPSINPNDNVHSI 352 Query: 328 SREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLD--DKASRSAAESMEKTR 385 G L N L ++ ++++ I+ ++ DLF +L+ ++ +A E M + + Sbjct: 353 YSGGLDL-------NSL--QAKITQIEDRIKRIYSTDLFLMLNELNRGQMTAQEVMARNQ 403 Query: 386 EKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSP 441 EK A +GP+I LQ+EF+ +I R ++L+ P D+ +L +K+EY SP Sbjct: 404 EKMAQLGPVIERLQNEFLSDIIERVYNLLERNQVFPPL--PDDVQQTLQGQEIKIEYLSP 461 Query: 442 LFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTA 501 L + Q+ + + QGV+ V +L DP+ + ++ D+ L P+ +IR Sbjct: 462 LAQAQKMSGLTAIEQGVSFVGQLA--QLDPNVILRVNFDKAVENYLDKLGVPSTMIRTED 519 Query: 502 E 502 E Sbjct: 520 E 520 >gi|262043566|ref|ZP_06016679.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039100|gb|EEW40258.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 560 Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 126/512 (24%), Positives = 221/512 (43%), Gaps = 59/512 (11%) Query: 16 LKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 L N R + EL+ F+ P + ++ D T + A LSS + S Sbjct: 17 LTNDRSSFDPHWRELSDFINPRGSRFLVTDVNRDDRRNTKIVDPTATLAARTLSSGMMSG 76 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV---TDTLFGFRERSRSGFVGC 121 IT P + W LA + V+ W + V + +F ++S Sbjct: 77 ITSPARPWFKLATPDPDMMDY--------GPVKLWLEVVQRRMNEVF-----NKSNIYQS 123 Query: 122 LQSFYTSVVEFGTGCF-YMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L Y S+ + TG +E D D IR + P+ + YM+ + + VD+ +R+F+ Sbjct: 124 LPLLYASLGNYSTGAMAVLEDDSDV------IRTMMFPIGSYYMANSARGSVDTCFRKFS 177 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK-DKGNKGFHSKF 238 TV Q+V ++G +S +K E + +IHAVYP D K + NK S + Sbjct: 178 MTVRQLVMEFGLNNVSDSVKGMWDSGNYESWIEVIHAVYPNIDRDTAKLNSKNKPVKSVY 237 Query: 239 VSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETVNELAQF 295 V D ++ E FP + R+ V +++YG S P M AL ++ L +Q Sbjct: 238 YEVGGDSDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGQVKALQLEQKRKSQL 297 Query: 296 GRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNP--LPYHEELNR 352 + +PP + S + + L PG +I + + G+ F+P NP ++ Sbjct: 298 IDKATNPPMVGPSSLRNQRVSLLPG--DITYIDQVTGQDGFKPAYLVNPNTADLLADIQD 355 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 ++ I S + +DLF +L + +RS +E EK +GP++ L E + +I R Sbjct: 356 TRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRT 415 Query: 411 LDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 I+ + LP PP L L++EY S + + Q++ ++S V + +L Sbjct: 416 FSIMARKNLLPP------PPDVLQGMPLRIEYISVMAQAQKSIGLSSLSSTVGFIGQLA- 468 Query: 467 KTGDPSCMDHMDTDR-VSRFSLWATNTPAVLI 497 P +D ++ D+ + F+ + +P V++ Sbjct: 469 -QAKPEALDKLNVDQAIDAFAEMSGVSPTVIV 499 >gi|144899435|emb|CAM76299.1| head-to-tail joining protein [Magnetospirillum gryphiswaldense MSR-1] Length = 502 Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 124/485 (25%), Positives = 203/485 (41%), Gaps = 64/485 (13%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R++D T ++A +L++ L S +TPP +W GL ++A ++V D+V Sbjct: 62 RLFDGTAADAVDQLAASLLSELTPPWAQWFGLTAGPDL-------DEAERQQVAPLLDKV 114 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 L +RS F + Y VV GT C E + G R+ +VPL+ Sbjct: 115 GAILQSHFDRS--NFAVEMHQCYLDVVTGGTACLLFEEA--QPGEASAFRFTAVPLAQAV 170 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + +DS +R T+ + ++ L + + RF +I AV P Sbjct: 171 LEEGPDGKLDSSFRRSELTLAALRQRFPAAQLDPSLIRRGEEDPQARFAVIEAVIP---- 226 Query: 224 DKKKDKGNKGFHSKFVSV------DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAME 277 N+ H + ++ D+ E + P+I R+ EIYGRSP M+ Sbjct: 227 -------NQRGHYDYAAILEDATDDDEALLAEGRFGQSPFINFRWLKAPGEIYGRSPVMK 279 Query: 278 ALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQ---------RNFDLKPGYMNIGALS 328 ALP I+ N+ V L L TIAV+ Q N L PG + A+ Sbjct: 280 ALPDIKTANKVVE-------LVLKNATIAVTGIWQADDDGVLNPANIKLIPGTIIPKAVG 332 Query: 329 REGRSLFQPVQFGNPLPYHE-ELNRLKESIRSLFLLD-LFQVLDDKASRSAAESMEKTRE 386 G QP++ + L+ L+ IR L D L Q D +A E +E++ + Sbjct: 333 SAG---LQPLESPGRFDISQLVLDDLRGRIRHALLADKLGQA--DNPKMTATEVLERSAD 387 Query: 387 KGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPE--CEGADNPPVSLLKVEYTSPLFK 444 +G G LQSE + +I R + IL +G +P +G L++++Y SPL + Sbjct: 388 MARLLGATYGRLQSELLTPLILRAVTILRRRGEIPPLLVDG------HLVELQYRSPLAQ 441 Query: 445 YQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 Q + L ++ + +LG P+ M +D +++ A N PA L+ E Sbjct: 442 SQAQRDAHNVLSWLSALAQLG-----PAGMAVVDPAAAAQWLGRAFNIPADLMVAPQNPE 496 Query: 505 DIRQQ 509 ++ Q Sbjct: 497 NVHVQ 501 >gi|291334466|gb|ADD94120.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] Length = 330 Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 84/330 (25%), Positives = 156/330 (47%), Gaps = 28/330 (8%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR----------MWDTTGSEACI 55 AK++ R++ LK+QR +E+ ++ P K + ++D + ++ Sbjct: 7 AKNLLKRYDRLKSQRQNWESHWQEVADYMQPRKADVTKTRSKGDKRTELIFDGSPLQSVE 66 Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 L++ L ++T P W L F ++ + + W + TD ++ +R Sbjct: 67 LLAASLHGMLTNPSTPWFTLR--------FKDEDIDNEDEAKLWLEASTDAMYT--AFNR 116 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 S F + Y ++ FGT ++E D DE ++ R+I+ V+++ N + +D+V Sbjct: 117 SNFQQEIFELYHDLITFGTAAMFIEED-DEDIIKFSTRHIN----EVFIAENDKGRIDTV 171 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD-KKKDKGNKGF 234 +R+F+ + ++ K+GD +S + + ++ E I+HAVYP+S D +K+DK N F Sbjct: 172 FRKFSLSARAVMQKFGD--VSINIATKAKKDPYEEVEIMHAVYPRSDFDPRKQDKENMPF 229 Query: 235 HSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQ 294 S ++ + FP++V RY + EIYGRSPAM ALP ++ LNE + Sbjct: 230 ESVYLDAESGDELSVSGFREFPFVVPRYLKASHEIYGRSPAMTALPDVKMLNEMSKTTIK 289 Query: 295 FGRLSLHPPTIAVSEAKQRNFDLKPGYMNI 324 + + PP + + PG +N Sbjct: 290 SAQKQVDPPLLVPDDGFMLPVRTIPGGLNF 319 >gi|330007155|ref|ZP_08305897.1| hypothetical protein HMPREF9538_03586 [Klebsiella sp. MS 92-3] gi|328535502|gb|EGF61962.1| hypothetical protein HMPREF9538_03586 [Klebsiella sp. MS 92-3] Length = 559 Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 135/527 (25%), Positives = 225/527 (42%), Gaps = 70/527 (13%) Query: 16 LKNQRGELNYWMEELTGFLYPY--------KNNA---QLRMWDTTGSEACIKLSSLLSSL 64 LKN+R EL F+ P +NN R+ D T S+A L S + S Sbjct: 17 LKNERTSFEEHWRELAEFIDPRSTRFLTTERNNGSKRNTRIVDPTASKAARTLQSGMLSG 76 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 IT P + W LA + V+ W D V + +RS L Sbjct: 77 ITSPTRPWFKLATPDPEMMQY--------GPVKRWLDVVMTRMNDVM--NRSNVYQSLPI 126 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 Y + FGT + D E+ IR +P+ + Y+S +H+ VD+ YR F+ T Sbjct: 127 IYRHLGVFGTAAMAVLEDD-----EDVIRTHPLPIGSYYLSNSHRLSVDTTYRVFSMTAR 181 Query: 185 QIVSKWGDKVLSSKMKSALAR-NENERFTIIHAVYPK-SLTDKKKDKGNKGFHSKF--VS 240 QIV ++G +S+ ++ A N F ++H P + K + NK F S + +S Sbjct: 182 QIVMQFGLDNVSNAVRGAWDNANYEAWFDVVHLTEPNIDRVNGKLNSRNKAFKSVYFELS 241 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLN-ETVNELAQFGRL 298 D ++ E P + R+ + +++YG + P M AL T + L E + + +L Sbjct: 242 GDGDKLLREAGFDEPPILSPRWEINGEDVYGSNCPGMMALGTGKALQLEQIRKANAIDKL 301 Query: 299 SLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQPVQFGNPLPYHEELNRL-- 353 ++PP +A + K + +L PG Y++ + L +P +P +LN + Sbjct: 302 -VNPPMVAPTGLKNKLINLAPGGVTYVD----EVDATKLVRPAYAVSP-----QLNDMLG 351 Query: 354 -----KESIRSLFLLDLFQVLDDKASRS----AAESMEKTREKGAFVGPLIGGLQSEFIG 404 ++ I + F DLF + +RS A +M+ EK +GP++ L EF+ Sbjct: 352 SIADDRQMIEACFFSDLFNLFSTINTRSMPVEAVAAMQD--EKLLQLGPVLERLNDEFLD 409 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASALQGVNT 460 + R +I+ + PE PP L LKVEY S L + Q++ ++S + V Sbjct: 410 PFVDRTFNIMARRNLFPE------PPEELQGTPLKVEYVSILAQAQKSIGISSVERFVGF 463 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 V L +P+ +D ++ D+ PA ++ EV+ R Sbjct: 464 VGNLA--KANPAALDKLNIDQTIDEYGNMLGVPATIVNSDDEVQATR 508 >gi|226940462|ref|YP_002795536.1| Bbp21 [Laribacter hongkongensis HLHK9] gi|226715389|gb|ACO74527.1| Bbp21 [Laribacter hongkongensis HLHK9] Length = 555 Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 126/502 (25%), Positives = 218/502 (43%), Gaps = 55/502 (10%) Query: 7 KDIQDRFNYLKNQRGE-LNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEAC 54 K + R+ LK +R +++W E++ +L P N ++D TG+ A Sbjct: 8 KRVSARWEALKKERSSWMSHW-SEISDYLLPRSGRFFVEDRNKGNKRHKNIYDNTGTRAL 66 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 L++ + + +T P + W L S S V+ W VT + +S Sbjct: 67 RVLAAGMMAGMTSPARPWFRLTTSDPQLD--------ESAAVKAWLADVTRIMQMVFAKS 118 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 + L S Y + FGT + D + G+ I + + ++ +++ V++ Sbjct: 119 NT--YRALHSCYEELGAFGTAGTIVLPDFN--GV---IHHHVLTAGEFAIAADYRGQVNT 171 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALAR-NENERFTIIHAVYPKSLTDK-KKDKGNK 232 +YREF TV Q+V ++G S+ ++ R +E T+IHA+ P++ K ++D N Sbjct: 172 LYREFQMTVGQMVGEFGLSACSATVQRLHERWCLDEWITVIHAIEPRTDRHKGRQDARNM 231 Query: 233 GFHSKFVSVD--ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 + S + E + E FP + R+ +IYG SPAME+L I++L Sbjct: 232 AWRSVYFEPGNREGQVLRESGFREFPALCPRWSTSGGDIYGNSPAMESLGDIKQLQHEQL 291 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQPVQFGNPLPY- 346 Q PP S + R+ D PG +++ G + RS F + G L + Sbjct: 292 RKGQVIDYKTKPPLQVPSSMRARDIDTLPGGVSFVDAGTPNGGIRSAF---EVGLDLSHL 348 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKAS--RSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 ++ ++E I+ F DLF +L + ++ +A E E+ EK +GP++ L +E + Sbjct: 349 LADIQDVRERIKGSFYADLFLMLANGSNPQMTATEVAERHEEKLLMLGPVLERLHNEILD 408 Query: 405 AMISRELDILDSQGNLP----ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 +I + G +P E +G D L VE+ S L + Q+A + S + V Sbjct: 409 PLIEMTFSRMVEAGIVPPPPEELQGVD------LNVEFVSMLAQAQRAIATNSVDRFVGN 462 Query: 461 VVELGVKTG-DPSCMDHMDTDR 481 LG G P +D D DR Sbjct: 463 ---LGAVAGIKPEVLDKFDADR 481 >gi|303328393|ref|ZP_07358830.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861387|gb|EFL84324.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 567 Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 117/470 (24%), Positives = 192/470 (40%), Gaps = 55/470 (11%) Query: 38 KNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKV- 96 KN + D+TG A L++ + +T P + W GL L D+ + Sbjct: 50 KNLLNPEVVDSTGIYALRTLAAGMQGGMTSPARPWFGLR---------LEGGDSGDGGIT 100 Query: 97 -REWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYI 155 R W D+V + + S F G + Y + FGT C + AD+ G + Sbjct: 101 ARAWIDEVVERMRTILHTSN--FYGVIYQAYAQLAAFGTACVFERADM------SGFTFD 152 Query: 156 SVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSAL--ARNENERFTI 213 + V+ VD+V R+ T Q+ ++G+ L +K++L A N R + Sbjct: 153 CCQAGTFVLDVDAGGRVDTVMRKIWLTARQMAQEFGEDALPDMVKTSLNNASMGNVRHAV 212 Query: 214 IHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRF---------FEEKQIATFPYIVGRYRV 264 HAVYP+ +++ N G F SV R E +FP+ R+ V Sbjct: 213 FHAVYPRREPGLRRETIN-GARRPFASVYWMRGMSGAGGYHPLRESGFDSFPFFGVRWNV 271 Query: 265 RADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN- 323 + ++YG SPAM+ +P R L + + + PP +E + DL PG +N Sbjct: 272 LSGDVYGTSPAMDTMPDCRMLQQMAKTTLKGVHKMVDPPVNVAAELQSVGVDLTPGGVNY 331 Query: 324 IGALSREGRSL-----FQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASR--S 376 + + G ++ QP + ++KE + + DLF++L R + Sbjct: 332 VSMMGNNGAAVTPVLKVQPDVAAAQAMIQQVQQQIKEGLYN----DLFRMLLGTNRRQIT 387 Query: 377 AAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSL--- 433 A E + EK +GP++ L E +I R ++D LP P L Sbjct: 388 ATEVDAREAEKMILIGPVLERLHDELFIPLIDRTFALMDKFNALPPV------PEELAGR 441 Query: 434 -LKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRV 482 LKVE+ S L + Q+ S Q + + G DPS +D ++ DR+ Sbjct: 442 GLKVEFISTLAQAQKLVSTGGIQQLLAFIG--GAAQVDPSVLDALNGDRL 489 >gi|332160969|ref|YP_004297546.1| hypothetical protein YE105_C1347 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665199|gb|ADZ41843.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862125|emb|CBX72289.1| hypothetical protein YEW_AK02260 [Yersinia enterocolitica W22703] Length = 534 Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust. Identities = 118/473 (24%), Positives = 197/473 (41%), Gaps = 40/473 (8%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGL-AESFSAYQAFLYKEDARSKKVREWCDQ 102 R+ D T +++ L+S L S +TP +W L +E+ S +D RS W Sbjct: 57 RLLDGTATDSARILASALMSGMTPANAQWLDLGSENLS--------DDERS-----WLS- 102 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 T + + F VV G Y VDE + G + PL+ V Sbjct: 103 -TCATLTWENIHAANFDAEGYEANIDVVCAGWFALY----VDEDTEQGGYTFNQWPLAQV 157 Query: 163 YMSVNHQN-VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK- 220 +++ + ++ VV++VYR + T +Q V ++G +S K++ A + +++F IHA++P+ Sbjct: 158 FVASSRRDGVVNTVYRCYQLTAEQAVKEFGRDNVSHKIQDAANKKPDDKFEFIHAIFPRD 217 Query: 221 SLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALP 280 + N F S V V E + E FP V R+ YG P +ALP Sbjct: 218 GYIGNARLAKNLPFASFNVEVAEKKVVRESGYHEFPVCVPRWMKIPGTPYGVGPVYDALP 277 Query: 281 TIRRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPV 338 + LNET L++ IA + R ++ P + + + L Sbjct: 278 DCKELNETKRMEKAAQDLAIAGMWIAEDDGVLNPRTVNVGPRKIIVANSVNSMKPLLTGA 337 Query: 339 QFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGL 398 F E RL+ IR + + D Q D A +A E + +GP+ G Sbjct: 338 DFNVAFTAEE---RLQAQIRKILMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRF 393 Query: 399 QSEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASA 454 Q+E++ ++ R I G P+ P S+ + Y SPL + Q+ E V + Sbjct: 394 QAEYLQPLVERCFGIAFRAGVFPQM------PESMAQANFNIRYISPLARAQKLEDVTAI 447 Query: 455 LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 + + +L +P +D+MD D +R A PA ++R A+V +R Sbjct: 448 ERLGANIAQLAAI--NPEVIDNMDADAAARVVSDALGVPAKVLRSAADVTALR 498 >gi|83313332|ref|YP_423596.1| hypothetical protein amb4233 [Magnetospirillum magneticum AMB-1] gi|82948173|dbj|BAE53037.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 545 Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 116/465 (24%), Positives = 192/465 (41%), Gaps = 49/465 (10%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R++D T + +L++ L S +TPP +W GLA +A + ++ + E V Sbjct: 78 RLFDGTAPDCVDQLAASLLSELTPPWAQWFGLAAGDQMPEA----DRDQAAPLLERIAAV 133 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 + F RS F + Y V GT E G R+ SVPL V Sbjct: 134 MQSHF-----DRSNFAIEMHQCYLDAVTGGTASLMFEEA--PPGEPSAFRFTSVPLGQVV 186 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + +D +R +V + +++ VL ++ A A + + R ++ AV P Sbjct: 187 LEEGPAGRLDVTFRRSELSVAALKARFPRAVLPREVIKAAADDPDLRLGVVEAVVPV--- 243 Query: 224 DKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 +G + + + Q ++ P++ R+ E+YGRSP M+ALP I+ Sbjct: 244 -----RGGYSYAAVLDDDGSDLVLGRGQFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIK 298 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQ---------RNFDLKPGYMNIGALSREG-RS 333 N+ V L L TIAV+ Q N L PG + A+ G + Sbjct: 299 TANKVVE-------LVLKNATIAVTGIWQADDDGVLNPANIKLVPGTIIPKAVGSAGLQP 351 Query: 334 LFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGP 393 L P +F L+ L+ IR + D A +A E +++ + +G Sbjct: 352 LTAPGRFDT---SQLVLDDLRGRIRHALMGDKLSQPASPA-LTATEVLQRADDMARLLGA 407 Query: 394 LIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 G LQSE + +I R + IL +G +P + D + ++Y SPL + Q + Sbjct: 408 TYGRLQSELLTPLILRAIHILRRRGEIPPLQ-VDG---RTIDLQYRSPLAQNQGRRDARN 463 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIR 498 L + + LG PS + +D+D +R+ A N P+ LIR Sbjct: 464 VLNWLGALSSLG-----PSALATVDSDAAARWLARAFNVPSELIR 503 >gi|225158777|ref|ZP_03725094.1| hypothetical protein ObacDRAFT_8203 [Opitutaceae bacterium TAV2] gi|224802612|gb|EEG20867.1| hypothetical protein ObacDRAFT_8203 [Opitutaceae bacterium TAV2] Length = 562 Score = 113 bits (282), Expect = 9e-23, Method: Compositional matrix adjust. Identities = 119/462 (25%), Positives = 200/462 (43%), Gaps = 51/462 (11%) Query: 45 MWDTTGSE-ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 ++D+T +E A + + LLSSL+ P G+ W FSA S V EW D Sbjct: 61 IYDSTANESALVYAAGLLSSLV-PAGELWF----RFSA-------RPGASAPVVEWFDDC 108 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGI-RYISVPLSNV 162 T S F + + + F + E +G G+ + +VP+ Sbjct: 109 THRAAA--ALHASNFYLGIHEDFMDMAGFSIASLFCEEGAALRGQRGGLLNFTNVPVGTF 166 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSAL----ARNENERFTIIHAVY 218 + + + +VD+V+REF FT Q KWG+ LS M AL A + ++RF IIHAVY Sbjct: 167 VIEEDAEGLVDTVFREFRFTARQCAQKWGEDKLSKPMLDALNSKTASDRDKRFQIIHAVY 226 Query: 219 PKSLTDKKKDKG---NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPA 275 P+ D K+ G + S +V EE P V R +EIYGR P Sbjct: 227 PRR--DGKQGPGIGKKRPIASVYVDKQAIHVIEEGGFYEMPIAVARLLRGNNEIYGRGPG 284 Query: 276 MEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGR 332 + +P I+ +N +L ++PP +A ++ R D +PG Y + + + Sbjct: 285 DQVMPEIKLVNRMERDLLLSLEQQVNPPWLAPQDSSWRP-DNRPGGVFYWDASNPNNKPE 343 Query: 333 SLFQPVQF--GNPLPYHEELNRLKESIRSLFLLDLFQVLDD----KASRSAAESMEKTRE 386 L + G+ + LN +E IR + +D+F++L + K ++A E + +E Sbjct: 344 RLRDTARLDIGDKV-----LNDKREVIRRAWFVDMFKMLSNPDAMKRDKTAFEVAQLMQE 398 Query: 387 KGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL---PECEGADNPPVSL-LKVEYTSPL 442 K P+ + E + ++ R +IL G P EG SL +++Y S + Sbjct: 399 KLVLFHPMFARITQEKLNPVLERVFNILMRAGIFAPPPMAEGE-----SLEYEIDYVSKI 453 Query: 443 FKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSR 484 +A + Q ++ + G+ T DP+ ++ + +R Sbjct: 454 ALAIKAAQNGALAQMMDLIG--GMATFDPTVALVINWKKAAR 493 >gi|85059164|ref|YP_454866.1| hypothetical protein SG1186 [Sodalis glossinidius str. 'morsitans'] gi|84779684|dbj|BAE74461.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 541 Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 133/534 (24%), Positives = 219/534 (41%), Gaps = 63/534 (11%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------NAQ------LRMW 46 M++ + K I R + LK+ R E + YP + +AQ ++ Sbjct: 1 MDELAVKLIT-RADTLKSHRQRHESVWRECYDYTYPLRGAGFSADVLDAQSAKSKVAKLL 59 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGL-AESFSAYQAFLYKEDARSKKVREW---CDQ 102 D T +++ L+S L S +TP +W L +ES +DA++ W C Sbjct: 60 DGTATDSARMLASALMSGMTPANAQWLNLDSESLP--------DDAKA-----WLSGCAT 106 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 + G+ L VV G Y +DE E G + PLS Sbjct: 107 LVWENIHAANFDAEGYEANL-----DVVCAGWFVLY----IDENREEGGYMFQQWPLSQC 157 Query: 163 YM-SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYP-K 220 Y+ S +VD++YR + T +Q ++++G+ +S K++ A +++F +HA++P K Sbjct: 158 YVASTRKDGIVDTIYRCYQMTAEQAIAEFGEAGVSEKIRRAAKDKPDDKFDFLHAIFPRK 217 Query: 221 SLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALP 280 + + + F S V R E FP V R+ + YG P +ALP Sbjct: 218 NYVVNARLAKHLRFASFHVERQGKRIVRESGYHEFPVCVPRWMKISGGAYGIGPVYDALP 277 Query: 281 TIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQF 340 + LNET L++ IA + + +K G I + + +P+ Sbjct: 278 DCKELNETKRMEKAAQDLAISGMWIAEDDGVINPYSVKVGPRRI--IVASSVNSMKPLLT 335 Query: 341 GNPLPYHEEL---NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGG 397 G +H +RL+ SIR + + D Q D A +A E + +GP+ G Sbjct: 336 GAD--FHVAFTAEDRLQASIRKIMMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGR 392 Query: 398 LQSEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVAS 453 Q+E++ ++ R I G P PP S+ V Y SPL + Q+ E V + Sbjct: 393 FQAEYLQPLVERCFGIAFRAGVFPA------PPDSMQTAHFNVRYISPLARAQKLEDVTA 446 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 + V +L + P +D +DTD R A PA +IR A+V +R Sbjct: 447 IERLGANVAQLSQVS--PEVVDLVDTDEAMRVVADALGVPAKVIRSAADVTSLR 498 >gi|303257564|ref|ZP_07343576.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] gi|302859534|gb|EFL82613.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] Length = 548 Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 111/474 (23%), Positives = 213/474 (44%), Gaps = 38/474 (8%) Query: 92 RSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEG 151 ++ V+EW +V D L + S++ L Y + FGT C ++ E+ Sbjct: 94 KNPAVKEWMTKVQDLLLLYF--SKAECYNALHQSYLELPVFGTACTIVKPHP-----EQL 146 Query: 152 IRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF 211 I ++ + +++ + VD++YR + T Q+V +WG + +++ ++ A ++ RF Sbjct: 147 ISLQNLTIGEYWLAEDDYGKVDTMYRRLSLTAKQMVQQWGFEAVNNDVRQAFEKDPFTRF 206 Query: 212 TIIHAVYPK-SLTDKKKDKGNKGFHSKFVSVD-ENRFFEEKQIATFPYIVGRYRVRADEI 269 +IHA+ P+ K+D N + S + +++ E FP + R+ + Sbjct: 207 NVIHAIEPRIERNPDKRDNKNMPWQSVYFQEGVQDKVLSESGFRNFPALCPRWMTSGGSV 266 Query: 270 YGRSPAMEALP---TIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGA 326 YGR P +AL +++RL+ + EL +G PP + S K + KPG + Sbjct: 267 YGRGPGAKALSAQKSLQRLHLRLAELVDYGT---RPPILYPSTLKDQLSQFKPGG-RVAV 322 Query: 327 LSREG---RSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKAS---RSAAES 380 +E RS+++ +P + ++ I+ +F +++FQ++ A+ R+A E Sbjct: 323 NPQEAPIIRSMWE--VRTDPQAMLALIQSTRQDIQRIFFVNVFQMIAATANQTDRTATEV 380 Query: 381 MEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKV 436 +EK +GP++ L +E + +++ + LPE P L L + Sbjct: 381 QALEQEKVMMLGPVLERLHTELLDPLVTNAFGFMVEYNMLPEV------PEELYGRELSI 434 Query: 437 EYTSPLFKYQQAESVASALQGVNTVVELGVKTG-DPSCMDHMDTDRVSRFSLWATNTPAV 495 EY S L +A+ ASA V T ++G+ +P +D +D D P Sbjct: 435 EYVSVL---AEAQKNASANGIVRTAQQIGLLAQINPQAVDKLDVDATIDQLADMNGVPPS 491 Query: 496 LIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHD 549 LI +V IRQQR Q++ + QQ + +D+G A + +++ + + Sbjct: 492 LIVTGQKVALIRQQRAEQQQAQMQAAQLQQAMTSLKDLGQAADSQGLQEAFSEE 545 >gi|288957023|ref|YP_003447364.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] gi|288909331|dbj|BAI70820.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] Length = 534 Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 106/432 (24%), Positives = 187/432 (43%), Gaps = 43/432 (9%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R++D T +A +L++ L S +TPP +W G F E R + + + Sbjct: 68 RLFDGTAPDAVEQLAASLLSELTPPWSRWFG----FRPGPDLTGAERDRIAPLLDRAAGI 123 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 F RS F + + +V GT ME G +R+ +VPL++ Sbjct: 124 IQAHF-----DRSNFAVEVHQAFLDLVTVGTASLLMEEAA--PGAVSSLRFTAVPLADAV 176 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + +D+ +R T+ QI+ ++ L +++ A + + RF ++ AV P Sbjct: 177 LEEGPDGRLDATFRRSEATLAQILQRFPGAGLPDELRRRAAEDPDHRFPLVEAVVPDGAA 236 Query: 224 DKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 + + G + + + + A P++ R+ E YGRSP M+ALP I+ Sbjct: 237 YRWGVVLDSGLA-------DPSWLAQGRFAQSPFVNFRWLKAPGETYGRSPVMKALPDIK 289 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFD---------LKPGYMNIGALSREGRS- 333 N+ V L L +IAV+ Q + D L PG + A+ G + Sbjct: 290 TANKVVE-------LVLKNASIAVTGIWQADDDGVLNPSTIRLVPGTIIPKAVGSAGLTP 342 Query: 334 LFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGP 393 L P +F L+ L+ IR L+D + D A +A E +E++ E +G Sbjct: 343 LANPGRFDV---SQLVLDDLRGRIRHALLVDRLGPV-DSARMTATEVLERSVEMARLLGA 398 Query: 394 LIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 G LQ+E + ++ R + IL +G +P+ D L+++++ SPL + Q V + Sbjct: 399 TYGRLQAELMTPLLLRAVSILRRRGEIPDIT-VDG---RLVELQHRSPLAQAQAQRDVQA 454 Query: 454 ALQGVNTVVELG 465 L+ +++V LG Sbjct: 455 TLRWLDSVKALG 466 >gi|46581008|ref|YP_011816.1| hypothetical protein DVU2604 [Desulfovibrio vulgaris str. Hildenborough] gi|46450429|gb|AAS97076.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311234693|gb|ADP87547.1| hypothetical protein Deval_2404 [Desulfovibrio vulgaris RCH1] Length = 569 Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 122/491 (24%), Positives = 208/491 (42%), Gaps = 56/491 (11%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDAR-SKKVREWCDQ 102 R+ D T + A L++ + +T P + W ++ L ED + R W D Sbjct: 56 RIIDGTATRAVRILAAGMQGGLTSPARPW---------FRLRLADEDMEEAGPERRWLDV 106 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 V L+ +RS F + YT + FG+ Y EAD + +R+ + + Sbjct: 107 VERRLYA--ALARSNFYAAVHGLYTELAAFGSADMYHEADP-----QRVMRFSCLACGDF 159 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK-- 220 + + VD+V R + Q+ ++G+ LS +++ L R+ ++H V P+ Sbjct: 160 AWACDAAGRVDTVVRRLRMSARQMAQRYGEARLSRRVRRMLRRDPERSVPLVHMVRPRVR 219 Query: 221 ---SLTDKKKDKGNKGFHSKFVSV-----DENRFFEEKQIATFPYIVGRYRVRADEIYGR 272 K G G + + S+ E FP++ R+ V +IYGR Sbjct: 220 RNAGEAGKTASGGLGGVNMPWQSLTWETEGAEGLLHEGGFEEFPHLAARWDVAGGDIYGR 279 Query: 273 SPAMEALPTIRRLNETVNELAQFGRLSLH----PPTIAVSEAKQRNFDLKPGYMNIGALS 328 SP M+ LP ++ L E+A+ L++H PP S KQR +L PG N Sbjct: 280 SPGMDVLPDVKML----QEMARSQLLAIHKVVNPPMRVPSGFKQR-LNLIPGGQNY-VTP 333 Query: 329 REGRSLFQPVQFGNP--LPYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKT 384 +G S+ Q NP ++ ++ ++R F DLF + + +++ +AAE +E+ Sbjct: 334 GQGESVGPLYQI-NPDIGAVTHKMEDVRRAVREGFFNDLFLMFTAEGRSNITAAEVLERG 392 Query: 385 REKGAFVGPLIGGLQSEFIGAMISRELDI----LDSQGNLPECEGADNPPVSLLKVEYTS 440 EK +GP+I QSE + ++ R I PE G ++VEY S Sbjct: 393 EEKLLMLGPVIERHQSELLDPLLERTYGILRRGGLLPPPPPELAGRS------MRVEYVS 446 Query: 441 PLFKYQQAESVASALQGVNTVVEL-GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRD 499 L + Q+ + + + + V L GV P +D +D ++ PA ++R Sbjct: 447 ALAQAQRVVTAQAIRRFASDVSALAGVA---PQVLDKVDFEQAVDELAAIAGVPARVVRS 503 Query: 500 TAEVEDIRQQR 510 AEV +R R Sbjct: 504 DAEVATLRAAR 514 >gi|120601696|ref|YP_966096.1| hypothetical protein Dvul_0646 [Desulfovibrio vulgaris DP4] gi|120561925|gb|ABM27669.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 569 Score = 110 bits (274), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 122/491 (24%), Positives = 208/491 (42%), Gaps = 56/491 (11%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDAR-SKKVREWCDQ 102 R+ D T + A L++ + +T P + W ++ L ED + R W D Sbjct: 56 RIIDGTATRAVRILAAGMQGGLTSPARPW---------FRLRLADEDMEEAGPERRWLDV 106 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 V L+ +RS F + YT + FG+ Y EAD + +R+ + + Sbjct: 107 VERRLYA--ALARSNFYAAVHGLYTELAAFGSADMYHEADP-----QRVMRFSCLACGDF 159 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK-- 220 + + VD+V R + Q+ ++G+ LS +++ L R+ ++H V P+ Sbjct: 160 AWACDAAGRVDTVVRRLRMSARQMAQRYGEARLSRRVRRMLRRDPERSVPLVHMVRPRVR 219 Query: 221 ---SLTDKKKDKGNKGFHSKFVSV-----DENRFFEEKQIATFPYIVGRYRVRADEIYGR 272 K G G + + S+ E FP++ R+ V +IYGR Sbjct: 220 RNAGEAGKTASGGLGGVNMPWQSLTWETEGAEGLLHEGGFEEFPHLAARWDVAGGDIYGR 279 Query: 273 SPAMEALPTIRRLNETVNELAQFGRLSLH----PPTIAVSEAKQRNFDLKPGYMNIGALS 328 SP M+ LP ++ L E+A+ L++H PP S KQR +L PG N Sbjct: 280 SPGMDVLPDVKML----QEMARSQLLAIHKVVNPPMRVPSGFKQR-LNLIPGGQNY-VTP 333 Query: 329 REGRSLFQPVQFGNP--LPYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKT 384 +G S+ Q NP ++ ++ ++R F DLF + + +++ +AAE +E+ Sbjct: 334 GQGESVGPLYQI-NPDIGAVTHKMEDVRRAVREGFFNDLFLMFTAEGRSNITAAEVLERG 392 Query: 385 REKGAFVGPLIGGLQSEFIGAMISRELDI----LDSQGNLPECEGADNPPVSLLKVEYTS 440 EK +GP+I QSE + ++ R I PE G ++VEY S Sbjct: 393 EEKLLMLGPVIERHQSELLDPLLERTYGILRRGGLLPPPPPELAGRS------MRVEYVS 446 Query: 441 PLFKYQQAESVASALQGVNTVVEL-GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRD 499 L + Q+ + + + + V L GV P +D +D ++ PA ++R Sbjct: 447 ALAQAQRVVTAQAIRRFASDVSALAGVA---PQVLDKVDFEQAVDELAAIAGVPARVVRS 503 Query: 500 TAEVEDIRQQR 510 AEV +R R Sbjct: 504 DAEVATLRAAR 514 >gi|209966578|ref|YP_002299493.1| hypothetical protein RC1_3320 [Rhodospirillum centenum SW] gi|209960044|gb|ACJ00681.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 521 Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 121/451 (26%), Positives = 195/451 (43%), Gaps = 53/451 (11%) Query: 59 SLLSSLITPPGQKWHGLAES--FSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 SLL+ L TPP +W GLA SA + L V ++ + L +RS Sbjct: 86 SLLAQL-TPPWSRWAGLAPGPDLSAAERAL---------VAPLLERASADLQAHLDRSN- 134 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 F + VV GTGC +E G +R+ +VPL+++ + + +D+V+ Sbjct: 135 -FAVEAHQAFLDVVTGGTGCLLVEEA--PPGAPSALRFTAVPLADLVLEEGAEGRLDTVF 191 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHS 236 R T T+ Q+ +++G L ++ A + + R ++ AV P G + Sbjct: 192 RRLTPTLAQLAARFGTDALPGALRRRAAADPDARAAVVEAVLPDP-------GGGACRWA 244 Query: 237 KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 + D E + A P+I R+ E+YGRSP M+ALP IR N+ V Sbjct: 245 VALEDDPPVLLAEGRFAEPPFIAFRWMKAPGEVYGRSPVMKALPDIRTANKVVE------ 298 Query: 297 RLSLHPPTIAVSEAKQRNFD---------LKPGYMNIGALSREGRS-LFQPVQFGNPLPY 346 L L ++AV+ Q + D L PG + A+ G + L P +F Sbjct: 299 -LVLKNASVAVTGIWQADDDGVLNPGTIRLVPGAIIPKAVGSAGLTPLASPGRFDV---S 354 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 L+ L+ IR L D + +A E +E++ E +G G LQSE + + Sbjct: 355 QLVLDDLRAHIRHALLADRLGPVQGP-RMTATEVLERSAEMARMLGATYGRLQSELLVPL 413 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + R L +L +G +P+ AD L+ V+ SPL + QQ + L+ + +V LG Sbjct: 414 VRRCLSLLRRRGAVPDL-AADG---RLVAVQILSPLARAQQRRDAEAVLRWLESVTGLGD 469 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLI 497 + M +D + +RF A PA L+ Sbjct: 470 -----AAMRAVDLEACARFLADAAGVPAALL 495 >gi|239787361|emb|CAX83837.1| Head-to-tail joining protein [uncultured bacterium] Length = 524 Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 132/533 (24%), Positives = 214/533 (40%), Gaps = 83/533 (15%) Query: 1 MNQRSAKDIQ----DRFNYLKNQRGELNYW---MEELTGFLYPYKNNAQL---------- 43 MN ++ D Q RF + +R N W +E F P + L Sbjct: 1 MNGQNDPDAQRVVLKRFEKARERR---NVWEGHWQECYDFALPSRGGPLLSSQPGAKRTD 57 Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R++D T + +L++ L + +TPP +W GLA A +E + V E Sbjct: 58 RLFDGTAPDCVDQLAASLLAQLTPPWAQWFGLA----AGPDLTPEEREVAAPVLEKAGAA 113 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 + F RS F + Y +V GT E G R+ ++PL+ + Sbjct: 114 LQSHF-----DRSNFAIEMHQCYLDLVTAGTASLLFEEA--PLGSASAFRFTAIPLAQLA 166 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK--- 220 + + + +D+ +R T+ I ++ L M + + RF ++ AV P+ Sbjct: 167 LEESVEGRLDTTFRSSEMTISAIRERFPKAQLPESMGRKSKDDADARFKVVEAVLPERHG 226 Query: 221 ----SLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAM 276 ++ D + G + ++ E RF P+I R+ E+YGRSP M Sbjct: 227 YAYHAILDGEGTGGAE-------TLAEGRF------EMSPFINFRWLKAPGEVYGRSPVM 273 Query: 277 EALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQ---------RNFDLKPGYMNIGAL 327 ++LP I+ N+ V L L TIAV+ Q N L PG + A+ Sbjct: 274 KSLPDIKTANKVVE-------LVLKNATIAVTGIWQADDDGVLNPANIKLVPGTIIPKAV 326 Query: 328 SREGRS-LFQPVQFGNPLPYHEELNRLKESIRSLFLLD-LFQVLDDKASRSAAESMEKTR 385 G + L P +F L L++ I L D L Q+ D + +A E +E++ Sbjct: 327 GSAGLTPLETPGRFDI---SQLMLTDLRQRISHALLADRLGQI--DAPNMTATEVLERSA 381 Query: 386 EKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 E +G G LQSE + ++ R + IL +G +P D + L+ Y SPL Sbjct: 382 EMARLLGATYGRLQSELLTPLVMRAVAILKRRGEIPGLS-IDGHQIELI---YKSPLANE 437 Query: 446 QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIR 498 + E + LQ + V+ G P +D +R+ A N PA L+R Sbjct: 438 RGREDAKNTLQWLTAVMSFG-----PPANQVVDLGAAARWLAKALNVPAELLR 485 >gi|85059667|ref|YP_455369.1| hypothetical protein SG1689 [Sodalis glossinidius str. 'morsitans'] gi|84780187|dbj|BAE74964.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 517 Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 131/530 (24%), Positives = 216/530 (40%), Gaps = 55/530 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------NAQ------LRMW 46 M++ + K I R + LK+ R E + YP + +AQ ++ Sbjct: 1 MDELAVKLIT-RADALKSHRQRHESVWSECYDYTYPLRGAGFSADVLDAQSAKSKVAKLL 59 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGL-AESFSAYQAFLYKEDARSKKVREWCDQVTD 105 D T +++ L+S L S +TP +W L ES L ED + W Sbjct: 60 DGTATDSARMLASALMSGMTPANAQWLNLDCES-------LADED------KAWLSTCAT 106 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM- 164 ++ + G ++ VV G Y +DE E G + PLS Y+ Sbjct: 107 LVWENIHAANFDAEGYEENL--DVVCAGWFVLY----IDENREEGGYTFQQWPLSQCYVA 160 Query: 165 SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD 224 S +VD++YR + T +Q ++++G+ +S K++ A +++F +HA++P++ Sbjct: 161 STRKDGIVDTIYRCYQMTAEQAIAEFGEAGVSEKIRRAARDKPDDKFDFLHAIFPRTNYG 220 Query: 225 KKKDKGNK-GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 F S V R E FP V R+ YG P +ALP + Sbjct: 221 VNACLAKHLRFASFHVERQGKRIVRESGYHEFPVCVPRWMKIPGGAYGIGPVYDALPDCK 280 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 LNET L++ I+ + + +K G I S +P+ G Sbjct: 281 ELNETKRMEKAAQDLAISGMWISEDDGVINPYSVKVGPRRIIVASSVNS--MKPLLTGAD 338 Query: 344 --LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + + E +RL+ SIR + + D Q D A +A E + +GP+ G Q+E Sbjct: 339 FQVAFTAE-DRLQASIRKIMMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRFQAE 396 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASALQG 457 ++ ++ R I G P PP S+ V Y SPL + Q+ E V + + Sbjct: 397 YLQPLVERCFGIAFRAGVFPP------PPDSMQTAHFNVLYISPLARAQKLEDVTAVERL 450 Query: 458 VNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 V +L + P +D +DTD +R A PA +IR A+V +R Sbjct: 451 GANVAQLSQVS--PEVVDLVDTDEATRVVADALGVPAKVIRSAADVTSLR 498 >gi|262043408|ref|ZP_06016533.1| hypothetical protein HMPREF0484_3551 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039234|gb|EEW40380.1| hypothetical protein HMPREF0484_3551 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 515 Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust. Identities = 118/475 (24%), Positives = 185/475 (38%), Gaps = 50/475 (10%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGL-AESFSAYQAFLYKEDARSKKVREWCDQ 102 R+ D T +++ L+S L S +TP +W L +ES +DA + W Sbjct: 57 RLLDGTATDSARMLASALMSGMTPANAQWLNLDSESLP--------DDAAA-----WLS- 102 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 T + + F VV G Y +DE E G + PL+ Sbjct: 103 -TCATLVWENIHAANFDAEGYEANLDVVCAGWFALY----IDEDREEGGFSFQQWPLAQC 157 Query: 163 YM-SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK- 220 Y+ S +VD++YR + T +Q + ++G +S K+ A A+ +++F +H ++P+ Sbjct: 158 YVTSTRRDGIVDTIYRRYQLTAEQAIKEFGADKVSKKISDAAAKKPDDKFEFLHCIFPRE 217 Query: 221 SLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALP 280 + + N F S V V E FP V R+ YG P +ALP Sbjct: 218 NYVVNARLAKNLRFASYNVEVSGKLIVRESGYHEFPCCVPRWMKIPGTPYGIGPVYDALP 277 Query: 281 TIRRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPV 338 + LNET L++ IA + R + P + + + L Sbjct: 278 DCKELNETKRMEKAAQDLAIAGMWIAEDDGVLNPRTVKVGPRRIIVANSVDSMKPLLTGA 337 Query: 339 QFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGL 398 F E RL+ SIR + + D Q D A +A E + +GP+ G Sbjct: 338 DFNVAFTAEE---RLQASIRKIMMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRF 393 Query: 399 QSEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASA 454 Q+E++ ++ R + G P P SL V Y SPL + QQ Sbjct: 394 QAEYLQPLVERCFGLAFRAGVFPPA------PESLQNANFNVRYISPLARAQQ------- 440 Query: 455 LQGVNTVVELGVKTGD-----PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 L+ V + LG + P D +DTD +R A PA +IR + VE Sbjct: 441 LENVTAIERLGANVANLAQVSPDVTDLVDTDEATRVIADALGVPAKVIRSSDAVE 495 >gi|282848877|ref|ZP_06258267.1| hypothetical protein HMPREF1035_1386 [Veillonella parvula ATCC 17745] gi|282581382|gb|EFB86775.1| hypothetical protein HMPREF1035_1386 [Veillonella parvula ATCC 17745] Length = 575 Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 122/478 (25%), Positives = 207/478 (43%), Gaps = 50/478 (10%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLA-ESFSAYQAFLYKEDARSKKVREWCDQ 102 ++ + E+C +S + S +TPP +KW L E+ A + +V E D+ Sbjct: 71 KILNPVAWESCQIFASGVMSGLTPPSRKWFKLTMENIDV---------AANSQVAELLDE 121 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 + L+ ++S F + Y + G + AD E G+R+ S P+ Sbjct: 122 REEILYAVL--AKSNFYSVVHQVYMEL-PMGQAPMGIFADS-----ESGVRFTSYPIGTY 173 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN---ERFTIIHAVYP 219 +S N + +V+ R++ TVDQIV ++G + +K+ + N N + FT+ V P Sbjct: 174 AISTNSKEIVNIFGRKYKMTVDQIVEQFGYENCPDNIKN-IYDNGNSLQQSFTVNWLVEP 232 Query: 220 KSLTDKKKDKGNKGFHSKF----VSVDENRF---FEEKQIATFPYIVGRYRVRADEIYGR 272 K + N + S + + DE + FEE +P + R+ YG+ Sbjct: 233 NKDRKDKLGRRNMPYSSIYWVEGSNSDEVLYHGGFEE-----WPIPIARHTSMDLNGYGK 287 Query: 273 SPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGR 332 A A P + L + + L + PP A S+ +L PG G EG+ Sbjct: 288 GAAWFAQPDSQMLQKLEFDYLTAVELGVKPPMQAPSDVIS-TVNLYPG----GITEIEGQ 342 Query: 333 SLFQP---VQFGNPLPYHEELNRLKESIRSLFLLDLFQVLD--DKASRSAAESMEKTREK 387 +P VQ N ++ ++SI+ + DLF +LD DK +A E ME+T+EK Sbjct: 343 HKVEPMFAVQ-SNLQDIQNKIAVTEDSIKRAYSADLFLMLDQIDKGQMTAREVMERTQEK 401 Query: 388 GAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGA---DNPPVSLLKVEYTSPLFK 444 +GP++ L SEF+ +I R +LD G P E D +K+EY SPL + Sbjct: 402 LQQLGPVVERLLSEFLNPIIERVYAVLDRAGVFPPVEDEELLDQLNGQEVKIEYISPLAQ 461 Query: 445 YQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 Q+ S+ + Q ++ L +P+ ++ + + + PA +IR E Sbjct: 462 AQKMSSLVNIEQYFAFIMSLA--QANPNIVNKFNFEEAANTYGVNLGVPAKIIRSDDE 517 >gi|227355860|ref|ZP_03840253.1| tail protein [Proteus mirabilis ATCC 29906] gi|227164179|gb|EEI49076.1| tail protein [Proteus mirabilis ATCC 29906] Length = 554 Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 115/437 (26%), Positives = 196/437 (44%), Gaps = 44/437 (10%) Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 + S IT P + W LA + K E +Q + +F +RS Sbjct: 72 MMSGITSPARPWFRLATPDPDLMDY-----GPVKLWLETTEQRMNEVF-----NRSNLYQ 121 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L Y + FGT + D + IR + PL + Y++ + VD YR+FT Sbjct: 122 SLPLMYGDLGTFGTAAMAVVEDS-----QRIIRTVHFPLGSYYIANSPSLSVDVCYRKFT 176 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPK-SLTDKKKDKGNKGFHSKF 238 TV Q+V ++G +S +KS ++ ++ ++HAVYP K + +K F S + Sbjct: 177 MTVRQLVMEFGVDSVSDTVKSMWNSSQYSQWIEVVHAVYPNLERQTGKLEAKHKPFKSVY 236 Query: 239 VSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETVNELAQF 295 + V D + E FP + R+ V +++YG S P M AL + L AQ Sbjct: 237 LEVAGDHEKVLRESGYDEFPIMAPRWEVNGEDVYGSSCPGMLALGGTKALQLMQKRKAQM 296 Query: 296 GRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLF--QPVQFGNPLPYHEEL 350 +PP + K + + PG Y++ + + +++F QPV L E++ Sbjct: 297 IDKLTNPPLQVPASLKNQRVNTIPGGINYLDEANPTNKIQTIFDVQPVALKALL---EDV 353 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAA-ESMEKTREKGAF-VGPLIGGLQSEFIGAMIS 408 ++ I + + +DLF+++ +RS E++ + RE+ +GP++ L SE + +I+ Sbjct: 354 QDTRQLIDTAYFVDLFRMMQMVNTRSMPIEAVVEMREEKLLQLGPVLQRLDSELLDKLIN 413 Query: 409 RELDILDSQGNLP----ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 R IL ++ LP E +G D LKVEY S + + Q++ V S + V L Sbjct: 414 RTFSILVNKNLLPVAPDEMQGMD------LKVEYISVMAQAQKSIGVGSIERFAGFVGNL 467 Query: 465 G-VKTGDPSCMDHMDTD 480 VK P +D ++ D Sbjct: 468 AKVK---PEALDKLNAD 481 >gi|23015763|ref|ZP_00055531.1| hypothetical protein Magn03010200 [Magnetospirillum magnetotacticum MS-1] Length = 543 Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust. Identities = 118/475 (24%), Positives = 193/475 (40%), Gaps = 61/475 (12%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R++D T + +L++ L S +TPP +W GL +A E + + E V Sbjct: 78 RLFDGTAPDCVDQLAASLLSELTPPWAQWFGLTAGDQMPEA----ERDQVAPLLERVAAV 133 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 + F RS F + Y V GT E G R+ SVPL V Sbjct: 134 MQSHF-----DRSNFAIEMHQCYLDAVTGGTASLLFEEAA--PGEASAFRFTSVPLGQVV 186 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + +D +R +V + +++ VLS + A A + + R ++ AV P Sbjct: 187 LEEGPAGRLDVTFRRSEMSVAALKARFARAVLSGHLIKAAADDPDLRLGVVEAVIPV--- 243 Query: 224 DKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 +G + + + ++ P++ R+ E+YGRSP M+ALP I+ Sbjct: 244 -----RGGYSYAAVLDDESSDVVLGRGSFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIK 298 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQ---------RNFDLKPGYMNIGALSREG-RS 333 N+ V L L TIAV+ Q N L PG + A+ G + Sbjct: 299 TANKVVE-------LVLKNATIAVTGIWQADDDGVLNPANIKLVPGTIIPKAVGSAGLQP 351 Query: 334 LFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRS--AAESMEKTREKGAFV 391 L P +F L+ L+ IR + D L AS S A E ++++ + + Sbjct: 352 LTAPGRFDT---SQLVLDDLRGRIRHALMGD---KLSQPASPSLTATEVLQRSDDMARLL 405 Query: 392 GPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVS----LLKVEYTSPLFKYQQ 447 G G LQSE + +I R + IL +G + PP+S + ++Y SPL + Q Sbjct: 406 GATYGRLQSELLTPLIMRAIHILRRRGEI--------PPLSVDGRVFDLQYRSPLAQNQG 457 Query: 448 AESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 + L + + LG P+ + +D +R+ A N P+ L+R +E Sbjct: 458 RRDARNVLSWLGALSSLG-----PAALATVDAAAAARWLGRAFNVPSELVRPASE 507 >gi|295096867|emb|CBK85957.1| Bacteriophage head to tail connecting protein [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 541 Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 130/531 (24%), Positives = 214/531 (40%), Gaps = 57/531 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------NAQ------LRMW 46 M++ + K I+ R + LK R + E + YP + +AQ ++ Sbjct: 1 MDELAVKLIK-RSDTLKANRQQHESVWRECYDYTYPLRGAGFSDEVLDAQSAKHKVAKLL 59 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGL-AESFSAYQAFLYKEDARSKKVREWCDQVTD 105 D T +++ L+S L S +TP +W L +ES +DA++ W + Sbjct: 60 DGTATDSARMLASALMSGMTPANAQWLNLDSESLP--------DDAKA-----WLSECAT 106 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM- 164 ++ + F VV G Y +DE E G + PL+ Y+ Sbjct: 107 LVW--ENIHAANFDAEGYEANLDVVCAGWFVLY----IDEDREEGGYTFQQWPLAQCYVT 160 Query: 165 SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKS--L 222 S +VD++YR + T +Q + ++G +S K++ A + +++F +H ++P+ + Sbjct: 161 STRKDGIVDTIYRRYQLTAEQAIKEFGADKVSEKIRDAAKKKADDKFDFLHCIFPRETYM 220 Query: 223 TDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D + K N F S V V + E FP V R+ YG P +ALP Sbjct: 221 VDARLAK-NMRFASYNVDVSNKQIVRESGYHEFPCCVPRWMKIPGGSYGIGPVYDALPDC 279 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPVQF 340 + LNET L++ IA + R + P + + + L F Sbjct: 280 KELNETKRMEKAAQDLAISGMWIAEDDGVLNPRTVKVGPRRIIVANSVDSMKPLLTGSDF 339 Query: 341 GNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS 400 E RL+ SIR + + D Q D A +A E + +GP+ G Q+ Sbjct: 340 SVAFTAEE---RLQASIRKIMMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRFQA 395 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESVASALQ 456 E++ ++ R I G PP SL V Y SPL + Q+ E V + + Sbjct: 396 EYLQLLVVRCFGIAFRAGIFSP------PPESLQNANFNVRYISPLARAQKLEDVTAIER 449 Query: 457 GVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 V L + D +D +DTD +R A PA +IR + V D+R Sbjct: 450 LGANVANLAGISQD--VVDLIDTDEATRVVADALGVPAKVIRSSDAVADLR 498 >gi|169795385|ref|YP_001713178.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148312|emb|CAM86177.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 547 Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 116/483 (24%), Positives = 201/483 (41%), Gaps = 51/483 (10%) Query: 45 MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVT 104 + D+T SEA L S + S TP W F A + + A +W D+V Sbjct: 57 LLDSTLSEATQLLVSSIISGTTPANALW------FKAVPNGV-DDPAELTDGEKWLDEVC 109 Query: 105 DTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM 164 F +R + + + V G G Y + D G G + + + Y+ Sbjct: 110 Q--FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVDRHAGG---GYVFQTWDIGQCYL 164 Query: 165 SVNHQN-VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + Q+ VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ Sbjct: 165 ASTRQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTG 224 Query: 224 DKKKDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEAL 279 K D+ F S V VDE E FP+++ R+R +YG AL Sbjct: 225 YIKGDRQLMPKEMPFASYHVEVDEKIILRETGYNEFPFVIPRFRKIPHSVYGTGQVSIAL 284 Query: 280 PTIRRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYM----NIGALSR--EG 331 P + N+ + + + +S V + R L G + ++ +L R +G Sbjct: 285 PDAKTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVNDVNSLKRIDDG 344 Query: 332 RSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFV 391 + Q G L H L+ +IR + D Q D A +A E + + Sbjct: 345 KGY----QVGVDLLAH-----LQGAIRKKMMADQLQPADGPAM-TATEVHVRVDLIRQQL 394 Query: 392 GPLIGGLQSEFIGAMISRELDILDSQGNL---PECEGADNPPVSLLKVEYTSPLFKYQQA 448 GPL G Q+E + ++ R + G + PE N L ++ S L + QQ Sbjct: 395 GPLYGRWQAELLTPLLERTFGLAYRAGVIGEAPEEMQGRN-----LSFKFISALARSQQL 449 Query: 449 ESVASA---LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 E V + LQG+++V EL DPS +D++D D V++ S P ++R +++ Sbjct: 450 EEVTAIERFLQGLSSVAEL-----DPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDA 504 Query: 506 IRQ 508 IR+ Sbjct: 505 IRK 507 >gi|294648400|ref|ZP_06725899.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825705|gb|EFF84409.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 558 Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 114/497 (22%), Positives = 204/497 (41%), Gaps = 65/497 (13%) Query: 38 KNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVR 97 + A+ ++DTT E L S + S T P W +++ D S+ Sbjct: 51 RKQARTDLFDTTSVEGIQLLVSSIVSGTTSPVSIW---------FKSVPSGVDTPSQLTE 101 Query: 98 --EWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYI 155 +W QV F FR S F + F T +V G Y + + EKG G + Sbjct: 102 GEQWLSQVDQ--FLFRNIHASNFDSEVTDFLTDLVVAGWAVLYADTN-REKG---GFTFN 155 Query: 156 SVPLSNVYMSVNHQN-VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTII 214 + + N Y+S N ++D++YREF + +QIVS++G +S K+++AL + +++FT++ Sbjct: 156 TWSIGNCYISSTQANGLIDTIYREFELSAEQIVSEFGIDNVSDKVRTALEKKPDQKFTLV 215 Query: 215 HAVYPKSLTDKKKDKGNKG--------FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRA 266 A++P+ D K KG +G F S + +E FP +V R++ Sbjct: 216 QAIFPR---DSKLIKGEEGKRVSTSMPFASYTIEAQSKHILKESGFEEFPCVVSRFKKIP 272 Query: 267 DEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGA 326 D YG + + N+ + Q L+L IA Q + ++ P + I Sbjct: 273 DSHYGLGMGSMVISDAKTANQIMKLSLQTAELNLGGLWIA-----QNDGNINPHTLRIRP 327 Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFL-LDLFQVLDDKASR---------- 375 + N + + + RL S+ L LD Q K R Sbjct: 328 ---------NAIIAANTV---DSIKRLDTGSASVGLGLDFLQHFQAKIKRTLMSDQLTPQ 375 Query: 376 -----SAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPP 430 +A E + + +G + +QSE++ ++ R + G LP + Sbjct: 376 GSSPLTATEIQARVQVYRNQLGSIFSRMQSEYLQVLLERTWGLAMRSGVLPPAP-EELMQ 434 Query: 431 VSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWAT 490 S + + +P+ Q+ E V +A+Q + V + D + MD+++ D + + A Sbjct: 435 ASRISFNFINPMAASQKLEWV-TAIQNLMLNVSQMAQI-DQTVMDNLNLDAMVQVMADAL 492 Query: 491 NTPAVLIRDTAEVEDIR 507 + P IR E+ ++R Sbjct: 493 SVPVEAIRTDEEIAELR 509 >gi|293609619|ref|ZP_06691921.1| predicted protein [Acinetobacter sp. SH024] gi|292828071|gb|EFF86434.1| predicted protein [Acinetobacter sp. SH024] Length = 547 Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 113/483 (23%), Positives = 201/483 (41%), Gaps = 51/483 (10%) Query: 45 MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVT 104 + D+T SEA L S + S TP W F A + + A + +W D+V Sbjct: 57 LLDSTLSEATQLLVSSIISGTTPANALW------FKAVPNGV-DDPAELTEGEKWLDEVC 109 Query: 105 DTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM 164 F +R + + + V G G Y + D G G + + + Y+ Sbjct: 110 Q--FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVDRHAGG---GYVFQTWDIGQCYL 164 Query: 165 SVNHQN-VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + Q+ VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ Sbjct: 165 ASTRQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTG 224 Query: 224 DKKKDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEAL 279 K D+ F S V VDE E FP+++ R+R + +YG AL Sbjct: 225 YIKGDRQLMPKEMPFASYHVEVDEKNVLRETGYNEFPFVIPRFRKIPNSVYGTGQVSIAL 284 Query: 280 PTIRRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYM----NIGALSR--EG 331 P + N+ + + + +S V + R L G + ++ +L R +G Sbjct: 285 PDAKTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVNDVNSLKRIDDG 344 Query: 332 RSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFV 391 + Q G L H L+ +IR + D Q D A +A E + + Sbjct: 345 KGY----QVGVDLLAH-----LQGAIRKKMMADQLQPADGPAM-TATEVHVRVDLIRQQL 394 Query: 392 GPLIGGLQSEFIGAMISRELDILDSQGNL---PECEGADNPPVSLLKVEYTSPLFKYQQA 448 GPL G Q+E + ++ R + G + PE N L ++ S L + QQ Sbjct: 395 GPLYGRWQAELLTPLLERTFGLAYRAGVIGEAPEEMQGRN-----LSFKFISALARSQQL 449 Query: 449 ESVASA---LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 E V + L G++ V ++ DPS +D++D D V++ S P ++R +++ Sbjct: 450 EEVTAIERFLAGMSNVAQI-----DPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDA 504 Query: 506 IRQ 508 IR+ Sbjct: 505 IRK 507 >gi|332875224|ref|ZP_08443057.1| hypothetical protein HMPREF0022_02690 [Acinetobacter baumannii 6014059] gi|332736668|gb|EGJ67662.1| hypothetical protein HMPREF0022_02690 [Acinetobacter baumannii 6014059] Length = 547 Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 113/483 (23%), Positives = 201/483 (41%), Gaps = 51/483 (10%) Query: 45 MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVT 104 + D+T SEA L S + S TP W F A + + A + +W D+V Sbjct: 57 LLDSTLSEATQLLVSSIISGTTPANALW------FKAVPNGV-DDPAELTEGEKWLDEVC 109 Query: 105 DTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM 164 F +R + + + V G G Y + D G G + + + Y+ Sbjct: 110 Q--FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVDRHAGG---GYVFQTWDIGQCYL 164 Query: 165 SVNHQN-VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + Q+ VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ Sbjct: 165 ASTRQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTG 224 Query: 224 DKKKDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEAL 279 K D+ F S V VDE E FP+++ R+R + +YG AL Sbjct: 225 YIKGDRQLMPKEMPFASYHVEVDEKIVLRETGYNEFPFVIPRFRKIPNSVYGTGQVSIAL 284 Query: 280 PTIRRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYM----NIGALSR--EG 331 P + N+ + + + +S V + R L G + ++ +L R +G Sbjct: 285 PDAKTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVNDVNSLKRIDDG 344 Query: 332 RSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFV 391 + Q G L H L+ +IR + D Q D A +A E + + Sbjct: 345 KGY----QVGVDLLAH-----LQGAIRKKMMADQLQPADGPAM-TATEVHVRVDLIRQQL 394 Query: 392 GPLIGGLQSEFIGAMISRELDILDSQGNL---PECEGADNPPVSLLKVEYTSPLFKYQQA 448 GPL G Q+E + ++ R + G + PE N L ++ S L + QQ Sbjct: 395 GPLYGRWQAELLTPLLERTFGLAYRAGVIGEAPEEMQGRN-----LSFKFISALARSQQL 449 Query: 449 ESVASA---LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 E V + L G++ V ++ DPS +D++D D V++ S P ++R +++ Sbjct: 450 EEVTAIERFLAGMSNVAQI-----DPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDA 504 Query: 506 IRQ 508 IR+ Sbjct: 505 IRK 507 >gi|254251745|ref|ZP_04945063.1| hypothetical protein BDAG_00942 [Burkholderia dolosa AUO158] gi|124894354|gb|EAY68234.1| hypothetical protein BDAG_00942 [Burkholderia dolosa AUO158] Length = 539 Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 120/512 (23%), Positives = 216/512 (42%), Gaps = 52/512 (10%) Query: 45 MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVT 104 ++D+T ++A L + + S +TP W F + + W D + Sbjct: 59 IFDSTATDAKRTLEASIMSGMTPANSLW------------FTMTVNGADDEGERWLDSAS 106 Query: 105 DTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM 164 + L+ + + F +V + G F + +DE G+ + P++ VY Sbjct: 107 EVLW--QNIHSANFD---SEAADAVADGMAGWFALY--IDENRDAGGLYFEHWPMAGVYC 159 Query: 165 -SVNHQNVVDSVYREFTFTVDQIVSKW---GDKVLSSKMKSALARNENERFTIIHAVYPK 220 S VD V+R + T +Q V ++ GD + + A + E E + A+YP+ Sbjct: 160 ASSKPGGTVDIVFRCYQLTAEQCVREFNRRGDSLPQEIVDKAKNKPE-ELVDLCQAIYPR 218 Query: 221 SLTDKKKDKG-NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEAL 279 + + N S + ++ + E P +V R++ + +YG P ++AL Sbjct: 219 DVHMVGALRAKNMPIASVTFACNQKQVIRESGYHEMPVVVARWKKIPNSVYGVGPLLDAL 278 Query: 280 PTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIG---ALSREGRSLFQ 336 P IR LN+ V ++ L L + ++E + L P + +G + + Sbjct: 279 PDIRTLNDIVK--LEYANLDLAVSGMWIAE---DDGVLNPRTVKVGPRKVIVANSVDSMK 333 Query: 337 PVQFGNPLPYHE-ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLI 395 P+Q + E + +L+ IR + D Q D A +A E + +GP+ Sbjct: 334 PLQPASNFQLAETRIEKLQGQIRKTLMADQLQPQDGPA-MTATEVHVRVDLIRQLLGPIY 392 Query: 396 GGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYTSPLFKYQQAESV 451 G LQ+E++ +I+R + G P PP SL V+Y SPL + Q+ E V Sbjct: 393 GRLQAEYLQPLIARCFGLAYRAGVFPP------PPDSLGGRNFSVQYQSPLARAQKLEEV 446 Query: 452 ASA--LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQ 509 ++ L G TV+ VK P +D++D D R + P ++R + +V RQQ Sbjct: 447 SAIERLMGDVTVIA-QVK---PEALDNIDGDEAVRLTAKNLGVPDSIVRTSDQVTQYRQQ 502 Query: 510 REVQRRVMEEQHLQQQLQ-QTSQDIGAKAAGR 540 ++ ++Q L ++Q + IG+ AA R Sbjct: 503 KQAAAAQQQQQQLGMEVQGDVMKSIGSAAASR 534 >gi|54302247|ref|YP_132240.1| putative head-tail connector protein [Photobacterium profundum SS9] gi|46915668|emb|CAG22440.1| hypothetical protein PBPRB0567 [Photobacterium profundum SS9] Length = 552 Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 108/460 (23%), Positives = 192/460 (41%), Gaps = 42/460 (9%) Query: 96 VREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYI 155 VR + D D + G + S F + S + ++ + E D +R+ Sbjct: 98 VRLYLDTCADLILGML--ASSNFYNVVPSMFMDLLTYSGSSVGFEKDP-----LTVMRFY 150 Query: 156 SVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-II 214 P+ + + + + V + R+ + V Q+V K+G +S +KSA + + T I Sbjct: 151 PNPIGSYRLGIGPRQNVSTHGRKVEYRVSQVVEKFGLDNVSQSIKSAYRSGKYNQLTEIR 210 Query: 215 HAVY------PKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADE 268 H V+ P++ + +K + + + D N F FP++ R+ V ++ Sbjct: 211 HLVFDNPDFVPRAFSAVRKPICSIWYDP---ADDRNPFLRRSGFDEFPFVTPRWEVIGND 267 Query: 269 IYGR-SPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGAL 327 YG P M AL +I+ L + + + L PP + S K L PG + Sbjct: 268 TYGSFGPGMLALGSIKGLQKDQRDKYEAQDKMLKPPMVGPSSLKNNPRSLLPGAVTF-VD 326 Query: 328 SREGRSLFQPV-QFGNPLPYH-EELNRLKESIRSLFLLDLFQVLDD--KASRSAAESMEK 383 +++G+ F P Q PL Y E + + I S F DLF + D K++ +A E + Sbjct: 327 NQQGQQGFTPAFQTNFPLNYQLESIRDTRAIIDSAFFKDLFLAVIDIGKSNTTATEIAAR 386 Query: 384 TREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSL----LKVEYT 439 EK +GP++ E + ++S ++ +G LPE PP L + +EY Sbjct: 387 KEEKLLMLGPVLNRFNEEGLDPIVSASFYEMNRRGMLPE------PPPELDGVDVNIEYV 440 Query: 440 SPLFKYQQAESVASALQGVNTVVEL-GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIR 498 L + Q+A ++S + V + L GV+ +D +D D V T T ++ Sbjct: 441 GLLQQAQKAVGISSIERTVGFIGNLAGVRQ---DVLDKVDFDSVVDIYTDITGTTPRILF 497 Query: 499 DTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAA 538 + +V+ R R+ ++Q Q GA+AA Sbjct: 498 NEQQVKATRDA-----RIQQQQREQMAAMAAPAKDGAEAA 532 >gi|167032756|ref|YP_001667987.1| putative tail protein [Pseudomonas putida GB-1] gi|166859244|gb|ABY97651.1| putative tail protein [Pseudomonas putida GB-1] Length = 564 Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 112/535 (20%), Positives = 211/535 (39%), Gaps = 59/535 (11%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQL-----------RMWDTTGSEACIKLS 58 + R + LK +R + +E++ F+ P ++ ++ + + A + Sbjct: 11 EKRLSALKTERSSWDTNAKEISDFILPMRSRVMCDDTNRGDRRNNKIINNRATMASRTTA 70 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREW---CDQVTDTLFGFRERSR 115 S + S IT P + W LA A F V+ W C Q +F R Sbjct: 71 SGMMSGITSPARPWFNLAPVARAIMEF--------GPVKSWFYECTQRMRDVF-----LR 117 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 S L + Y + FGTGC +++ D IR + Y+S ++ Sbjct: 118 SNLYQVLPTCYQEMATFGTGCIWVDEHPDTV-----IRCEAFTWGEYYISNGADGRAAAI 172 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFH 235 YREF +TV+Q+V ++G + LS K+ N ++F ++ G++ Sbjct: 173 YREFKWTVNQLVQEFGVEALSPSSKALYENNNGDQFISCAQRVELNMNANPDRAGSRNLP 232 Query: 236 SKFVSVDE----NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 ++ + + E++ FP + R+ + YG P L ++ L + Sbjct: 233 FSALTWEAGAPGDMVLEDRGYHEFPAMAVRWESMPGDAYGTGPGRICLGDVKALQLYERQ 292 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPG---YMNIGALSREGRSLFQPVQFGNPLPYHE 348 A+ +PP A E K + PG Y+ + + ++QP P Sbjct: 293 AARMTETGANPPLQAPVELKGQPSSTIPGGVTYVPMVGGQNQMAPIYQP-NAAWLSPIQA 351 Query: 349 ELNRLKESIRSLFLLDLFQVLDD-KASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 ++ + I F +DLF ++ R+A E + EK +GP++ + E + +I Sbjct: 352 KIQEHEGRINEAFFVDLFLMVSQLDTVRTATEIAARKEEKMLMLGPVLERINDELLDPLI 411 Query: 408 SRELDILDSQGNLPECEGADNPPV-------------SLLKVEYTSPLFKYQQAESVASA 454 R +I+ Q ++P G + S ++ EY S L + Q++++V Sbjct: 412 DRTFNIMLRQ-SIPIWAGIIDGDPLLPPPPEELINANSEIQAEYVSILAQAQKSQNVL-G 469 Query: 455 LQGVNTVVELGVKTGD-PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 L+ T+ G +G P +D +++D++ A ++R EV IR+ Sbjct: 470 LERFATLA--GNLSGAFPEVLDKVNSDQLIEEYADAIGVIPTVVRGADEVAAIRE 522 >gi|291334523|gb|ADD94176.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334657|gb|ADD94304.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291334711|gb|ADD94357.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] gi|291336437|gb|ADD95992.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 193 Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 56/187 (29%), Positives = 98/187 (52%), Gaps = 8/187 (4%) Query: 138 YMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSS 197 ++E D DE L+ R+I+ ++++ N + +D+V+R+F+ + ++ K+GD +S Sbjct: 2 FIEED-DEDILKFSTRHIN----EIFIAENDKGRIDTVFRKFSLSARAVMQKFGD--VSI 54 Query: 198 KMKSALARNENERFTIIHAVYPKSLTD-KKKDKGNKGFHSKFVSVDENRFFEEKQIATFP 256 + + ++ E I+HAVYP+S D +K+DK N F S ++ + FP Sbjct: 55 NIATKAKKDPYEEVEIMHAVYPRSDFDPRKQDKENMPFESVYLDAESGDELSVSGFREFP 114 Query: 257 YIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFD 316 ++V RY + EIYGRSPAM ALP ++ LNE + + + PP + + Sbjct: 115 FVVPRYLKASHEIYGRSPAMTALPDVKMLNEMSKTTIKSAQKQVDPPLLVPDDGFMLPVR 174 Query: 317 LKPGYMN 323 PG +N Sbjct: 175 TIPGGLN 181 >gi|48696640|ref|YP_024419.1| hypothetical protein VP2p04 [Vibrio phage VP2] gi|48696684|ref|YP_024978.1| hypothetical protein VP5_gp03 [Vibrio phage VP5] gi|40806147|gb|AAR92065.1| hypothetical protein [Vibrio phage VP5] gi|40950038|gb|AAR97629.1| hypothetical protein [Vibrio phage VP2] Length = 547 Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 123/545 (22%), Positives = 214/545 (39%), Gaps = 80/545 (14%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------------NAQLRMWDTTGSEAC 54 I R ++LK R + + + ++ P ++ N ++D+T + Sbjct: 6 IVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGL 65 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 LSS L +T P KW LA F KE + R+W + T ++ + S Sbjct: 66 ETLSSSLHGSLTSPATKWFELA--------FRDKELNSDDECRKWLENATHDVYSALQDS 117 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 F Y + +G E D DE+G + + S P+ + Y + + V + Sbjct: 118 --NFNLEANETYIDLCGYGNAIMVEEEDEDEEG---SVVFQSSPIQDSYFEEDSRGQVVN 172 Query: 175 VYREFTFTVDQIVSKWGDK------VLSSKMKSALARNENERFTIIHAVYPKSLTDKKKD 228 YR F +T QI ++GD+ + +K S A + E + Y DKK++ Sbjct: 173 FYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRY-----DKKQN 227 Query: 229 KG--------NKGFHSKFVSVDEN-RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEAL 279 + + F K++ + + EE P R+R A +G P+ AL Sbjct: 228 RNAGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLAL 287 Query: 280 PTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQ 339 P + N V EL + P I V+E G ++ L G ++ + ++ Sbjct: 288 PDVLTANRYV-ELVLRSSEKVIDPAIMVTER---------GLISDIDLGASGLTVVRDME 337 Query: 340 FGNPLPYHE-------ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVG 392 P +L L+ ++R ++ +D Q + D + +A E + +G Sbjct: 338 SMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQ-MKDSPAMTATEVQVRYELMQRLLG 396 Query: 393 PLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLK-------VEYTSPLFKY 445 P +G L+++F+ MI R +I G L E P LL+ + YT PL + Sbjct: 397 PTLGRLENDFLSPMIQRTFNIRFRAGKLGEL------PSKLLESGKAAMDIVYTGPLSRA 450 Query: 446 QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 Q+ + AS + + +L +P +D D D + R P L+R A+V Sbjct: 451 QKIDQAASIERWAGSTAQLA--EINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTS 508 Query: 506 IRQQR 510 IR+ R Sbjct: 509 IRKNR 513 >gi|260557979|ref|ZP_05830191.1| Bbp21 [Acinetobacter baumannii ATCC 19606] gi|260408489|gb|EEX01795.1| Bbp21 [Acinetobacter baumannii ATCC 19606] Length = 555 Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 104/463 (22%), Positives = 189/463 (40%), Gaps = 43/463 (9%) Query: 37 YKNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKV 96 + +A ++ D TG ++ L++ + S P +KW L + + Q + +V Sbjct: 41 HDRSAWSKIVDNTGKDSLKTLAAGMVSGTCSPSRKWFTLQAADESLQ--------KDIEV 92 Query: 97 REWCDQVTDTLF-GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYI 155 R+W V D + F S+S + Y FG G A E G + + Sbjct: 93 RQWLKAVEDACYVAF---SKSNVYRTVHHIYMQEGAFGIGA----ALAPEHGRNSKAQLM 145 Query: 156 S-VPLS--NVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALA-RNENERF 211 +PL+ ++ + N + VYR+F T +V +G +S +K+A +N + F Sbjct: 146 DLIPLTFGEFAITTDEFNKPNGVYRKFKLTSINMVKYFGLDNVSDAIKNAFENKNYEQEF 205 Query: 212 TIIHAVYPKSLTDKKKDKGNKGFHSKFVS-VDENRFFEEKQIATFPYIVGRYRVRADEIY 270 + HA+Y + + K N F S + ++ E + F I GR+ V + ++Y Sbjct: 206 EVCHAIYER-VDAKGYGPKNMPFASIYYEPSSSDKLLRESGLMGFQVICGRWTVSSSDVY 264 Query: 271 GRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSRE 330 G PA + + +R L + ++A + PP + K + P G + Sbjct: 265 GEGPASDCIGDLRALQKGHQQIAVGVDYQVRPPLLLPDYLKGHERETLPN----GIAFYQ 320 Query: 331 GRSLFQPVQFGNPLPYHEELN-------RLKESIRSLFLLDLFQVLD--DKASRSAAESM 381 Q Q L +LN + +E ++ F DLF +LD DK +A E Sbjct: 321 ASPTSQVAQVQAMLNVQFDLNGVMAQIAQCQERVKRAFHTDLFMMLDAFDKGKMTATEVY 380 Query: 382 EKTREKGAFVGPLIGGLQSEFIGAMISRELD-ILDSQGNLPEC--EGADNPPVSLLKVEY 438 E+ EK +GP++ E + ++ ++ +L + L + E N V + V Sbjct: 381 ERKSEKMLMLGPVVERQIDELLRPLVEICVERVLANSEYLRQIAPEAIQNADVEINFVSI 440 Query: 439 TSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 + K + + AL + V ++ DP +D +DTD+ Sbjct: 441 LALAQKSSGSAILERALAMIGQVAQV-----DPQVLDKVDTDK 478 >gi|296537022|ref|ZP_06899017.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296262651|gb|EFH09281.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 368 Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 87/350 (24%), Positives = 142/350 (40%), Gaps = 33/350 (9%) Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 RS F + + +V GTG +E G +R+ +VPL + +D+ Sbjct: 36 RSNFAVEMHQAFLDLVVAGTGVLLVEEA--PPGALSALRFTAVPLREAVLEEGESGRLDT 93 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGF 234 +YR I +++ VL + + E R ++ AV+P ++G + Sbjct: 94 IYRAMALEAAAIAARYPGAVLPPGLGAGSPAQEAPRHRVVEAVWP--------ERGGSAY 145 Query: 235 HSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQ 294 + E + P+I R+ E YGR P M+ALP IR N+ V Sbjct: 146 LAVLEHDGRAWPLAEGRFQDSPFIAFRWLKAPGEAYGRGPVMKALPDIRTANKVVE---- 201 Query: 295 FGRLSLHPPTIAVSEAKQRNFD--LKPGYMNI--GAL--SREGRSLFQPVQF-GNPLPYH 347 L L +IA + Q D L P + + GA+ G S P+ GN Sbjct: 202 ---LVLKNASIAATGIWQAEDDGVLNPATVRLVPGAIIPKAPGSSGLTPLAAPGNFDVSQ 258 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 L+ L+ IR+ L D A+ +A E +E++ + +G G LQ+E + +I Sbjct: 259 LVLDDLRGRIRAALLADRLGP-PGTAAMTATEVLERSAQTARLLGATYGRLQAELLTPLI 317 Query: 408 SRELDILDSQGNLPE--CEGADNPPVSLLKVEYTSPLFKYQQAESVASAL 455 R L IL +G +P +G + ++ Y SPL + Q A+ L Sbjct: 318 GRCLSILRRRGEVPPLLLDGREA------RLTYHSPLARVQGRSDAANTL 361 >gi|292670769|ref|ZP_06604195.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292647390|gb|EFF65362.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 567 Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 113/485 (23%), Positives = 190/485 (39%), Gaps = 64/485 (13%) Query: 45 MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVT 104 + D EA K ++ L S +T P + W L KE A V+ W ++ Sbjct: 69 LLDPYPMEASGKCAAGLHSGLTSPSRPWFALG--------LQDKELAEYHTVKLWLEECQ 120 Query: 105 DTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM 164 D L G ++S L + + +FGTG + D + G+ Sbjct: 121 DVLMGIY--AKSNIYNMLLNIEAELTQFGTGAALLLEDFNT-----GVWARPYTCGEYAG 173 Query: 165 SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSAL-ARNENERFTIIHAVYPKSLT 223 +V+ + V R+F Q+V ++G+ V+S +++A A+N + F + L Sbjct: 174 NVDARGRVVQFARKFKLNAWQMVDEFGEDVVSDAVRNAYRAKNLKDYFPVTM------LI 227 Query: 224 DKKKD---KGNKGFHSKFVSVDENRFFEEKQIATF---------PYIVGRYRVRADEIYG 271 +K D N + K+ S +FE+ Q F P+++ R+ V A+ IYG Sbjct: 228 EKNADYNPDSNALLNFKYKSY----YFEDSQTDVFLKVSGYHEVPFLMPRWTVIANGIYG 283 Query: 272 RSPAMEALPTIRRLN--ETVNELAQFGRLSLH---PPTIAVSEAKQRNF-----DLKPGY 321 P AL +L E +N RL H P I S + N L P Sbjct: 284 VGPGHNALGNCMQLQKIEKINM-----RLLEHRSDPALIVPSSVGKVNRLPGKETLVPDS 338 Query: 322 MNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAE 379 M G R L++ G+ + + ++ I + F DLF +L D +A E Sbjct: 339 MINGI-----RPLYEAT--GDRGEVMQTIQYKQQQIGAAFYNDLFVMLAQQDNPQMTARE 391 Query: 380 SMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYT 439 E+ EK + P++ + +E + + R +I G LP +K E+ Sbjct: 392 VAERHEEKLLMLSPVLEQMHNEVLAPLTRRAFEICYRNGLLPPLPEELRGQEGSIKAEFI 451 Query: 440 SPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRD 499 S L + Q+A V + + + P MD++D D R + TP ++RD Sbjct: 452 SLLAQAQKA--VGTNAMEKTLAIAGNLMGASPEIMDNLDLDAAIREHAQMSGTPETIMRD 509 Query: 500 TAEVE 504 +V+ Sbjct: 510 EQDVQ 514 >gi|325971684|ref|YP_004247875.1| hypothetical protein SpiBuddy_1857 [Spirochaeta sp. Buddy] gi|324026922|gb|ADY13681.1| hypothetical protein SpiBuddy_1857 [Spirochaeta sp. Buddy] Length = 571 Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 84/371 (22%), Positives = 152/371 (40%), Gaps = 37/371 (9%) Query: 161 NVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK 220 + ++ N +D+++ FT T + ++ DK + ++ + + A+YP+ Sbjct: 183 DFWIDKNANGKIDTIFIRFTMTSADALDRFKDKTPPNILRDVETDAGHNEHEFVLAIYPR 242 Query: 221 SLTDKKKDKGNKGFHSKFVSVD----ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAM 276 +K K F +V E+ EE FP V + YG M Sbjct: 243 KKLRSEKGKVLISTEKPFAAVTYYPVEDCIVEESGYDDFPVAVHVFEQDGTSAYGMGLVM 302 Query: 277 EALPTIRRLN-------ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR 329 + L ++RLN ETV ++A+ P +++ E+ + F PG N Sbjct: 303 KYLTELKRLNSMSRDHLETVQKVAK--------PPMSIPESLKGRFSGDPGARNYMGNMD 354 Query: 330 EGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKTREK 387 + Q VQ L +E+ L+E I LF DLF L DK +A ++ E+ Sbjct: 355 AKPEIIQTVQDIGWL--SQEITELEEKIGRLFFNDLFNYLMRQDKV-LTATQTQAIKSEE 411 Query: 388 GAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKV-------EYTS 440 A + ++G Q I ++ R I+ LP+ PP LL++ + Sbjct: 412 LALLASILGTTQYMKINPIVKRVFRIMVKGNRLPK------PPKELLRIKNALMRIDLDG 465 Query: 441 PLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDT 500 PL K + ++ LQ ++ + +D+++TD R + A P ++R+ Sbjct: 466 PLAKNVKMFAMQDGLQASLEWMQALHAMQMTNTLDNINTDIFVRKAFIAAGMPQSVLREL 525 Query: 501 AEVEDIRQQRE 511 EVE +R+Q++ Sbjct: 526 GEVEQMRKQKQ 536 >gi|46580131|ref|YP_010939.1| hypothetical protein DVU1721 [Desulfovibrio vulgaris str. Hildenborough] gi|46449547|gb|AAS96198.1| hypothetical protein DVU_1721 [Desulfovibrio vulgaris str. Hildenborough] gi|311233876|gb|ADP86730.1| hypothetical protein Deval_1575 [Desulfovibrio vulgaris RCH1] Length = 550 Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 109/484 (22%), Positives = 191/484 (39%), Gaps = 67/484 (13%) Query: 96 VREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEF-GTGCFYMEADVDEKGLEEGIRY 154 R W D V ++ S G Q+ + +EF G + D + L R+ Sbjct: 100 ARAWLDTVEASI-----NSVLRACGFYQAIHACNMEFLAFGPLLLFQDNSQGAL---CRF 151 Query: 155 ISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTII 214 S + ++++ +D+V R T Q+ ++G L+ L N+ + Sbjct: 152 ESCTVGTWAVALDADGGLDTVVRRLKLTARQMEQRFGRDRLTPATVKLLETNKGHERVEV 211 Query: 215 HAVY-PKSLTDKKK-DKGNKGFHS-KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYG 271 V P++ + D N F S + + + E PY Y D +YG Sbjct: 212 VHVVRPRTERQHGRIDARNMPFASYMYEATGADDVLSESGYHEMPYFFAAYDDTLD-LYG 270 Query: 272 RSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREG 331 +P + LP +++L E + + ++PPT + KQR ++ PG N Sbjct: 271 SAPGDDCLPDVKQLQELEKQKLVGLQKVINPPTRKPASFKQR-LNVNPGGENA------- 322 Query: 332 RSLFQPVQFGNPL---PYHE---ELNRLKESIRSL-----------FLLDLFQVLDDKAS 374 V G+P P +E +LN+++E I ++ + D+ L K Sbjct: 323 ------VSGGDPHGIGPLYEVRIDLNQVREEIATVVDRIRQTTMASYFADMPLELRPK-D 375 Query: 375 RSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPP---- 430 + E +E+ RE+ +GP + +++ + +I R +LD G LP PP Sbjct: 376 MTYGEYLERKRERLQLMGPSLEAYEAKVLTPVIFRTFALLDRAGMLPP------PPDALG 429 Query: 431 -VSLLKVEYTSPL---FKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFS 486 V+++ + Y SPL + AES + L V + E DP +D +D D+ Sbjct: 430 EVAVVDISYISPLAQALRQTGAESTRALLMDVMQLAE-----ADPGVLDKVDMDQAVDEL 484 Query: 487 LWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKL 546 P ++R +V +RQQR+ + + E Q Q+ Q + A R L Sbjct: 485 AKGIGAPGRVVRSDEDVAAMRQQRD-EAKAREAQ--AQEAITAMQGLAKVAGTRTGPGTL 541 Query: 547 THDM 550 HD+ Sbjct: 542 AHDL 545 >gi|119386466|ref|YP_917521.1| putative head-tail connector protein [Paracoccus denitrificans PD1222] gi|119377061|gb|ABL71825.1| putative head-tail connector protein [Paracoccus denitrificans PD1222] Length = 558 Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 109/477 (22%), Positives = 187/477 (39%), Gaps = 52/477 (10%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R+ D T A L + L S +T P + W L S D +V++W +V Sbjct: 59 RILDNTAQMALRTLRAGLMSGVTSPSRPWFRLGLRGST-------ADEAEFEVKDWLHEV 111 Query: 104 TDTLFGFRERSR-SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 ++ E R S L + Y + +GT + D E+ +R ++ + Sbjct: 112 QRRMY---EVMRGSNIYRMLDTTYGDLGLYGTAANLVVPD-----FEDVVRGHNLQVGRF 163 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN-ERFTIIHAVYPKS 221 + + V ++YRE V IV WG +S ++ A E + FTI H + ++ Sbjct: 164 RLGEDGNGRVIALYRELKMPVRGIVETWGLDAVSQSVRRAWDTGEYYQTFTICHMIDKRA 223 Query: 222 LTDKKK-DKGNKGFHSKFVSVD--ENRFFEEKQIATFPYIVGRY-RVRADEIYGRSPAME 277 D K + + S + +D +F + P + R+ +V + SP M Sbjct: 224 DGDPKAMQSSGRPWASIYWEMDAPSGQFLQIGGHRVKPLLAPRWEQVEGEAWSASSPGMV 283 Query: 278 ALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQP 337 AL R L + + A + +PP I + F PG A Q Sbjct: 284 ALGDARSLQVSQEQKAIAIQKMHNPPLIGGAVQGGMFFKNVPGGFTAMAT--------QD 335 Query: 338 VQFGNPLPYHE----------ELNRLKESIRSLFLLDLFQV----LDDKASRSAAESMEK 383 + G P +E ++ + + F DLFQ+ LD ++ +A E E+ Sbjct: 336 LSTGGIRPAYEVRPDIQGLIIDIQESQRRVEVAFYKDLFQMTALALDGRSQITAREIAER 395 Query: 384 TREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPEC-EGADNPPVSLLKVEYTSPL 442 EK +GP++ L E + +I + LPE EG P+ KVEY S L Sbjct: 396 HEEKLMALGPVLESLDHELLQPLIEATFAYMQEADILPEAPEGIVGNPI---KVEYISLL 452 Query: 443 FKYQQAESVASALQGVNTVVELG-VKTGDPSCMDHMDTDRVSR-FSLWATNTPAVLI 497 + Q+A + + + + L +K P +D +D +++ R F+ P +L+ Sbjct: 453 AQAQKAIGIGAIERTIGFAGTLAQIK---PDVIDMIDGEQMMREFADQVGGPPGILL 506 >gi|303327895|ref|ZP_07358334.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861721|gb|EFL84656.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 554 Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 105/522 (20%), Positives = 200/522 (38%), Gaps = 64/522 (12%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYK------NNAQLR---MWDTTGSEACIKL 57 K+++ +L++ R + EL + P + + LR +++ + A K Sbjct: 9 KEVKQLVGHLESLRAKRLAQQRELGRLILPSRGLFQGEDTESLRESNLFNPAANRALRKA 68 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 ++ ++ ITP G W AFL + D + E+ D V + L S G Sbjct: 69 AAGMTQAITPAGNPWF--------KHAFLLRRDREATGGNEYVDTVDNMLRTVL--SAGG 118 Query: 118 FVGCLQSFYTSVVEFGT---GCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 F + SF ++ FG GC E+ RY +++ +D+ Sbjct: 119 FYRAIHSFNKELLGFGCALLGC--------EESPRTVARYFCQTCGTYCAALDEDGNLDA 170 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD-KKKDKGNKG 233 V R T ++ ++G+ LS + L ++ + + H V ++ D ++ D+ N Sbjct: 171 VARRLLMTPRELARRFGEDRLSDVSRQKLKKDSYDPVAVRHVVQRRTARDPERADRSNMP 230 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRV---------RADEIYGRSPAMEALPTIRR 284 + S ++EE A F VG +R A +YG P EAL + Sbjct: 231 WGSW--------WYEEGGAADF-LDVGGFRSMPFFFTVWEEARGVYGTGPGDEALADQKG 281 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNP 343 + E G + P + + D PG + G + V FG Sbjct: 282 I-EGWELRKAVGVEKMIDPVLVSQGPLKAYVDTSPGAVIPSGGFGADSLKPLYEVNFGPA 340 Query: 344 LPY-HEELNRLKESIRSLFLLDLFQVLD---DKASRSAAESMEKTREKGAFVGPLIGGLQ 399 + + EE++++ + + + ++F + A + E M++ R +GP + G + Sbjct: 341 VQHVQEEISQISLRLEDVMMANIFASMSLETRPAGMTMTEYMDRRRRSAELMGPTVSGYE 400 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFK-YQQAESVASALQGV 458 + ++ +L+ G LP +P S L V Y SP+ + +Q+ +VA + Sbjct: 401 PRILSPVLENTFGLLEEYGLLPGPPDGLSPFAS-LNVSYQSPMAQMLEQSGAVA-----I 454 Query: 459 NTVVELGVKT--GDPSCMDHMDTDRVSRFSLWATNTPAVLIR 498 ++ EL P D +D ++ PA ++R Sbjct: 455 QSLFELAAPMLRAVPDLADKIDFEQAIDELAQRLGVPASVVR 496 >gi|13186164|emb|CAC33475.1| hypothetical protein [Legionella pneumophila] Length = 519 Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 87/406 (21%), Positives = 169/406 (41%), Gaps = 47/406 (11%) Query: 30 LTGFLYPYKN-NAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYK 88 L GFL P + NA + +D T A +L+ + + P GQ+W F+ F Sbjct: 70 LAGFLTPGQQYNADI--YDLTLPIAHKRLADKMLMNMVPQGQQWV----KFTPGDEFGEP 123 Query: 89 EDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSV------VEFGTGCFYMEAD 142 ++ + ++TD F +RS +FY +V V TG Sbjct: 124 GTPLYQRALDATQRMTDHFFKIIDRS---------NFYLAVGESLQDVLISTGII----A 170 Query: 143 VDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE-FTFTVDQIVSKWGDKVLSSKMKS 201 ++E + +RY +VP + V + + VD+++R+ + ++ I S W ++ Sbjct: 171 INEGNRKRPVRYEAVPPAQVMFQGDAEGQVDAIFRDWYQVRIENIKSMWPKAEVAK---- 226 Query: 202 ALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGR 261 L + ++ I + +K+ + V E+ +++P++V R Sbjct: 227 -LNKKPEDKVDIWECAWIDYEAPEKER------YQYVVMTSSKDVLLEQSNSSWPWVVYR 279 Query: 262 YRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKP 319 R EI GR P++ A PT +N+ + + +P +A S++ Q+ F +P Sbjct: 280 MRRLTGEIRGRGPSLSAYPTAATINQALEDELVAAAFQANPMYMAASDSAFNQQTFTPRP 339 Query: 320 GYMNIGALSREGRSLFQPVQFGNPLPYHEEL-NRLKESIRS-LFLLDLFQVLDDKASRSA 377 G + + +G +P + + ++ L N ++ I L+ L V + +R+A Sbjct: 340 GSI-VPVQMVQGEWPIKPFEQSGNIQFNALLVNDFRQQINELLYAFPLGAV--NSPTRTA 396 Query: 378 AESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPEC 423 E+ + E ++ LQ+EF +I R L +++ LPE Sbjct: 397 TEAEIRYTENLESFSAMVPRLQNEFFIPVIQRTLWVINKV--LPET 440 >gi|253583086|ref|ZP_04860294.1| predicted protein [Fusobacterium varium ATCC 27725] gi|251834978|gb|EES63531.1| predicted protein [Fusobacterium varium ATCC 27725] Length = 517 Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 63/254 (24%), Positives = 111/254 (43%), Gaps = 22/254 (8%) Query: 131 EFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR-EFTFTVDQIVSK 189 E GTGC+ E EK R+ VPL+ + + + Q+ + V+R F +++ I S Sbjct: 142 ELGTGCWKYEEQNSEKV---PFRHQYVPLNELLFNEDLQHRPNIVFRYNFKYSLWDIRSL 198 Query: 190 WGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDE--NRFF 247 + LS NENE T+I V P + TD +++ DE + Sbjct: 199 YKKADLSC----YDGINENEEVTVIECVMPVAETDT----------FEWILFDERMDNVL 244 Query: 248 EEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAV 307 K PY + R+ V + ++GR + L RL N A+ + PP + V Sbjct: 245 YRKIYNYNPYTIFRFTVMPNNVWGRGLGVTCLDYYERLCYCENLRARQSIRIVEPPLLLV 304 Query: 308 SEAKQRN-FDLKPGYMNIGALSREGRSLFQPVQ-FGNPLPYHEELNRLKESIRSLFLLDL 365 + + + FDL P +N G G++ P+ G LP +++ R + I+++ + Sbjct: 305 GDKRLIDGFDLDPNGLNWGGDGITGQANAVPMNTTGTLLPLDQDIQRYTQVIQAIHFNNP 364 Query: 366 FQVLDDKASRSAAE 379 ++++ +R AE Sbjct: 365 MGSVENRTTRGNAE 378 >gi|212703247|ref|ZP_03311375.1| hypothetical protein DESPIG_01289 [Desulfovibrio piger ATCC 29098] gi|212673291|gb|EEB33774.1| hypothetical protein DESPIG_01289 [Desulfovibrio piger ATCC 29098] Length = 552 Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 101/506 (19%), Positives = 189/506 (37%), Gaps = 45/506 (8%) Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 K ++ ++ ITP W FL + D E+ D V + + Sbjct: 65 KAAAGMTQAITPASSPWF--------RHQFLDRADREVTGGNEYVDVVDARIRAVL--AA 114 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 GF + +F ++ FG C + D + + R+ ++++ + V Sbjct: 115 GGFYSAIHAFNRELLGFG--CALLSCDASARTVA---RFACQTCGTYAVALDEDRTLSCV 169 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-DKGNKGF 234 R T ++ ++G L + L ++ V + D ++ D N F Sbjct: 170 VRRLRMTPVEMSRRFGRDRLCEATRQKLESQPYAPIEVVQVVRKREERDPERGDNRNMPF 229 Query: 235 HSKFVSVDE--NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 S F D+ E + P+ + A +YG P +AL + + Sbjct: 230 AS-FWYEDQGGTELLRESGFRSMPFFFSTWE-DARGVYGTGPGDDALADQKGIEAWEKRK 287 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPG-------YMNIGALSREGRSLFQPVQFGNPL- 344 A + + PP +A K R+ PG Y AL R L++ V FG + Sbjct: 288 AVGIEMMIQPPLLAPGTLK-RHVRAMPGSVISDTAYGQSNAL----RPLYE-VNFGPAVG 341 Query: 345 PYHEELNRLKESIRSLFLLDLFQVLD---DKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 +E+ ++ + + ++F + A + E M++ R +GP + + Sbjct: 342 AVQQEIEQISMRLEDVMKANIFANMSLETRPAGMTMTEYMDRRRRAAELMGPTVSSYEPR 401 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + I R +LD +G LP +P + L V Y SP+ + + + S Q ++ V Sbjct: 402 VLTLCIERVYQLLDEEGLLPPPPQGLSP-WATLNVSYQSPMAQMLEQAAAVSIGQFMDQV 460 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 P+ +D +D D++ PA +IR +V IRQQRE ++ Sbjct: 461 GPWA--QSQPTILDKLDLDQMVDELAQRLGVPASIIRSDEQVAAIRQQREQAAAAQQQAA 518 Query: 522 LQQQLQQTSQDIG-----AKAAGRAM 542 ++ Q+ ++ +G AG+ M Sbjct: 519 MEVQMMESMAKMGNVKTEGTVAGKVM 544 >gi|307946242|ref|ZP_07661577.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307769906|gb|EFO29132.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 519 Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 75/333 (22%), Positives = 141/333 (42%), Gaps = 29/333 (8%) Query: 155 ISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALA-RNENERFTI 213 ISVP+ + + N + +++ + +V + W + +K L + E E Sbjct: 152 ISVPIEELLIENGPNNRISAIFWKRKMSVRVLQDTWPEGKFGENLKKLLKEKPEGEIDVN 211 Query: 214 IHAVY-PKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGR 272 + V+ PK + NK V +E+R T P++ RY E YGR Sbjct: 212 VDTVWVPKERRWRMIVWCNK--QETAVFQNESR--------TCPWLFARYFRVPGEAYGR 261 Query: 273 SPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGA---LSR 329 P M A+PTI+ LN Q +++ V + F+ + GA ++R Sbjct: 262 GPVMLAMPTIKTLNTAARLQLQAAAIAMLGIYTTVDDGV---FNPDLASLEPGAFWKVAR 318 Query: 330 EGRSLFQPV-QFGNPL--PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 G +L + +F +P + LN ++ +++ ++D D A RSA E +E+ + Sbjct: 319 NGGALGPSINRFPDPRLDLSNLVLNDMRMGVKAT-MMDQSLPADGAAVRSATEILERVKR 377 Query: 387 KGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVS--LLKVEYTSPLFK 444 + G L E + + R ++I ++G + +D P+ L++V SPL Sbjct: 378 LASDHLGAYGRLVKEIVIPAVKRAMEIAYNKGLI-----SDEIPIDQLLVRVRVKSPLAL 432 Query: 445 YQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 ++A+ V +Q + V+ +G G P + + Sbjct: 433 AREAQRVEKVIQWLQMVISIGAAVGQPGFLQQI 465 >gi|157828579|ref|YP_001494821.1| hypothetical protein A1G_03995 [Rickettsia rickettsii str. 'Sheila Smith'] gi|157801060|gb|ABV76313.1| hypothetical protein A1G_03995 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 111 Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust. Identities = 28/111 (25%), Positives = 52/111 (46%), Gaps = 9/111 (8%) Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK-- 232 +YR F+ + +KW D + K LA+N +E I+H V P+S + K K Sbjct: 1 MYRLFSMPIKAASAKWPD---FADFKERLAKNPDETVKILHIVSPQSENQRGKGGKGKGL 57 Query: 233 ----GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEAL 279 + S+++ + E + + + FP+ V + ++YG +PA A+ Sbjct: 58 MTTLAYSSEYIYLSEQKIISQSGYSYFPFFVTLWIKGEGQVYGYAPAHHAI 108 >gi|165933293|ref|YP_001650082.1| hypothetical protein RrIowa_0838 [Rickettsia rickettsii str. Iowa] gi|165908380|gb|ABY72676.1| hypothetical protein RrIowa_0838 [Rickettsia rickettsii str. Iowa] Length = 111 Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust. Identities = 28/111 (25%), Positives = 51/111 (45%), Gaps = 9/111 (8%) Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK-- 232 +YR F+ + +KW D + K LA+N +E I+H V P+S + K K Sbjct: 1 MYRLFSMPIKAASAKWPD---FADFKERLAKNPDETVKILHIVSPQSENQRGKGGKGKGL 57 Query: 233 ----GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEAL 279 + S+++ + E + + FP+ V + ++YG +PA A+ Sbjct: 58 MTTLAYSSEYIYLSEQKIISQSGYLYFPFFVTLWIKGEGQVYGYAPAHHAI 108 >gi|259419010|ref|ZP_05742927.1| hypothetical protein SCH4B_4395 [Silicibacter sp. TrichCH4B] gi|259345232|gb|EEW57086.1| hypothetical protein SCH4B_4395 [Silicibacter sp. TrichCH4B] Length = 506 Score = 41.2 bits (95), Expect = 0.42, Method: Compositional matrix adjust. Identities = 72/332 (21%), Positives = 129/332 (38%), Gaps = 47/332 (14%) Query: 143 VDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSA 202 VD L I + +VP+ +Y++ + D +R F + + D ++ Sbjct: 136 VDRPTLNGAINFEAVPIPQLYVTPGPLGIEDR-FRRQRFHYRNLKVLFPDAKFPRAIEDK 194 Query: 203 LARNENERFTIIHAVYPKSLTDKKKD--KGNKGFHSKFVSVDENRFFEEKQIATFPYIVG 260 + ++ N ++H + ++ D + + K + +D++ I +VG Sbjct: 195 IKKSSNALAVVVHGFW-RTFEDVENPVWRHEIRVDGKPIGLDKDV----GSIGAVNLVVG 249 Query: 261 RYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG 320 R+ A +GR P + LP R+ +E V + +L PP + DL G Sbjct: 250 RFNPYAGSAWGRGPGRKLLPVFRQYDELVRMNMEGLDRTLDPPFTYPHDGM---LDLSQG 306 Query: 321 YMN-IGALSREG-RSLFQPVQFGNPLPY---HEELNRLKESIRSLFLLDLFQV------- 368 N +G + G + QPV FG L Y EE +L++ IR F + Q Sbjct: 307 LENGVGYPTMPGTKDALQPVLFGT-LDYGFFSEE--KLEQKIRDGFYREKEQAGKTPPSA 363 Query: 369 -----LDDKASRSAAESMEKT-REKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPE 422 ++K R A KT RE G + + L+ + G++ EL ++DS Sbjct: 364 SQYIGQENKQVRRMARPATKTWREFGVGLLSRVEWLERQPGGSLEGAELPLIDS------ 417 Query: 423 CEGADNPPVSLLKVEYTSPLFKYQQAESVASA 454 ++ SPL + Q + V +A Sbjct: 418 ---------GVVNARPISPLERAQAMQDVTTA 440 >gi|291294768|ref|YP_003506166.1| NAD-dependent epimerase/dehydratase [Meiothermus ruber DSM 1279] gi|290469727|gb|ADD27146.1| NAD-dependent epimerase/dehydratase [Meiothermus ruber DSM 1279] Length = 501 Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust. Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 10/86 (11%) Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 R LD+L G P +P + +++ ++ +Q + V A++GV+TVV LG Sbjct: 168 RLLDLL-LFGKEPIAHVLHHPNLEIIQADF-------RQVDKVVEAMRGVDTVVHLGGLV 219 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPA 494 GDP+C +D + +L AT T A Sbjct: 220 GDPACA--LDENLTIEINLVATRTIA 243 >gi|198418843|ref|XP_002122505.1| PREDICTED: similar to myosin VIIA [Ciona intestinalis] Length = 631 Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust. Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 21/97 (21%) Query: 84 AFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQ--SFYTSVVEFGTGCFYMEA 141 A LY +D K ++WC V +TL E+SRSG CL+ V+ + T F A Sbjct: 458 ASLYGDD----KGKKWCQMVYNTLKALAEKSRSG--ACLEPIEIMQQVIRYATIAFV--A 509 Query: 142 DVDE-------KGLEEGIRYISVPLSNVYMSVNHQNV 171 + + K + EG R PL+N+ + +NH+N+ Sbjct: 510 NFTKSFRLSTFKSITEGGR----PLTNLTLQLNHENL 542 >gi|291334263|gb|ADD93926.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 130 Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust. Identities = 30/118 (25%), Positives = 56/118 (47%), Gaps = 13/118 (11%) Query: 371 DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQG--NLPECEGADN 428 ++ SA E E+ + +G G LQ+E + ++ R + IL QG N+P G + Sbjct: 6 NRTPMSATEVAERMADLSRQIGSSFGRLQAEMVTPVLQRVIHILKKQGRINIPTVNGRE- 64 Query: 429 PPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL-GVKTGDPSCMDHMDTDRVSRF 485 +K++ TSPL + Q + + G N +EL G + G +D++ +++ Sbjct: 65 -----IKIQSTSPLAQAQANQDI----NGFNRFLELVGARFGPQLINLLVDSNEATKY 113 >gi|317402178|gb|EFV82769.1| ferrochelatase [Achromobacter xylosoxidans C54] Length = 363 Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust. Identities = 33/100 (33%), Positives = 49/100 (49%), Gaps = 17/100 (17%) Query: 440 SPLFKY--QQAESVASALQ--GVNTVVELGVKTGDPSCMDHMDTDR---------VSRFS 486 SPL Y +QAE V +AL GV VVELG++ G+PS D + R V + Sbjct: 104 SPLMVYSRRQAEGVQAALSAAGVEAVVELGMRYGNPSIPDAISRLRAQGCERILTVPLYP 163 Query: 487 LWATNTPAVLI----RDTAEVEDIRQQREVQRRVMEEQHL 522 +A +T A ++ R A + D + R ++R E +L Sbjct: 164 QYAASTTATVVDAVTRHAARLRDQPEMRFIKRFHQEPLYL 203 >gi|320033090|gb|EFW15039.1| fatty acid synthase beta subunit [Coccidioides posadasii str. Silveira] Length = 1334 Score = 37.4 bits (85), Expect = 6.1, Method: Compositional matrix adjust. Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 3/81 (3%) Query: 268 EIYGRSPAMEALPTIRRLNETV-NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN--I 324 E+ G++ T+R N+TV + + G++ L PT + + ++ + N I Sbjct: 731 ELLGQTLTFRLQSTVRFKNKTVFHSVETMGQVLLELPTKEIIQVASVEYEAGTSHGNPVI 790 Query: 325 GALSREGRSLFQPVQFGNPLP 345 L R G+S+ QPV F NP+P Sbjct: 791 DYLQRHGQSIEQPVHFENPIP 811 >gi|293402283|ref|ZP_06646421.1| putative thioredoxin [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291304390|gb|EFE45641.1| putative thioredoxin [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 603 Score = 37.0 bits (84), Expect = 8.1, Method: Compositional matrix adjust. Identities = 29/95 (30%), Positives = 44/95 (46%), Gaps = 10/95 (10%) Query: 177 REFTFTVDQIV-------SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 RE F VD + + W +KVL K L ER A++P L + +D+ Sbjct: 485 REKVFNVDTDLFEPVNSFADWNEKVLKVNDKPVLVLFGAERCVHCKALHP-VLEEALQDE 543 Query: 230 GNKGFHSKFVSVDENR-FFEEKQIATFPYIVGRYR 263 N FH ++V+VDEN+ + + P +V YR Sbjct: 544 FNSSFHIRYVNVDENKDIVDACHVQGIP-VVAIYR 577 Searching..................................................done Results from round 2 >gi|254781213|ref|YP_003065626.1| head-to-tail joining protein, putative [Candidatus Liberibacter asiaticus str. psy62] gi|254040890|gb|ACT57686.1| head-to-tail joining protein, putative [Candidatus Liberibacter asiaticus str. psy62] gi|317120678|gb|ADV02501.1| putative phage-related head-to-tail joining protein [Liberibacter phage SC1] gi|317120822|gb|ADV02643.1| putative phage-related head-to-tail joining protein [Candidatus Liberibacter asiaticus] Length = 556 Score = 690 bits (1780), Expect = 0.0, Method: Composition-based stats. Identities = 556/556 (100%), Positives = 556/556 (100%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL Sbjct: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG Sbjct: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT Sbjct: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS Sbjct: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL Sbjct: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL Sbjct: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL Sbjct: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD Sbjct: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR Sbjct: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 Query: 541 AMEKKLTHDMMENSYG 556 AMEKKLTHDMMENSYG Sbjct: 541 AMEKKLTHDMMENSYG 556 >gi|315122900|ref|YP_004063389.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496302|gb|ADR52901.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 555 Score = 613 bits (1580), Expect = e-173, Method: Composition-based stats. Identities = 396/555 (71%), Positives = 458/555 (82%), Gaps = 1/555 (0%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 MN S K I+ F +LK+QR ELN MEELT LYPYK + RMWDTTGSEACIKLSSL Sbjct: 1 MNN-SIKKIKTCFEHLKSQREELNTRMEELTSLLYPYKQEPKSRMWDTTGSEACIKLSSL 59 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LSSLITPPGQKWHGL+E F +QAFLY+EDA +KK+R WCDQVTD LFGFRERSRSGFV Sbjct: 60 LSSLITPPGQKWHGLSEPFFRHQAFLYEEDAGAKKIRGWCDQVTDVLFGFRERSRSGFVS 119 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 CLQSFYTS+VEFGTGCFY+EADVDE GLEEGIRYI+VPL++VY+SVNHQN VDS+YR F Sbjct: 120 CLQSFYTSIVEFGTGCFYIEADVDETGLEEGIRYIAVPLADVYLSVNHQNEVDSIYRTFE 179 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 FT +QI KWG KVLS KMKS+ + E ++F IIHAVYPKSL +KKKDKGNK FHSKFV Sbjct: 180 FTAEQIGGKWGYKVLSDKMKSSYEKKEPDKFKIIHAVYPKSLAEKKKDKGNKNFHSKFVC 239 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 +DEN FFEEKQI T PYI+GRYRVRADEIYG+SPAMEALP IRRLNE NELAQ+ RLSL Sbjct: 240 IDENVFFEEKQITTLPYIIGRYRVRADEIYGKSPAMEALPAIRRLNEISNELAQYARLSL 299 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 HP +A +EAKQ F +K ++N GA+S++G++LFQP+Q GNPLP++EEL R++ SI SL Sbjct: 300 HPAYLAPTEAKQLEFKIKSRHINTGAMSKDGKALFQPLQVGNPLPFYEELKRIQGSIHSL 359 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI RELDILD+Q NL Sbjct: 360 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIKRELDILDAQHNL 419 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 PE D+ P LLKVEYTSPLFKYQQAESVAS LQG NTV+ELG KTG+P MDH+D D Sbjct: 420 PELTDYDHSPFHLLKVEYTSPLFKYQQAESVASVLQGTNTVLELGAKTGNPEPMDHIDID 479 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 +VSRF+LWA+ +PA LIRD EV+ R+ R+ Q M+ + QQ +Q + GAKA + Sbjct: 480 KVSRFALWASGSPAHLIRDVDEVKQRRKDRDDQMEAMQNRQDAQQQEQMGMEAGAKAVSK 539 Query: 541 AMEKKLTHDMMENSY 555 A+EKK+T+D+MENSY Sbjct: 540 AIEKKMTNDLMENSY 554 >gi|315121938|ref|YP_004062427.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495340|gb|ADR51939.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 555 Score = 612 bits (1579), Expect = e-173, Method: Composition-based stats. Identities = 399/555 (71%), Positives = 457/555 (82%), Gaps = 1/555 (0%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 MN S K I+ F +LK+QR ELN MEELT LYPYK + RMWDTTGSEACIKLSSL Sbjct: 1 MNN-SIKKIKTCFEHLKSQREELNTRMEELTSLLYPYKQEPKSRMWDTTGSEACIKLSSL 59 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LSSLITPPGQKWHGL+E F +QAFLY+EDA +KK+R WCDQVTD LFGFRERSRSGFV Sbjct: 60 LSSLITPPGQKWHGLSEPFFRHQAFLYEEDAGAKKIRGWCDQVTDVLFGFRERSRSGFVS 119 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 CLQSFYTS+VEFGTGCFY+EADVDE GLEEGIRYI+VPL++VY+SVNHQN VDS+YR F Sbjct: 120 CLQSFYTSIVEFGTGCFYIEADVDETGLEEGIRYIAVPLADVYLSVNHQNEVDSIYRTFE 179 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 FT +QI KWG KVLS KMKS+ + E ++F IIHAVYPKSL +KKKDKGNK FHSKFV Sbjct: 180 FTAEQIGGKWGYKVLSDKMKSSYEKKEPDKFKIIHAVYPKSLAEKKKDKGNKNFHSKFVC 239 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 +DEN FFEEKQI T PYI+GRYRVRADEIYG+SPAMEALP IRRLNE NELAQ+ RLSL Sbjct: 240 IDENVFFEEKQITTLPYIIGRYRVRADEIYGKSPAMEALPAIRRLNEISNELAQYARLSL 299 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 HP +A EAKQ F K YMNIGA+S++G++LFQP+Q GNPLP++EEL R++ SI SL Sbjct: 300 HPAYLAPPEAKQLEFKNKSRYMNIGAMSKDGKALFQPLQVGNPLPFYEELKRIQGSIHSL 359 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI RELDILD+Q NL Sbjct: 360 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIKRELDILDAQHNL 419 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 PE D+ P LLKVEYTSPLFKYQQAESVAS LQG NTV+ELG KTG+P MDH+D D Sbjct: 420 PELTDYDHSPFHLLKVEYTSPLFKYQQAESVASVLQGTNTVLELGAKTGNPEPMDHIDID 479 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 +VSRF+LWA+ +PA LIRD EV+ R+ R+ Q M+ + QQ +Q + GAKA + Sbjct: 480 KVSRFALWASGSPAHLIRDVDEVKQRRKDRDDQMEAMQNRQDAQQQEQMGMEAGAKAVSK 539 Query: 541 AMEKKLTHDMMENSY 555 A+EKK+T+D+MENSY Sbjct: 540 AIEKKMTNDLMENSY 554 >gi|327252184|gb|EGE63856.1| bbp21 [Escherichia coli STEC_7v] Length = 559 Score = 558 bits (1439), Expect = e-157, Method: Composition-based stats. Identities = 132/559 (23%), Positives = 240/559 (42%), Gaps = 40/559 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D D + IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDDD-----DIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP IA + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMIAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + +L P +D ++ D+ + +I +VE RQQR Q++ + Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMM 520 Query: 521 HLQQQLQQTSQDIGAKAAG 539 + Q ++ + Sbjct: 521 EMGMAAAQGAKTLSEAKTS 539 >gi|301019343|ref|ZP_07183529.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299882260|gb|EFI90471.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 559 Score = 557 bits (1436), Expect = e-156, Method: Composition-based stats. Identities = 131/559 (23%), Positives = 240/559 (42%), Gaps = 40/559 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEANRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGIP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + +L P +D ++ D+ + +I +VE RQQR Q++ + Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMM 520 Query: 521 HLQQQLQQTSQDIGAKAAG 539 + Q ++ + Sbjct: 521 AVGMAAAQGAKTLSEAKTS 539 >gi|218700990|ref|YP_002408619.1| putative head-to-tail-joining protein [Escherichia coli IAI39] gi|218370976|emb|CAR18803.1| putative head-to-tail-joining protein [Escherichia coli IAI39] Length = 559 Score = 557 bits (1435), Expect = e-156, Method: Composition-based stats. Identities = 131/559 (23%), Positives = 239/559 (42%), Gaps = 40/559 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NNSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + +L P +D ++ D+ + +I +VE RQQR Q++ + Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMM 520 Query: 521 HLQQQLQQTSQDIGAKAAG 539 + Q ++ + Sbjct: 521 AMGMVAAQGAKTLSEAKTS 539 >gi|117624712|ref|YP_853625.1| putative tail protein [Escherichia coli APEC O1] gi|115513836|gb|ABJ01911.1| putative tail protein [Escherichia coli APEC O1] Length = 559 Score = 556 bits (1433), Expect = e-156, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 229/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q +PP +A + + ++ L PG + G+ +PV NP Sbjct: 286 LQLLQKRKSQIIDKVTNPPMVAPTTLRTQSVSLLPGGVTY-VDQLTGQEGLRPVYQVNPN 344 Query: 345 PYHE--ELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ +++I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLISDIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L G P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QGKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|294492610|gb|ADE91366.1| conserved hypothetical protein [Escherichia coli IHE3034] gi|323948685|gb|EGB44590.1| hypothetical protein ERKG_04908 [Escherichia coli H252] Length = 559 Score = 556 bits (1433), Expect = e-156, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 226/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRSFSMMVRKNMLPPPPDVMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|331648176|ref|ZP_08349266.1| conserved hypothetical protein [Escherichia coli M605] gi|331043036|gb|EGI15176.1| conserved hypothetical protein [Escherichia coli M605] Length = 559 Score = 556 bits (1432), Expect = e-156, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 227/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|323156133|gb|EFZ42292.1| bbp21 [Escherichia coli EPECa14] Length = 559 Score = 556 bits (1432), Expect = e-156, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 227/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIDVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|300898427|ref|ZP_07116768.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357894|gb|EFJ73764.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 559 Score = 556 bits (1432), Expect = e-156, Method: Composition-based stats. Identities = 123/524 (23%), Positives = 225/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEFGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDVMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|324008560|gb|EGB77779.1| hypothetical protein HMPREF9532_01747 [Escherichia coli MS 57-2] Length = 559 Score = 556 bits (1432), Expect = e-156, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 227/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRSFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|298381718|ref|ZP_06991317.1| hypothetical protein ECFG_01455 [Escherichia coli FVEC1302] gi|298279160|gb|EFI20674.1| hypothetical protein ECFG_01455 [Escherichia coli FVEC1302] Length = 559 Score = 556 bits (1432), Expect = e-156, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 226/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NNSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|301046408|ref|ZP_07193568.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|300301634|gb|EFJ58019.1| conserved hypothetical protein [Escherichia coli MS 185-1] Length = 559 Score = 555 bits (1431), Expect = e-156, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 227/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEANRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|89152428|ref|YP_512261.1| putative head-to-tail-joining protein [Escherichia phage phiV10] gi|74055451|gb|AAZ95900.1| putative head-to-tail-joining protein [Escherichia phage phiV10] Length = 559 Score = 554 bits (1428), Expect = e-155, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 226/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLDDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRSFSMMVRKNMLPPPPDVMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|320175046|gb|EFW50159.1| putative tail protein [Shigella dysenteriae CDC 74-1112] Length = 559 Score = 554 bits (1428), Expect = e-155, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 226/524 (43%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 MF--NESNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A + K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|332344354|gb|AEE57688.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 559 Score = 553 bits (1424), Expect = e-155, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 225/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDY--------GPVKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + + + Y++ + + Sbjct: 113 MF--NKSNLYQSLPQLYGSLGTYSTGAMAVLEDD-----EDIIRTMPFTIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E++ ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 L +Q + +PP +A K + L PG + G+ F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPISLKNQRASLLPGDITY-IDQITGQDGFRPAYLVNPS 344 Query: 345 PYH--EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|215487822|ref|YP_002330253.1| predicted phage head-tail connector protein [Escherichia coli O127:H6 str. E2348/69] gi|215265894|emb|CAS10303.1| predicted phage head-tail connector protein [Escherichia coli O127:H6 str. E2348/69] Length = 556 Score = 549 bits (1414), Expect = e-154, Method: Composition-based stats. Identities = 128/559 (22%), Positives = 230/559 (41%), Gaps = 40/559 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + LKN+R +L+ F+ P + ++ D T Sbjct: 1 MAETEKERLLKQLAQLKNERTSFESHWRDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 GS A LSS + S IT P + W LA + V+ W + V + Sbjct: 61 GSMAQRILSSGMMSGITSPARPWFKLATPDPDMMDY--------GPVKIWLEVVQRRMNE 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ FGTG + D ++ IR + P+ + Y++ + + Sbjct: 113 VF--NKSNLYQSLPVMYASLGTFGTGAMAVLEDD-----QDVIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTD-KKK 227 VD+ R+F+ TV Q+V ++G +S+ +K E + H + P D K Sbjct: 166 GSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVKVNHCITPNVNRDSGKM 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK + S + D ++ E FP + R+ V +++Y S P M AL ++ Sbjct: 226 DSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L AQ + +PP +A + K + L PG + + G+ F+P NP Sbjct: 286 LQVEQKRKAQLIDKATNPPMVAPTSLKNQRVSLLPGDVTYLDV-LTGQDGFKPAYLVNPN 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ +++I S + +DLF +L +RS +E EK +GP++ L Sbjct: 345 TADLLADIQDTRQTINSAYFVDLFMMLQKINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R I+ + LPE L++EY S + + Q++ + S Q V Sbjct: 405 EALNPLIDRVFSIMARKNMLPEPPDVLQGMP--LRIEYISVMAQAQKSIGLTSLSQTVGF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + +L P +D +D D+ + +I +V+ IR++R Q + + Sbjct: 463 IGQLA--QFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAM 520 Query: 521 HLQQQLQQTSQDIGAKAAG 539 + Q Q ++ + Sbjct: 521 AMGQAAAQGAKTLSETQTS 539 >gi|242279813|ref|YP_002991942.1| hypothetical protein Desal_2347 [Desulfovibrio salexigens DSM 2638] gi|242122707|gb|ACS80403.1| conserved hypothetical protein [Desulfovibrio salexigens DSM 2638] Length = 555 Score = 548 bits (1413), Expect = e-154, Method: Composition-based stats. Identities = 142/529 (26%), Positives = 228/529 (43%), Gaps = 39/529 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK--------NNAQLR---MWDTT 49 M R L+ +R ++++ ++ P K N+ ++R + D+T Sbjct: 1 MRHIENNQYLRRLQGLRQERNSWESHWQDISDYILPRKGVYDGHRPNDGRVRSGKIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 + A L++ L +T P + W L S ++ AR K VREW +V +T++ Sbjct: 61 ATRALRILAAGLQGGLTSPARPWFRLGISD--------RDLARHKSVREWISKVENTMY- 111 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 R +RS F C+ S YT + FGTG Y E D E GIR+ ++ ++ + Q Sbjct: 112 -RALARSNFYSCIHSLYTELAGFGTGILYCEPDD-----ERGIRFRTLTAGEYCLATDAQ 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK-KD 228 VD+VYREF T Q+ ++G + L + + S+L N + F ++H V P+ D D Sbjct: 166 GRVDTVYREFKMTARQLEKRFGMQNLPATVHSSLNMNRDHWFDVLHVVQPRDEFDIALMD 225 Query: 229 KGNKGFHSKFVSVD-ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 N F S F+ E PY+ R+ A ++YGRSPAM+ L ++ L E Sbjct: 226 TMNMPFESVFLLNGHGGHVLSESGFMENPYMAPRWDTSAMDVYGRSPAMDVLADVKMLME 285 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP--LP 345 Q L+L PP R +L PG N + + + P+ P Sbjct: 286 MSKSQIQAVHLTLRPPMKVP-SMYSRRLNLLPGGQNP--VEQNQQDSVSPLYQVRPDLAG 342 Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDKASR--SAAESMEKTREKGAFVGPLIGGLQSEFI 403 ++ ++ +IR F D+F ++ R +AAE E+ EK +GP+I +E + Sbjct: 343 VSNKIQDVRTAIREGFYNDIFMMMAGTNRRTITAAEVAERHEEKLIQLGPVIERQHTELL 402 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVE 463 +I R IL G LPE +K++Y S L + Q+ S V Sbjct: 403 DPLIDRVFGILMRSGQLPEAPSVLEGAD--IKIDYISVLAQAQKMVGTQSIQSLAQFVGN 460 Query: 464 LGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREV 512 L +P +D +D DR P ++R EVE +R R+ Sbjct: 461 LA--KANPEVLDKVDMDRAVDDYAELIGVPNGIVRSGDEVEKLRNMRKD 507 >gi|30387383|ref|NP_848212.1| hypothetical protein epsilon15p04 [Enterobacteria phage epsilon15] gi|30266038|gb|AAO06067.1| 4 [Salmonella phage epsilon15] Length = 556 Score = 548 bits (1412), Expect = e-154, Method: Composition-based stats. Identities = 128/559 (22%), Positives = 231/559 (41%), Gaps = 40/559 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + LKN+R +L+ F+ P + ++ D T Sbjct: 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 GS A LSS + S IT P + W LA + V+ W + V + Sbjct: 61 GSMAQRILSSGMMSGITSPARPWFKLATPDPDMMDY--------GPVKIWLEVVQRRMNE 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ FGTG + D ++ IR + P+ + Y++ + + Sbjct: 113 VF--NKSNLYQSLPVMYASLGTFGTGAMAVMEDD-----QDVIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTD-KKK 227 VD+ R+F+ TV Q+V ++G +S+ +K E + H + P D K Sbjct: 166 GSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKM 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK + S + D ++ E FP + R+ V +++Y S P M AL ++ Sbjct: 226 DSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L AQ + +PP +A + K + L PG + + G+ F+P NP Sbjct: 286 LQVEQKRKAQLIDKATNPPMVAPTSLKNQRVSLLPGDVTYLDV-ISGQDGFKPAYLVNPN 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ +++I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R I+ + LPE L++EY S + + Q++ + S Q V Sbjct: 405 EALNPLIDRVFSIMARKNMLPEPPDVLQGMP--LRIEYISVMAQAQKSIGLTSLSQTVGF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + +L P +D +D D+ + +I +V+ IR++R Q + + Sbjct: 463 IGQLA--QFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAM 520 Query: 521 HLQQQLQQTSQDIGAKAAG 539 + Q Q ++ + Sbjct: 521 AMGQAAAQGAKTLSETQTS 539 >gi|187476929|ref|YP_784953.1| phage head-tail connector protein [Bordetella avium 197N] gi|115421515|emb|CAJ48024.1| Putative phage head-tail connector protein [Bordetella avium 197N] Length = 555 Score = 547 bits (1409), Expect = e-153, Method: Composition-based stats. Identities = 132/557 (23%), Positives = 234/557 (42%), Gaps = 37/557 (6%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLR---MWDTTG 50 Q K + R+ LK +R +E++ +L P +N R + D TG Sbjct: 3 EQTERKLLLSRWGQLKAERESWISHWKEISDYLLPRSGRFFINDRNRGGKRHNNILDNTG 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + A L++ + + +T P + W L S E S V+ W VT + Sbjct: 63 TRALRVLAAGMMAGMTSPARPWFRLTTSIP--------ELDESAAVKAWLANVTRLMLMV 114 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 ++S L S Y + FGT + D ++ IR+ ++ ++ ++Q Sbjct: 115 F--AKSNTYRALHSTYEELGLFGTASSIVLPDF-----KDVIRHHTLSAGEYAIAADNQG 167 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNE-NERFTIIHAVYPKSLTDK-KKD 228 VD++YREF TV Q+V ++G S+ +++ R + T+IHA+ P++ D K+D Sbjct: 168 RVDTLYREFQITVAQMVREFGKDKCSTTVRNLFDRGALEQWVTVIHAIEPRADRDPNKRD 227 Query: 229 KGNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLN 286 N + S +V DE R E +F + R+ + +IYG SPAMEAL +R+L Sbjct: 228 DRNMAWKSVYVELGADETRTLRESGYRSFRALCPRWALAGGDIYGNSPAMEALGDVRQLQ 287 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPLP 345 AQ +PP AK ++ PG ++ ++ + + + Sbjct: 288 HEQLRKAQGIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDVAAPNGGIRTAFEVNLDLSH 347 Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDK--ASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 ++ ++E I++ F DLF +L + +A E E+ EK +GP++ + +E + Sbjct: 348 LLADIVDVRERIKASFYADLFLMLANGTNPKMTATEVAERHEEKLLMLGPVLERMHNEIL 407 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVE 463 +I + LP L VE+ S L + Q+A + S + V + Sbjct: 408 DPLIELTFQRMVEANILPPPPQEMQGVD--LNVEFVSMLAQAQRAIATNSVDRFVGNLG- 464 Query: 464 LGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQ 523 V P +D + DR + LI +V IR+QR Q++ ++ L Sbjct: 465 -VVAKIKPEVLDKFNADRWADTYADMLGIDPELIVPGNQVALIRKQRAEQQQAAQQAALL 523 Query: 524 QQLQQTSQDIGAKAAGR 540 Q T+ +G+ + Sbjct: 524 NQGADTAAKLGSVDTSK 540 >gi|309702812|emb|CBJ02143.1| putative phage protein [Escherichia coli ETEC H10407] Length = 559 Score = 541 bits (1393), Expect = e-151, Method: Composition-based stats. Identities = 122/524 (23%), Positives = 218/524 (41%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + LK++R +L+ F+ P + ++ D T Sbjct: 1 MAETEKERLLKQLAQLKSERTSFESHWRDLSDFINPRGSRFLTSDVNRDDRRNTKIIDPT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 GS A LSS + S IT P + W LA + V+ W + V + Sbjct: 61 GSMAQRILSSGMMSGITSPARPWFKLATPDPDMMDY--------GPVKVWLEVVQRRMNE 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ FGT + D ++ IR + P+ Y++ + + Sbjct: 113 VF--NKSNLYQSLPVMYASLGTFGTAAMAVLEDD-----QDVIRTMPFPIGCYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTD-KKK 227 VD+ +R+F+ TV Q+V ++G +SS ++ E + + H + P D K Sbjct: 166 GSVDTSFRQFSMTVRQLVQEFGLDNVSSSVQGMWQNGTYETWIEVNHCITPNVNRDTGKM 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + D ++ E FP + R+ V +++Y S P M AL ++ Sbjct: 226 DSKNKPFRSVYFETGGDADKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L AQ + +PP +A + K + L PG + + G+ F+P NP Sbjct: 286 LQVEQKRKAQLIDKATNPPMVAPTSLKTQRVSLLPGDVTYLDV-LSGQDGFKPAYLVNPN 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ +++I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLA--QAKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|41179382|ref|NP_958690.1| Bbp21 [Bordetella phage BPP-1] gi|45569514|ref|NP_996583.1| hypothetical protein BMP-1p20 [Bordetella phage BMP-1] gi|45580765|ref|NP_996631.1| hypothetical protein BIP-1p20 [Bordetella phage BIP-1] gi|40950121|gb|AAR97687.1| Bbp21 [Bordetella phage BPP-1] Length = 555 Score = 539 bits (1389), Expect = e-151, Method: Composition-based stats. Identities = 131/557 (23%), Positives = 231/557 (41%), Gaps = 37/557 (6%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLR---MWDTTG 50 Q K + R+ L+ +R +E++ +L P +N + R + D TG Sbjct: 3 EQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTG 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + A L++ + + +T P + W L S E S V+ W VT + Sbjct: 63 TRALRVLAAGMMAGMTSPARPWFRLTTSIP--------ELDESAAVKAWLANVTRLMLMI 114 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 ++S L S Y + FGT + D D + + S+ ++ ++Q Sbjct: 115 F--AKSNTYRALHSMYEELGAFGTASSIVLPDFDA-----VVYHHSLTAGEYAIAADNQG 167 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNE-NERFTIIHAVYPKSLTDK-KKD 228 V+++YREF TV Q+V ++G S+ ++S R + T+IHA+ P++ D K+D Sbjct: 168 RVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRD 227 Query: 229 KGNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLN 286 N + S + DE R E +F + R+ + +IYG SPAMEAL +R+L Sbjct: 228 DRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQ 287 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPLP 345 AQ +PP AK ++ PG ++ + + + + Sbjct: 288 HEQLRKAQAIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSH 347 Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDK--ASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 ++ ++E I++ F DLF +L + +A E E+ EK +GP++ + +E + Sbjct: 348 LLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEIL 407 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVE 463 +I + LP L VE+ S L + Q+A + S + V + Sbjct: 408 DPLIELTFQRMVEANILPPPPQEMQGVD--LNVEFVSMLAQAQRAIATNSVDRFVGNLG- 464 Query: 464 LGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQ 523 V P +D D DR + LI +V IR+QR Q++ ++ L Sbjct: 465 -AVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALL 523 Query: 524 QQLQQTSQDIGAKAAGR 540 Q T+ +G+ + Sbjct: 524 NQGADTAAKLGSVDTSK 540 >gi|262043566|ref|ZP_06016679.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039100|gb|EEW40258.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 560 Score = 536 bits (1382), Expect = e-150, Method: Composition-based stats. Identities = 120/523 (22%), Positives = 214/523 (40%), Gaps = 40/523 (7%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTTG 50 + + +Q + L N R + EL+ F+ P + ++ D T Sbjct: 3 AETLKEQLQKQQAQLTNDRSSFDPHWRELSDFINPRGSRFLVTDVNRDDRRNTKIVDPTA 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + A LSS + S IT P + W LA + V+ W + V + Sbjct: 63 TLAARTLSSGMMSGITSPARPWFKLATPDPDMMDY--------GPVKLWLEVVQRRMNEV 114 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 ++S L Y S+ + TG + D + IR + P+ + YM+ + + Sbjct: 115 F--NKSNIYQSLPLLYASLGNYSTGAMAVLEDDS-----DVIRTMMFPIGSYYMANSARG 167 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF-TIIHAVYPKSLTDK-KKD 228 VD+ +R+F+ TV Q+V ++G +S +K E + +IHAVYP D K + Sbjct: 168 SVDTCFRKFSMTVRQLVMEFGLNNVSDSVKGMWDSGNYESWIEVIHAVYPNIDRDTAKLN 227 Query: 229 KGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRL 285 NK S + V D ++ E FP + R+ V +++YG S P M AL ++ L Sbjct: 228 SKNKPVKSVYYEVGGDSDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGQVKAL 287 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP-- 343 +Q + +PP + S + + L PG + G+ F+P NP Sbjct: 288 QLEQKRKSQLIDKATNPPMVGPSSLRNQRVSLLPGDITY-IDQVTGQDGFKPAYLVNPNT 346 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSE 401 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L E Sbjct: 347 ADLLADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDE 406 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + +I R I+ + LP L++EY S + + Q++ ++S V + Sbjct: 407 CLNPLIDRTFSIMARKNLLPPPPDVLQGMP--LRIEYISVMAQAQKSIGLSSLSSTVGFI 464 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 +L P +D ++ D+ + +I +VE Sbjct: 465 GQLA--QAKPEALDKLNVDQAIDAFAEMSGVSPTVIVPQEQVE 505 >gi|304398403|ref|ZP_07380277.1| phage head-tail connector protein [Pantoea sp. aB] gi|304354269|gb|EFM18642.1| phage head-tail connector protein [Pantoea sp. aB] Length = 553 Score = 536 bits (1381), Expect = e-150, Method: Composition-based stats. Identities = 133/554 (24%), Positives = 233/554 (42%), Gaps = 39/554 (7%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTTG 50 + + + + LK++R + +L+ ++ P N + D T Sbjct: 3 EETLKQRLNKQLGLLKSERTTFDPHWRDLSDYISPRSSRFLVSDANRDNRRNTNIVDPTC 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + A LSS + S IT P + W L+ S A + + V+ W + V + Sbjct: 63 TLAERTLSSGMMSGITSPARPWFTLSVSDPAMKDY--------GPVKVWLEDVQRRMNEV 114 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 ++S L Y + +GT + D E+ IR P+ + Y+S + + Sbjct: 115 F--NKSNLYQSLPIVYAQLGTYGTAAMAILEDD-----EDIIRTYPFPIGSYYVSNSARL 167 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALAR-NENERFTIIHAVYPK-SLTDKKKD 228 VD+VYREF T Q+V ++G +S +K A N +IHAVYP S K D Sbjct: 168 SVDTVYREFRMTTRQLVEQFGLDNVSETVKGQWATQNTESWHDVIHAVYPNVSRQTGKMD 227 Query: 229 KGNKGFHSKFVS-VDENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLN 286 NK + S + +++ E FP + R+ V ++ YG + P M AL ++ L Sbjct: 228 AKNKRYKSVYFEKAGDDKVLRESGFDEFPILAPRWEVNGEDAYGSNCPGMTALGQVKALQ 287 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP--L 344 +Q + +PP + S K + PG + G+ +P+ NP Sbjct: 288 LEQKRKSQLIDKATNPPMVGPSSLKTQRVSQLPGAVTY-VDQLTGQDGLKPLYMVNPNTA 346 Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEF 402 ++ ++ IRS + +DLF +L + +RS E EK +GP++ L EF Sbjct: 347 DLLNDIQDTRDIIRSAYFVDLFLMLQNINTRSMPVEAVNELREEKLLMLGPVLERLNDEF 406 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + +I R I+ +G LP + L++EY S + + Q++ V S + V V Sbjct: 407 LDPLIDRAFAIMQRKGMLPPAPEVL--QGTALRIEYISVMAQAQKSIGVNSMERFVGFVG 464 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHL 522 G+ P +D +D D++ + +I EV+ IRQQR Q + ++ + Sbjct: 465 --GMAQAKPEALDKLDIDKIIDSYGDSIGVSPSVIVPDEEVQKIRQQRAEQIQQQQQMQM 522 Query: 523 QQQLQQTSQDIGAK 536 Q +++D+ Sbjct: 523 AQAAVASAKDLSQA 536 >gi|226940462|ref|YP_002795536.1| Bbp21 [Laribacter hongkongensis HLHK9] gi|226715389|gb|ACO74527.1| Bbp21 [Laribacter hongkongensis HLHK9] Length = 555 Score = 530 bits (1366), Expect = e-148, Method: Composition-based stats. Identities = 128/559 (22%), Positives = 226/559 (40%), Gaps = 38/559 (6%) Query: 1 MNQRSA-KDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLR---MWDT 48 M+ S K + R+ LK +R E++ +L P +N R ++D Sbjct: 1 MDGPSIQKRVSARWEALKKERSSWMSHWSEISDYLLPRSGRFFVEDRNKGNKRHKNIYDN 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 TG+ A L++ + + +T P + W L S + S V+ W VT + Sbjct: 61 TGTRALRVLAAGMMAGMTSPARPWFRLTTSDP--------QLDESAAVKAWLADVTRIMQ 112 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 ++S L S Y + FGT + D + I + + ++ ++ Sbjct: 113 MVF--AKSNTYRALHSCYEELGAFGTAGTIVLPDFN-----GVIHHHVLTAGEFAIAADY 165 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALAR-NENERFTIIHAVYPKSLTDK-K 226 + V+++YREF TV Q+V ++G S+ ++ R +E T+IHA+ P++ K + Sbjct: 166 RGQVNTLYREFQMTVGQMVGEFGLSACSATVQRLHERWCLDEWITVIHAIEPRTDRHKGR 225 Query: 227 KDKGNKGFHSKFVSVD--ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 +D N + S + E + E FP + R+ +IYG SPAME+L I++ Sbjct: 226 QDARNMAWRSVYFEPGNREGQVLRESGFREFPALCPRWSTSGGDIYGNSPAMESLGDIKQ 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NP 343 L Q PP S + R+ D PG ++ + + G + Sbjct: 286 LQHEQLRKGQVIDYKTKPPLQVPSSMRARDIDTLPGGVSFVDAGTPNGGIRSAFEVGLDL 345 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDK--ASRSAAESMEKTREKGAFVGPLIGGLQSE 401 ++ ++E I+ F DLF +L + +A E E+ EK +GP++ L +E Sbjct: 346 SHLLADIQDVRERIKGSFYADLFLMLANGSNPQMTATEVAERHEEKLLMLGPVLERLHNE 405 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + +I + G +P L VE+ S L + Q+A + S + V + Sbjct: 406 ILDPLIEMTFSRMVEAGIVPPPPEELQGVD--LNVEFVSMLAQAQRAIATNSVDRFVGNL 463 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 V P +D D DR + LI V IRQQR ++ ++ Sbjct: 464 G--AVAGIKPEVLDKFDADRWADAYADMLGIDPELIVPGDRVALIRQQRAQAQQAQQQAA 521 Query: 522 LQQQLQQTSQDIGAKAAGR 540 + Q +Q +G+ + Sbjct: 522 MLQMGADAAQKLGSVDTSQ 540 >gi|212710818|ref|ZP_03318946.1| hypothetical protein PROVALCAL_01886 [Providencia alcalifaciens DSM 30120] gi|212686515|gb|EEB46043.1| hypothetical protein PROVALCAL_01886 [Providencia alcalifaciens DSM 30120] Length = 550 Score = 516 bits (1330), Expect = e-144, Method: Composition-based stats. Identities = 132/551 (23%), Positives = 233/551 (42%), Gaps = 40/551 (7%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEAC 54 +D+ + + LKN+R +EL + P + ++ D +++ Sbjct: 4 KQDLLKQLSQLKNERQSFEPHWKELAEYTRPRSTRFSTSEVNRGDRRNTKIIDQEAAKSE 63 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 LSS + S IT P +KW LA + V+ W + V + + Sbjct: 64 RTLSSGMMSGITSPARKWFRLATPDPDMMNY--------SPVKMWLEVVEQRMNEVF--N 113 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 RS L Y+ + F T + D E IR + P+ + Y++ VD+ Sbjct: 114 RSNIYQSLPQTYSDIGTFATSALAVLEDN-----ERVIRTVPFPIGSYYIANGPDLTVDT 168 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN-ERFTIIHAVYPKSLT-DKKKDKGNK 232 +REF+ TV Q+V ++G +S ++KS + T+IH+VYP K D NK Sbjct: 169 CFREFSMTVRQLVMEFGLDNVSEQVKSMWDSGNYSQWITVIHSVYPNLNRISGKLDAKNK 228 Query: 233 GFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETV 289 F S + + D +R E FP + R+ V +++YG S P M AL +++ L Sbjct: 229 LFKSVYFEIGGDSDRVLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGSVKALQLLQ 288 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQ--FGNPLPYH 347 AQ +PP A + K + L PG + ++ + + +P+ + Sbjct: 289 RRKAQQIDKVTNPPMQAPASIKNQRISLVPGGITYLPMAGADQ-MIKPIFQVQADINGLI 347 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEFIGA 405 ++ + I+ + DLF +L + +RS +E EK +GP++ L SE + Sbjct: 348 ADIGDTRNQIKEAYFSDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLQRLDSELLDK 407 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 +I+R I+ + LP LKVEY S + + Q++ V S + V V G Sbjct: 408 LINRTFAIMARKNLLPVPPEEMQGMQ--LKVEYISVMAQAQKSVGVNSVERFVGFVG--G 463 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 + P +D ++TD + + ++ +V IRQQR Q++ M++ + Q+ Sbjct: 464 LAKLKPEALDKLNTDEIIDNYAESIGISPTIVSSNDQVAAIRQQRAEQQQQMQQMQMAQE 523 Query: 526 LQQTSQDIGAK 536 +Q +G Sbjct: 524 AVAGAQALGNT 534 >gi|268589375|ref|ZP_06123596.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315402|gb|EFE55855.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 550 Score = 516 bits (1329), Expect = e-144, Method: Composition-based stats. Identities = 132/551 (23%), Positives = 233/551 (42%), Gaps = 40/551 (7%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEAC 54 +D+ + + LKN+R +EL + P + ++ D +++ Sbjct: 4 KQDLLKQLSQLKNERQSFEPHWKELAEYTRPRSTRFNTSEVNRGDRRNTKIIDQEAAKSE 63 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 LSS + S IT P +KW LA + V+ W + V + + Sbjct: 64 RTLSSGMMSGITSPARKWFRLATPDPDMMNY--------SPVKMWLEVVEQRMNEVF--N 113 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 RS L Y+ + F T + D E IR + P+ + Y++ VD+ Sbjct: 114 RSNIYQSLPQTYSDIGTFATSALAVLEDN-----ERVIRTVPFPIGSYYIANGPDLTVDT 168 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN-ERFTIIHAVYPKSLT-DKKKDKGNK 232 +REF+ TV Q+V ++G +S ++KS + T+IH+VYP K D NK Sbjct: 169 CFREFSMTVRQLVMEFGLDKVSEQVKSLWDSGNYSQWITVIHSVYPNLNRISGKLDAKNK 228 Query: 233 GFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETV 289 F S + + D R E FP + R+ V +++YG S P M AL +++ L Sbjct: 229 LFKSVYFEMGGDSERVLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGSVKALQLLQ 288 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQ--FGNPLPYH 347 AQ +PP A + K + L PG + ++ + + +P+ + Sbjct: 289 RRKAQQIDKVTNPPMQAPASIKNQRISLVPGGITYLPMAGADQ-MIKPIFQVQADINGLI 347 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEFIGA 405 ++ + I+ + DLF +L + +RS +E EK +GP++ L SE + Sbjct: 348 ADIGDTRNQIKEAYFSDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLQRLDSELLDK 407 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 +I+R I+ + LP LKVEY S + + Q++ V+S + V V G Sbjct: 408 LINRTFAIMARKNLLPVPPEEMQGMQ--LKVEYISVMAQAQKSVGVSSIERFVGFVG--G 463 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 + P +D ++TD + + ++ +V IRQQR Q++ M++ + Q+ Sbjct: 464 LAQMKPEALDKLNTDEMIDNYAESIGVSPTIVSSNDQVAAIRQQRAEQQQQMQQMQMAQE 523 Query: 526 LQQTSQDIGAK 536 +Q +G Sbjct: 524 AISGAQALGNT 534 >gi|85059667|ref|YP_455369.1| hypothetical protein SG1689 [Sodalis glossinidius str. 'morsitans'] gi|84780187|dbj|BAE74964.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 517 Score = 512 bits (1319), Expect = e-143, Method: Composition-based stats. Identities = 117/526 (22%), Positives = 204/526 (38%), Gaps = 45/526 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA--------------QLRMW 46 M++ + K I R + LK+ R E + YP + ++ Sbjct: 1 MDELAVKLI-TRADALKSHRQRHESVWSECYDYTYPLRGAGFSADVLDAQSAKSKVAKLL 59 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 D T +++ L+S L S +TP +W L ++ + + + W Sbjct: 60 DGTATDSARMLASALMSGMTPANAQWLNL------------DCESLADEDKAWLSTCATL 107 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS- 165 ++ + F VV G Y+ DE E G + PLS Y++ Sbjct: 108 VW--ENIHAANFDAEGYEENLDVVCAGWFVLYI----DENREEGGYTFQQWPLSQCYVAS 161 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK 225 +VD++YR + T +Q ++++G+ +S K++ A +++F +HA++P++ Sbjct: 162 TRKDGIVDTIYRCYQMTAEQAIAEFGEAGVSEKIRRAARDKPDDKFDFLHAIFPRTNYGV 221 Query: 226 KK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 + F S V R E FP V R+ YG P +ALP + Sbjct: 222 NACLAKHLRFASFHVERQGKRIVRESGYHEFPVCVPRWMKIPGGAYGIGPVYDALPDCKE 281 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGN 342 LNET L++ I+ + + P + + + + L F Sbjct: 282 LNETKRMEKAAQDLAISGMWISEDDGVINPYSVKVGPRRIIVASSVNSMKPLLTGADFQV 341 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 +RL+ SIR + + D Q D A +A E + +GP+ G Q+E+ Sbjct: 342 AFTAE---DRLQASIRKIMMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRFQAEY 397 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + ++ R I G P D+ + V Y SPL + Q+ E V + + V Sbjct: 398 LQPLVERCFGIAFRAGVFPPPP--DSMQTAHFNVLYISPLARAQKLEDVTAVERLGANVA 455 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 +L P +D +DTD +R A PA +IR A+V +R Sbjct: 456 QL--SQVSPEVVDLVDTDEATRVVADALGVPAKVIRSAADVTSLRD 499 >gi|330007155|ref|ZP_08305897.1| hypothetical protein HMPREF9538_03586 [Klebsiella sp. MS 92-3] gi|328535502|gb|EGF61962.1| hypothetical protein HMPREF9538_03586 [Klebsiella sp. MS 92-3] Length = 559 Score = 509 bits (1310), Expect = e-142, Method: Composition-based stats. Identities = 134/570 (23%), Positives = 228/570 (40%), Gaps = 42/570 (7%) Query: 1 MNQRSAKD-IQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDT 48 M + S K LKN+R EL F+ P + R+ D Sbjct: 1 MAELSPKQHYLKHLGQLKNERTSFEEHWRELAEFIDPRSTRFLTTERNNGSKRNTRIVDP 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 T S+A L S + S IT P + W LA + V+ W D V + Sbjct: 61 TASKAARTLQSGMLSGITSPTRPWFKLATPDPEMMQY--------GPVKRWLDVVMTRMN 112 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 +RS L Y + FGT + D E+ IR +P+ + Y+S +H Sbjct: 113 DVM--NRSNVYQSLPIIYRHLGVFGTAAMAVLEDD-----EDVIRTHPLPIGSYYLSNSH 165 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLT-DKK 226 + VD+ YR F+ T QIV ++G +S+ ++ A E F ++H P + K Sbjct: 166 RLSVDTTYRVFSMTARQIVMQFGLDNVSNAVRGAWDNANYEAWFDVVHLTEPNIDRVNGK 225 Query: 227 KDKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIR 283 + NK F S + + D ++ E P + R+ + +++YG + P M AL T + Sbjct: 226 LNSRNKAFKSVYFELSGDGDKLLREAGFDEPPILSPRWEINGEDVYGSNCPGMMALGTGK 285 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L A ++PP +A + K + +L PG + + L +P +P Sbjct: 286 ALQLEQIRKANAIDKLVNPPMVAPTGLKNKLINLAPGGVTY-VDEVDATKLVRPAYAVSP 344 Query: 344 LPYHE--ELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 + ++ I + F DLF + +RS EK +GP++ L Sbjct: 345 QLNDMLGSIADDRQMIEACFFSDLFNLFSTINTRSMPVEAVAAMQDEKLLQLGPVLERLN 404 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 EF+ + R +I+ + PE LKVEY S L + Q++ ++S + V Sbjct: 405 DEFLDPFVDRTFNIMARRNLFPEPPEELQGTP--LKVEYVSILAQAQKSIGISSVERFVG 462 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 V L +P+ +D ++ D+ PA ++ EV+ R+QR + + Sbjct: 463 FVGNLA--KANPAALDKLNIDQTIDEYGNMLGVPATIVNSDDEVQATREQRAQMEQQQQM 520 Query: 520 QHLQQQLQQTSQDIG-AKAAGRAMEKKLTH 548 + QQ T++ + A ++ K L+ Sbjct: 521 MAMAQQAGATAKTLSDTNTADPSLLKTLSD 550 >gi|85059164|ref|YP_454866.1| hypothetical protein SG1186 [Sodalis glossinidius str. 'morsitans'] gi|84779684|dbj|BAE74461.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 541 Score = 508 bits (1307), Expect = e-141, Method: Composition-based stats. Identities = 118/526 (22%), Positives = 200/526 (38%), Gaps = 45/526 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA--------------QLRMW 46 M++ + K I R + LK+ R E + YP + ++ Sbjct: 1 MDELAVKLI-TRADTLKSHRQRHESVWRECYDYTYPLRGAGFSADVLDAQSAKSKVAKLL 59 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 D T +++ L+S L S +TP +W L + W Sbjct: 60 DGTATDSARMLASALMSGMTPANAQWLNLDSESLP------------DDAKAWLSGCATL 107 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS- 165 ++ + F VV G Y+ DE E G + PLS Y++ Sbjct: 108 VW--ENIHAANFDAEGYEANLDVVCAGWFVLYI----DENREEGGYMFQQWPLSQCYVAS 161 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD- 224 +VD++YR + T +Q ++++G+ +S K++ A +++F +HA++P+ Sbjct: 162 TRKDGIVDTIYRCYQMTAEQAIAEFGEAGVSEKIRRAAKDKPDDKFDFLHAIFPRKNYVV 221 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 + + F S V R E FP V R+ + YG P +ALP + Sbjct: 222 NARLAKHLRFASFHVERQGKRIVRESGYHEFPVCVPRWMKISGGAYGIGPVYDALPDCKE 281 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGN 342 LNET L++ IA + + P + + + + L F Sbjct: 282 LNETKRMEKAAQDLAISGMWIAEDDGVINPYSVKVGPRRIIVASSVNSMKPLLTGADFHV 341 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 +RL+ SIR + + D Q D A +A E + +GP+ G Q+E+ Sbjct: 342 AFTAE---DRLQASIRKIMMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRFQAEY 397 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + ++ R I G P D+ + V Y SPL + Q+ E V + + V Sbjct: 398 LQPLVERCFGIAFRAGVFPAPP--DSMQTAHFNVRYISPLARAQKLEDVTAIERLGANVA 455 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 +L P +D +DTD R A PA +IR A+V +R Sbjct: 456 QL--SQVSPEVVDLVDTDEAMRVVADALGVPAKVIRSAADVTSLRD 499 >gi|332160969|ref|YP_004297546.1| hypothetical protein YE105_C1347 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665199|gb|ADZ41843.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862125|emb|CBX72289.1| hypothetical protein YEW_AK02260 [Yersinia enterocolitica W22703] Length = 534 Score = 507 bits (1305), Expect = e-141, Method: Composition-based stats. Identities = 119/526 (22%), Positives = 206/526 (39%), Gaps = 45/526 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA--------------QLRMW 46 M+ +A+ + R + LK R E + YP + + R+ Sbjct: 1 MDDTAARLV-KRVSSLKAARQLHESVWRECYDYTYPLRGSGFSTEVLDAQSAKSKVARLL 59 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 D T +++ L+S L S +TP +W L + S R W Sbjct: 60 DGTATDSARILASALMSGMTPANAQWLDLGS------------ENLSDDERSWLSTC--A 105 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSV 166 + + F VV G Y++ D + G + PL+ V+++ Sbjct: 106 TLTWENIHAANFDAEGYEANIDVVCAGWFALYVDED----TEQGGYTFNQWPLAQVFVAS 161 Query: 167 NHQ-NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKS-LTD 224 + + VV++VYR + T +Q V ++G +S K++ A + +++F IHA++P+ Sbjct: 162 SRRDGVVNTVYRCYQLTAEQAVKEFGRDNVSHKIQDAANKKPDDKFEFIHAIFPRDGYIG 221 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 + N F S V V E + E FP V R+ YG P +ALP + Sbjct: 222 NARLAKNLPFASFNVEVAEKKVVRESGYHEFPVCVPRWMKIPGTPYGVGPVYDALPDCKE 281 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 LNET L++ IA + R ++ P + + + L F Sbjct: 282 LNETKRMEKAAQDLAIAGMWIAEDDGVLNPRTVNVGPRKIIVANSVNSMKPLLTGADFNV 341 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 E RL+ IR + + D Q D A +A E + +GP+ G Q+E+ Sbjct: 342 AFTAEE---RLQAQIRKILMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRFQAEY 397 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + ++ R I G P+ + + + Y SPL + Q+ E V + + + Sbjct: 398 LQPLVERCFGIAFRAGVFPQMPESMA--QANFNIRYISPLARAQKLEDVTAIERLGANIA 455 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 +L +P +D+MD D +R A PA ++R A+V +R Sbjct: 456 QLAAI--NPEVIDNMDADAAARVVSDALGVPAKVLRSAADVTALRD 499 >gi|262043408|ref|ZP_06016533.1| hypothetical protein HMPREF0484_3551 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039234|gb|EEW40380.1| hypothetical protein HMPREF0484_3551 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 515 Score = 501 bits (1291), Expect = e-140, Method: Composition-based stats. Identities = 118/522 (22%), Positives = 192/522 (36%), Gaps = 45/522 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQ--------------LRMW 46 M++ + K + R + LK R E + YP + R+ Sbjct: 1 MDELAVKLV-KRADTLKANRQVHESVWRECYDYTYPLRGAGLSDEVLDAQSAKSKVARLL 59 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 D T +++ L+S L S +TP +W L W Sbjct: 60 DGTATDSARMLASALMSGMTPANAQWLNLDSESLP------------DDAAAWLSTCATL 107 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM-S 165 ++ + F VV G Y++ D E G + PL+ Y+ S Sbjct: 108 VW--ENIHAANFDAEGYEANLDVVCAGWFALYIDED----REEGGFSFQQWPLAQCYVTS 161 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD- 224 +VD++YR + T +Q + ++G +S K+ A A+ +++F +H ++P+ Sbjct: 162 TRRDGIVDTIYRRYQLTAEQAIKEFGADKVSKKISDAAAKKPDDKFEFLHCIFPRENYVV 221 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 + N F S V V E FP V R+ YG P +ALP + Sbjct: 222 NARLAKNLRFASYNVEVSGKLIVRESGYHEFPCCVPRWMKIPGTPYGIGPVYDALPDCKE 281 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 LNET L++ IA + R + P + + + L F Sbjct: 282 LNETKRMEKAAQDLAIAGMWIAEDDGVLNPRTVKVGPRRIIVANSVDSMKPLLTGADFNV 341 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 E RL+ SIR + + D Q D A +A E + +GP+ G Q+E+ Sbjct: 342 AFTAEE---RLQASIRKIMMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRFQAEY 397 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + ++ R + G P + + V Y SPL + QQ E+V + + V Sbjct: 398 LQPLVERCFGLAFRAGVFPPAPESL--QNANFNVRYISPLARAQQLENVTAIERLGANVA 455 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 L P D +DTD +R A PA +IR + VE Sbjct: 456 NLA--QVSPDVTDLVDTDEATRVIADALGVPAKVIRSSDAVE 495 >gi|218886173|ref|YP_002435494.1| hypothetical protein DvMF_1072 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757127|gb|ACL08026.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 595 Score = 501 bits (1290), Expect = e-139, Method: Composition-based stats. Identities = 128/548 (23%), Positives = 220/548 (40%), Gaps = 59/548 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-------------------- 40 M + +D ++ ++L+ QR ++ ++ P + Sbjct: 1 MTSQRLRDAREAVDFLERQRSPWEEAWRDIAAYVLPRRGRMHGRDPLGASAPGAVGGSSG 60 Query: 41 ----------AQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKED 90 R+ D T + A L++ + +T P + W L + A D Sbjct: 61 VSGTHRSTDMRGGRVIDATATRAVRILAAGMQGGLTSPARPWFRLRLADGA--------D 112 Query: 91 ARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEE 150 A S R W D V L+ +RS F + YT + FG+ Y E D E Sbjct: 113 AESGPARRWLDAVEQRLYW--ALARSNFYQASHALYTELAAFGSADLYQEVDP-----ER 165 Query: 151 GIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENER 210 R+ ++ + + VD+V R T Q+ ++G+ LS+ + L + N Sbjct: 166 LTRFAALTCGEFSWACDAAGRVDTVARRMLMTARQLAERYGEAHLSTGTRRMLRKEPNRH 225 Query: 211 FTIIHAVYPKSLTDKKKDKG-NKGFHSKFVSVDE--NRFFEEKQIATFPYIVGRYRVRAD 267 ++H V P+++ G + F S D E FP++ R+ V Sbjct: 226 VEVVHLVRPRAVRTPGHGSGLHMPFESLVFEADGAAGDLLHEGGFEEFPHLAARWDVTGS 285 Query: 268 EIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGAL 327 ++YGRSP M+ LP ++ L E ++PP + KQR +L PG N A Sbjct: 286 DVYGRSPGMDVLPDVKMLQEMARSQLLAIHKVVNPPMRVPTGFKQR-LNLIPGAQNYVAP 344 Query: 328 SREGRSLFQPVQFGNP--LPYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEK 383 + P+ NP +++ +++++R F DLF + D +++ +AAE E+ Sbjct: 345 GQ--PEAVAPLYQINPDIAAVTRKIDDVRKAVREGFFNDLFLMFTADGRSNVTAAEVAER 402 Query: 384 TREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLF 443 +EK +GP+I Q+E + +++R IL G LP ++VEY S L Sbjct: 403 GQEKLLMLGPVIERHQTELLDPLLTRTYGILRRAGALPPNPPELEGLE--MRVEYVSALA 460 Query: 444 KYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 + Q+ + S Q V L P +D +D D+ PA ++R AEV Sbjct: 461 QAQRLGAAQSIRQFAAEVTALSATA--PGVLDKIDFDQAVDELASIGGVPARVVRSDAEV 518 Query: 504 EDIRQQRE 511 +R +RE Sbjct: 519 LRLRAERE 526 >gi|212703348|ref|ZP_03311476.1| hypothetical protein DESPIG_01391 [Desulfovibrio piger ATCC 29098] gi|212673194|gb|EEB33677.1| hypothetical protein DESPIG_01391 [Desulfovibrio piger ATCC 29098] Length = 611 Score = 488 bits (1257), Expect = e-136, Method: Composition-based stats. Identities = 121/566 (21%), Positives = 226/566 (39%), Gaps = 37/566 (6%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA----------QLRMWDTTGSEACI 55 + R+ L +R + E L P + + D TG A Sbjct: 38 VPALARRYRALLERRSPWDTAWESLAEHFLPTRFRTDDSLDDRPLLNRSLVDATGILAMR 97 Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 L++ L +T P + W LA + +RS + + D+V + + R Sbjct: 98 TLAAGLQGGMTSPARPWFRLALDDP--------DLSRSHAGQRYLDEVEARMRVVLQ--R 147 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 F + + Y + FGT + AD L G R++ + + + VD+V Sbjct: 148 CNFYNAMHTIYAELGTFGTAFVFELAD-----LRHGFRFVPLCAGQYVLDTDAARRVDTV 202 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK-KDKGNKGF 234 + ++ Q+V +G + L ++ A R ++R +IHAV P++ + + + Sbjct: 203 FHRMHMSLRQMVQSFGPEALPENLRLAARRTPDQRHAVIHAVLPRTERRPRLAGPCHMPW 262 Query: 235 HSKFVSVDEN---RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 S + +E FP R+ V A+++YGRSPAM+ALP R L + Sbjct: 263 ASVYWLEGREGQVVPLKESGFMGFPGFGPRWDVAANDVYGRSPAMDALPDCRMLQQMGIT 322 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMN-IGALSREGRSLFQPVQFGNPLPYHEE- 349 + ++ PP + + DL PG +N + +L + + + P+ P Sbjct: 323 TLKAIHKAVDPPMSVHAGLRSVGLDLTPGGINFVDSLPGQNQPVATPLLQVKPDLAQARS 382 Query: 350 -LNRLKESIRSLFLLDLF-QVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + +++ IR+ DLF +L+ ++ +A+E + EK +GP++ L E + +I Sbjct: 383 AMEAVQQQIRAGLYNDLFRLILEGRSKVTASEIAAREEEKLLLIGPVLERLHDELLIPLI 442 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R ++ + LP C + LKVE+ S L + Q+ +++ Q + + L Sbjct: 443 DRTFRLMLALDMLPPCPPELSG--RHLKVEFVSLLAQAQKLVGISATDQYL--ALTLKAA 498 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 + P +D +D D + + P L R E +R RE R+ ++ L Q+ Sbjct: 499 SAWPEALDSVDVDNLLDNYAESLGLPVNLTRPREERARLRAGREEARQTEQQLALLQKAA 558 Query: 528 QTSQDIGAKAAGRAMEKKLTHDMMEN 553 + EK ++ N Sbjct: 559 DLGHTLADSDLTVEGEKSSVLQVLAN 584 >gi|295096867|emb|CBK85957.1| Bacteriophage head to tail connecting protein [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 541 Score = 488 bits (1256), Expect = e-135, Method: Composition-based stats. Identities = 117/526 (22%), Positives = 198/526 (37%), Gaps = 45/526 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA--------------QLRMW 46 M++ + K I R + LK R + E + YP + ++ Sbjct: 1 MDELAVKLI-KRSDTLKANRQQHESVWRECYDYTYPLRGAGFSDEVLDAQSAKHKVAKLL 59 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 D T +++ L+S L S +TP +W L + W + Sbjct: 60 DGTATDSARMLASALMSGMTPANAQWLNLDSESLP------------DDAKAWLSECATL 107 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM-S 165 ++ + F VV G Y++ D E G + PL+ Y+ S Sbjct: 108 VW--ENIHAANFDAEGYEANLDVVCAGWFVLYIDED----REEGGYTFQQWPLAQCYVTS 161 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD- 224 +VD++YR + T +Q + ++G +S K++ A + +++F +H ++P+ Sbjct: 162 TRKDGIVDTIYRRYQLTAEQAIKEFGADKVSEKIRDAAKKKADDKFDFLHCIFPRETYMV 221 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 + N F S V V + E FP V R+ YG P +ALP + Sbjct: 222 DARLAKNMRFASYNVDVSNKQIVRESGYHEFPCCVPRWMKIPGGSYGIGPVYDALPDCKE 281 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 LNET L++ IA + R + P + + + L F Sbjct: 282 LNETKRMEKAAQDLAISGMWIAEDDGVLNPRTVKVGPRRIIVANSVDSMKPLLTGSDFSV 341 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 E RL+ SIR + + D Q D A +A E + +GP+ G Q+E+ Sbjct: 342 AFTAEE---RLQASIRKIMMADQLQPQDGPA-MTATEVHVRVALIRQLLGPVYGRFQAEY 397 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + ++ R I G + + V Y SPL + Q+ E V + + V Sbjct: 398 LQLLVVRCFGIAFRAGIFSPPPESL--QNANFNVRYISPLARAQKLEDVTAIERLGANVA 455 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 L + D +D +DTD +R A PA +IR + V D+R Sbjct: 456 NLAGISQD--VVDLIDTDEATRVVADALGVPAKVIRSSDAVADLRD 499 >gi|303257564|ref|ZP_07343576.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] gi|302859534|gb|EFL82613.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] Length = 548 Score = 486 bits (1252), Expect = e-135, Method: Composition-based stats. Identities = 114/560 (20%), Positives = 222/560 (39%), Gaps = 35/560 (6%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYP-----------YKNNAQLRMWDTTGSEAC 54 K I RF LK +R ++ + P + ++ D + Sbjct: 5 IKLINQRFESLKQERSSWEDLWRDIRDYCLPDLGCFPGEDATQGSKRYRKILDAEAIDCA 64 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 L++ L ++ P + W L + + ++ V+EW +V D L S Sbjct: 65 DVLAAGLLGGVSSPSRPWLRLTT--------MDPDLDKNPAVKEWMTKVQDLL--LLYFS 114 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 ++ L Y + FGT C ++ E+ I ++ + +++ + VD+ Sbjct: 115 KAECYNALHQSYLELPVFGTACTIVKPHP-----EQLISLQNLTIGEYWLAEDDYGKVDT 169 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGNKG 233 +YR + T Q+V +WG + +++ ++ A ++ RF +IHA+ P+ + K+D N Sbjct: 170 MYRRLSLTAKQMVQQWGFEAVNNDVRQAFEKDPFTRFNVIHAIEPRIERNPDKRDNKNMP 229 Query: 234 FHSKFVSVD-ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + S + +++ E FP + R+ +YGR P +AL + L L Sbjct: 230 WQSVYFQEGVQDKVLSESGFRNFPALCPRWMTSGGSVYGRGPGAKALSAQKSLQRLHLRL 289 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNR 352 A+ PP + S K + KPG +P + Sbjct: 290 AELVDYGTRPPILYPSTLKDQLSQFKPGGRVAVNPQEAPIIRSMWEVRTDPQAMLALIQS 349 Query: 353 LKESIRSLFLLDLFQVLDDKAS---RSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 ++ I+ +F +++FQ++ A+ R+A E +EK +GP++ L +E + +++ Sbjct: 350 TRQDIQRIFFVNVFQMIAATANQTDRTATEVQALEQEKVMMLGPVLERLHTELLDPLVTN 409 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 + LPE L +EY S L + Q+ S ++ + L Sbjct: 410 AFGFMVEYNMLPEVPEELYGRE--LSIEYVSVLAEAQKNASANGIVRTAQQIGLLA--QI 465 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQT 529 +P +D +D D P LI +V IRQQR Q++ + QQ + Sbjct: 466 NPQAVDKLDVDATIDQLADMNGVPPSLIVTGQKVALIRQQRAEQQQAQMQAAQLQQAMTS 525 Query: 530 SQDIGAKAAGRAMEKKLTHD 549 +D+G A + +++ + + Sbjct: 526 LKDLGQAADSQGLQEAFSEE 545 >gi|298485985|ref|ZP_07004059.1| hypothetical protein PSA3335_1414 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159462|gb|EFI00509.1| hypothetical protein PSA3335_1414 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 533 Score = 486 bits (1251), Expect = e-135, Method: Composition-based stats. Identities = 136/523 (26%), Positives = 212/523 (40%), Gaps = 42/523 (8%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA--------------QLRMWDTTG 50 +A I + LK+ R + YP + + + RM D T Sbjct: 3 TAAQICKTLSTLKSLRSPHESVWRDCFDHSYPIRGSGFCIEQITAMEAQMRKARMIDGTT 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 ++A LSS + S +TP W G+ S + R W D D L+ Sbjct: 63 TDAARILSSGIMSGLTPANSLWFGM------------DVGQESDEERRWLDGSADILW-- 108 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN-HQ 169 + S F T VV G Y++ D+ + G + P+++VY S + Sbjct: 109 QNIHASNFDAAAFEGLTDVVCAGWFALYIDQDM----EKGGFTFDLWPIASVYCSASKAG 164 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD-KKKD 228 +D+VYR + T +Q V+++G+ LS + E IHA+YP++ + Sbjct: 165 GKIDTVYRTYKLTAEQAVNEFGEDNLSETTRKLAKEKPQELVEFIHAIYPRTTHMVGARL 224 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 N S V V E P +V R+ + D +Y P +ALP R LNE Sbjct: 225 AKNMPVASCKVEVAAKTLVSESGYHEMPVVVPRWMMIPDSVYAVGPVFDALPDSRTLNEL 284 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 G L++ IA + +K G I + +P+Q G+ Y E Sbjct: 285 CRMDLAAGDLAIAGMWIAEDDGVLNPRTVKVGPRKI--IVANSVDSMKPLQSGSNFQYAE 342 Query: 349 -ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 ++ RL+ SIR + + D Q D A +A E + +GP+ G LQ+E++ MI Sbjct: 343 TKIARLQGSIRKILMADQLQAQDGPA-MTATEVHVRVNLIRQLLGPVYGRLQTEYLQPMI 401 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R I G L + + V Y SPL + Q+ E V++ Q V L V Sbjct: 402 ERCFGIAYRAGVLGQAPESLAGRD--FTVRYLSPLARSQKLEEVSAIDQFVQ--GALIVA 457 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 DPS MD++D D RF A P+ +IR A+ + +R+ R Sbjct: 458 QADPSVMDNIDMDEAQRFKGEALGVPSSVIRSKADRDKLREDR 500 >gi|227355860|ref|ZP_03840253.1| tail protein [Proteus mirabilis ATCC 29906] gi|227164179|gb|EEI49076.1| tail protein [Proteus mirabilis ATCC 29906] Length = 554 Score = 484 bits (1247), Expect = e-134, Method: Composition-based stats. Identities = 128/539 (23%), Positives = 221/539 (41%), Gaps = 41/539 (7%) Query: 17 KNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEACIKLSSLLSSLI 65 + +R EL+ F P + ++ D T S A LSS + S I Sbjct: 17 ETERSSFEPHWRELSDFTRPRSTRFTASDVNRGDRRNSKIIDPTASLASSVLSSGMMSGI 76 Query: 66 TPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSF 125 T P + W LA + V+ W + + +RS L Sbjct: 77 TSPARPWFRLATPDPDLMDY--------GPVKLWLETTEQRMNEVF--NRSNLYQSLPLM 126 Query: 126 YTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQ 185 Y + FGT + D + IR + PL + Y++ + VD YR+FT TV Q Sbjct: 127 YGDLGTFGTAAMAVVEDS-----QRIIRTVHFPLGSYYIANSPSLSVDVCYRKFTMTVRQ 181 Query: 186 IVSKWGDKVLSSKMKSALARNEN-ERFTIIHAVYPKSLTDK-KKDKGNKGFHSKFVSVDE 243 +V ++G +S +KS ++ + ++HAVYP K + +K F S ++ V Sbjct: 182 LVMEFGVDSVSDTVKSMWNSSQYSQWIEVVHAVYPNLERQTGKLEAKHKPFKSVYLEVAG 241 Query: 244 NR--FFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNETVNELAQFGRLSL 300 + E FP + R+ V +++YG S P M AL + L AQ Sbjct: 242 DHEKVLRESGYDEFPIMAPRWEVNGEDVYGSSCPGMLALGGTKALQLMQKRKAQMIDKLT 301 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNI---GALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 +PP + K + + PG +N + + +++F VQ E++ ++ I Sbjct: 302 NPPLQVPASLKNQRVNTIPGGINYLDEANPTNKIQTIF-DVQPVALKALLEDVQDTRQLI 360 Query: 358 RSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILD 415 + + +DLF+++ +RS +E EK +GP++ L SE + +I+R IL Sbjct: 361 DTAYFVDLFRMMQMVNTRSMPIEAVVEMREEKLLQLGPVLQRLDSELLDKLINRTFSILV 420 Query: 416 SQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD 475 ++ LP D LKVEY S + + Q++ V S + V L P +D Sbjct: 421 NKNLLPVAP--DEMQGMDLKVEYISVMAQAQKSIGVGSIERFAGFVGNLAKV--KPEALD 476 Query: 476 HMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIG 534 ++ D A ++ +V+ IRQQR+ Q++ M + + Q ++ + Sbjct: 477 KLNADDAIDNYASAIGVSPTIVATNEQVQAIRQQRQAQQQQMAQMQMAQSAIDGAKTLS 535 >gi|291336934|gb|ADD96462.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured organism MedDCM-OCT-S09-C787] Length = 450 Score = 479 bits (1232), Expect = e-133, Method: Composition-based stats. Identities = 106/462 (22%), Positives = 208/462 (45%), Gaps = 28/462 (6%) Query: 38 KNNAQLR---MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSK 94 ++ R ++D + ++ L++ L ++T P W L F + Sbjct: 12 RSKGDKRTELIFDGSPLQSVELLAASLHGMLTNPSTPWFSLR--------FKQNDMENED 63 Query: 95 KVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRY 154 + +EW + T+ ++ ++S F + Y ++ FGT ++E D E+ +++ Sbjct: 64 EAKEWLEDATEVMYSAF--NKSNFQQEIFELYHDLITFGTAAMFIEEDD-----EDILKF 116 Query: 155 ISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTII 214 + ++ ++++ N + +D+V+R+F+ + ++ K+GD +S + + ++ E I+ Sbjct: 117 STRHINEIFIAENDKGRIDTVFRKFSLSARAVMQKFGD--VSINIATKAKKDPYEEVEIM 174 Query: 215 HAVYPKSLTDKKK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRS 273 HAVYP+S D +K DK N F S ++ + FP++V RY + EIYGRS Sbjct: 175 HAVYPRSDFDPRKQDKENMPFESVYLDAESGDELSVSGFREFPFVVPRYLKASHEIYGRS 234 Query: 274 PAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRS 333 PAM ALP ++ LNE + + + PP + + PG +N R Sbjct: 235 PAMTALPDVKMLNEMSKTTIKSAQKQVDPPLLVPDDGFMLPVRTIPGGLNFYRAGTRDRI 294 Query: 334 LFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGP 393 + PL + E R + SIR+ F ++ + +A E +++ EK +GP Sbjct: 295 ETLNIGANTPLGLNMEEQR-RNSIRNAFYVNQLMMQSG-PQMTATEVIQRNEEKMRLLGP 352 Query: 394 LIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 ++G LQSE + +I R ++ + +++EY SPL K Q++ ++S Sbjct: 353 VLGRLQSELLKPLIDRTFALILRKNLFRPAPEFLAGQD--IEIEYVSPLAKAQKSTELSS 410 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAV 495 ++ + + L DH++ D++ R P Sbjct: 411 IMRAIEILGSLSNVA---PVFDHINMDKLVRHLADIVGVPQK 449 >gi|220903991|ref|YP_002479303.1| hypothetical protein Ddes_0717 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219868290|gb|ACL48625.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 597 Score = 478 bits (1231), Expect = e-133, Method: Composition-based stats. Identities = 122/551 (22%), Positives = 213/551 (38%), Gaps = 40/551 (7%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR-------------MWDTTGSE 52 + R+ L +R + + L P + + + + D TG Sbjct: 5 IPVLARRYQALLRRRMPWDTAWQSLADHFLPTRCRLRPQGGGAEEGPMLNSGLVDATGIL 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 A L++ L +T P + W L + ARS+ + W D+V + Sbjct: 65 AMRTLAAGLQGGLTSPARPWFRLG--------LDDADLARSRPGQAWLDEVAARMRSVF- 115 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 R F + + Y + FGT + AD +G R++ + + + V Sbjct: 116 -HRCNFYNAMHTLYAELATFGTAFVFELADP-----RDGFRFMPLCAGEYVLDCDAGRRV 169 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK-KDKGN 231 D+V+R + ++ QIV +G L ++ A+ RN +ER +I AVYP+ + Sbjct: 170 DTVFRRSSMSLRQIVQTFGPAALPESLREAVRRNADERRNVIQAVYPRDDRIHGILTASH 229 Query: 232 KGFHSKFVSVDEN---RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 S + + E FP R+ V +++YGRSPAM+ALP R L + Sbjct: 230 MPVASVYWLEGRDGGEHALRESGFRHFPGFGPRWDVAGNDVYGRSPAMDALPDCRMLQQM 289 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNI-GALSREGRSLFQPVQFGNPLPYH 347 + ++ PP + + DL PG +N + + P+ NP Sbjct: 290 GITTLKAIHKAVDPPMSVSAGLRSVGLDLTPGGINYVDSAPGQSPQAATPLLQVNPDLST 349 Query: 348 EE--LNRLKESIRSLFLLDLF-QVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 + ++ IRS DLF +L+ ++ +A+E + EK +GP++ L E Sbjct: 350 ARRAMESVQNQIRSGLYNDLFKLILEGRSGVTASEIAAREEEKLVLIGPVLERLHDELFI 409 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ R + + LP C + LKVE+ S L + Q+ V++A Q + + L Sbjct: 410 PLMDRTFECMRELDMLPPCPPELSG--RRLKVEFVSLLAQAQKLVGVSAADQYL--ALTL 465 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQ 524 T P +D ++ D + + P L R E E +R R R + Sbjct: 466 RASTAWPEALDTLNVDHLLDNYADSLGLPISLTRPPEEREQMRAARAEAARGAALADSLK 525 Query: 525 QLQQTSQDIGA 535 Q Q + Sbjct: 526 QGVDLVQQLAK 536 >gi|317152045|ref|YP_004120093.1| Bacteriophage head-to-tail connecting protein [Desulfovibrio aespoeensis Aspo-2] gi|316942296|gb|ADU61347.1| Bacteriophage head-to-tail connecting protein [Desulfovibrio aespoeensis Aspo-2] Length = 603 Score = 476 bits (1225), Expect = e-132, Method: Composition-based stats. Identities = 130/526 (24%), Positives = 210/526 (39%), Gaps = 33/526 (6%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-------------AQLRMWDT 48 + A+ +Q RF L+ R EL+ ++ P KN+ R++D+ Sbjct: 3 AKELARSLQTRFKGLEEARQPWLAAWRELSDYMLPRKNSFTGIDPGSTRGRSGDERIFDS 62 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 T S A L+S L L+T P W + + VR + Q + + Sbjct: 63 TPSHALELLASSLGGLLTNPAMPWFDIRARDP--------DQGDGAGVRTFLQQARERMI 114 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 +GF + Y V GT Y+EAD D +R+ + PL VY + + Sbjct: 115 ALFNTEDTGFQTNVHELYLDVALLGTAVMYVEADPD-----TVVRFCTRPLGEVYAAESA 169 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK- 227 + VDSVYR +T + Q +WG S + + ++ I+HAV+P++ D Sbjct: 170 RGAVDSVYRRYTLSARQTAREWG-AACSGETRRKAEERPDDTVEILHAVFPRTDRDPYGV 228 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 + F S +V EE PY+V R+ A E YGR P AL R LN Sbjct: 229 GAAHFPFASVYVETGAEHVLEESGYLEMPYLVPRWAKAAGETYGRGPGQTALSDTRVLNA 288 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH 347 PP + + PG ++ R PV + Sbjct: 289 MARTALMAAEKMSDPPLMVPDDGFLGPVHSGPGGLSYYRAGSPDRIEPLPVN-VDLAATE 347 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + + +ESIR +FL D + + +A E++ + EK +GP++G LQ+EF+ +I Sbjct: 348 TMMQQRRESIRRIFLGDQLTP--EGPAVTATEALIRQSEKMRVLGPVLGRLQAEFLSPLI 405 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R I+ G LP P ++V YTSP+ + Q+ + + + L Sbjct: 406 RRVFRIMLRAGALPPFPQGFGPDD--IEVRYTSPVARAQKEFEARGLSRTMEYLAPLVGA 463 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQ 513 + MD+ DTDR +R TP+ +R +V + R + Sbjct: 464 SDPFGIMDNFDTDRAARHVAELFGTPSDYLRPEKDVAETRAAKGRA 509 >gi|167032756|ref|YP_001667987.1| putative tail protein [Pseudomonas putida GB-1] gi|166859244|gb|ABY97651.1| putative tail protein [Pseudomonas putida GB-1] Length = 564 Score = 475 bits (1222), Expect = e-131, Method: Composition-based stats. Identities = 109/535 (20%), Positives = 203/535 (37%), Gaps = 51/535 (9%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEAC 54 K + R + LK +R + +E++ F+ P + + ++ + + A Sbjct: 7 RKLAEKRLSALKTERSSWDTNAKEISDFILPMRSRVMCDDTNRGDRRNNKIINNRATMAS 66 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 +S + S IT P + W LA A F V+ W + T + Sbjct: 67 RTTASGMMSGITSPARPWFNLAPVARAIMEF--------GPVKSWFYECTQRMRDVFL-- 116 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 RS L + Y + FGTGC +++ D IR + Y+S + Sbjct: 117 RSNLYQVLPTCYQEMATFGTGCIWVDEHPD-----TVIRCEAFTWGEYYISNGADGRAAA 171 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTII-HAVYPKSLTDK-KKDKGNK 232 +YREF +TV+Q+V ++G + LS K+ N ++F V + + N Sbjct: 172 IYREFKWTVNQLVQEFGVEALSPSSKALYENNNGDQFISCAQRVELNMNANPDRAGSRNL 231 Query: 233 GFHSKFVSVD--ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 F + + E++ FP + R+ + YG P L ++ L Sbjct: 232 PFSALTWEAGAPGDMVLEDRGYHEFPAMAVRWESMPGDAYGTGPGRICLGDVKALQLYER 291 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGAL---SREGRSLFQPVQFGNPLPYH 347 + A+ +PP A E K + PG + + + ++QP P Sbjct: 292 QAARMTETGANPPLQAPVELKGQPSSTIPGGVTYVPMVGGQNQMAPIYQP-NAAWLSPIQ 350 Query: 348 EELNRLKESIRSLFLLDLFQVLDD-KASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 ++ + I F +DLF ++ R+A E + EK +GP++ + E + + Sbjct: 351 AKIQEHEGRINEAFFVDLFLMVSQLDTVRTATEIAARKEEKMLMLGPVLERINDELLDPL 410 Query: 407 ISRELDILDSQGNLPECEGADNPPV-------------SLLKVEYTSPLFKYQQAESVAS 453 I R +I+ Q ++P G + S ++ EY S L + Q++++V Sbjct: 411 IDRTFNIMLRQ-SIPIWAGIIDGDPLLPPPPEELINANSEIQAEYVSILAQAQKSQNVLG 469 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 + L P +D +++D++ A ++R EV IR+ Sbjct: 470 LERFATLAGNLSGAF--PEVLDKVNSDQLIEEYADAIGVIPTVVRGADEVAAIRE 522 >gi|78357592|ref|YP_389041.1| hypothetical protein Dde_2550 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219997|gb|ABB39346.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 549 Score = 472 bits (1215), Expect = e-131, Method: Composition-based stats. Identities = 125/547 (22%), Positives = 229/547 (41%), Gaps = 44/547 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLY---------------PYKNNAQLRM 45 M+ + ++ + Y+++QRGE + E+ ++ P Q R+ Sbjct: 1 MSISTLEEARGAAAYIESQRGEWDSRWREVADYVTGAGYGGGSWQEGTARPEGRRGQ-RI 59 Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 D T + A L++ L +TPP + W L + S +VR W D V Sbjct: 60 IDATATRALRVLAAGLQGGLTPPARPWFRLR--------LADRGLMESAEVRRWLDDVEA 111 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 L+ + S F + +T++ +G+ YMEAD + +R+ VP + + Sbjct: 112 ALYA--ALAGSNFYQNSHALFTALAAYGSADMYMEADP-----QRVMRFCVVPHGDFAWA 164 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK 225 + VD+V R F+ T Q K+G LS ++ A ++ V P++ D Sbjct: 165 CDAAGRVDTVVRRFSMTAAQAAQKYGSDRLSRTVRRLAAVQPYAPVALVQLVRPRARRDP 224 Query: 226 K-KDKGNKGFHS-KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 + +D NK + S + + + R A FP++ R+ V ++YG SP M+ LP ++ Sbjct: 225 RRQDSLNKPYESLTWEAQEPRRLLHVSGYAEFPHLCARWEVNGGQLYGHSPVMDVLPDVK 284 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L E ++PP + KQR +L PG N ++ P+ P Sbjct: 285 MLQEMARSQLLAVHKVVNPPMRVPTGFKQR-LNLIPGAQNY--VNPAQPDALSPLYQIRP 341 Query: 344 --LPYHEELNRLKESIRSLFLLDLFQVLDD--KASRSAAESMEKTREKGAFVGPLIGGLQ 399 ++ ++ SIR ++F + +++ +AAE ME+++EK +GP++ Q Sbjct: 342 DIQAVTYKIEDVRRSIREGLFTEMFLLFAGESRSNVTAAEIMERSQEKLLLLGPVVERHQ 401 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 ++ + +I R +L G LP LKVEY S L + Q+ + Q Sbjct: 402 TDILDPLIGRAFGLLARAGRLPPAPDVLAGRD--LKVEYVSALAQAQRLSAAQGVRQLAG 459 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 V P +D +D D+ PA ++R +V+ +R++R +++ Sbjct: 460 DVSRFAAMA--PEVLDKIDFDQAVDELASIAGAPAGIVRSDEDVQLLRRERALKQAEQAG 517 Query: 520 QHLQQQL 526 + L + Sbjct: 518 RALLESA 524 >gi|221213955|ref|ZP_03586928.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166132|gb|EED98605.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 549 Score = 470 bits (1209), Expect = e-130, Method: Composition-based stats. Identities = 138/521 (26%), Positives = 229/521 (43%), Gaps = 34/521 (6%) Query: 1 MNQRSAKDIQDR---FNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQL 43 M AK ++ +K +R ++ F+ P + Sbjct: 1 MTNDDAKLLEALNADHGRMKEKRQSYEAVWNDVIDFMMPRLDKFGQMPRPDSEKGRERSQ 60 Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 RM+D+T A + + S+ITP Q WH L S A V+ + V Sbjct: 61 RMFDSTAPLALRNFVAAMDSMITPATQVWHRLKTSNDA--------LNEVPSVKAYLQAV 112 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 LF R R + GF + + Y S+ FG G +E DV GI Y +VP+ ++ Sbjct: 113 VRALFAVRYRWQGGFTTQMGATYQSIGLFGPGALMIEHDVG-----HGIVYRNVPMQRLW 167 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + N+ ++D + + T+ Q ++G + LS M++AL R+ + T H V P++ Sbjct: 168 FAENNAGLIDKTHVLWRLTLRQAAQRFGRENLSPSMQTALERDPEKTHTFYHVVEPRADR 227 Query: 224 DKKK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D +K D N F S ++ +R + TFP+ +GR+ V D++YG SPA +A+P I Sbjct: 228 DPRKLDGRNMRFGSYWLDEGRDRIIQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDI 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R N+ + + + PP +A + FDL+ G +N G L G + +P+ G Sbjct: 288 RMANDMAKTNIRGAQKMVDPPLLASEDGVLEGFDLRSGSLNWGGLDERGNEMVKPLLTGK 347 Query: 343 PLPYHEEL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 E +++I F + LFQ+L D +A E +++ +EKG + P +G Q+E Sbjct: 348 QAQIGIEFSQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQAE 407 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 +G +I RE+DIL G P + + VEY SPL K +A A+ LQ + + Sbjct: 408 LLGPLIQREVDILAEAGQFPPMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQL 467 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 V DP+ ++ R+ + P + E Sbjct: 468 G--VVAQFDPNAAKLVNGHRIGKLLADFGGVPVEALNTDEE 506 >gi|323699782|ref|ZP_08111694.1| phage head-tail connector protein [Desulfovibrio sp. ND132] gi|323459714|gb|EGB15579.1| phage head-tail connector protein [Desulfovibrio desulfuricans ND132] Length = 579 Score = 468 bits (1205), Expect = e-129, Method: Composition-based stats. Identities = 135/564 (23%), Positives = 228/564 (40%), Gaps = 48/564 (8%) Query: 1 MNQRS-AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQLRM 45 M++ A+ + RF+ L+ R +ELT ++ P KN+ R+ Sbjct: 1 MDRTELARSLLKRFSGLEEARRPWVSSWQELTEYMLPRKNSFAGPGGHTLGRGRAGDERI 60 Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 +D+T A L+S L L+T P W ++ A + +VR + + + Sbjct: 61 FDSTPLHALELLASSLGGLLTNPSLPWFDISVKDRAKGD--------ADEVRAFMQEARE 112 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 + +GF + Y V GT Y+EAD +R+ + PL V+++ Sbjct: 113 RMVAVFNSEDTGFQAHVHELYLDVALLGTAVMYVEADPT-----SVVRFSARPLGEVFVA 167 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK 225 + + VD+VYR + T Q + +WG S + + E ++HAV+P+ D Sbjct: 168 ESARGQVDTVYRRYEVTARQAIQEWG-AACSDETRRKGEDRPEEPVEVLHAVFPRMDRDP 226 Query: 226 KK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 + F S ++ V + EE PY+V R+ A E YGR P AL +R Sbjct: 227 AGFGSAHFPFASVYMEVKNSHVLEESGYLEMPYMVPRWAKAAGETYGRGPGQTALSDVRV 286 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL 344 LN PP + + PG ++ R PV + Sbjct: 287 LNAMARTALMAAEKMSDPPLMVPDDGFLGPVRSGPGGLSYYRAGSTDRIEALPVN-VDLR 345 Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 E +N +ESI +FL D + + +A E++ + EK +GP++G LQ+EF+ Sbjct: 346 AAEEMMNGRRESIGRIFLSDQLAP--EGPAVTATEAVIRQAEKMRVLGPVLGRLQTEFLS 403 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 +I R ++ G LP +P L+V YTS + + Q+ Q + + L Sbjct: 404 PLIRRVFRVMLRGGALPPFPEGLSPDD--LEVRYTSSVTRAQKQYEAQGLAQVMEYLSPL 461 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQ 524 MD+ DTDRV+R N P+ ++ RV+E + +Q Sbjct: 462 VGGRDAFGIMDNFDTDRVARHVAELFNIPSDYLKSED-------------RVVEGRTQKQ 508 Query: 525 QLQQTSQDIGAKAAGRAMEKKLTH 548 ++ + Q A A+ K L+ Sbjct: 509 RVASSQQTASTVANAAAIAKTLSE 532 >gi|317120721|gb|ADV02543.1| putative phage-related head-to-tail joining protein [Liberibacter phage SC2] gi|317120782|gb|ADV02603.1| putative phage-related head-to-tail joining protein [Candidatus Liberibacter asiaticus] Length = 539 Score = 466 bits (1200), Expect = e-129, Method: Composition-based stats. Identities = 209/543 (38%), Positives = 301/543 (55%), Gaps = 24/543 (4%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA--QLRMWDTTGSEACIKLSS 59 N+ K + RF LK QR E+ +E+ + PY+ A ++WDTT + A KL+S Sbjct: 14 NKEFIKKLIARFESLKAQRSEIEPIRQEIIDLVCPYRGKASEDKKIWDTTATSASDKLAS 73 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 LL +LITP G +WHGL +F + +K +RE CD LF RE SGF Sbjct: 74 LLHNLITPFGSRWHGLVAPDPQSGSFFASQ--ENKLIREQCDHFVMELFAQRELPASGFN 131 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 CL+ FYT VV FG GCFY+ G+RYISVP+S++ S NH+NVVD+V+ EF Sbjct: 132 LCLKDFYTEVVLFGMGCFYVSE-----REGGGLRYISVPVSSIVCSANHENVVDTVFEEF 186 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFV 239 + T + + KWG LS KMK L R++ +++ AV+P D +G+ V Sbjct: 187 SLTPENVAKKWGYDALSDKMKEDLDRSDPQKYEFFQAVFPDKEDD------YEGYKKVIV 240 Query: 240 SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLS 299 S+DENR EE PYIVGRY +G SP +ALP+IRRLN ++ + + Sbjct: 241 SIDENRIIEEGYHRVMPYIVGRYEASPSNPFGYSPTHKALPSIRRLNALSASVSLYSEKA 300 Query: 300 LHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPLPYHEELNRLKESIR 358 L+P + + + + F KP +N G + R+GR P G + P HEE+ RL+ IR Sbjct: 301 LNPAVLTSEDTRGKTFSTKPKTVNHGWMDRQGRPRAVPFFTGSDARPSHEEMQRLQMQIR 360 Query: 359 SLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQG 418 L+LLDLFQVL D+ASRSA ESMEKT EKG F+ ++GGLQ+EF+G+M+ RE+DIL Sbjct: 361 ELYLLDLFQVLADRASRSATESMEKTLEKGIFISAIVGGLQAEFVGSMVKREIDILY--- 417 Query: 419 NLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMD 478 + +G LKV YTSPL+KYQ+AE + +QG+ E+ TGDP+ + + Sbjct: 418 ---QDQGDIRGLGKDLKVSYTSPLYKYQKAEELNGIVQGIRVNAEIASMTGDPTPLMMFN 474 Query: 479 TDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAA 538 +++ + P VL+ + + + E Q++ Q Q ++++ + GA A Sbjct: 475 PYLCGKYAADGSGVPEVLVLSEEDTKQ--KLIEKQKQAEASQMKQLTMEESIKTGGAIAQ 532 Query: 539 GRA 541 RA Sbjct: 533 DRA 535 >gi|46581008|ref|YP_011816.1| hypothetical protein DVU2604 [Desulfovibrio vulgaris str. Hildenborough] gi|46450429|gb|AAS97076.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311234693|gb|ADP87547.1| hypothetical protein Deval_2404 [Desulfovibrio vulgaris RCH1] Length = 569 Score = 466 bits (1198), Expect = e-129, Method: Composition-based stats. Identities = 115/531 (21%), Positives = 210/531 (39%), Gaps = 49/531 (9%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPY-------------KNNAQLRMWDTTGSEA 53 ++ +D + ++ +R E+ F+ P R+ D T + A Sbjct: 6 REARDAASCVERERRVWEPLWREVEDFVLPRCIDSPRRADEAGDTARRGPRIIDGTATRA 65 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L++ + +T P + W L ++ + R W D V L+ Sbjct: 66 VRILAAGMQGGLTSPARPWFRLR--------LADEDMEEAGPERRWLDVVERRLYA--AL 115 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 +RS F + YT + FG+ Y EAD + +R+ + + + + VD Sbjct: 116 ARSNFYAAVHGLYTELAAFGSADMYHEADP-----QRVMRFSCLACGDFAWACDAAGRVD 170 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKG--- 230 +V R + Q+ ++G+ LS +++ L R+ ++H V P+ + + Sbjct: 171 TVVRRLRMSARQMAQRYGEARLSRRVRRMLRRDPERSVPLVHMVRPRVRRNAGEAGKTAS 230 Query: 231 ------NKGFHS-KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 N + S + + E FP++ R+ V +IYGRSP M+ LP ++ Sbjct: 231 GGLGGVNMPWQSLTWETEGAEGLLHEGGFEEFPHLAARWDVAGGDIYGRSPGMDVLPDVK 290 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L E ++PP S KQR +L PG N + P+ NP Sbjct: 291 MLQEMARSQLLAIHKVVNPPMRVPSGFKQR-LNLIPGGQNYVTPGQG--ESVGPLYQINP 347 Query: 344 --LPYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 ++ ++ ++R F DLF + + +++ +AAE +E+ EK +GP+I Q Sbjct: 348 DIGAVTHKMEDVRRAVREGFFNDLFLMFTAEGRSNITAAEVLERGEEKLLMLGPVIERHQ 407 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 SE + ++ R IL G ++VEY S L + Q+ + + + + Sbjct: 408 SELLDPLLERTYGILRRGGL--LPPPPPELAGRSMRVEYVSALAQAQRVVTAQAIRRFAS 465 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 V L P +D +D ++ PA ++R AEV +R R Sbjct: 466 DVSALAGVA--PQVLDKVDFEQAVDELAAIAGVPARVVRSDAEVATLRAAR 514 >gi|120601696|ref|YP_966096.1| hypothetical protein Dvul_0646 [Desulfovibrio vulgaris DP4] gi|120561925|gb|ABM27669.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 569 Score = 465 bits (1197), Expect = e-129, Method: Composition-based stats. Identities = 115/531 (21%), Positives = 210/531 (39%), Gaps = 49/531 (9%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPY-------------KNNAQLRMWDTTGSEA 53 ++ +D + ++ +R E+ F+ P R+ D T + A Sbjct: 6 REARDAASCVERERRVWEPLWREVEDFVLPRCIDSPRRADEAGDTARRGPRIIDGTATRA 65 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L++ + +T P + W L ++ + R W D V L+ Sbjct: 66 VRILAAGMQGGLTSPARPWFRLR--------LADEDMEEAGPERRWLDVVERRLYA--AL 115 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 +RS F + YT + FG+ Y EAD + +R+ + + + + VD Sbjct: 116 ARSNFYAAVHGLYTELAAFGSADMYHEADP-----QRVMRFSCLACGDFAWACDAAGRVD 170 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKG--- 230 +V R + Q+ ++G+ LS +++ L R+ ++H V P+ + + Sbjct: 171 TVVRRLRMSARQMAQRYGEARLSRRVRRMLRRDPERSVPLVHMVRPRVRRNAGEAGKTAS 230 Query: 231 ------NKGFHS-KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 N + S + + E FP++ R+ V +IYGRSP M+ LP ++ Sbjct: 231 GGLGGVNMPWQSLTWETEGAEGLLHEGGFEEFPHLAARWDVAGGDIYGRSPGMDVLPDVK 290 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L E ++PP S KQR +L PG N + P+ NP Sbjct: 291 MLQEMARSQLLAIHKVVNPPMRVPSGFKQR-LNLIPGGQNYVTPGQG--ESVGPLYQINP 347 Query: 344 --LPYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 ++ ++ ++R F DLF + + +++ +AAE +E+ EK +GP+I Q Sbjct: 348 DIGAVTHKMEDVRRAVREGFFNDLFLMFTAEGRSNITAAEVLERGEEKLLMLGPVIERHQ 407 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 SE + ++ R IL G ++VEY S L + Q+ + + + + Sbjct: 408 SELLDPLLERTYGILRRGGL--LPPPPPELAGRSMRVEYVSALAQAQRVVTAQAIRRFAS 465 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 V L P +D +D ++ PA ++R AEV +R R Sbjct: 466 DVSALAGVA--PQVLDKVDFEQAVDELAAIAGVPARVVRSDAEVATLRAAR 514 >gi|48697195|ref|YP_024925.1| hypothetical protein BcepC6B_gp05 [Burkholderia phage BcepC6B] gi|47779001|gb|AAT38364.1| gp05 [Burkholderia phage BcepC6B] Length = 549 Score = 460 bits (1183), Expect = e-127, Method: Composition-based stats. Identities = 140/520 (26%), Positives = 235/520 (45%), Gaps = 34/520 (6%) Query: 1 MNQRSAKDIQDR---FNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQL 43 M AK +Q +K +R ++ +L P + Sbjct: 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQ 60 Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 +M+D+T A + + S+ITP Q WH L A V+ + V Sbjct: 61 KMFDSTAPLALRNFVAAMDSMITPATQLWHRLKTGNDA--------LNEIASVKAYLQGV 112 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 TLF R R + GFV + + Y S+ FG G +E DV + GI Y +VP+ ++ Sbjct: 113 VRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGK-----GIVYRNVPMQRLW 167 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + N+ ++D + ++ T+ Q ++G + LS M+S L ++ + HAV P++ Sbjct: 168 FAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADR 227 Query: 224 DKKK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D +K D N F S ++ +R + TFP+ +GR+ V D++YG SPA +A+P + Sbjct: 228 DPRKLDGRNMQFASYWLDEGRDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDV 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R N+ + + + PP +A + FDL+ G +N G L+ +G + +P+ G Sbjct: 288 RMANDMAKTNIRGAQKLVDPPLLANEDGVLDGFDLRSGALNWGGLNDKGEEMVKPLLTGK 347 Query: 343 PLPYHEEL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 E +++I F + LFQ+L D +A E +++ +EKG + P +G QSE Sbjct: 348 QAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSE 407 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 +G MI+RE+DIL G LP+ + + VEY SPL K +A A+ LQ + + Sbjct: 408 LLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQL 467 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTA 501 V DP+ + R++R P + Sbjct: 468 G--IVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDE 505 >gi|254251745|ref|ZP_04945063.1| hypothetical protein BDAG_00942 [Burkholderia dolosa AUO158] gi|124894354|gb|EAY68234.1| hypothetical protein BDAG_00942 [Burkholderia dolosa AUO158] Length = 539 Score = 459 bits (1180), Expect = e-127, Method: Composition-based stats. Identities = 110/563 (19%), Positives = 213/563 (37%), Gaps = 46/563 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR--------------MW 46 M + + R +K++R E P + + ++ Sbjct: 1 MIDSLGETLAKRLETMKSKRQVHELVWRECFMLTDPVRASGLDGPQMDANQIAQAVALIF 60 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 D+T ++A L + + S +TP W + + + W D ++ Sbjct: 61 DSTATDAKRTLEASIMSGMTPANSLWFTMT------------VNGADDEGERWLDSASEV 108 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSV 166 L+ + + F G Y+ DE G+ + P++ VY + Sbjct: 109 LW--QNIHSANFDSEAADAVAD-GMAGWFALYI----DENRDAGGLYFEHWPMAGVYCAS 161 Query: 167 N-HQNVVDSVYREFTFTVDQIVSKWGD--KVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + VD V+R + T +Q V ++ L ++ E + A+YP+ + Sbjct: 162 SKPGGTVDIVFRCYQLTAEQCVREFNRRGDSLPQEIVDKAKNKPEELVDLCQAIYPRDVH 221 Query: 224 DKKKDK-GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 + N S + ++ + E P +V R++ + +YG P ++ALP I Sbjct: 222 MVGALRAKNMPIASVTFACNQKQVIRESGYHEMPVVVARWKKIPNSVYGVGPLLDALPDI 281 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R LN+ V L++ IA + +K G + + +P+Q + Sbjct: 282 RTLNDIVKLEYANLDLAVSGMWIAEDDGVLNPRTVKVGPRKV--IVANSVDSMKPLQPAS 339 Query: 343 PLPYHEE-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 E + +L+ IR + D Q D A +A E + +GP+ G LQ+E Sbjct: 340 NFQLAETRIEKLQGQIRKTLMADQLQPQDGPA-MTATEVHVRVDLIRQLLGPIYGRLQAE 398 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 ++ +I+R + G P + V+Y SPL + Q+ E V++ + + V Sbjct: 399 YLQPLIARCFGLAYRAGVFPPPPDSLGG--RNFSVQYQSPLARAQKLEEVSAIERLMGDV 456 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 + P +D++D D R + P ++R + +V RQQ++ ++Q Sbjct: 457 TVIA--QVKPEALDNIDGDEAVRLTAKNLGVPDSIVRTSDQVTQYRQQKQAAAAQQQQQQ 514 Query: 522 LQQQLQ-QTSQDIGAKAAGRAME 543 L ++Q + IG+ AA R + Sbjct: 515 LGMEVQGDVMKSIGSAAASRMVA 537 >gi|291334411|gb|ADD94066.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured phage MedDCM-OCT-S04-C1035] Length = 467 Score = 456 bits (1172), Expect = e-126, Method: Composition-based stats. Identities = 116/483 (24%), Positives = 219/483 (45%), Gaps = 29/483 (6%) Query: 64 LITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQ 123 ++T P W L F ++ + + W + T+ ++ ++S F + Sbjct: 1 MLTNPSTPWFSLK--------FKNEDMEGEDEAKLWLESATEVMYSAF--NQSNFQQEIF 50 Query: 124 SFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTV 183 Y ++ FGT ++E D DE L+ R+I+ +Y+S N + +D+V+R+F + Sbjct: 51 ELYHDLITFGTAAMFIEED-DEDNLKFSTRHIN----EIYISENEKGRIDTVFRKFRISA 105 Query: 184 DQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-DKGNKGFHSKFVSVD 242 + K+G +S+ + ++ E I+HAVYP+ + KK D N F S ++ D Sbjct: 106 RAAIRKFG--NVSNNIAVIAKKDPYEEVEILHAVYPRDDYNPKKQDTENMQFESIYLDAD 163 Query: 243 ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 FP++V RY + EIYGRSPAM ALP ++ LNE + + + + P Sbjct: 164 SGEELSVSGFREFPFVVPRYLKASHEIYGRSPAMTALPDVKMLNEMSKTIIKSAQKQVDP 223 Query: 303 PTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL-NRLKESIRSLF 361 P + + PG +N R +P+ G + + + SIR+ F Sbjct: 224 PLLVPDDGFLLPVRTVPGGLNFYRAGT--RDRIEPLNIGANNTLGLNMEEQRRNSIRNAF 281 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 ++ ++ D +A E +++ EK +GP++G LQSE + +I R IL + Sbjct: 282 YVNQL-MMQDGPQMTATEVIQRNEEKMRLLGPVLGRLQSELLKPLIDRSFAILMRRNLFA 340 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 + + +++EY SPL K Q++ ++S ++ + + L DH++ D+ Sbjct: 341 QPPEFLSGQD--IEIEYVSPLAKAQKSTELSSIMRAIEIMGSLSNVA---PVFDHINMDK 395 Query: 482 VSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRA 541 + R P +++ +E+ RQ + Q+ M++ QQL + + A +A Sbjct: 396 LVRHLTNIVGVPQKILKPQSELNAERQAQAQQQEQMQQMQQVQQLAEAGGKVAPLA--KA 453 Query: 542 MEK 544 + + Sbjct: 454 LPE 456 >gi|221201497|ref|ZP_03574536.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207947|ref|ZP_03580953.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221172132|gb|EEE04573.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221178765|gb|EEE11173.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 549 Score = 454 bits (1168), Expect = e-125, Method: Composition-based stats. Identities = 143/555 (25%), Positives = 240/555 (43%), Gaps = 34/555 (6%) Query: 1 MNQRSAKDIQDR---FNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQL 43 M AK ++ +K +R ++ FL P + Sbjct: 1 MTNDDAKLLEALNADHGRMKEKRQSYEATWNDVIDFLMPRLDKFGQLPRPDSEKGRERSQ 60 Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 RM+D+T A + + S+ITP Q WH L S + V+ + +V Sbjct: 61 RMFDSTAPLALRNFVAAMDSMITPATQLWHRLKASN--------DVLNENAAVKAYLQEV 112 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 LF R R + GFV + + Y SV FG G +E DV + GI Y +VP+ ++ Sbjct: 113 VRVLFAVRYRWQGGFVTQMGATYQSVGLFGPGALMIEHDVGQ-----GIVYRNVPMQRLW 167 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + N+ ++D + ++ T+ Q ++G + LS M+SAL R+ + H V P++ Sbjct: 168 FAENNAGIIDKTHVQWELTLRQAAQRFGRENLSPSMQSALERDPEKSAIFYHIVEPRADR 227 Query: 224 DKKK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D +K D N F S ++ +R + TFP+ +GR+ V + YG SPA +A+P Sbjct: 228 DPRKLDGRNMRFGSYWLDEGRDRIIQNSGFRTFPFAIGRFYVGTGDAYGGSPACDAMPDT 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R +N+ + + + PP + + FDL+ G +N G L +G + +P+ G Sbjct: 288 RMVNDMAKTNIRGAQKLVDPPLLVSEDGSLEGFDLRSGSLNWGGLDEKGNEMVKPLLMGK 347 Query: 343 PLPYHEEL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 E +++I F + LFQ+L D +A E +++ +EKG + P +G QSE Sbjct: 348 QAQIGIEFTQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSE 407 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 +G +I RELDIL LPE + +++EY SPL K +A A+ LQ + + Sbjct: 408 LLGPLIERELDILAEAAQLPEMPRELINAGANVEIEYDSPLNKAMRAGESAATLQWLQQL 467 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 V D M + R++R A P + E++ +V + Sbjct: 468 S--VVAQFDLRAMKAPNGLRIARMLADAGGVPVEAMNTDEELQAQEAAEAQAMQVQQALA 525 Query: 522 LQQQLQQTSQDIGAK 536 +D+ Sbjct: 526 AAPVAAGAIKDLSDA 540 >gi|262043663|ref|ZP_06016772.1| hypothetical protein HMPREF0484_3791 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039001|gb|EEW40163.1| hypothetical protein HMPREF0484_3791 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 554 Score = 453 bits (1166), Expect = e-125, Method: Composition-based stats. Identities = 133/514 (25%), Positives = 236/514 (45%), Gaps = 29/514 (5%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK----------NNAQLRMWDTTGS 51 I ++ R +E+ + P + D TG+ Sbjct: 10 ESERIGRILREQKSMETDRSVFEQHWQEIAERILPRSAEFKGTRQKGGKRTEKAIDATGA 69 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 A K + + S+ITP QKWH L+ + A ++V+ + +V D LF R Sbjct: 70 LALQKFGAAIESVITPRTQKWHTLS----------NERFANDEEVQRYFQEVRDILFRLR 119 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 + F Y S FGTGC +++ + + G RY + L +Y + N Q + Sbjct: 120 YAPWANFASQSHEHYISSGAFGTGCTFVDNVIGK-----GPRYCTYHLREIYFTENFQGM 174 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD-KKKDKG 230 +D V+R++ T Q + ++G++ L ++++ + +++F +H V P D ++DK Sbjct: 175 IDVVHRKYCMTARQAIQQFGEENLPQQVRTTARNDPSKQFNFLHRVEPNDKRDMSRQDKE 234 Query: 231 NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 F S + ++ ++ +E + PY + RY E+YGRSPAM LP I+ LNE Sbjct: 235 GMPFRSVHICMEGSKIVQEGGYWSQPYAISRYYTAPGEVYGRSPAMVVLPDIKLLNEINR 294 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 + + ++++ PP + + + F + PG +N G ++R+G+ L P+ L Sbjct: 295 AIIEGAQMAVRPPMLLPEDGILQPFKMMPGALNFGGMNRDGKPLALPLNTATDFSVAMTL 354 Query: 351 -NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 + +++I F + LFQ+L D +A E+M + +EKG + P G +Q+EF+G +I R Sbjct: 355 AEQKRQTINDGFFITLFQILVDNPQMTATEAMLRAQEKGQLLAPTAGRIQAEFLGTLILR 414 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 E+DI G LPE +EYTSPL + Q +E + + VN +G Sbjct: 415 EIDIAYQNGLLPEPPEQLKEIGGEYDIEYTSPLVRLQMSEEASGIMNVVNAAGTIG--QF 472 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 D + ++ D RF A+ P +++ E+ Sbjct: 473 DQNIARTLNGDAALRFIAKASGAPLQVVKTEDEM 506 >gi|303328393|ref|ZP_07358830.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861387|gb|EFL84324.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 567 Score = 452 bits (1163), Expect = e-125, Method: Composition-based stats. Identities = 117/521 (22%), Positives = 199/521 (38%), Gaps = 46/521 (8%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYP-------------YKNNAQLRMWDTTGSEA 53 K + R+ L +R ++L P KN + D+TG A Sbjct: 6 KKLHQRWEMLVEKRRPWISTWKDLAALYLPTGYRDADDGNARGGKNLLNPEVVDSTGIYA 65 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L++ + +T P + W GL R W D+V + + Sbjct: 66 LRTLAAGMQGGMTSPARPWFGLRLE-------GGDSGDGGITARAWIDEVVERMRTIL-- 116 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 S F G + Y + FGT C + AD+ G + + V+ VD Sbjct: 117 HTSNFYGVIYQAYAQLAAFGTACVFERADM------SGFTFDCCQAGTFVLDVDAGGRVD 170 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSAL--ARNENERFTIIHAVYPKSLTDKKKDKGN 231 +V R+ T Q+ ++G+ L +K++L A N R + HAVYP+ +++ N Sbjct: 171 TVMRKIWLTARQMAQEFGEDALPDMVKTSLNNASMGNVRHAVFHAVYPRREPGLRRETIN 230 Query: 232 ---KGFHSKFVS-----VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 + F S + E +FP+ R+ V + ++YG SPAM+ +P R Sbjct: 231 GARRPFASVYWMRGMSGAGGYHPLRESGFDSFPFFGVRWNVLSGDVYGTSPAMDTMPDCR 290 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L + + + PP +E + DL PG +N ++ + PV P Sbjct: 291 MLQQMAKTTLKGVHKMVDPPVNVAAELQSVGVDLTPGGVNYVSMMGNNGAAVTPVLKVQP 350 Query: 344 --LPYHEELNRLKESIRSLFLLDLFQVLDDKASR--SAAESMEKTREKGAFVGPLIGGLQ 399 + ++++ I+ DLF++L R +A E + EK +GP++ L Sbjct: 351 DVAAAQAMIQQVQQQIKEGLYNDLFRMLLGTNRRQITATEVDAREAEKMILIGPVLERLH 410 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 E +I R ++D LP LKVE+ S L + Q+ S Q + Sbjct: 411 DELFIPLIDRTFALMDKFNALPPVPEELAG--RGLKVEFISTLAQAQKLVSTGGIQQLLA 468 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDT 500 + G DPS +D ++ DR+ A ++R Sbjct: 469 FIG--GAAQVDPSVLDALNGDRLVDKYNEYLGVDAGVLRPQ 507 >gi|302339294|ref|YP_003804500.1| head-to-tail joining protein [Spirochaeta smaragdinae DSM 11293] gi|301636479|gb|ADK81906.1| head-to-tail joining protein, putative [Spirochaeta smaragdinae DSM 11293] Length = 560 Score = 448 bits (1152), Expect = e-123, Method: Composition-based stats. Identities = 125/526 (23%), Positives = 231/526 (43%), Gaps = 42/526 (7%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----NNAQLR-----MWDTTGS 51 ++SA++I F LK +R +E+T ++P + N + ++D T Sbjct: 3 EEKSAQEIIQTFEQLKQERSTWEDEYQEITEQIFPRRSVWTDNKGRASRSGGLIYDGTPI 62 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 A L++ L + P +W L E + + R+W + V + ++ Sbjct: 63 SALNLLANGLVGYLVSPATRWFKLRP--------TQDELLQIRGARQWLEIVENLIYD-- 112 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 E +RS F + ++ G Y++ D+ + R+ +Y++ + Sbjct: 113 EFNRSNFYEEIVEYFRDGGSIGIATIYVQEDIGRRMANYSCRH----PKEIYIAEDRFGY 168 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-DKG 230 +D+V+R F T ++ ++G + LS +++ R+ ER IIHAVYP+ + +K Sbjct: 169 IDTVFRRFFPTAKELEEEFGREALSDGVQNLCERSPYERVEIIHAVYPRKKRNPRKKGNR 228 Query: 231 NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 + F S +V N E+ PY+V R+ +DE+YGR P +AL ++RLN Sbjct: 229 DMKFASAYVEGGSNHKIRERGYERLPYVVWRWSTNSDEVYGRGPGYDALVDVKRLNRLSR 288 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 ++ + ++++ PP + + + + P +N E PV + + L Sbjct: 289 DMLKQSQMAVDPPLAVPEKMRGK-VNWVPRGLNYYQNPNE-----VPVALNPGMQFQVGL 342 Query: 351 NR---LKESIRSLFLLDLFQVLDDKAS-RSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 +R +++ I F+ D F +L+ +A E ME+ EK A +G +IG + SEF+ + Sbjct: 343 DREQHMQQIIEKHFMTDFFLMLEQAPKEMTATEVMERQSEKAAVLGTVIGRISSEFLDPI 402 Query: 407 ISRELDILDSQGNL----PECEGADNPPVSLLKVEYTSPLFKYQQAESV-ASALQGVNTV 461 I DI L PE A ++++Y PL + Q+ V A Q +N V Sbjct: 403 IDITFDIAMKGKRLPPPPPEFAEAMYKTNGGIEIDYLGPLAQAQKKFHVTQGAQQSLNAV 462 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 + +P D ++ D+++ L A P I D +V+ IR Sbjct: 463 API--MQINPQVADLINWDQLTMEILHAYGMPQKAIVDLRDVQKIR 506 >gi|288959388|ref|YP_003449729.1| phage head-tail connector protein [Azospirillum sp. B510] gi|288911696|dbj|BAI73185.1| phage head-tail connector protein [Azospirillum sp. B510] Length = 535 Score = 447 bits (1151), Expect = e-123, Method: Composition-based stats. Identities = 135/552 (24%), Positives = 220/552 (39%), Gaps = 30/552 (5%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M A++I R L R EL ++ P + R++D T Sbjct: 1 MADARAEEIIRRRESLAALRSPWEGVWSELGEYVRPLRTGFAGGPPQSGAKPSSRLFDAT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 A L++ L +IT P W + E + V+ W V + Sbjct: 61 AGMANNNLAAGLYGMITNPANSWFNIKHEI--------DELNEVQAVKLWMATVERAMRQ 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + F + Y + FGT FY++ G+ Y LS ++S N + Sbjct: 113 ALAANGLAFYSRVFGLYLDLPAFGTAVFYIDEQPG-----RGLWYSHRRLSECFVSENDR 167 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-D 228 +D+VYR+FT+T Q +WGD+ K+ + F +HAV P D +K Sbjct: 168 EEIDTVYRDFTWTARQAQQRWGDRAGREVAKAIEKGEPDRPFRWLHAVEPNPDFDPRKLG 227 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 K F S +V VD+ E PY V R+ YG S A+ A+ I+ +N Sbjct: 228 ARFKPFRSVYVGVDDRHVVAEGGYDELPYQVPRWAPSDAGTYGDSAAVLAIADIKMVNAM 287 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 + ++ PP +A E R PG + G + G L +P+Q G + Sbjct: 288 GKTTIVGAQKAVDPPLLAPDEFSVRGLRTSPGGITYGGVDMGGNQLLKPLQTGARVDLGL 347 Query: 349 EL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 EL + + +IR F L ++ R+A E ME EK + P +G +Q+EF+ + Sbjct: 348 ELEEQRRGAIREAFHWSLLLMVQQ-PGRTATEVMEHQEEKLRLMAPHLGRIQAEFLDPAL 406 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R +L+ G LP L+++Y SPL + +A A+ ++ + + + Sbjct: 407 GRVFSLLNRTGQLPPPPDVLR-QYPGLRLDYVSPLARAAKAAEGAAVIRTLEALGPIA-- 463 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 P MD+ DTD ++R A PA ++ D +VE +R R Q++ Sbjct: 464 QLRPEVMDNFDTDEIARGISDAYGLPAKMMLDPRQVEQMRSARAQQQQQAVALEQSAVAA 523 Query: 528 QTSQDIGAKAAG 539 +D+ A A Sbjct: 524 GALKDMSAAGAA 535 >gi|167041083|gb|ABZ05844.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured marine microorganism HF4000_48F7] Length = 552 Score = 445 bits (1145), Expect = e-123, Method: Composition-based stats. Identities = 115/517 (22%), Positives = 222/517 (42%), Gaps = 37/517 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTT 49 M+ +A +Q+ + LK++RG +++ + P + + + R++++T Sbjct: 1 MSSDAATLVQE-YEALKSERGNWENMWQDIAELMIPRRADFTNRYRAPGEQRRDRIYEST 59 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 A ++ +S L + +T W L +E ++++V+ W + T Sbjct: 60 AVRALVRGASGLHNTLTSSTVPWFALETED--------RELMKNRQVQLWLEDATRRCNS 111 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 RS F +Y ++ FGTGC Y+ + G + S L + Y++ Sbjct: 112 VFNAPRSMFHQSAHEYYLDLLAFGTGCMYVTQEPG-----MGPVFKSYFLGHTYIAEGKT 166 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 ++DSVYR F T + ++G L ++ A + RF ++H V P+S + Sbjct: 167 GMIDSVYRRFDDTARSLYKQFG-NKLPDEIVKAADKEPFRRFELLHIVRPRSNAPGGRTS 225 Query: 230 GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 K F S +V + + +E PYIV R++ + E+YGR P +EALP +R +NE Sbjct: 226 KQKPFLSVYVHAESRKVVQEGGFDEMPYIVSRWQKNSMEVYGRGPGIEALPDVRMVNEME 285 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE- 348 + + PP + + PG +N + P+Q G + +E Sbjct: 286 RVGLIALQKVVDPPLLVPDDGFLSPIRTTPGGLNYYRAGLGPQDRIAPLQTGGRVDLNEA 345 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASR------SAAESMEKTREKGAFVGPLIGGLQSEF 402 ++ +++ +I F LDL ++ A+ SA E + R++ +GP++ ++EF Sbjct: 346 KIGQVRAAIERTFYLDLLELPGPTAADGDVLRFSATEIAARQRDRLNILGPIVARQEAEF 405 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 +G ++ R L ++ LP + KV Y++P+ Q+A +AS Q + +V Sbjct: 406 LGPLVIRTLSVMLRAEMLPPPPQVLL--DADFKVSYSNPVAIAQRAGELASISQLIQFLV 463 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRD 499 DP+ + T RV+ + + + Sbjct: 464 PFA--QLDPTVIQRFQTGRVAELAAEILKVSPSVFKS 498 >gi|54302247|ref|YP_132240.1| putative head-tail connector protein [Photobacterium profundum SS9] gi|46915668|emb|CAG22440.1| hypothetical protein PBPRB0567 [Photobacterium profundum SS9] Length = 552 Score = 444 bits (1142), Expect = e-122, Method: Composition-based stats. Identities = 120/567 (21%), Positives = 216/567 (38%), Gaps = 46/567 (8%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTTGS 51 + + F L + EL ++ P + + D + + Sbjct: 2 KTIRQQCDSIFQGLDSDYAPWESHYRELANYIQPRRQRFSKDSVNRGGAHNSNIIDPSAT 61 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 A + + S IT P KW L K+ + VR + D D + G Sbjct: 62 LAMRVAAGGMYSGITNPVTKWLRL--------NVEDKDLNKYHIVRLYLDTCADLILGML 113 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 + S F + S + ++ + E D +R+ P+ + + + + Sbjct: 114 --ASSNFYNVVPSMFMDLLTYSGSSVGFEKDP-----LTVMRFYPNPIGSYRLGIGPRQN 166 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT-IIHAVYPKSLTDKKK-DK 229 V + R+ + V Q+V K+G +S +KSA + + T I H V+ + Sbjct: 167 VSTHGRKVEYRVSQVVEKFGLDNVSQSIKSAYRSGKYNQLTEIRHLVFDNPDFVPRAFSA 226 Query: 230 GNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGR-SPAMEALPTIRRLN 286 K S + D N F FP++ R+ V ++ YG P M AL +I+ L Sbjct: 227 VRKPICSIWYDPADDRNPFLRRSGFDEFPFVTPRWEVIGNDTYGSFGPGMLALGSIKGLQ 286 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPV-QFGNPLP 345 + + + L PP + S K L PG + +++G+ F P Q PL Sbjct: 287 KDQRDKYEAQDKMLKPPMVGPSSLKNNPRSLLPGAVTF-VDNQQGQQGFTPAFQTNFPLN 345 Query: 346 YHEE-LNRLKESIRSLFLLDLFQVLDD--KASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 Y E + + I S F DLF + D K++ +A E + EK +GP++ E Sbjct: 346 YQLESIRDTRAIIDSAFFKDLFLAVIDIGKSNTTATEIAARKEEKLLMLGPVLNRFNEEG 405 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + ++S ++ +G LPE + + +EY L + Q+A ++S + V + Sbjct: 406 LDPIVSASFYEMNRRGMLPEPPPELDGVD--VNIEYVGLLQQAQKAVGISSIERTVGFIG 463 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHL 522 L D +D +D D V T T ++ + +V+ R R + ++Q Sbjct: 464 NLAGVRQD--VLDKVDFDSVVDIYTDITGTTPRILFNEQQVKATRDAR-----IQQQQRE 516 Query: 523 QQQLQQTSQDIGAKAAGRAMEKKLTHD 549 Q GA+AA + + + T + Sbjct: 517 QMAAMAAPAKDGAEAA-KLLSETRTDE 542 >gi|293609619|ref|ZP_06691921.1| predicted protein [Acinetobacter sp. SH024] gi|292828071|gb|EFF86434.1| predicted protein [Acinetobacter sp. SH024] Length = 547 Score = 443 bits (1139), Expect = e-122, Method: Composition-based stats. Identities = 121/570 (21%), Positives = 226/570 (39%), Gaps = 47/570 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-------------AQLRMWD 47 M++ A+ + R + LK R L E + P + + + D Sbjct: 1 MSELVAR-LCKRLSELKAARNRLEPHWSECYRYAAPERQQSFIGDDVTDTRKTQRAELLD 59 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 +T SEA L S + S TP W + + A + +W D+V Sbjct: 60 STLSEATQLLVSSIISGTTPANALWFKAVPN-------GVDDPAELTEGEKWLDEVCQ-- 110 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 F +R + + + V G G Y + D G G + + + Y++ Sbjct: 111 FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVDRHAGG---GYVFQTWDIGQCYLAST 167 Query: 168 HQN-VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 Q+ VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ K Sbjct: 168 RQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTGYIK 227 Query: 227 KDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D+ F S V VDE E FP+++ R+R + +YG ALP Sbjct: 228 GDRQLMPKEMPFASYHVEVDEKNVLRETGYNEFPFVIPRFRKIPNSVYGTGQVSIALPDA 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPVQF 340 + N+ + + + +S V + R L G + + + + Sbjct: 288 KTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVNDVNS----LKRIDD 343 Query: 341 GNPLPYHEE-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 G + L L+ +IR + D Q D A +A E + +GPL G Q Sbjct: 344 GKGYQVGVDLLAHLQGAIRKKMMADQLQPADGPA-MTATEVHVRVDLIRQQLGPLYGRWQ 402 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 +E + ++ R + G + E L ++ S L + QQ E V + + + Sbjct: 403 AELLTPLLERTFGLAYRAGVIGEAPEEM--QGRNLSFKFISALARSQQLEEVTAIERFLA 460 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 + + DPS +D++D D V++ S P ++R +++ IR+QR+ ++ + Sbjct: 461 GMSNVA--QIDPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDAIRKQRQEAQQQAAQ 518 Query: 520 QHLQQQLQQTSQDIGAKAAGRAMEKKLTHD 549 Q +Q L Q + A G+ +E +LT + Sbjct: 519 QEQEQALAQPLAN----AVGKGLESELTSE 544 >gi|332875224|ref|ZP_08443057.1| hypothetical protein HMPREF0022_02690 [Acinetobacter baumannii 6014059] gi|332736668|gb|EGJ67662.1| hypothetical protein HMPREF0022_02690 [Acinetobacter baumannii 6014059] Length = 547 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 121/570 (21%), Positives = 226/570 (39%), Gaps = 47/570 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-------------AQLRMWD 47 M++ A+ + R + LK R L E + P + + + D Sbjct: 1 MSELVAR-LCKRLSELKAARNRLEPHWSECYRYAAPERQQSFIGDDVTDTRKTQRAELLD 59 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 +T SEA L S + S TP W + + A + +W D+V Sbjct: 60 STLSEATQLLVSSIISGTTPANALWFKAVPN-------GVDDPAELTEGEKWLDEVCQ-- 110 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 F +R + + + V G G Y + D G G + + + Y++ Sbjct: 111 FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVDRHAGG---GYVFQTWDIGQCYLAST 167 Query: 168 HQN-VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 Q+ VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ K Sbjct: 168 RQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTGYIK 227 Query: 227 KDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D+ F S V VDE E FP+++ R+R + +YG ALP Sbjct: 228 GDRQLMPKEMPFASYHVEVDEKIVLRETGYNEFPFVIPRFRKIPNSVYGTGQVSIALPDA 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPVQF 340 + N+ + + + +S V + R L G + + + + Sbjct: 288 KTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVNDVNS----LKRIDD 343 Query: 341 GNPLPYHEE-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 G + L L+ +IR + D Q D A +A E + +GPL G Q Sbjct: 344 GKGYQVGVDLLAHLQGAIRKKMMADQLQPADGPA-MTATEVHVRVDLIRQQLGPLYGRWQ 402 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 +E + ++ R + G + E L ++ S L + QQ E V + + + Sbjct: 403 AELLTPLLERTFGLAYRAGVIGEAPEEM--QGRNLSFKFISALARSQQLEEVTAIERFLA 460 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 + + DPS +D++D D V++ S P ++R +++ IR+QR+ ++ + Sbjct: 461 GMSNVA--QIDPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDAIRKQRQEAQQQAAQ 518 Query: 520 QHLQQQLQQTSQDIGAKAAGRAMEKKLTHD 549 Q +Q L Q + A G+ +E +LT + Sbjct: 519 QEQEQALAQPLAN----AVGKGLESELTSE 544 >gi|169795385|ref|YP_001713178.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148312|emb|CAM86177.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 547 Score = 442 bits (1136), Expect = e-122, Method: Composition-based stats. Identities = 121/570 (21%), Positives = 224/570 (39%), Gaps = 47/570 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-------------AQLRMWD 47 M++ A+ + R + LK R L E + P + + + D Sbjct: 1 MSELVAR-LCKRLSELKAARNRLEPHWSECYRYAAPERQQSFIGDDVTDTRKTQRAELLD 59 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 +T SEA L S + S TP W + + A +W D+V Sbjct: 60 STLSEATQLLVSSIISGTTPANALWFKAVPN-------GVDDPAELTDGEKWLDEVCQ-- 110 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 F +R + + + V G G Y + D G G + + + Y++ Sbjct: 111 FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVDRHAGG---GYVFQTWDIGQCYLAST 167 Query: 168 HQN-VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 Q+ VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ K Sbjct: 168 RQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTGYIK 227 Query: 227 KDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D+ F S V VDE E FP+++ R+R +YG ALP Sbjct: 228 GDRQLMPKEMPFASYHVEVDEKIILRETGYNEFPFVIPRFRKIPHSVYGTGQVSIALPDA 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPVQF 340 + N+ + + + +S V + R L G + + + + Sbjct: 288 KTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVNDVNS----LKRIDD 343 Query: 341 GNPLPYHEE-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 G + L L+ +IR + D Q D A +A E + +GPL G Q Sbjct: 344 GKGYQVGVDLLAHLQGAIRKKMMADQLQPADGPA-MTATEVHVRVDLIRQQLGPLYGRWQ 402 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 +E + ++ R + G + E L ++ S L + QQ E V + + + Sbjct: 403 AELLTPLLERTFGLAYRAGVIGEAPEEM--QGRNLSFKFISALARSQQLEEVTAIERFLQ 460 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 + + DPS +D++D D V++ S P ++R +++ IR+QR+ ++ + Sbjct: 461 GLSSVA--ELDPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDAIRKQRQEAQQQAAQ 518 Query: 520 QHLQQQLQQTSQDIGAKAAGRAMEKKLTHD 549 Q +Q L Q + A G+ +E +LT + Sbjct: 519 QEQEQALAQPLAN----AVGKGLESELTSE 544 >gi|282848877|ref|ZP_06258267.1| hypothetical protein HMPREF1035_1386 [Veillonella parvula ATCC 17745] gi|282581382|gb|EFB86775.1| hypothetical protein HMPREF1035_1386 [Veillonella parvula ATCC 17745] Length = 575 Score = 440 bits (1132), Expect = e-121, Method: Composition-based stats. Identities = 116/517 (22%), Positives = 209/517 (40%), Gaps = 44/517 (8%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN----------AQLRMWDTTGSEACIK 56 ++ +F+ L N + + L + P+ ++ + E+C Sbjct: 24 TKLRKKFSQLFNAQQRYVNKWKHLRDYQLPFIGQFDGEEDQSEPYNGKILNPVAWESCQI 83 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 +S + S +TPP +KW L + A + +V E D+ + L+ ++S Sbjct: 84 FASGVMSGLTPPSRKWFKLT--------MENIDVAANSQVAELLDEREEILYAVL--AKS 133 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 F + Y + G + AD E G+R+ S P+ +S N + +V+ Sbjct: 134 NFYSVVHQVYMEL-PMGQAPMGIFADS-----ESGVRFTSYPIGTYAISTNSKEIVNIFG 187 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNE--NERFTIIHAVYPKSLTDKKKDKGNKGF 234 R++ TVDQIV ++G + +K+ + FT+ V P K + N + Sbjct: 188 RKYKMTVDQIVEQFGYENCPDNIKNIYDNGNSLQQSFTVNWLVEPNKDRKDKLGRRNMPY 247 Query: 235 HSKFVSVDE--NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 S + + +P + R+ YG+ A A P + L + + Sbjct: 248 SSIYWVEGSNSDEVLYHGGFEEWPIPIARHTSMDLNGYGKGAAWFAQPDSQMLQKLEFDY 307 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG--NPLPYHEEL 350 L + PP A S+ +L PG + EG+ +P+ N ++ Sbjct: 308 LTAVELGVKPPMQAPSDVIS-TVNLYPGGIT----EIEGQHKVEPMFAVQSNLQDIQNKI 362 Query: 351 NRLKESIRSLFLLDLFQVLD--DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 ++SI+ + DLF +LD DK +A E ME+T+EK +GP++ L SEF+ +I Sbjct: 363 AVTEDSIKRAYSADLFLMLDQIDKGQMTAREVMERTQEKLQQLGPVVERLLSEFLNPIIE 422 Query: 409 RELDILDSQGNLPECEGA---DNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 R +LD G P E D +K+EY SPL + Q+ S+ + Q ++ L Sbjct: 423 RVYAVLDRAGVFPPVEDEELLDQLNGQEVKIEYISPLAQAQKMSSLVNIEQYFAFIMSLA 482 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 +P+ ++ + + + PA +IR E Sbjct: 483 --QANPNIVNKFNFEEAANTYGVNLGVPAKIIRSDDE 517 >gi|292670769|ref|ZP_06604195.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292647390|gb|EFF65362.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 567 Score = 434 bits (1117), Expect = e-119, Method: Composition-based stats. Identities = 101/523 (19%), Positives = 201/523 (38%), Gaps = 40/523 (7%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR------------MWDTT 49 + + + ++ + +R + ++L+ ++ P + + D Sbjct: 14 DSDAIRRKKNLVTQMMTERTQFESTWKQLSKYINPTRGRFDDEDKTQDGRRRDYFLLDPY 73 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 EA K ++ L S +T P + W L KE A V+ W ++ D L G Sbjct: 74 PMEASGKCAAGLHSGLTSPSRPWFALG--------LQDKELAEYHTVKLWLEECQDVLMG 125 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L + + +FGTG + D + G+ +V+ + Sbjct: 126 IY--AKSNIYNMLLNIEAELTQFGTGAALLLEDFN-----TGVWARPYTCGEYAGNVDAR 178 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSAL-ARNENERFTIIHAVYPKSLTDKKKD 228 V R+F Q+V ++G+ V+S +++A A+N + F + + + + + Sbjct: 179 GRVVQFARKFKLNAWQMVDEFGEDVVSDAVRNAYRAKNLKDYFPVTMLIEKNADYNPDSN 238 Query: 229 KG-NKGFHSKFVSVDE-NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLN 286 N + S + + + F + P+++ R+ V A+ IYG P AL +L Sbjct: 239 ALLNFKYKSYYFEDSQTDVFLKVSGYHEVPFLMPRWTVIANGIYGVGPGHNALGNCMQLQ 298 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG--YMNIGALSREGRSLFQPVQFGNPL 344 + + P I S + + PG + ++ R L++ G+ Sbjct: 299 KIEKINMRLLEHRSDPALIVPSSVGK--VNRLPGKETLVPDSMINGIRPLYEA--TGDRG 354 Query: 345 PYHEELNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 + + ++ I + F DLF +L D +A E E+ EK + P++ + +E Sbjct: 355 EVMQTIQYKQQQIGAAFYNDLFVMLAQQDNPQMTAREVAERHEEKLLMLSPVLEQMHNEV 414 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + + R +I G LP +K E+ S L + Q+A + + + Sbjct: 415 LAPLTRRAFEICYRNGLLPPLPEELRGQEGSIKAEFISLLAQAQKAVGTNAMEKTLAIAG 474 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 L P MD++D D R + TP ++RD +V+ Sbjct: 475 NL--MGASPEIMDNLDLDAAIREHAQMSGTPETIMRDEQDVQK 515 >gi|83313332|ref|YP_423596.1| hypothetical protein amb4233 [Magnetospirillum magneticum AMB-1] gi|82948173|dbj|BAE53037.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 545 Score = 431 bits (1109), Expect = e-118, Method: Composition-based stats. Identities = 112/503 (22%), Positives = 201/503 (39%), Gaps = 45/503 (8%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQLRMWDTTGSEACIKLS 58 + R+ K +R +E + P ++ R++D T + +L+ Sbjct: 33 LLRRYRKAKERRSTWESHWQECYDYALPLRDGMFHSSVPGERKADRLFDGTAPDCVDQLA 92 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + L S +TPP +W GLA +A + ++ + E V + F RS F Sbjct: 93 ASLLSELTPPWAQWFGLAAGDQMPEA----DRDQAAPLLERIAAVMQSHF-----DRSNF 143 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE 178 + Y V GT E G R+ SVPL V + +D +R Sbjct: 144 AIEMHQCYLDAVTGGTASLMFEEAP--PGEPSAFRFTSVPLGQVVLEEGPAGRLDVTFRR 201 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 +V + +++ VL ++ A A + + R ++ AV P +G + + Sbjct: 202 SELSVAALKARFPRAVLPREVIKAAADDPDLRLGVVEAVVPV--------RGGYSYAAVL 253 Query: 239 VSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRL 298 + Q ++ P++ R+ E+YGRSP M+ALP I+ N+ V + + + Sbjct: 254 DDDGSDLVLGRGQFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIKTANKVVELVLKNATI 313 Query: 299 SLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREG-RSLFQPVQFGNPLPYHEELNRLKE 355 ++ A + L PG + A+ G + L P +F L+ L+ Sbjct: 314 AVTGIWQADDDGVLNPANIKLVPGTIIPKAVGSAGLQPLTAPGRFDT---SQLVLDDLRG 370 Query: 356 SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILD 415 IR + D + +A E +++ + +G G LQSE + +I R + IL Sbjct: 371 RIRHALMGDKL-SQPASPALTATEVLQRADDMARLLGATYGRLQSELLTPLILRAIHILR 429 Query: 416 SQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD 475 +G +P + + ++Y SPL + Q + L + + LG PS + Sbjct: 430 RRGEIPPLQ----VDGRTIDLQYRSPLAQNQGRRDARNVLNWLGALSSLG-----PSALA 480 Query: 476 HMDTDRVSRFSLWATNTPAVLIR 498 +D+D +R+ A N P+ LIR Sbjct: 481 TVDSDAAARWLARAFNVPSELIR 503 >gi|144899435|emb|CAM76299.1| head-to-tail joining protein [Magnetospirillum gryphiswaldense MSR-1] Length = 502 Score = 427 bits (1097), Expect = e-117, Method: Composition-based stats. Identities = 118/516 (22%), Positives = 208/516 (40%), Gaps = 46/516 (8%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQLRMWDTTGSEACIKLS 58 ++ R+ K +R +E + P ++ R++D T ++A +L+ Sbjct: 17 LRQRYRKAKERRATWEAHWQECYDYALPLRDAVLHQPNPGEKKGDRLFDGTAADAVDQLA 76 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + L S +TPP +W GL ++A ++V D+V L + RS F Sbjct: 77 ASLLSELTPPWAQWFGLTAG-------PDLDEAERQQVAPLLDKVGAILQSHFD--RSNF 127 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE 178 + Y VV GT C E + G R+ +VPL+ + +DS +R Sbjct: 128 AVEMHQCYLDVVTGGTACLLFEEA--QPGEASAFRFTAVPLAQAVLEEGPDGKLDSSFRR 185 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 T+ + ++ L + + RF +I AV P +G+ + + Sbjct: 186 SELTLAALRQRFPAAQLDPSLIRRGEEDPQARFAVIEAVIPN-------QRGHYDYAAIL 238 Query: 239 VSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 D+ E + P+I R+ EIYGRSP M+ALP I+ N+ V + + Sbjct: 239 EDATDDDEALLAEGRFGQSPFINFRWLKAPGEIYGRSPVMKALPDIKTANKVVELVLKNA 298 Query: 297 RLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPY-HEELNRL 353 +++ A + L PG + A+ G QP++ L+ L Sbjct: 299 TIAVTGIWQADDDGVLNPANIKLIPGTIIPKAVGSAG---LQPLESPGRFDISQLVLDDL 355 Query: 354 KESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDI 413 + IR L D D +A E +E++ + +G G LQSE + +I R + I Sbjct: 356 RGRIRHALLADKLG-QADNPKMTATEVLERSADMARLLGATYGRLQSELLTPLILRAVTI 414 Query: 414 LDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSC 473 L +G +P L++++Y SPL + Q + L ++ + +LG P+ Sbjct: 415 LRRRGEIPPL----LVDGHLVELQYRSPLAQSQAQRDAHNVLSWLSALAQLG-----PAG 465 Query: 474 MDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQ 509 M +D +++ A N PA L+ E++ Q Sbjct: 466 MAVVDPAAAAQWLGRAFNIPADLMVAPQNPENVHVQ 501 >gi|48696640|ref|YP_024419.1| hypothetical protein VP2p04 [Vibrio phage VP2] gi|48696684|ref|YP_024978.1| hypothetical protein VP5_gp03 [Vibrio phage VP5] gi|40806147|gb|AAR92065.1| hypothetical protein [Vibrio phage VP5] gi|40950038|gb|AAR97629.1| hypothetical protein [Vibrio phage VP2] Length = 547 Score = 426 bits (1095), Expect = e-117, Method: Composition-based stats. Identities = 112/557 (20%), Positives = 213/557 (38%), Gaps = 44/557 (7%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------------NAQLRMWDTTGSEA 53 I R ++LK R + + + ++ P ++ N ++D+T + Sbjct: 5 KIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDG 64 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 LSS L +T P KW LA F KE + R+W + T ++ + Sbjct: 65 LETLSSSLHGSLTSPATKWFELA--------FRDKELNSDDECRKWLENATHDVYSALQD 116 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 S F Y + +G E D DE+G + + S P+ + Y + + V Sbjct: 117 --SNFNLEANETYIDLCGYGNAIMVEEEDEDEEG---SVVFQSSPIQDSYFEEDSRGQVV 171 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE---RFTIIHAVYPKSLTDKKKDKG 230 + YR F +T QI ++GD+ + N+ + ++ V+ + + ++ G Sbjct: 172 NFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAG 231 Query: 231 ------NKGFHSKFVSVDEN-RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 + F K++ + + EE P R+R A +G P+ ALP + Sbjct: 232 TVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVL 291 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 N V + + + P + + DL + + + +F Sbjct: 292 TANRYVELVLRSSEKVIDPAIMVTERGLISDIDLGASGLTVVRDMESMKPFESRARFDV- 350 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 +L L+ ++R ++ +D Q+ D + +A E + +GP +G L+++F+ Sbjct: 351 --SSIQLTDLRSAVRRIYYVDQLQM-KDSPAMTATEVQVRYELMQRLLGPTLGRLENDFL 407 Query: 404 GAMISRELDILDSQGNLPECEGA-DNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 MI R +I G L E + + + YT PL + Q+ + AS + + Sbjct: 408 SPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTA 467 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHL 522 +L +P +D D D + R P L+R A+V IR+ R ++ E+ + Sbjct: 468 QLA--EINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAI 525 Query: 523 QQQLQQTSQDIGAKAAG 539 + + G A Sbjct: 526 AEAEGNAMEAQGKGQAA 542 >gi|290968647|ref|ZP_06560185.1| hypothetical protein HMPREF0889_0287 [Megasphaera genomosp. type_1 str. 28L] gi|290781300|gb|EFD93890.1| hypothetical protein HMPREF0889_0287 [Megasphaera genomosp. type_1 str. 28L] Length = 577 Score = 425 bits (1093), Expect = e-117, Method: Composition-based stats. Identities = 111/516 (21%), Positives = 215/516 (41%), Gaps = 40/516 (7%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA-----------QLRMWDTTGSEAC 54 + + L Q+ + +++ + PY +++ ++A Sbjct: 26 KQSCVKMLDSLFKQQQKYIPLWKDIRNYELPYDGELGDDVIGAPAMHDEEIYNGITAQAR 85 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 ++ + S +TPP +KW F+ A L ++ + E C+ + L S Sbjct: 86 DTFAAGIQSGLTPPSRKWFR----FAPTDASLDNNIDVARVLDERCEIMEGVL------S 135 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 +S F + S Y + FG + AD E+G+ +++ + + + Q +++ Sbjct: 136 QSNFYNVIHSAYKEL-PFGQSPVGVFAD------EKGVYFVNYTIGTYALGADGQGRINT 188 Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNE--NERFTIIHAVYPKSLTDKKKDKGNK 232 R+ + QIVS +GD V++ ++ A+ N + +T+ VYP + Sbjct: 189 FARKVKMSAAQIVSLYGDSVVTDSVREAVKANGGHEDYYTVCWLVYPNPKAKPTGGNHDM 248 Query: 233 GFHSKFVSVDEN--RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 F S + K + V RY V+ + YG PA +ALP R L + Sbjct: 249 KFLSVHWLEGSDPNSLLAAKGFEEWAIPVARYNVKGIDAYGIGPAWDALPESRMLQKMEY 308 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 + A LS+ PP + +E + R +L PG + + ++ Sbjct: 309 DGAIALELSIKPPLVGPAELQGR-INLFPGAYTPSINPNDNVHSIYSGGL-DLNSLQAKI 366 Query: 351 NRLKESIRSLFLLDLFQVLD--DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 ++++ I+ ++ DLF +L+ ++ +A E M + +EK A +GP+I LQ+EF+ +I Sbjct: 367 TQIEDRIKRIYSTDLFLMLNELNRGQMTAQEVMARNQEKMAQLGPVIERLQNEFLSDIIE 426 Query: 409 RELDILDSQGNLPECEG--ADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 R ++L+ P +K+EY SPL + Q+ + + QGV+ V +L Sbjct: 427 RVYNLLERNQVFPPLPDDVQQTLQGQEIKIEYLSPLAQAQKMSGLTAIEQGVSFVGQLA- 485 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 DP+ + ++ D+ L P+ +IR E Sbjct: 486 -QLDPNVILRVNFDKAVENYLDKLGVPSTMIRTEDE 520 >gi|23015763|ref|ZP_00055531.1| hypothetical protein Magn03010200 [Magnetospirillum magnetotacticum MS-1] Length = 543 Score = 420 bits (1079), Expect = e-115, Method: Composition-based stats. Identities = 111/510 (21%), Positives = 200/510 (39%), Gaps = 45/510 (8%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQLRMWDTTGSEACIKLS 58 + R+ K +R +E + P ++ R++D T + +L+ Sbjct: 33 LLRRYRKAKERRSTWESHWQECYDYALPLRDGMFHAGVPGERKADRLFDGTAPDCVDQLA 92 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + L S +TPP +W GL +A E + + E V + F RS F Sbjct: 93 ASLLSELTPPWAQWFGLTAGDQMPEA----ERDQVAPLLERVAAVMQSHF-----DRSNF 143 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE 178 + Y V GT E G R+ SVPL V + +D +R Sbjct: 144 AIEMHQCYLDAVTGGTASLLFEEAA--PGEASAFRFTSVPLGQVVLEEGPAGRLDVTFRR 201 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 +V + +++ VLS + A A + + R ++ AV P +G + + Sbjct: 202 SEMSVAALKARFARAVLSGHLIKAAADDPDLRLGVVEAVIPV--------RGGYSYAAVL 253 Query: 239 VSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRL 298 + ++ P++ R+ E+YGRSP M+ALP I+ N+ V + + + Sbjct: 254 DDESSDVVLGRGSFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIKTANKVVELVLKNATI 313 Query: 299 SLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREG-RSLFQPVQFGNPLPYHEELNRLKE 355 ++ A + L PG + A+ G + L P +F L+ L+ Sbjct: 314 AVTGIWQADDDGVLNPANIKLVPGTIIPKAVGSAGLQPLTAPGRFDT---SQLVLDDLRG 370 Query: 356 SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILD 415 IR + D S +A E ++++ + +G G LQSE + +I R + IL Sbjct: 371 RIRHALMGDKL-SQPASPSLTATEVLQRSDDMARLLGATYGRLQSELLTPLIMRAIHILR 429 Query: 416 SQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD 475 +G +P + + ++Y SPL + Q + L + + LG P+ + Sbjct: 430 RRGEIPPL----SVDGRVFDLQYRSPLAQNQGRRDARNVLSWLGALSSLG-----PAALA 480 Query: 476 HMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 +D +R+ A N P+ L+R +E + Sbjct: 481 TVDAAAAARWLGRAFNVPSELVRPASEQQA 510 >gi|42526662|ref|NP_971760.1| head-to-tail joining protein, putative [Treponema denticola ATCC 35405] gi|41816855|gb|AAS11641.1| head-to-tail joining protein, putative [Treponema denticola ATCC 35405] Length = 560 Score = 417 bits (1073), Expect = e-114, Method: Composition-based stats. Identities = 126/525 (24%), Positives = 227/525 (43%), Gaps = 48/525 (9%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFL---------------YPYKNNAQLRMWDTTGSE 52 DI+ F+ LK++R +++ ++ P ++ + SE Sbjct: 13 DIKGLFDILKDKRSMHEAEWQDVCTYIGSNVFDWSENKEEIKRPKRHTGR-------PSE 65 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 KL S L P W L+ + + E V++W +Q L+ E Sbjct: 66 YLKKLVSGLMGYTISPNVTWLKLSLNNT--------EMLEYAGVKDWLEQSEKALY--EE 115 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 +R+ + F ++ FG G ++ E IR++++ +Y++ N + Sbjct: 116 FNRNNLYSQVSLFISNAASFGHGVMLIDE-----KKENSIRFLTIAEPEIYIAENEYGDI 170 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALA--RNENERFTIIHAVYPKSLTDK-KKDK 229 D+V+R F+ TV I++++G++ +S ++K+ + +N+ I+HAV P+ D+ K D Sbjct: 171 DTVFRYFSMTVKNIIARFGEENVSEQIKNDAKDIKGKNKEIKILHAVLPRDDYDESKLDG 230 Query: 230 GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 N F S ++ +D N EE PY V + YG SPA EA+P +R LN+ Sbjct: 231 KNMEFASYYIDMDNNTILEESGYYELPYSVFIWEKETSSAYGGSPAREAIPDMRLLNKVE 290 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH-E 348 + +L PP + + P N + P+ G P E Sbjct: 291 EARLKLAQLVSEPPMNVPDSMRGFE-SVVPAGYNYY---ERPDMIMTPINIGANFPITLE 346 Query: 349 ELNRLKESIRSLFLLDLFQVLD-DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + ++ +R F +D +L A ++A E +E EK A + LI Q++ + ++ Sbjct: 347 TIQDIESRLRDKFHVDFMLMLQAQTAQKTATEVIELQGEKSALLSSLIVN-QNKALSEIV 405 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R L+I+ QG PE N ++L V++ PL + Q+ +Q + + + Sbjct: 406 IRTLNIMYRQGRFPEPPNILNGSDAVLNVDFVGPLAQAQKRYHQTGGVQTSLAISQP-II 464 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREV 512 +P +D++DTD++ + L P IR+ EVE IRQQR Sbjct: 465 QMNPEVLDYIDTDKLLKNVLDTNGFPQSAIREDDEVEKIRQQRAE 509 >gi|294648400|ref|ZP_06725899.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825705|gb|EFF84409.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 558 Score = 417 bits (1071), Expect = e-114, Method: Composition-based stats. Identities = 118/572 (20%), Positives = 225/572 (39%), Gaps = 46/572 (8%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN----------------AQLRMWDT 48 +A+ + R + LK+ R + ++ + P + A+ ++DT Sbjct: 2 NAQQLLKRLSQLKSDRIKHEAHWKDCYKYCAPERQQSFADASATALEQERKQARTDLFDT 61 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 T E L S + S T P W S + L + +W QV LF Sbjct: 62 TSVEGIQLLVSSIVSGTTSPVSIWFKSVPSGVDTPSQL-------TEGEQWLSQVDQFLF 114 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN- 167 R S F + F T +V G Y D + G + + + N Y+S Sbjct: 115 --RNIHASNFDSEVTDFLTDLVVAGWAVLY----ADTNREKGGFTFNTWSIGNCYISSTQ 168 Query: 168 HQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT---- 223 ++D++YREF + +QIVS++G +S K+++AL + +++FT++ A++P+ Sbjct: 169 ANGLIDTIYREFELSAEQIVSEFGIDNVSDKVRTALEKKPDQKFTLVQAIFPRDSKLIKG 228 Query: 224 -DKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 + K+ + F S + +E FP +V R++ D YG + Sbjct: 229 EEGKRVSTSMPFASYTIEAQSKHILKESGFEEFPCVVSRFKKIPDSHYGLGMGSMVISDA 288 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQF 340 + N+ + Q L+L IA ++ ++P + + L Sbjct: 289 KTANQIMKLSLQTAELNLGGLWIAQNDGNINPHTLRIRPNAIIAANTVDSIKRLDTGSAS 348 Query: 341 GNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS 400 + L + I+ + D + +A E + + +G + +QS Sbjct: 349 VGLG--LDFLQHFQAKIKRTLMSDQLTP-QGSSPLTATEIQARVQVYRNQLGSIFSRMQS 405 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E++ ++ R + G LP S + + +P+ Q+ E V + + Sbjct: 406 EYLQVLLERTWGLAMRSGVLPPAPEEL-MQASRISFNFINPMAASQKLEWVTAIQNLMLN 464 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 V ++ D + MD+++ D + + A + P IR E+ ++RQ ++ Q++ M+EQ Sbjct: 465 VSQMA--QIDQTVMDNLNLDAMVQVMADALSVPVEAIRTDEEIAELRQAKQEQQQAMQEQ 522 Query: 521 HLQQQLQ-QTSQDIGAKAAGRAMEKKLTHDMM 551 QQ L Q Q A +A K +T D + Sbjct: 523 QQQQALMSQVGQTGLDIAKDQA--KNMTPDQL 552 >gi|46580131|ref|YP_010939.1| hypothetical protein DVU1721 [Desulfovibrio vulgaris str. Hildenborough] gi|46449547|gb|AAS96198.1| hypothetical protein DVU_1721 [Desulfovibrio vulgaris str. Hildenborough] gi|311233876|gb|ADP86730.1| hypothetical protein Deval_1575 [Desulfovibrio vulgaris RCH1] Length = 550 Score = 415 bits (1066), Expect = e-113, Method: Composition-based stats. Identities = 105/569 (18%), Positives = 213/569 (37%), Gaps = 43/569 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN------------NAQLRMWDT 48 M K++ + +++ R +++ +L P + + + + Sbjct: 1 MRSALLKELSEVAEHVEGLRKRREAQWRDISEWLMPMRGIYEGQDGADVIASRGKGLLNR 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 G+ A ++ ++ +TP W + R W D V ++ Sbjct: 61 EGTRALKVAATGMTGGMTPAALPWFRWSLRD--------DVQNERTGARAWLDTVEASIN 112 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 GF + + + FG + D + L R+ S + ++++ Sbjct: 113 SVLR--ACGFYQAIHACNMEFLAFG--PLLLFQDNSQGAL---CRFESCTVGTWAVALDA 165 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVY-PKSLTD-KK 226 +D+V R T Q+ ++G L+ L N+ + V P++ + Sbjct: 166 DGGLDTVVRRLKLTARQMEQRFGRDRLTPATVKLLETNKGHERVEVVHVVRPRTERQHGR 225 Query: 227 KDKGNKGFHSK-FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 D N F S + + + E PY Y ++YG +P + LP +++L Sbjct: 226 IDARNMPFASYMYEATGADDVLSESGYHEMPYFFAAYD-DTLDLYGSAPGDDCLPDVKQL 284 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNI--GALSREGRSLFQPVQFGNP 343 E + + ++PPT + KQR ++ PG N G L++ N Sbjct: 285 QELEKQKLVGLQKVINPPTRKPASFKQR-LNVNPGGENAVSGGDPHGIGPLYEVRIDLNQ 343 Query: 344 L--PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + ++R++++ + + D+ L K + E +E+ RE+ +GP + +++ Sbjct: 344 VREEIATVVDRIRQTTMASYFADMPLELRPK-DMTYGEYLERKRERLQLMGPSLEAYEAK 402 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + +I R +LD G LP A V+++ + Y SPL + + S + V Sbjct: 403 VLTPVIFRTFALLDRAGMLPPPPDALG-EVAVVDISYISPLAQALRQTGAESTRALLMDV 461 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 ++L DP +D +D D+ P ++R +V +RQQR+ + + Sbjct: 462 MQLA--EADPGVLDKVDMDQAVDELAKGIGAPGRVVRSDEDVAAMRQQRDEAKAREAQA- 518 Query: 522 LQQQLQQTSQDIGAKAAGRAMEKKLTHDM 550 Q+ Q + A R L HD+ Sbjct: 519 --QEAITAMQGLAKVAGTRTGPGTLAHDL 545 >gi|239787361|emb|CAX83837.1| Head-to-tail joining protein [uncultured bacterium] Length = 524 Score = 413 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 114/517 (22%), Positives = 199/517 (38%), Gaps = 51/517 (9%) Query: 1 MNQRSAKD----IQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQLRMW 46 MN ++ D + RF + +R +E F P + R++ Sbjct: 1 MNGQNDPDAQRVVLKRFEKARERRNVWEGHWQECYDFALPSRGGPLLSSQPGAKRTDRLF 60 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 D T + +L++ L + +TPP +W GLA A +E + V E + Sbjct: 61 DGTAPDCVDQLAASLLAQLTPPWAQWFGLA----AGPDLTPEEREVAAPVLEKAGAALQS 116 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSV 166 F RS F + Y +V GT E G R+ ++PL+ + + Sbjct: 117 HF-----DRSNFAIEMHQCYLDLVTAGTASLLFEEAP--LGSASAFRFTAIPLAQLALEE 169 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 + + +D+ +R T+ I ++ L M + + RF ++ AV P Sbjct: 170 SVEGRLDTTFRSSEMTISAIRERFPKAQLPESMGRKSKDDADARFKVVEAVLP------- 222 Query: 227 KDKGNKGFHSKF--VSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 ++ +H+ E + P+I R+ E+YGRSP M++LP I+ Sbjct: 223 -ERHGYAYHAILDGEGTGGAETLAEGRFEMSPFINFRWLKAPGEVYGRSPVMKSLPDIKT 281 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGN 342 N+ V + + +++ A + L PG + A+ G P++ Sbjct: 282 ANKVVELVLKNATIAVTGIWQADDDGVLNPANIKLVPGTIIPKAVGSAG---LTPLETPG 338 Query: 343 PLPY-HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 L L++ I L D D + +A E +E++ E +G G LQSE Sbjct: 339 RFDISQLMLTDLRQRISHALLADRLG-QIDAPNMTATEVLERSAEMARLLGATYGRLQSE 397 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ R + IL +G +P D + L+ Y SPL + E + LQ + V Sbjct: 398 LLTPLVMRAVAILKRRGEIPGLS-IDGHQIELI---YKSPLANERGREDAKNTLQWLTAV 453 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIR 498 + G P +D +R+ A N PA L+R Sbjct: 454 MSFG-----PPANQVVDLGAAARWLAKALNVPAELLR 485 >gi|119386466|ref|YP_917521.1| putative head-tail connector protein [Paracoccus denitrificans PD1222] gi|119377061|gb|ABL71825.1| putative head-tail connector protein [Paracoccus denitrificans PD1222] Length = 558 Score = 411 bits (1056), Expect = e-112, Method: Composition-based stats. Identities = 107/528 (20%), Positives = 191/528 (36%), Gaps = 40/528 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTT 49 +NQ+ K + R + + EL + P + R+ D T Sbjct: 5 VNQQLRKTLDYRRQAMNQEFDYWQGHFRELRDAIQPTRGRFEASERRSDSSINKRILDNT 64 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 A L + L S +T P + W L S D +V++W +V ++ Sbjct: 65 AQMALRTLRAGLMSGVTSPSRPWFRLGLRGST-------ADEAEFEVKDWLHEVQRRMYE 117 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 S L + Y + +GT + D E+ +R ++ + + + Sbjct: 118 VMR--GSNIYRMLDTTYGDLGLYGTAANLVVPDF-----EDVVRGHNLQVGRFRLGEDGN 170 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN-ERFTIIHAVYPKSLTDKK-K 227 V ++YRE V IV WG +S ++ A E + FTI H + ++ D K Sbjct: 171 GRVIALYRELKMPVRGIVETWGLDAVSQSVRRAWDTGEYYQTFTICHMIDKRADGDPKAM 230 Query: 228 DKGNKGFHSKFVSVD--ENRFFEEKQIATFPYIVGRYRVRADEIY-GRSPAMEALPTIRR 284 + + S + +D +F + P + R+ E + SP M AL R Sbjct: 231 QSSGRPWASIYWEMDAPSGQFLQIGGHRVKPLLAPRWEQVEGEAWSASSPGMVALGDARS 290 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L + + A + +PP I + F PG A +P P Sbjct: 291 LQVSQEQKAIAIQKMHNPPLIGGAVQGGMFFKNVPGGFTAMATQDLSTGGIRPAYEVRPD 350 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQV----LDDKASRSAAESMEKTREKGAFVGPLIGGL 398 ++ + + F DLFQ+ LD ++ +A E E+ EK +GP++ L Sbjct: 351 IQGLIIDIQESQRRVEVAFYKDLFQMTALALDGRSQITAREIAERHEEKLMALGPVLESL 410 Query: 399 QSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGV 458 E + +I + LPE +KVEY S L + Q+A + + + + Sbjct: 411 DHELLQPLIEATFAYMQEADILPEAPEGIVGNP--IKVEYISLLAQAQKAIGIGAIERTI 468 Query: 459 NTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDI 506 L P +D +D +++ R P ++ E+ ++ Sbjct: 469 GFAGTLA--QIKPDVIDMIDGEQMMREFADQVGGPPGILLSPDELREV 514 >gi|260557979|ref|ZP_05830191.1| Bbp21 [Acinetobacter baumannii ATCC 19606] gi|260408489|gb|EEX01795.1| Bbp21 [Acinetobacter baumannii ATCC 19606] Length = 555 Score = 409 bits (1052), Expect = e-112, Method: Composition-based stats. Identities = 101/522 (19%), Positives = 203/522 (38%), Gaps = 35/522 (6%) Query: 9 IQDRFNYLKNQRGE-LNYWMEELTGFLYP-----------YKNNAQLRMWDTTGSEACIK 56 ++ RF+ + R ++ + EL + P + +A ++ D TG ++ Sbjct: 1 MKKRFDAVWQLRVNDMDDYCAELALHVLPAAIKTIKNQEKHDRSAWSKIVDNTGKDSLKT 60 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L++ + S P +KW L + + Q + +VR+W V D + S+S Sbjct: 61 LAAGMVSGTCSPSRKWFTLQAADESLQ--------KDIEVRQWLKAVEDACYVAF--SKS 110 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 + Y FG G + + I + ++ + N + VY Sbjct: 111 NVYRTVHHIYMQEGAFGIGAALAPEH-GRNSKAQLMDLIPLTFGEFAITTDEFNKPNGVY 169 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALA-RNENERFTIIHAVYPKSLTDKKKDKGNKGFH 235 R+F T +V +G +S +K+A +N + F + HA+Y + K N F Sbjct: 170 RKFKLTSINMVKYFGLDNVSDAIKNAFENKNYEQEFEVCHAIYERVDA-KGYGPKNMPFA 228 Query: 236 SKFVS-VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQ 294 S + ++ E + F I GR+ V + ++YG PA + + +R L + ++A Sbjct: 229 SIYYEPSSSDKLLRESGLMGFQVICGRWTVSSSDVYGEGPASDCIGDLRALQKGHQQIAV 288 Query: 295 FGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH---EELN 351 + PP + K + P + S + + ++ Sbjct: 289 GVDYQVRPPLLLPDYLKGHERETLPNGIAFYQASPTSQVAQVQAMLNVQFDLNGVMAQIA 348 Query: 352 RLKESIRSLFLLDLFQVLD--DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 + +E ++ F DLF +LD DK +A E E+ EK +GP++ E + ++ Sbjct: 349 QCQERVKRAFHTDLFMMLDAFDKGKMTATEVYERKSEKMLMLGPVVERQIDELLRPLVEI 408 Query: 410 ELD-ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 ++ +L + L + + + +++ + S L Q++ A + + + ++ Sbjct: 409 CVERVLANSEYLRQIA-PEAIQNADVEINFVSILALAQKSSGSAILERALAMIGQVA--Q 465 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 DP +D +DTD+ + R V+ IR R Sbjct: 466 VDPQVLDKVDTDKFMDEYAEINGVSPDIFRPQRIVDQIRSDR 507 >gi|225158777|ref|ZP_03725094.1| hypothetical protein ObacDRAFT_8203 [Opitutaceae bacterium TAV2] gi|224802612|gb|EEG20867.1| hypothetical protein ObacDRAFT_8203 [Opitutaceae bacterium TAV2] Length = 562 Score = 400 bits (1027), Expect = e-109, Method: Composition-based stats. Identities = 118/563 (20%), Positives = 221/563 (39%), Gaps = 46/563 (8%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN------------AQLRMWDTTGSEA 53 A+D+ R+ +++ + ++ P K + ++D+T +E+ Sbjct: 10 AEDLIGRYEAGLSRQANWRSRWHDAARYILPSKGDILSMGDKHGGEAQTTDIYDSTANES 69 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 + ++ L S + P G+ W + S V EW D T Sbjct: 70 ALVYAAGLLSSLVPAGELWFRFSAR-----------PGASAPVVEWFDDCTHR--AAAAL 116 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGI-RYISVPLSNVYMSVNHQNVV 172 S F + + + F + E +G G+ + +VP+ + + + +V Sbjct: 117 HASNFYLGIHEDFMDMAGFSIASLFCEEGAALRGQRGGLLNFTNVPVGTFVIEEDAEGLV 176 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSAL----ARNENERFTIIHAVYPKSLTDKKKD 228 D+V+REF FT Q KWG+ LS M AL A + ++RF IIHAVYP+ D K+ Sbjct: 177 DTVFREFRFTARQCAQKWGEDKLSKPMLDALNSKTASDRDKRFQIIHAVYPR--RDGKQG 234 Query: 229 ---KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 + S +V EE P V R +EIYGR P + +P I+ + Sbjct: 235 PGIGKKRPIASVYVDKQAIHVIEEGGFYEMPIAVARLLRGNNEIYGRGPGDQVMPEIKLV 294 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 N +L ++PP +A ++ R D +PG + S + Sbjct: 295 NRMERDLLLSLEQQVNPPWLAPQDSSWRP-DNRPGGVFYWDASNPNNKPERLRDTARLDI 353 Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDKASR----SAAESMEKTREKGAFVGPLIGGLQSE 401 + LN +E IR + +D+F++L + + +A E + +EK P+ + E Sbjct: 354 GDKVLNDKREVIRRAWFVDMFKMLSNPDAMKRDKTAFEVAQLMQEKLVLFHPMFARITQE 413 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ R +IL G A+ + +++Y S + +A + Q ++ + Sbjct: 414 KLNPVLERVFNILMRAGIFAPPPMAEGESLE-YEIDYVSKIALAIKAAQNGALAQMMDLI 472 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDI---RQQREVQRRVME 518 G+ T DP+ ++ + +R + P EV ++ + Q ++ + Sbjct: 473 G--GMATFDPTVALVINWKKAARGVARNSGLPQEWQNSEEEVAEMMQAQAQANQAAQLEQ 530 Query: 519 EQHLQQQLQQTSQDIGAKAAGRA 541 Q +Q +G +A A Sbjct: 531 MASAANQAAGAAQKLGPQAQQAA 553 >gi|212703247|ref|ZP_03311375.1| hypothetical protein DESPIG_01289 [Desulfovibrio piger ATCC 29098] gi|212673291|gb|EEB33774.1| hypothetical protein DESPIG_01289 [Desulfovibrio piger ATCC 29098] Length = 552 Score = 397 bits (1020), Expect = e-108, Method: Composition-based stats. Identities = 100/572 (17%), Positives = 202/572 (35%), Gaps = 39/572 (6%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN---------AQLRMWDTTGS 51 M + K+++ +L+ R + E+ + P + + + Sbjct: 1 MAAPTLKELKQLVAHLEGLRSKRLAQQWEIGKLILPSRGLFQGEETECLRDANLLNPAAQ 60 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 A K ++ ++ ITP W FL + D E+ D V + Sbjct: 61 RALGKAAAGMTQAITPASSPWFR--------HQFLDRADREVTGGNEYVDVVDARIRAVL 112 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 + GF + +F ++ FG +A R+ ++++ Sbjct: 113 --AAGGFYSAIHAFNRELLGFGCALLSCDA-----SARTVARFACQTCGTYAVALDEDRT 165 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKG 230 + V R T ++ ++G L + L ++ V + D + D Sbjct: 166 LSCVVRRLRMTPVEMSRRFGRDRLCEATRQKLESQPYAPIEVVQVVRKREERDPERGDNR 225 Query: 231 NKGFHSKFVSVDEN-RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 N F S + E + P+ + A +YG P +AL + + Sbjct: 226 NMPFASFWYEDQGGTELLRESGFRSMPFFFSTWE-DARGVYGTGPGDDALADQKGIEAWE 284 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP---- 345 A + + PP +A K R+ PG + + +L +P+ N P Sbjct: 285 KRKAVGIEMMIQPPLLAPGTLK-RHVRAMPGSVISDTAYGQSNAL-RPLYEVNFGPAVGA 342 Query: 346 YHEELNRLKESIRSLFLLDLFQVLD---DKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 +E+ ++ + + ++F + A + E M++ R +GP + + Sbjct: 343 VQQEIEQISMRLEDVMKANIFANMSLETRPAGMTMTEYMDRRRRAAELMGPTVSSYEPRV 402 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + I R +LD +G LP + P + L V Y SP+ + + + S Q ++ V Sbjct: 403 LTLCIERVYQLLDEEGLLPPPPQGLS-PWATLNVSYQSPMAQMLEQAAAVSIGQFMDQVG 461 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHL 522 P+ +D +D D++ PA +IR +V IRQQRE ++ + Sbjct: 462 PWA--QSQPTILDKLDLDQMVDELAQRLGVPASIIRSDEQVAAIRQQREQAAAAQQQAAM 519 Query: 523 QQQLQQTSQDIGAKAAGRAMEKKLTHDMMENS 554 + Q+ ++ +G + K+ E++ Sbjct: 520 EVQMMESMAKMGNVKTEGTVAGKVMGSPQEDN 551 >gi|288957023|ref|YP_003447364.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] gi|288909331|dbj|BAI70820.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] Length = 534 Score = 389 bits (1000), Expect = e-106, Method: Composition-based stats. Identities = 111/503 (22%), Positives = 204/503 (40%), Gaps = 44/503 (8%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQLRMWDTTGSEACIKLS 58 + DR+ + +RG ++ P R++D T +A +L+ Sbjct: 23 LLDRYRGARERRGVWESHWQDCYDHALPNGRPFHGGGTAGERRVNRLFDGTAPDAVEQLA 82 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + L S +TPP +W G F E R + + + F RS F Sbjct: 83 ASLLSELTPPWSRWFG----FRPGPDLTGAERDRIAPLLDRAAGIIQAHF-----DRSNF 133 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE 178 + + +V GT ME G +R+ +VPL++ + +D+ +R Sbjct: 134 AVEVHQAFLDLVTVGTASLLMEEAA--PGAVSSLRFTAVPLADAVLEEGPDGRLDATFRR 191 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 T+ QI+ ++ L +++ A + + RF ++ AV P + + G Sbjct: 192 SEATLAQILQRFPGAGLPDELRRRAAEDPDHRFPLVEAVVPDGAAYRWGVVLDSGLA--- 248 Query: 239 VSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRL 298 + + + + A P++ R+ E YGRSP M+ALP I+ N+ V + + + Sbjct: 249 ----DPSWLAQGRFAQSPFVNFRWLKAPGETYGRSPVMKALPDIKTANKVVELVLKNASI 304 Query: 299 SLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGR-SLFQPVQFGNPLPYHEELNRLKE 355 ++ A + L PG + A+ G L P +F L+ L+ Sbjct: 305 AVTGIWQADDDGVLNPSTIRLVPGTIIPKAVGSAGLTPLANPGRFDV---SQLVLDDLRG 361 Query: 356 SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILD 415 IR L+D + D A +A E +E++ E +G G LQ+E + ++ R + IL Sbjct: 362 RIRHALLVDRLGPV-DSARMTATEVLERSVEMARLLGATYGRLQAELMTPLLLRAVSILR 420 Query: 416 SQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD 475 +G +P+ L+++++ SPL + Q V + L+ +++V LG P Sbjct: 421 RRGEIPDIT----VDGRLVELQHRSPLAQAQAQRDVQATLRWLDSVKALG-----PEAEA 471 Query: 476 HMDTDRVSRFSLWATNTPAVLIR 498 +D + + A PA L+R Sbjct: 472 VVDAAATAHWLGEAFGVPAKLMR 494 >gi|303327895|ref|ZP_07358334.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861721|gb|EFL84656.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 554 Score = 386 bits (991), Expect = e-105, Method: Composition-based stats. Identities = 99/566 (17%), Positives = 204/566 (36%), Gaps = 40/566 (7%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN---------AQLRMWDTTGSEACIKL 57 K+++ +L++ R + EL + P + + +++ + A K Sbjct: 9 KEVKQLVGHLESLRAKRLAQQRELGRLILPSRGLFQGEDTESLRESNLFNPAANRALRKA 68 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 ++ ++ ITP G W AFL + D + E+ D V + L S G Sbjct: 69 AAGMTQAITPAGNPWFK--------HAFLLRRDREATGGNEYVDTVDNMLRTVL--SAGG 118 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 F + SF ++ FG E RY +++ +D+V R Sbjct: 119 FYRAIHSFNKELLGFGCALLGCEESP-----RTVARYFCQTCGTYCAALDEDGNLDAVAR 173 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGNKGFHS 236 T ++ ++G+ LS + L ++ + + H V ++ D + D+ N + S Sbjct: 174 RLLMTPRELARRFGEDRLSDVSRQKLKKDSYDPVAVRHVVQRRTARDPERADRSNMPWGS 233 Query: 237 KFVSVDE-NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 + F + + P+ + A +YG P EAL + + A Sbjct: 234 WWYEEGGAADFLDVGGFRSMPFFFTVWE-EARGVYGTGPGDEALADQKGIEGWELRKAVG 292 Query: 296 GRLSLHPPTIAVSEAKQRNFDLKPGYMNI-GALSREGRSLFQPVQFGNPLP-YHEELNRL 353 + P + + D PG + G + V FG + EE++++ Sbjct: 293 VEKMID-PVLVSQGPLKAYVDTSPGAVIPSGGFGADSLKPLYEVNFGPAVQHVQEEISQI 351 Query: 354 KESIRSLFLLDLFQVLD---DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 + + + ++F + A + E M++ R +GP + G + + ++ Sbjct: 352 SLRLEDVMMANIFASMSLETRPAGMTMTEYMDRRRRSAELMGPTVSGYEPRILSPVLENT 411 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 +L+ G LP +P S L V Y SP+ + + + + Sbjct: 412 FGLLEEYGLLPGPPDGLSPFAS-LNVSYQSPMAQMLEQSGAVAIQSLFELAAPM--LRAV 468 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTS 530 P D +D ++ PA ++R V +RQQR + ++Q + ++ Q Sbjct: 469 PDLADKIDFEQAIDELAQRLGVPASVVRSDETVAAMRQQRAEAQAAQQQQMAEARMLQQV 528 Query: 531 QDIGAKAAGRAMEKKLTHDMMENSYG 556 +G + + +++ + G Sbjct: 529 AALGNVKT----QGTVAGEVLGTTQG 550 >gi|187736539|ref|YP_001878651.1| hypothetical protein Amuc_2060 [Akkermansia muciniphila ATCC BAA-835] gi|187426591|gb|ACD05870.1| hypothetical protein Amuc_2060 [Akkermansia muciniphila ATCC BAA-835] Length = 544 Score = 374 bits (960), Expect = e-101, Method: Composition-based stats. Identities = 129/544 (23%), Positives = 224/544 (41%), Gaps = 56/544 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M +R+A ++ + L QR W + L ++ P + +A RM DTT Sbjct: 1 MEERTA-ELNSVYKSLAAQRAPWETWWDRLRDYVLPRRLNREGEVSLPNRDAMDRMTDTT 59 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 EAC KL+S S ITP W + +D + W +Q ++ Sbjct: 60 AVEACQKLASGHMSYITPSHDVWFKWSAP----------DDRGGDEAEAWYNQCSEI--A 107 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 +E S S F + + V GTG + D + L + ++P + N + Sbjct: 108 LKELSVSNFYTEIHECFLDRVALGTGSLFTGTSSDGRLL-----FTNIPCGQFACAENAE 162 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFT---IIHAVYPKSLTDKK 226 VD+ REFT+T Q S +G K L K + L R N T +H V P++ ++ Sbjct: 163 GRVDTYVREFTYTAHQARSMFGVKALGPKAREVLERGGNPYATTLRFLHVVRPRTRRSRR 222 Query: 227 KD-KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 ++ + F S ++S+D+ EE FPY+V R+ YG +P P I+++ Sbjct: 223 REQASHMPFESVYLSLDDQVIVEEGGYMEFPYLVTRFLKWGSGPYGLAPGRLVFPAIQQV 282 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 L G ++ P I + DL+ G + ++ E SL P ++ Sbjct: 283 QFLNRILDTLGEVAAF-PRILELANQIGEVDLRAGGRTV--ITPEAASLHLPREWATQGK 339 Query: 346 YHEELNRL---KESIRSLFLLDLFQVLDD-KASRSAAESMEKTREKGAFVGPLIGGLQSE 401 Y ++RL +++IR + L + ++ + + +A E M + E+ P S+ Sbjct: 340 YDVGMDRLAQKQDAIRRAYYLPMLELWSGHRGNMTATEVMARENERVLMFSPSFTLFVSD 399 Query: 402 FIGAMISRELDILDSQGNLPECEGAD-------NPPVSLLKVEYTSPLF---KYQQAESV 451 M +R +L G P A + V +V Y S + + Q+E + Sbjct: 400 LYSTM-TRIFSLLFRMGKFPRPPRAVLRVGRDGSVAVGEPRVVYQSKIALVLRRLQSEGM 458 Query: 452 ASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQRE 511 +LQ +N +++ P DH+D D R S P ++R A+V +R++RE Sbjct: 459 DRSLQRLNMMMQAA-----PDLADHVDWDHCFRLSARVDGAPESMLRPWADVRAMRKERE 513 Query: 512 VQRR 515 ++ Sbjct: 514 DLQQ 517 >gi|291334466|gb|ADD94120.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] Length = 330 Score = 358 bits (920), Expect = 1e-96, Method: Composition-based stats. Identities = 84/336 (25%), Positives = 158/336 (47%), Gaps = 29/336 (8%) Query: 1 MNQRS-AKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-------KNNAQLR---MWDTT 49 M Q AK++ R++ LK+QR +E+ ++ P ++ R ++D + Sbjct: 1 MAQTDKAKNLLKRYDRLKSQRQNWESHWQEVADYMQPRKADVTKTRSKGDKRTELIFDGS 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 ++ L++ L ++T P W L F ++ + + W + TD ++ Sbjct: 61 PLQSVELLAASLHGMLTNPSTPWFTLR--------FKDEDIDNEDEAKLWLEASTDAMYT 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 +RS F + Y ++ FGT ++E D E+ I++ + ++ V+++ N + Sbjct: 113 AF--NRSNFQQEIFELYHDLITFGTAAMFIEEDD-----EDIIKFSTRHINEVFIAENDK 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-D 228 +D+V+R+F+ + ++ K+GD +S + + ++ E I+HAVYP+S D +K D Sbjct: 166 GRIDTVFRKFSLSARAVMQKFGD--VSINIATKAKKDPYEEVEIMHAVYPRSDFDPRKQD 223 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 K N F S ++ + FP++V RY + EIYGRSPAM ALP ++ LNE Sbjct: 224 KENMPFESVYLDAESGDELSVSGFREFPFVVPRYLKASHEIYGRSPAMTALPDVKMLNEM 283 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNI 324 + + + PP + + PG +N Sbjct: 284 SKTTIKSAQKQVDPPLLVPDDGFMLPVRTIPGGLNF 319 >gi|209966578|ref|YP_002299493.1| hypothetical protein RC1_3320 [Rhodospirillum centenum SW] gi|209960044|gb|ACJ00681.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 521 Score = 347 bits (890), Expect = 3e-93, Method: Composition-based stats. Identities = 115/488 (23%), Positives = 198/488 (40%), Gaps = 44/488 (9%) Query: 23 LNYWMEELTGFLYP------YKNNAQLR----MWDTTGSEACIKLSSLLSSLITPPGQKW 72 ++ + P R ++D T ++A +L++ L + +TPP +W Sbjct: 39 WEPLWQDCYDHVLPQNARFTRDAGPGERRGELLFDGTAADAADQLAASLLAQLTPPWSRW 98 Query: 73 HGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEF 132 GLA A V ++ + L RS F + VV Sbjct: 99 AGLAPG-------PDLSAAERALVAPLLERASADLQA--HLDRSNFAVEAHQAFLDVVTG 149 Query: 133 GTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGD 192 GTGC +E G +R+ +VPL+++ + + +D+V+R T T+ Q+ +++G Sbjct: 150 GTGCLLVEEAP--PGAPSALRFTAVPLADLVLEEGAEGRLDTVFRRLTPTLAQLAARFGT 207 Query: 193 KVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQI 252 L ++ A + + R ++ AV P G + + D E + Sbjct: 208 DALPGALRRRAAADPDARAAVVEAVLPDP-------GGGACRWAVALEDDPPVLLAEGRF 260 Query: 253 ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQ 312 A P+I R+ E+YGRSP M+ALP IR N+ V + + +++ A + Sbjct: 261 AEPPFIAFRWMKAPGEVYGRSPVMKALPDIRTANKVVELVLKNASVAVTGIWQADDDGVL 320 Query: 313 RN--FDLKPGYMNIGALSREGR-SLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVL 369 L PG + A+ G L P +F L+ L+ IR L D + Sbjct: 321 NPGTIRLVPGAIIPKAVGSAGLTPLASPGRFDV---SQLVLDDLRAHIRHALLADRLGPV 377 Query: 370 DDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNP 429 +A E +E++ E +G G LQSE + ++ R L +L +G +P+ Sbjct: 378 QG-PRMTATEVLERSAEMARMLGATYGRLQSELLVPLVRRCLSLLRRRGAVPDLAA---- 432 Query: 430 PVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWA 489 L+ V+ SPL + QQ + L+ + +V LG M +D + +RF A Sbjct: 433 DGRLVAVQILSPLARAQQRRDAEAVLRWLESVTGLGDA-----AMRAVDLEACARFLADA 487 Query: 490 TNTPAVLI 497 PA L+ Sbjct: 488 AGVPAALL 495 >gi|118590948|ref|ZP_01548348.1| hypothetical protein SIAM614_19846 [Stappia aggregata IAM 12614] gi|118436470|gb|EAV43111.1| hypothetical protein SIAM614_19846 [Stappia aggregata IAM 12614] Length = 567 Score = 340 bits (873), Expect = 3e-91, Method: Composition-based stats. Identities = 119/534 (22%), Positives = 211/534 (39%), Gaps = 47/534 (8%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYP------------------------YKNNA 41 D++ + +R + ++ + P + Sbjct: 4 VDDLKTELQSARAERQWVEADWQDYVTYTAPDMERAFNRPGGVSARDGMSALRGSAARDR 63 Query: 42 QLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCD 101 +++D T +L+S + SL P G WHG+ A S+ E+ + Sbjct: 64 SRKLYDPTAVWLLDRLASGIGSLTMPEGFPWHGVGFGDPFAP-------APSQADEEFFE 116 Query: 102 QVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFY-MEADVDEKGLEEGIRYISVPLS 160 V D LF R RSGF +S S V+ GTG + +E + + + Y VPL Sbjct: 117 LVRDHLFRVRYSGRSGFALANRSRLLSTVKLGTGVLFPVENEDSLADIRTPVHYRYVPLY 176 Query: 161 NVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKS--ALARNENERFTIIHAVY 218 +Y+ ++ Q +R T Q V ++ K +S K+K A A+ +N +T +HA + Sbjct: 177 EIYLVIDAQGNDCGFFRVRTLKAWQAVKEYAGK-VSPKVKEDAADAKRKNTDYTFVHACF 235 Query: 219 PKSLTDKKK-DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAME 277 + + D F S D +P ++ R+ YG P + Sbjct: 236 LREGGHAQATDTRKSRFESIHFEEDSGHICRRGGFFEYPLVISRWDRDGLSPYGSPPQAK 295 Query: 278 ALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQP 337 + I+ L + ++ PP + A++R DL PG +N G + +GR LF+P Sbjct: 296 LMSDIKSLQSLARDGLIASSQAVRPPI--ATHAQERQLDLNPGRINPGLIDEQGRPLFRP 353 Query: 338 V-QFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIG 396 + NP ++ ++E +R DL+Q L + R+A E+ + +E +GP Sbjct: 354 MIDTVNPGAADAQIETIREKLRVGLYGDLWQTLLEGNGRTATEANIRRKEMADMIGPFST 413 Query: 397 GLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEY----TSPLFKYQQAESVA 452 + + A+ RE+ IL +G PP S+L+ + T+P+ + ++A Sbjct: 414 NIMAGN-EALFEREIGILGRRGAFAPGS-PLAPPQSVLEGDVTLTPTAPIDQMREAGHFE 471 Query: 453 SALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDI 506 + + + DPS +D D + + A PA L R EVE + Sbjct: 472 AIMGFQEYLG--IAAGADPSILDLHDREAEYDLTRRALGLPAKLRRRPEEVEAL 523 >gi|325971684|ref|YP_004247875.1| hypothetical protein SpiBuddy_1857 [Spirochaeta sp. Buddy] gi|324026922|gb|ADY13681.1| hypothetical protein SpiBuddy_1857 [Spirochaeta sp. Buddy] Length = 571 Score = 336 bits (861), Expect = 6e-90, Method: Composition-based stats. Identities = 101/522 (19%), Positives = 198/522 (37%), Gaps = 30/522 (5%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQL--------RMWDTTGSEACIKL 57 AK I +++ LK R + E F+ N ++++T+G A Sbjct: 31 AKAIAAKWSRLKTLRQKTEALRWEACAFVQHRMNEFSDSNNPIKPVKLYNTSGILALDTF 90 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 + + P +W L + + ++ ++ + +F E +++ Sbjct: 91 INGYHGNLITPSMRWFKLTLTGENF-----EDSDTIHGANDYMEISETQMFA--ELNKTN 143 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 F + V GT ++ DV+ + ++ + ++ N +D+++ Sbjct: 144 FYPLDKLATKDAVVQGTSAEWVYDDVESGT----CVFETIAPWDFWIDKNANGKIDTIFI 199 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK----GNKG 233 FT T + ++ DK + ++ + + A+YP+ +K K K Sbjct: 200 RFTMTSADALDRFKDKTPPNILRDVETDAGHNEHEFVLAIYPRKKLRSEKGKVLISTEKP 259 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 F + E+ EE FP V + YG M+ L ++RLN + Sbjct: 260 FAAVTYYPVEDCIVEESGYDDFPVAVHVFEQDGTSAYGMGLVMKYLTELKRLNSMSRDHL 319 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 + + PP K R F PG N + Q VQ L +E+ L Sbjct: 320 ETVQKVAKPPMSIPESLKGR-FSGDPGARNYMGNMDAKPEIIQTVQDIGWL--SQEITEL 376 Query: 354 KESIRSLFLLDLF--QVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 +E I LF DLF + DK +A ++ E+ A + ++G Q I ++ R Sbjct: 377 EEKIGRLFFNDLFNYLMRQDK-VLTATQTQAIKSEELALLASILGTTQYMKINPIVKRVF 435 Query: 412 DILDSQGNLP-ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 I+ LP + +L++++ PL K + ++ LQ ++ Sbjct: 436 RIMVKGNRLPKPPKELLRIKNALMRIDLDGPLAKNVKMFAMQDGLQASLEWMQALHAMQM 495 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREV 512 + +D+++TD R + A P ++R+ EVE +R+Q++ Sbjct: 496 TNTLDNINTDIFVRKAFIAAGMPQSVLRELGEVEQMRKQKQA 537 >gi|296537022|ref|ZP_06899017.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296262651|gb|EFH09281.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 368 Score = 286 bits (731), Expect = 8e-75, Method: Composition-based stats. Identities = 81/354 (22%), Positives = 135/354 (38%), Gaps = 21/354 (5%) Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 RS F + + +V GTG +E G +R+ +VPL + Sbjct: 33 HLDRSNFAVEMHQAFLDLVVAGTGVLLVEEAP--PGALSALRFTAVPLREAVLEEGESGR 90 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGN 231 +D++YR I +++ VL + + E R ++ AV+P ++G Sbjct: 91 LDTIYRAMALEAAAIAARYPGAVLPPGLGAGSPAQEAPRHRVVEAVWP--------ERGG 142 Query: 232 KGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 + + E + P+I R+ E YGR P M+ALP IR N+ V Sbjct: 143 SAYLAVLEHDGRAWPLAEGRFQDSPFIAFRWLKAPGEAYGRGPVMKALPDIRTANKVVEL 202 Query: 292 LAQFGRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGR-SLFQPVQFGNPLPYHE 348 + + ++ A + L PG + A G L P F Sbjct: 203 VLKNASIAATGIWQAEDDGVLNPATVRLVPGAIIPKAPGSSGLTPLAAPGNFDV---SQL 259 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 L+ L+ IR+ L D A+ +A E +E++ + +G G LQ+E + +I Sbjct: 260 VLDDLRGRIRAALLADRLGP-PGTAAMTATEVLERSAQTARLLGATYGRLQAELLTPLIG 318 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 R L IL +G +P ++ Y SPL + Q A+ L + V Sbjct: 319 RCLSILRRRGEVPPL----LLDGREARLTYHSPLARVQGRSDAANTLLFLQAVA 368 >gi|13186164|emb|CAC33475.1| hypothetical protein [Legionella pneumophila] Length = 519 Score = 250 bits (639), Expect = 4e-64, Method: Composition-based stats. Identities = 89/461 (19%), Positives = 179/461 (38%), Gaps = 40/461 (8%) Query: 30 LTGFLYPYKNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKE 89 L GFL P + ++D T A +L+ + + P GQ+W F+ F Sbjct: 70 LAGFLTPGQQY-NADIYDLTLPIAHKRLADKMLMNMVPQGQQW----VKFTPGDEFGEPG 124 Query: 90 DARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLE 149 ++ + ++TD F RS F + V TG ++E + Sbjct: 125 TPLYQRALDATQRMTDHFFKI--IDRSNFYLAVGESLQD-VLISTGII----AINEGNRK 177 Query: 150 EGIRYISVPLSNVYMSVNHQNVVDSVYRE-FTFTVDQIVSKWGDKVLSSKMKSALARNEN 208 +RY +VP + V + + VD+++R+ + ++ I S W + + L + Sbjct: 178 RPVRYEAVPPAQVMFQGDAEGQVDAIFRDWYQVRIENIKSMWPKAEV-----AKLNKKPE 232 Query: 209 ERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADE 268 ++ I + +K+ + V E+ +++P++V R R E Sbjct: 233 DKVDIWECAWIDYEAPEKER------YQYVVMTSSKDVLLEQSNSSWPWVVYRMRRLTGE 286 Query: 269 IYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSE--AKQRNFDLKPGYMNIGA 326 I GR P++ A PT +N+ + + +P +A S+ Q+ F +PG + + Sbjct: 287 IRGRGPSLSAYPTAATINQALEDELVAAAFQANPMYMAASDSAFNQQTFTPRPGSI-VPV 345 Query: 327 LSREGRSLFQPVQFGNPLPYHEEL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTR 385 +G +P + + ++ L N ++ I L + +R+A E+ + Sbjct: 346 QMVQGEWPIKPFEQSGNIQFNALLVNDFRQQINELLYA-FPLGAVNSPTRTATEAEIRYT 404 Query: 386 EKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPE--CEGADNPPVSLLKVE------ 437 E ++ LQ+EF +I R L +++ LPE D+ ++ V+ Sbjct: 405 ENLESFSAMVPRLQNEFFIPVIQRTLWVINK--VLPETFANIPDDIRNKMISVDGQILGL 462 Query: 438 -YTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 + +PL + A+ L L + + +D + Sbjct: 463 SFDTPLMTAKGQVKTAALLGFYQAAASLLGQEAATASLDPV 503 >gi|307946242|ref|ZP_07661577.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307769906|gb|EFO29132.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 519 Score = 239 bits (609), Expect = 1e-60, Method: Composition-based stats. Identities = 84/506 (16%), Positives = 171/506 (33%), Gaps = 39/506 (7%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYK------NNAQLRM---WDTTGSEACIKLSS 59 ++ R N + +R ++E + P++ R+ +D T ++ + + Sbjct: 7 LKKRRNGAQRERDAFQPLLDEAYQYAIPFRKSAAKTGKGDKRVNDVFDHTAIDSAFRFAG 66 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 + + P GQ L ++ K+ + ++ + F + F Sbjct: 67 KVQQDLWPAGQDNFELEPGPVVL------DENERDKMSKQLAPISKIVQAFFDD--GDFD 118 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 + G G + E+ ISVP+ + + N + +++ + Sbjct: 119 MAFHEMALDL-SAGNGAMLLNP-PGPDEPEKLWEPISVPIEELLIENGPNNRISAIFWKR 176 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTI-IHAVY-PKSLTDKKKDKGNKGFHSK 237 +V + W + +K L + + V+ PK + NK + Sbjct: 177 KMSVRVLQDTWPEGKFGENLKKLLKEKPEGEIDVNVDTVWVPKERRWRMIVWCNKQETAV 236 Query: 238 FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 F + T P++ RY E YGR P M A+PTI+ LN Q Sbjct: 237 FQNES----------RTCPWLFARYFRVPGEAYGRGPVMLAMPTIKTLNTAARLQLQAAA 286 Query: 298 LSLHPPTIAVSEAKQRNF--DLKPGY-MNIGALSREGRSLFQPVQFGNPLPYHEELNRLK 354 +++ V + L+PG + + LN ++ Sbjct: 287 IAMLGIYTTVDDGVFNPDLASLEPGAFWKVARNGGALGPSINRFPDPRLDLSNLVLNDMR 346 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 +++ + D D A RSA E +E+ + + G L E + + R ++I Sbjct: 347 MGVKATMM-DQSLPADGAAVRSATEILERVKRLASDHLGAYGRLVKEIVIPAVKRAMEIA 405 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 ++G + L++V SPL ++A+ V +Q + V+ +G G P + Sbjct: 406 YNKGLI---SDEIPIDQLLVRVRVKSPLALAREAQRVEKVIQWLQMVISIGAAVGQPGFL 462 Query: 475 DHM-DTDRVSRFSLWATNTPAVLIRD 499 + + P + I Sbjct: 463 QQIAKVETALTQIGRDLGVPEMFIVS 488 >gi|291334523|gb|ADD94176.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334657|gb|ADD94304.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291334711|gb|ADD94357.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] gi|291336437|gb|ADD95992.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 193 Score = 225 bits (573), Expect = 2e-56, Method: Composition-based stats. Identities = 52/189 (27%), Positives = 97/189 (51%), Gaps = 8/189 (4%) Query: 137 FYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLS 196 ++E D E+ +++ + ++ ++++ N + +D+V+R+F+ + ++ K+GD +S Sbjct: 1 MFIEEDD-----EDILKFSTRHINEIFIAENDKGRIDTVFRKFSLSARAVMQKFGD--VS 53 Query: 197 SKMKSALARNENERFTIIHAVYPKSLTDKKK-DKGNKGFHSKFVSVDENRFFEEKQIATF 255 + + ++ E I+HAVYP+S D +K DK N F S ++ + F Sbjct: 54 INIATKAKKDPYEEVEIMHAVYPRSDFDPRKQDKENMPFESVYLDAESGDELSVSGFREF 113 Query: 256 PYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF 315 P++V RY + EIYGRSPAM ALP ++ LNE + + + PP + + Sbjct: 114 PFVVPRYLKASHEIYGRSPAMTALPDVKMLNEMSKTTIKSAQKQVDPPLLVPDDGFMLPV 173 Query: 316 DLKPGYMNI 324 PG +N Sbjct: 174 RTIPGGLNF 182 >gi|253583086|ref|ZP_04860294.1| predicted protein [Fusobacterium varium ATCC 27725] gi|251834978|gb|EES63531.1| predicted protein [Fusobacterium varium ATCC 27725] Length = 517 Score = 213 bits (543), Expect = 5e-53, Method: Composition-based stats. Identities = 97/523 (18%), Positives = 191/523 (36%), Gaps = 48/523 (9%) Query: 20 RGELNYWMEELTGFLYPYKNNAQLRM--------WDTTGSEACIKLSSLLSSLITPPGQK 71 + ++ E+ + P + ++ +++ S+A + +S + +K Sbjct: 23 KSKIEPLYNEILAYTDPMNSVTTSKLEGTLEGTYVNSSISDAQTSFKNFISYALFGIKKK 82 Query: 72 WHG--LAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSV 129 W + + A + + + +E D TD +F S + + T Sbjct: 83 WAKSDVIKPLLAKKYQGQELIDMIQSYKEKLDVQTDEIFD--YILASNYEKEIGRALTDW 140 Query: 130 VEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR-EFTFTVDQIVS 188 E GTGC+ E EK R+ VPL+ + + + Q+ + V+R F +++ I S Sbjct: 141 GELGTGCWKYEEQNSEKV---PFRHQYVPLNELLFNEDLQHRPNIVFRYNFKYSLWDIRS 197 Query: 189 KWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFE 248 + LS NENE T+I V P + TD F + Sbjct: 198 LYKKADLSC----YDGINENEEVTVIECVMPVAETDT--------FEWILFDERMDNVLY 245 Query: 249 EKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVS 308 K PY + R+ V + ++GR + L RL N A+ + PP + V Sbjct: 246 RKIYNYNPYTIFRFTVMPNNVWGRGLGVTCLDYYERLCYCENLRARQSIRIVEPPLLLVG 305 Query: 309 EAKQRN-FDLKPGYMNIGALSREGRSLFQPVQ-FGNPLPYHEELNRLKESIRSLFLLDLF 366 + + + FDL P +N G G++ P+ G LP +++ R + I+++ + Sbjct: 306 DKRLIDGFDLDPNGLNWGGDGITGQANAVPMNTTGTLLPLDQDIQRYTQVIQAIHFNNPM 365 Query: 367 QVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGA 426 ++++ +R AE + + L E + ++ IL + + + + Sbjct: 366 GSVENRTTRGNAEMGYRMQLFNQKFSDATSNLYDEVLIPTFAKPKQILQDKNIVKKIDE- 424 Query: 427 DNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSC-MDHMDTDRVSRF 485 + ++ + L + E + + TV + P ++ D F Sbjct: 425 ----DKYFQAKFVNLLTETVDMEEIQKLSTYIQTV-----QGFYPEVRTATLNKDNTLNF 475 Query: 486 SLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 P L ++QR+ +M +Q LQ Q Sbjct: 476 IADTFTVPVYL-------RATKEQRQESEEMMMKQALQMQAVA 511 >gi|256845624|ref|ZP_05551082.1| predicted protein [Fusobacterium sp. 3_1_36A2] gi|256719183|gb|EEU32738.1| predicted protein [Fusobacterium sp. 3_1_36A2] Length = 550 Score = 173 bits (439), Expect = 6e-41, Method: Composition-based stats. Identities = 85/548 (15%), Positives = 203/548 (37%), Gaps = 36/548 (6%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFL---YPYKNNA-----QLRMWDTTGSEACIK 56 + + ++ F+ KN + ++ E+ + + K++ R ++ ++ Sbjct: 6 TREKLEYYFDNAKNYKEDIRGLYNEVYEYTDVNFSIKDSGTVEKQSKRGVESVILKSQNF 65 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSK----KVREWCDQVTDTLFGFRE 112 L + + S I +W + + A++ + ++ ++ + + +DT++ Sbjct: 66 LCNFIMSSIFSKSGRWATVKVNQEAFKKLSGVDGEAAEGLSNEINKVLENNSDTVY--FT 123 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 + + ++ GTG + D Y L N+Y+ ++ Sbjct: 124 NDNTNYYTETSKALLDCIKVGTGIRKIIELKDNTKC---FTYAYQNLDNIYILEDNLGKP 180 Query: 173 DSVYREF-TFTVDQIVSKWGDKVLSSKMKSALARNE-NERFTIIHAVYPKSLTDKKKDKG 230 + +++ + ++ I +G L L ++ E+ II V D K Sbjct: 181 NIIFKVYVEKNLNDINDLFG--HLPITTPKGLNEDKLEEKINIIECVVGVFDEDTSTYKY 238 Query: 231 NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 G ++ E ++ PY V R+++ + +G +E L + L + Sbjct: 239 YHGLFTEAFEE----MLYEGELNYNPYTVFRWKINSSNPWGIGIGLENLDLFKELKDLKE 294 Query: 291 ELAQFGRLSLHPPTI--AVSEAKQRNFDLKPGYMNIGALSREGRS-LFQPVQFG-NPLPY 346 + + + PP ++ + LK N G G +P+ G N LP Sbjct: 295 KRKKHADKIVSPPLNFYGSTDLINK-VSLKANAKNYGGSGIGGDKYGVEPINIGTNLLPV 353 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 +++ ++K+ IR +F+ + D +RSA E + + +E + Sbjct: 354 EKDIEQVKQEIREVFMSQPLGDVSDTKNRSATEMSLRHEMFRKEFSGTYELINTELLEPT 413 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 I+D +G L E +S +++Y + L + ++ V + + T+ ++ Sbjct: 414 FMNAYYIMDGKGLLNTTEDESYINIS--QIQYINELTRNAGSDEVINTINFYMTLSQVVP 471 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 +T D + ++ P ++ E++ + Q++ + ME+ L Q+ Sbjct: 472 ETQRQFI---FKIDELIDWASKKMRVPLDVLNSKEEIKQLIAQQQELEQ-MEKMALIQEG 527 Query: 527 QQTSQDIG 534 QD+G Sbjct: 528 IGKRQDVG 535 >gi|291335391|gb|ADD95005.1| head tail connector protein [uncultured phage MedDCM-OCT-S04-C24] Length = 526 Score = 135 bits (339), Expect = 3e-29, Method: Composition-based stats. Identities = 73/502 (14%), Positives = 160/502 (31%), Gaps = 59/502 (11%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-------KNNAQLRM---WDTTGSEACIKLSS 59 + R++ L + R + + + PY L++ W +TG++ + L+S Sbjct: 4 KQRYDRLSSSRSQFLNAARQASELTIPYLIREDEHTTKGALKLTTPWQSTGAKGVVTLAS 63 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L + PP + L + L E ++ ++ T+ + SG Sbjct: 64 KLMLALLPPQTSFFKLQVNDVNLPDELGPEIRS--ELDLSFAKIERTV--MESIAESGDR 119 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 + +V G +M D + PL+ + + V + + Sbjct: 120 VVVHQALKHLVVAGNALIFMSKDGLKL----------YPLNRYVVDRDGNGNVIEIVTKE 169 Query: 180 TFTVDQIVSKWGD--KVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 T + I + + + +E H K N+ Sbjct: 170 TISKKLIKKFYPEYEDKAQDSVVDDGHIPNDECVIYTHV----------KLDNNRW---V 216 Query: 238 FVSVDENRFFEEK----QIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + E + + P++V R+ E+YGR E L ++ L + Sbjct: 217 WHQELEGKILPKSMGKAPFDANPWLVLRFNHVDGEVYGRGRVEEFLGDLKSLEALSQAIV 276 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDL---KPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 + + + + L G + G G + Q + + ++ + Sbjct: 277 EGSAAAAKVVFTVSPSSTTKPQTLAKAGNGAIIQGRPEDIG--VVQVGKTADFSTAYQMI 334 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 L + + F L + D +A E E +G L L EF+ ++R+ Sbjct: 335 GSLTQRLNEAF---LILNVRDSERTTAEEVRMTQLELEQQLGGLFSLLTVEFLVPYLNRK 391 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 L++ G++P +++ + L + Q ES+A Q + + + Sbjct: 392 LNVAQKTGDIPRLPQGGIVRPTIVAG--INALGRGQDRESLA---QFLTVIAQTMGPDA- 445 Query: 471 PSCMDHMDTDRVSRFSLWATNT 492 +++ D V + ++ Sbjct: 446 --IAQYINPDEVIKRLAASSGI 465 >gi|259419010|ref|ZP_05742927.1| hypothetical protein SCH4B_4395 [Silicibacter sp. TrichCH4B] gi|259345232|gb|EEW57086.1| hypothetical protein SCH4B_4395 [Silicibacter sp. TrichCH4B] Length = 506 Score = 128 bits (321), Expect = 3e-27, Method: Composition-based stats. Identities = 81/520 (15%), Positives = 166/520 (31%), Gaps = 55/520 (10%) Query: 8 DIQDRFNYLKNQRGEL-NYWMEELTGFLYPYKNNAQL----------RMWDTTGSEACIK 56 + RF+ K+ R + E+ F + + ++ T E + Sbjct: 4 EFDRRFSVAKSHRKQHVEEDGREVYKFCFNGREREWDNNSSYKDEPEEIFVETPGEVAEE 63 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 S L S +TP W + + +++ + + + S Sbjct: 64 FSGDLFSTMTPENSPWSEFEAGNAVDEDDEAAAKEELEELEKAISKSLRS---------S 114 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 + + + V G ++ D L I + +VP+ +Y++ + D + Sbjct: 115 NYYDEGPTAFQDAVV-GNVAMWV----DRPTLNGAINFEAVPIPQLYVTPGPLGIEDR-F 168 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVY--------PKSLTDKKKD 228 R F + + D ++ + ++ N ++H + P + + D Sbjct: 169 RRQRFHYRNLKVLFPDAKFPRAIEDKIKKSSNALAVVVHGFWRTFEDVENPVWRHEIRVD 228 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 G S+ +VGR+ A +GR P + LP R+ +E Sbjct: 229 GKPIGLDKDVGSIGAVNL-----------VVGRFNPYAGSAWGRGPGRKLLPVFRQYDEL 277 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 V + +L PP + + + + QPV FG Sbjct: 278 VRMNMEGLDRTLDPPFTYPHDGMLDLSQGLENGVGYPTMPGT-KDALQPVLFGTLDYGFF 336 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 +L++ IR F + + K SA++ + + ++ + EF ++S Sbjct: 337 SEEKLEQKIRDGFYRE--KEQAGKTPPSASQYIGQENKQVRRMARPATKTWREFGVGLLS 394 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 R + G E ++ SPL + Q + V +A + + E Sbjct: 395 RVEWLERQPGGSLEGAELPLIDSGVVNARPISPLERAQAMQDVTTADMIIGMINERLGPE 454 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLI--RDTAEVEDI 506 + DT R + L ++ R AE+E + Sbjct: 455 QAAMLIKGTDTYRKIKEVLK-----DQIVEFRSEAEIEAL 489 >gi|194100448|ref|YP_002003821.1| gp8 [Klebsiella phage K11] gi|193201387|gb|ACF15865.1| gp8 [Klebsiella phage K11] Length = 535 Score = 123 bits (310), Expect = 6e-26, Method: Composition-based stats. Identities = 84/549 (15%), Positives = 163/549 (29%), Gaps = 51/549 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSE 52 + + + + ++ LKN R E + P +NA W + G+ Sbjct: 6 LEGFAEEGAKAVYDRLKNDRQPYETRAESCAQYTIPSLFPKDSDNASTDYTTPWQSVGAR 65 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P Q W L S + L + +K V E V + + E Sbjct: 66 GLNNLASKLMLALF-PMQSWMKLTISEYEAKNLLGDAEGLAK-VDEGLSMVERIIMNYIE 123 Query: 113 RSRSGFVGC-LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 S L + G Y+ + + + R L++ + + Sbjct: 124 ---SNSYRVTLFECLKQLCVAGNALLYL-PEPEGYTPMKLYR-----LNSYVVQRDAFGN 174 Query: 172 VDSVYREFTFTVDQIV-SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKG 230 V + T+D+I + + V S + + E+ + VY D Sbjct: 175 VLQIV-----TLDKIAFNALPEDVRSQVEAAQGEQKEDAEVDVYTHVYLNESGDG----- 224 Query: 231 NKGFHSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 +SK+ V E E + PYI R E YGRS E L ++ L Sbjct: 225 ----YSKYEEVAEAVVPGSEAEYPLEECPYIPVRMVRIDGESYGRSYVEEYLGDLKSLEN 280 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH 347 + + ++ + + L R+ F ++ Sbjct: 281 LQESIVKMAMITAKVIGLVDPAGITQVRRLTAAQSGAFVPGRKQDIEFLQLEKSGDFTVA 340 Query: 348 EELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 + ++ ++ + F+L+ V +A E E +G + L E + Sbjct: 341 KNVSDTIEARLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 399 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + L L + +PE P +E + + + + + L Sbjct: 400 VRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCIAAWSALKA 453 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 GD D ++ + A A ++ + + Q+ Q + Q Sbjct: 454 LEGD----DDLNLANLKLRIANAIGLDTAGMLLTQEQKNALMAQQGAQIATQQGAAALGQ 509 Query: 526 LQQTSQDIG 534 Sbjct: 510 GMAAQATAS 518 >gi|326536937|ref|YP_004306344.1| head-tail connector protein [Pseudomonas phage phiIBB-PF7A] gi|318054513|gb|ADV35689.1| head-tail connector protein [Pseudomonas phage phiIBB-PF7A] Length = 535 Score = 118 bits (296), Expect = 2e-24, Method: Composition-based stats. Identities = 79/566 (13%), Positives = 161/566 (28%), Gaps = 57/566 (10%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEAC 54 + + + ++ LK+ R E P + + G+ Sbjct: 7 GLAEEGAKAVYDRLKSDRAPYETRAENCAKVTIPSLFPKESDNSSTNYTTPYQAVGARGV 66 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 L++ + + P + W L S + + + V + V L + E + Sbjct: 67 NNLAAKVHMALF-PLEPWMKLKVSEWQAKQLV-TDPEELAMVEQGLSMVERILMSYMEAN 124 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 + L +V G GC Y+ + +G L N + + V Sbjct: 125 S--YRTTLHELIRQLVIAGAGCLYL---PPPESSSQGSPMKLYTLHNHVVQRDAFGNV-- 177 Query: 175 VYREFTFTVDQI--VSKWGDKVLSSKMKSAL--ARNENERFTIIHAVYPKSLTDKKKDKG 230 QI + + L +++ L +E + VY D Sbjct: 178 ---------LQICTLDRVAFAALPEDVRTKLDGEHKPDEEIEVYTHVY--------LDDE 220 Query: 231 NKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 + + S E + Q P++ R+ R E YGRS E + L Sbjct: 221 SGDYLSYQEIDGEEVEGTDGQYPREAMPWVAVRWTKRDGEHYGRSHVEEYQGDLDSLENL 280 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 + +F ++ + + L R+ F + + Sbjct: 281 HEAMIKFSMIASKVVGLVNPNGITQVRRLTKAQTGAFVPGRKADIEFLQLDKAADFSVAK 340 Query: 349 ELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + +++ + +F+L+ V + +A E RE +G + L E +I Sbjct: 341 SVADAIEQRLSYVFMLN-SAVQRNGERVTAEEIRYVARELEDTLGGVYSILSQELQLPII 399 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 L+ L + +P+ P VE L + Q + + LQ + V L Sbjct: 400 RILLNQLQATQQIPDMPKEAVEPTVSTGVE---ALGRGQDLDKMTQFLQALQLVAPLEND 456 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 ++ + A L+ E + Q++ + Sbjct: 457 QD-------LNITTIKLRLANAMGLDTSGLLLTQEE----KAQKQAEMMAQTGGENLAGA 505 Query: 527 QQTSQDIGAKAAGRAMEKKLTHDMME 552 M+ + M+ Sbjct: 506 AGAGAGAMMTQDPDTMQDAMATAGMD 531 >gi|61806424|ref|YP_214201.1| T7-like head-to-tail connector [Prochlorococcus phage P-SSP7] gi|61374349|gb|AAX44203.1| T7-like head-to-tail connector [Prochlorococcus phage P-SSP7] gi|265525461|gb|ACY76227.1| head-tail connector protein [Prochlorococcus phage P-SSP7] Length = 522 Score = 118 bits (296), Expect = 3e-24, Method: Composition-based stats. Identities = 67/505 (13%), Positives = 157/505 (31%), Gaps = 63/505 (12%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY--KNNAQLRM--------WDTTGSEACIKL 57 ++R+N L R E + PY ++ R W + G++ C+ L Sbjct: 2 KARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTL 61 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 ++ L + PP + L L + ++ ++ + + S Sbjct: 62 AAKLMLAVLPPQTSFFKLQVRDDKLGEELDPQIRS--ELDLSFSKMERMIMD--YIAASN 117 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + ++ G +M D + PL+ ++ + V + Sbjct: 118 DRVAVHQALKHLIVGGNALIFMGKDG----------LKTFPLTRYVINRDGDGNVLEIVT 167 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + + + + + ++ + + N++ T K DK + + Sbjct: 168 KELISRKVLDIELPEPKPNTGIDESSTTNDDVTI----------YTYVKLDKSSGRW--V 215 Query: 238 FVSVDENRFFEEKQI----ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + ++ + + P++ R+ E YGR E L ++ L+ L Sbjct: 216 WHQEAFDKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLI 275 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR--EGRS----LFQPVQFGNPLPYH 347 + + + + KP + +GR + Q + + Sbjct: 276 EGAAAASKVVFLVSPSS-----TTKPATIAKAGNGAIVQGRPEDVAVIQVGKTADFSTAA 330 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 +++ + FL+ + + +A E E +G + L EF+ + Sbjct: 331 NMATAIEKRLLEAFLV---MNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYL 387 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 +R L +L +P+ P V + L + Q ES+ + V + Sbjct: 388 NRTLLVLQRSNQIPKLPKDIVRPTI---VAGVNALGRGQDRESLTA------FVGTIAQT 438 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNT 492 G + M +++ + A Sbjct: 439 LGPEALMQYLNPLEAIKRLAAAQGI 463 >gi|326633070|ref|YP_004306681.1| predicted head to tail joining protein [Salmonella phage Vi06] gi|301170543|emb|CBV65231.1| predicted head to tail joining protein [Salmonella phage Vi06] Length = 536 Score = 118 bits (295), Expect = 3e-24, Method: Composition-based stats. Identities = 96/567 (16%), Positives = 165/567 (29%), Gaps = 77/567 (13%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSE 52 + + AK + + LKN R + + P +NA W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYTTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P Q W L S + L D +K V E V + + E Sbjct: 65 GLNNLASKLMLALF-PMQTWMRLTISEYEAKQLLSDPDGLAK-VDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGC-LQSFYTSVVEFGTGCFYM-EADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 S L +V G Y+ E D + R LS+ + + Sbjct: 123 ---SNSYRVTLFEALKQLVVAGNVLLYLPEPDGSNYNPMKLYR-----LSSYVVQRDAFG 174 Query: 171 VVDSVYREFTF-TVDQIVSKWGDKVLSSKMKSALA-----RNENERFTIIHAVYPKSLTD 224 V T DQI +G L ++ A+ + +E + +Y + Sbjct: 175 NV------LQMVTRDQIA--FG--ALPEDVRKAVEGQGGDKKPDEVIDVYTHIYLDEESG 224 Query: 225 KKKDKGNKGFHSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPT 281 + + + + PYI R E YGRS E L Sbjct: 225 EYLR---------YEEAEGMEVQGSDGSYPKEACPYIPIRMVRLDGESYGRSYIEEYLGD 275 Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 +R L + + +S + + L R F ++ Sbjct: 276 LRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQ 335 Query: 342 NPLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS 400 + ++ ++ + F+L+ V +A E E +G + L Sbjct: 336 ADFTVAKSVSDAIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQ 394 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E ++ L L + +PE P +E + + + + V Sbjct: 395 ELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVAA 448 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + DP ++ + A I T E +Q Sbjct: 449 WAAMAPMRDDPD----INLAMIKLRIANAIGIDTSGILLTEE--------------QRQQ 490 Query: 521 HLQQQLQQTSQDIGAKAAGRAMEKKLT 547 + QQ Q D GA A G+ M + T Sbjct: 491 KMAQQSMQLGMDSGAAALGQGMAAQAT 517 >gi|310005679|gb|ADP00067.1| head-tail connector protein [Cyanophage 9515-10a] Length = 534 Score = 117 bits (293), Expect = 5e-24, Method: Composition-based stats. Identities = 82/566 (14%), Positives = 171/566 (30%), Gaps = 77/566 (13%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYP---YKNN------AQLRMWDTTGSEACIKL 57 K+ + R+N L R + E P +N+ W + G++ + L Sbjct: 2 KNARQRYNKLSTDREQFLNVAYECAELTIPTLLMRNDKPPAYAQFKTPWQSVGAKGVVTL 61 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 +S L + PP + L S + E ++ ++ + S Sbjct: 62 ASKLMLGLLPPSTSFFKLQLDDSKLGIEIPPE--AKSEMDLSFAKIERQIMDAIAASTDR 119 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + S +V G YM + PL+ + + V + Sbjct: 120 --VQIFSAIKHLVVTGNALLYMGKQGMKM----------YPLNRYVVERDGNGDVIEIVT 167 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + + ++ ++ + N ++ + V K Sbjct: 168 KEKVS-RDLI---PIELNDDSVVDDDTNNADKDVDVYTCV--------KLGAKG-----W 210 Query: 238 FVSVDENRFF---EEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + + + E + P++ R+ E YGRS E L ++ L + L Sbjct: 211 YWHQEVHDILIPGSEGKAPKDKNPFLPLRFVTVDGEDYGRSRVEEFLGDLKSLEALMQAL 270 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR--EGRS----LFQPVQFGNPLPY 346 + + + KPG + +GR + Q + + Sbjct: 271 VEGSAAAAKVVFTVSPSSV-----TKPGTLANAGNGAIIQGRPDDIGVIQVGKTADFRTA 325 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 E +N L++ + F L + +A E E +G L L +EF+ Sbjct: 326 FELVNTLEKRLSEAF---LILNVRQSERTTAEEVRMTQMELEQQLGGLFSLLTTEFLIPY 382 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 ++R++ L +P+ P V + L + Q +++ V V + Sbjct: 383 LNRKMHSLTLAKKIPKIPKNVVNPTI---VAGINALGRGQDRDAL------VQFVTTIAQ 433 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 G + +++ D + A +V ++ + E ++ Q Sbjct: 434 TMGPEALAQYINPDEAIKRLAAAQGI---------DVLNLVKSMEELDAQKQQAQQQAMQ 484 Query: 527 QQTSQDIGAKAAGRAMEKKLTHDMME 552 Q G A M+ ++ME Sbjct: 485 QNLMGQAGQLAGAPLMDPSKNPEVME 510 >gi|38424264|gb|AAR19412.1| head-tail connector protein [uncultured cyanophage] Length = 517 Score = 117 bits (293), Expect = 6e-24, Method: Composition-based stats. Identities = 69/517 (13%), Positives = 155/517 (29%), Gaps = 63/517 (12%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY--KNNAQLRM--------WDTTGSEACIKL 57 + + R++ L ++R + + + PY + + + + W + G++ + L Sbjct: 2 NAKTRYDELSSERTQFLDEARQASELTLPYLIRGHEETYIGMKQLKTPWQSVGAKGVVTL 61 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 +S L + PP + L S + ++ +V T+ + S Sbjct: 62 ASKLMLALLPPQTSFFKLQLDESQIGEEFGPDIKS--ELDLSFAKVERTI--LENIAASD 117 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + +V G +M D + PL+ + + V + Sbjct: 118 DRVAVHQALQHLVVAGNALIFMGKDGLKV----------FPLNRYVVERDGNGNVLEIVT 167 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + + + + + + +E H + Sbjct: 168 KERISKKLLAEEMPEYE--EPVNEDSNFRPDECDVYTHVRRENNRV-------------V 212 Query: 238 FVSVDENRFFEEKQ----IATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + + + I P++ R+ E YGR + + ++ L L Sbjct: 213 WHQEVHGKVLPKSISKAPIDANPWLPLRFNTVDGEAYGRGRVGQFIGDLKSLEALSQALV 272 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKP---GYMNIGALSREGR-SLFQPVQFGNPLPYHEE 349 + + + + + L G + G G + + FG + Sbjct: 273 EGSAAAAKVVFVVAPSSTTKPATLASAGNGAIVSGRPDDIGVIQVGKTADFGTAFQMTQV 332 Query: 350 LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 R + F L + +A E E +G L L EF+ ++R Sbjct: 333 YER---RLSEAF---LILNPRNAERVTAEEVRMTQLELEQQLGGLFSLLTVEFLVPYLNR 386 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 +L + + +P P + V + L + Q A S+A Q + T+ + Sbjct: 387 KLSVAQKRNEIPRIPKGIVKPTIVAGV---NALGRGQDAISLA---QFLQTIAQTMGPEA 440 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPA-VLIRDTAEVED 505 +++ V + A L+R E++ Sbjct: 441 ---IAQYINPTEVVKRLAAAQGIDILNLVRSMEELQA 474 >gi|326424990|ref|YP_004286212.1| virion structural protein [Pseudomonas phage phi15] gi|325048394|emb|CBZ42007.1| virion structural protein [Pseudomonas phage phi15] Length = 533 Score = 117 bits (292), Expect = 7e-24, Method: Composition-based stats. Identities = 82/520 (15%), Positives = 153/520 (29%), Gaps = 56/520 (10%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGF----LYP-YKNNAQLRM---WDTTGSEAC 54 + + + ++ LK R E L+P +NA W G+ Sbjct: 7 GLAEEGAKATYDRLKTDRSPYETRAENCAKVTIGSLFPAESDNASTNYATPWQAVGARGV 66 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 LS+ + + P + W L S + L + + V V + + E + Sbjct: 67 NNLSAKVHLALF-PLEPWMKLKVSEWQAKQMLGNPEDLAA-VEAGLSMVERVMMSYMEAN 124 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 + L +V G Y+ +G + + N + V Sbjct: 125 S--YRTTLHELIRQLVVAGNALLYLPNPEGTQGSPMKM----YTMHNYVCQRDSFGNV-- 176 Query: 175 VYREFTFTVDQIV--SKWGDKVLSSKMKSALA--RNENERFTIIHAVYPKSLTDKKKDKG 230 QIV K L ++S L R +E + VY +D Sbjct: 177 ---------LQIVTLDKVAFAALPEDVRSKLDGDRTPDEEVEVYTHVY--------RDDE 219 Query: 231 NKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 + F S E + Q P+I R+ R E YGRS E L ++ L Sbjct: 220 SGDFLSYQEVDGEEIEGTDGQYPVDAMPWIAVRWTKRDGEHYGRSHVEEYLGDLQSLENL 279 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 + +F ++ + + L R+ F ++ + Sbjct: 280 SEAMIKFSMIASKVIGLVNPNGVTQVRRLTSAQTGAFVPGRKADIEFLQLEKAADFNIAK 339 Query: 349 ELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + ++ + +F+L+ V +A E RE +G + L E ++ Sbjct: 340 AVADNIESRLSYVFMLN-SAVQRGGERVTAEEIRYVARELEDTLGGVYSILSQELQLPIV 398 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQA-ESVASALQGVNTVVELGV 466 L+ L + +P+ P S + + + LQ +N + + Sbjct: 399 RILLNQLQATQQIPDLPTEAVEPT-------VSTGAEALGRGQDLDKMLQFLNALTMVTP 451 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVED 505 D ++ + A LI E Sbjct: 452 LENDQD----LNVKTLKLRIAQAIGVDTTNLILTEDEKAQ 487 >gi|189427230|ref|YP_001949780.1| gp8 [Salmonella phage phiSG-JL2] gi|189085883|gb|ACD75698.1| gp8 [Salmonella phage phiSG-JL2] Length = 535 Score = 116 bits (290), Expect = 1e-23, Method: Composition-based stats. Identities = 84/526 (15%), Positives = 154/526 (29%), Gaps = 64/526 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ L N R E + P +N W G+ L+S L Sbjct: 15 KATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKL 74 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + D +K V E V + + E S Sbjct: 75 MLALF-PMQSWMKLTISEYEAKQLVGDPDGLAK-VDEGLSMVERIIMNYIE---SNSYRV 129 Query: 122 -LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L ++ G Y+ + R LS+ + + V + Sbjct: 130 TLFECLKQLIVAGNALLYLPEPEGSYNPMKLYR-----LSSYVVQRDAYGNVLQIV---- 180 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENER-----FTIIHAVYPKSLTDKKKDKGNKGFH 235 T DQI +G L ++SA+ + E+ + VY + Sbjct: 181 -TRDQIA--FG--ALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGD---------- 225 Query: 236 SKFVSVDENRFFEEKQ------IATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 ++ +E E PYI R E YGRS E L +R L Sbjct: 226 --YLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQ 283 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHE 348 + + +S + + L + RE Q + + Sbjct: 284 EAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKA 343 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 ++++ + F+L+ F V +A E E +G + L E ++ Sbjct: 344 VSDQIEARLSYAFMLN-FAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVR 402 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 L L + +PE P +E + + + + ++ L Sbjct: 403 VLLKQLQATSQIPELPKEAGEPTISTGLEAIG------RGQDLDKLERCISAWAALAPMQ 456 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQ 513 GDP ++ + A ++ + + + Q Q Sbjct: 457 GDPD----INLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQ 498 >gi|119637774|ref|YP_919010.1| Head-to-tail joining protein [Yersinia phage Berlin] gi|194100496|ref|YP_002003341.1| gp8 [Yersinia phage Yepe2] gi|119391805|emb|CAJ70678.1| hypothetical protein [Yersinia phage Berlin] gi|193201229|gb|ACF15710.1| gp8 [Yersinia phage Yepe2] Length = 535 Score = 116 bits (290), Expect = 1e-23, Method: Composition-based stats. Identities = 90/532 (16%), Positives = 160/532 (30%), Gaps = 64/532 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ LKN R E + P +NA W G+ L+S L Sbjct: 16 KAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKL 75 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + + A KV E V L + E S Sbjct: 76 MLALF-PMQTWMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIE---SNSYRV 130 Query: 122 -LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L +V G Y+ + R LS+ + + Sbjct: 131 TLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFG---------- 175 Query: 181 FTVDQIV--SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 TV QIV K L +++++ ++ + + VY D++ + + Sbjct: 176 -TVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGE--------Y 226 Query: 239 VSVDENRFFEEKQIAT------FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + +E E + PYI R E YGRS E L +R L + Sbjct: 227 LKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + Q K + + E S Q + + Sbjct: 287 VKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVARAVSE 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 +++ + F+L+ V +A E E +G + L E M+ L Sbjct: 347 QIEGRLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E L + Q + + + L GDP Sbjct: 406 KQLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCIAAWSALAPMQGDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHL 522 ++ + A +++ E +Q+E+ Sbjct: 460 D----INIATIKLRIANAIGIDTSGILKTPEE-----KQQEMAEAAQGTAMQ 502 >gi|212671411|ref|YP_002308410.1| head-to-tail joining protein [Kluyvera phage Kvp1] gi|211997255|gb|ACJ14572.1| head-to-tail joining protein [Kluyvera phage Kvp1] Length = 535 Score = 115 bits (288), Expect = 2e-23, Method: Composition-based stats. Identities = 89/534 (16%), Positives = 157/534 (29%), Gaps = 65/534 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ LKN R E + P +NA W G+ L+S L Sbjct: 16 KAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKL 75 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + + A KV E V L + E S Sbjct: 76 MLALF-PMQTWMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIE---SNSYRV 130 Query: 122 -LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L +V G Y+ + R LS+ + + Sbjct: 131 TLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFG---------- 175 Query: 181 FTVDQIV--SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 TV QIV K L ++++L + + VY D++ + + Sbjct: 176 -TVLQIVTLDKTAYAALPEDVRNSLDSGTEHKGDEMIDVYTHIYLDEESGE--------Y 226 Query: 239 VSVDENRFFEEKQIAT------FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + +E E PYI R E YGRS E L +R L + Sbjct: 227 LKYEEIDGVEVDGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + Q K + + E S Q + + Sbjct: 287 VKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVAKAVSE 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 +++ + F+L+ V +A E E +G + L E M+ L Sbjct: 347 QIEGRLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E L + Q + + + L DP Sbjct: 406 KQLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCIAAWSALAPMQNDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHLQQ 524 ++ + A +++ E +++ + L+ Sbjct: 460 D----INIATIKLRIANAIGIDTSGILKTPEE------KQQEMAEAAQGTALEN 503 >gi|310005857|gb|ADP00242.1| head-tail connector protein [Cyanophage Syn26] Length = 521 Score = 115 bits (288), Expect = 2e-23, Method: Composition-based stats. Identities = 71/499 (14%), Positives = 149/499 (29%), Gaps = 51/499 (10%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY--KNNAQLRM--------WDTTGSEACIKL 57 + ++++N L + R + + + PY ++ R W + G++ + L Sbjct: 2 NAREKYNQLSSARRQFLDKAVQCSELTLPYLIDDDISSRPNHKSLAVPWQSVGAKCVVTL 61 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 ++ L + PP + L L + + S Sbjct: 62 AAKLMLAVLPPQTSFFKLQVRDDKLGQELDPQIRSELD----LSFAKMERMIMEYIAASN 117 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + ++ G YM D + PL+ + + V + Sbjct: 118 DRVAIHQALKHLIVGGNALIYMHKDG----------LKTFPLTRYVVERDGDGNVLCIVT 167 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + + + + + +S + +E ++ V ++ KD G +H + Sbjct: 168 KELISRKVLDIELPEPEPNSVV--------DESHSVADDVTIYTMVKLDKDSGRWVWHQE 219 Query: 238 FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 P++ R+ E YGR E L ++ L+ L + Sbjct: 220 AFDKIIPDTRSTAPKKASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQALIEGAA 279 Query: 298 LSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR--EGRSLFQPVQFGNPLPYHEELNRLKE 355 + + + KP + +GR V + + Sbjct: 280 AASKVIFLVSPSS-----TTKPATIAKAGNGAIVQGRPEDVAVIQVGKTADFATAANMAQ 334 Query: 356 SIRSLFLLDLFQVLDDKASR-SAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 I L + A R +A E E +G + L EF+ ++R L +L Sbjct: 335 GIEKRMLEAFLVMNVRNAERVTAEEVRLTQLELEQQLGGIFSLLTVEFLIPYLNRTLLVL 394 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSC- 473 +P+ P + V + L + Q ES+ Q + T+ + T P Sbjct: 395 QRSNQIPKLPKDIVRPTIVAGV---NALGRGQDRESLT---QFIGTIAQ----TLGPEAL 444 Query: 474 MDHMDTDRVSRFSLWATNT 492 M +++ + A Sbjct: 445 MQYINPQEAIKRLAAAQGI 463 >gi|312436374|gb|ADQ83183.1| head to tail joining protein [Yersinia phage Yep-phi] Length = 535 Score = 115 bits (287), Expect = 2e-23, Method: Composition-based stats. Identities = 90/532 (16%), Positives = 161/532 (30%), Gaps = 64/532 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ LKN R E + P +NA W G+ L+S L Sbjct: 16 KAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKL 75 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + + A KV E V L + E S Sbjct: 76 MLALF-PMQTWMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIE---SNSYRV 130 Query: 122 -LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L +V G Y+ + R LS+ + + Sbjct: 131 TLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFG---------- 175 Query: 181 FTVDQIV--SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 TV QIV K L +++++ ++ + + VY D++ + + Sbjct: 176 -TVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGE--------Y 226 Query: 239 VSVDENRFFEEKQIAT------FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + +E E + PYI R E YGRS E L +R L + Sbjct: 227 LKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + Q K + + E S Q + + Sbjct: 287 VKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVARAVSE 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 +++ + F+L+ V +A E E +G + L E M+ L Sbjct: 347 QIEGRLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E L + Q + + ++ L GDP Sbjct: 406 KQLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCISAWSALAPMQGDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHL 522 ++ + A +++ E +Q+E+ Sbjct: 460 D----INIATIKLRIANAIGIDTSGILKTPEE-----KQQEMAEAAQGTAMQ 502 >gi|29366727|ref|NP_813772.1| head-tail connector protein [Pseudomonas phage gh-1] gi|29243586|gb|AAO73165.1|AF493143_26 head-tail connector protein [Pseudomonas phage gh-1] Length = 543 Score = 115 bits (287), Expect = 3e-23, Method: Composition-based stats. Identities = 90/571 (15%), Positives = 172/571 (30%), Gaps = 67/571 (11%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY---KNNAQLRMWDTTGSEA----- 53 + + + + LKN R E P K++ TT +A Sbjct: 7 EGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARG 66 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 LS+ + + P Q W L S + + + ++ V + V L + E Sbjct: 67 LNNLSAKVMLALF-PLQSWMKLKVSEWQAKQLV-SDPSQLAVVEQGLGMVERILMSYMEA 124 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 + + L + GT Y+ ++ L N + + V Sbjct: 125 NS--YRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKL--YTLHNHVVQRDAFGNV- 179 Query: 174 SVYREFTFTVDQIV--SKWGDKVLSSKMKSAL----ARNENERFTIIHAVYPKSLTDKKK 227 QIV K L ++++L + + +Y + Sbjct: 180 ----------LQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDDESGD-- 227 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 F S + Q P+I R+ R E YGRS E L + L Sbjct: 228 ------FLSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSL 281 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 + +F +S + + L R+ F ++ Sbjct: 282 ESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQTGDFVAGRKADIEFLQLEKTADFT 341 Query: 346 YHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 + + ++ + +F+L+ V +A E E +G + L E Sbjct: 342 VAKSVADAIEARLSYVFMLN-SAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQL 400 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ L+ L + +P P E L + Q + Q +N V + Sbjct: 401 PIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAE---ALGRGQ---DLDKLTQFLNAVATV 454 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQHLQ 523 GDP ++ + + A A L+ AE + + ++ L+ Sbjct: 455 SQLNGDPD----LNVNNIKLRLANAIGIDTAGLLLTEAE----------KAQAQSQEMLK 500 Query: 524 QQLQQTSQDIGAKAAGRAMEKKLTHDMMENS 554 Q + IG+ A +A + + ME++ Sbjct: 501 QGGLNAAAGIGSGVAAQA---TASPEAMESA 528 >gi|18640510|ref|NP_570351.1| head-tail connector protein [Synechococcus phage P60] gi|18478740|gb|AAL73289.1| head-tail connector protein [Synechococcus phage P60] Length = 555 Score = 115 bits (287), Expect = 3e-23, Method: Composition-based stats. Identities = 78/554 (14%), Positives = 161/554 (29%), Gaps = 71/554 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLL 61 Q ++ L+ R + + PY + W + GS+ L+S L Sbjct: 6 QAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKL 65 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P + L + + E ARS ++ ++ + S Sbjct: 66 MLSLFPVNTSFFKLQINDAEIDNLGMDEQARS-EIDLSLSRIERIVTQDIAESSDRVHLE 124 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTF 181 + ++ G Y + PL +S + + V + E Sbjct: 125 M--AMKHLIVTGNALLY----------QGKKNLKLYPLDRFVVSRDGEGNVMEIVTEEQI 172 Query: 182 TVDQIVSKW----GDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + ++ G + + T + + K K N Sbjct: 173 DRSLLPEEFQKVGGLEGAPDS-NAVGEDGPKMGVT--------APGGRDKGKSNDALVYT 223 Query: 238 FVSVDENRF------------FEEKQ--IATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 +V + + P+I R+ + E YGR E + ++ Sbjct: 224 YVCRKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLK 283 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRS----LFQPVQ 339 L + + S + A + +L + +GR + Q + Sbjct: 284 SLEALSQAMVEGSAASAKVVFMVSPSATTKPQNL---ALAANGAIIQGRPDDVSVVQANK 340 Query: 340 FGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 + E + +L++ I F L + +A E +E +G + L Sbjct: 341 AADFRTVLEMIQKLEQRISDAF---LMLQVRQSERTTATEVQATVQELNEQIGGIYSNLT 397 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 +E + ++R+L +L Q LP+ P + Q + Sbjct: 398 TELLQPYLARKLHLLQKQRKLPQLPKDLVQPT---------VVAGLWGVGRGQDKQQLME 448 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVME 518 + L G M +++ + A LI E ++Q + Q++ M Sbjct: 449 FITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSP---ETMKQLGDQQKQDMV 505 Query: 519 EQHLQQQLQQTSQD 532 + L Q Q ++ Sbjct: 506 QASLINQAGQLAKT 519 >gi|17570823|ref|NP_523332.1| head-to-tail joining protein [Enterobacteria phage T3] gi|138413|sp|P20323|VHTJ_BPT3 RecName: Full=Head-to-tail joining protein gi|15714|emb|CAA35152.1| 8 [Enterobacteria phage T3] gi|17384307|emb|CAC86295.1| head-to-tail joining protein [Enterobacteria phage T3] Length = 535 Score = 114 bits (285), Expect = 5e-23, Method: Composition-based stats. Identities = 87/558 (15%), Positives = 159/558 (28%), Gaps = 64/558 (11%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ L N R E + P +N W G+ L+S L Sbjct: 15 KATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKL 74 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + D +K V E V + + E S Sbjct: 75 MLALF-PMQSWMKLTISEYEAKQLVGDPDGLAK-VDEGLSMVERIIMNYIE---SNSYRV 129 Query: 122 -LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L ++ G Y+ + R LS+ + + V + Sbjct: 130 TLFECLKQLIVAGNALLYLPEPEGSYNPMKLYR-----LSSYVVQRDAYGNVLQIV---- 180 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENER-----FTIIHAVYPKSLTDKKKDKGNKGFH 235 T DQI +G L ++SA+ ++ E+ + VY + Sbjct: 181 -TRDQIA--FG--ALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGD---------- 225 Query: 236 SKFVSVDENRFFEEKQ------IATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 ++ +E E PYI R E YGRS E L +R L Sbjct: 226 --YLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQ 283 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHE 348 + + +S + + L + RE Q + + Sbjct: 284 EAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKA 343 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 ++++ + F+L+ V +A E E +G + L E ++ Sbjct: 344 VSDQIEARLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVR 402 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 L L + +PE P +E + + + + ++ L Sbjct: 403 VLLKQLQATSQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCISAWAALAPMQ 456 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 GDP ++ + A ++ + + + Q Q V Sbjct: 457 GDPD----INLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGV 512 Query: 528 QTSQDIGAKAAGRAMEKK 545 +A A K Sbjct: 513 GALATSSPEAMQGAAAKA 530 >gi|326536132|ref|YP_004300566.1| gp8 [Enterobacteria phage 285P] gi|256861521|gb|ACV32477.1| gp8 [Enterobacteria phage 285P] Length = 535 Score = 113 bits (284), Expect = 6e-23, Method: Composition-based stats. Identities = 88/534 (16%), Positives = 158/534 (29%), Gaps = 65/534 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ LKN R E + P +NA W G+ L+S L Sbjct: 16 KAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKL 75 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + + A KV E V L + E S Sbjct: 76 MLALF-PMQTWMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIE---SNSYRV 130 Query: 122 -LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L +V G Y+ + R LS+ + + Sbjct: 131 TLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFG---------- 175 Query: 181 FTVDQIV--SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 TV QIV K L +++++ + + + VY D++ + + Sbjct: 176 -TVLQIVTLDKTAYAALPEDVRNSMDSGQEHKGDEMIDVYTHIYLDEESGE--------Y 226 Query: 239 VSVDENRFFEEKQIAT------FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + +E E PYI R E YGRS E L +R L + Sbjct: 227 LKYEEIDGVEVDGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + Q K + + E S Q + + Sbjct: 287 VKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVAKAVSE 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 +++ + F+L+ V +A E E +G + L E M+ L Sbjct: 347 QIEGRLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E L + Q + + + L DP Sbjct: 406 KQLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCIAAWSALAPMQNDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHLQQ 524 ++ + A +++ E +++ + L+ Sbjct: 460 D----INIATIKLRIANAIGIDTSGILKTPEE------KQQEMAEAAQGTALEN 503 >gi|194100286|ref|YP_002003484.1| gp8 [Enterobacteria phage BA14] gi|193201281|gb|ACF15761.1| gp8 [Enterobacteria phage BA14] Length = 535 Score = 113 bits (283), Expect = 8e-23, Method: Composition-based stats. Identities = 89/527 (16%), Positives = 158/527 (29%), Gaps = 64/527 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ LKN R E + P +NA W G+ L+S L Sbjct: 16 KAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKL 75 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + + A KV E V L + E S Sbjct: 76 MLALF-PMQTWMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIE---SNSYRV 130 Query: 122 -LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L +V G Y+ + R LS+ + + Sbjct: 131 TLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFG---------- 175 Query: 181 FTVDQIV--SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 TV QIV K L +++++ + + + VY D++ + + Sbjct: 176 -TVLQIVTLDKTAYAALPEDVRNSMDSGQEHKGDEMIDVYTHIYLDEESGE--------Y 226 Query: 239 VSVDENRFFEEKQIAT------FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + +E E + PYI R E YGRS E L +R L + Sbjct: 227 LKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + Q K + + E S Q + + Sbjct: 287 VKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVAKAVSE 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 +++ + F+L+ V +A E E +G + L E M+ L Sbjct: 347 QIEGRLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E L + Q + + + L DP Sbjct: 406 KQLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCIAAWSALAPMQNDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVM 517 ++ + A +++ E +Q+E+ Sbjct: 460 D----INIATIKLRIANAIGIDTSGILKTPEE-----KQQEMAESAQ 497 >gi|291335893|gb|ADD95488.1| T7-like head to tail connector [uncultured phage MedDCM-OCT-S08-C41] Length = 527 Score = 113 bits (282), Expect = 1e-22, Method: Composition-based stats. Identities = 75/508 (14%), Positives = 161/508 (31%), Gaps = 72/508 (14%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRM----------WDTTGSEACIKL 57 ++R++ L + R + E + P+ LR+ W + G+++ + L Sbjct: 3 KAKERYSQLSSDRHQFLDIAVECSELTLPHLITDDLRVRQNHKRLTTPWQSVGAKSVVTL 62 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 ++ L + PP + L L E ++ ++ + S Sbjct: 63 AAKLMLALLPPQTSFFKLQVRDDQLGEELPMEVRS--ELDLSFSKMERMVMDKIAASSDR 120 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + ++ G +M D + PL+ +S + V + Sbjct: 121 --VVVHQALKHLIVGGNALIFMGKDG----------LKNFPLNRFVVSRDGNGYVCEIV- 167 Query: 178 EFTFTVDQIVSK--WGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFH 235 ++V++ G + + N +E + V + + N G+ Sbjct: 168 -----TKELVNRKLLGIDPMPDPHTVSGKGNNDEDAEVYTYV---------RRQDNGGW- 212 Query: 236 SKFVSVDENRFFEEKQI----ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 + +++ + + P++V R+ E YGR E L +R L Sbjct: 213 -VWHQEVDDKIIDGSRSTAPKDASPWLVLRFNAVDGEDYGRGRVEEFLGDLRSLEALSQA 271 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR--EGRS----LFQPVQFGNPLP 345 L + + + A KP + +GR + Q + + Sbjct: 272 LIEGSAAAAKVVFLVNPAA-----TTKPSTIAKAGNGAIVQGRPEDVSVVQVGKTADFGT 326 Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGA 405 + +++ + F L + +A E E +G L L EF+ Sbjct: 327 ASQMAQQIERRLGEAF---LLLNIRQSERTTAEEVRLTQLELEQQLGGLFSLLTVEFLKP 383 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 ++R L ++ G LP+ P + V + L + Q ES+ + + T+ + Sbjct: 384 YLARTLMVMQRSGQLPKIPREYVQPQIVAGV---NALGRGQDRESLTA---FIGTIAQ-- 435 Query: 466 VKTGDPSCMDH-MDTDRVSRFSLWATNT 492 T P + +D + A Sbjct: 436 --TLGPEALMKYIDASEAIKRLAAAQGI 461 >gi|9634032|ref|NP_052106.1| head-to-tail joining protein [Yersinia phage phiYeO3-12] gi|6599023|emb|CAB63627.1| head-to-tail joining protein [Yersinia phage phiYeO3-12] Length = 535 Score = 112 bits (279), Expect = 2e-22, Method: Composition-based stats. Identities = 83/526 (15%), Positives = 153/526 (29%), Gaps = 64/526 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ L N R E + P +N W G+ L+S L Sbjct: 15 KATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKL 74 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + D +K V E V + + E S Sbjct: 75 MLALF-PMQSWMKLTISEYEAKQLVGDPDGLAK-VDEGLSMVERIIMNYIE---SNSYRV 129 Query: 122 -LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L ++ G Y+ + R LS+ + + V + Sbjct: 130 TLFECLKQLIVAGNALLYLPEPEGSYNPMKLYR-----LSSYVVQRDAYGNVLQIV---- 180 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENER-----FTIIHAVYPKSLTDKKKDKGNKGFH 235 T DQI +G L ++SA+ + E+ + VY + Sbjct: 181 -TRDQIA--FG--ALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGD---------- 225 Query: 236 SKFVSVDENRFFEEKQ------IATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 ++ +E E PYI R E YGRS E L +R L Sbjct: 226 --YLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQ 283 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHE 348 + + +S + + L + RE Q + + Sbjct: 284 EAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKA 343 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 ++++ + F+L+ V +A E E +G + L E ++ Sbjct: 344 VSDQIEARLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVR 402 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 L L + +PE P +E + + + + ++ L Sbjct: 403 VLLKQLQATSQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCISAWAALAPMQ 456 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQ 513 GDP ++ + A ++ + + + Q Q Sbjct: 457 GDPD----INLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQ 498 >gi|26989003|ref|NP_744428.1| head-to-tail joining protein [Pseudomonas putida KT2440] gi|24983824|gb|AAN67892.1|AE016421_4 head-to-tail joining protein [Pseudomonas putida KT2440] Length = 524 Score = 112 bits (279), Expect = 2e-22, Method: Composition-based stats. Identities = 78/563 (13%), Positives = 174/563 (30%), Gaps = 71/563 (12%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGF-----LYPYKNNAQLRMWDT---TGSEAC 54 + + L R + + + + P + + + + + Sbjct: 7 EPERGLAASLYAKLAPDRETFLQRARDCSKYSIPTLIPPAGHASGTKFYTPWQAVAARGV 66 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 L + L + PP + L E + L V+ ++ + E + Sbjct: 67 NNLGAKLLMALLPPNSPFFRL-EIDEFTEEKLTSNPQMHADVQAGLAKIERAVQTEIETT 125 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV-D 173 ++ G G Y+ + G+++ PL + + V D Sbjct: 126 A--IRVTGFELLKHLIVGGNGLVYL-------PQQGGMKF--YPLDRYVVRRDPMGNVLD 174 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALA---------RNENERFTIIHAVYPKSLTD 224 V + + VL + +S + R+ N+ +I + K T Sbjct: 175 IV----------VKEEVSLAVLPEEARSLVEPGDDSGDTPRDHNKNVSIYTHITLKGETW 224 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEKQI---ATFPYIVGRYRVRADEIYGRSPAMEALPT 281 + V + ++ R+ E YGRS E L Sbjct: 225 N-----------VYQEVKGQIVPGSRGTYPKDKCAWLPIRFVKIDGENYGRSYVEEYLGD 273 Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLK--PGYMNIGALSREGRSLFQPVQ 339 I+ L + + S + + +L P + ++ + ++L Q + Sbjct: 274 IKSLEGLSQAIVEGSAASAKVLFLVNPNGVTSSSELAEAPNGEFVDGVASDVQAL-QLQK 332 Query: 340 FGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 G+ E +N + E + F+L+ + + +A E E A +G + L Sbjct: 333 SGDFRVALETINTITERLEFAFMLN-SAIQRNGERVTAEEIRYMAGELEAALGGVYSILS 391 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 EF +++R + + + LPE P + +E + + Q ++ Sbjct: 392 QEFQLPLVNRIMFSMQRRKKLPELPKGTVSPTIVTGMEALG------RGNDLTKLDQFIS 445 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVME 518 T++++ P ++ A L++ EV+ +QQ+++Q+ + Sbjct: 446 TIMQI------PDAASRINWGNYMTRRATALGIDTDGLVKTDQEVQQEQQQQQMQQAMQS 499 Query: 519 EQHLQQQLQQTSQDIGAKAAGRA 541 Q + G +A Sbjct: 500 GVAPAVQAAGRMMEKGQPDGSQA 522 >gi|310005702|gb|ADP00089.1| head-tail connector protein [Cyanophage NATL1A-7] Length = 543 Score = 108 bits (270), Expect = 2e-21, Method: Composition-based stats. Identities = 75/515 (14%), Positives = 156/515 (30%), Gaps = 56/515 (10%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY-------KNNAQLRM---WDTTGSEACIKL 57 +DR+ L R + + E + PY ++ W + G+++ + L Sbjct: 2 KARDRYAQLTRGRTQFLHTAVECSRLTLPYLVQEDLSSRPEHQKLHTPWQSVGAKSVVNL 61 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 ++ L + PP + L + + S S Sbjct: 62 AAKLMLALLPPQTSFFKLQIQDNKIGVEFDPKIRSEMD----LSFAKMERMVMDYISASN 117 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + ++ G +M D + PL+ + + + + Sbjct: 118 DRVVVHQALKHLIVSGNALIFMGKDG----------LKNYPLNRYVCNRDGNGNICEIVT 167 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + + + + +S + +++ ++ Y + + + + F + Sbjct: 168 KELISRKILGQDLPVPLPNSPGEDGYKTGSDDQDVEVYT-YVRLDDNGRWVWHQEAFDNI 226 Query: 238 FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 T P++V R+ E YGR E L IR L L + Sbjct: 227 LPGSRSTAPK-----NTSPWLVLRFNTVDGEDYGRGRVEEFLGDIRSLEGLSQSLVEGSA 281 Query: 298 LSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR--EGRS----LFQPVQFGNPLPYHEELN 351 + + + KP + +GR + Q + + E++ Sbjct: 282 AASKVVFLVSPSS-----TTKPKTIADAGNGAIVQGRPDDVGVIQVGKTADFRTAQEQMM 336 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 +L++ I FL+ + +A E E +G L L EF+ ++R L Sbjct: 337 QLEKRINEAFLV---LNVRQSERTTAEEVRLTQMELEQQLGGLFSLLTVEFLEPYLNRTL 393 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 IL +P+ P + V + L + Q S ++ T+ + T P Sbjct: 394 HILQRNKEIPKIPKESVRPQIIAGV---NALGRGQ---DEESLIRFAQTLSQ----TVGP 443 Query: 472 SCMDH-MDTDRVSRFSLWATNTPA-VLIRDTAEVE 504 M +D + A A LI+ + Sbjct: 444 EMMVKYLDPGEYVKRLAAAQGIDALNLIKSPETMA 478 >gi|37956836|gb|AAP34103.1| gene 8 [Enterobacteria phage T7] gi|37956889|gb|AAP34155.1| gene 8 [Enterobacteria phage T7] Length = 536 Score = 108 bits (270), Expect = 3e-21, Method: Composition-based stats. Identities = 89/551 (16%), Positives = 157/551 (28%), Gaps = 75/551 (13%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSE 52 + + AK + + LKN R + + P +NA W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P Q W L S + L D +K V E V + + E Sbjct: 65 GLNNLASKLMLALF-PMQTWMRLTISEYEAKQLLSDPDGLAK-VDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGC-LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 S L +V G Y+ + LS+ + + Sbjct: 123 ---SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL----YRLSSYVVQRDAFGN 175 Query: 172 VDSVYREFTF-TVDQIVSKWGDKVLSSKMKSALA-----RNENERFTIIHAVYPKSLTDK 225 V T DQI +G L ++ A+ + +E + +Y + + Sbjct: 176 V------LQMVTRDQIA--FG--ALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGE 225 Query: 226 KKDKGNKGFHSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 + V++ + PYI R E YGRS E L + Sbjct: 226 YLR---------YEEVEDMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDL 276 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R L + + +S + + L R F ++ Sbjct: 277 RSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQA 336 Query: 343 PLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + ++ ++ + F+L+ V +A E E +G + L E Sbjct: 337 DFTVAKAVSDAIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 ++ L L + +PE P +E + + + + V Sbjct: 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVTAW 449 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 L DP ++ + A I T E ++Q Sbjct: 450 AALAPMRNDPD----INLAMIKLRIANAIGIDTSGILLTEE--------------QKQQK 491 Query: 522 LQQQLQQTSQD 532 + QQ Q D Sbjct: 492 MAQQSMQMGMD 502 >gi|194100395|ref|YP_002003970.1| gp8 [Enterobacteria phage 13a] gi|193201442|gb|ACF15919.1| gp8 [Enterobacteria phage 13a] Length = 536 Score = 108 bits (269), Expect = 3e-21, Method: Composition-based stats. Identities = 89/551 (16%), Positives = 156/551 (28%), Gaps = 75/551 (13%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSE 52 + + AK + + LKN R + + P +NA W G+ Sbjct: 8 LAEEGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P Q W L S + L D +K V E V + + E Sbjct: 65 GLNNLASKLMLALF-PMQTWMRLTISEYEAKQLLSDPDGLAK-VDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGC-LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 S L +V G Y+ + LS+ + + Sbjct: 123 ---SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL----YRLSSYVVQRDAFGN 175 Query: 172 VDSVYREFTF-TVDQIVSKWGDKVLSSKMKSALA-----RNENERFTIIHAVYPKSLTDK 225 V T DQI +G L ++ A+ + +E + +Y + + Sbjct: 176 V------LQMVTRDQIA--FG--ALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGE 225 Query: 226 KKDKGNKGFHSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 + V+ + PYI R E YGRS E L + Sbjct: 226 YIR---------YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDL 276 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R L + + +S + + L R F ++ Sbjct: 277 RSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQA 336 Query: 343 PLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + ++ ++ + F+L+ V +A E E +G + L E Sbjct: 337 DFTVAKAVSDAIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 ++ L L + +PE P +E + + + + V Sbjct: 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVAAW 449 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 L DP ++ + A I T E ++Q Sbjct: 450 AALAPMRDDPD----INLAMIKLRIANAIGIDTSGILLTEE--------------QKQQK 491 Query: 522 LQQQLQQTSQD 532 + QQ Q D Sbjct: 492 MAQQSMQMGMD 502 >gi|9627467|ref|NP_041995.1| head-tail connector protein [Enterobacteria phage T7] gi|138414|sp|P03728|VHTJ_BPT7 RecName: Full=Head-to-tail joining protein gi|15602|emb|CAA24425.1| unnamed protein product [Enterobacteria phage T7] gi|37956678|gb|AAP33948.1| gene 8 [Enterobacteria phage T7] gi|265524999|gb|ACY75862.1| head-to-tail joining protein [Enterobacteria phage T7] Length = 536 Score = 108 bits (269), Expect = 4e-21, Method: Composition-based stats. Identities = 89/551 (16%), Positives = 156/551 (28%), Gaps = 75/551 (13%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSE 52 + + AK + + LKN R + + P +NA W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P Q W L S + L D +K V E V + + E Sbjct: 65 GLNNLASKLMLALF-PMQTWMRLTISEYEAKQLLSDPDGLAK-VDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGC-LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 S L +V G Y+ + LS+ + + Sbjct: 123 ---SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL----YRLSSYVVQRDAFGN 175 Query: 172 VDSVYREFTF-TVDQIVSKWGDKVLSSKMKSALA-----RNENERFTIIHAVYPKSLTDK 225 V T DQI +G L ++ A+ + +E + +Y + + Sbjct: 176 V------LQMVTRDQIA--FG--ALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGE 225 Query: 226 KKDKGNKGFHSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 + V+ + PYI R E YGRS E L + Sbjct: 226 YLR---------YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDL 276 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R L + + +S + + L R F ++ Sbjct: 277 RSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQA 336 Query: 343 PLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + ++ ++ + F+L+ V +A E E +G + L E Sbjct: 337 DFTVAKAVSDAIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 ++ L L + +PE P +E + + + + V Sbjct: 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVTAW 449 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 L DP ++ + A I T E ++Q Sbjct: 450 AALAPMRDDPD----INLAMIKLRIANAIGIDTSGILLTEE--------------QKQQK 491 Query: 522 LQQQLQQTSQD 532 + QQ Q D Sbjct: 492 MAQQSMQMGMD 502 >gi|148724480|ref|YP_001285446.1| head to tail connector [Cyanophage Syn5] gi|145588125|gb|ABP87944.1| head to tail connector [Synechococcus phage Syn5] Length = 542 Score = 107 bits (267), Expect = 5e-21, Method: Composition-based stats. Identities = 77/544 (14%), Positives = 165/544 (30%), Gaps = 42/544 (7%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSS 59 Q R++ ++ R + PY + + + GS+ LSS Sbjct: 4 LAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSS 63 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L + P + L + + + ++ ++ + S Sbjct: 64 KLMLSLFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDR-- 121 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 L + ++ TG + A + PL + + V + Sbjct: 122 VQLTAAMKHLIV--TGNVLVFAGKKTLKV--------YPLDRYVIERDGDGNVIEIITRE 171 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK----KKDKGNKGFH 235 + +++ + L S + +F + ++ + K G +H Sbjct: 172 LVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWH 231 Query: 236 SKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 + + + P++ R+ V E YGR E + L+ L + Sbjct: 232 QECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEG 291 Query: 296 GRLSLHPPTIAVSEAKQRNFDL-KPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLK 354 + + A + L + G I E S+ Q + + E + L Sbjct: 292 SAAAAKVVFMVSPSATTKPQSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLS 351 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 + I F L + +A E E E + + G L E + ++R+L ++ Sbjct: 352 QRISDAF---LILNVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLM 408 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 LP P + L + E A+ ++ + TV + P + Sbjct: 409 QRSKQLPSLPKGLVMPT------VVAGLGGVGRGEDRAALIEFMQTVGQ----AMGPEAL 458 Query: 475 DH-MDTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQD 532 +D + A+ L++ + + + ++ Q++ M + Q Q Sbjct: 459 QQFIDPTEFLKRLAAASGIDTLNLVKSPETMAN--EAQQAQQQQMTASLMGQAGQLAKSP 516 Query: 533 IGAK 536 IG K Sbjct: 517 IGEK 520 >gi|37956731|gb|AAP34000.1| gene 8 [Enterobacteria phage T7] gi|37956781|gb|AAP34049.1| gene 8 [Enterobacteria phage T7] Length = 536 Score = 107 bits (267), Expect = 5e-21, Method: Composition-based stats. Identities = 89/551 (16%), Positives = 156/551 (28%), Gaps = 75/551 (13%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSE 52 + + AK + + LKN R + + P +NA W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P Q W L S + L D +K V E V + + E Sbjct: 65 GLNNLASKLMLALF-PMQTWMRLTISEYEAKQLLSDPDGLAK-VDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGC-LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 S L +V G Y+ + LS+ + + Sbjct: 123 ---SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL----YRLSSYVVQRDAFGN 175 Query: 172 VDSVYREFTF-TVDQIVSKWGDKVLSSKMKSALA-----RNENERFTIIHAVYPKSLTDK 225 V T DQI +G L ++ A+ + +E + +Y + + Sbjct: 176 V------LQMVTRDQIA--FG--ALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGE 225 Query: 226 KKDKGNKGFHSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 + V+ + PYI R E YGRS E L + Sbjct: 226 YLR---------YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDL 276 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R L + + +S + + L R F ++ Sbjct: 277 RSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQA 336 Query: 343 PLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + ++ ++ + F+L+ V +A E E +G + L E Sbjct: 337 DFTVAKAVSDAIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 ++ L L + +PE P +E + + + + V Sbjct: 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVTAW 449 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 L DP ++ + A I T E ++Q Sbjct: 450 AALAPMRDDPD----INLAMIKLRIANAIGIDTSGILLTEE--------------QKQQK 491 Query: 522 LQQQLQQTSQD 532 + QQ Q D Sbjct: 492 MVQQSMQMGMD 502 >gi|30387485|ref|NP_848294.1| head-to-tail joining protein [Yersinia pestis phage phiA1122] gi|30314122|gb|AAP20530.1| head-to-tail joining protein [Yersinia pestis phage phiA1122] Length = 536 Score = 106 bits (265), Expect = 8e-21, Method: Composition-based stats. Identities = 89/551 (16%), Positives = 156/551 (28%), Gaps = 75/551 (13%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSE 52 + + AK + + LKN R + + P +NA W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P Q W L S + L D +K V E V + + E Sbjct: 65 GLNNLASKLMLALF-PMQTWMRLTISEYEAKQLLSDPDGLAK-VDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGC-LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 S L +V G Y+ + LS+ + + Sbjct: 123 ---SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL----YRLSSYVVQRDAFGN 175 Query: 172 VDSVYREFTF-TVDQIVSKWGDKVLSSKMKSALA-----RNENERFTIIHAVYPKSLTDK 225 V T DQI +G L ++ A+ + +E + +Y + + Sbjct: 176 V------LQMVTRDQIA--FG--ALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGE 225 Query: 226 KKDKGNKGFHSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 + V+ + PYI R E YGRS E L + Sbjct: 226 YLR---------YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDL 276 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R L + + +S + + L R F ++ Sbjct: 277 RSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQA 336 Query: 343 PLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + ++ ++ + F+L+ V +A E E +G + L E Sbjct: 337 DFTVAKAVSDAIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 ++ L L + +PE P +E + + + + V Sbjct: 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVTAW 449 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 L DP ++ + A I T E ++Q Sbjct: 450 AALAPMRDDPD----INLAMIKLRIANAIGIDTSGILLTEE--------------QKQQK 491 Query: 522 LQQQLQQTSQD 532 + QQ Q D Sbjct: 492 MAQQSMQMGMD 502 >gi|77118196|ref|YP_338118.1| head to tail connector [Enterobacteria phage K1F] gi|72527940|gb|AAZ72992.1| head to tail connector [Enterobacteria phage K1F] gi|83308148|emb|CAJ29381.1| gp8 protein [Enterobacteria phage K1F] Length = 522 Score = 106 bits (265), Expect = 9e-21, Method: Composition-based stats. Identities = 92/562 (16%), Positives = 170/562 (30%), Gaps = 64/562 (11%) Query: 1 MNQR---SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTT 49 M +R +A+ + ++ LKN R + P + W Sbjct: 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAV 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ L++ L + P W L S + +A ++ V E V L Sbjct: 61 GARCLNNLAAKLMLALFPQS-PWMRLTVSEYEAKTLSQDSEAAAR-VDEGLAMVERVLMA 118 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + E + F L ++ G Y+ E+G +R L + + + Sbjct: 119 YMETNS--FRVPLFEALKQLIVSGNCLLYIPE--PEQGTYSPMR--MYRLVSYVVQRDAF 172 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 + + + K L +KS L ++ E T + D + Sbjct: 173 GNILQIV---------TIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDE--- 220 Query: 230 GNKGFHSKFVSVDENRFFEEKQIAT------FPYIVGRYRVRADEIYGRSPAMEALPTIR 283 ++ +E E PYI R E YGRS E L + Sbjct: 221 --------YLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLN 272 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L + + +++ + + L R F + G Sbjct: 273 SLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAATGEFVAGRVEDINFLQLTKGQD 332 Query: 344 LPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 + + +++ + FLL+ V + +A E E A +G + E Sbjct: 333 FTIAKSVADAIEQRLGWAFLLN-SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQEL 391 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 ++ ++ L S G +P+ P +E L + Q E + Q VN + Sbjct: 392 QLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLE---ALGRGQDLEKLT---QAVNMMT 445 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQH 521 L + DP ++ + L A A L+ E + + +Q Sbjct: 446 GLQPLSQDPD----INLPTLKLRLLNALGIDTAGLLLTQDE------KIQRMAEQSSQQA 495 Query: 522 LQQQLQQTSQDIGAKAAGRAME 543 + Q ++GA A E Sbjct: 496 VVQGASAAGANMGAAVGQGAGE 517 >gi|194473831|ref|YP_002048655.1| head-to-tail joining protein [Morganella phage MmP1] gi|194307052|gb|ACF42034.1| head-to-tail joining protein [Morganella phage MmP1] Length = 543 Score = 106 bits (265), Expect = 1e-20, Method: Composition-based stats. Identities = 77/558 (13%), Positives = 166/558 (29%), Gaps = 69/558 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + ++ LKN R E + P +NA W + G+ L+S L Sbjct: 17 KAAYDRLKNDRAPYETRAENCAKYTIPSLFPKSSDNASTDYTTPWQSAGARGLNNLASKL 76 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P Q W L S + + + E+ +K V V + + E + + Sbjct: 77 MLALF-PMQTWMKLTISEFSAKELVGNEEGLAK-VDAALSMVERIIMNYIETNS--YRVA 132 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTF 181 L ++ G Y+ + I+ +P + + V Sbjct: 133 LFEGLKQLIVAGNVLLYLPPPEESDEGYNPIKVYKLP--SFVCQRDSFGNV--------- 181 Query: 182 TVDQIVSK----WGDKVLSSKMKSALA-----RNENERFTIIHAVYPKSLTDKKKDKGNK 232 QIV++ +G L ++ + + +E T+ +Y + + Sbjct: 182 --LQIVTEDKIAFG--ALDEDIRKMVEASGGEKKPDEEITVYTHIYLDDESGQYL----- 232 Query: 233 GFHSKFVSVDENRFFEEKQ------IATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLN 286 K+ V+ PYI R + E YGRS E L ++ L Sbjct: 233 ----KYEEVEGEEI---AGTDAAYPYEANPYIPVRMVRLSGESYGRSYCEEYLGDLKSLE 285 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPY 346 + + ++ + + + + F ++ Sbjct: 286 NLHEAMVKMSMIAAKVVGLVNPAGMTQIRQVSKADTGDYVPGKPEDIHFLQLEKQADFSV 345 Query: 347 HEELNR-LKESIRSLFLLDLFQVLDDKASR-SAAESMEKTREKGAFVGPLIGGLQSEFIG 404 + + ++ + F+L+ + A R +A E E +G + L E Sbjct: 346 AKTIADNIEARLSFAFMLN--SAVQRTAERVTAEEIRYVASELEDTLGGVYSNLSQELQL 403 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ L+ L + +PE P +E + + + + + L Sbjct: 404 PIVKVLLNQLQATAKIPELPQEAVEPAISTGLEAIG------RGQDLDRLERCIAAWAAL 457 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQHLQ 523 DP ++ + A A ++ + + + +Q+ +M + Sbjct: 458 APMANDPD----INLSTIKLRIANAIGIDTAGILLTEEQKQQKLAEAAMQQGMMTGANQL 513 Query: 524 QQLQQTSQDIGAKAAGRA 541 +A +A Sbjct: 514 GGGMAGMATESPEALAQA 531 >gi|68299738|ref|YP_249587.1| Head-to-tail joining protein [Vibriophage VP4] gi|66473277|gb|AAY46286.1| head-to-tail joining protein [Vibriophage VP4] Length = 532 Score = 105 bits (263), Expect = 1e-20, Method: Composition-based stats. Identities = 80/512 (15%), Positives = 155/512 (30%), Gaps = 55/512 (10%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 +N LKN RG E+ + P + + W + G+ L+S L Sbjct: 18 YNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + L S + + ++ V + E + F L + Sbjct: 78 LFPVGSSFFKLNVSELEVKQSITS-PEELTEIATGLAMVERICMNYMESNS--FRPTLHA 134 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y+ + +G + L N + + + V Sbjct: 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKL--YKLHNFVVERDAYDNV-----------L 181 Query: 185 QIV--SKWGDKVLSSKMKSALAR-----NENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 QIV K L ++ +L N +E TI VY +D F S Sbjct: 182 QIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVY--------RDPEAMVFRSY 233 Query: 238 FVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 E E + + P+I R +E YGRS E L ++ L + + Sbjct: 234 QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKM 293 Query: 296 GRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLK 354 +S + Q K + A ++ +FQ ++ + + ++ Sbjct: 294 SMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIE 353 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 + + F+L+ V +A E E +G + L E ++ L L Sbjct: 354 KRLSYAFMLN-SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKEL 412 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 + +P P +E + + ++ +++L Sbjct: 413 QATSKIPNLPKEAVEPAIATGLEALG------RGHDLNKLNVFIDYMIKLAGLQD----- 461 Query: 475 DHMDTDRVSRFSLWATNTPAV-LIRDTAEVED 505 D ++ V + LI + + Sbjct: 462 DDINLLDVKMRLANSLGMDTTGLILTQQDKQA 493 >gi|281416195|ref|YP_003347930.1| head-to-tail joining protein [Vibrio phage N4] gi|325171309|ref|YP_004251280.1| head-to-tail joining protein [Vibrio phage ICP3] gi|237701502|gb|ACR16495.1| head-to-tail joining protein [Vibrio phage N4] gi|323512015|gb|ADX87477.1| head-to-tail joining protein [Vibrio phage ICP3] gi|323512160|gb|ADX87619.1| head-to-tail joining protein [Vibrio phage ICP3_2008_A] gi|323512208|gb|ADX87666.1| head-to-tail joining protein [Vibrio phage ICP3_2007_A] Length = 532 Score = 105 bits (262), Expect = 2e-20, Method: Composition-based stats. Identities = 80/512 (15%), Positives = 155/512 (30%), Gaps = 55/512 (10%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 +N LKN RG E+ + P + + W + G+ L+S L Sbjct: 18 YNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + L S + + ++ V + E + F L + Sbjct: 78 LFPVGSSFFKLNVSELEVKQSITS-PEELTEIATGLAMVERICMNYMESNS--FRPTLHA 134 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y+ + +G + L N + + + V Sbjct: 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKL--YKLHNFVVERDAYDNV-----------L 181 Query: 185 QIV--SKWGDKVLSSKMKSALAR-----NENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 QIV K L ++ +L N +E TI VY +D F S Sbjct: 182 QIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVY--------RDPEAMVFRSY 233 Query: 238 FVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 E E + + P+I R +E YGRS E L ++ L + + Sbjct: 234 QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKM 293 Query: 296 GRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLK 354 +S + Q K + A ++ +FQ ++ + + ++ Sbjct: 294 SMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIE 353 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 + + F+L+ V +A E E +G + L E ++ L L Sbjct: 354 KRLSYAFMLN-SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKEL 412 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 + +P P +E + + ++ +++L Sbjct: 413 QATSKIPNLPKEAVEPAIATGLEALG------RGHDLNKLNVFIDYMIKLAGLQD----- 461 Query: 475 DHMDTDRVSRFSLWATNTPAV-LIRDTAEVED 505 D ++ V + LI + + Sbjct: 462 DDINLLDVKMRLANSLGMDTTGLILTQQDKQA 493 >gi|323512062|gb|ADX87523.1| head-to-tail joining protein [Vibrio phage ICP3_2009_B] gi|323512111|gb|ADX87571.1| head-to-tail joining protein [Vibrio phage ICP3_2009_A] Length = 532 Score = 105 bits (262), Expect = 2e-20, Method: Composition-based stats. Identities = 80/512 (15%), Positives = 155/512 (30%), Gaps = 55/512 (10%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 +N LKN RG E+ + P + + W + G+ L+S L Sbjct: 18 YNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + L S + + ++ V + E + F L + Sbjct: 78 LFPVGSSFFKLNVSELEVKQSITS-PEELTEIATGLAMVERICMNYMESNS--FRPTLHA 134 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y+ + +G + L N + + + V Sbjct: 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKL--YKLHNFVVERDAYDNV-----------L 181 Query: 185 QIV--SKWGDKVLSSKMKSALAR-----NENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 QIV K L ++ +L N +E TI VY +D F S Sbjct: 182 QIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVY--------RDPEAMVFRSY 233 Query: 238 FVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 E E + + P+I R +E YGRS E L ++ L + + Sbjct: 234 QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKM 293 Query: 296 GRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLK 354 +S + Q K + A ++ +FQ ++ + + ++ Sbjct: 294 SMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIE 353 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 + + F+L+ V +A E E +G + L E ++ L L Sbjct: 354 KRLSYAFMLN-SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKEL 412 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 + +P P +E + + ++ +++L Sbjct: 413 QATSKIPNLPKEAVEPAIATGLEALG------RGHDLNKLNVFIDYMIKLAGLQD----- 461 Query: 475 DHMDTDRVSRFSLWATNTPAV-LIRDTAEVED 505 D ++ V + LI + + Sbjct: 462 DDINLLDVKMRLANSLGMDTTGLILTQQDKQA 493 >gi|194100340|ref|YP_002003770.1| gp8 [Enterobacteria phage EcoDS1] gi|193201335|gb|ACF15814.1| gp8 [Enterobacteria phage EcoDS1] Length = 522 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 93/564 (16%), Positives = 169/564 (29%), Gaps = 68/564 (12%) Query: 1 MNQR---SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTT 49 M +R +A+ + ++ LKN R + P + W + Sbjct: 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQSV 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ L++ L + P W L S + +A ++ V E V L Sbjct: 61 GARCLNNLAAKLMLALFPQS-PWMRLTVSEYEAKTLSQDSEAAAR-VDEGLAMVERVLMA 118 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + E + F L ++ G Y+ E+G +R L + + + Sbjct: 119 YMETNS--FRVPLFEALKQLIVSGNCLLYIPE--PEQGTYSPMR--MYRLVSYVVQRDAF 172 Query: 170 NVVDSVYREFTFTVDQIV--SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK 227 + QIV K L +KS L ++ E T + D + Sbjct: 173 GNI-----------LQIVTLDKVAFSALPEDVKSQLNTDDYEPDTELEVYTHIYRQDDE- 220 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQIAT------FPYIVGRYRVRADEIYGRSPAMEALPT 281 ++ +E E PYI R E YGRS E L Sbjct: 221 ----------YLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGD 270 Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 + L + + +++ + + L R F + G Sbjct: 271 LNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAATGEFVAGRVEDINFLQLTKG 330 Query: 342 NPLPYHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS 400 + + +++ + FLL+ V + +A E E A +G + Sbjct: 331 QDFTIAKSVADAIEQRLGWAFLLN-SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQ 389 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E ++ ++ L S G +P+ P +E L + Q E + Q VN Sbjct: 390 ELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLE---ALGRGQDLEKLT---QAVNM 443 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEE 519 + L DP ++ + L A A L+ E + + + Sbjct: 444 MTGLQPLQQDPD----INLPTLKLRLLNALGIDTAGLLLTQDE------KLQRMAEQSAQ 493 Query: 520 QHLQQQLQQTSQDIGAKAAGRAME 543 + ++GA A E Sbjct: 494 GAVVNGASAAGANMGAAVGQGAGE 517 >gi|317487284|ref|ZP_07946079.1| hypothetical protein HMPREF0179_03442 [Bilophila wadsworthia 3_1_6] gi|316921474|gb|EFV42765.1| hypothetical protein HMPREF0179_03442 [Bilophila wadsworthia 3_1_6] Length = 554 Score = 103 bits (256), Expect = 1e-19, Method: Composition-based stats. Identities = 79/560 (14%), Positives = 168/560 (30%), Gaps = 65/560 (11%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLL 61 + R+ L R PY + ++ + G+ L+S L Sbjct: 21 ETRYTELSQDRAPYLDRARRCAELTIPYLIPPDDLAQGQELPSLYQSVGANGVTNLASKL 80 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSK-KVREWCDQVTDTLFGFRERSRSGFVG 120 + PP + L + + D + K+ + ++ + + SG Sbjct: 81 LLTMLPPNEPCFRLRVNNLVVEREEENADKEFRTKIEKALSRIEQAVLA--DIEASGDRP 138 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + ++ G + D +KGL PLS + + + E T Sbjct: 139 VVAEGNQHLIVAGN---VLYHDDPKKGLRL------FPLSRYVVERDPMGTPVEIVVEET 189 Query: 181 FTVDQIVSKWGDKVLSSKMKSALAR------NENERFTIIHAVYPKSLTDKKKDKGNKGF 234 +D + + ++ +++ A ++R + +Y KK Sbjct: 190 VNLDTL-----PEDVAERIREAADTLGQPSIKGDDRKDVN--IYTHLKRGPKK------- 235 Query: 235 HSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 S + + E P++ R A E YGRS L + L Sbjct: 236 WSVYQECRGVKLPGSEGSYKLEACPWLPVRMYSIAGENYGRSFVELQLGDLGSLESLCQS 295 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPY-HEEL 350 L + +S + L F VQ G ++ Sbjct: 296 LVEGSAVSAKVVGLVNPNGVTDPKALAESANGDMIEGNADDVAFLQVQKGADFQVVAAQI 355 Query: 351 NRLKESIRSLFLLDLFQVLD----DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 RL++ +++ F ++D D +A E +E +G + + EF Sbjct: 356 QRLEQRLKTA-----FLMMDGVRRDAERVTAEEIRVIAQELETGLGGVYTLISQEFQLPY 410 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 I+ + + Q +PE P + E A + Q + ++ G Sbjct: 411 IASRMATMTRQKRIPELPKGTVTPSIVTGFE----------AIGRGNDKQKLLEFLKAGT 460 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEV-EDIRQQREVQRRVMEEQHLQQ 524 + S + ++ A L++D E+ ++ + ++ + M + L Sbjct: 461 ELMGESFLGLLNPQNAVTRLASAMGISTEGLVKDEEELAQERQAAQQQAQGQMMMEKLGP 520 Query: 525 QLQQTSQDIGAKAAGRAMEK 544 + + + A++ Sbjct: 521 EALRQIGGMAQAGNAEALQG 540 >gi|281416306|ref|YP_003347546.1| head-to-tail joining protein [Klebsiella phage KP32] gi|262410425|gb|ACY66690.1| head-to-tail joining protein [Klebsiella phage KP32] Length = 461 Score = 100 bits (248), Expect = 8e-19, Method: Composition-based stats. Identities = 72/474 (15%), Positives = 139/474 (29%), Gaps = 42/474 (8%) Query: 68 PGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC-LQSFY 126 P Q W L S + L + +K V E V + + E S L Sbjct: 6 PMQSWMKLTISEYEAKNLLGDAEGLAK-VDEGLSMVERIIMNYIE---SNSYRVTLFECL 61 Query: 127 TSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQI 186 + G Y+ + + + R L++ + + V + T+D+I Sbjct: 62 KQLCVAGNALLYL-PEPEGYTPMKLYR-----LNSYVVQRDAFGNVLQIV-----TLDKI 110 Query: 187 V-SKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENR 245 + + V S + + E+ + VY D +SK+ V E Sbjct: 111 AFNALPEDVRSQVEAAQGEQKEDAEIDVYTHVYLNEAGDG---------YSKYEEVAEEV 161 Query: 246 F-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 E + PYI R E YGRS E L ++ L + + ++ Sbjct: 162 VPGSEAEYPLEECPYIPVRMVRIDGESYGRSYVEEYLGDLKSLENLQESIVKMAMITAKV 221 Query: 303 PTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNR-LKESIRSLF 361 + + L R+ F ++ + ++ ++ + F Sbjct: 222 IGLVDPAGITQVRRLTAAQSGAFVPGRKQDIEFLQLEKSGDFTVAKNVSDTIEARLSYAF 281 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 +L+ V +A E E +G + L E ++ L L + +P Sbjct: 282 MLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP 340 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 E P +E + + + + + L GD D ++ Sbjct: 341 ELPKEAVEPTISTGLEAIG------RGQDLDKLERCIAAWSALKALEGD----DDLNLAN 390 Query: 482 VSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIG 534 + A A ++ + + Q+ Q + Q T Sbjct: 391 LKLRIANAIGLDTAGMLLTQEQKNALMAQQGAQIATQQGAAALGQGIATQATAS 444 >gi|310005791|gb|ADP00177.1| head-tail connector protein [Cyanophage NATL2A-133] Length = 528 Score = 98.6 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 82/556 (14%), Positives = 168/556 (30%), Gaps = 62/556 (11%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYP---YKNNAQLRM------WDTTGSEACIKLSSL 60 + R+N L R + E P +N W + G++ + LSS Sbjct: 5 RQRYNKLSTGREQFLNVAYECAELTIPTLIMRNETPPNYAQFKTPWQSIGAKGVVTLSSK 64 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 L + PP + L S + E ++ ++ + S Sbjct: 65 LMLGLLPPSTSFFKLQLDDSKLGVEVPPE--SKSELDLSFAKIERMIMEAIAASTDR--V 120 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + + +V G YM D + PL+ + + + + Sbjct: 121 QIFTALKHLVVTGNALLYMGKDGMKM----------YPLNRYVVERDGNGDPVEIVTKEK 170 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 + + + + + ++ I + K K ++ H + Sbjct: 171 INKELLPKL-PLPLKGDGVVDDEQQGKD--VDIYTCI----KLTPKGWKWHQEVHDIMIP 223 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 E + +K P++ R+ E YGR E L ++ L + L + + Sbjct: 224 GSEGKAPAKK----CPFLPLRFVTVDGEDYGRGRVEEFLGDLKSLEALMQALVEGSAAAA 279 Query: 301 HPPTIAVSEAKQRNFDL---KPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 + + L G + G G + Q + + ++ +N L++ + Sbjct: 280 KVVFTVSPSSVTKPQTLANAGNGAIIQGRPDDIG--VVQVGKTADFQTAYQLVNTLEKRL 337 Query: 358 RSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 F L + D +A E E +G L L +EF+ + R++ L Sbjct: 338 AEAF---LIMNVRDSERTTAEEVRMTQMELEQQLGGLFSLLTTEFLLPYLHRKMHTLTQS 394 Query: 418 GNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDH- 476 +P P V + L + Q + +Q + T+ + T P + Sbjct: 395 KQIPALPKGLVKPTI---VAGINALGRGQ---DRDALVQFITTIAQ----TMGPEALQRF 444 Query: 477 MDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAK 536 ++ D + A +V ++ + E Q+ + QQ G Sbjct: 445 VNADEAIKRLAAAQGI---------DVLNLVKSMEEQQAEQQAAQQQQMQASLMDQAGQL 495 Query: 537 AAGRAMEKKLTHDMME 552 A M+ + E Sbjct: 496 AGTPMMDPTKNPEGFE 511 >gi|313892489|ref|ZP_07826078.1| head-to-tail joining protein [Dialister microaerophilus UPII 345-E] gi|313119068|gb|EFR42271.1| head-to-tail joining protein [Dialister microaerophilus UPII 345-E] Length = 516 Score = 98.6 bits (244), Expect = 2e-18, Method: Composition-based stats. Identities = 72/506 (14%), Positives = 144/506 (28%), Gaps = 73/506 (14%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLL 61 + + LK R E + P + + + + G+ L+S L Sbjct: 14 KAVYERLKQARTPYIERAVECAKYTIPSLFPRDGSTGSTKFETPYQSVGARGVNNLASKL 73 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + PP + L+ A Q E ++ + + DQ + Sbjct: 74 MLALFPPNANYFKLSPGDEAQQ-----ELDQTPQAKAQVDQALMKMESKIVE-----YAE 123 Query: 122 LQSFYTSV-----VEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 + ++ V TG + E G++ L+ + + V + Sbjct: 124 AHQYRVTLAEALKVLIVTGNDLLFLPPKEGGMKL------YKLNTYVLERDALGNVIQIV 177 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNE-----NERFTIIHAVYPKSLTDKKKDKGN 231 V K L ++K + ++ + + I VY + Sbjct: 178 ---------AVDKISYVALPDEVKRMVDKSGTTPTTSTQVEIYTHVYLEDDQ-------- 220 Query: 232 KGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 + S + E+ P+I R E YGRS E L + L Sbjct: 221 --YLSYQEYKGQIIPQSEQSYPKDKTPWIPLRMVKVDGESYGRSFVEEYLGDFKSLENLT 278 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDL---KPGYMNIGALSREGRSLFQPVQFGNPLPY 346 + + ++ + + R L K G G + G Q ++ + Sbjct: 279 KSIVEASLVAANILFLVNPNGVTRVRHLAKAKSGDFVSGRIEDIG--TLQINKYADLQVV 336 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 + ++ + F+L+ V +A E E +G + L E + Sbjct: 337 SSTIEQITARLSYAFMLN-SAVQRQGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 395 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + R L L S G LP E P +E + + + + + Sbjct: 396 VRRLLAQLMSLGQLPALEDGLVEPTITTGLEALG------RGHDLNKLITFMQLI----- 444 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNT 492 +P + + ++ A Sbjct: 445 -QQNPQQAQAIKWNEMTIMEATALGL 469 >gi|254505325|ref|ZP_05117473.1| hypothetical protein SADFL11_PLAS23 [Labrenzia alexandrii DFL-11] gi|222436169|gb|EEE42851.1| hypothetical protein SADFL11_PLAS23 [Labrenzia alexandrii DFL-11] Length = 490 Score = 96.6 bits (239), Expect = 9e-18, Method: Composition-based stats. Identities = 78/517 (15%), Positives = 166/517 (32%), Gaps = 61/517 (11%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYP-----YKNNAQLRM---WDTTGSEACIKLS 58 K +++R+ L+ +R + P +NA ++ + G+ + L+ Sbjct: 2 KSLKERYQNLQIKREPFLKRARDCAALTIPTLLPPEGHNATSKLPQPYQGLGARCVVTLA 61 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREW-CDQVTDTLFGFRERSRSG 117 S + P GQ + GL + L + + E T+ + E+ Sbjct: 62 SRMLVAFIPTGQPFFGLEVP---PELLLQEGLMEAPPDLEKGFALATNLITKEIEKKA-- 116 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 S ++ TG ++ + IR L + + + Sbjct: 117 -WRKPTSLTLELLV-STG-----NALERYMPDNSIR--VYRLDQYVVVRDLSGNL----- 162 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + + V+K L + +S L ++ + I + K Sbjct: 163 -VELILREKVNK---ASLPEQTQSYLKASQEDDVEIFTCAKRHPDGWEIKQ--------- 209 Query: 238 FVSVDENRFFEEKQIA-TFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 V+ + T P+ R+ E YGR E + L+ + Sbjct: 210 --EVEGQIIEGMGGVTPTNPFNPLRWSAVPGEDYGRGKVEEHFSDLTYLDLLSKSMVDGS 267 Query: 297 RLSLHPPTIA---VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 ++ T+ + + R + ++ + + E L Q +E+ R+ Sbjct: 268 AMATRHITMVRPNAAGSNLRKRFAEAKNGDVISGNPEDVDLKQFANVTGMQIAQQEIARI 327 Query: 354 KESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDI 413 + + FLL ++ + +A E E + +G + L + + A I + Sbjct: 328 TQELAQAFLLS-SSMIRNAERVTAQEVRMIAEELESVLGGVYSYLSQDMMSARIEALMTS 386 Query: 414 LDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSC 473 + + G LP P +L V L + + V + LQ + + P Sbjct: 387 MMAAGQLPPVLQMTQP---VLTVG-LEALERDKDVMRVQTVLQTLQAL--------PPDF 434 Query: 474 MDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 +D++D + + + P ++ E + RQQR Sbjct: 435 LDYLDIPDLLKTFMIGLGLPGK-VKTEQEAQQTRQQR 470 >gi|282857730|ref|ZP_06266939.1| head-to-tail joining protein [Pyramidobacter piscolens W5455] gi|282584400|gb|EFB89759.1| head-to-tail joining protein [Pyramidobacter piscolens W5455] Length = 534 Score = 96.6 bits (239), Expect = 1e-17, Method: Composition-based stats. Identities = 72/504 (14%), Positives = 149/504 (29%), Gaps = 46/504 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTG----FLYPYKNNAQLRM---WDTTGSEA 53 + + + + RF L R E+ + +L+P ++ + + G+E Sbjct: 10 LRRSARTTFKARFELLAGIRESYCQRAEQCSALTDPYLFPKDGVTGEKVASPYQSVGAEG 69 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 LSS + ++I PP + L + + + ++ +++ E Sbjct: 70 VTNLSSRILNIILPPNRPPFRLRVEKNPALPEEKRNWQQIEEGLAQLEKMVCDHIETLE- 128 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 R + + +V ++GIR S L N +S + + V Sbjct: 129 DRVVIAEAIPH------------LLVTGNVLLHVRKDGIRLHS--LRNYVVSRDPRGNVA 174 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 + + L S EN+R A Y + T K+ + Sbjct: 175 EIIVREKVDPRFL-------ALPLAT-STTDAPENDRRPEDKASYKELFTQIKRTENG-- 224 Query: 234 FHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 S VD + P++ R + E YGR + L + L Sbjct: 225 -WSLQQEVDGKFVSKHGHYKKDECPWLPLRMYRVSGESYGRGYVEKYLGDHKSLEALTKA 283 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + + + + L+ VQ N + + Sbjct: 284 IVEGAAACAKVVFLVSPNGTLKAKQLEEAGNLAILTGSAAEVSTVQVQKANDFQIAKAMA 343 Query: 352 R-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 L++ + +LL+ + + +A E +E +G L L EF + Sbjct: 344 DNLQQRLSRAYLLN-SAIQRNAERVTAEEIRYMAQELETALGGLYSMLSMEFQHPYVKLR 402 Query: 411 LDILDSQGNLPECEGADNPPVSLLK-VEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 + + LP+ + +K V L + Q + + V KT Sbjct: 403 MKYMKEDALLPDLDQQYQEGKVGVKIVTGIDALGRGQ---DASRLTEWAGIVF----KTI 455 Query: 470 DPSC-MDHMDTDRVSRFSLWATNT 492 P + +++ + + Sbjct: 456 GPQVALPYINASAFMKALANSMGI 479 >gi|325272831|ref|ZP_08139168.1| head-to-tail joining protein [Pseudomonas sp. TJI-51] gi|324102036|gb|EGB99545.1| head-to-tail joining protein [Pseudomonas sp. TJI-51] Length = 450 Score = 95.9 bits (237), Expect = 2e-17, Method: Composition-based stats. Identities = 73/493 (14%), Positives = 156/493 (31%), Gaps = 63/493 (12%) Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + PP + L E + L V+ ++ + E + Sbjct: 3 LLPPNSPFFRL-EIDEFTEEKLTSNPQMHADVQAGLAKIERAVQTEIETTA--IRVTGFE 59 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV-DSVYREFTFTV 183 ++ G G Y+ + G+++ PL + + V D V Sbjct: 60 LLKHLIVGGNGLVYL-------PQQGGMKF--YPLDRYVVRRDPMGNVLDIV-------- 102 Query: 184 DQIVSKWGDKVLSSKMKSALA---------RNENERFTIIHAVYPKSLTDKKKDKGNKGF 234 + + VL + +S + R+ N+ +I + K T Sbjct: 103 --VKEEVSLAVLPEEARSLVEPGDDSGDTPRDHNKNVSIYTHITLKGETWN--------- 151 Query: 235 HSKFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 + V ++ R+ E YGRS E L I+ L Sbjct: 152 --VYQEVKGQIVPGSRGTYPKDKCAWLPIRFVKIDGENYGRSYVEEYLGDIKSLEGLSQA 209 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLK--PGYMNIGALSREGRSLFQPVQFGNPLPYHEE 349 + + S + + +L P + ++ + ++L Q + G+ E Sbjct: 210 IVEGSAASAKVLFLVNPNGVTSSSELAEAPNGEFVDGVASDVQAL-QLQKSGDFRVALET 268 Query: 350 LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 +N + E + F+L+ + + +A E E A +G + L EF +++R Sbjct: 269 INTITERLEFAFMLN-SAIQRNGERVTAEEIRYMAGELEAALGGVYSILSQEFQLPLVNR 327 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 + + + LPE P + +E + + Q ++T++++ Sbjct: 328 IMFSMQRRKKLPELPKGTVSPTIVTGMEALG------RGNDLTKLDQFISTIMQI----- 376 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 P ++ A L++ EV+ +QQ+++Q+ + Q Sbjct: 377 -PDAASRINWGNYMTRRATALGIDTDGLVKTDQEVQQEQQQQQMQQAMQSGVAPAVQAAG 435 Query: 529 TSQDIGAKAAGRA 541 + G +A Sbjct: 436 RMMEKGQPDGSQA 448 >gi|158425212|ref|YP_001526504.1| head-to-tail joining protein [Azorhizobium caulinodans ORS 571] gi|158332101|dbj|BAF89586.1| head-to-tail joining protein [Azorhizobium caulinodans ORS 571] Length = 511 Score = 92.4 bits (228), Expect = 2e-16, Method: Composition-based stats. Identities = 75/504 (14%), Positives = 146/504 (28%), Gaps = 53/504 (10%) Query: 12 RFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSS 63 R+ L R + P N + G+ L S L Sbjct: 11 RYTQLATIRSPYLERARDCATLTIPSLMPRAGHGAANDLPTPFQGMGARGVNNLGSKLLL 70 Query: 64 LITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQ 123 + PP Q + L A Q ++ R +V + Q+ + E Sbjct: 71 ALMPPNQPFFRLMLDDFALQELTGQDGMR-TEVEKALGQIERAVQTEVETGA--IRVSAF 127 Query: 124 SFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTV 183 ++ G Y++ G + R L + + V + Sbjct: 128 EALKQLLVAGNVLLYVQP----TGGVKVYR-----LDRYVVKRDPSGNVLEIV------- 171 Query: 184 DQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDE 243 I + L +++ L +R + + + + F V Sbjct: 172 --IHERVSPLALPEELQRKL---GEQRKGVQDTI----DLYTWIRRESGKFV-VHQEVKG 221 Query: 244 NRFFEEKQ---IATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 + P+I R+ E YGR E + +R L + + + Sbjct: 222 EKVPGTDGEWPTDKAPFIALRWAKIDGEDYGRGHVEEYIGDLRSLEALTRAIVEGAAAAA 281 Query: 301 HPPTIAVSEA--KQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIR 358 + +R P M + + ++E ++ Q +F + E + RL+ + Sbjct: 282 KVLFLVNPNGVTNERTISEAPN-MAVRSGNKEDVNVLQVEKFNDFRVALETVGRLEIRLS 340 Query: 359 SLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQG 418 FLL + D +A E E +G + L EF ++ R + ++ Sbjct: 341 QAFLLT-SSIQRDAERVTAEEIRVMAGELEDALGGVYSILAQEFQLPLVRRLIFQMEQDE 399 Query: 419 NLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMD 478 LP P L+K + + + + + V +L PS D Sbjct: 400 RLPSL------PPDLVKPSIITGMEALGRGHDLNRLMMFAKVVNDLLGPGALPSYAD--- 450 Query: 479 TDRVSRFSLWATNTPAVLIRDTAE 502 ++ + A + I + E Sbjct: 451 ARKLIERAGVALSVDTSDILKSDE 474 >gi|291334465|gb|ADD94119.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] gi|291334522|gb|ADD94175.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334658|gb|ADD94305.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291334712|gb|ADD94358.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] gi|291336438|gb|ADD95993.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 86 Score = 90.1 bits (222), Expect = 9e-16, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 35/90 (38%), Gaps = 5/90 (5%) Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 MI R ++ + +++EY SPL K Q++ ++S ++ + + L Sbjct: 1 MIDRTFALILRKNLFRPAPEFLAGQD--IEIEYVSPLAKAQKSTELSSIMRAIEILGSLS 58 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAV 495 DH++ D++ R P Sbjct: 59 NVA---PVFDHINMDKLVRHLADIVGVPQK 85 >gi|291334262|gb|ADD93925.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 155 Score = 89.7 bits (221), Expect = 1e-15, Method: Composition-based stats. Identities = 22/113 (19%), Positives = 46/113 (40%), Gaps = 4/113 (3%) Query: 251 QIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEA 310 + PY+V R+ A E+YGR P + ++P I+ N + + + ++++ + Sbjct: 41 GEGSNPYVVFRWSKAAGEVYGRGPLLNSMPAIKTCNLVIEMILENAQMAISGMYQMEDDG 100 Query: 311 KQR--NFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 L PG + + S G + GN L ++++I + Sbjct: 101 IINVDTIQLLPGTIIPRSPSSRGLEPIK--NAGNFNVADLVLKDMRQNINEHY 151 >gi|33300841|ref|NP_877469.1| head-tail connector protein [Pseudomonas phage phiKMV] gi|195546675|ref|YP_002117756.1| hypothetical protein PT5_gp34 [Pseudomonas phage PT5] gi|33284812|emb|CAD44221.1| head-tail connector protein [Enterobacteria phage phiKMV] gi|158187636|gb|ABW23113.1| conserved hypothetical phage protein [Pseudomonas phage PT5] Length = 510 Score = 85.8 bits (211), Expect = 2e-14, Method: Composition-based stats. Identities = 61/443 (13%), Positives = 129/443 (29%), Gaps = 44/443 (9%) Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 + + G+ L++ L+ + P G + +E A + D +V +V Sbjct: 48 FQSAGALLVNNLAAKLARSLFPTGIPFFR-SELTDAIRREADSRDTDITEVTAALARVDR 106 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 ++ S + L ++ G Y ++D ++ L + + Sbjct: 107 KATQRLFQNAS--LAVLTQVIKLLIVTGNALLYRDSDAATV--------VAWSLRSYAVR 156 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK 225 + + + + + ++ L ++ + +T + Sbjct: 157 RDATGRWMDIVLKQRYKSKDLDEEYKQD-LMRAGRNLSGSGSVDLYTHVQ---------- 205 Query: 226 KKDKGNKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 +K + + +D R +E + PYIV + + E YGR + + Sbjct: 206 RKKGTAMEYAELYHEIDGVRVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFA 265 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNF---------DLKPGYMNIGALSREGRSL 334 +L+ +L + SL V EAK D PG G Sbjct: 266 KLSLLSEKLGLYELESLEV-LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERG--- 321 Query: 335 FQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPL 394 + + L + + F+ D +A E E +G Sbjct: 322 ----DYNKMAAIQQSLQAVVVRLNQAFMY--GANQRDAERVTAEEVRITAEEAENTLGGT 375 Query: 395 IGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASA 454 L + L +D L + P + S Q + + Sbjct: 376 YSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQV 434 Query: 455 LQGVNTVVELGVKTGDPSCMDHM 477 + G+ + +L + P MD + Sbjct: 435 IAGLAPIAQLDPRISLPKMMDTI 457 >gi|195546737|ref|YP_002117815.1| head-tail connector protein [Pseudomonas phage PT2] gi|165880746|gb|ABY71001.1| head-tail connector protein [Pseudomonas phage PT2] Length = 510 Score = 84.7 bits (208), Expect = 4e-14, Method: Composition-based stats. Identities = 60/443 (13%), Positives = 129/443 (29%), Gaps = 44/443 (9%) Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 + + G+ L++ L+ + P G + +E A + D +V +V Sbjct: 48 FQSAGALLVNNLAAKLARSLFPTGIPFFR-SELTDAIRREADSRDTDITEVTAALARVDR 106 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 ++ S + L ++ G Y ++ ++ L + + Sbjct: 107 KATQRLFQNAS--LAVLTQVIKLLIVTGNALLYRDSAAATV--------VAWSLRSYAVR 156 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK 225 + + + + + ++ L ++ + +T + Sbjct: 157 RDATGRWMDIVLKQRYKSKDLDEEYKQD-LMRAGRNLSGSGSVDLYTHVQ---------- 205 Query: 226 KKDKGNKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 +K+ + + +D R +E + PYIV + + E YGR + + Sbjct: 206 RKNGTAMEYAELYHEIDGVRVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFA 265 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNF---------DLKPGYMNIGALSREGRSL 334 +L+ +L + SL V EAK D PG G Sbjct: 266 KLSLLSEKLGLYELESLEV-LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERG--- 321 Query: 335 FQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPL 394 + + L + + F+ D +A E E +G Sbjct: 322 ----DYNKMAAIQQSLQAVVVRLNQAFMY--GANQRDAERVTAEEVRITAEEAENTLGGT 375 Query: 395 IGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASA 454 L + L +D L + P + S Q + + Sbjct: 376 YSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQV 434 Query: 455 LQGVNTVVELGVKTGDPSCMDHM 477 + G+ + +L + P MD + Sbjct: 435 IAGLAPIAQLDPRISLPKMMDTI 457 >gi|225626357|ref|YP_002727853.1| putative head-tail connector protein [Pseudomonas phage phikF77] gi|225594866|emb|CAX63151.1| putative head-tail connector protein [Pseudomonas phage phikF77] Length = 510 Score = 84.3 bits (207), Expect = 5e-14, Method: Composition-based stats. Identities = 68/449 (15%), Positives = 128/449 (28%), Gaps = 56/449 (12%) Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 + + G+ L++ L+ + P G + +E A + D +V +V Sbjct: 48 FQSAGALLVNNLAAKLARSLFPTGIPFFR-SELTDAIRREADSRDTDITEVTAALARVDR 106 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 ++ S + L ++ G Y +D ++ L + + Sbjct: 107 KATQRLFQNAS--LAVLTQVIKLLIVTGNALLYRNSDEATV--------VAWSLRSYAVR 156 Query: 166 VNHQNV-VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-----RFTIIHAVYP 219 + +D V + ++ K L K L R + V Sbjct: 157 RDATGRWMDIV----------LKQRYKSKDLDEAYKQDLMRAGRNLSGSGSVDLYTHVQ- 205 Query: 220 KSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAME 277 +K + + +D R EE + PYIV + + E YGR + Sbjct: 206 ------RKKGTAMEYAELYHEIDGVRVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVED 259 Query: 278 ALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF---------DLKPGYMNIGALS 328 + +L+ +L + SL V EAK D PG Sbjct: 260 YIGDFAKLSLLSEKLGLYELESLEV-LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAY 318 Query: 329 REGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKG 388 G + + L + + F+ D +A E E Sbjct: 319 ERG-------DYNKMAAIQQSLQAVVVRLNQAFMY--GANQRDAERVTAEEVRITAEEAE 369 Query: 389 AFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQA 448 +G L + L +D L + P + S Q Sbjct: 370 NTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSRSAAVQSM 428 Query: 449 ESVASALQGVNTVVELGVKTGDPSCMDHM 477 + + + G+ + +L + P MD + Sbjct: 429 LNASQVIAGLAPIAQLDPRISLPKMMDTI 457 >gi|125999995|ref|YP_001039666.1| head portal-like protein [Erwinia amylovora phage Era103] gi|121621851|gb|ABM63425.1| head portal-like protein [Enterobacteria phage Era103] Length = 517 Score = 84.3 bits (207), Expect = 5e-14, Method: Composition-based stats. Identities = 66/468 (14%), Positives = 138/468 (29%), Gaps = 60/468 (12%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLS 58 + I + L +R E + F PY + + W G+ A LS Sbjct: 8 NKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDGASATNFLS 67 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + LS ++ P + + + + L E ++ V F Sbjct: 68 NKLSQVLFPAQRSFFRIDLT-PEGIKQLDNEAMTQSTAQKLLSDVEKA--AMLYGESLQF 124 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV-DSVY- 176 + + ++ G D +VPL + + ++ V D V+ Sbjct: 125 RPAVVEAFKHLIVTGN-VMMYHPDKTSPI-------QAVPLHHYCVRRDNNGTVLDIVFL 176 Query: 177 -----REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGN 231 F ++ + + K ++++ HA ++ K + Sbjct: 177 QEKALETFEPSIRMAIQ--------ASRKGKQYKDKDNVKLYTHA--KRTKDGKYLIRQ- 225 Query: 232 KGFHSKFVSVDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 S D+ +E + P+++ ++ E YGR A + + Sbjct: 226 --------SADDVPVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLS 277 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGR-SLFQPVQFGNPLPYHE 348 LA+ L + + G EG + Q ++ + P Sbjct: 278 EALARGMALMADVKYLVKPGSYTDINQFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQA 337 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 LN ++ I +F+++ D +A E +G + + F G + + Sbjct: 338 VLNDYRQRIGRVFMMEA-MTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPL-A 395 Query: 409 REL-----DILDSQGNLPECEGADNPPVSLLKVE-------YTSPLFK 444 R IL S+ P + +++ Y S + Sbjct: 396 RWFMNGISSILTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQ 443 >gi|167600476|ref|YP_001671975.1| head-tail connector protein [Pseudomonas phage LUZ19] gi|161168339|emb|CAP45503.1| head-tail connector protein [Pseudomonas phage LUZ19] Length = 510 Score = 83.9 bits (206), Expect = 7e-14, Method: Composition-based stats. Identities = 60/443 (13%), Positives = 128/443 (28%), Gaps = 44/443 (9%) Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 + + G+ L++ L+ + P G + +E A + D +V +V Sbjct: 48 FQSAGALLVNNLAAKLARSLFPTGIPFFR-SELTDAIRREADSRDTDITEVTAALARVDR 106 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 ++ S + L ++ G Y ++ ++ L + + Sbjct: 107 KATQRLFQNAS--LAVLTQVIKLLIVTGNALLYRDSAAATV--------VAWSLRSYAVR 156 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK 225 + + + + + ++ L ++ + +T + Sbjct: 157 RDATGRWMDIVLKQRYKSKDLDEEYKQD-LMRAGRNLSGSGSVDLYTHVQ---------- 205 Query: 226 KKDKGNKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 +K + + +D R +E + PYIV + + E YGR + + Sbjct: 206 RKKGTAMEYAELYHEIDGVRVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFA 265 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNF---------DLKPGYMNIGALSREGRSL 334 +L+ +L + SL V EAK D PG G Sbjct: 266 KLSLLSEKLGLYELESLEV-LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERG--- 321 Query: 335 FQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPL 394 + + L + + F+ D +A E E +G Sbjct: 322 ----DYNKMAAIQQSLQAVVVRLNQAFMY--GANQRDAERVTAEEVRITAEEAENTLGGT 375 Query: 395 IGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASA 454 L + L +D L + P + S Q + + Sbjct: 376 YSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQV 434 Query: 455 LQGVNTVVELGVKTGDPSCMDHM 477 + G+ + +L + P MD + Sbjct: 435 IAGLAPIAQLDPRISLPKMMDTI 457 >gi|311875235|emb|CBX44494.1| bacteriophage head-to-tail connecting protein [Erwinia phage phiEa1H] gi|311875356|emb|CBX45097.1| head-to-tail connecting protein [Erwinia phage phiEa100] Length = 517 Score = 83.5 bits (205), Expect = 7e-14, Method: Composition-based stats. Identities = 65/468 (13%), Positives = 138/468 (29%), Gaps = 60/468 (12%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLS 58 + I + L +R E + F PY + + W G+ A LS Sbjct: 8 NKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDGASATNFLS 67 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + LS ++ P + + + + L E ++ V F Sbjct: 68 NKLSQVLFPAQRSFFRIDLT-PEGIKQLDNEAMTQSTAQKLLSDVEKA--AMLYGESLQF 124 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV-DSVY- 176 + + ++ G D +VPL + + ++ + D V+ Sbjct: 125 RPAVVEAFKHLIVTGN-VMMYHPDKTSPI-------QAVPLHHYCVRRDNNGTILDIVFL 176 Query: 177 -----REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGN 231 F ++ + + K ++++ HA ++ K + Sbjct: 177 QEKALETFEPSIRMAIQ--------ASRKGKQYKDKDNVKLYTHA--KRTKDGKYLIRQ- 225 Query: 232 KGFHSKFVSVDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 S D+ +E + P+++ ++ E YGR A + + Sbjct: 226 --------SADDVPVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLS 277 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGR-SLFQPVQFGNPLPYHE 348 LA+ L + + G EG + Q ++ + P Sbjct: 278 EALARGMALMADVKYLVKPGSYTDINQFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQA 337 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 LN ++ I +F+++ D +A E +G + + F G + + Sbjct: 338 VLNDYRQRIGRVFMMEA-MTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPL-A 395 Query: 409 REL-----DILDSQGNLPECEGADNPPVSLLKVE-------YTSPLFK 444 R IL S+ P + +++ Y S + Sbjct: 396 RWFMNGISSILTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQ 443 >gi|291335778|gb|ADD95380.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C429] Length = 315 Score = 83.2 bits (204), Expect = 1e-13, Method: Composition-based stats. Identities = 49/296 (16%), Positives = 99/296 (33%), Gaps = 19/296 (6%) Query: 256 PYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF 315 P++V + E YGR E L ++ L L + + + + + Sbjct: 30 PWLVLTFNSVDGEQYGRGRVEEFLGDLKSLEGLSQALVEGAAAASKVIFLVSPSSTTKPA 89 Query: 316 DLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASR 375 + GA+ + Q VQ G + N + R L L + + Sbjct: 90 TIAKAG--NGAIVQGRAEDVQVVQVGKTADFSTAANMSQTIERRLLEAFLVMNVRNAERV 147 Query: 376 SAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLK 435 +A E E +G + L F+ + R L +L LP+ P + Sbjct: 148 TAEEVRLTQLELEQQLGGIFSLLTVSFLIPYLDRTLLVLQRTNELPKLPKDIIRPTIVAG 207 Query: 436 VEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM-DHMDTDRVSRFSLWATNTPA 494 V + L + Q E++ Q + T+ + T P + ++ + A Sbjct: 208 V---NALGRGQDREALT---QFMGTIAQ----TIGPEALGQFINPLEAIKRLAAAQGIDV 257 Query: 495 -VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHD 549 L++ ++ ++E ++ ++Q L Q Q + A M+ + + Sbjct: 258 LNLVKTQEQLAG---EKEEAMQMQQQQTLLNQAGQFANS--KLADTENMQGMMQGE 308 >gi|291334263|gb|ADD93926.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 130 Score = 82.4 bits (202), Expect = 2e-13, Method: Composition-based stats. Identities = 25/126 (19%), Positives = 51/126 (40%), Gaps = 11/126 (8%) Query: 368 VLDDKAS--RSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEG 425 +L D SA E E+ + +G G LQ+E + ++ R + IL QG + Sbjct: 1 MLGDPNRTPMSATEVAERMADLSRQIGSSFGRLQAEMVTPVLQRVIHILKKQGRI----N 56 Query: 426 ADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD-HMDTDRVSR 484 +K++ TSPL + Q + + + + V P ++ +D++ ++ Sbjct: 57 IPTVNGREIKIQSTSPLAQAQANQDINGFNRFLELVG----ARFGPQLINLLVDSNEATK 112 Query: 485 FSLWAT 490 + Sbjct: 113 YLAENL 118 >gi|167565008|ref|ZP_02357924.1| head-to-tail joining protein [Burkholderia oklahomensis EO147] Length = 509 Score = 81.6 bits (200), Expect = 3e-13, Method: Composition-based stats. Identities = 81/510 (15%), Positives = 155/510 (30%), Gaps = 69/510 (13%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPY----KNNAQLRM----WDTTGSEACIKLSSL 60 ++DR+ L R + P ++ + + G +SS Sbjct: 4 LKDRYQELVPDRDPYFRRAQACAALTVPSVCPPDGQTSQQILPQSYTSFGHRGATNVSSK 63 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 L PPG + S + ++ + Q + E + Sbjct: 64 LMMAFMPPGDSAFNIEVSTQVL--LQEGVLSPPPEIVKGLAQCEQLINAKIE--ALNWRR 119 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 +V G Y++ D R LS + V Sbjct: 120 QTYLSLLHLVVAGNVGEYIQPDG---------RLKIFSLSQFVCVRDFNGRV-------- 162 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 K + L ++ A+ E E T+ + + ++ ++ Sbjct: 163 MEA-VTAEKLKVRELPKDLQRVTAKKEREDVTLY----------TRFEWVDENRYAVHQD 211 Query: 241 VDENRFFEEKQIAT----FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 +D+ K P+ + + E YGRS + + L++T +L + G Sbjct: 212 LDDAVV---KPYQEYNGIMPFNALAWELVPGESYGRSHVEQNYSDLIALDKTSQQLLECG 268 Query: 297 RLSLHP-----PTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH---E 348 ++ P A ++R + + G + +G QP QF N Sbjct: 269 AIAARNLIFVAPNAAGGNLRKRIMEARNGSVISARGGTQGD--VQPFQFNNMAAMQSLNA 326 Query: 349 ELNRLKESIRSLFL--LDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 E LK + FL DL D +A E E +G + L E IG Sbjct: 327 EKQDLKRDLAVAFLLTNDL---RRDAERVTAYELQMLVTEIEQSLGGVYSYLGPEMIGWR 383 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + + + + S+ LP+ G D+ +++ + L K + + V S L +N + Sbjct: 384 LKKLVAQMQSKDELPKI-GKDSTQITVTTG--LAALGKDAKLKKVHSFLSLLNETPQ--- 437 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVL 496 ++ D + + A P + Sbjct: 438 -AFQQEAAAYVKFDTILTPAAAALGFPQSI 466 >gi|158345057|ref|YP_001522822.1| putative head-tail connector protein [Pseudomonas phage LKD16] gi|114796410|emb|CAK25966.1| putative head-tail connector protein [Pseudomonas phage LKD16] Length = 510 Score = 80.5 bits (197), Expect = 7e-13, Method: Composition-based stats. Identities = 66/449 (14%), Positives = 126/449 (28%), Gaps = 56/449 (12%) Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 + + G+ L++ L+ + P G + +E A + D +V +V Sbjct: 48 FQSAGALLVNNLAAKLARSLFPTGIPFFR-SELTDAIRREADSRDTDITEVTAALARVDR 106 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 ++ S + L ++ G Y +D ++ L + + Sbjct: 107 KATQRLFQNAS--LAVLTQVIKLLIVTGNALLYRNSDEATV--------VAWSLRSYAVR 156 Query: 166 VNHQNV-VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-----RFTIIHAVYP 219 + +D V + ++ K L K L R + V Sbjct: 157 RDATGRWMDIV----------LKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQR 206 Query: 220 KSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAME 277 + + + +D R E + PYIV + + E YGR + Sbjct: 207 RK-------GTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED 259 Query: 278 ALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF---------DLKPGYMNIGALS 328 + +L+ +L + SL V EAK D PG Sbjct: 260 YIGDFAKLSLLSEKLGLYELESLEV-LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAY 318 Query: 329 REGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKG 388 G + + L + + F+ D +A E E Sbjct: 319 ERG-------DYNKMAAIQQSLQAVVVRLNQAFM--YGANQRDAERVTAEEVRITAEEAE 369 Query: 389 AFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQA 448 +G L + L +D L + P + S Q Sbjct: 370 NTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSRSAAVQSM 428 Query: 449 ESVASALQGVNTVVELGVKTGDPSCMDHM 477 + + + G+ + +L + P MD + Sbjct: 429 LNASQVIAGLAPIAQLDPRISLPKMMDTI 457 >gi|254505047|ref|ZP_05117198.1| hypothetical protein SADFL11_5087 [Labrenzia alexandrii DFL-11] gi|222441118|gb|EEE47797.1| hypothetical protein SADFL11_5087 [Labrenzia alexandrii DFL-11] Length = 400 Score = 80.5 bits (197), Expect = 8e-13, Method: Composition-based stats. Identities = 45/260 (17%), Positives = 92/260 (35%), Gaps = 17/260 (6%) Query: 254 TFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIA---VSEA 310 T P+ R+ E YGR E + L+ + ++ T+ + + Sbjct: 135 TNPFNPLRWSAVPGEDYGRGKVEEHFSDLTYLDLLSKSMVDGSAMATRHITMVRPNAAGS 194 Query: 311 KQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLD 370 R + ++ + + E L Q +E+ R+ + + FLL ++ Sbjct: 195 NLRKRFAEAKNGDVISGNPEDVDLKQFANVTGMQIAQQEIARITQELAQAFLLS-SSMIR 253 Query: 371 DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPP 430 + +A E E + +G + L + + A I + + + G LP P Sbjct: 254 NAERVTAQEVRMIAEELESVLGGVYSYLSQDMMSARIEALMTSMMAAGQLPPVLQMTQP- 312 Query: 431 VSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWAT 490 +L V L + + V + LQ + + P +D++D + + + Sbjct: 313 --VLTVG-LEALERDKDVMRVQTVLQTLQAL--------PPDFLDYLDIPDLLKTFMIGL 361 Query: 491 NTPAVLIRDTAEVEDIRQQR 510 P ++ E + RQQR Sbjct: 362 GLPGK-VKTEQEAQQTRQQR 380 >gi|291334897|gb|ADD94534.1| T7-like head to tail connector [uncultured phage MedDCM-OCT-S08-C159] Length = 416 Score = 79.3 bits (194), Expect = 2e-12, Method: Composition-based stats. Identities = 41/247 (16%), Positives = 80/247 (32%), Gaps = 15/247 (6%) Query: 249 EKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVS 308 ++ P+I R+ E YGR E + L + + + S + Sbjct: 120 RSKLDVSPWIPLRFIRVDGEDYGRGYVEEYRGDLISLESLMQAIIEGAAASAKTLFLVNP 179 Query: 309 EAKQRNFDLK--PGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLF 366 R L P L+ + S+ Q + G+ + R++ + FL+ Sbjct: 180 NGVTRAATLAKAPNGAIREGLASDI-SVMQVGKSGDFSVAFSAIQRIEGRLEFAFLMAR- 237 Query: 367 QVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGA 426 V D +AAE +E +G + L EF + R + +L QG +P+ Sbjct: 238 SVQRDAERVTAAEVSLMAQELENSLGGIYSILTQEFQLPYLRRRMHLLVRQGKVPKLPDE 297 Query: 427 DNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM-DHMDTDRVSRF 485 P + + Q + + + + G P M +++ D + Sbjct: 298 LVKPKIVTGL---------QGLGRGNDRNKLIEFIGTVAQALG-PDVMRQYVNVDEAVKR 347 Query: 486 SLWATNT 492 + Sbjct: 348 LATSIGI 354 >gi|315518948|dbj|BAJ51825.1| putative head to tail joining protein [Ralstonia phage RSB2] Length = 531 Score = 78.5 bits (192), Expect = 3e-12, Method: Composition-based stats. Identities = 84/555 (15%), Positives = 169/555 (30%), Gaps = 64/555 (11%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY-----KNNAQLRM---WDTTGSEACIKLSSLLSSL 64 + L+N R E+ + P +N + + G+ L++ L Sbjct: 19 YTRLENDRAPYITRAEKNAQYTIPSLFPKSSDNYSTDYPTPYQSVGARGLNNLAAKLVLS 78 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREW----CDQVTDTLFGFRERSRSGFVG 120 + P G+ +H L S + V E V + E + G Sbjct: 79 LIPVGEPFHRLTISEFDVKE-TAGGTGEEGSVMERAQVGLSMVERIITAHGESA--GLRP 135 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV-DSVYREF 179 ++ G G + + L N + + V ++ ++ Sbjct: 136 MASELMKQLLVAGNGLVCLPPQE--------VACKLYKLHNFVVERDSVGNVLQTIAKDV 187 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARN---ENERFTIIHAVYPKSLTDKKKDKGNKGFHS 236 T L ++K+AL N T+ Y +D+ Sbjct: 188 T----------AYVALPEEVKAALPEGDYQPNSPITMYTHCYRDLESDQWLA-------- 229 Query: 237 KFVSVDENRF-FEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + V+ E PYI R + E YGRS E + + L + Sbjct: 230 -YQEVEGEVIPGSENTYPKEGNPYIPIRMYKQDGENYGRSFVEEYIGDLVSLENISKAIV 288 Query: 294 QFGRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNR 352 QF + + K + +E +FQ +F + + Sbjct: 289 QFAIACSKILFLVKPGSSTSVRRVAKAASGDFVPGKKEDIEVFQMEKFADFQTAKSVADG 348 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 +++ + FLL+ V +A E + E + +G + L +EF ++ R L Sbjct: 349 IEQRLSFAFLLN-SSVQRSGERVTAEEIRFVSAELESTLGGVYSVLATEFQLPIVRRWLI 407 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 L + G +P+ P + ++ + Q +A+ + V+ Sbjct: 408 DLQATGKIPDLPTEALKPQIITGIDAIG---RGQDQAKLAAFQSLIQPFVQ--------R 456 Query: 473 CMDHMDTDRVSRFSLWATNT-PAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQ 531 + +D D + + A+ PA LI +++ R +E + + + Sbjct: 457 VSNRVDWDGLLLKAANASGLDPAGLILTDQQMQA-RATQEGITQGLVQGGASAGATAGQG 515 Query: 532 DIGAKAAGRAMEKKL 546 A +++ L Sbjct: 516 MGAAMTDPEGIQQAL 530 >gi|291334524|gb|ADD94177.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334656|gb|ADD94303.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291334710|gb|ADD94356.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] gi|291336436|gb|ADD95991.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 95 Score = 74.7 bits (182), Expect = 4e-11, Method: Composition-based stats. Identities = 14/93 (15%), Positives = 35/93 (37%), Gaps = 13/93 (13%) Query: 38 KNNAQLR---MWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSK 94 ++ R ++D + ++ L++ L ++T P W L F + Sbjct: 12 RSKGDKRTELIFDGSPLQSVELLAASLHGMLTNPSTPWFSLR--------FKQNDMENED 63 Query: 95 KVREWCDQVTDTLFGFRERSRSGFVGCLQSFYT 127 + +EW + T+ ++ ++S F + Sbjct: 64 EAKEWLEDATEVMYSAF--NKSNFQQEYLNCIM 94 >gi|157828579|ref|YP_001494821.1| hypothetical protein A1G_03995 [Rickettsia rickettsii str. 'Sheila Smith'] gi|157801060|gb|ABV76313.1| hypothetical protein A1G_03995 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 111 Score = 73.5 bits (179), Expect = 8e-11, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 50/112 (44%), Gaps = 9/112 (8%) Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKG---- 230 +YR F+ + +KW D K LA+N +E I+H V P+S + K Sbjct: 1 MYRLFSMPIKAASAKWPDFA---DFKERLAKNPDETVKILHIVSPQSENQRGKGGKGKGL 57 Query: 231 --NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALP 280 + S+++ + E + + + FP+ V + ++YG +PA A+ Sbjct: 58 MTTLAYSSEYIYLSEQKIISQSGYSYFPFFVTLWIKGEGQVYGYAPAHHAIS 109 >gi|165933293|ref|YP_001650082.1| hypothetical protein RrIowa_0838 [Rickettsia rickettsii str. Iowa] gi|165908380|gb|ABY72676.1| hypothetical protein RrIowa_0838 [Rickettsia rickettsii str. Iowa] Length = 111 Score = 73.1 bits (178), Expect = 1e-10, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 49/112 (43%), Gaps = 9/112 (8%) Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKG---- 230 +YR F+ + +KW D K LA+N +E I+H V P+S + K Sbjct: 1 MYRLFSMPIKAASAKWPDFA---DFKERLAKNPDETVKILHIVSPQSENQRGKGGKGKGL 57 Query: 231 --NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALP 280 + S+++ + E + + FP+ V + ++YG +PA A+ Sbjct: 58 MTTLAYSSEYIYLSEQKIISQSGYLYFPFFVTLWIKGEGQVYGYAPAHHAIS 109 >gi|167841465|ref|ZP_02468149.1| head-to-tail joining protein [Burkholderia thailandensis MSMB43] Length = 519 Score = 66.6 bits (161), Expect = 1e-08, Method: Composition-based stats. Identities = 63/523 (12%), Positives = 144/523 (27%), Gaps = 67/523 (12%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYP------YKNNAQLRM---WDTTGSEACIKLSSL 60 + + L R L E+ + F P N + + + G++ L++ Sbjct: 6 EQAWESLAGLRRPLLTRCEKYSAFTLPTIITPQGYNEELEELQTDFQSVGAQGVNNLANK 65 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 L + P + + + + D + + ++E + R G Sbjct: 66 LMLALFAPSRPFFRYQVAAALMNQLKQTLDVQEQDLQEMLAEGERNC--IRTLDAMGVRP 123 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 L ++ G C + D + + L + + + Sbjct: 124 KLYEAMKHLIITGN-CLLILGDDPKDTPMRV-----LSLKRYAVKRSMSGKL------LQ 171 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 + + V ++ L +++ + + + D K F Sbjct: 172 LIIHETV-RF--DELDDEVQKIAVESSSRYANV-------DPNDPNSCPEVKYFTWVRWD 221 Query: 241 VDENRFFEE-----------KQIA---TFPYIVGRYRVRADEIYGRSPAMEALPTIRRLN 286 N PYI + + D YG + + L+ Sbjct: 222 GTANYIVTHHVDNVELPAKFSGKYTDQDLPYIPLTWELHDDNDYGTGLVEQMAGDLAALS 281 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPY 346 + L+ + + R D+ + GA + P+ G Sbjct: 282 ALSEAEVKGAILASEFRWLVNPAGQTRPADI--ADSDNGAALPGTKDDVVPLNSGTGQAM 339 Query: 347 H---EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 + I FLL ++ D +A E + E +G + L +F Sbjct: 340 QYIDTVATKYVNRIGRNFLLS-SSIVRDAERVTAEEIRMQANELETSLGGVYSRLAVDFQ 398 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVE 463 M + G + G D P+ + ++ L + +++ ALQ + V Sbjct: 399 KPM---AYWLTKRAGV--QLAGKDIEPMVITGLD---ALSRNGDLDNLKLALQDLAAVSG 450 Query: 464 LGVKTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVED 505 + P + ++ +++ A ++ + Sbjct: 451 M-----PPQALAVLNLTAIAKAIFMGRGVTMADYVKSQEQQAA 488 >gi|108862014|ref|YP_654130.1| 29 [Enterobacteria phage K1-5] gi|40787100|gb|AAR90071.1| 29 [Enterobacteria phage K1-5] Length = 516 Score = 65.4 bits (158), Expect = 3e-08, Method: Composition-based stats. Identities = 64/490 (13%), Positives = 152/490 (31%), Gaps = 60/490 (12%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLSS 59 I + N+R + + PY N W G++A L++ Sbjct: 13 RSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLAN 72 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L+ ++ P + + + + + + +++ + T + +E + F Sbjct: 73 KLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAM---KELEQRQFR 129 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN-VVDSVY-- 176 + + ++ G C + ++P+ + ++ + ++D + Sbjct: 130 PAVVEAFKHLIVAG-SCMLYKPSKGAIS--------AIPMHHYVVNRDTNGDLLDIILLQ 180 Query: 177 ----REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGN 231 R F +V + K K + + +T HA Y + K+ + Sbjct: 181 EKALRTFDPATRAVVE------VGLKGKKCKEDDSVKLYT--HAKYLGDGFWELKQSADD 232 Query: 232 KGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 ++ EK P+I ++ E +GR A + + + Sbjct: 233 IPVGKV------SKIKSEK----LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEA 282 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFD--LKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE 349 +A+ L + A Q + D + G + E + Q ++ + P Sbjct: 283 VARGAALMADIKYLIRPGA-QTDVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAV 341 Query: 350 LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 L I +F+++ D +A E E +G + + + Sbjct: 342 LEVYTRRIGVVFMMET-MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPV--- 397 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL----G 465 + L G E +D ++ L + + + +A+ Q ++ ++ Sbjct: 398 AMWGLLEAG---ESFTSDLVDPVIITG--IEALGRMAELDKLANFAQYMSLPLQWPEPVL 452 Query: 466 VKTGDPSCMD 475 P MD Sbjct: 453 AAVKWPDYMD 462 >gi|312062873|gb|ADQ12735.1| putative Head-tail connector protein [Acinetobacter phage phiAB1] Length = 518 Score = 65.0 bits (157), Expect = 3e-08, Method: Composition-based stats. Identities = 32/293 (10%), Positives = 78/293 (26%), Gaps = 19/293 (6%) Query: 255 FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN--ELAQFGRLSLHPPTIAVSEAKQ 312 PYI + + YGR E +L+E Q L + A Sbjct: 235 CPYIPVTWSYMNGDAYGRGYVEEYAGDFAKLSELSQGLTEYQIESLIIRHVYNA-QGGFD 293 Query: 313 RNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDK 372 + + + + ++ + L + + + F+ + + Sbjct: 294 VESAVNSRNGDWISGNVNAVQNYESGSYQKMNEVRLGLEAIMQRLNVAFMYT--GNMREG 351 Query: 373 ASRSAAESMEKTREKGAFVGPLIGGL-QSEFIGAMISRELDILDSQGNLPECEGADNPPV 431 +A E E +G + L Q+ + + +L + Sbjct: 352 DRVTAYEIARNADEAEQVLGGVYSQLSQNMHL-PL---AYLLLYEVRK----DFIQAIDR 403 Query: 432 SLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN 491 +++ + L ++ + L N + + + D + L + Sbjct: 404 QEIELNILTGLQALSRSSENQALLVAANEIATVAQVFS--QVSKRFNLDAIVDKILLSNG 461 Query: 492 TP-AVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAME 543 + + E+ + E QR ++ Q +A + Sbjct: 462 IDISEITYSEEEMRA--KAMEEQRAAEAQRQQVIQQAGAQLGGNQLENTQAAQ 512 >gi|294661422|ref|YP_003347633.2| head-tail connector protein [Klebsiella phage KP34] gi|291195554|gb|ACY66713.2| head-tail connector protein [Klebsiella phage KP34] Length = 531 Score = 64.3 bits (155), Expect = 5e-08, Method: Composition-based stats. Identities = 74/517 (14%), Positives = 162/517 (31%), Gaps = 49/517 (9%) Query: 38 KNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVR 97 + R + +TG++ ++ + + P G + ++S + A + + + Sbjct: 44 RRRPLERDYQSTGAQLVNTAATKIVGALFPQGTSFFRFSKSSDLDEFISSLGSAATAESK 103 Query: 98 EWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISV 157 +V +T + + G+ LQ+ +V G Y++ + + + Sbjct: 104 --LAEVENTA-SQKVFEKDGYAAKLQAVKLLLVT-GNALEYIDERTGKSIVYSVRNFT-- 157 Query: 158 PLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAV 217 + + V + + S + L ++ R+++ + + Sbjct: 158 ------VRRDGSGNV------LRLIIRERAS---VQDLPESFQNTFYRDKDPYGDV--DI 200 Query: 218 YPKSLTDKKKDKGNKGFHS--KFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRS 273 Y + K+ + S + D +R + PY V + + + E YGR Sbjct: 201 YTAACRKVKRTEEGVEVVSYEVYQEADGHRIGDSSTYPELELPYNVLVWNLVSGEHYGRG 260 Query: 274 PAMEALPTIRRLNETVN--ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGY----MNIGAL 327 + RL+ + L P A S F + G Sbjct: 261 LVEDYAGDFARLSVLSEALTNYEVESARLIPLIDASSGLDVDEFATSETGEAVQVGGGGS 320 Query: 328 SREGRSLFQPVQFGNPLPYH---EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT 384 + +S + G+ + L++ + F+ +A E + Sbjct: 321 NGNSKSPVTAYEGGSAQKIQWIASNIQMLEQKLSRAFMYT--GNSRQGERVTAYEIRQNA 378 Query: 385 REKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEY-TSPLF 443 +E A +G L ++ R+L L + P + + V + V TS L Sbjct: 379 KEAEAAMGGGFSILSDTWL-----RKLAYLYTALVYPRFKLYLSEGVVSINVTVGTSALA 433 Query: 444 KYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 K A+ + A Q + + + ++ P + D + A + T E Sbjct: 434 KAAAADKLLEAAQSMQLAIPV-LEQITP----RFNKDACVDWYFDAYGIVSEPFMYTEEQ 488 Query: 504 EDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 +QQ + + Q QLQ + A + Sbjct: 489 LQQKQQVQDASADVSAGAAQDQLQGLTAADPTVAGKQ 525 >gi|158345175|ref|YP_001522882.1| putative head-tail connector protein [Enterobacteria phage LKA1] gi|114796471|emb|CAK25009.1| putative head-tail connector protein [Pseudomonas phage LKA1] Length = 514 Score = 64.3 bits (155), Expect = 5e-08, Method: Composition-based stats. Identities = 61/474 (12%), Positives = 129/474 (27%), Gaps = 59/474 (12%) Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCD---Q 102 + + G+ L++ L+ + PPG+ + + + +S+ D + Sbjct: 50 FQSAGAFLVNNLTAKLALTLFPPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERR 109 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 T LF S+ L +V G FY E + + + + Sbjct: 110 ATRRLFVNASLSK------LHRILKLLVVTGNALFYREPGTGKMLV--------WTMQSY 155 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSL 222 + V ++ + + ++ ++ + +T+I Sbjct: 156 TVRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKR-DSDKCDLYTVIE------- 207 Query: 223 TDKKKDKGNKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALP 280 + N + + ++ R E PY+ + V E YGR E Sbjct: 208 ---WQPTPNGKRCAVWHELEGKRVGPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSG 264 Query: 281 TIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF---------DLKPGYMNIGALSREG 331 RL+ L + +L V EAK D PG + A G Sbjct: 265 DFARLSILSERLGLYEFEALS-LLNLVDEAKGGAVDDYRDAETGDFVPGQVGSVASYERG 323 Query: 332 RSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFV 391 + + + + F+ + D + E E + Sbjct: 324 -------DYNKIAQASASVESIVMRLNRAFMYT--GQVRDAERVTVEEIRTVAEEAENLL 374 Query: 392 GPLIGGLQSEFIGAMISRELDILDS--QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAE 449 G + L + + G L P + L + + Sbjct: 375 GGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSI---ITGIPALTRNIETA 431 Query: 450 SVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 ++ A Q + +V V+ D +++ + + +V Sbjct: 432 NILRATQEASAIVPALVQLSK-----RFDPEKLVERIFANNSVDLSTLSKDPDV 480 >gi|83571754|ref|YP_425006.1| putative head-tail connector [Enterobacteria phage K1E] gi|83308205|emb|CAJ29437.1| gp29 protein [Enterobacteria phage K1E] Length = 516 Score = 64.3 bits (155), Expect = 5e-08, Method: Composition-based stats. Identities = 64/490 (13%), Positives = 152/490 (31%), Gaps = 60/490 (12%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLSS 59 I + +R + + PY N W G++A L++ Sbjct: 13 RSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLAN 72 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L+ ++ P + + + + + + +++ + T + +E + F Sbjct: 73 KLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAM---KELEQRQFR 129 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN-VVDSVY-- 176 + + ++ G C + ++P+ + ++ + ++D + Sbjct: 130 PAVVEAFKHLIVAG-SCMLYKPSKGAIS--------AIPMHHYVVNRDTNGDLLDIILLQ 180 Query: 177 ----REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGN 231 R F +V + K K + + +T HA Y + K+ + Sbjct: 181 EKSLRTFDPATRAVVE------VGLKGKKCKEDDSIKLYT--HAKYLGEGFWELKQSADD 232 Query: 232 KGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 ++ EK P+I ++ E +GR A + + + Sbjct: 233 IPVGKV------SKIKSEK----LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEA 282 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFD--LKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE 349 +A+ L + A Q + D + G + E + Q ++ + P Sbjct: 283 VARGAALMADIKYLIRPGA-QTDVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAV 341 Query: 350 LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 L I +F+++ D +A E E +G + + + Sbjct: 342 LEVYTRRIGVVFMMET-MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPV--- 397 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL----G 465 + L G+ PV + +E L + + + +A+ Q ++ ++ Sbjct: 398 AMWGLLEAGD--SFTSDLVDPVIITGIE---ALGRMAELDKLANFAQYMSLPLQWPEPVL 452 Query: 466 VKTGDPSCMD 475 P MD Sbjct: 453 AAVKWPDYMD 462 >gi|31711672|ref|NP_853590.1| head portal protein [Enterobacteria phage SP6] gi|31505676|gb|AAP48769.1| gp30 [Enterobacteria phage SP6] gi|40787047|gb|AAR90021.1| 29 [Enterobacteria phage SP6] Length = 515 Score = 64.3 bits (155), Expect = 5e-08, Method: Composition-based stats. Identities = 55/453 (12%), Positives = 132/453 (29%), Gaps = 72/453 (15%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLSS 59 I + +R + PY N W G++A L++ Sbjct: 12 RSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLAN 71 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L+ ++ P + + + + + + +++ + T + +R F Sbjct: 72 KLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQ---FR 128 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV-DSVYRE 178 + + ++ G C + +VP+ + ++ + + D + Sbjct: 129 PAIVEVFKHLIVAGN-CLLYKPSKGAMS--------AVPMHHYVVNRDTNGDLMDVI--- 176 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALAR-------NENERFTII-HAVYPKSLTDKKKDKG 230 ++ + + + A+ E++ + HA Y Sbjct: 177 -------LLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQY-----------A 218 Query: 231 NKGFHSKFVSVDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 +GF S D+ +E +I + P+I ++ E +GR A + + + Sbjct: 219 GEGFWKINQSADDIPVGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFL 278 Query: 289 VNELAQFGRLSLHPPTIA-VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH 347 +A+ L + + + G + E + Q ++ + P Sbjct: 279 SEAMARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVAEDIHIVQLGKYADLTPIS 338 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIG----------- 396 L I +F+++ D +A E E +G + Sbjct: 339 AVLEVYTRRIGVIFMMET-MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIA 397 Query: 397 ---------GLQSEFIGAMISRELDILDSQGNL 420 SE + +I ++ L L Sbjct: 398 MWGLQEAGDSFTSELVDPVIVTGIEALGRMAEL 430 >gi|296532334|ref|ZP_06895072.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296267358|gb|EFH13245.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 72 Score = 63.5 bits (153), Expect = 1e-07, Method: Composition-based stats. Identities = 15/67 (22%), Positives = 28/67 (41%), Gaps = 5/67 (7%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLY---PYKNNAQLRMWDTTGSEACIKLSSLLSSLI 65 I R+ +R +E + P A ++D T +A +L++ L + + Sbjct: 8 ILPRYQAALARRRPWEGVWQECYDHVLAQTPGSGGAM--LYDATAPDAAEQLAASLLAEL 65 Query: 66 TPPGQKW 72 TPP +W Sbjct: 66 TPPWSRW 72 >gi|197935883|ref|YP_002213719.1| head portal-like protein [Ralstonia phage RSB1] gi|197927046|dbj|BAG70388.1| head portal-like protein [Ralstonia phage RSB1] Length = 514 Score = 62.0 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 66/530 (12%), Positives = 143/530 (26%), Gaps = 60/530 (11%) Query: 19 QRGELNYWMEELTGFLYPYKN-NAQLRM---WDTTGSEACIKLSSLLSSLITPPGQKWHG 74 +R E + + L P N Q + + + GSE LS+ L + P + Sbjct: 24 RRSERYASWTQPS--LCPPDGFNEQTELQNDYQSVGSECVNSLSNRLVLNLFAPSRP--- 78 Query: 75 LAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGT 134 + A K D ++ + ++ + L ++ G Sbjct: 79 -FMRYDVPPAIAAKLDIDPAVLQTQLSKAERDSVKLLDQLSTR--PKLFEAIKHLIVIGN 135 Query: 135 GCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKV 194 + D +VP+ + + ++ + K+ Sbjct: 136 VLVILGKDKTTP-------LRTVPIKKFRCKRSPSGKLVTLAIKECL-------KF--DE 179 Query: 195 LSSKMKSALARNENERFTIIHAVYPKSLTDKK--KDKGNKGFHSKFVSVD-ENRFFEEKQ 251 L K++ L ++ P + D + + + V ++ Sbjct: 180 LDEKVQQKLLEQSPTKYQFT----PNNPPDCEWYTEVCLQPDGRYAVRTQVDDAMLTGHG 235 Query: 252 IA------TFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTI 305 PY V + + YG + ++ Q L+ + Sbjct: 236 YDAMYTEEEMPYRVLTWELPDGWHYGIGLVEQHAGDFAAISTMSASQLQSAILASEFRWL 295 Query: 306 AVSEAKQRN---FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL-NRLKESIRSLF 361 + + + G + G+ + L + + ++ + F Sbjct: 296 VNPAGITQPEDMVNSQNGDVVPGSPDDVVAVTAATAGVASALQVQDLILSKYVTRVGRAF 355 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 LL D +A E E +G + L +F + + G Sbjct: 356 LL-ASAAQRDAERVTAEEIRRDVLELETSLGGVYSRLAVDFQKPL---AYWLARMLGVKL 411 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 G ++ L L + E++ ALQ + V ++ G S ++T Sbjct: 412 SDTGIQPTIITGLD-----ALSRNSDLENLMRALQQLLIVSQIVAGGGPLSV--TLNTTS 464 Query: 482 VSRFSLWATNTPAVLIRDTAEVEDI----RQQREVQRRVMEEQHLQQQLQ 527 ++ A + E + Q R+ + QQ Sbjct: 465 IAASIFAGNGVDADTYVNDQETQQALMEQEQARQESLAAAPNRARNQQGA 514 >gi|289976621|gb|ADD21666.1| head-to-tail joining protein [Caulobacter phage Cd1] Length = 509 Score = 57.7 bits (138), Expect = 5e-06, Method: Composition-based stats. Identities = 63/501 (12%), Positives = 125/501 (24%), Gaps = 61/501 (12%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEE-----LTGFLYPYKNNAQ-LRMWDTT---GSEACIK 56 AK R++ L N+R +E + P + + T G +A Sbjct: 4 AKQASARWSQLDNKRRGFIERLETYASWTIAKLCTPSGYDQNHSELSHGTQAVGGQAVNH 63 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L++ + + P + + L S + + R Sbjct: 64 LANKIMLALFAPSRPFFRLDPSDKMQKELAAANVNEQALA---LILSQGEKRAIQALDRM 120 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 L +++ G D + + + V Sbjct: 121 ALRPKLYEAIKNLIVLGNVMLEFTKDTMRVIG----------IKRYCVRRSASGEV---- 166 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARN-----ENERFTIIHAVYPKSLTDKKKDKG- 230 + + L ++ R E+ ++ + + D + + Sbjct: 167 --LELIIKDTMQ---FDELEPSVQEECRRQGMRPLEDAEVSLYRWIVRQDNGDYRMTQHV 221 Query: 231 -NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 N KF P+ V + + D YG + L Sbjct: 222 DNIELSKKFQGKWSKDKL--------PFRVLTWDLSDDAHYGTGLVEDYRGDFAGLTMLS 273 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGR-SLFQPVQFGNPLPYHE 348 Q LS + + D + +G SL Q + + Sbjct: 274 TAQVQAAILSSEFRWLVNPAGMTKPEDFRDSENGAAIPGVQGDVSLVQSGKAADLQVILS 333 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 I FL+ + D +A E + E +G L +F M Sbjct: 334 VNAEYINRIARGFLMG-SAMTRDAERVTAEEIRMQASELETSLGGAYSRLAVDFQIPM-- 390 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 ++ EG D P + ++ S + + + + V LG T Sbjct: 391 -AYWLMKKVDM--SIEGTDVEPSIVTGLDALS------RGGDLENLKLFLADVAGLG--T 439 Query: 469 GDPSCMDHMDTDRVSRFSLWA 489 P + + + + A Sbjct: 440 LPPPVLAVLKVEPLLAAFATA 460 >gi|332800729|emb|CBY88569.1| hypothetical protein [Pantoea phage LIMEzero] Length = 522 Score = 57.3 bits (137), Expect = 6e-06, Method: Composition-based stats. Identities = 62/493 (12%), Positives = 155/493 (31%), Gaps = 58/493 (11%) Query: 43 LRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQ 102 R + + G+ L + L+ + P Q++ + Q + + +V + Sbjct: 57 TRDYQSVGALLVNNLVARLAEFLFPSNQRFVRVKP-----QNLTDAQREKMGQVNQGLIL 111 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 + T+ R ++ G+ +Q+ V G Y ++D + Y L N Sbjct: 112 IEKTV-SERAKANGGYADLIQAIAHQAVT-GNVALYRDSDSE--------TYRVYGLENF 161 Query: 163 YMSVNHQNVV-DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE----RFTIIHAV 217 + + + VV D++ I + L ++ ++ L + + ++ Sbjct: 162 VVQRDGRGVVVDAI----------IKERLQYDSLPAEFQAQLKAQNFQCGGNKRIWLYTR 211 Query: 218 YPKSLTDKK------KDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYG 271 + + GN S + + ++ EK P+I + +++ E YG Sbjct: 212 VLRVKRGNNYGYEITQQIGNMS-GSVY--TPGDDYYPEK---VCPWIFPVWSLKSGEHYG 265 Query: 272 RSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAV-SEAKQRNFDLKPGYMNIGALSRE 330 R + RL+ A + + ++ + S + + I + Sbjct: 266 RGIVEDHAGDFARLSMLSESSALYMQEAMRILWLLSGSGGNADDIEAAETGQVISLQTGT 325 Query: 331 GRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAF 390 + + +E+ ++ + + F+ D +A E + Sbjct: 326 KLEGVEVGDYQKVQQARDEIGQIVQRLSQAFMYT--GEFRDSERTTATEIQQVATSAERA 383 Query: 391 VGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPV--SLLKVEYTSPLFKYQQA 448 +G +Q++ L I + L E + P + +L+++ + L ++ Sbjct: 384 MGGPYS-MQAK--------TLQIPLAYVLLSEIDDTLVPDIVGKILELQVVAGLDALGRS 434 Query: 449 ESVASALQGVN-TVVELGVKTGDPSCMD-HMDTDRVSRFSLWATNTPAVLIRDTAEVEDI 506 + +Q ++ + +D V + R + E Sbjct: 435 IEASQLIQALSDAQAAIAAVANINQVAQGVLDPKAVLETIFSSNGVALDDYRTSPEELQA 494 Query: 507 RQQREVQRRVMEE 519 + Q+ Q Sbjct: 495 KAQQINQMTAEAG 507 >gi|115304377|ref|YP_762669.1| PfWMP4_39 [Cyanophage Pf-WMP4] gi|113201871|gb|ABI33183.1| PfWMP4_39 [Phormidium phage Pf-WMP4] Length = 641 Score = 57.0 bits (136), Expect = 8e-06, Method: Composition-based stats. Identities = 82/607 (13%), Positives = 164/607 (27%), Gaps = 90/607 (14%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPY---KNNAQLRMWDTTGSE------------- 52 + ++ +++R + +E + N + R + TTG++ Sbjct: 29 VISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHT 88 Query: 53 --ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAF------LYKEDARSKKVREWCDQ-V 103 L + T P W L L K + +R+ + V Sbjct: 89 FEVVETLVA-YFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYV 147 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 + + R G+ ++ + F DV +R + +V+ Sbjct: 148 RNLVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVW 207 Query: 164 MSVNHQNVVDSVYR-----------------EFTFT-VDQIVS-KWGDKVLSSKMKSALA 204 + + + R + T V+Q V K+ D + Sbjct: 208 LDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDT 267 Query: 205 RNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATF---PYIVGR 261 + II F + + P++ Sbjct: 268 SG----WDIIE-------YYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTT 316 Query: 262 YRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKP 319 D +YG S L + LN N L ++ V + K+ + KP Sbjct: 317 LLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKP 376 Query: 320 GYMNIGALSREGRSLFQPVQFGNP-LPYHEELNRLKESIRSLFLLDLFQVLDDKA----- 373 G + QP+ G + +++ES S++ L A Sbjct: 377 GAV----FKVAQHGSLQPIDMGRQDFVVTYQEAQVQES--SVYRNTSTGPLIGNAAPRGG 430 Query: 374 -SRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVS 432 +AAE G + + ++ ++++ +L PE P Sbjct: 431 ERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQ 490 Query: 433 LLKVEYTSP-------LFKYQQAESVASALQGVNTVVELGVKTGD-PSCMDHMDTDRVSR 484 + SP F A V + V +++L +G P +D + Sbjct: 491 MDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILE 550 Query: 485 FSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL-QQTSQDIGAKAAGRAME 543 L +R T + I++ L + +G +A+ Sbjct: 551 DLLRQ-------MRFTDPMRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIA 603 Query: 544 KKLTHDM 550 D+ Sbjct: 604 GMTPEDV 610 >gi|320158420|ref|YP_004190798.1| head-to-tail joining protein [Vibrio vulnificus MO6-24/O] gi|319933732|gb|ADV88595.1| head-to-tail joining protein [Vibrio vulnificus MO6-24/O] Length = 437 Score = 56.6 bits (135), Expect = 1e-05, Method: Composition-based stats. Identities = 66/459 (14%), Positives = 125/459 (27%), Gaps = 50/459 (10%) Query: 67 PPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFY 126 PP + L S A L D++ + Q + E R L Sbjct: 5 PPSHPFVRLGVSNE-LIAKLDLTDSKKGDLETALSQTEQLI--VTELERRALRSLLYEDI 61 Query: 127 TSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQI 186 ++ G G Y+ + R+ L + + Q + Sbjct: 62 KHLLVTGNGLLYVGSKES--------RF--YRLDKYVVERDDQGAPTRIVVCEKINFR-- 109 Query: 187 VSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD--KKKDKGNKGFHSKFVSVDEN 244 L M+ A+ + P+ + + + S Sbjct: 110 -------KLPDAMQFAIREKRRLKGD------PRKDLNLFTMIELKGDQWRSYQEVEGMR 156 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 E P+IV E YGRS E + + L V + Q + Sbjct: 157 VPDSESNYRKDRTPWIVCTMNRLDGEDYGRSFCEEHIGDMNTLESLVKAITQASIAASKV 216 Query: 303 PTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN-RLKESIRSLF 361 + A R L + + + + L ++ + F Sbjct: 217 IFMVKPNASTRASTLSKAKNGDYIQGDREDVGCLQLDKAHDMAIAQNLKAEIQAGLSEAF 276 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 L+ V D +A E T+ +G L L +++ L ++ G LP Sbjct: 277 LMS-SAVRRDAERVTAEEIRMMTQMLEESLGGLYSQLAQSLQLPLVNVLLGHMERDGILP 335 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 P+ + VE + ++ V+ V ++G + M Sbjct: 336 HFPEGTFEPIVITGVEGLG------REAELSRLNTFVSLVQQVGAEQAAKE----MHLGE 385 Query: 482 VSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEE 519 + + L++ E +Q+E+Q M + Sbjct: 386 LFKRYAANLQIETKGLMKTAEE-----KQQELQAEQMNQ 419 >gi|229604951|ref|YP_002875651.1| putative head-tail connector protein [Vibrio phage VP93] gi|227976996|gb|ACP44098.1| putative head-tail connector protein [Vibrio phage VP93] Length = 510 Score = 55.4 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 59/496 (11%), Positives = 142/496 (28%), Gaps = 62/496 (12%) Query: 15 YLKNQRGELNYWMEELTGFLYPYKNNAQLRM---WDTTGSEACIKLSSLLSSLITPPGQK 71 L ++R T F K+ ++ + + + G+ L+S L+ + P G Sbjct: 20 TLSSERYAF---WTVPTVFTRENKDGERVSLQRDFQSHGAMLVNNLASKLTRTLFPTGMS 76 Query: 72 WHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVE 131 + ++++ + + + + + ++ + GF ++ Sbjct: 77 FFRISDTDK-MREIIAQLGSENAQLSAVFTGIEREAMTLLTTHA-GFAQLTHLMKL-LII 133 Query: 132 FGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV-DSVYREFTFTVDQIVSKW 190 G Y + R + + + + V ++ RE + Sbjct: 134 TGNALLYRDPLTG--------RMTVYSVRDYAVRRDGAGRVLCTILRE----------RV 175 Query: 191 GDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEK 250 + + + + + + + + D +D Sbjct: 176 PIQDVPEEFRPTGYTDPTTDVWLYTKIQ-RETRDAGDVFV------ITQQIDGKPVGTLS 228 Query: 251 QIAT--FPYIVGRYRVRADEIYGRSPAME---ALPTIRRLNETVNELAQFGRLSLHPPTI 305 PYI + + + E YGR + A + L + + ++ + Sbjct: 229 VYPEKLCPYIPAVWNLVSGEHYGRGHVEDHAGAFARVSELTQALTLYEIEAMRVVN--LV 286 Query: 306 AVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDL 365 + + A EG + + +L + + F+ Sbjct: 287 SPKSTADVDALNDAETGEYVAGDGEGIKAHEAGEARKIAEVVNDLQMVLAELARAFMYT- 345 Query: 366 FQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEG 425 + D +A E RE +G + L +E + ++ L + PE Sbjct: 346 -GNVRDAERVTAEEIKNNVREAEENMGGIYATL-AEILHIPLAHILTVEAR----PELLA 399 Query: 426 ADNPPVSLLKVEY-TSPLFKY---QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 L ++ T+ + + Q+ VA+ + + V+ K +P DR Sbjct: 400 LLQANAVSLDIQVGTAAINRSIVVQRLGLVANDINLILPVLAQATKRTNP--------DR 451 Query: 482 VSRFSLWATNT-PAVL 496 V L P + Sbjct: 452 VIDLILAGHGVDPTEI 467 >gi|291334412|gb|ADD94067.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1035] Length = 64 Score = 55.4 bits (132), Expect = 2e-05, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 19/41 (46%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQL 43 AK + RF+ LK+QR +E+ ++ P K + Sbjct: 4 SEKAKILLSRFDRLKSQRQNWESHWQEVADYMQPRKADVTK 44 >gi|57237581|ref|YP_178595.1| hypothetical protein CJE0579 [Campylobacter jejuni RM1221] gi|57166385|gb|AAW35164.1| hypothetical protein CJE0579 [Campylobacter jejuni RM1221] Length = 512 Score = 55.0 bits (131), Expect = 3e-05, Method: Composition-based stats. Identities = 70/518 (13%), Positives = 162/518 (31%), Gaps = 64/518 (12%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGF-------LYPYKNNAQLRMWDTTGSEAC 54 N + + K+ +EL + + + ++ Sbjct: 7 NDERVSFLTQLISESKSGYENYKPHFKELQDAYLLENKVMQKLRKRNKSSIYIP------ 60 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 K+++ + LIT + +E + + ++ +D + W + + Sbjct: 61 -KINAKVKYLITSLNDVYFN-SERMADIETYINSDD---TIIELWQNAID------FYSG 109 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEE----GIRYISVPLSNVYMSVNHQN 170 + Q + V+ GT + +E I + L+ S + Sbjct: 110 KINMFKIFQPLFLDVLLVGTSIAKVTWHKGMPRIERVDIDSIFFDPNALN----SEDVGY 165 Query: 171 VVDSVYREFTFTVDQI--VSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKD 228 +V+ +Y T +QI K G + K +E ++ + +Y + D+ Sbjct: 166 IVNEIY----LTYNQIHERQKLGFYKKNEIKKLFDEDDEYKKVKLYD-IYERKNDDEWVV 220 Query: 229 KGNKGFHSKFVSVD---ENRFFEEKQIATFPYIVGRYRVRADE----IYGRSPAMEALPT 281 S + ++ Q + ++ + + +E YG A+P Sbjct: 221 -------STLFENNLLRNEVTLQDGQPFIWGSMLPQLKKIDNENYVSAYGEPIMASAMPL 273 Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 +N T N L R + P + D++ I +G + P Sbjct: 274 QDEINITRNLLIDAVRTHIMPKIMMPKSMGVSREDIETLGKPIYTDDPKGVQILPPPNVN 333 Query: 342 NPLPYHEELN-RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS 400 + + L L E I + Q ++ +A E K +E G + Sbjct: 334 SAGMNLQLLESELTEVIGVSPQNNGAQTAQNE---TATEISIKAQEGGRR-SADYIRQYN 389 Query: 401 E-FIGAMISRELDILDSQGN--LPECEGADNPPVSLLKVEY-TSPLFKYQQAESVASALQ 456 E FI + R ++ G ++ P K++ T + K + + +++Q Sbjct: 390 ETFIEPLFDRFAMLVFKYGEDSFFNGFQREDIPSFRFKIQTGTGAMNKEIRRAGIQASMQ 449 Query: 457 GVNTVVELGVKTGDPS-CMDHMDT-DRVSRFSLWATNT 492 + + ++ + GD + ++ +++ L Sbjct: 450 VFSQLYQMYMSIGDANSAYGIINASKELTKELLPILGV 487 >gi|149408206|ref|YP_001294640.1| hypothetical protein ORF047 [Pseudomonas phage PA11] Length = 584 Score = 54.6 bits (130), Expect = 4e-05, Method: Composition-based stats. Identities = 40/295 (13%), Positives = 85/295 (28%), Gaps = 22/295 (7%) Query: 252 IATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAK 311 + P +R R D ++ P + R++ N A L + PP + E Sbjct: 299 FGSAPIYHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEV- 357 Query: 312 QRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQV-LD 370 F PG + + + + V + ++ + + + + + Sbjct: 358 -EEFVWGPGAEIHLDQGGDVQEIAKNVNYIINADNQIQMLEDRMELYAG--APREAMGIR 414 Query: 371 DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPP 430 ++A E + G + + E + +++ L+ + Sbjct: 415 TPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNM---DGSDVIRVM 471 Query: 431 VSLLKV-EYTSPL---------FKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 + L V E+ S + A Q + +V + + H Sbjct: 472 DTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGK 531 Query: 481 RVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIG 534 ++ F T + R V +Q E Q V + Q Q Q + Sbjct: 532 ALATFVDDVTGLQGYEIFRPNVAVA---EQAETQSLVAQAQEDLQLQAQMPAEGA 583 >gi|308071876|emb|CBW54797.1| putative head-tail connector protein [Pantoea phage LIMElight] Length = 529 Score = 53.5 bits (127), Expect = 9e-05, Method: Composition-based stats. Identities = 52/464 (11%), Positives = 124/464 (26%), Gaps = 48/464 (10%) Query: 44 RMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQV 103 R + + G+ L+S ++ + P + + ++ Q + ++ Sbjct: 52 RDYQSKGAMLVNNLASKVTQALFPQNNAFFEIGQTAEMLQVAQEMGADAKQAASKFAGIE 111 Query: 104 TDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVY 163 + L ++ G Y + + + + + + Sbjct: 112 VRASARVFLNAG---YSALSHAMKLLIITGNALVYRDPTNKQ--------FHTYSVRDYV 160 Query: 164 MSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + + V + + + + + LS + + E T+ V Sbjct: 161 VKRDGSGKVLCLILKERIALQDLPEDF---RLS---RLQYRTDPFEDVTLYTKV------ 208 Query: 224 DKKKDKGNKGFHSKFVSVDENRFFEEKQIATF--PYIVGRYRVRADEIYGRSPAMEALPT 281 +K G + + V++ + PYI + + E YGR + Sbjct: 209 -TRKHNGARVMYEVTQEVEDYPIGTPSTYPEYLCPYIPLTWNLVTGENYGRGHVEDFAGD 267 Query: 282 IRRLNETVNELAQFGRLSLH-------PPTIAVSEAKQRN-FDLKPGYMNIGALSREGRS 333 RL+E + + I + + + G N G Sbjct: 268 FARLSELSESSLLYEVEMMRLINIIDPGAGIDLDDFMDADCGKAVAGKSNAAG---NGVV 324 Query: 334 LFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGP 393 + ++ L + + F+ D +A E E +G Sbjct: 325 AHEGGNAQKLAAVQNDIANLVQQLSIAFMYT--GNTRDAERVTAEEIRANVSEANQTLGG 382 Query: 394 LIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 + L SE + ++ L + + P L V L + +V Sbjct: 383 VYANL-SEVLHLQLAHILSVEEE----PALLQLLMVQGIKLDVSVG--LASLNRQANVER 435 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLI 497 LQ + +++ + S + D + + Sbjct: 436 -LQYLANALQIVLPVLTQSS-KRFNPDLIIDAMCQGYGVDREAL 477 >gi|157828580|ref|YP_001494822.1| hypothetical protein A1G_04000 [Rickettsia rickettsii str. 'Sheila Smith'] gi|157801061|gb|ABV76314.1| hypothetical protein A1G_04000 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 59 Score = 52.7 bits (125), Expect = 2e-04, Method: Composition-based stats. Identities = 10/42 (23%), Positives = 17/42 (40%) Query: 101 DQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEAD 142 + + S F + F+ ++ FGT FY+E D Sbjct: 4 QMIEKAIMDIFNNPASNFYNQIHQFFLNLAAFGTAIFYVEED 45 >gi|315929405|gb|EFV08607.1| hypothetical protein CSS_1407 [Campylobacter jejuni subsp. jejuni 305] Length = 512 Score = 50.0 bits (118), Expect = 0.001, Method: Composition-based stats. Identities = 67/517 (12%), Positives = 157/517 (30%), Gaps = 62/517 (11%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGF-------LYPYKNNAQLRMWDTTGSEAC 54 N + + K+ +EL + + + ++ Sbjct: 7 NDERVSFLTQLISESKSGYENYKPHFKELQDAYLLENKVMQKLRKRNKSSIYIP------ 60 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 K+++ + LIT + +E + + ++ +D + W + + Sbjct: 61 -KINAKVKYLITSLNDVYFN-SERMADIETYINSDD---TIIELWQNAID------FYSG 109 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEE----GIRYISVPLSNVYMSVNHQN 170 + Q + V+ GT + +E I + L+ S + Sbjct: 110 KINMFKIFQPLFLDVLLVGTSIAKVTWHKGMPRIERVDIDSIFFDPNALN----SEDVGY 165 Query: 171 VVDSVYREFTFTVDQI--VSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKD 228 +V+ +Y T +QI K G K +E ++ + +Y + D+ Sbjct: 166 IVNEIY----LTYNQIHERQKLGFYKKIEIKKLFDEDDEYKKVKLYD-IYERKNDDEWVV 220 Query: 229 KGNKGFHSKFVSVD---ENRFFEEKQIATFPYIVGRYRVRADE----IYGRSPAMEALPT 281 S + ++ Q + ++ + + +E YG A+P Sbjct: 221 -------STLFENNLLRNEVTLQDGQPFIWGSMLPQLKKIDNENYVSAYGEPIMASAMPL 273 Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 +N T N L R + P + D++ I +G + P Sbjct: 274 QDEINITRNLLIDAVRTHIMPKIMMPKSMGVSREDIETLGKPIYTDDPKGVQILPPPNVN 333 Query: 342 NPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + + L E + + +A E K +E G +E Sbjct: 334 SAGMNLQLLE--SELTEVTGVSPQNNGAQTAQNETATEISIKAQEGGRR-SADYIRQYNE 390 Query: 402 -FIGAMISRELDILDSQGN--LPECEGADNPPVSLLKVEY-TSPLFKYQQAESVASALQG 457 FI + R ++ G ++ P K++ T + K + + +++Q Sbjct: 391 TFIEPLFDRFAMLVFKYGEDSFFNGFQREDIPSFRFKIQTGTGAMNKEIRRAGIQASMQV 450 Query: 458 VNTVVELGVKTGDPS-CMDHMDT-DRVSRFSLWATNT 492 + + ++ + GD + ++ +++ L Sbjct: 451 FSQLYQMYMSIGDANSAYGIINASKELTKELLPILGV 487 >gi|281306687|ref|YP_003345493.1| predicted phage head-tail connector protein [Pseudomonas phage phi-2] gi|271277992|emb|CBH51598.1| predicted phage head-tail connector protein [Pseudomonas phage phi-2] Length = 518 Score = 50.0 bits (118), Expect = 0.001, Method: Composition-based stats. Identities = 60/474 (12%), Positives = 142/474 (29%), Gaps = 61/474 (12%) Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYK--EDARSKKVREWCD-Q 102 + + G+ L++ L + + P G + S + A + + + + D + Sbjct: 51 FQSVGALLTNNLTAKLVASLFPSGVPFFKNMPSKTLLAAAVEQSINEQEVNNMLARLDRE 110 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 T+ LF ++ L ++ G Y + + + + Sbjct: 111 ATERLFVQATTAK------LTRLLKLLIITGNALAYRDPKTGKM--------TVWSIRSY 156 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN---------ERFTI 213 + +R QI ++ L +++ + + FT+ Sbjct: 157 VVRRAADGE----FRHVVL--KQI-MRF--DELPEHVQADYTAKKPGQYKPDRMMDYFTV 207 Query: 214 IHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATF--PYIVGRYRVRADEIYG 271 I K+ NK + +D R E P+IV + + E YG Sbjct: 208 IE---------KQPGAVNKRVV-VWNEIDGLRVGPESSYPEHLAPWIVTVWNLADGEHYG 257 Query: 272 RSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFD--LKPGYMNIGALSR 329 R + +++ +L + +L V E+ D + + Sbjct: 258 RGLVEDFTGDFAKVSLVSEQLGLYELEALS-LLNVVDESAGGVIDEYQESDTGDYVRGKT 316 Query: 330 EGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGA 389 + ++ + E + + + + F+ +A E +E + Sbjct: 317 AAITSYERGDYNKINAVRESIGEVIQRLSMAFMYT--GNTRQAERVTAEEIRAVAKEAES 374 Query: 390 FVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAE 449 +G + L G + + D +L G + + L + + + Sbjct: 375 TLGGVYSLLAETLQGPLAYLCM--ADVADDL--MMGLVTKQYKPVILTGIPALSRAVEMQ 430 Query: 450 SVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 ++ +A Q + +V + D +D +V+ + + I EV Sbjct: 431 NLLAATQEIAAIVP-ALTQLDT----RVDGSKVADLIYNSRSVDVSRIFKEPEV 479 >gi|153951607|ref|YP_001398216.1| hypothetical protein JJD26997_1133 [Campylobacter jejuni subsp. doylei 269.97] gi|153952365|ref|YP_001397542.1| hypothetical protein JJD26997_0326 [Campylobacter jejuni subsp. doylei 269.97] gi|152939053|gb|ABS43794.1| conserved hypothetical protein [Campylobacter jejuni subsp. doylei 269.97] gi|152939811|gb|ABS44552.1| hypothetical protein JJD26997_0326 [Campylobacter jejuni subsp. doylei 269.97] Length = 507 Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats. Identities = 64/507 (12%), Positives = 157/507 (30%), Gaps = 56/507 (11%) Query: 9 IQDRFNYLKNQRGELNYWMEELTG-------FLYPYKNNAQLRMWDTTGSEACIKLSSLL 61 + + K+ +EL + + + ++ K+++ + Sbjct: 12 LTQLISESKSGYENYKPHFKELQDAYLLENKIMQKLRKRNKSSIYIP-------KINAKV 64 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 LIT + + +E + + ++ +D + W + + + Sbjct: 65 KYLITSLNEVYFN-SERMADIETYINSDD---TIIELWQNAID------FYSGKINMFKI 114 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEE----GIRYISVPLSNVYMSVNHQNVVDSVYR 177 Q + V+ GT + +E I + L+ S + +V+ +Y Sbjct: 115 FQPLFLDVLLVGTSIAKLTWHKGMPRIERVGIDSIFFDPNALN----SEDVGYIVNEIY- 169 Query: 178 EFTFTVDQI--VSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFH 235 T ++I K G K +E ++ + +Y + D H Sbjct: 170 ---LTYNEIYERQKLGFYKKLETPKLLDEEDEYKKVKLYD-IYERKNDDAWVVSTLFENH 225 Query: 236 SKFVSVDENRFFEEKQIATFPYIVGRYRVRADE----IYGRSPAMEALPTIRRLNETVNE 291 + ++ Q + ++ + + +E YG A+P +N T N Sbjct: 226 ----LLRNEVILQDGQPFVWGSMLPQLKKIDNENYVSAYGEPIMASAMPLQDEINITRNL 281 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 L R + P + D++ + +G + P + + L Sbjct: 282 LIDAVRTHIMPKIMLPKSMGVSREDIETLGKPLYTDDPKGVQILPPPDVNSAGMNLQLLE 341 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE-FIGAMISRE 410 E + + +A E K +E G +E FI + R Sbjct: 342 --SELTEVTGVSPQNNGAQTAHNETATEISIKAQEGGRR-SADYIRQYNETFIEPLFDRF 398 Query: 411 LDILDSQGN--LPECEGADNPPVSLLKVEY-TSPLFKYQQAESVASALQGVNTVVELGVK 467 ++ G + ++ P K++ T + K + + +++Q + + ++ + Sbjct: 399 AMLVFKYGEDNFFKGFQREDIPSFRFKIQTGTGAMNKEIRRAGIQASMQVFSQLYQMYMS 458 Query: 468 TGDPS-CMDHMDT-DRVSRFSLWATNT 492 GD + ++ +++ L Sbjct: 459 IGDTNSAYGIINASKELTKELLPILGV 485 >gi|283956319|ref|ZP_06373799.1| hypothetical protein C1336_000250090 [Campylobacter jejuni subsp. jejuni 1336] gi|283792039|gb|EFC30828.1| hypothetical protein C1336_000250090 [Campylobacter jejuni subsp. jejuni 1336] Length = 512 Score = 49.3 bits (116), Expect = 0.002, Method: Composition-based stats. Identities = 66/517 (12%), Positives = 158/517 (30%), Gaps = 62/517 (11%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGF-------LYPYKNNAQLRMWDTTGSEAC 54 N + + + K+ +EL + + + ++ Sbjct: 7 NDKRVSFLTQLISESKSGYENYKPHFKELQDAYLLENKVMQKLRKRNKSSIYIP------ 60 Query: 55 IKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 K+++ + LIT + +E + + ++ +D + W + + Sbjct: 61 -KINAKVKYLITSLNDVYFN-SERMADIETYINSDD---TIIELWQNAID------FYSG 109 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEE----GIRYISVPLSNVYMSVNHQN 170 + Q + V+ GT + +E I + L+ S + Sbjct: 110 KINMFKIFQPLFLDVLLVGTSIAKVTWHKGMPRIERVDIDSIFFDPNALN----SEDVGY 165 Query: 171 VVDSVYREFTFTVDQIVSK--WGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKD 228 +V+ +Y T +QI + G K +E ++ + +Y + D+ Sbjct: 166 IVNEIY----LTYNQIHERQNLGFYKNIEIQKLFDEDDEYKKVKLYD-IYERKNDDEWVV 220 Query: 229 KGNKGFHSKFVSVD---ENRFFEEKQIATFPYIVGRYRVRADE----IYGRSPAMEALPT 281 S + ++ Q + ++ + + +E YG A+P Sbjct: 221 -------STLFENNLLRNKVTLQDGQPFVWGSMLPQLKKIDNENYVSAYGEPIMASAMPL 273 Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 +N T N L R + P + D++ I +G + P Sbjct: 274 QDEINITRNLLIDAVRTHIMPKIMMPKSMGVSREDIETLGKPIYTDDPKGVQILPPPNVN 333 Query: 342 NPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + + L E + + +A E K +E G +E Sbjct: 334 SAGMNLQLLE--SELTEVTGVSPQNNGAQTAQNETATEISIKAQEGGRR-SADYIRQYNE 390 Query: 402 -FIGAMISRELDILDSQGN--LPECEGADNPPVSLLKVEY-TSPLFKYQQAESVASALQG 457 FI + R ++ G ++ P K++ T + K + + +++Q Sbjct: 391 TFIEPLFDRFAMLVFKYGEDNFFNGFQREDIPSFRFKIQTGTGAMNKEIRRAGIQASMQV 450 Query: 458 VNTVVELGVKTGDPS-CMDHMDT-DRVSRFSLWATNT 492 + + ++ + GD + ++ +++ L Sbjct: 451 FSQLYQMYMSIGDANSAYGIINASKELTKELLPILGV 487 >gi|327273550|ref|XP_003221543.1| PREDICTED: dnaJ homolog subfamily C member 2-like [Anolis carolinensis] Length = 619 Score = 47.7 bits (112), Expect = 0.005, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 41/111 (36%), Gaps = 6/111 (5%) Query: 194 VLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFH--SKFVSVDENRFFEEKQ 251 L K + + ++F H V P++ ++ + S + + ++ E+ Sbjct: 506 KLDPHQKDDINKKAFDKFKKEHGVVPQADNATPSERFEAPYGDSSPWTTEEQK--LLEQA 563 Query: 252 IATFPYIVG-RYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLH 301 + T+P R+ A + GRS + + + L E V ++ Sbjct: 564 LKTYPVNTPERWEKIAASVPGRS-KKDCMKRYKELVEMVKAKKAAQEQVVN 613 >gi|157828622|ref|YP_001494864.1| hypothetical protein A1G_04250 [Rickettsia rickettsii str. 'Sheila Smith'] gi|157801103|gb|ABV76356.1| hypothetical protein A1G_04250 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 56 Score = 46.6 bits (109), Expect = 0.012, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 25/54 (46%), Gaps = 1/54 (1%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEAC 54 M+ + F+ LK++R + N +EL ++ P + ++D+T + Sbjct: 1 MHDNELNKKIEYFDNLKSKREKWNQRWDELKRYVCP-QTERNKVIFDSTSIGSL 53 >gi|319956914|ref|YP_004168177.1| hypothetical protein Nitsa_1175 [Nitratifractor salsuginis DSM 16511] gi|319419318|gb|ADV46428.1| hypothetical protein Nitsa_1175 [Nitratifractor salsuginis DSM 16511] Length = 561 Score = 43.9 bits (102), Expect = 0.077, Method: Composition-based stats. Identities = 71/428 (16%), Positives = 127/428 (29%), Gaps = 44/428 (10%) Query: 73 HGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEF 132 + + + EW + R + + + Sbjct: 78 FAKLTPQVPTPESIKDVQKLQRALDEWTTK------------RINLYTRFKPSVLDALIY 125 Query: 133 GTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY--REFTFTVDQIVSKW 190 GT + + +R V L N+Y+ N NV D Y T T+ + ++ Sbjct: 126 GTPIMKIYWADGQ------LRIERVKLKNMYLDPNASNVFDIQYCVHRVTTTIGNLRQQF 179 Query: 191 GDKVLSSKMKSALARNEN-----ERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENR 245 G K K K+ + +E+ + A + D + + K + S + D Sbjct: 180 GRKF---KWKNYIGDSEDGTSYLSSADLGDASRI-EVRDVYRYQSGKWYVSTVLPGDAFV 235 Query: 246 FFEEKQIATFPYIV----GRY----RVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 +E P+I+ ++ A E YG S +P T N+ Sbjct: 236 RLDEPLKDGLPFIIGSVEPQFVRLDESNAVEAYGGSFIEPMIPLQEEYTVTRNQQIDAIA 295 Query: 298 LSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 SL +A + DL I S Q + + + L+ + + Sbjct: 296 ESLSKRFLATKTSGLNEKDLLSNRTKISVSSLNEVKELQAPRIDPSIFGIDRLDSEMQEV 355 Query: 358 RSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 + + +++A T E A + ++ L F I R + ++ Sbjct: 356 SGITKYNQGLNDPHNLNQTATGVSILTEEGNAVIADIVRALNESFFEPAIRRMVRLIYKY 415 Query: 418 GNLPECEGADNPPVSLLKVE-------YTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 G P G D V + L A + +ALQ V EL Sbjct: 416 GESPIFYGLDRTKDLRFYVTINAGVGAVNNELLLNNIAAAEGAALQNVKLAAELQDAERA 475 Query: 471 PSCMDHMD 478 MD +D Sbjct: 476 KRYMDVLD 483 >gi|119703755|ref|NP_002283.3| laminin subunit beta-2 precursor [Homo sapiens] gi|156630892|sp|P55268|LAMB2_HUMAN RecName: Full=Laminin subunit beta-2; AltName: Full=Laminin B1s chain; AltName: Full=Laminin-11 subunit beta; AltName: Full=Laminin-14 subunit beta; AltName: Full=Laminin-15 subunit beta; AltName: Full=Laminin-3 subunit beta; AltName: Full=Laminin-4 subunit beta; AltName: Full=Laminin-7 subunit beta; AltName: Full=Laminin-9 subunit beta; AltName: Full=S-laminin subunit beta; Short=S-LAM beta; Flags: Precursor gi|119585362|gb|EAW64958.1| laminin, beta 2 (laminin S), isoform CRA_a [Homo sapiens] gi|119585363|gb|EAW64959.1| laminin, beta 2 (laminin S), isoform CRA_a [Homo sapiens] gi|225000494|gb|AAI72384.1| Laminin, beta 2 (laminin S) [synthetic construct] Length = 1798 Score = 43.5 bits (101), Expect = 0.11, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 72/219 (32%), Gaps = 25/219 (11%) Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT-----REKGAFVGPLIGGLQ 399 ++EL L +S++ L+ D A +E + + G + ++ Sbjct: 1507 QANQELQELIQSVKD--FLNQEGADPDSIEMVATRVLELSIPASAEQIQHLAGAIAERVR 1564 Query: 400 SEF-IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGV 458 S + A+++R + + L + K Q+AE+V +AL+ Sbjct: 1565 SLADVDAILARTVGDVRRAEQLLQDARRARSWAEDEK----------QKAETVQAALEEA 1614 Query: 459 NTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVM 517 + + D DT++ + + + E RQ + + Sbjct: 1615 QRAQGIAQGAIRGAVADTRDTEQTLYQVQERMAGA-ERALSSAGERA--RQLDALLEALK 1671 Query: 518 EEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 ++ T+++ A GRA E ++ G Sbjct: 1672 LKRAGNSLAASTAEETAGSAQGRAQE---AEQLLRGPLG 1707 >gi|8170714|gb|AAB34682.2| laminin beta 2 chain [Homo sapiens] Length = 1798 Score = 43.5 bits (101), Expect = 0.11, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 72/219 (32%), Gaps = 25/219 (11%) Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT-----REKGAFVGPLIGGLQ 399 ++EL L +S++ L+ D A +E + + G + ++ Sbjct: 1507 QANQELQELIQSVKD--FLNQEGADPDSIEMVATRVLELSIPASAEQIQHLAGAIAERVR 1564 Query: 400 SEF-IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGV 458 S + A+++R + + L + K Q+AE+V +AL+ Sbjct: 1565 SLADVDAILARTVGDVRRAEQLLQDARRARSWAEDEK----------QKAETVQAALEEA 1614 Query: 459 NTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVM 517 + + D DT++ + + + E RQ + + Sbjct: 1615 QRAQGIAQGAIRGAVADTRDTEQTLYQVQERMAGA-ERALSSAGERA--RQLDALLEALK 1671 Query: 518 EEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 ++ T+++ A GRA E ++ G Sbjct: 1672 LKRAGNSLAASTAEETAGSAQGRAQE---AEQLLRGPLG 1707 >gi|1335202|emb|CAA56130.1| beta2/S laminin chain [Homo sapiens] Length = 1798 Score = 43.5 bits (101), Expect = 0.11, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 72/219 (32%), Gaps = 25/219 (11%) Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT-----REKGAFVGPLIGGLQ 399 ++EL L +S++ L+ D A +E + + G + ++ Sbjct: 1507 QANQELQELIQSVKD--FLNQEGADPDSIEMVATRVLELSIPASAEQIQHLAGAIAERVR 1564 Query: 400 SEF-IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGV 458 S + A+++R + + L + K Q+AE+V +AL+ Sbjct: 1565 SLADVDAILARTVGDVRRAEQLLQDARRARSWAEDEK----------QKAETVQAALEEA 1614 Query: 459 NTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVM 517 + + D DT++ + + + E RQ + + Sbjct: 1615 QRAQGIAQGAIRGAVADTRDTEQTLYQVQERMAGA-ERALSSAGERA--RQLDALLEALK 1671 Query: 518 EEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 ++ T+++ A GRA E ++ G Sbjct: 1672 LKRAGNSLAASTAEETAGSAQGRAQE---AEQLLRGPLG 1707 >gi|1103585|emb|CAA92279.1| laminin beta 2 chain [Homo sapiens] Length = 1798 Score = 43.5 bits (101), Expect = 0.11, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 72/219 (32%), Gaps = 25/219 (11%) Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT-----REKGAFVGPLIGGLQ 399 ++EL L +S++ L+ D A +E + + G + ++ Sbjct: 1507 QANQELQELIQSVKD--FLNQEGADPDSIEMVATRVLELSIPASAEQIQHLAGAIAERVR 1564 Query: 400 SEF-IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGV 458 S + A+++R + + L + K Q+AE+V +AL+ Sbjct: 1565 SLADVDAILARTVGDVRRAEQLLQDARRARSWAEDEK----------QKAETVQAALEEA 1614 Query: 459 NTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVM 517 + + D DT++ + + + E RQ + + Sbjct: 1615 QRAQGIAQGAIRGAVADTRDTEQTLYQVQERMAGA-ERALSSAGERA--RQLDALLEALK 1671 Query: 518 EEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 ++ T+++ A GRA E ++ G Sbjct: 1672 LKRAGNSLAASTAEETAGSAQGRAQE---AEQLLRGPLG 1707 >gi|332816911|ref|XP_003309859.1| PREDICTED: LOW QUALITY PROTEIN: laminin subunit beta-2-like [Pan troglodytes] Length = 1792 Score = 43.5 bits (101), Expect = 0.11, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 73/224 (32%), Gaps = 24/224 (10%) Query: 341 GNPLPYHEELNRLKESIRSLF-LLDLFQVLDDKASRSAAESMEKT-----REKGAFVGPL 394 + + L+E I+S+ L+ D A +E + + G + Sbjct: 1494 ASRGQVEQANQELRELIQSVKDFLNQEGADPDSIEMVATRVLELSIPASAEQIQHLAGAI 1553 Query: 395 IGGLQSEF-IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 ++S + A+++R + + L + K Q+AE+V + Sbjct: 1554 AERVRSLADVDAILARTVGDVRRAEQLLQDARRARSWAEDEK----------QKAETVQA 1603 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDR-VSRFSLWATNTPAVLIRDTAEVEDIRQQREV 512 AL+ + + D DT++ + + + E RQ + Sbjct: 1604 ALEEAQRAQGIAQGAIRGAVADTRDTEQTLYQVQERMAGA-EQALSSAGERA--RQLDAL 1660 Query: 513 QRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 + ++ T++++ A GRA E ++ G Sbjct: 1661 LEALKLKRAGNSLAASTAEEMAGSAQGRAQE---AEQLLRGPLG 1701 >gi|310829195|ref|YP_003961552.1| anaerobic ribonucleoside-triphosphate reductase [Eubacterium limosum KIST612] gi|308740929|gb|ADO38589.1| anaerobic ribonucleoside-triphosphate reductase [Eubacterium limosum KIST612] Length = 774 Score = 42.3 bits (98), Expect = 0.24, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 58/189 (30%), Gaps = 29/189 (15%) Query: 54 CIKLSSLLSSLITP--PGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 ++ L + TP P Q + E + + D + + LF Sbjct: 370 LKATAAGLGNGETPIFPVQI-FKVKEGIN-----YNETDPNYDLFKLAIKTSSMRLFPNF 423 Query: 112 ERSRSGFVGCLQS---FYTSVVEFG--TGCFY--MEADVDEKGLEEGIRYISVPLSNVYM 164 + F + T V G T + + + + S+ L + + Sbjct: 424 SFLDAPFNLQYYEEGDYNTEVAYMGCRTRVMGNHYDPQNETTCGRGNLSFTSINLPRIAL 483 Query: 165 SVNHQNVVDSVYR----EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK 220 N +D+ YR + V Q++ ++ A R N F + V+ Sbjct: 484 ESN--GSLDTFYRLLDERVSLVVKQLLHRFKI--------QAAKRGRNYPFLMGQGVWID 533 Query: 221 SLTDKKKDK 229 S + + D+ Sbjct: 534 SESLGRDDR 542 >gi|134300245|ref|YP_001113741.1| flagellar biosynthesis/type III secretory pathway protein-like protein [Desulfotomaculum reducens MI-1] gi|134052945|gb|ABO50916.1| Flagellar biosynthesis/type III secretory pathway protein-like protein [Desulfotomaculum reducens MI-1] Length = 238 Score = 41.9 bits (97), Expect = 0.31, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 42/133 (31%), Gaps = 9/133 (6%) Query: 419 NLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMD 478 LP + L E L + Q AE + A Q +++ + Sbjct: 21 ELPPPPSEEVNQEKQLSPEEIMVLAQQQAAEMINRAKQEAKQIIQQTQSKAEAEA----- 75 Query: 479 TDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE--QHLQQQLQQTSQDIGAK 536 R R + AE E IRQQ R +E + +++ D+ Sbjct: 76 --RQMREQAKQAGWQEGITASQAEAEKIRQQASDVLRQSKEIYRQTLGKMEAEIVDLAVD 133 Query: 537 AAGRAMEKKLTHD 549 A R + +L + Sbjct: 134 IAERVVLTQLAVE 146 >gi|239907145|ref|YP_002953886.1| hypothetical protein DMR_25090 [Desulfovibrio magneticus RS-1] gi|239797011|dbj|BAH76000.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 682 Score = 41.6 bits (96), Expect = 0.39, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 49/129 (37%), Gaps = 13/129 (10%) Query: 427 DNPPVSLLKVEYTSPLFK-----YQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 D P +K ++ S + + + +Q + +P +D ++ Sbjct: 520 DFNPRPDIKGDF-SVVARGATALMSKEVQSQRLIQFMTMCAS------NPQFAPMLDVNK 572 Query: 482 VSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRA 541 R + PA ++ D A VE + Q+R++ +V EQ + + S + A Sbjct: 573 GLRQVATSMQIPADIVYDQATVE-LNQERQMAMQVRIEQATKLETLLNSMNSRGITPDAA 631 Query: 542 MEKKLTHDM 550 +++ L + Sbjct: 632 LQRMLAEAL 640 >gi|220916211|ref|YP_002491515.1| integrase family protein [Anaeromyxobacter dehalogenans 2CP-1] gi|219954065|gb|ACL64449.1| integrase family protein [Anaeromyxobacter dehalogenans 2CP-1] Length = 466 Score = 40.8 bits (94), Expect = 0.57, Method: Composition-based stats. Identities = 63/376 (16%), Positives = 118/376 (31%), Gaps = 47/376 (12%) Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTI--IHAVYPKSLT 223 + VD TV QI ++ +LS + ++ +E + + +H V + Sbjct: 89 EEKRGEVDGTAER---TVAQIAQQYRTDILSHRERA------DEAWNVIRVHVVE--AQP 137 Query: 224 DKKKDKGNKGFHSKFVSVDENRFFEEKQIATF-PYIVGRYRVRADEIYGRSPAMEALPTI 282 D K+ + + + D + P + + I G A L Sbjct: 138 DPKRLSFGEWVARQVKASDVATVVRHAKRQRMVPATTRKGEMMTRRIGGAGAARVVL--- 194 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R L + G L P +A + + Y+ G + ++ F ++ Sbjct: 195 RELKSIFAHAVETGDLDASPAVVAKTRTFGIRATSRSRYLKAGEV----KAFFDALELTA 250 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGL---- 398 L + RL ++R L+ L ++ A K V P+ G L Sbjct: 251 LLDGTAKRQRLSPTMRLALAFQLYVPLRSQSLIGAQWIEIDLDAKRWTVPPVAGRLKMRK 310 Query: 399 ----QSE-FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQ--QAESV 451 ++E F+ + S + +L E A + P L SPL + + +A++V Sbjct: 311 EEREEAEGFVVPLPSTAVAMLKRL-----REEAGDSPWVL-----ASPLDRKRHIEAKAV 360 Query: 452 ASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQR 510 AL + T L + + D R R R V +R Sbjct: 361 VRALSRLQTGDRLALGSRVT----VHDLRRTWRTFAMDLGVDNVTAERSLGHVAVLRASG 416 Query: 511 EVQRRVMEEQHLQQQL 526 + + + Sbjct: 417 FGGAADVYGRAQMVEQ 432 >gi|296225177|ref|XP_002758379.1| PREDICTED: laminin subunit beta-2 [Callithrix jacchus] Length = 1798 Score = 40.8 bits (94), Expect = 0.58, Method: Composition-based stats. Identities = 32/224 (14%), Positives = 69/224 (30%), Gaps = 24/224 (10%) Query: 341 GNPLPYHEELNRLKESIRSL-FLLDLFQVLDDKASRSAAESMEKT-----REKGAFVGPL 394 + + L+E I+S+ L+ D A +E + + G + Sbjct: 1500 ASRGQVEQANQELRELIQSVKAFLNQEGADPDSIEMVATRVLELSIPASAEQIQHLAGAI 1559 Query: 395 IGGLQSEF-IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 ++S + +++R + + L + K Q+AE+V + Sbjct: 1560 AERVRSLADVDVILARTVGDVRRAEQLLQDARRARSRAENEK----------QKAETVQA 1609 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQ 513 AL+ + + D DT++ + E QQ + Sbjct: 1610 ALEEAQRAQGVAQGAIWGAVADTQDTEQTLHQVQERMAGAEQALSSAGERA---QQLDAL 1666 Query: 514 RRVMEEQHLQQQ-LQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 ++ + T+++ A GRA E ++ G Sbjct: 1667 LEALKLKRAGNSLAASTAEETAGSAQGRAQE---AEKLLRGPLG 1707 >gi|300712297|ref|YP_003738111.1| hypothetical protein HacjB3_14700 [Halalkalicoccus jeotgali B3] gi|299125980|gb|ADJ16319.1| hypothetical protein HacjB3_14700 [Halalkalicoccus jeotgali B3] Length = 421 Score = 40.8 bits (94), Expect = 0.65, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 60/176 (34%), Gaps = 15/176 (8%) Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 +S IT P E+ + + + A+ W + + + S + Sbjct: 121 GTASEITHPHAP--LSGEATPVLEDLIDYQTAQYVDFHAWLGR-----YALFDASLIDYE 173 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 + +V FGT + + + + + + + + N + Y E+ Sbjct: 174 SQIPDLLAAVDAFGTAAITLFTESSVRSHQVLVYDYERSPGRLVLFAYNPNYTAATYEEY 233 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFH 235 T+TV+ V G + + A ++F +H Y +++ +++ G Sbjct: 234 TYTVE--VDTSGASPVPRPTEHA----GYDQF--VHNEYDRAIRTRRESAGAGPLA 281 >gi|332715438|ref|YP_004442904.1| Threonine dehydratase [Agrobacterium sp. H13-3] gi|325062123|gb|ADY65813.1| Threonine dehydratase [Agrobacterium sp. H13-3] Length = 339 Score = 40.4 bits (93), Expect = 0.76, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 56/159 (35%), Gaps = 23/159 (14%) Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + R L L E G LK+E P+ ++ ++ + L +T + G+ Sbjct: 36 VERT--PLVRSDFLSERCGH----PVHLKLETLQPIGAFKLRGAMNAILSLDDTTRQRGL 89 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVL----IRDTAEVEDIRQQREVQR---RVMEE 519 T + R ++ PA + + +VE IR R R ++ Sbjct: 90 VTASTG-----NHGRAVAYAADKLGIPATICMSALVPANKVEAIRALGAEIRIVGRSQDD 144 Query: 520 QHLQQQLQQTSQDIGAK-----AAGRAMEKKLTHDMMEN 553 + + S+ + A A A + + +++EN Sbjct: 145 AQEEVERLTKSRGLTAIPPFDHADVVAGQGTIGLEVVEN 183 >gi|126340420|ref|XP_001364805.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 621 Score = 40.4 bits (93), Expect = 0.77, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 37/109 (33%), Gaps = 2/109 (1%) Query: 194 VLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIA 253 L K + + ++F H V P S + ++ E + E+ + Sbjct: 508 KLDPHQKDDINKKAFDKFKKEHGVVPHSDSAAPSERFEGLCTDFIPWTTEEQKLLEQALK 567 Query: 254 TFPYIVG-RYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLH 301 T+P R+ A + GRS + + + L E V ++ Sbjct: 568 TYPVNTPERWEKIASTVPGRS-KKDCMKRYKELVEMVKAKKAAQEQVMN 615 >gi|296283404|ref|ZP_06861402.1| peptidase, M16 family protein [Citromicrobium bathyomarinum JL354] Length = 945 Score = 40.4 bits (93), Expect = 0.83, Method: Composition-based stats. Identities = 54/328 (16%), Positives = 91/328 (27%), Gaps = 40/328 (12%) Query: 74 GLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS--FYTSVVE 131 G + SA L AR + R + + + + F + + S Sbjct: 291 GSGSADSAALDVLTAIMARGQSSRLY----DALVRTGKAVDSAMFYSESEEGGYVASFAV 346 Query: 132 FGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWG 191 E D K E IR V + + + DS+ R T + G Sbjct: 347 TNPTADADEVDALLKAELEKIRTQPVSAAELA-EAKSELFADSLRRRE--TARGRAFELG 403 Query: 192 DKVLSSKMKSALARNENERFTIIHAVYPKS--LTDKKKDKGNKGFHSKFVSVDENRFFEE 249 + ++S+ A ++R I AV P+ K N ++V+ +EN Sbjct: 404 EALVSTGNPRAA----DDRLAAIAAVTPEDVQRAAAKWLASNARVDMRYVAGEENPE--- 456 Query: 250 KQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSE 309 P + +R + LP R A R PT+ E Sbjct: 457 --AYANPVPMPTFRSLPA---ATGEPLSVLPEGER----QQPPAAGAR-----PTVVAPE 502 Query: 310 AKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVL 369 ++ + G ++ P+ L + Sbjct: 503 IVEQ-------TLTNGIDVVAAQTGEVPIATMTVLVPGGASTDTRAKAGVAQFA-ASLAD 554 Query: 370 DDKASRSAAESMEKTREKGAFVGPLIGG 397 A+ SA E + GA G G Sbjct: 555 QGTANMSAQEIAARLESLGASFGATAGR 582 >gi|229845187|ref|ZP_04465321.1| potassium efflux protein KefA [Haemophilus influenzae 6P18H1] gi|229811898|gb|EEP47593.1| potassium efflux protein KefA [Haemophilus influenzae 6P18H1] Length = 1107 Score = 40.4 bits (93), Expect = 0.84, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 50/149 (33%), Gaps = 19/149 (12%) Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 + SQG L P LK + L Q+ + + + + Sbjct: 16 FTLSVSQGVLGANSTNVLPTEQSLKAD----LANAQKMSEGEAKKRLLAELQTSIDLLQQ 71 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRV---MEEQHLQQ--- 524 ++ D + + + + ++ AE++ +++Q+E + Q Q Sbjct: 72 IQAQQKIN-DALQTTLSHSE---SEIRKNNAEIQALKKQQETATSTDYNAQSQDDLQNSL 127 Query: 525 -----QLQQTSQDIGAKAAGRAMEKKLTH 548 QLQ T +GA A A + ++ Sbjct: 128 AKLNDQLQDTQNALGAANAQLAGQNSISE 156 >gi|145635631|ref|ZP_01791328.1| potassium efflux protein KefA [Haemophilus influenzae PittAA] gi|145267104|gb|EDK07111.1| potassium efflux protein KefA [Haemophilus influenzae PittAA] Length = 1112 Score = 40.4 bits (93), Expect = 0.84, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 50/149 (33%), Gaps = 19/149 (12%) Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 + SQG L P LK + L Q+ + + + + Sbjct: 21 FTLSVSQGVLGANSTNVLPTEQSLKAD----LANAQKMSEGEAKKRLLAELQTSIDLLQQ 76 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRV---MEEQHLQQ--- 524 ++ D + + + + ++ AE++ +++Q+E + Q Q Sbjct: 77 IQAQQKIN-DALQTTLSHSE---SEIRKNNAEIQALKKQQETATSTDYNAQSQDDLQNSL 132 Query: 525 -----QLQQTSQDIGAKAAGRAMEKKLTH 548 QLQ T +GA A A + ++ Sbjct: 133 AKLNDQLQDTQNALGAANAQLAGQNSISE 161 >gi|39794437|gb|AAH64251.1| dnajc2-prov protein [Xenopus (Silurana) tropicalis] Length = 635 Score = 40.4 bits (93), Expect = 0.91, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 39/110 (35%), Gaps = 3/110 (2%) Query: 194 VLSSKMKSALARNENERFTIIHAVYPKS-LTDKKKDKGNKGFHSKFVSVDENRFFEEKQI 252 L + K + + ++F H V P+S ++ E + E+ + Sbjct: 521 KLDPQQKDDINKKAFDKFKKEHRVVPQSVDNAVPSERFEGPAADMSPWTTEEQKLLEQAL 580 Query: 253 ATFPYIVG-RYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLH 301 T+P R+ A+ + GRS + + + L E V L+ Sbjct: 581 KTYPVNTPERWEKIAEAVPGRS-KKDCMKRYKELVEMVKAKKAAQEQVLN 629 >gi|313747464|ref|NP_001186412.1| dnaJ homolog subfamily C member 2 [Xenopus (Silurana) tropicalis] gi|325530079|sp|Q6P2Y3|DNJC2_XENTR RecName: Full=DnaJ homolog subfamily C member 2 Length = 620 Score = 40.0 bits (92), Expect = 0.96, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 39/110 (35%), Gaps = 3/110 (2%) Query: 194 VLSSKMKSALARNENERFTIIHAVYPKS-LTDKKKDKGNKGFHSKFVSVDENRFFEEKQI 252 L + K + + ++F H V P+S ++ E + E+ + Sbjct: 506 KLDPQQKDDINKKAFDKFKKEHRVVPQSVDNAVPSERFEGPAADMSPWTTEEQKLLEQAL 565 Query: 253 ATFPYIVG-RYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLH 301 T+P R+ A+ + GRS + + + L E V L+ Sbjct: 566 KTYPVNTPERWEKIAEAVPGRS-KKDCMKRYKELVEMVKAKKAAQEQVLN 614 >gi|326778851|ref|ZP_08238116.1| YcaO-domain protein [Streptomyces cf. griseus XylebKG-1] gi|326659184|gb|EGE44030.1| YcaO-domain protein [Streptomyces cf. griseus XylebKG-1] Length = 777 Score = 40.0 bits (92), Expect = 1.1, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 36/109 (33%), Gaps = 19/109 (17%) Query: 256 PYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF 315 P R+ V A + GR+ A AL +L +L+ P AV Sbjct: 659 PSAAPRWAVGAG-LSGRAAAASAL----------RDLLGQAQLAAEDPGEAVDTGDPLVV 707 Query: 316 DLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLD 364 DL PG + +G S + + L L+ + R + Sbjct: 708 DLAPGAIAVGGGSVAADAAET--------TFDAVLEALRSAGRDALYVP 748 >gi|182438202|ref|YP_001825921.1| hypothetical protein SGR_4409 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178466718|dbj|BAG21238.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 771 Score = 40.0 bits (92), Expect = 1.2, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 36/109 (33%), Gaps = 19/109 (17%) Query: 256 PYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF 315 P R+ V A + GR+ A AL +L +L+ P AV Sbjct: 653 PSAAPRWAVGAG-LSGRAAAASAL----------RDLLGQAQLAAEDPGEAVDTGDPLVV 701 Query: 316 DLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLD 364 DL PG + +G S + + L L+ + R + Sbjct: 702 DLAPGAIAVGGGSVAADAAET--------TFDAVLEALRSAGRDALYVP 742 >gi|148747833|ref|YP_001285799.1| portal protein [Phormidium phage Pf-WMP3] gi|146230066|gb|ABQ12474.1| portal protein [Phormidium phage Pf-WMP3] Length = 651 Score = 39.6 bits (91), Expect = 1.5, Method: Composition-based stats. Identities = 84/637 (13%), Positives = 183/637 (28%), Gaps = 124/637 (19%) Query: 9 IQDRFNYLKNQRGELNYWMEE--------------LTGFLYPYKNNAQL----RMWDTTG 50 ++ + + R E L + + ++ Sbjct: 25 VKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTGKA 84 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 EA + + L S T P + W + + L S+ ++ + G Sbjct: 85 FEAIETIHAYLMSA-TFPNKNWFDVVPAKPGQDNLL-----VSRLIKRYVQD--KLTEGK 136 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGC------------FYMEAD-------VDEKGLEEG 151 + + F+ L SV+ + D +E+ ++ Sbjct: 137 FRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSS 196 Query: 152 IRYISVPLSNVYMSVN----HQNVVDSVYREFTFTVDQIVS------KWGDKVLSSKMKS 201 + + + + + N ++ + R+ T T I++ +G L ++ Sbjct: 197 PDFEVLDMFDCFYDPNVTDPNRG---AFIRKLTKTKADILNLLSEGYYYGVDPL-DVVEH 252 Query: 202 ALARNENERFTII-------------HAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFE 248 + + ++ H NK +H V++ N Sbjct: 253 KCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLR 312 Query: 249 EKQIATFPYIVGR------YRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 PY GR Y A + Y L + LN N+ L++ Sbjct: 313 ---FEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQ 369 Query: 303 PTIAVSEAKQRNFD--LKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 S+ + D +PG + + + + + L Q N ++E + L+ +I Sbjct: 370 MYTLRSDGLLQPEDVYTEPGKVFLVSDHGDLQPLAN--QSSNFSITYQESSFLESTIDKN 427 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPL-IGGLQSEF----IGAMISRELDILD 415 F + A+RS G + G+ + ++ + + ++ Sbjct: 428 FGT--GNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQ 485 Query: 416 SQGNLPEC---EGADNPPVSLLKVEYTSPLFKYQQAESVAS---------ALQGVNTVVE 463 + P G + +++ L K + + S + + Sbjct: 486 QFTDQPGMVRVAGDEAGAYEYYELD-VEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQA 544 Query: 464 LGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQ 523 + P +D R+ L E E +Q++ Q ++ L Sbjct: 545 VA---QVPEMGQLVDYKRILVDLLQHWGF--------EEPEAYLKQQDQQAPANPQEALL 593 Query: 524 QQLQQTSQDIGAKAAGRAMEKKLTHD----MMENSYG 556 Q +D+G +A ++ +L D MM YG Sbjct: 594 SQA----KDVGGQAMSNMLQNQLQADGGTQMMSEMYG 626 >gi|156546841|ref|XP_001606394.1| PREDICTED: hypothetical protein [Nasonia vitripennis] Length = 886 Score = 39.6 bits (91), Expect = 1.5, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 31/79 (39%), Gaps = 6/79 (7%) Query: 475 DHMDTDRVSRFSLWATNTPAVLIRDTAEV------EDIRQQREVQRRVMEEQHLQQQLQQ 528 D D D + + + IR EV E++R+QRE R E + Q ++ Sbjct: 123 DAPDLDLAADYPAKKQISAPGEIRREYEVQLQMVEEEMRRQREKDRLASEAIIRKIQQEE 182 Query: 529 TSQDIGAKAAGRAMEKKLT 547 Q + A + + K L Sbjct: 183 EQQKLVQLAQDQLLAKTLA 201 >gi|332023899|gb|EGI64119.1| Guanine nucleotide-binding protein-like 3-like protein [Acromyrmex echinatior] Length = 546 Score = 39.2 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 15/51 (29%), Positives = 30/51 (58%) Query: 502 EVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 EVE +++QRE +++ +E +++ +Q ++D +A K+L H ME Sbjct: 50 EVEAMKKQREEEKQKQKEAARERKREQLAKDGLQGLVKQAENKQLAHKSME 100 >gi|325266040|ref|ZP_08132726.1| S-adenosylmethionine:tRNA ribosyltransferase-isomerase [Kingella denitrificans ATCC 33394] gi|324982678|gb|EGC18304.1| S-adenosylmethionine:tRNA ribosyltransferase-isomerase [Kingella denitrificans ATCC 33394] Length = 340 Score = 39.2 bits (90), Expect = 1.7, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 43/148 (29%), Gaps = 25/148 (16%) Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESV----------------AS 453 +L+ G LP P Y + KYQ A + Sbjct: 134 VYTLLEEYGALPLPPYIVRPADDNDDARYQTVYAKYQGAVAAPTAGLHFTHEILSALQQK 193 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQ 513 ++ + +G T P +D++ ++ A V I+ + Sbjct: 194 GVEFAEVTLHVGAGTFQPVRVDNIAEHKMHSEWFD---------VPEATVAKIQAAKARG 244 Query: 514 RRVMEEQHLQQQLQQTSQDIGAKAAGRA 541 RV + +++ G+ AG+ Sbjct: 245 NRVWSVGTTSLRAIESAARSGSLHAGQG 272 >gi|325171218|ref|YP_004251190.1| hypothetical protein ViPhICP2p19 [Vibrio phage ICP2] gi|323512244|gb|ADX87701.1| conserved hypothetical protein [Vibrio phage ICP2] gi|323512316|gb|ADX87772.1| hypothetical protein TU12-16_00090 [Vibrio phage ICP2_2006_A] Length = 581 Score = 38.9 bits (89), Expect = 2.3, Method: Composition-based stats. Identities = 78/596 (13%), Positives = 171/596 (28%), Gaps = 112/596 (18%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFL------------YPYKNNAQLRMWDTTGSEA 53 A+ I + + +QR E EL ++ P+KN TT + Sbjct: 20 AEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNK-------TTLPKL 72 Query: 54 CIKLSSLLSSLITP---PGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 C L S P ++W ++ +++A+ ++++ D Sbjct: 73 CQI-RDNLHSNYISALFPNERWLK-------WEGKSLQDEAKRDAIQQYMDNKVKE---- 120 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 S F + +++G +E + EE + ++ ++ Sbjct: 121 -----SDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKD 175 Query: 171 VV-DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 +V + V +F + I VL+ + +++ E ++ A+ + + Sbjct: 176 IVFNPVAVDFAHSPKII-----RTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGT 230 Query: 230 GNKG-------------------FHSKFVSV------------------------DENRF 246 + F S +V V D Sbjct: 231 YTREDCEKAVGFSMDGFGNLYDYFQSPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFV 290 Query: 247 FEE----KQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 EE A P +R+R D +Y P + R++ N A L P Sbjct: 291 IEEKENPSWFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFP 350 Query: 303 PTIA---VSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPY-HEELNRLKESIR 358 P V E + Y+N Q +Q + ++ + R Sbjct: 351 PMKVKGDVEEFVWGPMEQI--YINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPR 408 Query: 359 SLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQG 418 + ++A E + G I + + +++ L+I Sbjct: 409 EAM------GIRTPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNL 462 Query: 419 NLPECEGADNPPVSL---LKVEYTSPLFKYQ----QAESVASALQGVNTVVELGVKTGDP 471 ++ + + + + V K + A A Q V +++ + Sbjct: 463 DVADTIRVFDSDDKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQ 522 Query: 472 SCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 H+ T+ +++ + I V + Q + ++++ Q Sbjct: 523 DIKPHVSTENLAKMLEHNLSLGGWDIFKPN-VAVMEAQTTSALVNQSQAQIEEEAQ 577 >gi|307293763|ref|ZP_07573607.1| integral membrane sensor signal transduction histidine kinase [Sphingobium chlorophenolicum L-1] gi|306879914|gb|EFN11131.1| integral membrane sensor signal transduction histidine kinase [Sphingobium chlorophenolicum L-1] Length = 451 Score = 38.9 bits (89), Expect = 2.6, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 57/177 (32%), Gaps = 16/177 (9%) Query: 392 GPLIGGLQSEFIGAMISRELDILDSQG-NLPECEGADNPPVSLLKVEYTSPLF---KYQQ 447 GP + AM SR +L+ + L P++ L+V S + + Sbjct: 223 GPGDVRQLTMAFNAMRSRIFAMLNEKDRMLGAIGHDLRTPLASLRVRAESVEDEGERARM 282 Query: 448 AESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLW---ATNTPAVL-------- 496 +E++ + + ++ L +D ++ + +P L Sbjct: 283 SETIDEMNRMLEDILSLARAGRSTEAQQKVDLSALADAVVEDFLELGSPVDLADSERVVA 342 Query: 497 -IRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 +R +R E E H+ + + + + G + + +MME Sbjct: 343 NVRPQQIRRALRNLIENAIVYGERAHVSVERGEGAIRLVVADDGPGISEDRMEEMME 399 >gi|62768239|gb|AAY00027.1| SA1_PKSC [uncultured bacterial symbiont of Discodermia dissoluta] Length = 3592 Score = 38.5 bits (88), Expect = 2.8, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 34/119 (28%), Gaps = 20/119 (16%) Query: 392 GPLIGGLQSEFIGAMISRELDILDSQGNLPECEGA--DNPPVSLLKVEYTSPLFKYQQAE 449 P E + +++ REL LP+ D SL+ VE L + Sbjct: 2545 SPTAAR--EELLVSLLQRELQGALRMHALPDPTVGFFDLGMDSLMAVELRGRLNRA---- 2598 Query: 450 SVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 + + + D+ +T ++R P R V R Sbjct: 2599 ------------FDGDYILSNTAVFDYPNTVELARHIASGLGVPPEDERPRPRVFSQRD 2645 >gi|167534917|ref|XP_001749133.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772286|gb|EDQ85939.1| predicted protein [Monosiga brevicollis MX1] Length = 802 Score = 38.5 bits (88), Expect = 2.8, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 57/200 (28%), Gaps = 22/200 (11%) Query: 277 EALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQ 336 +A+P + + S PP VS+ G +S + ++ Sbjct: 515 KAIPDPDMAQKRRRRSLKAQPQSSLPPLKRVSDVHVE------GTRPWYQISNQDEAVEA 568 Query: 337 PVQFGNPLPYHEELNR---------LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREK 387 Q + L E ++ IR LF++ + ++ E Sbjct: 569 VDQLISQLAGWGEQTDEERHDVLMLMQRRIRDR----LFEMRQQSPEVT-TAFDDRIEEI 623 Query: 388 GAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQ 447 F+ G + + + + ++ ++ P SL V L ++ + Sbjct: 624 LNFMMQAPGTEEDDRSAEAVQQRFSVVVDAAISGRLAHWESTPASL--VALVILLDQFPR 681 Query: 448 AESVASALQGVNTVVELGVK 467 + S + V Sbjct: 682 SIHANSKRMFAGDDMAKAVV 701 >gi|325116269|emb|CBZ51822.1| serine:pyruvate/alanine:glyoxylate aminotransferase [Neospora caninum Liverpool] Length = 371 Score = 38.5 bits (88), Expect = 3.3, Method: Composition-based stats. Identities = 28/134 (20%), Positives = 47/134 (35%), Gaps = 14/134 (10%) Query: 393 PLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVA 452 P Q E MI L G LP PP + SPL + Sbjct: 228 PSAVRHQ-ETCVKMIEDYFQALKDTG-LPTDAYGHRPPAEFRYFAFRSPLAQ-------P 278 Query: 453 SALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREV 512 S Q + + DP + M+ R +LW ++RDT + + +R + + Sbjct: 279 SHEQFFRQMCVVSADPDDPERVAEMNCVLTDREALWR-----SVLRDTKQAKKLRARLKR 333 Query: 513 QRRVMEEQHLQQQL 526 +V E + +++ Sbjct: 334 TAQVAETREQLRRV 347 >gi|240277638|gb|EER41146.1| conserved hypothetical protein [Ajellomyces capsulatus H143] Length = 537 Score = 37.7 bits (86), Expect = 5.6, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 52/133 (39%), Gaps = 4/133 (3%) Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 + P+ +V S + ++ S++S+ Q +E+ + + + + D Sbjct: 284 PIAAVELIPIETPRVSPASV--EAEELRSMSSSRQKRLLKMEIAKLKDEKAILAK-ELDE 340 Query: 482 VSRFSLWATNTPAVLIRDTAEVEDIRQQREV-QRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 T A L + EV + ++ QR V +E+ + +Q Q+ AA Sbjct: 341 ARTTIKEGGGTDAELEKTREEVRRLTKENASLQRTVQQERSQAEYTRQQYQNASTSAAQS 400 Query: 541 AMEKKLTHDMMEN 553 AME + + + N Sbjct: 401 AMELQQLEEELAN 413 >gi|148239654|ref|YP_001225041.1| glycyl-tRNA synthetase beta subunit [Synechococcus sp. WH 7803] gi|147848193|emb|CAK23744.1| Glycyl-tRNA synthetase beta subunit [Synechococcus sp. WH 7803] Length = 719 Score = 37.3 bits (85), Expect = 7.1, Method: Composition-based stats. Identities = 31/139 (22%), Positives = 56/139 (40%), Gaps = 27/139 (19%) Query: 359 SLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPL--IGGLQSEFIGAMISRELDIL 414 F +DL Q + + + E R++ + L G LQ + A++ R L Sbjct: 558 DGFAIDLVQAVCGEGVSTERLLEDPVDARDRLLLLKTLRESGRLQD--LQAVVQRA-SRL 614 Query: 415 DSQGNLPECEGA----------DNPPVS--LLKVEYTSPLFK-------YQQAESVASAL 455 +G+LP + + D+P + L+++E SPL + Q+ + A AL Sbjct: 615 AEKGDLPPSKLSVEGIVDAFLFDSPSEAALLVELEALSPLAQAKDYERLAQRLQGAARAL 674 Query: 456 Q-GVNTVVELGVKTGDPSC 473 + + + V DPS Sbjct: 675 EAFFDGSDSVMVMAEDPSV 693 >gi|218778476|ref|YP_002429794.1| hypothetical protein Dalk_0621 [Desulfatibacillum alkenivorans AK-01] gi|218759860|gb|ACL02326.1| protein of unknown function DUF323 [Desulfatibacillum alkenivorans AK-01] Length = 918 Score = 36.9 bits (84), Expect = 9.8, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 36/104 (34%), Gaps = 3/104 (2%) Query: 442 LFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTA 501 L + S+++ Q + L D +D + D + + Sbjct: 121 LAQTAGNGSLSALDQLAQMMNTLKQILSDEDIIDSNNPDDALSQIHRLLRGISEKLGIDQ 180 Query: 502 EVEDIRQQREVQRRVMEEQHLQQ---QLQQTSQDIGAKAAGRAM 542 EVED R+ V++ E + + A+AAG+A+ Sbjct: 181 EVEDAREGVAVRQAEEGEDAELIASPEADGAGKGGDAEAAGKAL 224 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.137 0.378 Lambda K H 0.267 0.0425 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 9,916,318,831 Number of Sequences: 14124377 Number of extensions: 415648651 Number of successful extensions: 2096225 Number of sequences better than 10.0: 2040 Number of HSP's better than 10.0 without gapping: 1243 Number of HSP's successfully gapped in prelim test: 932 Number of HSP's that attempted gapping in prelim test: 2017655 Number of HSP's gapped (non-prelim): 50075 length of query: 556 length of database: 4,842,793,630 effective HSP length: 144 effective length of query: 412 effective length of database: 2,808,883,342 effective search space: 1157259936904 effective search space used: 1157259936904 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 84 (36.9 bits)