BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781213|ref|YP_003065626.1| head-to-tail joining protein, putative [Candidatus Liberibacter asiaticus str. psy62] (556 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781213|ref|YP_003065626.1| head-to-tail joining protein, putative [Candidatus Liberibacter asiaticus str. psy62] gi|254040890|gb|ACT57686.1| head-to-tail joining protein, putative [Candidatus Liberibacter asiaticus str. psy62] gi|317120678|gb|ADV02501.1| putative phage-related head-to-tail joining protein [Liberibacter phage SC1] gi|317120822|gb|ADV02643.1| putative phage-related head-to-tail joining protein [Candidatus Liberibacter asiaticus] Length = 556 Score = 445 bits (1144), Expect = e-123, Method: Composition-based stats. Identities = 556/556 (100%), Positives = 556/556 (100%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL Sbjct: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG Sbjct: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT Sbjct: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS Sbjct: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL Sbjct: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL Sbjct: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL Sbjct: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD Sbjct: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR Sbjct: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 Query: 541 AMEKKLTHDMMENSYG 556 AMEKKLTHDMMENSYG Sbjct: 541 AMEKKLTHDMMENSYG 556 >gi|226940462|ref|YP_002795536.1| Bbp21 [Laribacter hongkongensis HLHK9] gi|226715389|gb|ACO74527.1| Bbp21 [Laribacter hongkongensis HLHK9] Length = 555 Score = 406 bits (1044), Expect = e-111, Method: Composition-based stats. Identities = 122/553 (22%), Positives = 219/553 (39%), Gaps = 37/553 (6%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTTGSEACI 55 K + R+ LK +R E++ +L P ++D TG+ A Sbjct: 8 KRVSARWEALKKERSSWMSHWSEISDYLLPRSGRFFVEDRNKGNKRHKNIYDNTGTRALR 67 Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 L++ + + +T P + W L S + S V+ W VT + ++ Sbjct: 68 VLAAGMMAGMTSPARPWFRLTTSD--------PQLDESAAVKAWLADVTRIMQMV--FAK 117 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 S L S Y + FGT + D + I + + ++ +++ V+++ Sbjct: 118 SNTYRALHSCYEELGAFGTAGTIVLPDFN-----GVIHHHVLTAGEFAIAADYRGQVNTL 172 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALAR-NENERFTIIHAVYPKSLTD-KKKDKGNKG 233 YREF TV Q+V ++G S+ ++ R +E T+IHA+ P++ ++D N Sbjct: 173 YREFQMTVGQMVGEFGLSACSATVQRLHERWCLDEWITVIHAIEPRTDRHKGRQDARNMA 232 Query: 234 FHSKFVSVD--ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 + S + E + E FP + R+ +IYG SPAME+L I++L Sbjct: 233 WRSVYFEPGNREGQVLRESGFREFPALCPRWSTSGGDIYGNSPAMESLGDIKQLQHEQLR 292 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPLPYHEEL 350 Q PP S + R+ D PG ++ + + G + ++ Sbjct: 293 KGQVIDYKTKPPLQVPSSMRARDIDTLPGGVSFVDAGTPNGGIRSAFEVGLDLSHLLADI 352 Query: 351 NRLKESIRSLFLLDLFQVLDDK--ASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 ++E I+ F DLF +L + +A E E+ EK +GP++ L +E + +I Sbjct: 353 QDVRERIKGSFYADLFLMLANGSNPQMTATEVAERHEEKLLMLGPVLERLHNEILDPLIE 412 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 + G +P L VE+ S L + Q+A + S + V + + Sbjct: 413 MTFSRMVEAGIVPPPPEELQG--VDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAG-- 468 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 P +D D DR + LI V IRQQR ++ ++ + Q Sbjct: 469 IKPEVLDKFDADRWADAYADMLGIDPELIVPGDRVALIRQQRAQAQQAQQQAAMLQMGAD 528 Query: 529 TSQDIGAKAAGRA 541 +Q +G+ + Sbjct: 529 AAQKLGSVDTSQP 541 >gi|242279813|ref|YP_002991942.1| hypothetical protein Desal_2347 [Desulfovibrio salexigens DSM 2638] gi|242122707|gb|ACS80403.1| conserved hypothetical protein [Desulfovibrio salexigens DSM 2638] Length = 555 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 146/564 (25%), Positives = 238/564 (42%), Gaps = 43/564 (7%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTTGSEACIK 56 R L+ +R ++++ ++ P K ++ D+T + A Sbjct: 8 QYLRRLQGLRQERNSWESHWQDISDYILPRKGVYDGHRPNDGRVRSGKIIDSTATRALRI 67 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L++ L +T P + W L S ++ AR K VREW +V +T++ R +RS Sbjct: 68 LAAGLQGGLTSPARPWFRLGISD--------RDLARHKSVREWISKVENTMY--RALARS 117 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 F C+ S YT + FGTG Y E D E GIR+ ++ ++ + Q VD+VY Sbjct: 118 NFYSCIHSLYTELAGFGTGILYCEPD-----DERGIRFRTLTAGEYCLATDAQGRVDTVY 172 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK-KDKGNKGFH 235 REF T Q+ ++G + L + + S+L N + F ++H V P+ D D N F Sbjct: 173 REFKMTARQLEKRFGMQNLPATVHSSLNMNRDHWFDVLHVVQPRDEFDIALMDTMNMPFE 232 Query: 236 SKF-VSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQ 294 S F ++ E PY+ R+ A ++YGRSPAM+ L ++ L E Q Sbjct: 233 SVFLLNGHGGHVLSESGFMENPYMAPRWDTSAMDVYGRSPAMDVLADVKMLMEMSKSQIQ 292 Query: 295 FGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP--LPYHEELNR 352 L+L PP R +L PG N +++ P+ P ++ Sbjct: 293 AVHLTLRPPMKVP-SMYSRRLNLLPGGQNPVEQNQQ--DSVSPLYQVRPDLAGVSNKIQD 349 Query: 353 LKESIRSLFLLDLFQVLDDKASR--SAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 ++ +IR F D+F ++ R +AAE E+ EK +GP+I +E + +I R Sbjct: 350 VRTAIREGFYNDIFMMMAGTNRRTITAAEVAERHEEKLIQLGPVIERQHTELLDPLIDRV 409 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 IL G LPE + +K++Y S L + Q+ S V L + Sbjct: 410 FGILMRSGQLPEAPSVLEG--ADIKIDYISVLAQAQKMVGTQSIQSLAQFVGNLAKA--N 465 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTS 530 P +D +D DR P ++R EVE +R R + M + Q Q + Sbjct: 466 PEVLDKVDMDRAVDDYAELIGVPNGIVRSGDEVEKLRNMR----KDMLIKEQQLQQSLQA 521 Query: 531 QDIGAKAAGRAMEKKLTHDMMENS 554 +GA L ++M+ Sbjct: 522 ASMGAGIVKDLSYSGLNPELMQGM 545 >gi|187476929|ref|YP_784953.1| phage head-tail connector protein [Bordetella avium 197N] gi|115421515|emb|CAJ48024.1| Putative phage head-tail connector protein [Bordetella avium 197N] Length = 555 Score = 401 bits (1030), Expect = e-109, Method: Composition-based stats. Identities = 129/556 (23%), Positives = 232/556 (41%), Gaps = 37/556 (6%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTTGS 51 Q K + R+ LK +R +E++ +L P + D TG+ Sbjct: 4 QTERKLLLSRWGQLKAERESWISHWKEISDYLLPRSGRFFINDRNRGGKRHNNILDNTGT 63 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 A L++ + + +T P + W L S E S V+ W VT + Sbjct: 64 RALRVLAAGMMAGMTSPARPWFRLTTS--------IPELDESAAVKAWLANVTRLMLMV- 114 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 ++S L S Y + FGT + D ++ IR+ ++ ++ ++Q Sbjct: 115 -FAKSNTYRALHSTYEELGLFGTASSIVLPDF-----KDVIRHHTLSAGEYAIAADNQGR 168 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNE-NERFTIIHAVYPKSLTDK-KKDK 229 VD++YREF TV Q+V ++G S+ +++ R + T+IHA+ P++ D K+D Sbjct: 169 VDTLYREFQITVAQMVREFGKDKCSTTVRNLFDRGALEQWVTVIHAIEPRADRDPNKRDD 228 Query: 230 GNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 N + S +V DE R E +F + R+ + +IYG SPAMEAL +R+L Sbjct: 229 RNMAWKSVYVELGADETRTLRESGYRSFRALCPRWALAGGDIYGNSPAMEALGDVRQLQH 288 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPLPY 346 AQ +PP AK ++ PG ++ ++ + + + Sbjct: 289 EQLRKAQGIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDVAAPNGGIRTAFEVNLDLSHL 348 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKAS--RSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 ++ ++E I++ F DLF +L + + +A E E+ EK +GP++ + +E + Sbjct: 349 LADIVDVRERIKASFYADLFLMLANGTNPKMTATEVAERHEEKLLMLGPVLERMHNEILD 408 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 +I + LP L VE+ S L + Q+A + S + V + + Sbjct: 409 PLIELTFQRMVEANILPPPPQEMQG--VDLNVEFVSMLAQAQRAIATNSVDRFVGNLGVV 466 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQ 524 P +D + DR + LI +V IR+QR Q++ ++ L Sbjct: 467 AK--IKPEVLDKFNADRWADTYADMLGIDPELIVPGNQVALIRKQRAEQQQAAQQAALLN 524 Query: 525 QLQQTSQDIGAKAAGR 540 Q T+ +G+ + Sbjct: 525 QGADTAAKLGSVDTSK 540 >gi|327252184|gb|EGE63856.1| bbp21 [Escherichia coli STEC_7v] Length = 559 Score = 399 bits (1026), Expect = e-109, Method: Composition-based stats. Identities = 131/559 (23%), Positives = 237/559 (42%), Gaps = 40/559 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D D + IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLEDDD-----DIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP IA + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMIAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSA--AESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + +L P +D ++ D+ + +I +VE RQQR Q++ + Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMM 520 Query: 521 HLQQQLQQTSQDIGAKAAG 539 + Q ++ + Sbjct: 521 EMGMAAAQGAKTLSEAKTS 539 >gi|301019343|ref|ZP_07183529.1| conserved hypothetical protein [Escherichia coli MS 196-1] gi|299882260|gb|EFI90471.1| conserved hypothetical protein [Escherichia coli MS 196-1] Length = 559 Score = 398 bits (1022), Expect = e-108, Method: Composition-based stats. Identities = 130/559 (23%), Positives = 237/559 (42%), Gaps = 40/559 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEANRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSA--AESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEG--IPLKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + +L P +D ++ D+ + +I +VE RQQR Q++ + Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMM 520 Query: 521 HLQQQLQQTSQDIGAKAAG 539 + Q ++ + Sbjct: 521 AVGMAAAQGAKTLSEAKTS 539 >gi|117624712|ref|YP_853625.1| putative tail protein [Escherichia coli APEC O1] gi|115513836|gb|ABJ01911.1| putative tail protein [Escherichia coli APEC O1] Length = 559 Score = 398 bits (1022), Expect = e-108, Method: Composition-based stats. Identities = 122/523 (23%), Positives = 222/523 (42%), Gaps = 38/523 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q +PP +A + + ++ L PG + L Q Sbjct: 286 LQLLQKRKSQIIDKVTNPPMVAPTTLRTQSVSLLPGGVTYVDQLTGQEGLRPVYQVNPNT 345 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSE 401 ++ +++I S + +DLF +L + +RS +E EK +GP++ L E Sbjct: 346 ADLISDIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDE 405 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + +I R ++ + LP A LKVEY S + + Q++ ++S VN + Sbjct: 406 CLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNFI 463 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 +L G P +D ++ D+ + +I +VE Sbjct: 464 GQLA--QGKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|294492610|gb|ADE91366.1| conserved hypothetical protein [Escherichia coli IHE3034] gi|323948685|gb|EGB44590.1| hypothetical protein ERKG_04908 [Escherichia coli H252] Length = 559 Score = 396 bits (1018), Expect = e-108, Method: Composition-based stats. Identities = 123/524 (23%), Positives = 223/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRSFSMMVRKNMLPPPPDVMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|300898427|ref|ZP_07116768.1| conserved hypothetical protein [Escherichia coli MS 198-1] gi|300357894|gb|EFJ73764.1| conserved hypothetical protein [Escherichia coli MS 198-1] Length = 559 Score = 396 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 122/524 (23%), Positives = 222/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEFGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDVMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|89152428|ref|YP_512261.1| putative head-to-tail-joining protein [Escherichia phage phiV10] gi|74055451|gb|AAZ95900.1| putative head-to-tail-joining protein [Escherichia phage phiV10] Length = 559 Score = 396 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 123/524 (23%), Positives = 223/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLDD-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRSFSMMVRKNMLPPPPDVMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQV--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|323156133|gb|EFZ42292.1| bbp21 [Escherichia coli EPECa14] Length = 559 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 224/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIDVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|324008560|gb|EGB77779.1| hypothetical protein HMPREF9532_01747 [Escherichia coli MS 57-2] Length = 559 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 224/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRSFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|331648176|ref|ZP_08349266.1| conserved hypothetical protein [Escherichia coli M605] gi|331043036|gb|EGI15176.1| conserved hypothetical protein [Escherichia coli M605] Length = 559 Score = 395 bits (1014), Expect = e-107, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 224/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|298381718|ref|ZP_06991317.1| hypothetical protein ECFG_01455 [Escherichia coli FVEC1302] gi|298279160|gb|EFI20674.1| hypothetical protein ECFG_01455 [Escherichia coli FVEC1302] Length = 559 Score = 395 bits (1014), Expect = e-107, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 223/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNNSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|301046408|ref|ZP_07193568.1| conserved hypothetical protein [Escherichia coli MS 185-1] gi|300301634|gb|EFJ58019.1| conserved hypothetical protein [Escherichia coli MS 185-1] Length = 559 Score = 394 bits (1013), Expect = e-107, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 224/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEANRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|218700990|ref|YP_002408619.1| putative head-to-tail-joining protein [Escherichia coli IAI39] gi|218370976|emb|CAR18803.1| putative head-to-tail-joining protein [Escherichia coli IAI39] Length = 559 Score = 394 bits (1013), Expect = e-107, Method: Composition-based stats. Identities = 130/559 (23%), Positives = 236/559 (42%), Gaps = 40/559 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNNSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + +L P +D ++ D+ + +I +VE RQQR Q++ + Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMM 520 Query: 521 HLQQQLQQTSQDIGAKAAG 539 + Q ++ + Sbjct: 521 AMGMVAAQGAKTLSEAKTS 539 >gi|320175046|gb|EFW50159.1| putative tail protein [Shigella dysenteriae CDC 74-1112] Length = 559 Score = 394 bits (1011), Expect = e-107, Method: Composition-based stats. Identities = 124/524 (23%), Positives = 223/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S L Y S+ + TG + D E+ IR + P+ + Y++ + + Sbjct: 113 M--FNESNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A + K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPTSLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|332344354|gb|AEE57688.1| conserved hypothetical protein [Escherichia coli UMNK88] Length = 559 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 123/524 (23%), Positives = 222/524 (42%), Gaps = 40/524 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M + + + + +F L+++R EL+ ++ P + R+ D+T Sbjct: 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDST 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ A L+S + S IT P + W LA + V+ W + V + + Sbjct: 61 GTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGP--------VKLWLEAVQNRMND 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ + TG + D E+ IR + + + Y++ + + Sbjct: 113 M--FNKSNLYQSLPQLYGSLGTYSTGAMAVLED-----DEDIIRTMPFTIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KK 227 VD+ +R+F+ TV Q+V ++G +S +KS E ++H+VYP D K Sbjct: 166 GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 Query: 228 DKGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + V D ++ E FP + R+ V +++YG S P M AL ++ Sbjct: 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP +A K + L PG + + F+P NP Sbjct: 286 LQLLQKRKSQLIDKATNPPMVAPISLKNQRASLLPGDITYIDQITG-QDGFRPAYLVNPS 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQS 400 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L Sbjct: 345 TADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I R ++ + LP A LKVEY S + + Q++ ++S VN Sbjct: 405 ECLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + +L P +D ++ D+ + +I +VE Sbjct: 463 IGQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|315122900|ref|YP_004063389.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496302|gb|ADR52901.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 555 Score = 391 bits (1004), Expect = e-106, Method: Composition-based stats. Identities = 396/555 (71%), Positives = 458/555 (82%), Gaps = 1/555 (0%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 MN S K I+ F +LK+QR ELN MEELT LYPYK + RMWDTTGSEACIKLSSL Sbjct: 1 MN-NSIKKIKTCFEHLKSQREELNTRMEELTSLLYPYKQEPKSRMWDTTGSEACIKLSSL 59 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LSSLITPPGQKWHGL+E F +QAFLY+EDA +KK+R WCDQVTD LFGFRERSRSGFV Sbjct: 60 LSSLITPPGQKWHGLSEPFFRHQAFLYEEDAGAKKIRGWCDQVTDVLFGFRERSRSGFVS 119 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 CLQSFYTS+VEFGTGCFY+EADVDE GLEEGIRYI+VPL++VY+SVNHQN VDS+YR F Sbjct: 120 CLQSFYTSIVEFGTGCFYIEADVDETGLEEGIRYIAVPLADVYLSVNHQNEVDSIYRTFE 179 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 FT +QI KWG KVLS KMKS+ + E ++F IIHAVYPKSL +KKKDKGNK FHSKFV Sbjct: 180 FTAEQIGGKWGYKVLSDKMKSSYEKKEPDKFKIIHAVYPKSLAEKKKDKGNKNFHSKFVC 239 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 +DEN FFEEKQI T PYI+GRYRVRADEIYG+SPAMEALP IRRLNE NELAQ+ RLSL Sbjct: 240 IDENVFFEEKQITTLPYIIGRYRVRADEIYGKSPAMEALPAIRRLNEISNELAQYARLSL 299 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 HP +A +EAKQ F +K ++N GA+S++G++LFQP+Q GNPLP++EEL R++ SI SL Sbjct: 300 HPAYLAPTEAKQLEFKIKSRHINTGAMSKDGKALFQPLQVGNPLPFYEELKRIQGSIHSL 359 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI RELDILD+Q NL Sbjct: 360 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIKRELDILDAQHNL 419 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 PE D+ P LLKVEYTSPLFKYQQAESVAS LQG NTV+ELG KTG+P MDH+D D Sbjct: 420 PELTDYDHSPFHLLKVEYTSPLFKYQQAESVASVLQGTNTVLELGAKTGNPEPMDHIDID 479 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 +VSRF+LWA+ +PA LIRD EV+ R+ R+ Q M+ + QQ +Q + GAKA + Sbjct: 480 KVSRFALWASGSPAHLIRDVDEVKQRRKDRDDQMEAMQNRQDAQQQEQMGMEAGAKAVSK 539 Query: 541 AMEKKLTHDMMENSY 555 A+EKK+T+D+MENSY Sbjct: 540 AIEKKMTNDLMENSY 554 >gi|41179382|ref|NP_958690.1| Bbp21 [Bordetella phage BPP-1] gi|45569514|ref|NP_996583.1| hypothetical protein BMP-1p20 [Bordetella phage BMP-1] gi|45580765|ref|NP_996631.1| hypothetical protein BIP-1p20 [Bordetella phage BIP-1] gi|40950121|gb|AAR97687.1| Bbp21 [Bordetella phage BPP-1] Length = 555 Score = 389 bits (999), Expect = e-106, Method: Composition-based stats. Identities = 128/559 (22%), Positives = 230/559 (41%), Gaps = 38/559 (6%) Query: 1 MNQRS-AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDT 48 M +++ K + R+ L+ +R +E++ +L P + D Sbjct: 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDN 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 TG+ A L++ + + +T P + W L S E S V+ W VT + Sbjct: 61 TGTRALRVLAAGMMAGMTSPARPWFRLTTS--------IPELDESAAVKAWLANVTRLML 112 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 ++S L S Y + FGT + D D + + S+ ++ ++ Sbjct: 113 MI--FAKSNTYRALHSMYEELGAFGTASSIVLPDFDA-----VVYHHSLTAGEYAIAADN 165 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNE-NERFTIIHAVYPKSLTDK-K 226 Q V+++YREF TV Q+V ++G S+ ++S R + T+IHA+ P++ D K Sbjct: 166 QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSK 225 Query: 227 KDKGNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 +D N + S + DE R E +F + R+ + +IYG SPAMEAL +R+ Sbjct: 226 RDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQ 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NP 343 L AQ +PP AK ++ PG ++ + + + + Sbjct: 286 LQHEQLRKAQAIDYKSNPPLQLPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDL 345 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDK--ASRSAAESMEKTREKGAFVGPLIGGLQSE 401 ++ ++E I++ F DLF +L + +A E E+ EK +GP++ + +E Sbjct: 346 SHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNE 405 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + +I + LP L VE+ S L + Q+A + S + V + Sbjct: 406 ILDPLIELTFQRMVEANILPPPPQEMQG--VDLNVEFVSMLAQAQRAIATNSVDRFVGNL 463 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 + P +D D DR + LI +V IR+QR Q++ ++ Sbjct: 464 GAVAG--IKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAA 521 Query: 522 LQQQLQQTSQDIGAKAAGR 540 L Q T+ +G+ + Sbjct: 522 LLNQGADTAAKLGSVDTSK 540 >gi|315121938|ref|YP_004062427.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495340|gb|ADR51939.1| head-to-tail joining protein, putative [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 555 Score = 389 bits (998), Expect = e-106, Method: Composition-based stats. Identities = 399/555 (71%), Positives = 457/555 (82%), Gaps = 1/555 (0%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSL 60 MN S K I+ F +LK+QR ELN MEELT LYPYK + RMWDTTGSEACIKLSSL Sbjct: 1 MN-NSIKKIKTCFEHLKSQREELNTRMEELTSLLYPYKQEPKSRMWDTTGSEACIKLSSL 59 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LSSLITPPGQKWHGL+E F +QAFLY+EDA +KK+R WCDQVTD LFGFRERSRSGFV Sbjct: 60 LSSLITPPGQKWHGLSEPFFRHQAFLYEEDAGAKKIRGWCDQVTDVLFGFRERSRSGFVS 119 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 CLQSFYTS+VEFGTGCFY+EADVDE GLEEGIRYI+VPL++VY+SVNHQN VDS+YR F Sbjct: 120 CLQSFYTSIVEFGTGCFYIEADVDETGLEEGIRYIAVPLADVYLSVNHQNEVDSIYRTFE 179 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 FT +QI KWG KVLS KMKS+ + E ++F IIHAVYPKSL +KKKDKGNK FHSKFV Sbjct: 180 FTAEQIGGKWGYKVLSDKMKSSYEKKEPDKFKIIHAVYPKSLAEKKKDKGNKNFHSKFVC 239 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 +DEN FFEEKQI T PYI+GRYRVRADEIYG+SPAMEALP IRRLNE NELAQ+ RLSL Sbjct: 240 IDENVFFEEKQITTLPYIIGRYRVRADEIYGKSPAMEALPAIRRLNEISNELAQYARLSL 299 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 HP +A EAKQ F K YMNIGA+S++G++LFQP+Q GNPLP++EEL R++ SI SL Sbjct: 300 HPAYLAPPEAKQLEFKNKSRYMNIGAMSKDGKALFQPLQVGNPLPFYEELKRIQGSIHSL 359 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI RELDILD+Q NL Sbjct: 360 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIKRELDILDAQHNL 419 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 PE D+ P LLKVEYTSPLFKYQQAESVAS LQG NTV+ELG KTG+P MDH+D D Sbjct: 420 PELTDYDHSPFHLLKVEYTSPLFKYQQAESVASVLQGTNTVLELGAKTGNPEPMDHIDID 479 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 +VSRF+LWA+ +PA LIRD EV+ R+ R+ Q M+ + QQ +Q + GAKA + Sbjct: 480 KVSRFALWASGSPAHLIRDVDEVKQRRKDRDDQMEAMQNRQDAQQQEQMGMEAGAKAVSK 539 Query: 541 AMEKKLTHDMMENSY 555 A+EKK+T+D+MENSY Sbjct: 540 AIEKKMTNDLMENSY 554 >gi|215487822|ref|YP_002330253.1| predicted phage head-tail connector protein [Escherichia coli O127:H6 str. E2348/69] gi|215265894|emb|CAS10303.1| predicted phage head-tail connector protein [Escherichia coli O127:H6 str. E2348/69] Length = 556 Score = 386 bits (990), Expect = e-105, Method: Composition-based stats. Identities = 120/530 (22%), Positives = 214/530 (40%), Gaps = 38/530 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + LKN+R +L+ F+ P + ++ D T Sbjct: 1 MAETEKERLLKQLAQLKNERTSFESHWRDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 GS A LSS + S IT P + W LA + V+ W + V + Sbjct: 61 GSMAQRILSSGMMSGITSPARPWFKLATPDPDMMDYGP--------VKIWLEVVQRRMNE 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ FGTG + D ++ IR + P+ + Y++ + + Sbjct: 113 V--FNKSNLYQSLPVMYASLGTFGTGAMAVLED-----DQDVIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTD-KKK 227 VD+ R+F+ TV Q+V ++G +S+ +K E + H + P D K Sbjct: 166 GSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVKVNHCITPNVNRDSGKM 225 Query: 228 DKGNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK + S + D ++ E FP + R+ V +++Y S P M AL ++ Sbjct: 226 DSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L AQ + +PP +A + K + L PG + + Sbjct: 286 LQVEQKRKAQLIDKATNPPMVAPTSLKNQRVSLLPGDVTYLDVLTGQDGFKPAYLVNPNT 345 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSE 401 ++ +++I S + +DLF +L +RS +E EK +GP++ L E Sbjct: 346 ADLLADIQDTRQTINSAYFVDLFMMLQKINTRSMPVEAVIEMKEEKLLMLGPVLERLNDE 405 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + +I R I+ + LPE L++EY S + + Q++ + S Q V + Sbjct: 406 ALNPLIDRVFSIMARKNMLPEPPDVLQGMP--LRIEYISVMAQAQKSIGLTSLSQTVGFI 463 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQRE 511 +L P +D +D D+ + +I +V+ IR++R Sbjct: 464 GQLAQ--FKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERA 511 >gi|30387383|ref|NP_848212.1| hypothetical protein epsilon15p04 [Enterobacteria phage epsilon15] gi|30266038|gb|AAO06067.1| 4 [Salmonella phage epsilon15] Length = 556 Score = 384 bits (987), Expect = e-104, Method: Composition-based stats. Identities = 120/530 (22%), Positives = 215/530 (40%), Gaps = 38/530 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDTT 49 M + + + + LKN+R +L+ F+ P + ++ D T Sbjct: 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 GS A LSS + S IT P + W LA + V+ W + V + Sbjct: 61 GSMAQRILSSGMMSGITSPARPWFKLATPDPDMMDYGP--------VKIWLEVVQRRMNE 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ FGTG + D ++ IR + P+ + Y++ + + Sbjct: 113 V--FNKSNLYQSLPVMYASLGTFGTGAMAVMED-----DQDVIRTMPFPIGSYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTD-KKK 227 VD+ R+F+ TV Q+V ++G +S+ +K E + H + P D K Sbjct: 166 GSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKM 225 Query: 228 DKGNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK + S + D ++ E FP + R+ V +++Y S P M AL ++ Sbjct: 226 DSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L AQ + +PP +A + K + L PG + + Sbjct: 286 LQVEQKRKAQLIDKATNPPMVAPTSLKNQRVSLLPGDVTYLDVISGQDGFKPAYLVNPNT 345 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSE 401 ++ +++I S + +DLF +L + +RS +E EK +GP++ L E Sbjct: 346 ADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDE 405 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + +I R I+ + LPE L++EY S + + Q++ + S Q V + Sbjct: 406 ALNPLIDRVFSIMARKNMLPEPPDVLQGMP--LRIEYISVMAQAQKSIGLTSLSQTVGFI 463 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQRE 511 +L P +D +D D+ + +I +V+ IR++R Sbjct: 464 GQLAQ--FKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERA 511 >gi|291336934|gb|ADD96462.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured organism MedDCM-OCT-S09-C787] Length = 450 Score = 383 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 103/464 (22%), Positives = 207/464 (44%), Gaps = 27/464 (5%) Query: 34 LYPYKNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARS 93 + ++D + ++ L++ L ++T P W L F + Sbjct: 11 TRSKGDKRTELIFDGSPLQSVELLAASLHGMLTNPSTPWFSLR--------FKQNDMENE 62 Query: 94 KKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIR 153 + +EW + T+ ++ ++S F + Y ++ FGT ++E D E+ ++ Sbjct: 63 DEAKEWLEDATEVMYS--AFNKSNFQQEIFELYHDLITFGTAAMFIEED-----DEDILK 115 Query: 154 YISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTI 213 + + ++ ++++ N + +D+V+R+F+ + ++ K+GD +S + + ++ E I Sbjct: 116 FSTRHINEIFIAENDKGRIDTVFRKFSLSARAVMQKFGD--VSINIATKAKKDPYEEVEI 173 Query: 214 IHAVYPKSLTDK-KKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGR 272 +HAVYP+S D K+DK N F S ++ + FP++V RY + EIYGR Sbjct: 174 MHAVYPRSDFDPRKQDKENMPFESVYLDAESGDELSVSGFREFPFVVPRYLKASHEIYGR 233 Query: 273 SPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGR 332 SPAM ALP ++ LNE + + + PP + + PG +N R Sbjct: 234 SPAMTALPDVKMLNEMSKTTIKSAQKQVDPPLLVPDDGFMLPVRTIPGGLNFYR--AGTR 291 Query: 333 SLFQPVQFGNPLPYHEEL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFV 391 + + G P + + + SIR+ F ++ ++ +A E +++ EK + Sbjct: 292 DRIETLNIGANTPLGLNMEEQRRNSIRNAFYVNQL-MMQSGPQMTATEVIQRNEEKMRLL 350 Query: 392 GPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESV 451 GP++G LQSE + +I R ++ + +++EY SPL K Q++ + Sbjct: 351 GPVLGRLQSELLKPLIDRTFALILRKNLFRPAPEFLAGQD--IEIEYVSPLAKAQKSTEL 408 Query: 452 ASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAV 495 +S ++ + + L DH++ D++ R P Sbjct: 409 SSIMRAIEILGSLSNVA---PVFDHINMDKLVRHLADIVGVPQK 449 >gi|317152045|ref|YP_004120093.1| Bacteriophage head-to-tail connecting protein [Desulfovibrio aespoeensis Aspo-2] gi|316942296|gb|ADU61347.1| Bacteriophage head-to-tail connecting protein [Desulfovibrio aespoeensis Aspo-2] Length = 603 Score = 381 bits (978), Expect = e-103, Method: Composition-based stats. Identities = 130/523 (24%), Positives = 210/523 (40%), Gaps = 33/523 (6%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-------------AQLRMWDT 48 + A+ +Q RF L+ R EL+ ++ P KN+ R++D+ Sbjct: 3 AKELARSLQTRFKGLEEARQPWLAAWRELSDYMLPRKNSFTGIDPGSTRGRSGDERIFDS 62 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 T S A L+S L L+T P W + + VR + Q + + Sbjct: 63 TPSHALELLASSLGGLLTNPAMPWFDIRARD--------PDQGDGAGVRTFLQQARERMI 114 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 +GF + Y V GT Y+EAD D +R+ + PL VY + + Sbjct: 115 ALFNTEDTGFQTNVHELYLDVALLGTAVMYVEADPD-----TVVRFCTRPLGEVYAAESA 169 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK-K 227 + VDSVYR +T + Q +WG S + + ++ I+HAV+P++ D Sbjct: 170 RGAVDSVYRRYTLSARQTAREWG-AACSGETRRKAEERPDDTVEILHAVFPRTDRDPYGV 228 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 + F S +V EE PY+V R+ A E YGR P AL R LN Sbjct: 229 GAAHFPFASVYVETGAEHVLEESGYLEMPYLVPRWAKAAGETYGRGPGQTALSDTRVLNA 288 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH 347 PP + + PG ++ R PV + Sbjct: 289 MARTALMAAEKMSDPPLMVPDDGFLGPVHSGPGGLSYYRAGSPDRIEPLPVNV-DLAATE 347 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + + +ESIR +FL D + + +A E++ + EK +GP++G LQ+EF+ +I Sbjct: 348 TMMQQRRESIRRIFLGDQLTP--EGPAVTATEALIRQSEKMRVLGPVLGRLQAEFLSPLI 405 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R I+ G LP P ++V YTSP+ + Q+ + + + L Sbjct: 406 RRVFRIMLRAGALPPFPQGFGPDD--IEVRYTSPVARAQKEFEARGLSRTMEYLAPLVGA 463 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 + MD+ DTDR +R TP+ +R +V + R + Sbjct: 464 SDPFGIMDNFDTDRAARHVAELFGTPSDYLRPEKDVAETRAAK 506 >gi|304398403|ref|ZP_07380277.1| phage head-tail connector protein [Pantoea sp. aB] gi|304354269|gb|EFM18642.1| phage head-tail connector protein [Pantoea sp. aB] Length = 553 Score = 380 bits (976), Expect = e-103, Method: Composition-based stats. Identities = 136/573 (23%), Positives = 236/573 (41%), Gaps = 41/573 (7%) Query: 1 MNQRSAKD-IQDRFNYLKNQRGELNYWMEELTGFLYPY-----------KNNAQLRMWDT 48 M + + K + + LK++R + +L+ ++ P N + D Sbjct: 1 MAEETLKQRLNKQLGLLKSERTTFDPHWRDLSDYISPRSSRFLVSDANRDNRRNTNIVDP 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 T + A LSS + S IT P + W L+ S A + + V+ W + V + Sbjct: 61 TCTLAERTLSSGMMSGITSPARPWFTLSVSDPAMKDYGP--------VKVWLEDVQRRMN 112 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 ++S L Y + +GT + D E+ IR P+ + Y+S + Sbjct: 113 EV--FNKSNLYQSLPIVYAQLGTYGTAAMAILED-----DEDIIRTYPFPIGSYYVSNSA 165 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLT-DKK 226 + VD+VYREF T Q+V ++G +S +K A E +IHAVYP K Sbjct: 166 RLSVDTVYREFRMTTRQLVEQFGLDNVSETVKGQWATQNTESWHDVIHAVYPNVSRQTGK 225 Query: 227 KDKGNKGFHSKFVSV-DENRFFEEKQIATFPYIVGRYRVRADEIYGR-SPAMEALPTIRR 284 D NK + S + +++ E FP + R+ V ++ YG P M AL ++ Sbjct: 226 MDAKNKRYKSVYFEKAGDDKVLRESGFDEFPILAPRWEVNGEDAYGSNCPGMTALGQVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L +Q + +PP + S K + PG + G+ +P+ NP Sbjct: 286 LQLEQKRKSQLIDKATNPPMVGPSSLKTQRVSQLPGAVTYVD-QLTGQDGLKPLYMVNPN 344 Query: 344 -LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSA--AESMEKTREKGAFVGPLIGGLQS 400 ++ ++ IRS + +DLF +L + +RS E EK +GP++ L Sbjct: 345 TADLLNDIQDTRDIIRSAYFVDLFLMLQNINTRSMPVEAVNELREEKLLMLGPVLERLND 404 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 EF+ +I R I+ +G LP + L++EY S + + Q++ V S + V Sbjct: 405 EFLDPLIDRAFAIMQRKGMLPPAPEVLQG--TALRIEYISVMAQAQKSIGVNSMERFVGF 462 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 V + P +D +D D++ + +I EV+ IRQQR Q + ++ Sbjct: 463 VGGMAQA--KPEALDKLDIDKIIDSYGDSIGVSPSVIVPDEEVQKIRQQRAEQIQQQQQM 520 Query: 521 HLQQQLQQTSQDIGAK-AAGRAMEKKLTHDMME 552 + Q +++D+ G L M + Sbjct: 521 QMAQAAVASAKDLSQANLEGPNALSALAGGMQQ 553 >gi|309702812|emb|CBJ02143.1| putative phage protein [Escherichia coli ETEC H10407] Length = 559 Score = 379 bits (974), Expect = e-103, Method: Composition-based stats. Identities = 116/523 (22%), Positives = 209/523 (39%), Gaps = 38/523 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYP-----------YKNNAQLRMWDTT 49 M + + + + LK++R +L+ F+ P + ++ D T Sbjct: 1 MAETEKERLLKQLAQLKSERTSFESHWRDLSDFINPRGSRFLTSDVNRDDRRNTKIIDPT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 GS A LSS + S IT P + W LA + V+ W + V + Sbjct: 61 GSMAQRILSSGMMSGITSPARPWFKLATPDPDMMDYGP--------VKVWLEVVQRRMNE 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 ++S L Y S+ FGT + D ++ IR + P+ Y++ + + Sbjct: 113 V--FNKSNLYQSLPVMYASLGTFGTAAMAVLED-----DQDVIRTMPFPIGCYYLANSPR 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTD-KKK 227 VD+ +R+F+ TV Q+V ++G +SS ++ E + H + P D K Sbjct: 166 GSVDTSFRQFSMTVRQLVQEFGLDNVSSSVQGMWQNGTYETWIEVNHCITPNVNRDTGKM 225 Query: 228 DKGNKGFHSKFVSVDE--NRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 D NK F S + ++ E FP + R+ V +++Y S P M AL ++ Sbjct: 226 DSKNKPFRSVYFETGGDADKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP- 343 L AQ + +PP +A + K + L PG + + Sbjct: 286 LQVEQKRKAQLIDKATNPPMVAPTSLKTQRVSLLPGDVTYLDVLSGQDGFKPAYLVNPNT 345 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSE 401 ++ +++I S + +DLF +L + +RS +E EK +GP++ L E Sbjct: 346 ADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDE 405 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + +I R ++ + LP A LKVEY S + + Q++ ++S VN + Sbjct: 406 CLNPLIDRAFSMMVRKNMLPPPPDAMEGMP--LKVEYISVMAQAQKSIGLSSLASTVNFI 463 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 +L P +D ++ D+ + +I +VE Sbjct: 464 GQLAQA--KPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVE 504 >gi|262043566|ref|ZP_06016679.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039100|gb|EEW40258.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 560 Score = 376 bits (966), Expect = e-102, Method: Composition-based stats. Identities = 115/522 (22%), Positives = 206/522 (39%), Gaps = 38/522 (7%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYP-----------YKNNAQLRMWDTTG 50 + + +Q + L N R + EL+ F+ P + ++ D T Sbjct: 3 AETLKEQLQKQQAQLTNDRSSFDPHWRELSDFINPRGSRFLVTDVNRDDRRNTKIVDPTA 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + A LSS + S IT P + W LA + V+ W + V + Sbjct: 63 TLAARTLSSGMMSGITSPARPWFKLATPDPDMMDYGP--------VKLWLEVVQRRMNEV 114 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 ++S L Y S+ + TG + D + IR + P+ + YM+ + + Sbjct: 115 --FNKSNIYQSLPLLYASLGNYSTGAMAVLEDDS-----DVIRTMMFPIGSYYMANSARG 167 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDK-KKD 228 VD+ +R+F+ TV Q+V ++G +S +K E +IHAVYP D K + Sbjct: 168 SVDTCFRKFSMTVRQLVMEFGLNNVSDSVKGMWDSGNYESWIEVIHAVYPNIDRDTAKLN 227 Query: 229 KGNKGFHSKFVSV--DENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRL 285 NK S + V D ++ E FP + R+ V +++YG S P M AL ++ L Sbjct: 228 SKNKPVKSVYYEVGGDSDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGQVKAL 287 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP-L 344 +Q + +PP + S + + L PG + Sbjct: 288 QLEQKRKSQLIDKATNPPMVGPSSLRNQRVSLLPGDITYIDQVTGQDGFKPAYLVNPNTA 347 Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEF 402 ++ ++ I S + +DLF +L + +RS +E EK +GP++ L E Sbjct: 348 DLLADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEC 407 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + +I R I+ + LP L++EY S + + Q++ ++S V + Sbjct: 408 LNPLIDRTFSIMARKNLLPPPPDVLQGMP--LRIEYISVMAQAQKSIGLSSLSSTVGFIG 465 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 +L P +D ++ D+ + +I +VE Sbjct: 466 QLAQA--KPEALDKLNVDQAIDAFAEMSGVSPTVIVPQEQVE 505 >gi|310005679|gb|ADP00067.1| head-tail connector protein [Cyanophage 9515-10a] Length = 534 Score = 373 bits (958), Expect = e-101, Method: Composition-based stats. Identities = 84/561 (14%), Positives = 168/561 (29%), Gaps = 57/561 (10%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYP---YKNNA------QLRMWDTTGSEACIK 56 K+ + R+N L R + E P +N+ W + G++ + Sbjct: 1 MKNARQRYNKLSTDREQFLNVAYECAELTIPTLLMRNDKPPAYAQFKTPWQSVGAKGVVT 60 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L+S L + PP + L S + E ++ ++ + + S Sbjct: 61 LASKLMLGLLPPSTSFFKLQLDDSKLGIEIPPEAKS--EMDLSFAKIERQIMD--AIAAS 116 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 + S +V G YM PL+ + + V + Sbjct: 117 TDRVQIFSAIKHLVVTGNALLYMGKQG----------MKMYPLNRYVVERDGNGDVIEIV 166 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHS 236 + + D I + D + N ++ + V K +H Sbjct: 167 TKEKVSRDLIPIELNDD----SVVDDDTNNADKDVDVYTCV--------KLGAKGWYWHQ 214 Query: 237 KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 + + + P++ R+ E YGRS E L ++ L + L + Sbjct: 215 EVHDILIPGSEGKAPKDKNPFLPLRFVTVDGEDYGRSRVEEFLGDLKSLEALMQALVEGS 274 Query: 297 RLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKES 356 + + + L GA+ + +Q G + + Sbjct: 275 AAAAKVVFTVSPSSVTKPGTLANAG--NGAIIQGRPDDIGVIQVGKTADFRTAFELVNTL 332 Query: 357 IRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDS 416 + L L + +A E E +G L L +EF+ ++R++ L Sbjct: 333 EKRLSEAFLILNVRQSERTTAEEVRMTQMELEQQLGGLFSLLTTEFLIPYLNRKMHSLTL 392 Query: 417 QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDH 476 +P+ P V + L + Q +++ V V + G + + Sbjct: 393 AKKIPKIPKNVVNPTI---VAGINALGRGQDRDAL------VQFVTTIAQTMGPEALAQY 443 Query: 477 MDTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGA 535 ++ D + A L++ E++ +QQ + Q Q G Sbjct: 444 INPDEAIKRLAAAQGIDVLNLVKSMEELDAQKQQAQQ----------QAMQQNLMGQAGQ 493 Query: 536 KAAGRAMEKKLTHDMMENSYG 556 A M+ ++ME G Sbjct: 494 LAGAPLMDPSKNPEVMEALPG 514 >gi|61806424|ref|YP_214201.1| T7-like head-to-tail connector [Prochlorococcus phage P-SSP7] gi|61374349|gb|AAX44203.1| T7-like head-to-tail connector [Prochlorococcus phage P-SSP7] gi|265525461|gb|ACY76227.1| head-tail connector protein [Prochlorococcus phage P-SSP7] Length = 522 Score = 373 bits (958), Expect = e-101, Method: Composition-based stats. Identities = 74/557 (13%), Positives = 168/557 (30%), Gaps = 54/557 (9%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY--KNNAQLRM--------WDTTGSEACIKL 57 ++R+N L R E + PY ++ R W + G++ C+ L Sbjct: 2 KARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTL 61 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 ++ L + PP + L L + ++ ++ + + + S Sbjct: 62 AAKLMLAVLPPQTSFFKLQVRDDKLGEELDPQIRS--ELDLSFSKMERMIMDY--IAASN 117 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + ++ G +M D + PL+ ++ + V + Sbjct: 118 DRVAVHQALKHLIVGGNALIFMGKDG----------LKTFPLTRYVINRDGDGNVLEIVT 167 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + + + + + ++ + + N++ V + K G +H + Sbjct: 168 KELISRKVLDIELPEPKPNTGIDESSTTNDD--------VTIYTYVKLDKSSGRWVWHQE 219 Query: 238 FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 P++ R+ E YGR E L ++ L+ L + Sbjct: 220 AFDKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAA 279 Query: 298 LSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 + + + + + GA+ + +Q G + N Sbjct: 280 AASKVVFLVSPSSTTKPATIAKAG--NGAIVQGRPEDVAVIQVGKTADFSTAANMATAIE 337 Query: 358 RSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 + L L + + +A E E +G + L EF+ ++R L +L Sbjct: 338 KRLLEAFLVMNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRS 397 Query: 418 GNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 +P+ P V + L + Q ES+ + V + G + M ++ Sbjct: 398 NQIPKLPKDIVRPTI---VAGVNALGRGQDRESLTA------FVGTIAQTLGPEALMQYL 448 Query: 478 DTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAK 536 + + A L++ ++ + +Q + Q Q G Sbjct: 449 NPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQ----------QAAQQSLVDQAGQM 498 Query: 537 AAGRAMEKKLTHDMMEN 553 M+ +M+ Sbjct: 499 TGSPLMDPTKNPQLMDE 515 >gi|310005857|gb|ADP00242.1| head-tail connector protein [Cyanophage Syn26] Length = 521 Score = 373 bits (957), Expect = e-101, Method: Composition-based stats. Identities = 75/554 (13%), Positives = 169/554 (30%), Gaps = 54/554 (9%) Query: 11 DRFNYLKNQRGELNYWMEELTGFLYPY--KNNAQLRM--------WDTTGSEACIKLSSL 60 +++N L + R + + + PY ++ R W + G++ + L++ Sbjct: 5 EKYNQLSSARRQFLDKAVQCSELTLPYLIDDDISSRPNHKSLAVPWQSVGAKCVVTLAAK 64 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 L + PP + L L + ++ ++ + + + S Sbjct: 65 LMLAVLPPQTSFFKLQVRDDKLGQELDPQIRS--ELDLSFAKMERMIMEY--IAASNDRV 120 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + ++ G YM D + PL+ + + V + + Sbjct: 121 AIHQALKHLIVGGNALIYMHKDG----------LKTFPLTRYVVERDGDGNVLCIVTKEL 170 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 + + + + +S + +E ++ V ++ KD G +H + Sbjct: 171 ISRKVLDIELPEPEPNSVV--------DESHSVADDVTIYTMVKLDKDSGRWVWHQEAFD 222 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 P++ R+ E YGR E L ++ L+ L + + Sbjct: 223 KIIPDTRSTAPKKASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQALIEGAAAAS 282 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 + + + + GA+ + +Q G + N + + + Sbjct: 283 KVIFLVSPSSTTKPATIAKAG--NGAIVQGRPEDVAVIQVGKTADFATAANMAQGIEKRM 340 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 L + + +A E E +G + L EF+ ++R L +L + Sbjct: 341 LEAFLVMNVRNAERVTAEEVRLTQLELEQQLGGIFSLLTVEFLIPYLNRTLLVLQRSNQI 400 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 P+ P V + L + Q ES+ + + G + M +++ Sbjct: 401 PKLPKDIVRPTI---VAGVNALGRGQDRESLT------QFIGTIAQTLGPEALMQYINPQ 451 Query: 481 RVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAG 539 + A L++ ++ + Q + QQ Q G A Sbjct: 452 EAIKRLAAAQGIDVLNLVKTEQQMAEEMQAAQA----------QQTQQSLVDQAGQLAGT 501 Query: 540 RAMEKKLTHDMMEN 553 M+ MM Sbjct: 502 PLMDPSKNPQMMPE 515 >gi|291335391|gb|ADD95005.1| head tail connector protein [uncultured phage MedDCM-OCT-S04-C24] Length = 526 Score = 373 bits (957), Expect = e-101, Method: Composition-based stats. Identities = 74/503 (14%), Positives = 155/503 (30%), Gaps = 46/503 (9%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQL----------RMWDTTGSEACIKLSS 59 + R++ L + R + + + PY W +TG++ + L+S Sbjct: 4 KQRYDRLSSSRSQFLNAARQASELTIPYLIREDEHTTKGALKLTTPWQSTGAKGVVTLAS 63 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L + PP + L + L E ++ ++ T+ + SG Sbjct: 64 KLMLALLPPQTSFFKLQVNDVNLPDELGPEIRS--ELDLSFAKIERTVME--SIAESGDR 119 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 + +V G +M D PL+ + + V + + Sbjct: 120 VVVHQALKHLVVAGNALIFMSKDG----------LKLYPLNRYVVDRDGNGNVIEIVTKE 169 Query: 180 TFTVDQIVSKWGD--KVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 T + I + + + +E H K D +H + Sbjct: 170 TISKKLIKKFYPEYEDKAQDSVVDDGHIPNDECVIYTHV---------KLDNNRWVWHQE 220 Query: 238 FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 + + P++V R+ E+YGR E L ++ L + + Sbjct: 221 LEGKILPKSMGKAPFDANPWLVLRFNHVDGEVYGRGRVEEFLGDLKSLEALSQAIVEGSA 280 Query: 298 LSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 + + + L GA+ + VQ G + + Sbjct: 281 AAAKVVFTVSPSSTTKPQTLAKAG--NGAIIQGRPEDIGVVQVGKTADFSTAYQMIGSLT 338 Query: 358 RSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 + L L + D +A E E +G L L EF+ ++R+L++ Sbjct: 339 QRLNEAFLILNVRDSERTTAEEVRMTQLELEQQLGGLFSLLTVEFLVPYLNRKLNVAQKT 398 Query: 418 GNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 G++P ++ V + L + Q ES+A + + G + ++ Sbjct: 399 GDIPRLPQGGIVRPTI--VAGINALGRGQDRESLA------QFLTVIAQTMGPDAIAQYI 450 Query: 478 DTDRVSRFSLWATNTPA-VLIRD 499 + D V + ++ L++ Sbjct: 451 NPDEVIKRLAASSGIDVLNLVKS 473 >gi|291334411|gb|ADD94066.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured phage MedDCM-OCT-S04-C1035] Length = 467 Score = 372 bits (954), Expect = e-100, Method: Composition-based stats. Identities = 111/484 (22%), Positives = 217/484 (44%), Gaps = 27/484 (5%) Query: 64 LITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQ 123 ++T P W L F ++ + + W + T+ ++ ++S F + Sbjct: 1 MLTNPSTPWFSL--------KFKNEDMEGEDEAKLWLESATEVMYS--AFNQSNFQQEIF 50 Query: 124 SFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTV 183 Y ++ FGT ++E D ++ +++ + ++ +Y+S N + +D+V+R+F + Sbjct: 51 ELYHDLITFGTAAMFIEEDDEDN-----LKFSTRHINEIYISENEKGRIDTVFRKFRISA 105 Query: 184 DQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-DKGNKGFHSKFVSVD 242 + K+G +S+ + ++ E I+HAVYP+ + KK D N F S ++ D Sbjct: 106 RAAIRKFG--NVSNNIAVIAKKDPYEEVEILHAVYPRDDYNPKKQDTENMQFESIYLDAD 163 Query: 243 ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 FP++V RY + EIYGRSPAM ALP ++ LNE + + + + P Sbjct: 164 SGEELSVSGFREFPFVVPRYLKASHEIYGRSPAMTALPDVKMLNEMSKTIIKSAQKQVDP 223 Query: 303 PTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL-NRLKESIRSLF 361 P + + PG +N R +P+ G + + + SIR+ F Sbjct: 224 PLLVPDDGFLLPVRTVPGGLNFYR--AGTRDRIEPLNIGANNTLGLNMEEQRRNSIRNAF 281 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 ++ ++ D +A E +++ EK +GP++G LQSE + +I R IL + Sbjct: 282 YVNQL-MMQDGPQMTATEVIQRNEEKMRLLGPVLGRLQSELLKPLIDRSFAILMRRNLFA 340 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 + + +++EY SPL K Q++ ++S ++ + + L DH++ D+ Sbjct: 341 QPPEFLSGQD--IEIEYVSPLAKAQKSTELSSIMRAIEIMGSLSNVA---PVFDHINMDK 395 Query: 482 VSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRA 541 + R P +++ +E+ RQ + Q+ M++ QQL + + A Sbjct: 396 LVRHLTNIVGVPQKILKPQSELNAERQAQAQQQEQMQQMQQVQQLAEAGGKVAPLAKALP 455 Query: 542 MEKK 545 E + Sbjct: 456 EEAQ 459 >gi|212710818|ref|ZP_03318946.1| hypothetical protein PROVALCAL_01886 [Providencia alcalifaciens DSM 30120] gi|212686515|gb|EEB46043.1| hypothetical protein PROVALCAL_01886 [Providencia alcalifaciens DSM 30120] Length = 550 Score = 371 bits (953), Expect = e-100, Method: Composition-based stats. Identities = 123/521 (23%), Positives = 212/521 (40%), Gaps = 38/521 (7%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEA 53 +D+ + + LKN+R +EL + P + ++ D +++ Sbjct: 3 LKQDLLKQLSQLKNERQSFEPHWKELAEYTRPRSTRFSTSEVNRGDRRNTKIIDQEAAKS 62 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 LSS + S IT P +KW LA + V+ W + V + Sbjct: 63 ERTLSSGMMSGITSPARKWFRLATPDPDMMNYSP--------VKMWLEVVEQRMNEV--F 112 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 +RS L Y+ + F T + D E IR + P+ + Y++ VD Sbjct: 113 NRSNIYQSLPQTYSDIGTFATSALAVLEDN-----ERVIRTVPFPIGSYYIANGPDLTVD 167 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN-ERFTIIHAVYPKSLT-DKKKDKGN 231 + +REF+ TV Q+V ++G +S ++KS + T+IH+VYP K D N Sbjct: 168 TCFREFSMTVRQLVMEFGLDNVSEQVKSMWDSGNYSQWITVIHSVYPNLNRISGKLDAKN 227 Query: 232 KGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNET 288 K F S + D +R E FP + R+ V +++YG S P M AL +++ L Sbjct: 228 KLFKSVYFEIGGDSDRVLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGSVKALQLL 287 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQF-GNPLPYH 347 AQ +PP A + K + L PG + ++ + + Q + Sbjct: 288 QRRKAQQIDKVTNPPMQAPASIKNQRISLVPGGITYLPMAGADQMIKPIFQVQADINGLI 347 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEFIGA 405 ++ + I+ + DLF +L + +RS +E EK +GP++ L SE + Sbjct: 348 ADIGDTRNQIKEAYFSDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLQRLDSELLDK 407 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 +I+R I+ + LP LKVEY S + + Q++ V S + V V L Sbjct: 408 LINRTFAIMARKNLLPVPPEEMQGMQ--LKVEYISVMAQAQKSVGVNSVERFVGFVGGLA 465 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDI 506 P +D ++TD + + ++ +V I Sbjct: 466 KL--KPEALDKLNTDEIIDNYAESIGISPTIVSSNDQVAAI 504 >gi|167041083|gb|ABZ05844.1| hypothetical protein ALOHA_HF400048F7ctg1g11 [uncultured marine microorganism HF4000_48F7] Length = 552 Score = 370 bits (949), Expect = e-100, Method: Composition-based stats. Identities = 114/516 (22%), Positives = 217/516 (42%), Gaps = 36/516 (6%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTTG 50 A + + LK++RG +++ + P + + + R++++T Sbjct: 1 MSSDAATLVQEYEALKSERGNWENMWQDIAELMIPRRADFTNRYRAPGEQRRDRIYESTA 60 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 A ++ +S L + +T W L +E ++++V+ W + T Sbjct: 61 VRALVRGASGLHNTLTSSTVPWFALETED--------RELMKNRQVQLWLEDATRRCNSV 112 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 RS F +Y ++ FGTGC Y+ + G + S L + Y++ Sbjct: 113 FNAPRSMFHQSAHEYYLDLLAFGTGCMYVTQEPGM-----GPVFKSYFLGHTYIAEGKTG 167 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKG 230 ++DSVYR F T + ++G+K L ++ A + RF ++H V P+S + Sbjct: 168 MIDSVYRRFDDTARSLYKQFGNK-LPDEIVKAADKEPFRRFELLHIVRPRSNAPGGRTSK 226 Query: 231 NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 K F S +V + + +E PYIV R++ + E+YGR P +EALP +R +NE Sbjct: 227 QKPFLSVYVHAESRKVVQEGGFDEMPYIVSRWQKNSMEVYGRGPGIEALPDVRMVNEMER 286 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE- 349 + + PP + + PG +N + P+Q G + +E Sbjct: 287 VGLIALQKVVDPPLLVPDDGFLSPIRTTPGGLNYYRAGLGPQDRIAPLQTGGRVDLNEAK 346 Query: 350 LNRLKESIRSLFLLDLFQVLDDKA------SRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 + +++ +I F LDL ++ A SA E + R++ +GP++ ++EF+ Sbjct: 347 IGQVRAAIERTFYLDLLELPGPTAADGDVLRFSATEIAARQRDRLNILGPIVARQEAEFL 406 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVE 463 G ++ R L ++ LP + KV Y++P+ Q+A +AS Q + +V Sbjct: 407 GPLVIRTLSVMLRAEMLPPPPQVLL--DADFKVSYSNPVAIAQRAGELASISQLIQFLVP 464 Query: 464 LGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRD 499 DP+ + T RV+ + + + Sbjct: 465 FAQL--DPTVIQRFQTGRVAELAAEILKVSPSVFKS 498 >gi|268589375|ref|ZP_06123596.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] gi|291315402|gb|EFE55855.1| conserved hypothetical protein [Providencia rettgeri DSM 1131] Length = 550 Score = 370 bits (949), Expect = e-100, Method: Composition-based stats. Identities = 122/521 (23%), Positives = 213/521 (40%), Gaps = 38/521 (7%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTTGSEA 53 +D+ + + LKN+R +EL + P + ++ D +++ Sbjct: 3 LKQDLLKQLSQLKNERQSFEPHWKELAEYTRPRSTRFNTSEVNRGDRRNTKIIDQEAAKS 62 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 LSS + S IT P +KW LA + V+ W + V + Sbjct: 63 ERTLSSGMMSGITSPARKWFRLATPDPDMMNYSP--------VKMWLEVVEQRMNEV--F 112 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 +RS L Y+ + F T + D E IR + P+ + Y++ VD Sbjct: 113 NRSNIYQSLPQTYSDIGTFATSALAVLEDN-----ERVIRTVPFPIGSYYIANGPDLTVD 167 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN-ERFTIIHAVYPKSLT-DKKKDKGN 231 + +REF+ TV Q+V ++G +S ++KS + T+IH+VYP K D N Sbjct: 168 TCFREFSMTVRQLVMEFGLDKVSEQVKSLWDSGNYSQWITVIHSVYPNLNRISGKLDAKN 227 Query: 232 KGFHSKFVSVDEN--RFFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRRLNET 288 K F S + + + R E FP + R+ V +++YG S P M AL +++ L Sbjct: 228 KLFKSVYFEMGGDSERVLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMIALGSVKALQLL 287 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQF-GNPLPYH 347 AQ +PP A + K + L PG + ++ + + Q + Sbjct: 288 QRRKAQQIDKVTNPPMQAPASIKNQRISLVPGGITYLPMAGADQMIKPIFQVQADINGLI 347 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEFIGA 405 ++ + I+ + DLF +L + +RS +E EK +GP++ L SE + Sbjct: 348 ADIGDTRNQIKEAYFSDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLQRLDSELLDK 407 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 +I+R I+ + LP LKVEY S + + Q++ V+S + V V L Sbjct: 408 LINRTFAIMARKNLLPVPPEEMQGMQ--LKVEYISVMAQAQKSVGVSSIERFVGFVGGLA 465 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDI 506 P +D ++TD + + ++ +V I Sbjct: 466 QM--KPEALDKLNTDEMIDNYAESIGVSPTIVSSNDQVAAI 504 >gi|323699782|ref|ZP_08111694.1| phage head-tail connector protein [Desulfovibrio sp. ND132] gi|323459714|gb|EGB15579.1| phage head-tail connector protein [Desulfovibrio desulfuricans ND132] Length = 579 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 133/569 (23%), Positives = 223/569 (39%), Gaps = 41/569 (7%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQLRMWDT 48 A+ + RF+ L+ R +ELT ++ P KN+ R++D+ Sbjct: 4 TELARSLLKRFSGLEEARRPWVSSWQELTEYMLPRKNSFAGPGGHTLGRGRAGDERIFDS 63 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 T A L+S L L+T P W ++ + + +VR + + + + Sbjct: 64 TPLHALELLASSLGGLLTNPSLPWFDISV--------KDRAKGDADEVRAFMQEARERMV 115 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 +GF + Y V GT Y+EAD +R+ + PL V+++ + Sbjct: 116 AVFNSEDTGFQAHVHELYLDVALLGTAVMYVEADP-----TSVVRFSARPLGEVFVAESA 170 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KK 227 + VD+VYR + T Q + +WG S + + E ++HAV+P+ D Sbjct: 171 RGQVDTVYRRYEVTARQAIQEWG-AACSDETRRKGEDRPEEPVEVLHAVFPRMDRDPAGF 229 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 + F S ++ V + EE PY+V R+ A E YGR P AL +R LN Sbjct: 230 GSAHFPFASVYMEVKNSHVLEESGYLEMPYMVPRWAKAAGETYGRGPGQTALSDVRVLNA 289 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH 347 PP + + PG ++ R PV Sbjct: 290 MARTALMAAEKMSDPPLMVPDDGFLGPVRSGPGGLSYYRAGSTDRIEALPVNVDLRA-AE 348 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 E +N +ESI +FL D + + +A E++ + EK +GP++G LQ+EF+ +I Sbjct: 349 EMMNGRRESIGRIFLSDQLAP--EGPAVTATEAVIRQAEKMRVLGPVLGRLQTEFLSPLI 406 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R ++ G LP +P L+V YTS + + Q+ Q + + L Sbjct: 407 RRVFRVMLRGGALPPFPEGLSPDD--LEVRYTSSVTRAQKQYEAQGLAQVMEYLSPLVGG 464 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 MD+ DTDRV+R N P+ ++ V + R Q++ QQ Sbjct: 465 RDAFGIMDNFDTDRVARHVAELFNIPSDYLKSEDRVVEGRTQKQRV-------ASSQQTA 517 Query: 528 QTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 T + A A + + +G Sbjct: 518 STVANAAAIAKTLSEAYTDRPSALTELWG 546 >gi|212703348|ref|ZP_03311476.1| hypothetical protein DESPIG_01391 [Desulfovibrio piger ATCC 29098] gi|212673194|gb|EEB33677.1| hypothetical protein DESPIG_01391 [Desulfovibrio piger ATCC 29098] Length = 611 Score = 366 bits (940), Expect = 4e-99, Method: Composition-based stats. Identities = 117/544 (21%), Positives = 217/544 (39%), Gaps = 37/544 (6%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRM----------WDTTGSEACIKLS 58 + R+ L +R + E L P + + D TG A L+ Sbjct: 41 LARRYRALLERRSPWDTAWESLAEHFLPTRFRTDDSLDDRPLLNRSLVDATGILAMRTLA 100 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + L +T P + W LA + +RS + + D+V + R F Sbjct: 101 AGLQGGMTSPARPWFRLALDD--------PDLSRSHAGQRYLDEVEARMRVV--LQRCNF 150 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE 178 + + Y + FGT + AD L G R++ + + + VD+V+ Sbjct: 151 YNAMHTIYAELGTFGTAFVFELAD-----LRHGFRFVPLCAGQYVLDTDAARRVDTVFHR 205 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-DKGNKGFHSK 237 ++ Q+V +G + L ++ A R ++R +IHAV P++ + + + S Sbjct: 206 MHMSLRQMVQSFGPEALPENLRLAARRTPDQRHAVIHAVLPRTERRPRLAGPCHMPWASV 265 Query: 238 FVSVDEN---RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQ 294 + +E FP R+ V A+++YGRSPAM+ALP R L + + Sbjct: 266 YWLEGREGQVVPLKESGFMGFPGFGPRWDVAANDVYGRSPAMDALPDCRMLQQMGITTLK 325 Query: 295 FGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGAL-SREGRSLFQPVQFGNPLPYHE--ELN 351 ++ PP + + DL PG +N + + + P+ P + Sbjct: 326 AIHKAVDPPMSVHAGLRSVGLDLTPGGINFVDSLPGQNQPVATPLLQVKPDLAQARSAME 385 Query: 352 RLKESIRSLFLLDLFQ-VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 +++ IR+ DLF+ +L+ ++ +A+E + EK +GP++ L E + +I R Sbjct: 386 AVQQQIRAGLYNDLFRLILEGRSKVTASEIAAREEEKLLLIGPVLERLHDELLIPLIDRT 445 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 ++ + LP C + LKVE+ S L + Q+ +++ Q + L + Sbjct: 446 FRLMLALDMLPPCPPELSG--RHLKVEFVSLLAQAQKLVGISATDQYLAL--TLKAASAW 501 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTS 530 P +D +D D + + P L R E +R RE R+ ++ L Q+ Sbjct: 502 PEALDSVDVDNLLDNYAESLGLPVNLTRPREERARLRAGREEARQTEQQLALLQKAADLG 561 Query: 531 QDIG 534 + Sbjct: 562 HTLA 565 >gi|262043408|ref|ZP_06016533.1| hypothetical protein HMPREF0484_3551 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039234|gb|EEW40380.1| hypothetical protein HMPREF0484_3551 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 515 Score = 366 bits (939), Expect = 6e-99, Method: Composition-based stats. Identities = 116/520 (22%), Positives = 193/520 (37%), Gaps = 42/520 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------------NAQLRMWD 47 A + R + LK R E + YP + + R+ D Sbjct: 1 MDELAVKLVKRADTLKANRQVHESVWRECYDYTYPLRGAGLSDEVLDAQSAKSKVARLLD 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 T +++ L+S L S +TP +W L W + Sbjct: 61 GTATDSARMLASALMSGMTPANAQWLNLDSESLP------------DDAAAWLSTCATLV 108 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM-SV 166 + + F VV G Y++ D +E G + PL+ Y+ S Sbjct: 109 --WENIHAANFDAEGYEANLDVVCAGWFALYIDEDREE----GGFSFQQWPLAQCYVTST 162 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK-SLTDK 225 +VD++YR + T +Q + ++G +S K+ A A+ +++F +H ++P+ + Sbjct: 163 RRDGIVDTIYRRYQLTAEQAIKEFGADKVSKKISDAAAKKPDDKFEFLHCIFPRENYVVN 222 Query: 226 KKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 + N F S V V E FP V R+ YG P +ALP + L Sbjct: 223 ARLAKNLRFASYNVEVSGKLIVRESGYHEFPCCVPRWMKIPGTPYGIGPVYDALPDCKEL 282 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPL 344 NET L++ IA + +K G I + +P+ G + Sbjct: 283 NETKRMEKAAQDLAIAGMWIAEDDGVLNPRTVKVGPRRIIVANS--VDSMKPLLTGADFN 340 Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 RL+ SIR + + D Q D + +A E + +GP+ G Q+E++ Sbjct: 341 VAFTAEERLQASIRKIMMADQLQ-PQDGPAMTATEVHVRVALIRQLLGPVYGRFQAEYLQ 399 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ R + G P + + V Y SPL + QQ E+V + + V L Sbjct: 400 PLVERCFGLAFRAGVFPPAPESLQ--NANFNVRYISPLARAQQLENVTAIERLGANVANL 457 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVE 504 + P D +DTD +R A PA +IR + VE Sbjct: 458 AQVS--PDVTDLVDTDEATRVIADALGVPAKVIRSSDAVE 495 >gi|85059164|ref|YP_454866.1| hypothetical protein SG1186 [Sodalis glossinidius str. 'morsitans'] gi|84779684|dbj|BAE74461.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 541 Score = 366 bits (938), Expect = 7e-99, Method: Composition-based stats. Identities = 115/523 (21%), Positives = 198/523 (37%), Gaps = 42/523 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------------NAQLRMWD 47 A + R + LK+ R E + YP + + ++ D Sbjct: 1 MDELAVKLITRADTLKSHRQRHESVWRECYDYTYPLRGAGFSADVLDAQSAKSKVAKLLD 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 T +++ L+S L S +TP +W L + W + Sbjct: 61 GTATDSARMLASALMSGMTPANAQWLNLDSESLP------------DDAKAWLSGCATLV 108 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS-V 166 + + F VV G Y+ DE E G + PLS Y++ Sbjct: 109 --WENIHAANFDAEGYEANLDVVCAGWFVLYI----DENREEGGYMFQQWPLSQCYVAST 162 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPK-SLTDK 225 +VD++YR + T +Q ++++G+ +S K++ A +++F +HA++P+ + Sbjct: 163 RKDGIVDTIYRCYQMTAEQAIAEFGEAGVSEKIRRAAKDKPDDKFDFLHAIFPRKNYVVN 222 Query: 226 KKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 + + F S V R E FP V R+ + YG P +ALP + L Sbjct: 223 ARLAKHLRFASFHVERQGKRIVRESGYHEFPVCVPRWMKISGGAYGIGPVYDALPDCKEL 282 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 NET L++ IA + + +K G I + +P+ G Sbjct: 283 NETKRMEKAAQDLAISGMWIAEDDGVINPYSVKVGPRRII--VASSVNSMKPLLTGADFH 340 Query: 346 YHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 L+ SIR + + D Q D + +A E + +GP+ G Q+E++ Sbjct: 341 VAFTAEDRLQASIRKIMMADQLQ-PQDGPAMTATEVHVRVALIRQLLGPVYGRFQAEYLQ 399 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ R I G P + + V Y SPL + Q+ E V + + V +L Sbjct: 400 PLVERCFGIAFRAGVFPAPPDSMQ--TAHFNVRYISPLARAQKLEDVTAIERLGANVAQL 457 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 + P +D +DTD R A PA +IR A+V +R Sbjct: 458 SQVS--PEVVDLVDTDEAMRVVADALGVPAKVIRSAADVTSLR 498 >gi|218886173|ref|YP_002435494.1| hypothetical protein DvMF_1072 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757127|gb|ACL08026.1| conserved hypothetical protein [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 595 Score = 366 bits (938), Expect = 7e-99, Method: Composition-based stats. Identities = 125/548 (22%), Positives = 218/548 (39%), Gaps = 59/548 (10%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-------------------- 40 M + +D ++ ++L+ QR ++ ++ P + Sbjct: 1 MTSQRLRDAREAVDFLERQRSPWEEAWRDIAAYVLPRRGRMHGRDPLGASAPGAVGGSSG 60 Query: 41 ----------AQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKED 90 R+ D T + A L++ + +T P + W L + A D Sbjct: 61 VSGTHRSTDMRGGRVIDATATRAVRILAAGMQGGLTSPARPWFRLRLADGA--------D 112 Query: 91 ARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEE 150 A S R W D V L+ +RS F + YT + FG+ Y E D E Sbjct: 113 AESGPARRWLDAVEQRLY--WALARSNFYQASHALYTELAAFGSADLYQEVDP-----ER 165 Query: 151 GIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENER 210 R+ ++ + + VD+V R T Q+ ++G+ LS+ + L + N Sbjct: 166 LTRFAALTCGEFSWACDAAGRVDTVARRMLMTARQLAERYGEAHLSTGTRRMLRKEPNRH 225 Query: 211 FTIIHAVYPKSLTDKKKDKG-NKGFHSKFVSVDE--NRFFEEKQIATFPYIVGRYRVRAD 267 ++H V P+++ G + F S D E FP++ R+ V Sbjct: 226 VEVVHLVRPRAVRTPGHGSGLHMPFESLVFEADGAAGDLLHEGGFEEFPHLAARWDVTGS 285 Query: 268 EIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGAL 327 ++YGRSP M+ LP ++ L E ++PP + ++ +L PG N A Sbjct: 286 DVYGRSPGMDVLPDVKMLQEMARSQLLAIHKVVNPPMRVPT-GFKQRLNLIPGAQNYVAP 344 Query: 328 SREGRSLFQPVQFGNPLPYHEE--LNRLKESIRSLFLLDLFQV--LDDKASRSAAESMEK 383 + P+ NP ++ +++++R F DLF + D +++ +AAE E+ Sbjct: 345 GQ--PEAVAPLYQINPDIAAVTRKIDDVRKAVREGFFNDLFLMFTADGRSNVTAAEVAER 402 Query: 384 TREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLF 443 +EK +GP+I Q+E + +++R IL G LP ++VEY S L Sbjct: 403 GQEKLLMLGPVIERHQTELLDPLLTRTYGILRRAGALPPNPPELEG--LEMRVEYVSALA 460 Query: 444 KYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 + Q+ + S Q V L P +D +D D+ PA ++R AEV Sbjct: 461 QAQRLGAAQSIRQFAAEVTALSATA--PGVLDKIDFDQAVDELASIGGVPARVVRSDAEV 518 Query: 504 EDIRQQRE 511 +R +RE Sbjct: 519 LRLRAERE 526 >gi|85059667|ref|YP_455369.1| hypothetical protein SG1689 [Sodalis glossinidius str. 'morsitans'] gi|84780187|dbj|BAE74964.1| hypothetical phage protein [Sodalis glossinidius str. 'morsitans'] Length = 517 Score = 364 bits (935), Expect = 2e-98, Method: Composition-based stats. Identities = 115/523 (21%), Positives = 199/523 (38%), Gaps = 42/523 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------------NAQLRMWD 47 A + R + LK+ R E + YP + + ++ D Sbjct: 1 MDELAVKLITRADALKSHRQRHESVWSECYDYTYPLRGAGFSADVLDAQSAKSKVAKLLD 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 T +++ L+S L S +TP +W L A + + W + Sbjct: 61 GTATDSARMLASALMSGMTPANAQWLNLDCESLA------------DEDKAWLSTCATLV 108 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS-V 166 + + F VV G Y+ DE E G + PLS Y++ Sbjct: 109 --WENIHAANFDAEGYEENLDVVCAGWFVLYI----DENREEGGYTFQQWPLSQCYVAST 162 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 +VD++YR + T +Q ++++G+ +S K++ A +++F +HA++P++ Sbjct: 163 RKDGIVDTIYRCYQMTAEQAIAEFGEAGVSEKIRRAARDKPDDKFDFLHAIFPRTNYGVN 222 Query: 227 KD-KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 + F S V R E FP V R+ YG P +ALP + L Sbjct: 223 ACLAKHLRFASFHVERQGKRIVRESGYHEFPVCVPRWMKIPGGAYGIGPVYDALPDCKEL 282 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 NET L++ I+ + + +K G I + +P+ G Sbjct: 283 NETKRMEKAAQDLAISGMWISEDDGVINPYSVKVGPRRII--VASSVNSMKPLLTGADFQ 340 Query: 346 YHEELNR-LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 L+ SIR + + D Q D + +A E + +GP+ G Q+E++ Sbjct: 341 VAFTAEDRLQASIRKIMMADQLQ-PQDGPAMTATEVHVRVALIRQLLGPVYGRFQAEYLQ 399 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ R I G P + + V Y SPL + Q+ E V + + V +L Sbjct: 400 PLVERCFGIAFRAGVFPPPPDSMQ--TAHFNVLYISPLARAQKLEDVTAVERLGANVAQL 457 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 + P +D +DTD +R A PA +IR A+V +R Sbjct: 458 SQVS--PEVVDLVDTDEATRVVADALGVPAKVIRSAADVTSLR 498 >gi|288959388|ref|YP_003449729.1| phage head-tail connector protein [Azospirillum sp. B510] gi|288911696|dbj|BAI73185.1| phage head-tail connector protein [Azospirillum sp. B510] Length = 535 Score = 360 bits (923), Expect = 4e-97, Method: Composition-based stats. Identities = 135/552 (24%), Positives = 220/552 (39%), Gaps = 30/552 (5%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M A++I R L R EL ++ P + R++D T Sbjct: 1 MADARAEEIIRRRESLAALRSPWEGVWSELGEYVRPLRTGFAGGPPQSGAKPSSRLFDAT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 A L++ L +IT P W + E + V+ W V + Sbjct: 61 AGMANNNLAAGLYGMITNPANSWFNI--------KHEIDELNEVQAVKLWMATVERAMRQ 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + F + Y + FGT FY++ G+ Y LS ++S N + Sbjct: 113 ALAANGLAFYSRVFGLYLDLPAFGTAVFYIDEQPG-----RGLWYSHRRLSECFVSENDR 167 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-D 228 +D+VYR+FT+T Q +WGD+ K+ + F +HAV P D +K Sbjct: 168 EEIDTVYRDFTWTARQAQQRWGDRAGREVAKAIEKGEPDRPFRWLHAVEPNPDFDPRKLG 227 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 K F S +V VD+ E PY V R+ YG S A+ A+ I+ +N Sbjct: 228 ARFKPFRSVYVGVDDRHVVAEGGYDELPYQVPRWAPSDAGTYGDSAAVLAIADIKMVNAM 287 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 + ++ PP +A E R PG + G + G L +P+Q G + Sbjct: 288 GKTTIVGAQKAVDPPLLAPDEFSVRGLRTSPGGITYGGVDMGGNQLLKPLQTGARVDLGL 347 Query: 349 EL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 EL + + +IR F L ++ R+A E ME EK + P +G +Q+EF+ + Sbjct: 348 ELEEQRRGAIREAFHWSLLLMVQQ-PGRTATEVMEHQEEKLRLMAPHLGRIQAEFLDPAL 406 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R +L+ G LP L+++Y SPL + +A A+ ++ + + + Sbjct: 407 GRVFSLLNRTGQLPPPPDVLRQYPG-LRLDYVSPLARAAKAAEGAAVIRTLEALGPIAQL 465 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 P MD+ DTD ++R A PA ++ D +VE +R R Q++ Sbjct: 466 R--PEVMDNFDTDEIARGISDAYGLPAKMMLDPRQVEQMRSARAQQQQQAVALEQSAVAA 523 Query: 528 QTSQDIGAKAAG 539 +D+ A A Sbjct: 524 GALKDMSAAGAA 535 >gi|295096867|emb|CBK85957.1| Bacteriophage head to tail connecting protein [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 541 Score = 359 bits (922), Expect = 5e-97, Method: Composition-based stats. Identities = 113/523 (21%), Positives = 195/523 (37%), Gaps = 42/523 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN--------------NAQLRMWD 47 A + R + LK R + E + YP + + ++ D Sbjct: 1 MDELAVKLIKRSDTLKANRQQHESVWRECYDYTYPLRGAGFSDEVLDAQSAKHKVAKLLD 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 T +++ L+S L S +TP +W L + W + + Sbjct: 61 GTATDSARMLASALMSGMTPANAQWLNLDSESLP------------DDAKAWLSECATLV 108 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM-SV 166 + + F VV G Y++ D +E G + PL+ Y+ S Sbjct: 109 --WENIHAANFDAEGYEANLDVVCAGWFVLYIDEDREE----GGYTFQQWPLAQCYVTST 162 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 +VD++YR + T +Q + ++G +S K++ A + +++F +H ++P+ Sbjct: 163 RKDGIVDTIYRRYQLTAEQAIKEFGADKVSEKIRDAAKKKADDKFDFLHCIFPRETYMVD 222 Query: 227 -KDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 + N F S V V + E FP V R+ YG P +ALP + L Sbjct: 223 ARLAKNMRFASYNVDVSNKQIVRESGYHEFPCCVPRWMKIPGGSYGIGPVYDALPDCKEL 282 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPL 344 NET L++ IA + +K G I + +P+ G + Sbjct: 283 NETKRMEKAAQDLAISGMWIAEDDGVLNPRTVKVGPRRIIVANS--VDSMKPLLTGSDFS 340 Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 RL+ SIR + + D Q D + +A E + +GP+ G Q+E++ Sbjct: 341 VAFTAEERLQASIRKIMMADQLQ-PQDGPAMTATEVHVRVALIRQLLGPVYGRFQAEYLQ 399 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ R I G + + V Y SPL + Q+ E V + + V L Sbjct: 400 LLVVRCFGIAFRAGIFSPPPESLQ--NANFNVRYISPLARAQKLEDVTAIERLGANVANL 457 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 +D +DTD +R A PA +IR + V D+R Sbjct: 458 AG--ISQDVVDLIDTDEATRVVADALGVPAKVIRSSDAVADLR 498 >gi|332160969|ref|YP_004297546.1| hypothetical protein YE105_C1347 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325665199|gb|ADZ41843.1| Hypothetical phage protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330862125|emb|CBX72289.1| hypothetical protein YEW_AK02260 [Yersinia enterocolitica W22703] Length = 534 Score = 359 bits (922), Expect = 6e-97, Method: Composition-based stats. Identities = 116/523 (22%), Positives = 201/523 (38%), Gaps = 42/523 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQL--------------RMWD 47 +A + R + LK R E + YP + + R+ D Sbjct: 1 MDDTAARLVKRVSSLKAARQLHESVWRECYDYTYPLRGSGFSTEVLDAQSAKSKVARLLD 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 T +++ L+S L S +TP +W L + S R W Sbjct: 61 GTATDSARILASALMSGMTPANAQWLDL------------GSENLSDDERSWLSTCATL- 107 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 + + F VV G Y++ D + G + PL+ V+++ + Sbjct: 108 -TWENIHAANFDAEGYEANIDVVCAGWFALYVDED----TEQGGYTFNQWPLAQVFVASS 162 Query: 168 H-QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 VV++VYR + T +Q V ++G +S K++ A + +++F IHA++P+ Sbjct: 163 RRDGVVNTVYRCYQLTAEQAVKEFGRDNVSHKIQDAANKKPDDKFEFIHAIFPRDGYIGN 222 Query: 227 -KDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 + N F S V V E + E FP V R+ YG P +ALP + L Sbjct: 223 ARLAKNLPFASFNVEVAEKKVVRESGYHEFPVCVPRWMKIPGTPYGVGPVYDALPDCKEL 282 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-NPL 344 NET L++ IA + + G I + + +P+ G + Sbjct: 283 NETKRMEKAAQDLAIAGMWIAEDDGVLNPRTVNVGPRKIIVANS--VNSMKPLLTGADFN 340 Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 RL+ IR + + D Q D + +A E + +GP+ G Q+E++ Sbjct: 341 VAFTAEERLQAQIRKILMADQLQ-PQDGPAMTATEVHVRVALIRQLLGPVYGRFQAEYLQ 399 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ R I G P+ + + Y SPL + Q+ E V + + + +L Sbjct: 400 PLVERCFGIAFRAGVFPQMPESMAQAN--FNIRYISPLARAQKLEDVTAIERLGANIAQL 457 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 +P +D+MD D +R A PA ++R A+V +R Sbjct: 458 A--AINPEVIDNMDADAAARVVSDALGVPAKVLRSAADVTALR 498 >gi|83313332|ref|YP_423596.1| hypothetical protein amb4233 [Magnetospirillum magneticum AMB-1] gi|82948173|dbj|BAE53037.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 545 Score = 357 bits (915), Expect = 4e-96, Method: Composition-based stats. Identities = 107/504 (21%), Positives = 193/504 (38%), Gaps = 43/504 (8%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQLRMWDTTGSEACIK 56 + R+ K +R +E + P ++ R++D T + + Sbjct: 31 SFLLRRYRKAKERRSTWESHWQECYDYALPLRDGMFHSSVPGERKADRLFDGTAPDCVDQ 90 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L++ L S +TPP +W GLA + A + +++ + RS Sbjct: 91 LAASLLSELTPPWAQWFGLAAGDQMPE-------ADRDQAAPLLERIAAVMQSH--FDRS 141 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 F + Y V GT E G R+ SVPL V + +D + Sbjct: 142 NFAIEMHQCYLDAVTGGTASLMFEEAP--PGEPSAFRFTSVPLGQVVLEEGPAGRLDVTF 199 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHS 236 R +V + +++ VL ++ A A + + R ++ AV P +G + + Sbjct: 200 RRSELSVAALKARFPRAVLPREVIKAAADDPDLRLGVVEAVVP--------VRGGYSYAA 251 Query: 237 KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 + Q ++ P++ R+ E+YGRSP M+ALP I+ N+ V + + Sbjct: 252 VLDDDGSDLVLGRGQFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIKTANKVVELVLKNA 311 Query: 297 RLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLK 354 +++ A + L PG + A+ G G L+ L+ Sbjct: 312 TIAVTGIWQADDDGVLNPANIKLVPGTIIPKAVGSAGLQPLTA--PGRFDTSQLVLDDLR 369 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 IR + D + +A E +++ + +G G LQSE + +I R + IL Sbjct: 370 GRIRHALMGDKLSQPA-SPALTATEVLQRADDMARLLGATYGRLQSELLTPLILRAIHIL 428 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 +G +P + ++Y SPL + Q + L + + LG PS + Sbjct: 429 RRRGEIPP----LQVDGRTIDLQYRSPLAQNQGRRDARNVLNWLGALSSLG-----PSAL 479 Query: 475 DHMDTDRVSRFSLWATNTPAVLIR 498 +D+D +R+ A N P+ LIR Sbjct: 480 ATVDSDAAARWLARAFNVPSELIR 503 >gi|298485985|ref|ZP_07004059.1| hypothetical protein PSA3335_1414 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298159462|gb|EFI00509.1| hypothetical protein PSA3335_1414 [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 533 Score = 354 bits (907), Expect = 3e-95, Method: Composition-based stats. Identities = 135/523 (25%), Positives = 210/523 (40%), Gaps = 42/523 (8%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQLRMWDTTG 50 +A I + LK+ R + YP + + + RM D T Sbjct: 3 TAAQICKTLSTLKSLRSPHESVWRDCFDHSYPIRGSGFCIEQITAMEAQMRKARMIDGTT 62 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 ++A LSS + S +TP W G+ S + R W D D L + Sbjct: 63 TDAARILSSGIMSGLTPANSLWFGMDVG------------QESDEERRWLDGSADIL--W 108 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN-HQ 169 + S F T VV G Y++ D + G + P+++VY S + Sbjct: 109 QNIHASNFDAAAFEGLTDVVCAGWFALYIDQD----MEKGGFTFDLWPIASVYCSASKAG 164 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD-KKKD 228 +D+VYR + T +Q V+++G+ LS + E IHA+YP++ + Sbjct: 165 GKIDTVYRTYKLTAEQAVNEFGEDNLSETTRKLAKEKPQELVEFIHAIYPRTTHMVGARL 224 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 N S V V E P +V R+ + D +Y P +ALP R LNE Sbjct: 225 AKNMPVASCKVEVAAKTLVSESGYHEMPVVVPRWMMIPDSVYAVGPVFDALPDSRTLNEL 284 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 G L++ IA + +K G I + +P+Q G+ Y E Sbjct: 285 CRMDLAAGDLAIAGMWIAEDDGVLNPRTVKVGPRKIIVANS--VDSMKPLQSGSNFQYAE 342 Query: 349 E-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + RL+ SIR + + D Q D + +A E + +GP+ G LQ+E++ MI Sbjct: 343 TKIARLQGSIRKILMADQLQA-QDGPAMTATEVHVRVNLIRQLLGPVYGRLQTEYLQPMI 401 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R I G L + + V Y SPL + Q+ E V++ Q V L V Sbjct: 402 ERCFGIAYRAGVLGQAPESLAG--RDFTVRYLSPLARSQKLEEVSAIDQFVQ--GALIVA 457 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 DPS MD++D D RF A P+ +IR A+ + +R+ R Sbjct: 458 QADPSVMDNIDMDEAQRFKGEALGVPSSVIRSKADRDKLREDR 500 >gi|220903991|ref|YP_002479303.1| hypothetical protein Ddes_0717 [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] gi|219868290|gb|ACL48625.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774] Length = 597 Score = 352 bits (904), Expect = 7e-95, Method: Composition-based stats. Identities = 119/547 (21%), Positives = 208/547 (38%), Gaps = 40/547 (7%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR-------------MWDTTGSEACI 55 + R+ L +R + + L P + + + + D TG A Sbjct: 8 LARRYQALLRRRMPWDTAWQSLADHFLPTRCRLRPQGGGAEEGPMLNSGLVDATGILAMR 67 Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 L++ L +T P + W L + + ARS+ + W D+V + R Sbjct: 68 TLAAGLQGGLTSPARPWFRLGLDDA--------DLARSRPGQAWLDEVAARMRSV--FHR 117 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 F + + Y + FGT + AD +G R++ + + + VD+V Sbjct: 118 CNFYNAMHTLYAELATFGTAFVFELADP-----RDGFRFMPLCAGEYVLDCDAGRRVDTV 172 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT-DKKKDKGNKGF 234 +R + ++ QIV +G L ++ A+ RN +ER +I AVYP+ + Sbjct: 173 FRRSSMSLRQIVQTFGPAALPESLREAVRRNADERRNVIQAVYPRDDRIHGILTASHMPV 232 Query: 235 HSKFVSVD---ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 S + E FP R+ V +++YGRSPAM+ALP R L + Sbjct: 233 ASVYWLEGRDGGEHALRESGFRHFPGFGPRWDVAGNDVYGRSPAMDALPDCRMLQQMGIT 292 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE--- 348 + ++ PP + + DL PG +N + Sbjct: 293 TLKAIHKAVDPPMSVSAGLRSVGLDLTPGGINYVDSAPGQSPQAATPLLQVNPDLSTARR 352 Query: 349 ELNRLKESIRSLFLLDLFQ-VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + ++ IRS DLF+ +L+ ++ +A+E + EK +GP++ L E ++ Sbjct: 353 AMESVQNQIRSGLYNDLFKLILEGRSGVTASEIAAREEEKLVLIGPVLERLHDELFIPLM 412 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 R + + LP C + LKVE+ S L + Q+ V++A Q + L Sbjct: 413 DRTFECMRELDMLPPCPPELSG--RRLKVEFVSLLAQAQKLVGVSAADQYLAL--TLRAS 468 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 T P +D ++ D + + P L R E E +R R R +Q Sbjct: 469 TAWPEALDTLNVDHLLDNYADSLGLPISLTRPPEEREQMRAARAEAARGAALADSLKQGV 528 Query: 528 QTSQDIG 534 Q + Sbjct: 529 DLVQQLA 535 >gi|303257564|ref|ZP_07343576.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] gi|302859534|gb|EFL82613.1| conserved hypothetical protein [Burkholderiales bacterium 1_1_47] Length = 548 Score = 352 bits (903), Expect = 8e-95, Method: Composition-based stats. Identities = 114/564 (20%), Positives = 228/564 (40%), Gaps = 39/564 (6%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYP-----------YKNNAQLRMWDTTGSEACI 55 K I RF LK +R ++ + P + ++ D + Sbjct: 6 KLINQRFESLKQERSSWEDLWRDIRDYCLPDLGCFPGEDATQGSKRYRKILDAEAIDCAD 65 Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 L++ L ++ P + W L + + ++ V+EW +V D L S+ Sbjct: 66 VLAAGLLGGVSSPSRPWLRLTT--------MDPDLDKNPAVKEWMTKVQDLL--LLYFSK 115 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 + L Y + FGT C ++ E+ I ++ + +++ + VD++ Sbjct: 116 AECYNALHQSYLELPVFGTACTIVKPHP-----EQLISLQNLTIGEYWLAEDDYGKVDTM 170 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGNKGF 234 YR + T Q+V +WG + +++ ++ A ++ RF +IHA+ P+ + K+D N + Sbjct: 171 YRRLSLTAKQMVQQWGFEAVNNDVRQAFEKDPFTRFNVIHAIEPRIERNPDKRDNKNMPW 230 Query: 235 HSKFVSVDE-NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 S + ++ E FP + R+ +YGR P +AL + L LA Sbjct: 231 QSVYFQEGVQDKVLSESGFRNFPALCPRWMTSGGSVYGRGPGAKALSAQKSLQRLHLRLA 290 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN--PLPYHEELN 351 + PP + S K + KPG A++ + + + + P + Sbjct: 291 ELVDYGTRPPILYPSTLKDQLSQFKPGGR--VAVNPQEAPIIRSMWEVRTDPQAMLALIQ 348 Query: 352 RLKESIRSLFLLDLFQVL---DDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 ++ I+ +F +++FQ++ ++ R+A E +EK +GP++ L +E + +++ Sbjct: 349 STRQDIQRIFFVNVFQMIAATANQTDRTATEVQALEQEKVMMLGPVLERLHTELLDPLVT 408 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 + LPE L +EY S L + Q+ S ++ + L Sbjct: 409 NAFGFMVEYNMLPEVPEELYG--RELSIEYVSVLAEAQKNASANGIVRTAQQIGLLA--Q 464 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 +P +D +D D P LI +V IRQQR Q++ + QQ Sbjct: 465 INPQAVDKLDVDATIDQLADMNGVPPSLIVTGQKVALIRQQRAEQQQAQMQAAQLQQAMT 524 Query: 529 TSQDIGAKAAGRAMEKKLTHDMME 552 + +D+G A + +++ + + + Sbjct: 525 SLKDLGQAADSQGLQEAFSEEGAQ 548 >gi|23015763|ref|ZP_00055531.1| hypothetical protein Magn03010200 [Magnetospirillum magnetotacticum MS-1] Length = 543 Score = 351 bits (899), Expect = 2e-94, Method: Composition-based stats. Identities = 107/518 (20%), Positives = 196/518 (37%), Gaps = 43/518 (8%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQLRMWDTTGSEACIKLS 58 + R+ K +R +E + P ++ R++D T + +L+ Sbjct: 33 LLRRYRKAKERRSTWESHWQECYDYALPLRDGMFHAGVPGERKADRLFDGTAPDCVDQLA 92 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + L S +TPP +W GL + A +V ++V + RS F Sbjct: 93 ASLLSELTPPWAQWFGLTAGDQMPE-------AERDQVAPLLERVAAVMQSH--FDRSNF 143 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE 178 + Y V GT E G R+ SVPL V + +D +R Sbjct: 144 AIEMHQCYLDAVTGGTASLLFEE--AAPGEASAFRFTSVPLGQVVLEEGPAGRLDVTFRR 201 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 +V + +++ VLS + A A + + R ++ AV P +G + + Sbjct: 202 SEMSVAALKARFARAVLSGHLIKAAADDPDLRLGVVEAVIP--------VRGGYSYAAVL 253 Query: 239 VSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRL 298 + ++ P++ R+ E+YGRSP M+ALP I+ N+ V + + + Sbjct: 254 DDESSDVVLGRGSFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIKTANKVVELVLKNATI 313 Query: 299 SLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKES 356 ++ A + L PG + A+ G G L+ L+ Sbjct: 314 AVTGIWQADDDGVLNPANIKLVPGTIIPKAVGSAGLQPLTA--PGRFDTSQLVLDDLRGR 371 Query: 357 IRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDS 416 IR + D S +A E ++++ + +G G LQSE + +I R + IL Sbjct: 372 IRHALMGDKLSQPA-SPSLTATEVLQRSDDMARLLGATYGRLQSELLTPLIMRAIHILRR 430 Query: 417 QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDH 476 +G +P + + ++Y SPL + Q + L + + LG P+ + Sbjct: 431 RGEIPP----LSVDGRVFDLQYRSPLAQNQGRRDARNVLSWLGALSSLG-----PAALAT 481 Query: 477 MDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQR 514 +D +R+ A N P+ L+R +E + + Sbjct: 482 VDAAAAARWLGRAFNVPSELVRPASEQQAGAMDPDPAA 519 >gi|254251745|ref|ZP_04945063.1| hypothetical protein BDAG_00942 [Burkholderia dolosa AUO158] gi|124894354|gb|EAY68234.1| hypothetical protein BDAG_00942 [Burkholderia dolosa AUO158] Length = 539 Score = 350 bits (898), Expect = 3e-94, Method: Composition-based stats. Identities = 108/565 (19%), Positives = 211/565 (37%), Gaps = 46/565 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR--------------MW 46 M + + R +K++R E P + + ++ Sbjct: 1 MIDSLGETLAKRLETMKSKRQVHELVWRECFMLTDPVRASGLDGPQMDANQIAQAVALIF 60 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 D+T ++A L + + S +TP W + + + W D ++ Sbjct: 61 DSTATDAKRTLEASIMSGMTPANSLWFTMTVN------------GADDEGERWLDSASEV 108 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSV 166 L ++ + F G Y+ DE G+ + P++ VY + Sbjct: 109 L--WQNIHSANFDSEAADAVAD-GMAGWFALYI----DENRDAGGLYFEHWPMAGVYCAS 161 Query: 167 N-HQNVVDSVYREFTFTVDQIVSKWGD--KVLSSKMKSALARNENERFTIIHAVYPKSLT 223 + VD V+R + T +Q V ++ L ++ E + A+YP+ + Sbjct: 162 SKPGGTVDIVFRCYQLTAEQCVREFNRRGDSLPQEIVDKAKNKPEELVDLCQAIYPRDVH 221 Query: 224 D-KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 N S + ++ + E P +V R++ + +YG P ++ALP I Sbjct: 222 MVGALRAKNMPIASVTFACNQKQVIRESGYHEMPVVVARWKKIPNSVYGVGPLLDALPDI 281 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R LN+ V L++ IA + +K G + + +P+Q + Sbjct: 282 RTLNDIVKLEYANLDLAVSGMWIAEDDGVLNPRTVKVGPRKVIVANS--VDSMKPLQPAS 339 Query: 343 PLPYHEE-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 E + +L+ IR + D Q D + +A E + +GP+ G LQ+E Sbjct: 340 NFQLAETRIEKLQGQIRKTLMADQLQ-PQDGPAMTATEVHVRVDLIRQLLGPIYGRLQAE 398 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 ++ +I+R + G P + V+Y SPL + Q+ E V++ + + V Sbjct: 399 YLQPLIARCFGLAYRAGVFPPPPDSLGGRN--FSVQYQSPLARAQKLEEVSAIERLMGDV 456 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 + P +D++D D R + P ++R + +V RQQ++ ++Q Sbjct: 457 TVIAQV--KPEALDNIDGDEAVRLTAKNLGVPDSIVRTSDQVTQYRQQKQAAAAQQQQQQ 514 Query: 522 LQQQ-LQQTSQDIGAKAAGRAMEKK 545 L + + IG+ AA R + + Sbjct: 515 LGMEVQGDVMKSIGSAAASRMVANQ 539 >gi|38424264|gb|AAR19412.1| head-tail connector protein [uncultured cyanophage] Length = 517 Score = 350 bits (897), Expect = 4e-94, Method: Composition-based stats. Identities = 72/552 (13%), Positives = 160/552 (28%), Gaps = 57/552 (10%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQ----------LRMWDTTGSEACIKLSS 59 + R++ L ++R + + + PY W + G++ + L+S Sbjct: 4 KTRYDELSSERTQFLDEARQASELTLPYLIRGHEETYIGMKQLKTPWQSVGAKGVVTLAS 63 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L + PP + L S + ++ +V T+ + S Sbjct: 64 KLMLALLPPQTSFFKLQLDESQIGEEFGPDIKS--ELDLSFAKVERTILE--NIAASDDR 119 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 + +V G +M D PL+ + + V + + Sbjct: 120 VAVHQALQHLVVAGNALIFMGKDG----------LKVFPLNRYVVERDGNGNVLEIVTKE 169 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFV 239 + + + + + +E H + +H + Sbjct: 170 RISKKLLAEEMPE--YEEPVNEDSNFRPDECDVYTHVRRENNRV---------VWHQEVH 218 Query: 240 SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLS 299 + + I P++ R+ E YGR + + ++ L L + + Sbjct: 219 GKVLPKSISKAPIDANPWLPLRFNTVDGEAYGRGRVGQFIGDLKSLEALSQALVEGSAAA 278 Query: 300 LHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRS 359 + + + L GA+ +Q G + + R Sbjct: 279 AKVVFVVAPSSTTKPATLASAG--NGAIVSGRPDDIGVIQVGKTADFGTAFQMTQVYERR 336 Query: 360 LFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGN 419 L L + +A E E +G L L EF+ ++R+L + + Sbjct: 337 LSEAFLILNPRNAERVTAEEVRMTQLELEQQLGGLFSLLTVEFLVPYLNRKLSVAQKRNE 396 Query: 420 LPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDT 479 +P P V + L + Q A S+A + + G + +++ Sbjct: 397 IPRIPKGIVKPTI---VAGVNALGRGQDAISLA------QFLQTIAQTMGPEAIAQYINP 447 Query: 480 DRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAA 538 V + A L+R E++ +Q + ++ + Q + +T Sbjct: 448 TEVVKRLAAAQGIDILNLVRSMEELQANQQAEQQMQQQQMQAEQQTAMLKT--------- 498 Query: 539 GRAMEKKLTHDM 550 M+ + Sbjct: 499 -PMMDPTKNPQL 509 >gi|262043663|ref|ZP_06016772.1| hypothetical protein HMPREF0484_3791 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|259039001|gb|EEW40163.1| hypothetical protein HMPREF0484_3791 [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] Length = 554 Score = 349 bits (894), Expect = 9e-94, Method: Composition-based stats. Identities = 139/570 (24%), Positives = 252/570 (44%), Gaps = 40/570 (7%) Query: 1 MNQRSAKD--------IQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQ 42 M+ + ++ I ++ R +E+ + P Sbjct: 1 MSDQKTQENESERIGRILREQKSMETDRSVFEQHWQEIAERILPRSAEFKGTRQKGGKRT 60 Query: 43 LRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQ 102 + D TG+ A K + + S+ITP QKWH L+ + A ++V+ + + Sbjct: 61 EKAIDATGALALQKFGAAIESVITPRTQKWHTLS----------NERFANDEEVQRYFQE 110 Query: 103 VTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNV 162 V D LF R + F Y S FGTGC +++ + +G RY + L + Sbjct: 111 VRDILFRLRYAPWANFASQSHEHYISSGAFGTGCTFVD-----NVIGKGPRYCTYHLREI 165 Query: 163 YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSL 222 Y + N Q ++D V+R++ T Q + ++G++ L ++++ + +++F +H V P Sbjct: 166 YFTENFQGMIDVVHRKYCMTARQAIQQFGEENLPQQVRTTARNDPSKQFNFLHRVEPNDK 225 Query: 223 TD-KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPT 281 D ++DK F S + ++ ++ +E + PY + RY E+YGRSPAM LP Sbjct: 226 RDMSRQDKEGMPFRSVHICMEGSKIVQEGGYWSQPYAISRYYTAPGEVYGRSPAMVVLPD 285 Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 I+ LNE + + ++++ PP + + + F + PG +N G ++R+G+ L P+ Sbjct: 286 IKLLNEINRAIIEGAQMAVRPPMLLPEDGILQPFKMMPGALNFGGMNRDGKPLALPLNTA 345 Query: 342 NPLPYHEEL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS 400 L + +++I F + LFQ+L D +A E+M + +EKG + P G +Q+ Sbjct: 346 TDFSVAMTLAEQKRQTINDGFFITLFQILVDNPQMTATEAMLRAQEKGQLLAPTAGRIQA 405 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 EF+G +I RE+DI G LPE +EYTSPL + Q +E + + VN Sbjct: 406 EFLGTLILREIDIAYQNGLLPEPPEQLKEIGGEYDIEYTSPLVRLQMSEEASGIMNVVNA 465 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 +G D + ++ D RF A+ P +++ E+ + Q ++ + Sbjct: 466 AGTIG--QFDQNIARTLNGDAALRFIAKASGAPLQVVKTEDEMAAQDAADQQQLQLQQLL 523 Query: 521 HLQQQLQQTSQDIGAK---AAGRAMEKKLT 547 ++D A A L Sbjct: 524 AAAPVAATAAKDFAQANQIAQTPAPSPALQ 553 >gi|46581008|ref|YP_011816.1| hypothetical protein DVU2604 [Desulfovibrio vulgaris str. Hildenborough] gi|46450429|gb|AAS97076.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311234693|gb|ADP87547.1| hypothetical protein Deval_2404 [Desulfovibrio vulgaris RCH1] Length = 569 Score = 348 bits (893), Expect = 1e-93, Method: Composition-based stats. Identities = 112/521 (21%), Positives = 201/521 (38%), Gaps = 49/521 (9%) Query: 17 KNQRGELNYWMEELTGFLYPY-------------KNNAQLRMWDTTGSEACIKLSSLLSS 63 + +R E+ F+ P R+ D T + A L++ + Sbjct: 16 ERERRVWEPLWREVEDFVLPRCIDSPRRADEAGDTARRGPRIIDGTATRAVRILAAGMQG 75 Query: 64 LITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQ 123 +T P + W L + + + R W D V L+ +RS F + Sbjct: 76 GLTSPARPWFRLRLADEDMEEAGPE--------RRWLDVVERRLYA--ALARSNFYAAVH 125 Query: 124 SFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTV 183 YT + FG+ Y EAD + +R+ + + + + VD+V R + Sbjct: 126 GLYTELAAFGSADMYHEADP-----QRVMRFSCLACGDFAWACDAAGRVDTVVRRLRMSA 180 Query: 184 DQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD---------KKKDKGNKGF 234 Q+ ++G+ LS +++ L R+ ++H V P+ + N + Sbjct: 181 RQMAQRYGEARLSRRVRRMLRRDPERSVPLVHMVRPRVRRNAGEAGKTASGGLGGVNMPW 240 Query: 235 HSKFVSVDE-NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 S + E FP++ R+ V +IYGRSP M+ LP ++ L E Sbjct: 241 QSLTWETEGAEGLLHEGGFEEFPHLAARWDVAGGDIYGRSPGMDVLPDVKMLQEMARSQL 300 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE--LN 351 ++PP ++ +L PG N + P+ NP + Sbjct: 301 LAIHKVVNPPMRVP-SGFKQRLNLIPGGQNYVTPGQG--ESVGPLYQINPDIGAVTHKME 357 Query: 352 RLKESIRSLFLLDLFQV--LDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 ++ ++R F DLF + + +++ +AAE +E+ EK +GP+I QSE + ++ R Sbjct: 358 DVRRAVREGFFNDLFLMFTAEGRSNITAAEVLERGEEKLLMLGPVIERHQSELLDPLLER 417 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 IL G LP ++VEY S L + Q+ + + + + V L Sbjct: 418 TYGILRRGGLLPPPPPELAG--RSMRVEYVSALAQAQRVVTAQAIRRFASDVSALAGVA- 474 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 P +D +D ++ PA ++R AEV +R R Sbjct: 475 -PQVLDKVDFEQAVDELAAIAGVPARVVRSDAEVATLRAAR 514 >gi|120601696|ref|YP_966096.1| hypothetical protein Dvul_0646 [Desulfovibrio vulgaris DP4] gi|120561925|gb|ABM27669.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 569 Score = 348 bits (893), Expect = 1e-93, Method: Composition-based stats. Identities = 112/521 (21%), Positives = 201/521 (38%), Gaps = 49/521 (9%) Query: 17 KNQRGELNYWMEELTGFLYPY-------------KNNAQLRMWDTTGSEACIKLSSLLSS 63 + +R E+ F+ P R+ D T + A L++ + Sbjct: 16 ERERRVWEPLWREVEDFVLPRCIDSPRRADEAGDTARRGPRIIDGTATRAVRILAAGMQG 75 Query: 64 LITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQ 123 +T P + W L + + + R W D V L+ +RS F + Sbjct: 76 GLTSPARPWFRLRLADEDMEEAGPE--------RRWLDVVERRLYA--ALARSNFYAAVH 125 Query: 124 SFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTV 183 YT + FG+ Y EAD + +R+ + + + + VD+V R + Sbjct: 126 GLYTELAAFGSADMYHEADP-----QRVMRFSCLACGDFAWACDAAGRVDTVVRRLRMSA 180 Query: 184 DQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD---------KKKDKGNKGF 234 Q+ ++G+ LS +++ L R+ ++H V P+ + N + Sbjct: 181 RQMAQRYGEARLSRRVRRMLRRDPERSVPLVHMVRPRVRRNAGEAGKTASGGLGGVNMPW 240 Query: 235 HSKFVSVDE-NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 S + E FP++ R+ V +IYGRSP M+ LP ++ L E Sbjct: 241 QSLTWETEGAEGLLHEGGFEEFPHLAARWDVAGGDIYGRSPGMDVLPDVKMLQEMARSQL 300 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE--LN 351 ++PP ++ +L PG N + P+ NP + Sbjct: 301 LAIHKVVNPPMRVP-SGFKQRLNLIPGGQNYVTPGQG--ESVGPLYQINPDIGAVTHKME 357 Query: 352 RLKESIRSLFLLDLFQV--LDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 ++ ++R F DLF + + +++ +AAE +E+ EK +GP+I QSE + ++ R Sbjct: 358 DVRRAVREGFFNDLFLMFTAEGRSNITAAEVLERGEEKLLMLGPVIERHQSELLDPLLER 417 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 IL G LP ++VEY S L + Q+ + + + + V L Sbjct: 418 TYGILRRGGLLPPPPPELAG--RSMRVEYVSALAQAQRVVTAQAIRRFASDVSALAGVA- 474 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 P +D +D ++ PA ++R AEV +R R Sbjct: 475 -PQVLDKVDFEQAVDELAAIAGVPARVVRSDAEVATLRAAR 514 >gi|227355860|ref|ZP_03840253.1| tail protein [Proteus mirabilis ATCC 29906] gi|227164179|gb|EEI49076.1| tail protein [Proteus mirabilis ATCC 29906] Length = 554 Score = 347 bits (891), Expect = 2e-93, Method: Composition-based stats. Identities = 122/523 (23%), Positives = 205/523 (39%), Gaps = 39/523 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDTT 49 M+ + + + N L+ +R EL+ F P + ++ D T Sbjct: 1 MSTPLKEQLLQQLNQLETERSSFEPHWRELSDFTRPRSTRFTASDVNRGDRRNSKIIDPT 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 S A LSS + S IT P + W LA + V+ W + + Sbjct: 61 ASLASSVLSSGMMSGITSPARPWFRLATPDPDLMDYGP--------VKLWLETTEQRMNE 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 +RS L Y + FGT + D + IR + PL + Y++ + Sbjct: 113 V--FNRSNLYQSLPLMYGDLGTFGTAAMAVVED-----SQRIIRTVHFPLGSYYIANSPS 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNEN-ERFTIIHAVYPKSLT-DKKK 227 VD YR+FT TV Q+V ++G +S +KS ++ + ++HAVYP K Sbjct: 166 LSVDVCYRKFTMTVRQLVMEFGVDSVSDTVKSMWNSSQYSQWIEVVHAVYPNLERQTGKL 225 Query: 228 DKGNKGFHSKFVSVDENR--FFEEKQIATFPYIVGRYRVRADEIYGRS-PAMEALPTIRR 284 + +K F S ++ V + E FP + R+ V +++YG S P M AL + Sbjct: 226 EAKHKPFKSVYLEVAGDHEKVLRESGYDEFPIMAPRWEVNGEDVYGSSCPGMLALGGTKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIG--ALSREGRSLFQPVQFGN 342 L AQ +PP + K + + PG +N A VQ Sbjct: 286 LQLMQKRKAQMIDKLTNPPLQVPASLKNQRVNTIPGGINYLDEANPTNKIQTIFDVQPVA 345 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSA--AESMEKTREKGAFVGPLIGGLQS 400 E++ ++ I + + +DLF+++ +RS +E EK +GP++ L S Sbjct: 346 LKALLEDVQDTRQLIDTAYFVDLFRMMQMVNTRSMPIEAVVEMREEKLLQLGPVLQRLDS 405 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 E + +I+R IL ++ LP LKVEY S + + Q++ V S + Sbjct: 406 ELLDKLINRTFSILVNKNLLPVAPDEMQGMD--LKVEYISVMAQAQKSIGVGSIERFAGF 463 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 V L P +D ++ D A ++ +V Sbjct: 464 VGNLAKV--KPEALDKLNADDAIDNYASAIGVSPTIVATNEQV 504 >gi|310005791|gb|ADP00177.1| head-tail connector protein [Cyanophage NATL2A-133] Length = 528 Score = 346 bits (888), Expect = 5e-93, Method: Composition-based stats. Identities = 76/554 (13%), Positives = 151/554 (27%), Gaps = 56/554 (10%) Query: 11 DRFNYLKNQRGELNYWMEELTGFLYP---YKNNA------QLRMWDTTGSEACIKLSSLL 61 R+N L R + E P +N W + G++ + LSS L Sbjct: 6 QRYNKLSTGREQFLNVAYECAELTIPTLIMRNETPPNYAQFKTPWQSIGAKGVVTLSSKL 65 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + PP + L S + E ++ ++ + + S Sbjct: 66 MLGLLPPSTSFFKLQLDDSKLGVEVPPE--SKSELDLSFAKIERMIME--AIAASTDRVQ 121 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTF 181 + + +V G YM D PL+ + + + + Sbjct: 122 IFTALKHLVVTGNALLYMGKDG----------MKMYPLNRYVVERDGNGDPVEIVTKEKI 171 Query: 182 TVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSV 241 + + + + + I + K +H + + Sbjct: 172 NKELLPKLPLPLKGDGVVD---DEQQGKDVDIYTCI--------KLTPKGWKWHQEVHDI 220 Query: 242 DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLH 301 + P++ R+ E YGR E L ++ L + L + + Sbjct: 221 MIPGSEGKAPAKKCPFLPLRFVTVDGEDYGRGRVEEFLGDLKSLEALMQALVEGSAAAAK 280 Query: 302 PPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + L GA+ + VQ G + + + L Sbjct: 281 VVFTVSPSSVTKPQTLANAG--NGAIIQGRPDDIGVVQVGKTADFQTAYQLVNTLEKRLA 338 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 L + D +A E E +G L L +EF+ + R++ L +P Sbjct: 339 EAFLIMNVRDSERTTAEEVRMTQMELEQQLGGLFSLLTTEFLLPYLHRKMHTLTQSKQIP 398 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 P V + L + Q +++ V + + G + ++ D Sbjct: 399 ALPKGLVKPTI---VAGINALGRGQDRDAL------VQFITTIAQTMGPEALQRFVNADE 449 Query: 482 VSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 + A L++ E Q+ + QQ G A Sbjct: 450 AIKRLAAAQGIDVLNLVKSM----------EEQQAEQQAAQQQQMQASLMDQAGQLAGTP 499 Query: 541 AMEKKLTHDMMENS 554 M+ + E Sbjct: 500 MMDPTKNPEGFEQM 513 >gi|144899435|emb|CAM76299.1| head-to-tail joining protein [Magnetospirillum gryphiswaldense MSR-1] Length = 502 Score = 346 bits (886), Expect = 8e-93, Method: Composition-based stats. Identities = 114/507 (22%), Positives = 201/507 (39%), Gaps = 44/507 (8%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------NAQLRMWDTTGSEACIKLS 58 ++ R+ K +R +E + P ++ R++D T ++A +L+ Sbjct: 17 LRQRYRKAKERRATWEAHWQECYDYALPLRDAVLHQPNPGEKKGDRLFDGTAADAVDQLA 76 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + L S +TPP +W GL ++A ++V D+V L RS F Sbjct: 77 ASLLSELTPPWAQWFGLTAGP-------DLDEAERQQVAPLLDKVGAILQSH--FDRSNF 127 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE 178 + Y VV GT C E + G R+ +VPL+ + +DS +R Sbjct: 128 AVEMHQCYLDVVTGGTACLLFEE--AQPGEASAFRFTAVPLAQAVLEEGPDGKLDSSFRR 185 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 T+ + ++ L + + RF +I AV P G+ + + Sbjct: 186 SELTLAALRQRFPAAQLDPSLIRRGEEDPQARFAVIEAVIPNQR-------GHYDYAAIL 238 Query: 239 VSVDENR--FFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 ++ E + P+I R+ EIYGRSP M+ALP I+ N+ V + + Sbjct: 239 EDATDDDEALLAEGRFGQSPFINFRWLKAPGEIYGRSPVMKALPDIKTANKVVELVLKNA 298 Query: 297 RLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLK 354 +++ A + L PG + A+ G + G L+ L+ Sbjct: 299 TIAVTGIWQADDDGVLNPANIKLIPGTIIPKAVGSAGLQPLE--SPGRFDISQLVLDDLR 356 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 IR L D D+ +A E +E++ + +G G LQSE + +I R + IL Sbjct: 357 GRIRHALLADKLGQADN-PKMTATEVLERSADMARLLGATYGRLQSELLTPLILRAVTIL 415 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 +G +P L++++Y SPL + Q + L ++ + +LG P+ M Sbjct: 416 RRRGEIPP----LLVDGHLVELQYRSPLAQSQAQRDAHNVLSWLSALAQLG-----PAGM 466 Query: 475 DHMDTDRVSRFSLWATNTPAVLIRDTA 501 +D +++ A N PA L+ Sbjct: 467 AVVDPAAAAQWLGRAFNIPADLMVAPQ 493 >gi|78357592|ref|YP_389041.1| hypothetical protein Dde_2550 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219997|gb|ABB39346.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 549 Score = 345 bits (885), Expect = 1e-92, Method: Composition-based stats. Identities = 122/548 (22%), Positives = 224/548 (40%), Gaps = 44/548 (8%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFL---------------YPYKNNAQLRM 45 M+ + ++ + Y+++QRGE + E+ ++ P R+ Sbjct: 1 MSISTLEEARGAAAYIESQRGEWDSRWREVADYVTGAGYGGGSWQEGTARPE-GRRGQRI 59 Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 D T + A L++ L +TPP + W L + S +VR W D V Sbjct: 60 IDATATRALRVLAAGLQGGLTPPARPWFRLRLADRGLM--------ESAEVRRWLDDVEA 111 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 L+ + S F + +T++ +G+ YMEAD + +R+ VP + + Sbjct: 112 ALYA--ALAGSNFYQNSHALFTALAAYGSADMYMEADP-----QRVMRFCVVPHGDFAWA 164 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK 225 + VD+V R F+ T Q K+G LS ++ A ++ V P++ D Sbjct: 165 CDAAGRVDTVVRRFSMTAAQAAQKYGSDRLSRTVRRLAAVQPYAPVALVQLVRPRARRDP 224 Query: 226 K-KDKGNKGFHSKFVSVDENR-FFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 + +D NK + S E R A FP++ R+ V ++YG SP M+ LP ++ Sbjct: 225 RRQDSLNKPYESLTWEAQEPRRLLHVSGYAEFPHLCARWEVNGGQLYGHSPVMDVLPDVK 284 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L E ++PP + ++ +L PG N + P+ P Sbjct: 285 MLQEMARSQLLAVHKVVNPPMRVPT-GFKQRLNLIPGAQNYV--NPAQPDALSPLYQIRP 341 Query: 344 --LPYHEELNRLKESIRSLFLLDLFQVLDD--KASRSAAESMEKTREKGAFVGPLIGGLQ 399 ++ ++ SIR ++F + +++ +AAE ME+++EK +GP++ Q Sbjct: 342 DIQAVTYKIEDVRRSIREGLFTEMFLLFAGESRSNVTAAEIMERSQEKLLLLGPVVERHQ 401 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 ++ + +I R +L G LP LKVEY S L + Q+ + Q Sbjct: 402 TDILDPLIGRAFGLLARAGRLPPAPDVLAG--RDLKVEYVSALAQAQRLSAAQGVRQLAG 459 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 V P +D +D D+ PA ++R +V+ +R++R +++ Sbjct: 460 DVSRFAAMA--PEVLDKIDFDQAVDELASIAGAPAGIVRSDEDVQLLRRERALKQAEQAG 517 Query: 520 QHLQQQLQ 527 + L + Sbjct: 518 RALLESAG 525 >gi|291335893|gb|ADD95488.1| T7-like head to tail connector [uncultured phage MedDCM-OCT-S08-C41] Length = 527 Score = 344 bits (881), Expect = 3e-92, Method: Composition-based stats. Identities = 80/560 (14%), Positives = 168/560 (30%), Gaps = 57/560 (10%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYP----------YKNNAQLRMWDTTGSEACI 55 ++R++ L + R + E + P + W + G+++ + Sbjct: 1 MSKAKERYSQLSSDRHQFLDIAVECSELTLPHLITDDLRVRQNHKRLTTPWQSVGAKSVV 60 Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 L++ L + PP + L L E ++ ++ + + + Sbjct: 61 TLAAKLMLALLPPQTSFFKLQVRDDQLGEELPMEVRS--ELDLSFSKMERMVMD--KIAA 116 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 S + ++ G +M D + PL+ +S + V + Sbjct: 117 SSDRVVVHQALKHLIVGGNALIFMGKDG----------LKNFPLNRFVVSRDGNGYVCEI 166 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFH 235 + G + + N +E + V ++D G +H Sbjct: 167 VTKELVNRKL----LGIDPMPDPHTVSGKGNNDEDAEVYTYVR-------RQDNGGWVWH 215 Query: 236 SKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 + + P++V R+ E YGR E L +R L L + Sbjct: 216 QEVDDKIIDGSRSTAPKDASPWLVLRFNAVDGEDYGRGRVEEFLGDLRSLEALSQALIEG 275 Query: 296 GRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKE 355 + + A + + GA+ + VQ G + ++ Sbjct: 276 SAAAAKVVFLVNPAATTKPSTIAKAG--NGAIVQGRPEDVSVVQVGKTADFGTASQMAQQ 333 Query: 356 SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILD 415 R L L + +A E E +G L L EF+ ++R L ++ Sbjct: 334 IERRLGEAFLLLNIRQSERTTAEEVRLTQLELEQQLGGLFSLLTVEFLKPYLARTLMVMQ 393 Query: 416 SQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD 475 G LP+ P V + L + Q ES+ + + + G + M Sbjct: 394 RSGQLPKIPREYVQPQI---VAGVNALGRGQDRESLTA------FIGTIAQTLGPEALMK 444 Query: 476 HMDTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIG 534 ++D + A L++ +++ QQ++ + Q+ Sbjct: 445 YIDASEAIKRLAAAQGIDVLNLVKTPQQMQQDMQQQQAMSSQQQLLGQAGQMM------- 497 Query: 535 AKAAGRAMEKKLTHDMMENS 554 + M+ D E + Sbjct: 498 ---SAPLMDPSKNPDAAEMA 514 >gi|310005702|gb|ADP00089.1| head-tail connector protein [Cyanophage NATL1A-7] Length = 543 Score = 342 bits (878), Expect = 6e-92, Method: Composition-based stats. Identities = 75/552 (13%), Positives = 165/552 (29%), Gaps = 52/552 (9%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY----------KNNAQLRMWDTTGSEACIKL 57 +DR+ L R + + E + PY ++ W + G+++ + L Sbjct: 2 KARDRYAQLTRGRTQFLHTAVECSRLTLPYLVQEDLSSRPEHQKLHTPWQSVGAKSVVNL 61 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 ++ L + PP + L + + ++ ++ + + S S Sbjct: 62 AAKLMLALLPPQTSFFKLQIQDNKIGVEFDPKIRS--EMDLSFAKMERMVMDY--ISASN 117 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + ++ G +M D + PL+ + + + + Sbjct: 118 DRVVVHQALKHLIVSGNALIFMGKDG----------LKNYPLNRYVCNRDGNGNICEIVT 167 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 + + + + +S + +++ ++ D G +H + Sbjct: 168 KELISRKILGQDLPVPLPNSPGEDGYKTGSDDQDVEVYTYVRLD------DNGRWVWHQE 221 Query: 238 FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 T P++V R+ E YGR E L IR L L + Sbjct: 222 AFDNILPGSRSTAPKNTSPWLVLRFNTVDGEDYGRGRVEEFLGDIRSLEGLSQSLVEGSA 281 Query: 298 LSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 + + + + + GA+ + +Q G + ++ + Sbjct: 282 AASKVVFLVSPSSTTKPKTIADAG--NGAIVQGRPDDVGVIQVGKTADFRTAQEQMMQLE 339 Query: 358 RSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 + + L + +A E E +G L L EF+ ++R L IL Sbjct: 340 KRINEAFLVLNVRQSERTTAEEVRLTQMELEQQLGGLFSLLTVEFLEPYLNRTLHILQRN 399 Query: 418 GNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 +P+ P + + L + Q ES+ + L G + ++ Sbjct: 400 KEIPKIPKESVRPQI---IAGVNALGRGQDEESL------IRFAQTLSQTVGPEMMVKYL 450 Query: 478 DTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAK 536 D + A A LI+ + +QQ+ + Q + + G Sbjct: 451 DPGEYVKRLAAAQGIDALNLIKSPETMAQEKQQQMQ----------EMQQGELLKQAGQL 500 Query: 537 AAGRAMEKKLTH 548 A M+ Sbjct: 501 AGTPMMDPSKNP 512 >gi|330007155|ref|ZP_08305897.1| hypothetical protein HMPREF9538_03586 [Klebsiella sp. MS 92-3] gi|328535502|gb|EGF61962.1| hypothetical protein HMPREF9538_03586 [Klebsiella sp. MS 92-3] Length = 559 Score = 342 bits (878), Expect = 6e-92, Method: Composition-based stats. Identities = 125/533 (23%), Positives = 206/533 (38%), Gaps = 51/533 (9%) Query: 1 MNQRSAKD-IQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDT 48 M + S K LKN+R EL F+ P R+ D Sbjct: 1 MAELSPKQHYLKHLGQLKNERTSFEEHWRELAEFIDPRSTRFLTTERNNGSKRNTRIVDP 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 T S+A L S + S IT P + W LA + V+ W D V + Sbjct: 61 TASKAARTLQSGMLSGITSPTRPWFKLATPDPEMMQYGP--------VKRWLDVVMTRMN 112 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 +RS L Y + FGT + D E+ IR +P+ + Y+S +H Sbjct: 113 DVM--NRSNVYQSLPIIYRHLGVFGTAAMAVLED-----DEDVIRTHPLPIGSYYLSNSH 165 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLT-DKK 226 + VD+ YR F+ T QIV ++G +S+ ++ A E F ++H P + K Sbjct: 166 RLSVDTTYRVFSMTARQIVMQFGLDNVSNAVRGAWDNANYEAWFDVVHLTEPNIDRVNGK 225 Query: 227 KDKGNKGFHSKFVS--VDENRFFEEKQIATFPYIVGRYRVRADEIYGR-SPAMEALPTIR 283 + NK F S + D ++ E P + R+ + +++YG P M AL T + Sbjct: 226 LNSRNKAFKSVYFELSGDGDKLLREAGFDEPPILSPRWEINGEDVYGSNCPGMMALGTGK 285 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L A ++PP +A + K + +L PG + + L +P +P Sbjct: 286 ALQLEQIRKANAIDKLVNPPMVAPTGLKNKLINLAPGGVTYVDEVDATK-LVRPAYAVSP 344 Query: 344 L--PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQ 399 + ++ I + F DLF + +RS EK +GP++ L Sbjct: 345 QLNDMLGSIADDRQMIEACFFSDLFNLFSTINTRSMPVEAVAAMQDEKLLQLGPVLERLN 404 Query: 400 SEFIGAM-----ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASA 454 E + R +I+ + PE + LKVEY S L + Q++ ++S Sbjct: 405 DE-----FLDPFVDRTFNIMARRNLFPEPPEELQG--TPLKVEYVSILAQAQKSIGISSV 457 Query: 455 LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 + V V L +P+ +D ++ D+ PA ++ EV+ R Sbjct: 458 ERFVGFVGNLAKA--NPAALDKLNIDQTIDEYGNMLGVPATIVNSDDEVQATR 508 >gi|288957023|ref|YP_003447364.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] gi|288909331|dbj|BAI70820.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] Length = 534 Score = 342 bits (877), Expect = 1e-91, Method: Composition-based stats. Identities = 106/505 (20%), Positives = 199/505 (39%), Gaps = 44/505 (8%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYK----------NNAQLRMWDTTGSEACIK 56 + + DR+ + +RG ++ P R++D T +A + Sbjct: 21 EALLDRYRGARERRGVWESHWQDCYDHALPNGRPFHGGGTAGERRVNRLFDGTAPDAVEQ 80 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L++ L S +TPP +W G A + + D+ + RS Sbjct: 81 LAASLLSELTPPWSRWFGFRPGPDLTGAERDR-------IAPLLDRAAGIIQAH--FDRS 131 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 F + + +V GT ME G +R+ +VPL++ + +D+ + Sbjct: 132 NFAVEVHQAFLDLVTVGTASLLMEE--AAPGAVSSLRFTAVPLADAVLEEGPDGRLDATF 189 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHS 236 R T+ QI+ ++ L +++ A + + RF ++ AV P D + Sbjct: 190 RRSEATLAQILQRFPGAGLPDELRRRAAEDPDHRFPLVEAVVP--------DGAAYRWGV 241 Query: 237 KFVSV-DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 S + + + + A P++ R+ E YGRSP M+ALP I+ N+ V + + Sbjct: 242 VLDSGLADPSWLAQGRFAQSPFVNFRWLKAPGETYGRSPVMKALPDIKTANKVVELVLKN 301 Query: 296 GRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 +++ A + L PG + A+ G + G L+ L Sbjct: 302 ASIAVTGIWQADDDGVLNPSTIRLVPGTIIPKAVGSAGLTPL--ANPGRFDVSQLVLDDL 359 Query: 354 KESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDI 413 + IR L+D +D A +A E +E++ E +G G LQ+E + ++ R + I Sbjct: 360 RGRIRHALLVDRLGPVD-SARMTATEVLERSVEMARLLGATYGRLQAELMTPLLLRAVSI 418 Query: 414 LDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSC 473 L +G +P L+++++ SPL + Q V + L+ +++V LG + Sbjct: 419 LRRRGEIP----DITVDGRLVELQHRSPLAQAQAQRDVQATLRWLDSVKALGPEAEAVVD 474 Query: 474 MDHMDTDRVSRFSLWATNTPAVLIR 498 + + A PA L+R Sbjct: 475 -----AAATAHWLGEAFGVPAKLMR 494 >gi|239787361|emb|CAX83837.1| Head-to-tail joining protein [uncultured bacterium] Length = 524 Score = 341 bits (873), Expect = 3e-91, Method: Composition-based stats. Identities = 107/512 (20%), Positives = 192/512 (37%), Gaps = 46/512 (8%) Query: 2 NQRSAKDI-QDRFNYLKNQRGELNYWMEELTGFLYPYK----------NNAQLRMWDTTG 50 N A+ + RF + +R +E F P + R++D T Sbjct: 5 NDPDAQRVVLKRFEKARERRNVWEGHWQECYDFALPSRGGPLLSSQPGAKRTDRLFDGTA 64 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + +L++ L + +TPP +W GLA +K Sbjct: 65 PDCVDQLAASLLAQLTPPWAQWFGLAAGPDLTPEEREVAAPVLEKAGAALQS-------- 116 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 RS F + Y +V GT E G R+ ++PL+ + + + + Sbjct: 117 -HFDRSNFAIEMHQCYLDLVTAGTASLLFEEAPL--GSASAFRFTAIPLAQLALEESVEG 173 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKG 230 +D+ +R T+ I ++ L M + + RF ++ AV P ++ Sbjct: 174 RLDTTFRSSEMTISAIRERFPKAQLPESMGRKSKDDADARFKVVEAVLP--------ERH 225 Query: 231 NKGFHSKFVSVD--ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 +H+ E + P+I R+ E+YGRSP M++LP I+ N+ Sbjct: 226 GYAYHAILDGEGTGGAETLAEGRFEMSPFINFRWLKAPGEVYGRSPVMKSLPDIKTANKV 285 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPY 346 V + + +++ A + L PG + A+ G + + G Sbjct: 286 VELVLKNATIAVTGIWQADDDGVLNPANIKLVPGTIIPKAVGSAGLTPLE--TPGRFDIS 343 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 L L++ I L D +D + +A E +E++ E +G G LQSE + + Sbjct: 344 QLMLTDLRQRISHALLADRLGQID-APNMTATEVLERSAEMARLLGATYGRLQSELLTPL 402 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + R + IL +G +P + +++ Y SPL + E + LQ + V+ G Sbjct: 403 VMRAVAILKRRGEIP----GLSIDGHQIELIYKSPLANERGREDAKNTLQWLTAVMSFG- 457 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIR 498 P +D +R+ A N PA L+R Sbjct: 458 ----PPANQVVDLGAAARWLAKALNVPAELLR 485 >gi|167032756|ref|YP_001667987.1| putative tail protein [Pseudomonas putida GB-1] gi|166859244|gb|ABY97651.1| putative tail protein [Pseudomonas putida GB-1] Length = 564 Score = 339 bits (869), Expect = 7e-91, Method: Composition-based stats. Identities = 107/540 (19%), Positives = 200/540 (37%), Gaps = 50/540 (9%) Query: 1 MNQRSAKDI-QDRFNYLKNQRGELNYWMEELTGFLYPYK-----------NNAQLRMWDT 48 M S + + + R + LK +R + +E++ F+ P + + ++ + Sbjct: 1 MATDSPRKLAEKRLSALKTERSSWDTNAKEISDFILPMRSRVMCDDTNRGDRRNNKIINN 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 + A +S + S IT P + W LA A F V+ W + T + Sbjct: 61 RATMASRTTASGMMSGITSPARPWFNLAPVARAIMEFGP--------VKSWFYECTQRMR 112 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 RS L + Y + FGTGC +++ D IR + Y+S Sbjct: 113 DV--FLRSNLYQVLPTCYQEMATFGTGCIWVDEHPD-----TVIRCEAFTWGEYYISNGA 165 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT--DKK 226 ++YREF +TV+Q+V ++G + LS K+ N ++F ++ + Sbjct: 166 DGRAAAIYREFKWTVNQLVQEFGVEALSPSSKALYENNNGDQFISCAQRVELNMNANPDR 225 Query: 227 KDKGNKGFHSKFVSVD--ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 N F + + E++ FP + R+ + YG P L ++ Sbjct: 226 AGSRNLPFSALTWEAGAPGDMVLEDRGYHEFPAMAVRWESMPGDAYGTGPGRICLGDVKA 285 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRS--LFQPVQFGN 342 L + A+ +PP A E K + PG + + Sbjct: 286 LQLYERQAARMTETGANPPLQAPVELKGQPSSTIPGGVTYVPMVGGQNQMAPIYQPNAAW 345 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDD-KASRSAAESMEKTREKGAFVGPLIGGLQSE 401 P ++ + I F +DLF ++ R+A E + EK +GP++ + E Sbjct: 346 LSPIQAKIQEHEGRINEAFFVDLFLMVSQLDTVRTATEIAARKEEKMLMLGPVLERINDE 405 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPP-------------VSLLKVEYTSPLFKYQQA 448 + +I R +I+ Q +P G + S ++ EY S L + Q++ Sbjct: 406 LLDPLIDRTFNIMLRQS-IPIWAGIIDGDPLLPPPPEELINANSEIQAEYVSILAQAQKS 464 Query: 449 ESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 ++V + L P +D +++D++ A ++R EV IR+ Sbjct: 465 QNVLGLERFATLAGNLSGAF--PEVLDKVNSDQLIEEYADAIGVIPTVVRGADEVAAIRE 522 >gi|302339294|ref|YP_003804500.1| head-to-tail joining protein [Spirochaeta smaragdinae DSM 11293] gi|301636479|gb|ADK81906.1| head-to-tail joining protein, putative [Spirochaeta smaragdinae DSM 11293] Length = 560 Score = 339 bits (869), Expect = 7e-91, Method: Composition-based stats. Identities = 120/523 (22%), Positives = 224/523 (42%), Gaps = 38/523 (7%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR----------MWDTTGSE 52 ++SA++I F LK +R +E+T ++P ++ ++D T Sbjct: 4 EKSAQEIIQTFEQLKQERSTWEDEYQEITEQIFPRRSVWTDNKGRASRSGGLIYDGTPIS 63 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 A L++ L + P +W L + R+W + V + ++ E Sbjct: 64 ALNLLANGLVGYLVSPATRWFKLRPTQDELLQIRG--------ARQWLEIVENLIYD--E 113 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 +RS F + ++ G Y++ D+ + Y +Y++ + + Sbjct: 114 FNRSNFYEEIVEYFRDGGSIGIATIYVQEDIGRRMA----NYSCRHPKEIYIAEDRFGYI 169 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGN 231 D+V+R F T ++ ++G + LS +++ R+ ER IIHAVYP+ + KK + Sbjct: 170 DTVFRRFFPTAKELEEEFGREALSDGVQNLCERSPYERVEIIHAVYPRKKRNPRKKGNRD 229 Query: 232 KGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 F S +V N E+ PY+V R+ +DE+YGR P +AL ++RLN + Sbjct: 230 MKFASAYVEGGSNHKIRERGYERLPYVVWRWSTNSDEVYGRGPGYDALVDVKRLNRLSRD 289 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE-L 350 + + ++++ PP + + + + P +N + + + G + Sbjct: 290 MLKQSQMAVDPPLAVPEKMRGK-VNWVPRGLNYY---QNPNEVPVALNPGMQFQVGLDRE 345 Query: 351 NRLKESIRSLFLLDLFQVLDDKA-SRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 +++ I F+ D F +L+ +A E ME+ EK A +G +IG + SEF+ +I Sbjct: 346 QHMQQIIEKHFMTDFFLMLEQAPKEMTATEVMERQSEKAAVLGTVIGRISSEFLDPIIDI 405 Query: 410 ELDILDSQGNL----PECEGADNPPVSLLKVEYTSPLFKYQQAESV-ASALQGVNTVVEL 464 DI L PE A ++++Y PL + Q+ V A Q +N V + Sbjct: 406 TFDIAMKGKRLPPPPPEFAEAMYKTNGGIEIDYLGPLAQAQKKFHVTQGAQQSLNAVAPI 465 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 +P D ++ D+++ L A P I D +V+ IR Sbjct: 466 --MQINPQVADLINWDQLTMEILHAYGMPQKAIVDLRDVQKIR 506 >gi|221213955|ref|ZP_03586928.1| conserved hypothetical protein [Burkholderia multivorans CGD1] gi|221166132|gb|EED98605.1| conserved hypothetical protein [Burkholderia multivorans CGD1] Length = 549 Score = 338 bits (867), Expect = 1e-90, Method: Composition-based stats. Identities = 135/546 (24%), Positives = 235/546 (43%), Gaps = 31/546 (5%) Query: 4 RSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQLRMWDTT 49 + + + +K +R ++ F+ P + RM+D+T Sbjct: 7 KLLEALNADHGRMKEKRQSYEAVWNDVIDFMMPRLDKFGQMPRPDSEKGRERSQRMFDST 66 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 A + + S+ITP Q WH L S A V+ + V LF Sbjct: 67 APLALRNFVAAMDSMITPATQVWHRLKTSNDAL--------NEVPSVKAYLQAVVRALFA 118 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 R R + GF + + Y S+ FG G +E DV GI Y +VP+ ++ + N+ Sbjct: 119 VRYRWQGGFTTQMGATYQSIGLFGPGALMIEHDVG-----HGIVYRNVPMQRLWFAENNA 173 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-D 228 ++D + + T+ Q ++G + LS M++AL R+ + T H V P++ D +K D Sbjct: 174 GLIDKTHVLWRLTLRQAAQRFGRENLSPSMQTALERDPEKTHTFYHVVEPRADRDPRKLD 233 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 N F S ++ +R + TFP+ +GR+ V D++YG SPA +A+P IR N+ Sbjct: 234 GRNMRFGSYWLDEGRDRIIQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDIRMANDM 293 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 + + + PP +A + FDL+ G +N G L G + +P+ G Sbjct: 294 AKTNIRGAQKMVDPPLLASEDGVLEGFDLRSGSLNWGGLDERGNEMVKPLLTGKQAQIGI 353 Query: 349 EL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 E +++I F + LFQ+L D +A E +++ +EKG + P +G Q+E +G +I Sbjct: 354 EFSQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQAELLGPLI 413 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 RE+DIL G P + + VEY SPL K +A A+ LQ + + + Sbjct: 414 QREVDILAEAGQFPPMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGVVA-- 471 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 DP+ ++ R+ + P + E++ ++ + Sbjct: 472 QFDPNAAKLVNGHRIGKLLADFGGVPVEALNTDEELQASAAAEAQAAQMQQVLEAAPVAA 531 Query: 528 QTSQDI 533 +D+ Sbjct: 532 GAIKDL 537 >gi|148724480|ref|YP_001285446.1| head to tail connector [Cyanophage Syn5] gi|145588125|gb|ABP87944.1| head to tail connector [Synechococcus phage Syn5] Length = 542 Score = 334 bits (855), Expect = 4e-89, Method: Composition-based stats. Identities = 72/560 (12%), Positives = 165/560 (29%), Gaps = 40/560 (7%) Query: 10 QDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLL 61 Q R++ ++ R + PY + + + GS+ LSS L Sbjct: 6 QARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKL 65 Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P + L + + + ++ ++ + ++ + S Sbjct: 66 MLSLFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVM--QQIAESSDRVQ 123 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTF 181 L + ++ G + PL + + V + Sbjct: 124 LTAAMKHLIVTGNVLVFAGKKT----------LKVYPLDRYVIERDGDGNVIEIITRELV 173 Query: 182 TVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK----KKDKGNKGFHSK 237 + +++ + L S + +F + ++ + K G +H + Sbjct: 174 DRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQE 233 Query: 238 FVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 + + P++ R+ V E YGR E + L+ L + Sbjct: 234 CDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSA 293 Query: 298 LSLHPPTIAVSEAKQRNFDL-KPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKES 356 + + A + L + G I E S+ Q + + E + L + Sbjct: 294 AAAKVVFMVSPSATTKPQSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQR 353 Query: 357 IRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDS 416 I FL + +A E E E + + G L E + ++R+L ++ Sbjct: 354 ISDAFL---ILNVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQR 410 Query: 417 QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDH 476 LP P V + + + ++ + + +G G + Sbjct: 411 SKQLPSLPKGLVMPTV---VAGLGGVGRGEDRAAL------IEFMQTVGQAMGPEALQQF 461 Query: 477 MDTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQT--SQDI 533 +D + A+ L++ + + QQ + Q+ QL ++ + + Sbjct: 462 IDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEKM 521 Query: 534 GAKAAGRAMEKKLTHDMMEN 553 + E E+ Sbjct: 522 MQQINAPGQEAPAGPQTGED 541 >gi|48697195|ref|YP_024925.1| hypothetical protein BcepC6B_gp05 [Burkholderia phage BcepC6B] gi|47779001|gb|AAT38364.1| gp05 [Burkholderia phage BcepC6B] Length = 549 Score = 331 bits (847), Expect = 2e-88, Method: Composition-based stats. Identities = 135/514 (26%), Positives = 233/514 (45%), Gaps = 31/514 (6%) Query: 4 RSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQLRMWDTT 49 + + + +K +R ++ +L P + +M+D+T Sbjct: 7 KILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDST 66 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 A + + S+ITP Q WH L A V+ + V TLF Sbjct: 67 APLALRNFVAAMDSMITPATQLWHRLKTGNDAL--------NEIASVKAYLQGVVRTLFA 118 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 R R + GFV + + Y S+ FG G +E DV + GI Y +VP+ ++ + N+ Sbjct: 119 ARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGK-----GIVYRNVPMQRLWFAENNS 173 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-D 228 ++D + ++ T+ Q ++G + LS M+S L ++ + HAV P++ D +K D Sbjct: 174 GLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLD 233 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 N F S ++ +R + TFP+ +GR+ V D++YG SPA +A+P +R N+ Sbjct: 234 GRNMQFASYWLDEGRDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDM 293 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 + + + PP +A + FDL+ G +N G L+ +G + +P+ G Sbjct: 294 AKTNIRGAQKLVDPPLLANEDGVLDGFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGI 353 Query: 349 EL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 E +++I F + LFQ+L D +A E +++ +EKG + P +G QSE +G MI Sbjct: 354 EFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMI 413 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 +RE+DIL G LP+ + + VEY SPL K +A A+ LQ + + + Sbjct: 414 AREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVS-- 471 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTA 501 DP+ + R++R P + Sbjct: 472 QFDPAAAKVPNGARIARLLADYGGVPVEAMSTDE 505 >gi|54302247|ref|YP_132240.1| putative head-tail connector protein [Photobacterium profundum SS9] gi|46915668|emb|CAG22440.1| hypothetical protein PBPRB0567 [Photobacterium profundum SS9] Length = 552 Score = 330 bits (846), Expect = 3e-88, Method: Composition-based stats. Identities = 111/574 (19%), Positives = 208/574 (36%), Gaps = 42/574 (7%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA-----------QLRMWDTTG 50 + + F L + EL ++ P + + D + Sbjct: 1 MKTIRQQCDSIFQGLDSDYAPWESHYRELANYIQPRRQRFSKDSVNRGGAHNSNIIDPSA 60 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 + A + + S IT P KW L K+ + VR + D D + G Sbjct: 61 TLAMRVAAGGMYSGITNPVTKWLRLNVED--------KDLNKYHIVRLYLDTCADLILGM 112 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 + S F + S + ++ + E D +R+ P+ + + + + Sbjct: 113 --LASSNFYNVVPSMFMDLLTYSGSSVGFEKDPL-----TVMRFYPNPIGSYRLGIGPRQ 165 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENER-FTIIHAVYPKSLTDKKK-D 228 V + R+ + V Q+V K+G +S +KSA + + I H V+ + Sbjct: 166 NVSTHGRKVEYRVSQVVEKFGLDNVSQSIKSAYRSGKYNQLTEIRHLVFDNPDFVPRAFS 225 Query: 229 KGNKGFHSKFVSVDENR--FFEEKQIATFPYIVGRYRVRADEIYGR-SPAMEALPTIRRL 285 K S + ++R F FP++ R+ V ++ YG P M AL +I+ L Sbjct: 226 AVRKPICSIWYDPADDRNPFLRRSGFDEFPFVTPRWEVIGNDTYGSFGPGMLALGSIKGL 285 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 + + + L PP + S K L PG + + + Q PL Sbjct: 286 QKDQRDKYEAQDKMLKPPMVGPSSLKNNPRSLLPGAVTFVDNQQGQQGFTPAFQTNFPLN 345 Query: 346 YHEE-LNRLKESIRSLFLLDLFQVL--DDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 Y E + + I S F DLF + K++ +A E + EK +GP++ E Sbjct: 346 YQLESIRDTRAIIDSAFFKDLFLAVIDIGKSNTTATEIAARKEEKLLMLGPVLNRFNEEG 405 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 + ++S ++ +G LPE + +EY L + Q+A ++S + V + Sbjct: 406 LDPIVSASFYEMNRRGMLPEPPPEL--DGVDVNIEYVGLLQQAQKAVGISSIERTVGFIG 463 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHL 522 L +D +D D V T T ++ + +V+ R R Q++ + + Sbjct: 464 NLAGVR--QDVLDKVDFDSVVDIYTDITGTTPRILFNEQQVKATRDARIQQQQREQMAAM 521 Query: 523 QQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 ++D A + + + + N G Sbjct: 522 ----AAPAKDGAEAAKLLSETRTDESNGLSNFLG 551 >gi|303328393|ref|ZP_07358830.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861387|gb|EFL84324.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 567 Score = 329 bits (843), Expect = 8e-88, Method: Composition-based stats. Identities = 112/521 (21%), Positives = 193/521 (37%), Gaps = 46/521 (8%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA-------------QLRMWDTTGSEA 53 K + R+ L +R ++L P + D+TG A Sbjct: 6 KKLHQRWEMLVEKRRPWISTWKDLAALYLPTGYRDADDGNARGGKNLLNPEVVDSTGIYA 65 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L++ + +T P + W GL R W D+V + + Sbjct: 66 LRTLAAGMQGGMTSPARPWFGLRLEGGDSGDGGIT-------ARAWIDEVVERMRTI--L 116 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 S F G + Y + FGT C + E+ G + + V+ VD Sbjct: 117 HTSNFYGVIYQAYAQLAAFGTACVF------ERADMSGFTFDCCQAGTFVLDVDAGGRVD 170 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALAR--NENERFTIIHAVYPKSLTDKKKDKGN 231 +V R+ T Q+ ++G+ L +K++L N R + HAVYP+ +++ N Sbjct: 171 TVMRKIWLTARQMAQEFGEDALPDMVKTSLNNASMGNVRHAVFHAVYPRREPGLRRETIN 230 Query: 232 ---KGFHSKFV-----SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 + F S + E +FP+ R+ V + ++YG SPAM+ +P R Sbjct: 231 GARRPFASVYWMRGMSGAGGYHPLRESGFDSFPFFGVRWNVLSGDVYGTSPAMDTMPDCR 290 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 L + + + PP +E + DL PG +N ++ + PV P Sbjct: 291 MLQQMAKTTLKGVHKMVDPPVNVAAELQSVGVDLTPGGVNYVSMMGNNGAAVTPVLKVQP 350 Query: 344 L--PYHEELNRLKESIRSLFLLDLFQVLDDKASRS--AAESMEKTREKGAFVGPLIGGLQ 399 + ++++ I+ DLF++L R A E + EK +GP++ L Sbjct: 351 DVAAAQAMIQQVQQQIKEGLYNDLFRMLLGTNRRQITATEVDAREAEKMILIGPVLERLH 410 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 E +I R ++D LP LKVE+ S L + Q+ S Q + Sbjct: 411 DELFIPLIDRTFALMDKFNALPPVPEELAGRG--LKVEFISTLAQAQKLVSTGGIQQLLA 468 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDT 500 + DPS +D ++ DR+ A ++R Sbjct: 469 FIGGAAQV--DPSVLDALNGDRLVDKYNEYLGVDAGVLRPQ 507 >gi|332875224|ref|ZP_08443057.1| hypothetical protein HMPREF0022_02690 [Acinetobacter baumannii 6014059] gi|332736668|gb|EGJ67662.1| hypothetical protein HMPREF0022_02690 [Acinetobacter baumannii 6014059] Length = 547 Score = 327 bits (837), Expect = 4e-87, Method: Composition-based stats. Identities = 106/527 (20%), Positives = 204/527 (38%), Gaps = 39/527 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN-------------NAQLRMWD 47 M++ A+ + R + LK R L E + P + + + D Sbjct: 1 MSELVAR-LCKRLSELKAARNRLEPHWSECYRYAAPERQQSFIGDDVTDTRKTQRAELLD 59 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 +T SEA L S + S TP W + + A + +W D+V Sbjct: 60 STLSEATQLLVSSIISGTTPANALWFKAVPN-------GVDDPAELTEGEKWLDEVCQ-- 110 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS-V 166 F +R + + + V G G Y + D + G + + + Y++ Sbjct: 111 FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVD---RHAGGGYVFQTWDIGQCYLAST 167 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ K Sbjct: 168 RQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTGYIK 227 Query: 227 KDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D+ F S V VDE E FP+++ R+R + +YG ALP Sbjct: 228 GDRQLMPKEMPFASYHVEVDEKIVLRETGYNEFPFVIPRFRKIPNSVYGTGQVSIALPDA 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 + N+ + + + +S V + ++ G I ++ + + + G Sbjct: 288 KTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVN--DVNSLKRIDDGK 345 Query: 343 PLPYHEE-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + L L+ +IR + D Q D + +A E + +GPL G Q+E Sbjct: 346 GYQVGVDLLAHLQGAIRKKMMADQLQ-PADGPAMTATEVHVRVDLIRQQLGPLYGRWQAE 404 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ R + G + E K + S L + QQ E V + + + + Sbjct: 405 LLTPLLERTFGLAYRAGVIGEAPEEMQGRNLSFK--FISALARSQQLEEVTAIERFLAGM 462 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 + DPS +D++D D V++ S P ++R +++ IR+ Sbjct: 463 SNVA--QIDPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDAIRK 507 >gi|293609619|ref|ZP_06691921.1| predicted protein [Acinetobacter sp. SH024] gi|292828071|gb|EFF86434.1| predicted protein [Acinetobacter sp. SH024] Length = 547 Score = 326 bits (836), Expect = 5e-87, Method: Composition-based stats. Identities = 106/527 (20%), Positives = 204/527 (38%), Gaps = 39/527 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN-------------NAQLRMWD 47 M++ A+ + R + LK R L E + P + + + D Sbjct: 1 MSELVAR-LCKRLSELKAARNRLEPHWSECYRYAAPERQQSFIGDDVTDTRKTQRAELLD 59 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 +T SEA L S + S TP W + + A + +W D+V Sbjct: 60 STLSEATQLLVSSIISGTTPANALWFKAVPN-------GVDDPAELTEGEKWLDEVCQ-- 110 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS-V 166 F +R + + + V G G Y + D + G + + + Y++ Sbjct: 111 FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVD---RHAGGGYVFQTWDIGQCYLAST 167 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ K Sbjct: 168 RQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTGYIK 227 Query: 227 KDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D+ F S V VDE E FP+++ R+R + +YG ALP Sbjct: 228 GDRQLMPKEMPFASYHVEVDEKNVLRETGYNEFPFVIPRFRKIPNSVYGTGQVSIALPDA 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 + N+ + + + +S V + ++ G I ++ + + + G Sbjct: 288 KTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVN--DVNSLKRIDDGK 345 Query: 343 PLPYHEE-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + L L+ +IR + D Q D + +A E + +GPL G Q+E Sbjct: 346 GYQVGVDLLAHLQGAIRKKMMADQLQ-PADGPAMTATEVHVRVDLIRQQLGPLYGRWQAE 404 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ R + G + E K + S L + QQ E V + + + + Sbjct: 405 LLTPLLERTFGLAYRAGVIGEAPEEMQGRNLSFK--FISALARSQQLEEVTAIERFLAGM 462 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 + DPS +D++D D V++ S P ++R +++ IR+ Sbjct: 463 SNVA--QIDPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDAIRK 507 >gi|169795385|ref|YP_001713178.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148312|emb|CAM86177.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 547 Score = 325 bits (833), Expect = 1e-86, Method: Composition-based stats. Identities = 106/527 (20%), Positives = 202/527 (38%), Gaps = 39/527 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN-------------NAQLRMWD 47 M++ A+ + R + LK R L E + P + + + D Sbjct: 1 MSELVAR-LCKRLSELKAARNRLEPHWSECYRYAAPERQQSFIGDDVTDTRKTQRAELLD 59 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 +T SEA L S + S TP W + + A +W D+V Sbjct: 60 STLSEATQLLVSSIISGTTPANALWFKAVPN-------GVDDPAELTDGEKWLDEVCQ-- 110 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS-V 166 F +R + + + V G G Y + D + G + + + Y++ Sbjct: 111 FIWRNIHGANYDSEIFDLVLDCVVAGWGVMYADVD---RHAGGGYVFQTWDIGQCYLAST 167 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 VD++YRE+ T+ +V+++G+ +S K+++ + + ++ V P+ K Sbjct: 168 RQDQKVDTLYREYEMTMAALVNEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTGYIK 227 Query: 227 KDK----GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 D+ F S V VDE E FP+++ R+R +YG ALP Sbjct: 228 GDRQLMPKEMPFASYHVEVDEKIILRETGYNEFPFVIPRFRKIPHSVYGTGQVSIALPDA 287 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 + N+ + + + +S V + ++ G I ++ + + + G Sbjct: 288 KTANKLMRDTLRSAEISTLGMYAGVDDGTFNPRTVRLGGGKIIVVN--DVNSLKRIDDGK 345 Query: 343 PLPYHEE-LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + L L+ +IR + D Q D + +A E + +GPL G Q+E Sbjct: 346 GYQVGVDLLAHLQGAIRKKMMADQLQ-PADGPAMTATEVHVRVDLIRQQLGPLYGRWQAE 404 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ R + G + E K + S L + QQ E V + + + + Sbjct: 405 LLTPLLERTFGLAYRAGVIGEAPEEMQGRNLSFK--FISALARSQQLEEVTAIERFLQGL 462 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 + DPS +D++D D V++ S P ++R +++ IR+ Sbjct: 463 SSVAEL--DPSILDNVDMDAVAQVSGMGLGVPTAILRTQDQIDAIRK 507 >gi|18640510|ref|NP_570351.1| head-tail connector protein [Synechococcus phage P60] gi|18478740|gb|AAL73289.1| head-tail connector protein [Synechococcus phage P60] Length = 555 Score = 325 bits (832), Expect = 2e-86, Method: Composition-based stats. Identities = 76/542 (14%), Positives = 164/542 (30%), Gaps = 45/542 (8%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSS 59 Q ++ L+ R + + PY + W + GS+ L+S Sbjct: 4 SAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLAS 63 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 L + P + L + + E ARS ++ ++ + ++ + S Sbjct: 64 KLMLSLFPVNTSFFKLQINDAEIDNLGMDEQARS-EIDLSLSRIERIVT--QDIAESSDR 120 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 L+ ++ G Y PL +S + + V + E Sbjct: 121 VHLEMAMKHLIVTGNALLYQGKK----------NLKLYPLDRFVVSRDGEGNVMEIVTEE 170 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKS---------LTDKKKDKG 230 + ++ + A E+ + A + T + G Sbjct: 171 QIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKDG 230 Query: 231 NKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 +H + P+I R+ + E YGR E + ++ L Sbjct: 231 QVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQ 290 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 + + S + A + +L GA+ + VQ + L Sbjct: 291 AMVEGSAASAKVVFMVSPSATTKPQNLALAA--NGAIIQGRPDDVSVVQANKAADFRTVL 348 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 +++ + + L + +A E +E +G + L +E + ++R+ Sbjct: 349 EMIQKLEQRISDAFLMLQVRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARK 408 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 L +L Q LP+ P + + + Q + Q + + L G Sbjct: 409 LHLLQKQRKLPQLPKDLVQPTVVAGLWGV---GRGQDKQ------QLMEFITTLAQTMGP 459 Query: 471 PSCMDHMDTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQT 529 M +++ + A LI ++ + + Q++ M + L Q Q Sbjct: 460 EIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQL---GDQQKQDMVQASLINQAGQL 516 Query: 530 SQ 531 ++ Sbjct: 517 AK 518 >gi|225158777|ref|ZP_03725094.1| hypothetical protein ObacDRAFT_8203 [Opitutaceae bacterium TAV2] gi|224802612|gb|EEG20867.1| hypothetical protein ObacDRAFT_8203 [Opitutaceae bacterium TAV2] Length = 562 Score = 324 bits (830), Expect = 3e-86, Method: Composition-based stats. Identities = 114/566 (20%), Positives = 218/566 (38%), Gaps = 42/566 (7%) Query: 4 RSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN------------AQLRMWDTTGS 51 + A+D+ R+ +++ + ++ P K + ++D+T + Sbjct: 8 KLAEDLIGRYEAGLSRQANWRSRWHDAARYILPSKGDILSMGDKHGGEAQTTDIYDSTAN 67 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 E+ + ++ L S + P G+ W + S V EW D T Sbjct: 68 ESALVYAAGLLSSLVPAGELWFRFSARP-----------GASAPVVEWFDDCTHR--AAA 114 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGI-RYISVPLSNVYMSVNHQN 170 S F + + + F + E +G G+ + +VP+ + + + Sbjct: 115 ALHASNFYLGIHEDFMDMAGFSIASLFCEEGAALRGQRGGLLNFTNVPVGTFVIEEDAEG 174 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLS----SKMKSALARNENERFTIIHAVYPKSLTDKK 226 +VD+V+REF FT Q KWG+ LS + S A + ++RF IIHAVYP+ + Sbjct: 175 LVDTVFREFRFTARQCAQKWGEDKLSKPMLDALNSKTASDRDKRFQIIHAVYPRRDGKQG 234 Query: 227 KDKGNK-GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 G K S +V EE P V R +EIYGR P + +P I+ + Sbjct: 235 PGIGKKRPIASVYVDKQAIHVIEEGGFYEMPIAVARLLRGNNEIYGRGPGDQVMPEIKLV 294 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 N +L ++PP +A ++ R D +PG + S + Sbjct: 295 NRMERDLLLSLEQQVNPPWLAPQDSSWRP-DNRPGGVFYWDASNPNNKPERLRDTARLDI 353 Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDKASR----SAAESMEKTREKGAFVGPLIGGLQSE 401 + LN +E IR + +D+F++L + + +A E + +EK P+ + E Sbjct: 354 GDKVLNDKREVIRRAWFVDMFKMLSNPDAMKRDKTAFEVAQLMQEKLVLFHPMFARITQE 413 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ R +IL G +++Y S + +A + Q ++ + Sbjct: 414 KLNPVLERVFNILMRAGIFAPPP-MAEGESLEYEIDYVSKIALAIKAAQNGALAQMMDLI 472 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 + T DP+ ++ + +R + P EV ++ Q + + + + Sbjct: 473 GGMA--TFDPTVALVINWKKAARGVARNSGLPQEWQNSEEEVAEMMQAQAQANQAAQLEQ 530 Query: 522 L---QQQLQQTSQDIGAKAAGRAMEK 544 + Q +Q +G +A A + Sbjct: 531 MASAANQAAGAAQKLGPQAQQAATDA 556 >gi|221201497|ref|ZP_03574536.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] gi|221207947|ref|ZP_03580953.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221172132|gb|EEE04573.1| conserved hypothetical protein [Burkholderia multivorans CGD2] gi|221178765|gb|EEE11173.1| conserved hypothetical protein [Burkholderia multivorans CGD2M] Length = 549 Score = 320 bits (819), Expect = 5e-85, Method: Composition-based stats. Identities = 139/546 (25%), Positives = 238/546 (43%), Gaps = 31/546 (5%) Query: 4 RSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQLRMWDTT 49 + + + +K +R ++ FL P + RM+D+T Sbjct: 7 KLLEALNADHGRMKEKRQSYEATWNDVIDFLMPRLDKFGQLPRPDSEKGRERSQRMFDST 66 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 A + + S+ITP Q WH L S + V+ + +V LF Sbjct: 67 APLALRNFVAAMDSMITPATQLWHRLKASNDVL--------NENAAVKAYLQEVVRVLFA 118 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 R R + GFV + + Y SV FG G +E DV + GI Y +VP+ ++ + N+ Sbjct: 119 VRYRWQGGFVTQMGATYQSVGLFGPGALMIEHDVGQ-----GIVYRNVPMQRLWFAENNA 173 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-D 228 ++D + ++ T+ Q ++G + LS M+SAL R+ + H V P++ D +K D Sbjct: 174 GIIDKTHVQWELTLRQAAQRFGRENLSPSMQSALERDPEKSAIFYHIVEPRADRDPRKLD 233 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 N F S ++ +R + TFP+ +GR+ V + YG SPA +A+P R +N+ Sbjct: 234 GRNMRFGSYWLDEGRDRIIQNSGFRTFPFAIGRFYVGTGDAYGGSPACDAMPDTRMVNDM 293 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 + + + PP + + FDL+ G +N G L +G + +P+ G Sbjct: 294 AKTNIRGAQKLVDPPLLVSEDGSLEGFDLRSGSLNWGGLDEKGNEMVKPLLMGKQAQIGI 353 Query: 349 EL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 E +++I F + LFQ+L D +A E +++ +EKG + P +G QSE +G +I Sbjct: 354 EFTQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPLI 413 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 RELDIL LPE + +++EY SPL K +A A+ LQ + + + Sbjct: 414 ERELDILAEAAQLPEMPRELINAGANVEIEYDSPLNKAMRAGESAATLQWLQQLSVVA-- 471 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 D M + R++R A P + E++ +V + Sbjct: 472 QFDLRAMKAPNGLRIARMLADAGGVPVEAMNTDEELQAQEAAEAQAMQVQQALAAAPVAA 531 Query: 528 QTSQDI 533 +D+ Sbjct: 532 GAIKDL 537 >gi|294648400|ref|ZP_06725899.1| phage protein [Acinetobacter haemolyticus ATCC 19194] gi|292825705|gb|EFF84409.1| phage protein [Acinetobacter haemolyticus ATCC 19194] Length = 558 Score = 318 bits (815), Expect = 1e-84, Method: Composition-based stats. Identities = 116/574 (20%), Positives = 225/574 (39%), Gaps = 44/574 (7%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN----------------NAQLRMWDTT 49 A+ + R + LK+ R + ++ + P + A+ ++DTT Sbjct: 3 AQQLLKRLSQLKSDRIKHEAHWKDCYKYCAPERQQSFADASATALEQERKQARTDLFDTT 62 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 E L S + S T P W S ++ + +W QV LF Sbjct: 63 SVEGIQLLVSSIVSGTTSPVSIWFKSVPS-------GVDTPSQLTEGEQWLSQVDQFLF- 114 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYM-SVNH 168 R S F + F T +V G Y D + G + + + N Y+ S Sbjct: 115 -RNIHASNFDSEVTDFLTDLVVAGWAVLY----ADTNREKGGFTFNTWSIGNCYISSTQA 169 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLT----- 223 ++D++YREF + +QIVS++G +S K+++AL + +++FT++ A++P+ Sbjct: 170 NGLIDTIYREFELSAEQIVSEFGIDNVSDKVRTALEKKPDQKFTLVQAIFPRDSKLIKGE 229 Query: 224 DKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 + K+ + F S + +E FP +V R++ D YG + + Sbjct: 230 EGKRVSTSMPFASYTIEAQSKHILKESGFEEFPCVVSRFKKIPDSHYGLGMGSMVISDAK 289 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 N+ + Q L+L IA ++ L+ I A + + + G+ Sbjct: 290 TANQIMKLSLQTAELNLGGLWIAQNDGNINPHTLRIRPNAIIAANT--VDSIKRLDTGSA 347 Query: 344 LP--YHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + L + I+ + D + +A E + + +G + +QSE Sbjct: 348 SVGLGLDFLQHFQAKIKRTLMSDQL-TPQGSSPLTATEIQARVQVYRNQLGSIFSRMQSE 406 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 ++ ++ R + G LP + + +P+ Q+ E V + + V Sbjct: 407 YLQVLLERTWGLAMRSGVLPPAPEELMQASR-ISFNFINPMAASQKLEWVTAIQNLMLNV 465 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 ++ D + MD+++ D + + A + P IR E+ ++RQ ++ Q++ M+EQ Sbjct: 466 SQMA--QIDQTVMDNLNLDAMVQVMADALSVPVEAIRTDEEIAELRQAKQEQQQAMQEQQ 523 Query: 522 LQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSY 555 QQ L G A + K +T D + + Sbjct: 524 QQQALMSQVGQTGLDIA-KDQAKNMTPDQLGAMF 556 >gi|48696640|ref|YP_024419.1| hypothetical protein VP2p04 [Vibrio phage VP2] gi|48696684|ref|YP_024978.1| hypothetical protein VP5_gp03 [Vibrio phage VP5] gi|40806147|gb|AAR92065.1| hypothetical protein [Vibrio phage VP5] gi|40950038|gb|AAR97629.1| hypothetical protein [Vibrio phage VP2] Length = 547 Score = 316 bits (809), Expect = 6e-84, Method: Composition-based stats. Identities = 108/558 (19%), Positives = 202/558 (36%), Gaps = 44/558 (7%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--------------AQLRMWDTTGSE 52 I R ++LK R + + + ++ P +++ ++D+T + Sbjct: 4 SKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGD 63 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 LSS L +T P KW LA F KE + R+W + T ++ Sbjct: 64 GLETLSSSLHGSLTSPATKWFELA--------FRDKELNSDDECRKWLENATHDVYS--A 113 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 S F Y + +G + +++ E + + S P+ + Y + + V Sbjct: 114 LQDSNFNLEANETYIDLCGYGNAIMV---EEEDEDEEGSVVFQSSPIQDSYFEEDSRGQV 170 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGN- 231 + YR F +T QI ++GD+ + N+ V KK N Sbjct: 171 VNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNA 230 Query: 232 --------KGFHSKFVSVDENRFF-EEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTI 282 + F K++ + EE P R+R A +G P+ ALP + Sbjct: 231 GTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDV 290 Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 N V + + + P + + DL + + + Sbjct: 291 LTANRYVELVLRSSEKVIDPAIMVTERGLISDIDLGASGLTVVRDMESMKPFESR---AR 347 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 +L L+ ++R ++ +D Q+ D + +A E + +GP +G L+++F Sbjct: 348 FDVSSIQLTDLRSAVRRIYYVDQLQM-KDSPAMTATEVQVRYELMQRLLGPTLGRLENDF 406 Query: 403 IGAMISRELDILDSQGNLPECE-GADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + MI R +I G L E + + + YT PL + Q+ + AS + + Sbjct: 407 LSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGST 466 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQH 521 +L +P +D D D + R P L+R A+V IR+ R ++ E+ Sbjct: 467 AQLAE--INPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAA 524 Query: 522 LQQQLQQTSQDIGAKAAG 539 + + + G A Sbjct: 525 IAEAEGNAMEAQGKGQAA 542 >gi|42526662|ref|NP_971760.1| head-to-tail joining protein, putative [Treponema denticola ATCC 35405] gi|41816855|gb|AAS11641.1| head-to-tail joining protein, putative [Treponema denticola ATCC 35405] Length = 560 Score = 312 bits (799), Expect = 9e-83, Method: Composition-based stats. Identities = 123/531 (23%), Positives = 227/531 (42%), Gaps = 48/531 (9%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFL---------------YPYKNNAQLRMW 46 ++ DI+ F+ LK++R +++ ++ P ++ + + Sbjct: 7 SKELLDDIKGLFDILKDKRSMHEAEWQDVCTYIGSNVFDWSENKEEIKRPKRHTGRPSEY 66 Query: 47 DTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDT 106 KL S L P W L+ + + + V++W +Q Sbjct: 67 -------LKKLVSGLMGYTISPNVTWLKLSLNNTEMLEY--------AGVKDWLEQSEKA 111 Query: 107 LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSV 166 L+ E +R+ + F ++ FG G ++ E IR++++ +Y++ Sbjct: 112 LYE--EFNRNNLYSQVSLFISNAASFGHGVMLIDE-----KKENSIRFLTIAEPEIYIAE 164 Query: 167 NHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALA--RNENERFTIIHAVYPKSLTD 224 N +D+V+R F+ TV I++++G++ +S ++K+ + +N+ I+HAV P+ D Sbjct: 165 NEYGDIDTVFRYFSMTVKNIIARFGEENVSEQIKNDAKDIKGKNKEIKILHAVLPRDDYD 224 Query: 225 K-KKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIR 283 + K D N F S ++ +D N EE PY V + YG SPA EA+P +R Sbjct: 225 ESKLDGKNMEFASYYIDMDNNTILEESGYYELPYSVFIWEKETSSAYGGSPAREAIPDMR 284 Query: 284 RLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP 343 LN+ + +L PP + + P N + P+ G Sbjct: 285 LLNKVEEARLKLAQLVSEPPMNVPDSMRGFE-SVVPAGYNYYERPDM---IMTPINIGAN 340 Query: 344 LPYH-EELNRLKESIRSLFLLDLFQVLD-DKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 P E + ++ +R F +D +L A ++A E +E EK A + LI Q++ Sbjct: 341 FPITLETIQDIESRLRDKFHVDFMLMLQAQTAQKTATEVIELQGEKSALLSSLIVN-QNK 399 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ R L+I+ QG PE N ++L V++ PL + Q+ +Q + Sbjct: 400 ALSEIVIRTLNIMYRQGRFPEPPNILNGSDAVLNVDFVGPLAQAQKRYHQTGGVQTSLAI 459 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREV 512 + +P +D++DTD++ + L P IR+ EVE IRQQR Sbjct: 460 SQPI-IQMNPEVLDYIDTDKLLKNVLDTNGFPQSAIREDDEVEKIRQQRAE 509 >gi|282848877|ref|ZP_06258267.1| hypothetical protein HMPREF1035_1386 [Veillonella parvula ATCC 17745] gi|282581382|gb|EFB86775.1| hypothetical protein HMPREF1035_1386 [Veillonella parvula ATCC 17745] Length = 575 Score = 307 bits (787), Expect = 3e-81, Method: Composition-based stats. Identities = 110/514 (21%), Positives = 201/514 (39%), Gaps = 40/514 (7%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYP----------YKNNAQLRMWDTTGSEACIKL 57 ++ +F+ L N + + L + P ++ + E+C Sbjct: 25 KLRKKFSQLFNAQQRYVNKWKHLRDYQLPFIGQFDGEEDQSEPYNGKILNPVAWESCQIF 84 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 +S + S +TPP +KW L + A + +V E D+ + L+ ++S Sbjct: 85 ASGVMSGLTPPSRKWFKLTMEN--------IDVAANSQVAELLDEREEILYAV--LAKSN 134 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 F + Y + G + AD E G+R+ S P+ +S N + +V+ R Sbjct: 135 FYSVVHQVYMELP-MGQAPMGIFAD-----SESGVRFTSYPIGTYAISTNSKEIVNIFGR 188 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNE--NERFTIIHAVYPKSLTDKKKDKGNKGFH 235 ++ TVDQIV ++G + +K+ + FT+ V P K + N + Sbjct: 189 KYKMTVDQIVEQFGYENCPDNIKNIYDNGNSLQQSFTVNWLVEPNKDRKDKLGRRNMPYS 248 Query: 236 SKFVSVDE--NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 S + + +P + R+ YG+ A A P + L + + Sbjct: 249 SIYWVEGSNSDEVLYHGGFEEWPIPIARHTSMDLNGYGKGAAWFAQPDSQMLQKLEFDYL 308 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 L + PP A S+ +L PG + + +F N ++ Sbjct: 309 TAVELGVKPPMQAPSD-VISTVNLYPGGITEIEGQHKVEPMFAV--QSNLQDIQNKIAVT 365 Query: 354 KESIRSLFLLDLFQVLDDKASRSAA--ESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 ++SI+ + DLF +LD E ME+T+EK +GP++ L SEF+ +I R Sbjct: 366 EDSIKRAYSADLFLMLDQIDKGQMTAREVMERTQEKLQQLGPVVERLLSEFLNPIIERVY 425 Query: 412 DILDSQGNLPECEGAD---NPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 +LD G P E + +K+EY SPL + Q+ S+ + Q ++ L Sbjct: 426 AVLDRAGVFPPVEDEELLDQLNGQEVKIEYISPLAQAQKMSSLVNIEQYFAFIMSLAQA- 484 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 +P+ ++ + + + PA +IR E Sbjct: 485 -NPNIVNKFNFEEAANTYGVNLGVPAKIIRSDDE 517 >gi|317120721|gb|ADV02543.1| putative phage-related head-to-tail joining protein [Liberibacter phage SC2] gi|317120782|gb|ADV02603.1| putative phage-related head-to-tail joining protein [Candidatus Liberibacter asiaticus] Length = 539 Score = 305 bits (782), Expect = 9e-81, Method: Composition-based stats. Identities = 205/543 (37%), Positives = 297/543 (54%), Gaps = 24/543 (4%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN--AQLRMWDTTGSEACIKLSS 59 N+ K + RF LK QR E+ +E+ + PY+ ++WDTT + A KL+S Sbjct: 14 NKEFIKKLIARFESLKAQRSEIEPIRQEIIDLVCPYRGKASEDKKIWDTTATSASDKLAS 73 Query: 60 LLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFV 119 LL +LITP G +WHGL +F ++ + +RE CD LF RE SGF Sbjct: 74 LLHNLITPFGSRWHGLVAPDPQSGSFFASQENKL--IREQCDHFVMELFAQRELPASGFN 131 Query: 120 GCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREF 179 CL+ FYT VV FG GCFY+ G+RYISVP+S++ S NH+NVVD+V+ EF Sbjct: 132 LCLKDFYTEVVLFGMGCFYVSEREG-----GGLRYISVPVSSIVCSANHENVVDTVFEEF 186 Query: 180 TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFV 239 + T + + KWG LS KMK L R++ +++ AV+P D +G+ V Sbjct: 187 SLTPENVAKKWGYDALSDKMKEDLDRSDPQKYEFFQAVFPDKEDD------YEGYKKVIV 240 Query: 240 SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLS 299 S+DENR EE PYIVGRY +G SP +ALP+IRRLN ++ + + Sbjct: 241 SIDENRIIEEGYHRVMPYIVGRYEASPSNPFGYSPTHKALPSIRRLNALSASVSLYSEKA 300 Query: 300 LHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPL-PYHEELNRLKESIR 358 L+P + + + + F KP +N G + R+GR P G+ P HEE+ RL+ IR Sbjct: 301 LNPAVLTSEDTRGKTFSTKPKTVNHGWMDRQGRPRAVPFFTGSDARPSHEEMQRLQMQIR 360 Query: 359 SLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQG 418 L+LLDLFQVL D+ASRSA ESMEKT EKG F+ ++GGLQ+EF+G+M+ RE+DIL Sbjct: 361 ELYLLDLFQVLADRASRSATESMEKTLEKGIFISAIVGGLQAEFVGSMVKREIDILYQDQ 420 Query: 419 NLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMD 478 G LKV YTSPL+KYQ+AE + +QG+ E+ TGDP+ + + Sbjct: 421 ------GDIRGLGKDLKVSYTSPLYKYQKAEELNGIVQGIRVNAEIASMTGDPTPLMMFN 474 Query: 479 TDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAA 538 +++ + P VL+ + + +++ Q + + L + ++ + GA A Sbjct: 475 PYLCGKYAADGSGVPEVLVLSEEDTKQKLIEKQKQAEASQMKQLTME--ESIKTGGAIAQ 532 Query: 539 GRA 541 RA Sbjct: 533 DRA 535 >gi|291334466|gb|ADD94120.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] Length = 330 Score = 303 bits (775), Expect = 6e-80, Method: Composition-based stats. Identities = 84/336 (25%), Positives = 157/336 (46%), Gaps = 29/336 (8%) Query: 1 MNQR-SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR----------MWDTT 49 M Q AK++ R++ LK+QR +E+ ++ P K + ++D + Sbjct: 1 MAQTDKAKNLLKRYDRLKSQRQNWESHWQEVADYMQPRKADVTKTRSKGDKRTELIFDGS 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 ++ L++ L ++T P W L F ++ + + W + TD ++ Sbjct: 61 PLQSVELLAASLHGMLTNPSTPWFTLR--------FKDEDIDNEDEAKLWLEASTDAMYT 112 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 +RS F + Y ++ FGT ++E D E+ I++ + ++ V+++ N + Sbjct: 113 --AFNRSNFQQEIFELYHDLITFGTAAMFIEED-----DEDIIKFSTRHINEVFIAENDK 165 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKD 228 +D+V+R+F+ + ++ K+GD +S + + ++ E I+HAVYP+S D K+D Sbjct: 166 GRIDTVFRKFSLSARAVMQKFGD--VSINIATKAKKDPYEEVEIMHAVYPRSDFDPRKQD 223 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 K N F S ++ + FP++V RY + EIYGRSPAM ALP ++ LNE Sbjct: 224 KENMPFESVYLDAESGDELSVSGFREFPFVVPRYLKASHEIYGRSPAMTALPDVKMLNEM 283 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNI 324 + + + PP + + PG +N Sbjct: 284 SKTTIKSAQKQVDPPLLVPDDGFMLPVRTIPGGLNF 319 >gi|290968647|ref|ZP_06560185.1| hypothetical protein HMPREF0889_0287 [Megasphaera genomosp. type_1 str. 28L] gi|290781300|gb|EFD93890.1| hypothetical protein HMPREF0889_0287 [Megasphaera genomosp. type_1 str. 28L] Length = 577 Score = 302 bits (772), Expect = 1e-79, Method: Composition-based stats. Identities = 106/515 (20%), Positives = 208/515 (40%), Gaps = 40/515 (7%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTTGSEACI 55 + + L Q+ + +++ + PY +++ ++A Sbjct: 27 QSCVKMLDSLFKQQQKYIPLWKDIRNYELPYDGELGDDVIGAPAMHDEEIYNGITAQARD 86 Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 ++ + S +TPP +KW A + ++ + V D+ + + G S+ Sbjct: 87 TFAAGIQSGLTPPSRKWFRFAPTDASLDNNID--------VARVLDERCEIMEGV--LSQ 136 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 S F + S Y + FG + AD E+G+ +++ + + + Q +++ Sbjct: 137 SNFYNVIHSAYKELP-FGQSPVGVFAD------EKGVYFVNYTIGTYALGADGQGRINTF 189 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKS--ALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 R+ + QIVS +GD V++ ++ + +T+ VYP + Sbjct: 190 ARKVKMSAAQIVSLYGDSVVTDSVREAVKANGGHEDYYTVCWLVYPNPKAKPTGGNHDMK 249 Query: 234 FHSKFVSVDEN--RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 F S + K + V RY V+ + YG PA +ALP R L + + Sbjct: 250 FLSVHWLEGSDPNSLLAAKGFEEWAIPVARYNVKGIDAYGIGPAWDALPESRMLQKMEYD 309 Query: 292 LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 A LS+ PP + Q +L PG + + ++ Sbjct: 310 GAIALELSIKPP-LVGPAELQGRINLFPGAYTPSINPNDNVHSIYSGGL-DLNSLQAKIT 367 Query: 352 RLKESIRSLFLLDLFQVLD--DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 ++++ I+ ++ DLF +L+ ++ +A E M + +EK A +GP+I LQ+EF+ +I R Sbjct: 368 QIEDRIKRIYSTDLFLMLNELNRGQMTAQEVMARNQEKMAQLGPVIERLQNEFLSDIIER 427 Query: 410 ELDILDSQGNLPECEGADNP--PVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 ++L+ P +K+EY SPL + Q+ + + QGV+ V +L Sbjct: 428 VYNLLERNQVFPPLPDDVQQTLQGQEIKIEYLSPLAQAQKMSGLTAIEQGVSFVGQLAQL 487 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 DP+ + ++ D+ L P+ +IR E Sbjct: 488 --DPNVILRVNFDKAVENYLDKLGVPSTMIRTEDE 520 >gi|46580131|ref|YP_010939.1| hypothetical protein DVU1721 [Desulfovibrio vulgaris str. Hildenborough] gi|46449547|gb|AAS96198.1| hypothetical protein DVU_1721 [Desulfovibrio vulgaris str. Hildenborough] gi|311233876|gb|ADP86730.1| hypothetical protein Deval_1575 [Desulfovibrio vulgaris RCH1] Length = 550 Score = 302 bits (772), Expect = 1e-79, Method: Composition-based stats. Identities = 101/570 (17%), Positives = 204/570 (35%), Gaps = 45/570 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK------------NNAQLRMWDT 48 M K++ + +++ R +++ +L P + + + + Sbjct: 1 MRSALLKELSEVAEHVEGLRKRREAQWRDISEWLMPMRGIYEGQDGADVIASRGKGLLNR 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 G+ A ++ ++ +TP W + R W D V ++ Sbjct: 61 EGTRALKVAATGMTGGMTPAALPWFRWSLRDDV--------QNERTGARAWLDTVEASIN 112 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 GF + + + FG + + R+ S + ++++ Sbjct: 113 SV--LRACGFYQAIHACNMEFLAFGPLLLF-----QDNSQGALCRFESCTVGTWAVALDA 165 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALA-RNENERFTIIHAVYPKSLT-DKK 226 +D+V R T Q+ ++G L+ L +ER ++H V P++ + Sbjct: 166 DGGLDTVVRRLKLTARQMEQRFGRDRLTPATVKLLETNKGHERVEVVHVVRPRTERQHGR 225 Query: 227 KDKGNKGFHSKFVSVDE-NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 D N F S + E PY Y ++YG +P + LP +++L Sbjct: 226 IDARNMPFASYMYEATGADDVLSESGYHEMPYFFAAYD-DTLDLYGSAPGDDCLPDVKQL 284 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN--P 343 E + + ++PPT + ++ ++ PG N A+S P+ Sbjct: 285 QELEKQKLVGLQKVINPPTRKPAS-FKQRLNVNPGGEN--AVSGGDPHGIGPLYEVRIDL 341 Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLD---DKASRSAAESMEKTREKGAFVGPLIGGLQS 400 EE+ + + IR + F + + E +E+ RE+ +GP + ++ Sbjct: 342 NQVREEIATVVDRIRQTTMASYFADMPLELRPKDMTYGEYLERKRERLQLMGPSLEAYEA 401 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 + + +I R +LD G LP A V+++ + Y SPL + + S + Sbjct: 402 KVLTPVIFRTFALLDRAGMLPPPPDAL-GEVAVVDISYISPLAQALRQTGAESTRALLMD 460 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 V++L DP +D +D D+ P ++R +V +RQQR+ + + Sbjct: 461 VMQLAEA--DPGVLDKVDMDQAVDELAKGIGAPGRVVRSDEDVAAMRQQRDEAKAREAQA 518 Query: 521 HLQQQLQQTSQDIGAKAAGRAMEKKLTHDM 550 Q + G L HD+ Sbjct: 519 QEAITAMQGLAKVAGTRTGPG---TLAHDL 545 >gi|26989003|ref|NP_744428.1| head-to-tail joining protein [Pseudomonas putida KT2440] gi|24983824|gb|AAN67892.1|AE016421_4 head-to-tail joining protein [Pseudomonas putida KT2440] Length = 524 Score = 296 bits (758), Expect = 6e-78, Method: Composition-based stats. Identities = 72/541 (13%), Positives = 157/541 (29%), Gaps = 43/541 (7%) Query: 11 DRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLS 62 + L R + + + P W + L + L Sbjct: 15 SLYAKLAPDRETFLQRARDCSKYSIPTLIPPAGHASGTKFYTPWQAVAARGVNNLGAKLL 74 Query: 63 SLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCL 122 + PP + L + L V+ ++ + E + Sbjct: 75 MALLPPNSPFFRLEI-DEFTEEKLTSNPQMHADVQAGLAKIERAVQT--EIETTAIRVTG 131 Query: 123 QSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFT 182 ++ G G Y+ + G+++ PL + + V + + + Sbjct: 132 FELLKHLIVGGNGLVYL-------PQQGGMKF--YPLDRYVVRRDPMGNVLDIVVKEEVS 182 Query: 183 VDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVD 242 + + + V R+ N+ +I + K T + + Sbjct: 183 LAVLPEEARSLVEPGDDSGDTPRDHNKNVSIYTHITLKGET--------WNVYQEVKGQI 234 Query: 243 ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 ++ R+ E YGRS E L I+ L + + S Sbjct: 235 VPGSRGTYPKDKCAWLPIRFVKIDGENYGRSYVEEYLGDIKSLEGLSQAIVEGSAASAKV 294 Query: 303 PTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + +L Q + G+ E +N + E + F Sbjct: 295 LFLVNPNGVTSSSELAEAPNGEFVDGVASDVQALQLQKSGDFRVALETINTITERLEFAF 354 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 +L+ + + +A E E A +G + L EF +++R + + + LP Sbjct: 355 MLN-SAIQRNGERVTAEEIRYMAGELEAALGGVYSILSQEFQLPLVNRIMFSMQRRKKLP 413 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 E P + +E L + + Q ++T++++ P ++ Sbjct: 414 ELPKGTVSPTIVTGME---ALGRG---NDLTKLDQFISTIMQI------PDAASRINWGN 461 Query: 482 VSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 A L++ EV+ +QQ+++Q+ + Q + G + Sbjct: 462 YMTRRATALGIDTDGLVKTDQEVQQEQQQQQMQQAMQSGVAPAVQAAGRMMEKGQPDGSQ 521 Query: 541 A 541 A Sbjct: 522 A 522 >gi|209966578|ref|YP_002299493.1| hypothetical protein RC1_3320 [Rhodospirillum centenum SW] gi|209960044|gb|ACJ00681.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 521 Score = 296 bits (757), Expect = 8e-78, Method: Composition-based stats. Identities = 112/487 (22%), Positives = 195/487 (40%), Gaps = 42/487 (8%) Query: 23 LNYWMEELTGFLYPYKNNAQLR----------MWDTTGSEACIKLSSLLSSLITPPGQKW 72 ++ + P ++D T ++A +L++ L + +TPP +W Sbjct: 39 WEPLWQDCYDHVLPQNARFTRDAGPGERRGELLFDGTAADAADQLAASLLAQLTPPWSRW 98 Query: 73 HGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEF 132 GLA A V ++ + L RS F + VV Sbjct: 99 AGLAPGP-------DLSAAERALVAPLLERASADLQAH--LDRSNFAVEAHQAFLDVVTG 149 Query: 133 GTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGD 192 GTGC +E G +R+ +VPL+++ + + +D+V+R T T+ Q+ +++G Sbjct: 150 GTGCLLVEEAP--PGAPSALRFTAVPLADLVLEEGAEGRLDTVFRRLTPTLAQLAARFGT 207 Query: 193 KVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQI 252 L ++ A + + R ++ AV P D + D E + Sbjct: 208 DALPGALRRRAAADPDARAAVVEAVLP----DPGGGACRWAVA---LEDDPPVLLAEGRF 260 Query: 253 ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQ 312 A P+I R+ E+YGRSP M+ALP IR N+ V + + +++ A + Sbjct: 261 AEPPFIAFRWMKAPGEVYGRSPVMKALPDIRTANKVVELVLKNASVAVTGIWQADDDGVL 320 Query: 313 RN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLD 370 L PG + A+ G + G L+ L+ IR L D + Sbjct: 321 NPGTIRLVPGAIIPKAVGSAGLTPL--ASPGRFDVSQLVLDDLRAHIRHALLADRLGPVQ 378 Query: 371 DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPP 430 +A E +E++ E +G G LQSE + ++ R L +L +G +P+ Sbjct: 379 -GPRMTATEVLERSAEMARMLGATYGRLQSELLVPLVRRCLSLLRRRGAVPDLAAD---- 433 Query: 431 VSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWAT 490 L+ V+ SPL + QQ + L+ + +V LG + M +D + +RF A Sbjct: 434 GRLVAVQILSPLARAQQRRDAEAVLRWLESVTGLG-----DAAMRAVDLEACARFLADAA 488 Query: 491 NTPAVLI 497 PA L+ Sbjct: 489 GVPAALL 495 >gi|9634032|ref|NP_052106.1| head-to-tail joining protein [Yersinia phage phiYeO3-12] gi|6599023|emb|CAB63627.1| head-to-tail joining protein [Yersinia phage phiYeO3-12] Length = 535 Score = 292 bits (748), Expect = 8e-77, Method: Composition-based stats. Identities = 75/528 (14%), Positives = 145/528 (27%), Gaps = 45/528 (8%) Query: 1 MNQRSAKDI-----QDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWD 47 M + + ++ L N R E + P ++ W Sbjct: 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQ 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 G+ L+S L + P W L S + + D KV E V + Sbjct: 61 AVGARGLNNLASKLMLALFPMQS-WMKLTISEYEAKQLVGDPDG-LAKVDEGLSMVERII 118 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 + E + + L ++ G Y+ + R LS+ + + Sbjct: 119 MNYIESN--SYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYR-----LSSYVVQRD 171 Query: 168 HQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK 227 V + + V S+ K+ + +E + VY + Sbjct: 172 AYGNVLQIVTRDQIAFGA----LPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYL 227 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 V+ + PYI R E YGRS E L +R L Sbjct: 228 KYEE------VEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLEN 281 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPY 346 + + +S + + L + RE Q + + Sbjct: 282 LQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVA 341 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 ++++ + F+L V +A E E +G + L E + Sbjct: 342 KAVSDQIEARLSYAFML-NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + L L + +PE P +E + + + + ++ L Sbjct: 401 VRVLLKQLQATSQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCISAWAALAP 454 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQ 513 GDP ++ + A ++ + + + Q Q Sbjct: 455 MQGDPD----INLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQ 498 >gi|189427230|ref|YP_001949780.1| gp8 [Salmonella phage phiSG-JL2] gi|189085883|gb|ACD75698.1| gp8 [Salmonella phage phiSG-JL2] Length = 535 Score = 292 bits (748), Expect = 8e-77, Method: Composition-based stats. Identities = 76/528 (14%), Positives = 146/528 (27%), Gaps = 45/528 (8%) Query: 1 MNQRSAKDI-----QDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWD 47 M + + ++ L N R E + P ++ W Sbjct: 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQ 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 G+ L+S L + P W L S + + D KV E V + Sbjct: 61 AVGARGLNNLASKLMLALFPMQS-WMKLTISEYEAKQLVGDPDG-LAKVDEGLSMVERII 118 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 + E + + L ++ G Y+ + R LS+ + + Sbjct: 119 MNYIESN--SYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYR-----LSSYVVQRD 171 Query: 168 HQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK 227 V + + V S+ K+ + +E + VY + Sbjct: 172 AYGNVLQIVTRDQIAFGA----LPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYL 227 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 V+ + PYI R E YGRS E L +R L Sbjct: 228 KYEE------VEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLEN 281 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPY 346 + + +S + + L + RE Q + + Sbjct: 282 LQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVA 341 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 ++++ + F+L F V +A E E +G + L E + Sbjct: 342 KAVSDQIEARLSYAFML-NFAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + L L + +PE P +E + + + + ++ L Sbjct: 401 VRVLLKQLQATSQIPELPKEAGEPTISTGLEAIG------RGQDLDKLERCISAWAALAP 454 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQ 513 GDP ++ + A ++ + + + Q Q Sbjct: 455 MQGDPD----INLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQ 498 >gi|292670769|ref|ZP_06604195.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] gi|292647390|gb|EFF65362.1| conserved hypothetical protein [Selenomonas noxia ATCC 43541] Length = 567 Score = 292 bits (747), Expect = 1e-76, Method: Composition-based stats. Identities = 98/510 (19%), Positives = 190/510 (37%), Gaps = 40/510 (7%) Query: 15 YLKNQRGELNYWMEELTGFLYPYKNNAQLR------------MWDTTGSEACIKLSSLLS 62 + +R + ++L+ ++ P + + D EA K ++ L Sbjct: 27 QMMTERTQFESTWKQLSKYINPTRGRFDDEDKTQDGRRRDYFLLDPYPMEASGKCAAGLH 86 Query: 63 SLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCL 122 S +T P + W L + V+ W ++ D L G ++S L Sbjct: 87 SGLTSPSRPWFALGLQDKELAEY--------HTVKLWLEECQDVLMGI--YAKSNIYNML 136 Query: 123 QSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFT 182 + + +FGTG + D + G+ +V+ + V R+F Sbjct: 137 LNIEAELTQFGTGAALLLEDFN-----TGVWARPYTCGEYAGNVDARGRVVQFARKFKLN 191 Query: 183 VDQIVSKWGDKVLSSKMKSAL-ARNENERFTIIHAVYPKSLTDKKKDK--GNKGFHSKFV 239 Q+V ++G+ V+S +++A A+N + F + + + + + K F Sbjct: 192 AWQMVDEFGEDVVSDAVRNAYRAKNLKDYFPVTMLIEKNADYNPDSNALLNFKYKSYYFE 251 Query: 240 SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLS 299 + F + P+++ R+ V A+ IYG P AL +L + + Sbjct: 252 DSQTDVFLKVSGYHEVPFLMPRWTVIANGIYGVGPGHNALGNCMQLQKIEKINMRLLEHR 311 Query: 300 LHPPTIAVSEAKQRNFDLKPG--YMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 P I S + PG + ++ R L++ G+ + + ++ I Sbjct: 312 SDPALIVPSS--VGKVNRLPGKETLVPDSMINGIRPLYEA--TGDRGEVMQTIQYKQQQI 367 Query: 358 RSLFLLDLFQVL--DDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILD 415 + F DLF +L D +A E E+ EK + P++ + +E + + R +I Sbjct: 368 GAAFYNDLFVMLAQQDNPQMTAREVAERHEEKLLMLSPVLEQMHNEVLAPLTRRAFEICY 427 Query: 416 SQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD 475 G LP +K E+ S L + Q+A + + + L P MD Sbjct: 428 RNGLLPPLPEELRGQEGSIKAEFISLLAQAQKAVGTNAMEKTLAIAGNL--MGASPEIMD 485 Query: 476 HMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 ++D D R + TP ++RD +V+ Sbjct: 486 NLDLDAAIREHAQMSGTPETIMRDEQDVQK 515 >gi|17570823|ref|NP_523332.1| head-to-tail joining protein [Enterobacteria phage T3] gi|138413|sp|P20323|VHTJ_BPT3 RecName: Full=Head-to-tail joining protein gi|15714|emb|CAA35152.1| 8 [Enterobacteria phage T3] gi|17384307|emb|CAC86295.1| head-to-tail joining protein [Enterobacteria phage T3] Length = 535 Score = 292 bits (746), Expect = 1e-76, Method: Composition-based stats. Identities = 76/528 (14%), Positives = 147/528 (27%), Gaps = 45/528 (8%) Query: 1 MNQRSAKDI-----QDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWD 47 M + + ++ L N R E + P ++ W Sbjct: 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQ 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 G+ L+S L + P W L S + + D KV E V + Sbjct: 61 AVGARGLNNLASKLMLALFPMQS-WMKLTISEYEAKQLVGDPDG-LAKVDEGLSMVERII 118 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 + E + + L ++ G Y+ + R LS+ + + Sbjct: 119 MNYIESN--SYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYR-----LSSYVVQRD 171 Query: 168 HQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK 227 V + + V S+ KS + +E + VY + Sbjct: 172 AYGNVLQIVTRDQIAFGA----LPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYL 227 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNE 287 + V+ + PYI R E YGRS E L +R L Sbjct: 228 KYE------EVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLEN 281 Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPY 346 + + +S + + L + RE Q + + Sbjct: 282 LQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVA 341 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 ++++ + F+L+ V +A E E +G + L E + Sbjct: 342 KAVSDQIEARLSYAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + L L + +PE P +E + + + + ++ L Sbjct: 401 VRVLLKQLQATSQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCISAWAALAP 454 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQ 513 GDP ++ + A ++ + + + Q Q Sbjct: 455 MQGDPD----INLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQ 498 >gi|194100448|ref|YP_002003821.1| gp8 [Klebsiella phage K11] gi|193201387|gb|ACF15865.1| gp8 [Klebsiella phage K11] Length = 535 Score = 290 bits (741), Expect = 6e-76, Method: Composition-based stats. Identities = 81/564 (14%), Positives = 154/564 (27%), Gaps = 52/564 (9%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEA 53 + AK + ++ LKN R E + P + W + G+ Sbjct: 10 AEEGAKAV---YDRLKNDRQPYETRAESCAQYTIPSLFPKDSDNASTDYTTPWQSVGARG 66 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L+S L + P W L S + L + KV E V + + E Sbjct: 67 LNNLASKLMLALFPMQS-WMKLTISEYEAKNLLG-DAEGLAKVDEGLSMVERIIMNYIES 124 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 + + L + G Y+ L++ + + V Sbjct: 125 N--SYRVTLFECLKQLCVAGNALLYLPEPEGYTP------MKLYRLNSYVVQRDAFGNVL 176 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 + + + V S + + E+ + VY D Sbjct: 177 QIVTLDKIAFNA----LPEDVRSQVEAAQGEQKEDAEVDVYTHVYLNESGDGYSKYE--- 229 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + E + PYI R E YGRS E L ++ L + Sbjct: 230 ---EVAEAVVPGSEAEYPLEECPYIPVRMVRIDGESYGRSYVEEYLGDLKSLENLQESIV 286 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGY-MNIGALSREGRSLFQPVQFGNPLPYHEELNR 352 + ++ + + L ++ Q + G+ + Sbjct: 287 KMAMITAKVIGLVDPAGITQVRRLTAAQSGAFVPGRKQDIEFLQLEKSGDFTVAKNVSDT 346 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 ++ + F+L V +A E E +G + L E ++ L Sbjct: 347 IEARLSYAFML-NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLK 405 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 L + +PE P +E + + + + + L GD Sbjct: 406 QLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCIAAWSALKALEGD-- 457 Query: 473 CMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQD 532 D ++ + A + +E + +M +Q Q QQ + Sbjct: 458 --DDLNLANLKLRIANAIGLDT---------AGMLLTQEQKNALMAQQGAQIATQQGAAA 506 Query: 533 IGAKAAGRAMEKKLTHDMMENSYG 556 +G A +A +S G Sbjct: 507 LGQGMAAQATASPEAMAAAADSVG 530 >gi|119637774|ref|YP_919010.1| Head-to-tail joining protein [Yersinia phage Berlin] gi|194100496|ref|YP_002003341.1| gp8 [Yersinia phage Yepe2] gi|119391805|emb|CAJ70678.1| hypothetical protein [Yersinia phage Berlin] gi|193201229|gb|ACF15710.1| gp8 [Yersinia phage Yepe2] Length = 535 Score = 289 bits (738), Expect = 1e-75, Method: Composition-based stats. Identities = 81/526 (15%), Positives = 154/526 (29%), Gaps = 44/526 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEA 53 + AK + ++ LKN R E + P + W G+ Sbjct: 11 AENGAKAV---YDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARG 67 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L+S L + P W L S + + + A KV E V L + E Sbjct: 68 LNNLASKLMLALFPMQT-WMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIES 125 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 + + L +V G Y+ + R LS+ + + V Sbjct: 126 N--SYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFGTVL 178 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 + + K L +++++ ++ + + VY D++ + K Sbjct: 179 QIVT---------LDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGEYLKY 229 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + V+ + PYI R E YGRS E L +R L + Sbjct: 230 --EEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELNR 352 + +S + + L + + E S Q + + + Sbjct: 288 KMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVARAVSEQ 347 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 ++ + F+L V +A E E +G + L E M+ L Sbjct: 348 IEGRLSYAFML-NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLK 406 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 L + +PE P +E L + Q + + + L GDP Sbjct: 407 QLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCIAAWSALAPMQGDPD 460 Query: 473 CMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVM 517 ++ + A +++ E + + + Sbjct: 461 ----INIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQ 502 >gi|260557979|ref|ZP_05830191.1| Bbp21 [Acinetobacter baumannii ATCC 19606] gi|260408489|gb|EEX01795.1| Bbp21 [Acinetobacter baumannii ATCC 19606] Length = 555 Score = 288 bits (737), Expect = 2e-75, Method: Composition-based stats. Identities = 98/521 (18%), Positives = 199/521 (38%), Gaps = 33/521 (6%) Query: 9 IQDRFNYLKNQR-GELNYWMEELTGFLYP---------YKNNAQ--LRMWDTTGSEACIK 56 ++ RF+ + R +++ + EL + P K++ ++ D TG ++ Sbjct: 1 MKKRFDAVWQLRVNDMDDYCAELALHVLPAAIKTIKNQEKHDRSAWSKIVDNTGKDSLKT 60 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L++ + S P +KW L + + Q +VR+W V D + S+S Sbjct: 61 LAAGMVSGTCSPSRKWFTLQAADESLQK--------DIEVRQWLKAVEDACY--VAFSKS 110 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 + Y FG G + + + + I + ++ + N + VY Sbjct: 111 NVYRTVHHIYMQEGAFGIGA-ALAPEHGRNSKAQLMDLIPLTFGEFAITTDEFNKPNGVY 169 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENE-RFTIIHAVYPKSLTDKKKDKGNKGFH 235 R+F T +V +G +S +K+A E F + HA+Y + N F Sbjct: 170 RKFKLTSINMVKYFGLDNVSDAIKNAFENKNYEQEFEVCHAIYERVDAKGY-GPKNMPFA 228 Query: 236 SKFVS-VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQ 294 S + ++ E + F I GR+ V + ++YG PA + + +R L + ++A Sbjct: 229 SIYYEPSSSDKLLRESGLMGFQVICGRWTVSSSDVYGEGPASDCIGDLRALQKGHQQIAV 288 Query: 295 FGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL- 353 + PP + K + P + S + + + ++ Sbjct: 289 GVDYQVRPPLLLPDYLKGHERETLPNGIAFYQASPTSQVAQVQAMLNVQFDLNGVMAQIA 348 Query: 354 --KESIRSLFLLDLFQVLD--DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 +E ++ F DLF +LD DK +A E E+ EK +GP++ E + ++ Sbjct: 349 QCQERVKRAFHTDLFMMLDAFDKGKMTATEVYERKSEKMLMLGPVVERQIDELLRPLVEI 408 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 ++ + + + + +++ + S L Q++ A + + + ++ Sbjct: 409 CVERVLANSEYLRQIAPEAIQNADVEINFVSILALAQKSSGSAILERALAMIGQVAQV-- 466 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR 510 DP +D +DTD+ + R V+ IR R Sbjct: 467 DPQVLDKVDTDKFMDEYAEINGVSPDIFRPQRIVDQIRSDR 507 >gi|212671411|ref|YP_002308410.1| head-to-tail joining protein [Kluyvera phage Kvp1] gi|211997255|gb|ACJ14572.1| head-to-tail joining protein [Kluyvera phage Kvp1] Length = 535 Score = 286 bits (732), Expect = 6e-75, Method: Composition-based stats. Identities = 79/522 (15%), Positives = 145/522 (27%), Gaps = 46/522 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEA 53 + AK + ++ LKN R E + P + W G+ Sbjct: 11 AENGAKAV---YDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARG 67 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L+S L + P W L S + + + A KV E V L + E Sbjct: 68 LNNLASKLMLALFPMQT-WMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIES 125 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 + + L +V G Y+ + R LS+ + + V Sbjct: 126 N--SYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFGTVL 178 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 + + K L ++++L + + VY D++ + K Sbjct: 179 QIVT---------LDKTAYAALPEDVRNSLDSGTEHKGDEMIDVYTHIYLDEESGEYLKY 229 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + V+ + + PYI R E YGRS E L +R L + Sbjct: 230 --EEIDGVEVDGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 + +S + + L G +Q + Sbjct: 288 KMSMISAKVIGLVNPAGITQVRRLTKAQT--GDFVSGRPEDISFLQLEKAADFSVAKAVS 345 Query: 354 KESIRSLFLLDLFQ--VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 ++ L + V +A E E +G + L E M+ L Sbjct: 346 EQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E L + Q + + + L DP Sbjct: 406 KQLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCIAAWSALAPMQNDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREV 512 ++ + A +++ E + + Sbjct: 460 D----INIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQ 497 >gi|312436374|gb|ADQ83183.1| head to tail joining protein [Yersinia phage Yep-phi] Length = 535 Score = 286 bits (731), Expect = 7e-75, Method: Composition-based stats. Identities = 81/526 (15%), Positives = 155/526 (29%), Gaps = 44/526 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEA 53 + AK + ++ LKN R E + P + W G+ Sbjct: 11 AENGAKAV---YDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARG 67 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L+S L + P W L S + + + A KV E V L + E Sbjct: 68 LNNLASKLMLALFPMQT-WMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIES 125 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 + + L +V G Y+ + R LS+ + + V Sbjct: 126 N--SYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFGTVL 178 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 + + K L +++++ ++ + + VY D++ + K Sbjct: 179 QIVT---------LDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGEYLKY 229 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + V+ + PYI R E YGRS E L +R L + Sbjct: 230 --EEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELNR 352 + +S + + L + + E S Q + + + Sbjct: 288 KMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVARAVSEQ 347 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 ++ + F+L V +A E E +G + L E M+ L Sbjct: 348 IEGRLSYAFML-NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLK 406 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 L + +PE P +E L + Q + + ++ L GDP Sbjct: 407 QLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCISAWSALAPMQGDPD 460 Query: 473 CMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVM 517 ++ + A +++ E + + + Sbjct: 461 ----INIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQ 502 >gi|119386466|ref|YP_917521.1| putative head-tail connector protein [Paracoccus denitrificans PD1222] gi|119377061|gb|ABL71825.1| putative head-tail connector protein [Paracoccus denitrificans PD1222] Length = 558 Score = 286 bits (731), Expect = 8e-75, Method: Composition-based stats. Identities = 108/565 (19%), Positives = 203/565 (35%), Gaps = 41/565 (7%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN-----------AQLRMWDTTG 50 NQ+ K + R + + EL + P + R+ D T Sbjct: 6 NQQLRKTLDYRRQAMNQEFDYWQGHFRELRDAIQPTRGRFEASERRSDSSINKRILDNTA 65 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 A L + L S +T P + W L S +V++W +V ++ Sbjct: 66 QMALRTLRAGLMSGVTSPSRPWFRLGLRGSTADE-------AEFEVKDWLHEVQRRMYEV 118 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQN 170 S L + Y + +GT + D E+ +R ++ + + + Sbjct: 119 M--RGSNIYRMLDTTYGDLGLYGTAANLVVPDF-----EDVVRGHNLQVGRFRLGEDGNG 171 Query: 171 VVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALA-RNENERFTIIHAVYPKSLTDKKKDK 229 V ++YRE V IV WG +S ++ A + FTI H + ++ D K + Sbjct: 172 RVIALYRELKMPVRGIVETWGLDAVSQSVRRAWDTGEYYQTFTICHMIDKRADGDPKAMQ 231 Query: 230 GN-KGFHSKFVSVD--ENRFFEEKQIATFPYIVGRYRVRADEIY-GRSPAMEALPTIRRL 285 + + + S + +D +F + P + R+ E + SP M AL R L Sbjct: 232 SSGRPWASIYWEMDAPSGQFLQIGGHRVKPLLAPRWEQVEGEAWSASSPGMVALGDARSL 291 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNP-- 343 + + A + +PP I + F PG A +P P Sbjct: 292 QVSQEQKAIAIQKMHNPPLIGGAVQGGMFFKNVPGGFTAMATQDLSTGGIRPAYEVRPDI 351 Query: 344 LPYHEELNRLKESIRSLFLLDLFQV----LDDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 ++ + + F DLFQ+ LD ++ +A E E+ EK +GP++ L Sbjct: 352 QGLIIDIQESQRRVEVAFYKDLFQMTALALDGRSQITAREIAERHEEKLMALGPVLESLD 411 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 E + +I + LPE +KVEY S L + Q+A + + + + Sbjct: 412 HELLQPLIEATFAYMQEADILPEAPEGIVGNP--IKVEYISLLAQAQKAIGIGAIERTIG 469 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 L P +D +D +++ R P ++ E+ ++R+ + + Sbjct: 470 FAGTLA--QIKPDVIDMIDGEQMMREFADQVGGPPGILLSPDELREVREAKARAAAQAQA 527 Query: 520 QHLQQQLQQTSQDIGAKAAGRAMEK 544 + + + + ++A M+ Sbjct: 528 IEAAEPMAG-AAKLISEATLNGMDA 551 >gi|187736539|ref|YP_001878651.1| hypothetical protein Amuc_2060 [Akkermansia muciniphila ATCC BAA-835] gi|187426591|gb|ACD05870.1| hypothetical protein Amuc_2060 [Akkermansia muciniphila ATCC BAA-835] Length = 544 Score = 285 bits (728), Expect = 1e-74, Method: Composition-based stats. Identities = 119/547 (21%), Positives = 214/547 (39%), Gaps = 52/547 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNA-----------QLRMWDTT 49 M +R+A+ + + L QR W + L ++ P + N RM DTT Sbjct: 1 MEERTAE-LNSVYKSLAAQRAPWETWWDRLRDYVLPRRLNREGEVSLPNRDAMDRMTDTT 59 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 EAC KL+S S ITP W + +D + W +Q ++ Sbjct: 60 AVEACQKLASGHMSYITPSHDVWFK----------WSAPDDRGGDEAEAWYNQCSEI--A 107 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 +E S S F + + V GTG + D + + + ++P + N + Sbjct: 108 LKELSVSNFYTEIHECFLDRVALGTGSLFTGTSSDGR-----LLFTNIPCGQFACAENAE 162 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERF---TIIHAVYPKSLTDKK 226 VD+ REFT+T Q S +G K L K + L R N +H V P++ ++ Sbjct: 163 GRVDTYVREFTYTAHQARSMFGVKALGPKAREVLERGGNPYATTLRFLHVVRPRTRRSRR 222 Query: 227 -KDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 + + F S ++S+D+ EE FPY+V R+ YG +P P I+++ Sbjct: 223 REQASHMPFESVYLSLDDQVIVEEGGYMEFPYLVTRFLKWGSGPYGLAPGRLVFPAIQQV 282 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 L G ++ P + + DL+ G + L + Sbjct: 283 QFLNRILDTLGEVAAFPRIL-ELANQIGEVDLRAGGRTVITPEAASLHLPREWATQGKYD 341 Query: 346 YHEE-LNRLKESIRSLFLLDLFQVLDD-KASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 + L + +++IR + L + ++ + + +A E M + E+ P S+ + Sbjct: 342 VGMDRLAQKQDAIRRAYYLPMLELWSGHRGNMTATEVMARENERVLMFSPSFTLFVSD-L 400 Query: 404 GAMISRELDILDSQGNLPECE-------GADNPPVSLLKVEYTSPLF---KYQQAESVAS 453 + ++R +L G P + V +V Y S + + Q+E + Sbjct: 401 YSTMTRIFSLLFRMGKFPRPPRAVLRVGRDGSVAVGEPRVVYQSKIALVLRRLQSEGMDR 460 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQ 513 +LQ +N +++ P DH+D D R S P ++R A+V +R++RE Sbjct: 461 SLQRLNMMMQAA-----PDLADHVDWDHCFRLSARVDGAPESMLRPWADVRAMRKEREDL 515 Query: 514 RRVMEEQ 520 ++ Sbjct: 516 QQGASLA 522 >gi|194100286|ref|YP_002003484.1| gp8 [Enterobacteria phage BA14] gi|193201281|gb|ACF15761.1| gp8 [Enterobacteria phage BA14] Length = 535 Score = 285 bits (728), Expect = 2e-74, Method: Composition-based stats. Identities = 78/522 (14%), Positives = 145/522 (27%), Gaps = 46/522 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEA 53 + AK + ++ LKN R E + P + W G+ Sbjct: 11 AENGAKAV---YDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARG 67 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L+S L + P W L S + + + A KV E V L + E Sbjct: 68 LNNLASKLMLALFPMQT-WMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIES 125 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 + + L +V G Y+ + R LS+ + + V Sbjct: 126 N--SYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFGTVL 178 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 + + K L +++++ + + + VY D++ + K Sbjct: 179 QIVT---------LDKTAYAALPEDVRNSMDSGQEHKGDEMIDVYTHIYLDEESGEYLKY 229 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + V+ + PYI R E YGRS E L +R L + Sbjct: 230 --EEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 + +S + + L G +Q + Sbjct: 288 KMSMISAKVIGLVNPAGITQVRRLTKAQT--GDFVSGRPEDISFLQLEKAADFSVAKAVS 345 Query: 354 KESIRSLFLLDLFQ--VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 ++ L + V +A E E +G + L E M+ L Sbjct: 346 EQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E L + Q + + + L DP Sbjct: 406 KQLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCIAAWSALAPMQNDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREV 512 ++ + A +++ E + + Sbjct: 460 D----INIATIKLRIANAIGIDTSGILKTPEEKQQEMAESAQ 497 >gi|326536132|ref|YP_004300566.1| gp8 [Enterobacteria phage 285P] gi|256861521|gb|ACV32477.1| gp8 [Enterobacteria phage 285P] Length = 535 Score = 285 bits (728), Expect = 2e-74, Method: Composition-based stats. Identities = 78/522 (14%), Positives = 146/522 (27%), Gaps = 46/522 (8%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEA 53 + AK + ++ LKN R E + P + W G+ Sbjct: 11 AENGAKAV---YDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARG 67 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L+S L + P W L S + + + A KV E V L + E Sbjct: 68 LNNLASKLMLALFPMQT-WMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIES 125 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 + + L +V G Y+ + R LS+ + + V Sbjct: 126 N--SYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYR-----LSSYVVQRDAFGTVL 178 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 + + K L +++++ + + + VY D++ + K Sbjct: 179 QIVT---------LDKTAYAALPEDVRNSMDSGQEHKGDEMIDVYTHIYLDEESGEYLKY 229 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 + V+ + + PYI R E YGRS E L +R L + Sbjct: 230 --EEIDGVEVDGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 + +S + + L G +Q + Sbjct: 288 KMSMISAKVIGLVNPAGITQVRRLTKAQT--GDFVSGRPEDISFLQLEKAADFSVAKAVS 345 Query: 354 KESIRSLFLLDLFQ--VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 ++ L + V +A E E +G + L E M+ L Sbjct: 346 EQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E L + Q + + + L DP Sbjct: 406 KQLQATNQIPELPKEAVEPTISTGME---ALGRGQ---DLDKLERCIAAWSALAPMQNDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREV 512 ++ + A +++ E + + Sbjct: 460 D----INIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQ 497 >gi|326633070|ref|YP_004306681.1| predicted head to tail joining protein [Salmonella phage Vi06] gi|301170543|emb|CBV65231.1| predicted head to tail joining protein [Salmonella phage Vi06] Length = 536 Score = 284 bits (727), Expect = 2e-74, Method: Composition-based stats. Identities = 80/567 (14%), Positives = 151/567 (26%), Gaps = 54/567 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSE 52 + + AK + + LKN R + + P + W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYTTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P W L S + L D KV E V + + E Sbjct: 65 GLNNLASKLMLALFPMQT-WMRLTISEYEAKQLLSDPDG-LAKVDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 + + L +V G Y+ LS+ + + V Sbjct: 123 SN--SYRVTLFEALKQLVVAGNVLLYLPEPDGSNYNP----MKLYRLSSYVVQRDAFGNV 176 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK 232 + + V + + +E + +Y + + Sbjct: 177 LQMVTRDQIAFGA----LPEDVRKAVEGQGGDKKPDEVIDVYTHIYLDEESGEYLRYEE- 231 Query: 233 GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 ++ PYI R E YGRS E L +R L + Sbjct: 232 -----AEGMEVQGSDGSYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE--- 349 + +S + + L G +Q + Sbjct: 287 VKMSMISSKVIGLVNPAGITQPRRLTKAQT--GDFVTGRPEDISFLQLEKQADFTVAKSV 344 Query: 350 LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 + ++ + F+L+ V +A E E +G + L E ++ Sbjct: 345 SDAIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRV 403 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 L L + +PE P +E + + + + V + Sbjct: 404 LLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVAAWAAMAPMRD 457 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQT 529 DP ++ + A I T E +++ M +Q +Q + Sbjct: 458 DPD----INLAMIKLRIANAIGIDTSGILLTEE---------QRQQKMAQQSMQLGMDSG 504 Query: 530 SQDIGAKAAGRAMEKKLTHDMMENSYG 556 + +G A +A +S G Sbjct: 505 AAALGQGMAAQATASPEAMASAADSVG 531 >gi|68299738|ref|YP_249587.1| Head-to-tail joining protein [Vibriophage VP4] gi|66473277|gb|AAY46286.1| head-to-tail joining protein [Vibriophage VP4] Length = 532 Score = 284 bits (726), Expect = 3e-74, Method: Composition-based stats. Identities = 75/514 (14%), Positives = 154/514 (29%), Gaps = 41/514 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 +N LKN RG E+ + P + + W + G+ L+S L Sbjct: 18 YNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + L S + + ++ V + E + F L + Sbjct: 78 LFPVGSSFFKLNVSELEVKQSITS-PEELTEIATGLAMVERICMNYMESN--SFRPTLHA 134 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y+ + +G + + N + + + V + E Sbjct: 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLH--NFVVERDAYDNVLQIVTEDKIARA 192 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + V S + +N +E TI V +D F S E Sbjct: 193 A----LPEDVRKSLEDAQGDQNPSEEVTIYTHV--------YRDPEAMVFRSYQEIDGEI 240 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 E + + P+I R +E YGRS E L ++ L + + +S Sbjct: 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKV 300 Query: 303 PTIAVSEAKQRNFDL-KPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + K + A ++ +FQ ++ + + +++ + F Sbjct: 301 LFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF 360 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 +L+ V +A E E +G + L E ++ L L + +P Sbjct: 361 MLN-SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIP 419 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 P +E L + + ++ +++L D ++ Sbjct: 420 NLPKEAVEPAIATGLE---ALGRG---HDLNKLNVFIDYMIKLAGLQDDD-----INLLD 468 Query: 482 VSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQR 514 V + LI + + + Sbjct: 469 VKMRLANSLGMDTTGLILTQQDKQAKMAEASTAA 502 >gi|281416195|ref|YP_003347930.1| head-to-tail joining protein [Vibrio phage N4] gi|325171309|ref|YP_004251280.1| head-to-tail joining protein [Vibrio phage ICP3] gi|237701502|gb|ACR16495.1| head-to-tail joining protein [Vibrio phage N4] gi|323512015|gb|ADX87477.1| head-to-tail joining protein [Vibrio phage ICP3] gi|323512160|gb|ADX87619.1| head-to-tail joining protein [Vibrio phage ICP3_2008_A] gi|323512208|gb|ADX87666.1| head-to-tail joining protein [Vibrio phage ICP3_2007_A] Length = 532 Score = 284 bits (726), Expect = 3e-74, Method: Composition-based stats. Identities = 75/514 (14%), Positives = 155/514 (30%), Gaps = 41/514 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 +N LKN RG E+ + P + + W + G+ L+S L Sbjct: 18 YNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + L S + + ++ V + E + F L + Sbjct: 78 LFPVGSSFFKLNVSELEVKQSITS-PEELTEIATGLAMVERICMNYMESN--SFRPTLHA 134 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y+ + +G + + N + + + V + E Sbjct: 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLH--NFVVERDAYDNVLQIVTEDKIARA 192 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + V S ++ +N +E TI V +D F S E Sbjct: 193 A----LPEDVRKSLEEAQGDQNPSEEVTIYTHV--------YRDPEAMVFRSYQEIDGEI 240 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 E + + P+I R +E YGRS E L ++ L + + +S Sbjct: 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKV 300 Query: 303 PTIAVSEAKQRNFDL-KPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + K + A ++ +FQ ++ + + +++ + F Sbjct: 301 LFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF 360 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 +L+ V +A E E +G + L E ++ L L + +P Sbjct: 361 MLN-SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIP 419 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 P +E L + + ++ +++L D ++ Sbjct: 420 NLPKEAVEPAIATGLE---ALGRG---HDLNKLNVFIDYMIKLAGLQDDD-----INLLD 468 Query: 482 VSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQR 514 V + LI + + + Sbjct: 469 VKMRLANSLGMDTTGLILTQQDKQAKMAEASTAA 502 >gi|303327895|ref|ZP_07358334.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] gi|302861721|gb|EFL84656.1| conserved hypothetical protein [Desulfovibrio sp. 3_1_syn3] Length = 554 Score = 284 bits (726), Expect = 3e-74, Method: Composition-based stats. Identities = 96/549 (17%), Positives = 196/549 (35%), Gaps = 36/549 (6%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKN---------NAQLRMWDTTGSE 52 + K+++ +L++ R + EL + P + + +++ + Sbjct: 4 ARMDLKEVKQLVGHLESLRAKRLAQQRELGRLILPSRGLFQGEDTESLRESNLFNPAANR 63 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 A K ++ ++ ITP G W AFL + D + E+ D V + L Sbjct: 64 ALRKAAAGMTQAITPAGNPWFK--------HAFLLRRDREATGGNEYVDTVDNMLRTV-- 113 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 S GF + SF ++ FG E RY +++ + Sbjct: 114 LSAGGFYRAIHSFNKELLGFGCALLGCEESP-----RTVARYFCQTCGTYCAALDEDGNL 168 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGN 231 D+V R T ++ ++G+ LS + L ++ + + H V ++ D + D+ N Sbjct: 169 DAVARRLLMTPRELARRFGEDRLSDVSRQKLKKDSYDPVAVRHVVQRRTARDPERADRSN 228 Query: 232 KGFHSKFVSVDE-NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVN 290 + S + F + + P+ + +YG P EAL + + Sbjct: 229 MPWGSWWYEEGGAADFLDVGGFRSMPFFFTVWEEARG-VYGTGPGDEALADQKGIEGWEL 287 Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNI-GALSREGRSLFQPVQFGNP-LPYHE 348 A + P + + D PG + G + V FG E Sbjct: 288 RKAVGVEKMIDPV-LVSQGPLKAYVDTSPGAVIPSGGFGADSLKPLYEVNFGPAVQHVQE 346 Query: 349 ELNRLKESIRSLFLLDLFQVLD---DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGA 405 E++++ + + + ++F + A + E M++ R +GP + G + + Sbjct: 347 EISQISLRLEDVMMANIFASMSLETRPAGMTMTEYMDRRRRSAELMGPTVSGYEPRILSP 406 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 ++ +L+ G LP + P + L V Y SP+ + + + + Sbjct: 407 VLENTFGLLEEYGLLPGPPDGLS-PFASLNVSYQSPMAQMLEQSGAVAIQSLFELAAPM- 464 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 P D +D ++ PA ++R V +RQQR + ++Q + + Sbjct: 465 -LRAVPDLADKIDFEQAIDELAQRLGVPASVVRSDETVAAMRQQRAEAQAAQQQQMAEAR 523 Query: 526 LQQTSQDIG 534 + Q +G Sbjct: 524 MLQQVAALG 532 >gi|323512062|gb|ADX87523.1| head-to-tail joining protein [Vibrio phage ICP3_2009_B] gi|323512111|gb|ADX87571.1| head-to-tail joining protein [Vibrio phage ICP3_2009_A] Length = 532 Score = 284 bits (726), Expect = 3e-74, Method: Composition-based stats. Identities = 75/514 (14%), Positives = 155/514 (30%), Gaps = 41/514 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 +N LKN RG E+ + P + + W + G+ L+S L Sbjct: 18 YNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + L S + + ++ V + E + F L + Sbjct: 78 LFPVGSSFFKLNVSELEVKQSITS-PEELTEIATGLAMVERICMNYMESN--SFRPTLHA 134 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y+ + +G + + N + + + V + E Sbjct: 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLH--NFVVERDAYDNVLQIVTEDKIARA 192 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + V S ++ +N +E TI V +D F S E Sbjct: 193 A----LPEDVRKSLEEAQGDQNPSEEVTIYTHV--------YRDPEAMVFRSYQEIDGEI 240 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 E + + P+I R +E YGRS E L ++ L + + +S Sbjct: 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKV 300 Query: 303 PTIAVSEAKQRNFDL-KPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + K + A ++ +FQ ++ + + +++ + F Sbjct: 301 LFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF 360 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 +L+ V +A E E +G + L E ++ L L + +P Sbjct: 361 MLN-SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIP 419 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 P +E L + + ++ +++L D ++ Sbjct: 420 NLPKEAVEPAIATGLE---ALGRG---HDLNKLNVFIDYMIKLAGLQDDD-----INLLD 468 Query: 482 VSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQR 514 V + LI + + + Sbjct: 469 VKMRLANSLGMDTTGLILTQQDKQAKMAEASTAA 502 >gi|194473831|ref|YP_002048655.1| head-to-tail joining protein [Morganella phage MmP1] gi|194307052|gb|ACF42034.1| head-to-tail joining protein [Morganella phage MmP1] Length = 543 Score = 284 bits (725), Expect = 4e-74, Method: Composition-based stats. Identities = 74/541 (13%), Positives = 153/541 (28%), Gaps = 45/541 (8%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 ++ LKN R E + P + W + G+ L+S L Sbjct: 20 YDRLKNDRAPYETRAENCAKYTIPSLFPKSSDNASTDYTTPWQSAGARGLNNLASKLMLA 79 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P W L S + + + E+ KV V + + E + + L Sbjct: 80 LFPMQT-WMKLTISEFSAKELVGNEEG-LAKVDAALSMVERIIMNYIETN--SYRVALFE 135 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y+ + I+ +P + + V + E Sbjct: 136 GLKQLIVAGNVLLYLPPPEESDEGYNPIKVYKLP--SFVCQRDSFGNVLQIVTEDKIAFG 193 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + + + S + +E T+ +Y + + + Sbjct: 194 ALD----EDIRKMVEASGGEKKPDEEITVYTHIYLDDESGQYLKYEE------VEGEEIA 243 Query: 245 RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPT 304 PYI R + E YGRS E L ++ L + + ++ Sbjct: 244 GTDAAYPYEANPYIPVRMVRLSGESYGRSYCEEYLGDLKSLENLHEAMVKMSMIAAKVVG 303 Query: 305 IAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLL 363 + + + + E Q + + + ++ + F+L Sbjct: 304 LVNPAGMTQIRQVSKADTGDYVPGKPEDIHFLQLEKQADFSVAKTIADNIEARLSFAFML 363 Query: 364 DLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPEC 423 + V +A E E +G + L E ++ L+ L + +PE Sbjct: 364 N-SAVQRTAERVTAEEIRYVASELEDTLGGVYSNLSQELQLPIVKVLLNQLQATAKIPEL 422 Query: 424 EGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVS 483 P +E + + + + + L DP ++ + Sbjct: 423 PQEAVEPAISTGLEAIG------RGQDLDRLERCIAAWAALAPMANDPD----INLSTIK 472 Query: 484 RFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAME 543 A I E +++ + E +QQ + + +G AG A E Sbjct: 473 LRIANAIGIDT---------AGILLTEEQKQQKLAEAAMQQGMMTGANQLGGGMAGMATE 523 Query: 544 K 544 Sbjct: 524 S 524 >gi|37956836|gb|AAP34103.1| gene 8 [Enterobacteria phage T7] gi|37956889|gb|AAP34155.1| gene 8 [Enterobacteria phage T7] Length = 536 Score = 280 bits (717), Expect = 3e-73, Method: Composition-based stats. Identities = 76/535 (14%), Positives = 144/535 (26%), Gaps = 50/535 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSE 52 + + AK + + LKN R + + P + W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P W L S + L D KV E V + + E Sbjct: 65 GLNNLASKLMLALFPMQT-WMRLTISEYEAKQLLSDPDG-LAKVDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 + + L +V G Y+ LS+ + + V Sbjct: 123 SN--SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP----MKLYRLSSYVVQRDAFGNV 176 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK 232 + + + + + +E + +Y + + Sbjct: 177 LQMVTRDQIAFGA----LPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEE- 231 Query: 233 GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 ++ PYI R E YGRS E L +R L + Sbjct: 232 -----VEDMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + L + E S Q + + + Sbjct: 287 VKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSD 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 ++ + F+L+ V +A E E +G + L E ++ L Sbjct: 347 AIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E + + + + V L DP Sbjct: 406 KQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVTAWAALAPMRNDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 ++ + A I T E +++ M +Q +Q + Sbjct: 460 D----INLAMIKLRIANAIGIDTSGILLTEE---------QKQQKMAQQSMQMGM 501 >gi|194100395|ref|YP_002003970.1| gp8 [Enterobacteria phage 13a] gi|193201442|gb|ACF15919.1| gp8 [Enterobacteria phage 13a] Length = 536 Score = 280 bits (717), Expect = 3e-73, Method: Composition-based stats. Identities = 77/539 (14%), Positives = 148/539 (27%), Gaps = 51/539 (9%) Query: 1 MNQRSA----KDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDT 48 M ++ + + + LKN R + + P + + W Sbjct: 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQA 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 G+ L+S L + P W L S + L D KV E V + Sbjct: 61 VGARGLNNLASKLMLALFPMQT-WMRLTISEYEAKQLLSDPDG-LAKVDEGLSMVERIIM 118 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 + E + + L +V G Y+ LS+ + + Sbjct: 119 NYIESN--SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP----MKLYRLSSYVVQRDA 172 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKD 228 V + + + + + +E + +Y +D Sbjct: 173 FGNVLQMVTRDQIAFGA----LPEDIRKAVEGQGGEKKADETIDVYTHIYLD------ED 222 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 G + + ++ PYI R E YGRS E L +R L Sbjct: 223 SGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENL 282 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYH 347 + + +S + + L + E S Q + + Sbjct: 283 QEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAK 342 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + ++ + F+L+ V +A E E +G + L E ++ Sbjct: 343 AVSDAIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVK 467 L L + +PE P +E + + + + V L Sbjct: 402 RVLLKQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVAAWAALAPM 455 Query: 468 TGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 DP ++ + A I T E +++ M +Q +Q + Sbjct: 456 RDDPD----INLAMIKLRIANAIGIDTSGILLTEE---------QKQQKMAQQSMQMGM 501 >gi|212703247|ref|ZP_03311375.1| hypothetical protein DESPIG_01289 [Desulfovibrio piger ATCC 29098] gi|212673291|gb|EEB33774.1| hypothetical protein DESPIG_01289 [Desulfovibrio piger ATCC 29098] Length = 552 Score = 280 bits (717), Expect = 3e-73, Method: Composition-based stats. Identities = 98/564 (17%), Positives = 197/564 (34%), Gaps = 42/564 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNN---------AQLRMWDTTGS 51 M + K+++ +L+ R + E+ + P + + + Sbjct: 1 MAAPTLKELKQLVAHLEGLRSKRLAQQWEIGKLILPSRGLFQGEETECLRDANLLNPAAQ 60 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 A K ++ ++ ITP W FL + D E+ D V + Sbjct: 61 RALGKAAAGMTQAITPASSPWFR--------HQFLDRADREVTGGNEYVDVVDARIRAV- 111 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 + GF + +F ++ FG +A R+ ++++ Sbjct: 112 -LAAGGFYSAIHAFNRELLGFGCALLSCDA-----SARTVARFACQTCGTYAVALDEDRT 165 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK-DKG 230 + V R T ++ ++G L + L ++ V + D ++ D Sbjct: 166 LSCVVRRLRMTPVEMSRRFGRDRLCEATRQKLESQPYAPIEVVQVVRKREERDPERGDNR 225 Query: 231 NKGFHS-KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 N F S + E + P+ + +YG P +AL + + Sbjct: 226 NMPFASFWYEDQGGTELLRESGFRSMPFFFSTWEDARG-VYGTGPGDDALADQKGIEAWE 284 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRS--LFQPVQFGNP-LPY 346 A + + PP +A +R+ PG + + + V FG Sbjct: 285 KRKAVGIEMMIQPPLLAP-GTLKRHVRAMPGSVISDTAYGQSNALRPLYEVNFGPAVGAV 343 Query: 347 HEELNRLKESIRSLFLLDLFQVLD---DKASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 +E+ ++ + + ++F + A + E M++ R +GP + + + Sbjct: 344 QQEIEQISMRLEDVMKANIFANMSLETRPAGMTMTEYMDRRRRAAELMGPTVSSYEPRVL 403 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVE 463 I R +LD +G LP + P + L V Y SP+ + + + S Q ++ V Sbjct: 404 TLCIERVYQLLDEEGLLPPPPQGLS-PWATLNVSYQSPMAQMLEQAAAVSIGQFMDQVGP 462 Query: 464 LGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQ 523 P+ +D +D D++ PA +IR +V IRQQRE ++ ++ Sbjct: 463 WAQSQ--PTILDKLDLDQMVDELAQRLGVPASIIRSDEQVAAIRQQREQAAAAQQQAAME 520 Query: 524 QQLQQTSQDIG-----AKAAGRAM 542 Q+ ++ +G AG+ M Sbjct: 521 VQMMESMAKMGNVKTEGTVAGKVM 544 >gi|9627467|ref|NP_041995.1| head-tail connector protein [Enterobacteria phage T7] gi|138414|sp|P03728|VHTJ_BPT7 RecName: Full=Head-to-tail joining protein gi|15602|emb|CAA24425.1| unnamed protein product [Enterobacteria phage T7] gi|37956678|gb|AAP33948.1| gene 8 [Enterobacteria phage T7] gi|265524999|gb|ACY75862.1| head-to-tail joining protein [Enterobacteria phage T7] Length = 536 Score = 280 bits (715), Expect = 6e-73, Method: Composition-based stats. Identities = 76/535 (14%), Positives = 144/535 (26%), Gaps = 50/535 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSE 52 + + AK + + LKN R + + P + W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P W L S + L D KV E V + + E Sbjct: 65 GLNNLASKLMLALFPMQT-WMRLTISEYEAKQLLSDPDG-LAKVDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 + + L +V G Y+ LS+ + + V Sbjct: 123 SN--SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP----MKLYRLSSYVVQRDAFGNV 176 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK 232 + + + + + +E + +Y + + Sbjct: 177 LQMVTRDQIAFGA----LPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEE- 231 Query: 233 GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 ++ PYI R E YGRS E L +R L + Sbjct: 232 -----VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + L + E S Q + + + Sbjct: 287 VKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSD 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 ++ + F+L+ V +A E E +G + L E ++ L Sbjct: 347 AIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E + + + + V L DP Sbjct: 406 KQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVTAWAALAPMRDDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 ++ + A I T E +++ M +Q +Q + Sbjct: 460 D----INLAMIKLRIANAIGIDTSGILLTEE---------QKQQKMAQQSMQMGM 501 >gi|158425212|ref|YP_001526504.1| head-to-tail joining protein [Azorhizobium caulinodans ORS 571] gi|158332101|dbj|BAF89586.1| head-to-tail joining protein [Azorhizobium caulinodans ORS 571] Length = 511 Score = 279 bits (714), Expect = 7e-73, Method: Composition-based stats. Identities = 63/510 (12%), Positives = 131/510 (25%), Gaps = 50/510 (9%) Query: 4 RSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACI 55 + A R+ L R + P N + G+ Sbjct: 3 KPATTAAGRYTQLATIRSPYLERARDCATLTIPSLMPRAGHGAANDLPTPFQGMGARGVN 62 Query: 56 KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSR 115 L S L + PP Q + L A L +D +V + Q+ + E Sbjct: 63 NLGSKLLLALMPPNQPFFRLMLDDFAL-QELTGQDGMRTEVEKALGQIERAVQTEVETGA 121 Query: 116 SGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSV 175 ++ G Y++ L + + V + Sbjct: 122 --IRVSAFEALKQLLVAGNVLLYVQPTGGV---------KVYRLDRYVVKRDPSGNVLEI 170 Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFH 235 + + + K+ + + +++ G H Sbjct: 171 VIHERVSPLALPEELQRKLGEQRKGVQDTIDLYTWI--------------RRESGKFVVH 216 Query: 236 SKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 + E P+I R+ E YGR E + +R L + + Sbjct: 217 QEVKGEKVPGTDGEWPTDKAPFIALRWAKIDGEDYGRGHVEEYIGDLRSLEALTRAIVEG 276 Query: 296 GRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN---R 352 + + + A+ + +Q + L R Sbjct: 277 AAAAAKVLFLVNPNGVTNERTISEA--PNMAVRSGNKEDVNVLQVEKFNDFRVALETVGR 334 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 L+ + FLL + D +A E E +G + L EF ++ R + Sbjct: 335 LEIRLSQAFLLTSS-IQRDAERVTAEEIRVMAGELEDALGGVYSILAQEFQLPLVRRLIF 393 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 ++ LP P + +E L + + + + G + Sbjct: 394 QMEQDERLPSLPPDLVKPSIITGME---ALGRG------HDLNRLMMFAKVVNDLLGPGA 444 Query: 473 CMDHMDTDRVSRFSLWATNTPA-VLIRDTA 501 + D ++ + A + +++ Sbjct: 445 LPSYADARKLIERAGVALSVDTSDILKSDE 474 >gi|37956731|gb|AAP34000.1| gene 8 [Enterobacteria phage T7] gi|37956781|gb|AAP34049.1| gene 8 [Enterobacteria phage T7] Length = 536 Score = 279 bits (713), Expect = 1e-72, Method: Composition-based stats. Identities = 76/535 (14%), Positives = 144/535 (26%), Gaps = 50/535 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSE 52 + + AK + + LKN R + + P + W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P W L S + L D KV E V + + E Sbjct: 65 GLNNLASKLMLALFPMQT-WMRLTISEYEAKQLLSDPDG-LAKVDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 + + L +V G Y+ LS+ + + V Sbjct: 123 SN--SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP----MKLYRLSSYVVQRDAFGNV 176 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK 232 + + + + + +E + +Y + + Sbjct: 177 LQMVTRDQIAFGA----LPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEE- 231 Query: 233 GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 ++ PYI R E YGRS E L +R L + Sbjct: 232 -----VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + L + E S Q + + + Sbjct: 287 VKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSD 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 ++ + F+L+ V +A E E +G + L E ++ L Sbjct: 347 AIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E + + + + V L DP Sbjct: 406 KQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVTAWAALAPMRDDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 ++ + A I T E +++ M +Q +Q + Sbjct: 460 D----INLAMIKLRIANAIGIDTSGILLTEE---------QKQQKMVQQSMQMGM 501 >gi|30387485|ref|NP_848294.1| head-to-tail joining protein [Yersinia pestis phage phiA1122] gi|30314122|gb|AAP20530.1| head-to-tail joining protein [Yersinia pestis phage phiA1122] Length = 536 Score = 277 bits (709), Expect = 3e-72, Method: Composition-based stats. Identities = 77/535 (14%), Positives = 146/535 (27%), Gaps = 50/535 (9%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSE 52 + + AK + + LKN R + + P + W G+ Sbjct: 8 LAEDGAKSV---YERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGAR 64 Query: 53 ACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRE 112 L+S L + P W L S + L D KV E V + + E Sbjct: 65 GLNNLASKLMLALFPMQT-WMRLTISEYEAKQLLSDPDG-LAKVDEGLSMVERIIMNYIE 122 Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 + + L +V G Y+ LS+ + + V Sbjct: 123 SN--SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP----MKLYRLSSYVVQRDAFGNV 176 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK 232 + + + + + +E + +Y + G Sbjct: 177 LQMVTRDQIAFGA----LPEDIRKAVEGQGGEKKADETIDVYTHIYLD------EASGEY 226 Query: 233 GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + + ++ PYI R E YGRS E L +R L + Sbjct: 227 LRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAI 286 Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELN 351 + +S + + L + E S Q + + + Sbjct: 287 VKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSD 346 Query: 352 RLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL 411 ++ + F+L+ V +A E E +G + L E ++ L Sbjct: 347 AIEARLSFAFMLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 Query: 412 DILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 L + +PE P +E + + + + V L DP Sbjct: 406 KQLQATQQIPELPKEAVEPTISTGLEAIG------RGQDLDKLERCVTAWAALAPMRDDP 459 Query: 472 SCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 ++ + A I T E +++ M +Q +Q + Sbjct: 460 D----INLAMIKLRIANAIGIDTSGILLTEE---------QKQQKMAQQSMQMGM 501 >gi|77118196|ref|YP_338118.1| head to tail connector [Enterobacteria phage K1F] gi|72527940|gb|AAZ72992.1| head to tail connector [Enterobacteria phage K1F] gi|83308148|emb|CAJ29381.1| gp8 protein [Enterobacteria phage K1F] Length = 522 Score = 277 bits (708), Expect = 3e-72, Method: Composition-based stats. Identities = 88/559 (15%), Positives = 165/559 (29%), Gaps = 52/559 (9%) Query: 1 MNQRS---AKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTT 49 M +R A+ + ++ LKN R + P + W Sbjct: 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAV 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ L++ L + P W L S + +A ++ V E V L Sbjct: 61 GARCLNNLAAKLMLALFP-QSPWMRLTVSEYEAKTLSQDSEAAAR-VDEGLAMVERVLMA 118 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + E + F L ++ G Y+ E+G +R L + + + Sbjct: 119 YMETN--SFRVPLFEALKQLIVSGNCLLYIPEP--EQGTYSPMRM--YRLVSYVVQRDAF 172 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 + + + K L +KS L ++ E T + T + Sbjct: 173 GNILQIVT---------IDKVAFSALPEDVKSQLNADDYEPDTELEV-----YTHIYRQD 218 Query: 230 GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 + + ++ + PYI R E YGRS E L + L Sbjct: 219 DEYLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETIT 278 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE 349 + + +++ + + L G +Q + Sbjct: 279 EAITKMAKVASKVVGLVNPNGITQPRRLNKAAT--GEFVAGRVEDINFLQLTKGQDFTIA 336 Query: 350 ---LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 + +++ + FLL V + +A E E A +G + E + Sbjct: 337 KSVADAIEQRLGWAFLL-NSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPI 395 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + ++ L S G +P+ P +E L + Q E + Q VN + L Sbjct: 396 VRVLMNQLQSAGMIPDLPKEAVEPTVSTGLE---ALGRGQDLEKLT---QAVNMMTGLQP 449 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 + DP ++ + L A A L+ E I++ E + Q Sbjct: 450 LSQDPD----INLPTLKLRLLNALGIDTAGLLLTQDE--KIQRMAEQSSQQAVVQGASAA 503 Query: 526 LQQTSQDIGAKAAGRAMEK 544 +G A + Sbjct: 504 GANMGAAVGQGAGEDMAQA 522 >gi|194100340|ref|YP_002003770.1| gp8 [Enterobacteria phage EcoDS1] gi|193201335|gb|ACF15814.1| gp8 [Enterobacteria phage EcoDS1] Length = 522 Score = 277 bits (708), Expect = 4e-72, Method: Composition-based stats. Identities = 86/559 (15%), Positives = 164/559 (29%), Gaps = 52/559 (9%) Query: 1 MNQRS---AKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTT 49 M +R A+ + ++ LKN R + P + W + Sbjct: 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQSV 60 Query: 50 GSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFG 109 G+ L++ L + P W L S + +A ++ V E V L Sbjct: 61 GARCLNNLAAKLMLALFP-QSPWMRLTVSEYEAKTLSQDSEAAAR-VDEGLAMVERVLMA 118 Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + E + F L ++ G Y+ E+G +R L + + + Sbjct: 119 YMETN--SFRVPLFEALKQLIVSGNCLLYIPEP--EQGTYSPMRM--YRLVSYVVQRDAF 172 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 + + + K L +KS L ++ E T + T + Sbjct: 173 GNILQIVT---------LDKVAFSALPEDVKSQLNTDDYEPDTELEV-----YTHIYRQD 218 Query: 230 GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 + + ++ + PYI R E YGRS E L + L Sbjct: 219 DEYLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETIT 278 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE 349 + + +++ + + L G +Q + Sbjct: 279 EAITKMAKVASKVVGLVNPNGITQPRRLNKAAT--GEFVAGRVEDINFLQLTKGQDFTIA 336 Query: 350 ---LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 + +++ + FLL V + +A E E A +G + E + Sbjct: 337 KSVADAIEQRLGWAFLL-NSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPI 395 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + ++ L S G +P+ P +E L + Q E + Q VN + L Sbjct: 396 VRVLMNQLQSAGMIPDLPKEAVEPTVSTGLE---ALGRGQDLEKLT---QAVNMMTGLQP 449 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 DP ++ + L A A L+ E +++ E + Sbjct: 450 LQQDPD----INLPTLKLRLLNALGIDTAGLLLTQDE--KLQRMAEQSAQGAVVNGASAA 503 Query: 526 LQQTSQDIGAKAAGRAMEK 544 +G A + Sbjct: 504 GANMGAAVGQGAGEDMAQA 522 >gi|29366727|ref|NP_813772.1| head-tail connector protein [Pseudomonas phage gh-1] gi|29243586|gb|AAO73165.1|AF493143_26 head-tail connector protein [Pseudomonas phage gh-1] Length = 543 Score = 274 bits (700), Expect = 3e-71, Method: Composition-based stats. Identities = 82/570 (14%), Positives = 166/570 (29%), Gaps = 58/570 (10%) Query: 1 MNQRSAKDIQDR-----FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWD 47 M + + + + + LKN R E P + W Sbjct: 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQ 60 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 G+ LS+ + + P W L S + + + ++ V + V L Sbjct: 61 AVGARGLNNLSAKVMLALFPLQS-WMKLKVSEWQAKQLV-SDPSQLAVVEQGLGMVERIL 118 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 + E + + L + GT Y+ ++ L N + + Sbjct: 119 MSYMEAN--SYRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKL--YTLHNHVVQRD 174 Query: 168 HQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKK 227 V + + K L ++++L+ + + + T Sbjct: 175 AFGNVLQIVT---------LDKVAYAALPEDVRNSLSGGQEYKPEQE----LEVYTHIYI 221 Query: 228 DKGNKGFHSKFVSVDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRL 285 D + F S + Q P+I R+ R E YGRS E L + L Sbjct: 222 DDESGDFLSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSL 281 Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPL 344 + +F +S + + L + A + Q + + Sbjct: 282 ESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQTGDFVAGRKADIEFLQLEKTADFT 341 Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 + ++ + +F+L V +A E E +G + L E Sbjct: 342 VAKSVADAIEARLSYVFML-NSAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQL 400 Query: 405 AMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 ++ L+ L + +P P E L + Q + Q +N V + Sbjct: 401 PIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAE---ALGRGQ---DLDKLTQFLNAVATV 454 Query: 465 GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQ 524 GDP ++ + + A + + + ++ L+Q Sbjct: 455 SQLNGDPD----LNVNNIKLRLANAIGIDT---------AGLLLTEAEKAQAQSQEMLKQ 501 Query: 525 QLQQTSQDIGAKAAGRAMEKKLTHDMMENS 554 + IG+ A +A + + ME++ Sbjct: 502 GGLNAAAGIGSGVAAQATA---SPEAMESA 528 >gi|317487284|ref|ZP_07946079.1| hypothetical protein HMPREF0179_03442 [Bilophila wadsworthia 3_1_6] gi|316921474|gb|EFV42765.1| hypothetical protein HMPREF0179_03442 [Bilophila wadsworthia 3_1_6] Length = 554 Score = 263 bits (672), Expect = 6e-68, Method: Composition-based stats. Identities = 72/549 (13%), Positives = 153/549 (27%), Gaps = 46/549 (8%) Query: 11 DRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLS 62 R+ L R PY + ++ + G+ L+S L Sbjct: 22 TRYTELSQDRAPYLDRARRCAELTIPYLIPPDDLAQGQELPSLYQSVGANGVTNLASKLL 81 Query: 63 SLITPPGQKWHGLAESFSAYQAFLYKEDAR-SKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + PP + L + + D K+ + ++ + + SG Sbjct: 82 LTMLPPNEPCFRLRVNNLVVEREEENADKEFRTKIEKALSRIEQAVLA--DIEASGDRPV 139 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTF 181 + ++ G ++ + PLS + + + E T Sbjct: 140 VAEGNQHLIVAGNVLYHDDPKKG---------LRLFPLSRYVVERDPMGTPVEIVVEETV 190 Query: 182 TVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSV 241 +D + ++ +++ A T K+ + + V Sbjct: 191 NLDTLPED-----VAERIREAADTLGQPSIKGDDRKDVNIYTHLKRGPKKWSVYQECRGV 245 Query: 242 DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLH 301 ++ P++ R A E YGRS L + L L + +S Sbjct: 246 KLPGSEGSYKLEACPWLPVRMYSIAGENYGRSFVELQLGDLGSLESLCQSLVEGSAVSAK 305 Query: 302 PPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH---EELNRLKESIR 358 + L G + +Q + ++ RL++ ++ Sbjct: 306 VVGLVNPNGVTDPKALAESA--NGDMIEGNADDVAFLQVQKGADFQVVAAQIQRLEQRLK 363 Query: 359 SLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQG 418 + FL+ + V D +A E +E +G + + EF I+ + + Q Sbjct: 364 TAFLM-MDGVRRDAERVTAEEIRVIAQELETGLGGVYTLISQEFQLPYIASRMATMTRQK 422 Query: 419 NLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMD 478 +PE P + E + + + + G + S + ++ Sbjct: 423 RIPELPKGTVTPSIVTGFEAI---------GRGNDKQKLLEFL-KAGTELMGESFLGLLN 472 Query: 479 TDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQH----LQQQLQQTSQDI 533 A L++D E+ RQ + Q + + Sbjct: 473 PQNAVTRLASAMGISTEGLVKDEEELAQERQAAQQQAQGQMMMEKLGPEALRQIGGMAQA 532 Query: 534 GAKAAGRAM 542 G A + M Sbjct: 533 GNAEALQGM 541 >gi|282857730|ref|ZP_06266939.1| head-to-tail joining protein [Pyramidobacter piscolens W5455] gi|282584400|gb|EFB89759.1| head-to-tail joining protein [Pyramidobacter piscolens W5455] Length = 534 Score = 262 bits (668), Expect = 1e-67, Method: Composition-based stats. Identities = 67/503 (13%), Positives = 142/503 (28%), Gaps = 45/503 (8%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPY-------KNNAQLRMWDTTGSEACIKLSSL 60 + RF L R E+ + PY + + G+E LSS Sbjct: 17 TFKARFELLAGIRESYCQRAEQCSALTDPYLFPKDGVTGEKVASPYQSVGAEGVTNLSSR 76 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 + ++I PP + L + L +E +++ E Q+ + E Sbjct: 77 ILNIILPPNRPPFRLRVEKNPA---LPEEKRNWQQIEEGLAQLEKMVCDHIETLE--DRV 131 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + ++ G ++ D GIR S L N +S + + V + Sbjct: 132 VIAEAIPHLLVTGNVLLHVRKD--------GIRLHS--LRNYVVSRDPRGNVAEIIVREK 181 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 + + ++ + + S Sbjct: 182 VDPRFLALPLATSTTDAPENDRRPEDKASYKELFTQIKRTENG-----------WSLQQE 230 Query: 241 VDENRFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRL 298 VD + P++ R + E YGR + L + L + + Sbjct: 231 VDGKFVSKHGHYKKDECPWLPLRMYRVSGESYGRGYVEKYLGDHKSLEALTKAIVEGAAA 290 Query: 299 SLHPPTIAVSEAKQRNFDL-KPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 + + L + G + I S S Q + + + L++ + Sbjct: 291 CAKVVFLVSPNGTLKAKQLEEAGNLAILTGSAAEVSTVQVQKANDFQIAKAMADNLQQRL 350 Query: 358 RSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 +LL+ + + +A E +E +G L L EF + + + Sbjct: 351 SRAYLLN-SAIQRNAERVTAEEIRYMAQELETALGGLYSMLSMEFQHPYVKLRMKYMKED 409 Query: 418 GNLPECEGADNPPVSLLKVE-YTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDH 476 LP+ + +K+ L + Q A + + G + + Sbjct: 410 ALLPDLDQQYQEGKVGVKIVTGIDALGRGQDASRLT------EWAGIVFKTIGPQVALPY 463 Query: 477 MDTDRVSRFSLWATNTP-AVLIR 498 ++ + + L++ Sbjct: 464 INASAFMKALANSMGIDGVSLLK 486 >gi|326536937|ref|YP_004306344.1| head-tail connector protein [Pseudomonas phage phiIBB-PF7A] gi|318054513|gb|ADV35689.1| head-tail connector protein [Pseudomonas phage phiIBB-PF7A] Length = 535 Score = 261 bits (666), Expect = 3e-67, Method: Composition-based stats. Identities = 76/566 (13%), Positives = 151/566 (26%), Gaps = 58/566 (10%) Query: 1 MNQRSA----KDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDT 48 M + + + ++ LK+ R E P + + Sbjct: 1 MAETRTGLAEEGAKAVYDRLKSDRAPYETRAENCAKVTIPSLFPKESDNSSTNYTTPYQA 60 Query: 49 TGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLF 108 G+ L++ + + P + W L S + + + V + V L Sbjct: 61 VGARGVNNLAAKVHMALFPL-EPWMKLKVSEWQAKQLVT-DPEELAMVEQGLSMVERILM 118 Query: 109 GFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 + E + + L +V G GC Y+ L N + + Sbjct: 119 SYMEAN--SYRTTLHELIRQLVIAGAGCLYLPPPESSSQGSP---MKLYTLHNHVVQRDA 173 Query: 169 QNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKD 228 V + + + K +E + VY + Sbjct: 174 FGNVLQICTLDRVAFAALPEDV-------RTKLDGEHKPDEEIEVYTHVYLDDESGDYLS 226 Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 + + + + P++ R+ R E YGRS E + L Sbjct: 227 ------YQEIDGEEVEGTDGQYPREAMPWVAVRWTKRDGEHYGRSHVEEYQGDLDSLENL 280 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 + +F ++ + + L GA ++ + +Q + Sbjct: 281 HEAMIKFSMIASKVVGLVNPNGITQVRRLTKAQT--GAFVPGRKADIEFLQLDKAADFSV 338 Query: 349 ELNRLKESIRSLFLLDLFQ--VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 + + L + + V + +A E RE +G + L E + Sbjct: 339 AKSVADAIEQRLSYVFMLNSAVQRNGERVTAEEIRYVARELEDTLGGVYSILSQELQLPI 398 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 I L+ L + +P+ P VE L + Q + + LQ + V L Sbjct: 399 IRILLNQLQATQQIPDMPKEAVEPTVSTGVE---ALGRGQDLDKMTQFLQALQLVAPLEN 455 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQ--------RRVM 517 ++ + A L+ E + + Q Sbjct: 456 DQD-------LNITTIKLRLANAMGLDTSGLLLTQEEKAQKQAEMMAQTGGENLAGAAGA 508 Query: 518 EEQHLQQQLQQTSQDIGAKAAGRAME 543 + Q T QD A M+ Sbjct: 509 GAGAMMTQDPDTMQD---AMATAGMD 531 >gi|313892489|ref|ZP_07826078.1| head-to-tail joining protein [Dialister microaerophilus UPII 345-E] gi|313119068|gb|EFR42271.1| head-to-tail joining protein [Dialister microaerophilus UPII 345-E] Length = 516 Score = 259 bits (662), Expect = 8e-67, Method: Composition-based stats. Identities = 64/506 (12%), Positives = 137/506 (27%), Gaps = 48/506 (9%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIK 56 + + + LK R E + P + + + G+ Sbjct: 9 RKETAKAVYERLKQARTPYIERAVECAKYTIPSLFPRDGSTGSTKFETPYQSVGARGVNN 68 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L+S L + PP + L+ A Q L + +V + ++ + + E + Sbjct: 69 LASKLMLALFPPNANYFKLSPGDEA-QQELDQTPQAKAQVDQALMKMESKIVEYAEAHQ- 126 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 + L ++ G ++ L+ + + V + Sbjct: 127 -YRVTLAEALKVLIVTGNDLLFLPPKEGG--------MKLYKLNTYVLERDALGNVIQIV 177 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHS 236 + D+V KS + + I VY + + Sbjct: 178 AVDKISYVA----LPDEVKRMVDKSGTTPTTSTQVEIYTHVYLEDDQYLS--------YQ 225 Query: 237 KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 ++ + + P+I R E YGRS E L + L + + Sbjct: 226 EYKGQIIPQSEQSYPKDKTPWIPLRMVKVDGESYGRSFVEEYLGDFKSLENLTKSIVEAS 285 Query: 297 RLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKES 356 ++ + + R L G +Q + +++ Sbjct: 286 LVAANILFLVNPNGVTRVRHLAKA--KSGDFVSGRIEDIGTLQINKYADLQVVSSTIEQI 343 Query: 357 IRSLFLLDLFQ--VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 L + V +A E E +G + L E ++ R L L Sbjct: 344 TARLSYAFMLNSAVQRQGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRRLLAQL 403 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 S G LP E P +E L + + + + + + +P Sbjct: 404 MSLGQLPALEDGLVEPTITTGLE---ALGRG---HDLNKLITFMQLIQQ------NPQQA 451 Query: 475 DHMDTDRVSRFSLWATNTP-AVLIRD 499 + + ++ A +++ Sbjct: 452 QAIKWNEMTIMEATALGLDVTNIVKT 477 >gi|326424990|ref|YP_004286212.1| virion structural protein [Pseudomonas phage phi15] gi|325048394|emb|CBZ42007.1| virion structural protein [Pseudomonas phage phi15] Length = 533 Score = 258 bits (658), Expect = 2e-66, Method: Composition-based stats. Identities = 71/517 (13%), Positives = 138/517 (26%), Gaps = 44/517 (8%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLY----P----YKNNAQLRMWDTTGSEACIKLS 58 + + ++ LK R E P + W G+ LS Sbjct: 11 EGAKATYDRLKTDRSPYETRAENCAKVTIGSLFPAESDNASTNYATPWQAVGARGVNNLS 70 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGF 118 + + + P + W L S + L V V + + E + + Sbjct: 71 AKVHLALFPL-EPWMKLKVSEWQAKQMLGN-PEDLAAVEAGLSMVERVMMSYMEAN--SY 126 Query: 119 VGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYRE 178 L +V G Y+ +G + N + V + Sbjct: 127 RTTLHELIRQLVVAGNALLYLPNPEGTQGSP----MKMYTMHNYVCQRDSFGNVLQIVTL 182 Query: 179 FTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKF 238 + K+ R +E + VY G+ + + Sbjct: 183 DKVAFAALPEDVRSKL-------DGDRTPDEEVEVYTHVYRDDE------SGDFLSYQEV 229 Query: 239 VSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRL 298 + + + P+I R+ R E YGRS E L ++ L + +F + Sbjct: 230 DGEEIEGTDGQYPVDAMPWIAVRWTKRDGEHYGRSHVEEYLGDLQSLENLSEAMIKFSMI 289 Query: 299 SLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIR 358 + + + L GA ++ + +Q ++ Sbjct: 290 ASKVIGLVNPNGVTQVRRLTSAQT--GAFVPGRKADIEFLQLEKAADFNIAKAVADNIES 347 Query: 359 SLFLLDLFQ--VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDS 416 L + + V +A E RE +G + L E ++ L+ L + Sbjct: 348 RLSYVFMLNSAVQRGGERVTAEEIRYVARELEDTLGGVYSILSQELQLPIVRILLNQLQA 407 Query: 417 QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDH 476 +P+ P E L + Q + LQ +N + + D Sbjct: 408 TQQIPDLPTEAVEPTVSTGAE---ALGRGQ---DLDKMLQFLNALTMVTPLENDQD---- 457 Query: 477 MDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREV 512 ++ + A LI E + Sbjct: 458 LNVKTLKLRIAQAIGVDTTNLILTEDEKAQRMAENMA 494 >gi|325272831|ref|ZP_08139168.1| head-to-tail joining protein [Pseudomonas sp. TJI-51] gi|324102036|gb|EGB99545.1| head-to-tail joining protein [Pseudomonas sp. TJI-51] Length = 450 Score = 245 bits (625), Expect = 1e-62, Method: Composition-based stats. Identities = 66/481 (13%), Positives = 145/481 (30%), Gaps = 35/481 (7%) Query: 63 SLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCL 122 + PP + L + L V+ ++ + E + Sbjct: 1 MALLPPNSPFFRLEI-DEFTEEKLTSNPQMHADVQAGLAKIERAVQT--EIETTAIRVTG 57 Query: 123 QSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFT 182 ++ G G Y+ + G+++ PL + + V + + + Sbjct: 58 FELLKHLIVGGNGLVYL-------PQQGGMKF--YPLDRYVVRRDPMGNVLDIVVKEEVS 108 Query: 183 VDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVD 242 + + + V R+ N+ +I + K T + + Sbjct: 109 LAVLPEEARSLVEPGDDSGDTPRDHNKNVSIYTHITLKGET--------WNVYQEVKGQI 160 Query: 243 ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 ++ R+ E YGRS E L I+ L + + S Sbjct: 161 VPGSRGTYPKDKCAWLPIRFVKIDGENYGRSYVEEYLGDIKSLEGLSQAIVEGSAASAKV 220 Query: 303 PTIAVSEAKQRNFDLKPG-YMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + +L Q + G+ E +N + E + F Sbjct: 221 LFLVNPNGVTSSSELAEAPNGEFVDGVASDVQALQLQKSGDFRVALETINTITERLEFAF 280 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 +L+ + + +A E E A +G + L EF +++R + + + LP Sbjct: 281 MLN-SAIQRNGERVTAEEIRYMAGELEAALGGVYSILSQEFQLPLVNRIMFSMQRRKKLP 339 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 E P + +E L + + Q ++T++++ P ++ Sbjct: 340 ELPKGTVSPTIVTGME---ALGRG---NDLTKLDQFISTIMQI------PDAASRINWGN 387 Query: 482 VSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 A L++ EV+ +QQ+++Q+ + Q + G + Sbjct: 388 YMTRRATALGIDTDGLVKTDQEVQQEQQQQQMQQAMQSGVAPAVQAAGRMMEKGQPDGSQ 447 Query: 541 A 541 A Sbjct: 448 A 448 >gi|118590948|ref|ZP_01548348.1| hypothetical protein SIAM614_19846 [Stappia aggregata IAM 12614] gi|118436470|gb|EAV43111.1| hypothetical protein SIAM614_19846 [Stappia aggregata IAM 12614] Length = 567 Score = 243 bits (619), Expect = 6e-62, Method: Composition-based stats. Identities = 115/565 (20%), Positives = 218/565 (38%), Gaps = 43/565 (7%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYP--YKN----------------------NA 41 D++ + +R + ++ + P + + Sbjct: 4 VDDLKTELQSARAERQWVEADWQDYVTYTAPDMERAFNRPGGVSARDGMSALRGSAARDR 63 Query: 42 QLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCD 101 +++D T +L+S + SL P G WHG+ A + E+ + Sbjct: 64 SRKLYDPTAVWLLDRLASGIGSLTMPEGFPWHGVGFGDPFAPAPSQAD-------EEFFE 116 Query: 102 QVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFY-MEADVDEKGLEEGIRYISVPLS 160 V D LF R RSGF +S S V+ GTG + +E + + + Y VPL Sbjct: 117 LVRDHLFRVRYSGRSGFALANRSRLLSTVKLGTGVLFPVENEDSLADIRTPVHYRYVPLY 176 Query: 161 NVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALA-RNENERFTIIHAVYP 219 +Y+ ++ Q +R T Q V ++ KV + A + +N +T +HA + Sbjct: 177 EIYLVIDAQGNDCGFFRVRTLKAWQAVKEYAGKVSPKVKEDAADAKRKNTDYTFVHACFL 236 Query: 220 KSLTD-KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEA 278 + + D F S D +P ++ R+ YG P + Sbjct: 237 REGGHAQATDTRKSRFESIHFEEDSGHICRRGGFFEYPLVISRWDRDGLSPYGSPPQAKL 296 Query: 279 LPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPV 338 + I+ L + ++ PP + A++R DL PG +N G + +GR LF+P+ Sbjct: 297 MSDIKSLQSLARDGLIASSQAVRPP--IATHAQERQLDLNPGRINPGLIDEQGRPLFRPM 354 Query: 339 -QFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGG 397 NP ++ ++E +R DL+Q L + R+A E+ + +E +GP Sbjct: 355 IDTVNPGAADAQIETIREKLRVGLYGDLWQTLLEGNGRTATEANIRRKEMADMIGPFSTN 414 Query: 398 LQSEFIGAMISRELDILDSQGNLP---ECEGADNPPVSLLKVEYTSPLFKYQQAESVASA 454 + + A+ RE+ IL +G + + + T+P+ + ++A + Sbjct: 415 IMA-GNEALFEREIGILGRRGAFAPGSPLAPPQSVLEGDVTLTPTAPIDQMREAGHFEAI 473 Query: 455 LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQR 514 + + DPS +D D + + A PA L R EVE +RQ+R ++ Sbjct: 474 MGFQEYLGIAAGA--DPSILDLHDREAEYDLTRRALGLPAKLRRRPEEVEALRQERAAEQ 531 Query: 515 RVMEEQHLQQQLQQTSQDIGAKAAG 539 + ++ + + + ++D Sbjct: 532 QQQQQLATGESMARIARDGAPLLQA 556 >gi|291334897|gb|ADD94534.1| T7-like head to tail connector [uncultured phage MedDCM-OCT-S08-C159] Length = 416 Score = 239 bits (610), Expect = 7e-61, Method: Composition-based stats. Identities = 54/394 (13%), Positives = 112/394 (28%), Gaps = 36/394 (9%) Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S + +V G Y+ PLS + Sbjct: 1 MNQIEISNDRVAMFEALKHLVVSGNVLLYLTDKG----------LKVYPLSKFVCKRDEV 50 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 V + + T + + + +++ K K + I+ + D Sbjct: 51 GNVLEILTKETVHPQALPADFLEQI---KKKENYDAVTMKEDLDIYTYIQRVNDDVF--- 104 Query: 230 GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 ++ + ++ P+I R+ E YGR E + L + Sbjct: 105 ----WYQECKGEKIPNTDGRSKLDVSPWIPLRFIRVDGEDYGRGYVEEYRGDLISLESLM 160 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE- 348 + + S + R L GA+ S +Q G + Sbjct: 161 QAIIEGAAASAKTLFLVNPNGVTRAATLAKA--PNGAIREGLASDISVMQVGKSGDFSVA 218 Query: 349 --ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 + R++ + FL+ V D +AAE +E +G + L EF Sbjct: 219 FSAIQRIEGRLEFAFLMARS-VQRDAERVTAAEVSLMAQELENSLGGIYSILTQEFQLPY 277 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + R + +L QG +P+ P + + Q + + + + Sbjct: 278 LRRRMHLLVRQGKVPKLPDELVKPKIVTGL---------QGLGRGNDRNKLIEFIGTVAQ 328 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRD 499 G +++ D + + A L++ Sbjct: 329 ALGPDVMRQYVNVDEAVKRLATSIGIDTANLVKT 362 >gi|256845624|ref|ZP_05551082.1| predicted protein [Fusobacterium sp. 3_1_36A2] gi|256719183|gb|EEU32738.1| predicted protein [Fusobacterium sp. 3_1_36A2] Length = 550 Score = 238 bits (607), Expect = 2e-60, Method: Composition-based stats. Identities = 80/544 (14%), Positives = 192/544 (35%), Gaps = 32/544 (5%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLY--------PYKNNAQLRMWDTTGSEACIKLS 58 + ++ F+ KN + ++ E+ + R ++ ++ L Sbjct: 8 EKLEYYFDNAKNYKEDIRGLYNEVYEYTDVNFSIKDSGTVEKQSKRGVESVILKSQNFLC 67 Query: 59 SLLSSLITPPGQKWHGLAESFSAYQAFLYKE----DARSKKVREWCDQVTDTLFGFRERS 114 + + S I +W + + A++ + + S ++ + + +DT++ Sbjct: 68 NFIMSSIFSKSGRWATVKVNQEAFKKLSGVDGEAAEGLSNEINKVLENNSDTVY--FTND 125 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS 174 + + ++ GTG + D Y L N+Y+ ++ + Sbjct: 126 NTNYYTETSKALLDCIKVGTGIRKIIELKDNTKC---FTYAYQNLDNIYILEDNLGKPNI 182 Query: 175 VYREF-TFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 +++ + ++ I +G +++ K E+ II V +D Sbjct: 183 IFKVYVEKNLNDINDLFGHLPITTP-KGLNEDKLEEKINIIECVVGVFD----EDTSTYK 237 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 ++ + E ++ PY V R+++ + +G +E L + L + + Sbjct: 238 YYHGLFTEAFEEMLYEGELNYNPYTVFRWKINSSNPWGIGIGLENLDLFKELKDLKEKRK 297 Query: 294 QFGRLSLHPPT-IAVSEAKQRNFDLKPGYMNIGALS-REGRSLFQPVQFGNP-LPYHEEL 350 + + PP S LK N G + +P+ G LP +++ Sbjct: 298 KHADKIVSPPLNFYGSTDLINKVSLKANAKNYGGSGIGGDKYGVEPINIGTNLLPVEKDI 357 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 ++K+ IR +F+ + D +RSA E + + +E + Sbjct: 358 EQVKQEIREVFMSQPLGDVSDTKNRSATEMSLRHEMFRKEFSGTYELINTELLEPTFMNA 417 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 I+D +G L E ++ +++Y + L + ++ V + +N + L + Sbjct: 418 YYIMDGKGLLNTTEDESYI--NISQIQYINELTRNAGSDEVINT---INFYMTLSQVVPE 472 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTS 530 D + ++ P ++ E++ + Q++ ME+ L Q+ Sbjct: 473 TQRQFIFKIDELIDWASKKMRVPLDVLNSKEEIKQLIAQQQEL-EQMEKMALIQEGIGKR 531 Query: 531 QDIG 534 QD+G Sbjct: 532 QDVG 535 >gi|296537022|ref|ZP_06899017.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296262651|gb|EFH09281.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 368 Score = 236 bits (601), Expect = 9e-60, Method: Composition-based stats. Identities = 80/352 (22%), Positives = 135/352 (38%), Gaps = 19/352 (5%) Query: 113 RSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV 172 RS F + + +V GTG +E G +R+ +VPL + + Sbjct: 34 LDRSNFAVEMHQAFLDLVVAGTGVLLVEEAP--PGALSALRFTAVPLREAVLEEGESGRL 91 Query: 173 DSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNK 232 D++YR I +++ VL + + E R ++ AV+P ++G Sbjct: 92 DTIYRAMALEAAAIAARYPGAVLPPGLGAGSPAQEAPRHRVVEAVWP--------ERGGS 143 Query: 233 GFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNEL 292 + + E + P+I R+ E YGR P M+ALP IR N+ V + Sbjct: 144 AYLAVLEHDGRAWPLAEGRFQDSPFIAFRWLKAPGEAYGRGPVMKALPDIRTANKVVELV 203 Query: 293 AQFGRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 + ++ A + L PG + A G + GN L Sbjct: 204 LKNASIAATGIWQAEDDGVLNPATVRLVPGAIIPKAPGSSGLTPLAA--PGNFDVSQLVL 261 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 + L+ IR+ L D A+ +A E +E++ + +G G LQ+E + +I R Sbjct: 262 DDLRGRIRAALLADRLGPP-GTAAMTATEVLERSAQTARLLGATYGRLQAELLTPLIGRC 320 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 L IL +G +P ++ Y SPL + Q A+ L + V Sbjct: 321 LSILRRRGEVPP----LLLDGREARLTYHSPLARVQGRSDAANTLLFLQAVA 368 >gi|325971684|ref|YP_004247875.1| hypothetical protein SpiBuddy_1857 [Spirochaeta sp. Buddy] gi|324026922|gb|ADY13681.1| hypothetical protein SpiBuddy_1857 [Spirochaeta sp. Buddy] Length = 571 Score = 234 bits (596), Expect = 3e-59, Method: Composition-based stats. Identities = 98/526 (18%), Positives = 197/526 (37%), Gaps = 30/526 (5%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNY-WMEELT-------GFLYPYKNNAQLRMWDTTGSEA 53 + AK I +++ LK R + E F +++++T+G A Sbjct: 27 DDPLAKAIAAKWSRLKTLRQKTEALRWEACAFVQHRMNEFSDSNNPIKPVKLYNTSGILA 86 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 + + P +W L + ++ + ++ + +F E Sbjct: 87 LDTFINGYHGNLITPSMRWFKLTLTGENFE-----DSDTIHGANDYMEISETQMFA--EL 139 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 +++ F + V GT ++ DV+ + ++ + ++ N +D Sbjct: 140 NKTNFYPLDKLATKDAVVQGTSAEWVYDDVESGTCV----FETIAPWDFWIDKNANGKID 195 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK---- 229 +++ FT T + ++ DK + ++ + + A+YP+ +K K Sbjct: 196 TIFIRFTMTSADALDRFKDKTPPNILRDVETDAGHNEHEFVLAIYPRKKLRSEKGKVLIS 255 Query: 230 GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 K F + E+ EE FP V + YG M+ L ++RLN Sbjct: 256 TEKPFAAVTYYPVEDCIVEESGYDDFPVAVHVFEQDGTSAYGMGLVMKYLTELKRLNSMS 315 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQ-FGNPLPYHE 348 + + + PP K R F PG N + Q VQ G + Sbjct: 316 RDHLETVQKVAKPPMSIPESLKGR-FSGDPGARNYMGNMDAKPEIIQTVQDIGW---LSQ 371 Query: 349 ELNRLKESIRSLFLLDLFQ-VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 E+ L+E I LF DLF ++ +A ++ E+ A + ++G Q I ++ Sbjct: 372 EITELEEKIGRLFFNDLFNYLMRQDKVLTATQTQAIKSEELALLASILGTTQYMKINPIV 431 Query: 408 SRELDILDSQGNLPECEGAD-NPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 R I+ LP+ +L++++ PL K + ++ LQ ++ Sbjct: 432 KRVFRIMVKGNRLPKPPKELLRIKNALMRIDLDGPLAKNVKMFAMQDGLQASLEWMQALH 491 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREV 512 + +D+++TD R + A P ++R+ EVE +R+Q++ Sbjct: 492 AMQMTNTLDNINTDIFVRKAFIAAGMPQSVLRELGEVEQMRKQKQA 537 >gi|307946242|ref|ZP_07661577.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307769906|gb|EFO29132.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 519 Score = 234 bits (596), Expect = 3e-59, Method: Composition-based stats. Identities = 85/525 (16%), Positives = 174/525 (33%), Gaps = 37/525 (7%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK---------NNAQLRMWDTTGS 51 M SA ++ R N + +R ++E + P++ + ++D T Sbjct: 1 MVDLSA--LKKRRNGAQRERDAFQPLLDEAYQYAIPFRKSAAKTGKGDKRVNDVFDHTAI 58 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 ++ + + + + P GQ L ++ K+ + ++ + F Sbjct: 59 DSAFRFAGKVQQDLWPAGQDNFELEPGPVVL------DENERDKMSKQLAPISKIVQAF- 111 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 F + DE ISVP+ + + N Sbjct: 112 -FDDGDFDMAFHEMALDLSAGNGAMLLNPPGPDEPEKLWEP--ISVPIEELLIENGPNNR 168 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGN 231 + +++ + +V + W + +K L + D Sbjct: 169 ISAIFWKRKMSVRVLQDTWPEGKFGENLKKLLKEKPEGEIDV--------NVDTVWVPKE 220 Query: 232 KGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNE 291 + + + + + T P++ RY E YGR P M A+PTI+ LN Sbjct: 221 RRWRMIVWCNKQETAVFQNESRTCPWLFARYFRVPGEAYGRGPVMLAMPTIKTLNTAARL 280 Query: 292 LAQFGRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPY-HE 348 Q +++ V + L+PG A + L + Sbjct: 281 QLQAAAIAMLGIYTTVDDGVFNPDLASLEPGAFWKVARNGGALGPSINRFPDPRLDLSNL 340 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 LN ++ ++ ++D D A RSA E +E+ + + G L E + + Sbjct: 341 VLNDMRMGVK-ATMMDQSLPADGAAVRSATEILERVKRLASDHLGAYGRLVKEIVIPAVK 399 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 R ++I ++G + L++V SPL ++A+ V +Q + V+ +G Sbjct: 400 RAMEIAYNKGLI---SDEIPIDQLLVRVRVKSPLALAREAQRVEKVIQWLQMVISIGAAV 456 Query: 469 GDPSCMDHM-DTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREV 512 G P + + + P + I E E+ ++Q + Sbjct: 457 GQPGFLQQIAKVETALTQIGRDLGVPEMFIVSEKEREEKKKQDQD 501 >gi|254505325|ref|ZP_05117473.1| hypothetical protein SADFL11_PLAS23 [Labrenzia alexandrii DFL-11] gi|222436169|gb|EEE42851.1| hypothetical protein SADFL11_PLAS23 [Labrenzia alexandrii DFL-11] Length = 490 Score = 233 bits (595), Expect = 4e-59, Method: Composition-based stats. Identities = 71/521 (13%), Positives = 149/521 (28%), Gaps = 61/521 (11%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKL 57 K +++R+ L+ +R + P + + + G+ + L Sbjct: 1 MKSLKERYQNLQIKREPFLKRARDCAALTIPTLLPPEGHNATSKLPQPYQGLGARCVVTL 60 Query: 58 SSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG 117 +S + P GQ + GL E + + T+ + +E + Sbjct: 61 ASRMLVAFIPTGQPFFGLEVPPELLLQEGLMEAPPDLE--KGFALATNLIT--KEIEKKA 116 Query: 118 FVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR 177 + +V G D L + + + + Sbjct: 117 WRKPTSLTLELLVSTGNALERYMPDNS---------IRVYRLDQYVVVRDLSGNLVELIL 167 Query: 178 EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSK 237 L + +S L ++ + I + Sbjct: 168 REKVN---------KASLPEQTQSYLKASQEDDVEIFTC------------AKRHPDGWE 206 Query: 238 FVSVDENRFFEEKQ--IATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 E + E T P+ R+ E YGR E + L+ + Sbjct: 207 IKQEVEGQIIEGMGGVTPTNPFNPLRWSAVPGEDYGRGKVEEHFSDLTYLDLLSKSMVDG 266 Query: 296 GRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLF---QPVQFGNPLPYHEELNR 352 ++ T+ A N + G + Q +E+ R Sbjct: 267 SAMATRHITMVRPNAAGSNLRKRFAEAKNGDVISGNPEDVDLKQFANVTGMQIAQQEIAR 326 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 + + + FLL ++ + +A E E + +G + L + + A I + Sbjct: 327 ITQELAQAFLL-SSSMIRNAERVTAQEVRMIAEELESVLGGVYSYLSQDMMSARIEALMT 385 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 + + G LP PV + +E L + + V + LQ + + P Sbjct: 386 SMMAAGQLPPVLQ-MTQPVLTVGLE---ALERDKDVMRVQTVLQTLQAL--------PPD 433 Query: 473 CMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQ 513 +D++D + + + P ++ E + RQQR + Sbjct: 434 FLDYLDIPDLLKTFMIGLGLPGK-VKTEQEAQQTRQQRLMA 473 >gi|281416306|ref|YP_003347546.1| head-to-tail joining protein [Klebsiella phage KP32] gi|262410425|gb|ACY66690.1| head-to-tail joining protein [Klebsiella phage KP32] Length = 461 Score = 231 bits (589), Expect = 2e-58, Method: Composition-based stats. Identities = 68/496 (13%), Positives = 131/496 (26%), Gaps = 41/496 (8%) Query: 62 SSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGC 121 + P W L S + L + KV E V + + E + + Sbjct: 1 MLALFPMQS-WMKLTISEYEAKNLLG-DAEGLAKVDEGLSMVERIIMNYIESN--SYRVT 56 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTF 181 L + G Y+ L++ + + V + Sbjct: 57 LFECLKQLCVAGNALLYLPEPEGYTP------MKLYRLNSYVVQRDAFGNVLQIVTLDKI 110 Query: 182 TVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSV 241 + + V S + + E+ + VY D Sbjct: 111 AFNA----LPEDVRSQVEAAQGEQKEDAEIDVYTHVYLNEAGDGYSKYEE------VAEE 160 Query: 242 DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLH 301 E + PYI R E YGRS E L ++ L + + ++ Sbjct: 161 VVPGSEAEYPLEECPYIPVRMVRIDGESYGRSYVEEYLGDLKSLENLQESIVKMAMITAK 220 Query: 302 PPTIAVSEAKQRNFDLKPGY-MNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSL 360 + + L ++ Q + G+ + ++ + Sbjct: 221 VIGLVDPAGITQVRRLTAAQSGAFVPGRKQDIEFLQLEKSGDFTVAKNVSDTIEARLSYA 280 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 F+L V +A E E +G + L E ++ L L + + Sbjct: 281 FML-NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQI 339 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 PE P +E + + + + + L GD D ++ Sbjct: 340 PELPKEAVEPTISTGLEAIG------RGQDLDKLERCIAAWSALKALEGD----DDLNLA 389 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 + A + +E + +M +Q Q QQ + +G A + Sbjct: 390 NLKLRIANAIGLDT---------AGMLLTQEQKNALMAQQGAQIATQQGAAALGQGIATQ 440 Query: 541 AMEKKLTHDMMENSYG 556 A +S G Sbjct: 441 ATASPEAMAAAADSVG 456 >gi|253583086|ref|ZP_04860294.1| predicted protein [Fusobacterium varium ATCC 27725] gi|251834978|gb|EES63531.1| predicted protein [Fusobacterium varium ATCC 27725] Length = 517 Score = 228 bits (582), Expect = 1e-57, Method: Composition-based stats. Identities = 92/522 (17%), Positives = 183/522 (35%), Gaps = 46/522 (8%) Query: 20 RGELNYWMEELTGFLYPYKNNAQLRM--------WDTTGSEACIKLSSLLSSLITPPGQK 71 + ++ E+ + P + ++ +++ S+A + +S + +K Sbjct: 23 KSKIEPLYNEILAYTDPMNSVTTSKLEGTLEGTYVNSSISDAQTSFKNFISYALFGIKKK 82 Query: 72 WHGLAESFSAYQA--FLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSV 129 W + + +E D TD +F + S + + T Sbjct: 83 WAKSDVIKPLLAKKYQGQELIDMIQSYKEKLDVQTDEIFDY--ILASNYEKEIGRALTDW 140 Query: 130 VEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYR-EFTFTVDQIVS 188 E GTGC+ E + + R+ VPL+ + + + Q+ + V+R F +++ I S Sbjct: 141 GELGTGCWKYEE---QNSEKVPFRHQYVPLNELLFNEDLQHRPNIVFRYNFKYSLWDIRS 197 Query: 189 KWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFE 248 + LS NENE T+I V P + TD F + Sbjct: 198 LYKKADLSC----YDGINENEEVTVIECVMPVAETDT--------FEWILFDERMDNVLY 245 Query: 249 EKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTI-AV 307 K PY + R+ V + ++GR + L RL N A+ + PP + Sbjct: 246 RKIYNYNPYTIFRFTVMPNNVWGRGLGVTCLDYYERLCYCENLRARQSIRIVEPPLLLVG 305 Query: 308 SEAKQRNFDLKPGYMNIGALSREGRSLFQPVQF-GNPLPYHEELNRLKESIRSLFLLDLF 366 + FDL P +N G G++ P+ G LP +++ R + I+++ + Sbjct: 306 DKRLIDGFDLDPNGLNWGGDGITGQANAVPMNTTGTLLPLDQDIQRYTQVIQAIHFNNPM 365 Query: 367 QVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGA 426 ++++ +R AE + + L E + ++ IL + + + + Sbjct: 366 GSVENRTTRGNAEMGYRMQLFNQKFSDATSNLYDEVLIPTFAKPKQILQDKNIVKKIDED 425 Query: 427 DNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFS 486 + ++ + L + E + + TV ++ D F Sbjct: 426 -----KYFQAKFVNLLTETVDMEEIQKLSTYIQTV----QGFYPEVRTATLNKDNTLNFI 476 Query: 487 LWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 P L ++QR+ +M +Q LQ Q Sbjct: 477 ADTFTVPVYL-------RATKEQRQESEEMMMKQALQMQAVA 511 >gi|315518948|dbj|BAJ51825.1| putative head to tail joining protein [Ralstonia phage RSB2] Length = 531 Score = 221 bits (563), Expect = 2e-55, Method: Composition-based stats. Identities = 75/559 (13%), Positives = 155/559 (27%), Gaps = 61/559 (10%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 + L+N R E+ + P + + + G+ L++ L Sbjct: 19 YTRLENDRAPYITRAEKNAQYTIPSLFPKSSDNYSTDYPTPYQSVGARGLNNLAAKLVLS 78 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREW---CDQVTDTLFGFRERSRSGFVGC 121 + P G+ +H L S + + V + E +G Sbjct: 79 LIPVGEPFHRLTISEFDVKETAGGTGEEGSVMERAQVGLSMVERIITAHGE--SAGLRPM 136 Query: 122 LQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTF 181 ++ G G + + L N + + V + Sbjct: 137 ASELMKQLLVAGNGLVCLPPQE--------VACKLYKLHNFVVERDSVGNVLQTIAK-DV 187 Query: 182 TVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSV 241 T + + L N T+ Y +D+ + + Sbjct: 188 TAYVALPEEVKAALPEG-----DYQPNSPITMYTHCYRDLESDQWLA------YQEVEGE 236 Query: 242 DENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLH 301 PYI R + E YGRS E + + L + QF Sbjct: 237 VIPGSENTYPKEGNPYIPIRMYKQDGENYGRSFVEEYIGDLVSLENISKAIVQFAIACSK 296 Query: 302 PPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE---LNRLKESIR 358 + + + G + + Q + + +++ + Sbjct: 297 ILFLVKPGSSTSVRRVAKAAS--GDFVPGKKEDIEVFQMEKFADFQTAKSVADGIEQRLS 354 Query: 359 SLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQG 418 FLL+ V +A E + E + +G + L +EF ++ R L L + G Sbjct: 355 FAFLLNSS-VQRSGERVTAEEIRFVSAELESTLGGVYSVLATEFQLPIVRRWLIDLQATG 413 Query: 419 NLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMD 478 +P+ P + ++ + + Q +A+ + + +D Sbjct: 414 KIPDLPTEALKPQIITGID---AIGRGQDQAKLAAFQSLIQ--------PFVQRVSNRVD 462 Query: 479 TDRVSRFSLWATNT-PAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKA 537 D + + A+ PA LI + + ++ + Q L Q GA A Sbjct: 463 WDGLLLKAANASGLDPAGLILTD----------QQMQARATQEGITQGLVQGGASAGATA 512 Query: 538 AGRAMEKKLTHDMMENSYG 556 + ++ + G Sbjct: 513 GQGMGAAMTDPEGIQQALG 531 >gi|167565008|ref|ZP_02357924.1| head-to-tail joining protein [Burkholderia oklahomensis EO147] Length = 509 Score = 210 bits (533), Expect = 6e-52, Method: Composition-based stats. Identities = 79/556 (14%), Positives = 153/556 (27%), Gaps = 64/556 (11%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPY----KNNAQLRM----WDTTGSEACIKLSSL 60 ++DR+ L R + P ++ + + G +SS Sbjct: 4 LKDRYQELVPDRDPYFRRAQACAALTVPSVCPPDGQTSQQILPQSYTSFGHRGATNVSSK 63 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 L PPG + S + ++ + Q + + + Sbjct: 64 LMMAFMPPGDSAFNIEVSTQVLLQEG--VLSPPPEIVKGLAQCEQLINA--KIEALNWRR 119 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 +V G Y++ D R LS + V Sbjct: 120 QTYLSLLHLVVAGNVGEYIQPDG---------RLKIFSLSQFVCVRDFNGRVMEAVTAEK 170 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 V + L ++ A+ E E T+ + D+ H Sbjct: 171 LKVRE---------LPKDLQRVTAKKEREDVTLYT-------RFEWVDENRYAVHQDLDD 214 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 + E I P+ + + E YGRS + + L++T +L + G ++ Sbjct: 215 AVVKPYQEYNGI--MPFNALAWELVPGESYGRSHVEQNYSDLIALDKTSQQLLECGAIAA 272 Query: 301 HPPTIAVSE---AKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH---EELNRLK 354 R ++ ++ + + QP QF N E LK Sbjct: 273 RNLIFVAPNAAGGNLRKRIMEARNGSVISARGGTQGDVQPFQFNNMAAMQSLNAEKQDLK 332 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 + FLL + D +A E E +G + L E IG + + + + Sbjct: 333 RDLAVAFLLTN-DLRRDAERVTAYELQMLVTEIEQSLGGVYSYLGPEMIGWRLKKLVAQM 391 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 S+ LP+ + L K + + V S L +N + Sbjct: 392 QSKDELPKIGKDSTQITVTTGLA---ALGKDAKLKKVHSFLSLLNETPQAFQ----QEAA 444 Query: 475 DHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIG 534 ++ D + + A P + + + ++ Q ++ Sbjct: 445 AYVKFDTILTPAAAALGFPQSI-----------KTAQEVQQEQAAAQEQAMQADMARAAA 493 Query: 535 AKAAGRAMEKKLTHDM 550 AG+ L Sbjct: 494 GPVAGQIAANTLAPAQ 509 >gi|291335778|gb|ADD95380.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C429] Length = 315 Score = 209 bits (531), Expect = 1e-51, Method: Composition-based stats. Identities = 48/325 (14%), Positives = 96/325 (29%), Gaps = 22/325 (6%) Query: 229 KGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNET 288 G +H + + P++V + E YGR E L ++ L Sbjct: 3 NGRWVWHQEVLDKIIPNTRSTAPKNASPWLVLTFNSVDGEQYGRGRVEEFLGDLKSLEGL 62 Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 L + + + + + + GA+ + Q VQ G + Sbjct: 63 SQALVEGAAAASKVIFLVSPSSTTKPATIAKAG--NGAIVQGRAEDVQVVQVGKTADFST 120 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 N + R L L + + +A E E +G + L F+ + Sbjct: 121 AANMSQTIERRLLEAFLVMNVRNAERVTAEEVRLTQLELEQQLGGIFSLLTVSFLIPYLD 180 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 R L +L LP+ P V + L + Q E++ + + Sbjct: 181 RTLLVLQRTNELPKLPKDIIRPTI---VAGVNALGRGQDREALT------QFMGTIAQTI 231 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQ 527 G + ++ + A L++ ++ +++ QQ Q Sbjct: 232 GPEALGQFINPLEAIKRLAAAQGIDVLNLVKTQEQLAGEKEE----------AMQMQQQQ 281 Query: 528 QTSQDIGAKAAGRAMEKKLTHDMME 552 G A + + + MM+ Sbjct: 282 TLLNQAGQFANSKLADTENMQGMMQ 306 >gi|125999995|ref|YP_001039666.1| head portal-like protein [Erwinia amylovora phage Era103] gi|121621851|gb|ABM63425.1| head portal-like protein [Enterobacteria phage Era103] Length = 517 Score = 208 bits (529), Expect = 2e-51, Method: Composition-based stats. Identities = 60/536 (11%), Positives = 145/536 (27%), Gaps = 42/536 (7%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLSSL 60 I + L +R E + F PY + + W G+ A LS+ Sbjct: 10 SKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDGASATNFLSNK 69 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LS ++ P + + + + + E ++ V + E + F Sbjct: 70 LSQVLFPAQRSFFRIDLTPEGIKQL-DNEAMTQSTAQKLLSDVEKAAMLYGESLQ--FRP 126 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + + ++ G Y I +VPL + + ++ V + Sbjct: 127 AVVEAFKHLIVTGNVMMY------HPDKTSPI--QAVPLHHYCVRRDNNGTVLDIV---F 175 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 + + ++ + + +++ ++ ++ K + + Sbjct: 176 LQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSADDVPVGKE 235 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 P+++ ++ E YGR A + + LA+ L Sbjct: 236 STVTE-------DKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMA 288 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALS-REGRSLFQPVQFGNPLPYHEELNRLKESIRS 359 + + G + Q ++ + P LN ++ I Sbjct: 289 DVKYLVKPGSYTDINQFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGR 348 Query: 360 LFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGN 419 +F+++ D +A E +G + + F G L G Sbjct: 349 VFMMEAMTR-RDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGP-----LARWFMNGI 402 Query: 420 LPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDT 479 + P L +E L + + + + + V+ + + Sbjct: 403 SSILTSKNVSPTILTGIE---ALGRMAELDKLGTFNGYVSMTAQW-----PEPLQQAIKW 454 Query: 480 DRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGA 535 + + + + E+ Q ++ Q + G Sbjct: 455 PDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQ 510 >gi|311875235|emb|CBX44494.1| bacteriophage head-to-tail connecting protein [Erwinia phage phiEa1H] gi|311875356|emb|CBX45097.1| head-to-tail connecting protein [Erwinia phage phiEa100] Length = 517 Score = 207 bits (527), Expect = 4e-51, Method: Composition-based stats. Identities = 59/536 (11%), Positives = 145/536 (27%), Gaps = 42/536 (7%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLSSL 60 I + L +R E + F PY + + W G+ A LS+ Sbjct: 10 SKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDGASATNFLSNK 69 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 LS ++ P + + + + + E ++ V + E + F Sbjct: 70 LSQVLFPAQRSFFRIDLTPEGIKQL-DNEAMTQSTAQKLLSDVEKAAMLYGESLQ--FRP 126 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + + ++ G Y I +VPL + + ++ + + Sbjct: 127 AVVEAFKHLIVTGNVMMY------HPDKTSPI--QAVPLHHYCVRRDNNGTILDIV---F 175 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 + + ++ + + +++ ++ ++ K + + Sbjct: 176 LQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSADDVPVGKE 235 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 P+++ ++ E YGR A + + LA+ L Sbjct: 236 STVTE-------DKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMA 288 Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALS-REGRSLFQPVQFGNPLPYHEELNRLKESIRS 359 + + G + Q ++ + P LN ++ I Sbjct: 289 DVKYLVKPGSYTDINQFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGR 348 Query: 360 LFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGN 419 +F+++ D +A E +G + + F G L G Sbjct: 349 VFMMEAMTR-RDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGP-----LARWFMNGI 402 Query: 420 LPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDT 479 + P L +E L + + + + + V+ + + Sbjct: 403 SSILTSKNVSPTILTGIE---ALGRMAELDKLGTFNGYVSMTAQW-----PEPLQQAIKW 454 Query: 480 DRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGA 535 + + + + E+ Q ++ Q + G Sbjct: 455 PDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQ 510 >gi|167841465|ref|ZP_02468149.1| head-to-tail joining protein [Burkholderia thailandensis MSMB43] Length = 519 Score = 206 bits (525), Expect = 6e-51, Method: Composition-based stats. Identities = 58/506 (11%), Positives = 138/506 (27%), Gaps = 39/506 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY---------KNNAQLRMWDTTGSEACIKLSSLLSS 63 + L R L E+ + F P + + + G++ L++ L Sbjct: 9 WESLAGLRRPLLTRCEKYSAFTLPTIITPQGYNEELEELQTDFQSVGAQGVNNLANKLML 68 Query: 64 LITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQ 123 + P + + + + D + + ++E + R G L Sbjct: 69 ALFAPSRPFFRYQVAAALMNQLKQTLDVQEQDLQEMLAEGERNC--IRTLDAMGVRPKLY 126 Query: 124 SFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTV 183 ++ G + D + + L + + + + T Sbjct: 127 EAMKHLIITGNCLLILGDDPKDTP------MRVLSLKRYAVKRSMSGKLLQLIIHETVRF 180 Query: 184 DQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDE 243 D++ + + S + A + + D H Sbjct: 181 DELDDEVQKIAVESSSRYANVDPNDPNSCPEVKYFTWVRWDG--TANYIVTHHVDNVELP 238 Query: 244 NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPP 303 +F + PYI + + D YG + + L+ + L+ Sbjct: 239 AKFSGKYTDQDLPYIPLTWELHDDNDYGTGLVEQMAGDLAALSALSEAEVKGAILASEFR 298 Query: 304 TIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH---EELNRLKESIRSL 360 + + R D+ + GA + P+ G + I Sbjct: 299 WLVNPAGQTRPADI--ADSDNGAALPGTKDDVVPLNSGTGQAMQYIDTVATKYVNRIGRN 356 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLL ++ D +A E + E +G + L +F M + G Sbjct: 357 FLL-SSSIVRDAERVTAEEIRMQANELETSLGGVYSRLAVDFQKPM---AYWLTKRAGV- 411 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTD 480 + G D P+ + ++ S + + + + + + P + ++ Sbjct: 412 -QLAGKDIEPMVITGLDALS------RNGDLDNLKLALQDLAAVSGM--PPQALAVLNLT 462 Query: 481 RVSRFSLWATNTP-AVLIRDTAEVED 505 +++ A ++ + Sbjct: 463 AIAKAIFMGRGVTMADYVKSQEQQAA 488 >gi|259419010|ref|ZP_05742927.1| hypothetical protein SCH4B_4395 [Silicibacter sp. TrichCH4B] gi|259345232|gb|EEW57086.1| hypothetical protein SCH4B_4395 [Silicibacter sp. TrichCH4B] Length = 506 Score = 198 bits (503), Expect = 2e-48, Method: Composition-based stats. Identities = 74/512 (14%), Positives = 157/512 (30%), Gaps = 35/512 (6%) Query: 8 DIQDRFNYLKNQRGEL-NYWMEELTGFLYPYKNNAQLR----------MWDTTGSEACIK 56 + RF+ K+ R + E+ F + + ++ T E + Sbjct: 4 EFDRRFSVAKSHRKQHVEEDGREVYKFCFNGREREWDNNSSYKDEPEEIFVETPGEVAEE 63 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 S L S +TP W + + +++ + + S Sbjct: 64 FSGDLFSTMTPENSPWSEFEAGNAVDEDDEAAAKEELEELEKAISK---------SLRSS 114 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 + + + G ++ D L I + +VP+ +Y++ + D + Sbjct: 115 NYYDEGPTAFQD-AVVGNVAMWV----DRPTLNGAINFEAVPIPQLYVTPGPLGIEDR-F 168 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHS 236 R F + + D ++ + ++ N ++H + + ++ Sbjct: 169 RRQRFHYRNLKVLFPDAKFPRAIEDKIKKSSNALAVVVHGFWRTFEDVENPVWRHEIRVD 228 Query: 237 KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 + + +VGR+ A +GR P + LP R+ +E V + Sbjct: 229 GKPIGLDKDVGSIGAVN---LVVGRFNPYAGSAWGRGPGRKLLPVFRQYDELVRMNMEGL 285 Query: 297 RLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKES 356 +L PP + + + QPV FG +L++ Sbjct: 286 DRTLDPPFTYPHDGMLDLSQGLENGVGY-PTMPGTKDALQPVLFGTLDYGFFSEEKLEQK 344 Query: 357 IRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDS 416 IR F + K SA++ + + ++ + EF ++SR + Sbjct: 345 IRDGFYREKE--QAGKTPPSASQYIGQENKQVRRMARPATKTWREFGVGLLSRVEWLERQ 402 Query: 417 QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDH 476 G E ++ SPL + Q + V +A + + + G Sbjct: 403 PGGSLEGAELPLIDSGVVNARPISPLERAQAMQDVTTADMIIGMIN---ERLGPEQAAML 459 Query: 477 MDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 + R V R AE+E + + Sbjct: 460 IKGTDTYRKIKEVLKDQIVEFRSEAEIEALIK 491 >gi|108862014|ref|YP_654130.1| 29 [Enterobacteria phage K1-5] gi|40787100|gb|AAR90071.1| 29 [Enterobacteria phage K1-5] Length = 516 Score = 197 bits (500), Expect = 5e-48, Method: Composition-based stats. Identities = 55/488 (11%), Positives = 139/488 (28%), Gaps = 45/488 (9%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLSSL 60 I + N+R + + PY N W G++A L++ Sbjct: 14 SKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLANK 73 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 L+ ++ P + + + + + + + ++ QV +E + F Sbjct: 74 LAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKK-TELATIFAQVETR--AMKELEQRQFRP 130 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + + ++ G+ Y + ++P+ + ++ + + + Sbjct: 131 AVVEAFKHLIVAGSCMLYKPSKGA---------ISAIPMHHYVVNRDTNGDLLDIILLQE 181 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGNKGFHSKFV 239 + V+ +K + ++ HA Y + K+ + Sbjct: 182 KALRTFDPAT-RAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSADDIPVGKVSK 240 Query: 240 SVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLS 299 E P+I ++ E +GR A + + + +A+ L Sbjct: 241 IKSEK----------LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALM 290 Query: 300 LHPPTIAVSEAKQRNFDLKP-GYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIR 358 + A+ G + E + Q ++ + P L I Sbjct: 291 ADIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIG 350 Query: 359 SLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQG 418 +F+++ D +A E E +G + + + + L G Sbjct: 351 VVFMMETMTR-RDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPV---AMWGLLEAG 406 Query: 419 NLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMD 478 PV + +E L + + + +A+ Q ++ + + Sbjct: 407 E--SFTSDLVDPVIITGIE---ALGRMAELDKLANFAQYMSL-----PLQWPEPVLAAVK 456 Query: 479 TDRVSRFS 486 + Sbjct: 457 WPDYMDWV 464 >gi|83571754|ref|YP_425006.1| putative head-tail connector [Enterobacteria phage K1E] gi|83308205|emb|CAJ29437.1| gp29 protein [Enterobacteria phage K1E] Length = 516 Score = 196 bits (499), Expect = 5e-48, Method: Composition-based stats. Identities = 52/487 (10%), Positives = 138/487 (28%), Gaps = 43/487 (8%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLSSL 60 I + +R + + PY N W G++A L++ Sbjct: 14 SKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLANK 73 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 L+ ++ P + + + + + + + ++ QV +E + F Sbjct: 74 LAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKK-TELATIFAQVETR--AMKELEQRQFRP 130 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + + ++ G+ Y + ++P+ + ++ + + + Sbjct: 131 AVVEAFKHLIVAGSCMLYKPSKGA---------ISAIPMHHYVVNRDTNGDLLDIILLQE 181 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 ++ + + + E +I K +G Sbjct: 182 KSLRT----FDPATRAVVEVGLKGKKCKEDDSI-----KLYTHAKYLGEGFWELKQSADD 232 Query: 241 VDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSL 300 + + + K P+I ++ E +GR A + + + +A+ L Sbjct: 233 IPVGKVSKIK-SEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMA 291 Query: 301 HPPTIAVSEAKQRNFDLKP-GYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRS 359 + A+ G + E + Q ++ + P L I Sbjct: 292 DIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGV 351 Query: 360 LFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGN 419 +F+++ D +A E E +G + + + + L G+ Sbjct: 352 VFMMETMTR-RDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPV---AMWGLLEAGD 407 Query: 420 LPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDT 479 PV + +E L + + + +A+ Q ++ + + Sbjct: 408 --SFTSDLVDPVIITGIE---ALGRMAELDKLANFAQYMSL-----PLQWPEPVLAAVKW 457 Query: 480 DRVSRFS 486 + Sbjct: 458 PDYMDWV 464 >gi|31711672|ref|NP_853590.1| head portal protein [Enterobacteria phage SP6] gi|31505676|gb|AAP48769.1| gp30 [Enterobacteria phage SP6] gi|40787047|gb|AAR90021.1| 29 [Enterobacteria phage SP6] Length = 515 Score = 189 bits (480), Expect = 1e-45, Method: Composition-based stats. Identities = 55/489 (11%), Positives = 138/489 (28%), Gaps = 47/489 (9%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLYPY------KNNAQLRMWDTTGSEACIKLSSL 60 I + +R + PY N W G++A L++ Sbjct: 13 SKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANK 72 Query: 61 LSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVG 120 L+ ++ P + + + + + + L + ++ +V T + + F Sbjct: 73 LAQVLFPAQRSFFRVDLT-AKGEKVLDDRGLKKTQLATIFARVETT--AMKALEQRQFRP 129 Query: 121 CLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFT 180 + + ++ G Y + +VP+ + ++ + + V Sbjct: 130 AIVEVFKHLIVAGNCLLYKPSKGA---------MSAVPMHHYVVNRDTNGDLMDVI---- 176 Query: 181 FTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 ++ + + + + E +GF S Sbjct: 177 LLQEKALRTFDPATRMAIEVGMKGKKCKED--------DNVKLYTHAQYAGEGFWKINQS 228 Query: 241 VDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRL 298 D+ +E +I + P+I ++ E +GR A + + + +A+ L Sbjct: 229 ADDIPVGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAAL 288 Query: 299 SLHPPTIAVSEAKQRNFDLKP-GYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESI 357 + ++ G + E + Q ++ + P L I Sbjct: 289 MADIKYLIRPGSQTDVDHFVNSGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRI 348 Query: 358 RSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 +F+++ D +A E E +G + + + L Sbjct: 349 GVIFMMETMTR-RDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPI---AMWGLQEA 404 Query: 418 GNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 G+ PV + +E L + + + +A+ Q ++ + Sbjct: 405 GD--SFTSELVDPVIVTGIE---ALGRMAELDKLANFAQYMSLPQTW-----PEPAQRAI 454 Query: 478 DTDRVSRFS 486 + Sbjct: 455 RWGDYMDWV 463 >gi|13186164|emb|CAC33475.1| hypothetical protein [Legionella pneumophila] Length = 519 Score = 185 bits (469), Expect = 2e-44, Method: Composition-based stats. Identities = 80/501 (15%), Positives = 172/501 (34%), Gaps = 48/501 (9%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPY---------------KNNAQLRMWD 47 + + N K+ ++ + P ++D Sbjct: 27 KLDVNRLCRMRNDAKSDLDMWRSILQTAYHYSMPDYNPFENYGLAGFLTPGQQYNADIYD 86 Query: 48 TTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTL 107 T A +L+ + + P GQ+W + + D Sbjct: 87 LTLPIAHKRLADKMLMNMVPQGQQWVKFTPGDEFGEPGTPLYQRALDATQRMTD------ 140 Query: 108 FGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVN 167 F+ RS F + V TG + +E + +RY +VP + V + Sbjct: 141 HFFKIIDRSNFYLAVGESLQD-VLISTGIIAI----NEGNRKRPVRYEAVPPAQVMFQGD 195 Query: 168 HQNVVDSVYR-EFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKK 226 + VD+++R + ++ I S W + + L + ++ I + +K Sbjct: 196 AEGQVDAIFRDWYQVRIENIKSMWPKAEV-----AKLNKKPEDKVDIWECAWIDYEAPEK 250 Query: 227 KDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLN 286 + + V E+ +++P++V R R EI GR P++ A PT +N Sbjct: 251 E------RYQYVVMTSSKDVLLEQSNSSWPWVVYRMRRLTGEIRGRGPSLSAYPTAATIN 304 Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR-EGRSLFQPVQFGNPLP 345 + + + +P +A S++ P +I + +G +P + + Sbjct: 305 QALEDELVAAAFQANPMYMAASDSAFNQQTFTPRPGSIVPVQMVQGEWPIKPFEQSGNIQ 364 Query: 346 YHEEL-NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 ++ L N ++ I L + + +R+A E+ + E ++ LQ+EF Sbjct: 365 FNALLVNDFRQQINELLYAFPLGAV-NSPTRTATEAEIRYTENLESFSAMVPRLQNEFFI 423 Query: 405 AMISRELDILDS-----QGNLPECE--GADNPPVSLLKVEYTSPLFKYQQAESVASALQG 457 +I R L +++ N+P+ + +L + + +PL + A+ L Sbjct: 424 PVIQRTLWVINKVLPETFANIPDDIRNKMISVDGQILGLSFDTPLMTAKGQVKTAALLGF 483 Query: 458 VNTVVELGVKTGDPSCMDHMD 478 L + + +D + Sbjct: 484 YQAAASLLGQEAATASLDPVK 504 >gi|320158420|ref|YP_004190798.1| head-to-tail joining protein [Vibrio vulnificus MO6-24/O] gi|319933732|gb|ADV88595.1| head-to-tail joining protein [Vibrio vulnificus MO6-24/O] Length = 437 Score = 183 bits (464), Expect = 6e-44, Method: Composition-based stats. Identities = 63/474 (13%), Positives = 122/474 (25%), Gaps = 43/474 (9%) Query: 63 SLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCL 122 + PP + L S D++ + Q + E R L Sbjct: 1 MALFPPSHPFVRLGVSNELIAKL-DLTDSKKGDLETALSQTEQLI--VTELERRALRSLL 57 Query: 123 QSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFT 182 ++ G G Y+ + L + + Q + Sbjct: 58 YEDIKHLLVTGNGLLYVGSKESRF----------YRLDKYVVERDDQGAPTRIVVCEKIN 107 Query: 183 VDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVD 242 ++ + + R + FT+I + + + Sbjct: 108 FRKLPDAMQFAIREKRRLKGDPRKDLNLFTMIELKGD-----------QWRSYQEVEGMR 156 Query: 243 ENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 + P+IV E YGRS E + + L V + Q + Sbjct: 157 VPDSESNYRKDRTPWIVCTMNRLDGEDYGRSFCEEHIGDMNTLESLVKAITQASIAASKV 216 Query: 303 PTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELN---RLKESIRS 359 + A R L G + R +Q N ++ + Sbjct: 217 IFMVKPNASTRASTLSKA--KNGDYIQGDREDVGCLQLDKAHDMAIAQNLKAEIQAGLSE 274 Query: 360 LFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGN 419 FL+ V D +A E T+ +G L L +++ L ++ G Sbjct: 275 AFLM-SSAVRRDAERVTAEEIRMMTQMLEESLGGLYSQLAQSLQLPLVNVLLGHMERDGI 333 Query: 420 LPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDT 479 LP P+ + VE L + + + + + V V M Sbjct: 334 LPHFPEGTFEPIVITGVEG---LGREAELSRLNTFVSLVQQVGA-------EQAAKEMHL 383 Query: 480 DRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQD 532 + + L++ E + Q Q + + + + Q Sbjct: 384 GELFKRYAANLQIETKGLMKTAEEKQQELQ--AEQMNQIVQTATPEVVHGAMQQ 435 >gi|289976621|gb|ADD21666.1| head-to-tail joining protein [Caulobacter phage Cd1] Length = 509 Score = 182 bits (461), Expect = 2e-43, Method: Composition-based stats. Identities = 63/501 (12%), Positives = 132/501 (26%), Gaps = 49/501 (9%) Query: 6 AKDIQDRFNYLKNQRGELNYWMEELTGFLYP---------YKNNAQLRMWDTTGSEACIK 56 AK R++ L N+R +E + ++ G +A Sbjct: 4 AKQASARWSQLDNKRRGFIERLETYASWTIAKLCTPSGYDQNHSELSHGTQAVGGQAVNH 63 Query: 57 LSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 L++ + + P + + L S Q L + + + Q + R Sbjct: 64 LANKIMLALFAPSRPFFRLDPSD-KMQKELAAANVNEQALALILSQGEKR--AIQALDRM 120 Query: 117 GFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVY 176 L +++ G D + + + + V + Sbjct: 121 ALRPKLYEAIKNLIVLGNVMLEFTKDT----------MRVIGIKRYCVRRSASGEVLELI 170 Query: 177 REFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHS 236 + T D++ + + + + R + ++ + + + Sbjct: 171 IKDTMQFDEL-----EPSVQEECRRQGMRPLEDAEVSLYRWIVRQDNGDYRMTQH----- 220 Query: 237 KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 +F + P+ V + + D YG + L Q Sbjct: 221 VDNIELSKKFQGKWSKDKLPFRVLTWDLSDDAHYGTGLVEDYRGDFAGLTMLSTAQVQAA 280 Query: 297 RLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKES 356 LS + + D GA + VQ G L+ E Sbjct: 281 ILSSEFRWLVNPAGMTKPEDF--RDSENGAAIPGVQGDVSLVQSGKAADLQVILSVNAEY 338 Query: 357 IRSLF--LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 I + L + D +A E + E +G L +F M ++ Sbjct: 339 INRIARGFLMGSAMTRDAERVTAEEIRMQASELETSLGGAYSRLAVDFQIPM---AYWLM 395 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCM 474 EG D P + ++ S + + + + V LG T P + Sbjct: 396 KKVDM--SIEGTDVEPSIVTGLDALS------RGGDLENLKLFLADVAGLG--TLPPPVL 445 Query: 475 DHMDTDRVSRFSLWATNTPAV 495 + + + A + Sbjct: 446 AVLKVEPLLAAFATARRIKSS 466 >gi|149408206|ref|YP_001294640.1| hypothetical protein ORF047 [Pseudomonas phage PA11] Length = 584 Score = 176 bits (447), Expect = 6e-42, Method: Composition-based stats. Identities = 70/578 (12%), Positives = 159/578 (27%), Gaps = 58/578 (10%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR---MW-DTTGSEAC---- 54 SA+ + ++ NQR + +EL +++ W ++T Sbjct: 15 DSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIR 74 Query: 55 IKLSSLLSSLITPP--GQKWHGLAESFSAYQAFLYKEDARSKKVRE--WCDQVTDTLFGF 110 L S S + P +W G + S + S K RE + +V+ ++ + Sbjct: 75 DNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDY 134 Query: 111 RERSRSGFVGCLQSF-YTSVVEFGTGCFYMEAD-------------------------VD 144 + + F Y + + Y+ Sbjct: 135 IDYGNA-FATVSFEAKYKEMTDGTLVPDYIGPRLVRISPLDIVFNPLATSISDTFKIVRS 193 Query: 145 EKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALA 204 K E +R Y + + ++V+ G V + Sbjct: 194 VKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDKAAGFDV--DGFGNLYE 251 Query: 205 RNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRV 264 ++ I+ + + + N+ S + + P +R Sbjct: 252 YYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRF 311 Query: 265 RADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNI 324 R D ++ P + R++ N A L + PP + + F PG Sbjct: 312 RPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKII--GEVEEFVWGPGAEIH 369 Query: 325 GALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT 384 + + + + V + ++ + + + ++A E + Sbjct: 370 LDQGGDVQEIAKNVNYIINADNQIQMLEDRMEL-YAGAPREAMGIRTPGEKTAFEVQQLG 428 Query: 385 REKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKV-EYTSPLF 443 G + + E + +++ L+ + + L V E+ S Sbjct: 429 NAAGRIFQEKVTTFEVELLEPVLNAMLETATRN---MDGSDVIRVMDTDLGVKEFMSVTR 485 Query: 444 ---------KYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPA 494 + A Q + +V + + H ++ F T Sbjct: 486 EDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQG 545 Query: 495 -VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQ 531 + R V + + + + + E+ LQ Q+ Sbjct: 546 YEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGA 583 >gi|197935883|ref|YP_002213719.1| head portal-like protein [Ralstonia phage RSB1] gi|197927046|dbj|BAG70388.1| head portal-like protein [Ralstonia phage RSB1] Length = 514 Score = 174 bits (440), Expect = 4e-41, Method: Composition-based stats. Identities = 54/517 (10%), Positives = 132/517 (25%), Gaps = 42/517 (8%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 + L + + E + P + + + GSE LS+ L Sbjct: 12 WTALDGRANTVIRRSERYASWTQPSLCPPDGFNEQTELQNDYQSVGSECVNSLSNRLVLN 71 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P + + + D ++ + + + L Sbjct: 72 LFAPSRPFMRYDVPPAIAAKL----DIDPAVLQTQLSKAERD--SVKLLDQLSTRPKLFE 125 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G + D +VP+ + + ++ + D Sbjct: 126 AIKHLIVIGNVLVILGKDKTTP-------LRTVPIKKFRCKRSPSGKLVTLAIKECLKFD 178 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 ++ K K+L N + + + + Sbjct: 179 ELDEKVQQKLLEQSPTKYQFTPNNPPDCEWYTEVCLQPDGRYAVRTQVDDAMLTGHGYDA 238 Query: 245 RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPT 304 + EE PY V + + YG + ++ Q L+ Sbjct: 239 MYTEE----EMPYRVLTWELPDGWHYGIGLVEQHAGDFAAISTMSASQLQSAILASEFRW 294 Query: 305 IAVSEAKQRNFDLK---PGYMNIGALSREGRSLFQPVQFGNPLPYH-EELNRLKESIRSL 360 + + D+ G + G+ + L L++ + Sbjct: 295 LVNPAGITQPEDMVNSQNGDVVPGSPDDVVAVTAATAGVASALQVQDLILSKYVTRVGRA 354 Query: 361 FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNL 420 FLL D +A E E +G + L +F + + G Sbjct: 355 FLLASAA-QRDAERVTAEEIRRDVLELETSLGGVYSRLAVDFQKPL---AYWLARMLGV- 409 Query: 421 PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV-KTGDPSCMDHMDT 479 + P + ++ S + + + ++ + ++ + G ++T Sbjct: 410 -KLSDTGIQPTIITGLDALS------RNSDLENLMRALQQLLIVSQIVAGGGPLSVTLNT 462 Query: 480 DRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRV 516 ++ A + E + ++E R+ Sbjct: 463 TSIAASIFAGNGVDADTYVNDQETQQALMEQEQARQE 499 >gi|294661422|ref|YP_003347633.2| head-tail connector protein [Klebsiella phage KP34] gi|291195554|gb|ACY66713.2| head-tail connector protein [Klebsiella phage KP34] Length = 531 Score = 167 bits (422), Expect = 5e-39, Method: Composition-based stats. Identities = 70/506 (13%), Positives = 148/506 (29%), Gaps = 45/506 (8%) Query: 38 KNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVR 97 + R + +TG++ ++ + + P G + ++S + Sbjct: 44 RRRPLERDYQSTGAQLVNTAATKIVGALFPQGTSFFRFSKSSDL--DEFISSLGSAATAE 101 Query: 98 EWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISV 157 +V +T + + G+ L ++ G Y++ + I Sbjct: 102 SKLAEVENTA-SQKVFEKDGYAAKL-QAVKLLLVTGNALEYIDERTGKS--------IVY 151 Query: 158 PLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAV 217 + N + + V + +V + + + ++ I A Sbjct: 152 SVRNFTVRRDGSGNVLRLIIRERASVQDLPESFQNTFYR-------DKDPYGDVDIYTAA 204 Query: 218 YPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIA--TFPYIVGRYRVRADEIYGRSPA 275 K ++ + + D +R + PY V + + + E YGR Sbjct: 205 CRKVKRTEEGVEVVSY--EVYQEADGHRIGDSSTYPELELPYNVLVWNLVSGEHYGRGLV 262 Query: 276 MEALPTIRRLNETVNEL--AQFGRLSLHPPTIAVSEAKQRNFDLKPGY----MNIGALSR 329 + RL+ L + L P A S F + G + Sbjct: 263 EDYAGDFARLSVLSEALTNYEVESARLIPLIDASSGLDVDEFATSETGEAVQVGGGGSNG 322 Query: 330 EGRSLFQPVQFGNPLPYH---EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 +S + G+ + L++ + F+ +A E + +E Sbjct: 323 NSKSPVTAYEGGSAQKIQWIASNIQMLEQKLSRAFMYTG--NSRQGERVTAYEIRQNAKE 380 Query: 387 KGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKV-EYTSPLFKY 445 A +G L ++ R+L L + P + + V + V TS L K Sbjct: 381 AEAAMGGGFSILSDTWL-----RKLAYLYTALVYPRFKLYLSEGVVSINVTVGTSALAKA 435 Query: 446 QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 A+ + A Q + + + + D + A + T E Sbjct: 436 AAADKLLEAAQSMQLAIPV-----LEQITPRFNKDACVDWYFDAYGIVSEPFMYTEEQLQ 490 Query: 506 IRQQREVQRRVMEEQHLQQQLQQTSQ 531 +QQ + + Q QLQ + Sbjct: 491 QKQQVQDASADVSAGAAQDQLQGLTA 516 >gi|291334523|gb|ADD94176.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334657|gb|ADD94304.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291334711|gb|ADD94357.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] gi|291336437|gb|ADD95992.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 193 Score = 166 bits (421), Expect = 6e-39, Method: Composition-based stats. Identities = 52/189 (27%), Positives = 97/189 (51%), Gaps = 8/189 (4%) Query: 137 FYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLS 196 ++E D E+ +++ + ++ ++++ N + +D+V+R+F+ + ++ K+GD +S Sbjct: 1 MFIEED-----DEDILKFSTRHINEIFIAENDKGRIDTVFRKFSLSARAVMQKFGD--VS 53 Query: 197 SKMKSALARNENERFTIIHAVYPKSLTDK-KKDKGNKGFHSKFVSVDENRFFEEKQIATF 255 + + ++ E I+HAVYP+S D K+DK N F S ++ + F Sbjct: 54 INIATKAKKDPYEEVEIMHAVYPRSDFDPRKQDKENMPFESVYLDAESGDELSVSGFREF 113 Query: 256 PYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF 315 P++V RY + EIYGRSPAM ALP ++ LNE + + + PP + + Sbjct: 114 PFVVPRYLKASHEIYGRSPAMTALPDVKMLNEMSKTTIKSAQKQVDPPLLVPDDGFMLPV 173 Query: 316 DLKPGYMNI 324 PG +N Sbjct: 174 RTIPGGLNF 182 >gi|33300841|ref|NP_877469.1| head-tail connector protein [Pseudomonas phage phiKMV] gi|195546675|ref|YP_002117756.1| hypothetical protein PT5_gp34 [Pseudomonas phage PT5] gi|33284812|emb|CAD44221.1| head-tail connector protein [Enterobacteria phage phiKMV] gi|158187636|gb|ABW23113.1| conserved hypothetical phage protein [Pseudomonas phage PT5] Length = 510 Score = 161 bits (407), Expect = 3e-37, Method: Composition-based stats. Identities = 56/476 (11%), Positives = 137/476 (28%), Gaps = 38/476 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 + L++ G + E PY + + G+ L++ L+ Sbjct: 9 WEKLRD--GSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARS 66 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + + A + D +V +V ++ S + L Sbjct: 67 LFPTGIPFFRSELTD-AIRREADSRDTDITEVTAALARVDRKATQRLFQNAS--LAVLTQ 123 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y ++D ++ L + + + + + + Sbjct: 124 VIKLLIVTGNALLYRDSDAAT--------VVAWSLRSYAVRRDATGRWMDIVLKQRYKSK 175 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + ++ ++ + + + + + + ++K + + +D Sbjct: 176 DLDEEYKQDLMRAGRNLSGSGSVDLYTHV-----------QRKKGTAMEYAELYHEIDGV 224 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 R +E + PYIV + + E YGR + + +L+ +L + SL Sbjct: 225 RVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV 284 Query: 303 PTIAVSE-AKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + + E ++ + + L + + F Sbjct: 285 LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 + D +A E E +G L + L +D L Sbjct: 345 MYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQ 401 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 + P + S Q + + + G+ + +L + P MD + Sbjct: 402 GLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTI 457 >gi|225626357|ref|YP_002727853.1| putative head-tail connector protein [Pseudomonas phage phikF77] gi|225594866|emb|CAX63151.1| putative head-tail connector protein [Pseudomonas phage phikF77] Length = 510 Score = 161 bits (407), Expect = 3e-37, Method: Composition-based stats. Identities = 57/476 (11%), Positives = 135/476 (28%), Gaps = 38/476 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 + L++ G + E PY + + G+ L++ L+ Sbjct: 9 WEKLRD--GSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARS 66 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + + A + D +V +V ++ S + L Sbjct: 67 LFPTGIPFFRSELTD-AIRREADSRDTDITEVTAALARVDRKATQRLFQNAS--LAVLTQ 123 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y +D ++ L + + + + + + Sbjct: 124 VIKLLIVTGNALLYRNSDEAT--------VVAWSLRSYAVRRDATGRWMDIVLKQRYKSK 175 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + + ++ + + + + + + ++K + + +D Sbjct: 176 DLDEAYKQDLMRAGRNLSGSGSVDLYTHV-----------QRKKGTAMEYAELYHEIDGV 224 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 R EE + PYIV + + E YGR + + +L+ +L + SL Sbjct: 225 RVGEEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV 284 Query: 303 PTIAVSE-AKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + + E ++ + + L + + F Sbjct: 285 LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 + D +A E E +G L + L +D L Sbjct: 345 MYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQ 401 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 + P + S Q + + + G+ + +L + P MD + Sbjct: 402 GLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTI 457 >gi|195546737|ref|YP_002117815.1| head-tail connector protein [Pseudomonas phage PT2] gi|165880746|gb|ABY71001.1| head-tail connector protein [Pseudomonas phage PT2] Length = 510 Score = 161 bits (407), Expect = 3e-37, Method: Composition-based stats. Identities = 55/476 (11%), Positives = 135/476 (28%), Gaps = 38/476 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 + L++ G + E PY + + G+ L++ L+ Sbjct: 9 WEKLRD--GSVESRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARS 66 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + + A + D +V +V ++ S + L Sbjct: 67 LFPTGIPFFRSELTD-AIRREADSRDTDITEVTAALARVDRKATQRLFQNAS--LAVLTQ 123 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y ++ L + + + + + + Sbjct: 124 VIKLLIVTGNALLY--------RDSAAATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSK 175 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + ++ ++ + + + + + + ++K+ + + +D Sbjct: 176 DLDEEYKQDLMRAGRNLSGSGSVDLYTHV-----------QRKNGTAMEYAELYHEIDGV 224 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 R +E + PYIV + + E YGR + + +L+ +L + SL Sbjct: 225 RVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV 284 Query: 303 PTIAVSE-AKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + + E ++ + + L + + F Sbjct: 285 LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 + D +A E E +G L + L +D L Sbjct: 345 MYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQ 401 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 + P + S Q + + + G+ + +L + P MD + Sbjct: 402 GLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTI 457 >gi|167600476|ref|YP_001671975.1| head-tail connector protein [Pseudomonas phage LUZ19] gi|161168339|emb|CAP45503.1| head-tail connector protein [Pseudomonas phage LUZ19] Length = 510 Score = 160 bits (405), Expect = 5e-37, Method: Composition-based stats. Identities = 55/476 (11%), Positives = 134/476 (28%), Gaps = 38/476 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 + L++ G + E PY + + G+ L++ L+ Sbjct: 9 WEKLRD--GSVESRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARS 66 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + + A + D +V +V ++ S + L Sbjct: 67 LFPTGIPFFRSELTD-AIRREADSRDTDITEVTAALARVDRKATQRLFQNAS--LAVLTQ 123 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y ++ L + + + + + + Sbjct: 124 VIKLLIVTGNALLY--------RDSAAATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSK 175 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + ++ ++ + + + + + + ++K + + +D Sbjct: 176 DLDEEYKQDLMRAGRNLSGSGSVDLYTHV-----------QRKKGTAMEYAELYHEIDGV 224 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 R +E + PYIV + + E YGR + + +L+ +L + SL Sbjct: 225 RVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV 284 Query: 303 PTIAVSE-AKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + + E ++ + + L + + F Sbjct: 285 LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 + D +A E E +G L + L +D L Sbjct: 345 MYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQ 401 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 + P + S Q + + + G+ + +L + P MD + Sbjct: 402 GLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTI 457 >gi|254505047|ref|ZP_05117198.1| hypothetical protein SADFL11_5087 [Labrenzia alexandrii DFL-11] gi|222441118|gb|EEE47797.1| hypothetical protein SADFL11_5087 [Labrenzia alexandrii DFL-11] Length = 400 Score = 159 bits (403), Expect = 8e-37, Method: Composition-based stats. Identities = 57/425 (13%), Positives = 120/425 (28%), Gaps = 51/425 (12%) Query: 94 KKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIR 153 + + T+ + +E + + +V G D Sbjct: 5 PDLEKGFALATNLIT--KEIEKKAWRKPTSLTLELLVSTGNALERYMPDNS--------- 53 Query: 154 YISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTI 213 L + + + + L + +S L ++ + I Sbjct: 54 IRVYRLDQYVVVRDLSGNLVELILREKVN---------KASLPEQTQSYLKASQEDDVEI 104 Query: 214 IHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQ--IATFPYIVGRYRVRADEIYG 271 + E + E T P+ R+ E YG Sbjct: 105 FTC------------AKRHPDGWEIKQEVEGQIIEGMGGVTPTNPFNPLRWSAVPGEDYG 152 Query: 272 RSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREG 331 R E + L+ + ++ T+ A N + G + Sbjct: 153 RGKVEEHFSDLTYLDLLSKSMVDGSAMATRHITMVRPNAAGSNLRKRFAEAKNGDVISGN 212 Query: 332 RSLF---QPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKG 388 Q +E+ R+ + + FLL ++ + +A E E Sbjct: 213 PEDVDLKQFANVTGMQIAQQEIARITQELAQAFLL-SSSMIRNAERVTAQEVRMIAEELE 271 Query: 389 AFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQA 448 + +G + L + + A I + + + G LP PV + +E L + + Sbjct: 272 SVLGGVYSYLSQDMMSARIEALMTSMMAAGQLPPVLQ-MTQPVLTVGLE---ALERDKDV 327 Query: 449 ESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ 508 V + LQ + + P +D++D + + + P ++ E + RQ Sbjct: 328 MRVQTVLQTLQAL--------PPDFLDYLDIPDLLKTFMIGLGLPGK-VKTEQEAQQTRQ 378 Query: 509 QREVQ 513 QR + Sbjct: 379 QRLMA 383 >gi|158345057|ref|YP_001522822.1| putative head-tail connector protein [Pseudomonas phage LKD16] gi|114796410|emb|CAK25966.1| putative head-tail connector protein [Pseudomonas phage LKD16] Length = 510 Score = 159 bits (402), Expect = 1e-36, Method: Composition-based stats. Identities = 55/476 (11%), Positives = 134/476 (28%), Gaps = 38/476 (7%) Query: 13 FNYLKNQRGELNYWMEELTGFLYPY--------KNNAQLRMWDTTGSEACIKLSSLLSSL 64 + L++ G + E PY + + G+ L++ L+ Sbjct: 9 WEKLRD--GSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARS 66 Query: 65 ITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQS 124 + P G + + A + D +V +V ++ S + L Sbjct: 67 LFPTGIPFFRSELTD-AIRREADSRDTDITEVTAALARVDRKATQRLFQNAS--LAVLTQ 123 Query: 125 FYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVD 184 ++ G Y +D ++ L + + + + + + Sbjct: 124 VIKLLIVTGNALLYRNSDEAT--------VVAWSLRSYAVRRDATGRWMDIVLKQRYKSK 175 Query: 185 QIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDEN 244 + + ++ + + + + + + +++ + + +D Sbjct: 176 DLDDVYKQDLMRAGRNLSGSGSVDLYTHV-----------QRRKGTAMDYAEMYHEIDGV 224 Query: 245 RFFEEKQI--ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 R E + PYIV + + E YGR + + +L+ +L + SL Sbjct: 225 RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV 284 Query: 303 PTIAVSE-AKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF 361 + + + E ++ + + L + + F Sbjct: 285 LNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 + D +A E E +G L + L +D L Sbjct: 345 MYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQ 401 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 + P + S Q + + + G+ + +L + P MD + Sbjct: 402 GLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTI 457 >gi|158345175|ref|YP_001522882.1| putative head-tail connector protein [Enterobacteria phage LKA1] gi|114796471|emb|CAK25009.1| putative head-tail connector protein [Pseudomonas phage LKA1] Length = 514 Score = 158 bits (400), Expect = 2e-36, Method: Composition-based stats. Identities = 51/517 (9%), Positives = 124/517 (23%), Gaps = 52/517 (10%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLY------PYKNNAQLRM----WDTTGS 51 ++ A + + R E+ F P Q + + + G+ Sbjct: 1 MRQQASAMWAEYRDSTAIR-----KAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGA 55 Query: 52 EACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFR 111 L++ L+ + PPG+ + Q ++ + Sbjct: 56 FLVNNLTAKLALTLFPPGRPSFQIELDD-TLQELAAANGIDQSELHSRTADLERRATRRL 114 Query: 112 ERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNV 171 + S + L +V G FY E + + + + + Sbjct: 115 FVNAS--LSKLHRILKLLVVTGNALFYREPGTG--------KMLVWTMQSYTVRRTSHGD 164 Query: 172 VDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGN 231 V ++ + + ++ + + I P Sbjct: 165 PAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKR-------- 216 Query: 232 KGFHSKFVSVDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 + + ++ R E PY+ + V E YGR E RL+ Sbjct: 217 ---CAVWHELEGKRVGPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILS 273 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLK-PGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 L + +L + D + + + ++ + Sbjct: 274 ERLGLYEFEALSLLNLVDEAKGGAVDDYRDAETGDFVPGQVGSVASYERGDYNKIAQASA 333 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 + + + F+ + D + E E +G + L + Sbjct: 334 SVESIVMRLNRAFMYTGQ--VRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAY 391 Query: 409 RELDILDS--QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + G L P + + + + A+ L+ + Sbjct: 392 LTMYEASRGNGGMLLGIAQGVYRPSIITGIPALT------RNIETANILRATQEASAIVP 445 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 D +++ + + +V Sbjct: 446 ALV--QLSKRFDPEKLVERIFANNSVDLSTLSKDPDV 480 >gi|312062873|gb|ADQ12735.1| putative Head-tail connector protein [Acinetobacter phage phiAB1] Length = 518 Score = 148 bits (374), Expect = 2e-33, Method: Composition-based stats. Identities = 55/505 (10%), Positives = 131/505 (25%), Gaps = 57/505 (11%) Query: 46 WDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTD 105 + + G+ +L+S L+S + P + + S + + K + + + Sbjct: 58 YQSVGAYLVNRLASRLASTLFPVSTSFFRIEPSQE-LKDLVDKRGTST------LIDLEN 110 Query: 106 TLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS 165 + S + ++ G + L R L N + Sbjct: 111 KACRRLFFNAS--YAQIVQALRLLIITG----------EVLLLRRDNRLRVFSLKNYALL 158 Query: 166 VNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDK 225 N+ V + + L ++ ++ L + ++ K + Sbjct: 159 RNNVGEVLEIITREPKRYRE---------LDAETQALLQDRNEDETLDLYTRIRKRNING 209 Query: 226 KKDKGNKGFHSKFVSVDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPAMEALPTIR 283 +D R + PYI + + YGR E Sbjct: 210 VISWK------ITQEIDGVRLPNYEIYRDKLCPYIPVTWSYMNGDAYGRGYVEEYAGDFA 263 Query: 284 RLNETVN--ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 +L+E Q L + A + + + + ++ + Sbjct: 264 KLSELSQGLTEYQIESLIIRHVYNA-QGGFDVESAVNSRNGDWISGNVNAVQNYESGSYQ 322 Query: 342 NPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 L + + + F+ + + +A E E +G + L Sbjct: 323 KMNEVRLGLEAIMQRLNVAFMYTG--NMREGDRVTAYEIARNADEAEQVLGGVYSQLSQN 380 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + L + + + + L ++ S ++ + L N + Sbjct: 381 MHLPLAYLLLYE-VRKDFIQAIDRQEIELNILTGLQALS------RSSENQALLVAANEI 433 Query: 462 VELGVKTGDPSCMDHMDTDRVSRFSLWATNTP-AVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + + D + L + + + E+ + + +Q Sbjct: 434 ATVAQVFS--QVSKRFNLDAIVDKILLSNGIDISEITYSEEEMRAKAMEEQRAAEAQRQQ 491 Query: 521 HLQQQLQQTSQ------DIGAKAAG 539 +QQ Q AAG Sbjct: 492 VIQQAGAQLGGNQLENTQAAQLAAG 516 >gi|229604951|ref|YP_002875651.1| putative head-tail connector protein [Vibrio phage VP93] gi|227976996|gb|ACP44098.1| putative head-tail connector protein [Vibrio phage VP93] Length = 510 Score = 148 bits (374), Expect = 2e-33, Method: Composition-based stats. Identities = 50/460 (10%), Positives = 122/460 (26%), Gaps = 43/460 (9%) Query: 42 QLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCD 101 R + + G+ L+S L+ + P G + ++ + + + + + ++ Sbjct: 47 LQRDFQSHGAMLVNNLASKLTRTLFPTGMSFFRIS-DTDKMREIIAQLGSENAQLSAVFT 105 Query: 102 QVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSN 161 + GF ++ G Y + R + + Sbjct: 106 GIEREAMTLLTTHA-GFAQLTH-LMKLLIITGNALLYRDPLTG--------RMTVYSVRD 155 Query: 162 VYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKS 221 + + V + + ++ + ++ + Sbjct: 156 YAVRRDGAGRVLCTILRERVPIQDVPEEFRPTGYTDPTTDVW----------LYTKIQRE 205 Query: 222 LTDKKKDKGNKGFHSKFVSVDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPAMEAL 279 D +D PYI + + + E YGR + Sbjct: 206 TRDAG------DVFVITQQIDGKPVGTLSVYPEKLCPYIPAVWNLVSGEHYGRGHVEDHA 259 Query: 280 PTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLK-PGYMNIGALSREGRSLFQPV 338 R++E L + ++ + ++ L A EG + Sbjct: 260 GAFARVSELTQALTLYEIEAMRVVNLVSPKSTADVDALNDAETGEYVAGDGEGIKAHEAG 319 Query: 339 QFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGL 398 + +L + + F+ + D +A E RE +G + L Sbjct: 320 EARKIAEVVNDLQMVLAELARAFMYTG--NVRDAERVTAEEIKNNVREAEENMGGIYATL 377 Query: 399 QSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVE-YTSPLFKYQQAESVASALQG 457 +E + ++ L + PE L ++ T+ + + + + Sbjct: 378 -AEILHIPLAHILTVEAR----PELLALLQANAVSLDIQVGTAAINRSIVVQRLGLVAND 432 Query: 458 VNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLI 497 +N ++ + + DRV L I Sbjct: 433 INLILPVLA-----QATKRTNPDRVIDLILAGHGVDPTEI 467 >gi|115304377|ref|YP_762669.1| PfWMP4_39 [Cyanophage Pf-WMP4] gi|113201871|gb|ABI33183.1| PfWMP4_39 [Phormidium phage Pf-WMP4] Length = 641 Score = 148 bits (372), Expect = 3e-33, Method: Composition-based stats. Identities = 62/539 (11%), Positives = 135/539 (25%), Gaps = 68/539 (12%) Query: 9 IQDRFNYLKNQRGELNYWMEELTGFLYPY---KNNAQLRMWDTTGSE------------- 52 + ++ +++R + +E + N + R + TTG++ Sbjct: 29 VISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHT 88 Query: 53 --ACIKLSSLLSSLITPPGQKWHGLA------ESFSAYQAFLYKEDARSKKVREWCDQVT 104 L + P W L + L K + +R+ + Sbjct: 89 FEVVETLVAYFKGATFP-SDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYV 147 Query: 105 DT-----LFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPL 159 + +R + + + + G DV +R + Sbjct: 148 RNLVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGW----EDVAVNRQRSELRIEPLSP 203 Query: 160 SNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYP 219 +V++ + + T +++ + + + Sbjct: 204 YDVWLDTSG-GKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFADPDTPKDV 262 Query: 220 KSLTDKKKDKGNKG---------FHSKFVSVDENRFFEEKQIATF---PYIVGRYRVRAD 267 D F + + P++ D Sbjct: 263 NGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRD 322 Query: 268 EIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDL--KPGYMNIG 325 +YG S L + LN N L ++ V + + D+ KPG + Sbjct: 323 SVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKV 382 Query: 326 ALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQV------LDDKASRSAAE 379 A QP+ G + + S++ +AAE Sbjct: 383 AQHGS----LQPIDMG-RQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAE 437 Query: 380 SMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYT 439 G + + ++ ++++ +L PE P + Sbjct: 438 IQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEV 497 Query: 440 SP--LF-----KYQQAESVASALQGVNTVVELGVKTG-DPSCMDHMDTDRVSRFSLWAT 490 SP L A V + V +++L +G P +D + L Sbjct: 498 SPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQM 556 >gi|148747833|ref|YP_001285799.1| portal protein [Phormidium phage Pf-WMP3] gi|146230066|gb|ABQ12474.1| portal protein [Phormidium phage Pf-WMP3] Length = 651 Score = 141 bits (356), Expect = 2e-31, Method: Composition-based stats. Identities = 76/635 (11%), Positives = 171/635 (26%), Gaps = 120/635 (18%) Query: 9 IQDRFNYLKNQRG----ELNYWM------EELTGFLYPY--------KNNAQLRMWDTTG 50 ++ + + R E +L + + ++ Sbjct: 25 VKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTGKA 84 Query: 51 SEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGF 110 EA + + L S P + W + + L + + Sbjct: 85 FEAIETIHAYLMSATFP-NKNWFDVVPAKPGQDNLLVSRL------------IKRYVQDK 131 Query: 111 RERSRSGFVGCLQSFYTSVVEFGTGCFYME-----ADVDEKGLEEGIRYISVPLSNVYMS 165 + F +F ++ G + A+V +K + P V Sbjct: 132 LTEGK--FRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSE 189 Query: 166 V-----NHQNVVDSVYREFTFTVDQIVSK-----------------------WGDKVLSS 197 + V ++ F ++ +G L Sbjct: 190 EREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDV 249 Query: 198 KMKSALARNENERFTI------------IHAVYPKSLTDKKKDKGNKGFHSKFVSVDENR 245 ++ ++ + H NK +H V++ N Sbjct: 250 VEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNE 309 Query: 246 FFEEKQIATF---PYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP 302 +Q + P+++G Y A + Y L + LN N+ L++ Sbjct: 310 VLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQ 369 Query: 303 PTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFL 362 S+ + D+ + +S G Q N ++E + L+ +I F Sbjct: 370 MYTLRSDGLLQPEDVYTEPGKVFLVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFG 429 Query: 363 LDLF---QVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDS--- 416 + +AAE G + + ++ + ++ + + ++ Sbjct: 430 TGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTD 489 Query: 417 -----------QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 G E +++ ++ + + L + V ++ Sbjct: 490 QPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQV- 548 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 P +D R+ L E E +Q++ Q ++ L Q Sbjct: 549 -----PEMGQLVDYKRILVDLLQHWGF--------EEPEAYLKQQDQQAPANPQEALLSQ 595 Query: 526 LQQTSQDIGAKAAGRAMEKKLTHD----MMENSYG 556 ++D+G +A ++ +L D MM YG Sbjct: 596 ----AKDVGGQAMSNMLQNQLQADGGTQMMSEMYG 626 >gi|281306687|ref|YP_003345493.1| predicted phage head-tail connector protein [Pseudomonas phage phi-2] gi|271277992|emb|CBH51598.1| predicted phage head-tail connector protein [Pseudomonas phage phi-2] Length = 518 Score = 140 bits (353), Expect = 5e-31, Method: Composition-based stats. Identities = 54/470 (11%), Positives = 133/470 (28%), Gaps = 37/470 (7%) Query: 38 KNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVR 97 N + + G+ L++ L + + P G + S + A + + ++V Sbjct: 43 SNQTVQHDFQSVGALLTNNLTAKLVASLFPSGVPFFKNMPSKTLLAAAVEQSINE-QEVN 101 Query: 98 EWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISV 157 ++ + L ++ G Y + + Sbjct: 102 NMLARLDREATERLFVQATT--AKLTRLLKLLIITGNALAYRDPKTG--------KMTVW 151 Query: 158 PLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAV 217 + + + V + D++ + K + FT+I Sbjct: 152 SIRSYVVRRAADGEFRHVVLKQIMRFDELPEHVQADYTAKKPGQYKPDRMMDYFTVIE-- 209 Query: 218 YPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPA 275 K+ NK + +D R E P+IV + + E YGR Sbjct: 210 -------KQPGAVNKRV-VVWNEIDGLRVGPESSYPEHLAPWIVTVWNLADGEHYGRGLV 261 Query: 276 MEALPTIRRLNETVNE--LAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRS 333 + +++ + L + LSL + + + + + Sbjct: 262 EDFTGDFAKVSLVSEQLGLYELEALSLLNVVDESAGGVIDEYQ-ESDTGDYVRGKTAAIT 320 Query: 334 LFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGP 393 ++ + E + + + + F+ +A E +E + +G Sbjct: 321 SYERGDYNKINAVRESIGEVIQRLSMAFMYTG--NTRQAERVTAEEIRAVAKEAESTLGG 378 Query: 394 LIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 + L G + + + + P + + L + + +++ + Sbjct: 379 VYSLLAETLQGPLAYLCMADVADDLMMGLVTKQYKP----VILTGIPALSRAVEMQNLLA 434 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 A Q + +V + D +D +V+ + + I EV Sbjct: 435 ATQEIAAIVP-ALTQLDT----RVDGSKVADLIYNSRSVDVSRIFKEPEV 479 >gi|308071876|emb|CBW54797.1| putative head-tail connector protein [Pantoea phage LIMElight] Length = 529 Score = 135 bits (340), Expect = 1e-29, Method: Composition-based stats. Identities = 53/487 (10%), Positives = 126/487 (25%), Gaps = 44/487 (9%) Query: 42 QLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCD 101 R + + G+ L+S ++ + P + + ++ + A +K+ Sbjct: 50 LQRDYQSKGAMLVNNLASKVTQALFPQNNAFFEIGQTAEML-QVAQEMGADAKQAASKFA 108 Query: 102 QVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSN 161 + + L ++ G Y + + + + + + Sbjct: 109 GIEVRASARVFLNAG--YSALSHAMKLLIITGNALVYRDPTNKQ--------FHTYSVRD 158 Query: 162 VYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKS 221 + + V + + + + + L + E T+ V Sbjct: 159 YVVKRDGSGKVLCLILKERIALQDLPEDFRLSRL------QYRTDPFEDVTLYTKV---- 208 Query: 222 LTDKKKDKGNKGFHSKFVSVDENRFFEEKQIAT--FPYIVGRYRVRADEIYGRSPAMEAL 279 +K G + + V++ PYI + + E YGR + Sbjct: 209 ---TRKHNGARVMYEVTQEVEDYPIGTPSTYPEYLCPYIPLTWNLVTGENYGRGHVEDFA 265 Query: 280 PTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALS-----REGRSL 334 RL+E + + I A D G Sbjct: 266 GDFARLSELSESSLLYEVEMMRLINIIDPGAGIDLDDFMDADCGKAVAGKSNAAGNGVVA 325 Query: 335 FQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPL 394 + ++ L + + F+ D +A E E +G + Sbjct: 326 HEGGNAQKLAAVQNDIANLVQQLSIAFMYTG--NTRDAERVTAEEIRANVSEANQTLGGV 383 Query: 395 IGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVE-YTSPLFKYQQAESVAS 453 L SE + ++ L + + P L V + L + E + Sbjct: 384 YANL-SEVLHLQLAHILSVEEE----PALLQLLMVQGIKLDVSVGLASLNRQANVERLQY 438 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQ 513 + V+ + ++ + D + + T + Q+++ Sbjct: 439 LANALQIVLPVLTQSS-----KRFNPDLIIDAMCQGYGVDREALSYTEDQLQQLQEQQDA 493 Query: 514 RRVMEEQ 520 Q Sbjct: 494 SAQQSAQ 500 >gi|332800729|emb|CBY88569.1| hypothetical protein [Pantoea phage LIMEzero] Length = 522 Score = 135 bits (340), Expect = 2e-29, Method: Composition-based stats. Identities = 55/528 (10%), Positives = 152/528 (28%), Gaps = 51/528 (9%) Query: 7 KDIQDRFNYLKNQRGELNYWMEELTGFLY--------PYKNNAQLRM-----WDTTGSEA 53 + + R+ + + + + N R + + G+ Sbjct: 13 ESLWQRYRD-----TNVVTKARDYSRYTLSKLVSEYDALDANDTSRAQITRDYQSVGALL 67 Query: 54 CIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRER 113 L + L+ + P Q++ + Q + + +V + + T+ + Sbjct: 68 VNNLVARLAEFLFPSNQRFVRVKP-----QNLTDAQREKMGQVNQGLILIEKTVSERAKA 122 Query: 114 SRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVD 173 + L G Y ++D + Y L N + + + VV Sbjct: 123 NGG--YADLIQAIAHQAVTGNVALYRDSDSET--------YRVYGLENFVVQRDGRGVVV 172 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG 233 + D + +++ ++ + + + + + G Sbjct: 173 DAIIKERLQYDSLPAEFQAQLKAQNFQCGGNKRI--WLYTRVLRVKRGNNYGYEITQQIG 230 Query: 234 FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELA 293 S V + ++ EK P+I + +++ E YGR + RL+ A Sbjct: 231 NMSGSVYTPGDDYYPEK---VCPWIFPVWSLKSGEHYGRGIVEDHAGDFARLSMLSESSA 287 Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGR-SLFQPVQFGNPLPYHEELNR 352 + + ++ + + + +L + + + +E+ + Sbjct: 288 LYMQEAMRILWLLSGSGGNADDIEAAETGQVISLQTGTKLEGVEVGDYQKVQQARDEIGQ 347 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 + + + F+ D +A E + +G +Q++ + ++ L Sbjct: 348 IVQRLSQAFMYTGE--FRDSERTTATEIQQVATSAERAMGGPYS-MQAKTLQIPLAYVLL 404 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN--TVVELGVKTGD 470 +P+ +L+++ + L ++ + +Q ++ V + Sbjct: 405 SEIDDTLVPDI------VGKILELQVVAGLDALGRSIEASQLIQALSDAQAAIAAVANIN 458 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIR-DTAEVEDIRQQREVQRRVM 517 +D V + R E++ QQ Sbjct: 459 QVAQGVLDPKAVLETIFSSNGVALDDYRTSPEELQAKAQQINQMTAEA 506 >gi|239907145|ref|YP_002953886.1| hypothetical protein DMR_25090 [Desulfovibrio magneticus RS-1] gi|239797011|dbj|BAH76000.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 682 Score = 113 bits (282), Expect = 9e-23, Method: Composition-based stats. Identities = 60/619 (9%), Positives = 146/619 (23%), Gaps = 118/619 (19%) Query: 5 SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSE--------ACIK 56 A + F + R E + A + S+ Sbjct: 9 LASKLAKEFRDAQRARQPWEMKWLERYRMYMGEYDEAVANSFSANASKLFVNKCRVKVDT 68 Query: 57 LSSLLSSLITPPG--QKWHGLAESFSAYQAFLYKE-------DARSKKVREWCDQV---- 103 + S L ++ P + W + + + + ++ + Sbjct: 69 IVSRLMEILFPQAGDRNW-SIEPTPEPVLEPAMMDFIAGVRRAYGDAEAVKFLQDIAKQR 127 Query: 104 TDTLFGFRE------RSRSGFVGCLQSFYTSVVEFG-----------------TGCFYME 140 ++ + G+ ++ +G T E Sbjct: 128 SEAMSRVIADQLAESPDHVGYRATIREVILDGAIYGMGIHKGPLVDERKRRVWTAKLVAE 187 Query: 141 ADVDEKGLEEGIR------------YISVPLSNVYMSVNHQNVVDSV---YREFTFTVDQ 185 VD + ++ Y V + Y + + Y E+ Sbjct: 188 PGVDGRAIQREAWVLDTSPVERRPYYRRVSPWSFYWDQSANRRMGDCRYGYEEYRMVYGD 247 Query: 186 IVSK-----WGDKVLSSKMKSALARNENERFTIIHAVYPKSLT----------------- 223 ++ + V+ + + + E T Sbjct: 248 VLELAGRTGFDGDVVRAYLAEKRDGDATEYDFESQLRSINGGTPEPQLQGRWRVLERYGW 307 Query: 224 -----------DKKKDKGNKGFHSKFVSVDENRFFEEK---QIATFPYIVGRYRVRADEI 269 D D + + + + FP+ + + Sbjct: 308 LRGDELEECGVDLGNDPVQADYFCNVWMLGGKIIKAVRAPIRGVEFPFQIFPMFRDDSSL 367 Query: 270 YGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR 329 G +N V + R+SL P A Q+ D Sbjct: 368 CGLGVTGVYRDAQSAINAVVRAMMDNARMSLGPIGGVNVPALQQTLDADNIRGGTWLKFD 427 Query: 330 EGRSLFQPVQ-------FGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESME 382 G + + + + L + + + + + + + D A + Sbjct: 428 TGEDMSKAITFWQASSHTSDYLALAKYFDDMGDELTVPRWVHGDGNVSDAAR-TLGGLSM 486 Query: 383 KTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPL 442 + ++ E ++ P+ +G + Sbjct: 487 LMNAMSINLAEMVKIFDDEVTSQFVTALYHWNMDFNPRPDIKGDFSVVARG--------- 537 Query: 443 FKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 ++ V S + + + +P +D ++ R + PA ++ D A Sbjct: 538 ATALMSKEVQS-QRLIQFMTMCAS---NPQFAPMLDVNKGLRQVATSMQIPADIVYDQAT 593 Query: 503 VEDIRQQREVQRRVMEEQH 521 V ++ Q+R++ +V EQ Sbjct: 594 V-ELNQERQMAMQVRIEQA 611 >gi|325171218|ref|YP_004251190.1| hypothetical protein ViPhICP2p19 [Vibrio phage ICP2] gi|323512244|gb|ADX87701.1| conserved hypothetical protein [Vibrio phage ICP2] gi|323512316|gb|ADX87772.1| hypothetical protein TU12-16_00090 [Vibrio phage ICP2_2006_A] Length = 581 Score = 107 bits (266), Expect = 6e-21, Method: Composition-based stats. Identities = 68/583 (11%), Positives = 160/583 (27%), Gaps = 82/583 (14%) Query: 3 QRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR---MWD--TTGSEACI-- 55 A+ I + + +QR E EL +++ W TT + C Sbjct: 17 DGLAEQIANTWQNWNSQRQEWLSQKSELRNYIFATDTTTTTNSTLPWKNKTTLPKLCQIR 76 Query: 56 -KLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERS 114 L S S + P ++W ++ +++A+ ++++ D + Sbjct: 77 DNLHSNYISALFP-NERWLK-------WEGKSLQDEAKRDAIQQYMDN---------KVK 119 Query: 115 RSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVV-D 173 S F + +++G +E + EE + ++ +++V + Sbjct: 120 ESDFRTIMSQLLLDYIDYGNCFATVEYVKETTKDEESGATRDTYFGPRAVRIDPKDIVFN 179 Query: 174 SVYREFTFTVDQIVSKWGDKVLSSKMKSALAR---------------------------- 205 V +F + I + + L + Sbjct: 180 PVAVDFAHSPKIIRTVLNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKA 239 Query: 206 ---------NENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENR--FFEEKQIAT 254 N + F + D D + F +R EEK+ + Sbjct: 240 VGFSMDGFGNLYDYFQSPYVEVLTFYGD-YHDTQSGTFKRNMKVTIIDRMFVIEEKENPS 298 Query: 255 F----PYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEA 310 + P +R+R D +Y P + R++ N A L PP + Sbjct: 299 WFAQAPIFHCGWRIRQDNLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPMKVKGD- 357 Query: 311 KQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLD 370 F P + P ++ K + Sbjct: 358 -VEEFVWGPMEQIYI-NGDGDVEMMAPNTQALQADMQIQILEAKM-EEFAGAPREAMGIR 414 Query: 371 DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPP 430 ++A E + G I + + +++ L+I ++ + + Sbjct: 415 TPGEKTAFEVQQLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRVFDSD 474 Query: 431 VSLLKVEYTS-------PLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVS 483 + + + A A Q V +++ + H+ T+ ++ Sbjct: 475 DKVATFMNVNKDDITAKGRLRPVGARHFAEQAQVVQSLMGIANTPVWQDIKPHVSTENLA 534 Query: 484 RFSLWATNTPA-VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 + + + + V + + + + + + Q Sbjct: 535 KMLEHNLSLGGWDIFKPNVAVMEAQTTSALVNQSQAQIEEEAQ 577 >gi|291334262|gb|ADD93925.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 155 Score = 91.4 bits (225), Expect = 4e-16, Method: Composition-based stats. Identities = 22/111 (19%), Positives = 45/111 (40%), Gaps = 4/111 (3%) Query: 251 QIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEA 310 + PY+V R+ A E+YGR P + ++P I+ N + + + ++++ + Sbjct: 41 GEGSNPYVVFRWSKAAGEVYGRGPLLNSMPAIKTCNLVIEMILENAQMAISGMYQMEDDG 100 Query: 311 KQR--NFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRS 359 L PG + + S G + GN L ++++I Sbjct: 101 IINVDTIQLLPGTIIPRSPSSRGLEPIK--NAGNFNVADLVLKDMRQNINE 149 >gi|9964612|ref|NP_064741.1| gp5 [Roseobacter phage SIO1] gi|9944303|gb|AAG02587.1|AF189021_5 gp5 [Roseobacter phage SIO1] Length = 271 Score = 85.6 bits (210), Expect = 2e-14, Method: Composition-based stats. Identities = 26/238 (10%), Positives = 60/238 (25%), Gaps = 26/238 (10%) Query: 272 RSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREG 331 P + R++ N A +P + +FD +P Sbjct: 1 MGPLDNLVGMQYRIDHLENLKADVFDQIAYPVLKIRGD--VEDFDFEPNARIYLG-DEGD 57 Query: 332 RSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQV-LDDKASRSAAESMEKTREKGAF 390 P + + ++ + + + + ++A E + G Sbjct: 58 VGYLVPDSTALNADFQ--IQNIEAKMEMMAGAPREAMGIRSAGEKTAFEVGQLMTAAGRI 115 Query: 391 VGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLL--------------KV 436 + F+ +++ L+ + + N L K+ Sbjct: 116 FQHKTAHFERVFLEPILNAMLETARRNMDYEDTAKVLNEDTGLYFFTQITRDDIKANGKI 175 Query: 437 EYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPA 494 ++A+ V + K DP+ H+ +R PA Sbjct: 176 VPMGARHFAERAQRVQNLTTMYQI------KASDPTVAAHLSGKEFARLLADELGEPA 227 >gi|291334465|gb|ADD94119.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1161] gi|291334522|gb|ADD94175.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334658|gb|ADD94305.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291334712|gb|ADD94358.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] gi|291336438|gb|ADD95993.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 86 Score = 82.2 bits (201), Expect = 2e-13, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 35/90 (38%), Gaps = 5/90 (5%) Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 MI R ++ + +++EY SPL K Q++ ++S ++ + + L Sbjct: 1 MIDRTFALILRKNLFRPAPEFLAGQD--IEIEYVSPLAKAQKSTELSSIMRAIEILGSLS 58 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAV 495 DH++ D++ R P Sbjct: 59 NVA---PVFDHINMDKLVRHLADIVGVPQK 85 >gi|291334263|gb|ADD93926.1| hypothetical protein [uncultured marine bacterium MedDCM-OCT-S08-C235] Length = 130 Score = 81.4 bits (199), Expect = 4e-13, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 50/120 (41%), Gaps = 7/120 (5%) Query: 371 DKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPP 430 ++ SA E E+ + +G G LQ+E + ++ R + IL QG + Sbjct: 6 NRTPMSATEVAERMADLSRQIGSSFGRLQAEMVTPVLQRVIHILKKQGRINIP----TVN 61 Query: 431 VSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWAT 490 +K++ TSPL + Q + + + + V G + G +D++ +++ Sbjct: 62 GREIKIQSTSPLAQAQANQDINGFNRFLELV---GARFGPQLINLLVDSNEATKYLAENL 118 >gi|9964610|ref|NP_064740.1| gp3 [Roseobacter phage SIO1] gi|9944301|gb|AAG02585.1|AF189021_3 gp3 [Roseobacter phage SIO1] Length = 282 Score = 74.5 bits (181), Expect = 5e-11, Method: Composition-based stats. Identities = 31/260 (11%), Positives = 69/260 (26%), Gaps = 35/260 (13%) Query: 1 MNQR--SAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR---MW-DTTGSEAC 54 M A +I +R+ N R E +EL ++Y W ++T + Sbjct: 11 MIDPHSLAVEIANRWTSWNNARSEKVKEWKELRNYIYATDTRTTSNNKLPWSNSTTTPKL 70 Query: 55 IKLSSLLS----SLITPPGQKWHGLAESF---------SAYQAFLYKEDARSKKVREWCD 101 +++ L + + P ++W + S QA++ + +S V Sbjct: 71 TQIADNLHANYFAALFP-QKRWFRFEATDADSDTKIKRSIIQAYMQNKLRQSDFVNTTSK 129 Query: 102 QVTDTLFGFRERSRSGFVGCLQSFYTS-----------VVEFGTGCFYMEADVDEKGLEE 150 V D + + F + Y VV Sbjct: 130 LVNDYIQYGNCFATVDFERKVTK-YEDGDRIVNYVGPKVVRISPFDICFNPLAANFSDTP 188 Query: 151 GIRYISVPLSNV---YMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNE 207 I + L + + + + + ++ + D S + + Sbjct: 189 KIVRSVLTLGEIQRMVENDSSKGYMADIFNKMLGNRGSARGNEVDINKSEGFVADGFASL 248 Query: 208 NERFTIIHAVYPKSLTDKKK 227 + + + D Sbjct: 249 TDYYESDYVEVLTFYGDIYD 268 >gi|291334524|gb|ADD94177.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1201] gi|291334656|gb|ADD94303.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C695] gi|291334710|gb|ADD94356.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C890] gi|291336436|gb|ADD95991.1| hypothetical protein [uncultured organism MedDCM-OCT-S04-C1073] Length = 95 Score = 74.1 bits (180), Expect = 6e-11, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 33/94 (35%), Gaps = 10/94 (10%) Query: 34 LYPYKNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESFSAYQAFLYKEDARS 93 + ++D + ++ L++ L ++T P W L F + Sbjct: 11 TRSKGDKRTELIFDGSPLQSVELLAASLHGMLTNPSTPWFSLR--------FKQNDMENE 62 Query: 94 KKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYT 127 + +EW + T+ ++ ++S F + Sbjct: 63 DEAKEWLEDATEVMYS--AFNKSNFQQEYLNCIM 94 >gi|296532334|ref|ZP_06895072.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296267358|gb|EFH13245.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 72 Score = 69.1 bits (167), Expect = 2e-09, Method: Composition-based stats. Identities = 13/72 (18%), Positives = 30/72 (41%), Gaps = 1/72 (1%) Query: 2 NQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYK-NNAQLRMWDTTGSEACIKLSSL 60 + + + I R+ +R +E + + ++D T +A +L++ Sbjct: 1 MRPTPETILPRYQAALARRRPWEGVWQECYDHVLAQTPGSGGAMLYDATAPDAAEQLAAS 60 Query: 61 LSSLITPPGQKW 72 L + +TPP +W Sbjct: 61 LLAELTPPWSRW 72 >gi|170719076|ref|YP_001784230.1| hypothetical protein HSM_0898 [Haemophilus somnus 2336] gi|168827205|gb|ACA32576.1| Haemophilus-specific protein, uncharacterized [Haemophilus somnus 2336] Length = 725 Score = 60.2 bits (144), Expect = 9e-07, Method: Composition-based stats. Identities = 40/402 (9%), Positives = 102/402 (25%), Gaps = 23/402 (5%) Query: 165 SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD 224 S + +D++ ++ + ++ N+ +A+ Sbjct: 257 SSDMDGYLDTLRTLSGLEKASNDKRYEVWTYHGGIPVSVLEQANQSLEEGYALELTEEQK 316 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 +K + + + A FPY V ++G Sbjct: 317 SEKAEIDGVIVMTGNGKILSVNLNPLDTAEFPYSVYTCEPDVACVFGFGIPYLCRDAQEI 376 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFD----LKPGYMNIGALSREGRSLFQPVQF 340 LN + G L++ I V+ + D +KP + + F+ + Sbjct: 377 LNTAWRGMIDNGVLTI-GSQIVVNSSVLSPVDKSWEIKPNKLWRTNDRASANASFEAQRA 435 Query: 341 GNPLPYHEELNRLKESIR--SLFLLDL--FQVLDDKASRSAAESMEKTREKGAFVGPLIG 396 + L I+ F+ + ++ ++ + Sbjct: 436 FGVFNFESRQQELANIIQLAKSFMDEESGLPMIAQGEQGQVTPTLGGMSMLMNAANAVRR 495 Query: 397 GLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQ 456 E+ + + + + + + TS L + Q Sbjct: 496 RQVKEWDDQVTKPLIRRFYEYNMAMD-DDPNIKGDMQVVARGTSAL--------LVKETQ 546 Query: 457 GVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRR 515 + P D ++ + + + A ++ + E QQ E Sbjct: 547 TAQIIDIFQKFGNHPQLSYAFDWYDGAKTLMQSMSMGAKTMLLSREDYEQKLQQIEQANA 606 Query: 516 VMEE----QHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMEN 553 + Q Q+Q + + M+ + + + Sbjct: 607 TQPQDPEILKSQMQMQLAQKKQQHEMQLEQMKLQHAMQIEQM 648 >gi|291335814|gb|ADD95414.1| hypothetical protein [uncultured phage MedDCM-OCT-S05-C849] Length = 55 Score = 59.8 bits (143), Expect = 1e-06, Method: Composition-based stats. Identities = 7/63 (11%), Positives = 16/63 (25%), Gaps = 10/63 (15%) Query: 110 FRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQ 169 + S + ++ G +M D + PL+ + + Sbjct: 1 MEYIAASNDRVAIHQALKHLIVGGNALIFMHKDG----------LKTFPLTRYVVERDGD 50 Query: 170 NVV 172 V Sbjct: 51 GNV 53 >gi|113461527|ref|YP_719596.1| hypothetical protein HS_1384 [Haemophilus somnus 129PT] gi|112823570|gb|ABI25659.1| hemophilus-specific protein, uncharacterized [Haemophilus somnus 129PT] Length = 688 Score = 56.4 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 46/418 (11%), Positives = 104/418 (24%), Gaps = 41/418 (9%) Query: 165 SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD 224 S + +D++ ++ + ++ N+ +A+ Sbjct: 220 SSDMDGYLDTLRTLSGLEKASNDKRYEVWTYHGGIPVSVLEQANQSLEEGYALELTEEQK 279 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRR 284 +K + + + A FPY V ++G Sbjct: 280 SEKAEIDGVIVMTGNGKILSVNLNPLDTAEFPYSVYTCEPDVACVFGFGIPYLCRDAQEI 339 Query: 285 LNETVNELAQFGRLSLHPPTIAVSEAKQRNFD----LKPGYMNIGALSREGRSLFQPVQF 340 LN + G L++ I V+ + D +KP + + F+ + Sbjct: 340 LNTAWRGMIDNGVLTI-GSQIVVNSSVLSPVDKSWEIKPNKLWRTNDRASANASFEAQRA 398 Query: 341 GNPLPYHEELNRLKESIR--SLFLLDL--FQVLDDKASRSAAESMEKTREKGAFVGPLIG 396 + L I+ F+ + ++ ++ + Sbjct: 399 FGVFNFESRQQELANIIQLAKSFMDEESGLPMIAQGEQGQVTPTLGGMSMLMNAANAVRR 458 Query: 397 GLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQ 456 E+ + + + + + TS L + Q Sbjct: 459 RQVKEWDDQVTKPLIRRFYEYNMAMN-DDPNIKGDMQVVARGTSAL--------LVKETQ 509 Query: 457 GVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAV-LIRDTAEVEDIRQQREVQRR 515 + P D ++ + + + A ++ + E QQ E Sbjct: 510 TAQIIDIFQKFGNHPQLSYAFDWYDGAKTLMQSMSMGAKTMLLSREDYEQKLQQIEQANA 569 Query: 516 VMEE---------------------QHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 + L+Q Q + I + EK+L MME Sbjct: 570 TQPQDPEILKSQMQMQLAQQKQQHEMQLEQMKLQHAMQI-EQMKVAIKEKELEVKMME 626 >gi|291334412|gb|ADD94067.1| hypothetical protein [uncultured phage MedDCM-OCT-S04-C1035] Length = 64 Score = 56.4 bits (134), Expect = 1e-05, Method: Composition-based stats. Identities = 13/44 (29%), Positives = 21/44 (47%), Gaps = 1/44 (2%) Query: 1 MNQ-RSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQL 43 M Q AK + RF+ LK+QR +E+ ++ P K + Sbjct: 1 MAQSEKAKILLSRFDRLKSQRQNWESHWQEVADYMQPRKADVTK 44 >gi|294083946|ref|YP_003550703.1| putative portal protein [Candidatus Puniceispirillum marinum IMCC1322] gi|292663518|gb|ADE38619.1| putative portal protein [Candidatus Puniceispirillum marinum IMCC1322] Length = 697 Score = 55.2 bits (131), Expect = 3e-05, Method: Composition-based stats. Identities = 33/319 (10%), Positives = 86/319 (26%), Gaps = 18/319 (5%) Query: 237 KFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFG 296 V+ N + +++ P++ YG + A + T + Sbjct: 323 HKVTKAGNVLLDIEEVKRRPFVTFCPLPIPHAFYGSNFAEKLCATQNARTVLTRSILDHA 382 Query: 297 RLSLHPPTIAVSEAKQRNFDLKP----GYMNIGALSREGRSLFQPVQFGNPLPYHEELNR 352 ++ +P + V +L G +N+ P+ + Sbjct: 383 MITNNPRYMVVKGGLSNPRELIDNRVGGLVNVSRPDAISAMPQAPLNPFVFQTLQQLDQD 442 Query: 353 LKES--IRSLFLLDLFQVLDDKASRSAAESMEKTREKGA-FVGPLIGGLQSEFIGAMISR 409 L+++ + L + + S + E + ++ + + Sbjct: 443 LEDNTGVSRLSQGLNKDAISKQNSAAMVEQLATMSQQRQKILARHFAQFVKSLFHEIYRL 502 Query: 410 ELDILDSQGNL----------PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVN 459 ++ D Q + P + LK+ Y + Q+ ++ + Sbjct: 503 VVENEDQQKIVEISGAYVEVDPRSWSDKRDVMVELKLGYGEQDAEAQKMLALHTLFSQDP 562 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 + + + + + + P +L + Q + + ME Sbjct: 563 NIQPMYGMENRFAMLKKILEQQGILNVEEFLTPPQMLQPPQPDPAAEMQAQM-AMKQMEL 621 Query: 520 QHLQQQLQQTSQDIGAKAA 538 Q Q + +T A Sbjct: 622 QERQTAVAETKATTDQAVA 640 >gi|167583563|ref|YP_001671753.1| portal protein [Enterobacteria phage phiEco32] gi|164375401|gb|ABY52809.1| portal protein [Enterobacteria phage phiEco32] Length = 747 Score = 54.1 bits (128), Expect = 7e-05, Method: Composition-based stats. Identities = 45/419 (10%), Positives = 102/419 (24%), Gaps = 26/419 (6%) Query: 137 FYMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLS 196 +++ + + +++ + ++T T+D S Sbjct: 208 IFVDEHATSFADAQYFCHRVRRSKEDLVAMGFPKDEIEAFNDWTDTMDTTQSTVAWSRTD 267 Query: 197 SKMKSALARNEN-ERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATF 255 + + E + VY + D NK V +++ Sbjct: 268 WRQDIDADIGTDTEDIASMVWVYEHYIRTGVLD-KNKESKLYQVIQAGEHILHTEEVTHI 326 Query: 256 PYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNF 315 P++ YG+S V + + A+ A R Sbjct: 327 PFVTFCPYPIPGSFYGQSVYDITKDIQDLRTALVRGYIDNVNNANYGRYKALVGAYDRRS 386 Query: 316 DLKPGYMNIGALSREGRSLFQP---VQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDK 372 L + + R+ P + G + L K Sbjct: 387 LLDNRPGGVVEMERQDAIDLFPYHNLPQGIDGLLGMSEELKETRTGVTKLGMGINPDVFK 446 Query: 373 ASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP----------- 421 + A + + + + ++ ++ G +P Sbjct: 447 NDNAYATVGLMMNAAQNRLRMVCRNIAHNGMVELMRGIYSLIRENGEVPIEVQTPRGMVQ 506 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 +L V SP K ++A+ + S Q + +L G DR Sbjct: 507 VNPKQLPARHNLQVVVAISPNEKAERAQKLISLKQLIAADAQLAPLFGLEQ-------DR 559 Query: 482 VSR-FSLWATNTPA--VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKA 537 + + + ++ + + +Q +SQ + A A Sbjct: 560 YMTAQIFELMGIKDTHKYLLPLEQYQPPEPSPMEILQLEMTKAQVENVQASSQKMIADA 618 >gi|291334599|gb|ADD94249.1| hypothetical protein Daci_1943 [uncultured phage MedDCM-OCT-S04-C136] Length = 741 Score = 53.7 bits (127), Expect = 8e-05, Method: Composition-based stats. Identities = 48/455 (10%), Positives = 121/455 (26%), Gaps = 46/455 (10%) Query: 76 AESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTG 135 + Y+ E + ++ + V + +F E ++ F L+ + V+ Sbjct: 141 KVDYETYENLSIVEKEALQDTKDEIETVEEEVFE-DESAKEKFEEVLKQYEMQGVDISQV 199 Query: 136 CF----YMEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS--VYREFTFTVDQIVSK 189 + ++ S+P + + + + D+ V + T +V+ Sbjct: 200 QVPNFNLYNCKIKRIKKTGRVKIESIPPEEFLIDRSAKTIEDADFVSHKVLMTRSDLVAM 259 Query: 190 -WGDKVLSSKMKSALARNENERFTIIHAVYP-----KSLTDKKKDKGNKGFHSKFVSVDE 243 + + KS L +E + V + T +K + + D Sbjct: 260 GYPQDEVDELPKSDLDIYNDEETVRLADVDDYRISSSTDTSTEKVLVYESYVKYDYDEDG 319 Query: 244 --------------NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 + + P++ YGRS + + + Sbjct: 320 IAELRKIVSAGADGHHILSNMPCDSVPFVTITPIPMPHRFYGRSISELVEDVQLMKSTVM 379 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG-------N 342 +L L+ + + + L I + + QP+Q Sbjct: 380 RQLLDNMYLTNNNRVAVMDGMVNMDDLLTTRPGGIVRTKQPPNQVMQPLQAQPISQQAFP 439 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS-- 400 L Y + + + + L+ K + M++T+ + + + Sbjct: 440 LLSYLDSVREGRTGVSKEAQGLSPDTLNAKTATGVNALMQQTQMRSELIARVFAETGVKD 499 Query: 401 ------EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASA 454 E + +E I+ S +P + L + + Sbjct: 500 LFKKIFELMVKYQDKEKIIMMSNQYIPVRPTEWKDR---FNISIVVGLGTGSKEQQTIML 556 Query: 455 LQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWA 489 + ++ G M ++ + Sbjct: 557 NSILERQLQAFQIQGGKE-MPMVNLKNMYNTLTKM 590 >gi|313113989|ref|ZP_07799544.1| hypothetical protein HMPREF9436_01396 [Faecalibacterium cf. prausnitzii KLE1255] gi|310623691|gb|EFQ07091.1| hypothetical protein HMPREF9436_01396 [Faecalibacterium cf. prausnitzii KLE1255] Length = 649 Score = 52.5 bits (124), Expect = 2e-04, Method: Composition-based stats. Identities = 40/396 (10%), Positives = 114/396 (28%), Gaps = 27/396 (6%) Query: 79 FSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFY 138 + + + +K + ++ T+ + + + ++ GTG Sbjct: 106 DNYPEPNVLPRAEDDEKTAKALSKILPTV-----LEQCDYETVYSDTWWRKLKTGTGVKG 160 Query: 139 MEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS--VYREFTFTVDQIVSKWGDKVLS 196 + D + +G I SV L +Y +++ D+ ++ DQ+ ++ Sbjct: 161 VFWDPEARGGLGEICIRSVNLLMLYWEPGVEDIQDTPHLFSLSLMDNDQLEGRYPQMAGH 220 Query: 197 SKMKSALARNENERFTIIHAVYPKSLTDKKK---------DKGNKGFHSKFVSVDENRFF 247 + +A+ ++ KK + + + + Sbjct: 221 TGSSMDVAKYIHDDSIDTGDKSVVVDWYYKKALEGGQTVLHYCKYCNGVVLYASENDPQY 280 Query: 248 EEKQIAT---FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPT 304 ++ +P++ D G T ++E + + + +L+ Sbjct: 281 AQRGFYDHGKYPFVFDPLFREEDSPAGFGYIDVMKDTQTAIDEMNHAMDENVKLAAKARY 340 Query: 305 IAVSEAKQRNFDLK-PGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLK-ESIRSLFL 362 + A +L G + + R F+P+Q + ++ + Sbjct: 341 VLSDTAGVNEEELADFGKDIVHVVGRLTDDSFRPLQTNVLSGNCISYRDARVSELKEISG 400 Query: 363 LDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPE 422 + +AA ++ +E G+ + + ++++ + Sbjct: 401 NRDVSQGGTTSGLTAASAIAALQEAGSKLSRDMLKSAYRTFAKECYLVIELMRQFY---D 457 Query: 423 CEGADNPPVSLLKVEYT---SPLFKYQQAESVASAL 455 E VEY + + + +V Sbjct: 458 EERVYRITGESGGVEYVPFSNAMLQAVPGGNVGGVQ 493 >gi|157828579|ref|YP_001494821.1| hypothetical protein A1G_03995 [Rickettsia rickettsii str. 'Sheila Smith'] gi|157801060|gb|ABV76313.1| hypothetical protein A1G_03995 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 111 Score = 51.7 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 9/112 (8%) Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG- 233 +YR F+ + +KW D K LA+N +E I+H V P+S + K KG Sbjct: 1 MYRLFSMPIKAASAKWPDFA---DFKERLAKNPDETVKILHIVSPQSENQRGKGGKGKGL 57 Query: 234 -----FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALP 280 + S+++ + E + + + FP+ V + ++YG +PA A+ Sbjct: 58 MTTLAYSSEYIYLSEQKIISQSGYSYFPFFVTLWIKGEGQVYGYAPAHHAIS 109 >gi|329663665|ref|NP_001039712.2| laminin subunit beta-2 [Bos taurus] Length = 1802 Score = 51.7 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 63/207 (30%), Gaps = 22/207 (10%) Query: 354 KESIRSLF-LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF--IGAMISRE 410 + ++ + R A E+ ++ + G ++ + +I Sbjct: 1464 QAELQRALAEGGGILSQVAETRRQAGEAQQRAQAALDKAHASRGQVEQANQELRQLIQNV 1523 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV-VELGVKTG 469 D L +G P+ V L + SP Q A +A ++ + V L G Sbjct: 1524 KDFLSQEGADPDSIEMVATRVLELSI-PASPEQIQQLAGEIAERVRSLADVDTILARTVG 1582 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQT 529 D R A R AE E +Q+ + E+ + Q Sbjct: 1583 D------------VRR-AEQLLNDARRARSRAEGE---KQKAETVQAALEEAQRAQGAAQ 1626 Query: 530 SQDIGAKAAGRAMEKKLTHDMMENSYG 556 GA + E+ L H + E G Sbjct: 1627 GAIQGAVVDTQDTEQTL-HQVQERMAG 1652 >gi|297459157|ref|XP_001790228.2| PREDICTED: laminin, beta 2 [Bos taurus] Length = 1803 Score = 51.7 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 63/207 (30%), Gaps = 22/207 (10%) Query: 354 KESIRSLF-LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF--IGAMISRE 410 + ++ + R A E+ ++ + G ++ + +I Sbjct: 1465 QAELQRALAEGGGILSQVAETRRQAGEAQQRAQAALDKAHASRGQVEQANQELRQLIQNV 1524 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV-VELGVKTG 469 D L +G P+ V L + SP Q A +A ++ + V L G Sbjct: 1525 KDFLSQEGADPDSIEMVATRVLELSI-PASPEQIQQLAGEIAERVRSLADVDTILARTVG 1583 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQT 529 D R A R AE E +Q+ + E+ + Q Sbjct: 1584 D------------VRR-AEQLLNDARRARSRAEGE---KQKAETVQAALEEAQRAQGAAQ 1627 Query: 530 SQDIGAKAAGRAMEKKLTHDMMENSYG 556 GA + E+ L H + E G Sbjct: 1628 GAIQGAVVDTQDTEQTL-HQVQERMAG 1653 >gi|297488687|ref|XP_002697087.1| PREDICTED: laminin, beta 2 (laminin S) [Bos taurus] gi|296474911|gb|DAA17026.1| laminin, beta 2 (laminin S) [Bos taurus] Length = 1802 Score = 51.7 bits (122), Expect = 3e-04, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 63/207 (30%), Gaps = 22/207 (10%) Query: 354 KESIRSLF-LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF--IGAMISRE 410 + ++ + R A E+ ++ + G ++ + +I Sbjct: 1464 QAELQRALAEGGGILSQVAETRRQAGEAQQRAQAALDKAHASRGQVEQANQELRQLIQNV 1523 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV-VELGVKTG 469 D L +G P+ V L + SP Q A +A ++ + V L G Sbjct: 1524 KDFLSQEGADPDSIEMVATRVLELSI-PASPEQIQQLAGEIAERVRSLADVDTILARTVG 1582 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQT 529 D R A R AE E +Q+ + E+ + Q Sbjct: 1583 D------------VRR-AEQLLNDARRARSRAEGE---KQKAETVQAALEEAQRAQGAAQ 1626 Query: 530 SQDIGAKAAGRAMEKKLTHDMMENSYG 556 GA + E+ L H + E G Sbjct: 1627 GAIQGAVVDTQDTEQTL-HQVQERMAG 1652 >gi|21234402|ref|NP_640321.1| hypothetical protein VpV262p60 [Vibrio phage VpV262] gi|21064915|gb|AAM28399.1| hypothetical protein [Vibrio phage VpV262] Length = 599 Score = 51.4 bits (121), Expect = 4e-04, Method: Composition-based stats. Identities = 53/599 (8%), Positives = 159/599 (26%), Gaps = 94/599 (15%) Query: 8 DIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLR---MW-DTTGSEACIKLSSLLSS 63 ++ F ++N R + + +EL ++ + ++T KL+ L Sbjct: 24 ELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTI---NKLA-HLHL 79 Query: 64 LITP-------PGQKWHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRS 116 +IT P + W + +A +++ + + S Sbjct: 80 MITTSYMEHLLPNRNWVDFVGFDN------DSVNAEKREIARS--------YVRGKVEAS 125 Query: 117 GFVGCLQSFYTSVVEFGTGC--------FYMEADVDEKGLEEGIRYISVPLSNVYMSVNH 168 G ++ G + A+ G + S+V+ V Sbjct: 126 NLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNYSGTVTERLSPSDVFWDVTA 185 Query: 169 QNV------VDSVYREFTFTVDQIVSKWGDKVLSS------------------------- 197 ++ + +Y + + + + Sbjct: 186 DSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKF 245 Query: 198 -KMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEK------ 250 + + + D ++ ++ +++ ++V + + K Sbjct: 246 DSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTW 305 Query: 251 -QIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSE 309 V ++ + P +L++ N LHP V + Sbjct: 306 DGSQNLHIAVYEFQKDT--LCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGD 363 Query: 310 AKQRNFDLKPGYMNIGALSREGRSLFQPVQF-GNPLPYHEELNRLKESIRSLFLLDLFQV 368 +++ P ++ + + + + P + L +++ Sbjct: 364 VREKGMRGGPNHVFEVEETGDVQYMTPPAEVLQPDNQLSITLQLMEDL---SGAPKESIG 420 Query: 369 LDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADN 428 ++ E + + + + E + +++ L+ + + + N Sbjct: 421 QRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFN 480 Query: 429 PPVS-----LLKVEYTSPLFK--YQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 + + + + + Q A A + + + + HM + Sbjct: 481 SELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTK 540 Query: 482 ---VSRFSLW--ATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGA 535 + A I + + R ++ ++ E Q+++ + D G Sbjct: 541 LFNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEEVGGPTTDTGQ 599 >gi|165933293|ref|YP_001650082.1| hypothetical protein RrIowa_0838 [Rickettsia rickettsii str. Iowa] gi|165908380|gb|ABY72676.1| hypothetical protein RrIowa_0838 [Rickettsia rickettsii str. Iowa] Length = 111 Score = 51.0 bits (120), Expect = 6e-04, Method: Composition-based stats. Identities = 29/112 (25%), Positives = 51/112 (45%), Gaps = 9/112 (8%) Query: 175 VYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKG- 233 +YR F+ + +KW D K LA+N +E I+H V P+S + K KG Sbjct: 1 MYRLFSMPIKAASAKWPDFA---DFKERLAKNPDETVKILHIVSPQSENQRGKGGKGKGL 57 Query: 234 -----FHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALP 280 + S+++ + E + + FP+ V + ++YG +PA A+ Sbjct: 58 MTTLAYSSEYIYLSEQKIISQSGYLYFPFFVTLWIKGEGQVYGYAPAHHAIS 109 >gi|157828580|ref|YP_001494822.1| hypothetical protein A1G_04000 [Rickettsia rickettsii str. 'Sheila Smith'] gi|157801061|gb|ABV76314.1| hypothetical protein A1G_04000 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 59 Score = 50.6 bits (119), Expect = 7e-04, Method: Composition-based stats. Identities = 10/42 (23%), Positives = 17/42 (40%) Query: 101 DQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEAD 142 + + S F + F+ ++ FGT FY+E D Sbjct: 4 QMIEKAIMDIFNNPASNFYNQIHQFFLNLAAFGTAIFYVEED 45 >gi|319776214|ref|YP_004138702.1| hypothetical protein HICON_18250 [Haemophilus influenzae F3047] gi|317450805|emb|CBY87027.1| Putative uncharacterized protein [Haemophilus influenzae F3047] Length = 731 Score = 50.2 bits (118), Expect = 8e-04, Method: Composition-based stats. Identities = 47/421 (11%), Positives = 107/421 (25%), Gaps = 45/421 (10%) Query: 165 SVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTD 224 S + VD++ +++ + + NE D Sbjct: 260 SNDMDGYVDTLRTLSGLETQSKDNRYELWTYHGGIPLNVLSGANELLG--EDNKLNIPDD 317 Query: 225 KKKDKGNKGFHSKFVSVDENRFFEEK----QIATFPYIVGRYRVRADEIYGRSPAMEALP 280 ++ N V + A FPY V ++G Sbjct: 318 EESRAANLEIEGVIVMAGNGKILSVNLNPLDTAEFPYSVYTCEPDVCCLFGFGIPYLCRD 377 Query: 281 TIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFD----LKPGYMNIGALSREGRSLFQ 336 LN + G L + P V+ + D L P + + F+ Sbjct: 378 AQEILNTAWRGMIDNGILGI-GPQAVVNSSVLTPVDGNWELAPYKLWKTNDRATVNAQFE 436 Query: 337 PVQFGNPLPYHEELNRLKESIR--SLFLLDL--FQVLDDKASRSAAESMEKTREKGAFVG 392 + L I+ F+ + ++ ++ Sbjct: 437 AQRAFGIFDIGSRQQELANIIQLSKSFMDEESGLPMIAQGEQGQVTPTLGGMSMLM-NAA 495 Query: 393 PLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVA 452 + Q + +++ L + N+ E + + TS L + Sbjct: 496 NAVRRRQVKEWDDSVTKPLIRRFYEYNMNMSEDSSIKGDMQVVARGTSAL--------LV 547 Query: 453 SALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNT-PAVLIRDTAEVEDIRQQRE 511 Q + P M D ++ + + + ++ E E Q+ + Sbjct: 548 KETQTAQIIDIFQKFGQHPQLMYAFDWYDGAKTLMQSMSMGTQTMLIPREEYEQKLQEIQ 607 Query: 512 VQRRVMEE--------------------QHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMM 551 + + + +Q++ +Q + + EK+L ++ Sbjct: 608 EAQAQQPQDPEILKVQMQMQIAQQKQQHEMQLEQMRTQAQLQIEQMKVQIREKELEIKVL 667 Query: 552 E 552 E Sbjct: 668 E 668 >gi|56551276|ref|YP_162115.1| hypothetical protein ZMO0380 [Zymomonas mobilis subsp. mobilis ZM4] gi|56542850|gb|AAV89004.1| hypothetical protein ZMO0380 [Zymomonas mobilis subsp. mobilis ZM4] Length = 729 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 42/342 (12%), Positives = 95/342 (27%), Gaps = 36/342 (10%) Query: 244 NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPP 303 + +++ P++V RA + G S A + + R + + + + P Sbjct: 317 DVLLSIEEVDEAPFVVWTPFPRAHRMIGNSLAEKVMDIQRVKSVLMRQALDGVYQTNAPR 376 Query: 304 TIAVSEAKQRN-----FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKE--- 355 + + ++PG + L L E + +E Sbjct: 377 MAVNVDGLTEDTFDDLLTIRPGAIVRYRGGIPPTPLNAGFDIQKSLGMIEYMQSAQESRT 436 Query: 356 ---SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 + D + A+ + +G L + + MI+ Sbjct: 437 GITRLNQGLDADSLNKTATGQALLQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGDP 496 Query: 413 ILDS-QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP 471 + +G + A PP ++V L ++ + +A Q ++ + Sbjct: 497 MAIKVEGLYKTVDPALWPPDMRVRVTV--GLGSGRKDQRLAYRQQLLSIQQQALAVGLTG 554 Query: 472 SCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR-------EVQR---------- 514 S + + + R P + D Q + Sbjct: 555 SKQIYNNIAAMIRDCG--LGNPTDYLIDPDIRLAGNQAENPVNNNSAAAQNSSGSVGNNP 612 Query: 515 ---RVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMEN 553 + Q + Q Q+ + D A A++K+ T + Sbjct: 613 DYTELKARQDINLQGQKMAADQERSMAEFALKKQETEAKLAM 654 >gi|241760934|ref|ZP_04759023.1| hypothetical protein ZmobDRAFT_0099 [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374553|gb|EER64014.1| hypothetical protein ZmobDRAFT_0099 [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 729 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/341 (12%), Positives = 93/341 (27%), Gaps = 34/341 (9%) Query: 244 NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPP 303 + +++ P++V RA + G S A + + R + + + + P Sbjct: 317 DVLLSIEEVDEAPFVVWTPFPRAHRMIGNSLAEKVMDIQRVKSVLMRQALDGVYQTNAPR 376 Query: 304 TIAVSEAKQRN-----FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKE--- 355 + + ++PG + L L E + +E Sbjct: 377 MAVNVDGLTEDTFDDLLTIRPGAIVRYRGGIPPTPLNAGFDIQKSLGMIEYMQSAQESRT 436 Query: 356 ---SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 + D + A+ + +G L + + MI+ Sbjct: 437 GITRLNQGLDADSLNKTATGQALLQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGDP 496 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 + L + P ++V T L ++ + +A Q ++ + S Sbjct: 497 MAIKVEGLYKTVDPALWPP-DMRVRVTVGLGSGRKDQRLAYRQQLLSIQQQALAVGLTGS 555 Query: 473 CMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR-------EVQR----------- 514 + + + R P + D Q + Sbjct: 556 KQIYNNIAAMIRDCG--LGNPTDYLIDPDIRLAGNQAENPVNNNSAAAQNSSGSVGNNPD 613 Query: 515 --RVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMEN 553 + Q + Q Q+ + D A A++K+ T + Sbjct: 614 YTELKARQDINLQGQKMAADQERSMAEFALKKQETEAKLAM 654 >gi|260753098|ref|YP_003225991.1| hypothetical protein Za10_0861 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258552461|gb|ACV75407.1| hypothetical protein Za10_0861 [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 729 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 41/341 (12%), Positives = 93/341 (27%), Gaps = 34/341 (9%) Query: 244 NRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPP 303 + +++ P++V RA + G S A + + R + + + + P Sbjct: 317 DVLLSIEEVDEAPFVVWTPFPRAHRMIGNSLAEKVMDIQRVKSVLMRQALDGVYQTNAPR 376 Query: 304 TIAVSEAKQRN-----FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKE--- 355 + + ++PG + L L E + +E Sbjct: 377 MAVNVDGLTEDTFDDLLTIRPGAIVRYRGGIPPTPLNAGFDIQKSLGMIEYMQSAQESRT 436 Query: 356 ---SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 + D + A+ + +G L + + MI+ Sbjct: 437 GITRLNQGLDADSLNKTATGQALLQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGDP 496 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 + L + P ++V T L ++ + +A Q ++ + S Sbjct: 497 MAIKVEGLYKTVDPALWPP-DMRVRVTVGLGSGRKDQRLAYRQQLLSIQQQALAVGLTGS 555 Query: 473 CMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQR-------EVQR----------- 514 + + + R P + D Q + Sbjct: 556 KQIYNNIAAMIRDCG--LGNPTDYLIDPDIRLAGNQAENPVNNNSAAAQNSSGSVGNNPD 613 Query: 515 --RVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMEN 553 + Q + Q Q+ + D A A++K+ T + Sbjct: 614 YTELKARQDINLQGQKMAADQERSMAEFALKKQETEAKLAM 654 >gi|157828622|ref|YP_001494864.1| hypothetical protein A1G_04250 [Rickettsia rickettsii str. 'Sheila Smith'] gi|157801103|gb|ABV76356.1| hypothetical protein A1G_04250 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 56 Score = 48.3 bits (113), Expect = 0.003, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 25/55 (45%), Gaps = 1/55 (1%) Query: 1 MNQRSAKDIQDRFNYLKNQRGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACI 55 M+ + F+ LK++R + N +EL ++ P ++D+T + + Sbjct: 1 MHDNELNKKIEYFDNLKSKREKWNQRWDELKRYVCPQ-TERNKVIFDSTSIGSLV 54 >gi|316995429|gb|ADU79210.1| hypothetical protein EcP1_gp59 [Enterobacter phage EcP1] Length = 719 Score = 47.5 bits (111), Expect = 0.006, Method: Composition-based stats. Identities = 55/456 (12%), Positives = 130/456 (28%), Gaps = 76/456 (16%) Query: 139 MEADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDS---VYREFTFTVDQIVSKWGDKVL 195 ME + K L+ + + NVY+ + Q +D V F ++ ++ K L Sbjct: 220 MEKVTETKVLQNQPYVEVLNIENVYIDPSCQGDMDKATFVIHRFETSIAELKKSGNYKNL 279 Query: 196 SSKMKSALAR-----NENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVS---------- 240 +++E T Y S +K+ + + + Sbjct: 280 DKLTVKDSDELIPSISDDEIKTSTPTDYNISGKSRKRFNVTEYWGYYDIDDSGVLTPIVV 339 Query: 241 ---VDENRFFEEKQIA--TFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 D E P++V Y +YG A + + + Sbjct: 340 AYVGDVKIRCSENPYPHGKPPFVVIPYLPMDSSVYGEPDAELIYDNQAIIGASTRAMIDL 399 Query: 296 GRLSLHP---------------PTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQF 340 S + +A +A+ + P I ++ P Sbjct: 400 VARSANGQNIIRKDVFDPVNYRKFMAGEDAQSNPLN-VPLAEAIRTVTTPEVPSIIPGLI 458 Query: 341 GNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPL---IGG 397 E L+ + ++ + S ++ + +G Sbjct: 459 QQQNNEAESLSGV-KAFSEGISSGSLGDVAAGIRGVLDASSKREMSILRRLKKGMVDLGR 517 Query: 398 LQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQG 457 + ++ E I + + LKV+ ++P + Q++ +A +Q Sbjct: 518 MIIAMNQEFLTDEEIIRITNDAFVHVKREALAGDFDLKVDISTPEAEQQKSNQLAFLVQT 577 Query: 458 VNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQ-------R 510 + + +++ L + +V + ++ Sbjct: 578 IGNTIPF----------------EITKVLLTEI----SRLNKMPDVAQMIKEFEPTPDPL 617 Query: 511 EVQRRVMEEQHLQQQLQQTSQ------DIGAKAAGR 540 E Q++ +E LQQ++++++ G+ A + Sbjct: 618 EEQKKQLELAKLQQEIKESAAREAYYLQRGSLATSQ 653 >gi|209548748|ref|YP_002280665.1| hypothetical protein Rleg2_1145 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534504|gb|ACI54439.1| hypothetical protein Rleg2_1145 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 612 Score = 47.1 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 48/507 (9%), Positives = 110/507 (21%), Gaps = 71/507 (14%) Query: 72 WHGLAESFSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSG------FVGCLQSF 125 WH + VR+ +T+ F +S F + Sbjct: 133 WHTFEVDDGVLGERRPSPFDKVWDVRDRTPYLTNQGFSAEMIWKSREEWKLIFEDKAEEI 192 Query: 126 YTSVV------EFGTGCFYMEAD----------VDEKGLEEGIRYISVPLSNVYMSVNHQ 169 S++ G+G + + I++ + Y+ + Sbjct: 193 -DSLINAGAPLVGGSGYSLLGERLRLVNGGSYYDKQFDELCVIKFDYRVAAKFYVYTSKD 251 Query: 170 NVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDK 229 V + + K E+ ++ Y D Sbjct: 252 GKVFQTFDR---------------KEAEKNSQRGEEISEEKGYKVYTCYFSGDVM--LDW 294 Query: 230 GNKGFHSKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETV 289 + P + R + YG A N T+ Sbjct: 295 FESPYQLN---------PARGDFVDTPIVAFR-EELTGKPYGI--IRAARDPQNLYNRTL 342 Query: 290 NELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEE 349 + + + + + I ++ + F+ E Sbjct: 343 SLIYWHSTSNRVVMDKGAVDKISKVATEIARADGIIEVNPGKKFDFE-NNTQRIQHLREI 401 Query: 350 LNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGL------QSEFI 403 L ++ + + + ++S + + + ++ + Sbjct: 402 LQVADMDVQKALGIYDEMMGVETNAKSGIAIQRRQAASQTTIALMFDRFLDAKYRWADKL 461 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESV------ASALQG 457 ++ + + L K + V S + Sbjct: 462 LWLVRATF---TDKNVFNVTDDDGVVKSVSLNEAVKGADGKDVTRQDVRVGTYDVSIEET 518 Query: 458 VNTVVELGVKTGD--PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRR 515 ++ + + ++ + L P + EVE QQR Sbjct: 519 MDVSSQNEESRIKMFELFTAGITPEQFTPGLLDIAGVPKNA-KLRKEVEASVQQRLANEA 577 Query: 516 VMEEQHLQQQLQQTSQDIGAKAAGRAM 542 M EQ + G A A Sbjct: 578 QMREQMQKLGGGPQGITQGPAGAQPAA 604 >gi|153212119|ref|ZP_01947936.1| hypothetical protein A55_1887 [Vibrio cholerae 1587] gi|124116915|gb|EAY35735.1| hypothetical protein A55_1887 [Vibrio cholerae 1587] Length = 740 Score = 47.1 bits (110), Expect = 0.007, Method: Composition-based stats. Identities = 38/391 (9%), Positives = 93/391 (23%), Gaps = 48/391 (12%) Query: 201 SALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQI-------- 252 ++ H P+ + + + F S VD + Sbjct: 297 QPTYKDRRYEIWEYHGPIPREVLQEAGLLTEEEFESTPSEVDGVIVMSGCGLILKAGINP 356 Query: 253 ---ATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSE 309 +PY V I+G LN + G ++ + Sbjct: 357 FDTEEWPYSVYCAEEDVSCIFGYGIPHLCSDAQSILNTAWRAMIDNGVATVGDQIVVNQS 416 Query: 310 AKQ---RNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIR--SLFLLD 364 A ++ P + + F+ + I F+ + Sbjct: 417 ALMPADNDWSFSPLKVWKTTDKASVSAQFEAQKAFGVFSLQNRQAEYANIISMAKAFMDE 476 Query: 365 L--FQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPE 422 ++ ++ + Q + +++ L N+ Sbjct: 477 ESGLPMISQGEQGQVTPTLGGMSMLM-NAANAVRRRQVKEWDDSVTKPLIRRFYAWNMQF 535 Query: 423 CEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD-HMDTDR 481 + + + T+ L Q + + L + + D + + Sbjct: 536 SKKNEIKGDMQIIARGTTAL-----LVKETQTAQLIELMDRLSSRPDAEAAFDFYFVYES 590 Query: 482 VSRFSLWATNTPAVLIRDTAEVEDIRQQREV--------------------QRRVMEEQH 521 + + + ++R E E +Q + QR M+ Sbjct: 591 LVKSM--SMGA-RSVLRPREEYEAKLKQIQEAQQNQPQDPQLVIKEMEIALQREKMQHDE 647 Query: 522 LQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 + + +A E ++ M+E Sbjct: 648 TLAKFSAAMKQQETQAMLYREEMRMQQAMLE 678 >gi|149018527|gb|EDL77168.1| laminin, beta 2 [Rattus norvegicus] Length = 1801 Score = 46.7 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 63/206 (30%), Gaps = 23/206 (11%) Query: 354 KESIRSLFLLDLFQVLD-DKASRSAAESMEKTREKGAFVGPLIGGLQSEF--IGAMISRE 410 + ++ + + + R A E+ ++ + G ++ + +I Sbjct: 1463 QAELQRALVEGGGILSRVSETRRQAEEAQQRAQAALDKANASRGQVEQANQELRELIQNV 1522 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQ-AESVASALQGVNTV-VELGVKT 468 D L +G P+ V + + SP + Q+ A +A ++ + V L Sbjct: 1523 KDFLSQEGADPDSIEMVATRVLDISI-PASP-EQIQRLASEIAERVRSLADVDTILAHTM 1580 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 GD R A R AE E Q+ + E+ + Q Sbjct: 1581 GD------------VRR-AEQLLQDAQRARSRAEGER---QKAETVQAALEEAQRAQGAA 1624 Query: 529 TSQDIGAKAAGRAMEKKLTHDMMENS 554 GA + E+ L + Sbjct: 1625 QGAIRGAVVDTKNTEQTLQQVQERMA 1650 >gi|6981142|ref|NP_037106.1| laminin subunit beta-2 precursor [Rattus norvegicus] gi|126371|sp|P15800|LAMB2_RAT RecName: Full=Laminin subunit beta-2; AltName: Full=Laminin chain B3; AltName: Full=Laminin-11 subunit beta; AltName: Full=Laminin-14 subunit beta; AltName: Full=Laminin-15 subunit beta; AltName: Full=Laminin-3 subunit beta; AltName: Full=Laminin-4 subunit beta; AltName: Full=Laminin-7 subunit beta; AltName: Full=Laminin-9 subunit beta; AltName: Full=S-laminin subunit beta; Short=S-LAM beta; Flags: Precursor gi|57251|emb|CAA34561.1| precursor (AA -35 to 1766) [Rattus norvegicus] Length = 1801 Score = 46.7 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 63/206 (30%), Gaps = 23/206 (11%) Query: 354 KESIRSLFLLDLFQVLD-DKASRSAAESMEKTREKGAFVGPLIGGLQSEF--IGAMISRE 410 + ++ + + + R A E+ ++ + G ++ + +I Sbjct: 1463 QAELQRALVEGGGILSRVSETRRQAEEAQQRAQAALDKANASRGQVEQANQELRELIQNV 1522 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQ-AESVASALQGVNTV-VELGVKT 468 D L +G P+ V + + SP + Q+ A +A ++ + V L Sbjct: 1523 KDFLSQEGADPDSIEMVATRVLDISI-PASP-EQIQRLASEIAERVRSLADVDTILAHTM 1580 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 GD R A R AE E Q+ + E+ + Q Sbjct: 1581 GD------------VRR-AEQLLQDAQRARSRAEGER---QKAETVQAALEEAQRAQGAA 1624 Query: 529 TSQDIGAKAAGRAMEKKLTHDMMENS 554 GA + E+ L + Sbjct: 1625 QGAIRGAVVDTKNTEQTLQQVQERMA 1650 >gi|226290|prf||1505373A laminin-like adhesive protein Length = 1801 Score = 46.7 bits (109), Expect = 0.010, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 63/206 (30%), Gaps = 23/206 (11%) Query: 354 KESIRSLFLLDLFQVLD-DKASRSAAESMEKTREKGAFVGPLIGGLQSEF--IGAMISRE 410 + ++ + + + R A E+ ++ + G ++ + +I Sbjct: 1463 QAELQRALVEGGGILSRVSETRRQAEEAQQRAQAALDKANASRGQVEQANQELRELIQNV 1522 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQ-AESVASALQGVNTV-VELGVKT 468 D L +G P+ V + + SP + Q+ A +A ++ + V L Sbjct: 1523 KDFLSQEGADPDSIEMVATRVLDISI-PASP-EQIQRLASEIAERVRSLADVDTILAHTM 1580 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 GD R A R AE E Q+ + E+ + Q Sbjct: 1581 GD------------VRR-AEQLLQDAQRARSRAEGER---QKAETVQAALEEAQRAQGAA 1624 Query: 529 TSQDIGAKAAGRAMEKKLTHDMMENS 554 GA + E+ L + Sbjct: 1625 QGAIRGAVVDTKNTEQTLQQVQERMA 1650 >gi|218778476|ref|YP_002429794.1| hypothetical protein Dalk_0621 [Desulfatibacillum alkenivorans AK-01] gi|218759860|gb|ACL02326.1| protein of unknown function DUF323 [Desulfatibacillum alkenivorans AK-01] Length = 918 Score = 46.3 bits (108), Expect = 0.013, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 45/147 (30%), Gaps = 15/147 (10%) Query: 407 ISRELDILDSQGNLPEC----EGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 I+ +L L +G P + + + L + S+++ Q + Sbjct: 82 INSDLKRLYKEGKNPSGVIIGPENNFIMSDEAREALLATLAQTAGNGSLSALDQLAQMMN 141 Query: 463 ELGVKTGDPSCMDHMDTD-------RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRR 515 L D +D + D R+ R + + + R+ + Sbjct: 142 TLKQILSDEDIIDSNNPDDALSQIHRLLRGISEKLGIDQEV----EDAREGVAVRQAEEG 197 Query: 516 VMEEQHLQQQLQQTSQDIGAKAAGRAM 542 E + + A+AAG+A+ Sbjct: 198 EDAELIASPEADGAGKGGDAEAAGKAL 224 >gi|307545235|ref|YP_003897714.1| Haemophilus-specific protein, uncharacterized [Halomonas elongata DSM 2581] gi|307217259|emb|CBV42529.1| Haemophilus-specific protein, uncharacterized [Halomonas elongata DSM 2581] Length = 749 Score = 46.3 bits (108), Expect = 0.013, Method: Composition-based stats. Identities = 30/325 (9%), Positives = 71/325 (21%), Gaps = 51/325 (15%) Query: 247 FEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIA 306 + PY ++ +G+ N T L +S P Sbjct: 377 INRDPLERRPYHKSSFQPVPGSFWGQGIPELMADVQDVCNATARGLVNNLAISSGPQVEV 436 Query: 307 VSEAKQ---RNFDLKPGYM--NIGALSREGRSLFQPVQFGNPLP-YHEELNRLKESIRSL 360 + Q D+ P + ++ + Q + + + Sbjct: 437 YEDRLQPQEDPTDIYPWKIWRTKASIETGNNPALRFFQPQSNASELLAVYEQFEYRADES 496 Query: 361 FLLDLFQ---VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 + + A ++A+ + I + + +I Sbjct: 497 TNIPRYMYGSDEAGGAGQTASGLSMLMESANKGIKDAIRHIDRGVLRRVIEALWLHNMQF 556 Query: 418 GNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 + + + +S + +Q Q ++L D + H Sbjct: 557 -----SDDNSIKGDASVVARGSSAMLIREQ------TNQLRQQFLQLTANDYDMGILGHD 605 Query: 478 DTDRVSRFSLWATNTPAVLIRDTAEVED------------------------------IR 507 ++ + P LI E++ R Sbjct: 606 GRRKLLESIAEKLDLPG-LIPSEEEMQKNLAQQRQDQQAQLQMEQAKAEAEAAEKQARAR 664 Query: 508 QQREVQRRVMEEQHLQQQLQQTSQD 532 + + E QQ+ Sbjct: 665 EANADAAQTEAETQQSQQMAPLEAQ 689 >gi|83646950|ref|YP_435385.1| chaperone activity ATPase ATP-binding subunit [Hahella chejuensis KCTC 2396] gi|83634993|gb|ABC30960.1| ATPase with chaperone activity, ATP-binding subunit [Hahella chejuensis KCTC 2396] Length = 919 Score = 45.6 bits (106), Expect = 0.021, Method: Composition-based stats. Identities = 34/208 (16%), Positives = 63/208 (30%), Gaps = 13/208 (6%) Query: 298 LSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQ---PVQFGNPLPYHEELNRLK 354 L++H +A + L Y+ L +G SL + + + Sbjct: 380 LAIHHNVRISDDAIIQAVKLSARYIPGRQLPDKGVSLLDTACARVSLSQSATPSLIEDTR 439 Query: 355 ESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDIL 414 I+ + ++ +S E++E E+ A + + Q+E IL Sbjct: 440 RRIQQIDTNLDLISQENISSGEYHETLELLTEEKAVLEASLAA-QTEQWEKEKDLIAKIL 498 Query: 415 DSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD-PSC 473 + + L + A P + E ++ N L GD P Sbjct: 499 EVRTKLEQDYQAKKGPEDAGD--------RLSDEEVAELQVEFKNLFAALASAQGDQPLM 550 Query: 474 MDHMDTDRVSRFSLWATNTPAVLIRDTA 501 M H+D V+ T P + Sbjct: 551 MPHVDGQAVAEVVANWTGIPVGKMVSDE 578 >gi|281357154|ref|ZP_06243643.1| hypothetical protein Vvad_PD2246 [Victivallis vadensis ATCC BAA-548] gi|281316185|gb|EFB00210.1| hypothetical protein Vvad_PD2246 [Victivallis vadensis ATCC BAA-548] Length = 752 Score = 45.6 bits (106), Expect = 0.023, Method: Composition-based stats. Identities = 35/304 (11%), Positives = 82/304 (26%), Gaps = 39/304 (12%) Query: 262 YRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAK----QRNFDL 317 YR D I+G A R +N + L+ P I ++A + Sbjct: 426 YRANIDSIWGEGIADLLHHVQRSVNSLMRSRNNNLALAGAPQVIINTDAVRLKPGEPLQI 485 Query: 318 KPGYMNIGALSR--EGRSLFQPVQFGNPLP-YHEELNRLKESIRSLFLLDLFQV-----L 369 P + S + F+ +Q + EL + + + + Sbjct: 486 TPFKQWFVSGSGYYGAQKPFELMQIPDVSDSLSRELEKELVFADRISGIPEYSQGVSKGA 545 Query: 370 DDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNP 429 ++ A+ +A+ + I + +I + + N Sbjct: 546 ENGAAGTASGLSMLLDAASNQIKDPINNIDEGLYEPLIRDLYY-----DKIND-PEVPNS 599 Query: 430 PVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVS---RFS 486 K+ + + +S + + V++ P + + + R Sbjct: 600 AKGDFKIHARGAIGLAFKEQSQIRRREFFSLVLQ------SPLLQQILKPEGIVALTREV 653 Query: 487 LWATNTPAVLIRDTAE------------VEDIRQQREVQRRVMEEQHLQQQLQQTSQDIG 534 + + P I + + Q + + +++ Q Q S Sbjct: 654 VRTLDMPVNDIVTSETEFAAQQQQLQLQQQAAAAQSDELQAAIQQIDEQLAAGQISPQEA 713 Query: 535 AKAA 538 +A Sbjct: 714 DRAK 717 >gi|31982223|ref|NP_032509.2| laminin subunit beta-2 precursor [Mus musculus] gi|19913504|gb|AAH26051.1| Laminin, beta 2 [Mus musculus] gi|148689344|gb|EDL21291.1| laminin, beta 2, isoform CRA_a [Mus musculus] gi|148689345|gb|EDL21292.1| laminin, beta 2, isoform CRA_a [Mus musculus] Length = 1799 Score = 45.6 bits (106), Expect = 0.024, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 63/206 (30%), Gaps = 23/206 (11%) Query: 354 KESIRSLFLLDLFQVLD-DKASRSAAESMEKTREKGAFVGPLIGGLQSEF--IGAMISRE 410 + ++ + + + R A E+ ++ + G ++ + +I Sbjct: 1461 QAELQRALVEGGGILSRVSETRRQAEEAQQRAQAALDKANASRGQVEQANQELRELIQNV 1520 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQ-AESVASALQGVNTV-VELGVKT 468 D L +G P+ V + + SP + Q+ A +A ++ + V L Sbjct: 1521 KDFLSQEGADPDSIEMVATRVLDISI-PASP-EQIQRLASEIAERVRSLADVDTILAHTM 1578 Query: 469 GDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 GD R A R AE E Q+ + E+ + Q Sbjct: 1579 GD------------VRR-AEQLLQDAHRARSRAEGER---QKAETVQAALEEAQRAQGAA 1622 Query: 529 TSQDIGAKAAGRAMEKKLTHDMMENS 554 GA + E+ L + Sbjct: 1623 QGAIWGAVVDTQNTEQTLQRVQERMA 1648 >gi|291618425|ref|YP_003521167.1| Hypothetical Protein PANA_2872 [Pantoea ananatis LMG 20103] gi|291153455|gb|ADD78039.1| Hypothetical Protein PANA_2872 [Pantoea ananatis LMG 20103] gi|327394819|dbj|BAK12241.1| hypothetical protein PAJ_2161 [Pantoea ananatis AJ13355] Length = 353 Score = 44.8 bits (104), Expect = 0.039, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 54/187 (28%), Gaps = 21/187 (11%) Query: 362 LLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLP 421 L FQ + S +A + +EK + +Q E L G Sbjct: 150 LGTGFQAVGSGISAAAPSVTQMAKEKLQQNNINLDNMQQE--------LETTLRQTGKPE 201 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAES-VASALQGVNTVVELGVKTGDPSCMDHMDTD 480 + + Q AE+ + T H DT Sbjct: 202 LQPENLKQDANN----------EAQNAENQANNTANHPQTADTDLANWFKGVIARHSDTL 251 Query: 481 RVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE--QHLQQQLQQTSQDIGAKAA 538 + + A + E E I Q E + + Q L++Q +Q +++ G +AA Sbjct: 252 QAADRDALKNIIKARTGKSDQEAEQIVNQAEQSYQQAMQKYQELKKQAEQKAREAGEQAA 311 Query: 539 GRAMEKK 545 + Sbjct: 312 KATAKAS 318 >gi|301770389|ref|XP_002920595.1| PREDICTED: laminin subunit beta-2-like [Ailuropoda melanoleuca] Length = 1797 Score = 44.0 bits (102), Expect = 0.058, Method: Composition-based stats. Identities = 45/236 (19%), Positives = 73/236 (30%), Gaps = 29/236 (12%) Query: 325 GALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT 384 G L G + G EL R+L + R A E+ ++ Sbjct: 1437 GGLGCSGVVAMADLALGRARHTQAELQ------RALAEGGGILSHVAETRRQAGEAQQRA 1490 Query: 385 REKGAFVGPLIGGLQ--SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPL 442 R G ++ ++ + +I D L +G P+ V L + SP Sbjct: 1491 RAALDKANASRGQVEKANQELRELIQSVKDFLSQEGADPDSIEMVATRVLELSI-PASP- 1548 Query: 443 FKYQQ-AESVASALQGVNTV-VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDT 500 + Q A +A ++ + V L GD R A R Sbjct: 1549 EQIQHLAGEIAERVRSLADVDTILARTVGD------------VRR-AEQLLQDARRARSR 1595 Query: 501 AEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 AE E +Q+ + E+ + Q GA + E+ L H + E G Sbjct: 1596 AEGE---KQKAETVQAALEEAQRAQGAAQGAIQGAVVDTQDTERTL-HQVQEKMAG 1647 >gi|281338355|gb|EFB13939.1| hypothetical protein PANDA_009358 [Ailuropoda melanoleuca] Length = 1805 Score = 44.0 bits (102), Expect = 0.058, Method: Composition-based stats. Identities = 45/236 (19%), Positives = 73/236 (30%), Gaps = 29/236 (12%) Query: 325 GALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT 384 G L G + G EL R+L + R A E+ ++ Sbjct: 1445 GGLGCSGVVAMADLALGRARHTQAELQ------RALAEGGGILSHVAETRRQAGEAQQRA 1498 Query: 385 REKGAFVGPLIGGLQ--SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPL 442 R G ++ ++ + +I D L +G P+ V L + SP Sbjct: 1499 RAALDKANASRGQVEKANQELRELIQSVKDFLSQEGADPDSIEMVATRVLELSI-PASP- 1556 Query: 443 FKYQQ-AESVASALQGVNTV-VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDT 500 + Q A +A ++ + V L GD R A R Sbjct: 1557 EQIQHLAGEIAERVRSLADVDTILARTVGD------------VRR-AEQLLQDARRARSR 1603 Query: 501 AEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 AE E +Q+ + E+ + Q GA + E+ L H + E G Sbjct: 1604 AEGE---KQKAETVQAALEEAQRAQGAAQGAIQGAVVDTQDTERTL-HQVQEKMAG 1655 >gi|282598927|ref|YP_003358477.1| N4 gp59-like protein [Pseudomonas phage LIT1] gi|259048687|emb|CAZ66336.1| N4 gp59-like protein [Pseudomonas phage LIT1] Length = 726 Score = 44.0 bits (102), Expect = 0.072, Method: Composition-based stats. Identities = 42/421 (9%), Positives = 115/421 (27%), Gaps = 19/421 (4%) Query: 142 DVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKS 201 D +++ + Y + ++ + ++S+ S +++ Sbjct: 253 DPSCGSDFSKAKFLIETFESSYAELKADGRYQNL-DKIQVEGQNLLSEPDYTGPSEGVRN 311 Query: 202 ALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGR 261 ++++ + ++H + + + +V PY+V Sbjct: 312 FDFQDKSRKRLVVHEYWG-YYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVVVN 370 Query: 262 YRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEA--KQRNFDLKP 319 Y R ++YG S + R + + S + + A Sbjct: 371 YIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRFDR 430 Query: 320 G---YMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRS 376 G N GA R + + Y L + + + + + Sbjct: 431 GENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAALGDT 490 Query: 377 AAESMEKTREKGAFVGPLIGGLQSEFIG----------AMISRELDILDSQGNLPECEGA 426 A ++ L + I + + + + + Sbjct: 491 ATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDIRRD 550 Query: 427 DNPPVSLLKVEYTSPLFKYQQAESVASALQGVN-TVVELGVKTGDPSCMDHMDTDRVSRF 485 D LK++ ++ + + LQ + + + + M+ ++ Sbjct: 551 DLAGNFDLKLDISTAEEDNAKVNDLTFMLQTMGPNMDPMMAQQIMGQIMELKKMPDFAKR 610 Query: 486 SLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGA-KAAGRAMEK 544 P + + A++E + Q +++ H +G +A RA+ Sbjct: 611 IREFQPQPDPIAQQKAQLELMLLQAQIEAERARAAHYMSGAGLQDSKVGTEQAKARALAS 670 Query: 545 K 545 + Sbjct: 671 Q 671 >gi|73985821|ref|XP_533831.2| PREDICTED: similar to Laminin beta-2 chain precursor (S-laminin) (Laminin B1s chain) [Canis familiaris] Length = 1801 Score = 43.7 bits (101), Expect = 0.082, Method: Composition-based stats. Identities = 42/285 (14%), Positives = 82/285 (28%), Gaps = 31/285 (10%) Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R+ L + + A++E P + + QP G Sbjct: 1384 RKHKANQQALGKLSARTHSLSLTAINELVCGPPGDAPCATSPCGGAGCLDEDGQPRCGGL 1443 Query: 343 PLPYHEELNRL--------KESIRSLF-LLDLFQVLDDKASRSAAESMEKTREKGAFVGP 393 + L + ++ + R A E+ ++ + Sbjct: 1444 GCNGAVAMADLALGRARHTQAELQRALAEGGGILSQVAETRRQAGEAQQRAQAALDKANA 1503 Query: 394 LIGGLQ--SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQ-AES 450 G ++ ++ + +I D L +G P+ V L + SP + Q A + Sbjct: 1504 SRGQVEKANQELRELIQSVKDFLSQEGADPDSIEMVATRVLELSI-PASP-EQIQHLAGA 1561 Query: 451 VASALQGVNTV-VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQ 509 +A ++ + V L GD R A R AE E +Q Sbjct: 1562 IAERVRSLADVDTILARTVGD------------VRR-AEQLLQDARRARSRAEGE---KQ 1605 Query: 510 REVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENS 554 + + E+ + Q GA + E+ L + + Sbjct: 1606 KAETVQAALEEAQRAQGAAQGAIQGAVVDTQDTERTLHQVQAKMA 1650 >gi|282599474|ref|YP_003358364.1| N4 gp59-like protein [Pseudomonas phage LUZ7] gi|259048573|emb|CAZ66223.1| N4 gp59-like protein [Pseudomonas phage LUZ7] Length = 720 Score = 42.9 bits (99), Expect = 0.16, Method: Composition-based stats. Identities = 38/429 (8%), Positives = 105/429 (24%), Gaps = 30/429 (6%) Query: 142 DVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKS 201 D G +++ + Y + ++ + I+S+ S +++ Sbjct: 247 DPSCNGDMNKAKFVVESFESSYAELKADGRYSNL-EKINEQNSDILSQPDYATGSESVRN 305 Query: 202 ALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIVGR 261 + + + ++H + + + + V PY+V Sbjct: 306 FDFADRSRKRLVVHEYWG-YYDIHGDGELHSIVATWVGQVLIRLELNPFPDGKIPYVVAA 364 Query: 262 YRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPP--TIAVSEAKQRNFDLKP 319 Y D +YG S + + + + S + + + Sbjct: 365 YLPVKDSVYGDSDGSLLIDNQKIVGAISRGMIDIMAQSANGQVGFQKGALDITNRRRYER 424 Query: 320 G---YMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRS 376 G N G + Y +L+ + + + Sbjct: 425 GETYEFNPGNNPATAIYTHTFQEIPRSAEYMLNQQQLEAESMTGVKAFNTGISGQALGDT 484 Query: 377 AAESMEKTREKGAFVGPLIGGLQS---EFIGAMISRELDILDSQGNLPECEGADNPPVSL 433 A ++ L E +I+ + LD + + Sbjct: 485 ATGIRGALDAASKRELGILRRLSDCLIEVGRRVIAMNAEFLDDEEVIRITNEGFVTVRRD 544 Query: 434 LKVEY-----TSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM----DTDRVSR 484 + S + VA + T+ + + + ++ Sbjct: 545 -DLAGEFDLRLSISTAEEDNAKVADLSFMLQTMGPNLEWGMNQLILSEIAELKKMPDLAH 603 Query: 485 FSLWATNTPAVL--IRDTAEVEDIRQQREVQRRVMEEQHL--------QQQLQQTSQDIG 534 P + + E+ + Q + ++ Q ++ +G Sbjct: 604 RIRKYQPEPDPIAQRKAELEIALLEAQVQETLAKAQQAASTGYLNTSKAGTEGQKARALG 663 Query: 535 AKAAGRAME 543 ++A ++ Sbjct: 664 SQADLADLD 672 >gi|119943823|ref|YP_941503.1| pentapeptide repeat-containing protein [Psychromonas ingrahamii 37] gi|119862427|gb|ABM01904.1| pentapeptide repeat protein [Psychromonas ingrahamii 37] Length = 976 Score = 42.5 bits (98), Expect = 0.17, Method: Composition-based stats. Identities = 18/202 (8%), Positives = 64/202 (31%), Gaps = 7/202 (3%) Query: 330 EGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGA 389 Q + + + ++ + + ++ + + A E ++K + Sbjct: 389 GDEEYKQVMGDNLSAFAEGKKQQAEQEMDEAIDKQVAELRANGMDKQADELLDKIKNPPQ 448 Query: 390 FVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKV-EYTSPLFKYQQA 448 + + + + I + + L + + ++ +V + + + Q+ Sbjct: 449 DIELPEDAKKLQALTDKILPGISAMKEAPKLDDLDLTKLNLKAMDEVQAHMEAMAEKQKK 508 Query: 449 ESVASALQGVNTVVELGVKTGDPSCMDHMDT--DRVSRFSLWATNTPAVLIRDTAEVEDI 506 E++ Q ++ + + P + +D ++ P ++ VE Sbjct: 509 EALLKVEQQLDELKQ--QAAQQPEMAEQLDPSIKQLEEMLASIDAIP--VLTRPDTVEQD 564 Query: 507 RQQREVQRRVMEEQHLQQQLQQ 528 Q + E+ Q+++ Sbjct: 565 TQLSAQLAQAAEQLTEQKKMMA 586 >gi|152982158|ref|YP_001354469.1| hypothetical protein mma_2779 [Janthinobacterium sp. Marseille] gi|151282235|gb|ABR90645.1| Uncharacterized conserved protein (possible phage related tail length tape measure protein) [Janthinobacterium sp. Marseille] Length = 901 Score = 42.1 bits (97), Expect = 0.25, Method: Composition-based stats. Identities = 36/274 (13%), Positives = 71/274 (25%), Gaps = 26/274 (9%) Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQF-GNPLPYHEELNRLKESIRS 359 P A E QR KP + + ++ Q + L R + ++ + Sbjct: 339 APKIQADPELLQRL--TKPKAVKPAQDTTGAQTTLMKAQLDAEFALLKDGLTRQQTALDA 396 Query: 360 LFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGN 419 L V D ++A E E E L Q G + L Sbjct: 397 ALEDRLVSVRDYYTQKTAIEQREVDAEIARKQQELARSQQVATTGKSENDRLR--AKAEV 454 Query: 420 LPECEGADNPPVSLLKVEYTSPLFKYQQAESVA-SALQGVNTVVELGVKTGDPSCMDHM- 477 +E + Q +A + Q + ++ D + Sbjct: 455 AKSEADLITLNNRRTDIEQANARKAAQAERELADALAQAREELAQITGTATDADRQAAIE 514 Query: 478 ----------------DTDRVSRFSLWATNTPAVLIRDTAE---VEDIRQQREVQRRVME 518 D + + A L A+ V + + + + + Sbjct: 515 RSYRDLRARLAAESDTDGVSLVDRLIDVKAAQANLAALEAQWRQVTERLRNAQEAIQTQQ 574 Query: 519 EQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 + L + Q Q + + ++L M + Sbjct: 575 QAGLLTEAQARQQIVALQQQSATEMERLLPTMQQ 608 >gi|238793398|ref|ZP_04637024.1| Uncharacterized mscS family protein [Yersinia intermedia ATCC 29909] gi|238727367|gb|EEQ18895.1| Uncharacterized mscS family protein [Yersinia intermedia ATCC 29909] Length = 1121 Score = 42.1 bits (97), Expect = 0.26, Method: Composition-based stats. Identities = 27/228 (11%), Positives = 70/228 (30%), Gaps = 31/228 (13%) Query: 332 RSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESM----EKTREK 387 + +P+ + + E ++ + L L + +R +ES+ ++ E Sbjct: 103 QEGDKPLPVPSNMSTSELEQQVLQISSQLLELSRLSQQEQDRAREISESLSQLPQQQSEA 162 Query: 388 GAFVGPLIGGLQSEFI--GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + LQ++ + + +L + A+ +S L L + Sbjct: 163 RRILAEISARLQAQSNPANPVAQAQFALLQ-AEAVARKAKANELELSQLSANNRQELSRL 221 Query: 446 Q------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-TPAVLIR 498 + + E V + LQ + + + + + P +I+ Sbjct: 222 RAELYKKRQERVDAQLQSLRNNLNNQRQQAAEQAL------ERTELLAEQGGDLPESIIK 275 Query: 499 ---DTAEVEDIRQQREV--------QRRVMEEQHLQQQLQQTSQDIGA 535 E+ Q+ QR+ + + +Q T ++ Sbjct: 276 ELQTNRELSQALNQQAQRIDLISSQQRQAVAQTQQVRQALSTIREQAQ 323 >gi|254729487|ref|YP_003084169.1| hypothetical protein PSS2_gp025 [Cyanophage PSS2] gi|254211639|gb|ACT65587.1| hypothetical protein [Cyanophage PSS2] gi|265524837|gb|ACY75729.1| predicted protein [Cyanophage PSS2] Length = 518 Score = 42.1 bits (97), Expect = 0.26, Method: Composition-based stats. Identities = 54/519 (10%), Positives = 142/519 (27%), Gaps = 82/519 (15%) Query: 20 RGELNYWMEELTGFLYPYKNNAQLRMWDTTGSEACIKLSSLLSSLITPPGQKWHGLAESF 79 R E +E PYK ++ + +A + + Q + E Sbjct: 48 RAEYLP--QEPGERDTPYKQRLGRSIYPSFYRDAIRAFA-----GLLSNYQ----IHEMP 96 Query: 80 SAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYM 139 ++ + D R + ++ + + + + + G Sbjct: 97 ASMEDADDNVDRRGSSLNKFLNSLDQLV---------------------LRDGGAAVLV- 134 Query: 140 EADVDEKGLEEGIRYISVPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKM 199 E EEG + + + + + G +V++ + Sbjct: 135 -EMPPETLDEEGNSLETSAMEEIEAARAP---WLVPIERQNLINWRTKVVDGREVVTMAV 190 Query: 200 KSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYIV 259 + ++ + V K + ++ E + + T P + Sbjct: 191 IRTIEERQDPK-NAFGTVLEPIYLLLTPGAWQKIRLVRGATMKWEMVVEAEGVTTLPVVP 249 Query: 260 GRYRVRADEIY-GRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLK 318 + + G S + L + + T+ L P ++Q Sbjct: 250 LVWYGATGSQFAGGSLPLSGLADLSIQHFTLRSDLVELIHRLALPVPVRKGSQQLPDGSY 309 Query: 319 P----GYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKAS 374 P G + L G F + + + E+ ++ + L ++ + Sbjct: 310 PPMVLGPNSGMDLPENGDFKFAELSGSSLAQHQVEVEHVEALMDRSSLSFMYGSTGNG-- 367 Query: 375 RSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLL 434 R+A E++ + + + V LI Q+ + + L + + ++ Sbjct: 368 RTATEAVLQGSQVASQVRTLIENKQA------MFGLIMKLWTTYMAEDLSEEAGLDIND- 420 Query: 435 KVEYTSPLFKYQQAESVASALQG-----VNTVVELGVKTGDPSCMDHMDTDRVSRFSLWA 489 + + + +A+ V + L ++ LG + +D + Sbjct: 421 -----NLIARPLEAQEVQAYLALFGGDLLSHETTLGELQKGQALSQDIDLE--------- 466 Query: 490 TNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQ 528 E+ + +R+ + E + + Sbjct: 467 -----------EEIARVTDERKARAEEAMEMMQETGGED 494 >gi|302527178|ref|ZP_07279520.1| von Willebrand factor [Streptomyces sp. AA4] gi|302436073|gb|EFL07889.1| von Willebrand factor [Streptomyces sp. AA4] Length = 652 Score = 41.7 bits (96), Expect = 0.31, Method: Composition-based stats. Identities = 27/212 (12%), Positives = 54/212 (25%), Gaps = 37/212 (17%) Query: 341 GNPLPYHEELNRLKESIRSLFLLDLFQ-------VLDDKASRSAAESME------KTREK 387 G + L ++ R D LD +AA E ++ E Sbjct: 83 GTLQEVQQLLQEALQAERRELFPDPDDEARFREAQLDALPPGTAAAVRELNEYDWRSDEA 142 Query: 388 GAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPL--FKY 445 + L E + A + + G +L + L Sbjct: 143 RQKYEQIRDLLGREMLDARFQGMKQAMQNAG-----PEDVERINQMLG--DLNALLSAHA 195 Query: 446 QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVED 505 Q A + + + G + + D + + ++ +E Sbjct: 196 QGASDID--ERFSEFMRRHGEFFPENP----QNVDELIDVLAARSAAAQRMLNSMSE--- 246 Query: 506 IRQQREVQRRVMEEQ----HLQQQLQQTSQDI 533 +QR + ++ L QQL + Sbjct: 247 --EQRAELAELAQQAFGDPRLAQQLSALDSQL 276 >gi|221504668|gb|EEE30341.1| ATP-dependent RNA helicase, putative [Toxoplasma gondii VEG] Length = 522 Score = 41.7 bits (96), Expect = 0.31, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 54/190 (28%), Gaps = 21/190 (11%) Query: 309 EAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQV 368 A+ + F L+ G + + + + L+ I F + + Sbjct: 216 SAETQAFQLRRGAEIVIGTPGRVKDCLEKAYTVLNQCNYVVLDEADRMIDMGFEEIVNFI 275 Query: 369 LDDKAS---RSAAESMEKTREKGAFVGPLIGGLQ---SEFIGAMISREL-DILDSQGNLP 421 LD + +S E++ +E A G + L S + + R L + Sbjct: 276 LDQIPTSNLKSNDEALILQQEMQAKAGHRLYRLTQMFSATMPPAVERLARKYLRQPSYIS 335 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 + +VE+ K Q+ + V + P M ++ + Sbjct: 336 IGDPGAGKRAIEQRVEFVPEARKKQRLQDV------LENAT--------PPVMVFVNQKK 381 Query: 482 VSRFSLWATN 491 + Sbjct: 382 SADALAKVLG 391 >gi|302035504|ref|YP_003795826.1| putative phage tail length tape measure protein [Candidatus Nitrospira defluvii] gi|300603568|emb|CBK39898.1| putative Phage tail length tape measure protein [Candidatus Nitrospira defluvii] Length = 901 Score = 41.7 bits (96), Expect = 0.32, Method: Composition-based stats. Identities = 36/274 (13%), Positives = 71/274 (25%), Gaps = 26/274 (9%) Query: 301 HPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQF-GNPLPYHEELNRLKESIRS 359 P A E QR KP + + ++ Q + L R + ++ + Sbjct: 339 APKIQADPELLQRL--TKPKAVKPAQDTTGAQTTLMKAQLDAEFALLKDGLARQQTALDA 396 Query: 360 LFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGN 419 L V D ++A E E E L Q G + L Sbjct: 397 ALEDRLVSVRDYYTQKTALEQREVDAEIARKQQELARSQQVVTTGKSENDRLKAKAEVAK 456 Query: 420 LPECEGADNPPVSLLKVEYTSPLFKYQQAESVA-SALQGVNTVVELGVKTGDPSCMDHM- 477 +E + Q +A + Q + ++ D + Sbjct: 457 --AEADLITLNNRRTDIEQANARKAAQAERELADALAQAREELAQITGTATDTDRQAAIE 514 Query: 478 ----------------DTDRVSRFSLWATNTPAVLIRDTAE---VEDIRQQREVQRRVME 518 D + + A L A+ V + + + + + Sbjct: 515 RSYRDLRARLAAESDADGVSLIDRLINVKAAQANLAALEAQWRQVTERLRNAQEAIQTQQ 574 Query: 519 EQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 + L + Q Q + + ++L M + Sbjct: 575 QAGLLTEAQARQQIVALQQQSATEMERLLPTMQQ 608 >gi|71281799|ref|YP_269191.1| sensor histidine kinase/response regulator [Colwellia psychrerythraea 34H] gi|71147539|gb|AAZ28012.1| sensor histidine kinase/response regulator [Colwellia psychrerythraea 34H] Length = 784 Score = 41.7 bits (96), Expect = 0.34, Method: Composition-based stats. Identities = 17/211 (8%), Positives = 57/211 (27%), Gaps = 11/211 (5%) Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGA 405 ++ + + S E + + + + + + Sbjct: 110 IRLPEQVVEGKVMKSSFGLSIPNKRVGNNDSQNEQSNRRQLGTLIIATNLATIHARLWQT 169 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 + L+ + + +E + K + + L Sbjct: 170 GFNILLNQTLLVVLIMLVIMFILQRLITRHLESMAGYSKAIGDGDLEAPLTL-----SRR 224 Query: 466 VKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQ 525 ++ R ++ + R E + +R R+ ++++E + + Q Sbjct: 225 QPNFPDELNQLVNALNDMRLAIRH-----DINRREEEKQALRYNRDQLQQMVERRTMSLQ 279 Query: 526 LQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 + + KA + + ++H++ G Sbjct: 280 QAKEIAEEANKAKSQFL-ATMSHEIRTPMNG 309 >gi|238750060|ref|ZP_04611563.1| Uncharacterized mscS family protein [Yersinia rohdei ATCC 43380] gi|238711604|gb|EEQ03819.1| Uncharacterized mscS family protein [Yersinia rohdei ATCC 43380] Length = 1113 Score = 41.3 bits (95), Expect = 0.38, Method: Composition-based stats. Identities = 23/222 (10%), Positives = 65/222 (29%), Gaps = 19/222 (8%) Query: 332 RSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESM----EKTREK 387 + + +P+ + + E ++ + L L+ + +R +ES+ ++ E Sbjct: 97 QEVDKPLPVPSNMSTSELEQQVLQISSQLLELNRLSQQEQDRAREISESLSQLPQQQSEA 156 Query: 388 GAFVGPLIGGLQSEF--IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + LQ++ + + +L + A+ +S L L + Sbjct: 157 RRILAEIGSRLQAQSSPTNPVTQAQFALLQ-AEAVARKAKANELELSQLSANNRQELSRL 215 Query: 446 QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-----TPAVLIRDT 500 + + + L + + L + Sbjct: 216 RAELYKKRQERVDAQLQTLRNNLNNQRQQAAEKALERTELLAEQGGDLPESITQQLQINR 275 Query: 501 AEVEDIRQQRE-------VQRRVMEEQHLQQQLQQTSQDIGA 535 + + QQ + QR+ + + +Q T ++ Sbjct: 276 ELSQALNQQAQRIDLISSQQRQAVAQTQQVRQALSTIREQAQ 317 >gi|21436526|emb|CAD29630.1| putative chitin binding protein [Anopheles gambiae] Length = 567 Score = 41.3 bits (95), Expect = 0.41, Method: Composition-based stats. Identities = 33/253 (13%), Positives = 66/253 (26%), Gaps = 35/253 (13%) Query: 294 QFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 + S P ++ + + P + + +PV+F Sbjct: 91 KGVPSSASPVYMSPASSLMTKATSLPLGVPPFRPIPKPTPEAEPVRFDP----------- 139 Query: 354 KESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDI 413 +R F L Q D +S + G + I Sbjct: 140 -SVLRRNFALKTAQTPDPS-----FQSQLMNQTSSFHRGGAAIRTAPASPFPSAPNQQII 193 Query: 414 LDSQG-NLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDP- 471 Q + + P S+ ++ T P+ + ++ + ++ TG P Sbjct: 194 YKEQNLQVQKVPAFQAMPESVSRIS-TGPVVQVDNKLQPSAIKNSIMSIPPRRQMTGKPG 252 Query: 472 -----------SCMDHMD-TDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEE 519 + +D A N +I+ E +RQQ ++ E+ Sbjct: 253 PTIATGSATTGDAAEEIDLMGHTVEELAAAANVSVEVIK---EAIRVRQQELRAQKQYEK 309 Query: 520 QHLQQQLQQTSQD 532 Q Q Sbjct: 310 QQAAFAQTQFLAQ 322 >gi|156046663|ref|XP_001589710.1| hypothetical protein SS1G_09432 [Sclerotinia sclerotiorum 1980] gi|154693827|gb|EDN93565.1| hypothetical protein SS1G_09432 [Sclerotinia sclerotiorum 1980 UF-70] Length = 1631 Score = 41.0 bits (94), Expect = 0.51, Method: Composition-based stats. Identities = 23/228 (10%), Positives = 61/228 (26%), Gaps = 11/228 (4%) Query: 320 GYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLF--QVLDDKASRSA 377 G I + Q G+ P L L++ + + + +++ + Sbjct: 1257 GERGISPVGASRNRGLSSPQSGSNTPDMARLRELEQQLAASMHAHQEIKEAFENREQEAE 1316 Query: 378 AESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVE 437 + EK + + + M+ R D L Sbjct: 1317 SAYREKLSQLENDYQSAV--HYVKGTEKMLKRMKDELSRYKQDNTRLKEQLTAAEERSAA 1374 Query: 438 YTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLI 497 SP + + ++ + + + D V + N+ + L+ Sbjct: 1375 SRSPTSWESERAGLVGQIETLQSEINSSAAQMHKELAD------VQKELQDTQNSHSDLM 1428 Query: 498 RDTAEV-EDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEK 544 R E+ + + E R + + + + + +++ Sbjct: 1429 RSHEELKKQLASTSEQARHELGQLQEENAQLEKRAQDAEEKVSLLLDQ 1476 >gi|119509792|ref|ZP_01628936.1| PBS lyase HEAT-like repeat protein [Nodularia spumigena CCY9414] gi|119465527|gb|EAW46420.1| PBS lyase HEAT-like repeat protein [Nodularia spumigena CCY9414] Length = 936 Score = 41.0 bits (94), Expect = 0.53, Method: Composition-based stats. Identities = 26/224 (11%), Positives = 69/224 (30%), Gaps = 22/224 (9%) Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 + L +++ ++ + L + D +AA+++ + + K + S++ Sbjct: 107 RRAAAQALGQMQAKEQAPQVALLLKDSDPDVRYAAAQALGQMQAKEVVPQVALLLKDSDW 166 Query: 403 IGAMIS-RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGV--- 458 + + L + ++ +P+ ++ L + Q E V + Sbjct: 167 NVRNAAAQALGQMQAKEVVPQVALLLKDSDPNVRRAAAYALGQMQAKEVVPQVALLLKDS 226 Query: 459 ---------NTVVELGVKTGDPSCMDHM-DTDRVSRFSLW-ATN-------TPAVLIRDT 500 + ++ K P + D+D R + A P V + Sbjct: 227 DWNVRNAAAQALGQMQAKEVVPQVALLLKDSDWNVRNAAAQALGQMQAKEVVPQVALLLK 286 Query: 501 AEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEK 544 ++R M+ + Q+ +D + A + Sbjct: 287 DSDWNVRNAAAQALGQMQAKEQAPQVALLLKDSDSDVRSVAAQA 330 Score = 39.8 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 63/206 (30%), Gaps = 29/206 (14%) Query: 373 ASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS-RELDILDSQGNLPECEGADNPPV 431 + +AAE++ + + K + SE + + L + ++ P+ Sbjct: 75 SRSAAAEALGQMQAKEVVPQLALLLKDSETYVRRAAAQALGQMQAKEQAPQVALLLKDSD 134 Query: 432 SLLKVEYTSPLFKYQQAESVASALQGV------------NTVVELGVKTGDPSCMDHM-D 478 ++ L + Q E V + + ++ K P + D Sbjct: 135 PDVRYAAAQALGQMQAKEVVPQVALLLKDSDWNVRNAAAQALGQMQAKEVVPQVALLLKD 194 Query: 479 TDRVSRFSLW-ATN-------TPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTS 530 +D R + A P V + ++R M+ + + Q+ Sbjct: 195 SDPNVRRAAAYALGQMQAKEVVPQVALLLKDSDWNVRNAAAQALGQMQAKEVVPQVALLL 254 Query: 531 QD-------IGAKAAGRAMEKKLTHD 549 +D A+A G+ K++ Sbjct: 255 KDSDWNVRNAAAQALGQMQAKEVVPQ 280 Score = 37.5 bits (85), Expect = 6.7, Method: Composition-based stats. Identities = 32/239 (13%), Positives = 73/239 (30%), Gaps = 35/239 (14%) Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 E L +++ L L + + R+AA+++ + + K P + L + Sbjct: 76 RSAAAEALGQMQAKEVVPQLALLLKDSETYVRRAAAQALGQMQAKEQ--APQVALLLKDS 133 Query: 403 IGAMIS----RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGV 458 + + L + ++ +P+ ++ L + Q E V + Sbjct: 134 -DPDVRYAAAQALGQMQAKEVVPQVALLLKDSDWNVRNAAAQALGQMQAKEVVPQVALLL 192 Query: 459 N------------TVVELGVKTGDPSCMDHM-DTDRVSRFSLW-ATN-------TPAVLI 497 + ++ K P + D+D R + A P V + Sbjct: 193 KDSDPNVRRAAAYALGQMQAKEVVPQVALLLKDSDWNVRNAAAQALGQMQAKEVVPQVAL 252 Query: 498 RDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQD-------IGAKAAGRAMEKKLTHD 549 ++R M+ + + Q+ +D A+A G+ K+ Sbjct: 253 LLKDSDWNVRNAAAQALGQMQAKEVVPQVALLLKDSDWNVRNAAAQALGQMQAKEQAPQ 311 >gi|291334641|gb|ADD94289.1| portal protein [uncultured phage MedDCM-OCT-S04-C64] Length = 755 Score = 40.6 bits (93), Expect = 0.64, Method: Composition-based stats. Identities = 47/426 (11%), Positives = 115/426 (26%), Gaps = 60/426 (14%) Query: 176 YREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFH 235 + E T +++ + + N T + + D + Sbjct: 245 FDESAMTEEELARRNKTDEEEPFDYVSEESMRNYFITECYIKIDRDGDDIAE-LLRVTLA 303 Query: 236 SKFVSVDENRFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQF 295 + +R +++ P+ + + YG S A + R + ++ Sbjct: 304 GGNYTSGSSRLLGIEEVDHMPFATCSPILMPHKFYGLSIADITMDLQRIKSVLTRQMLDN 363 Query: 296 GRLSLHPPTIAVSEAKQRN--FDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRL 353 L+ + T + +PG + P+ + Sbjct: 364 TYLANNSRTAVNDSHVNLDDLLTSRPGGVVRYKGEGSASQYITPIPHNPLPNEAYTMMGY 423 Query: 354 KESIRSL-------FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 + +R L + + AA + + R K + ++G + + + + Sbjct: 424 LDDVRRQRTGVGDETAGLGENSLSNVNTGVAALAFDAKRMKIELIARILGEVGFKDVFRL 483 Query: 407 ISRELDILDSQGNLPECEGADN---------PPVSLLKVEYTSPLFKYQQAESVASAL-Q 456 I + L + L G + ++V + + ++ ++ + + + Sbjct: 484 IHKLLMKHQDRKMLLNVAGNFQAINPSEWRKRENTSVQV-GVGSVSRERRMVALETIMAK 542 Query: 457 GVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRD----------------- 499 + G+ T + + R Sbjct: 543 QNELIANGGMGTLVQPFQVY----QTLRDIADGFGLQPQAYFTDPRTLPPPPPPQPDAQA 598 Query: 500 ------------TAEVEDIRQQREVQRRVMEEQ------HLQQQLQQTSQDIGAKAAGRA 541 AE + R Q +V + E+Q L+QQ Q DI + A Sbjct: 599 ELALTHARALVMDAESKMQRNQIDVAKAQAEQQIKFRELELRQQELQLKADIERQKAELV 658 Query: 542 MEKKLT 547 + ++ T Sbjct: 659 LLQRET 664 >gi|3540281|gb|AAC34383.1| All-1 related protein [Takifugu rubripes] Length = 4823 Score = 40.6 bits (93), Expect = 0.64, Method: Composition-based stats. Identities = 20/244 (8%), Positives = 64/244 (26%), Gaps = 18/244 (7%) Query: 329 REGRSLFQPVQFGNPLPYHEELNRLKESIRSLFL-------------LDLFQVLDDKASR 375 + Q G+ ++ + L ++ + L + Sbjct: 3209 SGPSTPSHVYQVGSANQLQQKKDHLNLQKQTGLMGNQQSMVQQQQQQPLLTPQRQGSVTD 3268 Query: 376 SAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLK 435 M E + Q M+ + + Q G P V Sbjct: 3269 DKPSMMNIKEEGKTIDISVQQQQQQAVQNPMMQSQDSSMQLQVTGQPHPGQQQPVVMGHN 3328 Query: 436 VEYTSPLFKYQQAESVASALQGVNT-VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPA 494 + + + ++Q+ +++ ++ + ++ + + N P Sbjct: 3329 PQQQALMAQHQKQQAMMGIIRAQQQGITAQRPALQPGQIRTPVNIQAIIAQNPQLRNLPP 3388 Query: 495 VLIRDTAEVEDIRQQREVQR---RVMEEQHLQQQL-QQTSQDIGAKAAGRAMEKKLTHDM 550 + ++Q + + M + ++ Q+ +G + ++ + M Sbjct: 3389 NQQIQHIQAIIAQRQIQQGQMLRMAMGQGQIRPQMPPGQVLQVGQQHQSNMLQPGVNSQM 3448 Query: 551 MENS 554 + Sbjct: 3449 QQGM 3452 >gi|195471922|ref|XP_002088251.1| GE18474 [Drosophila yakuba] gi|194174352|gb|EDW87963.1| GE18474 [Drosophila yakuba] Length = 1037 Score = 40.6 bits (93), Expect = 0.66, Method: Composition-based stats. Identities = 28/210 (13%), Positives = 62/210 (29%), Gaps = 19/210 (9%) Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFV------GPLIGGLQSEF 402 + +R L++ L L + S + S ++ + G L Sbjct: 146 SAETSRTEMRDLYMKLLRNALGQSKNPSLSLSHKQKLARRQLQVQSQAQGQSYQQLARTT 205 Query: 403 IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVV 462 I G ++ E + + Q++E + +Q + + Sbjct: 206 DEEQIQGLAQSQQQSGLKQSLNQNEDQEDQ----EDVTSQAQAQKSERQSQLIQSTQSEI 261 Query: 463 ELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHL 522 + ++ + + NT D E + + Q E Q + + Sbjct: 262 QGQSQSQVQAQS----QAEAISQLQESENT-----TDDQEQAESQDQAESQAQAQTQVQS 312 Query: 523 QQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 Q Q Q++ G +A +++ L E Sbjct: 313 QAQEQESLVQAGDQAKEDPIDQSLHQAQAE 342 >gi|307108830|gb|EFN57069.1| hypothetical protein CHLNCDRAFT_143822 [Chlorella variabilis] Length = 796 Score = 40.6 bits (93), Expect = 0.70, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 38/102 (37%), Gaps = 9/102 (8%) Query: 444 KYQQAESV-ASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 + SV ++ + + G + + + ++D +R + P + A Sbjct: 699 RAPAMGSVRSAVRRFAAAFEDDGGFNAEDAALSNVDPKEAARRAAD----PVSALDIAAT 754 Query: 503 VEDIRQQREVQRRVMEEQHLQQ----QLQQTSQDIGAKAAGR 540 V ++ Q+ + + + +Q Q+ Q G AAG+ Sbjct: 755 VREVFQRVAAAQPQLMQAGSEQLTPVQMAALQQIFGQAAAGQ 796 >gi|22124533|ref|NP_667956.1| hypothetical protein y0619 [Yersinia pestis KIM 10] gi|150260593|ref|ZP_01917321.1| putative membrane transport protein [Yersinia pestis CA88-4125] gi|218927566|ref|YP_002345441.1| hypothetical protein YPO0363 [Yersinia pestis CO92] gi|229840234|ref|ZP_04460393.1| putative membrane transport protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842312|ref|ZP_04462467.1| putative membrane transport protein [Yersinia pestis biovar Orientalis str. India 195] gi|229903949|ref|ZP_04519062.1| putative membrane transport protein [Yersinia pestis Nepal516] gi|21957330|gb|AAM84207.1|AE013664_2 putative periplasmic binding transport protein [Yersinia pestis KIM 10] gi|115346177|emb|CAL19045.1| putative membrane transport protein [Yersinia pestis CO92] gi|149290001|gb|EDM40078.1| putative membrane transport protein [Yersinia pestis CA88-4125] gi|229679719|gb|EEO75822.1| putative membrane transport protein [Yersinia pestis Nepal516] gi|229690622|gb|EEO82676.1| putative membrane transport protein [Yersinia pestis biovar Orientalis str. India 195] gi|229696600|gb|EEO86647.1| putative membrane transport protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|320013772|gb|ADV97343.1| putative membrane transport protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 1119 Score = 40.6 bits (93), Expect = 0.73, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 62/228 (27%), Gaps = 25/228 (10%) Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 + L P + L + + L Q + S S + ++ E Sbjct: 100 AQEGDKPLPVPSNLSTSDLEQQVLQVSSQLLELNRLSQQEQDRAREISESLGQLPQQQSE 159 Query: 387 KGAFVGPLIGGLQSEFI-GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + +QS+ +++ L + + +S L L + Sbjct: 160 ARRMLAEIGPRIQSQSNPSTPVAQAQLTLLQAEAVARKAKVNELELSQLSANNRQELSRL 219 Query: 446 Q------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-----TPA 494 Q + V + LQ + + + + + Sbjct: 220 QVELYKKREARVQAQLQSLRNNLNNQRQQAAEQAL------ERTELLAEQGGDLPESITQ 273 Query: 495 VLIRDTAEVEDIRQQRE-------VQRRVMEEQHLQQQLQQTSQDIGA 535 L R+ + + QQ + QR+ + + +Q T ++ Sbjct: 274 QLQRNRELSQALNQQVQRIDLISSQQRQAVAQTQQVRQALNTIREQAQ 321 >gi|108809911|ref|YP_653827.1| hypothetical protein YPA_3921 [Yersinia pestis Antiqua] gi|108813468|ref|YP_649235.1| hypothetical protein YPN_3308 [Yersinia pestis Nepal516] gi|165926747|ref|ZP_02222579.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165936580|ref|ZP_02225148.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|166011886|ref|ZP_02232784.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166213988|ref|ZP_02240023.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167400559|ref|ZP_02306068.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167419121|ref|ZP_02310874.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167423312|ref|ZP_02315065.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167469168|ref|ZP_02333872.1| hypothetical protein YpesF_15059 [Yersinia pestis FV-1] gi|270489063|ref|ZP_06206137.1| transporter, small conductance mechanosensitive ion channel (MscS) family protein [Yersinia pestis KIM D27] gi|294502472|ref|YP_003566534.1| membrane transport protein [Yersinia pestis Z176003] gi|108777116|gb|ABG19635.1| membrane transport protein [Yersinia pestis Nepal516] gi|108781824|gb|ABG15882.1| putative membrane transport protein [Yersinia pestis Antiqua] gi|165915696|gb|EDR34305.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165921370|gb|EDR38594.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165989245|gb|EDR41546.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166204783|gb|EDR49263.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166963115|gb|EDR59136.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167049927|gb|EDR61335.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167057482|gb|EDR67228.1| mechanosensitive ion channel domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|262360502|gb|ACY57223.1| membrane transport protein [Yersinia pestis D106004] gi|262364449|gb|ACY61006.1| membrane transport protein [Yersinia pestis D182038] gi|270337567|gb|EFA48344.1| transporter, small conductance mechanosensitive ion channel (MscS) family protein [Yersinia pestis KIM D27] gi|294352931|gb|ADE63272.1| membrane transport protein [Yersinia pestis Z176003] Length = 1113 Score = 40.6 bits (93), Expect = 0.77, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 62/228 (27%), Gaps = 25/228 (10%) Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 + L P + L + + L Q + S S + ++ E Sbjct: 94 AQEGDKPLPVPSNLSTSDLEQQVLQVSSQLLELNRLSQQEQDRAREISESLGQLPQQQSE 153 Query: 387 KGAFVGPLIGGLQSEFI-GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + +QS+ +++ L + + +S L L + Sbjct: 154 ARRMLAEIGPRIQSQSNPSTPVAQAQLTLLQAEAVARKAKVNELELSQLSANNRQELSRL 213 Query: 446 Q------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-----TPA 494 Q + V + LQ + + + + + Sbjct: 214 QVELYKKREARVQAQLQSLRNNLNNQRQQAAEQAL------ERTELLAEQGGDLPESITQ 267 Query: 495 VLIRDTAEVEDIRQQRE-------VQRRVMEEQHLQQQLQQTSQDIGA 535 L R+ + + QQ + QR+ + + +Q T ++ Sbjct: 268 QLQRNRELSQALNQQVQRIDLISSQQRQAVAQTQQVRQALNTIREQAQ 315 >gi|45440372|ref|NP_991911.1| hypothetical protein YP_0518 [Yersinia pestis biovar Microtus str. 91001] gi|229836622|ref|ZP_04456788.1| putative membrane transport protein [Yersinia pestis Pestoides A] gi|45435228|gb|AAS60788.1| putative membrane transport protein [Yersinia pestis biovar Microtus str. 91001] gi|229706306|gb|EEO92314.1| putative membrane transport protein [Yersinia pestis Pestoides A] Length = 1119 Score = 40.6 bits (93), Expect = 0.78, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 62/228 (27%), Gaps = 25/228 (10%) Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 + L P + L + + L Q + S S + ++ E Sbjct: 100 AQEGDKPLPVPSNLSTSDLEQQVLQVSSQLLELNRLSQQEQDRAREISESLGQLPQQQSE 159 Query: 387 KGAFVGPLIGGLQSEFI-GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + +QS+ +++ L + + +S L L + Sbjct: 160 ARRMLAEIGPRIQSQSNPSTPVAQAQLTLLQAEAVARKAKVNELELSQLSANNRQELSRL 219 Query: 446 Q------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-----TPA 494 Q + V + LQ + + + + + Sbjct: 220 QVELYKKREARVQAQLQSLRNNLNNQRQQAAEQAL------ERTELLAEQGGDLPESITQ 273 Query: 495 VLIRDTAEVEDIRQQRE-------VQRRVMEEQHLQQQLQQTSQDIGA 535 L R+ + + QQ + QR+ + + +Q T ++ Sbjct: 274 QLQRNRELSQALNQQVQRIDLISSQQRQAVAQTQQVRQALNTIREQAQ 321 >gi|310286713|ref|YP_003937971.1| Permease protein of ABC transporter system [Bifidobacterium bifidum S17] gi|309250649|gb|ADO52397.1| Permease protein of ABC transporter system [Bifidobacterium bifidum S17] Length = 1139 Score = 40.6 bits (93), Expect = 0.78, Method: Composition-based stats. Identities = 33/267 (12%), Positives = 69/267 (25%), Gaps = 29/267 (10%) Query: 297 RLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSR---EGRSLFQPVQFGNPLPYHEELNRL 353 L+ A S+ + G+ + + Sbjct: 275 SLASDYTFFAPSDGVTGDIYTAISLTVSGSTDEDAFGDDYDTLVRDVADR--IEATVQTK 332 Query: 354 KESIRSLFLLD----LFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISR 409 +++ R L+D A R ++ + E+ + Q++ + Sbjct: 333 RQNERRQTLVDAAQKKLDQAKTDAYRQLDDAQMQITEQTEELK--TRREQAKTTKQSLED 390 Query: 410 ELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTG 469 +L L+ Q + V Q + + QG+ T + Sbjct: 391 QLTQLEDQ------SEQLQDGKDQVNV------GLLQARQGQSQLQQGIATAQTMNDLAA 438 Query: 470 DPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRR------VMEEQHLQ 523 + D + A P ++ + + Q R + LQ Sbjct: 439 QGARAAEQAADAADQAVAGAQGLPETVLEPLRKAAKTARDLATQARSKADESAAQLTQLQ 498 Query: 524 QQLQQTSQDIGAKAAGRAMEKKLTHDM 550 QL Q + I A A ++ T + Sbjct: 499 SQLSQVNATIAQLEAQSATLQRQTEQL 525 >gi|153949019|ref|YP_001402618.1| hypothetical protein YpsIP31758_3664 [Yersinia pseudotuberculosis IP 31758] gi|170026025|ref|YP_001722530.1| hypothetical protein YPK_3811 [Yersinia pseudotuberculosis YPIII] gi|152960514|gb|ABS47975.1| mechanosensitive ion channel domain protein [Yersinia pseudotuberculosis IP 31758] gi|169752559|gb|ACA70077.1| MscS Mechanosensitive ion channel [Yersinia pseudotuberculosis YPIII] Length = 1113 Score = 40.6 bits (93), Expect = 0.80, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 62/228 (27%), Gaps = 25/228 (10%) Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 + L P + L + + L Q + S S + ++ E Sbjct: 94 AQEGDKPLPVPSNLSTSDLEQQVLQVSSQLLELNRLSQQEQDRAREISESLGQLPQQQSE 153 Query: 387 KGAFVGPLIGGLQSEFI-GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + +QS+ +++ L + + +S L L + Sbjct: 154 ARRMLAEIGPRIQSQSNPSTPVAQAQLTLLQAEAVARKAKVNELELSQLSANNRQELSRL 213 Query: 446 Q------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-----TPA 494 Q + V + LQ + + + + + Sbjct: 214 QVELYKKREARVQAQLQSLRNNLNNQRQQAAEQAL------ERTELLAEQGGDLPESITQ 267 Query: 495 VLIRDTAEVEDIRQQRE-------VQRRVMEEQHLQQQLQQTSQDIGA 535 L R+ + + QQ + QR+ + + +Q T ++ Sbjct: 268 QLQRNRELSQALNQQVQRIDLISSQQRQAVAQTQQVRQALNTIREQAQ 315 >gi|145600859|ref|YP_001164935.1| hypothetical protein YPDSF_3612 [Yersinia pestis Pestoides F] gi|162418708|ref|YP_001605289.1| hypothetical protein YpAngola_A0711 [Yersinia pestis Angola] gi|145212555|gb|ABP41962.1| membrane transport protein [Yersinia pestis Pestoides F] gi|162351523|gb|ABX85471.1| mechanosensitive ion channel domain protein [Yersinia pestis Angola] Length = 1113 Score = 40.6 bits (93), Expect = 0.80, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 62/228 (27%), Gaps = 25/228 (10%) Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 + L P + L + + L Q + S S + ++ E Sbjct: 94 AQEGDKPLPVPSNLSTSDLEQQVLQVSSQLLELNRLSQQEQDRAREISESLGQLPQQQSE 153 Query: 387 KGAFVGPLIGGLQSEFI-GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + +QS+ +++ L + + +S L L + Sbjct: 154 ARRMLAEIGPRIQSQSNPSTPVAQAQLTLLQAEAVARKAKVNELELSQLSANNRQELSRL 213 Query: 446 Q------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-----TPA 494 Q + V + LQ + + + + + Sbjct: 214 QVELYKKREARVQAQLQSLRNNLNNQRQQAAEQAL------ERTELLAEQGGDLPESITQ 267 Query: 495 VLIRDTAEVEDIRQQRE-------VQRRVMEEQHLQQQLQQTSQDIGA 535 L R+ + + QQ + QR+ + + +Q T ++ Sbjct: 268 QLQRNRELSQALNQQVQRIDLISSQQRQAVAQTQQVRQALNTIREQAQ 315 >gi|186893774|ref|YP_001870886.1| hypothetical protein YPTS_0441 [Yersinia pseudotuberculosis PB1/+] gi|186696800|gb|ACC87429.1| MscS Mechanosensitive ion channel [Yersinia pseudotuberculosis PB1/+] Length = 1113 Score = 40.6 bits (93), Expect = 0.81, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 62/228 (27%), Gaps = 25/228 (10%) Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 + L P + L + + L Q + S S + ++ E Sbjct: 94 AQEGDKPLPVPSNLSTSDLEQQVLQVSSQLLELNRLSQQEQDRAREISESLGQLPQQQSE 153 Query: 387 KGAFVGPLIGGLQSEFI-GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + +QS+ +++ L + + +S L L + Sbjct: 154 ARRMLAEIGPRIQSQSNPSTPVAQAQLTLLQAEAVARKAKVNELELSQLSANNRQELSRL 213 Query: 446 Q------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-----TPA 494 Q + V + LQ + + + + + Sbjct: 214 QVELYKKREARVQAQLQSLRNNLNNQRQQAAEQAL------ERTELLAEQGGDLPESITQ 267 Query: 495 VLIRDTAEVEDIRQQRE-------VQRRVMEEQHLQQQLQQTSQDIGA 535 L R+ + + QQ + QR+ + + +Q T ++ Sbjct: 268 QLQRNRELSQALNQQVQRIDLISSQQRQAVAQTQQVRQALNTIREQAQ 315 >gi|160700609|ref|YP_001552284.1| hypothetical protein BA3_0015 [Thalassomonas phage BA3] gi|157787728|gb|ABV74300.1| hypothetical protein BA3_0015 [Thalassomonas phage BA3] Length = 711 Score = 40.2 bits (92), Expect = 0.87, Method: Composition-based stats. Identities = 41/330 (12%), Positives = 77/330 (23%), Gaps = 50/330 (15%) Query: 272 RSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGY-------MNI 324 RS + R N + + L+ P I + D + Sbjct: 353 RSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTY 412 Query: 325 GALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT 384 + + P E I+S + + S + + Sbjct: 413 IPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQ 472 Query: 385 REKGAFVGPLIGGLQ---SEFIGAMISRELDILDSQG----NLPECEGADNPPVSLL--- 434 R+ I L ++ I D++ P+ + Sbjct: 473 RQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDE 532 Query: 435 -----------------KVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHM 477 V T P F Q+ E+ + +Q V D +M Sbjct: 533 ESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMAD-LIAQNM 591 Query: 478 DTDRVSRFSLWATN--TPAVLIRDTAEVEDIRQ-----------QREVQRRVMEEQHLQQ 524 D P ++ E E I + Q+ + + + Sbjct: 592 DWPGA-DVIAERLKKIVPPNVL-SKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAE 649 Query: 525 QLQQTSQDIGAKAAGRAMEKKLTHDMMENS 554 +Q KA E + M+E+ Sbjct: 650 ADTAQAQADMLKAQLETEEAQKQLAMIEDM 679 >gi|301107205|ref|XP_002902685.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262098559|gb|EEY56611.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 1082 Score = 40.2 bits (92), Expect = 0.89, Method: Composition-based stats. Identities = 29/219 (13%), Positives = 71/219 (32%), Gaps = 10/219 (4%) Query: 340 FGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQ 399 E+ RL+E +R L + + E +I Sbjct: 526 TVKFDDVGREVTRLQEEVRLLKAGSAAAPTSENERSMLHTLSTRLEEAMIQAKDVIT--Y 583 Query: 400 SEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAES--VASALQG 457 + + + L + +G + S + E T+ L + QQ++ + + Sbjct: 584 KDGVIQSLKERLQLASKRGA--DTIALLQQERSEFEREKTNLLAQLQQSKDSSASKKDEE 641 Query: 458 VNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVM 517 V+ + + + +++ A + RD + R +++V + Sbjct: 642 VSRLQAENMALEQQKAALTVKVAQLTLELETARSQWTQDARDREHRAEKRCEKQVAQAEE 701 Query: 518 EEQH----LQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 + + +QQQ+ Q ++ K A + + ++ E Sbjct: 702 QLEQATTAMQQQMAQFRAELDMKVAKQRVAAQVACRAGE 740 >gi|51594767|ref|YP_068958.1| hypothetical protein YPTB0415 [Yersinia pseudotuberculosis IP 32953] gi|51588049|emb|CAH19655.1| Small Conductance Mechanosensitive Ion Channel (MscS) Family Protein [Yersinia pseudotuberculosis IP 32953] Length = 1119 Score = 40.2 bits (92), Expect = 0.97, Method: Composition-based stats. Identities = 26/228 (11%), Positives = 63/228 (27%), Gaps = 25/228 (10%) Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 + L P + L + + L Q + S S + ++ E Sbjct: 100 AQEGDKPLPVPSNLSTSDLEQQVLQVSSQLLELNRLSQQEQDRAREISESLGQLPQQQSE 159 Query: 387 KGAFVGPLIGGLQSEFI-GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 + + +QS+ + +++ L + + +S L L + Sbjct: 160 ARRMLAEIGPRIQSQSNPSSPVAQAQLTLLQAEAVARKAKVNELELSQLSANNRQELSRL 219 Query: 446 Q------QAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATN-----TPA 494 Q + V + LQ + + + + + Sbjct: 220 QVELYKKREARVQAQLQSLRNNLNNQRQQAAEQAL------ERTELLAEQGGDLPESITQ 273 Query: 495 VLIRDTAEVEDIRQQRE-------VQRRVMEEQHLQQQLQQTSQDIGA 535 L R+ + + QQ + QR+ + + +Q T ++ Sbjct: 274 QLQRNRELSQALNQQVQRIDLISSQQRQAVAQTQQVRQALNTIREQAQ 321 >gi|172087805|ref|YP_206390.2| fused chromosome partitioning protein: nucleotide hydrolase [Vibrio fischeri ES114] gi|171902388|gb|AAW87502.2| fused chromosome partitioning protein: predicted nucleotide hydrolase [Vibrio fischeri ES114] Length = 1488 Score = 40.2 bits (92), Expect = 0.97, Method: Composition-based stats. Identities = 34/285 (11%), Positives = 88/285 (30%), Gaps = 14/285 (4%) Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 + N + AQ + P + + + + + L + G +S + Sbjct: 165 FKAFNSVTDYHAQMFDYGVLPKKLRNTSDRSKFYRLIEASL-YGGISSAITRSLRDYLLP 223 Query: 342 NPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + ++ ++R + + ++ A + + Sbjct: 224 QNGGVKKAFQDMEAALRENRMTLEAIKTTQSDRDLFKHLITESTNYVASDYMRHANDRRK 283 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ ++++++Q +L + N S L++ S Q ++ + LQ V T Sbjct: 284 KVEQTLTHRVELMNAQRSLVDLSSVLNNMQSELELLTESESGLEQDYQAASDHLQLVQTA 343 Query: 462 ----------VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQRE 511 E + + M + + A + EV+ ++ Q Sbjct: 344 VRQQEKIERYSEDLEELTERLEEQVMVVEEAAEQLAMA---EEQALLTEEEVDSLKTQLA 400 Query: 512 VQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 ++ ++ Q + Q + KA + LT D + G Sbjct: 401 DYQQALDMQQTRALQYQQAVKALEKAQQLTANESLTQDNAVDLQG 445 >gi|295103621|emb|CBL01165.1| Site-specific recombinases, DNA invertase Pin homologs [Faecalibacterium prausnitzii SL3/3] Length = 849 Score = 39.8 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 58/175 (33%), Gaps = 14/175 (8%) Query: 386 EKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY 445 EK P G + + I + + G + + K++ + K Sbjct: 523 EKVEVHAPTGGR--TRYRQQRIDIYFNFI---GEYHPPAEEISEEERVRKIDEQAEAKKN 577 Query: 446 QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFS------LWATNT---PAVL 496 ++ + + ++ + GDP + ++++R + Sbjct: 578 EKRQKSVQRYRERQNELKAAAQAGDPEAIAKLESERERKRLQGAKRRAELKAIREADPEY 637 Query: 497 IRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMM 551 +R E E IR ++ + + + + + ++T +++ A A E D M Sbjct: 638 LRTMEEKERIRLEKMQEAERRKAEKQKNKAKRTRKELKALAEAGDPEAIAERDAM 692 >gi|197336681|ref|YP_002158030.1| chromosome partition protein MukB [Vibrio fischeri MJ11] gi|197313933|gb|ACH63382.1| chromosome partition protein MukB [Vibrio fischeri MJ11] Length = 1490 Score = 39.8 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 33/278 (11%), Positives = 87/278 (31%), Gaps = 14/278 (5%) Query: 282 IRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFG 341 + N + AQ + P + + + + + L + G +S + Sbjct: 167 FKAFNSVTDYHAQMFDYGVLPKKLRNTSDRSKFYRLIEASL-YGGISSAITRSLRDYLLP 225 Query: 342 NPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + ++ ++R + + ++ A + + Sbjct: 226 QNGGVKKAFQDMEAALRENRMTLEAIKTTQSDRDLFKHLITESTNYVASDYMRHANDRRK 285 Query: 402 FIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTV 461 + ++ ++++++Q +L + N S L++ S Q ++ + LQ V T Sbjct: 286 KVEQTLTHRVELMNAQRSLVDLSSVLNNMQSELELLTESESGLEQDYQAASDHLQLVQTA 345 Query: 462 ----------VELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQRE 511 E + + M + + A + EV+ ++ Q Sbjct: 346 VRQQEKIERYSEDLEELTERLEEQVMVVEEAAEQLAMA---EEQALLTEEEVDSLKTQLA 402 Query: 512 VQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHD 549 ++ ++ Q + Q + KA + + LT D Sbjct: 403 DYQQALDMQQTRALQYQQAVKALEKAQQLTVNESLTQD 440 >gi|221481559|gb|EEE19941.1| DEAD-box helicase family protein [Toxoplasma gondii GT1] Length = 1158 Score = 39.8 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 54/190 (28%), Gaps = 21/190 (11%) Query: 309 EAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQV 368 A+ + F L+ G + + + + L+ I F + + Sbjct: 852 SAETQAFQLRRGAEIVIGTPGRVKDCLEKAYTVLNQCNYVVLDEADRMIDMGFEEIVNFI 911 Query: 369 LDDKAS---RSAAESMEKTREKGAFVGPLIGGLQ---SEFIGAMISREL-DILDSQGNLP 421 LD + +S E++ +E A G + L S + + R L + Sbjct: 912 LDQIPTSNLKSNDEALILQQEMQAKAGHRLYRLTQMFSATMPPAVERLARKYLRQPSYIS 971 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 + +VE+ K Q+ + V + P M ++ + Sbjct: 972 IGDPGAGKRAIEQRVEFVPEARKKQRLQDV------LENAT--------PPVMVFVNQKK 1017 Query: 482 VSRFSLWATN 491 + Sbjct: 1018 SADALAKVLG 1027 >gi|237843843|ref|XP_002371219.1| DEAD-box ATP-dependent RNA helicase, putative [Toxoplasma gondii ME49] gi|211968883|gb|EEB04079.1| DEAD-box ATP-dependent RNA helicase, putative [Toxoplasma gondii ME49] Length = 1158 Score = 39.8 bits (91), Expect = 1.1, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 54/190 (28%), Gaps = 21/190 (11%) Query: 309 EAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQV 368 A+ + F L+ G + + + + L+ I F + + Sbjct: 852 SAETQAFQLRRGAEIVIGTPGRVKDCLEKAYTVLNQCNYVVLDEADRMIDMGFEEIVNFI 911 Query: 369 LDDKAS---RSAAESMEKTREKGAFVGPLIGGLQ---SEFIGAMISREL-DILDSQGNLP 421 LD + +S E++ +E A G + L S + + R L + Sbjct: 912 LDQIPTSNLKSNDEALILQQEMQAKAGHRLYRLTQMFSATMPPAVERLARKYLRQPSYIS 971 Query: 422 ECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDR 481 + +VE+ K Q+ + V + P M ++ + Sbjct: 972 IGDPGAGKRAIEQRVEFVPEARKKQRLQDV------LENAT--------PPVMVFVNQKK 1017 Query: 482 VSRFSLWATN 491 + Sbjct: 1018 SADALAKVLG 1027 >gi|239927556|ref|ZP_04684509.1| hypothetical protein SghaA1_04984 [Streptomyces ghanaensis ATCC 14672] gi|291435900|ref|ZP_06575290.1| predicted protein [Streptomyces ghanaensis ATCC 14672] gi|291338795|gb|EFE65751.1| predicted protein [Streptomyces ghanaensis ATCC 14672] Length = 1629 Score = 39.8 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 32/272 (11%), Positives = 68/272 (25%), Gaps = 19/272 (6%) Query: 289 VNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHE 348 N Q G+ + A+ L + N R+ Sbjct: 376 TNATLQGGQAASQGAQKALQ-MAGAQQSLAAAHRNAARQIRQAEEGVADAVRNAAEASER 434 Query: 349 ELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMIS 408 ++K++ R L RSAAE + E A Q + A Sbjct: 435 AAQQVKQAKR---GLADAVQQAADRQRSAAEQVRSAEESLADAQRTARQAQQDLTQARAD 491 Query: 409 RELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKT 468 + D + L ++ V ++ +T + + + V + Sbjct: 492 AARQLEDLESRLANASLSERDAVLAVQEAHT----RLIRMREAGESASYVE--QQRAQLA 545 Query: 469 GDPSCMDHMDTDRVSRFS------LWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHL 522 D + D ++ + R ++ Q +Q L Sbjct: 546 YDQAVQRLADQRAETKRLSAEKKKADKAGVEGSDLVLD---AQERLRQAEQGVAKGQQQL 602 Query: 523 QQQLQQTSQDIGAKAAGRAMEKKLTHDMMENS 554 + + ++ A ++ + N Sbjct: 603 AKAREDAARQAVQSQRDIAEAQQRVAEAQRNV 634 >gi|152987165|ref|YP_001351358.1| methyl-accepting chemotaxis protein [Pseudomonas aeruginosa PA7] gi|150962323|gb|ABR84348.1| methyl-accepting chemotaxis protein [Pseudomonas aeruginosa PA7] Length = 708 Score = 39.8 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 35/283 (12%), Positives = 82/283 (28%), Gaps = 19/283 (6%) Query: 282 IRRLNETVNELAQFGRLSLHPPT------IAVSEAKQRNFDLKPGYMNIGALSREGRSLF 335 LN + +L+ + + ++Q L+ N Sbjct: 257 QETLNGMSEAMQTALTDALNNIMAPAIQTLVSTTSQQSTQVLEKLVGNFMDGMTSVGREQ 316 Query: 336 QPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLI 395 + ++ + E + LF L+++ R + +++ + + Sbjct: 317 GLQMQQAAADVNAAVSGMSERLNQLF-----SSLNEQQGRQMEVAQQQSAAFETQLQRIS 371 Query: 396 GGLQSEFIGAMISRELDILDS--QGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 G +E A + + L S L G +V + L + ++ A Sbjct: 372 G--SAEERQAQMEQRFAELMSGLTNQLQTQLGTAQQRDEERQVLFERLLGQASSSQ-TAM 428 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQ 513 Q ++ E + H + ++V + NT + R+Q Q Sbjct: 429 LEQFSSSTREQMQAMAEAGNERHSNLEKVFSRLMMNLNTQLD---SQMGAAEQREQARQQ 485 Query: 514 RRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 R + + Q+ + + + +L D + G Sbjct: 486 RFQEQLDQVSTHQQELLSGLASAVQATQQQSRLMADQHQQLLG 528 >gi|221485690|gb|EEE23971.1| membrane attachment protein, putative [Toxoplasma gondii GT1] Length = 4912 Score = 39.8 bits (91), Expect = 1.2, Method: Composition-based stats. Identities = 25/238 (10%), Positives = 66/238 (27%), Gaps = 20/238 (8%) Query: 326 ALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTR 385 A + Q L +++ + L + + E + Sbjct: 4469 AGIEQHAKSVQAQAQAWESEVAMVLAEMQDLVSELQAANRTNSPANVRH----EVVANLA 4524 Query: 386 EKGAFVGPLIGGLQSEF-IGAMISRELDILDSQGNLPECEGADNPPVS----LLKVEYTS 440 + + G + R +L+ +L P + L+ + S Sbjct: 4525 AVNSLLHNTESDQVETVDTGPELMRATTLLNRAQSLLRTAVDPGDPDTHENADLEAQAES 4584 Query: 441 PLFKYQQAESVASALQGVNTVVELG------VKTGDPSCMDHMDTDRVSRFSLWATNTPA 494 + Q+ + + V G G P + + A Sbjct: 4585 LSGRLQEHVDKHNLNRFEQFVSSTGSGLWSLENLGLPPM-----VEAALAALARTQSEAA 4639 Query: 495 VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 L+R+ + ++ + Q+ + + + L+ +T + +G + E + ++ Sbjct: 4640 DLMREWSRIQGLDQEAQADLQTRLRERLEAVSAETRKALGMLRSSLLSEVQRNDAKLQ 4697 >gi|313115193|ref|ZP_07800677.1| hypothetical protein HMPREF9436_02547 [Faecalibacterium cf. prausnitzii KLE1255] gi|310622471|gb|EFQ05942.1| hypothetical protein HMPREF9436_02547 [Faecalibacterium cf. prausnitzii KLE1255] Length = 604 Score = 39.8 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 38/355 (10%), Positives = 93/355 (26%), Gaps = 21/355 (5%) Query: 79 FSAYQAFLYKEDARSKKVREWCDQVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFY 138 + + + +A + V + ++ + + ++ GTG Sbjct: 81 DNYPEPNVLPREADDEDTARALSSVLPVV-----LEQADYEQVYSDCWWRKLKQGTGVTG 135 Query: 139 MEADVDEKGLEEGIRYISVPLSNVYMSV---NHQNVVDSVYREFTFTVDQIVSKWGDKVL 195 + D +G I SV L +Y + Q D T Sbjct: 136 IFWDPAMRGGIGDIAVRSVNLLMLYWEPGVADIQASPDFFSLSLEDTARLCAQYPQLAGH 195 Query: 196 SSKMKSALARNENERFTIIHAVYPKSLTDKKKDK-GNKGFHSKFVS-------VDENRFF 247 ++ + +E K+ D+ G H + Sbjct: 196 TASVLDVPRYIHDEGQDTSSKSVVVDWYYKRPDETGRMVLHYCKFCNGVVLYASQNDPAL 255 Query: 248 EEKQIAT---FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPT 304 E + +P++ V D G +++ + + + LS Sbjct: 256 AESGLYDHGQYPFVFDPLFVEEDSPAGFGYIDVMKDCQTAIDKMNHAMDENVLLSAKQRY 315 Query: 305 IAVSEAKQRNFDLKPGYMNIGALSRE-GRSLFQPVQFGNPLPYHEELNRLK-ESIRSLFL 362 + A +L +I + F+P+Q + + E ++ + Sbjct: 316 VLSDTAGVNEEELADFSRDIVHVVGRLNDDSFRPLQTAGLQGNSLSYRQSRIEELKEISG 375 Query: 363 LDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQ 417 +AA ++ +E G+ + + ++++ Sbjct: 376 NRDMTQGGTAGGVTAASAIAALQEAGSKLSRDMLKSAYRAFAKQCYLIIELMRQF 430 >gi|218288465|ref|ZP_03492755.1| ATPase AAA-2 domain protein [Alicyclobacillus acidocaldarius LAA1] gi|218241438|gb|EED08612.1| ATPase AAA-2 domain protein [Alicyclobacillus acidocaldarius LAA1] Length = 676 Score = 39.8 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 25/231 (10%), Positives = 65/231 (28%), Gaps = 5/231 (2%) Query: 311 KQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPY-HEELNRLKESIRSLFLLDLFQVL 369 L G G L + G+ L+ ++ + L FQ + Sbjct: 159 FIDEIHLLVGAGASQGGLDAGNILKPALARGDIQVIGATTLDEYRQIEKDPALERRFQPV 218 Query: 370 DDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNP 429 + + E I A ++ + + + + Sbjct: 219 MVDEPSVEEAVQILEGLRPRYEAYHGVRYTDEAIRACVTLSHRYIGDRFLPDKAIDLMDE 278 Query: 430 PVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWA 489 S ++Y + E +A+ + + + + + ++ +++ A Sbjct: 279 AGSKANLQYGG--DRASIEERLAAIAREKE--AAIRQEAYERAAELKVEEEKLRAELAHA 334 Query: 490 TNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGR 540 V + D ++ I + + + Q +L+ D+ A G+ Sbjct: 335 AGASDVPVVDEEQIAAIVEAKTGIPVTRMQADEQAKLKNLEADLAAVVIGQ 385 >gi|149728671|ref|XP_001498255.1| PREDICTED: laminin, beta 2 (laminin S) [Equus caballus] Length = 1801 Score = 39.8 bits (91), Expect = 1.3, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 60/226 (26%), Gaps = 21/226 (9%) Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGA 405 + +++ + + + ++ ++ RE V + E Sbjct: 1477 LSQVAETRRQAGEAQQQAQAALDKANASRGQVEQANQELRELIQSVKDFLS---QEGADP 1533 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLF-----KYQQAESVASALQGVNT 460 + + ++P + E L + V A Q + Sbjct: 1534 DSIEMVATRVLELSIPASPEQIQHLAGAI-AERVRSLADVDTILARTVGDVRRAEQLLQD 1592 Query: 461 VVEL-----GVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRR 515 G K + + + R A + DT + E Q + + Sbjct: 1593 ARRARSRAEGEKQKAETVQAAL--EEAQRAQGAAQGAIQGAVVDTQDTEQTLHQVQERMA 1650 Query: 516 VMEEQ-----HLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENSYG 556 E+ QQL + + K AG ++ + ++ G Sbjct: 1651 GAEQALSSAGERAQQLDGLLEALKLKRAGNSLAASSAEETAGSAQG 1696 >gi|221502938|gb|EEE28648.1| membrane attachment protein, putative [Toxoplasma gondii VEG] Length = 4798 Score = 39.4 bits (90), Expect = 1.5, Method: Composition-based stats. Identities = 25/238 (10%), Positives = 66/238 (27%), Gaps = 20/238 (8%) Query: 326 ALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTR 385 A + Q L +++ + L + + E + Sbjct: 4355 AGIEQHAKSVQAQAQAWESEVAMVLAEMQDLVSELQAANRTNSPANVRH----EVVANLA 4410 Query: 386 EKGAFVGPLIGGLQSEF-IGAMISRELDILDSQGNLPECEGADNPPVS----LLKVEYTS 440 + + G + R +L+ +L P + L+ + S Sbjct: 4411 AVNSLLHNTESDQVETVDTGPELMRATTLLNRAQSLLRTAVDPGDPDTHENADLEAQAES 4470 Query: 441 PLFKYQQAESVASALQGVNTVVELG------VKTGDPSCMDHMDTDRVSRFSLWATNTPA 494 + Q+ + + V G G P + + A Sbjct: 4471 LSGRLQEHVDKHNLNRFEQFVSSTGSGLWSLENLGLPPM-----VEAALAALARTQSEAA 4525 Query: 495 VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 L+R+ + ++ + Q+ + + + L+ +T + +G + E + ++ Sbjct: 4526 DLMREWSRIQGLDQEAQADLQTRLRERLEAVSAETRKALGMLRSSLLSEVQRNDAKLQ 4583 >gi|237842841|ref|XP_002370718.1| membrane attachment protein, putative [Toxoplasma gondii ME49] gi|211968382|gb|EEB03578.1| membrane attachment protein, putative [Toxoplasma gondii ME49] Length = 4900 Score = 39.4 bits (90), Expect = 1.5, Method: Composition-based stats. Identities = 25/238 (10%), Positives = 66/238 (27%), Gaps = 20/238 (8%) Query: 326 ALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTR 385 A + Q L +++ + L + + E + Sbjct: 4457 AGIEQHAKSVQAQAQAWESEVAMVLAEMQDLVSELQAANRTNSPANVRH----EVVANLA 4512 Query: 386 EKGAFVGPLIGGLQSEF-IGAMISRELDILDSQGNLPECEGADNPPVS----LLKVEYTS 440 + + G + R +L+ +L P + L+ + S Sbjct: 4513 AVNSLLHNTESDQVETVDTGPELMRATTLLNRAQSLLRTAVDPGDPDTQENADLEAQAES 4572 Query: 441 PLFKYQQAESVASALQGVNTVVELG------VKTGDPSCMDHMDTDRVSRFSLWATNTPA 494 + Q+ + + V G G P + + A Sbjct: 4573 LSGRLQEHMDKHNLNRFEQFVSSTGSGLWSLENLGLPPM-----VEAALAALARTQSEAA 4627 Query: 495 VLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 L+R+ + ++ + Q+ + + + L+ +T + +G + E + ++ Sbjct: 4628 DLMREWSRIQGLDQEAQADLQTRLRERLEAVSAETRKALGMLQSSLLSEVQRNDAKLQ 4685 >gi|332286581|ref|YP_004418492.1| metallopeptidase [Pusillimonas sp. T7-7] gi|330430534|gb|AEC21868.1| metallopeptidase [Pusillimonas sp. T7-7] Length = 475 Score = 39.4 bits (90), Expect = 1.5, Method: Composition-based stats. Identities = 25/222 (11%), Positives = 66/222 (29%), Gaps = 25/222 (11%) Query: 337 PVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREK---GAFVGP 393 P E+ L+ I L D + +A++ + Sbjct: 21 PTLTQKQADAREQRAELRARI--AGLQDEIDRSESSRRDAASQLKASETAISASNRRLAE 78 Query: 394 LIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS 453 L + E + E I++ + L + + SP ++ + Sbjct: 79 LAER-RHEAERELKDIERQIVEQKQQLQARQHELGEQMRAQYAGGLSPWAALLSGDNPQA 137 Query: 454 ALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQRE-- 511 + ++ + + D ++ DR++R R + ++ Q + Sbjct: 138 IGRDLSYLGYITQAQADAVIAVNLALDRLARLQA----------RSEEQTRELAQLAQDT 187 Query: 512 -------VQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKL 546 ++ +Q L++ + G A+ + +++L Sbjct: 188 TEEKNKLEAQKAERQQVLKRIEAELQAQRGQAASLKQNDERL 229 >gi|307943499|ref|ZP_07658843.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773129|gb|EFO32346.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 859 Score = 39.4 bits (90), Expect = 1.6, Method: Composition-based stats. Identities = 25/268 (9%), Positives = 78/268 (29%), Gaps = 21/268 (7%) Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLF-----QPVQF 340 E + + K+ ++P N ++S++ S + + Sbjct: 545 EEIARLTQELREALNEYMQALAEQMKRNPQAMQPFNSNQQSMSQQDLSEMLDRIEELART 604 Query: 341 GNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQS 400 G+ E L ++++ + +L + D E + + E L+ Sbjct: 605 GSRDAARELLAQMQQMLENLQAGRPQMMPPDGMDGEMMEMLNELSEMIQKQQQLMDQTHQ 664 Query: 401 EFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 + S + + + + Q + + QG Sbjct: 665 ----------FNQQQSPNGQQQQGQNRPGQQGQQQPGQGNQMTAEQLQQMLDQLRQGQGN 714 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + + + D + + ++ + + + D + ++ + ++ + Sbjct: 715 LAQQLQELMDQLGQNGVGENQALGEAGKSMG------DAQQSLGDGQGEQALGQQGQALE 768 Query: 521 HLQQQLQQTSQDIGAKAAGRAMEKKLTH 548 L+Q Q ++ + + G M + + Sbjct: 769 SLRQGAQGLAEQMMGQGNGPGMAQGPSP 796 >gi|325277058|ref|ZP_08142716.1| hypothetical protein G1E_25906 [Pseudomonas sp. TJI-51] gi|324097808|gb|EGB95996.1| hypothetical protein G1E_25906 [Pseudomonas sp. TJI-51] Length = 328 Score = 39.4 bits (90), Expect = 1.7, Method: Composition-based stats. Identities = 16/133 (12%), Positives = 34/133 (25%), Gaps = 11/133 (8%) Query: 414 LDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSC 473 L G + + S + + + Q + Sbjct: 172 LQQSGKAELNPSNLEQQADQAQAQGESA-GRIAAEDPTQAVDQLKQWFDRVRKA--GEPA 228 Query: 474 MDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQ--QREVQRRVMEEQHLQQQLQQTSQ 531 + D + T + E + I R Q+ + Q L+ Q +Q ++ Sbjct: 229 LSAADKQALVNIVAARTG------KSQQEAQQIVDNYARAYQQAAEQVQVLKDQAEQQAR 282 Query: 532 DIGAKAAGRAMEK 544 + AA + Sbjct: 283 EAAQVAASNVSKA 295 >gi|221633791|ref|YP_002523017.1| signal recognition particle protein [Thermomicrobium roseum DSM 5159] gi|221156000|gb|ACM05127.1| signal recognition particle protein [Thermomicrobium roseum DSM 5159] Length = 488 Score = 39.0 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 38/109 (34%), Gaps = 8/109 (7%) Query: 442 LFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTA 501 L + QQ + + Q + + +G + D R + R+ Sbjct: 333 LRQLQQVKKMGPLTQLLEMIPGMGQLLRQQQVQ--ISDDEYKRIEAIILSMTPEERRNPD 390 Query: 502 EVEDIRQQREVQ------RRVMEEQHLQQQLQQTSQDIGAKAAGRAMEK 544 + R++R Q V + +Q+Q+ ++G AAGR+ Sbjct: 391 IINYSRRRRIAQGSGTTIAEVSQLLTQFKQMQRMMAELGQLAAGRSRGP 439 >gi|120611311|ref|YP_970989.1| methyl-accepting chemotaxis sensory transducer [Acidovorax citrulli AAC00-1] gi|120589775|gb|ABM33215.1| methyl-accepting chemotaxis sensory transducer [Acidovorax citrulli AAC00-1] Length = 541 Score = 39.0 bits (89), Expect = 1.9, Method: Composition-based stats. Identities = 29/239 (12%), Positives = 61/239 (25%), Gaps = 33/239 (13%) Query: 325 GALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT 384 A S Q G + +L ++R + +R AA+ T Sbjct: 288 IAAGNNDLSARTEQQAGALQQTAASMEQLTSTVRQ----------NADNARHAAQLAGST 337 Query: 385 REKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFK 444 E G ++G + S DS + + G + + + + Sbjct: 338 SEVAQRGGAMVGQMVSTMGAVT--------DSSRRIVDIIGVIDGIAFQTNILALNAAVE 389 Query: 445 YQQAES-----------VASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTP 493 +A V S Q + + + D S ++ R+ + Sbjct: 390 AARAGEQGRGFAVVANEVRSLAQRSASAAKEIKQLIDTSVQQVGESSRLVNQAGTTMG-- 447 Query: 494 AVLIRDTAEVEDIRQQREVQRRVMEEQ-HLQQQLQQTSQDIGAKAAGRAMEKKLTHDMM 551 ++ +V + Q+ + Q + A E + Sbjct: 448 -EVVDSVQQVARLIQEIASANQEQAAGIDQVNQAVTHMDQATQQNAALVEEATAAAQSL 505 >gi|218189914|gb|EEC72341.1| hypothetical protein OsI_05562 [Oryza sativa Indica Group] Length = 2184 Score = 39.0 bits (89), Expect = 2.2, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 54/193 (27%), Gaps = 20/193 (10%) Query: 354 KESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL-- 411 ++ + + D A+ +A + E + + QSE + + Sbjct: 128 QQQQAKMNMAGPSTRDQDVAANTA-KMQELMSLQAQAQAQMFKRQQSEHLQQAEKQAEQG 186 Query: 412 ---DILDSQGNL--PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + G++ P P L P+ Q +++A + + Sbjct: 187 QPSNSEQRSGDMRPPSMPPQGVPGQQLSSAGMVRPMQPMQGQAGMSNAG---ANPMAMAQ 243 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 + + D + PA + + + ++ R + E + Q Sbjct: 244 LQAIQAWAKEHNVD---------LSNPANVTLISQILPMLQSNRMAAMQKQNEVGMASQQ 294 Query: 527 QQTSQDIGAKAAG 539 Q + A G Sbjct: 295 QSVPSQMNNDAPG 307 >gi|311276733|ref|XP_003135336.1| PREDICTED: nik-related protein kinase-like [Sus scrofa] Length = 1353 Score = 39.0 bits (89), Expect = 2.3, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 64/223 (28%), Gaps = 31/223 (13%) Query: 333 SLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVG 392 QP + + + + +F+ Q + + A++ ++ + Sbjct: 388 EPSQPRWLPDREEPQVKALQHLQGAARVFMPLQAQDSAPRPLQGQAQAHQRLQGAARVFM 447 Query: 393 PLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVA 452 PL ++++ L Q P + L+ + +P + + +A Sbjct: 448 PLQAQVKAKASRP--------LQMQMKAPPRPRRTAWMLMPLQAQVKAP----RPLQVLA 495 Query: 453 SALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV--EDIRQQR 510 + + G P R +V + RQ R Sbjct: 496 QIPREQQAQTQPQASEGPQDLDQ----------------VPEE-FRGHDQVPEQQQRQGR 538 Query: 511 EVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMEN 553 +++ + Q +QQL+Q + +A E E Sbjct: 539 VPEQQQRQNQVPEQQLEQNGIPEQPEVQEQAAEPTQAETEAEE 581 >gi|41052581|dbj|BAD07923.1| SNF2 domain/helicase domain-containing protein-like [Oryza sativa Japonica Group] gi|41052776|dbj|BAD07645.1| SNF2 domain/helicase domain-containing protein-like [Oryza sativa Japonica Group] gi|222622037|gb|EEE56169.1| hypothetical protein OsJ_05089 [Oryza sativa Japonica Group] Length = 2200 Score = 39.0 bits (89), Expect = 2.3, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 54/193 (27%), Gaps = 20/193 (10%) Query: 354 KESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISREL-- 411 ++ + + D A+ +A + E + + QSE + + Sbjct: 128 QQQQAKMNMAGPSTRDQDVAANTA-KMQELMSLQAQAQAQMFKRQQSEHLQQAEKQAEQG 186 Query: 412 ---DILDSQGNL--PECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + G++ P P L P+ Q +++A + + Sbjct: 187 QPSNSEQRSGDMRPPSMPPQGVPGQQLSSAGMVRPMQPMQGQAGMSNAG---ANPMAMAQ 243 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 + + D + PA + + + ++ R + E + Q Sbjct: 244 LQAIQAWAKEHNVD---------LSNPANVTLISQILPMLQSNRMAAMQKQNEVGMASQQ 294 Query: 527 QQTSQDIGAKAAG 539 Q + A G Sbjct: 295 QSVPSQMNNDAPG 307 >gi|168051357|ref|XP_001778121.1| predicted protein [Physcomitrella patens subsp. patens] gi|162670443|gb|EDQ57011.1| predicted protein [Physcomitrella patens subsp. patens] Length = 878 Score = 38.6 bits (88), Expect = 2.5, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 27/95 (28%), Gaps = 13/95 (13%) Query: 461 VVELGVKTGDPS---CMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVM 517 ++GV+ +P M+T + ++ + Q Sbjct: 742 AQQMGVQQMNPQQLNAAQQMNTQQQLN--AQQMGV--------QQMNPQQLNAAQQMNTQ 791 Query: 518 EEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 ++ QQ Q AA + +++ M Sbjct: 792 QQLSAQQMGMQQMNPQQLNAAQQMSTQQMNPQQMS 826 >gi|291232347|ref|XP_002736118.1| PREDICTED: myeloid/lymphoid or mixed-lineage leukemia 4-like, partial [Saccoglossus kowalevskii] Length = 3264 Score = 38.6 bits (88), Expect = 2.5, Method: Composition-based stats. Identities = 31/261 (11%), Positives = 66/261 (25%), Gaps = 40/261 (15%) Query: 327 LSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTRE 386 + S + G P+ + N++ + L + +A + + Sbjct: 24 MRSPIPSPAPLLSTGPMAPHMQVPNQMSGQL-----LPGMMPVRGQAPGYSGIPGIMLGQ 78 Query: 387 KGAFVGPLIGGLQ-----SEFIGAMISRELDILDSQ-GNLPECEGADNPPVSLLKVEY-- 438 + G +Q ++ + L + G + G P ++V Sbjct: 79 GLPHMQGPPGQMQGPPVTTQGQLGQMQGPLGQMQGPPGQMQGPPGQMQGPPGQMQVPPGQ 138 Query: 439 -TSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMD-------------HMDTDRVSR 484 P + Q V + Q G G P M + ++ Sbjct: 139 MQGPPGQMQGPP-VTTQGQLGQMQGPPGQMQGPPGQMQGPPGQMQGPPGQMQVPPGQMQV 197 Query: 485 FSLWATNTPAVLIRDTAEVEDIRQQREVQRRVME----EQH---LQQQLQQTSQDI---- 533 P +++ Q + M+ + H Q Q + + Sbjct: 198 PPGQMQGLPVTTQVPPGQMQGPPGQMQGPPGQMQGPPGQMHGPPGQMQGPHGAMQMFEAL 257 Query: 534 -GAKAAGRAMEKKLTHDMMEN 553 G ++T D M Sbjct: 258 PGQMLHSPRGPGEVTMDRMSM 278 >gi|40556094|ref|NP_955179.1| CNPV156 hypothetical protein [Canarypox virus] gi|40233919|gb|AAR83502.1| CNPV156 hypothetical protein [Canarypox virus] Length = 832 Score = 38.6 bits (88), Expect = 2.7, Method: Composition-based stats. Identities = 25/222 (11%), Positives = 76/222 (34%), Gaps = 11/222 (4%) Query: 336 QPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLI 395 + ++ E+ + E+ + ++ ++ + E E+ K + ++ Sbjct: 465 KTLEIAMQKIVEIEVQEIIENAIRESEMQESEMKENAER-AMQEIAER-EMKEIAMQEIV 522 Query: 396 GGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKY------QQAE 449 E I + + + E + ++E + + + Sbjct: 523 ER---EMQEIAIQEIAERAMQEIAIQEIAERAMQESVMQEIEMQEITERTIQEITERAMQ 579 Query: 450 SVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQ 509 +A + E + S M ++ ++ ++ + + +++ A E Q+ Sbjct: 580 EIAIQESAKRAMQESAERAMQESVMQEIEMQEIAERAMQESVMQEIEMQERAMQERAMQE 639 Query: 510 REVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMM 551 R +Q R M+E +Q++ Q + R M+++ ++ Sbjct: 640 RAMQERAMQEIEMQERAMQERAMQEIEMQEREMQERAMQEIE 681 >gi|196229374|ref|ZP_03128239.1| hypothetical protein CfE428DRAFT_1404 [Chthoniobacter flavus Ellin428] gi|196226606|gb|EDY21111.1| hypothetical protein CfE428DRAFT_1404 [Chthoniobacter flavus Ellin428] Length = 514 Score = 38.6 bits (88), Expect = 2.8, Method: Composition-based stats. Identities = 15/142 (10%), Positives = 47/142 (33%), Gaps = 5/142 (3%) Query: 416 SQGNLPECEGADNPPVSLLKVEYTSPLFKYQ-QAESVASALQGVNTVVELGVKTGDPS-- 472 L + L+ T+P + + +++ Q V + + Sbjct: 180 KADALKKLAEQLQKGAEQLRANATNPEEAGKSKLRELSALEQMVQDMQKSPAGLTPEEQQ 239 Query: 473 -CMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQ 531 ++ + ++ + + + E+E Q+ Q+ + +++QL+ + Sbjct: 240 ALAKALEQNEATKEAAKSLAA-GDQAKAAEELEKEMQKLAEQKDGATSEEIRKQLEDAVK 298 Query: 532 DIGAKAAGRAMEKKLTHDMMEN 553 + + +KL + E+ Sbjct: 299 QLAQQKQLSEAMQKLAQQLKES 320 >gi|320589539|gb|EFX02000.1| myosin class 2 heavy chain [Grosmannia clavigera kw1407] Length = 2564 Score = 38.6 bits (88), Expect = 3.0, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 62/208 (29%), Gaps = 28/208 (13%) Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESME-KTREKGAFVGPLIGGLQSEFIGA 405 ++ +++ + + + SAAE +E + E + L+SE + Sbjct: 1657 QIVEEAVERQLQTTAEAVPARRQEAEEGGSAAEMLEARVMELELRLQAERANLESELLLR 1716 Query: 406 MI-SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVEL 464 + + + + ++VE + Q+ + L+ E Sbjct: 1717 RTAEDKTAEMGRK---------LELAETKIEVEIMNRSAYDQRVADLEDRLRHQEEKTEA 1767 Query: 465 GVKTGDPSCMDHMDTDRVSR-FSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQ 523 MD R + L T E +R++ E + + ++ Sbjct: 1768 -----------EMDVRRSAEGRLSEVQG---QLRISTEERTRLREELEERGQQLKAAEET 1813 Query: 524 QQLQQTSQDIGAKAAGRAMEKKLTHDMM 551 T + A +A E+ D+ Sbjct: 1814 --TGTTLMRLAVLEAAQAREETAHSDLQ 1839 >gi|238018524|ref|ZP_04598950.1| hypothetical protein VEIDISOL_00351 [Veillonella dispar ATCC 17748] gi|237864995|gb|EEP66285.1| hypothetical protein VEIDISOL_00351 [Veillonella dispar ATCC 17748] Length = 1214 Score = 38.3 bits (87), Expect = 3.2, Method: Composition-based stats. Identities = 12/107 (11%), Positives = 37/107 (34%), Gaps = 2/107 (1%) Query: 443 FKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAE 502 + Q+ ++ + Q + + + + ++ + + AE Sbjct: 467 AEAQRQAAIQAEQQRLAAQQAEQARIAEAQRQAALKAEQ--DRIAAQQAEQQRIAAEQAE 524 Query: 503 VEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHD 549 + + Q+R+ EQ Q+ + AA +A ++++ + Sbjct: 525 AQRQAALQAEQQRIAAEQAEAQRQAALKAEQERIAAEQAEQQRIAAE 571 >gi|89052971|ref|YP_508422.1| hypothetical protein Jann_0480 [Jannaschia sp. CCS1] gi|88862520|gb|ABD53397.1| hypothetical protein Jann_0480 [Jannaschia sp. CCS1] Length = 850 Score = 38.3 bits (87), Expect = 3.8, Method: Composition-based stats. Identities = 40/276 (14%), Positives = 81/276 (29%), Gaps = 14/276 (5%) Query: 286 NETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLP 345 E + ++ + + + + M+ L R + + +Q G Sbjct: 530 QEMREAMDEYMQELADNTEFGDD--TDQPDEGERQEMSNADLDEMLRRIEELMQEGRMAE 587 Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGA 405 E L L+E + ++ + D E+ME +E L E Sbjct: 588 AMEMLQALQEMLENMEITQGEGG-GDGPQTPGQEAMEGLQETLRGQQELSDDSFQELQDQ 646 Query: 406 M-ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT---- 460 +R + QGN P+ + + + AE L+ + Sbjct: 647 FNPNRPGQQSEQQGNAPQGNQPGQEGQNPGDIAG-GDSGQGSLAERQQELLRQLEEQARR 705 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWAT---NTPAVLIRDTAEVEDIRQQREVQRRVM 517 + G + GD + R + A L + +E +R+ + Sbjct: 706 LPGTGTEAGDEALEQLDGAGRAMDEAAEALERGGIAEALDLQSEAMEALREGMTQLSEAL 765 Query: 518 EEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMEN 553 ++ + Q ++ G A R M+ L N Sbjct: 766 AQEQGAEPGQGPAE--GNMAESRPMQDPLGRQAGNN 799 >gi|326317367|ref|YP_004235039.1| methyl-accepting chemotaxis sensory transducer with Cache sensor [Acidovorax avenae subsp. avenae ATCC 19860] gi|323374203|gb|ADX46472.1| methyl-accepting chemotaxis sensory transducer with Cache sensor [Acidovorax avenae subsp. avenae ATCC 19860] Length = 541 Score = 38.3 bits (87), Expect = 3.9, Method: Composition-based stats. Identities = 28/239 (11%), Positives = 61/239 (25%), Gaps = 33/239 (13%) Query: 325 GALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKT 384 A S Q G + +L ++R + +R AA+ T Sbjct: 288 IAAGNNDLSARTEQQAGALQQTAASMEQLASTVRH----------NADNARHAAQLAGST 337 Query: 385 REKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFK 444 E G ++G + S DS + + G + + + + Sbjct: 338 SEVAQRGGAMVGQMVSTMGAVT--------DSSRRIVDIIGVIDGIAFQTNILALNAAVE 389 Query: 445 YQQAES-----------VASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTP 493 +A V S Q + + + D S ++ R+ + Sbjct: 390 AARAGEQGRGFAVVASEVRSLAQRSASAAKEIKQLIDTSVQQVGESSRLVNQAGTTMG-- 447 Query: 494 AVLIRDTAEVEDIRQQREVQRRVMEEQ-HLQQQLQQTSQDIGAKAAGRAMEKKLTHDMM 551 ++ +V + Q+ + Q + A + + Sbjct: 448 -EVVESVQQVARLIQEIASANQEQAAGIDQVNQAVTHMDQATQQNAALVEQATAAAQSL 505 >gi|301780748|ref|XP_002925791.1| PREDICTED: LOW QUALITY PROTEIN: laminin subunit alpha-5-like [Ailuropoda melanoleuca] Length = 3514 Score = 38.3 bits (87), Expect = 3.9, Method: Composition-based stats. Identities = 41/315 (13%), Positives = 81/315 (25%), Gaps = 41/315 (13%) Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 R L E L + L P L EG Sbjct: 2166 RTLAEVERLLGEMRARDLGAPRAVAEAELDAARRLLARVQEQLTSRWEGNQGLAARARDR 2225 Query: 343 PLPYHEELNRLKESIRSL-FLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSE 401 + L L+ ++ + L+ + +++ + +E L LQ+ Sbjct: 2226 LAQHEAGLMDLRGALNRAVGTTREAEELNSRNQERLEDALHRKQELSRDNATLRATLQAA 2285 Query: 402 F-IGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQA-ESVASALQGVN 459 A +S L +D + S V + Q+A E+ + + Sbjct: 2286 SDTLAQLSGLLPAMDQAREVSAAPPRGTEAGSDGIVRGVNQDHFIQRAIEAANAYSSILQ 2345 Query: 460 TVVELGVKTGDPSCMDHMDTDRVSRF--------SLWATNTPAVLIRDTAEV-------- 503 V G ++ V R L T+ ++ + Sbjct: 2346 AVQAAEGAAGQARQQENDTWAMVVRRGLAPRAWELLTNTSALLEVVLREQQRLGHVRVTL 2405 Query: 504 ---------EDIRQQREVQRRVMEEQHLQQQLQQTSQDI-------------GAKAAGRA 541 R++++ R + L +TS+ I A+ R Sbjct: 2406 QGTGTQLRDAQARKEQQATRIQEVQAMLAMDTDETSKKIARAKAVAAEAQDTAARVQSRI 2465 Query: 542 MEKKLTHDMMENSYG 556 + + + + YG Sbjct: 2466 QDMQKHLERWQGQYG 2480 >gi|295103136|emb|CBL00680.1| hypothetical protein [Faecalibacterium prausnitzii SL3/3] Length = 594 Score = 38.3 bits (87), Expect = 4.0, Method: Composition-based stats. Identities = 32/302 (10%), Positives = 78/302 (25%), Gaps = 18/302 (5%) Query: 133 GTGCFYMEADVDEKGLEEGIRYISVPLSNVYMS---VNHQNVVDSVYREFTFTVDQIVSK 189 GTG + D G I SV L +Y + Q+ D + T Sbjct: 130 GTGVTGIFWDPAAHGGLGDIAVRSVNLLMLYWEPGVQDIQDSPDLFHLSLEDTARLTAQ- 188 Query: 190 WGDKVLSSKMKSALARNENER---------FTIIHAVYPKSLTDKKKDKGNKGFHSKFVS 240 + + + R +E + P + + Sbjct: 189 YPQLTGHAAGVVDVPRYIHEDGQTTANKSVVVDWYYKRPDENGKLRLHYCKLCNGVVLYA 248 Query: 241 VDENRFFEEKQIAT---FPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGR 297 + + + +P++ V D G +++ + + + Sbjct: 249 SQNDPALAARGLYDHGKYPFVFDPLFVEEDSPAGFGYIDVMKDCQNAIDKMNHAMDENVL 308 Query: 298 LSLHPPTIAVSEAKQRNFDLKPGYMNIGALSRE-GRSLFQPVQFGNPLPYHEELNRLK-E 355 L+ + A +L +I + F+P+Q + E Sbjct: 309 LASRQRYVLSDTAGVNEEELADLSRDIVHVVGRLNEDSFRPLQTAGLQGNSLSYRNSRIE 368 Query: 356 SIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILD 415 ++ + +AA ++ +E G+ + + ++++ Sbjct: 369 ELKEISGNRDLTQGGTTGGVTAASAIAALQEAGSKLSRDMLKSAYRAFARQCYLIIELMR 428 Query: 416 SQ 417 Sbjct: 429 QF 430 >gi|195035865|ref|XP_001989392.1| GH11701 [Drosophila grimshawi] gi|193905392|gb|EDW04259.1| GH11701 [Drosophila grimshawi] Length = 1857 Score = 37.9 bits (86), Expect = 4.2, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 66/199 (33%), Gaps = 7/199 (3%) Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 +++E + L L + S + E + + + + Q E + E Sbjct: 747 EQIRELNQQLDELTTQLNVQKADSSALDEMLNAQQSQNVDSKTQLEQFQVELQQLKTANE 806 Query: 411 LDILDSQGNLPECEGADNPPVSLLK----VEYTSPLFKYQQAESVASALQGVNTVVELGV 466 + + + E + S Q +E+ + + + + + G Sbjct: 807 TVLKEKAAMEQQMEQELGKLRQQTQELLLASGDSKSQNLQLSEAKVALEKQLEALQQSGE 866 Query: 467 KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQL 526 S + ++ ++ R + L + + + +Q ++ +++ +QQ Sbjct: 867 AQLQASQAEIVNKEQQLRELEKS---KEQLQQQLEQQTKLHEQLIASQQELQQSQTKQQA 923 Query: 527 QQTSQDIGAKAAGRAMEKK 545 +Q++Q + ME K Sbjct: 924 EQSAQLAQETSKVVEMEAK 942 >gi|258649127|ref|ZP_05736596.1| peptidyl-prolyl cis-trans isomerase [Prevotella tannerae ATCC 51259] gi|260850781|gb|EEX70650.1| peptidyl-prolyl cis-trans isomerase [Prevotella tannerae ATCC 51259] Length = 739 Score = 37.9 bits (86), Expect = 4.2, Method: Composition-based stats. Identities = 14/146 (9%), Positives = 37/146 (25%), Gaps = 25/146 (17%) Query: 425 GADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSC-MDHMDTDRVS 483 +VE S + K Q+ + S + + + D ++T Sbjct: 51 ETLTIQDFQNRVEQLSNIAKMQKQRAGQS-----DALTDQEQDQIREQVWSDFVNTS-AI 104 Query: 484 RFSLWATNTPAVLIRDTAEVE-DIRQQREVQRRVMEEQHLQQQLQQT------------- 529 + +V+ +R + ++M + Q Sbjct: 105 KHETDKAGIQV----TDEDVQDALRTGQAQSLQMMAQMGFANQQTGRFDVNALQDFLKNY 160 Query: 530 SQDIGAKAAGRAMEKKLTHDMMENSY 555 +++ A + M+ + Sbjct: 161 DKNMAQLAQSGQQAYMEQYQMLRQIW 186 >gi|163815658|ref|ZP_02207030.1| hypothetical protein COPEUT_01838 [Coprococcus eutactus ATCC 27759] gi|158448963|gb|EDP25958.1| hypothetical protein COPEUT_01838 [Coprococcus eutactus ATCC 27759] Length = 438 Score = 37.9 bits (86), Expect = 4.5, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 63/223 (28%), Gaps = 26/223 (11%) Query: 344 LPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFI 403 + ++ + L LD +A+ + + ++K +E LQ+E Sbjct: 59 AEAKANAEKYQKKVDK--LTATVNELDKQATDISTQIVQKKQEADD--------LQTEID 108 Query: 404 GAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVE 463 + + VEY L E + + V+ + Sbjct: 109 ETQTKLAEAQVSEDNQYVAMKKRIQYLYEEGDVEYIDALMSSASFEDSLNKSEYVDQLSS 168 Query: 464 LGVKTGDPSCMDHMDTDR----VSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRR---- 515 K D D + A L + ++++ + Q+ + Sbjct: 169 YDQKQLDKLVKTKNDIAEYEQTLKDDLADVKKVQADLEQKQSDLDAVISQKNEEINKYSG 228 Query: 516 -VMEEQHLQQQLQ-------QTSQDIGAKAAGRAMEKKLTHDM 550 +Q L ++ +I + A R E++ ++ Sbjct: 229 DAAMQQALAEEYARQESELDDKLAEIARQEAARLEEERKQEEL 271 >gi|221271428|dbj|BAH15181.1| portal protein [Serratia phage KSP100] Length = 374 Score = 37.9 bits (86), Expect = 4.6, Method: Composition-based stats. Identities = 28/250 (11%), Positives = 64/250 (25%), Gaps = 16/250 (6%) Query: 315 FDLKPGYMNI-GALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKA 373 D +PG + A+ P+ G + + + K Sbjct: 29 LDNRPGGVVEENAIGMVDLFPHHPLPAGVDSILEQIEQAKERRTGVTRIGMGLSPEVFKN 88 Query: 374 SRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSL 433 S A + + + F+ + +L N + + Sbjct: 89 DNSFATVDMMMSAAQNRMRMVARNVAQNFMTQLFLAIYRLLKENENSTLPIEVNGAMKEV 148 Query: 434 L--------KVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRF 485 + KV + + ++ E + +Q + + + + ++R Sbjct: 149 MPALWPDRDKVIVAVAIGQNERRERANNLVQLSQFLT--ANPLLSGTTFTAENANHLARE 206 Query: 486 SLWATNT--PAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIG---AKAAGR 540 A I +V+ E Q + Q++Q + I A GR Sbjct: 207 LTLAMGFYDVNNFITPMEQVQPQGPTPEQQAEQQRIELESQRVQLELKKIENDMQVAMGR 266 Query: 541 AMEKKLTHDM 550 ++ Sbjct: 267 MQAEQTEAQA 276 >gi|75761880|ref|ZP_00741807.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228905318|ref|ZP_04069295.1| hypothetical protein bthur0014_63940 [Bacillus thuringiensis IBL 4222] gi|228937950|ref|ZP_04100577.1| hypothetical protein bthur0008_6260 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228970830|ref|ZP_04131470.1| hypothetical protein bthur0003_6170 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228977404|ref|ZP_04137799.1| hypothetical protein bthur0002_6190 [Bacillus thuringiensis Bt407] gi|74490640|gb|EAO53929.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228782381|gb|EEM30564.1| hypothetical protein bthur0002_6190 [Bacillus thuringiensis Bt407] gi|228788955|gb|EEM36894.1| hypothetical protein bthur0003_6170 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228821741|gb|EEM67742.1| hypothetical protein bthur0008_6260 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228854317|gb|EEM98998.1| hypothetical protein bthur0014_63940 [Bacillus thuringiensis IBL 4222] gi|326938429|gb|AEA14325.1| Phage protein [Bacillus thuringiensis serovar chinensis CT-43] Length = 707 Score = 37.9 bits (86), Expect = 4.6, Method: Composition-based stats. Identities = 39/318 (12%), Positives = 75/318 (23%), Gaps = 15/318 (4%) Query: 133 GTGCFYMEADVDEKGLEEGIRYISVPLSNVYMSVNH--QNVVDSVYREFTFTVDQIVSKW 190 G G E D ++ IR VY+ + + +D I ++ Sbjct: 162 GEGEVGFEED-MQRLYTGEIRCRICDPLTVYIDPAAEMDEEIRWIVERKPRDIDYIKERY 220 Query: 191 GDKVLSSK---MKSALARNENERFTIIHAVYPK---SLTDKKKDKGNKGFHSKFVSVDEN 244 G V + + +A F P K G K Sbjct: 221 GKDVAADENVGFAAAFDVTPQNGFNSTSKKRPNMAMVDEMWVKPCGKHPNGLKVTIAGGQ 280 Query: 245 RFFEEKQIATFPYIVGRYRVRADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPT 304 ++ P+ + + + + LP R +N + A R + Sbjct: 281 LLDIDENAGDIPFFIFGDIPIPGSVKAEAFIKDMLPIQREINIMRSMFATHARKMGNSMW 340 Query: 305 IAVSEAKQ--RNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFL 362 + + + G + R + P Y LN I L Sbjct: 341 LVPMGSSVDEDEITNEEGGIVHYTPIEGVRPE-RVGAPDIPSFYDRILNNHDADIDDLSG 399 Query: 363 LDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPE 422 + + E+ + + ++ R L ++ Sbjct: 400 AREISQGRLPSGLDTYSGLSLMVEQENEKLAVSSQNYEHGMKRLLQRVLLLMKKHYTEER 459 Query: 423 CEGADNPPVSLLKVEYTS 440 P +E S Sbjct: 460 MARILGPDN---DIELVS 474 >gi|195448509|ref|XP_002071689.1| GK10116 [Drosophila willistoni] gi|194167774|gb|EDW82675.1| GK10116 [Drosophila willistoni] Length = 1733 Score = 37.9 bits (86), Expect = 4.8, Method: Composition-based stats. Identities = 22/261 (8%), Positives = 59/261 (22%), Gaps = 53/261 (20%) Query: 293 AQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNR 352 Q + P + + + N + Sbjct: 558 EQARQQMAQNPMMMQQRQMSEDLARQQAAQNP----------------MMMQQRQMAEEQ 601 Query: 353 LKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELD 412 ++ + ++ + + + +R M + R+ + E Sbjct: 602 ARQQMSQNPMMMQQRQMAEDLARQQVAQMMQQRQMAEEQARQHMAQNPMMMQQRQMAEEQ 661 Query: 413 ILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPS 472 P + + Q AE +P Sbjct: 662 ARQQAAQNPMM------------------MQQRQMAED-----------QARQQMAQNPM 692 Query: 473 CMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQD 532 M +++ ++ ++ + E R+ M + + Q +Q ++D Sbjct: 693 MMQ---QRQMAEDLARQQAAQNPMM-----MQQRQMAEEQARQQMAQNPMMMQQRQMAED 744 Query: 533 IGAKAAGRAMEKKLTHDMMEN 553 + + A + M E Sbjct: 745 LARQQAEQNPMMMQQRQMAEE 765 >gi|320352670|ref|YP_004194009.1| type 11 methyltransferase [Desulfobulbus propionicus DSM 2032] gi|320121172|gb|ADW16718.1| Methyltransferase type 11 [Desulfobulbus propionicus DSM 2032] Length = 586 Score = 37.9 bits (86), Expect = 5.0, Method: Composition-based stats. Identities = 26/250 (10%), Positives = 63/250 (25%), Gaps = 12/250 (4%) Query: 287 ETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPY 346 TV ++ + V++ P + G + +Q Sbjct: 196 LTVMDVLEGVSPDYAVIAQRVADPFIMQTTAAPFAKDYGIDLKNIAGRYQQYLTARIGMA 255 Query: 347 HEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAM 406 ++ + V + + AE+ + E+ A + + Sbjct: 256 ETTAQTAEQRAQRA----ETAVQNAEQRVQQAETAAQNAEQRAQRAETAVQNAEQRVQRA 311 Query: 407 ISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAE-SVASALQGVNTVVELG 465 + V + S + Q+AE +V +A Q V Sbjct: 312 ETAAQSAEQRAQRAETAVQNAEQRVQQAETAAQSAEQRAQRAETAVQNAEQRVQRAETAA 371 Query: 466 V-----KTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQ 520 + + + ++ + + A ++ +QR Q + Sbjct: 372 QSAEQRAQRAETAVQ--NAEQRVQQAEIAVQNAEQRVQRAETAAQSAEQRVQQAETAVQN 429 Query: 521 HLQQQLQQTS 530 Q+ Q + Sbjct: 430 AEQRVQQAIA 439 >gi|312131267|ref|YP_003998607.1| hypothetical protein Lbys_2592 [Leadbetterella byssophila DSM 17132] gi|311907813|gb|ADQ18254.1| hypothetical protein Lbys_2592 [Leadbetterella byssophila DSM 17132] Length = 1080 Score = 37.9 bits (86), Expect = 5.2, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 72/217 (33%), Gaps = 18/217 (8%) Query: 346 YHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGA 405 ++E+ LK ++ L + + + + +K + L+ S+ + Sbjct: 555 LNKEIQDLKNQLQELLEK------QSRFEQQSPQLQQKMEMIQKMLNELMESKDSKVLEE 608 Query: 406 MISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELG 465 + LD + + N L E L +Q+ + + N + EL Sbjct: 609 LKKMMEKSLDEKSL--DQLEKFNKNQRNLDKELDRTLKLFQELQRKQKIEETSNELKELA 666 Query: 466 VKT-------GDPSCMDHMD--TDRVSRFSLWATNTPAVLIRDTAEVEDIRQQ-REVQRR 515 + +P + ++ + + + N L + ++D + + E Q++ Sbjct: 667 EEQEKLSEADANPQDQEKINQKFEDIKKKLEDIENRSNELNKSFDPMDDKQSEISEDQKQ 726 Query: 516 VMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 +E Q + + A + M +++ M Sbjct: 727 AKKELSQQNKDAASKAQKNAAKKMKQMAEEMEQQMQS 763 Score = 37.1 bits (84), Expect = 7.8, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 53/194 (27%), Gaps = 16/194 (8%) Query: 345 PYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIG 404 +++ +++ + L +VL++ EK+ ++ L E Sbjct: 582 QLQQKMEMIQKMLNELMESKDSKVLEELKKMMEKSLDEKSLDQLEKFNKNQRNLDKE--L 639 Query: 405 AMISRELDILDSQGNLPECE---GADNPPVSLLKVEYTSPLFK---YQQAESVASALQGV 458 + L + + E L +P + Q+ E + L+ + Sbjct: 640 DRTLKLFQELQRKQKIEETSNELKELAEEQEKLSEADANPQDQEKINQKFEDIKKKLEDI 699 Query: 459 NTVVELGVKTGDPSCMDHM-----DTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQ 513 K+ DP D D + + A + +Q + Sbjct: 700 ENRSNELNKSFDP-MDDKQSEISEDQKQAKKELSQQNKDAAS--KAQKNAAKKMKQMAEE 756 Query: 514 RRVMEEQHLQQQLQ 527 + QQ Q Sbjct: 757 MEQQMQSAEMQQAQ 770 >gi|317485513|ref|ZP_07944390.1| hypothetical protein HMPREF0179_01743 [Bilophila wadsworthia 3_1_6] gi|316923193|gb|EFV44402.1| hypothetical protein HMPREF0179_01743 [Bilophila wadsworthia 3_1_6] Length = 699 Score = 37.5 bits (85), Expect = 5.4, Method: Composition-based stats. Identities = 53/454 (11%), Positives = 112/454 (24%), Gaps = 53/454 (11%) Query: 99 WCD-QVTDTLFGFRERSRSGFVGCLQSFYTSVVEFGTGCFYMEADVDEKGLEEGIRYI-S 156 W + F R + E +G ++D E I Sbjct: 162 WLESDKCRYAFFQRWMDLFDLQCLYPEREKEIGEAFSGLSAHDSDYSYMDDEADIVEQDK 221 Query: 157 VPLSNVYMSVNHQNVVDSVYREFTFTVDQIVSKWGDKVLSSKMKSALARNENERFTIIHA 216 L + S + + V + + + + D Sbjct: 222 RVLGSTRWSDPERRRIRPVQLWYPVLEKAVFALFPDGQCVEVNTKLPDAQVYMLVRNAQQ 281 Query: 217 VYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPYI-----------VGRYRVR 265 + S+ + K F + DE F Q P+I V R Sbjct: 282 LITTSVRKLRV----KTFIGSYELSDEPSPFPHGQYPFIPFIGYLDRYLNPFGVPRMLSG 337 Query: 266 ADEIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIG 325 +E + +M + L + + + L + LKPG + Sbjct: 338 QNEEINKRRSMN----LAMLQKRRIIVEEGAADDLQDLYE-EANKPDGFMVLKPGGRSKM 392 Query: 326 ALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDL--------FQVLDDKASRSA 377 + + Q +E+ ++ + ++ A Sbjct: 393 EIIEGAQ--LSQYQIQVLEQSEKEIQQISGANDEAMGYTSNANSGKAIELRRQQSSTIMA 450 Query: 378 AESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVE 437 + R +I +Q + + R D + + + VE Sbjct: 451 SLFGNYRRSMSRLGQLVIANVQGAWTAEKVLRITDKMTNAERFVTVNQKVLGESGDV-VE 509 Query: 438 YTSPLFKYQQAESVASA-------LQGVNTVVELGVKTGDPSCMDHM-----------DT 479 + + + V+ A Q +N ++E K P + ++ + Sbjct: 510 IRNDITQGMYDVIVSDAPATDSVREQNMNLLIEW-CKQSPPEVIPYLMGMAMEMSNLPNK 568 Query: 480 DRVSRFSLWATNT-PAVLIRDTAEVEDIRQQREV 512 D++ P + E++ QQ Sbjct: 569 DQLMMKLKPMMGITPEEMDMSPEELQQRAQQEAE 602 >gi|197294333|ref|YP_001798874.1| hypothetical protein PAa_0204 [Candidatus Phytoplasma australiense] gi|171853660|emb|CAM11539.1| Conserved hypothetical protein [Candidatus Phytoplasma australiense] Length = 1164 Score = 37.5 bits (85), Expect = 5.4, Method: Composition-based stats. Identities = 43/393 (10%), Positives = 100/393 (25%), Gaps = 60/393 (15%) Query: 198 KMKSALARNENERFTIIHAVYPKSLTDKKKDKGNKGFHSKFVSVDENRFFEEKQIATFPY 257 K K A N++ + A K + ++ + + + + Sbjct: 667 KTKQAKLDEINKKIGTLTANKDNLEKTIKDLENDQTVTNYKKIKNRTDWGVRSSSKEIQF 726 Query: 258 I------VGRYRVRAD-EIYGRSPAMEALPTIRRLNETVNELAQFGRLSLHP-PTIAVSE 309 Y++ + Y A E Q + L P Sbjct: 727 PRFWNDKPFTYKIVPKIDFYETGFTQNARGYQAYREEIDKNTIQGESIELSPGKYYCEPS 786 Query: 310 AKQR--NFDLKPGYMNIGALSREGRSLFQPVQFG----NPLPYHEELNRLKESIRSLFLL 363 R + + + E + N +EL+ + ++++ L Sbjct: 787 INMRNIPYSTHGANLIFSEPTSETEPPQNLFKLDEAKENLKNISQELSNYETNLKNAQLE 846 Query: 364 DLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPEC 423 + + S +E I L++E I + + Sbjct: 847 YNQLLENQTPDDS------LQQELNNK-ANHIKTLKNEMQQLEI--------KEQSFRSE 891 Query: 424 EGADNPPVSLLKVEYTSPLFKYQQAESVAS-----ALQGVNTVVE--LGVKTGDPSCMDH 476 LK +YT+ L K QQ + + + + D + + Sbjct: 892 IDTLKLENKNLKEKYTNDLTKIQQELDATKTENEQLEKEMQEIQAELIKNGNADDALVKQ 951 Query: 477 MDTDRV-SRFS--------LWATNTPAVLIRDTAEVEDIRQQREVQRRV----------- 516 ++ + T ++ + E++ +RQ+ + Q Sbjct: 952 LNHKEAQIKELKGKINTLEANETKLQTIIKQKDEEIKQLRQKVQEQAEQIIKLTTEIENN 1011 Query: 517 ----MEEQHLQQQLQQTSQDIGAKAAGRAMEKK 545 ++ QQL+ + + + K Sbjct: 1012 IEIFKQQAMKIQQLEGAIAGLEGASGSLGSDNK 1044 >gi|307104056|gb|EFN52312.1| hypothetical protein CHLNCDRAFT_58914 [Chlorella variabilis] Length = 740 Score = 37.5 bits (85), Expect = 5.5, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 78/256 (30%), Gaps = 16/256 (6%) Query: 306 AVSEAKQRNFDLKP-GYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLD 364 A + + + ++ L+P G G L Q L + ++E I Sbjct: 142 AAANSVEADYVLEPQGPQPPGLHQDGQEELEISRQSEALLAILQAGEHVEEVIAQHRADI 201 Query: 365 LFQVLDD-KASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPEC 423 +L AAE +EK + L L++E + S L +LD ++ + Sbjct: 202 DDSMLQLLARRMKAAELLEKQEAVLQGLQLLYRRLKAEVDRQLASPGLRLLDELMSILDL 261 Query: 424 EGADNPPVSLLKVEYTSPLFKYQQA-------ESVASALQGVNTVVELGVKTGDPSCMDH 476 D + + E + + +A G + D D Sbjct: 262 GEGDLGSPAAAREERRAQAAAHLRAAFSGSLVGDADVLSLAAQLSASGGSQLADQLVADP 321 Query: 477 MDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQQQLQQTSQDIGAK 536 +D F A L+R E + Q+R E Q+ + S ++ Sbjct: 322 VDP---MVFMAEA----TELLRRVEEQHTQLEAYLQQQRQEEGTGQSQEAVRASLEVEQL 374 Query: 537 AAGRAMEKKLTHDMME 552 R L + ++ Sbjct: 375 LEQRQAAVALVQECLQ 390 >gi|301118911|ref|XP_002907183.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262105695|gb|EEY63747.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 2213 Score = 37.5 bits (85), Expect = 5.9, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 54/181 (29%), Gaps = 13/181 (7%) Query: 383 KTREKGAFVGPLIG-GLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSP 441 +E + + E +++ L + ++ + + Sbjct: 989 MAQEHEKQLAEQHKYRGEVEAERQRLTQVLQ--QESTRFQNLRKEAGEARAQIEAQAMAA 1046 Query: 442 LFKY--QQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRD 499 + + Q + A + + + + ++R L+ Sbjct: 1047 MKQREQQLLDEKARVEEELQLQFSKINEENIELRATVDNLKDINRRKSTEIG---RLMAT 1103 Query: 500 TAEVEDIRQQREVQRRVMEE-----QHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENS 554 + E E Q R + + M++ +QQ++ S+ + AK + KL EN Sbjct: 1104 SQEAEQQIQSRMQEAQKMQQLTEEVARAKQQMETLSKTLAAKESAHDEAMKLQSAEFENQ 1163 Query: 555 Y 555 Y Sbjct: 1164 Y 1164 >gi|145525567|ref|XP_001448600.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124416155|emb|CAK81203.1| unnamed protein product [Paramecium tetraurelia] Length = 891 Score = 37.5 bits (85), Expect = 6.4, Method: Composition-based stats. Identities = 26/245 (10%), Positives = 69/245 (28%), Gaps = 28/245 (11%) Query: 323 NIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESME 382 NI + + Q ++ + + + + E ++ + + D+ + E Sbjct: 379 NITQQREQRQPTVQQIETIHEEELKQTIQEITEELKKPHKKHMTKAQRDEQKKRKKEIQR 438 Query: 383 KTREKGAFVGPLIGGLQSEFIG---------AMISRELDILDSQGNLPECEGADNPPVSL 433 + G + E + + D + L + Sbjct: 439 LHEDIERIKKEKGGDFEYEKTDSDSEDRFRRKKLQNQFDDMFKPRQLSRRQSHQFDENEG 498 Query: 434 LKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTP 493 ++Y S Q+ E + + + + + + Sbjct: 499 EDLDYASE----QEIEDRTKIQRVEQIIQQKRDPNYQYNPQEFWQQE------------- 541 Query: 494 AVLIRDTAEVEDIR-QQREVQR-RVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMM 551 + +V I QQR+ + + + L+QQ+Q +Q +++ Sbjct: 542 VKINVKKPQVSSIASQQRQQEMFYQFQREKLEQQMQMINQKYSQSPTDPPQQQQANPLQY 601 Query: 552 ENSYG 556 + S+G Sbjct: 602 QMSHG 606 >gi|332186618|ref|ZP_08388361.1| HAMP domain protein [Sphingomonas sp. S17] gi|332013270|gb|EGI55332.1| HAMP domain protein [Sphingomonas sp. S17] Length = 609 Score = 37.5 bits (85), Expect = 6.4, Method: Composition-based stats. Identities = 31/242 (12%), Positives = 65/242 (26%), Gaps = 13/242 (5%) Query: 309 EAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEELNRLKESIRS-LFLLDLFQ 367 + + S Q + ++ + +++ Sbjct: 316 STVVTAASSINNGAGDIRQASDDLSQRTEQQAASLEETAAAMDEITTTVKETAAGASQAN 375 Query: 368 VLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGAD 427 + +A A ES E R +G I SE I +I+ I L G + Sbjct: 376 RIVGEAREEARESGEIVRRAVQAMGG-IERASSE-ISEIIAVIDGISFQTNLLALNAGVE 433 Query: 428 NPPVSLLK----VEYTSPLFKYQQAESVASALQGVNTVVELGVKTGDPSCMDHMD-TDRV 482 V + Q++ A ++ T V+ G + D R+ Sbjct: 434 AARAGDAGKGFAVVASEVRALAQRSADAAKDVKTRITASSDQVEEGVRLVGETGDALQRI 493 Query: 483 SRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVM-----EEQHLQQQLQQTSQDIGAKA 537 + + + QQ M + + +Q ++ + ++A Sbjct: 494 IQRIAEIDGLVSNIANSADRQATGLQQVNTAVAEMDGMTQQNAAMVEQATAAARSLASEA 553 Query: 538 AG 539 G Sbjct: 554 DG 555 >gi|330830183|ref|YP_004393135.1| phage tail tape measure protein, TP901 family [Aeromonas veronii B565] gi|328805319|gb|AEB50518.1| Phage tail tape measure protein, TP901 family [Aeromonas veronii B565] Length = 811 Score = 37.5 bits (85), Expect = 6.5, Method: Composition-based stats. Identities = 11/89 (12%), Positives = 26/89 (29%) Query: 464 LGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREVQRRVMEEQHLQ 523 + D +DT + + + ++ R Q +R ++ Q Sbjct: 22 AASGQSRITAKDLVDTKKRIKELEAQSGQIDGYRTLGQQIGATRAQLTQAQRDAQQMAQQ 81 Query: 524 QQLQQTSQDIGAKAAGRAMEKKLTHDMME 552 + ++A +A +K E Sbjct: 82 FAKVEQPTKAMSRAMEQAKQKVRDLSQQE 110 >gi|327271670|ref|XP_003220610.1| PREDICTED: nuclear receptor coactivator 6-like [Anolis carolinensis] Length = 2035 Score = 37.1 bits (84), Expect = 7.9, Method: Composition-based stats. Identities = 28/261 (10%), Positives = 65/261 (24%), Gaps = 19/261 (7%) Query: 291 ELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYHEEL 350 + P L PG+ + +S G P Sbjct: 465 SNFMVMQQQNQGPQGLHPGLGGMPKRLPPGFPSGQTNQNFMQSQVPSTAPGTPASTGAPQ 524 Query: 351 NRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMISRE 410 + +S + + ++ S G + ++ Sbjct: 525 LQTSQSAQHTGGQGN-GLSQNQMQVQHGPSNMMQSNLMGLHGNMNNQQAGNSGVPQVN-- 581 Query: 411 LDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVASALQGVNTVVELGVKTGD 470 + + G + + + V + Q S+ Q + + +L ++ Sbjct: 582 MGSMQ--GQPSQGPQSQLMGMHQPIVSTQGQMVNIQPQGSLNPQNQMILSRAQLMPQSQM 639 Query: 471 PSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIRQQREV---------QRRVMEEQH 521 + + + TP + + + ++ Q+ M EQ Sbjct: 640 MVAPQNQNLGPTQQRM-----TPPKQMLPQQGQQMMAAHNQMMGPQGQVLLQQNSMMEQM 694 Query: 522 LQQQLQQTSQDIGAKAAGRAM 542 + Q+Q Q GA+ M Sbjct: 695 MTNQMQGNKQQFGAQNQSNVM 715 >gi|260802925|ref|XP_002596342.1| hypothetical protein BRAFLDRAFT_76142 [Branchiostoma floridae] gi|229281597|gb|EEN52354.1| hypothetical protein BRAFLDRAFT_76142 [Branchiostoma floridae] Length = 2545 Score = 37.1 bits (84), Expect = 8.2, Method: Composition-based stats. Identities = 18/271 (6%), Positives = 74/271 (27%), Gaps = 26/271 (9%) Query: 288 TVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGNPLPYH 347 + + + + ++ ++ + L E + Sbjct: 308 LQDSSKEALQDKNRVIDQLNHALRTKDQLIQQLNQDKADLVAEKVKPLEAQVQNLTQELR 367 Query: 348 EELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEFIGAMI 407 + +++ I +++ ++ E ++ ++ + + + Sbjct: 368 VKEGNMQDDINRYQQQVEVSKKNNQEIQALLEDQQRKLDEYEIAAGQMTRDHDKKEKEIK 427 Query: 408 SRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLFKYQQAESVAS-------------- 453 E +L+++ E + S +K++ + L + + + + + Sbjct: 428 ELEKLVLEAEDENEELKRKLQDMDSDVKLQEQNALKRDKAIQGLTAAIQNKSKEIDELCE 487 Query: 454 -ALQGVNTVVELGVKTGDPSCMDHMDTDR----VSRFSLWATNTPAVLIRDTAEVEDIR- 507 + ++ + + +S T + AE + ++ Sbjct: 488 QIEELQQSLAQARETAHKAQLQQFQGVEEQQQALSDKEAEITGLQGKVHEKDAENQQLKK 547 Query: 508 ------QQREVQRRVMEEQHLQQQLQQTSQD 532 Q+ + ++ +E Q +D Sbjct: 548 SLRKKEQEIDQLQQAAQEADDQADEALRDKD 578 >gi|329297591|ref|ZP_08254927.1| hypothetical protein Pstas_15486 [Plautia stali symbiont] Length = 337 Score = 36.7 bits (83), Expect = 9.1, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 52/207 (25%), Gaps = 21/207 (10%) Query: 326 ALSREGRSLFQPVQFGNPLPYHEELNRLKESIRSLF--LLDLFQVLDDKASRSAAESMEK 383 A + ++ I F L F L + S A Sbjct: 96 AQREGALHGLLMFGVSTLITLWLAISLASGIIGGAFNILGSGFNALGNGISAVAPSVTNM 155 Query: 384 TREKGAFVGPLIGGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYTSPLF 443 +EK + LQ+E L G V + Sbjct: 156 AKEKLQENNINLDDLQNELQTT--------LRQTGK-----PELQSENLQQDVNSEANNA 202 Query: 444 KYQQAESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEV 503 + Q ++ + N + H DT + + A + E Sbjct: 203 QNQAKQTAQNPQNAGNDIANWIRGV----LSRHADTLQAADRDALKNIIKARTGKSDQEA 258 Query: 504 EDIRQQREVQRRVMEE--QHLQQQLQQ 528 E I Q E + + Q L+Q+ +Q Sbjct: 259 EQIVNQTEQSYQQAMQKYQQLKQEAEQ 285 >gi|322367864|ref|ZP_08042434.1| Patched family protein [Haladaptatus paucihalophilus DX253] gi|320552571|gb|EFW94215.1| Patched family protein [Haladaptatus paucihalophilus DX253] Length = 1255 Score = 36.7 bits (83), Expect = 9.6, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 75/233 (32%), Gaps = 25/233 (10%) Query: 340 FGNPLPYHEELNRLKESIRSLFLLDLFQVLDDKA----SRSAAESMEKTREKGAFVGPLI 395 + L++ L + S ES + + KG + Sbjct: 193 QQRSDELNRSKQDLQQRGEELKEEGQELKQRGQTLQQRSDELNESKAQLQAKGQELQAQA 252 Query: 396 GGLQSEFIGAMISRELDILDSQGNLPECEGADNPPVSLLKVEYT----SPLFKYQQAESV 451 L +E + ++ ++ L E + L+V + + ES+ Sbjct: 253 KQL-NESKAQLRNQSEELKQRAQELNESRAELEQRQANLEVRAQELNQTQRELAARNESL 311 Query: 452 ASALQGVNTVVELGVKTGDPSCMDHMDT--DRVSRFSLWATNTPAVLIRDTAEVEDIRQQ 509 + + G D +D+ + + A L ++A ++ RQ+ Sbjct: 312 QERRATIEEAHQNG-TINDTEYEQRLDSLREEQAELKADQ----AQLANESAALQQDRQE 366 Query: 510 REVQRRVMEE---------QHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMEN 553 EV + +E+ L+QQ +Q + G A RA ++ + ++ + Sbjct: 367 LEVDAQQLEQRAAELESDKAELEQQSEQLQESAGQLQAERAELEQRSAELQQE 419 >gi|149278197|ref|ZP_01884335.1| hypothetical protein PBAL39_11587 [Pedobacter sp. BAL39] gi|149230963|gb|EDM36344.1| hypothetical protein PBAL39_11587 [Pedobacter sp. BAL39] Length = 1110 Score = 36.7 bits (83), Expect = 9.7, Method: Composition-based stats. Identities = 42/303 (13%), Positives = 92/303 (30%), Gaps = 52/303 (17%) Query: 283 RRLNETVNELAQFGRLSLHPPTIAVSEAKQRNFDLKPGYMNIGALSREGRSLFQPVQFGN 342 ++L+E L Q ++ E+K+ L + + F + Sbjct: 502 KKLDEGSQTLKQQMAKAIKLAGTVEKESKKLGETLL---------------DKKQLTFDD 546 Query: 343 PLPYHEELNRLKESIRSLFLLDLFQVLDDKASRSAAESMEKTREKGAFVGPLIGGLQSEF 402 + L++ K+ ++ + E+ +EK + L + + Sbjct: 547 KKQVEQLLDKRKQLEAAVKEIQQLNQQQTSDKAENNTLTEELKEKQRQIDELFNHVLDDK 606 Query: 403 IGAMISRELDILDSQGNLPECEGA--DNPPVSLLKVEYTSPLFKYQQAESVASALQGVNT 460 A++ + ++D + LK E L Y+Q E + Q ++ Sbjct: 607 TKALLEKLQQMMDQNNKEQTHDELSKMQVDNKSLKKELDRILELYKQLEYEQNLQQNIDQ 666 Query: 461 VVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTA-----EVEDIRQQREV--- 512 + EL K S + N P ++ E E IR++ + Sbjct: 667 LKELAKKQEALS-----KKSTAAEQKTADRNAPKEELKKQQRENAAEFEQIRKELQQLKE 721 Query: 513 ----------------------QRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDM 550 ++ E+ L++ Q + + KAAG+ + + Sbjct: 722 KNEQLEHPNDFSMPEKESADIKSQQEQSEESLEKNNLQKAAEHQKKAAGQLEQMAKKMEE 781 Query: 551 MEN 553 M+ Sbjct: 782 MQQ 784 >gi|254675300|ref|NP_598708.3| nuclear mitotic apparatus protein 1 [Mus musculus] Length = 2094 Score = 36.7 bits (83), Expect = 9.8, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 37/107 (34%), Gaps = 12/107 (11%) Query: 448 AESVASALQGVNTVVELGVKTGDPSCMDHMDTDRVSRFSLWATNTPAVLIRDTAEVEDIR 507 SV++ Q + + G T A L + E+ ++ Sbjct: 470 QSSVSNLSQAKEELEQASQAQGAQLTAQ----------LTSMTGLNATLQQRDQELASLK 519 Query: 508 QQREVQRRVMEEQHLQQQLQQTSQDIGAKAAGRAMEKKLTHDMMENS 554 +Q + ++ M + +Q+ Q +Q + + + KL +E + Sbjct: 520 EQAKKEQAQMLQTMQEQE--QAAQGLRQQVEQLSSSLKLKEQQLEEA 564 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.122 0.289 Lambda K H 0.267 0.0371 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,813,804,080 Number of Sequences: 14124377 Number of extensions: 97158073 Number of successful extensions: 4014518 Number of sequences better than 10.0: 10000 Number of HSP's better than 10.0 without gapping: 24508 Number of HSP's successfully gapped in prelim test: 6741 Number of HSP's that attempted gapping in prelim test: 2193837 Number of HSP's gapped (non-prelim): 691387 length of query: 556 length of database: 4,842,793,630 effective HSP length: 144 effective length of query: 412 effective length of database: 2,808,883,342 effective search space: 1157259936904 effective search space used: 1157259936904 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 84 (37.1 bits)