BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781145|ref|YP_003065558.1| hypothetical protein CLIBASIA_05245 [Candidatus Liberibacter asiaticus str. psy62] (350 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781145|ref|YP_003065558.1| hypothetical protein CLIBASIA_05245 [Candidatus Liberibacter asiaticus str. psy62] gi|254040822|gb|ACT57618.1| hypothetical protein CLIBASIA_05245 [Candidatus Liberibacter asiaticus str. psy62] Length = 350 Score = 292 bits (746), Expect = 6e-77, Method: Composition-based stats. Identities = 350/350 (100%), Positives = 350/350 (100%) Query: 1 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD 60 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD Sbjct: 1 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD 60 Query: 61 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM 120 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM Sbjct: 61 SVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEM 120 Query: 121 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE 180 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE Sbjct: 121 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGE 180 Query: 181 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM 240 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM Sbjct: 181 SLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGM 240 Query: 241 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS Sbjct: 241 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 Query: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY Sbjct: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 >gi|315122535|ref|YP_004063024.1| hypothetical protein CKC_03935 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495937|gb|ADR52536.1| hypothetical protein CKC_03935 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 637 Score = 274 bits (699), Expect = 2e-71, Method: Composition-based stats. Identities = 293/343 (85%), Positives = 315/343 (91%), Gaps = 1/343 (0%) Query: 9 MLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSP 68 +LI D +VEVLEH+ R+D E +HD+RIRRKYSQGKVCVDAV PDEFLIHPD+ DIEKSP Sbjct: 156 LLISDPEVEVLEHTQRKDREEIIHDIRIRRKYSQGKVCVDAVPPDEFLIHPDATDIEKSP 215 Query: 69 IVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYV 128 IVGRKLYLTRSDLISMGYDR+ IN L + SSQ EN+W+ K +SD ALEMIEYYELYV Sbjct: 216 IVGRKLYLTRSDLISMGYDRKYINQLQVASSQGNENSWQLSKYHHSDTALEMIEYYELYV 275 Query: 129 TIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIE 188 T+DYD DGIAELRRV+M GGTGKDNIL NEEW+ELPFTCLRA+RAPHCF+GESLA+SIIE Sbjct: 276 TLDYDNDGIAELRRVVMVGGTGKDNILVNEEWDELPFTCLRAIRAPHCFVGESLASSIIE 335 Query: 189 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGI 248 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSI+DPESVLNPQFGKPIRV +GMDIRSVLGI Sbjct: 336 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIVDPESVLNPQFGKPIRVVSGMDIRSVLGI 395 Query: 249 HSVPMIEK-SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVEL 307 HSVPMI SFSMLHYLDQELVDRTGISDISSG SPEILQNMTATATSLIEQSGVGQVEL Sbjct: 396 HSVPMIADKSFSMLHYLDQELVDRTGISDISSGLSPEILQNMTATATSLIEQSGVGQVEL 455 Query: 308 IVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 IVRTLAQGLE LFRGLLRLIIQHQDKVRMVRLRDQW+SFDPR+ Sbjct: 456 IVRTLAQGLERLFRGLLRLIIQHQDKVRMVRLRDQWISFDPRH 498 >gi|291334599|gb|ADD94249.1| hypothetical protein Daci_1943 [uncultured phage MedDCM-OCT-S04-C136] Length = 741 Score = 273 bits (698), Expect = 3e-71, Method: Composition-based stats. Identities = 102/350 (29%), Positives = 194/350 (55%), Gaps = 12/350 (3%) Query: 8 HMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKS 67 ++K +++ ++ S + +++ +I+R G+V ++++ P+EFLI + IE + Sbjct: 183 EEVLKQYEMQGVDISQVQVPNFNLYNCKIKRIKKTGRVKIESIPPEEFLIDRSAKTIEDA 242 Query: 68 PIVGRKLYLTRSDLISMGYDRESINNLPIIS---SQNIENTWKFPKNQY-----SDKALE 119 V K+ +TRSDL++MGY ++ ++ LP + E + Y +D + E Sbjct: 243 DFVSHKVLMTRSDLVAMGYPQDEVDELPKSDLDIYNDEETVRLADVDDYRISSSTDTSTE 302 Query: 120 MIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIG 179 + YE YV DYD DGIAELR+++ AG G +IL N + +PF + + PH F G Sbjct: 303 KVLVYESYVKYDYDEDGIAELRKIVSAGADG-HHILSNMPCDSVPFVTITPIPMPHRFYG 361 Query: 180 ESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAG 239 S++ + ++Q +K+ ++RQ LDN+Y N + V +G +++ + +L + G +R Sbjct: 362 RSISELVEDVQLMKSTVMRQLLDNMYLTNNNRVAVMDG-MVNMDDLLTTRPGGIVRTKQ- 419 Query: 240 MDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQ 299 + + + + P+ +++F +L YLD RTG+S + G SP+ L TAT + + Q Sbjct: 420 PPNQVMQPLQAQPISQQAFPLLSYLDSVREGRTGVSKEAQGLSPDTLNAKTATGVNALMQ 479 Query: 300 SGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + ELI R A+ G++ LF+ + L++++QDK +++ + +Q++ P Sbjct: 480 QTQMRSELIARVFAETGVKDLFKKIFELMVKYQDKEKIIMMSNQYIPVRP 529 >gi|227822448|ref|YP_002826420.1| hypothetical protein NGR_c19030 [Sinorhizobium fredii NGR234] gi|227341449|gb|ACP25667.1| hypothetical protein NGR_c19030 [Sinorhizobium fredii NGR234] Length = 684 Score = 252 bits (644), Expect = 4e-65, Method: Composition-based stats. Identities = 185/361 (51%), Positives = 248/361 (68%), Gaps = 16/361 (4%) Query: 6 FIHMLIKDSDVEVLEHSHREDGGE--------KVHDLRIRRKYSQGKVCVDAVSPDEFLI 57 + LI D +VEV+E S + E + ++IRR+ +G + AV +EFLI Sbjct: 152 ALVQLIGDDEVEVVEQSRTTEKIETPQGMVEQPSYSVKIRRRLERGTPRLAAVPLEEFLI 211 Query: 58 HPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKN------ 111 HP+++ I SPI G + RSDLI+ GYDR+ I LP + + + +F + Sbjct: 212 HPEAISIADSPIAGIATRMRRSDLIATGYDRDLIEGLPASTGDSGRDDEEFTRRRGVFEA 271 Query: 112 -QYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRA 170 KALE ++YYELYV +D D DGIAELRR+++AGGTG++++L NEEW+E+PF L Sbjct: 272 KDAVPKALEEVDYYELYVKVDADDDGIAELRRLVLAGGTGEEHLLSNEEWDEVPFADLII 331 Query: 171 MRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQF 230 R PH G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +PESVLNP+F Sbjct: 332 ERRPHQREGGSVTDDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANPESVLNPKF 391 Query: 231 GKPIRVAAGMDIRSVLGIHS-VPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNM 289 +PIRV+ G+D R+ LG + ++SF+ML YLDQE DRTGISD SSG +P+ L NM Sbjct: 392 AQPIRVSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGLAPDALTNM 451 Query: 290 TATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 TA AT+LIEQ+G+GQ EL+VRT AQGL +F+GLLRL+I+HQD+ R VRLR QWV+FDPR Sbjct: 452 TARATALIEQAGIGQTELMVRTFAQGLRRVFKGLLRLVIKHQDRPRAVRLRGQWVTFDPR 511 Query: 350 Y 350 + Sbjct: 512 H 512 >gi|150397041|ref|YP_001327508.1| hypothetical protein Smed_1838 [Sinorhizobium medicae WSM419] gi|150028556|gb|ABR60673.1| hypothetical protein Smed_1838 [Sinorhizobium medicae WSM419] Length = 683 Score = 250 bits (638), Expect = 2e-64, Method: Composition-based stats. Identities = 184/359 (51%), Positives = 247/359 (68%), Gaps = 15/359 (4%) Query: 7 IHMLIKDSDVEVLEHSHRED--------GGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIH 58 + L+ D +VEVLE S + + + ++IRR+ +G + AV +EFLIH Sbjct: 153 LIQLVGDDEVEVLEQSQTVERMETPQGVVEQPSYSVKIRRRAERGTPRLAAVPLEEFLIH 212 Query: 59 PDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPI------ISSQNIENTWKFPKNQ 112 PD++ I SPI G + + RSDL++MG+DR+ I+ LP + F Sbjct: 213 PDAISIADSPITGFAMRMRRSDLVAMGHDRDLIDGLPAAEAGGRDDEASTRRRDAFETKD 272 Query: 113 YSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMR 172 KALE ++YYELYV +D D DGIAELRR++ AGGT ++N+L NEEW+E+PF L R Sbjct: 273 AVPKALEEVDYYELYVKVDADDDGIAELRRLVFAGGTSEENLLSNEEWDEVPFADLTVER 332 Query: 173 APHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGK 232 PH G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +PE+VLNP+FG+ Sbjct: 333 RPHQREGGSVTGDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANPEAVLNPKFGQ 392 Query: 233 PIRVAAGMDIRSVLGIHS-VPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTA 291 PIRV+ G+D R+ LG + ++SF+ML YLDQE DRTGISD SSG +P+ LQNMTA Sbjct: 393 PIRVSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGMAPDALQNMTA 452 Query: 292 TATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPRY 350 AT+L+EQ+G+GQ EL+VRT AQGL +FRGLLRL+++HQD+ R VRLR QWV+FDPR+ Sbjct: 453 RATALVEQAGIGQTELMVRTFAQGLRRVFRGLLRLVVKHQDRPRAVRLRGQWVTFDPRH 511 >gi|294083946|ref|YP_003550703.1| putative portal protein [Candidatus Puniceispirillum marinum IMCC1322] gi|292663518|gb|ADE38619.1| putative portal protein [Candidatus Puniceispirillum marinum IMCC1322] Length = 697 Score = 229 bits (584), Expect = 4e-58, Method: Composition-based stats. Identities = 84/356 (23%), Positives = 162/356 (45%), Gaps = 23/356 (6%) Query: 7 IHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEK 66 + ML+ DV++++ S +D G I +V ++ + P+E ++ +E+ Sbjct: 180 LDMLLAQDDVDLID-SSTDDVGMV--SGTIGVTRDTSQVVIETIPPEELIVEAQCKSLEE 236 Query: 67 SPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK---------FPKNQYSDKA 117 S + T S+L M D + ++++ +E + + S Sbjct: 237 STFSAHRTRKTLSELREMYPDSDKLDDIGDHEDVEMETDPEILARHDGVSENRGFSSHGY 296 Query: 118 LEMI---EYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAP 174 + + YE Y+ +D +G GIA+L +V AG +L EE PF + P Sbjct: 297 QDQVRHILCYEAYIMLDVEGSGIAKLHKVTKAGNV----LLDIEEVKRRPFVTFCPLPIP 352 Query: 175 HCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPI 234 H F G + A + Q +TVL R LD+ N P+ +V +G + +P +++ + G + Sbjct: 353 HAFYGSNFAEKLCATQNARTVLTRSILDHAMITNNPRYMVVKGGLSNPRELIDNRVGGLV 412 Query: 235 RVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNM-TATA 293 V+ I + + P+ F L LDQ+L D TG+S +S G + + + +A Sbjct: 413 NVSRPDAISA---MPQAPLNPFVFQTLQQLDQDLEDNTGVSRLSQGLNKDAISKQNSAAM 469 Query: 294 TSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 + + +++ R AQ ++ LF + RL+++++D+ ++V + +V DPR Sbjct: 470 VEQLATMSQQRQKILARHFAQFVKSLFHEIYRLVVENEDQQKIVEISGAYVEVDPR 525 >gi|160897386|ref|YP_001562968.1| hypothetical protein Daci_1943 [Delftia acidovorans SPH-1] gi|160362970|gb|ABX34583.1| conserved hypothetical protein [Delftia acidovorans SPH-1] Length = 763 Score = 228 bits (581), Expect = 8e-58, Method: Composition-based stats. Identities = 103/332 (31%), Positives = 157/332 (47%), Gaps = 20/332 (6%) Query: 30 KVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDR- 88 + D+ +R G+V V+ V P+EFLI + IE + VG ++ T S+L SMGY Sbjct: 230 MLWDVVCKRVKKGGRVRVENVPPEEFLISRKAKSIEDASFVGHRVARTISELKSMGYKNV 289 Query: 89 ESINNLPIISSQNIENTWKFPKNQ-----------YSDKALEMIEYYELYVTIDYDGDGI 137 + I + +S N+E + + D + I E Y+ DYDGDGI Sbjct: 290 DDITSDDQAASLNMERIERLSWDDEMAYLQMDNVQSMDTSQRQIWVTECYLRCDYDGDGI 349 Query: 138 AELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLL 197 AELR+V+ AG IL NE + PF + + PH F G S+A +E Q+I T+LL Sbjct: 350 AELRKVVRAGN----QILENEVCDVAPFVSITPVPMPHKFFGLSVADLALEGQRINTILL 405 Query: 198 RQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS 257 R LDN + + EG + + + +L + G +R+ + + + Sbjct: 406 RNQLDNNNLEVNGRYFAVEGQV-NLDDLLTSRPGGVVRMKSAGMAGRLD--QGAGNSGLN 462 Query: 258 FSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLE 317 M+ Y+ D TG + + G + L N TAT + I +++LI R A G Sbjct: 463 LQMMEYMKGFQEDSTGWTRYNQGSDGDSL-NQTATGVNQIVNRADMRLDLIARNYADGFR 521 Query: 318 ILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 LFR +L+L Q+Q MV+LR +WV PR Sbjct: 522 ELFRLMLKLCSQYQQTEDMVKLRGKWVPVSPR 553 >gi|291334641|gb|ADD94289.1| portal protein [uncultured phage MedDCM-OCT-S04-C64] Length = 755 Score = 221 bits (562), Expect = 2e-55, Method: Composition-based stats. Identities = 96/353 (27%), Positives = 173/353 (49%), Gaps = 14/353 (3%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVD--IEKS 67 L+ D +V+ + E ++ + G + ++ V P+EF I ++ +E + Sbjct: 157 LLSDPNVQRELIEDSIEQTEFGLNVEFKVIEKMGSIRIEPVPPEEFGIARNARSPYVEDT 216 Query: 68 PIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIEN--------TWKFPKNQYSDKALE 119 + + S+L++MGYD E I +LP S E + P + S++++ Sbjct: 217 NFCYHRTLKSFSELVAMGYDVELIRSLPFDESAMTEEELARRNKTDEEEPFDYVSEESMR 276 Query: 120 MIEYYELYVTIDYDGDGIAELRRVIMAGG---TGKDNILCNEEWNELPFTCLRAMRAPHC 176 E Y+ ID DGD IAEL RV +AGG +G +L EE + +PF + PH Sbjct: 277 NYFITECYIKIDRDGDDIAELLRVTLAGGNYTSGSSRLLGIEEVDHMPFATCSPILMPHK 336 Query: 177 FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV 236 F G S+A +++Q+IK+VL RQ LDN Y N +T V + + + + + G Sbjct: 337 FYGLSIADITMDLQRIKSVLTRQMLDNTYLANNSRTAVNDSHVNLDDLLTSRPGGVVRYK 396 Query: 237 AAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSL 296 G + + I P+ ++++M+ YLD RTG+ D ++G L N+ +L Sbjct: 397 GEGSASQYITPIPHNPLPNEAYTMMGYLDDVRRQRTGVGDETAGLGENSLSNVNTGVAAL 456 Query: 297 IEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + ++ELI R L + G + +FR + +L+++HQD+ ++ + + + +P Sbjct: 457 AFDAKRMKIELIARILGEVGFKDVFRLIHKLLMKHQDRKMLLNVAGNFQAINP 509 >gi|148257059|ref|YP_001241644.1| hypothetical protein BBta_5791 [Bradyrhizobium sp. BTAi1] gi|146409232|gb|ABQ37738.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 557 Score = 219 bits (558), Expect = 4e-55, Method: Composition-based stats. Identities = 87/335 (25%), Positives = 147/335 (43%), Gaps = 18/335 (5%) Query: 30 KVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGR-KLYLTRSDLISMGYDR 88 HD+ I + V V P+EF I + I + T + LI+ G+D Sbjct: 18 TTHDVTIVTTRKFAQARVMGVPPEEFGIERGARSIRDCNYCFHEIVTKTEAQLIAEGFDA 77 Query: 89 ESINNLPIISSQ--------NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL 140 I +L + + + ++ ++ E YV +DY+G+G L Sbjct: 78 AQIRSLGDYAGTTRVETLARDTVDEQSRASASAANSGTRLVRITEHYVRMDYEGEGRPCL 137 Query: 141 RRVIMAGGTGKDNI----LCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVL 196 ++I G G+ C ++ +PF + H F G S+A ++ +Q+ KT L Sbjct: 138 YQIITGGDQGEILRKDGQDCITPFDAIPFAATTPVPMTHRFFGRSIADLVMPLQREKTAL 197 Query: 197 LRQTLDNLYWQNQPQTIVQEGSIIDP--ESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMI 254 R LDNLY N P+ V E + + +L + G +R + + + Sbjct: 198 KRGALDNLYLHNNPRVEVAEANAGPNTLDDLLVSRPGGVVRTKTAGGLNWQV---VPDIT 254 Query: 255 EKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ 314 + ML Y+D EL R+G+S + G LQN +ATA + + + +++LI R +A+ Sbjct: 255 SSIYPMLQYIDAELESRSGLSKQAQGIDANALQNQSATAVAQVFSASQMRIKLIARIMAE 314 Query: 315 GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 G+ +F L I +H + + VRLR+ WV DPR Sbjct: 315 GVRDMFGLLHATIRKHGQQRQTVRLRNAWVQVDPR 349 >gi|167600438|ref|YP_001671938.1| portal protein [Pseudomonas phage LUZ24] gi|161168301|emb|CAP45466.1| portal protein [Pseudomonas phage LUZ24] Length = 706 Score = 201 bits (511), Expect = 1e-49, Method: Composition-based stats. Identities = 94/354 (26%), Positives = 175/354 (49%), Gaps = 26/354 (7%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 ++ D D E+L S EDG + ++IR+ + ++ V + P+ FL+ + I+ + Sbjct: 163 ILADPDTEILAQSVDEDG---TYSIKIRKDKKKREIKVTCIKPENFLVDRLATCIDDARF 219 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFP-------------KNQYSDK 116 + + T SDL +G + ++ LP + ++ + + + Sbjct: 220 LCHREKYTVSDLRLLGVPEDVLDELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAE 279 Query: 117 ALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHC 176 A + E Y +D DGDGI+ELRR++ G I+ NE W+ PF L A R H Sbjct: 280 ANREVWASECYTLLDVDGDGISELRRILYVGD----YIISNEPWDSRPFADLNAYRIAHK 335 Query: 177 FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV 236 F G S+ I +IQ+I++VL+R +DN+Y NQ +++V +G + + + N G Sbjct: 336 FHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLTNEAAGIVRVK 395 Query: 237 AAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEIL-QNMTATATS 295 + S++ + + + + + ML L+ + RTGI+D + G L N A + + Sbjct: 396 ----AMNSIMPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVN 451 Query: 296 LIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + + Q++LI R A+ G++ LF+ L I++Q++ + +LR +WV+ +P Sbjct: 452 QLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAINP 505 >gi|27476052|ref|NP_775254.1| putative portal protein [Pseudomonas phage PaP3] gi|27414482|gb|AAL85568.1| ORF.04 [Pseudomonas phage PaP3] Length = 705 Score = 199 bits (506), Expect = 5e-49, Method: Composition-based stats. Identities = 96/354 (27%), Positives = 174/354 (49%), Gaps = 26/354 (7%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 ++ D D +L S +DG + ++IR+ + ++ V V P+ FL+ + I+ + Sbjct: 162 ILSDPDTSILAQSVDDDG---TYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARF 218 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFP-------------KNQYSDK 116 + + T SDL +G + I LP + ++ + + + Sbjct: 219 LCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAE 278 Query: 117 ALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHC 176 A + E Y +D DGDGI+ELRR++ G I+ NE W+ PF L A R H Sbjct: 279 ANREVWASECYTLLDVDGDGISELRRILYVGD----YIISNEPWDCRPFADLNAYRIAHK 334 Query: 177 FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRV 236 F G S+ I +IQ+I++VL+R +DN+Y NQ +++V +G + + + N G +RV Sbjct: 335 FHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLEDLLTNEAAG-IVRV 393 Query: 237 AAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEIL-QNMTATATS 295 + I + + + + + ML L+ + RTGI+D + G L N A + + Sbjct: 394 KSMNSIT---PLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVN 450 Query: 296 LIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + + Q++LI R A+ G++ LF+ L I++Q++ + +LR +WV+ +P Sbjct: 451 QLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNP 504 >gi|221199509|ref|ZP_03572553.1| putative portal protein [Burkholderia multivorans CGD2M] gi|221205589|ref|ZP_03578604.1| putative portal protein [Burkholderia multivorans CGD2] gi|221174427|gb|EEE06859.1| putative portal protein [Burkholderia multivorans CGD2] gi|221180794|gb|EEE13197.1| putative portal protein [Burkholderia multivorans CGD2M] Length = 807 Score = 190 bits (482), Expect = 3e-46, Method: Composition-based stats. Identities = 97/339 (28%), Positives = 164/339 (48%), Gaps = 24/339 (7%) Query: 26 DGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMG 85 + ++H++ + R G V ++AV P++FL+ S I ++ T SDL + G Sbjct: 268 EQLPRLHNVVLTRSKKAGHVAIEAVMPEDFLVSARSRRIRD-GFCAHRVRKTLSDLKAEG 326 Query: 86 YDR-ESINNLPIISSQNIEN------------TWKFPKNQYSDKALEMIEYYELYVTIDY 132 Y+ E I++ P + ++ + + D++ +E YE Y+ ID Sbjct: 327 YENVELIDSEPNAVAADLSELALARQNEQNRVVTNALDDGFGDESQREVELYECYLPIDV 386 Query: 133 DGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKI 192 DGDGI+E R++ AG IL NE + PF + + P IG S+A + IQ+I Sbjct: 387 DGDGISEWRKITKAGN----AILDNEVVDGPPFALVSPISIPGLLIGRSIADLAMPIQRI 442 Query: 193 KTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVP 252 KT LR DN+ Q + + +G + + ++ + G +R+ + I + +P Sbjct: 443 KTKFLRGLDDNMQIQINGRVGLVDGKVNVND-WMDNRPGGGVRIKSADAIVPIK--QGLP 499 Query: 253 MIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTL 312 I + +L Y+D +RTGI+ S G + L N TA I +V++I R Sbjct: 500 DIAGAMQLLQYVDAMSQERTGITKYSQGLDADTL-NHTADGIKRITARADLRVKMIARKF 558 Query: 313 AQ-GLEILFRGLLRLIIQHQDKVRMVRL-RDQWVSFDPR 349 A+ G+ LFR + +L++QHQDK + L + +WV DPR Sbjct: 559 AETGVTDLFRLIQKLLMQHQDKPMSIALSKGKWVDIDPR 597 >gi|288817860|ref|YP_003432207.1| putative portal protein [Hydrogenobacter thermophilus TK-6] gi|288787259|dbj|BAI69006.1| putative portal protein [Hydrogenobacter thermophilus TK-6] Length = 618 Score = 185 bits (470), Expect = 6e-45, Method: Composition-based stats. Identities = 79/364 (21%), Positives = 157/364 (43%), Gaps = 29/364 (7%) Query: 8 HMLIKDSDVEVLEHSHR------EDGGEKVHDL--RIRRKYSQGKVCVDAVSPDEFLIHP 59 +++ ++++ +H +D G ++ + +I R S+ + C++ V EF+ HP Sbjct: 149 EIVLGWDELQLAQHDPTAVVESAQDLGNGIYRVALKISRL-SKNQPCLENVPATEFIFHP 207 Query: 60 DSVDIEKSPIVGRKLYLTRSDLI---SMGYDR--ESINNLPIISSQNIENTWKF------ 108 ++ ++ SP V + +T L G + + + + Sbjct: 208 STLSVKDSPFVAHRKVVTVDYLKRKEKEGIYKNVDKVIESASSDDLRYTQMADYYLKPYK 267 Query: 109 ---PKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPF 165 D A + YE Y D + DG+ L VI+ G + + PF Sbjct: 268 KYAVSESDQDLARRKVLLYECYTKYDINNDGL--LEDVIITVGNNTILRIQENIYGRPPF 325 Query: 166 TCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESV 225 L + P+ G+S A + +IQ +KT L+ Q + N+ N + + + + + V Sbjct: 326 FVLAPILEPYQLWGKSFADVLKDIQDLKTALVNQIIVNVGMNNDYKIAINDTLVNVQDIV 385 Query: 226 LNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEI 285 + + A ++++ + + P+ SF+ L Y++ +RTGI+ + G Sbjct: 386 NDKPVIRM--KAGADIRQAIMPLPTQPLAPWSFNFLEYIEGTKENRTGITRYNQGLDGRS 443 Query: 286 LQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWV 344 L N TA+ S+I Q+ ++ELI R A+ G++ LF L+ L Q D+ ++RL ++ + Sbjct: 444 L-NKTASGISMIMQAANQRLELIARIFAETGIKDLFSFLVYLNQQFIDQKTVIRLTNKSL 502 Query: 345 SFDP 348 P Sbjct: 503 PIAP 506 >gi|308751459|gb|ADO44942.1| hypothetical protein Hydth_0542 [Hydrogenobacter thermophilus TK-6] Length = 618 Score = 185 bits (470), Expect = 6e-45, Method: Composition-based stats. Identities = 79/364 (21%), Positives = 157/364 (43%), Gaps = 29/364 (7%) Query: 8 HMLIKDSDVEVLEHSHR------EDGGEKVHDL--RIRRKYSQGKVCVDAVSPDEFLIHP 59 +++ ++++ +H +D G ++ + +I R S+ + C++ V EF+ HP Sbjct: 149 EIVLGWDELQLAQHDPTAVVESAQDLGNGIYRVALKISRL-SKNQPCLENVPATEFIFHP 207 Query: 60 DSVDIEKSPIVGRKLYLTRSDLI---SMGYDR--ESINNLPIISSQNIENTWKF------ 108 ++ ++ SP V + +T L G + + + + Sbjct: 208 STLSVKDSPFVAHRKVVTVDYLKRKEKEGIYKNVDKVIESASSDDLRYTQMADYYLKPYK 267 Query: 109 ---PKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPF 165 D A + YE Y D + DG+ L VI+ G + + PF Sbjct: 268 KYAVSESDQDLARRKVLLYECYTKYDINNDGL--LEDVIITVGNNTILRIQENIYGRPPF 325 Query: 166 TCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESV 225 L + P+ G+S A + +IQ +KT L+ Q + N+ N + + + + + V Sbjct: 326 FVLAPILEPYQLWGKSFADVLKDIQDLKTALVNQIIVNVGMNNDYKIAINDTLVNVQDIV 385 Query: 226 LNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEI 285 + + A ++++ + + P+ SF+ L Y++ +RTGI+ + G Sbjct: 386 NDKPVIRM--KAGADIRQAIMPLPTQPLAPWSFNFLEYIEGTKENRTGITRYNQGLDGRS 443 Query: 286 LQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQWV 344 L N TA+ S+I Q+ ++ELI R A+ G++ LF L+ L Q D+ ++RL ++ + Sbjct: 444 L-NKTASGISMIMQAANQRLELIARIFAETGIKDLFSFLVYLNQQFIDQKTVIRLTNKSL 502 Query: 345 SFDP 348 P Sbjct: 503 PIAP 506 >gi|167583563|ref|YP_001671753.1| portal protein [Enterobacteria phage phiEco32] gi|164375401|gb|ABY52809.1| portal protein [Enterobacteria phage phiEco32] Length = 747 Score = 182 bits (461), Expect = 7e-44, Method: Composition-based stats. Identities = 67/366 (18%), Positives = 149/366 (40%), Gaps = 30/366 (8%) Query: 2 ALNYFIHMLIKD--SDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHP 59 AL ++ L ++E+ + + D+++ + + +V V+ V ++ + Sbjct: 157 ALAAYVQGLEAGGLKNLEIFTEENEDGTV----DVKVTYEQTVKRVKVEYVPSEQIFVDE 212 Query: 60 DSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISS-------------QNIENTW 106 + + ++ ++ DL++MG+ ++ I + + Sbjct: 213 HATSFADAQYFCHRVRRSKEDLVAMGFPKDEIEAFNDWTDTMDTTQSTVAWSRTDWRQDI 272 Query: 107 KFPKNQYSDKALEMIEYYELYVTI-DYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPF 165 ++ M+ YE Y+ D + ++L +VI AG +IL EE +PF Sbjct: 273 DADIGTDTEDIASMVWVYEHYIRTGVLDKNKESKLYQVIQAGE----HILHTEEVTHIPF 328 Query: 166 TCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESV 225 P F G+S+ +IQ ++T L+R +DN+ N + G+ D S+ Sbjct: 329 VTFCPYPIPGSFYGQSVYDITKDIQDLRTALVRGYIDNVNNANYGRYKALVGA-YDRRSL 387 Query: 226 LNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEI 285 L+ + G + + I + + +L ++ RTG++ + G +P++ Sbjct: 388 LDNRPGGVVEMERQDAID---LFPYHNLPQGIDGLLGMSEELKETRTGVTKLGMGINPDV 444 Query: 286 LQNMTA-TATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQW 343 +N A L+ + ++ ++ R +A G+ L RG+ LI ++ + V+ Sbjct: 445 FKNDNAYATVGLMMNAAQNRLRMVCRNIAHNGMVELMRGIYSLIRENGEVPIEVQTPRGM 504 Query: 344 VSFDPR 349 V +P+ Sbjct: 505 VQVNPK 510 >gi|260753098|ref|YP_003225991.1| hypothetical protein Za10_0861 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258552461|gb|ACV75407.1| hypothetical protein Za10_0861 [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 729 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 98/356 (27%), Positives = 164/356 (46%), Gaps = 21/356 (5%) Query: 2 ALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 AL + + D+++ + D G +++ + R Q + + +E+ + + Sbjct: 167 ALAALLMEAEDNPDIQI---TLNNDDGSGQYEVTVTRYQLQKRYVDMPIPSEEYRVSART 223 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPII-----SSQNIENTWKFPK--NQYS 114 + + Y T SDLISMG+DR+ + +LP S + W+ + S Sbjct: 224 RHEDDADYQAHVSYKTLSDLISMGFDRDIVESLPSDKSFPNSDGRSDARWRDESFLSGSS 283 Query: 115 DKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAP 174 D+A + YE YV ID DGDGIAEL ++ +L EE +E PF Sbjct: 284 DQANREVLLYEEYVRIDRDGDGIAELLQIFRVKDV----LLSIEEVDEAPFVVWTPFPRA 339 Query: 175 HCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSI--IDPESVLNPQFGK 232 H IG SLA +++IQ++K+VL+RQ LD +Y N P+ V + + +L + G Sbjct: 340 HRMIGNSLAEKVMDIQRVKSVLMRQALDGVYQTNAPRMAVNVDGLTEDTFDDLLTIRPGA 399 Query: 233 PIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTAT 292 +R +++ I+KS M+ Y+ RTGI+ ++ G + L N TAT Sbjct: 400 IVRYRG---GIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTAT 455 Query: 293 ATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 +L++ G E + R AQ L LF+ L L+I D +++ + + DP Sbjct: 456 GQALLQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGD-PMAIKVEGLYKTVDP 510 >gi|56551276|ref|YP_162115.1| hypothetical protein ZMO0380 [Zymomonas mobilis subsp. mobilis ZM4] gi|56542850|gb|AAV89004.1| hypothetical protein ZMO0380 [Zymomonas mobilis subsp. mobilis ZM4] Length = 729 Score = 177 bits (449), Expect = 2e-42, Method: Composition-based stats. Identities = 98/356 (27%), Positives = 164/356 (46%), Gaps = 21/356 (5%) Query: 2 ALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 AL + + D+++ + D G +++ + R Q + + +E+ + + Sbjct: 167 ALAALLMEAEDNPDIQI---TLNNDDGSGQYEVTVTRYQLQKRYVDMPIPSEEYRVSART 223 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPII-----SSQNIENTWKFPK--NQYS 114 + + Y T SDLISMG+DR+ + +LP S + W+ + S Sbjct: 224 RHEDDADYQAHVSYKTLSDLISMGFDRDIVESLPSDKSFPNSDGRSDARWRDESFLSGSS 283 Query: 115 DKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAP 174 D+A + YE YV ID DGDGIAEL ++ +L EE +E PF Sbjct: 284 DQANREVLLYEEYVRIDRDGDGIAELLQIFRVKDV----LLSIEEVDEAPFVVWTPFPRA 339 Query: 175 HCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSI--IDPESVLNPQFGK 232 H IG SLA +++IQ++K+VL+RQ LD +Y N P+ V + + +L + G Sbjct: 340 HRMIGNSLAEKVMDIQRVKSVLMRQALDGVYQTNAPRMAVNVDGLTEDTFDDLLTIRPGA 399 Query: 233 PIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTAT 292 +R +++ I+KS M+ Y+ RTGI+ ++ G + L N TAT Sbjct: 400 IVRYRG---GIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTAT 455 Query: 293 ATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 +L++ G E + R AQ L LF+ L L+I D +++ + + DP Sbjct: 456 GQALLQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGD-PMAIKVEGLYKTVDP 510 >gi|241760934|ref|ZP_04759023.1| hypothetical protein ZmobDRAFT_0099 [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241374553|gb|EER64014.1| hypothetical protein ZmobDRAFT_0099 [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 729 Score = 176 bits (445), Expect = 5e-42, Method: Composition-based stats. Identities = 98/356 (27%), Positives = 164/356 (46%), Gaps = 21/356 (5%) Query: 2 ALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDS 61 AL + + D+++ + D G +++ + R Q + + +E+ + + Sbjct: 167 ALAALLMEAEDNPDIQI---TLNSDNGSGQYEVTVTRYQLQKRYVDMPIPSEEYRVSART 223 Query: 62 VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPII-----SSQNIENTWKFPK--NQYS 114 + + Y T SDLISMG+DR+ + +LP S + W+ + S Sbjct: 224 RHEDDADYQAHVSYKTLSDLISMGFDRDIVESLPSDKSFPNSDGRSDARWRDESFLSGSS 283 Query: 115 DKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAP 174 D+A + YE YV ID DGDGIAEL ++ +L EE +E PF Sbjct: 284 DQANREVLLYEEYVRIDRDGDGIAELLQIFRVKDV----LLSIEEVDEAPFVVWTPFPRA 339 Query: 175 HCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSI--IDPESVLNPQFGK 232 H IG SLA +++IQ++K+VL+RQ LD +Y N P+ V + + +L + G Sbjct: 340 HRMIGNSLAEKVMDIQRVKSVLMRQALDGVYQTNAPRMAVNVDGLTEDTFDDLLTIRPGA 399 Query: 233 PIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTAT 292 +R +++ I+KS M+ Y+ RTGI+ ++ G + L N TAT Sbjct: 400 IVRYRG---GIPPTPLNAGFDIQKSLGMIEYMQSAQESRTGITRLNQGLDADSL-NKTAT 455 Query: 293 ATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 +L++ G E + R AQ L LF+ L L+I D +++ + + DP Sbjct: 456 GQALLQAQGQQMEEYVARNFAQSLGRLFQKKLWLMIASGD-PMAIKVEGLYKTVDP 510 >gi|316934283|ref|YP_004109265.1| putative portal protein [Rhodopseudomonas palustris DX-1] gi|315601997|gb|ADU44532.1| putative portal protein [Rhodopseudomonas palustris DX-1] Length = 673 Score = 171 bits (433), Expect = 1e-40, Method: Composition-based stats. Identities = 76/352 (21%), Positives = 144/352 (40%), Gaps = 20/352 (5%) Query: 7 IHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEK 66 + DVEV + E G + R + V+ V P+EF + Sbjct: 160 AQAITSQEDVEV-DLELDEATG--TYSGSWTRVTDTSGLRVEVVPPEEFYSDASKKRRQD 216 Query: 67 SPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKN--------QYSDKAL 118 GRK TR++LIS GY R+ ++ + + S ++ + L Sbjct: 217 GTR-GRKTLKTRAELISEGYPRDKVSKVRVSSEIEFDSERQERDRETNDGIGSDAPQSEL 275 Query: 119 EMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFI 178 + I +E ++ + GDG A L R++ A G ++ E + F +R PH Sbjct: 276 DQILVHETFIQLSLKGDGKASLYRIVHADG----HLFEMGEVADDNFLDFVPLRRPHSQF 331 Query: 179 GESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAA 238 G + + I+ Q +TV+ R LD+ N P+ V S+ +P+ +L+ + + V Sbjct: 332 GNNFSKRIVPTQNARTVITRSILDHAATVNNPRWTVLNNSLSNPKELLDARLRGVVNVKN 391 Query: 239 GMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT-SLI 297 I + + F +L L + TGIS +S G + + + + + + + Sbjct: 392 RDAIGI---LPYPQLNNAVFPLLEMLKTNKEETTGISSLSQGLNKDAISSQNSQGMVNDL 448 Query: 298 EQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 + ++I R A L LF +++I++Q + ++ + + + DPR Sbjct: 449 ITVSQTRQKIIARNFAMFLHDLFLAARKVVIENQTRKKVWEFDNNFQNIDPR 500 >gi|307308935|ref|ZP_07588618.1| hypothetical protein SinmeBDRAFT_4502 [Sinorhizobium meliloti BL225C] gi|306900569|gb|EFN31182.1| hypothetical protein SinmeBDRAFT_4502 [Sinorhizobium meliloti BL225C] Length = 677 Score = 156 bits (393), Expect = 6e-36, Method: Composition-based stats. Identities = 68/341 (19%), Positives = 141/341 (41%), Gaps = 12/341 (3%) Query: 16 VEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSV-DIEK----SPIV 70 +E + GG +V D++IR + + V V P++ ++ D+ D E + + Sbjct: 173 IEESGEPYTIPGGVQVRDVKIRTVTRRSCINVFPVDPEDAVLSTDAQFDPETGGIRAKLQ 232 Query: 71 GRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTI 130 G + ++RS LI +G+D+ +++ +P ++ + + K+ ++A + V Sbjct: 233 GHRKIMSRSVLIDLGFDKATVDRIPGVNEKTDGIALERLKDVSGERAFDKDMVEVYTVYT 292 Query: 131 DYDGDGIAELRRVIMAGGTGKDNILCNEEWNEL-PFTCLRAMRAPHCFIGESLAASIIEI 189 D + R+ G + +L EE P+ G+ +A I E Sbjct: 293 RLKLDTTSRHYRITFGGDSANPILLDYEETTRFYPYAAFVPYPLAGTLFGQGIADRIGED 352 Query: 190 QKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIH 249 + + + R D+L P T+V + + + + N GK IR ++ + + Sbjct: 353 HEKISKMERAVQDSLNMSVFPITVVDD-DVSSIDDLTNLHPGKVIRSSSPNGG--INFVQ 409 Query: 250 SVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIV 309 ++ ++ L+Q+L TG+ LQ TATA + +E + Sbjct: 410 HPFTGAQATGIIERLEQKLDFSTGVGPQMMTLDASDLQRTTATAINQRSNQQQTLIETVS 469 Query: 310 RTLAQ-GLEILFRGLLRLIIQHQDKVRMV--RLRDQWVSFD 347 R A+ G L + ++ L++Q D+ + + RL ++ D Sbjct: 470 RFFAETGYRYLTKVIVDLLVQKPDESQELIGRLTGNFIPVD 510 >gi|291334834|gb|ADD94474.1| hypothetical protein CLIBASIA_05245 [uncultured phage MedDCM-OCT-S06-C1041] Length = 265 Score = 155 bits (390), Expect = 1e-35, Method: Composition-based stats. Identities = 82/263 (31%), Positives = 133/263 (50%), Gaps = 12/263 (4%) Query: 84 MGYDRESINNLPIISSQNIEN--------TWKFPKNQYSDKALEMIEYYELYVTIDYDGD 135 MGYD E I +LP S E + P + S++++ E Y+ ID DGD Sbjct: 1 MGYDVELIRSLPFDESAMTEEELARRNKTDEEEPFDYVSEESMRNYFITECYIKIDRDGD 60 Query: 136 GIAELRRVIMAGG---TGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKI 192 IAEL RV +AGG +G +L EE + +PF + PH F G S+A +++Q+I Sbjct: 61 DIAELLRVTLAGGNYTSGSSRLLGIEEVDHMPFATCSPILMPHKFYGLSIADITMDLQRI 120 Query: 193 KTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVP 252 K+VL RQ LDN Y N +T V + + + + + G G + + I P Sbjct: 121 KSVLTRQMLDNTYLANNSRTAVNDSHVNLDDLLTSRPGGVVRYKGEGSASQYITPIPHNP 180 Query: 253 MIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTL 312 + ++++M+ YLD RTG+ D ++G L N+ +L + ++ELI R L Sbjct: 181 LPNEAYTMMGYLDDVRRQRTGVGDETAGLGENSLSNVNTGVAALAFDAKRMKIELIARIL 240 Query: 313 AQ-GLEILFRGLLRLIIQHQDKV 334 + G + +FR + +L+++HQD+ Sbjct: 241 GEVGFKDVFRLIHKLLMKHQDRK 263 >gi|316995429|gb|ADU79210.1| hypothetical protein EcP1_gp59 [Enterobacter phage EcP1] Length = 719 Score = 151 bits (380), Expect = 2e-34, Method: Composition-based stats. Identities = 53/348 (15%), Positives = 108/348 (31%), Gaps = 19/348 (5%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSP 68 L + V +E E E K Q + V+ ++ + I P D++K+ Sbjct: 206 LEQGFAVRAVETGEMEKVTE--------TKVLQNQPYVEVLNIENVYIDPSCQGDMDKAT 257 Query: 69 IVGRKLYLTRSDLISMGYDRESI-------NNLPIISSQNIENTWKFPKNQYSDKALEMI 121 V + + ++L G + + L S + T S K+ + Sbjct: 258 FVIHRFETSIAELKKSGNYKNLDKLTVKDSDELIPSISDDEIKTSTPTDYNISGKSRKRF 317 Query: 122 EYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGES 181 E + D D G+ V G + PF + + GE Sbjct: 318 NVTEYWGYYDIDDSGVLTPIVVAYVGDVKIRCSENPYPHGKPPFVVIPYLPMDSSVYGEP 377 Query: 182 LAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPE-SVLNPQFGKPIRVAAGM 240 A I + Q I R +D + Q I+++ Sbjct: 378 DAELIYDNQAIIGASTRAMIDLVARSANGQNIIRKDVFDPVNYRKFMAGEDAQSNPLNVP 437 Query: 241 DIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQS 300 ++ + + + ++ + E +G+ S G S L A + + Sbjct: 438 LAEAIRTVTTPEVPSIIPGLIQQQNNEAESLSGVKAFSEGISSGSL-GDVAAGIRGVLDA 496 Query: 301 GVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQ-WVSFD 347 + I+R L +G+ L R ++ + + ++R+ + +V Sbjct: 497 SSKREMSILRRLKKGMVDLGRMIIAMNQEFLTDEEIIRITNDAFVHVK 544 >gi|119952228|ref|YP_950537.1| 94 kDa protein [Enterobacteria phage N4] gi|117650947|gb|ABK54420.1| 94 kDa protein [Enterobacteria phage N4] Length = 763 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 57/345 (16%), Positives = 125/345 (36%), Gaps = 15/345 (4%) Query: 17 EVLEHSHR--EDGGEKVHDLRIRRKYSQGKVCVDAVS------PDEFLIHPDSV-DIEKS 67 E ++ S R ++ G+ + ++ ++ +V + P+ +I P DI K+ Sbjct: 202 EAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINKA 261 Query: 68 PIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKN----QYSDKALEMIEY 123 ++DL+ ++N + SS + Q SD + + Sbjct: 262 MFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVA 321 Query: 124 YELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLA 183 YE + D +G+G+ E G T +LPF + M GE A Sbjct: 322 YEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDA 381 Query: 184 ASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIR 243 + + Q + ++R +D L Q + +G + S + + Sbjct: 382 ELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDALNSRRYREGEDYEYNPTQNPAQ 441 Query: 244 SVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVG 303 ++ + + + +M +QE TG+ + G + E A + + Sbjct: 442 MIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESY-GDVAAGIRGVLDAASK 500 Query: 304 QVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD-QWVSFD 347 + I+R LA+G+ + ++ + + +VR+ + ++V+ Sbjct: 501 REMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIK 545 >gi|227822445|ref|YP_002826417.1| hypothetical protein NGR_c19000 [Sinorhizobium fredii NGR234] gi|227341446|gb|ACP25664.1| hypothetical protein NGR_c19000 [Sinorhizobium fredii NGR234] Length = 361 Score = 144 bits (362), Expect = 2e-32, Method: Composition-based stats. Identities = 116/189 (61%), Positives = 146/189 (77%), Gaps = 1/189 (0%) Query: 163 LPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDP 222 +PF L R PH G S+ + EIQ++KTVL+RQTLDNLYWQN Q IVQEG+I +P Sbjct: 1 MPFADLIIERRPHQREGGSVTDDMAEIQRVKTVLMRQTLDNLYWQNNQQPIVQEGAIANP 60 Query: 223 ESVLNPQFGKPIRVAAGMDIRSVLGIHS-VPMIEKSFSMLHYLDQELVDRTGISDISSGF 281 ESVLNP+FG+PIRV+ G+D R+ LG + ++SF+ML YLDQE DRTGISD SSG Sbjct: 61 ESVLNPKFGQPIRVSQGIDARAALGYTMVPFVAKESFAMLSYLDQEATDRTGISDASSGL 120 Query: 282 SPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD 341 +P+ L NMTA AT+LIEQ+G+GQ EL+VRT AQGL +F+GLLRL+I+HQD+ R VRLR Sbjct: 121 APDALTNMTARATALIEQAGIGQTELMVRTFAQGLRRVFKGLLRLVIKHQDRPRAVRLRG 180 Query: 342 QWVSFDPRY 350 QWV+FDPR+ Sbjct: 181 QWVTFDPRH 189 >gi|237651609|ref|YP_002899079.1| putative portal protein [Roseophage DSS3P2] gi|220898079|gb|ACL81337.1| N4 94kDa-like protein [Silicibacter phage DSS3phi2] Length = 800 Score = 143 bits (361), Expect = 3e-32, Method: Composition-based stats. Identities = 48/341 (14%), Positives = 113/341 (33%), Gaps = 16/341 (4%) Query: 19 LEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVD-IEKSPIVGRKLYLT 77 ++ + + + + R + V + V+ + P EK+ + T Sbjct: 239 MQQLVKAEPDGIIETIEERMVKNCPSVRIINVA--NLFVDPSCEGEWEKAQYMIYTYEAT 296 Query: 78 RSDLISMGYDRESIN-----------NLPIISSQNIENTWKFPKNQYSDKALEMIEYYEL 126 S+L + ++++ N ++ + + + YE Sbjct: 297 PSELKAKKNYYQNLDKVNWESAKIQSNHGNPDHESNTPNNDMRTSGTGSADKQKVLVYEY 356 Query: 127 YVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASI 186 + D +G+ V G T + PF + M GE+ A+ + Sbjct: 357 WGLYDIYANGVMVPIVVTWVGETIIEMRENPFPDKRPPFVIVPYMPILKSVFGEADASLL 416 Query: 187 IEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVL 246 + Q+I + R +D + QT +G + Q G ++ Sbjct: 417 QDNQRIIGAVTRGVIDLMGRSANAQTGYAKGFLDPVNKRRFTQGEDFEFNPNGDPKANIR 476 Query: 247 GIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVE 306 + + + + + + E TG+ S G S + AT S + Sbjct: 477 QMEYPEIPRSAHETIQWQNAEAEALTGVKSFSGGISGDAY-GRVATGIRGALDSASQREM 535 Query: 307 LIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD-QWVSF 346 I+R LA+G++ + ++ + + + ++R+ + ++V Sbjct: 536 SILRRLAKGIQDIGMKMISMNGKFLSEKEIIRVTNREFVEV 576 >gi|282599474|ref|YP_003358364.1| N4 gp59-like protein [Pseudomonas phage LUZ7] gi|259048573|emb|CAZ66223.1| N4 gp59-like protein [Pseudomonas phage LUZ7] Length = 720 Score = 143 bits (360), Expect = 4e-32, Method: Composition-based stats. Identities = 49/320 (15%), Positives = 112/320 (35%), Gaps = 9/320 (2%) Query: 34 LRIRRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMGYDR--ES 90 +++++ + + +I P D+ K+ V + ++L + G E Sbjct: 224 VKVQKTIVN-QPTLKVCDFRNIVIDPSCNGDMNKAKFVVESFESSYAELKADGRYSNLEK 282 Query: 91 INNLPII---SSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAG 147 IN + ++D++ + + +E + D GDG G Sbjct: 283 INEQNSDILSQPDYATGSESVRNFDFADRSRKRLVVHEYWGYYDIHGDGELHSIVATWVG 342 Query: 148 GTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQ 207 L ++P+ + G+S + +I+ QKI + R +D + Sbjct: 343 QVLIRLELNPFPDGKIPYVVAAYLPVKDSVYGDSDGSLLIDNQKIVGAISRGMIDIMAQS 402 Query: 208 NQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQE 267 Q Q+G++ + ++ + + ML+ E Sbjct: 403 ANGQVGFQKGALDITNRRRYERGETYEFNPGNNPATAIYTHTFQEIPRSAEYMLNQQQLE 462 Query: 268 LVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLI 327 TG+ ++G S + L TAT + + I+R L+ L + R ++ + Sbjct: 463 AESMTGVKAFNTGISGQAL-GDTATGIRGALDAASKRELGILRRLSDCLIEVGRRVIAMN 521 Query: 328 IQHQDKVRMVRLRDQ-WVSF 346 + D ++R+ ++ +V+ Sbjct: 522 AEFLDDEEVIRITNEGFVTV 541 >gi|308516960|emb|CBW47065.1| structural protein, N4 gp59-like [Roseovarius sp. 217 phage 1] Length = 801 Score = 141 bits (354), Expect = 2e-31, Method: Composition-based stats. Identities = 52/341 (15%), Positives = 110/341 (32%), Gaps = 16/341 (4%) Query: 19 LEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLT 77 + + V + R + V + ++ + P D E+S + T Sbjct: 239 IGQPVTAEPDGVVETVEERMVKNCPSVRIVNIA--NLFVDPSCEGDWEQSQYMVYTYEAT 296 Query: 78 RSDLI-SMGYDRESIN----------NLPIISSQNIENTWKFPKNQYSDKALEMIEYYEL 126 +S+L+ G + N N ++ + + + YE Sbjct: 297 KSELMAKKGTYQNLENVNWESAKIQSNAGNPDHESNTPNNDMRTSGTGATDKQKVLVYEY 356 Query: 127 YVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASI 186 + D +GI V G T + PF + M GE+ A+ + Sbjct: 357 WGLYDIYDNGIMVPIVVTWVGETIIEMRENPFPDKRPPFVIVPYMPILKSVFGEADASLL 416 Query: 187 IEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVL 246 + Q+I + R +D + QT +G + G ++ Sbjct: 417 QDNQRIIGAVTRGVIDLMGRSANAQTGYAKGFLDPVNKRRFVNGEDFEFNPNGDPKANIR 476 Query: 247 GIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVE 306 + + + + + E TG+ S G S + AT S + Sbjct: 477 QMEYPEIPRSAHETIQMQNAEAEALTGVKSFSGGISGDAY-GSVATGIRGALDSAATREM 535 Query: 307 LIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD-QWVSF 346 I+R LA+G++ + ++ + + + +VR+ + ++V Sbjct: 536 SILRRLAKGMQAIGTKMIAMNAKFLSEKEIVRVTNEEFVEV 576 >gi|237651526|ref|YP_002898997.1| putative portal protein [Roseophage EE36P1] gi|220898158|gb|ACL81415.1| N4 gp59 protein [Sulfitobacter phage EE36phi1] Length = 800 Score = 141 bits (354), Expect = 2e-31, Method: Composition-based stats. Identities = 47/342 (13%), Positives = 115/342 (33%), Gaps = 18/342 (5%) Query: 19 LEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVD-IEKSPIVGRKLYLT 77 ++ + + + + R + V + V+ + P EK+ + T Sbjct: 239 MQQLVKAEPDGVIETIEERMVKNCPSVRIINVA--NLFVDPSCEGEWEKAQYMIYTYEAT 296 Query: 78 RSDLISMGYDRESIN-----------NLPIISSQNIENTWKFPKNQYSDKALEMIEYYEL 126 S+L + ++++ N ++ + + + YE Sbjct: 297 PSELKAKKDYYQNLDQVNWESAKIQSNHGNPDHESKTPNNDMRTSGTGSADKQKVLVYEY 356 Query: 127 YVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASI 186 + D +G+ V G T + PF + M GE+ A+ + Sbjct: 357 WGLYDIYNNGVMVPIVVTWVGETIIEMRENPFPDKRPPFVIVPYMPILKSVFGEADASLL 416 Query: 187 IEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVL 246 + Q+I + R +D + QT +G + G+ D ++ + Sbjct: 417 QDNQRIIGAVTRGVIDLMGRSANAQTGYAKGFLDPVNKR-RFTNGEDFEFNPNGDPKANI 475 Query: 247 -GIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQV 305 + + + + + + E TG+ S G + + AT S + Sbjct: 476 RQMEYPEIPRSAHETIQWQNAEAEALTGVKSFSGGITGDAY-GRVATGIRGALDSAAQRE 534 Query: 306 ELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD-QWVSF 346 I+R LA+G++ + ++ + + + ++R+ + ++V Sbjct: 535 MSILRRLAKGIQDIGMKMIAMNGKFLSEKEIIRVTNREFVEV 576 >gi|307545235|ref|YP_003897714.1| Haemophilus-specific protein, uncharacterized [Halomonas elongata DSM 2581] gi|307217259|emb|CBV42529.1| Haemophilus-specific protein, uncharacterized [Halomonas elongata DSM 2581] Length = 749 Score = 138 bits (348), Expect = 9e-31, Method: Composition-based stats. Identities = 51/320 (15%), Positives = 102/320 (31%), Gaps = 30/320 (9%) Query: 44 KVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMG----YDRESINNLPIISS 99 + + VSP + PD+ I+ + + TRS L + Y ++I + Sbjct: 240 RPEFERVSPFDMYPSPDATSIDDGAFIIERARFTRSQLNQLIGVPSYSEDAIRQVLHQYG 299 Query: 100 QNIENTWKFPKNQYSDKALEMIEYYELYVTID-----------------YDGDGIAEL-- 140 Q W + + ++ E+ TID D I + Sbjct: 300 QGGLRDWLWSDGERAELEGRGHEWLTPGETIDGLIYSGGAQGVTLLQWGISPDEIEDPLA 359 Query: 141 ---RRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLL 197 I+ G + + P+ P F G+ + + ++Q + Sbjct: 360 EYEVEAILIGQHVIRVRINRDPLERRPYHKSSFQPVPGSFWGQGIPELMADVQDVCNATA 419 Query: 198 RQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAG---MDIRSVLGIHSVPMI 254 R ++NL + PQ V E + E + K R A + ++ Sbjct: 420 RGLVNNLAISSGPQVEVYEDRLQPQEDPTDIYPWKIWRTKASIETGNNPALRFFQPQSNA 479 Query: 255 EKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ 314 + ++ + + T I G TA+ S++ +S ++ +R + + Sbjct: 480 SELLAVYEQFEYRADESTNIPRYMYGSDEAGGAGQTASGLSMLMESANKGIKDAIRHIDR 539 Query: 315 G-LEILFRGLLRLIIQHQDK 333 G L + L +Q D Sbjct: 540 GVLRRVIEALWLHNMQFSDD 559 >gi|282598927|ref|YP_003358477.1| N4 gp59-like protein [Pseudomonas phage LIT1] gi|259048687|emb|CAZ66336.1| N4 gp59-like protein [Pseudomonas phage LIT1] Length = 726 Score = 136 bits (342), Expect = 5e-30, Method: Composition-based stats. Identities = 50/300 (16%), Positives = 105/300 (35%), Gaps = 8/300 (2%) Query: 54 EFLIHPDS-VDIEKSPIVGRKLYLTRSDLISMGYDR--ESINNLPI---ISSQNIENTWK 107 +I P D K+ + + ++L + G + + I + Sbjct: 249 NIVIDPSCGSDFSKAKFLIETFESSYAELKADGRYQNLDKIQVEGQNLLSEPDYTGPSEG 308 Query: 108 FPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTC 167 + DK+ + + +E + D GDG+ G +P+ Sbjct: 309 VRNFDFQDKSRKRLVVHEYWGYYDIHGDGVLHPIVATWVGAVMIRMEENPFPDKRIPYVV 368 Query: 168 LRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLN 227 + + GES A +I+ Q+I + R +D + Q V +G++ Sbjct: 369 VNYIPRKRDLYGESDGALLIDNQRIIGAVTRGMIDTMARSANGQVGVMKGALDVTNRRRF 428 Query: 228 PQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQ 287 + +V + + + M++ E TG+ ++G S L Sbjct: 429 DRGENYEFNPGADPRAAVHMHTFPEIPQSAQYMINLQQAEAESMTGVKAFNAGISGAAL- 487 Query: 288 NMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQ-WVSF 346 TATA + + I+R L+ G+ + R ++ + + D V +VR+ ++ +V Sbjct: 488 GDTATAVRGALDAASKRELGILRRLSAGIIEIGRKIIAMNAEFLDDVEVVRITNEHFVDI 547 >gi|326573143|gb|EGE23112.1| putative portal protein [Moraxella catarrhalis CO72] Length = 806 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 48/313 (15%), Positives = 100/313 (31%), Gaps = 5/313 (1%) Query: 37 RRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMGYDRES-INNL 94 R ++ V + + I P + E + V + S+L G + Sbjct: 288 RVIVNKPTVDICNLK--NVFIDPTCRGNFENAQFVVHAYESSLSELKKQGIYQNLGYLME 345 Query: 95 PIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNI 154 + N + ++ D A + YE + D +G G T Sbjct: 346 QQSQADNSIDKPSDDVFKFQDNARRKLTVYEYWGYWDIHDNGETTAIVCAWVGDTIIRME 405 Query: 155 LCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 +LPF + G A + + Q+I + R +D L QT Sbjct: 406 ENPFPKGKLPFVVFNYLPEEESIWGIPNAELLGDNQEILGAVTRGMIDLLGKSANSQTAF 465 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 + + V V + + M+H ++ E +G+ Sbjct: 466 PKNFLDSANKVKYSTGQDYEYNQGFDPRVHVHTHTFPEIPNSAMMMVHSMNNEAESLSGV 525 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV 334 SS +ATA + + + I+R +++G + R ++ + + + Sbjct: 526 KAFSSQGISASHLGDSATAARGVLDAVSKREMSILRRISEGFIQMGRFIMAMNSEFLSEK 585 Query: 335 RMVRLRD-QWVSF 346 +VR+ + ++V+ Sbjct: 586 EIVRITNKEFVTI 598 >gi|326567485|gb|EGE17600.1| putative portal protein [Moraxella catarrhalis BC1] Length = 806 Score = 134 bits (337), Expect = 2e-29, Method: Composition-based stats. Identities = 48/313 (15%), Positives = 100/313 (31%), Gaps = 5/313 (1%) Query: 37 RRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMGYDRES-INNL 94 R ++ V + + I P + E + V + S+L G + Sbjct: 288 RVIVNKPTVDICNLK--NVFIDPTCRGNFENAQFVVHAYESSLSELKKQGIYQNLGYLME 345 Query: 95 PIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNI 154 + N + ++ D A + YE + D +G G T Sbjct: 346 QQSQADNSIDKPSDDVFKFQDNARRKLTVYEYWGYWDIHDNGETTAIVCAWVGDTIIRME 405 Query: 155 LCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 +LPF + G A + + Q+I + R +D L QT Sbjct: 406 ENPFPKGKLPFVVFNYLPEEESIWGIPNAELLGDNQEILGAVTRGMIDLLGKSANSQTAF 465 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 + + V V + + M+H ++ E +G+ Sbjct: 466 PKNFLDSANKVKYSTGQDYEYNQGFDPRVHVHTHTFPEIPNSAMMMVHSMNNEAESLSGV 525 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV 334 SS +ATA + + + I+R +++G + R ++ + + + Sbjct: 526 KAFSSQGISASHLGDSATAARGVLDAVSKREMSILRRISEGFIQMGRFIMAMNSEFLSEK 585 Query: 335 RMVRLRD-QWVSF 346 +VR+ + ++V+ Sbjct: 586 EIVRITNKEFVTI 598 >gi|326562389|gb|EGE12709.1| putative portal protein [Moraxella catarrhalis 103P14B1] Length = 806 Score = 133 bits (335), Expect = 3e-29, Method: Composition-based stats. Identities = 48/313 (15%), Positives = 100/313 (31%), Gaps = 5/313 (1%) Query: 37 RRKYSQGKVCVDAVSPDEFLIHPDSV-DIEKSPIVGRKLYLTRSDLISMGYDRES-INNL 94 R ++ V + + I P + E + V + S+L G + Sbjct: 288 RVIVNKPTVDICNLK--NVFIDPTCKGNFENAQFVVHAYESSLSELKKQGIYQNLGYLME 345 Query: 95 PIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNI 154 + N + ++ D A + YE + D +G G T Sbjct: 346 QHAQADNSIDKPSDDVFKFQDNARRKLTVYEYWGYWDIHDNGETTAIVCAWVGDTIIRME 405 Query: 155 LCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 +LPF + G A + + Q+I + R +D L QT Sbjct: 406 ENPFPKGKLPFVVFNYLPEEESIWGIPNAELLGDNQEILGAVTRGMIDLLGKSANSQTAF 465 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 + + V V + + M+H ++ E +G+ Sbjct: 466 PKNFLDSANKVKYSTGQDYEYNQGFDPRVHVHTHTFPEIPNSAMMMVHSMNNEAESLSGV 525 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV 334 SS +ATA + + + I+R +++G + R ++ + + + Sbjct: 526 KAFSSQGISASHLGDSATAARGVLDAVSKREMSILRRISEGFIQMGRFIMAMNSEFLSEK 585 Query: 335 RMVRLRD-QWVSF 346 +VR+ + ++V+ Sbjct: 586 EIVRITNKEFVTI 598 >gi|113461527|ref|YP_719596.1| hypothetical protein HS_1384 [Haemophilus somnus 129PT] gi|112823570|gb|ABI25659.1| hemophilus-specific protein, uncharacterized [Haemophilus somnus 129PT] Length = 688 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 50/384 (13%), Positives = 114/384 (29%), Gaps = 53/384 (13%) Query: 1 MALNYFIHM---LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLI 57 +AL+Y + +++ V+ ++ D G + S+ V V P +F+ Sbjct: 111 LALHYAAVLGTGILRGPVVDTIDERIWSDDGMGNWSAQ---TKSKIVPKVRLVLPWDFVP 167 Query: 58 HPDSVDIEKSPIVGRKLYLTRSDLISM----GYDRESINNL----------PIISSQNIE 103 + ++ V + YLT+ L ++ Y +++ L Sbjct: 168 DMTAPTLKDCQFVFERSYLTKKQLQNLLNNPYYLADTVQALIESEASETHTSSSDMDGYL 227 Query: 104 NTWKFPKNQYSDKALEMIEYYELYVTIDY---------------------DGDGIAELRR 142 +T + + E + + I AE+ Sbjct: 228 DTLRTLSGLEKASNDKRYEVWTYHGGIPVSVLEQANQSLEEGYALELTEEQKSEKAEIDG 287 Query: 143 VIMAGGTGKDNILCNEEWN--ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 VI+ G GK + + E P++ C G + + Q+I R Sbjct: 288 VIVMTGNGKILSVNLNPLDTAEFPYSVYTCEPDVACVFGFGIPYLCRDAQEILNTAWRGM 347 Query: 201 LDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLG-------IHSVPM 253 +DN Q +V + + + K R + + Sbjct: 348 IDNGVLTIGSQIVVNSSVLSPVDKSWEIKPNKLWRTNDRASANASFEAQRAFGVFNFESR 407 Query: 254 IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLA 313 ++ +++ + + +G+ I+ G ++ T S++ + V+ Sbjct: 408 QQELANIIQLAKSFMDEESGLPMIAQGEQGQV--TPTLGGMSMLMNAANAVRRRQVKEWD 465 Query: 314 QGL-EILFRGLLRLIIQHQDKVRM 336 + + L R + D + Sbjct: 466 DQVTKPLIRRFYEYNMAMNDDPNI 489 >gi|170719076|ref|YP_001784230.1| hypothetical protein HSM_0898 [Haemophilus somnus 2336] gi|168827205|gb|ACA32576.1| Haemophilus-specific protein, uncharacterized [Haemophilus somnus 2336] Length = 725 Score = 130 bits (326), Expect = 4e-28, Method: Composition-based stats. Identities = 50/384 (13%), Positives = 114/384 (29%), Gaps = 53/384 (13%) Query: 1 MALNYFIHM---LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLI 57 +AL+Y + +++ V+ ++ D G + S+ V V P +F+ Sbjct: 148 LALHYAAVLGTGILRGPVVDTIDERIWSDDGMGNWSAQ---TKSKIVPKVRLVLPWDFVP 204 Query: 58 HPDSVDIEKSPIVGRKLYLTRSDLISM----GYDRESINNL----------PIISSQNIE 103 + ++ V + YLT+ L ++ Y +++ L Sbjct: 205 DMTAPTLKDCQFVFERSYLTKKQLQNLLNNPYYLADTVQALIESEASETHTSSSDMDGYL 264 Query: 104 NTWKFPKNQYSDKALEMIEYYELYVTIDY---------------------DGDGIAELRR 142 +T + + E + + I AE+ Sbjct: 265 DTLRTLSGLEKASNDKRYEVWTYHGGIPVSVLEQANQSLEEGYALELTEEQKSEKAEIDG 324 Query: 143 VIMAGGTGKDNILCNEEWN--ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 VI+ G GK + + E P++ C G + + Q+I R Sbjct: 325 VIVMTGNGKILSVNLNPLDTAEFPYSVYTCEPDVACVFGFGIPYLCRDAQEILNTAWRGM 384 Query: 201 LDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLG-------IHSVPM 253 +DN Q +V + + + K R + + Sbjct: 385 IDNGVLTIGSQIVVNSSVLSPVDKSWEIKPNKLWRTNDRASANASFEAQRAFGVFNFESR 444 Query: 254 IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLA 313 ++ +++ + + +G+ I+ G ++ T S++ + V+ Sbjct: 445 QQELANIIQLAKSFMDEESGLPMIAQGEQGQV--TPTLGGMSMLMNAANAVRRRQVKEWD 502 Query: 314 QGL-EILFRGLLRLIIQHQDKVRM 336 + + L R + D + Sbjct: 503 DQVTKPLIRRFYEYNMAMDDDPNI 526 >gi|319776214|ref|YP_004138702.1| hypothetical protein HICON_18250 [Haemophilus influenzae F3047] gi|317450805|emb|CBY87027.1| Putative uncharacterized protein [Haemophilus influenzae F3047] Length = 731 Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats. Identities = 48/386 (12%), Positives = 117/386 (30%), Gaps = 55/386 (14%) Query: 1 MALNYFIHM---LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLI 57 + L+Y + +++ V+V+E + I ++ V V P +F+ Sbjct: 151 LCLHYAAALGTGILRAPVVDVVESKAWKQDSLGNWVGEI---VNKTIPAVRLVLPWDFVP 207 Query: 58 HPDSVDIEKSPIVGRKLYLTRSDLISM----GYDRESINNL----------PIISSQNIE 103 + ++ V + ++T+ L ++ Y +ES+ L Sbjct: 208 DMTAPTLKDCQFVFERSHVTKKQLQALAKNPYYLKESVLELCELDGGDTRTASNDMDGYV 267 Query: 104 NTWKFPKNQYSDKALEMIEYYELYV------------------TIDYDGDGIA-----EL 140 +T + + E + + ++ D + E+ Sbjct: 268 DTLRTLSGLETQSKDNRYELWTYHGGIPLNVLSGANELLGEDNKLNIPDDEESRAANLEI 327 Query: 141 RRVIMAGGTGKDNILCNEEWN--ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLR 198 VI+ G GK + + E P++ C G + + Q+I R Sbjct: 328 EGVIVMAGNGKILSVNLNPLDTAEFPYSVYTCEPDVCCLFGFGIPYLCRDAQEILNTAWR 387 Query: 199 QTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMI---- 254 +DN PQ +V + + K + + + I Sbjct: 388 GMIDNGILGIGPQAVVNSSVLTPVDGNWELAPYKLWKTNDRATVNAQFEAQRAFGIFDIG 447 Query: 255 ---EKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRT 311 ++ +++ + + +G+ I+ G ++ T S++ + V+ Sbjct: 448 SRQQELANIIQLSKSFMDEESGLPMIAQGEQGQV--TPTLGGMSMLMNAANAVRRRQVKE 505 Query: 312 LAQGL-EILFRGLLRLIIQHQDKVRM 336 + + L R + + + Sbjct: 506 WDDSVTKPLIRRFYEYNMNMSEDSSI 531 >gi|153212119|ref|ZP_01947936.1| hypothetical protein A55_1887 [Vibrio cholerae 1587] gi|124116915|gb|EAY35735.1| hypothetical protein A55_1887 [Vibrio cholerae 1587] Length = 740 Score = 123 bits (309), Expect = 3e-26, Method: Composition-based stats. Identities = 50/351 (14%), Positives = 108/351 (30%), Gaps = 43/351 (12%) Query: 24 REDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLIS 83 + G + + I R S AV P +F+ + I+ S + YLTR L+ Sbjct: 196 MDQTGIEQWEAVIERSAS---PSARAVMPWDFVPDMSATSIDDSEFTFERSYLTRKKLLK 252 Query: 84 -----MGYDRESINNLPIISSQNIE----------NTWKFPKNQYSDKALEMIEYYELYV 128 GY +++ L ++ N + E +E + Sbjct: 253 TMTEQAGYVAKNVRELAEKEPRDSHALTEDVLGTINQIRALNGLQPTYKDRRYEIWEYHG 312 Query: 129 TIDYD-------------GDGIAELRRVIMAGGTGKDNILCNEEWNEL--PFTCLRAMRA 173 I + +E+ VI+ G G ++ P++ A Sbjct: 313 PIPREVLQEAGLLTEEEFESTPSEVDGVIVMSGCGLILKAGINPFDTEEWPYSVYCAEED 372 Query: 174 PHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKP 233 C G + + Q I R +DN Q +V + +++ ++ + K Sbjct: 373 VSCIFGYGIPHLCSDAQSILNTAWRAMIDNGVATVGDQIVVNQSALMPADNDWSFSPLKV 432 Query: 234 IRVAAGMDIRSVLGIHSVPMI-------EKSFSMLHYLDQELVDRTGISDISSGFSPEIL 286 + + + + + +++ + + +G+ IS G ++ Sbjct: 433 WKTTDKASVSAQFEAQKAFGVFSLQNRQAEYANIISMAKAFMDEESGLPMISQGEQGQV- 491 Query: 287 QNMTATATSLIEQSGVGQVELIVRTLAQGL-EILFRGLLRLIIQHQDKVRM 336 T S++ + V+ + + L R +Q K + Sbjct: 492 -TPTLGGMSMLMNAANAVRRRQVKEWDDSVTKPLIRRFYAWNMQFSKKNEI 541 >gi|209544596|ref|YP_002276825.1| hypothetical protein Gdia_2465 [Gluconacetobacter diazotrophicus PAl 5] gi|209532273|gb|ACI52210.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 730 Score = 119 bits (299), Expect = 5e-25, Method: Composition-based stats. Identities = 49/318 (15%), Positives = 94/318 (29%), Gaps = 32/318 (10%) Query: 48 DAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK 107 + V P I+ + + + ++ L Y E + + + Sbjct: 235 ETVDPLRLCINYKAKSFATAARMTEEIDL---------YPWEIEERIRAGLFLDEDYGTN 285 Query: 108 FPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNIL------------ 155 S + + E + D DGDG AE V +A +G+ + Sbjct: 286 HDDG--SQDEDAPVTFLEQHRRWDLDGDGYAEPYIVTIARDSGQLARIVAGFDADGVMFD 343 Query: 156 ----CNEEWNELPFTC-LRAMRAPHC-FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 + +P+ + + +P + + + L Q D + N Sbjct: 344 PVTHRIRKIEAVPYYTRFQFIPSPQSAIYAMGFGSLLYPLNGAINTSLNQMFDAGHLANA 403 Query: 210 PQTIVQEG-SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQEL 268 + G S+ K + +++ + F +L YL + Sbjct: 404 GGGFIGSGMSLNTGSVRFQVGEYKVVNTPGATLRENMVPLQFPGPSPALFQLLQYLVEAG 463 Query: 269 VDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLII 328 + I DI SG P N + Q G+ I + + + L F L RL Sbjct: 464 REIASIKDILSGAMP--GGNTPGILGLAVIQQGMKVFSAIFKRVHRALGAEFDKLYRLNR 521 Query: 329 QHQDKVRMVRLRDQWVSF 346 + RL +Q+ Sbjct: 522 LYLPDDAGYRLGEQYFEV 539 >gi|162149432|ref|YP_001603893.1| hypothetical protein GDI_3670 [Gluconacetobacter diazotrophicus PAl 5] gi|161788009|emb|CAP57613.1| hypothetical protein GDI3670 [Gluconacetobacter diazotrophicus PAl 5] Length = 907 Score = 117 bits (293), Expect = 2e-24, Method: Composition-based stats. Identities = 46/318 (14%), Positives = 91/318 (28%), Gaps = 33/318 (10%) Query: 48 DAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK 107 + V P I ++ +P + ++ L Y E + + E Sbjct: 213 ETVDPLRLCIDYNAKSFAAAPRITEEIDL---------YPWEVEEKIRAGLFLDDEYGCN 263 Query: 108 FPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNIL------------ 155 + + E + D DGDG AE V +A +G+ + Sbjct: 264 HDAGD---DEDAPVTFLEQHRRYDLDGDGYAEPYIVTIARDSGRLARIVAGFESEGVIFG 320 Query: 156 ----CNEEWNELPFTC-LRAMRAPHC-FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 + + + + +P + + L Q D + N Sbjct: 321 AADHRIRRIDAVAYYTKFPFIPSPDSAIYDIGFGTLLHPLNAAVNTSLNQMFDAAHLANA 380 Query: 210 PQTIVQEG-SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQEL 268 + G S+ K + +++ + F +L +L Sbjct: 381 GGGFIGSGMSLNSGSVRFQIGEYKVVNTPGATLRENLVPMQFSGPNPVLFQLLGFLVDAG 440 Query: 269 VDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLII 328 + + DI SG P N+ + Q G+ I + + + L + FR L RL Sbjct: 441 REIASVKDILSGAMP--GGNVPGVLGLAVIQQGLKVFSAIFKRIHRSLGMEFRKLYRLNR 498 Query: 329 QHQDKVRMVRLRDQWVSF 346 + R ++ Sbjct: 499 IYLPDEAGFRAGAEYFRV 516 >gi|115304377|ref|YP_762669.1| PfWMP4_39 [Cyanophage Pf-WMP4] gi|113201871|gb|ABI33183.1| PfWMP4_39 [Phormidium phage Pf-WMP4] Length = 641 Score = 117 bits (293), Expect = 2e-24, Method: Composition-based stats. Identities = 49/339 (14%), Positives = 106/339 (31%), Gaps = 24/339 (7%) Query: 23 HREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLI 82 D D+ + R + ++ ++ +SP + + V +L TR +L Sbjct: 176 ETGDIFGGWEDVAVNR--QRSELRIEPLSPYDVWLDTSGGK-NTGTFV--RLRHTREELH 230 Query: 83 SM----GYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIA 138 + YD + + + + N ++IEYY + +G Sbjct: 231 ELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYY---GPLLVEGVQFW 287 Query: 139 ELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLR 198 + V G + ++ W PF + G S+ + + VL Sbjct: 288 CVHAVFY--GKQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTN 345 Query: 199 QTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSF 258 LDNL + E I+ E + + G +VA ++ + ++ + Sbjct: 346 GRLDNLVLHINKMWTLVEDGILKRED-VKAKPGAVFKVAQHGSLQPIDMGRQDFVVT--Y 402 Query: 259 SMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLE- 317 + + T + +P + +TA + +G ++ + + Sbjct: 403 QEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTL 462 Query: 318 ILFRGLLRLIIQHQDKVRMVRL------RDQWVSFDPRY 350 L + L+ Q +R+ D + P Y Sbjct: 463 PLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEY 501 >gi|209544682|ref|YP_002276911.1| hypothetical protein Gdia_2553 [Gluconacetobacter diazotrophicus PAl 5] gi|209532359|gb|ACI52296.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 707 Score = 117 bits (292), Expect = 3e-24, Method: Composition-based stats. Identities = 46/318 (14%), Positives = 91/318 (28%), Gaps = 33/318 (10%) Query: 48 DAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWK 107 + V P I ++ +P + ++ L Y E + + E Sbjct: 202 ETVDPLRLCIDYNAKSFAAAPRITEEIDL---------YPWEVEEKIRAGLFLDDEYGCN 252 Query: 108 FPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNIL------------ 155 + + E + D DGDG AE V +A +G+ + Sbjct: 253 HDAGD---DEDAPVTFLEQHRRYDLDGDGYAEPYIVTIARDSGRLARIVAGFESEGVIFG 309 Query: 156 ----CNEEWNELPFTC-LRAMRAPHC-FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 + + + + +P + + L Q D + N Sbjct: 310 AADHRIRRIDAVAYYTKFPFIPSPDSAIYDIGFGTLLHPLNAAVNTSLNQMFDAAHLANA 369 Query: 210 PQTIVQEG-SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQEL 268 + G S+ K + +++ + F +L +L Sbjct: 370 GGGFIGSGMSLNSGSVRFQIGEYKVVNTPGATLRENLVPMQFSGPNPVLFQLLGFLVDAG 429 Query: 269 VDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLII 328 + + DI SG P N+ + Q G+ I + + + L + FR L RL Sbjct: 430 REIASVKDILSGAMP--GGNVPGVLGLAVIQQGLKVFSAIFKRIHRSLGMEFRKLYRLNR 487 Query: 329 QHQDKVRMVRLRDQWVSF 346 + R ++ Sbjct: 488 IYLPDEAGFRAGAEYFRV 505 >gi|239907145|ref|YP_002953886.1| hypothetical protein DMR_25090 [Desulfovibrio magneticus RS-1] gi|239797011|dbj|BAH76000.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 682 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 39/325 (12%), Positives = 86/325 (26%), Gaps = 28/325 (8%) Query: 44 KVCVDAVSPDEFLIHPDS-VDIEKSPIVGRKLYLTRSDLISM----GYDRESINNL---- 94 + VSP F + + + + D++ + G+D + + Sbjct: 210 RPYYRRVSPWSFYWDQSANRRMGDCRYGYEEYRMVYGDVLELAGRTGFDGDVVRAYLAEK 269 Query: 95 --PIISSQNIENTWKFPKNQYSDKALE-MIEYYELYVTIDYDGDGI------------AE 139 + + E+ + + L+ E Y + D Sbjct: 270 RDGDATEYDFESQLRSINGGTPEPQLQGRWRVLERYGWLRGDELEECGVDLGNDPVQADY 329 Query: 140 LRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQ 199 V M GG + E PF R G + + Q ++R Sbjct: 330 FCNVWMLGGKIIKAVRAPIRGVEFPFQIFPMFRDDSSLCGLGVTGVYRDAQSAINAVVRA 389 Query: 200 TLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFS 259 +DN P V ++ N + G ++ G D+ + + Sbjct: 390 MMDNARMSLGPIGGVNVPALQQTLDADNIRGGTWLKFDTGEDMSKAITFWQASSHTSDYL 449 Query: 260 MLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEIL 319 L ++ D + G T S++ + + +V+ + Sbjct: 450 ALAKYFDDMGDELTVPRWVHGDGNVSDAARTLGGLSMLMNAMSINLAEMVKIFDDEVTSQ 509 Query: 320 F-RGLLRLIIQHQDKVRMVRLRDQW 343 F L + + ++ + Sbjct: 510 FVTALYHWNMDFNPRPD---IKGDF 531 >gi|227821703|ref|YP_002825673.1| hypothetical protein NGR_c11350 [Sinorhizobium fredii NGR234] gi|227340702|gb|ACP24920.1| hypothetical protein NGR_c11350 [Sinorhizobium fredii NGR234] Length = 348 Score = 103 bits (257), Expect = 3e-20, Method: Composition-based stats. Identities = 71/169 (42%), Positives = 96/169 (56%), Gaps = 15/169 (8%) Query: 6 FIHMLIKDSDVEVLEHSHREDGG--------EKVHDLRIRRKYSQGKVCVDAVSPDEFLI 57 + L+ D DVEVLE ++ ++++RIRR G + AV +EFLI Sbjct: 152 ALVQLVADDDVEVLEQESYQEQIDTPQGPQSVTLYNVRIRRTKEYGCTKLAAVPLEEFLI 211 Query: 58 HPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIE-------NTWKFPK 110 HPD++ I+ SPI G K L RSDL++MGYDRE ++ SS N E F + Sbjct: 212 HPDAMSIDDSPITGIKTRLRRSDLVAMGYDREKVDKFATASSSNEEETEEFARRREPFDE 271 Query: 111 NQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE 159 KAL+ ++YYELYV ID D DGIAELRR+ AGG + N+L +EE Sbjct: 272 KDEIIKALQEVDYYELYVKIDVDDDGIAELRRMCFAGGLAEVNLLDDEE 320 >gi|228905598|ref|ZP_04069542.1| hypothetical protein bthur0014_66580 [Bacillus thuringiensis IBL 4222] gi|228854038|gb|EEM98752.1| hypothetical protein bthur0014_66580 [Bacillus thuringiensis IBL 4222] Length = 707 Score = 103 bits (257), Expect = 3e-20, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 110/301 (36%), Gaps = 13/301 (4%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISM-GYDRESINNLPIISSQ 100 G++ P I P + E+ + + + G D + N+ ++ Sbjct: 177 TGEIRCRICDPLTVYIDPAAEMDEEIRWIVERKPRDIDYIQERYGKDVAADENVGFAAAF 236 Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 ++ F + M++ + +V +AGG D +E Sbjct: 237 DVTPQNGFNSTSKKRPNMAMVDEMWVKPC-----GKHPNGLKVTIAGGQLLDI---DENA 288 Query: 161 NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSII 220 ++PF + P E+ ++ IQ+ ++ + +V GS + Sbjct: 289 GDIPFFIFGDIPIPGSVKAEAFIKDMLPIQREINIMRSMFATHARKMGNSMWLVPMGSSV 348 Query: 221 DPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSG 280 D + + N + G I ++ + + + +L+ D ++ D +G +IS G Sbjct: 349 DEDEITNEEGG--IVHYTPIEGARPERVGAPDIPSFYDRILNNHDADIDDLSGAREISQG 406 Query: 281 FSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLR 340 P L T + SL+ + ++ + + G++ L + +L L+ +H + RM R+ Sbjct: 407 RLPSGL--DTYSGLSLMVEQENEKLAVSSQNYEHGMKRLLQRVLMLMKKHYTEERMARIL 464 Query: 341 D 341 Sbjct: 465 G 465 >gi|291529975|emb|CBK95560.1| hypothetical protein EUS_02210 [Eubacterium siraeum 70/3] Length = 534 Score = 103 bits (255), Expect = 6e-20, Method: Composition-based stats. Identities = 49/314 (15%), Positives = 108/314 (34%), Gaps = 17/314 (5%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLP-IISSQ 100 G + + P DIE+S + + R L M + + ++ Sbjct: 139 MGDIAIRNADILNLFWEPGIKDIEESANLFYVTLVDRERLNLMYPELCEDDTESVAGGTE 198 Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 N+E K S K + YY+ + +G +L G + +E Sbjct: 199 NVEKYKTEDKTDDSAKVEVIDWYYKKTI------NGRKQLCYCKFCGDRVIYSSEDDESC 252 Query: 161 -------NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTI 213 + PF G + + Q L + L + ++ + Sbjct: 253 ADGFYKHSRYPFVMDTLFVQEGTPCGFGYIDVMRDAQMYIDKLSQVVLAHTVMMSRKRYF 312 Query: 214 VQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTG 273 +++ S ++ + + + + VA + + I + P+ + L + EL + +G Sbjct: 313 IRQNSAVNEAEFADLK-NRFVHVAGNLGEEDIREIKAEPLDSSVMNALSFKIDELKETSG 371 Query: 274 ISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDK 333 D S G + A+A + ++++G +++ + + ++ LI Q D Sbjct: 372 NRDFSQGSVSNGVTA--ASAIAALQEAGSKLSRDMIKGTYFAFQQVCYLIIELIRQFYDT 429 Query: 334 VRMVRLRDQWVSFD 347 R R+ + +FD Sbjct: 430 PRSFRITGGYDAFD 443 >gi|330958837|gb|EGH59097.1| hypothetical genomic island protein [Pseudomonas syringae pv. maculicola str. ES4326] Length = 699 Score = 102 bits (254), Expect = 8e-20, Method: Composition-based stats. Identities = 53/389 (13%), Positives = 113/389 (29%), Gaps = 62/389 (15%) Query: 14 SDVEVLEHSHREDGGEKVHDLRIRRKYSQ-GKVCVDAVSPDEFLIHPDSV--DIEKSPIV 70 + +V + G D+R+ + + G+V VD + P + + PD+ D + V Sbjct: 120 KETQVFADGMIQQRG--YFDIRMSYEDTILGEVRVDILDPLDVIPDPDANSYDPDDWADV 177 Query: 71 GRKLYLTRSDLISM-------------------------------GYDRESINNLPIISS 99 ++T+ ++ ++ G D ++ Sbjct: 178 TVTRFMTQIEIEALYGTSAKKSIEDEESDSGLIGIDGTDHDRNGFGDDEGFVDEFLSDEK 237 Query: 100 QNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE 159 + Q+ + + + L ++I GG + Sbjct: 238 DKPGKRHRVVDRQFWQMDMAEVIITPTGDIRLVEDVKPEVLAQMIENGGIQSKRRIKRVR 297 Query: 160 W-----------NELPFTCL---RAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLY 205 W + PF L I Q++ + Q L L Sbjct: 298 WLVSTKETVLHDDWSPFNHFTVVPFFPTFRRGHTRGLVDDAIGPQQLLNKAMSQYLHVLN 357 Query: 206 WQNQPQTIVQEGSIIDPESVLNPQFGKP-----IRVAAGMDIRSVLGIHSVPMIEKSFSM 260 I G++ + G + + I + + Sbjct: 358 TSANSGWITVAGTLANMRDEELANRGSETGLHLMIKSKTPVEDRPQKIQPNQVPTGIDRL 417 Query: 261 LHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILF 320 + L TGI++ SG + + A + + Q+ + + LA+ ++L Sbjct: 418 IDRAGALLEQSTGINEAMSGNQGNEVSGI---AIQTRQFAAQQQLAVPLDNLARTRQMLA 474 Query: 321 RGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 +L +I D+ R++R+ DPR Sbjct: 475 TRMLEMIQVFYDQPRIIRIT----ETDPR 499 >gi|319956914|ref|YP_004168177.1| hypothetical protein Nitsa_1175 [Nitratifractor salsuginis DSM 16511] gi|319419318|gb|ADV46428.1| hypothetical protein Nitsa_1175 [Nitratifractor salsuginis DSM 16511] Length = 561 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 47/301 (15%), Positives = 99/301 (32%), Gaps = 27/301 (8%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G++ ++ V + P++ ++ ++ T +L + N S Sbjct: 137 GQLRIERVKLKNMYLDPNASNVFDIQYCVHRVTTTIGNLRQQFGRKFKWKNYIGDSEDGT 196 Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNE 162 S + + Y+ V L + Sbjct: 197 SYLSSADLGDASRIEVRDVYRYQSGKWY------------VSTVLPGDAFVRLDEPLKDG 244 Query: 163 LPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 LPF G S +I +Q+ TV Q +D + + + Sbjct: 245 LPFIIGSVEPQFVRLDESNAVEAYGGSFIEPMIPLQEEYTVTRNQQIDAIAESLSKRFLA 304 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 + S ++ + +L+ + + V + + + F + LD E+ + +GI Sbjct: 305 TKTSGLNEKDLLSNRTKISVSSLNE-----VKELQAPRIDPSIFG-IDRLDSEMQEVSGI 358 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDK 333 + + G + N TAT S++ + G + IVR L + E R ++RLI ++ + Sbjct: 359 TKYNQGLNDPHNLNQTATGVSILTEEGNAVIADIVRALNESFFEPAIRRMVRLIYKYGES 418 Query: 334 V 334 Sbjct: 419 P 419 >gi|75761880|ref|ZP_00741807.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228905318|ref|ZP_04069295.1| hypothetical protein bthur0014_63940 [Bacillus thuringiensis IBL 4222] gi|228937950|ref|ZP_04100577.1| hypothetical protein bthur0008_6260 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228970830|ref|ZP_04131470.1| hypothetical protein bthur0003_6170 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228977404|ref|ZP_04137799.1| hypothetical protein bthur0002_6190 [Bacillus thuringiensis Bt407] gi|74490640|gb|EAO53929.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|228782381|gb|EEM30564.1| hypothetical protein bthur0002_6190 [Bacillus thuringiensis Bt407] gi|228788955|gb|EEM36894.1| hypothetical protein bthur0003_6170 [Bacillus thuringiensis serovar thuringiensis str. T01001] gi|228821741|gb|EEM67742.1| hypothetical protein bthur0008_6260 [Bacillus thuringiensis serovar berliner ATCC 10792] gi|228854317|gb|EEM98998.1| hypothetical protein bthur0014_63940 [Bacillus thuringiensis IBL 4222] gi|326938429|gb|AEA14325.1| Phage protein [Bacillus thuringiensis serovar chinensis CT-43] Length = 707 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 49/301 (16%), Positives = 110/301 (36%), Gaps = 13/301 (4%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISM-GYDRESINNLPIISSQ 100 G++ P I P + E+ + + + G D + N+ ++ Sbjct: 177 TGEIRCRICDPLTVYIDPAAEMDEEIRWIVERKPRDIDYIKERYGKDVAADENVGFAAAF 236 Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 ++ F + M++ + +V +AGG D +E Sbjct: 237 DVTPQNGFNSTSKKRPNMAMVDEMWVKPC-----GKHPNGLKVTIAGGQLLDI---DENA 288 Query: 161 NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSII 220 ++PF + P E+ ++ IQ+ ++ + +V GS + Sbjct: 289 GDIPFFIFGDIPIPGSVKAEAFIKDMLPIQREINIMRSMFATHARKMGNSMWLVPMGSSV 348 Query: 221 DPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSG 280 D + + N + G I ++ + + + +L+ D ++ D +G +IS G Sbjct: 349 DEDEITNEEGG--IVHYTPIEGVRPERVGAPDIPSFYDRILNNHDADIDDLSGAREISQG 406 Query: 281 FSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLR 340 P L T + SL+ + ++ + + G++ L + +L L+ +H + RM R+ Sbjct: 407 RLPSGL--DTYSGLSLMVEQENEKLAVSSQNYEHGMKRLLQRVLLLMKKHYTEERMARIL 464 Query: 341 D 341 Sbjct: 465 G 465 >gi|167749268|ref|ZP_02421395.1| hypothetical protein EUBSIR_00219 [Eubacterium siraeum DSM 15702] gi|167657761|gb|EDS01891.1| hypothetical protein EUBSIR_00219 [Eubacterium siraeum DSM 15702] Length = 534 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 49/314 (15%), Positives = 107/314 (34%), Gaps = 17/314 (5%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLP-IISSQ 100 G + + P DIE+S + + R L M + + Sbjct: 139 MGDIAIRNADILNLFWEPGIKDIEESANLFYVTLVDRERLNLMYPELCGEEPESVAGGTG 198 Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 N+E K S K + YY+ + +G +L G + +E Sbjct: 199 NVEKYKTEDKTDDSAKVEVVDWYYKKTI------NGRKQLCYCKFCGDRVIYSSEDDESC 252 Query: 161 -------NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTI 213 + PF G + + Q L + L++ ++ + Sbjct: 253 ADGFYKHSRYPFVMDTLFVQEGTPCGFGYIDVMRDAQMYIDKLSQVVLEHTVMMSRKRYF 312 Query: 214 VQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTG 273 +++ S ++ + + + + VA + + I + P+ + L + EL + +G Sbjct: 313 IRQNSAVNEAEFADLK-NRFVHVAGNLGEEDIREIKAEPLDSSVMNALSFKIDELKETSG 371 Query: 274 ISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDK 333 D S G + A+A + ++++G +++ + + ++ LI Q D Sbjct: 372 NRDFSQGSVSNGVTA--ASAIAALQEAGSKLSRDMIKGTYFAFQQVCYLIIELIRQFYDT 429 Query: 334 VRMVRLRDQWVSFD 347 R R+ + +FD Sbjct: 430 PRSFRITGGYDAFD 443 >gi|291556862|emb|CBL33979.1| hypothetical protein ES1_09090 [Eubacterium siraeum V10Sc8a] Length = 534 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 49/314 (15%), Positives = 106/314 (33%), Gaps = 17/314 (5%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLP-IISSQ 100 G + + P DIE+S + + R L M + + Sbjct: 139 MGDIAIRNADILNLFWEPGIKDIEESANLFYVTLVDRERLNLMYPELCGEEPESVAGGTG 198 Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 N+E K S K + YY+ + +G +L G + +E Sbjct: 199 NVEKYKTEDKTDDSAKVEVVDWYYKKTI------NGRKQLCYCKFCGDRVIYSSEDDESC 252 Query: 161 -------NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTI 213 + PF G + + Q L + L++ ++ + Sbjct: 253 ADGFYKHSRYPFVMDTLFVQEGTPCGFGYIDVMRDAQMYIDKLSQVVLEHTVMMSRKRYF 312 Query: 214 VQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTG 273 +++ S ++ + + + + VA + + I + P+ + L EL + +G Sbjct: 313 IRQNSAVNEAEFADLK-NRFVHVAGNLGEEDIREIKAEPLDSSVMNALSLKIDELKETSG 371 Query: 274 ISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDK 333 D S G + A+A + ++++G +++ + + ++ LI Q D Sbjct: 372 NRDFSQGSVSNGVTA--ASAIAALQEAGSKLSRDMIKGTYFAFQQVCYLIIELIRQFYDT 429 Query: 334 VRMVRLRDQWVSFD 347 R R+ + +FD Sbjct: 430 PRSFRITGGYDAFD 443 >gi|148747833|ref|YP_001285799.1| portal protein [Phormidium phage Pf-WMP3] gi|146230066|gb|ABQ12474.1| portal protein [Phormidium phage Pf-WMP3] Length = 651 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 51/353 (14%), Positives = 123/353 (34%), Gaps = 20/353 (5%) Query: 1 MALNYFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD 60 +AL + + V+V ++ ++ + + + + + P+ Sbjct: 156 LALPWRVETAEVKKKVQVRTPLFEDE---PTFEVVSEEREVKSSPDFEVLDMFDCFYDPN 212 Query: 61 SVDIEKSPIVGRKLYLTRSDLISM---GYDR-----ESINNLPIISSQNIENTWKFPKNQ 112 D + + RKL T++D++++ GY + + + +S ++ + Sbjct: 213 VTDPNRGAFI-RKLTKTKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGV 271 Query: 113 YSD--KALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRA 170 + + +E E + I + V+ G N W PF Sbjct: 272 TTSLWSPHQNVELLEYWGDIHLEN--KTYHDVVVTIMGNEVLRFEQNPYWCGRPFVIGTY 329 Query: 171 MRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQF 230 + + + ++ Q LDNL ++ ++ PE V + Sbjct: 330 IPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVY-TEP 388 Query: 231 GKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMT 290 GK V+ D++ + S I + +L+ + G + + + +T Sbjct: 389 GKVFLVSDHGDLQPLANQSSNFSIT--YQESSFLESTIDKNFGTGNYVGANAARSGERVT 446 Query: 291 ATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKVRMVRLRDQ 342 A + + ++G ++ I + + + L +L ++ L+ Q D+ MVR+ Sbjct: 447 AAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGD 499 >gi|257459274|ref|ZP_05624388.1| conserved hypothetical protein [Campylobacter gracilis RM3268] gi|257443287|gb|EEV18416.1| conserved hypothetical protein [Campylobacter gracilis RM3268] Length = 516 Score = 97.3 bits (240), Expect = 3e-18, Method: Composition-based stats. Identities = 51/302 (16%), Positives = 112/302 (37%), Gaps = 23/302 (7%) Query: 33 DLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESIN 92 ++ +S+ + +D VS + P + + + ++YL+ D++S G R Sbjct: 124 SCAVKVYWSKDRAMIDEVSLQDLYFDPGARGLNDISYLVHRIYLSSEDILSYG-KRGIFR 182 Query: 93 NLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKD 152 + + + +F + + LY EL R ++ G+ Sbjct: 183 IENKEAFADKKPYERFEIYEIYELRGGKWYVSSLY---------ENELLRDLIELRDGQP 233 Query: 153 NILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 I+ LP GE S++ +Q V D + Q P+ Sbjct: 234 FIV----GYMLPQIRCTDEEIYVSAYGEPALMSMLPLQNELNVNRNSITDVIRQQVAPKI 289 Query: 213 IVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRT 272 I+ + S+++ + + D S + + I + + L ++ E+ + + Sbjct: 290 ILGKASMVERGELESVG------TPIYADQPSAVQVLPAGDIGGAMAALQVIENEMSEVS 343 Query: 273 GISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQ 331 G+S +G ++ TAT S++ G +++ +RT + E +F L L+ ++ Sbjct: 344 GVSPQQNG--ATTVRKETATMASIMANEGSVRLQGYIRTFNETFFEPIFERLAFLVWKYA 401 Query: 332 DK 333 D Sbjct: 402 DP 403 >gi|154174760|ref|YP_001409087.1| hypothetical protein CCV52592_0034 [Campylobacter curvus 525.92] gi|153793129|gb|EAU00312.2| conserved hypothetical protein [Campylobacter curvus 525.92] Length = 554 Score = 96.5 bits (238), Expect = 5e-18, Method: Composition-based stats. Identities = 49/300 (16%), Positives = 102/300 (34%), Gaps = 28/300 (9%) Query: 41 SQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDL----ISMGYDRESINNLPI 96 +G ++ V D+ P++ D + ++ L+ DL YD+E+ N L Sbjct: 134 RKGLPVIEEVELDDIFFDPEAKDHDDIRYYVNRISLSYEDLGNLAKQKIYDKEATNELIS 193 Query: 97 ISSQNIENTWK-FPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNIL 155 + + Y + + + D + + I+ Sbjct: 194 RDEAKERRYDRLEIYDVYECENDKWYLSTIADNALLRDKVELKDGCPFILG--------- 244 Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 +P + + C GE ASI+ +Q+ +D + +P+ IV Sbjct: 245 -----YMVPQVRDFSEQNFVCAYGEPPLASILPLQEEMNFARNSLIDAMNMHLKPKAIVP 299 Query: 216 EGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGIS 275 + I + + + P I + + +D E+ + +G+S Sbjct: 300 LSANISRTDLETIGK------PVYAQTPAQITFVPPPNIGSAQINISLIDNEMSEASGVS 353 Query: 276 DISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDKV 334 +G + TAT S++ G +V+ VR+ + +E LF L L+ ++ Sbjct: 354 PQQNG--ATTPRKETATMASIMANEGSVRVQGYVRSFNETFIEPLFERLAMLVWKYGASE 411 >gi|237748191|ref|ZP_04578671.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] gi|229379553|gb|EEO29644.1| conserved hypothetical protein [Oxalobacter formigenes OXCC13] Length = 798 Score = 95.3 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 39/329 (11%), Positives = 106/329 (32%), Gaps = 19/329 (5%) Query: 21 HSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPD-SV--DIEKSPIVGRKLYLT 77 RE + R + +D V + L+ P + D E++ + + + + Sbjct: 181 EEIRETMAALQERAEVGRTE---GLVIDRVLTENLLVDPSIAEFWDYEQADWMVQIVPMK 237 Query: 78 RSDLISMG---YDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDG 134 ++ + D+ +I + S + + F + ++ I E++ Sbjct: 238 KAVAEGLYGYKLDKATIYKHRDMRSSSTGSGRLFSGGKQTNDDDSQICILEIWDKQSQRV 297 Query: 135 DGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKT 194 +AE + + PF L F+G S+ +Q Sbjct: 298 YTMAEGCEFWLRDPYSPPKVGERWY----PFFLLPFQTVDGHFVGPSIVDLTERLQDEHN 353 Query: 195 VLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKP--IRVAAGMDIRSVLGIHSVP 252 + ++ E + + + + G+ I ++++ P Sbjct: 354 SARDRYNEHRDLIKPGYIASAELNEKTLKRFTDSELGEITLIDAGGQPIQQAIMPKSYPP 413 Query: 253 MIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTL 312 + + +D ++ + +++ TAT ++++QS G+V + Sbjct: 414 IDPAVYD----TSPVRLDWEMVTGLQDASRSSVVKPKTATEANILQQSLSGRVSEFRDQV 469 Query: 313 AQGLEILFRGLLRLIIQHQDKVRMVRLRD 341 L+ + + ++IQ ++ ++ Sbjct: 470 EDFLQQIAQYTAEILIQELQPEQVEKIMG 498 >gi|121534832|ref|ZP_01666652.1| hypothetical protein TcarDRAFT_1284 [Thermosinus carboxydivorans Nor1] gi|121306627|gb|EAX47549.1| hypothetical protein TcarDRAFT_1284 [Thermosinus carboxydivorans Nor1] Length = 610 Score = 93.0 bits (229), Expect = 5e-17, Method: Composition-based stats. Identities = 52/351 (14%), Positives = 110/351 (31%), Gaps = 49/351 (13%) Query: 43 GKVCVDAVSPDEFLIHPDSV--DIEKSPIVGRKLYLTRSDLISMGYD-RESINNLPIIS- 98 GK + VSP + + P+S D+ + + R ++++ DL + + I Sbjct: 144 GKAVIKRVSPFDIYVDPESREPDLSDAEYICRAKWVSKDDLKRTYPEFADEIEAFAERYD 203 Query: 99 ---------------SQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGD-------- 135 + + + Y ++ + D Sbjct: 204 RDEEEECDEDLEPLWYSREKKKCRLVEIWYKRHTMKEYYVIGPGQIVTKDELLPGMMVTH 263 Query: 136 ----GIAELRRVIMAGGTGKDNILCNEEWNELPFT-CLRAMRAPHCFIGESLAASIIEIQ 190 E+R + G +++ + PF I + + +IQ Sbjct: 264 KFRVPQTEIRCSAIIGDVELEDVPSPYQHGRFPFAPYFAYYVGEEGEIPAGVVRDLQDIQ 323 Query: 191 KIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHS 250 + + Q L + +++ G + +L + V D S Sbjct: 324 REQNKRRSQLLHLINTMANRGWLLRRGQEDTKKKLLESGSTPGVVVEYDTDPPKPFDSTS 383 Query: 251 VPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVR 310 VP F L D + +GI++ G EI + A L +++ V QV + Sbjct: 384 VPTTFAEFEQLG--DADFRQISGINEAMLGQ--EIPSGTSGRAIELRQRTAVTQVAGLFD 439 Query: 311 TLAQGLEILFRGLLR-------LIIQHQDKVRMVRLRD-----QWVSFDPR 349 L + + + LL +I Q+ + + R+ ++V+ + R Sbjct: 440 NL-RATKEMVLYLLWGSEGAPGIIPQYYTEEKTFRIIGESGKDEFVTINQR 489 >gi|225155390|ref|ZP_03723882.1| hypothetical protein ObacDRAFT_9438 [Opitutaceae bacterium TAV2] gi|224803846|gb|EEG22077.1| hypothetical protein ObacDRAFT_9438 [Opitutaceae bacterium TAV2] Length = 672 Score = 92.3 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 43/361 (11%), Positives = 107/361 (29%), Gaps = 38/361 (10%) Query: 12 KDSDVEVLEHSHRE-DGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIV 70 D + EV+ S + G+ V + + S+ ++ ++A++ + ++ + + + Sbjct: 100 TDFETEVVIGSDYMLESGKVVF--KAFWETSRKRLKIEAINRYDVIVPNWTGRLADCDWI 157 Query: 71 GRKLYLTRSDLISM------GYDRESINNLPIISSQNIE----NTWKFPKNQYSDKALEM 120 ++ + D ++IN L + N KF + + + + Sbjct: 158 VHVQRFSKHAFRRLVKRMAWTIDDDTINALAGQDATNTGAASAEQSKFQRQGITSPSKDD 217 Query: 121 IEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE---------WNELP----FTC 167 + + DG + + +L + + LP F Sbjct: 218 EIVL--WEVYSRNDDGAW---IIKTYSPVRPEQVLRPDFGLPYNQGVFADSLPPPPFFEI 272 Query: 168 LRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLN 227 ++ + + + + D + P S + S L Sbjct: 273 SCELKDRGYYDSRGIVKRVAPFEASLCKDWNTVKDYQTLTSTPILTASARSDVGNNSTLR 332 Query: 228 PQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQ 287 Q G+ + + + + + + Q G+ D +G + Sbjct: 333 FQPGQVLPF-------PLSAVQMPTLPVDTQQGMLGTRQTAEQLVGVPDFGTGSQQPSGE 385 Query: 288 NMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFD 347 TA SLI V++ R + L + ++ Q+ + + D + Sbjct: 386 RKTAKEVSLIANVMGQSVDMRARIFRKELAHGLAIMWAILSQYAREELDYFVLDNLIQIP 445 Query: 348 P 348 P Sbjct: 446 P 446 >gi|315929405|gb|EFV08607.1| hypothetical protein CSS_1407 [Campylobacter jejuni subsp. jejuni 305] Length = 512 Score = 92.3 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 47/296 (15%), Positives = 106/296 (35%), Gaps = 27/296 (9%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDL---ISMGYDRESINNLPIIS 98 +G ++ V D P++++ E + ++YLT + + +G+ ++ Sbjct: 138 KGMPRIERVDIDSIFFDPNALNSEDVGYIVNEIYLTYNQIHERQKLGFYKKIEIKKLFDE 197 Query: 99 SQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNE 158 + + E + + + + + + I + + NE Sbjct: 198 DDEYKKVKLY-DIYERKNDDEWVVSTLFENNLLRNEVTLQDGQPFIWGSMLPQLKKIDNE 256 Query: 159 EWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGS 218 + GE + AS + +Q + +D + P+ ++ + Sbjct: 257 NYV--------------SAYGEPIMASAMPLQDEINITRNLLIDAVRTHIMPKIMMPKSM 302 Query: 219 IIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDIS 278 + E + GKPI ++ + P + + L L+ EL + TG+S + Sbjct: 303 GVSREDIETL--GKPIYTDDPKGVQILP----PPNVNSAGMNLQLLESELTEVTGVSPQN 356 Query: 279 SGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDK 333 +G + QN TAT S+ Q G + +R + +E LF L+ ++ + Sbjct: 357 NG--AQTAQNETATEISIKAQEGGRRSADYIRQYNETFIEPLFDRFAMLVFKYGED 410 >gi|209548332|ref|YP_002280249.1| hypothetical protein Rleg2_0727 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534088|gb|ACI54023.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 711 Score = 91.9 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 43/304 (14%), Positives = 94/304 (30%), Gaps = 10/304 (3%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISM-GYDRESINNLPII 97 + +VC+D V +FL P + + V R++ +T ++ G + + Sbjct: 184 VIADERVCIDYVHWSDFLHSP-ARRWKDVTWVARRVPMTDEEMEKRFGAEAMASRAAEGA 242 Query: 98 SSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYD-GDGIAELRRVIMAGGTGKDNILC 156 + ++ + +N+ + E + DG V C Sbjct: 243 AGNKADSQAERLENEGKTH---VWEIWCKSENYTVWIADGSPVALEVSEPPLDLTHFWPC 299 Query: 157 NEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQ-TLDNLYWQNQPQTI-V 214 + + P + I + K L Q L Y Sbjct: 300 PRPAYGT-MSTSSLIPVPDYVYYQQQCDEIDLLTKRINKLTDQLRLKVFYPSGDGAVSPA 358 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 E ++ ++ + ++++ + + + + + Q + D I Sbjct: 359 IEKAMRPENDMVMVPIPEWAAFTDKGGSKAIVTLPIDEVQKVIVACMQARKQLIEDVYQI 418 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLII-QHQDK 333 + IS + + TATA + Q G ++ LA+ + R +I Q Q + Sbjct: 419 TGISDIVRGDTQASETATAQRIKSQWGSIRIRDRQAELARFARDIIRLAGEIICDQFQPE 478 Query: 334 VRMV 337 M+ Sbjct: 479 TLML 482 >gi|283956319|ref|ZP_06373799.1| hypothetical protein C1336_000250090 [Campylobacter jejuni subsp. jejuni 1336] gi|283792039|gb|EFC30828.1| hypothetical protein C1336_000250090 [Campylobacter jejuni subsp. jejuni 1336] Length = 512 Score = 91.9 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 48/304 (15%), Positives = 102/304 (33%), Gaps = 43/304 (14%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLIS---MGYDRESINNLPIIS 98 +G ++ V D P++++ E + ++YLT + + +G+ + Sbjct: 138 KGMPRIERVDIDSIFFDPNALNSEDVGYIVNEIYLTYNQIHERQNLGFYKNIEIQKLFDE 197 Query: 99 SQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNE 158 + + + Y K + L+ Sbjct: 198 DDEYKKVKLY--DIYERKNDDEWVVSTLF---------------------ENNLLRNKVT 234 Query: 159 EWNELPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYWQNQP 210 + PF + GE + AS + +Q + +D + P Sbjct: 235 LQDGQPFVWGSMLPQLKKIDNENYVSAYGEPIMASAMPLQDEINITRNLLIDAVRTHIMP 294 Query: 211 QTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVD 270 + ++ + + E + GKPI ++ + P + + L L+ EL + Sbjct: 295 KIMMPKSMGVSREDIETL--GKPIYTDDPKGVQILP----PPNVNSAGMNLQLLESELTE 348 Query: 271 RTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQ 329 TG+S ++G + QN TAT S+ Q G + +R + +E LF L+ + Sbjct: 349 VTGVSPQNNG--AQTAQNETATEISIKAQEGGRRSADYIRQYNETFIEPLFDRFAMLVFK 406 Query: 330 HQDK 333 + + Sbjct: 407 YGED 410 >gi|149408206|ref|YP_001294640.1| hypothetical protein ORF047 [Pseudomonas phage PA11] Length = 584 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 45/317 (14%), Positives = 95/317 (29%), Gaps = 31/317 (9%) Query: 45 VCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYD--------------RES 90 + +SP + + +P + I + T+ +L+ + D E Sbjct: 165 PRLVRISPLDIVFNPLATSISD-TFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEI 223 Query: 91 INNLPIISSQNIENTWKFPKNQ----YSDKALEMIEYYELYVTIDYDGDGIAELRRVIMA 146 +L S ++ + F + Y + +E E Y G + R+I Sbjct: 224 CRHLGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITV 283 Query: 147 GGTGKDNILC--NEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNL 204 + + P + P +++ +Q L D + Sbjct: 284 VDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAV 343 Query: 205 YWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYL 264 QP II G I + G D++ + +V I + + + L Sbjct: 344 DLIIQPPLK-----IIGEVEEFVWGPGAEIHLDQGGDVQEI--AKNVNYIINADNQIQML 396 Query: 265 DQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLA-QGLEILFRGL 323 + + G + G TA + + + V T + LE + + Sbjct: 397 EDRMELYAGAPREAMGI--RTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAM 454 Query: 324 LRLIIQHQDKVRMVRLR 340 L ++ D ++R+ Sbjct: 455 LETATRNMDGSDVIRVM 471 >gi|57237581|ref|YP_178595.1| hypothetical protein CJE0579 [Campylobacter jejuni RM1221] gi|57166385|gb|AAW35164.1| hypothetical protein CJE0579 [Campylobacter jejuni RM1221] Length = 512 Score = 91.1 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 46/296 (15%), Positives = 106/296 (35%), Gaps = 27/296 (9%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDL---ISMGYDRESINNLPIIS 98 +G ++ V D P++++ E + ++YLT + + +G+ +++ Sbjct: 138 KGMPRIERVDIDSIFFDPNALNSEDVGYIVNEIYLTYNQIHERQKLGFYKKNEIKKLFDE 197 Query: 99 SQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNE 158 + + E + + + + + + I + + NE Sbjct: 198 DDEYKKVKLY-DIYERKNDDEWVVSTLFENNLLRNEVTLQDGQPFIWGSMLPQLKKIDNE 256 Query: 159 EWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGS 218 + GE + AS + +Q + +D + P+ ++ + Sbjct: 257 NYV--------------SAYGEPIMASAMPLQDEINITRNLLIDAVRTHIMPKIMMPKSM 302 Query: 219 IIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDIS 278 + E + GKPI ++ + P + + L L+ EL + G+S + Sbjct: 303 GVSREDIETL--GKPIYTDDPKGVQILP----PPNVNSAGMNLQLLESELTEVIGVSPQN 356 Query: 279 SGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLIIQHQDK 333 +G + QN TAT S+ Q G + +R + +E LF L+ ++ + Sbjct: 357 NG--AQTAQNETATEISIKAQEGGRRSADYIRQYNETFIEPLFDRFAMLVFKYGED 410 >gi|283852987|ref|ZP_06370245.1| hypothetical protein DFW101DRAFT_2815 [Desulfovibrio sp. FW1012B] gi|283571597|gb|EFC19599.1| hypothetical protein DFW101DRAFT_2815 [Desulfovibrio sp. FW1012B] Length = 614 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 49/374 (13%), Positives = 105/374 (28%), Gaps = 54/374 (14%) Query: 15 DVEVLEHSHREDGGEKVHDLRIRRKYSQ-------GKVCVDAVSPDEFLIHPDS-VDIEK 66 + E R + + L + + G+V V P F ++P + DI+ Sbjct: 117 EQEQQAVLERSVLNGETYGLAVEKVVFDPDLEYGLGEVRTVVVDPFAFGVYPTACPDIQD 176 Query: 67 SPIVGRKLYLTRSDLISMGYDR---------------ESINNLPIISSQNIENTWKFPK- 110 + V +T + + + +F + Sbjct: 177 AEAVLHFTPMTLREAARRWPEAAGRLTSDAALLADLGDGRREAATGDGSRRGLFARFGEV 236 Query: 111 -------NQYSDKALEMIEYYELYV-TIDYDGDGIAEL---RRVIMAGGTGKDNILCNEE 159 Q + + E +V DG R V +AG G Sbjct: 237 VRTLAGAGQTDGPSEDTTLVCECWVKDYAMTSDGPRYPGCIRCVTVAGAGGLVLSDRGNP 296 Query: 160 ----------------WNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDN 203 ++ PF ++ P G S + E+Q L Q + Sbjct: 297 SVNPALTPDEAMATYLYDRFPFALANSLTDPASLWGASDFEQLAELQTEVNKCLSQLTYH 356 Query: 204 LYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHY 263 +P+ I S + + N + A+ + + + + S+L Sbjct: 357 KDRCARPKIINPRDSGVANAAFTNR--LGIVNPASMAAAQGIRYLEFANNTKDIESVLAI 414 Query: 264 LDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGL 323 + +G+ ++ SP+ + A S + + + +R ++ + R Sbjct: 415 YRELFSQISGLGELERAGSPDHPV-IAYKAISALIEQAATLLRGKIRNYSRLVRERGRMF 473 Query: 324 LRLIIQHQDKVRMV 337 L + + R + Sbjct: 474 LSHMQNWYTEERWI 487 >gi|313113989|ref|ZP_07799544.1| hypothetical protein HMPREF9436_01396 [Faecalibacterium cf. prausnitzii KLE1255] gi|310623691|gb|EFQ07091.1| hypothetical protein HMPREF9436_01396 [Faecalibacterium cf. prausnitzii KLE1255] Length = 649 Score = 89.2 bits (219), Expect = 8e-16, Method: Composition-based stats. Identities = 44/317 (13%), Positives = 107/317 (33%), Gaps = 25/317 (7%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G++C+ +V+ P DI+ +P + + L SS ++ Sbjct: 172 GEICIRSVNLLMLYWEPGVEDIQDTPHLFSLSLMDNDQLEGRYPQ----MAGHTGSSMDV 227 Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNE 162 DK++ + YY+ + G L G + ++ + Sbjct: 228 AKYIHDDSIDTGDKSVVVDWYYKKAL-----EGGQTVLHYCKYCNGVVLYASENDPQYAQ 282 Query: 163 L--------PFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 PF R G + + Q + +N+ + + ++ Sbjct: 283 RGFYDHGKYPFVFDPLFREEDSPAGFGYIDVMKDTQTAIDEMNHAMDENVKLAAKARYVL 342 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 + + ++ E + + + V + S + + + S EL + +G Sbjct: 343 SDTAGVNEEELADFGKD-IVHVVGRLTDDSFRPLQTNVLSGNCISYRDARVSELKEISGN 401 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV 334 D+S G + L A+A + ++++G ++++ + ++ L+ Q D+ Sbjct: 402 RDVSQGGTTSGLTA--ASAIAALQEAGSKLSRDMLKSAYRTFAKECYLVIELMRQFYDEE 459 Query: 335 RMVRLRD-----QWVSF 346 R+ R+ ++V F Sbjct: 460 RVYRITGESGGVEYVPF 476 >gi|153951607|ref|YP_001398216.1| hypothetical protein JJD26997_1133 [Campylobacter jejuni subsp. doylei 269.97] gi|153952365|ref|YP_001397542.1| hypothetical protein JJD26997_0326 [Campylobacter jejuni subsp. doylei 269.97] gi|152939053|gb|ABS43794.1| conserved hypothetical protein [Campylobacter jejuni subsp. doylei 269.97] gi|152939811|gb|ABS44552.1| hypothetical protein JJD26997_0326 [Campylobacter jejuni subsp. doylei 269.97] Length = 507 Score = 88.4 bits (217), Expect = 1e-15, Method: Composition-based stats. Identities = 50/306 (16%), Positives = 108/306 (35%), Gaps = 47/306 (15%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDL---ISMGYDRESINNLPIIS 98 +G ++ V D P++++ E + ++YLT +++ +G+ ++ P + Sbjct: 136 KGMPRIERVGIDSIFFDPNALNSEDVGYIVNEIYLTYNEIYERQKLGFYKK--LETPKLL 193 Query: 99 SQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNE 158 + E + Y K + L+ ++++L NE Sbjct: 194 DEEDEYKKVKLYDIYERKNDDAWVVSTLF-----------------------ENHLLRNE 230 Query: 159 EW--NELPFTCLRAMRAPHCF--------IGESLAASIIEIQKIKTVLLRQTLDNLYWQN 208 + PF + GE + AS + +Q + +D + Sbjct: 231 VILQDGQPFVWGSMLPQLKKIDNENYVSAYGEPIMASAMPLQDEINITRNLLIDAVRTHI 290 Query: 209 QPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQEL 268 P+ ++ + + E + D + I P + + L L+ EL Sbjct: 291 MPKIMLPKSMGVSREDIETLGK------PLYTDDPKGVQILPPPDVNSAGMNLQLLESEL 344 Query: 269 VDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQ-GLEILFRGLLRLI 327 + TG+S ++G + N TAT S+ Q G + +R + +E LF L+ Sbjct: 345 TEVTGVSPQNNG--AQTAHNETATEISIKAQEGGRRSADYIRQYNETFIEPLFDRFAMLV 402 Query: 328 IQHQDK 333 ++ + Sbjct: 403 FKYGED 408 >gi|303245700|ref|ZP_07331983.1| conserved hypothetical protein [Desulfovibrio fructosovorans JJ] gi|302492963|gb|EFL52828.1| conserved hypothetical protein [Desulfovibrio fructosovorans JJ] Length = 602 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 46/374 (12%), Positives = 103/374 (27%), Gaps = 54/374 (14%) Query: 15 DVEVLEHSHREDGGEKVHDLRIRRKYSQ-------GKVCVDAVSPDEFLIHP-DSVDIEK 66 + E R + + + + + G+V V P F ++P DI+ Sbjct: 117 EQEQQAIFERSVINGETYGVAVEKVVFDPDLEYGLGEVRTVVVDPFAFGVYPTSCPDIQD 176 Query: 67 SPIVGRKLYLTRSDLISMGYDR---------------ESINNLPIISSQNIENTWKFPK- 110 + V ++ + + +F + Sbjct: 177 AEAVLHFTPMSLREAKRRWPKAAGKLTSDAALLAQLGDGRREAITGDGSRQGLFGRFGEV 236 Query: 111 -------NQYSDKALEMIEYYELYVT-IDYDGDGIAEL---RRVIMAGGTGKDNILCNEE 159 + + + E + DGD R V +AG Sbjct: 237 VRTIVGASGGDGPSDDATLVCECWARDYTMDGDMPRYPGFIRCVTVAGAGEVVLSDQGNP 296 Query: 160 ----------------WNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDN 203 ++ PF ++ P G S + E+Q L Q + Sbjct: 297 SINPELQEAEAVASYLYDRFPFALANSLTDPASLWGASDFEQLAELQLEVNKCLSQLTYH 356 Query: 204 LYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHY 263 +P+ I S + + N Q + A+ + + + + S+L Sbjct: 357 KDRCARPKIINPRDSGVANAAFTNRQ--GIVNPASMAAAQGIRYLEFTNNTKDIESVLGI 414 Query: 264 LDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGL 323 + +GI +I +P+ + A + + + + +R ++ + R Sbjct: 415 YREMFSQISGIGEIERATAPDHPV-IAYKAIAALIEQAATLLRGKIRNYSRLIRERGRMF 473 Query: 324 LRLIIQHQDKVRMV 337 L + + R + Sbjct: 474 LSHMQNWYTEERWI 487 >gi|327189473|gb|EGE56633.1| hypothetical protein RHECNPAF_608006 [Rhizobium etli CNPAF512] Length = 694 Score = 87.6 bits (215), Expect = 2e-15, Method: Composition-based stats. Identities = 42/298 (14%), Positives = 87/298 (29%), Gaps = 8/298 (2%) Query: 44 KVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIE 103 +VC+D V +FL P + + V R++ +T ++ + Sbjct: 177 RVCIDYVHWSDFLHSP-ARRWKDVTWVARRVPMTDEEMEKRFGREAM--ASGAAQAAAGG 233 Query: 104 NTWKFPKNQYSDKALEMIEYYELYVTIDYD-GDGIAELRRVIMAGGTGKDNILCNEEWNE 162 + ++ + E + DG V C Sbjct: 234 KGASQAERAENEGKTHVWEIWCKSENYTVWIADGSPVALEVSEPPLELTHFWPCPRPAYG 293 Query: 163 LPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQ-TLDNLYWQNQPQTI-VQEGSII 220 + + P + I + K L Q L Y E ++ Sbjct: 294 TV-STSSLIPVPDYVYYQQQCDEIDLLTKRINKLTDQLRLKVFYPSGDGAISPAIEKAMR 352 Query: 221 DPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSG 280 ++ + ++V+ + + + + + Q + D I+ IS Sbjct: 353 PENDMVMVPIPEWAAFTDKGGSKAVVTLPIDEVQKVIVACMAARKQLIEDVYQITGISDI 412 Query: 281 FSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLII-QHQDKVRMV 337 + + TATA + Q G ++ LA+ + R +I Q Q + M+ Sbjct: 413 VRGDTQASETATAQRIKSQWGSIRIRDRQAELARFARDIIRLAGEIICDQFQPETLML 470 >gi|86356737|ref|YP_468629.1| hypothetical protein RHE_CH01094 [Rhizobium etli CFN 42] gi|86280839|gb|ABC89902.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 701 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 41/298 (13%), Positives = 86/298 (28%), Gaps = 8/298 (2%) Query: 44 KVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIE 103 +VC+D V +FL P + + V R++ + ++ + Sbjct: 177 RVCIDYVHWSDFLHSP-ARRWKDVTWVARRVPMADEEMEKRFGREAM--ASGAAQAAAGG 233 Query: 104 NTWKFPKNQYSDKALEMIEYYELYVTIDYD-GDGIAELRRVIMAGGTGKDNILCNEEWNE 162 + ++ + E + DG V C Sbjct: 234 KGASQAERAENEGKTHVWEIWCKSENYTVWIADGSPVALEVSEPPLELTHFWPCPRPAYG 293 Query: 163 LPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQ-TLDNLYWQNQPQTI-VQEGSII 220 + + P + I + K L Q L Y E ++ Sbjct: 294 TV-STSSLIPVPDYVYYQQQCDEIDLLTKRINKLTDQLRLKVFYPSGDGAISPAIEKAMR 352 Query: 221 DPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSG 280 ++ + ++V+ + + + + + Q + D I+ IS Sbjct: 353 PENDMVMVPIPEWAAFTDKGGSKAVVTLPIDEVQKVIVACMAARKQLIEDVYQITGISDI 412 Query: 281 FSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLII-QHQDKVRMV 337 + + TATA + Q G ++ LA+ + R +I Q Q + M+ Sbjct: 413 VRGDTQASETATAQRIKSQWGSIRIRDRQAELARFARDIIRLAGEIICDQFQPETLML 470 >gi|239905065|ref|YP_002951804.1| hypothetical protein DMR_04270 [Desulfovibrio magneticus RS-1] gi|239794929|dbj|BAH73918.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 584 Score = 84.9 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 49/374 (13%), Positives = 103/374 (27%), Gaps = 54/374 (14%) Query: 15 DVEVLEHSHREDGGEKVHDLRIRRKYSQ-------GKVCVDAVSPDEFLIHP-DSVDIEK 66 + E R + + L + + G+V V P F ++P +DI++ Sbjct: 115 EQEQQAVFERSVINGETYGLAVEKVVFDPELEYGLGEVRTVNVDPFAFGVYPTSCLDIQE 174 Query: 67 SPIVGRKLYLTRSD------------------LISMGYDRESINNLPIISSQNIENTWKF 108 + V ++ L +G R I + Sbjct: 175 AEAVLHFAPMSLRQAARRWPEAAGQLKSDAATLADLGDGRREILLGDGRRQGLFTRFGEV 234 Query: 109 PKNQYSDKA-----LEMIEYYELYVT-IDYDGDGIAEL---RRVIMAGGTGKDNILCNEE 159 + + + E + DG R V++AG Sbjct: 235 LRQLAGAGGGDALGQDTVLVCECWARDYTMTDDGPLYPGFIRCVVVAGPGSLVLSDQPNP 294 Query: 160 ----------------WNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDN 203 ++ PF ++ P G S + E+Q L Q + Sbjct: 295 SINPALPLDQAMASYLYDRYPFALANSLTDPTTIWGASDFEQLAELQLEINKCLSQLTYH 354 Query: 204 LYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHY 263 +P+ I S +D + N + + + + + S+L Sbjct: 355 KDRCARPKIINPRDSGVDNAAFTNR--LGIVNPTSMAAAQGIRYLEFANNTRDIESVLTL 412 Query: 264 LDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGL 323 + +GI ++ SP+ A + + + + +R ++ + R Sbjct: 413 YRELFSQISGIGELERAASPDHPVVA-YKAIAALIEQASTLLRGKIRNYSRLVRERGRMF 471 Query: 324 LRLIIQHQDKVRMV 337 L + K R + Sbjct: 472 LSHMQNWYAKERWI 485 >gi|145642402|ref|ZP_01797960.1| Haemophilus-specific protein, uncharacterized [Haemophilus influenzae R3021] gi|145272901|gb|EDK12789.1| Haemophilus-specific protein, uncharacterized [Haemophilus influenzae 22.4-21] Length = 313 Score = 83.8 bits (205), Expect = 3e-14, Method: Composition-based stats. Identities = 28/208 (13%), Positives = 64/208 (30%), Gaps = 12/208 (5%) Query: 139 ELRRVIMAGGTGKDNILCNEEWN--ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVL 196 E+ VI+ G GK + + E P++ C G + + Q+I Sbjct: 22 EIEGVIVMAGNGKILSVNLNPLDTAEFPYSVYTCEPDVCCLFGFGIPYLCRDAQEILNTA 81 Query: 197 LRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMI-- 254 R +DN PQ +V + + K + + I Sbjct: 82 WRGMIDNGILGIGPQAVVNSSVLTPVDGNWELAPYKLWKTNDRATANAQFEAQRAFGIFD 141 Query: 255 -----EKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIV 309 ++ +++ + + +G+ I+ G ++ T S++ + V Sbjct: 142 IGSRQQELANIIQLSKSFMDEESGLPMIAQGEQGQV--TPTLGGMSMLMNAANAVRRRQV 199 Query: 310 RTLAQGL-EILFRGLLRLIIQHQDKVRM 336 + + + L R + + + Sbjct: 200 KEWDDSVTKPLIRRFYEYNMNMSEDASI 227 >gi|288957023|ref|YP_003447364.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] gi|288909331|dbj|BAI70820.1| hypothetical protein AZL_001820 [Azospirillum sp. B510] Length = 534 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 65/217 (29%), Gaps = 9/217 (4%) Query: 134 GDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIK 193 DG A V++ G + L + + PF R ++AP G S + K Sbjct: 232 PDGAAYRWGVVLDSGLADPSWLAQGRFAQSPFVNFRWLKAPGETYGRSPVMKALPDIKTA 291 Query: 194 TVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPM 253 ++ L N + + LNP + + G+ + Sbjct: 292 NKVVELVLKNASIAVTGIWQADDDGV------LNPSTIRLVPGTIIPKAVGSAGL-TPLA 344 Query: 254 IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTL 312 F + + +L R + + P MTAT + R Sbjct: 345 NPGRFDVSQLVLDDLRGRIRHALLVDRLGPVDSARMTATEVLERSVEMARLLGATYGRLQ 404 Query: 313 AQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 A+ + L + ++ + + + + + V R Sbjct: 405 AELMTPLLLRAVSILRRRGEIPD-ITVDGRLVELQHR 440 >gi|225155663|ref|ZP_03724152.1| hypothetical protein ObacDRAFT_9274 [Opitutaceae bacterium TAV2] gi|224803636|gb|EEG21870.1| hypothetical protein ObacDRAFT_9274 [Opitutaceae bacterium TAV2] Length = 657 Score = 81.1 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 36/282 (12%), Positives = 83/282 (29%), Gaps = 12/282 (4%) Query: 46 CVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDR-----ESINNLPIISSQ 100 + V D FL + + + +V + + M +R E + S Sbjct: 248 RSEIVPSDRFLCPVNVASPDDAKLVAELYDKDIAWIEDMWIERPWAIWEEVKGEFTQSGA 307 Query: 101 NIENTWKFPKNQYSDKALEMIE--YYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNE 158 + + + + + + E + D G + + + + + Sbjct: 308 DEKTEGESKAKEDATHDDKESLRKIIECWGRRDVLGLEGPQEFVIFIDEDSERAVFYEFT 367 Query: 159 ----EWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 + PFT + R + G+S+ + + Q+ P V Sbjct: 368 AKVCPDFKRPFTTIAVGRTRRRWWGKSIPEKVAQYQEKIDENFNGEAYRNLMNANPLKGV 427 Query: 215 QEGSIIDPESVLNPQFGKPIRVA-AGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTG 273 + ++ E L K + V K+ + ++ + Sbjct: 428 NPDATVEEEEDLVFDPEKVYHLKLNKKMEDFVSFAKLPDADFKTRDIAQFVFWFVQRWLH 487 Query: 274 ISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQG 315 ISD+ +G + +N TAT + ++ R + +G Sbjct: 488 ISDVGTGDYEALPENNTATGIEINREASQSTSRRWNRRINEG 529 >gi|254523473|ref|ZP_05135528.1| hypothetical protein SSKA14_2606 [Stenotrophomonas sp. SKA14] gi|219721064|gb|EED39589.1| hypothetical protein SSKA14_2606 [Stenotrophomonas sp. SKA14] Length = 696 Score = 79.2 bits (193), Expect = 8e-13, Method: Composition-based stats. Identities = 39/353 (11%), Positives = 99/353 (28%), Gaps = 57/353 (16%) Query: 41 SQGKVCVDAVSPDEFLIHPDSVD--IEKSPIVGRKLYLTRSDLISMGYDRESINNLPIIS 98 +G+V + P + L PD+ + +LT S + Y +++ + + S Sbjct: 145 DEGEVSLTTFDPRDVLPDPDATSYNPDSWADCSITRWLTHSQI-EQNYGKDAADEIRDSS 203 Query: 99 SQNIENTWKFPKNQ--------YSDKALEMIEY-----YELYVTIDYDGDGIAELRRVIM 145 + N W + A+ M Y + Y +D + Sbjct: 204 MAYVHNNWGDEQGMMRDAFGNMPPSYAMNMGWYGEEGTWRRYRVVDRQSHEYQQTLVAKW 263 Query: 146 AGGTGKDNILCNEE------------------------------------WNELPFTCLR 169 I E FT + Sbjct: 264 PATGDLRIIEGFEPELIGWLIEQGVHVMRRRIRRVRWQVCAPEVCVYDKLSPYDHFTVIP 323 Query: 170 AMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQ 229 + + ++Q + + Q + + S+ + Sbjct: 324 YFPYFRRGKTVGMLDNAAQVQDLINKFVSQYAHIVNASANGGWQGEANSLENMTDEEFTS 383 Query: 230 FGKP--IRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQ 287 G + + + I +M+ +L + + T +++ + G + Sbjct: 384 RGGETGLVLLRKPGTQPFQKIEPNQPPRGIENMIDFLQRNMQTVTAVNESAMG---QGSA 440 Query: 288 NMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLR 340 +M+ A + + + + + L++ ++L L+L+ + R++R+ Sbjct: 441 DMSGIAIQSRQFAAQQALGIALDNLSRTRQMLAERTLKLVQRFYTAPRVIRIA 493 >gi|145589308|ref|YP_001155905.1| hypothetical protein Pnuc_1125 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145047714|gb|ABP34341.1| hypothetical protein Pnuc_1125 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 653 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 36/307 (11%), Positives = 91/307 (29%), Gaps = 16/307 (5%) Query: 45 VCVDAVSPDEFLIHPDS---VDIEKSPIVGRKLYLTRSD---LISMGYDRESINNLPIIS 98 + +D V + LI P D + + + + + RS L I Sbjct: 197 LVIDRVLTENLLIDPSICEFWDYTDADWICQIIPMKRSQAEALYKKNLANAKIYQPGQGE 256 Query: 99 SQNIENTW----KFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNI 154 + + + + I E++ + + E + Sbjct: 257 PSHKKAKRLASMQMNAGSGPVTDDQQIAVLEIWDRVTQRVYTMVEGATEWLREPYSPPRA 316 Query: 155 LCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 PF L F+G SL +Q + + Sbjct: 317 GERWY----PFFLLPYQVIDGQFVGPSLVDLTERLQDEHNEARDRFNQHRDLCIPGWVAS 372 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 + + + + +FG+ V + + I K +++ D + Sbjct: 373 ADINEKTIKKHSDSRFGEITIVDTEGKPLNQVIIPRG--HPKIDPIVYDTSAVRYDWEQV 430 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV 334 + + +++ TAT ++++++ G+V + L+ + + ++++Q K Sbjct: 431 TGLQDAARSTVVRPKTATEANILQRALSGRVFEFKDQIEDWLQEIAQYSAQVLLQELTKE 490 Query: 335 RMVRLRD 341 ++ R Sbjct: 491 QVERYMG 497 >gi|313115193|ref|ZP_07800677.1| hypothetical protein HMPREF9436_02547 [Faecalibacterium cf. prausnitzii KLE1255] gi|310622471|gb|EFQ05942.1| hypothetical protein HMPREF9436_02547 [Faecalibacterium cf. prausnitzii KLE1255] Length = 604 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 34/264 (12%), Positives = 88/264 (33%), Gaps = 21/264 (7%) Query: 96 IISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNIL 155 S ++ S K++ + YY+ D G L G Sbjct: 196 TASVLDVPRYIHDEGQDTSSKSVVVDWYYKR-----PDETGRMVLHYCKFCNGVVLYASQ 250 Query: 156 CN--------EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQ 207 + + + PF G + + Q + +N+ Sbjct: 251 NDPALAESGLYDHGQYPFVFDPLFVEEDSPAGFGYIDVMKDCQTAIDKMNHAMDENVLLS 310 Query: 208 NQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQE 267 + + ++ + + ++ E + + + V ++ S + + + S S +E Sbjct: 311 AKQRYVLSDTAGVNEEELADFSRD-IVHVVGRLNDDSFRPLQTAGLQGNSLSYRQSRIEE 369 Query: 268 LVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLI 327 L + +G D++ G A+A + ++++G ++++ + ++ L+ Sbjct: 370 LKEISGNRDMTQG--GTAGGVTAASAIAALQEAGSKLSRDMLKSAYRAFAKQCYLIIELM 427 Query: 328 IQHQDKVRMVRLRDQ-----WVSF 346 Q D+ R+ R+ + +V F Sbjct: 428 RQFYDEQRVFRIVGESGESRFVPF 451 >gi|295103136|emb|CBL00680.1| hypothetical protein [Faecalibacterium prausnitzii SL3/3] Length = 594 Score = 77.6 bits (189), Expect = 2e-12, Method: Composition-based stats. Identities = 41/317 (12%), Positives = 110/317 (34%), Gaps = 25/317 (7%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G + V +V+ P DI+ SP + + L + ++ Sbjct: 147 GDIAVRSVNLLMLYWEPGVQDIQDSPDLFHLSLEDTARLTAQYPQ----LTGHAAGVVDV 202 Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCN----- 157 ++K++ + YY+ D +G L + G + Sbjct: 203 PRYIHEDGQTTANKSVVVDWYYKR-----PDENGKLRLHYCKLCNGVVLYASQNDPALAA 257 Query: 158 ---EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 + + PF G + + Q + +N+ ++ + ++ Sbjct: 258 RGLYDHGKYPFVFDPLFVEEDSPAGFGYIDVMKDCQNAIDKMNHAMDENVLLASRQRYVL 317 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 + + ++ E + + + V ++ S + + + S S + +EL + +G Sbjct: 318 SDTAGVNEEELADLSRD-IVHVVGRLNEDSFRPLQTAGLQGNSLSYRNSRIEELKEISGN 376 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV 334 D++ G + + A+A + ++++G ++++ + ++ L+ Q D+ Sbjct: 377 RDLTQGGTTGGVTA--ASAIAALQEAGSKLSRDMLKSAYRAFARQCYLIIELMRQFYDEQ 434 Query: 335 RMVRLRD-----QWVSF 346 R+ R+ ++V F Sbjct: 435 RVFRITGQRGESEFVPF 451 >gi|160945640|ref|ZP_02092866.1| hypothetical protein FAEPRAM212_03169 [Faecalibacterium prausnitzii M21/2] gi|158443371|gb|EDP20376.1| hypothetical protein FAEPRAM212_03169 [Faecalibacterium prausnitzii M21/2] Length = 594 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 41/317 (12%), Positives = 110/317 (34%), Gaps = 25/317 (7%) Query: 43 GKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNI 102 G + V +V+ P DI+ SP + + L + ++ Sbjct: 147 GDIAVRSVNLLMLYWEPGVQDIQDSPDLFHLSLEDTARLTAQYPQ----LAGHAAGVVDV 202 Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCN----- 157 ++K++ + YY+ D +G L + G + Sbjct: 203 PRYIHEDGQTTANKSVVVDWYYKR-----PDENGKLRLHYCKLCNGVVLYASQNDPALAA 257 Query: 158 ---EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIV 214 + + PF G + + Q + +N+ ++ + ++ Sbjct: 258 RGLYDHGKYPFVFDPLFVEEDSPAGFGYIDVMKDCQNAIDKMNHAMDENVLLASRQRYVL 317 Query: 215 QEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 + + ++ E + + + V ++ S + + + S S + +EL + +G Sbjct: 318 SDTAGVNEEELADLSRD-IVHVVGRLNEDSFRPLQTAGLQGNSLSYRNSRIEELKEISGN 376 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV 334 D++ G + + A+A + ++++G ++++ + ++ L+ Q D+ Sbjct: 377 RDLTQGGTTGGVTA--ASAIAALQEAGSKLSRDMLKSAYRAFAKQCYLIIELMRQFYDEQ 434 Query: 335 RMVRLRD-----QWVSF 346 R+ R+ ++V F Sbjct: 435 RVFRITGQRGESEFVPF 451 >gi|295102644|emb|CBL00189.1| hypothetical protein FP2_29200 [Faecalibacterium prausnitzii L2-6] Length = 588 Score = 74.5 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 75/202 (37%), Gaps = 5/202 (2%) Query: 143 VIMAGGTGKDNILCNEEWNE--LPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 V++ + ++ PF G + E Q + Sbjct: 240 VVLYASENDPALAERGFYDHGRYPFVFDALFMEEDSPAGFGYIDVMKECQTAIDKMNHAM 299 Query: 201 LDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSM 260 +N+ ++ + ++ + + ++ E + + I V ++ S + + + S S Sbjct: 300 DENVLLSSRQRYVLSDTAGVNEEELTDLSRD-IIHVVGRLNDDSFRPLQTAGLQGNSLSY 358 Query: 261 LHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILF 320 + +EL + +G D++ G A+A + ++++G ++++ + Sbjct: 359 RNSRIEELKEISGNRDMTQG--GTAGGVTAASAIAALQEAGSKLSRDMLKSAYRAFAKEC 416 Query: 321 RGLLRLIIQHQDKVRMVRLRDQ 342 ++ L+ Q D+ R+ R+ + Sbjct: 417 CLIIELMRQFYDEERIFRITGK 438 >gi|257438498|ref|ZP_05614253.1| hypothetical protein FAEPRAA2165_01042 [Faecalibacterium prausnitzii A2-165] gi|257199077|gb|EEU97361.1| hypothetical protein FAEPRAA2165_01042 [Faecalibacterium prausnitzii A2-165] Length = 578 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 72/188 (38%), Gaps = 8/188 (4%) Query: 164 PFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPE 223 PF G + E Q + +N+ ++ + ++ + + ++ E Sbjct: 264 PFVFDPLFMEEDSPAGFGYIDVMKECQTAIDRMNHAMDENVLLASKQRYVLSDTAGVNEE 323 Query: 224 SVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSP 283 + + + VA + S + + + S S + +EL + +G D++ G Sbjct: 324 ELADLSRD-IVHVAGRLGDESFRPLQTAGLQGNSLSYRNSRIEELKEISGNRDMTQG--G 380 Query: 284 EILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD-- 341 A+A + ++++G ++++ + ++ L+ Q D+ R+ R+ Sbjct: 381 TAGGVTAASAIAALQEAGSKLSRDMLKSAYRAFARECYLIIDLMRQFYDEERVFRVIGPA 440 Query: 342 ---QWVSF 346 ++V F Sbjct: 441 GGREFVPF 448 >gi|332142316|ref|YP_004428054.1| hypothetical genomic island protein [Alteromonas macleodii str. 'Deep ecotype'] gi|327552338|gb|AEA99056.1| hypothetical genomic island protein [Alteromonas macleodii str. 'Deep ecotype'] Length = 700 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 50/370 (13%), Positives = 108/370 (29%), Gaps = 55/370 (14%) Query: 32 HDLRIRR-KYSQGKVCVDAVSPDEFLIHPDSVDIE--KSPIVGRKLYLTRSDLISMG--- 85 +D+R+ + +G++ +D +P + PD+ + K V +++ + Sbjct: 133 YDVRLNHDEVIEGEIAIDTENPIAVIPDPDATSYDPKKWSEVFITRWMSPQQIGEQYGED 192 Query: 86 -------------YDRESINNLPIISSQNIE------------------------NTWKF 108 Y R+SI + E + Sbjct: 193 KRTEVINRAAGAHYGRDSIELSKHTFGSDEETAADTNSIADGATVRNVRVVERQYYKTRI 252 Query: 109 PKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVI-------MAGGTGKDNILCNEEWN 161 + ++ E + E + + +I T +L ++ Sbjct: 253 IQEFIEPRSGETRKIPEQWTPEHIENVRQTFGLEIIKRKKRSVRWTITADSVVLHDDWSP 312 Query: 162 ELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIID 221 FT + L +II Q+ Q L + +V EGS+ + Sbjct: 313 YRTFTVVPYFPIYRRGKPIGLVRNIISPQEFLNKTRSQELHIINTTANSGWLVPEGSLTN 372 Query: 222 PESVLNPQFGKPI--RVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISS 279 + G + I + I P+ + ++ + +G++D Sbjct: 373 MSPEELAEEGAKTGSVITYNPQIGAPEKIKPNPVPTGVDRISTKGAMDIKEISGMNDAIL 432 Query: 280 GFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRL 339 G + + A G Q+++ L ++L +L LI + R+ + Sbjct: 433 GSENAEVSGI---ALQEKTARGQIQLQVPFSNLEFSRKLLAEKILELIQDFYTQERVFFI 489 Query: 340 RDQWVSFDPR 349 D PR Sbjct: 490 TDYMEPEQPR 499 >gi|221271428|dbj|BAH15181.1| portal protein [Serratia phage KSP100] Length = 374 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 64/153 (41%), Gaps = 6/153 (3%) Query: 198 RQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKS 257 R +DN+ N + +G D S+L+ + G + A + Sbjct: 2 RGYIDNIMSANYGRFRAVKGQ-YDKRSLLDNRPGGVVEENAIGMVDLFPHHPLP---AGV 57 Query: 258 FSMLHYLDQELVDRTGISDISSGFSPEILQNMTA-TATSLIEQSGVGQVELIVRTLAQ-G 315 S+L ++Q RTG++ I G SPE+ +N + ++ + ++ ++ R +AQ Sbjct: 58 DSILEQIEQAKERRTGVTRIGMGLSPEVFKNDNSFATVDMMMSAAQNRMRMVARNVAQNF 117 Query: 316 LEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDP 348 + LF + RL+ ++++ + + P Sbjct: 118 MTQLFLAIYRLLKENENSTLPIEVNGAMKEVMP 150 >gi|325171218|ref|YP_004251190.1| hypothetical protein ViPhICP2p19 [Vibrio phage ICP2] gi|323512244|gb|ADX87701.1| conserved hypothetical protein [Vibrio phage ICP2] gi|323512316|gb|ADX87772.1| hypothetical protein TU12-16_00090 [Vibrio phage ICP2_2006_A] Length = 581 Score = 72.6 bits (176), Expect = 8e-11, Method: Composition-based stats. Identities = 38/345 (11%), Positives = 94/345 (27%), Gaps = 39/345 (11%) Query: 16 VEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLY 75 VE ++ + +++ D + + P + + +P +VD SP + + Sbjct: 144 VEYVKETTKDEESGATRD-------TYFGPRAVRIDPKDIVFNPVAVDFAHSPKII-RTV 195 Query: 76 LTRSDLISM--------------GYDRESINNLPIISSQNIENTWKFPKNQ----YSDKA 117 L +L+ M RE L + ++ E F + Y Sbjct: 196 LNEGELLQMEQDQPENASLASAIARRREFRRGLGTYTREDCEKAVGFSMDGFGNLYDYFQ 255 Query: 118 LEMIEYYELYVTIDYDGDG--IAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPH 175 +E Y G ++ I+ + + + P Sbjct: 256 SPYVEVLTFYGDYHDTQSGTFKRNMKVTIIDRMFVIEEKENPSWFAQAPIFHCGWRIRQD 315 Query: 176 CFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIR 235 +++ +Q L D P + + P+ Sbjct: 316 NLYAMGPLDNLVGMQYRIDHLENLKADVFDLIAFPPM--------KVKGDVEEFVWGPME 367 Query: 236 VAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATS 295 V + ++ + L+ ++ + G + G TA Sbjct: 368 QIYINGDGDVEMMAPNTQALQADMQIQILEAKMEEFAGAPREAMGIR--TPGEKTAFEVQ 425 Query: 296 LIEQSGVGQVELIVRTLA-QGLEILFRGLLRLIIQHQDKVRMVRL 339 ++ + + + +E + +L + ++ D +R+ Sbjct: 426 QLQNAAGRIFQEKIMNFEVMLMEKVLNAMLEISRRNLDVADTIRV 470 >gi|239787361|emb|CAX83837.1| Head-to-tail joining protein [uncultured bacterium] Length = 524 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 37/272 (13%), Positives = 79/272 (29%), Gaps = 28/272 (10%) Query: 75 YLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDG 134 +T S + + ++ S + + +K + ++ Y + +D +G Sbjct: 183 EMTISAIRERFPKAQLPESMGRKSKDDADARFKVVEAVLPERHG-----YAYHAILDGEG 237 Query: 135 DGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKT 194 G AE L + PF R ++AP G S + K Sbjct: 238 TGGAET--------------LAEGRFEMSPFINFRWLKAPGEVYGRSPVMKSLPDIKTAN 283 Query: 195 VLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMI 254 ++ L N + + LNP K + G+ + Sbjct: 284 KVVELVLKNATIAVTGIWQADDDGV------LNPANIKLVPGTIIPKAVGSAGL-TPLET 336 Query: 255 EKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTLA 313 F + + +L R + ++ NMTAT + R + Sbjct: 337 PGRFDISQLMLTDLRQRISHALLADRLGQIDAPNMTATEVLERSAEMARLLGATYGRLQS 396 Query: 314 QGLEILFRGLLRLIIQHQDKVRMVRLRDQWVS 345 + L L + ++ + + + + + Sbjct: 397 ELLTPLVMRAVAILKRRGEIPG-LSIDGHQIE 427 >gi|326203482|ref|ZP_08193346.1| hypothetical protein Cpap_1526 [Clostridium papyrosolvens DSM 2782] gi|325986302|gb|EGD47134.1| hypothetical protein Cpap_1526 [Clostridium papyrosolvens DSM 2782] Length = 660 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 36/210 (17%), Positives = 64/210 (30%), Gaps = 6/210 (2%) Query: 142 RVIMAGGTGKDNILCNEEWNEL-----PFTCLRAMRAPHCFIGESLAASIIEIQKIKTVL 196 +IMAG P + P F S+ +I IQ+ L Sbjct: 292 HIIMAGDNLLHYGEFIYRVGNDGKYGFPLVMQVCVETPGRFWPVSIIERLIPIQRSFNAL 351 Query: 197 LRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEK 256 + D L + V++ +D + + F + I + I Sbjct: 352 KNRKKDILNRKAIGNWAVEDDGNVDVDDLEEEGFYPGKIHFYSRGGKPPQEIQNRSSITD 411 Query: 257 SFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGL 316 L E +G+S +S P N AT I++S ++ L + Sbjct: 412 FDVEEQRLLDEFTTISGVSPFASQSLPPTGSNSGAT-LEKIKESDDTRIGLTAENINIAA 470 Query: 317 EILFRGLLRLIIQHQDKVRMVRLRDQWVSF 346 ++ LR+ Q R++R + Sbjct: 471 IASYKIDLRMYRQFAKTPRLLRHVGKNDEV 500 >gi|83313332|ref|YP_423596.1| hypothetical protein amb4233 [Magnetospirillum magneticum AMB-1] gi|82948173|dbj|BAE53037.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 545 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 61/206 (29%), Gaps = 9/206 (4%) Query: 145 MAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNL 204 + G D +L +++ PF R ++AP G S + K ++ L N Sbjct: 252 VLDDDGSDLVLGRGQFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIKTANKVVELVLKNA 311 Query: 205 YWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYL 264 + + LNP K + G+ F + Sbjct: 312 TIAVTGIWQADDDGV------LNPANIKLVPGTIIPKAVGSAGLQ-PLTAPGRFDTSQLV 364 Query: 265 DQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTLAQGLEILFRGL 323 +L R + + S +TAT + R ++ L L Sbjct: 365 LDDLRGRIRHALMGDKLSQPASPALTATEVLQRADDMARLLGATYGRLQSELLTPLILRA 424 Query: 324 LRLIIQHQDKVRMVRLRDQWVSFDPR 349 + ++ + + +++ + + R Sbjct: 425 IHILRRRGEIP-PLQVDGRTIDLQYR 449 >gi|281357154|ref|ZP_06243643.1| hypothetical protein Vvad_PD2246 [Victivallis vadensis ATCC BAA-548] gi|281316185|gb|EFB00210.1| hypothetical protein Vvad_PD2246 [Victivallis vadensis ATCC BAA-548] Length = 752 Score = 71.1 bits (172), Expect = 2e-10, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 78/263 (29%), Gaps = 15/263 (5%) Query: 73 KLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALE----MIEYYELYV 128 ++ D + P+ + + + + + + + Sbjct: 329 VRSMSPVQAAEFYPDSPQV---PLSPEEFEQKLQQAAAEGIAQSEEDAAAVKVRIAGVDG 385 Query: 129 TIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIE 188 D D + V A I C+ GE +A + Sbjct: 386 VEDIDQLEDLKFYEVY-AVVVRNHCIYCSLSAASRYIYSASYRANIDSIWGEGIADLLHH 444 Query: 189 IQKIKTVLLRQTLDNLYWQNQPQTIVQEGS-IIDPESVLNPQFGKPIRVAAGMDIRSVLG 247 +Q+ L+R +NL PQ I+ + + P L K V+ + Sbjct: 445 VQRSVNSLMRSRNNNLALAGAPQVIINTDAVRLKPGEPLQITPFKQWFVSGSGYYGAQKP 504 Query: 248 ---IHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFS--PEILQNMTATATSLIEQSGV 302 + + + L +GI + S G S E TA+ S++ + Sbjct: 505 FELMQIPDVSDSLSRELEKELVFADRISGIPEYSQGVSKGAENGAAGTASGLSMLLDAAS 564 Query: 303 GQVELIVRTLAQGL-EILFRGLL 324 Q++ + + +GL E L R L Sbjct: 565 NQIKDPINNIDEGLYEPLIRDLY 587 >gi|241763592|ref|ZP_04761643.1| conserved hypothetical genomic island protein [Acidovorax delafieldii 2AN] gi|241367185|gb|EER61539.1| conserved hypothetical genomic island protein [Acidovorax delafieldii 2AN] Length = 718 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 46/360 (12%), Positives = 104/360 (28%), Gaps = 55/360 (15%) Query: 32 HDLRIRRKYS-QGKVCVDAVSPDEFLIHPDSV--DIEKSPIVGRKLYLTRSDLISMGYDR 88 +DLR+ + +G++ + + P + + PD+ D +K V +LT ++ S+ Sbjct: 150 YDLRMNFDKNIKGEIDLATLDPRDVIPDPDAKSYDPDKWADVMVTRWLTLDEIESLYGRN 209 Query: 89 ESINNLPIISSQNIENTWKFPKNQYS--------------------------DKALEMIE 122 + + D+ + E Sbjct: 210 ARDLAEKSGDESSDWGFQDGETERSKFGGIRFPGQYDAFGAHDDGLKRFRVIDRQRFVFE 269 Query: 123 YYELYVT------IDYDGDGIAELRRVIMAGGTGKDNILCNEEW-----------NELPF 165 + V + D + + G + W P+ Sbjct: 270 MTDCLVFPEAGNIVVMDTLSQESIDTALKDGAVKARRMHRRVRWVVATYSTTLFDQYSPY 329 Query: 166 TCLRAMRAPHCFI---GESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDP 222 + F + I Q++ + Q + + V+E S+ + Sbjct: 330 DHFTVIPYFAYFRRGETRGMVDDAIGPQEVLNKAVSQEVHIINTTANSGWTVEENSLTNM 389 Query: 223 ESVL--NPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSG 280 + + + V + I + ++ + L D T + D G Sbjct: 390 STEELNDVGAKTGLIVEYKKGSQRPEKIQPNQVPPGIDKLIAMSTKALKDVT-VPDAMRG 448 Query: 281 FSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLR 340 + + A + + Q+ + + L +L + LL+LI ++ D RM R+ Sbjct: 449 QEGNAVSGI---AKQADQFASQQQLAVPLDNLTYTRNLLAKRLLKLIQRYYDSYRMFRIT 505 >gi|218782387|ref|YP_002433705.1| hypothetical protein Dalk_4559 [Desulfatibacillum alkenivorans AK-01] gi|218763771|gb|ACL06237.1| hypothetical protein Dalk_4559 [Desulfatibacillum alkenivorans AK-01] Length = 704 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 26/246 (10%), Positives = 76/246 (30%), Gaps = 9/246 (3%) Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 + +F + I Y+ + D + + I G + + Sbjct: 273 ETQEWEEFDPENIEQLKVNYILKYKTPFEYNTMMDKKVKWLQFI--GDEILYDGDSPMPY 330 Query: 161 NELPFTC--LRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGS 218 + + + + + Q+ Q L+ L QP T +++G+ Sbjct: 331 DGFSVVTSIANTDPSRRSNNHFGVIRLMKDPQREINKRWSQALNLLNNMVQPGTDIEDGA 390 Query: 219 IIDPESVLNPQF---GKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGIS 275 + D + + G I + + + + M + TGI+ Sbjct: 391 VPDIDQYSEARKTPGGVGIVSSGALRDGKIKERSAPQFPSAPMQMEQMSQDIIRKITGIN 450 Query: 276 DISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVR 335 G + + ++ G+ ++ + + + +F+ ++ +I ++ + Sbjct: 451 PDLLGQ--DSGRQEPGVVVQTRQRQGLILLQKLFKEHKRVRREIFKRVIAIISKYMPDGQ 508 Query: 336 MVRLRD 341 ++R+ Sbjct: 509 ILRILG 514 >gi|169334552|ref|ZP_02861745.1| hypothetical protein ANASTE_00955 [Anaerofustis stercorihominis DSM 17244] gi|169259269|gb|EDS73235.1| hypothetical protein ANASTE_00955 [Anaerofustis stercorihominis DSM 17244] Length = 648 Score = 69.9 bits (169), Expect = 5e-10, Method: Composition-based stats. Identities = 48/327 (14%), Positives = 113/327 (34%), Gaps = 22/327 (6%) Query: 35 RIRRKYSQGKVCVDAVSPDEFLIHP-DSVDIEKSPIVGRKLYLTRSDLISM------GYD 87 +I +G + + +SP +F + DIE ++ ++ +M GY+ Sbjct: 183 KINVAIKEGGINYEIISPFDFFPSNVYAKDIESLDYAIWYKVMSVKEIENMFNITVEGYE 242 Query: 88 RESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAG 147 + S+ + YS+ + + E+ + + + R ++ Sbjct: 243 NNVV---SYSKSKTNVGGLGSKGHGYSESSKNIDLSAEVISYFEKPTNRYPKGRYIVCTK 299 Query: 148 GTGKDN-----ILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLD 202 I + ELPF +++ F GES+ +I +Q+ + + + Sbjct: 300 DNVLHMGDLPYINAEDGERELPFVIQKSL-DYGEFFGESIINRLIPLQRRFNNIKNRKQE 358 Query: 203 NLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLH 262 L Q ++GSI D + V++ + + + + S Sbjct: 359 YLNRVAIGQITYEKGSI-DEDDVIDMGLAPGAVIPRRQGSEEPSYLRTPALPSTILSDEK 417 Query: 263 YLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRG 322 ++ + +G+S++S + A SL++ ++ L + + + Sbjct: 418 ATEELFITLSGVSEMSRNSYNPK-NVTSGVALSLLQDQDDTRLALNYENMYDTRIKIAKQ 476 Query: 323 LLRLIIQHQDKVRM---VRLRDQWVSF 346 LR++ R+ V ++ Q V Sbjct: 477 TLRILKNSVTTPRLSKYVDMKGQ-VEV 502 >gi|296537022|ref|ZP_06899017.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296262651|gb|EFH09281.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 368 Score = 69.9 bits (169), Expect = 5e-10, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 63/199 (31%), Gaps = 9/199 (4%) Query: 145 MAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNL 204 + G+ L + + PF R ++AP G + + ++ L N Sbjct: 148 VLEHDGRAWPLAEGRFQDSPFIAFRWLKAPGEAYGRGPVMKALPDIRTANKVVELVLKNA 207 Query: 205 YWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYL 264 ++ +++P +V + A + + +F + + Sbjct: 208 SIAATGIWQAEDDGVLNPATV-------RLVPGAIIPKAPGSSGLTPLAAPGNFDVSQLV 260 Query: 265 DQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTLAQGLEILFRGL 323 +L R + ++ P MTAT Q+ R A+ L L Sbjct: 261 LDDLRGRIRAALLADRLGPPGTAAMTATEVLERSAQTARLLGATYGRLQAELLTPLIGRC 320 Query: 324 LRLIIQHQDKVRMVRLRDQ 342 L ++ + + ++ L + Sbjct: 321 LSILRRRGEVPPLL-LDGR 338 >gi|23015763|ref|ZP_00055531.1| hypothetical protein Magn03010200 [Magnetospirillum magnetotacticum MS-1] Length = 543 Score = 69.5 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 54/191 (28%), Gaps = 8/191 (4%) Query: 145 MAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNL 204 + D +L ++ PF R ++AP G S + K ++ L N Sbjct: 252 VLDDESSDVVLGRGSFSSSPFLNFRWLKAPGEVYGRSPVMKALPDIKTANKVVELVLKNA 311 Query: 205 YWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYL 264 + + LNP K + G+ F + Sbjct: 312 TIAVTGIWQADDDGV------LNPANIKLVPGTIIPKAVGSAGLQ-PLTAPGRFDTSQLV 364 Query: 265 DQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTLAQGLEILFRGL 323 +L R + + S ++TAT + R ++ L L Sbjct: 365 LDDLRGRIRHALMGDKLSQPASPSLTATEVLQRSDDMARLLGATYGRLQSELLTPLIMRA 424 Query: 324 LRLIIQHQDKV 334 + ++ + + Sbjct: 425 IHILRRRGEIP 435 >gi|117924319|ref|YP_864936.1| hypothetical protein Mmc1_1012 [Magnetococcus sp. MC-1] gi|117608075|gb|ABK43530.1| conserved hypothetical protein [Magnetococcus sp. MC-1] Length = 671 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 41/338 (12%), Positives = 107/338 (31%), Gaps = 13/338 (3%) Query: 8 HMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKS 67 L+ V +++ + +G E D + V V + +F+ P + ++ Sbjct: 136 DYLLPGRGVAWVQYRPQIEGSEPGRDGEPVPLITDESVEVVHLHWTDFVHEP-ARHWKEV 194 Query: 68 PIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELY 127 V R++Y+++ LI + L + + + + E ++ Sbjct: 195 TWVARRVYMSKEALIERFGQKGEQVPLAFLP----QGKRNEASMLAAQNRGAVWEIWDRA 250 Query: 128 VTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCL----RAMRAPHCFIGESLA 183 + Y DG + I+ L P + P + + A Sbjct: 251 SSSVYWLDGSDKG---ILLDWEPDPLGLEGFFPCPRPLLATRSTDSMIPVPDYLLYQDQA 307 Query: 184 ASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIR 243 + +I + ++L R + + + + ++ Sbjct: 308 IELDQITERLSLLTRAVKVSGVYNGELGDRIGSLLQSTGNQLIPVDNWALFGERG-GLRG 366 Query: 244 SVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVG 303 + + +++ + + I+ IS + TATA S+ Q G Sbjct: 367 QIEYLPLTDVVQAITVLSSVRESIKSVIYEITGISDIVRGVSKASETATAQSIKSQWGGR 426 Query: 304 QVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD 341 +++ + + + LFR + ++++H + ++ Sbjct: 427 RLQERQSQVQRFVRDLFRMVGEIMVEHFQPQTIAKMVG 464 >gi|144899435|emb|CAM76299.1| head-to-tail joining protein [Magnetospirillum gryphiswaldense MSR-1] Length = 502 Score = 66.8 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 61/215 (28%), Gaps = 9/215 (4%) Query: 136 GIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTV 195 G + ++ + +L + + PF R ++AP G S + K Sbjct: 230 GHYDYAAILEDATDDDEALLAEGRFGQSPFINFRWLKAPGEIYGRSPVMKALPDIKTANK 289 Query: 196 LLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIE 255 ++ L N + + LNP K I G+ Sbjct: 290 VVELVLKNATIAVTGIWQADDDGV------LNPANIKLIPGTIIPKAVGSAGLQ-PLESP 342 Query: 256 KSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELIVRTLAQ 314 F + + +L R + ++ MTAT R ++ Sbjct: 343 GRFDISQLVLDDLRGRIRHALLADKLGQADNPKMTATEVLERSADMARLLGATYGRLQSE 402 Query: 315 GLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 L L + ++ + + ++ + V R Sbjct: 403 LLTPLILRAVTILRRRGEIPPLL-VDGHLVELQYR 436 >gi|171914969|ref|ZP_02930439.1| hypothetical protein VspiD_27370 [Verrucomicrobium spinosum DSM 4136] Length = 711 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 38/305 (12%), Positives = 93/305 (30%), Gaps = 21/305 (6%) Query: 54 EFLIHPDSVDIE--KSPI----------VGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 + P + D++ + + ++ LT ++ + + + + S Sbjct: 278 DIAFDPTAPDLDLHHTDFFHSFTKGVLDIAKEYGLTDAETRELYHAAKERSESLKPESAR 337 Query: 102 IENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNIL------ 155 E+ P + ++ + + + D G + + M + L Sbjct: 338 SESAPADPDQDDPNGSIPNLPVRLIEGFMRVDALGKGQASNIYMVFAPQCEMCLKLDYLG 397 Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 +LP R P +G ++Q L + + + P T Sbjct: 398 NITPKGKLPVHAHTINRLPWRIVGRGFFERFDKVQTFVDDLFNRINWHDRKSSDPITGFD 457 Query: 216 EGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHS---VPMIEKSFSMLHYLDQELVDRT 272 + + + + F + D + I + +++ ML + Q + RT Sbjct: 458 KSKLAQEDEEEDEPFNSEKPLNLKPDSKLDEAIQFKALPDLNDRTKEMLQMMVQMVQLRT 517 Query: 273 GISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQD 332 GI+ + G + + TAT + ++ + L + L L+ + D Sbjct: 518 GITAANQGDVAGLPEASTATGIKQLMSRAAVLLKSPIDQLKRSFTCDLEYSLLLLYTNLD 577 Query: 333 KVRMV 337 + Sbjct: 578 EDETF 582 >gi|209966578|ref|YP_002299493.1| hypothetical protein RC1_3320 [Rhodospirillum centenum SW] gi|209960044|gb|ACJ00681.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 521 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 54/190 (28%), Gaps = 10/190 (5%) Query: 130 IDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEI 189 + D G A V + +L + E PF R M+AP G S + Sbjct: 232 VLPDPGGGACRWAVALEDD--PPVLLAEGRFAEPPFIAFRWMKAPGEVYGRSPVMKALPD 289 Query: 190 QKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIH 249 + ++ L N + + LNP + + A G+ Sbjct: 290 IRTANKVVELVLKNASVAVTGIWQADDDGV------LNPGTIRLVPGAIIPKAVGSAGL- 342 Query: 250 SVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT-SLIEQSGVGQVELI 308 + F + + +L + ++ P MTAT + Sbjct: 343 TPLASPGRFDVSQLVLDDLRAHIRHALLADRLGPVQGPRMTATEVLERSAEMARMLGATY 402 Query: 309 VRTLAQGLEI 318 R ++ L Sbjct: 403 GRLQSELLVP 412 >gi|291334833|gb|ADD94473.1| hypothetical protein [uncultured phage MedDCM-OCT-S06-C1041] Length = 110 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 11/70 (15%), Positives = 28/70 (40%), Gaps = 2/70 (2%) Query: 10 LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVD--IEKS 67 L+ D +V+ + E ++ + G + ++ V P+EF I ++ +E + Sbjct: 38 LLSDPNVQREIIEDSVEETEFGLNVEFKVIEKMGSIRIEPVPPEEFGIARNARSPYVEDT 97 Query: 68 PIVGRKLYLT 77 + + Sbjct: 98 NFCYHRTLKS 107 >gi|75760981|ref|ZP_00740986.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|74491524|gb|EAO54735.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] Length = 304 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 64/197 (32%), Gaps = 9/197 (4%) Query: 42 QGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISM-GYDRESINNLPIISSQ 100 G++ P I P + E+ + + + G D + N+ ++ Sbjct: 116 TGEIRCRICDPLTVYIDPAAEMDEEIRWIVERKPRDIDYIQERYGKDVAADENVGFAAAF 175 Query: 101 NIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEW 160 ++ F + M++ + +V +AGG D +E Sbjct: 176 DVTPQNGFNSTSKKRPNMAMVDEMWVKPC-----GKHPNGLKVTIAGGQLLDI---DENA 227 Query: 161 NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSII 220 ++PF + P E+ ++ IQ+ ++ + +V GS + Sbjct: 228 GDIPFFIFGDIPIPGSVKAEAFIKDMLPIQREINIMRSMFATHARKMGNSMWLVPMGSSV 287 Query: 221 DPESVLNPQFGKPIRVA 237 D + + N + G I Sbjct: 288 DEDEITNEEGGLFIITN 304 >gi|145642444|ref|ZP_01797998.1| Haemophilus-specific protein, uncharacterized [Haemophilus influenzae R3021] gi|145272864|gb|EDK12756.1| Haemophilus-specific protein, uncharacterized [Haemophilus influenzae 22.4-21] Length = 308 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 48/149 (32%), Gaps = 20/149 (13%) Query: 1 MALNYFIHM---LIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLI 57 + L+Y + +++ V+V+E + I ++ V P +F+ Sbjct: 151 LCLHYAAVLGTGILRAPVVDVVESKAWKQDSLGNWVGEI---VNKTIPAARLVLPWDFVP 207 Query: 58 HPDSVDIEKSPIVGRKLYLTRSDLISM----GYDRESINNL----------PIISSQNIE 103 + ++ V + ++T+ L ++ Y +ES+ L Sbjct: 208 DMTAPTLKDCQFVFERSHVTKKQLQALAKNPYYLKESVLELCELDGGDTRTASNDMDGYV 267 Query: 104 NTWKFPKNQYSDKALEMIEYYELYVTIDY 132 +T + + E + + I Sbjct: 268 DTLRTLSGLETQSKDNRYELWTYHGGIPL 296 >gi|169795385|ref|YP_001713178.1| putative phage related protein [Acinetobacter baumannii AYE] gi|169148312|emb|CAM86177.1| conserved hypothetical protein; putative phage related protein [Acinetobacter baumannii AYE] Length = 547 Score = 59.1 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/239 (16%), Positives = 71/239 (29%), Gaps = 12/239 (5%) Query: 95 PIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMA---GGTGK 151 + + D ++++ E T GD + + A + Sbjct: 189 NEYGENKVSEKVRNTYKSKPDCKVKVLWVVEPRKTGYIKGDRQLMPKEMPFASYHVEVDE 248 Query: 152 DNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 IL +NE PF R + PH G + + K L+R TL + Sbjct: 249 KIILRETGYNEFPFVIPRFRKIPHSVYGTGQVSIALPDAKTANKLMRDTLRSAEISTLGM 308 Query: 212 TIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDR 271 + +P + + GK I V ++ + + + L ++ Sbjct: 309 YAGVDDGTFNPRT-VRLGGGKIIVVNDVNSLKRIDDGKGYQVGVDLLAHLQGAIRKK--- 364 Query: 272 TGISDISSGFSPEILQNMTATATSLIEQSGVGQVE-LIVRTLAQGLEILFRGLLRLIIQ 329 ++ P MTAT + Q+ L R A+ L L L + Sbjct: 365 ----MMADQLQPADGPAMTATEVHVRVDLIRQQLGPLYGRWQAELLTPLLERTFGLAYR 419 >gi|317009831|gb|ADU80411.1| mosaic CUP1551/CUP0957-like protein [Helicobacter pylori India7] Length = 602 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 33/306 (10%), Positives = 96/306 (31%), Gaps = 26/306 (8%) Query: 40 YSQGKVCVDAVSPDEFLIHPDSVDIE--KSPIVGRKLYLTRSDLISMGYDRESINNLPII 97 + ++ + A+ P+ F+I S D + + L +T + + + +D I N + Sbjct: 132 ENNVEIDIKALKPESFVIDYFSTDKNALDARRFHKMLEITEQEALLL-FDESVIINYSNV 190 Query: 98 SSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNI-LC 156 + + I + E + + E R + G L Sbjct: 191 NHERIAS------------------VIESWYKEFNEETKSYEWNRYLWNRSAGIYKSELK 232 Query: 157 NEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQE 216 + PF + L I +Q + + + +E Sbjct: 233 PFKNGACPFIISKLYTDELNNYY-GLFRDIKPMQDFINYAENRM---GNMMGSFKAMFEE 288 Query: 217 GSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISD 276 +++D + + I + ++ +Q+ ++ Sbjct: 289 DAVVDVAEFVETMSLDNAIAKVRPNALKDHKIQFMNNQADLSALSAKAEQKRQLLRLLAG 348 Query: 277 ISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRM 336 ++ + + A + ++SG+ ++ ++ ++F+ + I ++ K ++ Sbjct: 349 LNDESLGIAVNRQSGVAIAQRKESGLMGLQTFLKATDDMDRLVFKLAISFICEYFTKEQV 408 Query: 337 VRLRDQ 342 ++ D+ Sbjct: 409 FKIVDR 414 >gi|316933862|ref|YP_004108844.1| hypothetical protein Rpdx1_2520 [Rhodopseudomonas palustris DX-1] gi|315601576|gb|ADU44111.1| hypothetical protein Rpdx1_2520 [Rhodopseudomonas palustris DX-1] Length = 770 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/313 (12%), Positives = 99/313 (31%), Gaps = 12/313 (3%) Query: 33 DLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESIN 92 D + + Q KVC++AV +FL + D + VG++ +LT+ ++ + S + Sbjct: 159 DEKTDKAKVQEKVCLEAVHRRDFLHD-LARDWSEVDWVGKRSWLTKLEMRKR-FKPVSGD 216 Query: 93 NLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKD 152 + + + KA +E++ +AE +++ Sbjct: 217 AYQQAAYAVRQQQGDAEADDGKAKAG----VWEIWCKSRNKVVWVAEGCDLVLDEDEPHL 272 Query: 153 NILCNEEWNELPFTCLRA---MRAPHCFIGESLAASIIEIQKIKTVLLRQ-TLDNLYWQN 208 + + L+ + P + I E+ + L + + Y Sbjct: 273 QLEGFFPCPRPAYGTLQPGSLIPVPDYAQYKDQLEEINELTGRISALCQAVRVRGFYPAG 332 Query: 209 QPQT--IVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQ 266 + + + + G +++ + ++ ++ Q Sbjct: 333 AGDLGDAIDTAVNSVDDGQILVPVSNWSLLGNGSPKDTIVWLPLDQVVSTIKELVGMRRQ 392 Query: 267 ELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRL 326 + D I+ +S + + T A L Q G ++ L + L + + Sbjct: 393 LIDDVYQITGLSDIMRGSTVASETLGAQKLKSQYGSVRIRDKQEELVRFARDLTAIVAEI 452 Query: 327 IIQHQDKVRMVRL 339 ++ ++ + Sbjct: 453 AAENFAPQTLLDM 465 >gi|308061501|gb|ADO03389.1| mosaic CUP1551/CUP0957-like protein [Helicobacter pylori Cuz20] Length = 601 Score = 57.2 bits (136), Expect = 3e-06, Method: Composition-based stats. Identities = 29/310 (9%), Positives = 88/310 (28%), Gaps = 32/310 (10%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIE--KSPIVGRKLYLTRSDLISMGYDRESINNLPI 96 K ++ + A+ P+ F+I S D + K+ + + I N Sbjct: 131 KEKNVEIEIKAIKPESFIIDYFSTDKNALDARR-FHKMLEVSEQEALLLFGDSVIINYSF 189 Query: 97 ISSQN----IENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKD 152 ++ + IE+ +K + + + Sbjct: 190 VNHERIASVIESWYKEFNEETKSYEWNRYLWNRSAGIYKAEK------------------ 231 Query: 153 NILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 + PF + L I +Q + + Sbjct: 232 ---KPFKNGVCPFVVSKLYTDELNNYY-GLFRDIKPMQDFINYAENRM---GNMMGSFKA 284 Query: 213 IVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRT 272 + +E +++D + + I + ++ +Q+ Sbjct: 285 MFEEDAVVDVAEFVETMSLDNAIAKVRPNALKDHKIQFMNNQADLSALSQKAEQKRQLLR 344 Query: 273 GISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQD 332 ++ ++ + + A + +SG+ ++ ++ + ++F+ + I + Sbjct: 345 LLAGLNDESLGMAVNRQSGVAIAQRRESGLMGLQTFLKATDEMDRLVFKLAVSFICDYFT 404 Query: 333 KVRMVRLRDQ 342 K ++ ++ D+ Sbjct: 405 KEQVFKIVDR 414 >gi|152982725|ref|YP_001353895.1| hypothetical protein mma_2205 [Janthinobacterium sp. Marseille] gi|151282802|gb|ABR91212.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille] Length = 685 Score = 56.1 bits (133), Expect = 8e-06, Method: Composition-based stats. Identities = 38/328 (11%), Positives = 97/328 (29%), Gaps = 34/328 (10%) Query: 12 KDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVG 71 + + L+ S + E + + + V D+F I + ++ + Sbjct: 81 AEPQDQELQESEADQYEEIAWEQTV----------CERVQWDDFRILGAAKTWDEVCAIA 130 Query: 72 RKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTID 131 K TR D I + ++ + + + + E+ + K E+ E + Sbjct: 131 FKHRFTREDCIEK-FGKD-VGKAITLDNVDDEDVKQSDTTADLFKTAEIWEIW------- 181 Query: 132 YDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLR----AMRAPHCFIGESLAASII 187 + + + K + ++ F + A+ I +L Sbjct: 182 ----NKDDKEVIWICKTYSKPCKIQDDPLQLSGFFPIPRPLYAIENDQSLIPAALYTQYE 237 Query: 188 EIQKIKTVLLRQTLDNLYWQNQPQTIVQEG------SIIDPESVLNPQFGKPIRVAAGMD 241 + K + ++ L + + I + ++ L P Sbjct: 238 QQAKELNRI-SIRINKLIEALKVRGIYDSTLSELSELMKAADNELIPAQNVAAIAERAGL 296 Query: 242 IRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSG 301 +++ + + + DQ I+ I+ T A + Q G Sbjct: 297 DKAIFMMPIETIAAVIKYLYEQRDQTKQVIYEITGIADIMRGATDARETMGAQQIKTQWG 356 Query: 302 VGQVELIVRTLAQGLEILFRGLLRLIIQ 329 +++ + R + + + L R +I + Sbjct: 357 TQRLQRMQREVQRYIRDLIRLKAEIISE 384 >gi|317013629|gb|ADU81065.1| mosaic CUP1551/CUP0957-like protein [Helicobacter pylori Gambia94/24] Length = 603 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/312 (9%), Positives = 94/312 (30%), Gaps = 32/312 (10%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIE--KSPIVGRKLYLTRSDLISMGYDRESINNLPI 96 K ++ + A+ P+ F+I S D + + L +T + + + + + N Sbjct: 131 KEKNVEIDIKALKPESFVIDYFSTDKNALDARRFHKMLEITEQEALLL-FGESVMVNYSS 189 Query: 97 ISSQNI----ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKD 152 + + I E+ +K + + Sbjct: 190 ANHERIASVIESWYKEYNQNSQSYEWNRYLWSRSAGIYKSE------------------- 230 Query: 153 NILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 L + PF + L I +Q + + Sbjct: 231 --LKPFKSGACPFIVSKLYTDELNNYY-GLFRDIKPMQDFINYAENRM---GNMMGSFKA 284 Query: 213 IVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRT 272 + +E +++D + + I + ++ +Q+ Sbjct: 285 MFEEDAVVDVAEFVETMSLDNAIAKVRPNALKDHKIQFMNNQADLSALSQKAEQKRQLLR 344 Query: 273 GISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQD 332 ++ ++ + + A + ++SG+ ++ ++ + ++F+ + I ++ Sbjct: 345 LLAGLNDESLGMAVNRQSGVAIAQRKESGLMGLQTFLKATDEMDRLIFKLAVSFICEYFT 404 Query: 333 KVRMVRLRDQWV 344 K ++ ++ D+ V Sbjct: 405 KEQVFKIVDRKV 416 >gi|15320615|ref|NP_203459.1| virion structural protein [Myxococcus phage Mx8] gi|15281725|gb|AAK94380.1|AF396866_45 virion structural protein [Myxococcus phage Mx8] Length = 663 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 43/341 (12%), Positives = 108/341 (31%), Gaps = 8/341 (2%) Query: 11 IKDSDVEVLEHSHREDGG-EKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPI 69 ++ +V ++ E G E + ++ + V D + + L P + + Sbjct: 141 VEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSP-ARVWHEVRW 199 Query: 70 VGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVT 129 + + L + + +D + NL + + K+ S + E +E++ Sbjct: 200 LAFRNLLDMREFNAR-FDADGSRNLWASVPKVGKPK--DGKDGQSCHPWDRAEVWEIWDK 256 Query: 130 IDYDGDGIAELRRVIMAGGTGKDNILCNEEWNEL---PFTCLRAMRAPHCFIGESLAASI 186 D E ++ + + +T + + P + + L I Sbjct: 257 GGRKVDWYVEGYSAVLDTQPDPLGLESFFPCPKPLLANWTTDKVVPRPDFVLAQDLYKEI 316 Query: 187 IEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVL 246 + T+L R + + ++ L P G V Sbjct: 317 DLVSTRITLLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRGVVD 376 Query: 247 GIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVE 306 P++ S+ Y + + ++ ++ TA A + + G +++ Sbjct: 377 WFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQ 436 Query: 307 LIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFD 347 + +A+ + R +I +H D ++ + +FD Sbjct: 437 RLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFD 477 >gi|226227231|ref|YP_002761337.1| hypothetical protein GAU_1825 [Gemmatimonas aurantiaca T-27] gi|226090422|dbj|BAH38867.1| hypothetical protein [Gemmatimonas aurantiaca T-27] Length = 799 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 72/189 (38%), Gaps = 7/189 (3%) Query: 156 CNEEWNELPFTCLRAMRAP--HCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTI 213 EE +LP R P G S + + + L+ LY N P T Sbjct: 378 EREEPLDLPVAQCRFFEDPADQDPYGLSPVEWLAPMDEAVATQTIAWLEYLYRFNHPNTF 437 Query: 214 VQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTG 273 + GS+I P LN + G PIR A I + P +S +++ + + +G Sbjct: 438 LPLGSVIQPGQ-LNIRDGTPIRYNAAAGKLEYESIPTFP--SESTALIDKYEAWMRTLSG 494 Query: 274 ISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDK 333 + + + G + +++ I + + + +V + L R L+LI H Sbjct: 495 LENAARGVADPSVKS--GIHAERIIEQALVALTQVVSNVQDFLLRRGRIRLQLIATHYTA 552 Query: 334 VRMVRLRDQ 342 R++R+ Sbjct: 553 PRLLRINGD 561 >gi|291336985|gb|ADD96509.1| hypothetical protein [uncultured organism MedDCM-OCT-S11-C235] Length = 694 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 45/369 (12%), Positives = 87/369 (23%), Gaps = 64/369 (17%) Query: 12 KDSDVEVLEHSHREDGGEKVHDLRIR------RKYSQGKVCVDAVS-----------PDE 54 D V V+ + V D +R RK +G+V V V ++ Sbjct: 186 NDDAVFVMAQNLLATQNYTVSDAEVRALISDLRKKGEGRVTVPMVHKDRPTVVALKVGED 245 Query: 55 FLIHPDSVDIEKSPIVGRKLYLTRSDLIS-------MGYDRESINNLPIISSQ-----NI 102 F D+ DI+K+ + + Y+T + E + + Sbjct: 246 FFAPADTTDIQKARRLYYRQYMTAEQIQDAVVSQDWDKRWAEEVIESAKGNMTSGNFLEN 305 Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIM---------AGGTGKDN 153 Q + E + G+ + +I + Sbjct: 306 TTNRSKRPGQLDLDTENLYEVVHAFERRVDPKTGVPGIYIIIFSPHLMSDESGEEIVAKH 365 Query: 154 ILCNEEWNELPFTCLRAMRAPHCFIG-ESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQT 212 L N ++PF +R Q+ + +D Y P Sbjct: 366 ELLNYGHCQMPFVLMRREFLSRRVDDSRGYGEIAHTWQRQIKMEWDGRVDRSYLATMPPL 425 Query: 213 IVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRT 272 + G P + M S S + + + Sbjct: 426 MHPFGRAPV--------KWGPGVMVPRMRADDYQYAESPKYDSGSKEIEESIRKTADRYF 477 Query: 273 GISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQD 332 G + + +VR + + +L+L Q Sbjct: 478 GRPVE-----------------EANVAYAQMRQQNMVRKWLDHWREVTQQVLQLCQQFLP 520 Query: 333 KVRMVRLRD 341 + R+ Sbjct: 521 EPFYFRVVG 529 >gi|154175505|ref|YP_001408187.1| hypothetical protein CCV52592_0386 [Campylobacter curvus 525.92] gi|112802353|gb|EAT99697.1| hypothetical protein CCV52592_0386 [Campylobacter curvus 525.92] Length = 576 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 30/304 (9%), Positives = 83/304 (27%), Gaps = 25/304 (8%) Query: 40 YSQGKVCVDAVSPDEFLIHPDS--VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPII 97 + + V + D F I P S D + L+SM ++ + Sbjct: 138 KKEKAITVSTIPSDMFYIDPYSCEEDASDAKYFI--------KLMSMDFEDAKV------ 183 Query: 98 SSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCN 157 K + + + YE ++ + G T Sbjct: 184 ---YFGQKANALKLNIISRYRKRVNIYEFWIKEPDSQSQNGYTWNRYIMGDTLVLLRYEK 240 Query: 158 EEW--NELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 + PF + ++ + + + + + Sbjct: 241 SPFANGMHPFAVCKLKIDDENRWY-GFFRNLKPQIDFINFAENRMAN---MIGSSKILYE 296 Query: 216 EGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGIS 275 ++ D ++ V + I V + ++ + +S Sbjct: 297 SDAVDDADTFAKEINIDNAVVRVKNGALADKKIEIVNNQPQISNLSAKVADARATAQRLS 356 Query: 276 DISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVR 335 ++ + ++ +A +G+ ++ + A +++F + LI ++ D + Sbjct: 357 GLNDETLGLAVNRLSGSAIEQRNNAGIVSLQGFLSASAAMDKMIFLKAIDLITRYFDAEQ 416 Query: 336 MVRL 339 + R+ Sbjct: 417 VFRI 420 >gi|299534277|ref|ZP_07047626.1| hypothetical protein CTS44_25721 [Comamonas testosteroni S44] gi|298717735|gb|EFI58743.1| hypothetical protein CTS44_25721 [Comamonas testosteroni S44] Length = 724 Score = 53.7 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 30/318 (9%), Positives = 77/318 (24%), Gaps = 27/318 (8%) Query: 52 PDEFLIHPDSVDI--EKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFP 109 P I P + + + + + + + Sbjct: 156 PLSVYIDPFAQCPVASDMRYCFLTDLIPTEQFKREYPNAKVTDGVEWQGVGDTYKQGWVR 215 Query: 110 KNQYSDKALEMIEYYELYVTIDYDG------------------DGIAELRRVIMAGGTGK 151 + I + + DG + R+V A TG Sbjct: 216 DDGIIVAEYYRIVLTSDTLVLMQDGSTAWKSDLSEDAKAVSAKTRPSMRRKVKWAKITGC 275 Query: 152 DNILCNE-EWNELPF--TCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQN 208 D + E + +P + + + + + ++ + + + + Sbjct: 276 DVLEEAEIPGSWIPVFPVYGQELDVEGQVHRWGVIRNAKDPARMYNFWMTSATEEVAMRP 335 Query: 209 QPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMD----IRSVLGIHSVPMIEKSFSMLHYL 264 + + +G E + PM + +L Sbjct: 336 KTPWVGAKGQFEGVEQQWTNANRSSQAYLEYEPVSLNGQLAPPPQRQPMADVPVGVLQMA 395 Query: 265 DQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLL 324 + + + + A ++ G V L + ++ R L+ Sbjct: 396 MHARDNLKSTTGLYDASLGAQGNETSGRAILARQKEGDTANYHFVDNLNRAIKHCGRVLV 455 Query: 325 RLIIQHQDKVRMVRLRDQ 342 +I D R++R+R + Sbjct: 456 EMIPHIYDGERVIRIRGE 473 >gi|307564867|ref|ZP_07627392.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A] gi|307346403|gb|EFN91715.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A] Length = 658 Score = 53.4 bits (126), Expect = 5e-05, Method: Composition-based stats. Identities = 33/268 (12%), Positives = 81/268 (30%), Gaps = 17/268 (6%) Query: 90 SINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGT 149 I + N + + + E + + +D + G Sbjct: 300 KIEEEDFEKEVTLVNAQRMQMAEATGMPPEEVPLVKATWFMD----DYWYFYYLTPFGDI 355 Query: 150 GKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 K+ E P+ +A I S A +I+ Q+ L+ + + Sbjct: 356 LKEG-ETPFEHGSHPYV-FKAYPFIDGEIH-SFVADVIDQQRYTNRLITLYDWIMRASAK 412 Query: 210 PQTIVQEGSIIDPESVLNP-----QFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYL 264 ++ E S+ D S+ + +F I + + + +L+ Sbjct: 413 GVLLMPEDSLPDGVSMEDIAESWAEFNGVIVFKPSKSGQIPHQVANNSTNIGITELLNLQ 472 Query: 265 DQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLL 324 + D +G++ G +A + Q+ + ++ + + + Sbjct: 473 LKFFEDISGVNGALQG--KPGYAGTSAAKYNQETQNATMSLLDMLECFSYFVVDGAYKDV 530 Query: 325 RLIIQHQDKVRMVRLRDQ---WVSFDPR 349 + I Q D R+ + + + +DP+ Sbjct: 531 KNIQQFYDGKRVFNIAGKTSAQIEYDPK 558 >gi|325971684|ref|YP_004247875.1| hypothetical protein SpiBuddy_1857 [Spirochaeta sp. Buddy] gi|324026922|gb|ADY13681.1| hypothetical protein SpiBuddy_1857 [Spirochaeta sp. Buddy] Length = 571 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 41/349 (11%), Positives = 108/349 (30%), Gaps = 37/349 (10%) Query: 5 YFIHMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSV-D 63 Y + L V + V+D G + ++P +F I ++ Sbjct: 145 YPLDKLATKDAVVQGTSAEW------VYD-----DVESGTCVFETIAPWDFWIDKNANGK 193 Query: 64 IEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEY 123 I+ + + +T +D + D+ P +++E ++++ + Sbjct: 194 IDT---IFIRFTMTSADALDRFKDK-----TPPNILRDVETDAGHNEHEFVLAIYPRKKL 245 Query: 124 YELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLA 183 + E + +D I+ +++ P + G L Sbjct: 246 RSEKGKVLI----STEKPFAAVTYYPVEDCIVEESGYDDFPVAVHVFEQDGTSAYGMGLV 301 Query: 184 ASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIR 243 + K + R L+ + +P + E + Sbjct: 302 MKYLTELKRLNSMSRDHLETVQKVAKPPMSIPESLKGRFSGDPGARNYMG------NMDA 355 Query: 244 SVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVG 303 I +V I + L++++ + + + +TAT T I+ + Sbjct: 356 KPEIIQTVQDIGWLSQEITELEEKIGRLFFNDLFNYLMRQD--KVLTATQTQAIKSEELA 413 Query: 304 QVELIVRT-----LAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFD 347 + I+ T + ++ +FR +++ + ++R+++ + D Sbjct: 414 LLASILGTTQYMKINPIVKRVFRIMVKGNRLPKPPKELLRIKNALMRID 462 >gi|109948103|ref|YP_665331.1| mosaic CUP1551/CUP0957-like protein [Helicobacter acinonychis str. Sheeba] gi|109715324|emb|CAK00332.1| conserved hypothetical mosaic CUP1551/CUP0957-like protein [Helicobacter acinonychis str. Sheeba] Length = 600 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/309 (9%), Positives = 88/309 (28%), Gaps = 30/309 (9%) Query: 39 KYSQGKVCVDAVSPDEFLIHPDSVDIE--KSPIVGRKLYLTRSDLISMGYDRESINNLPI 96 + ++ + A++P+ F+I S D + + L +T + + + D ++ Sbjct: 131 EEKNIEIGIKALNPESFIIDHFSTDKNALDARRFHKMLEITEQEALLLFGDSVMVDYSNR 190 Query: 97 ISS---QNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDN 153 IE+ +K + + Sbjct: 191 HHERIASVIESWYKEYDKEKKSYEWNRYL---------------------WSRNAGVYKS 229 Query: 154 ILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTI 213 PF + L I +Q + + + Sbjct: 230 ERRPFSNGACPFIVAKLYMDECNHYY-GLFRDIKPMQDFINYAENRM---GNMMGSFKAM 285 Query: 214 VQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTG 273 +E +++D + + I + ++ +Q+ Sbjct: 286 FEEDAVVDIAEFVETMSLDNAIAKVRPNALKENKIQFMNNQADLSALSQKAEQKRQLLRL 345 Query: 274 ISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDK 333 ++ ++ + + A + +SG+ ++ ++ ++FR + I ++ K Sbjct: 346 LAGLNDESLGMAVNRQSGVAIAQRRESGLMGLQSFLKATDDMDRLVFRLAVSFICEYFKK 405 Query: 334 VRMVRLRDQ 342 ++ ++ D+ Sbjct: 406 EQVFKIVDR 414 >gi|238801662|ref|YP_002922718.1| gp46 [Burkholderia phage BcepIL02] gi|237688037|gb|ACR15039.1| gp46 [Burkholderia phage BcepIL02] Length = 775 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 65/198 (32%), Gaps = 12/198 (6%) Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 N PFT + R + + + +Q L + L Y + + +++ Sbjct: 343 SPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKAL---YILSANKVMME 399 Query: 216 EGSIIDPES-VLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGI 274 EG++ D E + V + +V + + Q + G+ Sbjct: 400 EGAVDDIEEFRREIARPDSVNVVKNGKLGAVKLDVDRDLAPAHLELASRSIQMIQQVGGV 459 Query: 275 SDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV 334 +D G + + + A ++ G + L + L LI Q+ + Sbjct: 460 TDEMLGRTTNAVSGV---AIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEE 516 Query: 335 RMVRLRD-----QWVSFD 347 + R+ + ++V+ + Sbjct: 517 KQFRITNSRGNPEYVAVN 534 >gi|298385365|ref|ZP_06994923.1| hypothetical protein HMPREF9007_02030 [Bacteroides sp. 1_1_14] gi|298261506|gb|EFI04372.1| hypothetical protein HMPREF9007_02030 [Bacteroides sp. 1_1_14] Length = 656 Score = 49.5 bits (116), Expect = 7e-04, Method: Composition-based stats. Identities = 31/268 (11%), Positives = 80/268 (29%), Gaps = 17/268 (6%) Query: 90 SINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGT 149 I+ EN + + E + + +D + G Sbjct: 298 KIDEEDYAQVVLAENEERMRMAKEVGMPEEEVPLIKATWFVD----DYWYFYYLSPFGD- 352 Query: 150 GKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 E P+ +A I S A +I+ Q+ L+ + + Sbjct: 353 ILREGETPYEHGSHPYV-FKAYPFIDGEIH-SFVADVIDQQRYTNRLITLYDWIMRASAK 410 Query: 210 PQTIVQEGSIIDPESVLNP-----QFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYL 264 ++ E S+ D S+ + +F I + + + +L+ Sbjct: 411 GVLMMPEDSLPDGVSIDDIAESWTEFNGVIVYRPSKSGKVPEQVANNSTNIGIAELLNMQ 470 Query: 265 DQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLL 324 + D +G++ G +A+ + ++ + ++ + + + Sbjct: 471 LKFFEDISGVTGALQG--KPGYSGESASHYNQQTENATKSLLDLLECFSCFVVDGAYKDV 528 Query: 325 RLIIQHQDKVRMVRLRDQ---WVSFDPR 349 + + Q D R+ + + + +DP+ Sbjct: 529 KNMQQFYDTKRVFNIAGRSGAQIEYDPK 556 >gi|221633562|ref|YP_002522788.1| phage domain-containing protein [Thermomicrobium roseum DSM 5159] gi|221156112|gb|ACM05239.1| phage domain protein [Thermomicrobium roseum DSM 5159] Length = 429 Score = 49.5 bits (116), Expect = 7e-04, Method: Composition-based stats. Identities = 42/295 (14%), Positives = 89/295 (30%), Gaps = 48/295 (16%) Query: 35 RIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSP---IVGRKLYLTRSDLISMGYDRESI 91 ++ ++ + V + P + + + + + V + L + L E++ Sbjct: 106 KVTWDAARRRPRVTPIDPAQLV---AATRPDDAREVVAVAHEYPLEPAAL-------EAV 155 Query: 92 NNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGK 151 L + + + AE R+++AG T Sbjct: 156 FGL-------------------------RLPRLGPEGWVTVREEWTAERYRLLVAGETVH 190 Query: 152 DNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQ 211 D + +P+ + AP GES A ++++ + L L P Sbjct: 191 D---DANPYGWIPYVLVPNSPAPGGPWGESDLADLLDVCRELNRRLTVLSRILQVSGNPI 247 Query: 212 TIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDR 271 +++ + + + +L + S + + L Q L D Sbjct: 248 VVLENVTASEGIRAEEGAVWEL----PEGSRAYLLDMLSGGGVALHLEYVRLLFQVLHDL 303 Query: 272 TGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRL 326 + + G L A L Q V +VE R+ + L + +L L Sbjct: 304 AEVPRAAFGDHGRDLSG---AALELELQPLVHKVERKRRSWERALRQRAQRVLDL 355 >gi|269836055|ref|YP_003318283.1| phage portal protein, SPP1 [Sphaerobacter thermophilus DSM 20745] gi|269785318|gb|ACZ37461.1| phage portal protein, SPP1 [Sphaerobacter thermophilus DSM 20745] Length = 452 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 62/198 (31%), Gaps = 10/198 (5%) Query: 134 GDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIK 193 D AE R +AG + +P+ + PH GES A ++++ + Sbjct: 190 EDWTAERVRFEVAG---VIVRDEPNPYGWIPYVIFPNIAKPHSLWGESDLADLLDVCREL 246 Query: 194 TVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPM 253 + L P +++ + + G + + + Sbjct: 247 NRRMTVISRILQVSGNPIVVLEN---VTGSDGIRADEGAVWELPEDSKAYLLDMLS---- 299 Query: 254 IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLA 313 + Y++ +++ + +N++ TA + Q V +V+ R Sbjct: 300 GGGVRLHIDYVELLYRALYDLAETPRSAFGDSGRNLSGTALEVEIQPLVQKVQRKRRVWD 359 Query: 314 QGLEILFRGLLRLIIQHQ 331 R LL L+ + Sbjct: 360 SVYRRRNRMLLDLMERFG 377 >gi|147668978|ref|YP_001213796.1| phage portal protein, SPP1 [Dehalococcoides sp. BAV1] gi|146269926|gb|ABQ16918.1| phage portal protein, SPP1 [Dehalococcoides sp. BAV1] Length = 454 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 64/199 (32%), Gaps = 16/199 (8%) Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 + +PF +R P F G S I+E Q+ L Q L P +++ Sbjct: 197 KPNPYGFIPFVIFPNLREPKRFWGVSDLDEIMEPQRELNRALSQLSRILELSGNPIAVLE 256 Query: 216 EGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGIS 275 ++ + + G + +L + + + + L + L D Sbjct: 257 N---VEQSEDIAVRPGAVWNL-PEDTRAYLLDLLQGGGVGLHINYVDLLYRTLHDIAEAP 312 Query: 276 DISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVR 335 + G S L A L + ++R++A + +LRL+ ++ R Sbjct: 313 RAAFGGSGRDLSG-VALEIELQPLLQRVWRKRLIRSVA-YRKRSG-MILRLLEKY----R 365 Query: 336 MVRLRD-----QWVSFDPR 349 + + PR Sbjct: 366 GLDFNGVDPSISFSPVLPR 384 >gi|160700609|ref|YP_001552284.1| hypothetical protein BA3_0015 [Thalassomonas phage BA3] gi|157787728|gb|ABV74300.1| hypothetical protein BA3_0015 [Thalassomonas phage BA3] Length = 711 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 34/312 (10%), Positives = 81/312 (25%), Gaps = 28/312 (8%) Query: 57 IHPDS--VDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYS 114 I PD+ D +++ ++ D + + + Sbjct: 203 IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSE 262 Query: 115 DKALEMIEYYELYV------TIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNEL----- 163 E + + +D D + EL ++ + W ++ Sbjct: 263 YFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANV 322 Query: 164 -------PFTCLRAMRAPHC-------FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQ 209 P T + + I S+ + Q++ + + + Sbjct: 323 LEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPK 382 Query: 210 PQTIVQEGSIIDPESVLNPQFGKPIRV-AAGMDIRSVLGIHSVPMIEKSFSMLHYLDQEL 268 I EG++ E K + + G P + L + Sbjct: 383 APFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSV 442 Query: 269 VDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLII 328 + + + A ++ G + L + + + + L+ +I Sbjct: 443 EKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIP 502 Query: 329 QHQDKVRMVRLR 340 D R+VRL+ Sbjct: 503 HIYDTERVVRLK 514 >gi|270307724|ref|YP_003329782.1| phage domain protein [Dehalococcoides sp. VS] gi|270153616|gb|ACZ61454.1| phage domain protein [Dehalococcoides sp. VS] Length = 454 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 57/181 (31%), Gaps = 18/181 (9%) Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 + +PF +R P F G S I+E Q+ L Q L P +++ Sbjct: 197 KPNPYGFIPFVIYPNLREPKRFWGVSDLDEIMEPQRELNRALSQLSRILELSGNPIAVLE 256 Query: 216 EGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGIS 275 ++ + + G + +L + + + + L + L D Sbjct: 257 N---VEQSEDIAVRPGAVWNL-PEDTRAYLLDLLQGGGVGLHINYVDLLYRTLHDIAEAP 312 Query: 276 DISSGFSPEILQNMTATATS------------LIEQSG-VGQVELIVRTLAQGLEILFRG 322 + G S L + A LI + + +I+R L + F G Sbjct: 313 RAAFGGSGRDLSGI-ALEIELQPLLQRVWRKRLIRSAAYRKRSAMILRLLEKYRGQDFSG 371 Query: 323 L 323 + Sbjct: 372 V 372 >gi|264677592|ref|YP_003277498.1| hypothetical protein CtCNB1_1456 [Comamonas testosteroni CNB-2] gi|262208104|gb|ACY32202.1| hypothetical protein CtCNB1_1456 [Comamonas testosteroni CNB-2] Length = 543 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 52/186 (27%), Gaps = 14/186 (7%) Query: 164 PFTCLRAMRAPHCFI-------GESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQE 216 PF R + P + +IQ + L Y + + I ++ Sbjct: 145 PFKHNRFLMVPIWGYRRARDGLAYGAWRGMRDIQDDLNKRRSKAL---YALSVNRIIAEK 201 Query: 217 GSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISD 276 G++ D + + + + R + + + + + Q+ Sbjct: 202 GAVDDWDDLRD----EAARPDGIIIKNPQRELKFDNNMGDFQANVELAAQDAQLIRNAGG 257 Query: 277 ISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRM 336 ++ + A + G L + + L I Q + ++ Sbjct: 258 VTDENLGRDTNANSGRAILAKQDQGSLTTSEFFDNLLLAIRQAGQLRLSHIEQFYTEEKV 317 Query: 337 VRLRDQ 342 +R+ + Sbjct: 318 IRIVGE 323 >gi|237750676|ref|ZP_04581156.1| conserved hypothetical protein [Helicobacter bilis ATCC 43879] gi|229373766|gb|EEO24157.1| conserved hypothetical protein [Helicobacter bilis ATCC 43879] Length = 556 Score = 46.0 bits (107), Expect = 0.008, Method: Composition-based stats. Identities = 30/312 (9%), Positives = 86/312 (27%), Gaps = 33/312 (10%) Query: 44 KVCVDAVSPDEFLIHPDSVDIE--KSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 K+ + + + I P S + K+ + D++ + L + Sbjct: 140 KITIKHIPINALYIDPYSQKEDGSDCKYY-HKV---------LYNDKDDMIELYGKREYD 189 Query: 102 IENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWN 161 I N + E + Y+E +V + R I + Sbjct: 190 I------INNVGMNAYRERVRYFESFVL----NPKTRKYDRFIWDKTGIMQTDTSIFDLR 239 Query: 162 ELPFTCLR-AMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSII 220 P + + + + F + ++ Q + + + + + ++ Sbjct: 240 HCPIVIRKLYVDSANAFY--GIFRNVKPHQDYVNFAENRMAN---MLGSQKILYEMSAVD 294 Query: 221 DPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSG 280 + E V S I S+ ++ + + Sbjct: 295 NAEEFSKHVSLDNAVVGVRDGALSSSKIQFQNHSNDVASLSSKSNEHRQIARMQAGFNDE 354 Query: 281 FSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLR 340 ++ + +G+ ++ + + +F L I ++ DK ++ R+ Sbjct: 355 ALGQVTSRASGVVVQQRTNAGLMGIQRFLTASDLFDKSVFSVCLEYITKYFDKAQVFRIV 414 Query: 341 DQ-----WVSFD 347 ++ + + Sbjct: 415 EEDTFENYFEIN 426 >gi|302339294|ref|YP_003804500.1| head-to-tail joining protein [Spirochaeta smaragdinae DSM 11293] gi|301636479|gb|ADK81906.1| head-to-tail joining protein, putative [Spirochaeta smaragdinae DSM 11293] Length = 560 Score = 45.7 bits (106), Expect = 0.010, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 45/185 (24%), Gaps = 13/185 (7%) Query: 148 GTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQ 207 G ++ + + LP+ R G + K L R L Sbjct: 238 EGGSNHKIRERGYERLPYVVWRWSTNSDEVYGRGPGYDALVDVKRLNRLSRDMLKQSQMA 297 Query: 208 NQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQ- 266 P V E G ++ + + + + Sbjct: 298 VDPPLAVPEKMRGKVNW---VPRGLNYYQNPNEVPVALNPGMQFQVGLDREQHMQQIIEK 354 Query: 267 -ELVDRTGISDISSGFSPEILQNMTATA-TSLIEQSGVGQVELIVRTLAQGLEILFRGLL 324 + D + + MTAT + +I R ++ L+ + Sbjct: 355 HFMTDFF-------LMLEQAPKEMTATEVMERQSEKAAVLGTVIGRISSEFLDPIIDITF 407 Query: 325 RLIIQ 329 + ++ Sbjct: 408 DIAMK 412 >gi|38640357|ref|NP_944280.1| Bcep22gp51 [Burkholderia phage Bcep22] gi|33860424|gb|AAQ54984.1| Bcep22gp51 [Burkholderia phage Bcep22] Length = 776 Score = 45.7 bits (106), Expect = 0.010, Method: Composition-based stats. Identities = 33/343 (9%), Positives = 92/343 (26%), Gaps = 64/343 (18%) Query: 63 DIEKSPIVGRKLYL-----------TRSDLISMGYDR------ESINNLPIISSQNIENT 105 D++ + R ++ + L + D + I+ + S E + Sbjct: 200 DMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERS 259 Query: 106 WKFPKNQYSDKALEMIEYYELYVTIDY-----------------DGDGIAELRRVIMAGG 148 A + + E + + D + + V Sbjct: 260 MNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRA 319 Query: 149 TGKDNILCNEEWNEL-----------PFTC----LRAM---RAPHCFIGESLAASIIEIQ 190 + + + P+ + R + + + +Q Sbjct: 320 VLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQ 379 Query: 191 KIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHS 250 L + L Y + + +++EG++ D + + + + Sbjct: 380 DDVNKRLSKAL---YILSTNKVLMEEGAVDDIDEFRREAARPDAVMTVKNGKLGAVKMDV 436 Query: 251 -VPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIV 309 + + Q + G++D G + + + A ++ G + Sbjct: 437 DRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGV---AIQARQEQGSVATNKLF 493 Query: 310 RTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD-----QWVSFD 347 L + L LI Q+ + + R+ + ++V+ + Sbjct: 494 DNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVN 536 >gi|190573931|ref|YP_001971776.1| hypothetical protein Smlt1958 [Stenotrophomonas maltophilia K279a] gi|190011853|emb|CAQ45473.1| putative phage protein [Stenotrophomonas maltophilia K279a] Length = 723 Score = 45.3 bits (105), Expect = 0.013, Method: Composition-based stats. Identities = 28/216 (12%), Positives = 60/216 (27%), Gaps = 9/216 (4%) Query: 130 IDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEI 189 + E+ I G P+T R + L + + Sbjct: 326 YSLSDAVVEEMWCAIFTEGGLLQLKRSPFRHGRFPYTPYWCYRRNRDGMEYGLVRGVRDS 385 Query: 190 QKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVL---NPQFGKPIRVAAGMDIRSVL 246 Q+ + + L + + Q +EG+I + + + + Sbjct: 386 QEDLNKRMSKLL---WALSTNQLFYEEGAIDEDRIEEVKREIAKPNGVIPLKNNGLDRIK 442 Query: 247 GIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVE 306 ++ + E +L + D TG++ G A +Q G Sbjct: 443 VERNLDVAEAQIKLLELDAAHIHDGTGVNRELLGRETNAASGR---AILAKQQEGAVSTA 499 Query: 307 LIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQ 342 + G+++ L L Q + R R+ + Sbjct: 500 ELFDNYRLGIQLSGEKQLSLTEQFMTEERQFRIVGE 535 >gi|18071218|ref|NP_542303.1| hypothetical protein PBC5p43 [Sinorhizobium phage PBC5] gi|17940324|gb|AAL49568.1|AF448724_5 unknown [Sinorhizobium phage PBC5] Length = 749 Score = 45.3 bits (105), Expect = 0.014, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 67/226 (29%), Gaps = 18/226 (7%) Query: 133 DGDGI------AELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASI 186 +GDG + + N PFT + R + + +I Sbjct: 314 EGDGEIIEKVSMRMYVALFTSAGLLWLSPSPYRHNRYPFTPIWNKRRGRDGMPYGMIRNI 373 Query: 187 IEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVL 246 +IQ + L L + + I+ +G++ D + + + Sbjct: 374 RDIQSDINKRASKALHIL---SSNKVIMDDGAVEDINELAEEIARPDAIIVKQQGKEFKI 430 Query: 247 GIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVE 306 + + ++ L G++D + G + + A ++ G Sbjct: 431 DTD-RELGQWHLELMSRNISMLQQVGGVTDENLGRTTNAVSGK---AIIARQEQGSLATA 486 Query: 307 LIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD-----QWVSFD 347 + ++ L + Q + + R+ + ++V+ + Sbjct: 487 GLFDNHRYAQQVRGEKTLANMEQFMSEEKKFRITNKRGTPEYVAVN 532 >gi|264678783|ref|YP_003278690.1| hypothetical protein CtCNB1_2648 [Comamonas testosteroni CNB-2] gi|262209296|gb|ACY33394.1| hypothetical protein CtCNB1_2648 [Comamonas testosteroni CNB-2] Length = 747 Score = 44.9 bits (104), Expect = 0.017, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 52/186 (27%), Gaps = 14/186 (7%) Query: 164 PFTCLRAMRAPHCFI-------GESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQE 216 PF R + P + +IQ + L Y + + I ++ Sbjct: 351 PFKHNRFLMVPIWGYRRARDGLAYGAWRGMRDIQDDLNKRRSKAL---YALSVNRIIAEK 407 Query: 217 GSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISD 276 G++ D + + + + R + + + + + Q+ Sbjct: 408 GAVDDWDDLRD----EAARPDGIIIKNPQRELKFDNNMGDFQANVELAAQDAQLIRNAGG 463 Query: 277 ISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRM 336 ++ + A + G L + + L I Q + ++ Sbjct: 464 VTDENLGRDTNANSGRAILAKQDQGSLTTSEFFDNLLLAIRQAGQLRLSHIEQFYTEEKV 523 Query: 337 VRLRDQ 342 +R+ + Sbjct: 524 IRIVGE 529 >gi|57234878|ref|YP_181101.1| phage domain-containing protein [Dehalococcoides ethenogenes 195] gi|57225326|gb|AAW40383.1| phage domain protein [Dehalococcoides ethenogenes 195] Length = 303 Score = 43.0 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 24/82 (29%), Gaps = 1/82 (1%) Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 + +PF +R P F G S I+E Q+ L Q L P V Sbjct: 197 KPNPYGFIPFVIYPNLREPKRFWGVSDLDEIMEPQRELNRALSQLSRILELSGNP-IAVL 255 Query: 216 EGSIIDPESVLNPQFGKPIRVA 237 E + + P Sbjct: 256 ENVEQSEDIAVRPGGFTATVRF 277 >gi|300361373|ref|ZP_07057550.1| SPP1 family phage portal protein [Lactobacillus gasseri JV-V03] gi|300353992|gb|EFJ69863.1| SPP1 family phage portal protein [Lactobacillus gasseri JV-V03] Length = 468 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 26/220 (11%), Positives = 60/220 (27%), Gaps = 9/220 (4%) Query: 113 YSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMR 172 Y + + +Y D + D A + D++ +++ P+ + A+ Sbjct: 157 YDNTVNREPLAFVMYEYYDTESDWQARGKIYYANKVYDFDDMKISDDDTVNPYKMVPAVE 216 Query: 173 APHCFIGESLAASIIEIQKIKTVLLRQ------TLDNLYWQNQPQTIVQEGSIIDPESVL 226 + + + + +L Q DN Y + + + Sbjct: 217 FYENEERQGVLDPVKTLLNAYDKVLSQKANQNEYFDNAYLALFNVHLKTD---KKTGKPI 273 Query: 227 NPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEIL 286 + + V + +YL + +S + + Sbjct: 274 LDLVNNRFLYLPNTTPGTEPKLEFVSKPDNDGMQENYLKRLEDLIYQVSMVPNLNDQAFA 333 Query: 287 QNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRL 326 N + A S + VR + L LFR + + Sbjct: 334 GNQSGVALQYKLLSLQNKTANQVRKFKKSLRQLFRVIFSV 373 >gi|258517297|ref|YP_003193519.1| hypothetical protein Dtox_4229 [Desulfotomaculum acetoxidans DSM 771] gi|257781002|gb|ACV64896.1| hypothetical protein Dtox_4229 [Desulfotomaculum acetoxidans DSM 771] Length = 508 Score = 41.0 bits (94), Expect = 0.24, Method: Composition-based stats. Identities = 35/276 (12%), Positives = 77/276 (27%), Gaps = 36/276 (13%) Query: 103 ENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL---------RRVIMAGG---TG 150 E + ++ ++ + + +D + L R + + G Sbjct: 150 EQVVQVIRDPLNNNLVREYVIQAAHDWLDDQDNAKRSLVSQRISATKRIIQITGDIPQDQ 209 Query: 151 KDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQP 210 + + W +P + G+S I K +L L + P Sbjct: 210 AAYMEEDNPWGFIPIVHFKNEGDDTREFGQSDLEPIEPFFKAYHDVLLHALQGSKMHSTP 269 Query: 211 QT--------IVQEGSIIDPESVLNPQFGKPIRVAAGM-----DIRSVLGIHSVPMIEKS 257 + + + G I + + I I + Sbjct: 270 RLKFKLKDIAGFLRNNFGVTDPYAFASQGGTISLDGHEFFLFSEDEDAEFIEVKSAIGDA 329 Query: 258 FSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVR---TLAQ 314 +L +L +VD + + + G T ++ S +++ V I R + Sbjct: 330 TQLLQFLFYCIVDASETPEFAFGVH-------TPSSLSSVKEQMPILVRKIARKREQFTE 382 Query: 315 GLEILFRGLLRL-IIQHQDKVRMVRLRDQWVSFDPR 349 + L R +L + + K +W +PR Sbjct: 383 SWQRLARMVLAMTAMAGNKKAGSYATVLEWDEVNPR 418 >gi|150390340|ref|YP_001320389.1| hypothetical protein Amet_2578 [Alkaliphilus metalliredigens QYMF] gi|149950202|gb|ABR48730.1| hypothetical protein Amet_2578 [Alkaliphilus metalliredigens QYMF] Length = 498 Score = 41.0 bits (94), Expect = 0.24, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 69/228 (30%), Gaps = 28/228 (12%) Query: 141 RRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQT 200 + G ++ W +P + G+S I K ++ Sbjct: 189 IEITGDKPEGIESGTFPNTWGFIPIIHFKNEPDETMKFGQSDIEPIEPYIKAYHDVMLHA 248 Query: 201 LDNLYWQNQPQTIV----QEGSIIDPESVLNP----QFGKPIRVAAGM-----DIRSVLG 247 L + P+ + G + + + +P + G I + Sbjct: 249 LKGSKMHSTPKLKLKLKDVAGFLANNFGIEDPVKFAKEGGNINLDGHEILFFTQDEDAQF 308 Query: 248 IHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVEL 307 I + +L + +VD + + G T +A + +++ V Sbjct: 309 IEVKSATGDAKQLLKMIFYCIVDISETPEFIFGVH-------TPSALASVKEQMPIMVNK 361 Query: 308 IVR---TLAQGLEILFRGLLRLIIQ---HQDKVRMVRLRDQWVSFDPR 349 I R A+ ++L R +L + Q ++ V L W DPR Sbjct: 362 IKRKREQFAEQWQLLARMVLAMSSQVRGYKFSDYTVSL--GWDEVDPR 407 >gi|253583086|ref|ZP_04860294.1| predicted protein [Fusobacterium varium ATCC 27725] gi|251834978|gb|EES63531.1| predicted protein [Fusobacterium varium ATCC 27725] Length = 517 Score = 41.0 bits (94), Expect = 0.28, Method: Composition-based stats. Identities = 19/214 (8%), Positives = 55/214 (25%), Gaps = 9/214 (4%) Query: 116 KALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPH 175 E + E + + + DN+L + +N P+T R P+ Sbjct: 211 NENEEVTVIECVMPVAETDTFE------WILFDERMDNVLYRKIYNYNPYTIFRFTVMPN 264 Query: 176 CFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIR 235 G L + ++ + +P ++ + L+P G Sbjct: 265 NVWGRGLGVTCLDYYERLCYCENLRARQSIRIVEPPLLLVGDKRLIDGFDLDP-NGLNWG 323 Query: 236 VAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATS 295 + + +++ + + Q + ++ + Sbjct: 324 GDGITGQANAVPMNTTGTLLPLDQDIQRYTQVI-QAIHFNNPMGSVENRTTRGNAEMGYR 382 Query: 296 LIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + + + + L F +++ Sbjct: 383 MQLFN-QKFSDATSNLYDEVLIPTFAKPKQILQD 415 >gi|254240166|ref|ZP_04933488.1| portal protein [Pseudomonas aeruginosa 2192] gi|126193544|gb|EAZ57607.1| portal protein [Pseudomonas aeruginosa 2192] Length = 773 Score = 40.6 bits (93), Expect = 0.33, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 70/230 (30%), Gaps = 8/230 (3%) Query: 123 YYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESL 182 I ++ +RR G + P+ R I Sbjct: 292 IALASGRISPKKVTVSRVRRSYWLGPHCLHDGPSPYTHRHFPYVPFFGFREDATGIPYGY 351 Query: 183 AASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDI 242 + Q + + + + +G++ ++ L Q +P Sbjct: 352 VRGMKYAQDSLNSGMSKLRWGMSVT---RVERTKGAVDMTDAQLRRQIARPDADIVLNAE 408 Query: 243 RSVLGIHSVPMIEKSFSMLHYLDQELVD-RTGISDISSGFSPEILQNMTATATSLIEQSG 301 + +++ +++ Q L D R I +S+ + + TAT+ +Q Sbjct: 409 HFASNRGARFEVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQI 468 Query: 302 VGQVELIVR---TLAQGLEILFRGLLRLIIQHQDKVRM-VRLRDQWVSFD 347 + I R G ++ LL +I++ + R V + V+ D Sbjct: 469 EQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTAD 518 >gi|320103517|ref|YP_004179108.1| hypothetical protein Isop_1979 [Isosphaera pallida ATCC 43644] gi|319750799|gb|ADV62559.1| hypothetical protein Isop_1979 [Isosphaera pallida ATCC 43644] Length = 454 Score = 40.6 bits (93), Expect = 0.33, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 44/179 (24%), Gaps = 12/179 (6%) Query: 157 NEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQE 216 + LPF F L + ++ + Q L + + P Sbjct: 251 PNPYGRLPFAFAHDELVTRDFWDGGLGDFLADLDREIDREWSQ-LAWIGQFDLP-IGFLR 308 Query: 217 GSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISD 276 + + P P+ A + + S + L ++ G+ Sbjct: 309 DASPTARLIARPGHFNPLVAARPGEKPDAFYLRSEYDPTRRLDGLERYLFLALELLGVPR 368 Query: 277 ISSGFSPEILQNMTATATSL------IEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQ 329 + Q + + +L + + +L A L + Q Sbjct: 369 AAIRLE----QGRSPSGAALVAEHWPLLTRARRRRDLFAVLEADLLATMLHCAGTWYRQ 423 >gi|239835186|ref|ZP_04683512.1| Hypothetical protein OINT_3000019 [Ochrobactrum intermedium LMG 3301] gi|239821162|gb|EEQ92733.1| Hypothetical protein OINT_3000019 [Ochrobactrum intermedium LMG 3301] Length = 772 Score = 40.6 bits (93), Expect = 0.36, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 63/199 (31%), Gaps = 16/199 (8%) Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 N+ P T + R + + + +IQ + L Y + + I++ Sbjct: 343 SPYRHNQFPLTPIWGYRRGRNNLPYGIIRRLKDIQVDVNKRASKAL---YILSSNKIIME 399 Query: 216 EGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGIS 275 EG+ D ++ + + L + + ++ + +G++ Sbjct: 400 EGATDDLDAFTEEASRPDAVLVVKTGKKVELNAE-RELAQGHLELMSRSIGMIQQASGVT 458 Query: 276 DISSGFSPEILQNMTATATSLIEQSGVGQVELIVRT--LAQGLEILFRGLLRLIIQHQDK 333 D G + + + A ++ G A+ + +L I Q + Sbjct: 459 DEVLGRTTNAVSGI---AIQRRQEQGSLATAKFFDNLMFAEQVR--GEKVLANIEQFMSE 513 Query: 334 VRMVRLRD-----QWVSFD 347 + R+ + Q+V + Sbjct: 514 KKSFRITNTRGTPQYVDIN 532 >gi|291335183|gb|ADD94807.1| hypothetical protein [uncultured phage MedDCM-OCT-S12-C102] Length = 574 Score = 40.6 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 44/354 (12%), Positives = 90/354 (25%), Gaps = 60/354 (16%) Query: 22 SHREDGGEKVHDLRIRRKYSQGKVCVDAVSP-DEFLIHPDSVDIEKSPIVGRKLYLTRSD 80 GE + ++ + + + + A+ P E L+ P++ D ++ + R+ Y + ++ Sbjct: 96 KEVAKTGETIFEVP---QVIKNQPSIVALRPYFEVLMPPETQDWHRARAIFRRDYYSVAE 152 Query: 81 LISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGI--- 137 + + +E + S + +D + I Sbjct: 153 IEEKA-------TNGGWDEEFVEKIKRTAGKNSSVWDTGLSPVTGDSEKLDDRSNLIEIV 205 Query: 138 -AELRRVIMAGGTGKDNILCNEEWNEL--------------------PFTCLRAMRAPHC 176 A RRV G G + + ++ PF C + Sbjct: 206 HAYSRRVTENGNPGIYQTVYSPYMHKDERGKECFAQHELVTEAGGTYPFECFTREKTRRS 265 Query: 177 -FIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIR 235 ++ + Q Q D + P V + + G I Sbjct: 266 PIESRGVSEIVKTWQSEYKAQADQVFDRSSFDTLPALKVP----LRYGQRIKIGPGVQIS 321 Query: 236 VAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATS 295 DI + + L Q DR S E Sbjct: 322 EQRPGDISW---MDTPKRGADLAFQLMDQIQVRTDRYFGRPNSQVAPVET---------- 368 Query: 296 LIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWVSFDPR 349 + + V + + + + L + D R + PR Sbjct: 369 ------QLRQQAYVHRWLRHMSTVVNRMWDLTQKFDDDERFATVTGTGKPI-PR 415 >gi|332981152|ref|YP_004462593.1| hypothetical protein Mahau_0568 [Mahella australiensis 50-1 BON] gi|332698830|gb|AEE95771.1| hypothetical protein Mahau_0568 [Mahella australiensis 50-1 BON] Length = 503 Score = 40.3 bits (92), Expect = 0.44, Method: Composition-based stats. Identities = 25/236 (10%), Positives = 60/236 (25%), Gaps = 18/236 (7%) Query: 128 VTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASII 187 + L ++ W +P + G+S + Sbjct: 183 CIVTQHISKERRLVQITGDMPPDVQPGEEKNPWGFIPIVHFKNEGDETREFGQSDLEPVE 242 Query: 188 EIQKIKTVLLRQTLDNLYWQNQPQTI--------VQEGSIIDPESVLNPQFGKPIRVAAG 239 K ++ + + P+ + + G I + Sbjct: 243 PFLKAYHDVMLHAMQGSKMHSTPRLKLKLKDVSRFLANNFGITDPADFAAKGGTINLDGH 302 Query: 240 MDIRSVLGIHSVPM-----IEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATAT 294 + + + I + +L L +VD + + S G + Sbjct: 303 ELLIFQDEEDAGFIEVNSAIGDAKDLLQLLFYCIVDTSETPEFSFGVHTPSSLSSVKEQM 362 Query: 295 SLIEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKV-RMVRLRDQWVSFDPR 349 ++ + ++ + + L R +L + Q + K +W DPR Sbjct: 363 PILVR----RIARKREHFTEAWQRLARIVLAMTAQAEGKKFSTYATTLEWDDIDPR 414 >gi|56692922|ref|YP_164304.1| portal protein [Pseudomonas phage F116] gi|48527508|gb|AAT45883.1| portal protein [Pseudomonas phage F116] Length = 772 Score = 39.9 bits (91), Expect = 0.61, Method: Composition-based stats. Identities = 29/229 (12%), Positives = 59/229 (25%), Gaps = 7/229 (3%) Query: 123 YYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESL 182 I ++ +RR G + P+ R I Sbjct: 292 IALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGY 351 Query: 183 AASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDI 242 + Q + + + +T + I + Sbjct: 352 VRGMKYAQDSLNSGVSKLRWGMSVARVERTKGAVAMTDAQFRRQIARPDADIVLDENHMA 411 Query: 243 RSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGV 302 + + L +S+I++GF + TAT+ +Q Sbjct: 412 KPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGF---QGRKGTATSGIQEQQQIE 468 Query: 303 GQVELIVR---TLAQGLEILFRGLLRLIIQHQDKVRM-VRLRDQWVSFD 347 + I R G ++ LL +I++ + R V + V+ D Sbjct: 469 QSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTAD 517 >gi|332704584|ref|ZP_08424672.1| hypothetical protein Desaf_3493 [Desulfovibrio africanus str. Walvis Bay] gi|332554733|gb|EGJ51777.1| hypothetical protein Desaf_3493 [Desulfovibrio africanus str. Walvis Bay] Length = 809 Score = 39.9 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 11/76 (14%), Positives = 27/76 (35%), Gaps = 1/76 (1%) Query: 43 GKVCVDAVSPDEFLIHP-DSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQN 101 G+V V P F ++P D ++ + V ++ D+ + + Sbjct: 184 GEVRTVVVDPFHFGVYPVDCKKLQDAEGVLHFYPMSVRQARRKWPDQAPLIRPDADLLKE 243 Query: 102 IENTWKFPKNQYSDKA 117 + +T + + D+ Sbjct: 244 LGDTRRLIGGEGRDQN 259 >gi|297605545|ref|NP_001057333.2| Os06g0264200 [Oryza sativa Japonica Group] gi|53793155|dbj|BAD54363.1| zinc finger protein-like [Oryza sativa Japonica Group] gi|255676906|dbj|BAF19247.2| Os06g0264200 [Oryza sativa Japonica Group] Length = 481 Score = 39.5 bits (90), Expect = 0.76, Method: Composition-based stats. Identities = 32/274 (11%), Positives = 71/274 (25%), Gaps = 28/274 (10%) Query: 77 TRSDLISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDG 136 T ++L D E++ + + ++ + + + DG Sbjct: 213 TDAELREFAADMEALLGRGLDDGNDEDSFCMETLGLIEPVDDD-------AGRVKVEADG 265 Query: 137 IAELRRVIMAG---GTGKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIK 193 A + T +L + P AA+ + Q + Sbjct: 266 DAGMTLAWCHELDTETSSGEMLDIDFDCGSPQAATTPDEKVGS---SGPAAADDDAQLQQ 322 Query: 194 TVL---LRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHS 250 + L L W P T + + +S + +L Sbjct: 323 SNLALSLNYEAIIESWGTSPWTDGERPHVKLDDSWPRDYSVRATPCTPYASSHRILH--- 379 Query: 251 VPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVR 310 L D L R + + + A + G+ + R Sbjct: 380 ---------NLAGTDDLLRRRAAVQGVWMAAAGVFGHGGEEQALTPRLGMDGGREARVSR 430 Query: 311 TLAQGLEILFRGLLRLIIQHQDKVRMVRLRDQWV 344 + LF +R ++ + + R++ ++V Sbjct: 431 YREKRRTRLFSKKIRYEVRKLNAEKRPRMKGRFV 464 >gi|300087306|ref|YP_003757828.1| phage portal protein SPP1 [Dehalogenimonas lykanthroporepellens BL-DC-9] gi|299527039|gb|ADJ25507.1| phage portal protein, SPP1 [Dehalogenimonas lykanthroporepellens BL-DC-9] Length = 423 Score = 39.1 bits (89), Expect = 0.91, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 53/195 (27%), Gaps = 12/195 (6%) Query: 158 EEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEG 217 W +P+ + P G S ++ Q L Q L P V E Sbjct: 187 NPWRFIPYLVFPNLPRPKSSWGMSDLENLTGPQLELERALSQLSRILELSGNP-IAVLEN 245 Query: 218 SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDI 277 + + P + +L + + + L + L D + Sbjct: 246 VEESSDIAVAP---GAVWHLPEEARAYLLDLLQGGGGQLHLDYIDLLFRVLHDLAEVPRA 302 Query: 278 SSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQG--LEILFRGLLRLIIQHQDKVR 335 + G + + +L + + + L + L L ++ + Sbjct: 303 AFGGVGRDI-----SGVALELELQPLLHRVWRKRLVRTGVYRRRAEMALALYGRYLGRDF 357 Query: 336 -MVRLRDQWVSFDPR 349 V ++ W PR Sbjct: 358 NGVDVQVDWAPVLPR 372 >gi|256845624|ref|ZP_05551082.1| predicted protein [Fusobacterium sp. 3_1_36A2] gi|256719183|gb|EEU32738.1| predicted protein [Fusobacterium sp. 3_1_36A2] Length = 550 Score = 39.1 bits (89), Expect = 0.92, Method: Composition-based stats. Identities = 25/210 (11%), Positives = 57/210 (27%), Gaps = 6/210 (2%) Query: 119 EMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFI 178 E I E V + + + + + +L E N P+T R Sbjct: 217 EKINIIECVVGVFDEDTSTYKYYHGLFT--EAFEEMLYEGELNYNPYTVFRWKINSSNPW 274 Query: 179 GESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAA 238 G + +++ K L + + P + + + L Sbjct: 275 GIGIGLENLDLFKELKDLKEKRKKHADKIVSPPLNFYGSTDLINKVSLKA---NAKNYGG 331 Query: 239 GMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISSGFSPEILQNMTATATSLIE 298 G+ + + + ++Q + + +N +AT SL Sbjct: 332 SGIGGDKYGVEPINIGTNLLPVEKDIEQVKQEIREVFMSQPLGDVSDTKNRSATEMSLRH 391 Query: 299 QSGVGQVELIVRTLA-QGLEILFRGLLRLI 327 + + + + LE F ++ Sbjct: 392 EMFRKEFSGTYELINTELLEPTFMNAYYIM 421 >gi|313113968|ref|ZP_07799523.1| site-specific recombinase, phage integrase family [Faecalibacterium cf. prausnitzii KLE1255] gi|310623670|gb|EFQ07070.1| site-specific recombinase, phage integrase family [Faecalibacterium cf. prausnitzii KLE1255] Length = 377 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 12/99 (12%), Positives = 29/99 (29%), Gaps = 5/99 (5%) Query: 11 IKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIV 70 + +DV++ E + G D +I+ ++ V V + + L+ + V Sbjct: 207 LTWADVDLKEATITVHSGYNFKDKKIKDPKTEAGVRVVNIP--KILVDYLKTQQDDCLYV 264 Query: 71 GRK---LYLTRSDLISMGYDRESINNLPIISSQNIENTW 106 +T ++ + N Sbjct: 265 LHTVKGHRMTEQAWKTLWSSYMADLNAKYGYHGEESKKR 303 >gi|75760980|ref|ZP_00740985.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] gi|74491523|gb|EAO54734.1| Phage protein [Bacillus thuringiensis serovar israelensis ATCC 35646] Length = 287 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 8/45 (17%), Positives = 22/45 (48%) Query: 297 IEQSGVGQVELIVRTLAQGLEILFRGLLRLIIQHQDKVRMVRLRD 341 + + ++ + + G++ L + +L L+ +H + RM R+ Sbjct: 1 MVEQENEKLAVSSQNYEHGMKRLLQRVLMLMKKHYTEERMARILG 45 >gi|54302247|ref|YP_132240.1| putative head-tail connector protein [Photobacterium profundum SS9] gi|46915668|emb|CAG22440.1| hypothetical protein PBPRB0567 [Photobacterium profundum SS9] Length = 552 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 25/270 (9%), Positives = 72/270 (26%), Gaps = 9/270 (3%) Query: 82 ISMGYDRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELR 141 + Y + + + + + +Y+ ++ + + + Sbjct: 172 RKVEYRVSQVVEKFGLDNVSQSIKSAYRSGKYNQLTEIRHLVFDNPDFVPRAFSAVRKPI 231 Query: 142 R-VIMAGGTGKDNILCNEEWNELPFTCLRAMRAPHCFIG-ESLAASIIEIQKIKTVLLRQ 199 + ++ L ++E PF R + G + K R Sbjct: 232 CSIWYDPADDRNPFLRRSGFDEFPFVTPRWEVIGNDTYGSFGPGMLALGSIKGLQKDQRD 291 Query: 200 TLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFS 259 + +P + +P S+L + + ++ Sbjct: 292 KYEAQDKMLKPPMVGPSSLKNNPRSLLP----GAVTFVDNQQGQQGFTPAFQTNFPLNYQ 347 Query: 260 MLHYLD-QELVDRTGISDISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTL-AQGLE 317 + D + ++D D+ N TAT + ++ + + ++ +GL+ Sbjct: 348 LESIRDTRAIIDSAFFKDLFLAVIDIGKSNTTATEIAARKEEKLLMLGPVLNRFNEEGLD 407 Query: 318 ILFRG-LLRLIIQHQDKVRMVRLRDQWVSF 346 + + + L V+ Sbjct: 408 PIVSASFYEMNRRGMLPEPPPELDGVDVNI 437 >gi|319956966|ref|YP_004168229.1| oligopeptidase a [Nitratifractor salsuginis DSM 16511] gi|319419370|gb|ADV46480.1| oligopeptidase A [Nitratifractor salsuginis DSM 16511] Length = 651 Score = 38.7 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 54/177 (30%), Gaps = 19/177 (10%) Query: 8 HMLIKDSDVEVLEHSHREDGGEKVHDLRIRRKYSQGKVCVDA---VSP------------ 52 ++L E++ + G DL R GKV + Sbjct: 155 NLLDATKAYELIIEDPEDVAGIPESDLAAARFEEDGKVQWRFTLQIPSYLAYMTYGPNRQ 214 Query: 53 --DEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLPIISSQNIENTWKFPK 110 +E + E + ++ R L L + +G+D + L +++ F + Sbjct: 215 LREELYRAYTTRAPENAQVIDRILELRQQKAKLLGFDNYAEYALQTRDARDEWEVTDFLE 274 Query: 111 NQYSDKALEMIEYYELYVTIDYDGDGIAEL--RRVIMAGGTGKDNILCNEEWNELPF 165 + E + DGI +L V G K ++ +E P+ Sbjct: 275 KLTELSLPQGRAELEELRRFARELDGIEDLASYDVAYYGEKLKKHLYDFDESETKPY 331 >gi|260827316|ref|XP_002608611.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae] gi|229293962|gb|EEN64621.1| hypothetical protein BRAFLDRAFT_115635 [Branchiostoma floridae] Length = 513 Score = 38.3 bits (87), Expect = 1.6, Method: Composition-based stats. Identities = 27/240 (11%), Positives = 69/240 (28%), Gaps = 19/240 (7%) Query: 22 SHREDGGEKVHDLRIRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDL 81 D +DL + G+V +D +S E++ GR R +L Sbjct: 269 EETRDIVMPTYDLTESTLETMGRVSLDMLSVQGNTGPRWVNKTEQALWRGRDSRRERLNL 328 Query: 82 ISMGY-DRESINNLPIISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAEL 140 + +G + I+ + K+ + + ++++ ++ DG A Sbjct: 329 VDLGRKYPDLIDAALTNFFFFRDEEAKY---GPKVQHISFFDFFKYKYQLNIDGTVAAYR 385 Query: 141 RRVIMAGGT----GKDNILCNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVL 196 ++AG + + + + P+ R + ++ Sbjct: 386 LPYLLAGDSAVFKHESVYYEHFYSDLEPYVHYIPFR-----------KDLTDLVPKIRWA 434 Query: 197 LRQTLDNLYWQNQPQTIVQEGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEK 256 R D + ++ + + + + + G+ VP + Sbjct: 435 KRNDDDARQIAENGREYARKNLLANSIFCYYERLFREYASRQVDQPQVREGMEEVPQPTE 494 >gi|308071876|emb|CBW54797.1| putative head-tail connector protein [Pantoea phage LIMElight] Length = 529 Score = 38.3 bits (87), Expect = 1.8, Method: Composition-based stats. Identities = 27/217 (12%), Positives = 63/217 (29%), Gaps = 6/217 (2%) Query: 100 QNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNILCNEE 159 Q++ ++ + QY E + Y + +G + V Sbjct: 181 QDLPEDFRLSRLQYRTDPFEDVTLYT---KVTRKHNGARVMYEVTQEVEDYPIGTPSTYP 237 Query: 160 WNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQEGSI 219 P+ L G + L +L + I+ G+ Sbjct: 238 EYLCPYIPLTWNLVTGENYGRGHVEDFAGDFARLSELSESSLLYEVEMMRLINIIDPGAG 297 Query: 220 IDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISDISS 279 ID + ++ GK + + V+ + + + LV + I+ + + Sbjct: 298 IDLDDFMDADCGKAVAGKSNAAGNGVVAHEGGN--AQKLAAVQNDIANLVQQLSIAFMYT 355 Query: 280 GFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGL 316 G + + + +TA + + L++ L Sbjct: 356 GNTRDA-ERVTAEEIRANVSEANQTLGGVYANLSEVL 391 >gi|319440825|ref|ZP_07989981.1| hypothetical protein CvarD4_03568 [Corynebacterium variabile DSM 44702] Length = 542 Score = 36.8 bits (83), Expect = 4.9, Method: Composition-based stats. Identities = 32/230 (13%), Positives = 59/230 (25%), Gaps = 17/230 (7%) Query: 114 SDKALEMIEYYELYVTIDYDGDGIAE--------LRRVIMAGGTGKDNILCNEEWNELPF 165 S IEY + D GD L V+ A G I Sbjct: 209 SRHTPGRIEYTLMAGRDDNLGDTEPLANHPSTVGLAAVVDADGGVATGITRIAAVYIPNV 268 Query: 166 TCLRAMRAPHCFIGES---LAASIIEIQKIKTVLLRQTLDNLYWQNQP-----QTIVQEG 217 + A R L A + + + L + +G Sbjct: 269 QPIPAFRRSGQLRNMGRPDLPADTYGLLDMLDEVWTDLKRELRTAKARVIVPEMMLDFKG 328 Query: 218 SIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPM-IEKSFSMLHYLDQELVDRTGISD 276 + E + + + + +E+ L +E++ R S Sbjct: 329 AGRGMEFDPEREIYSAVADTPASIENGSPMVVQPQIRVEQYLRACDALVREVLRRASYSP 388 Query: 277 ISSGFSPEILQNMTATATSLIEQSGVGQVELIVRTLAQGLEILFRGLLRL 326 + G + +TA ++ + + R GL L ++ L Sbjct: 389 GTFGLNDNTSGAVTAREIEANSRATLQTFKAKARHWKAGLAHLAAAMVEL 438 >gi|269926874|ref|YP_003323497.1| hypothetical protein Tter_1769 [Thermobaculum terrenum ATCC BAA-798] gi|269790534|gb|ACZ42675.1| hypothetical protein Tter_1769 [Thermobaculum terrenum ATCC BAA-798] Length = 435 Score = 36.8 bits (83), Expect = 5.1, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 43/140 (30%), Gaps = 6/140 (4%) Query: 157 NEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQE 216 +P+ +R P F GES +I + + L + + V E Sbjct: 193 PNPLGRIPYVIFPNIRRPFSFWGESDLVDLIGPARELNKRMS-VLAWVLEVSGNPIAVLE 251 Query: 217 GSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGISD 276 + D + G+ + A +L + ++ + L + L D Sbjct: 252 NAEADG---IRVGPGQLWELPAESKA-YLLDLLQGGGVKLHIEYVDLLYRALHDIAETPR 307 Query: 277 ISSGFSPEILQNMTATATSL 296 + G S ++ A + Sbjct: 308 TAFGDSGRVISGA-ALEVEM 326 >gi|259419010|ref|ZP_05742927.1| hypothetical protein SCH4B_4395 [Silicibacter sp. TrichCH4B] gi|259345232|gb|EEW57086.1| hypothetical protein SCH4B_4395 [Silicibacter sp. TrichCH4B] Length = 506 Score = 36.4 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 29/275 (10%), Positives = 66/275 (24%), Gaps = 31/275 (11%) Query: 36 IRRKYSQGKVCVDAVSPDEFLIHPDSVDIEKSPIVGRKLYLTRSDLISMGYDRESINNLP 95 + R G + +AV + + P + IE R+ +L + D + + Sbjct: 136 VDRPTLNGAINFEAVPIPQLYVTPGPLGIED---RFRRQRFHYRNLKVLFPDAKFPRAIE 192 Query: 96 IISSQNIENTWKFPKNQYSDKALEMIEYYELYVTIDYDGDGIAELRRVIMAGGTGKDNIL 155 ++ + + + +D G + G + Sbjct: 193 DKIKKSSNALAVVVHGFWRTFEDVENPVWRHEIRVDGKPIG--------LDKDVGSIGAV 244 Query: 156 CNEEWNELPFTCLRAMRAPHCFIGESLAASIIEIQKIKTVLLRQTLDNLYWQNQPQTIVQ 215 R G ++ + + L+R ++ L P Sbjct: 245 N--------LVVGRFNPYAGSAWGRGPGRKLLPVFRQYDELVRMNMEGLDRTLDPPFTYP 296 Query: 216 EGSIIDPESVLNPQFGKPIRVAAGMDIRSVLGIHSVPMIEKSFSMLHYLDQELVDRTGIS 275 ++D L G ++ + + FS + Sbjct: 297 HDGMLDLSQGLENGVG---YPTMPGTKDALQPVLFGTLDYGFFSEEKLEQKIRDGFYREK 353 Query: 276 DISSGFSPEILQNMTATATSLIEQSGVGQVELIVR 310 + + T + S QV + R Sbjct: 354 EQA---------GKTPPSASQYIGQENKQVRRMAR 379 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.308 0.116 0.263 Lambda K H 0.267 0.0354 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,541,087,360 Number of Sequences: 14124377 Number of extensions: 48387896 Number of successful extensions: 175407 Number of sequences better than 10.0: 187 Number of HSP's better than 10.0 without gapping: 95 Number of HSP's successfully gapped in prelim test: 92 Number of HSP's that attempted gapping in prelim test: 175171 Number of HSP's gapped (non-prelim): 214 length of query: 350 length of database: 4,842,793,630 effective HSP length: 140 effective length of query: 210 effective length of database: 2,865,380,850 effective search space: 601729978500 effective search space used: 601729978500 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 81 (36.0 bits)